Algorithm, Data Analysis and EDA - Data Science Current

Exploratory Data Analysis (EDA) – Credit Card Fraud Detection Case Study

Analytics Vidhya

MARCH 23, 2022

Overview Lots of financial losses are caused every year due to credit card fraud transactions, the financial industry has switched from a posterior investigation approach to an a priori predictive approach with the design of fraud detection algorithms to warn and help fraud investigators. […].

Exploratory Data Analysis

Exploratory Data Analysis EDA Data Analysis Data Analysis

The ultimate guide to the Machine Learning Model Deployment

Data Science Dojo

JULY 5, 2023

The development of a Machine Learning Model can be divided into three main stages: Building your ML data pipeline: This stage involves gathering data, cleaning it, and preparing it for modeling. Exploratory data analysis (EDA): EDA is a process of exploring data to gain insights into its distribution, relationships, and patterns.

Machine Learning

Machine Learning Machine Learning EDA ML

Practicing Machine Learning with Imbalanced Dataset

Analytics Vidhya

JANUARY 31, 2023

But are they still useful without the data? The machine learning algorithms heavily rely on data that we feed to them. The quality of data we feed to the algorithms […] The post Practicing Machine Learning with Imbalanced Dataset appeared first on Analytics Vidhya. The answer is No.

Machine Learning

Machine Learning Machine Learning Artificial Intelligence Artificial Intelligence

Webinars

Automation, Evolved: Your New Playbook For Smarter Knowledge Work

MORE WEBINARS

Understanding Data Science and Data Analysis Life Cycle

Pickl AI

MAY 30, 2024

Summary: The Data Science and Data Analysis life cycles are systematic processes crucial for uncovering insights from raw data. Quality data is foundational for accurate analysis, ensuring businesses stay competitive in the digital landscape. billion INR by 2026, with a CAGR of 27.7%.

Data Analysis

Data Analysis Data Analysis Data Science Exploratory Data Analysis

Data Analysis vs. Data Visualization – More Than Just Pretty Charts

Pickl AI

APRIL 3, 2025

Summary: Data Analysis focuses on extracting meaningful insights from raw data using statistical and analytical methods, while data visualization transforms these insights into visual formats like graphs and charts for better comprehension. Is Data Analysis just about crunching numbers?

Data Analysis

Data Analysis Data Analysis Data Visualization EDA

Exploring Different Types of Data Analysis: Methods and Applications

Pickl AI

OCTOBER 14, 2024

Summary: This article explores different types of Data Analysis, including descriptive, exploratory, inferential, predictive, diagnostic, and prescriptive analysis. Introduction Data Analysis transforms raw data into valuable insights that drive informed decisions. What is Data Analysis?

Data Analysis

Data Analysis Data Analysis EDA Data Mining

11 Open Source Data Exploration Tools You Need to Know in 2023

ODSC - Open Data Science

FEBRUARY 24, 2023

There are many well-known libraries and platforms for data analysis such as Pandas and Tableau, in addition to analytical databases like ClickHouse, MariaDB, Apache Druid, Apache Pinot, Google BigQuery, Amazon RedShift, etc. These tools will help make your initial data exploration process easy.

Exploratory Data Analysis

Exploratory Data Analysis Data Visualization Data Analysis Data Analysis

How To Learn Python For Data Science?

Pickl AI

NOVEMBER 4, 2024

This article will guide you through effective strategies to learn Python for Data Science, covering essential resources, libraries, and practical applications to kickstart your journey in this thriving field. Key Takeaways Python’s simplicity makes it ideal for Data Analysis. in 2022, according to the PYPL Index.

Data Science

Data Science Python Machine Learning Machine Learning

LLMOps demystified: Why it’s crucial and best practices for 2023

Data Science Dojo

AUGUST 28, 2023

Some projects may necessitate a comprehensive LLMOps approach, spanning tasks from data preparation to pipeline production. Exploratory Data Analysis (EDA) Data collection: The first step in LLMOps is to collect the data that will be used to train the LLM.

Exploratory Data Analysis

Exploratory Data Analysis Data Preparation Machine Learning Machine Learning

The AI Process

Towards AI

AUGUST 16, 2023

We can apply a data-centric approach by using AutoML or coding a custom test harness to evaluate many algorithms (say 20–30) on the dataset and then choose the top performers (perhaps top 3) for further study, being sure to give preference to simpler algorithms (Occam’s Razor).

AI

AI AI Machine Learning Machine Learning

Life of modern-day alchemists: What does a data scientist do?

Dataconomy

AUGUST 16, 2023

Data scientists are the master keyholders, unlocking this portal to reveal the mysteries within. They wield algorithms like ancient incantations, summoning patterns from the chaos and crafting narratives from raw numbers. Model development : Crafting magic from algorithms!

Data Scientist

Data Scientist Data Science Machine Learning Machine Learning

Navigating the Exciting Stages: The Journey of a Machine Learning Project Life Cycle

Towards AI

FEBRUARY 3, 2024

From Predicting the behavior of a customer to automating many tasks, Machine learning has shown its capacity to convert raw data into actionable insights. Even though converting raw data into actionable insights, it is not determined by ML algorithms alone. This process is called Exploratory Data Analysis(EDA).

Machine Learning

Machine Learning Machine Learning Exploratory Data Analysis Data Scientist

ML | Data Preprocessing in Python

Pickl AI

DECEMBER 3, 2024

In Python, commonly used libraries include: Pandas: For data manipulation and analysis, particularly for handling structured data. Scikit-learn: For Machine Learning algorithms and preprocessing utilities. Matplotlib/Seaborn: For data visualization. During EDA, you can: Check for missing values.

Python

Python ML ML Exploratory Data Analysis

Roadmap to Learn Data Science for Beginners and Freshers in 2023

Becoming Human

MAY 15, 2023

These communities will help you to be updated in the field, because there are some experienced data scientists posting the stuff, or you can talk with them so they will also guide you in your journey. Data Analysis After learning math now, you are able to talk with your data.

Data Science

Data Science Machine Learning Machine Learning Database

Turn the face of your business from chaos to clarity

Dataconomy

JULY 28, 2023

In the digital age, the abundance of textual information available on the internet, particularly on platforms like Twitter, blogs, and e-commerce websites, has led to an exponential growth in unstructured data. Text data is often unstructured, making it challenging to directly apply machine learning algorithms for sentiment analysis.

Power BI

Power BI Data Preparation Exploratory Data Analysis Machine Learning

How to Calculate the Correlation Between Categorical and Continuous Values

Mlearning.ai

FEBRUARY 28, 2023

Theoretical Explanations and Practical Examples of Correlation between Categorical and Continuous Values Without any doubt, after obtaining the dataset, giving entire data to any ML model without any data analysis methods such as missing data analysis, outlier analysis, and correlation analysis.

Data Analysis

Data Analysis Data Analysis EDA ML

Decoding METAR Data: Insights from the Ocean Protocol Data Challenge

Ocean Protocol

MARCH 11, 2024

METAR, Miami International Airport (KMIA) on March 9, 2024, at 15:00 UTC In the recently concluded data challenge hosted on Desights.ai , participants used exploratory data analysis (EDA) and advanced artificial intelligence (AI) techniques to enhance aviation weather forecasting accuracy.

Exploratory Data Analysis

Exploratory Data Analysis Machine Learning Machine Learning EDA

Machine Learning Project in Python Step-By-Step – Predicting Employee Attrition

Towards AI

FEBRUARY 21, 2023

First of all, HR needs to collect comprehensive data about an employee, such as education, salary, experience… We also need data from supervisors such as performance, relationships, promotions… After that, HR can use this information to predict employees’ tendency to leave and take preventive action. TRAIN ==Staying Rate: 83.87%Leaving

Machine Learning

Machine Learning Machine Learning Python Exploratory Data Analysis

Feature Engineering in Machine Learning

Pickl AI

JANUARY 3, 2024

Feature engineering in machine learning is a pivotal process that transforms raw data into a format comprehensible to algorithms. Through Exploratory Data Analysis , imputation, and outlier handling, robust models are crafted. Time features Objective: Extracting valuable information from time-related data.

Machine Learning

Machine Learning Machine Learning Exploratory Data Analysis Cross Validation

Explaining PCA

Mlearning.ai

MARCH 22, 2023

Principal Component Analysis(PCA) is an essential algorithm in a data scientist's toolkit. This makes it particularly useful for analyzing large datasets with many variables, where it can be difficult to visualize and interpret the data. This shows the data will likely be classified using linear algorithms.

Data Scientist

Data Scientist Machine Learning Machine Learning EDA

Unveiling Market Dynamics: Winners of the Google Trends Analysis and Predictive Modeling

Ocean Protocol

MAY 24, 2024

The challenge required a detailed analysis of Google Trends data, integration of additional data sources, and the application of advanced ML methods to predict market behaviors. Data scientists across various expertise levels engaged in this challenge to determine Google Trends’ impact on cryptocurrency valuations.

EDA

EDA Exploratory Data Analysis Data Scientist ML

The Data Dilemma: Exploring the Key Differences Between Data Science and Data Engineering

Pickl AI

JULY 25, 2023

Collaborating with data scientists, to ensure optimal model performance in real-world applications. With expertise in Python, machine learning algorithms, and cloud platforms, machine learning engineers optimize models for efficiency, scalability, and maintenance. Data Visualization: Matplotlib, Seaborn, Tableau, etc.

Data Engineering

Data Engineering Data Engineering Data Engineer Data Engineering

Curve Finance Data Challenge Review & Insights Research

Ocean Protocol

DECEMBER 12, 2023

Abstract This research report encapsulates the findings from the Curve Finance Data Challenge , a competition that engaged 34 participants in a comprehensive analysis of the decentralized finance protocol. Part 1: Exploratory Data Analysis (EDA) MEV Over 25,000 MEV-related transactions have been executed through Curve.

Exploratory Data Analysis

Exploratory Data Analysis Predictive Analytics Data Analysis Data Analysis

Formula 1 Racing Challenge: 2024 Strategy Analysis

Ocean Protocol

SEPTEMBER 9, 2024

F1 :: 2024 Strategy Analysis Poster ‘The Formula 1 Racing Challenge’ challenges participants to analyze race strategies during the 2024 season. They will work with lap-by-lap data to assess how pit stop timing, tire selection, and stint management influence race performance.

EDA

EDA Exploratory Data Analysis Hypothesis Testing Data Science

Artificial Intelligence Using Python: A Comprehensive Guide

Pickl AI

JULY 12, 2024

Jupyter notebooks are widely used in AI for prototyping, data visualisation, and collaborative work. Their interactive nature makes them suitable for experimenting with AI algorithms and analysing data. Importance of Data in AI Quality data is the lifeblood of AI models, directly influencing their performance and reliability.

Artificial Intelligence

Artificial Intelligence Artificial Intelligence Python Natural Language Processing

Meet the winners of the Kelp Wanted challenge

DrivenData Labs

APRIL 10, 2024

In the Kelp Wanted challenge, participants were called upon to develop algorithms to help map and monitor kelp forests. Winning algorithms will not only advance scientific understanding, but also equip kelp forest managers and policymakers with vital tools to safeguard these vulnerable and vital ecosystems.

Deep Learning

Deep Learning Deep Learning Machine Learning Machine Learning

Data Science Career FAQs Answered: Educational Background

Mlearning.ai

MAY 23, 2023

Blind 75 LeetCode Questions - LeetCode Discuss Data Manipulation and Analysis Proficiency in working with data is crucial. This includes skills in data cleaning, preprocessing, transformation, and exploratory data analysis (EDA).

Data Science

Data Science Data Scientist Machine Learning Machine Learning

Basic Data Science Terms Every Data Analyst Should Know

Pickl AI

SEPTEMBER 12, 2024

Data Cleaning: Raw data often contains errors, inconsistencies, and missing values. Data cleaning identifies and addresses these issues to ensure data quality and integrity. Data Visualisation: Effective communication of insights is crucial in Data Science.

Data Analyst

Data Analyst Data Science Machine Learning Machine Learning

AI in Time Series Forecasting

Pickl AI

DECEMBER 16, 2024

Summary: AI in Time Series Forecasting revolutionizes predictive analytics by leveraging advanced algorithms to identify patterns and trends in temporal data. Advanced algorithms recognize patterns in temporal data effectively. Making Data Stationary: Many forecasting models assume stationarity.

AI

AI AI Machine Learning Machine Learning

Is your model good? A deep dive into Amazon SageMaker Canvas advanced metrics

AWS Machine Learning Blog

JULY 31, 2023

Model-centric approach In this approach, the data always remains the same and is used to iteratively improve the model to meet desired results. We use the model preview functionality to perform an initial EDA.

ML

ML ML Data Preparation Machine Learning

Top 10 Data Science Projects on GitHub

Pickl AI

JUNE 7, 2023

Face Recognition One of the most effective Github Projects on Data Science is a Face Recognition project that makes use of Deep Learning and Histogram of Oriented Gradients (HOG) algorithm. You can make use of HOG algorithm for orientation gradients and use Python library for creating and viewing HOG representations.

Data Science

Data Science Deep Learning Deep Learning Clustering

Building ML Platform in Retail and eCommerce

The MLOps Blog

MAY 31, 2023

The ML platform can utilize historic customer engagement data, also called “clickstream data”, and transform it into features essential for the success of the search platform. From an algorithmic perspective, Learning To Rank (LeToR) and Elastic Search are some of the most popular algorithms used to build a Seach system.

ML

ML ML Algorithm Machine Learning

Machine Learning Project in Python Step-By-StepCredit Card Fraud Detection

Towards AI

MARCH 1, 2023

Business questions to brainstorm: Since all features are anonymous, we will focus our analysis on non-anonymized features: Time, Amount How different is the amount of money used in different transaction classes? Exploratory Data Analysis — EDA Let us now check the missing values in the dataset.

Machine Learning

Machine Learning Machine Learning Python Exploratory Data Analysis

Harnessing Machine Learning on Big Data with PySpark on AWS

ODSC - Open Data Science

AUGUST 9, 2023

Understanding the Session In this engaging and interactive session, we will delve into PySpark MLlib, an invaluable resource in the field of machine learning, and explore how various classification algorithms can be implemented using AWS Glue/EMR as our platform. But this session goes beyond just concepts and algorithms.

Machine Learning

Machine Learning Machine Learning AWS Big Data

Generative AI in Software Development

Mlearning.ai

JUNE 16, 2023

GPT-4 Data Pipelines: Transform JSON to SQL Schema Instantly Blockstream’s public Bitcoin API. The data would be interesting to analyze. From Data Engineering to Prompt Engineering Prompt to do data analysis BI report generation/data analysis In BI/data analysis world, people usually need to query data (small/large).

AI

AI AI Data Analysis Data Analysis

Predicting new and existing product sales in semiconductors using Amazon Forecast

AWS Machine Learning Blog

APRIL 6, 2023

We also demonstrate the performance of our state-of-the-art point cloud-based product lifecycle prediction algorithm. Challenges One of the challenges we faced while using fine-grained or micro-level modeling like product-level models for sale prediction was missing sales data.

Machine Learning

Machine Learning Machine Learning ML ML

F1 Racing 2024 Strategy Analysis Challenge— Final Classification

Ocean Protocol

OCTOBER 18, 2024

Provided information included telemetry data covering each race, including variables like tire choices, stint lengths, lap times, and pit stop durations. Each participant performed Exploratory Data Analysis (EDA) to uncover relationships between variables like tire degradation and race performance.

Exploratory Data Analysis

Exploratory Data Analysis Data Scientist EDA Data Analysis

Top 15 Data Analytics Projects in 2023 for beginners to Experienced

Pickl AI

JULY 20, 2023

Predictive Analytics Projects: Predictive analytics involves using historical data to predict future events or outcomes. Techniques like regression analysis, time series forecasting, and machine learning algorithms are used to predict customer behavior, sales trends, equipment failure, and more.

Analytics

Analytics Analytics Big Data Big Data

Forecasting Carbon Emission Across Continents Research & Data Challenge Review

Ocean Protocol

JANUARY 10, 2024

Here we use data science to diagnose the issues and propose better practices to treat our planet better than the last 30 years. Exploratory Data Analysis (EDA) In Asia, the surge in CO2 and GHG emissions is closely linked to rapid population growth, industrialization, and the rise of emerging economies.

Data Science

Data Science Exploratory Data Analysis Support Vector Machines Data Analysis

Large Language Models: A Complete Guide

Heartbeat

MAY 29, 2023

It is therefore important to carefully plan and execute data preparation tasks to ensure the best possible performance of the machine learning model. It is also essential to evaluate the quality of the dataset by conducting exploratory data analysis (EDA), which involves analyzing the dataset’s distribution, frequency, and diversity of text.

Machine Learning

Machine Learning Machine Learning Natural Language Processing Data Preparation

Scaling Kaggle Competitions Using XGBoost: Part 2

PyImageSearch

DECEMBER 12, 2022

With the completion of AdaBoost, we are one more step closer to understanding the XGBoost algorithm. load the data in the form of a csv estData = pd.read_csv("/content/realtor-data.csv") # drop NaN values from the dataset estData = estData.dropna() # split the labels and remove non-numeric data y = estData["price"].values

Decision Trees

Decision Trees Deep Learning Deep Learning Exploratory Data Analysis

Natural Language Processing (NLP) Concepts With NLTK

Heartbeat

MARCH 22, 2023

In this article, let’s dive deep into the Natural Language Toolkit (NLTK) data processing concepts for NLP data. Before building our model, we will also see how we can visualize this data with Kangas as part of exploratory data analysis (EDA). This, in turn, reduces the time complexity and space.

Natural Language Processing

Natural Language Processing Deep Learning Deep Learning Machine Learning

MyShell AI is another app to create custom chatbots

Dataconomy

AUGUST 5, 2024

For instance: “Data Consultant bot is designed to assist you with all your data analysis needs. Whether you’re looking to interpret complex datasets, forecast trends, or gain insights from your data, this bot provides expert guidance and practical solutions. This is how others will get to know your bot.

AI

AI AI Exploratory Data Analysis Data Analysis

Exploratory Data Analysis (EDA) – Credit Card Fraud Detection Case Study

The ultimate guide to the Machine Learning Model Deployment

Webinars

Trending Sources

Practicing Machine Learning with Imbalanced Dataset

Webinars

Understanding Data Science and Data Analysis Life Cycle

Data Analysis vs. Data Visualization – More Than Just Pretty Charts

Exploring Different Types of Data Analysis: Methods and Applications

11 Open Source Data Exploration Tools You Need to Know in 2023

How To Learn Python For Data Science?

LLMOps demystified: Why it’s crucial and best practices for 2023

The AI Process

Life of modern-day alchemists: What does a data scientist do?

Navigating the Exciting Stages: The Journey of a Machine Learning Project Life Cycle

ML | Data Preprocessing in Python

Roadmap to Learn Data Science for Beginners and Freshers in 2023

Turn the face of your business from chaos to clarity

How to Calculate the Correlation Between Categorical and Continuous Values

Decoding METAR Data: Insights from the Ocean Protocol Data Challenge

Machine Learning Project in Python Step-By-Step – Predicting Employee Attrition

Feature Engineering in Machine Learning

Explaining PCA

Unveiling Market Dynamics: Winners of the Google Trends Analysis and Predictive Modeling

The Data Dilemma: Exploring the Key Differences Between Data Science and Data Engineering

Curve Finance Data Challenge Review & Insights Research

Formula 1 Racing Challenge: 2024 Strategy Analysis

Artificial Intelligence Using Python: A Comprehensive Guide

Meet the winners of the Kelp Wanted challenge

Data Science Career FAQs Answered: Educational Background

Top 10 Data Science Interviews Questions and Expert Answers

Basic Data Science Terms Every Data Analyst Should Know

AI in Time Series Forecasting

Is your model good? A deep dive into Amazon SageMaker Canvas advanced metrics

Top 10 Data Science Projects on GitHub

Building ML Platform in Retail and eCommerce

Machine Learning Project in Python Step-By-StepCredit Card Fraud Detection

Harnessing Machine Learning on Big Data with PySpark on AWS

Generative AI in Software Development

Predicting new and existing product sales in semiconductors using Amazon Forecast

F1 Racing 2024 Strategy Analysis Challenge— Final Classification

Top 15 Data Analytics Projects in 2023 for beginners to Experienced

Forecasting Carbon Emission Across Continents Research & Data Challenge Review

Large Language Models: A Complete Guide

Scaling Kaggle Competitions Using XGBoost: Part 2

Natural Language Processing (NLP) Concepts With NLTK

MyShell AI is another app to create custom chatbots

Stay Connected