Blog, Data Science and EDA - Data Science Current

Performing EDA of Netflix Dataset with Plotly

Analytics Vidhya

SEPTEMBER 4, 2021

This article was published as a part of the Data Science Blogathon Image 1In this blog, We are going to talk about some of the advanced and most used charts in Plotly while doing analysis. Table of content Description of Dataset Data Exploration Data Cleaning Data visualization […].

EDA

EDA Clean Data Data Visualization Data Science

Mastering Exploratory Data Analysis (EDA): A comprehensive guide

Data Science Dojo

JANUARY 22, 2023

In this blog, we will discuss exploratory data analysis, also known as EDA, and why it is important. EDA is an iterative process of conglomerative activities which include data cleaning, manipulation and visualization. We will also be sharing code snippets so you can try out different analysis techniques yourself.

Exploratory Data Analysis

Exploratory Data Analysis EDA Data Analysis Data Analysis

The 6 best ChatGPT plugins for data science

Data Science Dojo

OCTOBER 2, 2023

ChatGPT plugins can be used to extend the capabilities of ChatGPT in a variety of ways, such as: Accessing and processing external data Performing complex computations Using third-party services In this article, we’ll dive into the top 6 ChatGPT plugins tailored for data science.

Data Science

Data Science Machine Learning Machine Learning Data Analysis

Webinars

How to Achieve High-Accuracy Results When Using LLMs

MORE WEBINARS

Unleash Your Data Insights: Learn from the Experts in Our DataHour Sessions

Analytics Vidhya

APRIL 11, 2023

Introduction Analytics Vidhya DataHour is designed to provide valuable insights and knowledge to individuals looking to build a career in the data-tech industry. These sessions cover a wide range of topics, from the fields of artificial intelligence, and machine learning, and various topics related to data science.

Artificial Intelligence

Artificial Intelligence Artificial Intelligence Data Science Machine Learning

A Detailed Study on COVID 19 Vaccinations Data

Analytics Vidhya

APRIL 21, 2022

This article was published as a part of the Data Science Blogathon. In this blog, we study […]. The post A Detailed Study on COVID 19 Vaccinations Data appeared first on Analytics Vidhya. A considerably low vaccination rate has been observed in low-income countries of the world.

Data Science

Data Science Analytics Analytics EDA

Performing Exploratory Data Analysis with SAS and Python

Analytics Vidhya

JUNE 21, 2021

ArticleVideo Book This article was published as a part of the Data Science Blogathon Hi all, this is my first blog hope you all like. The post Performing Exploratory Data Analysis with SAS and Python appeared first on Analytics Vidhya.

Exploratory Data Analysis

Exploratory Data Analysis Data Analysis Data Analysis Python

Unraveling the phenomenon of ChatGPT: Understanding the revolutionary AI technology

Data Science Dojo

APRIL 26, 2023

This blog explores the amazing AI (Artificial Intelligence) technology called ChatGPT that has taken the world by storm and try to unravel the underlying phenomenon which makes up this seemingly complex technology. Well, fret not we are here to answer those questions in this blog. What is ChatGPT? What purpose does it serve?

Natural Language Processing

Natural Language Processing AI AI Artificial Intelligence

End-to-End Machine Learning Project Development: Spam Classifier

Towards AI

MARCH 22, 2024

Many beginners in data science and machine learning only focus on the data analysis and model development part, which is understandable, as the other department often does the deployment process. We will walk through it together, from the data analysis to automatic retraining. Establish a Data Science Project2.

Machine Learning

Machine Learning Machine Learning EDA Data Science

Things You Can do Using Kangas Library in Data Science

Heartbeat

FEBRUARY 13, 2023

Comet is an MLOps platform that offers a suite of tools for machine-learning experimentation and data analysis. It is designed to make it easy to track and monitor experiments and conduct exploratory data analysis (EDA) using popular Python visualization frameworks. Please consider signing up using my referral link.

Data Science

Data Science Python Deep Learning Deep Learning

Mastering Data Normalization: A Comprehensive Guide

Data Science Dojo

MARCH 27, 2025

This is particularly important for relational databases, where data is stored in tables with defined relationships. Another interesting read: Master EDA Importance of Data Normalization So, we defined data normalization, and hopefully, youve got the idea. Most of these challenges have workarounds.

Database

Database Data Warehouse Machine Learning Machine Learning

The ultimate guide to the Machine Learning Model Deployment

Data Science Dojo

JULY 5, 2023

For data scrapping a variety of sources, such as online databases, sensor data, or social media. Cleaning data: Once the data has been gathered, it needs to be cleaned. This involves removing any errors or inconsistencies in the data. This information can be used to inform the design of the model.

Machine Learning

Machine Learning Machine Learning EDA ML

Roadmap to Learn Data Science for Beginners and Freshers in 2023

Becoming Human

MAY 15, 2023

Data Science is a popular as well as vast field; till date, there are a lot of opportunities in this field, and most people, whether they are working professionals or students, everyone want a transition in data science because of its scope. How much to learn? What to do next?

Data Science

Data Science Machine Learning Machine Learning Database

Teaching with DrivenData Competitions

DrivenData Labs

AUGUST 27, 2024

We give recommendations and examples below, with instructors of college or graduate level data science or applied statistics courses in mind. Variations: For practice with data wrangling, students can find, download, and prepare data for analysis as part of the assignment. from the Snowcast Showdown.

Data Science

Data Science Algorithm Data Wrangling Machine Learning

Different Plots Used in Exploratory Data Analysis (EDA)

Heartbeat

JANUARY 24, 2024

The importance of EDA in the machine learning world is well known to its users. Making visualizations is one of the finest ways for data scientists to explain data analysis to people outside the business. Exploratory data analysis can help you comprehend your data better, which can aid in future data preprocessing.

Exploratory Data Analysis

Exploratory Data Analysis EDA Data Analysis Data Analysis

Consolidated Kaggle datasets for learning data science

Mlearning.ai

MARCH 26, 2023

Embark on Your Data Science Journey through In-Depth Projects and Hands-on Learning Photo by Wes Hicks on Unsplash Data science, as an emerging field, is constantly evolving and bringing forth innovative solutions to complex problems. I’ve handpicked a few Kaggle projects covering a range of data science concepts.

Data Science

Data Science EDA Decision Trees Machine Learning

Life of modern-day alchemists: What does a data scientist do?

Dataconomy

AUGUST 16, 2023

Today’s question is, “What does a data scientist do.” ” Step into the realm of data science, where numbers dance like fireflies and patterns emerge from the chaos of information. In this blog post, we’re embarking on a thrilling expedition to demystify the enigmatic role of data scientists.

Data Scientist

Data Scientist Data Science Machine Learning Machine Learning

Mastering mutable and immutable objects in Python

Data Science Dojo

MARCH 13, 2023

This blog explores the difference between mutable and immutable object in python. Want to start your EDA journey, well you can always get yourself registered at Python for Data Science. Python is a powerful programming language with a wide range of applications in various industries.

Python

Python EDA Data Science

Top 10 Data Science Projects on GitHub

Pickl AI

JUNE 7, 2023

How to create a Data Science Project on GitHub? Data Science being the most demanding career fields today with millions of job opportunities flooding in the market. in order to ensure that you have a great career in Data Science, one of the major requirements is to create and have a Github Data Science project.

Data Science

Data Science Deep Learning Deep Learning Clustering

Control digital voice speech and pitch rate using the Watson Text to Speech (TTS) library

IBM Data Science in Practice

DECEMBER 21, 2023

Text to Speech Dash app IBM Watson’s text-to-speech model is built using machine learning techniques and deep neural networks, trained on large amounts of speech and text data. This blog gives an overview of how to convert text data into speech and how to control speech rate & voice pitch using Watson Speech libraries.

Exploratory Data Analysis

Exploratory Data Analysis EDA Python Clustering

The Data Dilemma: Exploring the Key Differences Between Data Science and Data Engineering

Pickl AI

JULY 25, 2023

Together, data engineers, data scientists, and machine learning engineers form a cohesive team that drives innovation and success in data analytics and artificial intelligence. Their collective efforts are indispensable for organizations seeking to harness data’s full potential and achieve business growth.

Data Engineering

Data Engineering Data Engineering Data Engineering Data Engineer

LLMOps demystified: Why it’s crucial and best practices for 2023

Data Science Dojo

AUGUST 28, 2023

Some projects may necessitate a comprehensive LLMOps approach, spanning tasks from data preparation to pipeline production. Exploratory Data Analysis (EDA) Data collection: The first step in LLMOps is to collect the data that will be used to train the LLM.

Exploratory Data Analysis

Exploratory Data Analysis Data Preparation Machine Learning Machine Learning

How to tackle lack of data: an overview on transfer learning

Data Science Blog

FEBRUARY 23, 2023

And importantly, starting naively annotating data might become a quick solution rather than thinking about how to make uses of limited labels if extracting data itself is easy and does not cost so much. In that case, you tasks have your own problem, and you would have to be careful about your EDA, data cleaning, and labeling.

Supervised Learning

Supervised Learning Machine Learning Machine Learning Deep Learning

Data Acquisition & Exploration: Exploring 5 Key MLOps Questions using AWS SageMaker

Towards AI

JUNE 24, 2023

In this blog, I will walk through AWS SageMaker's capabilities in addressing these questions. An MLOps workflow consists of a series of steps from data acquisition and feature engineering to training and deployment. Collaboration] How can multiple data scientists collaborate in real-time on the same dataset?

AWS

AWS Data Scientist ML ML

Predicting the Protein Structure Resolution Using Decision Tree

Mlearning.ai

FEBRUARY 6, 2024

Exploratory Data Analysis(EDA)on Biological Data: A Hands-On Guide Unraveling the Structural Data of Proteins, Part II — Exploratory Data Analysis Photo from Pexels In a previous post, I covered the background of this protein structure resolution data set, including an explanation of key data terminology and details on how to acquire the data.

Decision Trees

Decision Trees Exploratory Data Analysis EDA Data Analysis

New Data Challenge: Aviation Weather Forecasting Using METAR Data

Ocean Protocol

FEBRUARY 1, 2024

This is a unique opportunity for data people to dive into real-world data and uncover insights that could shape the future of aviation safety, understanding, airline efficiency, and pilots driving planes. Stay tuned for updates and discussions on our blog page blog.oceanprotocol.com for progress throughout the year!

Exploratory Data Analysis

Exploratory Data Analysis Data Science Cross Validation Machine Learning

Meet the winners of the Kelp Wanted challenge

DrivenData Labs

APRIL 10, 2024

I initially conducted detailed exploratory data analysis (EDA) to understand the dataset, identifying challenges like duplicate entries and missing Coordinate Reference System (CRS) information. I consider myself as a machine learning engineer who enjoys taking part in various machine learning competitions.

Deep Learning

Deep Learning Deep Learning Machine Learning Machine Learning

When his hobbies went on hiatus, this Kaggler made fighting COVID-19 with data his mission | A…

Kaggle

JULY 29, 2020

The early days of the effort were spent on EDA and exchanging ideas with other members of the community. Before models could be built, gaining an understanding of the data, strengths and weaknesses of the dataset and what researchers are looking for out of the CORD-19 dataset was needed.

ETL

ETL Data Scientist Data Science Machine Learning

Monitoring Your Time Series Model in Comet

Heartbeat

MARCH 21, 2023

We will carry out some EDA on our dataset, and then we will log the visualizations onto the Comet experimentation website or platform. Time Series Models Time series models are a type of statistical model that are used to analyze and make predictions about data that is collected over time. Without further ado, let’s begin.

Exploratory Data Analysis

Exploratory Data Analysis EDA Machine Learning Machine Learning

Introducing our New Book: Implementing MLOps in the Enterprise

Iguazio

DECEMBER 14, 2023

Who This Book Is For This book is for practitioners in charge of building, managing, maintaining, and operationalizing the ML process end to end: Data science / AI / ML leaders: Heads of Data Science, VPs of Advanced Analytics, AI Lead etc. The book contains a full chapter dedicated to generative AI. Key Takeaways 1.

ML

ML ML Data Science Data Preparation

Different Python Libraries for Data Visualisation

Pickl AI

FEBRUARY 4, 2025

Python data visualisation libraries offer powerful visualisation tools , ranging from simple charts to interactive dashboards. In this blog, we aim to explore the most popular Python data visualisation libraries, highlight their unique features, and guide you on how to use them effectively.

Python

Python Exploratory Data Analysis Data Analysis Data Analysis

Turn the face of your business from chaos to clarity

Dataconomy

JULY 28, 2023

In the digital age, the abundance of textual information available on the internet, particularly on platforms like Twitter, blogs, and e-commerce websites, has led to an exponential growth in unstructured data. Integration also helps avoid duplication and redundancy of data, providing a comprehensive view of the information.

Power BI

Power BI Data Preparation Exploratory Data Analysis Machine Learning

Tracking Your Sentiment Analysis With Comet

Heartbeat

JANUARY 30, 2023

But they need a lot of labeled training data, and the dataset could be biased. In order to accomplish this, we will perform some EDA on the Disneyland dataset, and then we will view the visualization on the Comet experimentation website or platform. In this article, we’ll learn how to link Comet with Disneyland Sentiment Analysis.

EDA

EDA Machine Learning Machine Learning Exploratory Data Analysis

The Easiest Way to Determine Which Scikit-Learn Model Is Perfect for Your Data

Mlearning.ai

NOVEMBER 23, 2023

In this blog post, I’m going to show you how to use the lazypredict library on your dataset. You may need to import more libraries for EDA, preprocessing, and so on depending on the dataset you’re dealing with. Call-To-Action Enjoyed this blog post? Give it a clap and share it with your fellow data enthusiasts!

Supervised Learning

Supervised Learning Cross Validation EDA Machine Learning

Enhancing Customer Churn Prediction with Continuous Experiment Tracking

Heartbeat

SEPTEMBER 28, 2023

In a typical MLOps project, similar scheduling is essential to handle new data and track model performance continuously. Load and Explore Data We load the Telco Customer Churn dataset and perform exploratory data analysis (EDA). Experiment Tracking in CometML (Image by the Author) 2.

Machine Learning

Machine Learning Machine Learning Support Vector Machines ML

Build a Stocks Price Prediction App powered by Snowflake, AWS, Python and Streamlit?—?Part 2 of 3

Mlearning.ai

MARCH 15, 2023

Introduction Welcome Back, Let's continue with our Data Science journey to create the Stock Price Prediction web application. The scope of this article is quite big, we will exercise the core steps of data science, let's get started… Project Layout Here are the high-level steps for this project.

Python

Python AWS Exploratory Data Analysis Machine Learning

Vertex AI: Guide to Google’s Unified Machine Learning Platform

Pickl AI

AUGUST 28, 2024

Vertex AI combines data engineering, data science, and ML engineering into a single, cohesive environment, making it easier for data scientists and ML engineers to build, deploy, and manage ML models. Data Preparation Begin by ingesting and analysing your dataset.

Machine Learning

Machine Learning Machine Learning ML ML

Is your model good? A deep dive into Amazon SageMaker Canvas advanced metrics

AWS Machine Learning Blog

JULY 31, 2023

We use the model preview functionality to perform an initial EDA. This provides us a baseline that we can use to perform data augmentation, generating a new baseline, and finally getting the best model with a model-centric approach using the standard build functionality.

ML

ML ML Data Preparation Machine Learning

Analyze Amazon SageMaker spend and determine cost optimization opportunities based on usage, Part 2: SageMaker notebooks and Studio

AWS Machine Learning Blog

MAY 30, 2023

For ML model development, the size of a SageMaker notebook instance depends on the amount of data you need to load in-memory for meaningful exploratory data analyses (EDA) and the amount of computation required. We recommend starting small with general-purpose instances (such as T or M families) and scaling up as needed.

AWS

AWS ML ML EDA

Create and visualize image data with Kangas for computer vision tasks

Heartbeat

MAY 24, 2023

Create DataGrids with image data using Kangas, and load and visualize image data from hugging face Photo by Genny Dimitrakopoulou on Unsplash Visualizing data to carry out a detailed EDA, especially for image data, is critical. Be sure to explore more on the Kangas repo.

Deep Learning

Deep Learning Deep Learning EDA ML

Discovering the Basics of Pandas DataFrame LOC Method

Pickl AI

AUGUST 14, 2024

Central to Pandas is the DataFrame object, a versatile structure for managing and analysing data in tabular form. This blog introduces the Pandas DataFrame.loc method, which is crucial for data selection and manipulation.

Data Analysis

Data Analysis Data Analysis Exploratory Data Analysis Python

Netflix Data Analysis using Python

Mlearning.ai

APRIL 25, 2023

Photo by Juraj Gabriel on Unsplash Data analysis is a powerful tool that helps businesses make informed decisions. In today’s blog, we will explore the Netflix dataset using Python and uncover some interesting insights. The platform has gained a massive following in recent years, and its popularity shows no signs of slowing down.

Data Analysis

Data Analysis Data Analysis Python Exploratory Data Analysis

Room Occupancy Detection

Heartbeat

FEBRUARY 6, 2024

From the above EDA, it is clear that the room's temperature, light, and CO2 levels are good occupancy indicators. The exploratory data analysis found that the change in room temperature, CO levels, and light intensity can be used to predict the occupancy of the room in place of humidity and humidity ratio.

Exploratory Data Analysis

Exploratory Data Analysis Data Analysis Data Analysis Machine Learning

Large Language Models: A Complete Guide

Heartbeat

MAY 29, 2023

It is therefore important to carefully plan and execute data preparation tasks to ensure the best possible performance of the machine learning model. It is also essential to evaluate the quality of the dataset by conducting exploratory data analysis (EDA), which involves analyzing the dataset’s distribution, frequency, and diversity of text.

Machine Learning

Machine Learning Machine Learning Natural Language Processing Data Preparation

Sentiment Analysis with Python and Streamlit

Heartbeat

JANUARY 25, 2023

Now you need to perform some EDA and cleaning on the data after loading it into the notebook. EDA and Data Cleaning First, you will check the frequency of the target variable: Category. This variable denotes the type of emotions represented by the Reddit threads. We pay our contributors, and we don’t sell ads.

Python

Python Deep Learning Deep Learning ML

Performing EDA of Netflix Dataset with Plotly

Mastering Exploratory Data Analysis (EDA): A comprehensive guide

Webinars

Trending Sources

The 6 best ChatGPT plugins for data science

Webinars

Unleash Your Data Insights: Learn from the Experts in Our DataHour Sessions

A Detailed Study on COVID 19 Vaccinations Data

Performing Exploratory Data Analysis with SAS and Python

Unraveling the phenomenon of ChatGPT: Understanding the revolutionary AI technology

End-to-End Machine Learning Project Development: Spam Classifier

Things You Can do Using Kangas Library in Data Science

Mastering Data Normalization: A Comprehensive Guide

The ultimate guide to the Machine Learning Model Deployment

Roadmap to Learn Data Science for Beginners and Freshers in 2023

Teaching with DrivenData Competitions

Different Plots Used in Exploratory Data Analysis (EDA)

Consolidated Kaggle datasets for learning data science

Life of modern-day alchemists: What does a data scientist do?

Mastering mutable and immutable objects in Python

Top 10 Data Science Projects on GitHub

Control digital voice speech and pitch rate using the Watson Text to Speech (TTS) library

The Data Dilemma: Exploring the Key Differences Between Data Science and Data Engineering

LLMOps demystified: Why it’s crucial and best practices for 2023

How to tackle lack of data: an overview on transfer learning

Data Acquisition & Exploration: Exploring 5 Key MLOps Questions using AWS SageMaker

Predicting the Protein Structure Resolution Using Decision Tree

New Data Challenge: Aviation Weather Forecasting Using METAR Data

Meet the winners of the Kelp Wanted challenge

When his hobbies went on hiatus, this Kaggler made fighting COVID-19 with data his mission | A…

Monitoring Your Time Series Model in Comet

Introducing our New Book: Implementing MLOps in the Enterprise

Different Python Libraries for Data Visualisation

Turn the face of your business from chaos to clarity

Tracking Your Sentiment Analysis With Comet

The Easiest Way to Determine Which Scikit-Learn Model Is Perfect for Your Data

Enhancing Customer Churn Prediction with Continuous Experiment Tracking

Build a Stocks Price Prediction App powered by Snowflake, AWS, Python and Streamlit?—?Part 2 of 3

Vertex AI: Guide to Google’s Unified Machine Learning Platform

Is your model good? A deep dive into Amazon SageMaker Canvas advanced metrics

Analyze Amazon SageMaker spend and determine cost optimization opportunities based on usage, Part 2: SageMaker notebooks and Studio

Create and visualize image data with Kangas for computer vision tasks

Discovering the Basics of Pandas DataFrame LOC Method

Netflix Data Analysis using Python

Room Occupancy Detection

Large Language Models: A Complete Guide

Sentiment Analysis with Python and Streamlit

Stay Connected