5 Portfolio Projects for Final Year Data Science Students
KDnuggets
SEPTEMBER 5, 2023
From cleaning data to wowing recruiters - this blog shares 5 killer data science projects to launch your data science career and get hired!
This site uses cookies to improve your experience. To help us insure we adhere to various privacy regulations, please select your country/region of residence. If you do not select a country, we will assume you are from the United States. Select your Cookie Settings or view our Privacy Policy and Terms of Use.
Cookies and similar technologies are used on this website for proper function of the website, for tracking performance analytics and for marketing purposes. We and some of our third-party providers may use cookie data for various purposes. Please review the cookie settings below and choose your preference.
Used for the proper function of the website
Used for monitoring website traffic and interactions
Cookies and similar technologies are used on this website for proper function of the website, for tracking performance analytics and for marketing purposes. We and some of our third-party providers may use cookie data for various purposes. Please review the cookie settings below and choose your preference.
KDnuggets
SEPTEMBER 5, 2023
From cleaning data to wowing recruiters - this blog shares 5 killer data science projects to launch your data science career and get hired!
Analytics Vidhya
SEPTEMBER 4, 2021
This article was published as a part of the Data Science Blogathon Image 1In this blog, We are going to talk about some of the advanced and most used charts in Plotly while doing analysis. Table of content Description of Dataset Data Exploration Data Cleaning Data visualization […].
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
DataRobot Blog
DECEMBER 6, 2022
With a goal to help data science teams learn about the application of AI and ML, DataRobot shares helpful, educational blogs based on work with the world’s most strategic companies. Explore these 10 popular blogs that help data scientists drive better data decisions. Read the blog. Read the blog.
Data Science Dojo
JANUARY 31, 2023
Big data is conventionally understood in terms of its scale. This one-dimensional approach, however, runs the risk of simplifying the complexity of big data. In this blog, we discuss the 10 Vs as metrics to gauge the complexity of big data.
Data Science Dojo
OCTOBER 23, 2023
In this blog post, we are going to share the top 10 YouTube videos for learning about LLMs. ChatGPT is a large language model that can be used for a variety of tasks, including data analysis and visualization. LLMs can be used to build a variety of applications, such as chatbots, virtual assistants, and translation tools.
Data Science Dojo
JANUARY 22, 2023
In this blog, we will discuss exploratory data analysis, also known as EDA, and why it is important. This can be useful for identifying patterns and trends in the data. We will also be sharing code snippets so you can try out different analysis techniques yourself. So, without any further ado let’s dive right in.
phData
NOVEMBER 4, 2024
Snowflake excels in efficient data storage and governance, while Dataiku provides the tooling to operationalize advanced analytics and machine learning models. Together they create a powerful, flexible, and scalable foundation for modern data applications.
Towards AI
JANUARY 11, 2024
PandasAI would use the LLM power to help us explore and clean data. It would be conversational tools that we can use to ask Pandas to manipulate data in a way we want. To use the PandasAI, we need to install… Read the full blog for free on Medium. Join thousands of data leaders on the AI newsletter.
Dataconomy
APRIL 17, 2025
Pro Tip “Treat AI like a new hiretrain it with clean data, document its decisions, and supervise its work.” past high-converting blogs) 4. However, if you just let things be and do not train AI, you may face some dire consequences because of the risks you let grow in your own backyard.
Tableau
JUNE 4, 2021
Welcome to our monthly highlight of data viz tips, tricks and inspiration produced by the Tableau Community. Avinash Reddy Munnangi recently wrote a blog post on 10 Reasons Why You Need a Tableau Public Profile , and it’s spot on! Will Sutton, guest blog on The Flerlage Twins : Tableau Public APIs Plus a VOTD Data Set.
Data Science Dojo
JULY 5, 2023
The following steps are involved in pipeline development: Gathering data: The first step is to gather the data that will be used to train the model. For data scrapping a variety of sources, such as online databases, sensor data, or social media. Cleaning data: Once the data has been gathered, it needs to be cleaned.
Data Science Blog
AUGUST 22, 2024
The effectiveness of generative AI is linked to the data it uses. Similar to how a chef needs fresh ingredients to prepare a meal, generative AI needs well-prepared, clean data to produce outputs. Businesses need to understand the trends in data preparation to adapt and succeed.
NYU Center for Data Science
JULY 18, 2024
This entry is part of our Meet the Fellow blog series, which introduces and highlights Faculty Fellows who have recently joined CDS. Colner received his PhD in Political Science from the University of California, Davis in 2024, and has a keen interest in leveraging data science to understand local political institutions.
SAS Software
JULY 19, 2024
Data appeared first on SAS Blogs. “How will we catch up when technology seems to change overnight, nearly every night?” It’s a surprisingly common [.] The post The one constant in our AI future?
ML @ CMU
MARCH 22, 2024
This blog post will delve into the unique challenges presented by off-road racing environments, describe our efforts in creating datasets that capture these conditions, and discuss methods and benchmarks for improving computer vision models to robustly handle the extreme variability inherent in off-road racing.
ML @ CMU
MARCH 25, 2024
This blog post will delve into the unique challenges presented by off-road racing environments, describe our efforts in creating datasets that capture these conditions, and discuss methods and benchmarks for improving computer vision models to robustly handle the extreme variability inherent in off-road racing.
IBM Journey to AI blog
NOVEMBER 6, 2023
This method requires the enterprise to have clean data flows from central sources of truth to accurately track and reflect usage. Watsonx.data allows enterprises to centrally gather, categorize and filter data from multiple sources. With usage-based pricing of products, SMBs pay for only what they use.
AWS Machine Learning Blog
MAY 13, 2024
In part 1 of this blog series, we discussed how a large language model (LLM) available on Amazon SageMaker JumpStart can be fine-tuned for the task of radiology report impression generation. For more details on the definition of various forms of this score, please refer to part 1 of this blog. 5708 and dev2=.4525)
Towards AI
AUGUST 25, 2023
By using amplified features generated from trustworthy data sources, even simple linear regressions can yield highly accurate results. In this blog post, I will discuss the importance of data in solving real-world… Read the full blog for free on Medium. Join thousands of data leaders on the AI newsletter.
IBM Journey to AI blog
JUNE 13, 2024
And it’s critical for us to have clean data in the system.” As her team ingests data, they are constantly studying it and verifying it – because if your data is stale, nothing else will be accurate. “We are very diligent about governing the platform we have.
Towards AI
OCTOBER 18, 2023
Let’s see how good and bad it can be (image created by the author with Midjourney) A big part of most data-related jobs is cleaning the data. There is usually no standard way of cleaning data, as it can come in numerous different ways. Join thousands of data leaders on the AI newsletter.
Towards AI
OCTOBER 18, 2023
In-depth data analysis using GPT-4’s data visualization toolset. dallE-2: painting in impressionist style with thick oil colors of a map of Europe Efficiency is everything for coders and data analysts. With GPT-4’s Advanced Data Analysis (ADA) toolset, this process becomes significantly more streamlined. Let’s get to it.
Tableau
JUNE 4, 2021
Welcome to our monthly highlight of data viz tips, tricks and inspiration produced by the Tableau Community. Avinash Reddy Munnangi recently wrote a blog post on 10 Reasons Why You Need a Tableau Public Profile , and it’s spot on! Will Sutton, guest blog on The Flerlage Twins : Tableau Public APIs Plus a VOTD Data Set.
Alation
JANUARY 20, 2022
Monitor and Measure with data quality remediation plans. These are useful in finding repeatable data issues, which will influence how you adapt your data governance framework. It also informs how you clean data and reeducate personnel at the data source within the data catalog.
Towards AI
FEBRUARY 21, 2023
AI being in the limelight has spawned a deluge of thought pieces, articles, videos, blog posts, and podcasts. With this narrowed scope in mind, our approach will be to use ChatGPT to write custom quality metrics through Encord Active that we can run over the data, labels, and model predictions to filter and clean data in our panda problem.
Dataconomy
AUGUST 16, 2023
Today’s question is, “What does a data scientist do.” ” Step into the realm of data science, where numbers dance like fireflies and patterns emerge from the chaos of information. In this blog post, we’re embarking on a thrilling expedition to demystify the enigmatic role of data scientists.
AWS Machine Learning Blog
NOVEMBER 29, 2023
With over 300 built-in transformations powered by SageMaker Data Wrangler, SageMaker Canvas empowers you to rapidly wrangle the loan data. For this dataset, use Drop missing and Handle outliers to clean data, then apply One-hot encode, and Vectorize text to create features for ML.
Mlearning.ai
FEBRUARY 21, 2023
Data wrangling prepares raw data for analysis by cleaning, converting, and manipulating it. It might be a time-consuming operation but it is a necessary stage in data analysis. This blog article will look at manipulating data using Python and Jupyter Notebooks.
JULY 10, 2023
During training, the input data is intentionally corrupted by adding noise, while the target remains the original, uncorrupted data. The autoencoder learns to reconstruct the clean data from the noisy input, making it useful for image denoising and data preprocessing tasks.
IBM Journey to AI blog
DECEMBER 4, 2023
Accurate, clean data and workflows prevent disruptions and downtime once the system goes live. Specifically, to ensure the accuracy of data, organizations should test the following variables: Data archive: Make sure older data that might not have been imported to Oracle is archived securely and is easy to access.
Pickl AI
MARCH 10, 2023
However, despite being a lucrative career option, Data Scientists face several challenges occasionally. The following blog will discuss the familiar Data Science challenges professionals face daily. Conclusion Thus, the above blog has provided you with the everyday challenges in Data Science.
Mlearning.ai
APRIL 25, 2023
Photo by Juraj Gabriel on Unsplash Data analysis is a powerful tool that helps businesses make informed decisions. In today’s blog, we will explore the Netflix dataset using Python and uncover some interesting insights. Let’s explore the dataset further by cleaning data and creating some visualizations. df.isnull().sum()
IBM Journey to AI blog
AUGUST 15, 2023
Building and training foundation models Creating foundations models starts with clean data. This includes building a process to integrate, cleanse, and catalog the full lifecycle of your AI data. A hybrid multicloud environment offers this, giving you choice and flexibility across your enterprise.
Pickl AI
JULY 12, 2023
Furthermore, with the ability to manipulate data efficiently, companies can unlock their true potential, which can eventually help in boosting their productivity and gain a competitive edge. Key Features of Data Manipulation Data Filtering Filtering of data is an integral aspect of data manipulation.
IBM Journey to AI blog
FEBRUARY 23, 2024
Clean data is fundamental for training your AI. The quality of data fed into your AI system directly impacts its learning and accuracy. Helping to ensure that the data is relevant, comprehensive, and free from biases is crucial for practical AI training.
Heartbeat
NOVEMBER 6, 2023
Imagine, if this is a DCG graph, as shown in the image below, that the clean data task depends on the extract weather data task. Ironically, the extract weather data task depends on the clean data task. Weather Pipeline as a Directed Cyclic Graph (DCG) So, how does DAG solve this problem?
Snorkel AI
OCTOBER 23, 2023
Wayfair and Snorkel developed a workflow that incorporated data preprocessing, curation, and iterative development to extract and apply visual data to product labels. Using Snorkel Flow, Wayfair can clean data, remove outliers and duplicates, and quickly prepare training and evaluation datasets with strategic sampling and prompting.
Pickl AI
DECEMBER 3, 2024
According to a report from Statista, the global big data market is expected to grow to over $103 billion by 2027, highlighting the increasing importance of data handling practices. Key Takeaways Data preprocessing is crucial for effective Machine Learning model training.
Pickl AI
MARCH 12, 2023
Significantly, the use of Excel in Data Analysis is beneficial in keeping records of data over time and enabling data visualization effectively. How to use Excel in Data Analysis and why is it important? Let’s find out in the blog! What is Data Analysis?
Snorkel AI
OCTOBER 23, 2023
Wayfair and Snorkel developed a workflow that incorporated data preprocessing, curation, and iterative development to extract and apply visual data to product labels. Using Snorkel Flow, Wayfair can clean data, remove outliers and duplicates, and quickly prepare training and evaluation datasets with strategic sampling and prompting.
Pickl AI
JULY 25, 2023
The following blog is a complete guide on Algorithmic Bias- What is it and How to Avoid it?, Algorithmic bias refers to the presence of unfair or discriminatory outcomes produced by algorithms or machine learning models due to biased data or design choices. helping you learn about bias in ML. What is Algorithmic Bias?
Tableau
JULY 28, 2020
Ryan Cairnes Senior Manager, Product Management, Tableau Hannah Kuffner July 28, 2020 - 10:43pm March 20, 2023 Tableau Prep is a citizen data preparation tool that brings analytics to anyone, anywhere. With Prep, users can easily and quickly combine, shape, and clean data for analysis with just a few clicks.
Tableau
JULY 28, 2020
Ryan Cairnes Senior Manager, Product Management, Tableau Hannah Kuffner July 28, 2020 - 10:43pm March 20, 2023 Tableau Prep is a citizen data preparation tool that brings analytics to anyone, anywhere. With Prep, users can easily and quickly combine, shape, and clean data for analysis with just a few clicks.
Pickl AI
AUGUST 28, 2023
Direct Query and Import: Users can import data into Power BI or create direct connections to databases for real-time data analysis. Data Transformation and Modeling: Power Query: This feature enables users to shape, transform, and clean data from various sources before visualization. appeared first on Pickl AI.
Expert insights. Personalized for you.
We have resent the email to
Are you sure you want to cancel your subscriptions?
Let's personalize your content