Remove Clean Data Remove Definition Remove K-nearest Neighbors
article thumbnail

Debugging data to build better and more fair ML applications

Snorkel AI

You can approximate your machine learning training components into some simpler classifiers—for example, a k-nearest neighbors classifier. Here’s one application where you have a 100% clean data set that also has some fairness issues, meaning that if you clean up the whole dataset, the model could be unfair.

ML 52
article thumbnail

Debugging data to build better and more fair ML applications

Snorkel AI

You can approximate your machine learning training components into some simpler classifiers—for example, a k-nearest neighbors classifier. Here’s one application where you have a 100% clean data set that also has some fairness issues, meaning that if you clean up the whole dataset, the model could be unfair.

ML 52
article thumbnail

[Updated] 100+ Top Data Science Interview Questions

Mlearning.ai

The following figure represents the life cycle of data science. It starts with gathering the business requirements and relevant data. Once the data is acquired, it is maintained by performing data cleaning, data warehousing, data staging, and data architecture. Why is data cleaning crucial?