Article and Data Quality - Data Science Current

Why Data Quality is the Secret Ingredient to AI Success

insideBIGDATA

NOVEMBER 1, 2024

In this contributed article, engineering leader Uma Uppin emphasizes that high-quality data is fundamental to effective AI systems, as poor data quality leads to unreliable and potentially costly model outcomes.

Data Quality

Data Quality Data Governance AI AI

Scaling Data Quality with Computer Vision on Spatial Data

insideBIGDATA

FEBRUARY 12, 2024

In this contributed article, editorial consultant Jelani Harper discusses a number of hot topics today: computer vision, data quality, and spatial data. Its utility for data quality is evinced from some high profile use cases.

Data Quality

Data Quality Machine Learning Machine Learning Deep Learning

Cloud Migration Alone Won’t Solve Data Quality. Here’s Why CDOs Need a More Holistic Approach

insideBIGDATA

APRIL 1, 2024

In this contributed article, Emmet Townsend, VP of Engineering at Inrupt, discusses how cloud migration is just one step to achieving comprehensive data quality programs, not the entire strategy.

Data Quality

Data Quality Big Data Big Data

Webinars

Agent Tooling: Connecting AI to Your Tools, Systems & Data

Automation, Evolved: Your New Playbook for Smarter Knowledge Work

How to Modernize Manufacturing Without Losing Control

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

MORE WEBINARS

Innovations in Analytics: Elevating Data Quality with GenAI

Towards AI

OCTOBER 31, 2024

Data analytics has become a key driver of commercial success in recent years. The ability to turn large data sets into actionable insights can mean the difference between a successful campaign and missed opportunities. Flipping the paradigm: Using AI to enhance data quality What if we could change the way we think about data quality?

Data Quality

Data Quality Analytics Analytics Clean Data

Business Leaders Must Prioritize Data Quality to Ensure Lasting AI Implementation

insideBIGDATA

JULY 24, 2024

In this contributed article, Subbiah Muthiah, CTO of Emerging Technologies at Qualitest, takes a deep dive into how raw data can throw specialized AI into disarray. While raw data has its uses, properly processed data is vital to the success of niche AI.

Data Quality

Data Quality AI AI Big Data

State of Data Quality Report

insideBIGDATA

JUNE 13, 2023

Bigeye, the data observability company, announced the results of its 2023 State of Data Quality survey. The report sheds light on the most pervasive problems in data quality today. The report, which was researched and authored by Bigeye, consisted of answers from 100 survey respondents.

Data Quality

Data Quality Data Observability Big Data Big Data

The Importance of Data Quality in Benefits

insideBIGDATA

JUNE 6, 2023

In this contributed article, Peter Nagel, VP of Engineering at Noyo, addresses the benefits/insurance industry’s roadblocks and opportunities — and why some of the most interesting data innovations will soon be happening in benefits.

Data Quality

Data Quality Big Data Big Data

Data Errors in Financial Services: Addressing the Real Cost of Poor Data Quality

The Data Administration Newsletter

NOVEMBER 20, 2024

Data quality issues continue to plague financial services organizations, resulting in costly fines, operational inefficiencies, and damage to reputations. Key Examples of Data Quality Failures — […]

Data Quality

Data Quality Data Silos Data Governance

Implementing Data Quality Assurance in Data Science Pipelines with Great Expectations

KDnuggets

JANUARY 8, 2025

This article shows how to use Great Expectations to check data quality in data science projects.

Data Quality

Data Quality Data Science

10 Most Common Data Quality Issues and How to Fix Them

KDnuggets

NOVEMBER 22, 2022

Ensuring data quality guarantees more data-informed decisions. Hence, this article highlights the common data quality issues and ways to overcome them.

Data Quality

Data Quality Data Science

Data Quality Dimensions: Assuring Your Data Quality with Great Expectations

KDnuggets

MARCH 23, 2023

This article highlights the significance of ensuring high-quality data and presents six key dimensions for measuring it. These dimensions include Completeness, Consistency, Integrity, Timelessness, Uniqueness, and Validity.

Data Quality

Data Quality Data Engineering Data Engineering Data Engineer

In 2024, Data Quality and AI Will Open New Doors

insideBIGDATA

MAY 1, 2024

In this contributed article, Stephany Lapierre, Founder and CEO of Tealbook, discusses how AI can help streamline procurement processes, reduce costs and improve supplier management, while also addressing common concerns and challenges related to AI implementation like data privacy, ethical considerations and the need for human oversight.

Data Quality

Data Quality AI AI Big Data

The Problem with ‘Dirty Data’ — How Data Quality Can Impact Life Science AI Adoption

insideBIGDATA

JUNE 15, 2023

Jason Smith, Chief Technology Officer, AI & Analytics at Within3, highlights how many life science data sets contain unclean, unstructured, or highly-regulated data that reduces the effectiveness of AI models. Life science companies must first clean and harmonize their data for effective AI adoption.

Data Quality

Data Quality AI AI Analytics

Why Reinforcement Learning Will Save Generative AI

insideBIGDATA

AUGUST 12, 2023

In this contributed article, Kim Stagg, VP of Product for Appen, knows the only way to achieve functional AI models is to use high-quality data in every stage of deployment.

Data Quality

Data Quality AI AI Big Data

Study Finds Data Quality is Still the Largest Obstacle for Successful AI and Greater Human Expertise Needed Across ML Ops Lifecycle

insideBIGDATA

MAY 28, 2023

iMerit, a leading artificial intelligence (AI) data solutions company, released its 2023 State of ML Ops report, which includes a study outlining the impact of data on wide-scale commercial-ready AI projects.

Data Quality

Data Quality ML ML Artificial Intelligence

Data Quality Metrics Best Practices

Dataversity

MARCH 17, 2025

The amount of data we deal with has increased rapidly (close to 50TB, even for a small company), whereas75% of leaders dont trust their datafor business decision-making.Though these are two different stats, the common denominator playing a role could be data quality.With new data flowing from almost every direction, there must be a yardstick or […] (..)

Data Quality

Data Quality Data Governance

Unit Test framework and Test Driven Development (TDD) in Python

Analytics Vidhya

SEPTEMBER 2, 2021

This article was published as a part of the Data Science Blogathon Overview Running data projects takes a lot of time. Poor data results in poor judgments. Running unit tests in data science and data engineering projects assures data quality. You know your code does what you want it to do.

Python

Python Data Science Data Quality Data Engineering

What I Learned from Executing Data Quality Projects

The Data Administration Newsletter

MAY 4, 2022

Getting to great data quality need not be a blood sport! This article aims to provide some practical insights gained from enterprise master data quality projects undertaken within the past […].

Data Quality

Knowledge Enhanced Machine Learning: Techniques & Types

Analytics Vidhya

DECEMBER 30, 2022

This article was published as a part of the Data Science Blogathon. Introduction In machine learning, the data is an essential part of the training of machine learning algorithms. The amount of data and the data quality highly affect the results from the machine learning algorithms.

Machine Learning

Machine Learning Machine Learning Algorithm Data Quality

Sigmoid Function: Derivative and Working Mechanism

Analytics Vidhya

DECEMBER 28, 2022

This article was published as a part of the Data Science Blogathon. Choosing the best appropriate activation function can help one get better results with even reduced data quality; hence, […].

Deep Learning

Deep Learning Deep Learning Data Quality Data Science

Alation Unveils AI Governance Solution to Power Safe and Reliable AI for Enterprises

insideBIGDATA

OCTOBER 12, 2024

The solution ensures that AI models are developed using secure, compliant, and well-documented data.

Data Quality

Data Quality AI AI Data Governance

The Cool Kids Corner: Data Quality Is Not a Fish You Can Catch

Dataversity

NOVEMBER 6, 2023

This is my monthly check-in to share with you the people and ideas I encounter as a data evangelist with DATAVERSITY. This month we’re talking about Data Quality (DQ). Read last month’s column here.)

Data Quality

Data Quality Data Governance

Good Data Quality Is the Secret to Successful GenAI Implementation

Dataversity

APRIL 22, 2024

So why are many technology leaders attempting to adopt GenAI technologies before ensuring their data quality can be trusted? Reliable and consistent data is the bedrock of a successful AI strategy.

Data Quality

Data Quality AI AI Data Governance

Mind the Gap: Did You Know About the ISO 25000 Series Data Quality Standards? Me Neither

Dataversity

APRIL 3, 2025

This is the first in a two-part series exploring Data Quality and the ISO 25000 standard. Despite efforts to recall the bombers, one plane successfully drops a […] The post Mind the Gap: Did You Know About the ISO 25000 Series Data Quality Standards? Ripper orders a nuclear strike on the USSR.

Data Quality

Data Quality Data Governance

How Solving the Big Data Problem Can Fix B2B Ecommerce

insideBIGDATA

APRIL 3, 2024

In this contributed article, Jonathan Taylor, CTO of Zoovu, highlights how many B2B executives believe ecommerce is broken in their organizations due to data quality issues.

Big Data

Big Data Big Data Data Quality AI

Data-Driven Companies Leverage OCR for Optimal Data Quality

Smart Data Collective

SEPTEMBER 29, 2022

Find out in this article how your company can benefit from the use of OCR. This article reveals all! The post Data-Driven Companies Leverage OCR for Optimal Data Quality appeared first on SmartData Collective. Even so, it takes time and can quickly become an obstacle to the smooth running of your business.

Data Quality

Data Quality Big Data Big Data Artificial Intelligence

How to Implement a Data Quality Framework

Dataversity

AUGUST 22, 2022

They have the data they need, but due to the presence of intolerable defects, they cannot use it as needed. These defects – also called Data Quality issues – must be fetched and fixed so that data can be used for successful business […].

Data Quality

Data Quality Data Governance

Data Quality Dimensions: How Do You Measure Up? (+ Downloadable Scorecard)

Precisely

JANUARY 9, 2024

Data can only deliver business value if it has high levels of data integrity. That starts with good data quality, contextual richness, integration, and sound data governance tools and processes. This article focuses primarily on data quality. How can you assess your data quality?

Data Quality

Data Quality Database Data Governance Analytics

Data Speaks for Itself: Data Quality Management in the Age of Language Models

The Data Administration Newsletter

FEBRUARY 5, 2025

Unsurprisingly, my last two columns discussed artificial intelligence (AI), specifically the impact of language models (LMs) on data curation. My August 2024 column, The Shift from Syntactic to Semantic Data Curation and What It Means for Data Quality, and my November 2024 column, Data Validation, the Data Accuracy Imposter or Assistant?

Data Quality

Data Quality Artificial Intelligence Artificial Intelligence AI

Why Is Data Quality Still So Hard to Achieve?

Dataversity

OCTOBER 25, 2023

In fact, it’s been more than three decades of innovation in this market, resulting in the development of thousands of data tools and a global data preparation tools market size that’s set […] The post Why Is Data Quality Still So Hard to Achieve? appeared first on DATAVERSITY.

Data Quality

Data Quality Data Preparation Algorithm Data Silos

Through the Looking Glass: What Does Data Quality Mean for Unstructured Data?

The Data Administration Newsletter

DECEMBER 4, 2024

We have lots of data conferences here. I’ve taken to asking a question at these conferences: What does data quality mean for unstructured data? Over the years, I’ve seen a trend — more and more emphasis on AI. This is my version of […]

Data Quality

Data Quality AI AI Artificial Intelligence

How to Assess Data Quality Readiness for Modern Data Pipelines

Dataversity

FEBRUARY 13, 2023

The key to being truly data-driven is having access to accurate, complete, and reliable data. In fact, Gartner recently found that organizations believe […] The post How to Assess Data Quality Readiness for Modern Data Pipelines appeared first on DATAVERSITY.

Data Pipeline

Data Pipeline Data Quality Data Silos Data Governance

#47 Building a NotebookLM Clone, Time Series Clustering, Instruction Tuning, and More!

Towards AI

OCTOBER 31, 2024

Meme shared by ghost_in_the_machine TAI Curated section Article of the week How I Developed a NotebookLM Clone? By Vatsal Saglani This article explores the creation of PDF2Pod, a NotebookLM clone that transforms PDF documents into engaging, multi-speaker podcasts. Our must-read articles 1. Meme of the week!

Clustering

Clustering AI AI Machine Learning

RAG (Retrieval Augmented Generation) Architecture for Data Quality Assessment

Dataversity

JULY 12, 2024

At their core, LLMs are trained on large amounts of content and data, and the architecture […] The post RAG (Retrieval Augmented Generation) Architecture for Data Quality Assessment appeared first on DATAVERSITY. It is estimated that by 2025, 50% of digital work will be automated through these LLM models.

Data Quality

Data Quality Artificial Intelligence Artificial Intelligence AI

Who Done It? 3 Possible Suspects in this Halloween’s Bad Data Horror Movie, And How Data Teams Can Make It Out Alive

insideBIGDATA

OCTOBER 31, 2023

In this contributed article, Lior Gavish, CTO and Co-Founder of Monte Carlo, outlines some of the ways companies can erase themselves from ever appearing in these bad data horror stories, ranging from simple tips to bolster governance within their organization, to tools and best practices that will save data teams the time, hassle, and headache that (..)

Data Quality

Data Quality Big Data Big Data

LLM stack layers

Dataconomy

APRIL 24, 2025

Data layer The data layer serves as the bedrock of LLM development, emphasizing the critical importance of data quality and variety. Importance of the data layer The effectiveness of an LLM relies heavily on the data it is trained on.

Data Quality

How to Ensure Data Quality and Consistency in Master Data Management

Dataversity

APRIL 1, 2024

This reliance has spurred a significant shift across industries, driven by advancements in artificial intelligence (AI) and machine learning (ML), which thrive on comprehensive, high-quality data.

Data Quality

Data Quality Artificial Intelligence Artificial Intelligence Machine Learning

Elevating customer experience: The rise of generative AI and conversational data analytics

Flipboard

JUNE 15, 2023

This article is part of a VB special issue. Read the full series here: Building the foundation for customer data quality. The rapid advancement of artificial intelligence (AI) and machine learning (ML) technologies is pushing the boundaries of what can be achieved in marketing, customer experience …

Artificial Intelligence

Artificial Intelligence Artificial Intelligence Data Quality Machine Learning

Change Data Capture and the Value of Real-Time Data Integration

Dataversity

APRIL 24, 2025

Business insights are only as good as the accuracy of the data on which they are built. According to Gartner, data quality is important to organizations in part because poor data quality costs organizations at least $12.9 million a year on average.

Data Quality

Data Quality Data Pipeline ETL Database

DGIQ + AIGov Conference: Takeaways and Trending Topics in AI Governance

Dataversity

APRIL 4, 2025

These takeaways include my overall professional impressions and a high-level review of the most prominenttopics discussed in the conferences core subject areas: data governance, data quality, and AI governance.

Data Governance

Data Governance Data Quality AI AI

Training-serving skew

Dataconomy

APRIL 29, 2025

Understanding how discrepancies between training data and operational data can impact model performance is essential for developing robust systems. This article explores the concept of training-serving skew, illustrating its implications and offering strategies to mitigate it. What is training-serving skew?

Machine Learning

Machine Learning Machine Learning Data Preparation Data Quality

Data Sips: Interview with Tom Redman

Dataversity

JANUARY 31, 2025

Data Sips is a new video miniseries presented by Ippon Technologies and DATAVERSITY that showcases quick conversations with industry experts from last months Data Governance & Information Quality (DGIQ) Conference in Washington, D.C.

Data Governance

Data Governance Data Quality

Data Integrity: The Last Mile Problem of Data Observability

Dataversity

NOVEMBER 7, 2022

Data quality issues have been a long-standing challenge for data-driven organizations. Even with significant investments, the trustworthiness of data in most organizations is questionable at best. Gartner reports that companies lose an average of $14 million per year due to poor data quality.

Data Observability

Data Observability Data Quality Data Governance

Documenting Critical Data Elements

The Data Administration Newsletter

FEBRUARY 21, 2024

Many Data Governance or Data Quality programs focus on “critical data elements,” but what are they and what are some key features to document for them? A critical data element is any data element in your organization that has a high impact on your organization’s ability to execute its business strategy.

Data Governance

Data Governance Data Quality

Why Data Quality is the Secret Ingredient to AI Success

Scaling Data Quality with Computer Vision on Spatial Data

Webinars

Trending Sources

Cloud Migration Alone Won’t Solve Data Quality. Here’s Why CDOs Need a More Holistic Approach

Webinars

Innovations in Analytics: Elevating Data Quality with GenAI

Business Leaders Must Prioritize Data Quality to Ensure Lasting AI Implementation

State of Data Quality Report

The Importance of Data Quality in Benefits

Data Errors in Financial Services: Addressing the Real Cost of Poor Data Quality

Implementing Data Quality Assurance in Data Science Pipelines with Great Expectations

10 Most Common Data Quality Issues and How to Fix Them

Data Quality Dimensions: Assuring Your Data Quality with Great Expectations

In 2024, Data Quality and AI Will Open New Doors

The Problem with ‘Dirty Data’ — How Data Quality Can Impact Life Science AI Adoption

Why Reinforcement Learning Will Save Generative AI

Study Finds Data Quality is Still the Largest Obstacle for Successful AI and Greater Human Expertise Needed Across ML Ops Lifecycle

Data Quality Metrics Best Practices

Unit Test framework and Test Driven Development (TDD) in Python

What I Learned from Executing Data Quality Projects

Knowledge Enhanced Machine Learning: Techniques & Types

Sigmoid Function: Derivative and Working Mechanism

Alation Unveils AI Governance Solution to Power Safe and Reliable AI for Enterprises

The Cool Kids Corner: Data Quality Is Not a Fish You Can Catch

Good Data Quality Is the Secret to Successful GenAI Implementation

Mind the Gap: Did You Know About the ISO 25000 Series Data Quality Standards? Me Neither

How Solving the Big Data Problem Can Fix B2B Ecommerce

Data-Driven Companies Leverage OCR for Optimal Data Quality

How to Implement a Data Quality Framework

Data Quality Dimensions: How Do You Measure Up? (+ Downloadable Scorecard)

Data Speaks for Itself: Data Quality Management in the Age of Language Models

Why Is Data Quality Still So Hard to Achieve?

Through the Looking Glass: What Does Data Quality Mean for Unstructured Data?

How to Assess Data Quality Readiness for Modern Data Pipelines

#47 Building a NotebookLM Clone, Time Series Clustering, Instruction Tuning, and More!

RAG (Retrieval Augmented Generation) Architecture for Data Quality Assessment

Who Done It? 3 Possible Suspects in this Halloween’s Bad Data Horror Movie, And How Data Teams Can Make It Out Alive

LLM stack layers

How to Ensure Data Quality and Consistency in Master Data Management

Elevating customer experience: The rise of generative AI and conversational data analytics

Change Data Capture and the Value of Real-Time Data Integration

DGIQ + AIGov Conference: Takeaways and Trending Topics in AI Governance

Training-serving skew

Data Sips: Interview with Tom Redman

Data Integrity: The Last Mile Problem of Data Observability

Documenting Critical Data Elements

Stay Connected