Data Governance, Data Models and Data Pipeline

CI/CD for Data Pipelines: A Game-Changer with AnalyticsCreator

Data Science Blog

MAY 20, 2024

Continuous Integration and Continuous Delivery (CI/CD) for Data Pipelines: It is a Game-Changer with AnalyticsCreator! The need for efficient and reliable data pipelines is paramount in data science and data engineering. They transform data into a consistent format for users to consume.

Data Pipeline

Data Pipeline Data Warehouse Azure Data Lakes

Essential data engineering tools for 2023: Empowering for management and analysis

Data Science Dojo

JULY 6, 2023

Data engineering tools are software applications or frameworks specifically designed to facilitate the process of managing, processing, and transforming large volumes of data. Spark offers a rich set of libraries for data processing, machine learning, graph processing, and stream processing.

Data Engineer

Data Engineer Data Engineering Data Engineering Data Engineering

Testing and Monitoring Data Pipelines: Part Two

Dataversity

JUNE 19, 2023

In part one of this article, we discussed how data testing can specifically test a data object (e.g., table, column, metadata) at one particular point in the data pipeline.

Data Pipeline

Data Pipeline Database Data Modeling Data Models

Webinars

How to Achieve High-Accuracy Results When Using LLMs

MORE WEBINARS

Discover the Most Important Fundamentals of Data Engineering

Pickl AI

NOVEMBER 4, 2024

Summary: The fundamentals of Data Engineering encompass essential practices like data modelling, warehousing, pipelines, and integration. Understanding these concepts enables professionals to build robust systems that facilitate effective data management and insightful analysis. What is Data Engineering?

Data Engineer

Data Engineer Data Engineering Data Engineering Data Engineering

The Evolution of Customer Data Modeling: From Static Profiles to Dynamic Customer 360

phData

SEPTEMBER 27, 2024

Introduction: The Customer Data Modeling Dilemma You know, that thing we’ve been doing for years, trying to capture the essence of our customers in neat little profile boxes? For years, we’ve been obsessed with creating these grand, top-down customer data models. Yeah, that one.

Data Modeling

Data Modeling Data Models Apache Kafka Data Lakes

Self-Service Analytics for Google Cloud, now with Looker and Tableau

Tableau

OCTOBER 8, 2021

Leveraging Looker’s semantic layer will provide Tableau customers with trusted, governed data at every stage of their analytics journey. With its LookML modeling language, Looker provides a unique, modern approach to define governed and reusable data models to build a trusted foundation for analytics.

Tableau

Tableau Analytics Analytics Machine Learning

The Data Dilemma: Exploring the Key Differences Between Data Science and Data Engineering

Pickl AI

JULY 25, 2023

Data engineers are essential professionals responsible for designing, constructing, and maintaining an organization’s data infrastructure. They create data pipelines, ETL processes, and databases to facilitate smooth data flow and storage. Big Data Processing: Apache Hadoop, Apache Spark, etc.

Data Engineer

Data Engineer Data Engineering Data Engineering Data Engineering

DataOps vs. DevOps: What’s the Difference?

Alation

AUGUST 3, 2021

It brings together business users, data scientists , data analysts, IT, and application developers to fulfill the business need for insights. DataOps then works to continuously improve and adjust data models, visualizations, reports, and dashboards to achieve business goals. The Agile Connection.

DataOps

DataOps Data Pipeline Data Analyst Analytics

Maximize the Power of dbt and Snowflake to Achieve Efficient and Scalable Data Vault Solutions

phData

AUGUST 10, 2023

In data vault implementations, critical components encompass the storage layer, ELT technology, integration platforms, data observability tools, Business Intelligence and Analytics tools, Data Governance , and Metadata Management solutions. The most important reason for using DBT in Data Vault 2.0

SQL

SQL Data Observability Data Quality Data Pipeline

Self-Service Analytics for Google Cloud, now with Looker and Tableau

Tableau

OCTOBER 8, 2021

Leveraging Looker’s semantic layer will provide Tableau customers with trusted, governed data at every stage of their analytics journey. With its LookML modeling language, Looker provides a unique, modern approach to define governed and reusable data models to build a trusted foundation for analytics.

Tableau

Tableau Analytics Analytics Machine Learning

MLOps Landscape in 2023: Top Tools and Platforms

The MLOps Blog

JUNE 27, 2023

Model versioning, lineage, and packaging : Can you version and reproduce models and experiments? Can you see the complete model lineage with data/models/experiments used downstream? It sits between the data lake and cloud object storage, allowing you to version and control changes to data lakes at scale.

Machine Learning

Machine Learning Machine Learning ML ML

Implementing Gen AI for Financial Services

Iguazio

FEBRUARY 20, 2024

This includes management vision and strategy, resource commitment, data and tech and operating model alignment, robust risk management and change management. The required architecture includes a data pipeline, ML pipeline, application pipeline and a multi-stage pipeline. Read more here.

AI

AI AI Data Pipeline Data Quality

Who is a BI Developer: Role, Responsibilities & Skills

Pickl AI

JULY 3, 2023

It is the process of converting raw data into relevant and practical knowledge to help evaluate the performance of businesses, discover trends, and make well-informed choices. Data gathering, data integration, data modelling, analysis of information, and data visualization are all part of intelligence for businesses.

Business Intelligence

Business Intelligence Business Intelligence SQL Data Visualization

Data architecture strategy for data quality

IBM Journey to AI blog

JANUARY 5, 2023

What does a modern data architecture do for your business? A modern data architecture like Data Mesh and Data Fabric aims to easily connect new data sources and accelerate development of use case specific data pipelines across on-premises, hybrid and multicloud environments.

Data Quality

Data Quality Data Lakes Data Warehouse Big Data

What Industries are Hiring for Different Jobs in AI

ODSC - Open Data Science

APRIL 26, 2023

Data Engineer Data engineers are the authors of the infrastructure that stores, processes, and manages the large volumes of data an organization has. The main aspect of their profession is the building and maintenance of data pipelines, which allow for data to move between sources.

Data Analyst

Data Analyst Machine Learning Machine Learning Power BI

How to Manage Unstructured Data in AI and Machine Learning Projects

DagsHub

OCTOBER 23, 2024

With proper unstructured data management, you can write validation checks to detect multiple entries of the same data. Continuous learning: In a properly managed unstructured data pipeline, you can use new entries to train a production ML model, keeping the model up-to-date.

Machine Learning

Machine Learning Machine Learning AI AI

Discover the Snowflake Architecture With All its Pros and Cons- NIX United

Mlearning.ai

FEBRUARY 16, 2023

Thus, the solution allows for scaling data workloads independently from one another and seamlessly handling data warehousing, data lakes , data sharing, and engineering. Data Security and Governance Maintaining data security is crucial for any company.

Data Warehouse

Data Warehouse Business Intelligence Business Intelligence Database

The Data Engineer’s Roadmap

Dataversity

SEPTEMBER 28, 2022

Data engineering is a fascinating and fulfilling career – you are at the helm of every business operation that requires data, and as long as users generate data, businesses will always need data engineers. The journey to becoming a successful data engineer […]. In other words, job security is guaranteed.

Data Engineer

Data Engineer Data Engineering Data Engineering Data Engineering

Capital One’s data-centric solutions to banking business challenges

Snorkel AI

MAY 12, 2023

The reason is that most teams do not have access to a robust data ecosystem for ML development. billion is lost by Fortune 500 companies because of broken data pipelines and communications. Publishing standards for data and governance of that data is either missing or very widely far from an ideal.

Machine Learning

Machine Learning Machine Learning ML ML

Capital One’s data-centric solutions to banking business challenges

Snorkel AI

MAY 12, 2023

The reason is that most teams do not have access to a robust data ecosystem for ML development. billion is lost by Fortune 500 companies because of broken data pipelines and communications. Publishing standards for data and governance of that data is either missing or very widely far from an ideal.

Machine Learning

Machine Learning Machine Learning ML ML

Data Governance for Dummies: Your Questions, Answered

Alation

FEBRUARY 17, 2023

This past week, I had the pleasure of hosting Data Governance for Dummies author Jonathan Reichental for a fireside chat , along with Denise Swanson , Data Governance lead at Alation. Can you have proper data management without establishing a formal data governance program?

Data Governance

Data Governance Data Quality Data Analyst Data Pipeline

The Ultimate Modern Data Stack Migration Guide

phData

JULY 18, 2023

Enter dbt dbt provides SQL-centric transformations for your data modeling and transformations, which is efficient for scrubbing and transforming your data while being an easy skill set to hire for and develop within your teams. However, merely knowing what it consists of isn’t enough.

Data Warehouse

Data Warehouse Analytics Analytics SQL

Data Science Current

CI/CD for Data Pipelines: A Game-Changer with AnalyticsCreator

Essential data engineering tools for 2023: Empowering for management and analysis

Webinars

Trending Sources

Testing and Monitoring Data Pipelines: Part Two

Webinars

Discover the Most Important Fundamentals of Data Engineering

The Evolution of Customer Data Modeling: From Static Profiles to Dynamic Customer 360

Self-Service Analytics for Google Cloud, now with Looker and Tableau

The Data Dilemma: Exploring the Key Differences Between Data Science and Data Engineering

DataOps vs. DevOps: What’s the Difference?

Maximize the Power of dbt and Snowflake to Achieve Efficient and Scalable Data Vault Solutions

Self-Service Analytics for Google Cloud, now with Looker and Tableau

MLOps Landscape in 2023: Top Tools and Platforms

Implementing Gen AI for Financial Services

Who is a BI Developer: Role, Responsibilities & Skills

Data architecture strategy for data quality

What Industries are Hiring for Different Jobs in AI

How to Manage Unstructured Data in AI and Machine Learning Projects

Discover the Snowflake Architecture With All its Pros and Cons- NIX United

The Data Engineer’s Roadmap

Capital One’s data-centric solutions to banking business challenges

Capital One’s data-centric solutions to banking business challenges

Data Governance for Dummies: Your Questions, Answered

The Ultimate Modern Data Stack Migration Guide

Stay Connected