Artificial Intelligence, Data Engineering and Data Warehouse

A Quick Overview of Data Engineering

Analytics Vidhya

MARCH 17, 2022

Machine learning and artificial intelligence, which are at the top of the list of data science capabilities, aren’t just buzzwords; many companies are keen to implement them. Prior to developing intelligent data products, however, the frequently overlooked core work required to make it happen, […].

Data Engineer

Data Engineer Data Engineering Data Engineering Data Engineering

AWS re:Invent 2023 Amazon Redshift Sessions Recap

Flipboard

DECEMBER 18, 2023

Amazon Redshift powers data-driven decisions for tens of thousands of customers every day with a fully managed, AI-powered cloud data warehouse, delivering the best price-performance for your analytics workloads. Learn more about the AWS zero-ETL future with newly launched AWS databases integrations with Amazon Redshift.

AWS

AWS Data Warehouse ETL SQL

Shaping the future: OMRON’s data-driven journey with AWS

AWS Machine Learning Blog

APRIL 3, 2025

OMRONs data strategyrepresented on ODAPalso allowed the organization to unlock generative AI use cases focused on tangible business outcomes and enhanced productivity. When needed, the system can access an ODAP data warehouse to retrieve additional information.

AWS

AWS Data Governance Data Silos SQL

Webinars

Automation, Evolved: Your New Playbook For Smarter Knowledge Work

MORE WEBINARS

Future trends in ETL

Dataconomy

FEBRUARY 12, 2024

ELT advocates for loading raw data directly into storage systems, often cloud-based, before transforming it as necessary. This shift leverages the capabilities of modern data warehouses, enabling faster data ingestion and reducing the complexities associated with traditional transformation-heavy ETL processes.

ETL

ETL Data Governance Machine Learning Machine Learning

What Does a Data Engineering Job Involve in 2024?

ODSC - Open Data Science

JANUARY 30, 2024

Data engineering is a hot topic in the AI industry right now. And as data’s complexity and volume grow, its importance across industries will only become more noticeable. But what exactly do data engineers do? So let’s do a quick overview of the job of data engineer, and maybe you might find a new interest.

Data Engineer

Data Engineer Data Engineering Data Engineering Data Engineering

Most Common Use Cases of Data Engineering in Healthcare

phData

AUGUST 11, 2023

Data engineering in healthcare is taking a giant leap forward with rapid industrial development. Artificial Intelligence (AI) and Machine Learning (ML) are buzzwords these days with developments of Chat-GPT, Bard, and Bing AI, among others. Thus, using data engineering is a must in 2023 for hospitals.

Data Engineer

Data Engineer Data Engineering Data Engineering Data Engineering

Why optimize your warehouse with a data lakehouse strategy

IBM Journey to AI blog

APRIL 25, 2023

We also made the case that query and reporting, provided by big data engines such as Presto, need to work with the Spark infrastructure framework to support advanced analytics and complex enterprise data decision-making. To do so, Presto and Spark need to readily work with existing and modern data warehouse infrastructures.

Data Warehouse

Data Warehouse Data Engineer Data Engineering Data Engineering

Interview – Datenstrategie und Data Teams entwickeln!

Data Science Blog

MARCH 3, 2023

der Aufbau einer Datenplattform, vielleicht ein Data Warehouse zur Datenkonsolidierung, Process Mining zur Prozessanalyse oder Predictive Analytics für den Aufbau eines bestimmten Vorhersagesystems, KI zur Anomalieerkennung oder je nach Ziel etwas ganz anderes. Es gibt aber viele junge Leute, die da gerne einsteigen wollen.

Data Warehouse

Data Warehouse Data Lakes Data Engineer Data Engineering

11 Open-Source Data Engineering Tools Every Pro Should Use

ODSC - Open Data Science

FEBRUARY 6, 2024

Data engineering has become an integral part of the modern tech landscape, driving advancements and efficiencies across industries. So let’s explore the world of open-source tools for data engineers, shedding light on how these resources are shaping the future of data handling, processing, and visualization.

Data Engineer

Data Engineer Data Engineering Data Engineering Data Engineering

How to Shift from Data Science to Data Engineering

ODSC - Open Data Science

JANUARY 18, 2024

Data engineering is a rapidly growing field, and there is a high demand for skilled data engineers. If you are a data scientist, you may be wondering if you can transition into data engineering. In this blog post, we will discuss how you can become a data engineer if you are a data scientist.

Data Engineer

Data Engineer Data Engineering Data Engineering Data Engineering

The Data Dilemma: Exploring the Key Differences Between Data Science and Data Engineering

Pickl AI

JULY 25, 2023

Unfolding the difference between data engineer, data scientist, and data analyst. Data engineers are essential professionals responsible for designing, constructing, and maintaining an organization’s data infrastructure. Read more to know.

Data Engineer

Data Engineer Data Engineering Data Engineering Data Engineering

Announcing the First Speakers for the 2024 Data Engineering Summit

ODSC - Open Data Science

FEBRUARY 15, 2024

We couldn’t be more excited to announce the first sessions for our second annual Data Engineering Summit , co-located with ODSC East this April. Join us for 2 days of talks and panels from leading experts and data engineering pioneers. Is Gen AI A Data Engineering or Software Engineering Problem?

Data Engineer

Data Engineer Data Engineering Data Engineering Data Engineering

Sneak peek at Microsoft Fabric price and its promising features

Dataconomy

JUNE 1, 2023

By automating the integration of all Fabric workloads into OneLake, Microsoft eliminates the need for developers, analysts, and business users to create their own data silos. This approach not only improves performance by eliminating the need for separate data warehouses but also results in substantial cost savings for customers.

Power BI

Power BI Data Lakes Azure Data Silos

Supercharge your data strategy: Integrate and innovate today leveraging data integration

IBM Journey to AI blog

OCTOBER 22, 2024

Data is the differentiator as business leaders look to utilize their competitive edge as they implement generative AI (gen AI). Leaders feel the pressure to infuse their processes with artificial intelligence (AI) and are looking for ways to harness the insights in their data platforms to fuel this movement.

Data Silos

Data Silos Data Pipeline DataOps Business Intelligence

How to use foundation models and trusted governance to manage AI workflow risk

IBM Journey to AI blog

OCTOBER 16, 2023

Artificial intelligence (AI) adoption is still in its early stages. The Stanford Institute for Human-Centered Artificial Intelligence’s Center for Research on Foundation Models (CRFM) recently outlined the many risks of foundation models, as well as opportunities. Trustworthiness is critical.

AI

AI AI Data Warehouse ML

Podcast: Deciphering Data Architectures with James Serra

ODSC - Open Data Science

MAY 7, 2024

In this episode, James Serra, author of “Deciphering Data Architectures: Choosing Between a Modern Data Warehouse, Data Fabric, Data Lakehouse, and Data Mesh” joins us to discuss his book and dive into the current state and possible future of data architectures.

Data Warehouse

Data Warehouse Data Lakes Data Science Big Data

Building an efficient MLOps platform with OSS tools on Amazon ECS with AWS Fargate

AWS Machine Learning Blog

SEPTEMBER 18, 2024

The ZMP analyzes billions of structured and unstructured data points to predict consumer intent by using sophisticated artificial intelligence (AI) to personalize experiences at scale. Additionally, Feast promotes feature reuse, so the time spent on data preparation is reduced greatly.

AWS

AWS Machine Learning Machine Learning ML

Maximising Efficiency with ETL Data: Future Trends and Best Practices

Pickl AI

OCTOBER 17, 2024

Introduction ETL plays a crucial role in Data Management. This process enables organisations to gather data from various sources, transform it into a usable format, and load it into data warehouses or databases for analysis. Loading The transformed data is loaded into the target destination, such as a data warehouse.

ETL

ETL Data Warehouse Data Quality Data Governance

The Modern Data Stack Explained: What The Future Holds

Alation

JANUARY 17, 2023

The modern data stack is a combination of various software tools used to collect, process, and store data on a well-integrated cloud-based data platform. It is known to have benefits in handling data due to its robustness, speed, and scalability. A typical modern data stack consists of the following: A data warehouse.

Data Warehouse

Data Warehouse ETL Tableau Cloud Data

Data science vs data analytics: Unpacking the differences

IBM Journey to AI blog

SEPTEMBER 19, 2023

Overview: Data science vs data analytics Think of data science as the overarching umbrella that covers a wide range of tasks performed to find patterns in large datasets, structure data for use, train machine learning models and develop artificial intelligence (AI) applications.

Data Science

Data Science Analytics Analytics Data Scientist

How OLAP and AI can enable better business

IBM Journey to AI blog

DECEMBER 7, 2023

Online analytical processing (OLAP) database systems and artificial intelligence (AI) complement each other and can help enhance data analysis and decision-making when used in tandem. Today, OLAP database systems have become comprehensive and integrated data analytics platforms, addressing the diverse needs of modern businesses.

Data Preparation

Data Preparation Database Data Analysis Data Analysis

How Reveal’s Logikcull used Amazon Comprehend to detect and redact PII from legal documents at scale

AWS Machine Learning Blog

NOVEMBER 1, 2023

Jeff Newburn is a Senior Software Engineering Manager leading the Data Engineering team at Logikcull – A Reveal Technology. He oversees the company’s data initiatives, including data warehouses, visualizations, analytics, and machine learning. Outside of work, he enjoys playing lawn tennis and reading books.

AWS

AWS Machine Learning Machine Learning ML

Exploring the AI and data capabilities of watsonx

IBM Journey to AI blog

JULY 17, 2023

Within watsonx.ai, users can take advantage of open-source frameworks like PyTorch, TensorFlow and scikit-learn alongside IBM’s entire machine learning and data science toolkit and its ecosystem tools for code-based and visual data science capabilities.

AI

AI AI Machine Learning Machine Learning

Scale knowledge management use cases with generative AI

IBM Journey to AI blog

JULY 27, 2023

Artificial intelligence is disrupting many different areas of business. Powering a knowledge management system with a data lakehouse Organizations need a data lakehouse to target data challenges that come with deploying an AI-powered knowledge management system. A data lakehouse is a fit-for-purpose data store.

AI

AI AI Data Scientist Data Quality

The Ultimate Modern Data Stack Migration Guide

phData

JULY 18, 2023

With the birth of cloud data warehouses, data applications, and generative AI , processing large volumes of data faster and cheaper is more approachable and desired than ever. First up, let’s dive into the foundation of every Modern Data Stack, a cloud-based data warehouse.

Data Warehouse

Data Warehouse Analytics Analytics Cloud Data

8 Data Lake Vendors to Make Your Data Life Easier in 2023

ODSC - Open Data Science

JUNE 7, 2023

Data has to be stored somewhere. Data warehouses are repositories for your cleaned, processed data, but what about all that unstructured data your organization is starting to notice? What is a data lake? Snowflake Snowflake is a cross-cloud platform that looks to break down data silos.

Data Lakes

Data Lakes Azure Data Warehouse Hadoop

AI-Powered ETL Pipeline Orchestration: Multi-Agent Systems in the Era of Generative AI

ODSC - Open Data Science

FEBRUARY 19, 2025

Well according to Brij Kishore Pandey, it stands for Extract, Transform, Load and is a fundamental process in data engineering, ensuring data moves efficiently from raw sources to structured storage for analysis. The stepsinclude: Extraction : Data is collected from multiple sources (databases, APIs, flatfiles).

ETL

ETL AI AI Data Warehouse

Building Robust Data Pipelines: 9 Fundamentals and Best Practices to Follow

Alation

MAY 16, 2023

This makes it easier to compare and contrast information and provides organizations with a unified view of their data. Machine Learning Data pipelines feed all the necessary data into machine learning algorithms, thereby making this branch of Artificial Intelligence (AI) possible.

Data Pipeline

Data Pipeline Data Governance Data Lakes Data Warehouse

Booths and Demos Coming to the ODSC West 2024 Expo Hall

ODSC - Open Data Science

OCTOBER 7, 2024

There you’ll hear from Ivan Nardini, Developer Relations Engineer at Google Cloud and discover the latest advancements in AI and learn how to leverage Google Cloud’s powerful tools and infrastructure to drive innovation in your organization.

Data Scientist

Data Scientist Data Science Machine Learning Machine Learning

Transitioning off Amazon Lookout for Metrics

AWS Machine Learning Blog

OCTOBER 9, 2024

Using Amazon Redshift ML for anomaly detection Amazon Redshift ML makes it easy to create, train, and apply machine learning models using familiar SQL commands in Amazon Redshift data warehouses. To learn more, see the documentation.

AWS

AWS ML ML Data Quality

Data science vs. machine learning: What’s the difference?

IBM Journey to AI blog

JULY 6, 2023

Data science solves a business problem by understanding the problem, knowing the data that’s required, and analyzing the data to help solve the real-world problem. Machine learning (ML) is a subset of artificial intelligence (AI) that focuses on learning from what the data science comes up with.

Machine Learning

Machine Learning Machine Learning Data Science Big Data

Tackling AI’s data challenges with IBM databases on AWS

IBM Journey to AI blog

MARCH 14, 2024

Businesses face significant hurdles when preparing data for artificial intelligence (AI) applications. The existence of data silos and duplication, alongside apprehensions regarding data quality, presents a multifaceted environment for organizations to manage.

AWS

AWS Database ETL AI

How IBM Data Product Hub helps you unlock business intelligence potential

IBM Journey to AI blog

OCTOBER 2, 2024

Inconsistent data quality: The uncertainty surrounding the accuracy, consistency and reliability of data pulled from various sources can lead to risks in analysis and reporting. These products are curated with key attributes such as business domain, access level, delivery methods, recommended usage and data contracts.

Business Intelligence

Business Intelligence Business Intelligence Power BI Data Quality

Data Analytics in the Age of AI, When to Use RAG, Examples of Data Visualization with D3 and Vega…

ODSC - Open Data Science

APRIL 4, 2024

ODSC Highlights Announcing the Keynote and Featured Speakers for ODSC East 2024 The keynotes and featured speakers for ODSC East 2024 have won numerous awards, authored books and widely cited papers, and shaped the future of data science and AI with their research. Learn more about them here!

Data Visualization

Data Visualization Analytics Analytics Big Data Analytics

Building Robust Data Pipelines: 9 Fundamentals and Best Practices to Follow

Alation

MAY 16, 2023

This makes it easier to compare and contrast information and provides organizations with a unified view of their data. Machine Learning Data pipelines feed all the necessary data into machine learning algorithms, thereby making this branch of Artificial Intelligence (AI) possible.

Data Pipeline

Data Pipeline Data Governance Data Lakes Data Warehouse

Snowflake Snowpark: cloud SQL and Python ML pipelines

Snorkel AI

MAY 26, 2023

[link] Ahmad Khan, head of artificial intelligence and machine learning strategy at Snowflake gave a presentation entitled “Scalable SQL + Python ML Pipelines in the Cloud” about his company’s Snowpark service at Snorkel AI’s Future of Data-Centric AI virtual conference in August 2022.

SQL

SQL ML ML Python

Snowflake Snowpark: cloud SQL and Python ML pipelines

Snorkel AI

MAY 26, 2023

[link] Ahmad Khan, head of artificial intelligence and machine learning strategy at Snowflake gave a presentation entitled “Scalable SQL + Python ML Pipelines in the Cloud” about his company’s Snowpark service at Snorkel AI’s Future of Data-Centric AI virtual conference in August 2022.

SQL

SQL ML ML Python

Your Complete Roadmap to Become an Azure Data Scientist

Pickl AI

SEPTEMBER 5, 2024

Data Preparation: Cleaning, transforming, and preparing data for analysis and modelling. Collaborating with Teams: Working with data engineers, analysts, and stakeholders to ensure data solutions meet business needs.

Azure

Azure Data Scientist Data Science Machine Learning

Top Advanced Text Data Labeling Techniques: A Comprehensive Guide

DagsHub

JANUARY 27, 2025

It enables efficient active learning by iteratively selecting the most valuable data points for labeling, reducing manual effort while improving model performance. This LabelBox LabelBox is an AI-powered data engine platform that supports text annotation along with other data types.

Machine Learning

Machine Learning Machine Learning Natural Language Processing Supervised Learning

Top Advanced Text Data Labeling: A Comprehensive Guide

DagsHub

JANUARY 27, 2025

It enables efficient active learning by iteratively selecting the most valuable data points for labeling, reducing manual effort while improving model performance. This LabelBox LabelBox is an AI-powered data engine platform that supports text annotation along with other data types.

Machine Learning

Machine Learning Machine Learning Natural Language Processing Supervised Learning

A Quick Overview of Data Engineering

AWS re:Invent 2023 Amazon Redshift Sessions Recap

Webinars

Trending Sources

Shaping the future: OMRON’s data-driven journey with AWS

Webinars

Future trends in ETL

What Does a Data Engineering Job Involve in 2024?

Most Common Use Cases of Data Engineering in Healthcare

Why optimize your warehouse with a data lakehouse strategy

Interview – Datenstrategie und Data Teams entwickeln!

11 Open-Source Data Engineering Tools Every Pro Should Use

How to Shift from Data Science to Data Engineering

The Data Dilemma: Exploring the Key Differences Between Data Science and Data Engineering

Announcing the First Speakers for the 2024 Data Engineering Summit

Sneak peek at Microsoft Fabric price and its promising features

Supercharge your data strategy: Integrate and innovate today leveraging data integration

How to use foundation models and trusted governance to manage AI workflow risk

Podcast: Deciphering Data Architectures with James Serra

Building an efficient MLOps platform with OSS tools on Amazon ECS with AWS Fargate

Maximising Efficiency with ETL Data: Future Trends and Best Practices

The Modern Data Stack Explained: What The Future Holds

Data science vs data analytics: Unpacking the differences

How OLAP and AI can enable better business

How Reveal’s Logikcull used Amazon Comprehend to detect and redact PII from legal documents at scale

Exploring the AI and data capabilities of watsonx

Scale knowledge management use cases with generative AI

The Ultimate Modern Data Stack Migration Guide

8 Data Lake Vendors to Make Your Data Life Easier in 2023

AI-Powered ETL Pipeline Orchestration: Multi-Agent Systems in the Era of Generative AI

Building Robust Data Pipelines: 9 Fundamentals and Best Practices to Follow

Booths and Demos Coming to the ODSC West 2024 Expo Hall

Transitioning off Amazon Lookout for Metrics

Data science vs. machine learning: What’s the difference?

Tackling AI’s data challenges with IBM databases on AWS

How IBM Data Product Hub helps you unlock business intelligence potential

Data Analytics in the Age of AI, When to Use RAG, Examples of Data Visualization with D3 and Vega…

Building Robust Data Pipelines: 9 Fundamentals and Best Practices to Follow

Snowflake Snowpark: cloud SQL and Python ML pipelines

Snowflake Snowpark: cloud SQL and Python ML pipelines

Your Complete Roadmap to Become an Azure Data Scientist

Top Advanced Text Data Labeling Techniques: A Comprehensive Guide

Top Advanced Text Data Labeling: A Comprehensive Guide

Stay Connected