Apache Kafka and Artificial Intelligence

Maximizing your event-driven architecture investments: Unleashing the power of Apache Kafka with IBM Event Automation

IBM Journey to AI blog

FEBRUARY 12, 2024

At the forefront of this event-driven revolution is Apache Kafka, the widely recognized and dominant open-source technology for event streaming. While most enterprises have already recognized how Apache Kafka provides a strong foundation for EDA, they often fall behind in unlocking its true potential.

Apache Kafka

Apache Kafka EDA SQL Database

Real-time artificial intelligence and event processing

IBM Journey to AI blog

NOVEMBER 29, 2023

Artificial intelligence is also key for businesses, helping provide capabilities for both streamlining business processes and improving strategic decisions. Events as fuel for AI Models: Artificial intelligence models rely on big data to refine the effectiveness of their capabilities.

Artificial Intelligence

Artificial Intelligence Artificial Intelligence Apache Kafka AI

Streaming Machine Learning Without a Data Lake

ODSC - Open Data Science

MAY 31, 2023

Be sure to check out his talk, “ Apache Kafka for Real-Time Machine Learning Without a Data Lake ,” there! The combination of data streaming and machine learning (ML) enables you to build one scalable, reliable, but also simple infrastructure for all machine learning tasks using the Apache Kafka ecosystem.

Data Lakes

Data Lakes Machine Learning Machine Learning Apache Kafka

Webinars

What’s New in Apache Airflow® 3.0—And How Will It Reshape Your Data Workflows?

MORE WEBINARS

22 Widely Used Data Science and Machine Learning Tools in 2020

Analytics Vidhya

JUNE 27, 2020

Overview There are a plethora of data science tools out there – which one should you pick up? Here’s a list of over 20. The post 22 Widely Used Data Science and Machine Learning Tools in 2020 appeared first on Analytics Vidhya.

Data Science

Data Science Machine Learning Machine Learning Analytics

What Are AI Credits and How Can Data Scientists Use Them?

ODSC - Open Data Science

APRIL 23, 2025

With AI credits, teams can streamline the annotation process using intelligent suggestions and quality control mechanisms. Confluent Confluent provides a robust data streaming platform built around Apache Kafka. Amazon Web Services(AWS) AWS offers one of the most extensive AI and ML infrastructures in the world.

Data Scientist

Data Scientist Azure Apache Kafka ML

Machine Learning with MATLAB and Amazon SageMaker

Flipboard

NOVEMBER 21, 2023

MATLAB   is a popular programming tool for a wide range of applications, such as data processing, parallel computing, automation, simulation, machine learning, and artificial intelligence. It’s heavily used in many industries such as automotive, aerospace, communication, and manufacturing.

Machine Learning

Machine Learning Machine Learning AWS Decision Trees

All of the Free Virtual Sessions Coming to ODSC Europe 2023

ODSC - Open Data Science

JUNE 7, 2023

Wednesday, June 14th Me, my health, and AI: applications in medical diagnostics and prognostics: Sara Khalid | Associate Professor, Senior Research Fellow, Biomedical Data Science and Health Informatics | University of Oxford Iterated and Exponentially Weighted Moving Principal Component Analysis : Dr. Paul A.

Apache Kafka

Apache Kafka Machine Learning Machine Learning Data Science

Building a Pizza Delivery Service with a Real-Time Analytics Stack

ODSC - Open Data Science

JUNE 1, 2023

We’re going to assume that the pizza service already captures orders in Apache Kafka and is also keeping a record of its customers and the products that they sell in MySQL. Apache Pinot is a real-time OLAP database built at LinkedIn to deliver scalable real-time analytics with low latency.

Analytics

Analytics Analytics Apache Kafka Data Science

Bundesliga Match Facts Shot Speed – Who fires the hardest shots in the Bundesliga?

AWS Machine Learning Blog

NOVEMBER 3, 2023

m How it’s implemented In our quest to accurately determine shot speed during live matches, we’ve implemented a cutting-edge solution using Amazon Managed Streaming for Apache Kafka (Amazon MSK). He is passionate about enabling customers on their data and artificial intelligence (AI) journey to the cloud.

AWS

AWS Apache Kafka Data Scientist Data Science

11 Open-Source Data Engineering Tools Every Pro Should Use

ODSC - Open Data Science

FEBRUARY 6, 2024

Apache Kafka For data engineers dealing with real-time data, Apache Kafka is a game-changer. Spark offers a versatile range of functionalities, from batch processing to stream processing, making it a comprehensive solution for complex data challenges.

Data Engineering

Data Engineering Data Engineering Data Engineer Data Engineering

Anomaly detection in streaming time series data with online learning using Amazon Managed Service for Apache Flink

AWS Machine Learning Blog

SEPTEMBER 11, 2024

It initially sources input time series data from Amazon Managed Streaming for Apache Kafka (Amazon MSK) using this live stream for model training. The application, once deployed, constructs an ML model using the Random Cut Forest (RCF) algorithm. Post-training, the model continues to process incoming data points from the stream.

AWS

AWS ML ML Apache Kafka

Transitioning off Amazon Lookout for Metrics

AWS Machine Learning Blog

OCTOBER 9, 2024

Customers can use the CloudFormation template to bring up an application stack that receives time-series data from an Amazon Managed Streaming for Apache Kafka (Amazon MSK) streaming source and performs near-real-time anomaly detection in the streaming data.

AWS

AWS ML ML Data Quality

How Netflix Applies Big Data Across Business Verticals: Insights and Strategies

Pickl AI

SEPTEMBER 18, 2024

Data in Motion Technologies like Apache Kafka facilitate real-time processing of events and data, allowing Netflix to respond swiftly to user interactions and operational needs. Data at Rest This includes storage solutions such as S3 Data Warehouse and Cassandra. What Technologies Does Netflix Use for Its Big Data Infrastructure?

Big Data

Big Data Big Data Apache Kafka Big Data Analytics

What is Data Ingestion? Understanding the Basics

Pickl AI

JULY 25, 2024

Enhanced Data Utilisation Effective ingestion unlocks the full potential of data by making it available for advanced analytics, machine learning, and artificial intelligence applications, driving innovation and business growth. Apache Kafka An open-source platform designed for real-time data streaming.

Apache Kafka

Apache Kafka Data Lakes Data Warehouse Data Quality

Predicting the Future of Data Science

Pickl AI

DECEMBER 4, 2024

The rise of advanced technologies such as Artificial Intelligence (AI), Machine Learning (ML) , and Big Data analytics is reshaping industries and creating new opportunities for Data Scientists. Apache Kafka), organisations can now analyse vast amounts of data as it is generated. Here are five key trends to watch.

Data Science

Data Science Data Scientist Machine Learning Machine Learning

Bundesliga Match Fact Ball Recovery Time: Quantifying teams’ success in pressing opponents on AWS

AWS Machine Learning Blog

MARCH 30, 2023

To ensure real-time updates of ball recovery times, we have implemented Amazon Managed Streaming for Apache Kafka (Amazon MSK) as a central solution for data streaming and messaging. This allows for seamless communication of positional data and various outputs of Bundesliga Match Facts between containers in real time.

AWS

AWS Machine Learning Machine Learning Apache Kafka

Pictures and Highlights from ODSC Europe 2023

ODSC - Open Data Science

JULY 22, 2023

Leverage Compound Sparsity to Achieve the Fastest Inference Performance on CPUs: Damian Bogunowicz | Neural Magic and Konstantin Gulin | Machine Learning Engineer | Neural Magic Apache Kafka for Real-Time Machine Learning Without a Data Lake: Kai Waehner | Global Field CTO | Author, International Speaker Time Series Forecasting for Managers — All Forecasts (..)

Apache Kafka

Apache Kafka Machine Learning Machine Learning Data Science

Watch the Top ODSC Europe 2023 Virtual Sessions Here

ODSC - Open Data Science

JULY 14, 2023

The session participants will learn the theory behind compound sparsity, state-of-the-art techniques, and how to apply it in practice using the Neural Magic platform.

Machine Learning

Machine Learning Machine Learning Apache Kafka Data Science

Unlock the knowledge in your Slack workspace with Slack connector for Amazon Q Business

AWS Machine Learning Blog

OCTOBER 9, 2024

I am currently using Apache Kafka. The #customerwork Slack channel is being used to communicate about an upcoming customer engagement, as shown in the following figure. Post the first question to Amazon Q Business. Can you list high level steps involved in migration to Amazon MSK?

AWS

AWS Apache Kafka Data Scientist Database Administration

How Thomson Reuters delivers personalized content subscription plans at scale using Amazon Personalize

AWS Machine Learning Blog

JANUARY 6, 2023

Then the events are ingested into TR’s centralized streaming platform, which is built on top of Amazon Managed Streaming for Kafka (Amazon MSK). Amazon MSK makes it easy to ingest and process streaming data in real time with fully managed Apache Kafka.

AWS

AWS Data Warehouse ML ML

Exploring Database Management Systems in Social Media Giants

Pickl AI

OCTOBER 21, 2024

In response, Twitter has implemented various solutions, including Apache Kafka, a distributed streaming platform that helps manage the data flow from user interactions. Using Kafka, Twitter can effectively handle high-throughput data streams, enabling users to receive timely notifications and updates.

Database

Database Apache Kafka Machine Learning Machine Learning

Why Software Engineers Should Be Embracing AI: A Guide to Staying Ahead

ODSC - Open Data Science

OCTOBER 9, 2024

Efficient Incremental Processing with Apache Iceberg and Netflix Maestro Dimensional Data Modeling in the Modern Era Building Big Data Workflows: NiFi, Hive, Trino, & Zeppelin An Introduction to Data Contracts From Data Mess to Data Mesh — Data Management in the Age of Big Data and Gen AI Introduction to Containers for Data Science / Data Engineering (..)

Apache Kafka

Apache Kafka AI AI Machine Learning

Big Data Syllabus: A Comprehensive Overview

Pickl AI

AUGUST 9, 2024

Data Streaming Learning about real-time data collection methods using tools like Apache Kafka and Amazon Kinesis. Future Trends Exploring emerging trends in Big Data, such as the rise of edge computing, quantum computing, and advancements in artificial intelligence.

Big Data

Big Data Big Data Big Data Analytics Big Data Analytics

A Comprehensive Guide to the main components of Big Data

Pickl AI

DECEMBER 2, 2024

Real-time processing allows organisations to make timely decisions based on current data rather than relying on historical information.Technologies enabling real-time analytics include: Stream Processing Frameworks: Tools like Apache Kafka facilitate the continuous ingestion and processing of streaming data.

Big Data

Big Data Big Data Data Lakes Apache Hadoop

Bundesliga Match Fact Keeper Efficiency: Comparing keepers’ performances objectively using machine learning on AWS

AWS Machine Learning Blog

MARCH 30, 2023

For every xSaves prediction, it produces a message with the prediction as a payload, which then gets distributed by a central message broker running on Amazon Managed Streaming for Apache Kafka (Amazon MSK). The information also gets stored in a data lake for future auditing and model improvements.

Machine Learning

Machine Learning Machine Learning AWS ML

Building a Business with a Real-Time Analytics Stack, Streaming ML Without a Data Lake, and…

ODSC - Open Data Science

MAY 24, 2023

Streaming Machine Learning Without a Data Lake The combination of data streaming and ML enables you to build one scalable, reliable, but also simple infrastructure for all machine learning tasks using the Apache Kafka ecosystem.

Data Lakes

Data Lakes ML ML Analytics

ML Pipeline Architecture Design Patterns (With 10 Real-World Examples)

The MLOps Blog

AUGUST 11, 2023

Apache Kafka, Amazon Kinesis) 2 Data Preprocessing (e.g., Today different stages exist within ML pipelines built to meet technical, industrial, and business requirements. This section delves into the common stages in most ML pipelines, regardless of industry or business function. 1 Data Ingestion (e.g.,

ML

ML ML Machine Learning Machine Learning

Data Science Current

Maximizing your event-driven architecture investments: Unleashing the power of Apache Kafka with IBM Event Automation

Real-time artificial intelligence and event processing

Webinars

Trending Sources

Streaming Machine Learning Without a Data Lake

Webinars

22 Widely Used Data Science and Machine Learning Tools in 2020

What Are AI Credits and How Can Data Scientists Use Them?

Machine Learning with MATLAB and Amazon SageMaker

All of the Free Virtual Sessions Coming to ODSC Europe 2023

Building a Pizza Delivery Service with a Real-Time Analytics Stack

Bundesliga Match Facts Shot Speed – Who fires the hardest shots in the Bundesliga?

11 Open-Source Data Engineering Tools Every Pro Should Use

Anomaly detection in streaming time series data with online learning using Amazon Managed Service for Apache Flink

Transitioning off Amazon Lookout for Metrics

How Netflix Applies Big Data Across Business Verticals: Insights and Strategies

What is Data Ingestion? Understanding the Basics

Predicting the Future of Data Science

Bundesliga Match Fact Ball Recovery Time: Quantifying teams’ success in pressing opponents on AWS

Pictures and Highlights from ODSC Europe 2023

Watch the Top ODSC Europe 2023 Virtual Sessions Here

Unlock the knowledge in your Slack workspace with Slack connector for Amazon Q Business

How Thomson Reuters delivers personalized content subscription plans at scale using Amazon Personalize

Exploring Database Management Systems in Social Media Giants

Why Software Engineers Should Be Embracing AI: A Guide to Staying Ahead

Big Data Syllabus: A Comprehensive Overview

A Comprehensive Guide to the main components of Big Data

Bundesliga Match Fact Keeper Efficiency: Comparing keepers’ performances objectively using machine learning on AWS

Building a Business with a Real-Time Analytics Stack, Streaming ML Without a Data Lake, and…

ML Pipeline Architecture Design Patterns (With 10 Real-World Examples)

Stay Connected