Apache Kafka, Article and Data Engineering

Apache Kafka

Article

Data Engineering

Apache Kafka Architecture and Use Cases Explained

Analytics Vidhya

JULY 22, 2022

This article was published as a part of the Data Science Blogathon. Introduction The big data industry is growing daily and needs tools to process vast volumes of data. That’s why you need to know about Apache Kafka, a publish-subscribe messaging system you can use to build distributed applications.

Apache Kafka

Apache Kafka Big Data Big Data Data Science

Apache Kafka Use Cases and Installation Guide

Analytics Vidhya

OCTOBER 3, 2022

This article was published as a part of the Data Science Blogathon. The post Apache Kafka Use Cases and Installation Guide appeared first on Analytics Vidhya. Introduction Today, we expect web applications to respond to user queries quickly, if not immediately. Source: kafka.apache.org Caching is used to solve […].

Apache Kafka

Apache Kafka Data Science Analytics Analytics

Join 17,000+

professionals

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Webinars

Maximizing Profit and Productivity: The New Era of AI-Powered Accounting

Automation, Evolved: Your New Playbook For Smarter Knowledge Work

MORE WEBINARS

Trending Sources

Exploring Partitions and Consumer Groups in Apache Kafka

Analytics Vidhya

AUGUST 2, 2022

This article was published as a part of the Data Science Blogathon. Introduction Earlier, I had introduced basic concepts of Apache Kafka in my blog on Analytics Vidhya(link is available under references). The post Exploring Partitions and Consumer Groups in Apache Kafka appeared first on Analytics Vidhya.

Apache Kafka

Apache Kafka Data Science Python Analytics

Webinars

Maximizing Profit and Productivity: The New Era of AI-Powered Accounting

Automation, Evolved: Your New Playbook For Smarter Knowledge Work

MORE WEBINARS

Introduction to Apache Kafka: Fundamentals and Working

Analytics Vidhya

DECEMBER 30, 2022

This article was published as a part of the Data Science Blogathon. The post Introduction to Apache Kafka: Fundamentals and Working appeared first on Analytics Vidhya. All these sites use some event streaming tool to monitor user activities. […]. . […].

Apache Kafka

Apache Kafka Data Science Analytics Analytics

Build a Simple Realtime Data Pipeline

Analytics Vidhya

SEPTEMBER 22, 2022

This article was published as a part of the Data Science Blogathon. Dale Carnegie” Apache Kafka is a Software Framework for storing, reading, and analyzing streaming data. Introduction “Learning is an active process. We learn by doing. Only knowledge that is used sticks in your mind.-

Data Pipeline

Data Pipeline Apache Kafka Internet of Things Data Science

How data engineers tame Big Data?

Dataconomy

FEBRUARY 23, 2023

Data engineers play a crucial role in managing and processing big data. They are responsible for designing, building, and maintaining the infrastructure and tools needed to manage and process large volumes of data effectively. What is data engineering?

Big Data

Big Data Big Data Data Engineering Data Engineer

11 Open-Source Data Engineering Tools Every Pro Should Use

ODSC - Open Data Science

FEBRUARY 6, 2024

Data engineering has become an integral part of the modern tech landscape, driving advancements and efficiencies across industries. So let’s explore the world of open-source tools for data engineers, shedding light on how these resources are shaping the future of data handling, processing, and visualization.

Data Engineering

Data Engineering Data Engineer Data Engineering Data Engineering

Discover the Most Important Fundamentals of Data Engineering

Pickl AI

NOVEMBER 4, 2024

Summary: The fundamentals of Data Engineering encompass essential practices like data modelling, warehousing, pipelines, and integration. Understanding these concepts enables professionals to build robust systems that facilitate effective data management and insightful analysis. What is Data Engineering?

Data Engineering

Data Engineering Data Engineer Data Engineering Data Engineering

Navigating the Big Data Frontier: A Guide to Efficient Handling

Women in Big Data

OCTOBER 9, 2024

Refer to Unlocking the Power of Big Data Article to understand the use case of these data collected from various sources. Data Ingestion: Data is collected and funneled into the pipeline using batch or real-time methods, leveraging tools like Apache Kafka, AWS Kinesis, or custom ETL scripts.

Big Data

Big Data Big Data Apache Kafka Data Pipeline

The Backbone of Data Engineering: 5 Key Architectural Patterns Explained

Mlearning.ai

MAY 16, 2023

Data engineering is a rapidly growing field that designs and develops systems that process and manage large amounts of data. There are various architectural design patterns in data engineering that are used to solve different data-related problems. BECOME a WRITER at MLearning.ai.

Data Engineering

Data Engineering Data Engineering Data Engineering Data Engineer

Training Models on Streaming Data [Practical Guide]

The MLOps Blog

FEBRUARY 5, 2023

There are many platforms and sources that generate this kind of data. In this article, we will go through the basics of streaming data, what it is, and how it differs from traditional data. We will also get familiar with tools that can help record this data and further analyse it.

Machine Learning

Machine Learning Machine Learning Data Pipeline Apache Kafka

How to Manage Unstructured Data in AI and Machine Learning Projects

DagsHub

OCTOBER 23, 2024

Managing unstructured data is essential for the success of machine learning (ML) projects. Without structure, data is difficult to analyze and extracting meaningful insights and patterns is challenging. This article will discuss managing unstructured data for AI and ML projects. How to properly manage unstructured data.

Machine Learning

Machine Learning Machine Learning Data Lakes AI

Mastering Duplicate Data Management in Machine Learning for Optimal Model Performance

DagsHub

JANUARY 14, 2025

This article is an attempt to delve into how duplicate data can affect machine learning models, and how it impacts their accuracy and other performance metrics. We'll try to uncover practical strategies to identify, analyze, and manage duplicate data effectively. We hope you find this article thought-provoking!

Machine Learning

Machine Learning Machine Learning Clustering Algorithm

ML Pipeline Architecture Design Patterns (With 10 Real-World Examples)

The MLOps Blog

AUGUST 11, 2023

1 Data Ingestion (e.g., Apache Kafka, Amazon Kinesis) 2 Data Preprocessing (e.g., pandas, NumPy) 3 Feature Engineering and Selection (e.g., 1 Data Ingestion (e.g., Apache Kafka, Amazon Kinesis) 2 Data Preprocessing (e.g., pandas, NumPy) 3 Feature Engineering and Selection (e.g.,

ML ML Machine Learning Machine Learning

Data Science Current

Apache Kafka Architecture and Use Cases Explained

Apache Kafka Use Cases and Installation Guide

Webinars

Trending Sources

Exploring Partitions and Consumer Groups in Apache Kafka

Webinars

Introduction to Apache Kafka: Fundamentals and Working

Build a Simple Realtime Data Pipeline

How data engineers tame Big Data?

11 Open-Source Data Engineering Tools Every Pro Should Use

Discover the Most Important Fundamentals of Data Engineering

Navigating the Big Data Frontier: A Guide to Efficient Handling

The Backbone of Data Engineering: 5 Key Architectural Patterns Explained

Training Models on Streaming Data [Practical Guide]

How to Manage Unstructured Data in AI and Machine Learning Projects

Mastering Duplicate Data Management in Machine Learning for Optimal Model Performance

ML Pipeline Architecture Design Patterns (With 10 Real-World Examples)

Stay Connected