AI, Data Pipeline and Database - Data Science Current

Setup Mage AI with Postgres to Build and Manage Your Data Pipeline

Analytics Vidhya

SEPTEMBER 12, 2024

Introduction Imagine yourself as a data professional tasked with creating an efficient data pipeline to streamline processes and generate real-time information. That’s where Mage AI comes in to ensure that the lenders operating online gain a competitive edge. Sounds challenging, right?

Data Pipeline

Data Pipeline AI AI Analytics

How to Build a SQL Agent with CrewAI and Composio?

Analytics Vidhya

JULY 1, 2024

It serves as the primary means for communicating with relational databases, where most organizations store crucial data. SQL plays a significant role including analyzing complex data, creating data pipelines, and efficiently managing data warehouses.

SQL

SQL Data Warehouse Data Pipeline Database

Relational Graph Transformers

Hacker News

APRIL 28, 2025

Relational Graph Transformers represent the next evolution in Relational Deep Learning, allowing AI systems to seamlessly navigate and learn from data spread across multiple tables.

Data Pipeline

Data Pipeline Deep Learning Deep Learning Database

Webinars

What’s New in Apache Airflow® 3.0—And How Will It Reshape Your Data Workflows?

MORE WEBINARS

Securing the data pipeline, from blockchain to AI

Dataconomy

OCTOBER 8, 2024

Almost every tech company today is up to its neck in generative AI, with Google focused on enhancing search, Microsoft betting the house on business productivity gains with its family of copilots, and startups like Runway AI and Stability AI going all-in on video and image creation. Why is data integrity important?

Data Pipeline

Data Pipeline AI AI Data Warehouse

What is Data Pipeline? A Detailed Explanation

Smart Data Collective

OCTOBER 17, 2022

Data pipelines automatically fetch information from various disparate sources for further consolidation and transformation into high-performing data storage. There are a number of challenges in data storage , which data pipelines can help address. Choosing the right data pipeline solution.

Data Pipeline

Data Pipeline Data Warehouse ETL Data Lakes

Matillion Democratizes GenAI with No-Code Cortex Components on Snowflake AI Data Cloud

insideBIGDATA

JUNE 4, 2024

Modern data pipeline platform provider Matillion today announced at Snowflake Data Cloud Summit 2024 that it is bringing no-code Generative AI (GenAI) to Snowflake users with new GenAI capabilities and integrations with Snowflake Cortex AI, Snowflake ML Functions, and support for Snowpark Container Services.

Data Pipeline

Data Pipeline ML ML AI

Architect a mature generative AI foundation on AWS

Flipboard

MAY 30, 2025

Generative AI applications seem simpleinvoke a foundation model (FM) with the right context to generate a response. Many organizations have siloed generative AI initiatives, with development managed independently by various departments and lines of businesses (LOBs). This approach facilitates centralized governance and operations.

AWS

AWS AI AI Database

Create a generative AI-based application builder assistant using Amazon Bedrock Agents

AWS Machine Learning Blog

OCTOBER 24, 2024

Amazon Bedrock Agents helps you accelerate generative AI application development by orchestrating multistep tasks. With the power of AI automation, you can boost productivity and reduce cost. The generative AI–based application builder assistant from this post will help you accomplish tasks through all three tiers.

AWS

AWS SQL Database AI

Streaming Data Pipelines: What Are They and How to Build One

Precisely

DECEMBER 28, 2023

The concept of streaming data was born of necessity. More than ever, advanced analytics, ML, and AI are providing the foundation for innovation, efficiency, and profitability. But insights derived from day-old data don’t cut it. Business success is based on how we use continuously changing data.

Data Pipeline

Data Pipeline Apache Kafka Big Data Big Data

Unlocking generative AI for enterprises: How SnapLogic powers their low-code Agent Creator using Amazon Bedrock

AWS Machine Learning Blog

OCTOBER 23, 2024

The landscape of enterprise application development is undergoing a seismic shift with the advent of generative AI. This intuitive platform enables the rapid development of AI-powered solutions such as conversational interfaces, document summarization tools, and content generation apps through a drag-and-drop interface.

AI

AI AI AWS Database

The 6 best ChatGPT plugins for data science

Data Science Dojo

OCTOBER 2, 2023

This can be useful for data scientists who need to streamline their data science pipeline or automate repetitive tasks. It provides access to a vast database of scholarly articles and books, as well as tools for literature review and data analysis.

Data Science

Data Science Machine Learning Machine Learning Data Analysis

Build generative AI applications quickly with Amazon Bedrock IDE in Amazon SageMaker Unified Studio

AWS Machine Learning Blog

DECEMBER 4, 2024

Building generative AI applications presents significant challenges for organizations: they require specialized ML expertise, complex infrastructure management, and careful orchestration of multiple services. The following diagram illustrates the conceptual architecture of an AI assistant with Amazon Bedrock IDE.

AWS

AWS AI AI SQL

The power of remote engine execution for ETL/ELT data pipelines

IBM Journey to AI blog

MAY 15, 2024

Business leaders risk compromising their competitive edge if they do not proactively implement generative AI (gen AI). However, businesses scaling AI face entry barriers. This situation will exacerbate data silos, increase costs and complicate the governance of AI and data workloads.

Data Pipeline

Data Pipeline ETL SQL Database

Enhanced diagnostics flow with LLM and Amazon Bedrock agent integration

Flipboard

JUNE 3, 2025

To solve this, Noodoe has integrated large language models (LLMs) through Amazon Bedrock and Amazon Bedrock Agents to deliver intelligent automation, real-time data access, and multilingual support. In this post, we explore how Noodoe uses AI and Amazon Bedrock to optimize EV charging operations.

AWS

AWS Apache Kafka Database AI

Designing generative AI workloads for resilience

AWS Machine Learning Blog

FEBRUARY 1, 2024

Resilience plays a pivotal role in the development of any workload, and generative AI workloads are no different. There are unique considerations when engineering generative AI workloads through a resilience lens. In this post, we discuss the different stacks of a generative AI workload and what those considerations should be.

AWS

AWS AI AI Database

Align and monitor your Amazon Bedrock powered insurance assistance chatbot to responsible AI principles with AWS Audit Manager

AWS Machine Learning Blog

JANUARY 7, 2025

Generative AI applications are gaining widespread adoption across various industries, including regulated industries such as financial services and healthcare. To address this need, AWS generative AI best practices framework was launched within AWS Audit Manager , enabling auditing and monitoring of generative AI applications.

AWS

AWS AI AI Database

HCLTech’s AWS powered AutoWise Companion: A seamless experience for informed automotive buyer decisions with data-driven design

AWS Machine Learning Blog

JANUARY 15, 2025

This post introduces HCLTechs AutoWise Companion, a transformative generative AI solution designed to enhance customers vehicle purchasing journey. Powered by generative AI services on AWS and large language models (LLMs) multi-modal capabilities, HCLTechs AutoWise Companion provides a seamless and impactful experience.

AWS

AWS SQL AI AI

Build Data Pipelines: Comprehensive Step-by-Step Guide

Pickl AI

JULY 8, 2024

Summary: This blog explains how to build efficient data pipelines, detailing each step from data collection to final delivery. Introduction Data pipelines play a pivotal role in modern data architecture by seamlessly transporting and transforming raw data into valuable insights.

Data Pipeline

Data Pipeline Data Quality Database Apache Kafka

Enhance your Amazon Redshift cloud data warehouse with easier, simpler, and faster machine learning using Amazon SageMaker Canvas

AWS Machine Learning Blog

OCTOBER 24, 2024

Database name : Enter dev. Database user : Enter awsuser. SageMaker Canvas integration with Amazon Redshift provides a unified environment for building and deploying machine learning models, allowing you to focus on creating value with your data rather than focusing on the technical details of building data pipelines or ML algorithms.

Data Warehouse

Data Warehouse Machine Learning Machine Learning Cloud Data

OfferUp improved local results by 54% and relevance recall by 27% with multimodal search on Amazon Bedrock and Amazon OpenSearch Service

AWS Machine Learning Blog

FEBRUARY 5, 2025

The following diagram illustrates the data pipeline for indexing and query in the foundational search architecture. These databases typically use k-nearest (k-NN) indexes built with advanced algorithms such as Hierarchical Navigable Small Worlds (HNSW) and Inverted File (IVF) systems.

K-nearest Neighbors

K-nearest Neighbors Machine Learning Machine Learning Database

Navigating the World of Data Engineering: A Beginners Guide.

Towards AI

MARCH 21, 2023

Last Updated on March 21, 2023 by Editorial Team Author(s): Data Science meets Cyber Security Originally published on Towards AI. Navigating the World of Data Engineering: A Beginner’s Guide. A GLIMPSE OF DATA ENGINEERING ❤ IMAGE SOURCE: BY AUTHOR Data or data? What are ETL and data pipelines?

Data Engineering

Data Engineering Data Engineer Data Engineering Data Engineering

Meet the Seattle-area startups that just graduated from Y Combinator

Flipboard

SEPTEMBER 25, 2023

The companies include: Talc AI, a service for assessing large language models. Watto AI, an AI program that generates consulting reports. Neum AI, a platform designed to assist companies in maintaining the relevancy of their AI applications with the latest data. Talc AI Talc.ai

Data Pipeline

Data Pipeline AI AI Natural Language Processing

How SnapLogic built a text-to-pipeline application with Amazon Bedrock to translate business intent into action

Flipboard

NOVEMBER 24, 2023

Many customers are building generative AI apps on Amazon Bedrock and Amazon CodeWhisperer to create code artifacts based on natural language. Amazon Bedrock is the easiest way to build and scale generative AI applications with foundation models (FMs). This post was co-written with Greg Benson, Chief Scientist; Aaron Kesler, Sr.

Database

Database AWS ETL SQL

Feature Platforms?—?A New Paradigm in Machine Learning Operations (MLOps)

IBM Data Science in Practice

MARCH 8, 2023

The United States published a Blueprint for the AI Bill of Rights. The growth of the AI and Machine Learning (ML) industry has continued to grow at a rapid rate over recent years. Source: A Chat with Andrew on MLOps: From Model-centric to Data-centric AI So how does this data-centric approach fit in with Machine Learning? — Features

Machine Learning

Machine Learning Machine Learning ML ML

The 2021 Executive Guide To Data Science and AI

Applied Data Science

AUGUST 2, 2021

This post is a bitesize walk-through of the 2021 Executive Guide to Data Science and AI — a white paper packed with up-to-date advice for any CIO or CDO looking to deliver real value through data. Automation Automating data pipelines and models ➡️ 6. Download the free, unabridged version here.

Data Science

Data Science Data Scientist ML ML

Best Data Engineering Tools Every Engineer Should Know

Pickl AI

MARCH 19, 2025

Summary: Data engineering tools streamline data collection, storage, and processing. Learning these tools is crucial for building scalable data pipelines. offers Data Science courses covering these tools with a job guarantee for career growth. What Does a Data Engineer Do?

Data Engineering

Data Engineering Data Engineering Data Engineer Data Engineering

Data Fabric and Address Verification Interface

IBM Data Science in Practice

NOVEMBER 28, 2022

Implementing a data fabric architecture is the answer. What is a data fabric? Data fabric is defined by IBM as “an architecture that facilitates the end-to-end integration of various data pipelines and cloud environments through the use of intelligent and automated systems.”

Data Pipeline

Data Pipeline Data Quality Data Preparation Data Governance

The Shift from Models to Compound AI Systems

BAIR

FEBRUARY 17, 2024

This is enforced with the `more` excerpt separator. --> AI caught everyone’s attention in 2023 with Large Language Models (LLMs) that can be instructed to perform general tasks, such as translation or coding, just by prompting. In this post, we analyze the trend toward compound AI systems and what it means for AI developers.

AI

AI AI DataOps Data Pipeline

Improving air quality with generative AI

AWS Machine Learning Blog

JUNE 18, 2024

This post presents a solution that uses a generative artificial intelligence (AI) to standardize air quality data from low-cost sensors in Africa, specifically addressing the air quality data integration problem of low-cost sensors. This allows for data to be aggregated for further manufacturer-agnostic analysis.

AWS

AWS Python AI AI

Discovering the Role of Data Science in a Cloud World

Pickl AI

DECEMBER 26, 2024

AI, serverless computing, and edge technologies redefine cloud-based Data Science workflows. Defining Cloud Computing in Data Science Cloud computing provides on-demand access to computing resources such as servers, storage, databases, and software over the Internet. billion in 2023 to USD 1,266.4

Data Science

Data Science Cloud Computing Machine Learning Machine Learning

Accelerating AI/ML development at BMW Group with Amazon SageMaker Studio

Flipboard

NOVEMBER 24, 2023

Data scientists and ML engineers require capable tooling and sufficient compute for their work. To pave the way for the growth of AI, BMW Group needed to make a leap regarding scalability and elasticity while reducing operational overhead, software licensing, and hardware management.

ML

ML ML AWS AI

LLM app platforms

Dataconomy

MARCH 20, 2025

As AI technologies continue to evolve, understanding the functionalities and development stages of LLM applications is essential for both new and seasoned developers. Data annotation: Adding relevant metadata to enhance the model’s learning capabilities. KLU.ai: Offers no-code solutions for smooth data source integration.

Data Preparation

Data Preparation Data Pipeline Data Quality Database

Orchestration Frameworks 101: Simplifying LLM-App Interactions with LangChain and Llama Index

Data Science Dojo

SEPTEMBER 14, 2023

This orchestration process encompasses interactions with external APIs, retrieval of contextual data from vector databases, and maintaining memory across multiple LLM calls. This makes it easy to connect your data pipeline to the data sources that you need. It is known for its extensibility and modularity.

Data Pipeline

Data Pipeline Python Database AI

Demystifying Time Series Database: A Comprehensive Guide

Pickl AI

JULY 8, 2024

Summary: Time series databases (TSDBs) are built for efficiently storing and analyzing data that changes over time. This data, often from sensors or IoT devices, is typically collected at regular intervals. Buckle up as we navigate the intricacies of storing and analysing this dynamic data.

Database

Database Data Pipeline Analytics Machine Learning

The journey of PGA TOUR’s generative AI virtual assistant, from concept to development to prototype

AWS Machine Learning Blog

MARCH 14, 2024

Generative artificial intelligence (generative AI) has enabled new possibilities for building intelligent systems. Recent improvements in Generative AI based large language models (LLMs) have enabled their use in a variety of applications surrounding information retrieval.

SQL

SQL AWS AI AI

Future trends in ETL

Dataconomy

FEBRUARY 12, 2024

The acronym ETL—Extract, Transform, Load—has long been the linchpin of modern data management, orchestrating the movement and manipulation of data across systems and databases. This methodology has been pivotal in data warehousing, setting the stage for analysis and informed decision-making.

ETL

ETL Data Governance Machine Learning Machine Learning

Step-by-step guide: Generative AI for your business

IBM Journey to AI blog

JULY 30, 2024

Generative artificial intelligence (gen AI) is transforming the business world by creating new opportunities for innovation, productivity and efficiency. This guide offers a clear roadmap for businesses to begin their gen AI journey. Most teams should include at least four types of team members.

AI

AI AI Data Scientist Data Preparation

Building and Scaling Gen AI Applications with Simplicity, Performance and Risk Mitigation in Mind Using Iguazio (acquired by McKinsey) and MongoDB

Iguazio

JULY 22, 2024

AI and generative Al can lead to major enterprise advancements and productivity gains. One popular gen AI use case is customer service and personalization. Gen AI chatbots have quickly transformed the way that customers interact with organizations. Another less obvious use case is fraud detection and prevention.

AI

AI AI ML ML

How Fastweb fine-tuned the Mistral model using Amazon SageMaker HyperPod as a first step to build an Italian large language model

AWS Machine Learning Blog

DECEMBER 18, 2024

AIs transformative impact extends throughout the modern business landscape, with telecommunications emerging as a key area of innovation. Fastweb , one of Italys leading telecommunications operators, recognized the immense potential of AI technologies early on and began investing in this area in 2019.

Clustering

Clustering AWS AI AI

Supercharge your data strategy: Integrate and innovate today leveraging data integration

IBM Journey to AI blog

OCTOBER 22, 2024

Data is the differentiator as business leaders look to utilize their competitive edge as they implement generative AI (gen AI). Leaders feel the pressure to infuse their processes with artificial intelligence (AI) and are looking for ways to harness the insights in their data platforms to fuel this movement.

Data Silos

Data Silos Data Pipeline DataOps Business Intelligence

How Dataiku and Snowflake Strengthen the Modern Data Stack

phData

NOVEMBER 4, 2024

It must integrate seamlessly across data technologies in the stack to execute various workflows—all while maintaining a strong focus on performance and governance. Two key technologies that have become foundational for this type of architecture are the Snowflake AI Data Cloud and Dataiku. Let’s say your company makes cars.

Machine Learning

Machine Learning Machine Learning Data Science ML

Evaluate large language models for your machine translation tasks on AWS

AWS Machine Learning Blog

JANUARY 7, 2025

It is critical for AI models to capture not only the context, but also the cultural specificities to produce a more natural sounding translation. Translation memory A translation memory is a database that stores previously translated text segments (typically sentences or phrases) along with their corresponding translations.

AWS

AWS Python AI AI

RAG vs Fine-Tuning for Enterprise LLMs

Towards AI

FEBRUARY 17, 2025

Originally published on Towards AI. RAFT vs Fine-Tuning Image created by author As the use of large language models (LLMs) grows within businesses, to automate tasks, analyse data, and engage with customers; adapting these models to specific needs (e.g.,

Database

Database Data Pipeline Data Preparation Data Quality

University of British Columbia Cloud Innovation Centre: Prototyping generative AI solutions using AWS

Flipboard

MAY 21, 2025

The University of British Columbia (UBC) Cloud Innovation Centre (CIC) has become a hub for innovation by prototyping generative AI applications in collaboration with public sector sponsors. Importance of rapid prototyping in AI development Rapid prototyping is critical in AI development.

AWS

AWS AI AI Database

Setup Mage AI with Postgres to Build and Manage Your Data Pipeline

How to Build a SQL Agent with CrewAI and Composio?

Webinars

Trending Sources

Relational Graph Transformers

Webinars

Securing the data pipeline, from blockchain to AI

What is Data Pipeline? A Detailed Explanation

Matillion Democratizes GenAI with No-Code Cortex Components on Snowflake AI Data Cloud

Architect a mature generative AI foundation on AWS

Create a generative AI-based application builder assistant using Amazon Bedrock Agents

Streaming Data Pipelines: What Are They and How to Build One

Unlocking generative AI for enterprises: How SnapLogic powers their low-code Agent Creator using Amazon Bedrock

The 6 best ChatGPT plugins for data science

Build generative AI applications quickly with Amazon Bedrock IDE in Amazon SageMaker Unified Studio

The power of remote engine execution for ETL/ELT data pipelines

Enhanced diagnostics flow with LLM and Amazon Bedrock agent integration

Designing generative AI workloads for resilience

Align and monitor your Amazon Bedrock powered insurance assistance chatbot to responsible AI principles with AWS Audit Manager

HCLTech’s AWS powered AutoWise Companion: A seamless experience for informed automotive buyer decisions with data-driven design

Build Data Pipelines: Comprehensive Step-by-Step Guide

Enhance your Amazon Redshift cloud data warehouse with easier, simpler, and faster machine learning using Amazon SageMaker Canvas

OfferUp improved local results by 54% and relevance recall by 27% with multimodal search on Amazon Bedrock and Amazon OpenSearch Service

Navigating the World of Data Engineering: A Beginners Guide.

Meet the Seattle-area startups that just graduated from Y Combinator

How SnapLogic built a text-to-pipeline application with Amazon Bedrock to translate business intent into action

Feature Platforms?—?A New Paradigm in Machine Learning Operations (MLOps)

The 2021 Executive Guide To Data Science and AI

Best Data Engineering Tools Every Engineer Should Know

Data Fabric and Address Verification Interface

The Shift from Models to Compound AI Systems

Improving air quality with generative AI

Discovering the Role of Data Science in a Cloud World

Accelerating AI/ML development at BMW Group with Amazon SageMaker Studio

LLM app platforms

Orchestration Frameworks 101: Simplifying LLM-App Interactions with LangChain and Llama Index

Demystifying Time Series Database: A Comprehensive Guide

The journey of PGA TOUR’s generative AI virtual assistant, from concept to development to prototype

Future trends in ETL

Step-by-step guide: Generative AI for your business

Building and Scaling Gen AI Applications with Simplicity, Performance and Risk Mitigation in Mind Using Iguazio (acquired by McKinsey) and MongoDB

How Fastweb fine-tuned the Mistral model using Amazon SageMaker HyperPod as a first step to build an Italian large language model

Supercharge your data strategy: Integrate and innovate today leveraging data integration

How Dataiku and Snowflake Strengthen the Modern Data Stack

Evaluate large language models for your machine translation tasks on AWS

RAG vs Fine-Tuning for Enterprise LLMs

University of British Columbia Cloud Innovation Centre: Prototyping generative AI solutions using AWS

Stay Connected