Artificial Intelligence, AWS and Data Pipeline

Shaping the future: OMRON’s data-driven journey with AWS

AWS Machine Learning Blog

APRIL 3, 2025

At the heart of this transformation is the OMRON Data & Analytics Platform (ODAP), an innovative initiative designed to revolutionize how the company harnesses its data assets. The robust security features provided by Amazon S3, including encryption and durability, were used to provide data protection.

AWS

AWS Data Governance Data Silos SQL

AWS Machine Learning: A Beginner’s Guide

How to Learn Machine Learning

DECEMBER 24, 2024

If you’re diving into the world of machine learning, AWS Machine Learning provides a robust and accessible platform to turn your data science dreams into reality. Whether you’re a solo developer or part of a large enterprise, AWS provides scalable solutions that grow with your needs. Hey dear reader!

Machine Learning

Machine Learning Machine Learning AWS ML

Reducing hallucinations in LLM agents with a verified semantic cache using Amazon Bedrock Knowledge Bases

AWS Machine Learning Blog

FEBRUARY 21, 2025

Lets assume that the question What date will AWS re:invent 2024 occur? The corresponding answer is also input as AWS re:Invent 2024 takes place on December 26, 2024. If the question was Whats the schedule for AWS events in December?, This setup uses the AWS SDK for Python (Boto3) to interact with AWS services.

AWS

AWS Natural Language Processing Machine Learning Machine Learning

Webinars

Agent Tooling: Connecting AI to Your Tools, Systems & Data

Automation, Evolved: Your New Playbook for Smarter Knowledge Work

How to Modernize Manufacturing Without Losing Control

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

MORE WEBINARS

Real value, real time: Production AI with Amazon SageMaker and Tecton

AWS Machine Learning Blog

DECEMBER 4, 2024

It seems straightforward at first for batch data, but the engineering gets even more complicated when you need to go from batch data to incorporating real-time and streaming data sources, and from batch inference to real-time serving. You can also find Tecton at AWS re:Invent.

ML

ML ML AWS AI

Optimize pet profiles for Purina’s Petfinder application using Amazon Rekognition Custom Labels and AWS Step Functions

AWS Machine Learning Blog

OCTOBER 18, 2023

Purina used artificial intelligence (AI) and machine learning (ML) to automate animal breed detection at scale. The AWS Cloud Development Kit (AWS CDK) is an open-source software development framework for defining cloud infrastructure as code with modern programming languages and deploying it through AWS CloudFormation.

AWS

AWS ML ML Machine Learning

Building an efficient MLOps platform with OSS tools on Amazon ECS with AWS Fargate

AWS Machine Learning Blog

SEPTEMBER 18, 2024

The ZMP analyzes billions of structured and unstructured data points to predict consumer intent by using sophisticated artificial intelligence (AI) to personalize experiences at scale. Additionally, Feast promotes feature reuse, so the time spent on data preparation is reduced greatly.

AWS

AWS Machine Learning Machine Learning ML

The power of remote engine execution for ETL/ELT data pipelines

IBM Journey to AI blog

MAY 15, 2024

Data engineers build data pipelines, which are called data integration tasks or jobs, as incremental steps to perform data operations and orchestrate these data pipelines in an overall workflow. Organizations can harness the full potential of their data while reducing risk and lowering costs.

Data Pipeline

Data Pipeline ETL SQL Database

Boost your MLOps efficiency with these 6 must-have tools and platforms

Data Science Dojo

FEBRUARY 20, 2023

It is used by businesses across industries for a wide range of applications, including fraud prevention, marketing automation, customer service, artificial intelligence (AI), chatbots, virtual assistants, and recommendations. AWS SageMaker also has a CLI for model creation and management.

Machine Learning

Machine Learning Machine Learning AWS Azure

Modular functions design for Advanced Driver Assistance Systems (ADAS) on AWS

AWS Machine Learning Blog

FEBRUARY 23, 2023

For more information about distributed training with SageMaker, refer to the AWS re:Invent 2020 video Fast training and near-linear scaling with DataParallel in Amazon SageMaker and The science behind Amazon SageMaker’s distributed-training engines. In a later post, we will do a deep dive into the DNNs used by ADAS systems.

AWS

AWS ML ML Machine Learning

Unlocking generative AI for enterprises: How SnapLogic powers their low-code Agent Creator using Amazon Bedrock

AWS Machine Learning Blog

OCTOBER 23, 2024

SnapLogic uses Amazon Bedrock to build its platform, capitalizing on the proximity to data already stored in Amazon Web Services (AWS). Control plane and data plane implementation SnapLogic’s Agent Creator platform follows a decoupled architecture, separating the control plane and data plane for enhanced security and scalability.

AI

AI AI Database AWS

Navigating the Cloud Modernization Journey: Insights from Precisely’s Partnership with AWS

Precisely

APRIL 11, 2024

In an era where cloud technology is not just an option but a necessity for competitive business operations, the collaboration between Precisely and Amazon Web Services (AWS) has set a new benchmark for mainframe and IBM i modernization. Solution page Precisely on Amazon Web Services (AWS) Precisely brings data integrity to the AWS cloud.

AWS

AWS Cloud Computing Database Data Pipeline

Discovering the Role of Data Science in a Cloud World

Pickl AI

DECEMBER 26, 2024

Each platform offers unique capabilities tailored to varying needs, making the platform a critical decision for any Data Science project. Major Cloud Platforms for Data Science Amazon Web Services ( AWS ), Microsoft Azure, and Google Cloud Platform (GCP) dominate the cloud market with their comprehensive offerings.

Data Science

Data Science Cloud Computing Machine Learning Machine Learning

Designing generative AI workloads for resilience

AWS Machine Learning Blog

FEBRUARY 1, 2024

Consider the following picture, which is an AWS view of the a16z emerging application stack for large language models (LLMs). This pipeline could be a batch pipeline if you prepare contextual data in advance, or a low-latency pipeline if you’re incorporating new contextual data on the fly.

AWS

AWS AI AI Database

Improving air quality with generative AI

AWS Machine Learning Blog

JUNE 18, 2024

On December 6 th -8 th 2023, the non-profit organization, Tech to the Rescue , in collaboration with AWS, organized the world’s largest Air Quality Hackathon – aimed at tackling one of the world’s most pressing health and environmental challenges, air pollution. As always, AWS welcomes your feedback.

AWS

AWS Python AI AI

Amazon SageMaker Feature Store now supports cross-account sharing, discovery, and access

AWS Machine Learning Blog

FEBRUARY 13, 2024

SageMaker Feature Store now makes it effortless to share, discover, and access feature groups across AWS accounts. With this launch, account owners can grant access to select feature groups by other accounts using AWS Resource Access Manager (AWS RAM). Their task is to construct and oversee efficient data pipelines.

AWS

AWS ML ML Machine Learning

Accelerating AI/ML development at BMW Group with Amazon SageMaker Studio

Flipboard

NOVEMBER 24, 2023

In this post, we will talk about how BMW Group, in collaboration with AWS Professional Services, built its Jupyter Managed (JuMa) service to address these challenges. For example, teams using these platforms missed an easy migration of their AI/ML prototypes to the industrialization of the solution running on AWS.

ML

ML ML AWS AI

The journey of PGA TOUR’s generative AI virtual assistant, from concept to development to prototype

AWS Machine Learning Blog

MARCH 14, 2024

Generative artificial intelligence (generative AI) has enabled new possibilities for building intelligent systems. Given the data sources, LLMs provided tools that would allow us to build a Q&A chatbot in weeks, rather than what may have taken years previously, and likely with worse performance.

SQL

SQL AWS AI AI

Harness the power of AI and ML using Splunk and Amazon SageMaker Canvas

AWS Machine Learning Blog

AUGUST 12, 2024

Instead, organizations are increasingly looking to take advantage of transformative technologies like machine learning (ML) and artificial intelligence (AI) to deliver innovative products, improve outcomes, and gain operational efficiencies at scale.

ML

ML ML AWS AI

What Are AI Credits and How Can Data Scientists Use Them?

ODSC - Open Data Science

APRIL 23, 2025

Confluent Confluent provides a robust data streaming platform built around Apache Kafka. AI credits from Confluent can be used to implement real-time data pipelines, monitor data flows, and run stream-based ML applications.

Data Scientist

Data Scientist Azure Apache Kafka ML

Use Snowflake as a data source to train ML models with Amazon SageMaker

AWS Machine Learning Blog

MARCH 8, 2023

In order to train a model using data stored outside of the three supported storage services, the data first needs to be ingested into one of these services (typically Amazon S3). This requires building a data pipeline (using tools such as Amazon SageMaker Data Wrangler ) to move data into Amazon S3.

ML

ML ML AWS Python

How SnapLogic built a text-to-pipeline application with Amazon Bedrock to translate business intent into action

Flipboard

NOVEMBER 24, 2023

In this post, we show you how SnapLogic , an AWS customer, used Amazon Bedrock to power their SnapGPT product through automated creation of these complex DSL artifacts from human language. SnapLogic background SnapLogic is an AWS customer on a mission to bring enterprise automation to the world.

Database

Database AWS ETL SQL

Accelerate disaster response with computer vision for satellite imagery using Amazon SageMaker and Amazon Augmented AI

AWS Machine Learning Blog

FEBRUARY 24, 2023

AWS recently released Amazon SageMaker geospatial capabilities to provide you with satellite imagery and geospatial state-of-the-art machine learning (ML) models, reducing barriers for these types of use cases. In the following sections, we dive into each pipeline in more detail.

ML

ML ML AWS Data Pipeline

How Twilio generated SQL using Looker Modeling Language data with Amazon Bedrock

AWS Machine Learning Blog

AUGUST 8, 2024

As one of the largest AWS customers, Twilio engages with data, artificial intelligence (AI), and machine learning (ML) services to run their daily workloads. Data is the foundational layer for all generative AI and ML applications. Access to Amazon Bedrock FMs isn’t granted by default.

SQL

SQL Data Lakes Data Analyst AWS

Identify cybersecurity anomalies in your Amazon Security Lake data using Amazon SageMaker

AWS Machine Learning Blog

DECEMBER 20, 2023

Whether logs are coming from Amazon Web Services (AWS), other cloud providers, on-premises, or edge devices, customers need to centralize and standardize security data. After the security log data is stored in Amazon Security Lake, the question becomes how to analyze it. Subscribe an AWS Lambda function to the SQS queue.

AWS

AWS ML ML Algorithm

Deploy generative AI agents in your contact center for voice and chat using Amazon Connect, Amazon Lex, and Amazon Bedrock Knowledge Bases

AWS Machine Learning Blog

SEPTEMBER 24, 2024

Working with the AWS Generative AI Innovation Center , DoorDash built a solution to provide Dashers with a low-latency self-service voice experience to answer frequently asked questions, reducing the need for live agent assistance, in just 2 months. “We You can deploy the solution in your own AWS account and try the example solution.

AWS

AWS AI AI Analytics

Use Amazon DocumentDB to build no-code machine learning solutions in Amazon SageMaker Canvas

AWS Machine Learning Blog

DECEMBER 15, 2023

In this post, we discuss how to bring data stored in Amazon DocumentDB into SageMaker Canvas and use that data to build ML models for predictive analytics. Without creating and maintaining data pipelines, you will be able to power ML models with your unstructured data stored in Amazon DocumentDB.

Machine Learning

Machine Learning Machine Learning AWS ML

Fine tune a generative AI application for Amazon Bedrock using Amazon SageMaker Pipeline decorators

AWS Machine Learning Blog

AUGUST 22, 2024

Building a deployment pipeline for generative artificial intelligence (AI) applications at scale is a formidable challenge because of the complexities and unique requirements of these systems. Generative AI applications require continuous ingestion, preprocessing, and formatting of vast amounts of data from various sources.

ML

ML ML Python AWS

Foundational data protection for enterprise LLM acceleration with Protopia AI

AWS Machine Learning Blog

DECEMBER 5, 2023

AWS is especially well suited to provide enterprises the tools necessary for deploying LLMs at scale to enable critical decision-making. In their implementation of generative AI technology, enterprises have real concerns about data exposure and ownership of confidential information that may be sent to LLMs.

AI

AI AI AWS ML

A review of purpose-built accelerators for financial services

AWS Machine Learning Blog

SEPTEMBER 11, 2024

These activities cover disparate fields such as basic data processing, analytics, and machine learning (ML). And finally, some activities, such as those involved with the latest advances in artificial intelligence (AI), are simply not practically possible, without hardware acceleration.

AWS

AWS ML ML Clustering

Robust time series forecasting with MLOps on Amazon SageMaker

AWS Machine Learning Blog

SEPTEMBER 28, 2023

This process significantly benefits from the MLOps features of SageMaker, which streamline the data science workflow by harnessing the powerful cloud infrastructure of AWS. The following diagram illustrates the inference pipeline. Click here to open the AWS console and follow along.

AWS

AWS Machine Learning Machine Learning ML

A Guide to Choose the Best Data Science Bootcamp

Data Science Dojo

JULY 3, 2024

Big Data Technologies : Handling and processing large datasets using tools like Hadoop, Spark, and cloud platforms such as AWS and Google Cloud. Data Processing and Analysis : Techniques for data cleaning, manipulation, and analysis using libraries such as Pandas and Numpy in Python.

Data Science

Data Science Machine Learning Machine Learning Data Visualization

Top NLP Skills, Frameworks, Platforms, and Languages for 2023

ODSC - Open Data Science

FEBRUARY 17, 2023

Cloud Computing, APIs, and Data Engineering NLP experts don’t go straight into conducting sentiment analysis on their personal laptops. Data Engineering Platforms Spark is still the leader for data pipelines but other platforms are gaining ground. Google Cloud is starting to make a name for itself as well.

Deep Learning

Deep Learning Deep Learning Data Science Natural Language Processing

How Reveal’s Logikcull used Amazon Comprehend to detect and redact PII from legal documents at scale

AWS Machine Learning Blog

NOVEMBER 1, 2023

Give the features a try and send us feedback either through the AWS forum for Amazon Comprehend or through your usual AWS support contacts. About the Authors Aman Tiwari is a General Solutions Architect working with Worldwide Commercial Sales at AWS.

AWS

AWS Machine Learning Machine Learning ML

AIOps vs. MLOps: Harnessing big data for “smarter” ITOPs

IBM Journey to AI blog

AUGUST 12, 2024

Instead, businesses tend to rely on advanced tools and strategies—namely artificial intelligence for IT operations (AIOps) and machine learning operations (MLOps)—to turn vast quantities of data into actionable insights that can improve IT decision-making and ultimately, the bottom line.

Big Data

Big Data Big Data ML ML

Understanding and predicting urban heat islands at Gramener using Amazon SageMaker geospatial capabilities

AWS Machine Learning Blog

APRIL 5, 2024

Solution workflow In this section, we discuss how the different components work together, from data acquisition to spatial modeling and forecasting, serving as the core of the UHI solution. Janosch Woschitz is a Senior Solutions Architect at AWS, specializing in AI/ML. Outside work, he is a travel enthusiast.

Clustering

Clustering ML ML AWS

Learn AI Together — Towards AI Community Newsletter #26

Towards AI

MAY 30, 2024

Build and Run Data Pipelines with Sagemaker Pipelines by Jake Teo This article shows how to run long-running, repetitive, centrally managed, and traceable data pipelines leveraging AWS’s MLOps platform, Sagemaker, and its underlying services, Sagemaker pipelines, and Studio.

AI

AI AI Data Pipeline Deep Learning

Mainframe Technology Trends for 2024

Precisely

JANUARY 18, 2024

With the emergence of cloud hyperscalers like AWS, Google, and Microsoft, the shift to the cloud has accelerated significantly. Instead of performing major surgery on their critical business systems, enterprises are opting for real-time data integration built around inherently reliable and scalable change data capture (CDC) technology.

AWS

AWS Artificial Intelligence Artificial Intelligence Cloud Computing

What is the Pile Dataset

Pickl AI

DECEMBER 25, 2024

Introduction In the rapidly evolving field of Artificial Intelligence , datasets like the Pile play a pivotal role in training models to understand and generate human-like text. The dataset is openly accessible, making it a go-to resource for researchers and developers in Artificial Intelligence.

Natural Language Processing

Natural Language Processing Machine Learning Machine Learning AI

Explain text classification model predictions using Amazon SageMaker Clarify

AWS Machine Learning Blog

JANUARY 25, 2023

This field is often referred to as explainable artificial intelligence (XAI). Amazon SageMaker Clarify is a feature of Amazon SageMaker that enables data scientists and ML engineers to explain the predictions of their ML models. Solution overview SageMaker algorithms have fixed input and output data formats.

Algorithm

Algorithm Natural Language Processing Machine Learning Machine Learning

The Data Dilemma: Exploring the Key Differences Between Data Science and Data Engineering

Pickl AI

JULY 25, 2023

Data engineers are essential professionals responsible for designing, constructing, and maintaining an organization’s data infrastructure. They create data pipelines, ETL processes, and databases to facilitate smooth data flow and storage. Big Data Processing: Apache Hadoop, Apache Spark, etc.

Data Engineering

Data Engineering Data Engineering Data Engineering Data Engineer

Your Guide to Unlocking Trusted AI with Reliable Data

Precisely

MARCH 4, 2024

What if every decision, recommendation, and prediction made by artificial intelligence (AI) was as reliable as your most trusted team members? Next, you’ll see valuable AI use cases and how data integrity powers success. Technology-driven insights and capabilities depend on trusted data.

AI

AI AI Data Quality Artificial Intelligence

Exploring the AI and data capabilities of watsonx

IBM Journey to AI blog

JULY 17, 2023

” Das Kamhout, VP and Senior Principal Engineer of the Cloud and Enterprise Solutions Group at Intel Watsonx.data supports our customers’ increasing needs around hybrid cloud deployments and is available on premises and across multiple cloud providers, including IBM Cloud and Amazon Web Services (AWS).

AI

AI AI Machine Learning Machine Learning

40 Must-Know Data Science Skills and Frameworks for 2023

ODSC - Open Data Science

FEBRUARY 2, 2023

Cloud Services The only two to make multiple lists were Amazon Web Services (AWS) and Microsoft Azure. Most major companies are using one of the two, so excelling in one or the other will help any aspiring data scientist. Saturn Cloud is picking up a lot of momentum lately too thanks to its scalability.

Data Science

Data Science Data Scientist Computer Science Computer Science

Popular Data Transformation Tools: Importance and Best Practices

Pickl AI

OCTOBER 10, 2024

It supports batch and real-time data processing, making it a preferred choice for large enterprises with complex data workflows. Informatica’s AI-powered automation helps streamline data pipelines and improve operational efficiency. AWS Glue AWS Glue is a fully managed ETL service provided by Amazon Web Services.

Data Quality

Data Quality AWS Machine Learning Machine Learning

Shaping the future: OMRON’s data-driven journey with AWS

AWS Machine Learning: A Beginner’s Guide

Webinars

Trending Sources

Reducing hallucinations in LLM agents with a verified semantic cache using Amazon Bedrock Knowledge Bases

Webinars

Real value, real time: Production AI with Amazon SageMaker and Tecton

Optimize pet profiles for Purina’s Petfinder application using Amazon Rekognition Custom Labels and AWS Step Functions

Building an efficient MLOps platform with OSS tools on Amazon ECS with AWS Fargate

The power of remote engine execution for ETL/ELT data pipelines

Boost your MLOps efficiency with these 6 must-have tools and platforms

Modular functions design for Advanced Driver Assistance Systems (ADAS) on AWS

Unlocking generative AI for enterprises: How SnapLogic powers their low-code Agent Creator using Amazon Bedrock

Navigating the Cloud Modernization Journey: Insights from Precisely’s Partnership with AWS

Discovering the Role of Data Science in a Cloud World

Designing generative AI workloads for resilience

Improving air quality with generative AI

Amazon SageMaker Feature Store now supports cross-account sharing, discovery, and access

Accelerating AI/ML development at BMW Group with Amazon SageMaker Studio

The journey of PGA TOUR’s generative AI virtual assistant, from concept to development to prototype

Harness the power of AI and ML using Splunk and Amazon SageMaker Canvas

What Are AI Credits and How Can Data Scientists Use Them?

Use Snowflake as a data source to train ML models with Amazon SageMaker

How SnapLogic built a text-to-pipeline application with Amazon Bedrock to translate business intent into action

Accelerate disaster response with computer vision for satellite imagery using Amazon SageMaker and Amazon Augmented AI

How Twilio generated SQL using Looker Modeling Language data with Amazon Bedrock

Identify cybersecurity anomalies in your Amazon Security Lake data using Amazon SageMaker

Deploy generative AI agents in your contact center for voice and chat using Amazon Connect, Amazon Lex, and Amazon Bedrock Knowledge Bases

Use Amazon DocumentDB to build no-code machine learning solutions in Amazon SageMaker Canvas

Fine tune a generative AI application for Amazon Bedrock using Amazon SageMaker Pipeline decorators

Foundational data protection for enterprise LLM acceleration with Protopia AI

A review of purpose-built accelerators for financial services

Robust time series forecasting with MLOps on Amazon SageMaker

A Guide to Choose the Best Data Science Bootcamp

Top NLP Skills, Frameworks, Platforms, and Languages for 2023

How Reveal’s Logikcull used Amazon Comprehend to detect and redact PII from legal documents at scale

AIOps vs. MLOps: Harnessing big data for “smarter” ITOPs

Understanding and predicting urban heat islands at Gramener using Amazon SageMaker geospatial capabilities

Learn AI Together — Towards AI Community Newsletter #26

Mainframe Technology Trends for 2024

What is the Pile Dataset

Explain text classification model predictions using Amazon SageMaker Clarify

The Data Dilemma: Exploring the Key Differences Between Data Science and Data Engineering

Your Guide to Unlocking Trusted AI with Reliable Data

Exploring the AI and data capabilities of watsonx

40 Must-Know Data Science Skills and Frameworks for 2023

Popular Data Transformation Tools: Importance and Best Practices

Stay Connected