Artificial Intelligence, AWS and Deep Learning

Your guide to generative AI and ML at AWS re:Invent 2024

AWS Machine Learning Blog

NOVEMBER 19, 2024

The excitement is building for the fourteenth edition of AWS re:Invent, and as always, Las Vegas is set to host this spectacular event. Third, we’ll explore the robust infrastructure services from AWS powering AI innovation, featuring Amazon SageMaker , AWS Trainium , and AWS Inferentia under AI/ML, as well as Compute topics.

AWS

AWS ML ML AI

AWS Announces Generative AI Innovation Center with $100 million Investment

insideBIGDATA

JUNE 22, 2023

AWS), an Amazon.com, Inc. company (NASDAQ: AMZN), today announced the AWS Generative AI Innovation Center, a new program to help customers successfully build and deploy generative artificial intelligence (AI) solutions. Amazon Web Services, Inc.

AWS

AWS Artificial Intelligence Artificial Intelligence Machine Learning

PEFT fine tuning of Llama 3 on SageMaker HyperPod with AWS Trainium

AWS Machine Learning Blog

DECEMBER 24, 2024

To simplify infrastructure setup and accelerate distributed training, AWS introduced Amazon SageMaker HyperPod in late 2023. In this blog post, we showcase how you can perform efficient supervised fine tuning for a Meta Llama 3 model using PEFT on AWS Trainium with SageMaker HyperPod. architectures/5.sagemaker-hyperpod/LifecycleScripts/base-config/

AWS

AWS Clustering Deep Learning Deep Learning

Webinars

What’s New in Apache Airflow® 3.0—And How Will It Reshape Your Data Workflows?

MORE WEBINARS

AWS and NVIDIA Extend Collaboration to Advance Generative AI Innovation

insideBIGDATA

MARCH 22, 2024

GTC—Amazon Web Services (AWS), an Amazon.com company (NASDAQ: AMZN), and NVIDIA (NASDAQ: NVDA) today announced that the new NVIDIA Blackwell GPU platform—unveiled by NVIDIA at GTC 2024—is coming to AWS.

AWS

AWS Artificial Intelligence Artificial Intelligence AI

Top 10 AI and Data Science Trends in 2022

Analytics Vidhya

FEBRUARY 3, 2022

In this article, we shall discuss the upcoming innovations in the field of artificial intelligence, big data, machine learning and overall, Data Science Trends in 2022. Deep learning, natural language processing, and computer vision are examples […]. Times change, technology improves and our lives get better.

Data Science

Data Science Natural Language Processing Deep Learning Deep Learning

Serving LLMs using vLLM and Amazon EC2 instances with AWS AI chips

AWS Machine Learning Blog

NOVEMBER 26, 2024

Using vLLM on AWS Trainium and Inferentia makes it possible to host LLMs for high performance inference and scalability. Deploy vLLM on AWS Trainium and Inferentia EC2 instances In these sections, you will be guided through using vLLM on an AWS Inferentia EC2 instance to deploy Meta’s newest Llama 3.2 You will use inf2.xlarge

AWS

AWS AI AI Artificial Intelligence

Unlocking insights and enhancing customer service: Intact’s transformative AI journey with AWS

AWS Machine Learning Blog

OCTOBER 16, 2024

The company developed an automated solution called Call Quality (CQ) using AI services from Amazon Web Services (AWS). It uses deep learning to convert audio to text quickly and accurately. To address this, Intact turned to AI and speech-to-text technology to unlock insights from calls and improve customer service.

AWS

AWS AI AI Machine Learning

Build a Search Engine: Setting Up AWS OpenSearch

Flipboard

MAY 5, 2025

Home Table of Contents Build a Search Engine: Setting Up AWS OpenSearch Introduction What Is AWS OpenSearch? What AWS OpenSearch Is Commonly Used For Key Features of AWS OpenSearch How Does AWS OpenSearch Work? Why Use AWS OpenSearch for Semantic Search? Looking for the source code to this post?

AWS

AWS Clustering Deep Learning Deep Learning

Get started quickly with AWS Trainium and AWS Inferentia using AWS Neuron DLAMI and AWS Neuron DLC

AWS Machine Learning Blog

JUNE 11, 2024

Starting with the AWS Neuron 2.18 release , you can now launch Neuron DLAMIs (AWS Deep Learning AMIs) and Neuron DLCs (AWS Deep Learning Containers) with the latest released Neuron packages on the same day as the Neuron SDK release. AWS Systems Manager Parameter Store support Neuron 2.18

AWS

AWS Deep Learning Deep Learning ML

Build a scalable AI assistant to help refugees using AWS

AWS Machine Learning Blog

JUNE 3, 2025

This post details our technical implementation using AWS services to create a scalable, multilingual AI assistant system that provides automated assistance while maintaining data security and GDPR compliance. Amazon Titan Embeddings also integrates smoothly with AWS, simplifying tasks like indexing, search, and retrieval.

AWS

AWS AI AI Machine Learning

Reduce conversational AI response time through inference at the edge with AWS Local Zones

AWS Machine Learning Blog

MARCH 3, 2025

Hybrid architecture with AWS Local Zones To minimize the impact of network latency on TTFT for users regardless of their locations, a hybrid architecture can be implemented by extending AWS services from commercial Regions to edge locations closer to end users. Next, create a subnet inside each Local Zone. Amazon Linux 2).

AWS

AWS AI AI Deep Learning

AWS Machine Learning: A Beginner’s Guide

How to Learn Machine Learning

DECEMBER 24, 2024

If you’re diving into the world of machine learning, AWS Machine Learning provides a robust and accessible platform to turn your data science dreams into reality. Today, we’ll explore why Amazon’s cloud-based machine learning services could be your perfect starting point for building AI-powered applications.

Machine Learning

Machine Learning Machine Learning AWS ML

How to Become a Generative AI Engineer in 2025?

Towards AI

JANUARY 29, 2025

Photo by Andrea De Santis on Unsplash Artificial Intelligence (AI) has revolutionized the way we interact with technology, and Generative AI is at the forefront of this transformation. Machine Learning and Deep Learning: Supervised, Unsupervised, and Reinforcement Learning Neural Networks, CNNs, RNNs, GANs, and VAEs 4.

Deep Learning

Deep Learning Deep Learning Machine Learning Machine Learning

Accelerating large-scale neural network training on CPUs with ThirdAI and AWS Graviton

AWS Machine Learning Blog

FEBRUARY 29, 2024

Large-scale deep learning has recently produced revolutionary advances in a vast array of fields. is a startup dedicated to the mission of democratizing artificial intelligence technologies through algorithmic and software innovations that fundamentally change the economics of deep learning.

AWS

AWS Deep Learning Deep Learning ML

Open Protocols for Agent Interoperability Part 1: Inter-Agent Communication on MCP

Flipboard

MAY 19, 2025

At AWS, open standards run deep in our DNA, driving all that we do. Thats why we decided to build Amazon Elastic Cloud Compute (EC2) as a protocol-agnostic cloud computing service and Amazon SageMaker as a framework-agnostic deep learning service.

Cloud Computing

Cloud Computing Deep Learning Deep Learning AWS

Protect sensitive data in RAG applications with Amazon Bedrock

Flipboard

APRIL 23, 2025

To assist in this effort, AWS provides a range of generative AI security strategies that you can use to create appropriate threat models. For all data stored in Amazon Bedrock, the AWS shared responsibility model applies.

AWS

AWS ML ML AI

Enable Amazon Bedrock cross-Region inference in multi-account environments

AWS Machine Learning Blog

MARCH 27, 2025

Amazon Bedrock cross-Region inference capability that provides organizations with flexibility to access foundation models (FMs) across AWS Regions while maintaining optimal performance and availability. We provide practical examples for both SCP modifications and AWS Control Tower implementations.

AWS

AWS Machine Learning Machine Learning AI

Announcing New Tools for Building with Generative AI on AWS

Flipboard

APRIL 13, 2023

Prime Air (our drones) and the computer vision technology in Amazon Go (our physical retail experience that lets consumers select items off a shelf and leave the store without having to formally check out) use deep learning. AWS has the broadest and deepest portfolio of AI and ML services at all three layers of the stack.

AWS

AWS ML ML AI

Impel enhances automotive dealership customer experience with fine-tuned LLMs on Amazon SageMaker

Flipboard

JUNE 4, 2025

Impels R&D team partnered closely with various AWS teams, including its Account team, GenAI strategy team, and SageMaker service team. The tight collaboration between Impel and AWS was instrumental in realizing the full potential of Impels fine-tuned model hosted on SageMaker AI. Impels Sales AI reference architecture.

AWS

AWS ML ML AI

Speed up your AI inference workloads with new NVIDIA-powered capabilities in Amazon SageMaker

AWS Machine Learning Blog

DECEMBER 2, 2024

Today, we are introducing three key advancements that further expand our AI inference capabilities: NVIDIA NIM microservices are now available in AWS Marketplace for SageMaker Inference deployments , providing customers with easy access to state-of-the-art generative AI models. or Mixtral.

AWS

AWS AI AI Machine Learning

Auto-labeling module for deep learning-based Advanced Driver Assistance Systems on AWS

AWS Machine Learning Blog

JULY 3, 2023

It’s one of the prerequisite tasks to prepare training data to train a deep learning model. Specifically, for deep learning-based autonomous vehicle (AV) and Advanced Driver Assistance Systems (ADAS), there is a need to label complex multi-modal data from scratch, including synchronized LiDAR, RADAR, and multi-camera streams.

Deep Learning

Deep Learning Deep Learning AWS Machine Learning

Transitioning from Amazon Rekognition people pathing: Exploring other alternatives

AWS Machine Learning Blog

OCTOBER 24, 2024

Example code The following code example is a Python script that can be used as an AWS Lambda function or as part of your processing pipeline. Combined with AWS tool offerings such as AWS Lambda and Amazon SageMaker, you can implement such open source tools for your applications.

AWS

AWS Python Algorithm ML

10 Things AWS Can Do for Your SaaS Company

Smart Data Collective

FEBRUARY 20, 2022

AWS (Amazon Web Services), the comprehensive and evolving cloud computing platform provided by Amazon, is comprised of infrastructure as a service (IaaS), platform as a service (PaaS) and packaged software as a service (SaaS). With its wide array of tools and convenience, AWS has already become a popular choice for many SaaS companies.

AWS

AWS Cloud Computing Data Lakes Database

How Salesforce achieves high-performance model deployment with Amazon SageMaker AI

Flipboard

APRIL 17, 2025

This post is a joint collaboration between Salesforce and AWS and is being cross-published on both the Salesforce Engineering Blog and the AWS Machine Learning Blog. To learn more, see Revolutionizing AI: How Amazon SageMaker Enhances Einsteins Large Language Model Latency and Throughput.

AWS

AWS AI AI Machine Learning

Introducing Fast Model Loader in SageMaker Inference: Accelerate autoscaling for your Large Language Models (LLMs) – Part 2

Flipboard

DECEMBER 2, 2024

About the Authors Melanie Li , PhD, is a Senior Generative AI Specialist Solutions Architect at AWS based in Sydney, Australia, where her focus is on working with customers to build solutions leveraging state-of-the-art AI and machine learning tools. Vivek Gangasani is a Senior GenAI Specialist Solutions Architect at AWS.

AWS

AWS ML ML Machine Learning

Track, allocate, and manage your generative AI cost and usage with Amazon Bedrock

AWS Machine Learning Blog

NOVEMBER 1, 2024

Organizations can now label all Amazon Bedrock models with AWS cost allocation tags , aligning usage to specific organizational taxonomies such as cost centers, business units, and applications. By assigning AWS cost allocation tags, the organization can effectively monitor and track their Bedrock spend patterns.

AWS

AWS AI AI Deep Learning

Build an automated generative AI solution evaluation pipeline with Amazon Nova

Flipboard

APRIL 21, 2025

In this post, to address the aforementioned challenges, we introduce an automated evaluation framework that is deployable on AWS. We then present a typical evaluation workflow, followed by our AWS-based solution that facilitates this process. The UI service can be run locally in a Docker container or deployed to AWS Fargate.

AWS

AWS AI AI Machine Learning

Efficiently train models with large sequence lengths using Amazon SageMaker model parallel

AWS Machine Learning Blog

NOVEMBER 27, 2024

Mixed Precision Training with FP8 As shown in figure below, FP8 is a datatype supported by NVIDIA’s H100 and H200 GPUs, enables efficient deep learning workloads. More details about FP8 can be found at FP8 Formats For Deep Learning. Surya Kari is a Senior Generative AI Data Scientist at AWS.

AWS

AWS Clustering ML ML

How Tealium built a chatbot evaluation platform with Ragas and Auto-Instruct using AWS generative AI services

AWS Machine Learning Blog

DECEMBER 11, 2024

In this post, we illustrate the importance of generative AI in the collaboration between Tealium and the AWS Generative AI Innovation Center (GenAIIC) team by automating the following: Evaluating the retriever and the generated answer of a RAG system based on the Ragas Repository powered by Amazon Bedrock. Create a SageMaker domain instance.

AWS

AWS AI AI Data Scientist

Fast and cost-effective LLaMA 2 fine-tuning with AWS Trainium

AWS Machine Learning Blog

OCTOBER 5, 2023

In this post, we walk through how to fine-tune Llama 2 on AWS Trainium , a purpose-built accelerator for LLM training, to reduce training times and costs. We review the fine-tuning scripts provided by the AWS Neuron SDK (using NeMo Megatron-LM), the various configurations we used, and the throughput results we saw.

AWS

AWS Machine Learning Machine Learning Deep Learning

Simple guide to training Llama 2 with AWS Trainium on Amazon SageMaker

AWS Machine Learning Blog

MAY 1, 2024

Large language models (LLMs) are making a significant impact in the realm of artificial intelligence (AI). Llama2 by Meta is an example of an LLM offered by AWS. To learn more about Llama 2 on AWS, refer to Llama 2 foundation models from Meta are now available in Amazon SageMaker JumpStart.

AWS

AWS ML ML Clustering

Build a medical imaging AI inference pipeline with MONAI Deploy on AWS

AWS Machine Learning Blog

NOVEMBER 8, 2023

AWS and NVIDIA have come together to make this vision a reality. AWS, NVIDIA, and other partners build applications and solutions to make healthcare more accessible, affordable, and efficient by accelerating cloud connectivity of enterprise imaging. AHI provides API access to ImageSet metadata and ImageFrames.

AWS

AWS AI AI ML

Build a Search Engine: Semantic Search System Using OpenSearch

PyImageSearch

MAY 19, 2025

run_opensearch.sh Running OpenSearch Locally A script to start OpenSearch using Docker for local testing before deploying to AWS. Register the Sentence Transformer model in AWS OpenSearch: AWS users must ensure that OpenSearch can access the model before indexing. These can be used for evaluation and comparison.

K-nearest Neighbors

K-nearest Neighbors AWS Deep Learning Deep Learning

Your guide to generative AI and ML at AWS re:Invent 2023

AWS Machine Learning Blog

NOVEMBER 22, 2023

Yes, the AWS re:Invent season is upon us and as always, the place to be is Las Vegas! Now all you need is some guidance on generative AI and machine learning (ML) sessions to attend at this twelfth edition of re:Invent. are the sessions dedicated to AWS DeepRacer ! Generative AI is at the heart of the AWS Village this year.

AWS

AWS ML ML AI

Easily deploy and manage hundreds of LoRA adapters with SageMaker efficient multi-adapter inference

AWS Machine Learning Blog

NOVEMBER 29, 2024

For example, marketing and software as a service (SaaS) companies can personalize artificial intelligence and machine learning (AI/ML) applications using each of their customer’s images, art style, communication style, and documents to create campaigns and artifacts that represent them. _region_name sm_client = boto3.client(service_name='sagemaker')

AWS

AWS ML ML Machine Learning

Frugality meets Accuracy: Cost-efficient training of GPT NeoX and Pythia models with AWS Trainium

AWS Machine Learning Blog

DECEMBER 12, 2023

In this post, we’ll summarize training procedure of GPT NeoX on AWS Trainium , a purpose-built machine learning (ML) accelerator optimized for deep learning training. M tokens/$) trained such models with AWS Trainium without losing any model quality. We’ll outline how we cost-effectively (3.2 billion in Pythia.

AWS

AWS Machine Learning Machine Learning Deep Learning

Develop and train large models cost-efficiently with Metaflow and AWS Trainium

AWS Machine Learning Blog

APRIL 29, 2024

For AWS and Outerbounds customers, the goal is to build a differentiated machine learning and artificial intelligence (ML/AI) system and reliably improve it over time. First, the AWS Trainium accelerator provides a high-performance, cost-effective, and readily available solution for training and fine-tuning large models.

AWS

AWS ML ML Python

Achieve high performance with lowest cost for generative AI inference using AWS Inferentia2 and AWS Trainium on Amazon SageMaker

AWS Machine Learning Blog

MAY 4, 2023

The world of artificial intelligence (AI) and machine learning (ML) has been witnessing a paradigm shift with the rise of generative AI models that can create human-like text, images, code, and audio. For the full list with versions, see Available Deep Learning Containers Images. petaflops for BF16/FP16.

AWS

AWS Deep Learning Deep Learning ML

Creating an artificial intelligence 101

Dataconomy

MARCH 13, 2023

How to create an artificial intelligence? The creation of artificial intelligence (AI) has long been a dream of scientists, engineers, and innovators. With advances in machine learning, deep learning, and natural language processing, the possibilities of what we can create with AI are limitless.

Artificial Intelligence

Artificial Intelligence Artificial Intelligence Natural Language Processing Algorithm

Fine-tune and deploy Llama 2 models cost-effectively in Amazon SageMaker JumpStart with AWS Inferentia and AWS Trainium

AWS Machine Learning Blog

JANUARY 17, 2024

Today, we’re excited to announce the availability of Llama 2 inference and fine-tuning support on AWS Trainium and AWS Inferentia instances in Amazon SageMaker JumpStart. In this post, we demonstrate how to deploy and fine-tune Llama 2 on Trainium and AWS Inferentia instances in SageMaker JumpStart.

AWS

AWS Python Machine Learning Machine Learning

Building an efficient MLOps platform with OSS tools on Amazon ECS with AWS Fargate

AWS Machine Learning Blog

SEPTEMBER 18, 2024

The ZMP analyzes billions of structured and unstructured data points to predict consumer intent by using sophisticated artificial intelligence (AI) to personalize experiences at scale. Zeta’s AI innovation is powered by a proprietary machine learning operations (MLOps) system, developed in-house.

AWS

AWS Machine Learning Machine Learning ML

Llama 3.3 70B now available in Amazon SageMaker JumpStart

AWS Machine Learning Blog

DECEMBER 16, 2024

Melanie Li , PhD, is a Senior Generative AI Specialist Solutions Architect at AWS based in Sydney, Australia, where her focus is on working with customers to build solutions leveraging state-of-the-art AI and machine learning tools. Prior to joining AWS, Dr. Li held data science roles in the financial and retail industries.

AWS

AWS ML ML Python

Build high-performance ML models using PyTorch 2.0 on AWS – Part 1

AWS Machine Learning Blog

JUNE 6, 2023

PyTorch is a machine learning (ML) framework that is widely used by AWS customers for a variety of applications, such as computer vision, natural language processing, content creation, and more. release, AWS customers can now do same things as they could with PyTorch 1.x 24xlarge with AWS PyTorch 2.0 on AWS PyTorch2.0

AWS

AWS ML ML Deep Learning

Accelerate deep learning model training up to 35% with Amazon SageMaker smart sifting

AWS Machine Learning Blog

NOVEMBER 29, 2023

In today’s rapidly evolving landscape of artificial intelligence, deep learning models have found themselves at the forefront of innovation, with applications spanning computer vision (CV), natural language processing (NLP), and recommendation systems. If not, refer to Using the SageMaker Python SDK before continuing.

Deep Learning

Deep Learning Deep Learning Natural Language Processing Python

Your guide to generative AI and ML at AWS re:Invent 2024

AWS Announces Generative AI Innovation Center with $100 million Investment

Webinars

Trending Sources

PEFT fine tuning of Llama 3 on SageMaker HyperPod with AWS Trainium

Webinars

AWS and NVIDIA Extend Collaboration to Advance Generative AI Innovation

Top 10 AI and Data Science Trends in 2022

Serving LLMs using vLLM and Amazon EC2 instances with AWS AI chips

Unlocking insights and enhancing customer service: Intact’s transformative AI journey with AWS

Build a Search Engine: Setting Up AWS OpenSearch

Get started quickly with AWS Trainium and AWS Inferentia using AWS Neuron DLAMI and AWS Neuron DLC

Build a scalable AI assistant to help refugees using AWS

Reduce conversational AI response time through inference at the edge with AWS Local Zones

AWS Machine Learning: A Beginner’s Guide

How to Become a Generative AI Engineer in 2025?

Accelerating large-scale neural network training on CPUs with ThirdAI and AWS Graviton

Open Protocols for Agent Interoperability Part 1: Inter-Agent Communication on MCP

Protect sensitive data in RAG applications with Amazon Bedrock

Enable Amazon Bedrock cross-Region inference in multi-account environments

Announcing New Tools for Building with Generative AI on AWS

Impel enhances automotive dealership customer experience with fine-tuned LLMs on Amazon SageMaker

Speed up your AI inference workloads with new NVIDIA-powered capabilities in Amazon SageMaker

Auto-labeling module for deep learning-based Advanced Driver Assistance Systems on AWS

Transitioning from Amazon Rekognition people pathing: Exploring other alternatives

10 Things AWS Can Do for Your SaaS Company

How Salesforce achieves high-performance model deployment with Amazon SageMaker AI

Introducing Fast Model Loader in SageMaker Inference: Accelerate autoscaling for your Large Language Models (LLMs) – Part 2

Track, allocate, and manage your generative AI cost and usage with Amazon Bedrock

Build an automated generative AI solution evaluation pipeline with Amazon Nova

Efficiently train models with large sequence lengths using Amazon SageMaker model parallel

How Tealium built a chatbot evaluation platform with Ragas and Auto-Instruct using AWS generative AI services

Fast and cost-effective LLaMA 2 fine-tuning with AWS Trainium

Simple guide to training Llama 2 with AWS Trainium on Amazon SageMaker

Build a medical imaging AI inference pipeline with MONAI Deploy on AWS

Build a Search Engine: Semantic Search System Using OpenSearch

Your guide to generative AI and ML at AWS re:Invent 2023

Easily deploy and manage hundreds of LoRA adapters with SageMaker efficient multi-adapter inference

Frugality meets Accuracy: Cost-efficient training of GPT NeoX and Pythia models with AWS Trainium

Develop and train large models cost-efficiently with Metaflow and AWS Trainium

Achieve high performance with lowest cost for generative AI inference using AWS Inferentia2 and AWS Trainium on Amazon SageMaker

Creating an artificial intelligence 101

Fine-tune and deploy Llama 2 models cost-effectively in Amazon SageMaker JumpStart with AWS Inferentia and AWS Trainium

Building an efficient MLOps platform with OSS tools on Amazon ECS with AWS Fargate

Llama 3.3 70B now available in Amazon SageMaker JumpStart

Build high-performance ML models using PyTorch 2.0 on AWS – Part 1

Accelerate deep learning model training up to 35% with Amazon SageMaker smart sifting

Stay Connected