AWS, Deep Learning and ML - Data Science Current

Your guide to generative AI and ML at AWS re:Invent 2024

AWS Machine Learning Blog

NOVEMBER 19, 2024

The excitement is building for the fourteenth edition of AWS re:Invent, and as always, Las Vegas is set to host this spectacular event. Third, we’ll explore the robust infrastructure services from AWS powering AI innovation, featuring Amazon SageMaker , AWS Trainium , and AWS Inferentia under AI/ML, as well as Compute topics.

AWS

AWS ML ML AI

Enhanced observability for AWS Trainium and AWS Inferentia with Datadog

AWS Machine Learning Blog

NOVEMBER 26, 2024

Neuron is the SDK used to run deep learning workloads on Trainium and Inferentia based instances. AWS AI chips, Trainium and Inferentia, enable you to build and deploy generative AI models at higher performance and lower cost. To get started, see AWS Inferentia and AWS Trainium Monitoring.

AWS

AWS ML ML Data Pipeline

PEFT fine tuning of Llama 3 on SageMaker HyperPod with AWS Trainium

AWS Machine Learning Blog

DECEMBER 24, 2024

To simplify infrastructure setup and accelerate distributed training, AWS introduced Amazon SageMaker HyperPod in late 2023. In this blog post, we showcase how you can perform efficient supervised fine tuning for a Meta Llama 3 model using PEFT on AWS Trainium with SageMaker HyperPod. architectures/5.sagemaker-hyperpod/LifecycleScripts/base-config/

AWS

AWS Clustering Deep Learning Deep Learning

Webinars

Agent Tooling: Connecting AI to Your Tools, Systems & Data

Automation, Evolved: Your New Playbook for Smarter Knowledge Work

How to Modernize Manufacturing Without Losing Control

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

MORE WEBINARS

AWS Announces Generative AI Innovation Center with $100 million Investment

insideBIGDATA

JUNE 22, 2023

AWS), an Amazon.com, Inc. company (NASDAQ: AMZN), today announced the AWS Generative AI Innovation Center, a new program to help customers successfully build and deploy generative artificial intelligence (AI) solutions. Amazon Web Services, Inc.

AWS

AWS Artificial Intelligence Artificial Intelligence Machine Learning

Deploy Meta Llama 3.1 models cost-effectively in Amazon SageMaker JumpStart with AWS Inferentia and AWS Trainium

AWS Machine Learning Blog

NOVEMBER 25, 2024

8B and 70B inference support on AWS Trainium and AWS Inferentia instances in Amazon SageMaker JumpStart. Trainium and Inferentia, enabled by the AWS Neuron software development kit (SDK), offer high performance and lower the cost of deploying Meta Llama 3.1 An AWS Identity and Access Management (IAM) role to access SageMaker.

AWS

AWS Python ML ML

Unlocking insights and enhancing customer service: Intact’s transformative AI journey with AWS

AWS Machine Learning Blog

OCTOBER 16, 2024

The company developed an automated solution called Call Quality (CQ) using AI services from Amazon Web Services (AWS). It uses deep learning to convert audio to text quickly and accurately. To address this, Intact turned to AI and speech-to-text technology to unlock insights from calls and improve customer service.

AWS

AWS AI AI Machine Learning

Accelerate your ML lifecycle using the new and improved Amazon SageMaker Python SDK – Part 1: ModelTrainer

AWS Machine Learning Blog

DECEMBER 12, 2024

The new SDK is designed with a tiered user experience in mind, where the new lower-level SDK ( SageMaker Core ) provides access to full breadth of SageMaker features and configurations, allowing for greater flexibility and control for ML engineers. Admins and users can also overwrite the defaults using the SDK defaults configuration file.

ML

ML ML Python AWS

Serving LLMs using vLLM and Amazon EC2 instances with AWS AI chips

AWS Machine Learning Blog

NOVEMBER 26, 2024

Using vLLM on AWS Trainium and Inferentia makes it possible to host LLMs for high performance inference and scalability. Deploy vLLM on AWS Trainium and Inferentia EC2 instances In these sections, you will be guided through using vLLM on an AWS Inferentia EC2 instance to deploy Meta’s newest Llama 3.2 You will use inf2.xlarge

AWS

AWS AI AI Artificial Intelligence

Get started quickly with AWS Trainium and AWS Inferentia using AWS Neuron DLAMI and AWS Neuron DLC

AWS Machine Learning Blog

JUNE 11, 2024

Starting with the AWS Neuron 2.18 release , you can now launch Neuron DLAMIs (AWS Deep Learning AMIs) and Neuron DLCs (AWS Deep Learning Containers) with the latest released Neuron packages on the same day as the Neuron SDK release. PyTorch 1.13, Transformers NeuronX, and TensorFlow 2.10.

AWS

AWS Deep Learning Deep Learning ML

Fine-tune and host SDXL models cost-effectively with AWS Inferentia2

AWS Machine Learning Blog

FEBRUARY 6, 2025

We show how to then prepare the fine-tuned model to run on AWS Inferentia2 powered Amazon EC2 Inf2 instances , unlocking superior price performance for your inference workloads. After the model is fine-tuned, you can compile and host the fine-tuned SDXL on Inf2 instances using the AWS Neuron SDK. An Amazon Web Services (AWS) account.

AWS

AWS Machine Learning Machine Learning Deep Learning

Introducing Fast Model Loader in SageMaker Inference: Accelerate autoscaling for your Large Language Models (LLMs) – Part 2

Flipboard

DECEMBER 2, 2024

To learn more about the ModelBuilder class, refer to Package and deploy classical ML and LLMs easily with Amazon SageMaker, part 1: PySDK Improvements. Prior to joining AWS, Dr. Li held data science roles in the financial and retail industries. Raghu Ramesha is an ML Solutions Architect with the Amazon SageMaker Service team.

AWS

AWS ML ML Machine Learning

Protect sensitive data in RAG applications with Amazon Bedrock

Flipboard

APRIL 23, 2025

To assist in this effort, AWS provides a range of generative AI security strategies that you can use to create appropriate threat models. For all data stored in Amazon Bedrock, the AWS shared responsibility model applies.

AWS

AWS ML ML AI

Your guide to generative AI and ML at AWS re:Invent 2023

AWS Machine Learning Blog

NOVEMBER 22, 2023

Yes, the AWS re:Invent season is upon us and as always, the place to be is Las Vegas! Now all you need is some guidance on generative AI and machine learning (ML) sessions to attend at this twelfth edition of re:Invent. This year, learn about LLMOps, not just MLOps! are the sessions dedicated to AWS DeepRacer !

AWS

AWS ML ML AI

Map Earth’s vegetation in under 20 minutes with Amazon SageMaker

AWS Machine Learning Blog

OCTOBER 16, 2024

Amazon SageMaker supports geospatial machine learning (ML) capabilities, allowing data scientists and ML engineers to build, train, and deploy ML models using geospatial data. SageMaker Processing provisions cluster resources for you to run city-, country-, or continent-scale geospatial ML workloads.

ML

ML ML Clustering Machine Learning

AWS Machine Learning: A Beginner’s Guide

How to Learn Machine Learning

DECEMBER 24, 2024

If you’re diving into the world of machine learning, AWS Machine Learning provides a robust and accessible platform to turn your data science dreams into reality. Today, we’ll explore why Amazon’s cloud-based machine learning services could be your perfect starting point for building AI-powered applications.

Machine Learning

Machine Learning Machine Learning AWS ML

Announcing New Tools for Building with Generative AI on AWS

Flipboard

APRIL 13, 2023

The seeds of a machine learning (ML) paradigm shift have existed for decades, but with the ready availability of scalable compute capacity, a massive proliferation of data, and the rapid advancement of ML technologies, customers across industries are transforming their businesses.

AWS

AWS ML ML AI

Harmonize data using AWS Glue and AWS Lake Formation FindMatches ML to build a customer 360 view

Flipboard

JUNE 26, 2023

These techniques utilize various machine learning (ML) based approaches. In this post, we look at how we can use AWS Glue and the AWS Lake Formation ML transform FindMatches to harmonize (deduplicate) customer data coming from different sources to get a complete customer profile to be able to provide better customer experience.

AWS

AWS ML ML ETL

Llama 3.3 70B now available in Amazon SageMaker JumpStart

AWS Machine Learning Blog

DECEMBER 16, 2024

Getting started with SageMaker JumpStart SageMaker JumpStart is a machine learning (ML) hub that can help accelerate your ML journey. About the authors Marc Karp is an ML Architect with the Amazon SageMaker Service team. He focuses on helping customers design, deploy, and manage ML workloads at scale.

AWS

AWS ML ML Python

Accelerating AI/ML development at BMW Group with Amazon SageMaker Studio

Flipboard

NOVEMBER 24, 2023

With that, the need for data scientists and machine learning (ML) engineers has grown significantly. Data scientists and ML engineers require capable tooling and sufficient compute for their work. Data scientists and ML engineers require capable tooling and sufficient compute for their work.

ML

ML ML AWS AI

Unlock cost savings with the new scale down to zero feature in SageMaker Inference

Flipboard

DECEMBER 2, 2024

Today at AWS re:Invent 2024, we are excited to announce a new feature for Amazon SageMaker inference endpoints: the ability to scale SageMaker inference endpoints to zero instances. This long-awaited capability is a game changer for our customers using the power of AI and machine learning (ML) inference in the cloud.

ML

ML ML AWS Machine Learning

Reduce energy consumption of your machine learning workloads by up to 90% with AWS purpose-built accelerators

Flipboard

JUNE 20, 2023

Machine learning (ML) engineers have traditionally focused on striking a balance between model training and deployment cost vs. performance. This is important because training ML models and then using the trained models to make predictions (inference) can be highly energy-intensive tasks.

AWS

AWS Machine Learning Machine Learning ML

Speed up your AI inference workloads with new NVIDIA-powered capabilities in Amazon SageMaker

AWS Machine Learning Blog

DECEMBER 2, 2024

Today, we are introducing three key advancements that further expand our AI inference capabilities: NVIDIA NIM microservices are now available in AWS Marketplace for SageMaker Inference deployments , providing customers with easy access to state-of-the-art generative AI models. or Mixtral.

AWS

AWS AI AI Machine Learning

How to Become a Generative AI Engineer in 2025?

Towards AI

JANUARY 29, 2025

Generative AI is powered by advanced machine learning techniques, particularly deep learning and neural networks, such as Generative Adversarial Networks (GANs) and Variational Autoencoders (VAEs). Programming Languages: Python (most widely used in AI/ML) R, Java, or C++ (optional but useful) 2.

Deep Learning

Deep Learning Deep Learning Machine Learning Machine Learning

Rad AI reduces real-time inference latency by 50% using Amazon SageMaker

AWS Machine Learning Blog

NOVEMBER 26, 2024

Challenges in deploying advanced ML models in healthcare Rad AI, being an AI-first company, integrates machine learning (ML) models across various functions—from product development to customer success, from novel research to internal applications. Rad AI’s ML organization tackles this challenge on two fronts.

ML

ML ML AI AI

AI/ML-driven actionable insights and themes for Amazon third-party sellers using AWS

Flipboard

MARCH 7, 2023

This post presents a solution that uses a workflow and AWS AI and machine learning (ML) services to provide actionable insights based on those transcripts. We use multiple AWS AI/ML services, such as Contact Lens for Amazon Connect and Amazon SageMaker , and utilize a combined architecture.

ML

ML ML AWS AI

Transitioning from Amazon Rekognition people pathing: Exploring other alternatives

AWS Machine Learning Blog

OCTOBER 24, 2024

Amazon Rekognition people pathing is a machine learning (ML)–based capability of Amazon Rekognition Video that users can use to understand where, when, and how each person is moving in a video. Example code The following code example is a Python script that can be used as an AWS Lambda function or as part of your processing pipeline.

AWS

AWS Python Algorithm ML

Intuitivo achieves higher throughput while saving on AI/ML costs using AWS Inferentia and PyTorch

AWS Machine Learning Blog

OCTOBER 26, 2023

Intuitivo, a pioneer in retail innovation, is revolutionizing shopping with its cloud-based AI and machine learning (AI/ML) transactional processing system. Unlocking high-performance and cost-effective inference using AWS Inferentia As retailers look to scale operations, cost of A-POPs becomes a consideration.

AWS

AWS ML ML AI

Amazon Bedrock Marketplace now includes NVIDIA models: Introducing NVIDIA Nemotron-4 NIM microservices

AWS Machine Learning Blog

DECEMBER 4, 2024

At AWS re:Invent 2024, we are excited to introduce Amazon Bedrock Marketplace. Through Bedrock Marketplace, organizations can use Nemotron’s advanced capabilities while benefiting from the scalable infrastructure of AWS and NVIDIA’s robust technologies. Marc Karp is an ML Architect with the Amazon SageMaker Service team.

AWS

AWS Machine Learning Machine Learning AI

Supercharge your auto scaling for generative AI inference – Introducing Container Caching in SageMaker Inference

AWS Machine Learning Blog

DECEMBER 2, 2024

Today at AWS re:Invent 2024, we are excited to announce the new Container Caching capability in Amazon SageMaker, which significantly reduces the time required to scale generative AI models for inference. This feature is only supported when using inference components.

AI

AI AI AWS Machine Learning

Easily deploy and manage hundreds of LoRA adapters with SageMaker efficient multi-adapter inference

AWS Machine Learning Blog

NOVEMBER 29, 2024

For example, marketing and software as a service (SaaS) companies can personalize artificial intelligence and machine learning (AI/ML) applications using each of their customer’s images, art style, communication style, and documents to create campaigns and artifacts that represent them. For details, refer to Create an AWS account.

AWS

AWS ML ML Machine Learning

Accelerating large-scale neural network training on CPUs with ThirdAI and AWS Graviton

AWS Machine Learning Blog

FEBRUARY 29, 2024

Large-scale deep learning has recently produced revolutionary advances in a vast array of fields. is a startup dedicated to the mission of democratizing artificial intelligence technologies through algorithmic and software innovations that fundamentally change the economics of deep learning. Founded in 2021, ThirdAI Corp.

AWS

AWS Deep Learning Deep Learning ML

How Booking.com modernized its ML experimentation framework with Amazon SageMaker

AWS Machine Learning Blog

FEBRUARY 12, 2024

Sharing in-house resources with other internal teams, the Ranking team machine learning (ML) scientists often encountered long wait times to access resources for model training and experimentation – challenging their ability to rapidly experiment and innovate. If it shows online improvement, it can be deployed to all the users.

ML

ML ML AWS Machine Learning

Accelerate NLP inference with ONNX Runtime on AWS Graviton processors

AWS Machine Learning Blog

MAY 15, 2024

ONNX is an open source machine learning (ML) framework that provides interoperability across a wide range of frameworks, operating systems, and hardware platforms. AWS Graviton3 processors are optimized for ML workloads, including support for bfloat16, Scalable Vector Extension (SVE), and Matrix Multiplication (MMLA) instructions.

AWS

AWS Natural Language Processing Python Deep Learning

Auto-labeling module for deep learning-based Advanced Driver Assistance Systems on AWS

AWS Machine Learning Blog

JULY 3, 2023

It’s one of the prerequisite tasks to prepare training data to train a deep learning model. Specifically, for deep learning-based autonomous vehicle (AV) and Advanced Driver Assistance Systems (ADAS), there is a need to label complex multi-modal data from scratch, including synchronized LiDAR, RADAR, and multi-camera streams.

Deep Learning

Deep Learning Deep Learning AWS Machine Learning

Build an automated generative AI solution evaluation pipeline with Amazon Nova

Flipboard

APRIL 21, 2025

In this post, to address the aforementioned challenges, we introduce an automated evaluation framework that is deployable on AWS. We then present a typical evaluation workflow, followed by our AWS-based solution that facilitates this process. The UI service can be run locally in a Docker container or deployed to AWS Fargate.

AWS

AWS AI AI Machine Learning

Efficiently train models with large sequence lengths using Amazon SageMaker model parallel

AWS Machine Learning Blog

NOVEMBER 27, 2024

Mixed Precision Training with FP8 As shown in figure below, FP8 is a datatype supported by NVIDIA’s H100 and H200 GPUs, enables efficient deep learning workloads. More details about FP8 can be found at FP8 Formats For Deep Learning. supports the Llama 3.1 (and and prior Llama models), Mixtral, and Mistral.

AWS

AWS Clustering ML ML

Scale and simplify ML workload monitoring on Amazon EKS with AWS Neuron Monitor container

AWS Machine Learning Blog

JUNE 25, 2024

Amazon Web Services is excited to announce the launch of the AWS Neuron Monitor container , an innovative tool designed to enhance the monitoring capabilities of AWS Inferentia and AWS Trainium chips on Amazon Elastic Kubernetes Service (Amazon EKS).

AWS

AWS ML ML Clustering

Train and deploy ML models in a multicloud environment using Amazon SageMaker

AWS Machine Learning Blog

SEPTEMBER 20, 2023

For example, you might have acquired a company that was already running on a different cloud provider, or you may have a workload that generates value from unique capabilities provided by AWS. We show how you can build and train an ML model in AWS and deploy the model in another platform.

ML

ML ML Azure AWS

Develop and train large models cost-efficiently with Metaflow and AWS Trainium

AWS Machine Learning Blog

APRIL 29, 2024

For AWS and Outerbounds customers, the goal is to build a differentiated machine learning and artificial intelligence (ML/AI) system and reliably improve it over time. First, the AWS Trainium accelerator provides a high-performance, cost-effective, and readily available solution for training and fine-tuning large models.

AWS

AWS ML ML Python

Announcing new Jupyter contributions by AWS to democratize generative AI and scale ML workloads

AWS Machine Learning Blog

MAY 10, 2023

Project Jupyter is a multi-stakeholder, open-source project that builds applications, open standards, and tools for data science, machine learning (ML), and computational science. Given the importance of Jupyter to data scientists and ML developers, AWS is an active sponsor and contributor to Project Jupyter.

ML

ML ML AWS AI

How Carrier predicts HVAC faults using AWS Glue and Amazon SageMaker

AWS Machine Learning Blog

SEPTEMBER 5, 2023

In order to improve our equipment reliability, we partnered with the Amazon Machine Learning Solutions Lab to develop a custom machine learning (ML) model capable of predicting equipment issues prior to failure. We first highlight how we use AWS Glue for highly parallel data processing.

AWS

AWS ML ML Machine Learning

Reinventing a cloud-native federated learning architecture on AWS

AWS Machine Learning Blog

OCTOBER 10, 2023

Machine learning (ML), especially deep learning, requires a large amount of data for improving model performance. Customers often need to train a model with data from different regions, organizations, or AWS accounts. Existing partner open-source FL solutions on AWS include FedML and NVIDIA FLARE.

AWS

AWS ML ML Algorithm

Fine-tune Llama 2 using QLoRA and Deploy it on Amazon SageMaker with AWS Inferentia2

AWS Machine Learning Blog

DECEMBER 13, 2023

In this post, we showcase fine-tuning a Llama 2 model using a Parameter-Efficient Fine-Tuning (PEFT) method and deploy the fine-tuned model on AWS Inferentia2. We use the AWS Neuron software development kit (SDK) to access the AWS Inferentia2 device and benefit from its high performance.

AWS

AWS Machine Learning Machine Learning Deep Learning

10 Things AWS Can Do for Your SaaS Company

Smart Data Collective

FEBRUARY 20, 2022

AWS (Amazon Web Services), the comprehensive and evolving cloud computing platform provided by Amazon, is comprised of infrastructure as a service (IaaS), platform as a service (PaaS) and packaged software as a service (SaaS). With its wide array of tools and convenience, AWS has already become a popular choice for many SaaS companies.

AWS

AWS Cloud Computing Data Lakes Database

Your guide to generative AI and ML at AWS re:Invent 2024

Enhanced observability for AWS Trainium and AWS Inferentia with Datadog

Webinars

Trending Sources

PEFT fine tuning of Llama 3 on SageMaker HyperPod with AWS Trainium

Webinars

AWS Announces Generative AI Innovation Center with $100 million Investment

Deploy Meta Llama 3.1 models cost-effectively in Amazon SageMaker JumpStart with AWS Inferentia and AWS Trainium

Unlocking insights and enhancing customer service: Intact’s transformative AI journey with AWS

Accelerate your ML lifecycle using the new and improved Amazon SageMaker Python SDK – Part 1: ModelTrainer

Serving LLMs using vLLM and Amazon EC2 instances with AWS AI chips

Get started quickly with AWS Trainium and AWS Inferentia using AWS Neuron DLAMI and AWS Neuron DLC

Fine-tune and host SDXL models cost-effectively with AWS Inferentia2

Introducing Fast Model Loader in SageMaker Inference: Accelerate autoscaling for your Large Language Models (LLMs) – Part 2

Protect sensitive data in RAG applications with Amazon Bedrock

Your guide to generative AI and ML at AWS re:Invent 2023

Map Earth’s vegetation in under 20 minutes with Amazon SageMaker

AWS Machine Learning: A Beginner’s Guide

Announcing New Tools for Building with Generative AI on AWS

Harmonize data using AWS Glue and AWS Lake Formation FindMatches ML to build a customer 360 view

Llama 3.3 70B now available in Amazon SageMaker JumpStart

Accelerating AI/ML development at BMW Group with Amazon SageMaker Studio

Unlock cost savings with the new scale down to zero feature in SageMaker Inference

Reduce energy consumption of your machine learning workloads by up to 90% with AWS purpose-built accelerators

Speed up your AI inference workloads with new NVIDIA-powered capabilities in Amazon SageMaker

How to Become a Generative AI Engineer in 2025?

Rad AI reduces real-time inference latency by 50% using Amazon SageMaker

AI/ML-driven actionable insights and themes for Amazon third-party sellers using AWS

Transitioning from Amazon Rekognition people pathing: Exploring other alternatives

Intuitivo achieves higher throughput while saving on AI/ML costs using AWS Inferentia and PyTorch

Amazon Bedrock Marketplace now includes NVIDIA models: Introducing NVIDIA Nemotron-4 NIM microservices

Supercharge your auto scaling for generative AI inference – Introducing Container Caching in SageMaker Inference

Easily deploy and manage hundreds of LoRA adapters with SageMaker efficient multi-adapter inference

Accelerating large-scale neural network training on CPUs with ThirdAI and AWS Graviton

How Booking.com modernized its ML experimentation framework with Amazon SageMaker

Accelerate NLP inference with ONNX Runtime on AWS Graviton processors

Auto-labeling module for deep learning-based Advanced Driver Assistance Systems on AWS

Build an automated generative AI solution evaluation pipeline with Amazon Nova

Efficiently train models with large sequence lengths using Amazon SageMaker model parallel

Scale and simplify ML workload monitoring on Amazon EKS with AWS Neuron Monitor container

Train and deploy ML models in a multicloud environment using Amazon SageMaker

Develop and train large models cost-efficiently with Metaflow and AWS Trainium

Announcing new Jupyter contributions by AWS to democratize generative AI and scale ML workloads

How Carrier predicts HVAC faults using AWS Glue and Amazon SageMaker

Reinventing a cloud-native federated learning architecture on AWS

Fine-tune Llama 2 using QLoRA and Deploy it on Amazon SageMaker with AWS Inferentia2

10 Things AWS Can Do for Your SaaS Company

Stay Connected