AWS, Deep Learning and Download - Data Science Current

PEFT fine tuning of Llama 3 on SageMaker HyperPod with AWS Trainium

AWS Machine Learning Blog

DECEMBER 24, 2024

To simplify infrastructure setup and accelerate distributed training, AWS introduced Amazon SageMaker HyperPod in late 2023. In this blog post, we showcase how you can perform efficient supervised fine tuning for a Meta Llama 3 model using PEFT on AWS Trainium with SageMaker HyperPod. architectures/5.sagemaker-hyperpod/LifecycleScripts/base-config/

AWS

AWS Clustering Deep Learning Deep Learning

Reduce conversational AI response time through inference at the edge with AWS Local Zones

AWS Machine Learning Blog

MARCH 3, 2025

Hybrid architecture with AWS Local Zones To minimize the impact of network latency on TTFT for users regardless of their locations, a hybrid architecture can be implemented by extending AWS services from commercial Regions to edge locations closer to end users. Next, create a subnet inside each Local Zone. Amazon Linux 2).

AWS

AWS AI AI Deep Learning

Accelerate NLP inference with ONNX Runtime on AWS Graviton processors

AWS Machine Learning Blog

MAY 15, 2024

ONNX is an open source machine learning (ML) framework that provides interoperability across a wide range of frameworks, operating systems, and hardware platforms. AWS Graviton3 processors are optimized for ML workloads, including support for bfloat16, Scalable Vector Extension (SVE), and Matrix Multiplication (MMLA) instructions.

AWS

AWS Natural Language Processing Python Deep Learning

Webinars

How to Achieve High-Accuracy Results When Using LLMs

MORE WEBINARS

Achieve multi-Region resiliency for your conversational AI chatbots with Amazon Lex

AWS Machine Learning Blog

OCTOBER 30, 2024

Global Resiliency is a new Amazon Lex capability that enables near real-time replication of your Amazon Lex V2 bots in a second AWS Region. Additionally, we discuss how to handle integrations with AWS Lambda and Amazon CloudWatch after enabling Global Resiliency. We walk through the instructions to replicate the bot later in this post.

AWS

AWS AI AI Natural Language Processing

Auto-labeling module for deep learning-based Advanced Driver Assistance Systems on AWS

AWS Machine Learning Blog

JULY 3, 2023

It’s one of the prerequisite tasks to prepare training data to train a deep learning model. Specifically, for deep learning-based autonomous vehicle (AV) and Advanced Driver Assistance Systems (ADAS), there is a need to label complex multi-modal data from scratch, including synchronized LiDAR, RADAR, and multi-camera streams.

Deep Learning

Deep Learning Deep Learning AWS Machine Learning

Fast and cost-effective LLaMA 2 fine-tuning with AWS Trainium

AWS Machine Learning Blog

OCTOBER 5, 2023

In this post, we walk through how to fine-tune Llama 2 on AWS Trainium , a purpose-built accelerator for LLM training, to reduce training times and costs. We review the fine-tuning scripts provided by the AWS Neuron SDK (using NeMo Megatron-LM), the various configurations we used, and the throughput results we saw.

AWS

AWS Machine Learning Machine Learning Deep Learning

Simple guide to training Llama 2 with AWS Trainium on Amazon SageMaker

AWS Machine Learning Blog

MAY 1, 2024

Llama2 by Meta is an example of an LLM offered by AWS. To learn more about Llama 2 on AWS, refer to Llama 2 foundation models from Meta are now available in Amazon SageMaker JumpStart. Virginia) and US West (Oregon) AWS Regions, and most recently announced general availability in the US East (Ohio) Region.

AWS

AWS ML ML Clustering

Manage your Amazon Lex bot via AWS CloudFormation templates

AWS Machine Learning Blog

APRIL 16, 2024

It employs advanced deep learning technologies to understand user input, enabling developers to create chatbots, virtual assistants, and other applications that can interact with users in natural language. Version control – With AWS CloudFormation, you can use version control systems like Git to manage your CloudFormation templates.

AWS

AWS Deep Learning Deep Learning Artificial Intelligence

Frugality meets Accuracy: Cost-efficient training of GPT NeoX and Pythia models with AWS Trainium

AWS Machine Learning Blog

DECEMBER 12, 2023

In this post, we’ll summarize training procedure of GPT NeoX on AWS Trainium , a purpose-built machine learning (ML) accelerator optimized for deep learning training. M tokens/$) trained such models with AWS Trainium without losing any model quality. We’ll outline how we cost-effectively (3.2 billion in Pythia.

AWS

AWS Machine Learning Machine Learning Deep Learning

Process formulas and charts with Anthropic’s Claude on Amazon Bedrock

AWS Machine Learning Blog

MARCH 21, 2025

AWS Lambda AWS Lambda is a compute service that runs code in response to triggers such as changes in data, changes in application state, or user actions. Prerequisites If youre new to AWS, you first need to create and set up an AWS account. We download the documents and store them under a samples folder locally.

AWS

AWS AI AI Data Scientist

Accelerated PyTorch inference with torch.compile on AWS Graviton processors

AWS Machine Learning Blog

JULY 2, 2024

AWS optimized the PyTorch torch.compile feature for AWS Graviton3 processors. the optimizations are available in torch Python wheels and AWS Graviton PyTorch deep learning container (DLC). The goal for the AWS Graviton team was to optimize torch.compile backend for Graviton3 processors.

AWS

AWS Natural Language Processing Python ML

Reinventing a cloud-native federated learning architecture on AWS

AWS Machine Learning Blog

OCTOBER 10, 2023

Machine learning (ML), especially deep learning, requires a large amount of data for improving model performance. Customers often need to train a model with data from different regions, organizations, or AWS accounts. Federated learning (FL) is a distributed ML approach that trains ML models on distributed datasets.

AWS

AWS ML ML Algorithm

Achieve high performance with lowest cost for generative AI inference using AWS Inferentia2 and AWS Trainium on Amazon SageMaker

AWS Machine Learning Blog

MAY 4, 2023

AWS has been innovating with purpose-built chips to address the growing need for powerful, efficient, and cost-effective compute hardware. You can use ml.trn1 and ml.inf2 compatible AWS Deep Learning Containers (DLCs) for PyTorch, TensorFlow, Hugging Face, and large model inference (LMI) to easily get started.

AWS

AWS Deep Learning Deep Learning ML

Optimized PyTorch 2.0 inference with AWS Graviton processors

AWS Machine Learning Blog

MAY 3, 2023

AWS, Arm, Meta and others helped optimize the performance of PyTorch 2.0 As a result, we are delighted to announce that AWS Graviton-based instance inference performance for PyTorch 2.0 times the speed for BERT, making Graviton-based instances the fastest compute optimized instances on AWS for these models. is up to 3.5

AWS

AWS Cloud Computing Python Machine Learning

Build high-performance ML models using PyTorch 2.0 on AWS – Part 1

AWS Machine Learning Blog

JUNE 6, 2023

PyTorch is a machine learning (ML) framework that is widely used by AWS customers for a variety of applications, such as computer vision, natural language processing, content creation, and more. release, AWS customers can now do same things as they could with PyTorch 1.x 24xlarge with AWS PyTorch 2.0 on AWS PyTorch2.0

AWS

AWS ML ML Deep Learning

Harmonize data using AWS Glue and AWS Lake Formation FindMatches ML to build a customer 360 view

Flipboard

JUNE 26, 2023

These techniques utilize various machine learning (ML) based approaches. In this post, we look at how we can use AWS Glue and the AWS Lake Formation ML transform FindMatches to harmonize (deduplicate) customer data coming from different sources to get a complete customer profile to be able to provide better customer experience.

AWS

AWS ML ML ETL

Map Earth’s vegetation in under 20 minutes with Amazon SageMaker

AWS Machine Learning Blog

OCTOBER 16, 2024

With these hyperlinks, we can bypass traditional memory and storage-intensive methods of first downloading and subsequently processing images locally—a task made even more daunting by the size and scale of our dataset, spanning over 4 TB. See Amazon SageMaker geospatial capabilities to learn more. He is an ACM Fellow and IEEE Fellow.

ML

ML ML Clustering Machine Learning

Pre-training genomic language models using AWS HealthOmics and Amazon SageMaker

AWS Machine Learning Blog

MAY 31, 2024

In this blog post and open source project , we show you how you can pre-train a genomics language model, HyenaDNA , using your genomic data in the AWS Cloud. Amazon SageMaker Amazon SageMaker is a fully managed ML service offered by AWS, designed to reduce the time and cost associated with training and tuning ML models at scale.

AWS

AWS ML ML Machine Learning

Sprinklr improves performance by 20% and reduces cost by 25% for machine learning inference on AWS Graviton3

AWS Machine Learning Blog

JUNE 11, 2024

In this post, we describe the scale of our AI offerings, the challenges with diverse AI workloads, and how we optimized mixed AI workload inference performance with AWS Graviton3 based c7g instances and achieved 20% throughput improvement, 30% latency reduction, and reduced our cost by 25–30%.

Machine Learning

Machine Learning Machine Learning AWS Natural Language Processing

Maximize Stable Diffusion performance and lower inference costs with AWS Inferentia2

AWS Machine Learning Blog

JULY 26, 2023

In this post, we show how you can run Stable Diffusion models and achieve high performance at the lowest cost in Amazon Elastic Compute Cloud (Amazon EC2) using Amazon EC2 Inf2 instances powered by AWS Inferentia2. versions on AWS Inferentia2 cost-effectively. You can run both Stable Diffusion 2.1 The Stable Diffusion 2.1

AWS

AWS Deep Learning Deep Learning ML

How to extend the functionality of AWS Trainium with custom operators

AWS Machine Learning Blog

APRIL 27, 2023

Deep learning (DL) is a fast-evolving field, and practitioners are constantly innovating DL models and inventing ways to speed them up. Custom operators are one of the mechanisms developers use to push the boundaries of DL innovation by extending the functionality of existing machine learning (ML) frameworks such as PyTorch.

AWS

AWS Deep Learning Deep Learning ML

Scaling Large Language Model (LLM) training with Amazon EC2 Trn1 UltraClusters

Flipboard

FEBRUARY 16, 2023

In October 2022, we launched Amazon EC2 Trn1 Instances , powered by AWS Trainium , which is the second generation machine learning accelerator designed by AWS. Trn1 instances are purpose built for high-performance deep learning model training while offering up to 50% cost-to-train savings over comparable GPU-based instances.

Clustering

Clustering AWS Deep Learning Deep Learning

Deploy DeepSeek-R1 distilled models on Amazon SageMaker using a Large Model Inference container

AWS Machine Learning Blog

MARCH 11, 2025

By integrating this model with Amazon SageMaker AI , you can benefit from the AWS scalable infrastructure while maintaining high-quality language model capabilities. Solution overview You can use DeepSeeks distilled models within the AWS managed machine learning (ML) infrastructure. For details, refer to Create an AWS account.

AWS

AWS ML ML Natural Language Processing

Train and deploy ML models in a multicloud environment using Amazon SageMaker

AWS Machine Learning Blog

SEPTEMBER 20, 2023

For example, you might have acquired a company that was already running on a different cloud provider, or you may have a workload that generates value from unique capabilities provided by AWS. We show how you can build and train an ML model in AWS and deploy the model in another platform.

ML

ML ML Azure AWS

Analyze Amazon SageMaker spend and determine cost optimization opportunities based on usage, Part 4: Training jobs

AWS Machine Learning Blog

MAY 30, 2023

In 2021, we launched AWS Support Proactive Services as part of the AWS Enterprise Support plan. Since its introduction, we’ve helped hundreds of customers optimize their workloads, set guardrails, and improve the visibility of their machine learning (ML) workloads’ cost and usage.

AWS

AWS Deep Learning Deep Learning ML

Streamline diarization using AI as an assistive technology: ZOO Digital’s story

AWS Machine Learning Blog

FEBRUARY 20, 2024

With an aim to accelerate the localization of content workflows through machine learning, ZOO Digital engaged AWS Prototyping, an investment program by AWS to co-build workloads with customers. This S3 bucket was configured to emit an event when new files are detected within it, triggering an AWS Lambda function.

AWS

AWS AI AI Machine Learning

GenASL: Generative AI-powered American Sign Language avatars

AWS Machine Learning Blog

AUGUST 26, 2024

AWS makes it possible for organizations of all sizes and developers of all skill levels to build and scale generative AI applications with security, privacy, and responsible AI. In this post, we dive into the architecture and implementation details of GenASL, which uses AWS generative AI capabilities to create human-like ASL avatar videos.

AWS

AWS AI AI ML

Create a web UI to interact with LLMs using Amazon SageMaker JumpStart

AWS Machine Learning Blog

DECEMBER 12, 2023

The launch of ChatGPT and rise in popularity of generative AI have captured the imagination of customers who are curious about how they can use this technology to create new products and services on AWS, such as enterprise chatbots, which are more conversational. Optionally, deploy the application using AWS Amplify.

AWS

AWS ML ML AI

Scale LLMs with PyTorch 2.0 FSDP on Amazon EKS – Part 2

AWS Machine Learning Blog

APRIL 1, 2024

Machine learning (ML) research has proven that large language models (LLMs) trained with significantly large datasets result in better model quality. Solution overview In this post, we set up a compute cluster using Amazon EKS, which is a managed service to run Kubernetes in the AWS Cloud and on-premises data centers.

Clustering

Clustering AWS ML ML

Deploy Falcon-40B with large model inference DLCs on Amazon SageMaker

AWS Machine Learning Blog

JUNE 13, 2023

In this post, we demonstrate how to deploy Falcon for applications like language understanding and automated writing assistance using large model inference deep learning containers on SageMaker. SageMaker large model inference (LMI) deep learning containers (DLCs) can help.

Deep Learning

Deep Learning Deep Learning AWS Machine Learning

How LotteON built a personalized recommendation system using Amazon SageMaker and MLOps

AWS Machine Learning Blog

MAY 16, 2024

Therefore, we decided to introduce a deep learning-based recommendation algorithm that can identify not only linear relationships in the data, but also more complex relationships. However, it was necessary to upgrade the recommendation service to analyze each customer’s taste and meet their needs.

AWS

AWS ML ML Deep Learning

Boost inference performance for LLMs with new Amazon SageMaker containers

AWS Machine Learning Blog

NOVEMBER 27, 2023

of Large Model Inference (LMI) Deep Learning Containers (DLCs) and adds support for NVIDIA’s TensorRT-LLM Library. This file contains the required configurations for the Deep Java Library (DJL) model server to download and host the model. Qing Lan is a Software Development Engineer in AWS.

AWS

AWS Deep Learning Deep Learning Machine Learning

Achieve ~2x speed-up in LLM inference with Medusa-1 on Amazon SageMaker AI

AWS Machine Learning Blog

FEBRUARY 12, 2025

Prerequisites To build the solution yourself, there are the following prerequisites: You need an AWS account with an AWS Identity and Access Management (IAM) role that has permissions to manage resources created as part of the solution (for example AmazonSageMakerFullAccess and AmazonS3FullAccess ).

AI

AI AI AWS SQL

Deploy a Hugging Face (PyAnnote) speaker diarization model on Amazon SageMaker as an asynchronous endpoint

AWS Machine Learning Blog

APRIL 25, 2024

We provide a comprehensive guide on how to deploy speaker segmentation and clustering solutions using SageMaker on the AWS Cloud. Solution overview Amazon Transcribe is the go-to service for speaker diarization in AWS. Hugging Face is a popular open source hub for machine learning (ML) models.

AWS

AWS ML ML Python

Generate unique images by fine-tuning Stable Diffusion XL with Amazon SageMaker

AWS Machine Learning Blog

JULY 8, 2024

Stable Diffusion XL by Stability AI is a high-quality text-to-image deep learning model that allows you to generate professional-looking images in various styles. AWS CodeCommit is a fully managed source control service that hosts private Git repositories. Kohya SS can be used with a GUI.

AWS

AWS ML ML AI

Build brand loyalty by recommending actions to your users with Amazon Personalize Next Best Action

AWS Machine Learning Blog

NOVEMBER 26, 2023

Amazon Personalize is excited to announce the new Next Best Action ( aws-next-best-action ) recipe to help you determine the best actions to suggest to your individual users that will enable you to increase brand loyalty and conversion.

AWS

AWS ML ML Machine Learning

Boost inference performance for Mixtral and Llama 2 models with new Amazon SageMaker containers

AWS Machine Learning Blog

APRIL 8, 2024

of Large Model Inference (LMI) Deep Learning Containers (DLCs). LMI-Distributed backend At AWS re:Invent 2023, LMI-Dist added new, optimized collective operations to speed up communication between GPUs, resulting in lower latency and higher throughput for models that are too big for a single GPU.

AWS

AWS Deep Learning Deep Learning Machine Learning

Deploy large models at high performance using FasterTransformer on Amazon SageMaker

AWS Machine Learning Blog

APRIL 17, 2023

However, as the size and complexity of the deep learning models that power generative AI continue to grow, deployment can be a challenging task. Then, we highlight how Amazon SageMaker large model inference deep learning containers (LMI DLCs) can help with optimization and deployment.

AWS

AWS Deep Learning Deep Learning Machine Learning

Revolutionize logo design creation with Amazon Bedrock: Embracing generative art, dynamic logos, and AI collaboration

AWS Machine Learning Blog

SEPTEMBER 18, 2024

Integrating it with the range of AWS serverless computing, networking, and content delivery services like AWS Lambda , Amazon API Gateway , and AWS Amplify facilitates the creation of an interactive tool to generate dynamic, responsive, and adaptive logos. We recommend using the us-east-1 Obtain access to the Stability SDXL 1.0

AWS

AWS AI AI ML

Accelerate PyTorch with DeepSpeed to train large language models with Intel Habana Gaudi-based DL1 EC2 instances

AWS Machine Learning Blog

JUNE 7, 2023

Libraries such as DeepSpeed (an open-source deep learning optimization library for PyTorch) address some of these challenges, and can help accelerate model development and training. Training setup We provisioned a managed compute cluster comprised of 16 dl1.24xlarge instances using AWS Batch.

AWS

AWS Clustering Deep Learning Deep Learning

Build ultra-low latency multimodal generative AI applications using sticky session routing in Amazon

AWS Machine Learning Blog

SEPTEMBER 13, 2024

Multimodal is a type of deep learning using multiple modalities of data, such as text, audio, or images. This feature is available in all AWS Regions where SageMaker is available. To learn more about deploying models on SageMaker, see Amazon SageMaker Model Deployment. gpu-py311-cu121-ubuntu20.04-sagemaker.

AWS

AWS AI AI Deep Learning

Build ML features at scale with Amazon SageMaker Feature Store using data from Amazon Redshift

Flipboard

AUGUST 17, 2023

Amazon Redshift uses SQL to analyze structured and semi-structured data across data warehouses, operational databases, and data lakes, using AWS-designed hardware and ML to deliver the best price-performance at any scale. Prerequisites To continue with the examples in this post, you need to create the required AWS resources.

ML

ML ML AWS Data Warehouse

Automatically redact PII for machine learning using Amazon SageMaker Data Wrangler

AWS Machine Learning Blog

OCTOBER 19, 2023

Customers increasingly want to use deep learning approaches such as large language models (LLMs) to automate the extraction of data and insights. For many industries, data that is useful for machine learning (ML) may contain personally identifiable information (PII). Download the SageMaker Data Wrangler flow.

Machine Learning

Machine Learning Machine Learning ML ML

Get started with the open-source Amazon SageMaker Distribution

AWS Machine Learning Blog

JUNE 8, 2023

Data scientists need a consistent and reproducible environment for machine learning (ML) and data science workloads that enables managing dependencies and is secure. AWS Deep Learning Containers already provides pre-built Docker images for training and serving models in common frameworks such as TensorFlow, PyTorch, and MXNet.

AWS

AWS ML ML Data Scientist

PEFT fine tuning of Llama 3 on SageMaker HyperPod with AWS Trainium

Reduce conversational AI response time through inference at the edge with AWS Local Zones

Webinars

Trending Sources

Accelerate NLP inference with ONNX Runtime on AWS Graviton processors

Webinars

Achieve multi-Region resiliency for your conversational AI chatbots with Amazon Lex

Auto-labeling module for deep learning-based Advanced Driver Assistance Systems on AWS

Fast and cost-effective LLaMA 2 fine-tuning with AWS Trainium

Simple guide to training Llama 2 with AWS Trainium on Amazon SageMaker

Manage your Amazon Lex bot via AWS CloudFormation templates

Frugality meets Accuracy: Cost-efficient training of GPT NeoX and Pythia models with AWS Trainium

Process formulas and charts with Anthropic’s Claude on Amazon Bedrock

Accelerated PyTorch inference with torch.compile on AWS Graviton processors

Reinventing a cloud-native federated learning architecture on AWS

Achieve high performance with lowest cost for generative AI inference using AWS Inferentia2 and AWS Trainium on Amazon SageMaker

Optimized PyTorch 2.0 inference with AWS Graviton processors

Build high-performance ML models using PyTorch 2.0 on AWS – Part 1

Harmonize data using AWS Glue and AWS Lake Formation FindMatches ML to build a customer 360 view

Map Earth’s vegetation in under 20 minutes with Amazon SageMaker

Pre-training genomic language models using AWS HealthOmics and Amazon SageMaker

Sprinklr improves performance by 20% and reduces cost by 25% for machine learning inference on AWS Graviton3

Maximize Stable Diffusion performance and lower inference costs with AWS Inferentia2

How to extend the functionality of AWS Trainium with custom operators

Scaling Large Language Model (LLM) training with Amazon EC2 Trn1 UltraClusters

Deploy DeepSeek-R1 distilled models on Amazon SageMaker using a Large Model Inference container

Train and deploy ML models in a multicloud environment using Amazon SageMaker

Analyze Amazon SageMaker spend and determine cost optimization opportunities based on usage, Part 4: Training jobs

Streamline diarization using AI as an assistive technology: ZOO Digital’s story

GenASL: Generative AI-powered American Sign Language avatars

Create a web UI to interact with LLMs using Amazon SageMaker JumpStart

Scale LLMs with PyTorch 2.0 FSDP on Amazon EKS – Part 2

Deploy Falcon-40B with large model inference DLCs on Amazon SageMaker

How LotteON built a personalized recommendation system using Amazon SageMaker and MLOps

Boost inference performance for LLMs with new Amazon SageMaker containers

Achieve ~2x speed-up in LLM inference with Medusa-1 on Amazon SageMaker AI

Deploy a Hugging Face (PyAnnote) speaker diarization model on Amazon SageMaker as an asynchronous endpoint

Generate unique images by fine-tuning Stable Diffusion XL with Amazon SageMaker

Build brand loyalty by recommending actions to your users with Amazon Personalize Next Best Action

Boost inference performance for Mixtral and Llama 2 models with new Amazon SageMaker containers

Deploy large models at high performance using FasterTransformer on Amazon SageMaker

Revolutionize logo design creation with Amazon Bedrock: Embracing generative art, dynamic logos, and AI collaboration

Accelerate PyTorch with DeepSpeed to train large language models with Intel Habana Gaudi-based DL1 EC2 instances

Build ultra-low latency multimodal generative AI applications using sticky session routing in Amazon

Build ML features at scale with Amazon SageMaker Feature Store using data from Amazon Redshift

Automatically redact PII for machine learning using Amazon SageMaker Data Wrangler

Get started with the open-source Amazon SageMaker Distribution

Stay Connected