AWS, Deep Learning and ML - Data Science Current

Enhanced observability for AWS Trainium and AWS Inferentia with Datadog

AWS Machine Learning Blog

NOVEMBER 26, 2024

Neuron is the SDK used to run deep learning workloads on Trainium and Inferentia based instances. AWS AI chips, Trainium and Inferentia, enable you to build and deploy generative AI models at higher performance and lower cost. To get started, see AWS Inferentia and AWS Trainium Monitoring.

AWS

AWS ML ML Data Pipeline

Your guide to generative AI and ML at AWS re:Invent 2024

AWS Machine Learning Blog

NOVEMBER 19, 2024

The excitement is building for the fourteenth edition of AWS re:Invent, and as always, Las Vegas is set to host this spectacular event. Third, we’ll explore the robust infrastructure services from AWS powering AI innovation, featuring Amazon SageMaker , AWS Trainium , and AWS Inferentia under AI/ML, as well as Compute topics.

AWS

AWS ML ML AI

AWS Announces Generative AI Innovation Center with $100 million Investment

insideBIGDATA

JUNE 22, 2023

AWS), an Amazon.com, Inc. company (NASDAQ: AMZN), today announced the AWS Generative AI Innovation Center, a new program to help customers successfully build and deploy generative artificial intelligence (AI) solutions. Amazon Web Services, Inc.

AWS

AWS Artificial Intelligence Artificial Intelligence Machine Learning

Webinars

How to Achieve High-Accuracy Results When Using LLMs

Maximizing Profit and Productivity: The New Era of AI-Powered Accounting

Automation, Evolved: Your New Playbook For Smarter Knowledge Work

MORE WEBINARS

PEFT fine tuning of Llama 3 on SageMaker HyperPod with AWS Trainium

AWS Machine Learning Blog

DECEMBER 24, 2024

To simplify infrastructure setup and accelerate distributed training, AWS introduced Amazon SageMaker HyperPod in late 2023. In this blog post, we showcase how you can perform efficient supervised fine tuning for a Meta Llama 3 model using PEFT on AWS Trainium with SageMaker HyperPod. architectures/5.sagemaker-hyperpod/LifecycleScripts/base-config/

AWS

AWS Clustering Deep Learning Deep Learning

Unlocking insights and enhancing customer service: Intact’s transformative AI journey with AWS

AWS Machine Learning Blog

OCTOBER 16, 2024

The company developed an automated solution called Call Quality (CQ) using AI services from Amazon Web Services (AWS). It uses deep learning to convert audio to text quickly and accurately. To address this, Intact turned to AI and speech-to-text technology to unlock insights from calls and improve customer service.

AWS

AWS AI AI Machine Learning

Get started quickly with AWS Trainium and AWS Inferentia using AWS Neuron DLAMI and AWS Neuron DLC

AWS Machine Learning Blog

JUNE 11, 2024

Starting with the AWS Neuron 2.18 release , you can now launch Neuron DLAMIs (AWS Deep Learning AMIs) and Neuron DLCs (AWS Deep Learning Containers) with the latest released Neuron packages on the same day as the Neuron SDK release. PyTorch 1.13, Transformers NeuronX, and TensorFlow 2.10.

AWS

AWS Deep Learning Deep Learning ML

Accelerate your ML lifecycle using the new and improved Amazon SageMaker Python SDK – Part 1: ModelTrainer

AWS Machine Learning Blog

DECEMBER 12, 2024

The new SDK is designed with a tiered user experience in mind, where the new lower-level SDK ( SageMaker Core ) provides access to full breadth of SageMaker features and configurations, allowing for greater flexibility and control for ML engineers. Admins and users can also overwrite the defaults using the SDK defaults configuration file.

ML

ML ML Python AWS

Fine-tune and host SDXL models cost-effectively with AWS Inferentia2

AWS Machine Learning Blog

FEBRUARY 6, 2025

We show how to then prepare the fine-tuned model to run on AWS Inferentia2 powered Amazon EC2 Inf2 instances , unlocking superior price performance for your inference workloads. After the model is fine-tuned, you can compile and host the fine-tuned SDXL on Inf2 instances using the AWS Neuron SDK. An Amazon Web Services (AWS) account.

AWS

AWS Machine Learning Machine Learning Deep Learning

AWS Machine Learning: A Beginner’s Guide

How to Learn Machine Learning

DECEMBER 24, 2024

If you’re diving into the world of machine learning, AWS Machine Learning provides a robust and accessible platform to turn your data science dreams into reality. Today, we’ll explore why Amazon’s cloud-based machine learning services could be your perfect starting point for building AI-powered applications.

Machine Learning

Machine Learning Machine Learning AWS ML

Your guide to generative AI and ML at AWS re:Invent 2023

AWS Machine Learning Blog

NOVEMBER 22, 2023

Yes, the AWS re:Invent season is upon us and as always, the place to be is Las Vegas! Now all you need is some guidance on generative AI and machine learning (ML) sessions to attend at this twelfth edition of re:Invent. This year, learn about LLMOps, not just MLOps! are the sessions dedicated to AWS DeepRacer !

AWS

AWS ML ML AI

Harmonize data using AWS Glue and AWS Lake Formation FindMatches ML to build a customer 360 view

Flipboard

JUNE 26, 2023

These techniques utilize various machine learning (ML) based approaches. In this post, we look at how we can use AWS Glue and the AWS Lake Formation ML transform FindMatches to harmonize (deduplicate) customer data coming from different sources to get a complete customer profile to be able to provide better customer experience.

AWS

AWS ML ML ETL

Map Earth’s vegetation in under 20 minutes with Amazon SageMaker

AWS Machine Learning Blog

OCTOBER 16, 2024

Amazon SageMaker supports geospatial machine learning (ML) capabilities, allowing data scientists and ML engineers to build, train, and deploy ML models using geospatial data. SageMaker Processing provisions cluster resources for you to run city-, country-, or continent-scale geospatial ML workloads.

ML

ML ML Clustering Machine Learning

Accelerating AI/ML development at BMW Group with Amazon SageMaker Studio

Flipboard

NOVEMBER 24, 2023

With that, the need for data scientists and machine learning (ML) engineers has grown significantly. Data scientists and ML engineers require capable tooling and sufficient compute for their work. Data scientists and ML engineers require capable tooling and sufficient compute for their work.

ML

ML ML AWS AI

Reduce energy consumption of your machine learning workloads by up to 90% with AWS purpose-built accelerators

Flipboard

JUNE 20, 2023

Machine learning (ML) engineers have traditionally focused on striking a balance between model training and deployment cost vs. performance. This is important because training ML models and then using the trained models to make predictions (inference) can be highly energy-intensive tasks.

AWS

AWS Machine Learning Machine Learning ML

How to Become a Generative AI Engineer in 2025?

Towards AI

JANUARY 29, 2025

Generative AI is powered by advanced machine learning techniques, particularly deep learning and neural networks, such as Generative Adversarial Networks (GANs) and Variational Autoencoders (VAEs). Programming Languages: Python (most widely used in AI/ML) R, Java, or C++ (optional but useful) 2.

Deep Learning

Deep Learning Deep Learning Machine Learning Machine Learning

AI/ML-driven actionable insights and themes for Amazon third-party sellers using AWS

Flipboard

MARCH 7, 2023

This post presents a solution that uses a workflow and AWS AI and machine learning (ML) services to provide actionable insights based on those transcripts. We use multiple AWS AI/ML services, such as Contact Lens for Amazon Connect and Amazon SageMaker , and utilize a combined architecture.

ML

ML ML AWS AI

Intuitivo achieves higher throughput while saving on AI/ML costs using AWS Inferentia and PyTorch

AWS Machine Learning Blog

OCTOBER 26, 2023

Intuitivo, a pioneer in retail innovation, is revolutionizing shopping with its cloud-based AI and machine learning (AI/ML) transactional processing system. Unlocking high-performance and cost-effective inference using AWS Inferentia As retailers look to scale operations, cost of A-POPs becomes a consideration.

AWS

AWS ML ML AI

Rad AI reduces real-time inference latency by 50% using Amazon SageMaker

AWS Machine Learning Blog

NOVEMBER 26, 2024

Challenges in deploying advanced ML models in healthcare Rad AI, being an AI-first company, integrates machine learning (ML) models across various functions—from product development to customer success, from novel research to internal applications. Rad AI’s ML organization tackles this challenge on two fronts.

ML

ML ML AI AI

How Booking.com modernized its ML experimentation framework with Amazon SageMaker

AWS Machine Learning Blog

FEBRUARY 12, 2024

Sharing in-house resources with other internal teams, the Ranking team machine learning (ML) scientists often encountered long wait times to access resources for model training and experimentation – challenging their ability to rapidly experiment and innovate. If it shows online improvement, it can be deployed to all the users.

ML

ML ML AWS Machine Learning

Develop and train large models cost-efficiently with Metaflow and AWS Trainium

AWS Machine Learning Blog

APRIL 29, 2024

For AWS and Outerbounds customers, the goal is to build a differentiated machine learning and artificial intelligence (ML/AI) system and reliably improve it over time. First, the AWS Trainium accelerator provides a high-performance, cost-effective, and readily available solution for training and fine-tuning large models.

AWS

AWS ML ML Python

Accelerate NLP inference with ONNX Runtime on AWS Graviton processors

AWS Machine Learning Blog

MAY 15, 2024

ONNX is an open source machine learning (ML) framework that provides interoperability across a wide range of frameworks, operating systems, and hardware platforms. AWS Graviton3 processors are optimized for ML workloads, including support for bfloat16, Scalable Vector Extension (SVE), and Matrix Multiplication (MMLA) instructions.

AWS

AWS Natural Language Processing Python Deep Learning

Easily deploy and manage hundreds of LoRA adapters with SageMaker efficient multi-adapter inference

AWS Machine Learning Blog

NOVEMBER 29, 2024

For example, marketing and software as a service (SaaS) companies can personalize artificial intelligence and machine learning (AI/ML) applications using each of their customer’s images, art style, communication style, and documents to create campaigns and artifacts that represent them. For details, refer to Create an AWS account.

AWS

AWS ML ML Machine Learning

Transitioning from Amazon Rekognition people pathing: Exploring other alternatives

AWS Machine Learning Blog

OCTOBER 24, 2024

Amazon Rekognition people pathing is a machine learning (ML)–based capability of Amazon Rekognition Video that users can use to understand where, when, and how each person is moving in a video. Example code The following code example is a Python script that can be used as an AWS Lambda function or as part of your processing pipeline.

AWS

AWS Python Algorithm ML

Process formulas and charts with Anthropic’s Claude on Amazon Bedrock

AWS Machine Learning Blog

MARCH 21, 2025

JupyterLab applications flexible and extensive interface can be used to configure and arrange machine learning (ML) workflows. AWS Lambda AWS Lambda is a compute service that runs code in response to triggers such as changes in data, changes in application state, or user actions. samples/2003.10304/page_0.png'

AWS

AWS AI AI Data Scientist

Auto-labeling module for deep learning-based Advanced Driver Assistance Systems on AWS

AWS Machine Learning Blog

JULY 3, 2023

It’s one of the prerequisite tasks to prepare training data to train a deep learning model. Specifically, for deep learning-based autonomous vehicle (AV) and Advanced Driver Assistance Systems (ADAS), there is a need to label complex multi-modal data from scratch, including synchronized LiDAR, RADAR, and multi-camera streams.

Deep Learning

Deep Learning Deep Learning AWS Machine Learning

Build high-performance ML models using PyTorch 2.0 on AWS – Part 1

AWS Machine Learning Blog

JUNE 6, 2023

PyTorch is a machine learning (ML) framework that is widely used by AWS customers for a variety of applications, such as computer vision, natural language processing, content creation, and more. release, AWS customers can now do same things as they could with PyTorch 1.x 24xlarge with AWS PyTorch 2.0 DLAMI + DLC.

AWS

AWS ML ML Deep Learning

10 Things AWS Can Do for Your SaaS Company

Smart Data Collective

FEBRUARY 20, 2022

AWS (Amazon Web Services), the comprehensive and evolving cloud computing platform provided by Amazon, is comprised of infrastructure as a service (IaaS), platform as a service (PaaS) and packaged software as a service (SaaS). With its wide array of tools and convenience, AWS has already become a popular choice for many SaaS companies.

AWS

AWS Cloud Computing Data Lakes Database

Efficiently train models with large sequence lengths using Amazon SageMaker model parallel

AWS Machine Learning Blog

NOVEMBER 27, 2024

Mixed Precision Training with FP8 As shown in figure below, FP8 is a datatype supported by NVIDIA’s H100 and H200 GPUs, enables efficient deep learning workloads. More details about FP8 can be found at FP8 Formats For Deep Learning. supports the Llama 3.1 (and and prior Llama models), Mixtral, and Mistral.

AWS

AWS Clustering ML ML

Fine-tune Llama 2 using QLoRA and Deploy it on Amazon SageMaker with AWS Inferentia2

AWS Machine Learning Blog

DECEMBER 13, 2023

In this post, we showcase fine-tuning a Llama 2 model using a Parameter-Efficient Fine-Tuning (PEFT) method and deploy the fine-tuned model on AWS Inferentia2. We use the AWS Neuron software development kit (SDK) to access the AWS Inferentia2 device and benefit from its high performance.

AWS

AWS Machine Learning Machine Learning Deep Learning

Scale and simplify ML workload monitoring on Amazon EKS with AWS Neuron Monitor container

AWS Machine Learning Blog

JUNE 25, 2024

Amazon Web Services is excited to announce the launch of the AWS Neuron Monitor container , an innovative tool designed to enhance the monitoring capabilities of AWS Inferentia and AWS Trainium chips on Amazon Elastic Kubernetes Service (Amazon EKS).

AWS

AWS ML ML Clustering

Simple guide to training Llama 2 with AWS Trainium on Amazon SageMaker

AWS Machine Learning Blog

MAY 1, 2024

Llama2 by Meta is an example of an LLM offered by AWS. To learn more about Llama 2 on AWS, refer to Llama 2 foundation models from Meta are now available in Amazon SageMaker JumpStart. Virginia) and US West (Oregon) AWS Regions, and most recently announced general availability in the US East (Ohio) Region.

AWS

AWS ML ML Clustering

Package and deploy classical ML and LLMs easily with Amazon SageMaker, part 1: PySDK Improvements

Flipboard

NOVEMBER 30, 2023

Amazon SageMaker is a fully managed service that enables developers and data scientists to quickly and effortlessly build, train, and deploy machine learning (ML) models at any scale. Deploy traditional models to SageMaker endpoints In the following examples, we showcase how to use ModelBuilder to deploy traditional ML models.

ML

ML ML AWS Python

Train and deploy ML models in a multicloud environment using Amazon SageMaker

AWS Machine Learning Blog

SEPTEMBER 20, 2023

For example, you might have acquired a company that was already running on a different cloud provider, or you may have a workload that generates value from unique capabilities provided by AWS. We show how you can build and train an ML model in AWS and deploy the model in another platform.

ML

ML ML Azure AWS

Remote Data Science Jobs: 5 High-Demand Roles for Career Growth

Data Science Dojo

OCTOBER 31, 2024

Key Skills: Mastery in machine learning frameworks like PyTorch or TensorFlow is essential, along with a solid foundation in unsupervised learning methods. Stanford AI Lab recommends proficiency in deep learning, especially if working in experimental or cutting-edge areas.

Data Science

Data Science Data Scientist Machine Learning Machine Learning

How Carrier predicts HVAC faults using AWS Glue and Amazon SageMaker

AWS Machine Learning Blog

SEPTEMBER 5, 2023

In order to improve our equipment reliability, we partnered with the Amazon Machine Learning Solutions Lab to develop a custom machine learning (ML) model capable of predicting equipment issues prior to failure. We first highlight how we use AWS Glue for highly parallel data processing.

AWS

AWS ML ML Machine Learning

Reinventing a cloud-native federated learning architecture on AWS

AWS Machine Learning Blog

OCTOBER 10, 2023

Machine learning (ML), especially deep learning, requires a large amount of data for improving model performance. Customers often need to train a model with data from different regions, organizations, or AWS accounts. Existing partner open-source FL solutions on AWS include FedML and NVIDIA FLARE.

AWS

AWS ML ML Algorithm

Use Snowflake as a data source to train ML models with Amazon SageMaker

AWS Machine Learning Blog

MARCH 8, 2023

Amazon SageMaker is a fully managed machine learning (ML) service. With SageMaker, data scientists and developers can quickly and easily build and train ML models, and then directly deploy them into a production-ready hosted environment. Store your Snowflake account credentials in AWS Secrets Manager.

ML

ML ML AWS Python

AWS and Hugging Face collaborate to make generative AI more accessible and cost efficient

AWS Machine Learning Blog

FEBRUARY 21, 2023

We’re thrilled to announce an expanded collaboration between AWS and Hugging Face to accelerate the training, fine-tuning, and deployment of large language and vision models used to create generative AI applications. AWS has a deep history of innovation in generative AI. Or they can self-manage on Amazon EC2.

AWS

AWS ML ML AI

Build ML features at scale with Amazon SageMaker Feature Store using data from Amazon Redshift

Flipboard

AUGUST 17, 2023

Many practitioners are extending these Redshift datasets at scale for machine learning (ML) using Amazon SageMaker , a fully managed ML service, with requirements to develop features offline in a code way or low-code/no-code way, store featured data from Amazon Redshift, and make this happen at scale in a production environment.

ML

ML ML AWS Data Warehouse

Accelerated PyTorch inference with torch.compile on AWS Graviton processors

AWS Machine Learning Blog

JULY 2, 2024

AWS optimized the PyTorch torch.compile feature for AWS Graviton3 processors. the optimizations are available in torch Python wheels and AWS Graviton PyTorch deep learning container (DLC). It’s easier to use, more suitable for machine learning (ML) researchers, and hence is the default mode.

AWS

AWS Natural Language Processing Python ML

Frugality meets Accuracy: Cost-efficient training of GPT NeoX and Pythia models with AWS Trainium

AWS Machine Learning Blog

DECEMBER 12, 2023

In this post, we’ll summarize training procedure of GPT NeoX on AWS Trainium , a purpose-built machine learning (ML) accelerator optimized for deep learning training. M tokens/$) trained such models with AWS Trainium without losing any model quality. We’ll outline how we cost-effectively (3.2 2048 256 10.4

AWS

AWS Machine Learning Machine Learning Deep Learning

Build a dynamic, role-based AI agent using Amazon Bedrock inline agents

AWS Machine Learning Blog

FEBRUARY 13, 2025

AWS Lambda functions for executing specific actions (such as submitting vacation requests or expense reports). With a strong background in AI/ML, Ishan specializes in building Generative AI solutions that drive business value. Maira Ladeira Tanke is a Senior Generative AI Data Scientist at AWS. Nitin Eusebius is a Sr.

AI

AI AI AWS ML

Building a GenAI CV screener at DataRobot and AWS Hackathon 2023

Towards AI

NOVEMBER 5, 2023

Source: [link] This article describes a solution for a generative AI resume screener that got us 3rd place at DataRobot & AWS Hackathon 2023. You can also set the environment variables on the notebook instance for things like AWS access key etc. Source: author’s screenshot on AWS We used Anthropic Claude 2 in our solution.

AWS

AWS Machine Learning Machine Learning Python

Achieve multi-Region resiliency for your conversational AI chatbots with Amazon Lex

AWS Machine Learning Blog

OCTOBER 30, 2024

Global Resiliency is a new Amazon Lex capability that enables near real-time replication of your Amazon Lex V2 bots in a second AWS Region. Additionally, we discuss how to handle integrations with AWS Lambda and Amazon CloudWatch after enabling Global Resiliency. We walk through the instructions to replicate the bot later in this post.

AWS

AWS AI AI Natural Language Processing

Enhanced observability for AWS Trainium and AWS Inferentia with Datadog

Your guide to generative AI and ML at AWS re:Invent 2024

Webinars

Trending Sources

AWS Announces Generative AI Innovation Center with $100 million Investment

Webinars

PEFT fine tuning of Llama 3 on SageMaker HyperPod with AWS Trainium

Unlocking insights and enhancing customer service: Intact’s transformative AI journey with AWS

Get started quickly with AWS Trainium and AWS Inferentia using AWS Neuron DLAMI and AWS Neuron DLC

Accelerate your ML lifecycle using the new and improved Amazon SageMaker Python SDK – Part 1: ModelTrainer

Fine-tune and host SDXL models cost-effectively with AWS Inferentia2

AWS Machine Learning: A Beginner’s Guide

Your guide to generative AI and ML at AWS re:Invent 2023

Harmonize data using AWS Glue and AWS Lake Formation FindMatches ML to build a customer 360 view

Map Earth’s vegetation in under 20 minutes with Amazon SageMaker

Accelerating AI/ML development at BMW Group with Amazon SageMaker Studio

Reduce energy consumption of your machine learning workloads by up to 90% with AWS purpose-built accelerators

How to Become a Generative AI Engineer in 2025?

AI/ML-driven actionable insights and themes for Amazon third-party sellers using AWS

Intuitivo achieves higher throughput while saving on AI/ML costs using AWS Inferentia and PyTorch

Rad AI reduces real-time inference latency by 50% using Amazon SageMaker

How Booking.com modernized its ML experimentation framework with Amazon SageMaker

Develop and train large models cost-efficiently with Metaflow and AWS Trainium

Accelerate NLP inference with ONNX Runtime on AWS Graviton processors

Easily deploy and manage hundreds of LoRA adapters with SageMaker efficient multi-adapter inference

Transitioning from Amazon Rekognition people pathing: Exploring other alternatives

Process formulas and charts with Anthropic’s Claude on Amazon Bedrock

Auto-labeling module for deep learning-based Advanced Driver Assistance Systems on AWS

Build high-performance ML models using PyTorch 2.0 on AWS – Part 1

10 Things AWS Can Do for Your SaaS Company

Efficiently train models with large sequence lengths using Amazon SageMaker model parallel

Fine-tune Llama 2 using QLoRA and Deploy it on Amazon SageMaker with AWS Inferentia2

Scale and simplify ML workload monitoring on Amazon EKS with AWS Neuron Monitor container

Simple guide to training Llama 2 with AWS Trainium on Amazon SageMaker

Package and deploy classical ML and LLMs easily with Amazon SageMaker, part 1: PySDK Improvements

Train and deploy ML models in a multicloud environment using Amazon SageMaker

Remote Data Science Jobs: 5 High-Demand Roles for Career Growth

How Carrier predicts HVAC faults using AWS Glue and Amazon SageMaker

Reinventing a cloud-native federated learning architecture on AWS

Use Snowflake as a data source to train ML models with Amazon SageMaker

AWS and Hugging Face collaborate to make generative AI more accessible and cost efficient

Build ML features at scale with Amazon SageMaker Feature Store using data from Amazon Redshift

Accelerated PyTorch inference with torch.compile on AWS Graviton processors

Frugality meets Accuracy: Cost-efficient training of GPT NeoX and Pythia models with AWS Trainium

Build a dynamic, role-based AI agent using Amazon Bedrock inline agents

Building a GenAI CV screener at DataRobot and AWS Hackathon 2023

Achieve multi-Region resiliency for your conversational AI chatbots with Amazon Lex

Stay Connected