AWS, Download and Python - Data Science Current

Build and deploy a UI for your generative AI applications with AWS and Python

AWS Machine Learning Blog

NOVEMBER 6, 2024

AWS provides a powerful set of tools and services that simplify the process of building and deploying generative AI applications, even for those with limited experience in frontend and backend development. The Python application uses the Streamlit library to provide a user-friendly interface for interacting with a generative AI model.

AWS

AWS Python AI AI

PEFT fine tuning of Llama 3 on SageMaker HyperPod with AWS Trainium

AWS Machine Learning Blog

DECEMBER 24, 2024

To simplify infrastructure setup and accelerate distributed training, AWS introduced Amazon SageMaker HyperPod in late 2023. In this blog post, we showcase how you can perform efficient supervised fine tuning for a Meta Llama 3 model using PEFT on AWS Trainium with SageMaker HyperPod. architectures/5.sagemaker-hyperpod/LifecycleScripts/base-config/

AWS

AWS Clustering Deep Learning Deep Learning

Accelerating ML experimentation with enhanced security: AWS PrivateLink support for Amazon SageMaker with MLflow

AWS Machine Learning Blog

DECEMBER 9, 2024

It simplifies the often complex and time-consuming tasks involved in setting up and managing an MLflow environment, allowing ML administrators to quickly establish secure and scalable MLflow environments on AWS. For example, you can give users access permission to download popular packages and customize the development environment.

AWS

AWS ML ML Data Scientist

Webinars

Automation, Evolved: Your New Playbook For Smarter Knowledge Work

MORE WEBINARS

Automate invoice processing with Streamlit and Amazon Bedrock

AWS Machine Learning Blog

NOVEMBER 14, 2024

Streamlit is an open source framework for data scientists to efficiently create interactive web-based data applications in pure Python. Prerequisites To perform this solution, complete the following: Create and activate an AWS account. Make sure your AWS credentials are configured correctly. Install Python 3.7

AWS

AWS Python AI AI

Serving LLMs using vLLM and Amazon EC2 instances with AWS AI chips

AWS Machine Learning Blog

NOVEMBER 26, 2024

Using vLLM on AWS Trainium and Inferentia makes it possible to host LLMs for high performance inference and scalability. Deploy vLLM on AWS Trainium and Inferentia EC2 instances In these sections, you will be guided through using vLLM on an AWS Inferentia EC2 instance to deploy Meta’s newest Llama 3.2 You will use inf2.xlarge

AWS

AWS AI AI Artificial Intelligence

Reduce conversational AI response time through inference at the edge with AWS Local Zones

AWS Machine Learning Blog

MARCH 3, 2025

Hybrid architecture with AWS Local Zones To minimize the impact of network latency on TTFT for users regardless of their locations, a hybrid architecture can be implemented by extending AWS services from commercial Regions to edge locations closer to end users. Next, create a subnet inside each Local Zone. Amazon Linux 2).

AWS

AWS AI AI Deep Learning

AWS Inferentia and AWS Trainium deliver lowest cost to deploy Llama 3 models in Amazon SageMaker JumpStart

AWS Machine Learning Blog

MAY 2, 2024

Today, we’re excited to announce the availability of Meta Llama 3 inference on AWS Trainium and AWS Inferentia based instances in Amazon SageMaker JumpStart. In this post, we demonstrate how easy it is to deploy Llama 3 on AWS Trainium and AWS Inferentia based instances in SageMaker JumpStart.

AWS

AWS ML ML Python

Llama 3.3 70B now available in Amazon SageMaker JumpStart

AWS Machine Learning Blog

DECEMBER 16, 2024

70B through SageMaker JumpStart offers two convenient approaches: using the intuitive SageMaker JumpStart UI or implementing programmatically through the SageMaker Python SDK. Prior to joining AWS, Dr. Li held data science roles in the financial and retail industries. Adriana Simmons is a Senior Product Marketing Manager at AWS.

AWS

AWS ML ML Python

Accelerate NLP inference with ONNX Runtime on AWS Graviton processors

AWS Machine Learning Blog

MAY 15, 2024

AWS Graviton3 processors are optimized for ML workloads, including support for bfloat16, Scalable Vector Extension (SVE), and Matrix Multiplication (MMLA) instructions. In this post, we show how to run ONNX Runtime inference on AWS Graviton3-based EC2 instances and how to configure them to use optimized GEMM kernels.

AWS

AWS Natural Language Processing Python Deep Learning

Develop and train large models cost-efficiently with Metaflow and AWS Trainium

AWS Machine Learning Blog

APRIL 29, 2024

For AWS and Outerbounds customers, the goal is to build a differentiated machine learning and artificial intelligence (ML/AI) system and reliably improve it over time. First, the AWS Trainium accelerator provides a high-performance, cost-effective, and readily available solution for training and fine-tuning large models.

AWS

AWS ML ML Python

Monitor AWS Sagemaker model using IBM Watson OpenScale

IBM Data Science in Practice

APRIL 4, 2023

Introduction This article shows how to monitor a model deployed on AWS Sagemaker for quality, bias and explainability, using IBM Watson OpenScale on the IBM Cloud Pak for Data platform. This article shows how to use the endpoint generated from that tutorial to demonstrate how to monitor the AWS deployment with Watson OpenScale.

AWS

AWS Machine Learning Machine Learning Data Science

Deploy DeepSeek-R1 Distilled Llama models in Amazon Bedrock

AWS Machine Learning Blog

JANUARY 29, 2025

In this post, we explore how to deploy distilled versions of DeepSeek-R1 with Amazon Bedrock Custom Model Import, making them accessible to organizations looking to use state-of-the-art AI capabilities within the secure and scalable AWS infrastructure at an effective cost. You can monitor costs with AWS Cost Explorer.

AWS

AWS AI AI ML

Deploy an IBM Watson Studio Model on AWS Sagemaker

IBM Data Science in Practice

MARCH 8, 2023

You can then export the model and deploy it on Amazon Sagemaker on Amazon Web Server (AWS). If you are set up with the required systems, you can download the sample project and complete the steps for hands-on learning. This sample uses the following platforms: Watson Studio and Watson Machine Learning with a Python 3.9

AWS

AWS Machine Learning Machine Learning Algorithm

Accelerated PyTorch inference with torch.compile on AWS Graviton processors

AWS Machine Learning Blog

JULY 2, 2024

AWS optimized the PyTorch torch.compile feature for AWS Graviton3 processors. the optimizations are available in torch Python wheels and AWS Graviton PyTorch deep learning container (DLC). The goal for the AWS Graviton team was to optimize torch.compile backend for Graviton3 processors.

AWS

AWS Natural Language Processing Python ML

Supercharge your auto scaling for generative AI inference – Introducing Container Caching in SageMaker Inference

AWS Machine Learning Blog

DECEMBER 2, 2024

Today at AWS re:Invent 2024, we are excited to announce the new Container Caching capability in Amazon SageMaker, which significantly reduces the time required to scale generative AI models for inference. Container Caching addresses this scaling challenge by pre-caching the container image, eliminating the need to download it when scaling up.

AI

AI AI AWS Machine Learning

Harmonize data using AWS Glue and AWS Lake Formation FindMatches ML to build a customer 360 view

Flipboard

JUNE 26, 2023

In this post, we look at how we can use AWS Glue and the AWS Lake Formation ML transform FindMatches to harmonize (deduplicate) customer data coming from different sources to get a complete customer profile to be able to provide better customer experience. Run the AWS Glue ML transform job.

AWS

AWS ML ML ETL

Fast and cost-effective LLaMA 2 fine-tuning with AWS Trainium

AWS Machine Learning Blog

OCTOBER 5, 2023

In this post, we walk through how to fine-tune Llama 2 on AWS Trainium , a purpose-built accelerator for LLM training, to reduce training times and costs. We review the fine-tuning scripts provided by the AWS Neuron SDK (using NeMo Megatron-LM), the various configurations we used, and the throughput results we saw.

AWS

AWS Machine Learning Machine Learning Deep Learning

Deploy a serverless ML inference endpoint of large language models using FastAPI, AWS Lambda, and AWS CDK

AWS Machine Learning Blog

JUNE 23, 2023

Additionally, you can use AWS Lambda directly to expose your models and deploy your ML applications using your preferred open-source framework, which can prove to be more flexible and cost-effective. FastAPI is a modern, high-performance web framework for building APIs with Python. This will be used to deploy our solution.

AWS

AWS ML ML Python

Fine-tune and deploy Llama 2 models cost-effectively in Amazon SageMaker JumpStart with AWS Inferentia and AWS Trainium

AWS Machine Learning Blog

JANUARY 17, 2024

Today, we’re excited to announce the availability of Llama 2 inference and fine-tuning support on AWS Trainium and AWS Inferentia instances in Amazon SageMaker JumpStart. In this post, we demonstrate how to deploy and fine-tune Llama 2 on Trainium and AWS Inferentia instances in SageMaker JumpStart.

AWS

AWS Python Machine Learning Machine Learning

Reinventing a cloud-native federated learning architecture on AWS

AWS Machine Learning Blog

OCTOBER 10, 2023

Customers often need to train a model with data from different regions, organizations, or AWS accounts. Existing partner open-source FL solutions on AWS include FedML and NVIDIA FLARE. These open-source packages are deployed in the cloud by running in virtual machines, without using the cloud-native services available on AWS.

AWS

AWS ML ML Algorithm

Optimized PyTorch 2.0 inference with AWS Graviton processors

AWS Machine Learning Blog

MAY 3, 2023

AWS, Arm, Meta and others helped optimize the performance of PyTorch 2.0 As a result, we are delighted to announce that AWS Graviton-based instance inference performance for PyTorch 2.0 times the speed for BERT, making Graviton-based instances the fastest compute optimized instances on AWS for these models. is up to 3.5

AWS

AWS Cloud Computing Python Machine Learning

Simple guide to training Llama 2 with AWS Trainium on Amazon SageMaker

AWS Machine Learning Blog

MAY 1, 2024

Llama2 by Meta is an example of an LLM offered by AWS. To learn more about Llama 2 on AWS, refer to Llama 2 foundation models from Meta are now available in Amazon SageMaker JumpStart. Virginia) and US West (Oregon) AWS Regions, and most recently announced general availability in the US East (Ohio) Region.

AWS

AWS ML ML Clustering

Open Earth Observation Data for the Pale Blue Dot: Visualization Challenge

DrivenData Labs

NOVEMBER 14, 2023

VIIRS' Day/Night Band (DNB) sensor captures nightlight imagery that is useful for mapping populations. ECOSTRESS has been used in the midwestern United States to detects droughts early enough to recover and protect crops.

Python

Python AWS Azure Data Analysis

Use Snowflake as a data source to train ML models with Amazon SageMaker

AWS Machine Learning Blog

MARCH 8, 2023

This post shows a way to do this using Snowflake as the data source and by downloading the data directly from Snowflake into a SageMaker Training job instance. We create a custom training container that downloads data directly from the Snowflake table into the training instance rather than first downloading the data into an S3 bucket.

ML

ML ML AWS Python

Llama 4 family of models from Meta are now available in SageMaker JumpStart

AWS Machine Learning Blog

APRIL 7, 2025

Virginia) AWS Region. Prerequisites To try the Llama 4 models in SageMaker JumpStart, you need the following prerequisites: An AWS account that will contain all your AWS resources. An AWS Identity and Access Management (IAM) role to access SageMaker AI. Access to accelerated instances (GPUs) for hosting the LLMs.

AWS

AWS Machine Learning Machine Learning AI

Achieve high performance with lowest cost for generative AI inference using AWS Inferentia2 and AWS Trainium on Amazon SageMaker

AWS Machine Learning Blog

MAY 4, 2023

AWS has been innovating with purpose-built chips to address the growing need for powerful, efficient, and cost-effective compute hardware. You can use ml.trn1 and ml.inf2 compatible AWS Deep Learning Containers (DLCs) for PyTorch, TensorFlow, Hugging Face, and large model inference (LMI) to easily get started. petaflops for BF16/FP16.

AWS

AWS Deep Learning Deep Learning ML

NeMo Retriever Llama 3.2 text embedding and reranking NVIDIA NIM microservices now available in Amazon SageMaker JumpStart

AWS Machine Learning Blog

MARCH 18, 2025

With this launch, you can now deploy NVIDIAs optimized reranking and embedding models to build, experiment, and responsibly scale your generative AI ideas on AWS. As part of NVIDIA AI Enterprise available in AWS Marketplace , NIM is a set of user-friendly microservices designed to streamline and accelerate the deployment of generative AI.

AWS

AWS AI AI Computer Science

Streamline custom model creation and deployment for Amazon Bedrock with Provisioned Throughput using Terraform

AWS Machine Learning Blog

JUNE 4, 2024

Terraform is an IaC tool that allows you to manage AWS resources, software as a service (SaaS) resources, datasets, and more, using declarative configuration. Configure your local Python virtual environment. Download the DialogSum public dataset and convert it to JSONL. python v3.8 Terraform v1.0.0 or newer installed.

AWS

AWS Python Artificial Intelligence Artificial Intelligence

Transform one-on-one customer interactions: Build speech-capable order processing agents with AWS and generative AI

AWS Machine Learning Blog

MARCH 15, 2024

In addition to Amazon Bedrock, you can use other AWS services like Amazon SageMaker JumpStart and Amazon Lex to create fully automated and easily adaptable generative AI order processing agents. In this post, we show you how to build a speech-capable order processing agent using Amazon Lex, Amazon Bedrock, and AWS Lambda.

AWS

AWS AI AI Python

Automatically generate impressions from findings in radiology reports using generative AI on AWS

AWS Machine Learning Blog

AUGUST 30, 2023

This post demonstrates a strategy for fine-tuning publicly available LLMs for the task of radiology report summarization using AWS services. In the following sections, we demonstrate fine-tuning an LLM available on SageMaker JumpStart for summarization of a domain-specific task via the SageMaker Python SDK. We use an ml.t3.medium

AWS

AWS AI AI ML

Introduction to GitHub Actions for Python Projects

PyImageSearch

SEPTEMBER 30, 2024

Home Table of Contents Introduction to GitHub Actions for Python Projects Introduction What Is CICD? For Python projects, CI/CD pipelines ensure that your code is consistently integrated and delivered with high quality and reliability. Git is the most commonly used VCS for Python projects, enabling collaboration and version tracking.

Python

Python Deep Learning Deep Learning AWS

Welcome to a New Era of Building in the Cloud with Generative AI on AWS

AWS Machine Learning Blog

NOVEMBER 30, 2023

The number of companies launching generative AI applications on AWS is substantial and building quickly, including adidas, Booking.com, Bridgewater Associates, Clariant, Cox Automotive, GoDaddy, and LexisNexis Legal & Professional, to name just a few. Innovative startups like Perplexity AI are going all in on AWS for generative AI.

AWS

AWS AI AI ML

Four approaches to manage Python packages in Amazon SageMaker Studio notebooks

Flipboard

MARCH 7, 2023

This post presents and compares options and recommended practices on how to manage Python packages and virtual environments in Amazon SageMaker Studio notebooks. You can manage app images via the SageMaker console, the AWS SDK for Python (Boto3), and the AWS Command Line Interface (AWS CLI). Define a Dockerfile.

Python

Python AWS ML ML

Customize DeepSeek-R1 distilled models using Amazon SageMaker HyperPod recipes – Part 1

AWS Machine Learning Blog

MARCH 3, 2025

These recipes include a training stack validated by Amazon Web Services (AWS) , which removes the tedious work of experimenting with different model configurations, minimizing the time it takes for iterative evaluation and testing. Alternatively, you can also use AWS Systems Manager and run a command like the following to start the session.

Clustering

Clustering AWS ML ML

Build high-performance ML models using PyTorch 2.0 on AWS – Part 1

AWS Machine Learning Blog

JUNE 6, 2023

PyTorch is a machine learning (ML) framework that is widely used by AWS customers for a variety of applications, such as computer vision, natural language processing, content creation, and more. release, AWS customers can now do same things as they could with PyTorch 1.x 24xlarge with AWS PyTorch 2.0 on AWS PyTorch2.0

AWS

AWS ML ML Deep Learning

Deploy pre-trained models on AWS Wavelength with 5G edge using Amazon SageMaker JumpStart

AWS Machine Learning Blog

APRIL 7, 2023

To reduce the barrier to entry of ML at the edge, we wanted to demonstrate an example of deploying a pre-trained model from Amazon SageMaker to AWS Wavelength , all in less than 100 lines of code. In this post, we demonstrate how to deploy a SageMaker model to AWS Wavelength to reduce model inference latency for 5G network-based applications.

AWS

AWS Clustering ML ML

An integrated experience for all your data and AI with Amazon SageMaker Unified Studio (preview)

Flipboard

DECEMBER 11, 2024

Discover your data and put it to work using familiar AWS tools to complete end-to-end development workflows, including data analysis, data processing, model training, generative AI app building, and more, in a single governed environment. Youre redirected to the AWS CloudFormation console to deploy a stack to configure VPC resources.

SQL

SQL AWS Data Lakes AI

GenASL: Generative AI-powered American Sign Language avatars

AWS Machine Learning Blog

AUGUST 26, 2024

AWS makes it possible for organizations of all sizes and developers of all skill levels to build and scale generative AI applications with security, privacy, and responsible AI. In this post, we dive into the architecture and implementation details of GenASL, which uses AWS generative AI capabilities to create human-like ASL avatar videos.

AWS

AWS AI AI ML

Train and deploy ML models in a multicloud environment using Amazon SageMaker

AWS Machine Learning Blog

SEPTEMBER 20, 2023

For example, you might have acquired a company that was already running on a different cloud provider, or you may have a workload that generates value from unique capabilities provided by AWS. We show how you can build and train an ML model in AWS and deploy the model in another platform.

ML

ML ML Azure AWS

Making traffic lights more efficient with Amazon Rekognition

AWS Machine Learning Blog

SEPTEMBER 23, 2024

Prerequisties The proposed solution can be implemented in a personal AWS environment using the code that we provide. Before running the labs in this post, ensure you have the following: An AWS account. The appropriate AWS Identity and Access Management (IAM) permissions to access services used in the lab.

AWS

AWS Data Scientist Machine Learning Machine Learning

Maximize Stable Diffusion performance and lower inference costs with AWS Inferentia2

AWS Machine Learning Blog

JULY 26, 2023

In this post, we show how you can run Stable Diffusion models and achieve high performance at the lowest cost in Amazon Elastic Compute Cloud (Amazon EC2) using Amazon EC2 Inf2 instances powered by AWS Inferentia2. versions on AWS Inferentia2 cost-effectively. You can run both Stable Diffusion 2.1 The Stable Diffusion 2.1

AWS

AWS Deep Learning Deep Learning ML

Use Amazon SageMaker Studio to build a RAG question answering solution with Llama 2, LangChain, and Pinecone for fast experimentation

Flipboard

NOVEMBER 20, 2023

We use two AWS Media & Entertainment Blog posts as the sample external data, which we convert into embeddings with the BAAI/bge-small-en-v1.5 Prerequisites To follow the steps in this post, you need to have an AWS account and an AWS Identity and Access Management (IAM) role with permissions to create and access the solution resources.

AWS

AWS Database Machine Learning Machine Learning

Host ML models on Amazon SageMaker using Triton: Python backend

AWS Machine Learning Blog

MAY 9, 2023

In this post, we help you understand the Python backend that is supported by Triton on SageMaker so that you can make an informed decision for your workloads and achieve great results. Triton supports instance types that support GPUs, CPUs, and AWS Inferentia chips, which allow you to maximize the performance for your workloads.

Python

Python ML ML Deep Learning

Share medical image research on Amazon SageMaker Studio Lab for free

Flipboard

FEBRUARY 7, 2023

The following example illustrates Studio Lab running a Jupyter notebook that downloads TCIA prostate MRI data, segments it using MONAI, and displays the results using itkWidgets. Make sure to choose the medical-image-ai Python kernel when running the TCIA notebooks in Studio Lab.

AWS

AWS ML ML Deep Learning

Build and deploy a UI for your generative AI applications with AWS and Python

PEFT fine tuning of Llama 3 on SageMaker HyperPod with AWS Trainium

Webinars

Trending Sources

Accelerating ML experimentation with enhanced security: AWS PrivateLink support for Amazon SageMaker with MLflow

Webinars

Automate invoice processing with Streamlit and Amazon Bedrock

Serving LLMs using vLLM and Amazon EC2 instances with AWS AI chips

Reduce conversational AI response time through inference at the edge with AWS Local Zones

AWS Inferentia and AWS Trainium deliver lowest cost to deploy Llama 3 models in Amazon SageMaker JumpStart

Llama 3.3 70B now available in Amazon SageMaker JumpStart

Accelerate NLP inference with ONNX Runtime on AWS Graviton processors

Develop and train large models cost-efficiently with Metaflow and AWS Trainium

Monitor AWS Sagemaker model using IBM Watson OpenScale

Deploy DeepSeek-R1 Distilled Llama models in Amazon Bedrock

Deploy an IBM Watson Studio Model on AWS Sagemaker

Accelerated PyTorch inference with torch.compile on AWS Graviton processors

Supercharge your auto scaling for generative AI inference – Introducing Container Caching in SageMaker Inference

Harmonize data using AWS Glue and AWS Lake Formation FindMatches ML to build a customer 360 view

Fast and cost-effective LLaMA 2 fine-tuning with AWS Trainium

Deploy a serverless ML inference endpoint of large language models using FastAPI, AWS Lambda, and AWS CDK

Fine-tune and deploy Llama 2 models cost-effectively in Amazon SageMaker JumpStart with AWS Inferentia and AWS Trainium

Reinventing a cloud-native federated learning architecture on AWS

Optimized PyTorch 2.0 inference with AWS Graviton processors

Simple guide to training Llama 2 with AWS Trainium on Amazon SageMaker

Open Earth Observation Data for the Pale Blue Dot: Visualization Challenge

Use Snowflake as a data source to train ML models with Amazon SageMaker

Llama 4 family of models from Meta are now available in SageMaker JumpStart

Achieve high performance with lowest cost for generative AI inference using AWS Inferentia2 and AWS Trainium on Amazon SageMaker

NeMo Retriever Llama 3.2 text embedding and reranking NVIDIA NIM microservices now available in Amazon SageMaker JumpStart

Streamline custom model creation and deployment for Amazon Bedrock with Provisioned Throughput using Terraform

Transform one-on-one customer interactions: Build speech-capable order processing agents with AWS and generative AI

Automatically generate impressions from findings in radiology reports using generative AI on AWS

Introduction to GitHub Actions for Python Projects

Welcome to a New Era of Building in the Cloud with Generative AI on AWS

Four approaches to manage Python packages in Amazon SageMaker Studio notebooks

Customize DeepSeek-R1 distilled models using Amazon SageMaker HyperPod recipes – Part 1

Build high-performance ML models using PyTorch 2.0 on AWS – Part 1

Deploy pre-trained models on AWS Wavelength with 5G edge using Amazon SageMaker JumpStart

An integrated experience for all your data and AI with Amazon SageMaker Unified Studio (preview)

GenASL: Generative AI-powered American Sign Language avatars

Train and deploy ML models in a multicloud environment using Amazon SageMaker

Making traffic lights more efficient with Amazon Rekognition

Maximize Stable Diffusion performance and lower inference costs with AWS Inferentia2

Use Amazon SageMaker Studio to build a RAG question answering solution with Llama 2, LangChain, and Pinecone for fast experimentation

Host ML models on Amazon SageMaker using Triton: Python backend

Share medical image research on Amazon SageMaker Studio Lab for free

Stay Connected