AWS and Download - Data Science Current

Build and deploy a UI for your generative AI applications with AWS and Python

AWS Machine Learning Blog

NOVEMBER 6, 2024

AWS provides a powerful set of tools and services that simplify the process of building and deploying generative AI applications, even for those with limited experience in frontend and backend development. The AWS deployment architecture makes sure the Python application is hosted and accessible from the internet to authenticated users.

AWS

AWS Python AI AI

PEFT fine tuning of Llama 3 on SageMaker HyperPod with AWS Trainium

AWS Machine Learning Blog

DECEMBER 24, 2024

To simplify infrastructure setup and accelerate distributed training, AWS introduced Amazon SageMaker HyperPod in late 2023. In this blog post, we showcase how you can perform efficient supervised fine tuning for a Meta Llama 3 model using PEFT on AWS Trainium with SageMaker HyperPod. architectures/5.sagemaker-hyperpod/LifecycleScripts/base-config/

AWS

AWS Clustering Deep Learning Deep Learning

Build a reverse image search engine with Amazon Titan Multimodal Embeddings in Amazon Bedrock and AWS managed services

AWS Machine Learning Blog

NOVEMBER 13, 2024

Prerequisites To implement the proposed solution, make sure that you have the following: An AWS account and a working knowledge of FMs, Amazon Bedrock , Amazon SageMaker , Amazon OpenSearch Service , Amazon S3 , and AWS Identity and Access Management (IAM). Amazon Titan Multimodal Embeddings model access in Amazon Bedrock.

AWS

AWS Database K-nearest Neighbors AI

Webinars

Agent Tooling: Connecting AI to Your Tools, Systems & Data

Automation, Evolved: Your New Playbook for Smarter Knowledge Work

How to Modernize Manufacturing Without Losing Control

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

MORE WEBINARS

Accelerating ML experimentation with enhanced security: AWS PrivateLink support for Amazon SageMaker with MLflow

AWS Machine Learning Blog

DECEMBER 9, 2024

It simplifies the often complex and time-consuming tasks involved in setting up and managing an MLflow environment, allowing ML administrators to quickly establish secure and scalable MLflow environments on AWS. For example, you can give users access permission to download popular packages and customize the development environment.

AWS

AWS ML ML Data Scientist

Accelerate digital pathology slide annotation workflows on AWS using H-optimus-0

AWS Machine Learning Blog

JANUARY 31, 2025

Solution overview Our solution uses the AWS integrated ecosystem to create an efficient scalable pipeline for digital pathology AI workflows. Prerequisites We assume you have access to and are authenticated in an AWS account. The AWS CloudFormation template for this solution uses t3.medium

AWS

AWS Supervised Learning ML ML

How AWS sales uses Amazon Q Business for customer engagement

AWS Machine Learning Blog

DECEMBER 11, 2024

Earlier this year, we published the first in a series of posts about how AWS is transforming our seller and customer journeys using generative AI. Field Advisor serves four primary use cases: AWS-specific knowledge search With Amazon Q Business, weve made internal data sources as well as public AWS content available in Field Advisors index.

AWS

AWS Database AI AI

WordFinder app: Harnessing generative AI on AWS for aphasia communication

AWS Machine Learning Blog

MAY 2, 2025

David Copland, from QARC, and Scott Harding, a person living with aphasia, used AWS services to develop WordFinder, a mobile, cloud-based solution that helps individuals with aphasia increase their independence through the use of AWS generative AI technology. The following diagram illustrates the solution architecture on AWS.

AWS

AWS AI AI Machine Learning

Serving LLMs using vLLM and Amazon EC2 instances with AWS AI chips

AWS Machine Learning Blog

NOVEMBER 26, 2024

Using vLLM on AWS Trainium and Inferentia makes it possible to host LLMs for high performance inference and scalability. Deploy vLLM on AWS Trainium and Inferentia EC2 instances In these sections, you will be guided through using vLLM on an AWS Inferentia EC2 instance to deploy Meta’s newest Llama 3.2 You will use inf2.xlarge

AWS

AWS AI AI Artificial Intelligence

Amazon Q Business simplifies integration of enterprise knowledge bases at scale

Flipboard

FEBRUARY 11, 2025

Enhancing AWS Support Engineering efficiency The AWS Support Engineering team faced the daunting task of manually sifting through numerous tools, internal sources, and AWS public documentation to find solutions for customer inquiries. Then we introduce the solution deployment using three AWS CloudFormation templates.

AWS

AWS ML ML Machine Learning

Automate invoice processing with Streamlit and Amazon Bedrock

AWS Machine Learning Blog

NOVEMBER 14, 2024

Prerequisites To perform this solution, complete the following: Create and activate an AWS account. Make sure your AWS credentials are configured correctly. This tutorial assumes you have the necessary AWS Identity and Access Management (IAM) permissions. For this walkthrough, we will use the AWS CLI to trigger the processing.

AWS

AWS Python AI AI

Reduce conversational AI response time through inference at the edge with AWS Local Zones

AWS Machine Learning Blog

MARCH 3, 2025

Hybrid architecture with AWS Local Zones To minimize the impact of network latency on TTFT for users regardless of their locations, a hybrid architecture can be implemented by extending AWS services from commercial Regions to edge locations closer to end users. Next, create a subnet inside each Local Zone. Amazon Linux 2).

AWS

AWS AI AI Deep Learning

Build generative AI applications quickly with Amazon Bedrock IDE in Amazon SageMaker Unified Studio

AWS Machine Learning Blog

DECEMBER 4, 2024

SageMaker Unified Studio combines various AWS services, including Amazon Bedrock , Amazon SageMaker , Amazon Redshift , Amazon Glue , Amazon Athena , and Amazon Managed Workflows for Apache Airflow (MWAA) , into a comprehensive data and AI development platform. Navigate to the AWS Secrets Manager console and find the secret -api-keys.

AWS

AWS AI AI SQL

Achieve multi-Region resiliency for your conversational AI chatbots with Amazon Lex

AWS Machine Learning Blog

OCTOBER 30, 2024

Global Resiliency is a new Amazon Lex capability that enables near real-time replication of your Amazon Lex V2 bots in a second AWS Region. Additionally, we discuss how to handle integrations with AWS Lambda and Amazon CloudWatch after enabling Global Resiliency. We walk through the instructions to replicate the bot later in this post.

AWS

AWS AI AI Natural Language Processing

Deploy DeepSeek-R1 Distilled Llama models in Amazon Bedrock

AWS Machine Learning Blog

JANUARY 29, 2025

In this post, we explore how to deploy distilled versions of DeepSeek-R1 with Amazon Bedrock Custom Model Import, making them accessible to organizations looking to use state-of-the-art AI capabilities within the secure and scalable AWS infrastructure at an effective cost. You can monitor costs with AWS Cost Explorer.

AWS

AWS ML ML AI

Monitor AWS Sagemaker model using IBM Watson OpenScale

IBM Data Science in Practice

APRIL 4, 2023

Introduction This article shows how to monitor a model deployed on AWS Sagemaker for quality, bias and explainability, using IBM Watson OpenScale on the IBM Cloud Pak for Data platform. This article shows how to use the endpoint generated from that tutorial to demonstrate how to monitor the AWS deployment with Watson OpenScale.

AWS

AWS Machine Learning Machine Learning Data Science

Supercharge your auto scaling for generative AI inference – Introducing Container Caching in SageMaker Inference

AWS Machine Learning Blog

DECEMBER 2, 2024

Today at AWS re:Invent 2024, we are excited to announce the new Container Caching capability in Amazon SageMaker, which significantly reduces the time required to scale generative AI models for inference. Container Caching addresses this scaling challenge by pre-caching the container image, eliminating the need to download it when scaling up.

AI

AI AI AWS Machine Learning

Accelerate NLP inference with ONNX Runtime on AWS Graviton processors

AWS Machine Learning Blog

MAY 15, 2024

AWS Graviton3 processors are optimized for ML workloads, including support for bfloat16, Scalable Vector Extension (SVE), and Matrix Multiplication (MMLA) instructions. In this post, we show how to run ONNX Runtime inference on AWS Graviton3-based EC2 instances and how to configure them to use optimized GEMM kernels.

AWS

AWS Natural Language Processing Python Deep Learning

AWS Inferentia and AWS Trainium deliver lowest cost to deploy Llama 3 models in Amazon SageMaker JumpStart

AWS Machine Learning Blog

MAY 2, 2024

Today, we’re excited to announce the availability of Meta Llama 3 inference on AWS Trainium and AWS Inferentia based instances in Amazon SageMaker JumpStart. In this post, we demonstrate how easy it is to deploy Llama 3 on AWS Trainium and AWS Inferentia based instances in SageMaker JumpStart.

AWS

AWS ML ML Python

Configure Amazon Q Business with AWS IAM Identity Center trusted identity propagation

AWS Machine Learning Blog

JULY 30, 2024

Amazon Q Business uses AWS IAM Identity Center to record the workforce users you assign access to and their attributes, such as group associations. IAM Identity Center is used by many AWS managed applications such as Amazon Q. Why use trusted identity propagation? Promotes software design principles rooted in user privacy.

AWS

AWS AI AI Artificial Intelligence

Deploy an IBM Watson Studio Model on AWS Sagemaker

IBM Data Science in Practice

MARCH 8, 2023

You can then export the model and deploy it on Amazon Sagemaker on Amazon Web Server (AWS). If you are set up with the required systems, you can download the sample project and complete the steps for hands-on learning. The example model predicts how likely a customer is to enroll in a Demand Response Program of a Utilities Company.

AWS

AWS Machine Learning Machine Learning Algorithm

Cohere Embed multimodal embeddings model is now available on Amazon SageMaker JumpStart

AWS Machine Learning Blog

NOVEMBER 15, 2024

Prerequisites Make sure you meet the following prerequisites: Make sure your SageMaker AWS Identity and Access Management (IAM) role has the AmazonSageMakerFullAccess permission policy attached. You may be prompted to subscribe to this model through AWS Marketplace. On the AWS Marketplace listing , choose Continue to subscribe.

AWS

AWS Computer Science Computer Science Database

Build an AI-powered document processing platform with open source NER model and LLM on Amazon SageMaker

Flipboard

APRIL 23, 2025

Solution overview The NER & LLM Gen AI Application is a document processing solution built on AWS that combines NER and LLMs to automate document analysis at scale. Click here to open the AWS console and follow along. The endpoint lifecycle is orchestrated through dedicated AWS Lambda functions that handle creation and deletion.

AWS

AWS ML ML AI

Evaluate healthcare generative AI applications using LLM-as-a-judge on AWS

AWS Machine Learning Blog

FEBRUARY 27, 2025

Because we used only the radiology report text data, we downloaded just one compressed report file (mimic-cxr-reports.zip) from the MIMIC-CXR website. Data ScientistGenerative AI, Amazon Bedrock, where he contributes to cutting edge innovations in foundational models and generative AI applications at AWS.

AWS

AWS AI AI ML

Develop and train large models cost-efficiently with Metaflow and AWS Trainium

AWS Machine Learning Blog

APRIL 29, 2024

For AWS and Outerbounds customers, the goal is to build a differentiated machine learning and artificial intelligence (ML/AI) system and reliably improve it over time. First, the AWS Trainium accelerator provides a high-performance, cost-effective, and readily available solution for training and fine-tuning large models.

AWS

AWS ML ML Python

Reinventing a cloud-native federated learning architecture on AWS

AWS Machine Learning Blog

OCTOBER 10, 2023

Customers often need to train a model with data from different regions, organizations, or AWS accounts. Existing partner open-source FL solutions on AWS include FedML and NVIDIA FLARE. These open-source packages are deployed in the cloud by running in virtual machines, without using the cloud-native services available on AWS.

AWS

AWS ML ML Algorithm

Fast and cost-effective LLaMA 2 fine-tuning with AWS Trainium

AWS Machine Learning Blog

OCTOBER 5, 2023

In this post, we walk through how to fine-tune Llama 2 on AWS Trainium , a purpose-built accelerator for LLM training, to reduce training times and costs. We review the fine-tuning scripts provided by the AWS Neuron SDK (using NeMo Megatron-LM), the various configurations we used, and the throughput results we saw.

AWS

AWS Machine Learning Machine Learning Deep Learning

Enhance your Amazon Redshift cloud data warehouse with easier, simpler, and faster machine learning using Amazon SageMaker Canvas

AWS Machine Learning Blog

OCTOBER 24, 2024

Prerequisites Before you begin, make sure you have the following prerequisites in place: An AWS account and role with the AWS Identity and Access Management (IAM) privileges to deploy the following resources: IAM roles. Open the AWS Management Console, go to Amazon Bedrock, and choose Model access in the navigation pane.

Data Warehouse

Data Warehouse Machine Learning Machine Learning Cloud Data

Manage your Amazon Lex bot via AWS CloudFormation templates

AWS Machine Learning Blog

APRIL 16, 2024

Managing your Amazon Lex bots using AWS CloudFormation allows you to create templates defining the bot and all the AWS resources it depends on. AWS CloudFormation provides and configures those resources on your behalf, removing the risk of human error when deploying bots to new environments. Resources: # 1.

AWS

AWS Deep Learning Deep Learning Artificial Intelligence

Accelerated PyTorch inference with torch.compile on AWS Graviton processors

AWS Machine Learning Blog

JULY 2, 2024

AWS optimized the PyTorch torch.compile feature for AWS Graviton3 processors. the optimizations are available in torch Python wheels and AWS Graviton PyTorch deep learning container (DLC). The goal for the AWS Graviton team was to optimize torch.compile backend for Graviton3 processors. Starting with PyTorch 2.3.1,

AWS

AWS Natural Language Processing Python ML

Harmonize data using AWS Glue and AWS Lake Formation FindMatches ML to build a customer 360 view

Flipboard

JUNE 26, 2023

In this post, we look at how we can use AWS Glue and the AWS Lake Formation ML transform FindMatches to harmonize (deduplicate) customer data coming from different sources to get a complete customer profile to be able to provide better customer experience. Run the AWS Glue ML transform job.

AWS

AWS ML ML ETL

Unify structured data in Amazon Aurora and unstructured data in Amazon S3 for insights using Amazon Q

AWS Machine Learning Blog

NOVEMBER 20, 2024

In this post, we explore how you can use Amazon Q Business , the AWS generative AI-powered assistant, to build a centralized knowledge base for your organization, unifying structured and unstructured datasets from different sources to accelerate decision-making and drive productivity. In this post, we use IAM Identity Center as the SAML 2.0-aligned

Database

Database AWS SQL ETL

Frugality meets Accuracy: Cost-efficient training of GPT NeoX and Pythia models with AWS Trainium

AWS Machine Learning Blog

DECEMBER 12, 2023

In this post, we’ll summarize training procedure of GPT NeoX on AWS Trainium , a purpose-built machine learning (ML) accelerator optimized for deep learning training. M tokens/$) trained such models with AWS Trainium without losing any model quality. We’ll outline how we cost-effectively (3.2 billion in Pythia.

AWS

AWS Machine Learning Machine Learning Deep Learning

Automate prior authorization using CRD with CDS Hooks and AWS HealthLake

AWS Machine Learning Blog

OCTOBER 4, 2023

Documentation Templates and Rules (DTR) – This allows providers to download smart questionnaires and rules, such as Clinical Quality Language (CQL), and provides a SMART on FHIR app or EHR app that runs the questionnaires and rules to gather information relevant to a performed or planned service.

AWS

AWS AI AI ML

Llama 3.3 70B now available in Amazon SageMaker JumpStart

AWS Machine Learning Blog

DECEMBER 16, 2024

This feature eliminates one of the major bottlenecks in deployment scaling by pre-caching container images, removing the need for time-consuming downloads when adding new instances. Prior to joining AWS, Dr. Li held data science roles in the financial and retail industries. Adriana Simmons is a Senior Product Marketing Manager at AWS.

AWS

AWS ML ML Python

Build enterprise-ready generative AI solutions with Cohere foundation models in Amazon Bedrock and Weaviate vector database on AWS Marketplace

AWS Machine Learning Blog

JANUARY 24, 2024

We demonstrate how to build an end-to-end RAG application using Cohere’s language models through Amazon Bedrock and a Weaviate vector database on AWS Marketplace. Additionally, you can securely integrate and easily deploy your generative AI applications using the AWS tools you are already familiar with.

AWS

AWS Database AI AI

Optimized PyTorch 2.0 inference with AWS Graviton processors

AWS Machine Learning Blog

MAY 3, 2023

AWS, Arm, Meta and others helped optimize the performance of PyTorch 2.0 As a result, we are delighted to announce that AWS Graviton-based instance inference performance for PyTorch 2.0 times the speed for BERT, making Graviton-based instances the fastest compute optimized instances on AWS for these models. is up to 3.5

AWS

AWS Cloud Computing Python Machine Learning

Discover insights from Gmail using the Gmail connector for Amazon Q Business

AWS Machine Learning Blog

OCTOBER 31, 2024

The web application that the user uses to retrieve answers is connected to an identity provider (IdP) or AWS IAM Identity Center. If you haven’t created one yet, refer to Build private and secure enterprise generative AI apps with Amazon Q Business and AWS IAM Identity Center for instructions. Access to AWS Secrets Manager.

AWS

AWS AI AI ML

Large language model inference over confidential data using AWS Nitro Enclaves

AWS Machine Learning Blog

MARCH 12, 2024

In this post, we discuss how Leidos worked with AWS to develop an approach to privacy-preserving large language model (LLM) inference using AWS Nitro Enclaves. The steps carried out during the inference are as follows: The chatbot app generates temporary AWS credentials and asks the user to input a question. hvm-2.0.20230628.0-x86_64-gp2

AWS

AWS Natural Language Processing Machine Learning Machine Learning

Simple guide to training Llama 2 with AWS Trainium on Amazon SageMaker

AWS Machine Learning Blog

MAY 1, 2024

Llama2 by Meta is an example of an LLM offered by AWS. To learn more about Llama 2 on AWS, refer to Llama 2 foundation models from Meta are now available in Amazon SageMaker JumpStart. Virginia) and US West (Oregon) AWS Regions, and most recently announced general availability in the US East (Ohio) Region.

AWS

AWS ML ML Clustering

Fine-tune and deploy Llama 2 models cost-effectively in Amazon SageMaker JumpStart with AWS Inferentia and AWS Trainium

AWS Machine Learning Blog

JANUARY 17, 2024

Today, we’re excited to announce the availability of Llama 2 inference and fine-tuning support on AWS Trainium and AWS Inferentia instances in Amazon SageMaker JumpStart. In this post, we demonstrate how to deploy and fine-tune Llama 2 on Trainium and AWS Inferentia instances in SageMaker JumpStart.

AWS

AWS Python Machine Learning Machine Learning

Deploy a serverless ML inference endpoint of large language models using FastAPI, AWS Lambda, and AWS CDK

AWS Machine Learning Blog

JUNE 23, 2023

Additionally, you can use AWS Lambda directly to expose your models and deploy your ML applications using your preferred open-source framework, which can prove to be more flexible and cost-effective. We also show you how to automate the deployment using the AWS Cloud Development Kit (AWS CDK). Now, let’s set up the environment.

AWS

AWS ML ML Python

Process formulas and charts with Anthropic’s Claude on Amazon Bedrock

AWS Machine Learning Blog

MARCH 21, 2025

AWS Lambda AWS Lambda is a compute service that runs code in response to triggers such as changes in data, changes in application state, or user actions. Prerequisites If youre new to AWS, you first need to create and set up an AWS account. We download the documents and store them under a samples folder locally.

AWS

AWS AI AI Data Scientist

Unlock cost savings with the new scale down to zero feature in SageMaker Inference

Flipboard

DECEMBER 2, 2024

Today at AWS re:Invent 2024, we are excited to announce a new feature for Amazon SageMaker inference endpoints: the ability to scale SageMaker inference endpoints to zero instances. This long-awaited capability is a game changer for our customers using the power of AI and machine learning (ML) inference in the cloud.

ML

ML ML AWS Machine Learning

Build and deploy a UI for your generative AI applications with AWS and Python

PEFT fine tuning of Llama 3 on SageMaker HyperPod with AWS Trainium

Webinars

Trending Sources

Build a reverse image search engine with Amazon Titan Multimodal Embeddings in Amazon Bedrock and AWS managed services

Webinars

Accelerating ML experimentation with enhanced security: AWS PrivateLink support for Amazon SageMaker with MLflow

Accelerate digital pathology slide annotation workflows on AWS using H-optimus-0

How AWS sales uses Amazon Q Business for customer engagement

WordFinder app: Harnessing generative AI on AWS for aphasia communication

Top 6 Amazon S3 Interview Questions

Serving LLMs using vLLM and Amazon EC2 instances with AWS AI chips

Amazon Q Business simplifies integration of enterprise knowledge bases at scale

Automate invoice processing with Streamlit and Amazon Bedrock

Reduce conversational AI response time through inference at the edge with AWS Local Zones

Build generative AI applications quickly with Amazon Bedrock IDE in Amazon SageMaker Unified Studio

Achieve multi-Region resiliency for your conversational AI chatbots with Amazon Lex

Deploy DeepSeek-R1 Distilled Llama models in Amazon Bedrock

Monitor AWS Sagemaker model using IBM Watson OpenScale

Supercharge your auto scaling for generative AI inference – Introducing Container Caching in SageMaker Inference

Accelerate NLP inference with ONNX Runtime on AWS Graviton processors

AWS Inferentia and AWS Trainium deliver lowest cost to deploy Llama 3 models in Amazon SageMaker JumpStart

Configure Amazon Q Business with AWS IAM Identity Center trusted identity propagation

Deploy an IBM Watson Studio Model on AWS Sagemaker

Cohere Embed multimodal embeddings model is now available on Amazon SageMaker JumpStart

Build an AI-powered document processing platform with open source NER model and LLM on Amazon SageMaker

Evaluate healthcare generative AI applications using LLM-as-a-judge on AWS

Develop and train large models cost-efficiently with Metaflow and AWS Trainium

Reinventing a cloud-native federated learning architecture on AWS

Fast and cost-effective LLaMA 2 fine-tuning with AWS Trainium

Enhance your Amazon Redshift cloud data warehouse with easier, simpler, and faster machine learning using Amazon SageMaker Canvas

Manage your Amazon Lex bot via AWS CloudFormation templates

Accelerated PyTorch inference with torch.compile on AWS Graviton processors

Harmonize data using AWS Glue and AWS Lake Formation FindMatches ML to build a customer 360 view

Unify structured data in Amazon Aurora and unstructured data in Amazon S3 for insights using Amazon Q

Frugality meets Accuracy: Cost-efficient training of GPT NeoX and Pythia models with AWS Trainium

Automate prior authorization using CRD with CDS Hooks and AWS HealthLake

Llama 3.3 70B now available in Amazon SageMaker JumpStart

Build enterprise-ready generative AI solutions with Cohere foundation models in Amazon Bedrock and Weaviate vector database on AWS Marketplace

Optimized PyTorch 2.0 inference with AWS Graviton processors

Discover insights from Gmail using the Gmail connector for Amazon Q Business

Large language model inference over confidential data using AWS Nitro Enclaves

Simple guide to training Llama 2 with AWS Trainium on Amazon SageMaker

Fine-tune and deploy Llama 2 models cost-effectively in Amazon SageMaker JumpStart with AWS Inferentia and AWS Trainium

Deploy a serverless ML inference endpoint of large language models using FastAPI, AWS Lambda, and AWS CDK

Process formulas and charts with Anthropic’s Claude on Amazon Bedrock

Unlock cost savings with the new scale down to zero feature in SageMaker Inference

Stay Connected