AWS, Computer Science and Python - Data Science Current

PEFT fine tuning of Llama 3 on SageMaker HyperPod with AWS Trainium

AWS Machine Learning Blog

DECEMBER 24, 2024

To simplify infrastructure setup and accelerate distributed training, AWS introduced Amazon SageMaker HyperPod in late 2023. In this blog post, we showcase how you can perform efficient supervised fine tuning for a Meta Llama 3 model using PEFT on AWS Trainium with SageMaker HyperPod. architectures/5.sagemaker-hyperpod/LifecycleScripts/base-config/

AWS

AWS Clustering Deep Learning Deep Learning

Introducing Fast Model Loader in SageMaker Inference: Accelerate autoscaling for your Large Language Models (LLMs) – Part 2

Flipboard

DECEMBER 2, 2024

We explore two approaches: using the SageMaker Python SDK for programmatic implementation, and using the Amazon SageMaker Studio UI for a more visual, interactive experience. In this post, we walked through the step-by-step process of implementing this feature through both the SageMaker Python SDK and SageMaker Studio UI.

AWS

AWS ML ML Machine Learning

The Weather Company enhances MLOps with Amazon SageMaker, AWS CloudFormation, and Amazon CloudWatch

AWS Machine Learning Blog

JULY 8, 2024

MLOps practitioners have many options to establish an MLOps platform; one among them is cloud-based integrated platforms that scale with data science teams. AWS provides a full-stack of services to establish an MLOps platform in the cloud that is customizable to your needs while reaping all the benefits of doing ML in the cloud.

AWS

AWS ML ML Data Scientist

Webinars

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

MORE WEBINARS

DeepSeek-R1 model now available in Amazon Bedrock Marketplace and Amazon SageMaker JumpStart

AWS Machine Learning Blog

JANUARY 30, 2025

You can now use DeepSeek-R1 to build, experiment, and responsibly scale your generative AI ideas on AWS. To check if you have quotas for P5e, open the Service Quotas console and under AWS Services , choose Amazon SageMaker , and confirm youre using ml.p5e.48xlarge 48xlarge instance in the AWS Region you are deploying.

AWS

AWS Python AI AI

Enterprise-grade natural language to SQL generation using LLMs: Balancing accuracy, latency, and scale

Flipboard

APRIL 24, 2025

This post describes a pattern that AWS and Cisco teams have developed and deployed that is viable at scale and addresses a broad set of challenging enterprise use cases. AWS solution architecture In this section, we illustrate how you might implement the architecture on AWS.

SQL

SQL Database AWS ML

Build a video insights and summarization engine using generative AI with Amazon Bedrock

AWS Machine Learning Blog

OCTOBER 29, 2024

This engine uses artificial intelligence (AI) and machine learning (ML) services and generative AI on AWS to extract transcripts, produce a summary, and provide a sentiment for the call. Organizations typically can’t predict their call patterns, so the solution relies on AWS serverless services to scale during busy times.

AWS

AWS AI AI ML

TigerEye (YC S22) Is Hiring a Full Stack Engineer

Hacker News

NOVEMBER 19, 2024

Here are a few of the things that you might do as an AI Engineer at TigerEye: - Design, develop, and validate statistical models to explain past behavior and to predict future behavior of our customers’ sales teams - Own training, integration, deployment, versioning, and monitoring of ML components - Improve TigerEye’s existing metrics collection and (..)

Computer Science

Computer Science Computer Science ML ML

Connect to Amazon services using AWS PrivateLink in Amazon SageMaker

AWS Machine Learning Blog

JUNE 20, 2024

AWS customers that implement secure development environments often have to restrict outbound and inbound internet traffic. Therefore, accessing AWS services without leaving the AWS network can be a secure workflow. Therefore, accessing AWS services without leaving the AWS network can be a secure workflow.

AWS

AWS Machine Learning Machine Learning AI

Mistral-NeMo-Instruct-2407 and Mistral-NeMo-Base-2407 are now available on SageMaker JumpStart

AWS Machine Learning Blog

DECEMBER 6, 2024

The model is deployed in an AWS secure environment and under your virtual private cloud (VPC) controls, helping to support data security. The model is deployed in an AWS secure environment and under your virtual private cloud (VPC) controls, helping to support data security. His area of focus is generative AI and AWS AI Accelerators.

AWS

AWS ML ML AI

Llama 4 family of models from Meta are now available in SageMaker JumpStart

AWS Machine Learning Blog

APRIL 7, 2025

Virginia) AWS Region. Prerequisites To try the Llama 4 models in SageMaker JumpStart, you need the following prerequisites: An AWS account that will contain all your AWS resources. An AWS Identity and Access Management (IAM) role to access SageMaker AI. Access to accelerated instances (GPUs) for hosting the LLMs.

AWS

AWS Machine Learning Machine Learning ML

NeMo Retriever Llama 3.2 text embedding and reranking NVIDIA NIM microservices now available in Amazon SageMaker JumpStart

AWS Machine Learning Blog

MARCH 18, 2025

With this launch, you can now deploy NVIDIAs optimized reranking and embedding models to build, experiment, and responsibly scale your generative AI ideas on AWS. As part of NVIDIA AI Enterprise available in AWS Marketplace , NIM is a set of user-friendly microservices designed to streamline and accelerate the deployment of generative AI.

AWS

AWS AI AI Computer Science

Architecture to AWS CloudFormation code using Anthropic’s Claude 3 on Amazon Bedrock

AWS Machine Learning Blog

SEPTEMBER 27, 2024

Architecting specific AWS Cloud solutions involves creating diagrams that show relationships and interactions between different services. Instead of building the code manually, you can use Anthropic’s Claude 3’s image analysis capabilities to generate AWS CloudFormation templates by passing an architecture diagram as input.

AWS

AWS AI AI Computer Science

Generative AI and multi-modal agents in AWS: The key to unlocking new value in financial markets

AWS Machine Learning Blog

SEPTEMBER 19, 2023

Technical challenges with multi-modal data further include the complexity of integrating and modeling different data types, the difficulty of combining data from multiple modalities (text, images, audio, video), and the need for advanced computer science skills and sophisticated analysis tools.

AWS

AWS AI AI ML

Build a Search Engine: Semantic Search System Using OpenSearch

PyImageSearch

MAY 19, 2025

run_opensearch.sh Running OpenSearch Locally A script to start OpenSearch using Docker for local testing before deploying to AWS. Implement and analyze search results using Python scripts. Now, lets implement a Python script to execute the neural search query in OpenSearch. These can be used for evaluation and comparison.

K-nearest Neighbors

K-nearest Neighbors AWS Deep Learning Deep Learning

Simple guide to training Llama 2 with AWS Trainium on Amazon SageMaker

AWS Machine Learning Blog

MAY 1, 2024

Llama2 by Meta is an example of an LLM offered by AWS. To learn more about Llama 2 on AWS, refer to Llama 2 foundation models from Meta are now available in Amazon SageMaker JumpStart. Virginia) and US West (Oregon) AWS Regions, and most recently announced general availability in the US East (Ohio) Region.

AWS

AWS ML ML Clustering

Pixtral 12B is now available on Amazon SageMaker JumpStart

AWS Machine Learning Blog

DECEMBER 10, 2024

You can now use state-of-the-art model architectures, such as language models, computer vision models, and more, without having to build them from scratch. Prerequisites To try out Pixtral 12B in SageMaker JumpStart, you need the following prerequisites: An AWS account that will contain all your AWS resources.

AWS

AWS ML ML AI

Multi-tenancy in RAG applications in a single Amazon Bedrock knowledge base with metadata filtering

AWS Machine Learning Blog

APRIL 7, 2025

Amazon Bedrock is a fully managed service that offers a choice of high-performing foundation models (FMs) from leading AI companies and AWS. Solution overview The following diagram provides a high-level overview of AWS services and features through a sample use case. The response only cites sources that are relevant to the query.

Database

Database AWS Natural Language Processing AI

Customize DeepSeek-R1 distilled models using Amazon SageMaker HyperPod recipes – Part 1

AWS Machine Learning Blog

MARCH 3, 2025

These recipes include a training stack validated by Amazon Web Services (AWS) , which removes the tedious work of experimenting with different model configurations, minimizing the time it takes for iterative evaluation and testing. Alternatively, you can also use AWS Systems Manager and run a command like the following to start the session.

Clustering

Clustering AWS ML ML

Learn how to build and deploy tool-using LLM agents using AWS SageMaker JumpStart Foundation Models

AWS Machine Learning Blog

SEPTEMBER 15, 2023

In this post, we introduce LLM agents and demonstrate how to build and deploy an e-commerce LLM agent using Amazon SageMaker JumpStart and AWS Lambda. To power the LLM agent, we use a Flan-UL2 model deployed as a SageMaker endpoint and use data retrieval tools built with AWS Lambda.

AWS

AWS Database Python Computer Science

How VirtuSwap accelerates their pandas-based trading simulations with an Amazon SageMaker Studio custom container and AWS GPU instances

AWS Machine Learning Blog

SEPTEMBER 19, 2023

Prerequisites To run this step-by-step guide, you need an AWS account with permissions to SageMaker, Amazon Elastic Container Registry (Amazon ECR), AWS Identity and Access Management (IAM), and AWS CodeBuild. Complete the following steps: Sign in to the AWS Management Console and open the IAM console. base-ubuntu18.04

AWS

AWS Data Science Data Mining Data Mining

Introduction to GitHub Actions for Python Projects

PyImageSearch

SEPTEMBER 30, 2024

Home Table of Contents Introduction to GitHub Actions for Python Projects Introduction What Is CICD? For Python projects, CI/CD pipelines ensure that your code is consistently integrated and delivered with high quality and reliability. Git is the most commonly used VCS for Python projects, enabling collaboration and version tracking.

Python

Python Deep Learning Deep Learning AWS

Announcing new Jupyter contributions by AWS to democratize generative AI and scale ML workloads

AWS Machine Learning Blog

MAY 10, 2023

Project Jupyter is a multi-stakeholder, open-source project that builds applications, open standards, and tools for data science, machine learning (ML), and computational science. Given the importance of Jupyter to data scientists and ML developers, AWS is an active sponsor and contributor to Project Jupyter.

ML

ML ML AWS AI

Harness large language models in fake news detection

AWS Machine Learning Blog

NOVEMBER 14, 2023

The solution also uses Amazon Bedrock , a fully managed service that makes foundation models (FMs) from Amazon and third-party model providers accessible through the AWS Management Console and APIs. Prerequisites For this tutorial, you need a bash terminal with Python 3.9 in computer science. - Dr. Liskov earned her Ph.D.

Computer Science

Computer Science Computer Science AWS Python

Customize Amazon Nova models to improve tool usage

AWS Machine Learning Blog

APRIL 28, 2025

Amazon Nova models and Amazon Bedrock Amazon Nova models , unveiled at AWS re:Invent in December 2024, are optimized to deliver exceptional price-performance value, offering state-of-the-art performance on key text-understanding benchmarks at low cost. Choose us-east-1 as the AWS Region. gpus 2'] Ground truth pattern: python(3?)

AWS

AWS AI AI Computer Science

Manage Amazon SageMaker JumpStart foundation model access with private hubs

AWS Machine Learning Blog

JUNE 21, 2024

Solution overview Starting today, with SageMaker JumpStart and its private hub feature, administrators can create repositories for a subset of models tailored to different teams, use cases, or license requirements using the Amazon SageMaker Python SDK. For a list of filters you can apply, refer to SageMaker Python SDK.

AWS

AWS Python Machine Learning Machine Learning

AWS performs fine-tuning on a Large Language Model (LLM) to classify toxic speech for a large gaming company

AWS Machine Learning Blog

AUGUST 7, 2023

In an effort to create and maintain a socially responsible gaming environment, AWS Professional Services was asked to build a mechanism that detects inappropriate language (toxic speech) within online gaming player interactions. Unfortunately, as in the real world, not all players communicate appropriately and respectfully.

AWS

AWS ML ML Data Science

Use Snowflake as a data source to train ML models with Amazon SageMaker

AWS Machine Learning Blog

MARCH 8, 2023

The workflow steps are as follows: Set up a SageMaker notebook and an AWS Identity and Access Management (IAM) role with appropriate permissions to allow SageMaker to access Amazon Elastic Container Registry (Amazon ECR), Secrets Manager, and other services within your AWS account. AWS Region Link us-east-1 (N.

ML

ML ML AWS Python

Customize DeepSeek-R1 671b model using Amazon SageMaker HyperPod recipes – Part 2

AWS Machine Learning Blog

MAY 14, 2025

You can execute each step in the training pipeline by initiating the process through the SageMaker control plane using APIs, AWS Command Line Interface (AWS CLI), or the SageMaker ModelTrainer SDK. In response, SageMaker launches training jobs with the requested number and type of compute instances to run specific tasks.

Clustering

Clustering AWS ML ML

Use zero-shot large language models on Amazon Bedrock for custom named entity recognition

AWS Machine Learning Blog

JUNE 18, 2024

Some examples include extracting players and positions in an NFL game summary, products mentioned in an AWS keynote transcript, or key names from an article on a favorite tech company. We extract the default generic entities through the AWS SDK for Python (Boto3) as follows: import pandas as pd comprehend_client = boto3.client("comprehend")

AWS

AWS Natural Language Processing Machine Learning Machine Learning

Falcon 2 11B is now available on Amazon SageMaker JumpStart

AWS Machine Learning Blog

MAY 31, 2024

You can discover and deploy the Falcon 2 11B model with a few clicks in Amazon SageMaker Studio or programmatically through the SageMaker Python SDK, enabling you to derive model performance and MLOps controls with SageMaker features such as Amazon SageMaker Pipelines , Amazon SageMaker Debugger , or container logs.

AWS

AWS Python ML ML

Building AI chatbots using Amazon Lex and Amazon Kendra for filtering query results based on user context

AWS Machine Learning Blog

FEBRUARY 14, 2023

For instance, faculty in an educational institution belongs to different departments, and if a professor belonging to the computer science department signs in to the application and searches with the keywords “ faculty courses ,” then documents relevant to the same department come up as the top results, based on data source availability.

AWS

AWS Computer Science Computer Science Python

Build an automated insight extraction framework for customer feedback analysis with Amazon Bedrock and Amazon QuickSight

AWS Machine Learning Blog

JUNE 25, 2024

The customer review analysis workflow consists of the following steps: A user uploads a file to dedicated data repository within your Amazon Simple Storage Service (Amazon S3) data lake, invoking the processing using AWS Step Functions. In the first step, an AWS Lambda function reads and validates the file, and extracts the raw data.

AWS

AWS Natural Language Processing Machine Learning Machine Learning

Build well-architected IDP solutions with a custom lens – Part 3: Reliability

AWS Machine Learning Blog

NOVEMBER 22, 2023

The IDP Well-Architected Custom Lens is intended for all AWS customers who use AWS to run intelligent document processing (IDP) solutions and are searching for guidance on how to build a secure, efficient, and reliable IDP solution on AWS.

AWS

AWS ML ML Machine Learning

40 Must-Know Data Science Skills and Frameworks for 2023

ODSC - Open Data Science

FEBRUARY 2, 2023

Just as a writer needs to know core skills like sentence structure, grammar, and so on, data scientists at all levels should know core data science skills like programming, computer science, algorithms, and so on. While knowing Python, R, and SQL are expected, you’ll need to go beyond that.

Data Science

Data Science Data Scientist Computer Science Computer Science

Advanced RAG patterns on Amazon SageMaker

AWS Machine Learning Blog

MARCH 28, 2024

For more information on Mixtral-8x7B Instruct on AWS, refer to Mixtral-8x7B is now available in Amazon SageMaker JumpStart. LangChain is an open source Python library designed to build applications with LLMs. Before you get started with the solution, create an AWS account. This identity is called the AWS account root user.

AWS

AWS Machine Learning Machine Learning AI

How SnapLogic built a text-to-pipeline application with Amazon Bedrock to translate business intent into action

Flipboard

NOVEMBER 24, 2023

This use case highlights how large language models (LLMs) are able to become a translator between human languages (English, Spanish, Arabic, and more) and machine interpretable languages (Python, Java, Scala, SQL, and so on) along with sophisticated internal reasoning.

Database

Database AWS ETL SQL

Build an end-to-end MLOps pipeline using Amazon SageMaker Pipelines, GitHub, and GitHub Actions

AWS Machine Learning Blog

DECEMBER 13, 2023

The built-in project templates provided by Amazon SageMaker include integration with some of third-party tools, such as Jenkins for orchestration and GitHub for source control, and several utilize AWS native CI/CD tools such as AWS CodeCommit , AWS CodePipeline , and AWS CodeBuild. An AWS account.

AWS

AWS ML ML Data Preparation

Build a RAG-based QnA application using Llama3 models from SageMaker JumpStart

AWS Machine Learning Blog

SEPTEMBER 12, 2024

LangChain is a Python library designed to build applications with LLMs. Prerequisites To implement this solution, you need the following: An AWS account with privileges to create AWS Identity and Access Management (IAM) roles and policies. Basic familiarity with SageMaker and AWS services that support LLMs. Python 3.10

AWS

AWS ML ML AI

Code Llama 70B is now available in Amazon SageMaker JumpStart

AWS Machine Learning Blog

FEBRUARY 16, 2024

The models excel in Python, C++, Java, PHP, C#, TypeScript, and Bash, and have the potential to save developers’ time and make software workflows more efficient. Because the models are hosted and deployed on AWS, you can rest assured that your data, whether used for evaluating or using the model at scale, is never shared with third parties.

Python

Python Machine Learning Machine Learning ML

Run your local machine learning code as Amazon SageMaker Training jobs with minimal code changes

AWS Machine Learning Blog

APRIL 25, 2023

With this new capability of the SageMaker Python SDK, data scientists can onboard their ML code to the SageMaker Training platform in a few minutes. In this release, you can run your local machine learning (ML) Python code as a single-node Amazon SageMaker training job or multiple parallel jobs.

Machine Learning

Machine Learning Machine Learning ML ML

Elevating the generative AI experience: Introducing streaming support in Amazon SageMaker hosting

AWS Machine Learning Blog

SEPTEMBER 1, 2023

In terms of security, both the input and output are secured using TLS using AWS Sigv4 Auth. In this post, we showcase two container options to create a SageMaker endpoint with response streaming: using an AWS Large Model Inference (LMI) and Hugging Face Text Generation Inference (TGI) container.

AWS

AWS AI AI Machine Learning

Package and deploy classical ML and LLMs easily with Amazon SageMaker, part 1: PySDK Improvements

Flipboard

NOVEMBER 30, 2023

Although it provides various entry points like the SageMaker Python SDK, AWS SDKs, the SageMaker console, and Amazon SageMaker Studio notebooks to simplify the process of training and deploying ML models at scale, customers are still looking for better ways to deploy their models for playground testing and to optimize production deployments.

ML

ML ML AWS Python

Package and deploy classical ML and LLMs easily with Amazon SageMaker, part 2: Interactive User Experiences in SageMaker Studio

AWS Machine Learning Blog

NOVEMBER 30, 2023

The AWS SDK gives you most control and flexibility. It’s a low-level API available for Java, C++, Go, JavaScript, Node.js, PHP, Ruby, and Python. The SageMaker Python SDK is a high-level Python API that abstracts some of the steps and configuration, and makes it easier to deploy models.

ML

ML ML AWS Python

Harnessing the power of enterprise data with generative AI: Insights from Amazon Kendra, LangChain, and large language models

AWS Machine Learning Blog

NOVEMBER 7, 2023

These demos can be seamlessly deployed in your AWS account, offering foundational insights and guidance on utilizing AWS services to create a state-of-the-art LLM generative AI question and answer bot and content generation. Prerequisites You must have the following prerequisites: An AWS account. Python 3.6 x or later.

AWS

AWS AI AI Database

PEFT fine tuning of Llama 3 on SageMaker HyperPod with AWS Trainium

Introducing Fast Model Loader in SageMaker Inference: Accelerate autoscaling for your Large Language Models (LLMs) – Part 2

Webinars

Trending Sources

The Weather Company enhances MLOps with Amazon SageMaker, AWS CloudFormation, and Amazon CloudWatch

Webinars

DeepSeek-R1 model now available in Amazon Bedrock Marketplace and Amazon SageMaker JumpStart

Enterprise-grade natural language to SQL generation using LLMs: Balancing accuracy, latency, and scale

Build a video insights and summarization engine using generative AI with Amazon Bedrock

TigerEye (YC S22) Is Hiring a Full Stack Engineer

Connect to Amazon services using AWS PrivateLink in Amazon SageMaker

Mistral-NeMo-Instruct-2407 and Mistral-NeMo-Base-2407 are now available on SageMaker JumpStart

Llama 4 family of models from Meta are now available in SageMaker JumpStart

NeMo Retriever Llama 3.2 text embedding and reranking NVIDIA NIM microservices now available in Amazon SageMaker JumpStart

Architecture to AWS CloudFormation code using Anthropic’s Claude 3 on Amazon Bedrock

Generative AI and multi-modal agents in AWS: The key to unlocking new value in financial markets

Build a Search Engine: Semantic Search System Using OpenSearch

Simple guide to training Llama 2 with AWS Trainium on Amazon SageMaker

Pixtral 12B is now available on Amazon SageMaker JumpStart

Multi-tenancy in RAG applications in a single Amazon Bedrock knowledge base with metadata filtering

Customize DeepSeek-R1 distilled models using Amazon SageMaker HyperPod recipes – Part 1

Learn how to build and deploy tool-using LLM agents using AWS SageMaker JumpStart Foundation Models

How VirtuSwap accelerates their pandas-based trading simulations with an Amazon SageMaker Studio custom container and AWS GPU instances

Introduction to GitHub Actions for Python Projects

Announcing new Jupyter contributions by AWS to democratize generative AI and scale ML workloads

Harness large language models in fake news detection

Customize Amazon Nova models to improve tool usage

Manage Amazon SageMaker JumpStart foundation model access with private hubs

AWS performs fine-tuning on a Large Language Model (LLM) to classify toxic speech for a large gaming company

Use Snowflake as a data source to train ML models with Amazon SageMaker

Customize DeepSeek-R1 671b model using Amazon SageMaker HyperPod recipes – Part 2

Use zero-shot large language models on Amazon Bedrock for custom named entity recognition

Falcon 2 11B is now available on Amazon SageMaker JumpStart

Building AI chatbots using Amazon Lex and Amazon Kendra for filtering query results based on user context

Build an automated insight extraction framework for customer feedback analysis with Amazon Bedrock and Amazon QuickSight

Build well-architected IDP solutions with a custom lens – Part 3: Reliability

40 Must-Know Data Science Skills and Frameworks for 2023

Advanced RAG patterns on Amazon SageMaker

How SnapLogic built a text-to-pipeline application with Amazon Bedrock to translate business intent into action

Build an end-to-end MLOps pipeline using Amazon SageMaker Pipelines, GitHub, and GitHub Actions

Build a RAG-based QnA application using Llama3 models from SageMaker JumpStart

Code Llama 70B is now available in Amazon SageMaker JumpStart

Run your local machine learning code as Amazon SageMaker Training jobs with minimal code changes

Elevating the generative AI experience: Introducing streaming support in Amazon SageMaker hosting

Package and deploy classical ML and LLMs easily with Amazon SageMaker, part 1: PySDK Improvements

Package and deploy classical ML and LLMs easily with Amazon SageMaker, part 2: Interactive User Experiences in SageMaker Studio

Harnessing the power of enterprise data with generative AI: Insights from Amazon Kendra, LangChain, and large language models

Stay Connected