2018, AWS and Natural Language Processing

Fast and cost-effective LLaMA 2 fine-tuning with AWS Trainium

AWS Machine Learning Blog

OCTOBER 5, 2023

In this post, we walk through how to fine-tune Llama 2 on AWS Trainium , a purpose-built accelerator for LLM training, to reduce training times and costs. We review the fine-tuning scripts provided by the AWS Neuron SDK (using NeMo Megatron-LM), the various configurations we used, and the throughput results we saw.

AWS

AWS Machine Learning Machine Learning Deep Learning

Accelerating large-scale neural network training on CPUs with ThirdAI and AWS Graviton

AWS Machine Learning Blog

FEBRUARY 29, 2024

In this post, we investigate of potential for the AWS Graviton3 processor to accelerate neural network training for ThirdAI’s unique CPU-based deep learning engine. As shown in our results, we observed a significant training speedup with AWS Graviton3 over the comparable Intel and NVIDIA instances on several representative modeling workloads.

AWS

AWS Deep Learning Deep Learning ML

Generative AI and multi-modal agents in AWS: The key to unlocking new value in financial markets

AWS Machine Learning Blog

SEPTEMBER 19, 2023

Implementing a multi-modal agent with AWS consolidates key insights from diverse structured and unstructured data on a large scale. All this is achieved using AWS services, thereby increasing the financial analyst’s efficiency to analyze multi-modal financial data (text, speech, and tabular data) holistically.

AWS

AWS AI AI ML

Webinars

Automation, Evolved: Your New Playbook For Smarter Knowledge Work

MORE WEBINARS

Fine-tune and deploy Llama 2 models cost-effectively in Amazon SageMaker JumpStart with AWS Inferentia and AWS Trainium

AWS Machine Learning Blog

JANUARY 17, 2024

Today, we’re excited to announce the availability of Llama 2 inference and fine-tuning support on AWS Trainium and AWS Inferentia instances in Amazon SageMaker JumpStart. In this post, we demonstrate how to deploy and fine-tune Llama 2 on Trainium and AWS Inferentia instances in SageMaker JumpStart.

AWS

AWS Python Machine Learning Machine Learning

AWS performs fine-tuning on a Large Language Model (LLM) to classify toxic speech for a large gaming company

AWS Machine Learning Blog

AUGUST 7, 2023

In an effort to create and maintain a socially responsible gaming environment, AWS Professional Services was asked to build a mechanism that detects inappropriate language (toxic speech) within online gaming player interactions. The solution was to find and fine-tune an LLM to classify toxic language.

AWS

AWS ML ML Data Science

Accelerate your learning towards AWS Certification exams with automated quiz generation using Amazon SageMaker foundations models

AWS Machine Learning Blog

MAY 31, 2023

Getting AWS Certified can help you propel your career, whether you’re looking to find a new role, showcase your skills to take on a new project, or become your team’s go-to expert. Reading the FAQ page of the AWS services relevant for your certification exam is important in order to acquire a deeper understanding of the service.

AWS

AWS ML ML Python

Anthropic Claude 3.5 Sonnet ranks number 1 for business and finance in S&P AI Benchmarks by Kensho

AWS Machine Learning Blog

JULY 9, 2024

of its consolidated revenues during the years ended December 31, 2019, 2018 and 2017, respectively. Sonnet made key improvements in visual processing and understanding, writing and content generation, natural language processing, coding, and generating insights. As pointed out in Anthropic’s Claude 3.5

AWS

AWS AI AI Machine Learning

AI-powered assistants for investment research with multi-modal data: An application of Agents for Amazon Bedrock

AWS Machine Learning Blog

JUNE 26, 2024

This post is a follow-up to Generative AI and multi-modal agents in AWS: The key to unlocking new value in financial markets. Technical architecture and key steps The multi-modal agent orchestrates various steps based on natural language prompts from business users to generate insights.

AWS

AWS AI AI Database

Deploy large language models for a healthtech use case on Amazon SageMaker

AWS Machine Learning Blog

FEBRUARY 6, 2024

We implemented the solution using the AWS Cloud Development Kit (AWS CDK). Transformers, BERT, and GPT The transformer architecture is a neural network architecture that is used for natural language processing (NLP) tasks. The first GPT model was introduced in 2018 by OpenAI.

AWS

AWS ML ML Data Preparation

Customizing coding companions for organizations

AWS Machine Learning Blog

NOVEMBER 9, 2023

In these two studies, commissioned by AWS, developers were asked to create a medical software application in Java that required use of their internal libraries. About the authors Qing Sun is a Senior Applied Scientist in AWS AI Labs and work on AWS CodeWhisperer, a generative AI-powered coding assistant.

AWS

AWS Natural Language Processing K-nearest Neighbors Computer Science

The journey of PGA TOUR’s generative AI virtual assistant, from concept to development to prototype

AWS Machine Learning Blog

MARCH 14, 2024

Amazon Kendra uses natural language processing (NLP) to understand user queries and find the most relevant documents. The following figures shows the step-by-step procedure of how a query is processed for the text-to-SQL pipeline. This will require extensive testing, through collaboration between AWS and the PGA TOUR.

SQL

SQL AWS AI AI

Beyond data: Cloud analytics mastery for business brilliance

Dataconomy

SEPTEMBER 4, 2023

It uses natural language processing (NLP) techniques to extract valuable insights from textual data. For instance, British Airways faced a fine of £183 million ($230 million) for a GDPR breach in 2018. Downtime, like the AWS outage in 2017 that affected several high-profile websites, can disrupt business operations.

Analytics

Analytics Analytics Big Data Analytics Big Data Analytics

A review of purpose-built accelerators for financial services

AWS Machine Learning Blog

SEPTEMBER 11, 2024

In 2018, other forms of PBAs became available, and by 2020, PBAs were being widely used for parallel problems, such as training of NN. Examples of other PBAs now available include AWS Inferentia and AWS Trainium , Google TPU, and Graphcore IPU. In November 2023, AWS announced the next generation Trainium2 chip.

AWS

AWS ML ML Clustering

Train self-supervised vision transformers on overhead imagery with Amazon SageMaker

AWS Machine Learning Blog

AUGUST 16, 2023

The images document the land cover, or physical surface features, of ten European countries between June 2017 and May 2018. Because we use true color images during DINO training, we only upload the red (B04), green (B03), and blue (B02) bands: aws s3 cp final_ben_s2.parquet Machine Learning Engineer at AWS. tif" --include "_B03.tif"

ML

ML ML Data Scientist AWS

Best practices for building robust generative AI applications with Amazon Bedrock Agents – Part 1

AWS Machine Learning Blog

OCTOBER 2, 2024

Additionally, check out the service introduction video from AWS re:Invent 2023. About the Authors Maira Ladeira Tanke is a Senior Generative AI Data Scientist at AWS. Mark Roy is a Principal Machine Learning Architect for AWS, helping customers design and build generative AI solutions.

AI

AI AI AWS Machine Learning

How Sportradar used the Deep Java Library to build production-scale ML platforms for increased performance and efficiency

AWS Machine Learning Blog

APRIL 19, 2023

Since 2018, our team has been developing a variety of ML models to enable betting products for NFL and NCAA football. It has intuitive helpers and utilities for modalities like computer vision, natural language processing, audio, time series, and tabular data. We recently developed four more new models.

ML

ML ML Deep Learning Deep Learning

Train a Large Language Model on a single Amazon SageMaker GPU with Hugging Face and LoRA

AWS Machine Learning Blog

JUNE 5, 2023

In this post, we show you how to train the 7-billion-parameter BloomZ model using just a single graphics processing unit (GPU) on Amazon SageMaker , Amazon’s machine learning (ML) platform for preparing, building, training, and deploying high-quality ML models. BloomZ is a general-purpose natural language processing (NLP) model.

AWS

AWS ML ML Machine Learning

Top 5 Generative AI Integration Companies to drive Customer Support in 2023

Chatbots Life

MAY 16, 2023

Master of Code Global (MOCG) is a certified partner of Microsoft and AWS and has been recognized by LivePerson, Inc. Data Monsters can help companies deploy, train and test machine learning pipelines for natural language processing and computer vision. Elite Service Delivery partner of NVIDIA.

AI

AI AI Natural Language Processing Artificial Intelligence

Identifying defense coverage schemes in NFL’s Next Gen Stats

AWS Machine Learning Blog

FEBRUARY 10, 2023

Quantitative evaluation We utilize 2018–2020 season data for model training and validation, and 2021 season data for model evaluation. Prior to AWS, he obtained his MCS from West Virginia University and worked as computer vision researcher at Midea. He is broadly interested in Deep Learning and Natural Language Processing.

ML

ML ML Machine Learning Machine Learning

Question answering using Retrieval Augmented Generation with foundation models in Amazon SageMaker JumpStart

AWS Machine Learning Blog

MAY 2, 2023

There are a few limitations of using off-the-shelf pre-trained LLMs: They’re usually trained offline, making the model agnostic to the latest information (for example, a chatbot trained from 2011–2018 has no information about COVID-19). Managed Spot Training is supported in all AWS Regions where Amazon SageMaker is currently available.

Algorithm

Algorithm Machine Learning Machine Learning Natural Language Processing

Instruction fine-tuning for FLAN T5 XL with Amazon SageMaker Jumpstart

AWS Machine Learning Blog

MAY 22, 2023

Prerequisites To get started, all you need is an AWS account in which you can use Studio. It develops insights by recognizing the entities, key phrases, language, sentiments, and other common elements in a document. Baris Kurt is an Applied Scientist at AWS AI Labs. Jonas Kübler is an Applied Scientist at AWS AI Labs.

AWS

AWS Machine Learning Machine Learning Natural Language Processing

Optimize your machine learning deployments with auto scaling on Amazon SageMaker

AWS Machine Learning Blog

FEBRUARY 8, 2023

About the Authors Mohan Gandhi is a Senior Software Engineer at AWS. He has been with AWS for the last 10 years and has worked on various AWS services like EMR, EFA and RDS. He is currently focused on natural language processing, responsible AI, inference optimization and scaling ML across the enterprise.

Machine Learning

Machine Learning Machine Learning ML ML

Deploying Conversational AI Products to Production With Jason Flaks

The MLOps Blog

JULY 18, 2023

First and foremost, let’s say that we have some parts of our stack, especially the audio componentry, that tend to require heavy GPU machines to operate some of the pure language side of the house, such as the natural language processing model. Some of them can be handled purely on CPU processing.

AI

AI AI Natural Language Processing Machine Learning

Large language models: their history, capabilities and limitations

Snorkel AI

MAY 25, 2023

A brief history of large language models Large language models grew out of research and experiments with neural networks to allow computers to process natural language. From 2018 to the modern day, NLP researchers have engaged in a steady march toward ever-larger models. For example: I love this movie.

Natural Language Processing

Natural Language Processing Python Machine Learning Machine Learning

Large language models: their history, capabilities and limitations

Snorkel AI

MAY 25, 2023

A brief history of large language models Large language models grew out of research and experiments with neural networks to allow computers to process natural language. From 2018 to the modern day, NLP researchers have engaged in a steady march toward ever-larger models. For example: I love this movie.

Natural Language Processing

Natural Language Processing Python Machine Learning Machine Learning

spaCy meets Transformers: Fine-tune BERT, XLNet and GPT-2

Explosion

AUGUST 1, 2019

Transformers and transfer-learning Natural Language Processing (NLP) systems face a problem known as the “knowledge acquisition bottleneck”. Based on the (fairly vague) marketing copy, AWS might be doing something similar in SageMaker. We have updated our library and this blog post accordingly. and follows Devlin et al.

Natural Language Processing

Natural Language Processing Machine Learning Machine Learning AWS

Unlocking complex problem-solving with multi-agent collaboration on Amazon Bedrock

Flipboard

JANUARY 14, 2025

The research team at AWS has worked extensively on building and evaluating the multi-agent collaboration (MAC) framework so customers can orchestrate multiple AI agents on Amazon Bedrock Agents. At AWS, he led the Dialog2API project, which enables large language models to interact with the external environment through dialogue.

AWS

AWS Natural Language Processing AI AI

Build a computer vision-based asset inventory application with low or no training

Flipboard

APRIL 16, 2025

Solution overview The AI-powered asset inventory labeling solution aims to streamline the process of updating inventory databases by automatically extracting relevant information from asset labels through computer vision and generative AI capabilities. LLMs are large deep learning models that are pre-trained on vast amounts of data.

AWS

AWS Database AI AI

Best practices for building robust generative AI applications with Amazon Bedrock Agents – Part 2

AWS Machine Learning Blog

OCTOBER 21, 2024

Amazon Bedrock Agents allows you to write IaC code with AWS CloudFormation , the AWS Cloud Development Kit (AWS CDK), or Terraform. We provide blueprint templates of the most common capabilities of Amazon Bedrock Agents, which can be deployed and updated with a single AWS CDK command.

AWS

AWS AI AI ML

Fine-tune Meta Llama 3.2 text generation models for generative AI inference using Amazon SageMaker JumpStart

AWS Machine Learning Blog

NOVEMBER 11, 2024

Prerequisites To try out this solution using SageMaker JumpStart, you’ll need the following prerequisites: An AWS account that will contain all of your AWS resources. An AWS Identity and Access Management (IAM) role to access SageMaker. He is specialized in architecting AI/ML and generative AI services at AWS.

AI

AI AI ML ML

Data Science Current

Fast and cost-effective LLaMA 2 fine-tuning with AWS Trainium

Accelerating large-scale neural network training on CPUs with ThirdAI and AWS Graviton

Webinars

Trending Sources

Generative AI and multi-modal agents in AWS: The key to unlocking new value in financial markets

Webinars

Fine-tune and deploy Llama 2 models cost-effectively in Amazon SageMaker JumpStart with AWS Inferentia and AWS Trainium

AWS performs fine-tuning on a Large Language Model (LLM) to classify toxic speech for a large gaming company

Accelerate your learning towards AWS Certification exams with automated quiz generation using Amazon SageMaker foundations models

Anthropic Claude 3.5 Sonnet ranks number 1 for business and finance in S&P AI Benchmarks by Kensho

AI-powered assistants for investment research with multi-modal data: An application of Agents for Amazon Bedrock

Deploy large language models for a healthtech use case on Amazon SageMaker

Customizing coding companions for organizations

The journey of PGA TOUR’s generative AI virtual assistant, from concept to development to prototype

Beyond data: Cloud analytics mastery for business brilliance

A review of purpose-built accelerators for financial services

Train self-supervised vision transformers on overhead imagery with Amazon SageMaker

Best practices for building robust generative AI applications with Amazon Bedrock Agents – Part 1

How Sportradar used the Deep Java Library to build production-scale ML platforms for increased performance and efficiency

Train a Large Language Model on a single Amazon SageMaker GPU with Hugging Face and LoRA

Top 5 Generative AI Integration Companies to drive Customer Support in 2023

Identifying defense coverage schemes in NFL’s Next Gen Stats

Question answering using Retrieval Augmented Generation with foundation models in Amazon SageMaker JumpStart

Instruction fine-tuning for FLAN T5 XL with Amazon SageMaker Jumpstart

Optimize your machine learning deployments with auto scaling on Amazon SageMaker

Deploying Conversational AI Products to Production With Jason Flaks

Large language models: their history, capabilities and limitations

Large language models: their history, capabilities and limitations

spaCy meets Transformers: Fine-tune BERT, XLNet and GPT-2

Unlocking complex problem-solving with multi-agent collaboration on Amazon Bedrock

Build a computer vision-based asset inventory application with low or no training

Best practices for building robust generative AI applications with Amazon Bedrock Agents – Part 2

Fine-tune Meta Llama 3.2 text generation models for generative AI inference using Amazon SageMaker JumpStart

Stay Connected