Remove 2018 Remove AWS Remove Natural Language Processing
article thumbnail

Fast and cost-effective LLaMA 2 fine-tuning with AWS Trainium

AWS Machine Learning Blog

In this post, we walk through how to fine-tune Llama 2 on AWS Trainium , a purpose-built accelerator for LLM training, to reduce training times and costs. We review the fine-tuning scripts provided by the AWS Neuron SDK (using NeMo Megatron-LM), the various configurations we used, and the throughput results we saw.

AWS 126
article thumbnail

Accelerating large-scale neural network training on CPUs with ThirdAI and AWS Graviton

AWS Machine Learning Blog

In this post, we investigate of potential for the AWS Graviton3 processor to accelerate neural network training for ThirdAI’s unique CPU-based deep learning engine. As shown in our results, we observed a significant training speedup with AWS Graviton3 over the comparable Intel and NVIDIA instances on several representative modeling workloads.

AWS 133
professionals

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Generative AI and multi-modal agents in AWS: The key to unlocking new value in financial markets

AWS Machine Learning Blog

Implementing a multi-modal agent with AWS consolidates key insights from diverse structured and unstructured data on a large scale. All this is achieved using AWS services, thereby increasing the financial analyst’s efficiency to analyze multi-modal financial data (text, speech, and tabular data) holistically.

AWS 133
article thumbnail

Fine-tune and deploy Llama 2 models cost-effectively in Amazon SageMaker JumpStart with AWS Inferentia and AWS Trainium

AWS Machine Learning Blog

Today, we’re excited to announce the availability of Llama 2 inference and fine-tuning support on AWS Trainium and AWS Inferentia instances in Amazon SageMaker JumpStart. In this post, we demonstrate how to deploy and fine-tune Llama 2 on Trainium and AWS Inferentia instances in SageMaker JumpStart.

AWS 129
article thumbnail

AWS performs fine-tuning on a Large Language Model (LLM) to classify toxic speech for a large gaming company

AWS Machine Learning Blog

In an effort to create and maintain a socially responsible gaming environment, AWS Professional Services was asked to build a mechanism that detects inappropriate language (toxic speech) within online gaming player interactions. The solution was to find and fine-tune an LLM to classify toxic language.

AWS 98
article thumbnail

Accelerate your learning towards AWS Certification exams with automated quiz generation using Amazon SageMaker foundations models

AWS Machine Learning Blog

Getting AWS Certified can help you propel your career, whether you’re looking to find a new role, showcase your skills to take on a new project, or become your team’s go-to expert. Reading the FAQ page of the AWS services relevant for your certification exam is important in order to acquire a deeper understanding of the service.

AWS 97
article thumbnail

Anthropic Claude 3.5 Sonnet ranks number 1 for business and finance in S&P AI Benchmarks by Kensho

AWS Machine Learning Blog

of its consolidated revenues during the years ended December 31, 2019, 2018 and 2017, respectively. Sonnet made key improvements in visual processing and understanding, writing and content generation, natural language processing, coding, and generating insights. As pointed out in Anthropic’s Claude 3.5

AWS 123