article thumbnail

Transcribe, translate, and summarize live streams in your browser with AWS AI and generative AI services

AWS Machine Learning Blog

Recognizing this need, we have developed a Chrome extension that harnesses the power of AWS AI and generative AI services, including Amazon Bedrock , an AWS managed service to build and scale generative AI applications with foundation models (FMs). The user signs in by entering a user name and a password.

AWS 132
article thumbnail

PEFT fine tuning of Llama 3 on SageMaker HyperPod with AWS Trainium

AWS Machine Learning Blog

To simplify infrastructure setup and accelerate distributed training, AWS introduced Amazon SageMaker HyperPod in late 2023. In this blog post, we showcase how you can perform efficient supervised fine tuning for a Meta Llama 3 model using PEFT on AWS Trainium with SageMaker HyperPod. architectures/5.sagemaker-hyperpod/LifecycleScripts/base-config/

AWS 106
professionals

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Accelerate custom labeling workflows in Amazon SageMaker Ground Truth without using AWS Lambda

AWS Machine Learning Blog

Previously, setting up a custom labeling job required specifying two AWS Lambda functions: a pre-annotation function, which is run on each dataset object before it’s sent to workers, and a post-annotation function, which is run on the annotations of each dataset object and consolidates multiple worker annotations if needed.

AWS 117
article thumbnail

WordFinder app: Harnessing generative AI on AWS for aphasia communication

AWS Machine Learning Blog

David Copland, from QARC, and Scott Harding, a person living with aphasia, used AWS services to develop WordFinder, a mobile, cloud-based solution that helps individuals with aphasia increase their independence through the use of AWS generative AI technology. The following diagram illustrates the solution architecture on AWS.

AWS 88
article thumbnail

From innovation to impact: How AWS and NVIDIA enable real-world generative AI success

AWS Machine Learning Blog

When the stakes are high, success requires not just cutting-edge technology, but the ability to operationalize it at scalea challenge that AWS has consistently solved for customers. To train generative AI models at enterprise scale, ServiceNow uses NVIDIA DGX Cloud on AWS. The team achieved 97.1%

AWS 144
article thumbnail

Build a Search Engine: Setting Up AWS OpenSearch

Flipboard

Home Table of Contents Build a Search Engine: Setting Up AWS OpenSearch Introduction What Is AWS OpenSearch? What AWS OpenSearch Is Commonly Used For Key Features of AWS OpenSearch How Does AWS OpenSearch Work? Why Use AWS OpenSearch for Semantic Search? Looking for the source code to this post?

AWS 119
article thumbnail

Why Fugaku, Japan’s fastest supercomputer, went virtual on AWS

Hacker News

Now, the researchers behind one of the world’s fastest supercomputers, Fugaku , are trying to make the supercomputer just as accessible on the Amazon Web Services (AWS) Cloud. Dr. Matsuoka at AWS re:Invent 2023, where he held a session on Virtual Fugaku, a replication of the original environment on AWS.

AWS 181