AWS, Clustering and Natural Language Processing

Amazon Bedrock Knowledge Bases now supports Amazon OpenSearch Service Managed Cluster as vector store

AWS Machine Learning Blog

JULY 15, 2025

Amazon Bedrock Knowledge Bases has extended its vector store options by enabling support for Amazon OpenSearch Service managed clusters, further strengthening its capabilities as a fully managed Retrieval Augmented Generation (RAG) solution. Why use OpenSearch Service Managed Cluster as a vector store?

Clustering

Clustering AWS K-nearest Neighbors Database

Build conversational interfaces for structured data using Amazon Bedrock Knowledge Bases

Flipboard

JUNE 17, 2025

Large language models (LLMs) have transformed natural language processing (NLP), yet converting conversational queries into structured data analysis remains complex. Amazon Bedrock Knowledge Bases enables direct natural language interactions with structured data sources.

AWS

AWS SQL Database Natural Language Processing

Syngenta develops a generative AI assistant to support sales representatives using Amazon Bedrock Agents

Flipboard

DECEMBER 3, 2024

Syngenta and AWS collaborated to develop Cropwise AI , an innovative solution powered by Amazon Bedrock Agents , to accelerate their sales reps’ ability to place Syngenta seed products with growers across North America. The collaboration between Syngenta and AWS showcases the transformative power of LLMs and AI agents.

AWS

AWS Machine Learning Machine Learning AI

Webinars

Precision in Motion: Why Process Optimization Is the Future of Manufacturing

Airflow Best Practices for ETL/ELT Pipelines

MORE WEBINARS

Deploy Meta Llama 3.1 models cost-effectively in Amazon SageMaker JumpStart with AWS Inferentia and AWS Trainium

AWS Machine Learning Blog

NOVEMBER 25, 2024

8B and 70B inference support on AWS Trainium and AWS Inferentia instances in Amazon SageMaker JumpStart. multilingual large language models (LLMs) are a collection of pre-trained and instruction tuned generative models. An AWS Identity and Access Management (IAM) role to access SageMaker. Meta Llama 3.1 by up to 50%.

AWS

AWS Python ML ML

Large Language Models: A Self-Study Roadmap

Flipboard

JULY 7, 2025

Step 1: Cover the Fundamentals You can skip this step if you already know the basics of programming, machine learning, and natural language processing. The key here is to focus on concepts like supervised vs. unsupervised learning, regression, classification, clustering, and model evaluation. So, lets get started.

Natural Language Processing

Natural Language Processing Machine Learning Machine Learning Data Science

Accelerating Mixtral MoE fine-tuning on Amazon SageMaker with QLoRA

AWS Machine Learning Blog

NOVEMBER 22, 2024

Although QLoRA helps optimize memory during fine-tuning, we will use Amazon SageMaker Training to spin up a resilient training cluster, manage orchestration, and monitor the cluster for failures. To take complete advantage of this multi-GPU cluster, we use the recent support of QLoRA and PyTorch FSDP. 24xlarge compute instance.

Clustering

Clustering AWS ML ML

Transforming financial analysis with CreditAI on Amazon Bedrock: Octus’s journey with AWS

AWS Machine Learning Blog

MARCH 10, 2025

We walk through the journey Octus took from managing multiple cloud providers and costly GPU instances to implementing a streamlined, cost-effective solution using AWS services including Amazon Bedrock, AWS Fargate , and Amazon OpenSearch Service. Along the way, it also simplified operations as Octus is an AWS shop more generally.

AWS

AWS Database AI AI

Train, optimize, and deploy models on edge devices using Amazon SageMaker and Qualcomm AI Hub

AWS Machine Learning Blog

OCTOBER 18, 2024

We demonstrate this solution by walking you through a comprehensive step-by-step guide on how to fine-tune YOLOv8 , a real-time object detection model, on Amazon Web Services (AWS) using a custom dataset. The process uses a single ml.g5.2xlarge instance (providing one NVIDIA A10G Tensor Core GPU) with SageMaker for fine-tuning.

AWS

AWS AI AI Machine Learning

Query Amazon Aurora PostgreSQL using Amazon Bedrock Knowledge Bases structured data

Flipboard

JULY 9, 2025

Amazon Bedrock Knowledge Bases offers a fully managed Retrieval Augmented Generation (RAG) feature that connects large language models (LLMs) to internal data sources. Users can interact with Amazon Bedrock Knowledge Bases using the AWS Management Console or an AWS SDK client, which sends natural language queries.

ETL

ETL Database AWS SQL

How Fastweb fine-tuned the Mistral model using Amazon SageMaker HyperPod as a first step to build an Italian large language model

AWS Machine Learning Blog

DECEMBER 18, 2024

Training an LLM is a compute-intensive and complex process, which is why Fastweb, as a first step in their AI journey, used AWS generative AI and machine learning (ML) services such as Amazon SageMaker HyperPod. The team opted for fine-tuning on AWS.

Clustering

Clustering AWS AI AI

How Lumi streamlines loan approvals with Amazon SageMaker AI

AWS Machine Learning Blog

APRIL 4, 2025

To achieve this, Lumi developed a classification model based on BERT (Bidirectional Encoder Representations from Transformers) , a state-of-the-art natural language processing (NLP) technique. The pipeline leverages several AWS services familiar to Lumis team. Follow him on LinkedIn.

AI

AI AI Machine Learning Machine Learning

How Aetion is using generative AI and Amazon Bedrock to unlock hidden insights about patient populations

AWS Machine Learning Blog

JANUARY 30, 2025

Smart Subgroups For a user-specified patient population, the Smart Subgroups feature identifies clusters of patients with similar characteristics (for example, similar prevalence profiles of diagnoses, procedures, and therapies). The features are stored in Amazon S3 and encrypted with AWS Key Management Service (AWS KMS) for downstream use.

Clustering

Clustering Natural Language Processing Machine Learning Machine Learning

End-to-End model training and deployment with Amazon SageMaker Unified Studio

Flipboard

JULY 3, 2025

Although rapid generative AI advancements are revolutionizing organizational natural language processing tasks, developers and data scientists face significant challenges customizing these large models. It’s available as a standalone service on the AWS Management Console , or through APIs.

ML

ML ML AWS Data Engineer

Detect hallucinations for RAG-based systems

Flipboard

MAY 16, 2025

Prerequisites To use the methods presented in this post, you need an AWS account with access to Amazon SageMaker , Amazon Bedrock , and Amazon Simple Storage Service (Amazon S3). Statement: 'AWS is Amazon subsidiary that provides cloud computing services.' Finally, we compare approaches in terms of their performance and latency.

AWS

AWS Cloud Computing Natural Language Processing AI

Advanced fine-tuning methods on Amazon SageMaker AI

AWS Machine Learning Blog

JULY 11, 2025

At this stage, the model is not tailored to any specific downstream task but rather builds a general-purpose language representation that can be adapted later using fine-tuning or PEFT methods. AWS provides comprehensive support for implementing various alignment techniques, each offering distinct approaches to achieving this goal.

AWS

AWS AI AI Deep Learning

How Untold Studios empowers artists with an AI assistant built on Amazon Bedrock

AWS Machine Learning Blog

FEBRUARY 7, 2025

Key AWS services used include: Amazon Bedrock Including Anthropics Claude 3.5 Sonnet model for natural language processing. This means artists can focus on their creative process rather than worrying about precise phrasing or navigating complex menu structures.

AWS

AWS AI AI Python

How Apoidea Group enhances visual information extraction from banking documents with multimodal models using LLaMA-Factory on Amazon SageMaker HyperPod

AWS Machine Learning Blog

MAY 15, 2025

Amazon SageMaker HyperPod offers an effective solution for provisioning resilient clusters to run ML workloads and develop state-of-the-art models. Whether youre processing financial statements, KYC documents, or complex reports, we encourage you to evaluate its potential for optimizing your document workflows.

AWS

AWS ML ML Machine Learning

Deploy DeepSeek-R1 distilled models on Amazon SageMaker using a Large Model Inference container

AWS Machine Learning Blog

MARCH 11, 2025

The MoE architecture allows activation of 37 billion parameters, enabling efficient inference by routing queries to the most relevant expert clusters. By integrating this model with Amazon SageMaker AI , you can benefit from the AWS scalable infrastructure while maintaining high-quality language model capabilities.

AWS

AWS ML ML Natural Language Processing

Multi-tenancy in RAG applications in a single Amazon Bedrock knowledge base with metadata filtering

AWS Machine Learning Blog

APRIL 7, 2025

Amazon Bedrock is a fully managed service that offers a choice of high-performing foundation models (FMs) from leading AI companies and AWS. Solution overview The following diagram provides a high-level overview of AWS services and features through a sample use case.

Database

Database AWS Natural Language Processing AI

How to Work Smarter, Not Harder, with Artificial Intelligence

Flipboard

JUNE 13, 2025

Unsupervised Learning: Focuses on identifying patterns in unlabeled data, such as clustering customers based on purchasing behavior or reducing data dimensions for visualization. Cloud Computing: Scaling AI Solutions Cloud computing platforms like AWS, Google Cloud, and Microsoft Azure are indispensable for deploying and scaling AI models.

Artificial Intelligence

Artificial Intelligence Artificial Intelligence Exploratory Data Analysis Machine Learning

Understanding the Generative AI Value Chain

Pickl AI

DECEMBER 26, 2024

Tensor Processing Units (TPUs) Developed by Google, TPUs are optimized for Machine Learning tasks, providing even greater efficiency than traditional GPUs for specific applications. The demand for advanced hardware continues to grow as organisations seek to develop more sophisticated Generative AI applications.

AI

AI AI Deep Learning Deep Learning

How IDIADA optimized its intelligent chatbot with Amazon Bedrock

AWS Machine Learning Blog

FEBRUARY 25, 2025

To take advantage of the power of these language models, we use Amazon Bedrock. The integration with Amazon Bedrock is achieved through the Boto3 Python module, which serves as an interface to the AWS, enabling seamless interaction with Amazon Bedrock and the deployment of the classification model.

Algorithm

Algorithm Machine Learning Machine Learning K-nearest Neighbors

Fine-tune multimodal models for vision and text use cases on Amazon SageMaker JumpStart

AWS Machine Learning Blog

NOVEMBER 15, 2024

Prerequisites To try out this solution using SageMaker JumpStart, you need the following prerequisites: An AWS account that will contain all of your AWS resources. An AWS Identity and Access Management (IAM) role to access SageMaker. Appendix Language models such as Meta Llama are more than 10 GB or even 100 GB in size.

ML

ML ML Python AWS

Build a Search Engine: Setting Up AWS OpenSearch

Flipboard

MAY 5, 2025

Home Table of Contents Build a Search Engine: Setting Up AWS OpenSearch Introduction What Is AWS OpenSearch? What AWS OpenSearch Is Commonly Used For Key Features of AWS OpenSearch How Does AWS OpenSearch Work? Why Use AWS OpenSearch for Semantic Search? Looking for the source code to this post?

AWS

AWS Clustering Deep Learning Deep Learning

Build agentic AI solutions with DeepSeek-R1, CrewAI, and Amazon SageMaker AI

Flipboard

FEBRUARY 10, 2025

These services support single GPU to HyperPods (cluster of GPUs) for training and include built-in FMOps tools for tracking, debugging, and deployment. Solution overview CrewAI provides a robust framework for developing multi-agent systems that integrate with AWS services, particularly SageMaker AI.

AI

AI AI AWS ML

Ask HN: Who wants to be hired? (July 2025)

Hacker News

JULY 1, 2025

I've created docker containers from scratch and set up AWS Fargate and all the related services to run them and connect them to a public IP address. I have about 3 YoE training PyTorch models on HPC clusters and 1 YoE optimizing PyTorch models, including with custom CUDA kernels.

Python

Python AWS SQL ML

Optimizing AI implementation costs with Automat-it

AWS Machine Learning Blog

FEBRUARY 28, 2025

This post was written by Claudiu Bota, Oleg Yurchenko, and Vladyslav Melnyk of AWS Partner Automat-it. As organizations adopt AI and machine learning (ML), theyre using these technologies to improve processes and enhance products. This approach was inspired by several AWS blog posts that can be found in the references section.

AWS

AWS AI AI Deep Learning

Reduce energy consumption of your machine learning workloads by up to 90% with AWS purpose-built accelerators

Flipboard

JUNE 20, 2023

For reference, GPT-3, an earlier generation LLM has 175 billion parameters and requires months of non-stop training on a cluster of thousands of accelerated processors. The Carbontracker study estimates that training GPT-3 from scratch may emit up to 85 metric tons of CO2 equivalent, using clusters of specialized hardware accelerators.

AWS

AWS Machine Learning Machine Learning Deep Learning

Fast and cost-effective LLaMA 2 fine-tuning with AWS Trainium

AWS Machine Learning Blog

OCTOBER 5, 2023

In this post, we walk through how to fine-tune Llama 2 on AWS Trainium , a purpose-built accelerator for LLM training, to reduce training times and costs. We review the fine-tuning scripts provided by the AWS Neuron SDK (using NeMo Megatron-LM), the various configurations we used, and the throughput results we saw.

AWS

AWS Machine Learning Machine Learning Deep Learning

Sprinklr improves performance by 20% and reduces cost by 25% for machine learning inference on AWS Graviton3

AWS Machine Learning Blog

JUNE 11, 2024

Sprinklr’s specialized AI models streamline data processing, gather valuable insights, and enable workflows and analytics at scale to drive better decision-making and productivity. During this journey, we collaborated with our AWS technical account manager and the Graviton software engineering teams.

Machine Learning

Machine Learning Machine Learning AWS Natural Language Processing

Build enterprise-ready generative AI solutions with Cohere foundation models in Amazon Bedrock and Weaviate vector database on AWS Marketplace

AWS Machine Learning Blog

JANUARY 24, 2024

We demonstrate how to build an end-to-end RAG application using Cohere’s language models through Amazon Bedrock and a Weaviate vector database on AWS Marketplace. Cohere Embed comes in two forms, an English language model and a multilingual model, both of which are now available on Amazon Bedrock.

AWS

AWS Database AI AI

The future of productivity agents with NinjaTech AI and AWS Trainium

AWS Machine Learning Blog

JUNE 27, 2024

In this post, we describe how we built our cutting-edge productivity agent NinjaLLM, the backbone of MyNinja.ai, using AWS Trainium chips. For training, we chose to use a cluster of trn1.32xlarge instances to take advantage of Trainium chips. We used a cluster of 32 instances in order to efficiently parallelize the training.

AWS

AWS AI AI Clustering

Simple guide to training Llama 2 with AWS Trainium on Amazon SageMaker

AWS Machine Learning Blog

MAY 1, 2024

Llama2 by Meta is an example of an LLM offered by AWS. Llama 2 is an auto-regressive language model that uses an optimized transformer architecture and is intended for commercial and research use in English. Virginia) and US West (Oregon) AWS Regions, and most recently announced general availability in the US East (Ohio) Region.

AWS

AWS ML ML Clustering

Develop and train large models cost-efficiently with Metaflow and AWS Trainium

AWS Machine Learning Blog

APRIL 29, 2024

Historically, natural language processing (NLP) would be a primary research and development expense. In 2024, however, organizations are using large language models (LLMs), which require relatively little focus on NLP, shifting research and development from modeling to the infrastructure needed to support LLM workflows.

AWS

AWS ML ML Python

Retrieval-Augmented Generation with LangChain, Amazon SageMaker JumpStart, and MongoDB Atlas semantic search

Flipboard

NOVEMBER 17, 2023

Set up a MongoDB cluster To create a free tier MongoDB Atlas cluster, follow the instructions in Create a Cluster. Specify the AWS Lambda function that will interact with MongoDB Atlas and the LLM to provide responses. Delete the MongoDB Atlas cluster. As always, AWS welcomes feedback.

K-nearest Neighbors

K-nearest Neighbors AWS Clustering Database

Deploy pre-trained models on AWS Wavelength with 5G edge using Amazon SageMaker JumpStart

AWS Machine Learning Blog

APRIL 7, 2023

Retailers can deliver more frictionless experiences on the go with natural language processing (NLP), real-time recommendation systems, and fraud detection. In this post, we demonstrate how to deploy a SageMaker model to AWS Wavelength to reduce model inference latency for 5G network-based applications. Choose Manage.

AWS

AWS Clustering ML ML

Fine-tune and deploy Llama 2 models cost-effectively in Amazon SageMaker JumpStart with AWS Inferentia and AWS Trainium

AWS Machine Learning Blog

JANUARY 17, 2024

Today, we’re excited to announce the availability of Llama 2 inference and fine-tuning support on AWS Trainium and AWS Inferentia instances in Amazon SageMaker JumpStart. In this post, we demonstrate how to deploy and fine-tune Llama 2 on Trainium and AWS Inferentia instances in SageMaker JumpStart.

AWS

AWS Python Machine Learning Machine Learning

Use LangChain with PySpark to process documents at massive scale with Amazon SageMaker Studio and Amazon EMR Serverless

AWS Machine Learning Blog

SEPTEMBER 3, 2024

Cost optimization – The serverless nature of the integration means you only pay for the compute resources you use, rather than having to provision and maintain a persistent cluster. This same interface is also used for provisioning EMR clusters. This same interface is also used for provisioning EMR clusters.

AWS

AWS Clustering Big Data Big Data

Pre-training genomic language models using AWS HealthOmics and Amazon SageMaker

AWS Machine Learning Blog

MAY 31, 2024

Genomic language models are a new and exciting field in the application of large language models to challenges in genomics. In this blog post and open source project , we show you how you can pre-train a genomics language model, HyenaDNA , using your genomic data in the AWS Cloud.

AWS

AWS ML ML Machine Learning

Scale LLMs with PyTorch 2.0 FSDP on Amazon EKS – Part 2

AWS Machine Learning Blog

APRIL 1, 2024

Machine learning (ML) research has proven that large language models (LLMs) trained with significantly large datasets result in better model quality. Distributed model training requires a cluster of worker nodes that can scale. The following figure shows how FSDP works for two data parallel processes.

Clustering

Clustering AWS ML ML

Connecting Amazon Redshift and RStudio on Amazon SageMaker

AWS Machine Learning Blog

DECEMBER 29, 2022

AWS offers tools such as RStudio on SageMaker and Amazon Redshift to help tackle these challenges. Note: If you already have an RStudio domain and Amazon Redshift cluster you can skip this step. Amazon Redshift Serverless cluster. I acknowledge that AWS CloudFormation might create IAM resources with custom names.

AWS

AWS Machine Learning Machine Learning Natural Language Processing

Monitor embedding drift for LLMs deployed from Amazon SageMaker JumpStart

AWS Machine Learning Blog

FEBRUARY 2, 2024

Embeddings capture the information content in bodies of text, allowing natural language processing (NLP) models to work with language in a numeric form. Then we use K-Means to identify a set of cluster centers. A visual representation of the silhouette score can be seen in the following figure.

AWS

AWS Clustering ETL Database

Getting started with Amazon Titan Text Embeddings

AWS Machine Learning Blog

JANUARY 31, 2024

Embeddings play a key role in natural language processing (NLP) and machine learning (ML). Text embedding refers to the process of transforming text into numerical representations that reside in a high-dimensional vector space. You can use it via either the Amazon Bedrock REST API or the AWS SDK.

Natural Language Processing

Natural Language Processing AWS Machine Learning Machine Learning

Connect Amazon EMR and RStudio on Amazon SageMaker

AWS Machine Learning Blog

APRIL 17, 2023

Using RStudio on SageMaker and Amazon EMR together, you can continue to use the RStudio IDE for analysis and development, while using Amazon EMR managed clusters for larger data processing. In this post, we demonstrate how you can connect your RStudio on SageMaker domain with an EMR cluster. Choose Create stack.

Clustering

Clustering AWS Machine Learning Machine Learning

Amazon Bedrock Knowledge Bases now supports Amazon OpenSearch Service Managed Cluster as vector store

Build conversational interfaces for structured data using Amazon Bedrock Knowledge Bases

Webinars

Trending Sources

Syngenta develops a generative AI assistant to support sales representatives using Amazon Bedrock Agents

Webinars

Deploy Meta Llama 3.1 models cost-effectively in Amazon SageMaker JumpStart with AWS Inferentia and AWS Trainium

Large Language Models: A Self-Study Roadmap

Accelerating Mixtral MoE fine-tuning on Amazon SageMaker with QLoRA

Transforming financial analysis with CreditAI on Amazon Bedrock: Octus’s journey with AWS

Train, optimize, and deploy models on edge devices using Amazon SageMaker and Qualcomm AI Hub

Query Amazon Aurora PostgreSQL using Amazon Bedrock Knowledge Bases structured data

How Fastweb fine-tuned the Mistral model using Amazon SageMaker HyperPod as a first step to build an Italian large language model

How Lumi streamlines loan approvals with Amazon SageMaker AI

How Aetion is using generative AI and Amazon Bedrock to unlock hidden insights about patient populations

End-to-End model training and deployment with Amazon SageMaker Unified Studio

Detect hallucinations for RAG-based systems

Advanced fine-tuning methods on Amazon SageMaker AI

How Untold Studios empowers artists with an AI assistant built on Amazon Bedrock

How Apoidea Group enhances visual information extraction from banking documents with multimodal models using LLaMA-Factory on Amazon SageMaker HyperPod

Deploy DeepSeek-R1 distilled models on Amazon SageMaker using a Large Model Inference container

Multi-tenancy in RAG applications in a single Amazon Bedrock knowledge base with metadata filtering

How to Work Smarter, Not Harder, with Artificial Intelligence

Understanding the Generative AI Value Chain

How IDIADA optimized its intelligent chatbot with Amazon Bedrock

Fine-tune multimodal models for vision and text use cases on Amazon SageMaker JumpStart

Build a Search Engine: Setting Up AWS OpenSearch

Build agentic AI solutions with DeepSeek-R1, CrewAI, and Amazon SageMaker AI

Ask HN: Who wants to be hired? (July 2025)

Optimizing AI implementation costs with Automat-it

Reduce energy consumption of your machine learning workloads by up to 90% with AWS purpose-built accelerators

Fast and cost-effective LLaMA 2 fine-tuning with AWS Trainium

Sprinklr improves performance by 20% and reduces cost by 25% for machine learning inference on AWS Graviton3

Build enterprise-ready generative AI solutions with Cohere foundation models in Amazon Bedrock and Weaviate vector database on AWS Marketplace

The future of productivity agents with NinjaTech AI and AWS Trainium

Simple guide to training Llama 2 with AWS Trainium on Amazon SageMaker

Develop and train large models cost-efficiently with Metaflow and AWS Trainium

Retrieval-Augmented Generation with LangChain, Amazon SageMaker JumpStart, and MongoDB Atlas semantic search

Deploy pre-trained models on AWS Wavelength with 5G edge using Amazon SageMaker JumpStart

Fine-tune and deploy Llama 2 models cost-effectively in Amazon SageMaker JumpStart with AWS Inferentia and AWS Trainium

Use LangChain with PySpark to process documents at massive scale with Amazon SageMaker Studio and Amazon EMR Serverless

Pre-training genomic language models using AWS HealthOmics and Amazon SageMaker

Scale LLMs with PyTorch 2.0 FSDP on Amazon EKS – Part 2

Connecting Amazon Redshift and RStudio on Amazon SageMaker

Monitor embedding drift for LLMs deployed from Amazon SageMaker JumpStart

Getting started with Amazon Titan Text Embeddings

Connect Amazon EMR and RStudio on Amazon SageMaker

Stay Connected