AWS, Clustering and Demo - Data Science Current

Your guide to generative AI and ML at AWS re:Invent 2024

AWS Machine Learning Blog

NOVEMBER 19, 2024

The excitement is building for the fourteenth edition of AWS re:Invent, and as always, Las Vegas is set to host this spectacular event. Third, we’ll explore the robust infrastructure services from AWS powering AI innovation, featuring Amazon SageMaker , AWS Trainium , and AWS Inferentia under AI/ML, as well as Compute topics.

AWS

AWS ML ML AI

Enhance your Amazon Redshift cloud data warehouse with easier, simpler, and faster machine learning using Amazon SageMaker Canvas

AWS Machine Learning Blog

OCTOBER 24, 2024

Prerequisites Before you begin, make sure you have the following prerequisites in place: An AWS account and role with the AWS Identity and Access Management (IAM) privileges to deploy the following resources: IAM roles. For this post we’ll use a provisioned Amazon Redshift cluster. A SageMaker domain.

Data Warehouse

Data Warehouse Machine Learning Machine Learning Cloud Data

AWS at NVIDIA GTC 2024: Accelerate innovation with generative AI on AWS

AWS Machine Learning Blog

APRIL 11, 2024

AWS was delighted to present to and connect with over 18,000 in-person and 267,000 virtual attendees at NVIDIA GTC, a global artificial intelligence (AI) conference that took place March 2024 in San Jose, California, returning to a hybrid, in-person experience for the first time since 2019.

AWS

AWS AI AI Clustering

Webinars

How to Achieve High-Accuracy Results When Using LLMs

Maximizing Profit and Productivity: The New Era of AI-Powered Accounting

Automation, Evolved: Your New Playbook For Smarter Knowledge Work

MORE WEBINARS

Real value, real time: Production AI with Amazon SageMaker and Tecton

AWS Machine Learning Blog

DECEMBER 4, 2024

Orchestrate with Tecton-managed EMR clusters – After features are deployed, Tecton automatically creates the scheduling, provisioning, and orchestration needed for pipelines that can run on Amazon EMR compute engines. You can view and create EMR clusters directly through the SageMaker notebook.

ML

ML ML AWS AI

CBRE and AWS perform natural language queries of structured data using Amazon Bedrock

AWS Machine Learning Blog

MAY 30, 2024

Because Amazon Bedrock is serverless, you don’t have to manage infrastructure, and you can securely integrate and deploy generative AI capabilities into your applications using the AWS services you are already familiar with. AWS Prototyping developed an AWS Cloud Development Kit (AWS CDK) stack for deployment following AWS best practices.

AWS

AWS SQL Database AI

Deploy generative AI models from Amazon SageMaker JumpStart using the AWS CDK

AWS Machine Learning Blog

MAY 23, 2023

In April 2023, AWS unveiled Amazon Bedrock , which provides a way to build generative AI-powered apps via pre-trained models from startups including AI21 Labs , Anthropic , and Stability AI. Amazon Bedrock also offers access to Titan foundation models, a family of models trained in-house by AWS. Deploy the AWS CDK application.

AWS

AWS AI AI ML

Introducing Amazon SageMaker HyperPod to train foundation models at scale

AWS Machine Learning Blog

NOVEMBER 30, 2023

Building foundation models (FMs) requires building, maintaining, and optimizing large clusters to train models with tens to hundreds of billions of parameters on vast amounts of data. SageMaker HyperPod integrates the Slurm Workload Manager for cluster and training job orchestration.

Clustering

Clustering AWS Machine Learning Machine Learning

Deploy pre-trained models on AWS Wavelength with 5G edge using Amazon SageMaker JumpStart

AWS Machine Learning Blog

APRIL 7, 2023

To reduce the barrier to entry of ML at the edge, we wanted to demonstrate an example of deploying a pre-trained model from Amazon SageMaker to AWS Wavelength , all in less than 100 lines of code. In this post, we demonstrate how to deploy a SageMaker model to AWS Wavelength to reduce model inference latency for 5G network-based applications.

AWS

AWS Clustering ML ML

Multi-tenancy in RAG applications in a single Amazon Bedrock knowledge base with metadata filtering

AWS Machine Learning Blog

APRIL 7, 2025

Amazon Bedrock is a fully managed service that offers a choice of high-performing foundation models (FMs) from leading AI companies and AWS. Solution overview The following diagram provides a high-level overview of AWS services and features through a sample use case.

Build ML features at scale with Amazon SageMaker Feature Store using data from Amazon Redshift

Flipboard

AUGUST 17, 2023

Amazon Redshift uses SQL to analyze structured and semi-structured data across data warehouses, operational databases, and data lakes, using AWS-designed hardware and ML to deliver the best price-performance at any scale. Prerequisites To continue with the examples in this post, you need to create the required AWS resources.

ML

ML ML AWS Data Warehouse

Live Meeting Assistant with Amazon Transcribe, Amazon Bedrock, and Knowledge Bases for Amazon Bedrock

AWS Machine Learning Blog

APRIL 18, 2024

Check out the following demo to see how it works. It’s straightforward to deploy in your AWS account. Prerequisites You need to have an AWS account and an AWS Identity and Access Management (IAM) role and user with permissions to create and manage the necessary resources and components for this application.

AWS

AWS Analytics Analytics AI

Getting started with Amazon Titan Text Embeddings

AWS Machine Learning Blog

JANUARY 31, 2024

Amazon Titan Text Embeddings is a text embeddings model that converts natural language text—consisting of single words, phrases, or even large documents—into numerical representations that can be used to power use cases such as search, personalization, and clustering based on semantic similarity. Nitin Eusebius is a Sr.

Natural Language Processing

Natural Language Processing AWS Machine Learning Machine Learning

Use Amazon DocumentDB to build no-code machine learning solutions in Amazon SageMaker Canvas

AWS Machine Learning Blog

DECEMBER 15, 2023

Prerequisites To implement this solution, complete the following prerequisites: Have AWS Cloud admin access with an AWS Identity and Access Management (IAM) user with permissions required to complete the integration. Enter a connection name such as demo and choose your desired Amazon DocumentDB cluster.

Machine Learning

Machine Learning Machine Learning AWS ML

Implement unified text and image search with a CLIP model using Amazon SageMaker and Amazon OpenSearch Service

AWS Machine Learning Blog

APRIL 5, 2023

You can also use an AWS CloudFormation template by following the GitHub instructions to create a domain. By using an interface VPC endpoint (interface endpoint), the communication between your VPC and Studio is conducted entirely and securely within the AWS network. For demo purposes, we use approximately 1,600 products.

ML

ML ML AWS K-nearest Neighbors

Announcing the Preview of Amazon SageMaker Profiler: Track and visualize detailed hardware performance data for your model training workloads

AWS Machine Learning Blog

AUGUST 24, 2023

Today, we’re pleased to announce the preview of Amazon SageMaker Profiler , a capability of Amazon SageMaker that provides a detailed view into the AWS compute resources provisioned during training deep learning models on SageMaker. The following table provides the links to the supported AWS Deep Learning Containers for SageMaker.

AWS

AWS Deep Learning Deep Learning ML

GenAI for Aerospace: Empowering the workforce with expert knowledge on Amazon Q and Amazon Bedrock

AWS Machine Learning Blog

SEPTEMBER 26, 2024

AWS is uniquely positioned to help you address these challenges through generative AI, with a broad and deep range of AI/ML services and over 20 years of experience in developing AI/ML technologies. Under Connect Amazon Q to IAM Identity Center , choose Create account instance to create a custom credential set for this demo.

AWS

AWS AI AI Machine Learning

Machine learning with decentralized training data using federated learning on Amazon SageMaker

AWS Machine Learning Blog

AUGUST 22, 2023

Usually, if the dataset or model is too large to be trained on a single instance, distributed training allows for multiple instances within a cluster to be used and distribute either data or model partitions across those instances during the training process. Each account or Region has its own training instances.

Machine Learning

Machine Learning Machine Learning AWS ML

How Sportradar used the Deep Java Library to build production-scale ML platforms for increased performance and efficiency

AWS Machine Learning Blog

APRIL 19, 2023

Then we needed to Dockerize the application, write a deployment YAML file, deploy the gRPC server to our Kubernetes cluster, and make sure it’s reliable and auto scalable. It also includes support for new hardware like ARM (both in servers like AWS Graviton and laptops with Apple M1 ) and AWS Inferentia.

ML

ML ML Deep Learning Deep Learning

Introducing the MLOps Management Agent

DataRobot

JUNE 16, 2021

The MLOps Management Agent provides a framework to automate the entire model deployment lifecycle in any environment or infrastructure such as Azure, GCP, AWS, or your own on-premise Kubernetes cluster. What’s the new page , you can find a demo video to see how it works. Request a Demo. It is available in Release 7.1.

Azure

Azure Data Science Clustering AWS

Churn prediction using multimodality of text and tabular features with Amazon SageMaker Jumpstart

AWS Machine Learning Blog

JANUARY 17, 2023

To try out the solution in your own account, make sure that you have the following in place: An AWS account. To run this JumpStart solution and have the infrastructure deploy to your AWS account, you must create an active Amazon SageMaker Studio instance (see Onboard to Amazon SageMaker Studio ). Demo notebook. Conclusion.

AWS

AWS Machine Learning Machine Learning Natural Language Processing

Zero-shot and few-shot prompting for the BloomZ 176B foundation model with the simplified Amazon SageMaker JumpStart SDK

AWS Machine Learning Blog

AUGUST 14, 2023

The code for all the steps in this demo is available in the following notebook. These attributes are only default values; you can override them and retain granular control over the AWS models you create. LMI is an AWS-built LLM software stack (container) that offers easy-to-use functions and performance gain on generative AI models.

Natural Language Processing

Natural Language Processing AWS Machine Learning Machine Learning

Question answering using Retrieval Augmented Generation with foundation models in Amazon SageMaker JumpStart

AWS Machine Learning Blog

MAY 2, 2023

Managed Spot Training is supported in all AWS Regions where Amazon SageMaker is currently available. In this demo, we use a Jumpstart Flan T5 XXL model endpoint. Managed Spot Training is supported in all AWS Regions where Amazon SageMaker is currently available. SageMaker Savings Plans apply only to SageMaker ML Instance usage.

Algorithm

Algorithm Machine Learning Machine Learning Natural Language Processing

Dialogue-guided intelligent document processing with foundation models on Amazon SageMaker JumpStart

AWS Machine Learning Blog

MAY 24, 2023

You can efficiently deploy the pre-trained J2-jumbo-instruct, or other Jurassic-2 models available on AWS Marketplace, into your own own virtual private cloud (VPC) using Amazon SageMaker. 24xlarge") # Create a Sgaemkaer endpoint then deploy a pre-trained J2-jumbo-instruct-v1 model from AWS Market Place.

AI

AI AWS AI ML

How Foundation Models bolster programmatic labeling

Snorkel AI

JANUARY 26, 2023

We tackle that by learning these clusters in the foundation models embedding space and providing those clusters as the subgroups—and basically learning a weak supervision model on each of those clusters. You can register for a live demo of Snorkel Flow on February 16 which will feature the platform’s new FM capabilities.

K-nearest Neighbors

K-nearest Neighbors Clustering Computer Science Computer Science

How Foundation Models bolster programmatic labeling

Snorkel AI

JANUARY 26, 2023

We tackle that by learning these clusters in the foundation models embedding space and providing those clusters as the subgroups—and basically learning a weak supervision model on each of those clusters. You can register for a live demo of Snorkel Flow on February 16 which will feature the platform’s new FM capabilities.

K-nearest Neighbors

K-nearest Neighbors Clustering Computer Science Computer Science

Zero-shot prompting for the Flan-T5 foundation model in Amazon SageMaker JumpStart

AWS Machine Learning Blog

APRIL 3, 2023

We cover prompts for the following NLP tasks: Text summarization Common sense reasoning Question answering Sentiment classification Translation Pronoun resolution Text generation based on article Imaginary article based on title Code for all the steps in this demo is available in the following notebook.

Natural Language Processing

Natural Language Processing Machine Learning Machine Learning Algorithm

Generative AI in the Enterprise

O'Reilly Media

NOVEMBER 28, 2023

How will AI adopters react when the cost of renting infrastructure from AWS, Microsoft, or Google rises? Second, while OpenAI’s GPT-4 announcement last March demoed generating website code from a hand-drawn sketch, that capability wasn’t available until after the survey closed. But they may back off on AI development.

AI

AI AI Data Analysis Data Analysis

Top 50+ Data Analyst Interview Questions & Answers

Pickl AI

APRIL 26, 2024

Then, I would use clustering techniques such as k-means or hierarchical clustering to group customers based on similarities in their purchasing behaviour. Have you worked with cloud-based data platforms like AWS, Google Cloud, or Azure? Additional Benefits Free demo sessions. What approach would you take?

Data Analyst

Data Analyst Data Analysis Data Analysis Machine Learning

Financial text generation using a domain-adapted fine-tuned large language model in Amazon SageMaker JumpStart

AWS Machine Learning Blog

APRIL 18, 2023

All the steps in this demo are available in the accompanying notebook Fine-tuning text generation GPT-J 6B model on a domain specific dataset. We serve developers and enterprises of all sizes through AWS, which offers a broad set of global compute, storage, database, and other service offerings.

ML

ML ML Deep Learning Deep Learning

MLOps Landscape in 2023: Top Tools and Platforms

The MLOps Blog

JUNE 27, 2023

For example, if you use AWS, you may prefer Amazon SageMaker as an MLOps platform that integrates with other AWS services. SageMaker Studio offers built-in algorithms, automated model tuning, and seamless integration with AWS services, making it a powerful platform for developing and deploying machine learning solutions at scale.

Machine Learning

Machine Learning Machine Learning ML ML

Domain-adaptation Fine-tuning of Foundation Models in Amazon SageMaker JumpStart on Financial data

AWS Machine Learning Blog

APRIL 18, 2023

All the steps in this demo are available in the accompanying notebook Fine-tuning text generation GPT-J 6B model on a domain specific dataset. We serve developers and enterprises of all sizes through AWS, which offers a broad set of global compute, storage, database, and other service offerings.

ML

ML ML Deep Learning Deep Learning

Learnings From Building the ML Platform at Mailchimp

The MLOps Blog

OCTOBER 3, 2023

For example, you can use BigQuery , AWS , or Azure. It can be a cluster run by Kubernetes or maybe something else. How awful are they?” In terms of the interaction, ideally, the data scientists shouldn’t have to be setting up infrastructure like a Spark cluster. They’re terrible people.

ML

ML ML Data Scientist Machine Learning

Reinvent personalization with generative AI on Amazon Bedrock using task decomposition for agentic workflows

AWS Machine Learning Blog

SEPTEMBER 18, 2024

Generative AI on AWS can transform user experiences for customers while maintaining brand consistency and your desired customization. Here, we also prompted the LLM to use the company logo (which is the unicorn of AWS GameDay ) to demonstrate incorporating existing design elements into the design. The AWS SDK for Python (Boto3) set up.

AI

AI AI AWS ML

How to Build an End-To-End ML Pipeline

The MLOps Blog

MAY 9, 2023

3 Quickly build and deploy an end-to-end ML pipeline with Kubeflow Pipelines on AWS. Kubeflow Pipelines Kubeflow Pipelines is an orchestration tool for building and deploying portable, scalable, and reproducible end-to-end machine learning workflows directly on Kubernetes clusters. If you don’t already have an AWS account, create one.

ML

ML ML Machine Learning Machine Learning

Unlocking generative AI for enterprises: How SnapLogic powers their low-code Agent Creator using Amazon Bedrock

AWS Machine Learning Blog

OCTOBER 23, 2024

The following demo shows Agent Creator in action. SnapLogic uses Amazon Bedrock to build its platform, capitalizing on the proximity to data already stored in Amazon Web Services (AWS). To address customers’ requirements about data privacy and sovereignty, SnapLogic deploys the data plane within the customer’s VPC on AWS.

AI

AI AI AWS Database

How SnapLogic built a text-to-pipeline application with Amazon Bedrock to translate business intent into action

Flipboard

NOVEMBER 24, 2023

In this post, we show you how SnapLogic , an AWS customer, used Amazon Bedrock to power their SnapGPT product through automated creation of these complex DSL artifacts from human language. SnapLogic background SnapLogic is an AWS customer on a mission to bring enterprise automation to the world.

Database

Database AWS ETL SQL

Use Kubernetes Operators for new inference capabilities in Amazon SageMaker that reduce LLM deployment costs by 50% on average

AWS Machine Learning Blog

APRIL 19, 2024

We are excited to announce a new version of the Amazon SageMaker Operators for Kubernetes using the AWS Controllers for Kubernetes (ACK). ACK is a framework for building Kubernetes custom controllers, where each controller communicates with an AWS service API. They are also supported by AWS CloudFormation. Release v1.2.9

AWS

AWS ML ML Machine Learning

Dialogue-guided visual language processing with Amazon SageMaker JumpStart

AWS Machine Learning Blog

NOVEMBER 1, 2023

The demo implementation code is available in the following GitHub repo. Utilizing the latest Hugging Face LLM modules on Amazon SageMaker, AWS customers can now tap into the power of SageMaker deep learning containers (DLCs). With SageMaker, you can seamlessly deploy IDEFICS-9b-instruct on a g5.2xlarge instance for inference tasks.

AWS

AWS Clustering Deep Learning Deep Learning

Generative AI foundation model training on Amazon SageMaker

AWS Machine Learning Blog

OCTOBER 22, 2024

In this post, we explore how organizations can address these challenges and cost-effectively customize and adapt FMs using AWS managed services such as Amazon SageMaker training jobs and Amazon SageMaker HyperPod. After the training is complete, SageMaker spins down the cluster and the customer is billed for the net training time in seconds.

Clustering

Clustering ML ML AWS

Enhance performance of generative language models with self-consistency prompting on Amazon Bedrock

AWS Machine Learning Blog

MARCH 19, 2024

For multiple-choice reasoning, we prompt AI21 Labs Jurassic-2 Mid on a small sample of questions from the AWS Certified Solutions Architect – Associate exam. Prerequisites This walkthrough assumes the following prerequisites: An AWS account with a ml.t3.medium We use Cohere Command and AI21 Labs Jurassic-2 Mid for this demo.

Database

Database AWS Python Natural Language Processing

Deploying ML Models on GPU With Kyle Morris

The MLOps Blog

DECEMBER 29, 2022

A GPU machine on GCP, or AWS has a CPU on it. How do you look at an On-Premises GPU cluster, managed by NVIDIA AI enterprise software suite in combination with Red Hat OpenShift or VMware Tanzu, over something like AWS stack or Azure stack for the same GPU cluster managed by EKS, for example?

ML

ML ML Machine Learning Machine Learning

Definite Guide to Building a Machine Learning Platform

The MLOps Blog

MARCH 21, 2023

We ask this during product demos, user and support calls, and on our MLOps LIVE podcast. Orchestrators are concerned with lower-level abstractions like machines, instances, clusters, service-level grouping, replication, and so on. If your organization runs its workloads on AWS , it might be worth it to leverage AWS SageMaker.

Machine Learning

Machine Learning Machine Learning Data Scientist ML

Develop a RAG-based application using Amazon Aurora with Amazon Kendra

AWS Machine Learning Blog

JANUARY 28, 2025

On the Add additional capacity page, select Developer edition (for this demo) and choose Next. Under Authentication , if you already have credentials stored in AWS Secrets Manager , choose it on the dropdown Otherwise, choose Create and add new secret. Choose Next. Under User-group expansion , select None.

AWS

AWS Database Clustering Data Preparation

Efficiently build and tune custom log anomaly detection models with Amazon SageMaker

AWS Machine Learning Blog

JANUARY 6, 2025

Prerequisites You should have the following prerequisites: An AWS account A SageMaker notebook instance An S3 bucket to store the input data Process the data To start, upload the log dataset to an S3 bucket in your AWS account. aws ecr get-login --region $region --registry-ids $account_id --no-include-email) !aws client("sts").get_caller_identity().get("Account")

Python

Python AWS ML ML

Your guide to generative AI and ML at AWS re:Invent 2024

Enhance your Amazon Redshift cloud data warehouse with easier, simpler, and faster machine learning using Amazon SageMaker Canvas

Webinars

Trending Sources

AWS at NVIDIA GTC 2024: Accelerate innovation with generative AI on AWS

Webinars

Real value, real time: Production AI with Amazon SageMaker and Tecton

CBRE and AWS perform natural language queries of structured data using Amazon Bedrock

Deploy generative AI models from Amazon SageMaker JumpStart using the AWS CDK

Introducing Amazon SageMaker HyperPod to train foundation models at scale

Deploy pre-trained models on AWS Wavelength with 5G edge using Amazon SageMaker JumpStart

Multi-tenancy in RAG applications in a single Amazon Bedrock knowledge base with metadata filtering

Build ML features at scale with Amazon SageMaker Feature Store using data from Amazon Redshift

Live Meeting Assistant with Amazon Transcribe, Amazon Bedrock, and Knowledge Bases for Amazon Bedrock

Getting started with Amazon Titan Text Embeddings

Use Amazon DocumentDB to build no-code machine learning solutions in Amazon SageMaker Canvas

Implement unified text and image search with a CLIP model using Amazon SageMaker and Amazon OpenSearch Service

Announcing the Preview of Amazon SageMaker Profiler: Track and visualize detailed hardware performance data for your model training workloads

GenAI for Aerospace: Empowering the workforce with expert knowledge on Amazon Q and Amazon Bedrock

Machine learning with decentralized training data using federated learning on Amazon SageMaker

How Sportradar used the Deep Java Library to build production-scale ML platforms for increased performance and efficiency

Introducing the MLOps Management Agent

Churn prediction using multimodality of text and tabular features with Amazon SageMaker Jumpstart

Zero-shot and few-shot prompting for the BloomZ 176B foundation model with the simplified Amazon SageMaker JumpStart SDK

Question answering using Retrieval Augmented Generation with foundation models in Amazon SageMaker JumpStart

Dialogue-guided intelligent document processing with foundation models on Amazon SageMaker JumpStart

How Foundation Models bolster programmatic labeling

How Foundation Models bolster programmatic labeling

Zero-shot prompting for the Flan-T5 foundation model in Amazon SageMaker JumpStart

Generative AI in the Enterprise

Top 50+ Data Analyst Interview Questions & Answers

Financial text generation using a domain-adapted fine-tuned large language model in Amazon SageMaker JumpStart

MLOps Landscape in 2023: Top Tools and Platforms

Domain-adaptation Fine-tuning of Foundation Models in Amazon SageMaker JumpStart on Financial data

Learnings From Building the ML Platform at Mailchimp

Reinvent personalization with generative AI on Amazon Bedrock using task decomposition for agentic workflows

How to Build an End-To-End ML Pipeline

Unlocking generative AI for enterprises: How SnapLogic powers their low-code Agent Creator using Amazon Bedrock

How SnapLogic built a text-to-pipeline application with Amazon Bedrock to translate business intent into action

Use Kubernetes Operators for new inference capabilities in Amazon SageMaker that reduce LLM deployment costs by 50% on average

Dialogue-guided visual language processing with Amazon SageMaker JumpStart

Generative AI foundation model training on Amazon SageMaker

Enhance performance of generative language models with self-consistency prompting on Amazon Bedrock

Deploying ML Models on GPU With Kyle Morris

Definite Guide to Building a Machine Learning Platform

Develop a RAG-based application using Amazon Aurora with Amazon Kendra

Efficiently build and tune custom log anomaly detection models with Amazon SageMaker

Stay Connected