AWS, Data Models and ML - Data Science Current

Accelerating ML experimentation with enhanced security: AWS PrivateLink support for Amazon SageMaker with MLflow

AWS Machine Learning Blog

DECEMBER 9, 2024

With access to a wide range of generative AI foundation models (FM) and the ability to build and train their own machine learning (ML) models in Amazon SageMaker , users want a seamless and secure way to experiment with and select the models that deliver the most value for their business.

AWS

AWS ML ML Data Scientist

Unstructured data management and governance using AWS AI/ML and analytics services

Flipboard

OCTOBER 25, 2023

Unstructured data is information that doesn’t conform to a predefined schema or isn’t organized according to a preset data model. Text, images, audio, and videos are common examples of unstructured data. Additionally, we show how to use AWS AI/ML services for analyzing unstructured data.

AWS

AWS ML ML Analytics

How Rocket Companies modernized their data science solution on AWS

AWS Machine Learning Blog

FEBRUARY 21, 2025

The Hadoop environment was hosted on Amazon Elastic Compute Cloud (Amazon EC2) servers, managed in-house by Rockets technology team, while the data science experience infrastructure was hosted on premises. Communication between the two systems was established through Kerberized Apache Livy (HTTPS) connections over AWS PrivateLink.

Data Science

Data Science AWS Hadoop Data Scientist

Webinars

Automation, Evolved: Your New Playbook For Smarter Knowledge Work

MORE WEBINARS

Governing the ML lifecycle at scale, Part 1: A framework for architecting ML workloads using Amazon SageMaker

AWS Machine Learning Blog

OCTOBER 20, 2023

Customers of every size and industry are innovating on AWS by infusing machine learning (ML) into their products and services. Recent developments in generative AI models have further sped up the need of ML adoption across industries.

ML

ML ML AWS Data Lakes

Develop and train large models cost-efficiently with Metaflow and AWS Trainium

AWS Machine Learning Blog

APRIL 29, 2024

In 2024, however, organizations are using large language models (LLMs), which require relatively little focus on NLP, shifting research and development from modeling to the infrastructure needed to support LLM workflows. Metaflow’s coherent APIs simplify the process of building real-world ML/AI systems in teams.

AWS

AWS ML ML Python

Modernizing data science lifecycle management with AWS and Wipro

AWS Machine Learning Blog

JANUARY 5, 2024

This post was written in collaboration with Bhajandeep Singh and Ajay Vishwakarma from Wipro’s AWS AI/ML Practice. Many organizations have been using a combination of on-premises and open source data science solutions to create and manage machine learning (ML) models.

AWS

AWS Data Science ML ML

Building an efficient MLOps platform with OSS tools on Amazon ECS with AWS Fargate

AWS Machine Learning Blog

SEPTEMBER 18, 2024

The ZMP analyzes billions of structured and unstructured data points to predict consumer intent by using sophisticated artificial intelligence (AI) to personalize experiences at scale. Hosted on Amazon ECS with tasks run on Fargate, this platform streamlines the end-to-end ML workflow, from data ingestion to model deployment.

AWS

AWS Machine Learning Machine Learning ML

How Carrier predicts HVAC faults using AWS Glue and Amazon SageMaker

AWS Machine Learning Blog

SEPTEMBER 5, 2023

In order to improve our equipment reliability, we partnered with the Amazon Machine Learning Solutions Lab to develop a custom machine learning (ML) model capable of predicting equipment issues prior to failure. Our teams developed a framework for processing over 50 TB of historical sensor data and predicting faults with 91% precision.

AWS

AWS ML ML Machine Learning

TigerEye (YC S22) Is Hiring a Full Stack Engineer

Hacker News

NOVEMBER 19, 2024

Here are a few of the things that you might do as an AI Engineer at TigerEye: - Design, develop, and validate statistical models to explain past behavior and to predict future behavior of our customers’ sales teams - Own training, integration, deployment, versioning, and monitoring of ML components - Improve TigerEye’s existing metrics collection and (..)

Computer Science

Computer Science Computer Science ML ML

Security best practices to consider while fine-tuning models in Amazon Bedrock

AWS Machine Learning Blog

JANUARY 24, 2025

In this post, we delve into the essential security best practices that organizations should consider when fine-tuning generative AI models. Security in Amazon Bedrock Cloud security at AWS is the highest priority. Amazon Bedrock prioritizes security through a comprehensive approach to protect customer data and AI workloads.

AWS

AWS AI AI Machine Learning

Remote Data Science Jobs: 5 High-Demand Roles for Career Growth

Data Science Dojo

OCTOBER 31, 2024

Growth Outlook: Companies like Google DeepMind, NASA’s Jet Propulsion Lab, and IBM Research actively seek research data scientists for their teams, with salaries typically ranging from $120,000 to $180,000. With the continuous growth in AI, demand for remote data science jobs is set to rise.

Data Science

Data Science Data Scientist Machine Learning Machine Learning

Derive meaningful and actionable operational insights from AWS Using Amazon Q Business

AWS Machine Learning Blog

JULY 17, 2024

As a customer, you rely on Amazon Web Services (AWS) expertise to be available and understand your specific environment and operations. Amazon Q Business is a fully managed, secure, generative-AI powered enterprise chat assistant that enables natural language interactions with your organization’s data.

AWS

AWS AI AI Artificial Intelligence

Frugality meets Accuracy: Cost-efficient training of GPT NeoX and Pythia models with AWS Trainium

AWS Machine Learning Blog

DECEMBER 12, 2023

In this post, we’ll summarize training procedure of GPT NeoX on AWS Trainium , a purpose-built machine learning (ML) accelerator optimized for deep learning training. M tokens/$) trained such models with AWS Trainium without losing any model quality. We’ll outline how we cost-effectively (3.2 billion in Pythia.

AWS

AWS Machine Learning Machine Learning Deep Learning

How Light & Wonder built a predictive maintenance solution for gaming machines on AWS

AWS Machine Learning Blog

JUNE 22, 2023

Working with AWS, Light & Wonder recently developed an industry-first secure solution, Light & Wonder Connect (LnW Connect), to stream telemetry and machine health data from roughly half a million electronic gaming machines distributed across its casino customer base globally when LnW Connect reaches its full potential.

AWS

AWS ML ML Machine Learning

Enhance speech synthesis and video generation models with RLHF using audio and video segmentation in Amazon SageMaker

AWS Machine Learning Blog

NOVEMBER 21, 2024

We guide you through deploying the necessary infrastructure using AWS CloudFormation , creating an internal labeling workforce, and setting up your first labeling job. This precision helps models learn the fine details that separate natural from artificial-sounding speech. We demonstrate how to use Wavesurfer.js

AWS

AWS AI AI Natural Language Processing

How Axfood enables accelerated machine learning throughout the organization using Amazon SageMaker

AWS Machine Learning Blog

FEBRUARY 27, 2024

In this post, we share how Axfood, a large Swedish food retailer, improved operations and scalability of their existing artificial intelligence (AI) and machine learning (ML) operations by prototyping in close collaboration with AWS experts and using Amazon SageMaker.

Machine Learning

Machine Learning Machine Learning ML ML

Automate mortgage document fraud detection using an ML model and business-defined rules with Amazon Fraud Detector: Part 3

AWS Machine Learning Blog

FEBRUARY 7, 2024

In the first post of this three-part series, we presented a solution that demonstrates how you can automate detecting document tampering and fraud at scale using AWS AI and machine learning (ML) services for a mortgage underwriting use case. Add rules to interpret model scores. Deploy the API to make predictions.

ML

ML ML AWS Data Profiling

Build well-architected IDP solutions with a custom lens – Part 4: Performance efficiency

AWS Machine Learning Blog

NOVEMBER 22, 2023

The AWS Well-Architected Framework provides a systematic way for organizations to learn operational and architectural best practices for designing and operating reliable, secure, efficient, cost-effective, and sustainable workloads in the cloud. These resources introduce common AWS services for IDP workloads and suggested workflows.

AWS

AWS ML ML Machine Learning

Apply fine-grained data access controls with AWS Lake Formation in Amazon SageMaker Data Wrangler

AWS Machine Learning Blog

AUGUST 21, 2023

Amazon SageMaker Data Wrangler reduces the time it takes to collect and prepare data for machine learning (ML) from weeks to minutes. Data is frequently kept in data lakes that can be managed by AWS Lake Formation , giving you the ability to implement fine-grained access control using a straightforward grant or revoke procedure.

AWS

AWS Data Lakes Clustering Data Preparation

Deploy a Hugging Face (PyAnnote) speaker diarization model on Amazon SageMaker as an asynchronous endpoint

AWS Machine Learning Blog

APRIL 25, 2024

We provide a comprehensive guide on how to deploy speaker segmentation and clustering solutions using SageMaker on the AWS Cloud. Solution overview Amazon Transcribe is the go-to service for speaker diarization in AWS. Hugging Face is a popular open source hub for machine learning (ML) models.

AWS

AWS ML ML Python

How InsuranceDekho transformed insurance agent interactions using Amazon Bedrock and generative AI

AWS Machine Learning Blog

NOVEMBER 18, 2024

The key reasons that influenced this decision were: Managed service – Amazon Bedrock is a fully serverless offering that offers a choice of industry leading FMs without provisioning infrastructure, procuring GPUs around the clock, or configuring ML frameworks.

AI

AI AI Database AWS

MLOps and DevOps: Why Data Makes It Different

O'Reilly Media

OCTOBER 19, 2021

As with many burgeoning fields and disciplines, we don’t yet have a shared canonical infrastructure stack or best practices for developing and deploying data-intensive applications. What does a modern technology stack for streamlined ML processes look like? Why: Data Makes It Different. All ML projects are software projects.

ML

ML ML Data Scientist AWS

How Earth.com and Provectus implemented their MLOps Infrastructure with Amazon SageMaker

AWS Machine Learning Blog

JUNE 27, 2023

When machine learning (ML) models are deployed into production and employed to drive business decisions, the challenge often lies in the operation and management of multiple models. That is where Provectus , an AWS Premier Consulting Partner with competencies in Machine Learning, Data & Analytics, and DevOps, stepped in.

ML

ML ML AWS Machine Learning

Transition your Amazon Forecast usage to Amazon SageMaker Canvas

AWS Machine Learning Blog

JULY 29, 2024

Amazon Forecast is a fully managed service that uses statistical and machine learning (ML) algorithms to deliver highly accurate time series forecasts. With SageMaker Canvas, you get faster model building , cost-effective predictions, advanced features such as a model leaderboard and algorithm selection, and enhanced transparency.

ML

ML ML Algorithm AWS

Implementing Knowledge Bases for Amazon Bedrock in support of GDPR (right to be forgotten) requests

AWS Machine Learning Blog

MAY 31, 2024

With the Amazon Bedrock serverless experience, you can get started quickly, privately customize FMs with your own data, and integrate and deploy them into your applications using the Amazon Web Services (AWS) tools without having to manage infrastructure. However, this is beyond the scope of this post.

AWS

AWS Machine Learning Machine Learning Database

Fine-tune Meta Llama 3.1 models using torchtune on Amazon SageMaker

AWS Machine Learning Blog

SEPTEMBER 19, 2024

In this post, AWS collaborates with Meta’s PyTorch team to showcase how you can use Meta’s torchtune library to fine-tune Meta Llama-like architectures while using a fully-managed environment provided by Amazon SageMaker Training. SageMaker Training is a comprehensive, fully managed ML service that enables scalable model training.

AWS

AWS ML ML Machine Learning

Automate the deployment of an Amazon Forecast time-series forecasting model

AWS Machine Learning Blog

MAY 4, 2023

Simple methods for time series forecasting use historical values of the same variable whose future values need to be predicted, whereas more complex, machine learning (ML)-based methods use additional information, such as the time series data of related variables. All of this is possible without having to write or compile code.

AWS

AWS ML ML Data Scientist

The Essential Tools for ML Evaluation and Responsible AI

ODSC - Open Data Science

OCTOBER 21, 2024

Fortunately, there are many tools for ML evaluation and frameworks designed to support responsible AI development and evaluation. But let’s first take a look at some of the tools for ML evaluation that are popular for responsible AI. It includes methods for addressing fairness issues by adjusting training data, models, or outputs.

ML

ML ML Machine Learning Machine Learning

MLOps Landscape in 2023: Top Tools and Platforms

The MLOps Blog

JUNE 27, 2023

For example, if you use AWS, you may prefer Amazon SageMaker as an MLOps platform that integrates with other AWS services. Knowledge and skills in the organization Evaluate the level of expertise and experience of your ML team and choose a tool that matches their skill set and learning curve.

Machine Learning

Machine Learning Machine Learning ML ML

Scaling Thomson Reuters’ language model research with Amazon SageMaker HyperPod

AWS Machine Learning Blog

SEPTEMBER 12, 2024

Thomson Reuters , a global content and technology-driven company, has been using artificial intelligence and machine learning (AI/ML) in its professional information products for decades. LLMs disrupt the industry Towards the end of 2022, groundbreaking LLMs were released that realized drastic improvements over previous model capabilities.

Clustering

Clustering AWS ML ML

Mastering Version Control for ML Models: Best Practices You Need to Know

DagsHub

AUGUST 29, 2024

Source: Author Introduction Machine learning (ML) models, like other software, are constantly changing and evolving. Version control systems (VCS) play a key role in this area by offering a structured method to track changes made to models and handle versions of data and code used in these ML projects.

ML

ML ML Python Machine Learning

Implement a custom AutoML job using pre-selected algorithms in Amazon SageMaker Automatic Model Tuning

AWS Machine Learning Blog

NOVEMBER 15, 2023

AutoML allows you to derive rapid, general insights from your data right at the beginning of a machine learning (ML) project lifecycle. Understanding up front which preprocessing techniques and algorithm types provide best results reduces the time to develop, train, and deploy the right model.

Algorithm

Algorithm AWS ML ML

MLOps for IoT Edge Ecosystems: Building an MLOps Environment on AWS

The MLOps Blog

JANUARY 11, 2023

This can enable the company to leverage the data generated by its IoT edge devices to drive business decisions and gain a competitive advantage. AWS offers a three-layered machine learning stack to choose from based on your skill set and team’s requirements for implementing workloads to execute machine learning tasks.

AWS

AWS Machine Learning Machine Learning ML

Say Goodbye to Costly BERT Inference: Turbocharge with AWS Inferentia2 and Hugging Face…

Mlearning.ai

JUNE 7, 2023

AWS Inferentia accelerators are custom-built machine learning inference chips designed by Amazon Web Services (AWS) to optimize inference workloads on the AWS platform. The AWS Inferentia chips are designed with a focus on delivering high performance, low latency, and cost efficiency for inference workloads.

AWS

AWS Deep Learning Deep Learning AI

Explore advanced techniques for hyperparameter optimization with Amazon SageMaker Automatic Model Tuning

AWS Machine Learning Blog

NOVEMBER 10, 2023

Creating high-performance machine learning (ML) solutions relies on exploring and optimizing training parameters, also known as hyperparameters. It provides key functionality that allows you to focus on the ML problem at hand while automatically keeping track of the trials and results. We use a Random Forest from SkLearn.

ML

ML ML Algorithm Python

Fine-tune Mixtral 8x7b on AWS SageMaker and Deploy to RunPod

Mlearning.ai

DECEMBER 22, 2023

You can jump AWS authentication steps if you're already in AWS 's environment. Tip: Only include libraries with the most updated version in the requirements.txt file, specially the transformerslibrary as it may miss newly updated models in older versions. You can find here more about it. 1.46k [00:00<?

AWS

AWS ML ML Python

Train a Large Language Model on a single Amazon SageMaker GPU with Hugging Face and LoRA

AWS Machine Learning Blog

JUNE 5, 2023

In this post, we show you how to train the 7-billion-parameter BloomZ model using just a single graphics processing unit (GPU) on Amazon SageMaker , Amazon’s machine learning (ML) platform for preparing, building, training, and deploying high-quality ML models. Then, it starts the training job.

AWS

AWS ML ML Machine Learning

Best practices for prompt engineering with Meta Llama 3 for Text-to-SQL use cases

AWS Machine Learning Blog

AUGUST 30, 2024

With the rapid growth of generative artificial intelligence (AI), many AWS customers are looking to take advantage of publicly available foundation models (FMs) and technologies. This includes Meta Llama 3, Meta’s publicly available large language model (LLM).

SQL

SQL AWS Database AI

Scalable training platform with Amazon SageMaker HyperPod for innovation: a video generation case study

AWS Machine Learning Blog

SEPTEMBER 26, 2024

It removes the undifferentiated heavy lifting involved in building and optimizing machine learning (ML) infrastructure for training foundation models (FMs). In this post, we share an ML infrastructure architecture that uses SageMaker HyperPod to support research team innovation in video generation.

Clustering

Clustering Algorithm ML ML

Amazon SageMaker Automatic Model Tuning now automatically chooses tuning configurations to improve usability and cost efficiency

AWS Machine Learning Blog

JUNE 5, 2023

Hyperparameter overview When training any machine learning (ML) model, you are generally dealing with three types of data: input data (also called the training data), model parameters, and hyperparameters. You use the input data to train your model, which in effect learns your model parameters.

Machine Learning

Machine Learning Machine Learning ML ML

Learnings From Building the ML Platform at Mailchimp

The MLOps Blog

OCTOBER 3, 2023

This article was originally an episode of the ML Platform Podcast , a show where Piotr Niedźwiedź and Aurimas Griciūnas, together with ML platform professionals, discuss design choices, best practices, example tool stacks, and real-world learnings from some of the best ML platform professionals. How do I develop my body of work?

ML

ML ML Data Scientist Machine Learning

Why is Git Not the Best for ML Model Version Control

The MLOps Blog

NOVEMBER 30, 2022

Data science teams currently struggle with managing multiple experiments and models and need an efficient way to store, retrieve, and utilize details like model versions, hyperparameters, and performance metrics. ML model versioning: where are we at? The short answer is we are in the middle of a data revolution.

ML

ML ML Machine Learning Machine Learning

Career in Python: Trending Job Roles

Pickl AI

JANUARY 30, 2023

The role of Python is not just limited to Data Science. It’s a universal programming language that finds application in different technologies like AI, ML, Big Data and others. Data Modeling : Using libraries like scikit-learn and Tensorflow, one can build and evaluate predictive models.

Python

Python Machine Learning Machine Learning Data Science

High-quality human feedback for your generative AI applications from Amazon SageMaker Ground Truth Plus

AWS Machine Learning Blog

MAY 30, 2023

All you do is share data along with labeling requirements, and Ground Truth Plus sets up and manages your data labeling workflow based on these requirements. From there, an expert workforce that is trained on a variety of machine learning (ML) tasks labels your data.

Machine Learning

Machine Learning Machine Learning AI AI

Accelerating ML experimentation with enhanced security: AWS PrivateLink support for Amazon SageMaker with MLflow

Unstructured data management and governance using AWS AI/ML and analytics services

Webinars

Trending Sources

How Rocket Companies modernized their data science solution on AWS

Webinars

Governing the ML lifecycle at scale, Part 1: A framework for architecting ML workloads using Amazon SageMaker

Develop and train large models cost-efficiently with Metaflow and AWS Trainium

Modernizing data science lifecycle management with AWS and Wipro

Building an efficient MLOps platform with OSS tools on Amazon ECS with AWS Fargate

How Carrier predicts HVAC faults using AWS Glue and Amazon SageMaker

TigerEye (YC S22) Is Hiring a Full Stack Engineer

Security best practices to consider while fine-tuning models in Amazon Bedrock

Remote Data Science Jobs: 5 High-Demand Roles for Career Growth

Derive meaningful and actionable operational insights from AWS Using Amazon Q Business

Frugality meets Accuracy: Cost-efficient training of GPT NeoX and Pythia models with AWS Trainium

How Light & Wonder built a predictive maintenance solution for gaming machines on AWS

Enhance speech synthesis and video generation models with RLHF using audio and video segmentation in Amazon SageMaker

How Axfood enables accelerated machine learning throughout the organization using Amazon SageMaker

Automate mortgage document fraud detection using an ML model and business-defined rules with Amazon Fraud Detector: Part 3

Build well-architected IDP solutions with a custom lens – Part 4: Performance efficiency

Apply fine-grained data access controls with AWS Lake Formation in Amazon SageMaker Data Wrangler

Deploy a Hugging Face (PyAnnote) speaker diarization model on Amazon SageMaker as an asynchronous endpoint

How InsuranceDekho transformed insurance agent interactions using Amazon Bedrock and generative AI

MLOps and DevOps: Why Data Makes It Different

How Earth.com and Provectus implemented their MLOps Infrastructure with Amazon SageMaker

Transition your Amazon Forecast usage to Amazon SageMaker Canvas

Implementing Knowledge Bases for Amazon Bedrock in support of GDPR (right to be forgotten) requests

Fine-tune Meta Llama 3.1 models using torchtune on Amazon SageMaker

Automate the deployment of an Amazon Forecast time-series forecasting model

The Essential Tools for ML Evaluation and Responsible AI

MLOps Landscape in 2023: Top Tools and Platforms

Scaling Thomson Reuters’ language model research with Amazon SageMaker HyperPod

Mastering Version Control for ML Models: Best Practices You Need to Know

Implement a custom AutoML job using pre-selected algorithms in Amazon SageMaker Automatic Model Tuning

MLOps for IoT Edge Ecosystems: Building an MLOps Environment on AWS

Say Goodbye to Costly BERT Inference: Turbocharge with AWS Inferentia2 and Hugging Face…

Explore advanced techniques for hyperparameter optimization with Amazon SageMaker Automatic Model Tuning

Fine-tune Mixtral 8x7b on AWS SageMaker and Deploy to RunPod

Train a Large Language Model on a single Amazon SageMaker GPU with Hugging Face and LoRA

Best practices for prompt engineering with Meta Llama 3 for Text-to-SQL use cases

Scalable training platform with Amazon SageMaker HyperPod for innovation: a video generation case study

Amazon SageMaker Automatic Model Tuning now automatically chooses tuning configurations to improve usability and cost efficiency

Learnings From Building the ML Platform at Mailchimp

Why is Git Not the Best for ML Model Version Control

Career in Python: Trending Job Roles

High-quality human feedback for your generative AI applications from Amazon SageMaker Ground Truth Plus

Stay Connected