2016, AWS and Python - Data Science Current

Enterprise-grade natural language to SQL generation using LLMs: Balancing accuracy, latency, and scale

Flipboard

APRIL 24, 2025

This post describes a pattern that AWS and Cisco teams have developed and deployed that is viable at scale and addresses a broad set of challenging enterprise use cases. AWS solution architecture In this section, we illustrate how you might implement the architecture on AWS.

SQL

SQL Database AWS ML

Llama 4 family of models from Meta are now available in SageMaker JumpStart

AWS Machine Learning Blog

APRIL 7, 2025

Virginia) AWS Region. Prerequisites To try the Llama 4 models in SageMaker JumpStart, you need the following prerequisites: An AWS account that will contain all your AWS resources. An AWS Identity and Access Management (IAM) role to access SageMaker AI. Access to accelerated instances (GPUs) for hosting the LLMs.

AWS

AWS Machine Learning Machine Learning AI

Fine-tune and deploy Llama 2 models cost-effectively in Amazon SageMaker JumpStart with AWS Inferentia and AWS Trainium

AWS Machine Learning Blog

JANUARY 17, 2024

Today, we’re excited to announce the availability of Llama 2 inference and fine-tuning support on AWS Trainium and AWS Inferentia instances in Amazon SageMaker JumpStart. In this post, we demonstrate how to deploy and fine-tune Llama 2 on Trainium and AWS Inferentia instances in SageMaker JumpStart.

AWS

AWS Python Machine Learning Machine Learning

Webinars

Going Beyond Chatbots: Connecting AI to Your Tools, Systems, & Data

Automation, Evolved: Your New Playbook for Smarter Knowledge Work

Smart Tech + Human Expertise = How to Modernize Manufacturing Without Losing Control

MORE WEBINARS

Reinventing a cloud-native federated learning architecture on AWS

AWS Machine Learning Blog

OCTOBER 10, 2023

Customers often need to train a model with data from different regions, organizations, or AWS accounts. Existing partner open-source FL solutions on AWS include FedML and NVIDIA FLARE. These open-source packages are deployed in the cloud by running in virtual machines, without using the cloud-native services available on AWS.

AWS

AWS ML ML Algorithm

Announcing new Jupyter contributions by AWS to democratize generative AI and scale ML workloads

AWS Machine Learning Blog

MAY 10, 2023

Given the importance of Jupyter to data scientists and ML developers, AWS is an active sponsor and contributor to Project Jupyter. In parallel to these open-source contributions, we have AWS product teams who are working to integrate Jupyter with products such as Amazon SageMaker.

ML

ML ML AWS AI

Improving air quality with generative AI

AWS Machine Learning Blog

JUNE 18, 2024

On December 6 th -8 th 2023, the non-profit organization, Tech to the Rescue , in collaboration with AWS, organized the world’s largest Air Quality Hackathon – aimed at tackling one of the world’s most pressing health and environmental challenges, air pollution. This is done to optimize performance and minimize cost of LLM invocation.

AWS

AWS AI AI Python

LLM continuous self-instruct fine-tuning framework powered by a compound AI system on Amazon SageMaker

AWS Machine Learning Blog

FEBRUARY 21, 2025

We use DSPy (Declarative Self-improving Python) to demonstrate the workflow of Retrieval Augmented Generation (RAG) optimization, LLM fine-tuning and evaluation, and human preference alignment for performance improvement. Examples are similar to Python dictionaries but with added utilities such as the dspy.Prediction as a return value.

AI

AI AI AWS Data Scientist

Geospatial generative AI with Amazon Bedrock and Amazon Location Service

AWS Machine Learning Blog

NOVEMBER 22, 2023

The task involved writing Python code to read data, transform it, and then visualize it in an interesting map. You’ll need access to an AWS account with an access key or AWS Identity and Access Management (IAM) role with permissions to Amazon Bedrock and Amazon Location. The following are a few of the prompts we included.

Python

Python AWS Data Scientist AI

A review of purpose-built accelerators for financial services

AWS Machine Learning Blog

SEPTEMBER 11, 2024

In terms of resulting speedups, the approximate order is programming hardware, then programming against PBA APIs, then programming in an unmanaged language such as C++, then a managed language such as Python. Examples of other PBAs now available include AWS Inferentia and AWS Trainium , Google TPU, and Graphcore IPU.

AWS

AWS ML ML Clustering

TensorFlow vs. PyTorch: What’s Better for a Deep Learning Project?

Towards AI

AUGUST 8, 2024

It can also be used in a variety of languages, such as Python, C++, JavaScript, and Java. Other Cloud Providers: TensorFlow works well with other cloud platforms such as AWS and Azure, supporting scalable deployment and training in cloud environments. The basic data structure for TensorFlow are tensors.

Deep Learning

Deep Learning Deep Learning Machine Learning Machine Learning

Extract non-PHI data from Amazon HealthLake, reduce complexity, and increase cost efficiency with Amazon Athena and Amazon SageMaker Canvas

AWS Machine Learning Blog

FEBRUARY 28, 2023

AWS provides the most complete set of services for the entire end-to-end data journey for all workloads, all types of data, and all desired business outcomes. The high-level steps involved in the solution are as follows: Use AWS Step Functions to orchestrate the health data anonymization pipeline.

ML

ML ML AWS Machine Learning

Amazon Textract’s new Layout feature introduces efficiencies in general purpose and generative AI document processing tasks

AWS Machine Learning Blog

NOVEMBER 21, 2023

python -m pip install -q amazon-textract-prettyprinter You have the option to format the text in markdown format, exclude text from within figures in the document, and exclude page header, footer, and page number extractions from the linearized output. She is focused on building machine learning–based services for AWS customers.

AI

AI AI Database AWS

Top 10 Deep Learning Platforms in 2024

DagsHub

JULY 25, 2024

A good understanding of Python and machine learning concepts is recommended to fully leverage TensorFlow's capabilities. Further Reading TensorFlow Documentation TensorFlow Tutorials PyTorch PyTorch, developed by Facebook's AI Research Lab (FAIR) , was released in 2016. It is well-suited for both research and production environments.

Deep Learning

Deep Learning Deep Learning Machine Learning Machine Learning

Financial text generation using a domain-adapted fine-tuned large language model in Amazon SageMaker JumpStart

AWS Machine Learning Blog

APRIL 18, 2023

Solution overview In the following sections, we provide a step-by-step demonstration for fine-tuning an LLM for text generation tasks via both the JumpStart Studio UI and Python SDK. The Companys net income attributable to the Company for the year ended December 31, 2016 was $4,816,000, or $0.28

ML

ML ML Deep Learning Deep Learning

Data Analysis at Warp Speed: Explore the World of Polars

Mlearning.ai

JULY 9, 2023

Abstract Polars is a fast-growing open-source data frame library that is rapidly becoming the preferred choice for data scientists and data engineers in Python. It is available in multiple languages: Python, Rust, and NodeJS. If you are looking for a fast and intuitive data frame library for Python, then Polars is a great option.

Data Analysis

Data Analysis Data Analysis Python Data Scientist

7 Best Machine Learning Workflow and Pipeline Orchestration Tools 2024

DagsHub

APRIL 7, 2024

The project was created in 2014 by Airbnb and has been developed by the Apache Software Foundation since 2016. Thanks to its various operators, it is integrated with Python, Spark, Bash, SQL, and more. Programming language: It offers a simple way to transform Python code into an interactive workflow application.

Machine Learning

Machine Learning Machine Learning ML ML

Comparative Analysis: PyTorch vs TensorFlow vs Keras

Pickl AI

AUGUST 22, 2024

First released in 2016, it quickly gained traction due to its intuitive design and robust capabilities. Discover its dynamic computational graphs, ease of debugging, strong community support, and seamless integration with popular Python libraries for enhanced development.

Deep Learning

Deep Learning Deep Learning Machine Learning Machine Learning

Domain-adaptation Fine-tuning of Foundation Models in Amazon SageMaker JumpStart on Financial data

AWS Machine Learning Blog

APRIL 18, 2023

Solution overview In the following sections, we provide a step-by-step demonstration for fine-tuning an LLM for text generation tasks via both the JumpStart Studio UI and Python SDK. The Companys net income attributable to the Company for the year ended December 31, 2016 was $4,816,000, or $0.28

ML

ML ML Deep Learning Deep Learning

spaCy meets Transformers: Fine-tune BERT, XLNet and GPT-2

Explosion

AUGUST 1, 2019

Based on the (fairly vague) marketing copy, AWS might be doing something similar in SageMaker. 2019) have shown that a transformer models trained on only 1% of the IMDB sentiment analysis data (just a few dozen examples) can exceed the pre-2016 state-of-the-art. Modern transfer learning techniques are bearing this out.

Natural Language Processing

Natural Language Processing Machine Learning Machine Learning AWS

Fine-tune Meta Llama 3.2 text generation models for generative AI inference using Amazon SageMaker JumpStart

AWS Machine Learning Blog

NOVEMBER 11, 2024

Prerequisites To try out this solution using SageMaker JumpStart, you’ll need the following prerequisites: An AWS account that will contain all of your AWS resources. An AWS Identity and Access Management (IAM) role to access SageMaker. We then also cover how to fine-tune the model using SageMaker Python SDK.

AI

AI AI ML ML

Data Science Current

Enterprise-grade natural language to SQL generation using LLMs: Balancing accuracy, latency, and scale

Llama 4 family of models from Meta are now available in SageMaker JumpStart

Webinars

Trending Sources

Fine-tune and deploy Llama 2 models cost-effectively in Amazon SageMaker JumpStart with AWS Inferentia and AWS Trainium

Webinars

Reinventing a cloud-native federated learning architecture on AWS

Announcing new Jupyter contributions by AWS to democratize generative AI and scale ML workloads

Improving air quality with generative AI

LLM continuous self-instruct fine-tuning framework powered by a compound AI system on Amazon SageMaker

Geospatial generative AI with Amazon Bedrock and Amazon Location Service

A review of purpose-built accelerators for financial services

TensorFlow vs. PyTorch: What’s Better for a Deep Learning Project?

Extract non-PHI data from Amazon HealthLake, reduce complexity, and increase cost efficiency with Amazon Athena and Amazon SageMaker Canvas

Amazon Textract’s new Layout feature introduces efficiencies in general purpose and generative AI document processing tasks

Top 10 Deep Learning Platforms in 2024

Financial text generation using a domain-adapted fine-tuned large language model in Amazon SageMaker JumpStart

Data Analysis at Warp Speed: Explore the World of Polars

7 Best Machine Learning Workflow and Pipeline Orchestration Tools 2024

Comparative Analysis: PyTorch vs TensorFlow vs Keras

Domain-adaptation Fine-tuning of Foundation Models in Amazon SageMaker JumpStart on Financial data

spaCy meets Transformers: Fine-tune BERT, XLNet and GPT-2

Fine-tune Meta Llama 3.2 text generation models for generative AI inference using Amazon SageMaker JumpStart

Stay Connected