Document, ML and Natural Language Processing

Build an AI-powered document processing platform with open source NER model and LLM on Amazon SageMaker

Flipboard

APRIL 23, 2025

Traditional keyword-based search mechanisms are often insufficient for locating relevant documents efficiently, requiring extensive manual review to extract meaningful insights. This solution improves the findability and accessibility of archival records by automating metadata enrichment, document classification, and summarization.

AWS

AWS ML ML AI

Intelligent document processing

Dataconomy

APRIL 30, 2025

Intelligent document processing (IDP) is transforming the way businesses manage their documentation and data management processes. By harnessing the power of emerging technologies, organizations can automate the extraction and handling of data from various document types, significantly enhancing operational workflows.

Natural Language Processing

Natural Language Processing Machine Learning Machine Learning ML

How Apoidea Group enhances visual information extraction from banking documents with multimodal models using LLaMA-Factory on Amazon SageMaker HyperPod

AWS Machine Learning Blog

MAY 15, 2025

The banking industry has long struggled with the inefficiencies associated with repetitive processes such as information extraction, document review, and auditing. This substantial reduction in processing time not only accelerates workflows but also minimizes the risk of manual errors.

AWS

AWS ML ML Machine Learning

Webinars

What’s New in Apache Airflow® 3.0—And How Will It Reshape Your Data Workflows?

MORE WEBINARS

Accelerate your ML lifecycle using the new and improved Amazon SageMaker Python SDK – Part 1: ModelTrainer

AWS Machine Learning Blog

DECEMBER 12, 2024

The new SDK is designed with a tiered user experience in mind, where the new lower-level SDK ( SageMaker Core ) provides access to full breadth of SageMaker features and configurations, allowing for greater flexibility and control for ML engineers. For the detailed list of pre-set values, refer to the SDK documentation.

ML

ML ML Python AWS

Discover how nonprofits can utilize no-code machine learning with Amazon SageMaker Canvas

Flipboard

MAY 28, 2025

Machine learning (ML) has emerged as a powerful tool to help nonprofits expedite manual processes, quickly unlock insights from data, and accelerate mission outcomesfrom personalizing marketing materials for donors to predicting member churn and donation patterns. For a full list of custom model types, check out this documentation.

Machine Learning

Machine Learning Machine Learning ML ML

Amazon Q Business simplifies integration of enterprise knowledge bases at scale

Flipboard

FEBRUARY 11, 2025

Large-scale data ingestion is crucial for applications such as document analysis, summarization, research, and knowledge management. These tasks often involve processing vast amounts of documents, which can be time-consuming and labor-intensive. The Process Data Lambda function redacts sensitive data through Amazon Comprehend.

AWS

AWS ML ML Machine Learning

Precise Software Solutions implements ML as a service on AWS to save time and money for federal agency

Flipboard

JANUARY 6, 2025

The platform helped the agency digitize and process forms, pictures, and other documents. Using the platform, which uses Amazon Textract , AWS Fargate , and other services, the agency gained a four-fold productivity improvement by streamlining and automating labor-intensive manual processes.

AWS

AWS ML ML Machine Learning

Syngenta develops a generative AI assistant to support sales representatives using Amazon Bedrock Agents

Flipboard

DECEMBER 3, 2024

As a global leader in agriculture, Syngenta has led the charge in using data science and machine learning (ML) to elevate customer experiences with an unwavering commitment to innovation. Efficient metadata storage with Amazon DynamoDB – To support quick and efficient data retrieval, document metadata is stored in Amazon DynamoDB.

AWS

AWS AI AI Machine Learning

Intelligent healthcare assistants: Empowering stakeholders with personalized support and data-driven insights

AWS Machine Learning Blog

MARCH 17, 2025

Large language models (LLMs) have revolutionized the field of natural language processing, enabling machines to understand and generate human-like text with remarkable accuracy. However, despite their impressive language capabilities, LLMs are inherently limited by the data they were trained on.

AWS

AWS Natural Language Processing ML ML

John Snow Labs Medical LLMs are now available in Amazon SageMaker JumpStart

AWS Machine Learning Blog

NOVEMBER 25, 2024

You can try out the models with SageMaker JumpStart, a machine learning (ML) hub that provides access to algorithms, models, and ML solutions so you can quickly get started with ML. To learn more, refer to the API documentation. Both models support a context window of 32,000 tokens, which is roughly 50 pages of text.

AWS

AWS ML ML Machine Learning

Techniques for automatic summarization of documents using language models

Flipboard

DECEMBER 6, 2023

Tools like LangChain , combined with a large language model (LLM) powered by Amazon Bedrock or Amazon SageMaker JumpStart , simplify the implementation process. Implementation includes the following steps: The first step is to break down the large document, such as a book, into smaller sections, or chunks.

AWS

AWS Clustering Artificial Intelligence Artificial Intelligence

Intelligent document processing with Amazon Textract, Amazon Bedrock, and LangChain

AWS Machine Learning Blog

OCTOBER 24, 2023

In today’s information age, the vast volumes of data housed in countless documents present both a challenge and an opportunity for businesses. Traditional document processing methods often fall short in efficiency and accuracy, leaving room for innovation, cost-efficiency, and optimizations.

Database

Database AWS ML ML

A comprehensive comparison of RPA and ML

Dataconomy

MARCH 27, 2023

Both have the potential to transform the way organizations operate, enabling them to streamline processes, improve efficiency, and drive business outcomes. However, while RPA and ML share some similarities, they differ in functionality, purpose, and the level of human intervention required. What is machine learning (ML)?

ML

ML ML Machine Learning Machine Learning

Build an Amazon Bedrock based digital lending solution on AWS

Flipboard

JANUARY 9, 2025

In India, the KYC verification usually involves identity verification through identification documents for Indian citizens, such as a PAN card or Aadhar card, address verification, and income verification. They have developed a solution that fully automates the customer onboarding, KYC verification, and credit underwriting process.

AWS

AWS Machine Learning Machine Learning AI

Create a document lake using large-scale text extraction from documents with Amazon Textract

AWS Machine Learning Blog

JANUARY 8, 2024

AWS customers in healthcare, financial services, the public sector, and other industries store billions of documents as images or PDFs in Amazon Simple Storage Service (Amazon S3). In this post, we focus on processing a large collection of documents into raw text files and storing them in Amazon S3.

AWS

AWS Python ML ML

Process formulas and charts with Anthropic’s Claude on Amazon Bedrock

AWS Machine Learning Blog

MARCH 21, 2025

Research papers and engineering documents often contain a wealth of information in the form of mathematical formulas, charts, and graphs. Navigating these unstructured documents to find relevant information can be a tedious and time-consuming task, especially when dealing with large volumes of data.

AWS

AWS AI AI Data Scientist

Empower your business users to extract insights from company documents using Amazon SageMaker Canvas Generative AI

AWS Machine Learning Blog

OCTOBER 26, 2023

Enterprises seek to harness the potential of Machine Learning (ML) to solve complex problems and improve outcomes. Until recently, building and deploying ML models required deep levels of technical and coding skills, including tuning ML models and maintaining operational pipelines.

ML

ML ML AWS AI

Navigating tomorrow: Role of AI and ML in information technology

Dataconomy

FEBRUARY 6, 2024

With the ability to analyze a vast amount of data in real-time, identify patterns, and detect anomalies, AI/ML-powered tools are enhancing the operational efficiency of businesses in the IT sector. Why does AI/ML deserve to be the future of the modern world? Let’s understand the crucial role of AI/ML in the tech industry.

ML

ML ML Machine Learning Machine Learning

Unstructured data management and governance using AWS AI/ML and analytics services

Flipboard

OCTOBER 25, 2023

Most companies produce and consume unstructured data such as documents, emails, web pages, engagement center phone calls, and social media. However, with the help of AI and machine learning (ML), new software tools are now available to unearth the value of unstructured data.

AWS

AWS ML ML Analytics

10 Top LLM Companies You Must Know About

Data Science Dojo

SEPTEMBER 10, 2024

LLM companies are businesses that specialize in developing and deploying Large Language Models (LLMs) and advanced machine learning (ML) models. It has also risen as a dominant player in the LLM space, leading the changes within the landscape of natural language processing and AI-driven solutions.

Machine Learning

Machine Learning Machine Learning Natural Language Processing ML

Autonomous mortgage processing using Amazon Bedrock Data Automation and Amazon Bedrock Agents

Flipboard

MAY 1, 2025

Mortgage processing is a complex, document-heavy workflow that demands accuracy, efficiency, and compliance. Recent industry surveys indicate that only about half of borrowers express satisfaction with the mortgage process, with traditional banks trailing non-bank lenders in borrower satisfaction. Why agentic IDP?

AWS

AWS AI AI Cross Validation

Implement RAG while meeting data residency requirements using AWS hybrid and edge services

Flipboard

JANUARY 14, 2025

Moreover, interest in small language models (SLMs) that enable resource-constrained devices to perform complex functionssuch as natural language processing and predictive automationis growing. These documents are chunked by the application and are sent to the embedding model.

AWS

AWS Database AI AI

Fine-tune multimodal models for vision and text use cases on Amazon SageMaker JumpStart

AWS Machine Learning Blog

NOVEMBER 15, 2024

This significant improvement showcases how the fine-tuning process can equip these powerful multimodal AI systems with specialized skills for excelling at understanding and answering natural language questions about complex, document-based visual information. For a detailed walkthrough on fine-tuning the Meta Llama 3.2

ML

ML ML Python AWS

Translate documents in real time with Amazon Translate

AWS Machine Learning Blog

MAY 31, 2023

Now, Amazon Translate offers real-time document translation to seamlessly integrate and accelerate content creation and localization. This feature eliminates the wait for documents to be translated in asynchronous batch mode. This feature eliminates the wait for documents to be translated in asynchronous batch mode.

AWS

AWS Natural Language Processing Python ML

Unleashing the multimodal power of Amazon Bedrock Data Automation to transform unstructured data into actionable insights

AWS Machine Learning Blog

MARCH 20, 2025

It often requires managing multiple machine learning (ML) models, designing complex workflows, and integrating diverse data sources into production-ready formats. Amazon Bedrock Data Automation is expanding to additional Regions, so be sure to check the documentation for the latest updates. billion in 2025 to USD 66.68

AWS

AWS Analytics Analytics ML

How Reveal’s Logikcull used Amazon Comprehend to detect and redact PII from legal documents at scale

AWS Machine Learning Blog

NOVEMBER 1, 2023

Organizations can search for PII using methods such as keyword searches, pattern matching, data loss prevention tools, machine learning (ML), metadata analysis, data classification software, optical character recognition (OCR), document fingerprinting, and encryption.

AWS

AWS Machine Learning Machine Learning ML

Elevate healthcare interaction and documentation with Amazon Bedrock and Amazon Transcribe using Live Meeting Assistant

AWS Machine Learning Blog

AUGUST 21, 2024

Today, physicians spend about 49% of their workday documenting clinical visits, which impacts physician productivity and patient care. By using the solution, clinicians don’t need to spend additional hours documenting patient encounters. This blog post focuses on the Amazon Transcribe LMA solution for the healthcare domain.

AWS

AWS ML ML AI

Gemma 3 27B model now available on Amazon Bedrock Marketplace and Amazon SageMaker JumpStart

AWS Machine Learning Blog

MAY 28, 2025

The second approach is using SageMaker JumpStart, a machine learning (ML) hub, with foundation models (FMs), built-in algorithms, and pre-built ML solutions. This resource includes integration examples, API documentation, and programming samples. You can deploy pre-trained models using either the Amazon SageMaker console or SDK.

AWS

AWS ML ML AI

Use LangChain with PySpark to process documents at massive scale with Amazon SageMaker Studio and Amazon EMR Serverless

AWS Machine Learning Blog

SEPTEMBER 3, 2024

With the introduction of EMR Serverless support for Apache Livy endpoints , SageMaker Studio users can now seamlessly integrate their Jupyter notebooks running sparkmagic kernels with the powerful data processing capabilities of EMR Serverless. This same interface is also used for provisioning EMR clusters. python3.11-pip jars/livy-repl_2.12-0.7.1-incubating.jar

AWS

AWS Clustering Big Data Big Data

Wouldn’t you like to halve your workload and double your earnings?

Dataconomy

JULY 4, 2023

Hyper automation, which uses cutting-edge technologies like AI and ML, can help you automate even the most complex tasks. It’s also about using AI and ML to gain insights into your data and make better decisions. ML algorithms enable systems to identify patterns, make predictions, and take autonomous actions.

Natural Language Processing

Natural Language Processing Big Data Big Data Machine Learning

Principal Financial Group uses QnABot on AWS and Amazon Q Business to enhance workforce productivity with generative AI

AWS Machine Learning Blog

NOVEMBER 15, 2024

Principal wanted to use existing internal FAQs, documentation, and unstructured data and build an intelligent chatbot that could provide quick access to the right information for different roles. For queries earning negative feedback, less than 1% involved answers or documentation deemed irrelevant to the original question.

AWS

AWS AI AI Machine Learning

Run small language models cost-efficiently with AWS Graviton and Amazon SageMaker AI

Flipboard

JUNE 5, 2025

As organizations look to incorporate AI capabilities into their applications, large language models (LLMs) have emerged as powerful tools for natural language processing tasks. Lets say you are able to successfully serve 10 requests users in parallel with one ML instance.

AWS

AWS AI AI ML

How Aetion is using generative AI and Amazon Bedrock to translate scientific intent to results

AWS Machine Learning Blog

FEBRUARY 6, 2025

Extracts of AEP documentation, describing each Measure type covered, its input and output types, and how to use it. His career has focused on natural language processing, and he has experience applying machine learning solutions to various domains, from healthcare to social media.

Natural Language Processing

Natural Language Processing AI AI Machine Learning

Amazon Comprehend document classifier adds layout support for higher accuracy

AWS Machine Learning Blog

APRIL 19, 2023

The ability to effectively handle and process enormous amounts of documents has become essential for enterprises in the modern world. Due to the continuous influx of information that all enterprises deal with, manually classifying documents is no longer a viable option.

AWS

AWS Machine Learning Machine Learning ML

Improve Amazon Nova migration performance with data-aware prompt optimization

AWS Machine Learning Blog

APRIL 29, 2025

The following example shows how prompt optimization converts a typical prompt for a summarization task on Anthropics Claude Haiku into a well-structured prompt for an Amazon Nova model, with sections that begin with special markdown tags such as ## Task, ### Summarization Instructions , and ### Document to Summarize.

AWS

AWS AI ML AI

Groq sparks LPU vs GPU face-off

Dataconomy

FEBRUARY 26, 2024

GPUs: The versatile powerhouses Graphics Processing Units, or GPUs, have transcended their initial design purpose of rendering video game graphics to become key elements of Artificial Intelligence (AI) and Machine Learning (ML) efforts. However, it’s not time to discard your GPUs just yet.

Natural Language Processing

Natural Language Processing ML ML Deep Learning

Boosting RAG-based intelligent document assistants using entity extraction, SQL querying, and agents with Amazon Bedrock

AWS Machine Learning Blog

DECEMBER 6, 2023

Such data often lacks the specialized knowledge contained in internal documents available in modern businesses, which is typically needed to get accurate answers in domains such as pharmaceutical research, financial investigation, and customer support. For example, imagine that you are planning next year’s strategy of an investment company.

SQL

SQL AWS Analytics Analytics

Transforming financial analysis with CreditAI on Amazon Bedrock: Octus’s journey with AWS

AWS Machine Learning Blog

MARCH 10, 2025

Investment professionals face the mounting challenge of processing vast amounts of data to make timely, informed decisions. The traditional approach of manually sifting through countless research documents, industry reports, and financial statements is not only time-consuming but can also lead to missed opportunities and incomplete analysis.

AWS

AWS Database AI AI

Cohere Embed multimodal embeddings model is now available on Amazon SageMaker JumpStart

AWS Machine Learning Blog

NOVEMBER 15, 2024

It provides a common framework for assessing the performance of natural language processing (NLP)-based retrieval models, making it straightforward to compare different approaches. Amazon SageMaker is a comprehensive, fully managed machine learning (ML) platform that revolutionizes the entire ML workflow.

AWS

AWS Computer Science Computer Science Database

Boosting team productivity with Amazon Q Business Microsoft 365 integrations for Microsoft 365 Outlook and Word

AWS Machine Learning Blog

MAY 22, 2025

What sets Amazon Q Business apart is its support of enterprise requirements from its ability to integrate with company documentation to its adaptability with specific business terminology and context-aware responses. You can access the Amazon Q Business document processing features from the Word context menu when you highlight text.

AWS

AWS Natural Language Processing Analytics Analytics

Use machine learning without writing a single line of code with Amazon SageMaker Canvas

AWS Machine Learning Blog

NOVEMBER 10, 2023

In the recent past, using machine learning (ML) to make predictions, especially for data in the form of text and images, required extensive ML knowledge for creating and tuning of deep learning models. Today, ML has become more accessible to any user who wants to use ML models to generate business value.

Machine Learning

Machine Learning Machine Learning ML ML

Empower your generative AI application with a comprehensive custom observability solution

AWS Machine Learning Blog

OCTOBER 29, 2024

For a detailed breakdown of the features and implementation specifics, refer to the comprehensive documentation in the GitHub repository. You can follow the steps provided in the Deleting a stack on the AWS CloudFormation console documentation to delete the resources created for this solution.

AWS

AWS AI AI Data Scientist

Retrieval-Augmented Generation with LangChain, Amazon SageMaker JumpStart, and MongoDB Atlas semantic search

Flipboard

NOVEMBER 17, 2023

The Retrieval-Augmented Generation (RAG) framework augments prompts with external data from multiple sources, such as document repositories, databases, or APIs, to make foundation models effective for domain-specific tasks. Amazon SageMaker enables enterprises to build, train, and deploy machine learning (ML) models.

K-nearest Neighbors

K-nearest Neighbors AWS Clustering Database

Training your AI, not just your team: A marketer’s guide to smarter campaigns

Dataconomy

APRIL 17, 2025

AI in marketing refers to the use of machine learning (ML), natural language processing (NLP), and predictive analytics to automate, optimize, and personalize campaigns at scale. Pro Tip “Treat AI like a new hiretrain it with clean data, document its decisions, and supervise its work.”

AI

AI AI Machine Learning Machine Learning

Build an AI-powered document processing platform with open source NER model and LLM on Amazon SageMaker

Intelligent document processing

Webinars

Trending Sources

How Apoidea Group enhances visual information extraction from banking documents with multimodal models using LLaMA-Factory on Amazon SageMaker HyperPod

Webinars

Accelerate your ML lifecycle using the new and improved Amazon SageMaker Python SDK – Part 1: ModelTrainer

Discover how nonprofits can utilize no-code machine learning with Amazon SageMaker Canvas

Amazon Q Business simplifies integration of enterprise knowledge bases at scale

Precise Software Solutions implements ML as a service on AWS to save time and money for federal agency

Syngenta develops a generative AI assistant to support sales representatives using Amazon Bedrock Agents

Intelligent healthcare assistants: Empowering stakeholders with personalized support and data-driven insights

John Snow Labs Medical LLMs are now available in Amazon SageMaker JumpStart

Techniques for automatic summarization of documents using language models

Intelligent document processing with Amazon Textract, Amazon Bedrock, and LangChain

A comprehensive comparison of RPA and ML

Build an Amazon Bedrock based digital lending solution on AWS

Create a document lake using large-scale text extraction from documents with Amazon Textract

Process formulas and charts with Anthropic’s Claude on Amazon Bedrock

Empower your business users to extract insights from company documents using Amazon SageMaker Canvas Generative AI

Navigating tomorrow: Role of AI and ML in information technology

Unstructured data management and governance using AWS AI/ML and analytics services

10 Top LLM Companies You Must Know About

Autonomous mortgage processing using Amazon Bedrock Data Automation and Amazon Bedrock Agents

Implement RAG while meeting data residency requirements using AWS hybrid and edge services

Fine-tune multimodal models for vision and text use cases on Amazon SageMaker JumpStart

Translate documents in real time with Amazon Translate

Unleashing the multimodal power of Amazon Bedrock Data Automation to transform unstructured data into actionable insights

How Reveal’s Logikcull used Amazon Comprehend to detect and redact PII from legal documents at scale

Elevate healthcare interaction and documentation with Amazon Bedrock and Amazon Transcribe using Live Meeting Assistant

Gemma 3 27B model now available on Amazon Bedrock Marketplace and Amazon SageMaker JumpStart

Use LangChain with PySpark to process documents at massive scale with Amazon SageMaker Studio and Amazon EMR Serverless

Wouldn’t you like to halve your workload and double your earnings?

Principal Financial Group uses QnABot on AWS and Amazon Q Business to enhance workforce productivity with generative AI

Run small language models cost-efficiently with AWS Graviton and Amazon SageMaker AI

How Aetion is using generative AI and Amazon Bedrock to translate scientific intent to results

Amazon Comprehend document classifier adds layout support for higher accuracy

Improve Amazon Nova migration performance with data-aware prompt optimization

Groq sparks LPU vs GPU face-off

Boosting RAG-based intelligent document assistants using entity extraction, SQL querying, and agents with Amazon Bedrock

Transforming financial analysis with CreditAI on Amazon Bedrock: Octus’s journey with AWS

Cohere Embed multimodal embeddings model is now available on Amazon SageMaker JumpStart

Boosting team productivity with Amazon Q Business Microsoft 365 integrations for Microsoft 365 Outlook and Word

Use machine learning without writing a single line of code with Amazon SageMaker Canvas

Empower your generative AI application with a comprehensive custom observability solution

Retrieval-Augmented Generation with LangChain, Amazon SageMaker JumpStart, and MongoDB Atlas semantic search

Training your AI, not just your team: A marketer’s guide to smarter campaigns

Stay Connected