2015, Clustering and Natural Language Processing

Evaluating Long-Context Question & Answer Systems

Eugene Yan

JUNE 21, 2025

Loong evaluates a model’s ability to locate, compare, cluster, and reason on evidence spread across multiple documents, typically ranging from 10,000 to over 250,000 tokens. Clustering : Aggregating and grouping relevant information from multiple sources based on specific criteria. © Eugene Yan 2015 - 2025 • Feedback • RSS

Clustering

Clustering Natural Language Processing AI AI

Announcing the ICDAR 2023 Competition on Hierarchical Text Detection and Recognition

Google Research AI blog

MARCH 7, 2023

books, magazines, newspapers, forms, street signs, restaurant menus) so that they can be indexed, searched, translated, and further processed by state-of-the-art natural language processing techniques. Middle: Illustration of line clustering. Right: Illustration paragraph clustering. HierText identifies 103.8

Clustering

Clustering Natural Language Processing Deep Learning Deep Learning

Top 6 Kubernetes use cases

IBM Journey to AI blog

NOVEMBER 13, 2023

Nodes run the pods and are usually grouped in a Kubernetes cluster, abstracting the underlying physical hardware resources. Kubernetes’s declarative, API -driven infrastructure has helped free up DevOps and other teams from manually driven processes so they can work more independently and efficiently to achieve their goals.

Machine Learning

Machine Learning Machine Learning ML ML

Fast and cost-effective LLaMA 2 fine-tuning with AWS Trainium

AWS Machine Learning Blog

OCTOBER 5, 2023

Our high-level training procedure is as follows: for our training environment, we use a multi-instance cluster managed by the SLURM system for distributed training and scheduling under the NeMo framework. From 2015–2018, he worked as a program director at the US NSF in charge of its big data program. Youngsuk Park is a Sr.

AWS

AWS Machine Learning Machine Learning Deep Learning

Robustness of a Markov Blanket Discovery Approach to Adversarial Attack in Image Segmentation: An…

Mlearning.ai

MARCH 9, 2023

Automated algorithms for image segmentation have been developed based on various techniques, including clustering, thresholding, and machine learning (Arbeláez et al., 2015; Huang et al., 2019) or by using input pre-processing techniques to remove adversarial perturbations (Xie et al., 2012; Otsu, 1979; Long et al.,

Deep Learning

Deep Learning Deep Learning Machine Learning Machine Learning

Financial text generation using a domain-adapted fine-tuned large language model in Amazon SageMaker JumpStart

AWS Machine Learning Blog

APRIL 18, 2023

Large language models (LLMs) with billions of parameters are currently at the forefront of natural language processing (NLP). These models are shaking up the field with their incredible abilities to generate text, analyze sentiment, translate languages, and much more.

ML

ML ML Deep Learning Deep Learning

Comparative Analysis: PyTorch vs TensorFlow vs Keras

Pickl AI

AUGUST 22, 2024

In industry, it powers applications in computer vision, natural language processing, and reinforcement learning. This allows users to change the network architecture on-the-fly, which is particularly useful for tasks that require variable input sizes, such as natural language processing and reinforcement learning.

Deep Learning

Deep Learning Deep Learning Machine Learning Machine Learning

Introducing spaCy

Explosion

FEBRUARY 18, 2015

spaCy is a new library for text processing in Python and Cython. I wrote it because I think small companies are terrible at natural language processing (NLP). The only problem is that the list really contains two clusters of words: one associated with the legal meaning of “pleaded”, and one for the more general sense.

Clustering

Clustering Natural Language Processing Machine Learning Machine Learning

Domain-adaptation Fine-tuning of Foundation Models in Amazon SageMaker JumpStart on Financial data

AWS Machine Learning Blog

APRIL 18, 2023

Large language models (LLMs) with billions of parameters are currently at the forefront of natural language processing (NLP). These models are shaking up the field with their incredible abilities to generate text, analyze sentiment, translate languages, and much more.

ML

ML ML Deep Learning Deep Learning

Meet the Winners of the Youth Mental Health Narratives Challenge

DrivenData Labs

FEBRUARY 3, 2025

His research focuses on applying natural language processing techniques to extract information from unstructured clinical and medical texts, especially in low-resource settings. I love participating in various competitions involving deep learning, especially tasks involving natural language processing or LLMs.

Machine Learning

Machine Learning Machine Learning Data Science Natural Language Processing

Data Science Current

Evaluating Long-Context Question & Answer Systems

Announcing the ICDAR 2023 Competition on Hierarchical Text Detection and Recognition

Trending Sources

Top 6 Kubernetes use cases

Fast and cost-effective LLaMA 2 fine-tuning with AWS Trainium

Robustness of a Markov Blanket Discovery Approach to Adversarial Attack in Image Segmentation: An…

Financial text generation using a domain-adapted fine-tuned large language model in Amazon SageMaker JumpStart

Comparative Analysis: PyTorch vs TensorFlow vs Keras

Introducing spaCy

Domain-adaptation Fine-tuning of Foundation Models in Amazon SageMaker JumpStart on Financial data

Meet the Winners of the Youth Mental Health Narratives Challenge

Stay Connected