2015, Natural Language Processing and Python

Zero-shot text classification with Amazon SageMaker JumpStart

AWS Machine Learning Blog

AUGUST 11, 2023

Natural language processing (NLP) is the field in machine learning (ML) concerned with giving computers the ability to understand text and spoken words in the same way as human beings can. For this solution, we use the 2015 New Year’s Resolutions dataset to classify resolutions.

Natural Language Processing

Natural Language Processing ML ML Machine Learning

Fast and cost-effective LLaMA 2 fine-tuning with AWS Trainium

AWS Machine Learning Blog

OCTOBER 5, 2023

For example, to use the RedPajama dataset, use the following command: wget [link] python nemo/scripts/nlp_language_modeling/preprocess_data_for_megatron.py His research interests are in the area of natural language processing, explainable deep learning on tabular data, and robust analysis of non-parametric space-time clustering.

AWS

AWS Machine Learning Machine Learning Deep Learning

Chatbot Development using SpaCy

Heartbeat

APRIL 23, 2023

One of the key components of chatbot development is natural language processing (NLP), which allows the bot to understand and respond to human language. SpaCy is a popular open-source NLP library developed in 2015 by Matthew Honnibal and Ines Montani, the founders of the software company Explosion.

Natural Language Processing

Natural Language Processing Machine Learning Machine Learning Deep Learning

Webinars

How to Achieve High-Accuracy Results When Using LLMs

MORE WEBINARS

Comparative Analysis: PyTorch vs TensorFlow vs Keras

Pickl AI

AUGUST 22, 2024

In industry, it powers applications in computer vision, natural language processing, and reinforcement learning. Discover its dynamic computational graphs, ease of debugging, strong community support, and seamless integration with popular Python libraries for enhanced development.

Deep Learning

Deep Learning Deep Learning Machine Learning Machine Learning

Top 10 Deep Learning Platforms in 2024

DagsHub

JULY 25, 2024

TensorFlow The Google Brain team created the open-source deep learning framework TensorFlow, which was made available in 2015. A good understanding of Python and machine learning concepts is recommended to fully leverage TensorFlow's capabilities. Before using Keras, ensure you have a basic understanding of Python and neural networks.

Deep Learning

Deep Learning Deep Learning Machine Learning Machine Learning

Dead Code Should Be Buried

Explosion

SEPTEMBER 3, 2015

Natural Language Processing moves fast, so maintaining a good library means constantly throwing things away. But most Natural Language Processing libraries do, and it’s terrible. Natural Language Processing (NLP) research moves very quickly. The new models supercede the old ones.

Natural Language Processing

Natural Language Processing Python Deep Learning Deep Learning

sense2vec reloaded: contextually-keyed word vectors

Explosion

NOVEMBER 21, 2019

In 2016 we trained a sense2vec model on the 2015 portion of the Reddit comments corpus, leading to a useful library and one of our most popular demos. Try the new interactive demo to explore similarities and compare them between 2015 and 2019 sense2vec (Trask et. Interestingly, “to ghost” wasn’t very common in 2015.

Natural Language Processing

Natural Language Processing Data Scientist Machine Learning Machine Learning

Robustness of a Markov Blanket Discovery Approach to Adversarial Attack in Image Segmentation: An…

Mlearning.ai

MARCH 9, 2023

2015; Huang et al., One approach involves incorporating adversarial training into the learning process, which involves generating adversarial examples during training and using them to augment the training set (Goodfellow et al., 2019) or by using input pre-processing techniques to remove adversarial perturbations (Xie et al.,

Deep Learning

Deep Learning Deep Learning Machine Learning Machine Learning

Evaluate the text summarization capabilities of LLMs for enhanced decision-making on AWS

AWS Machine Learning Blog

APRIL 25, 2024

Calculate a ROUGE-N score You can use the following steps to calculate a ROUGE-N score: Tokenize the generated summary and the reference summary into individual words or tokens using basic tokenization methods like splitting by whitespace or natural language processing (NLP) libraries.

AWS

AWS Algorithm Artificial Intelligence Artificial Intelligence

Multi-threading spaCy's parser and named entity recognizer

Explosion

MAY 10, 2016

The pay-off is the.pipe() method, which adds data-streaming capabilities to spaCy: import spacy nlp = spacy.load('de') for doc in nlp.pipe(texts, n_threads=16, batch_size=10000): analyse_text(doc) My favourite post on the Zen of Python iterators was written by Radim, the creator of Gensim. The Python unicode object is also very useful.

Python

Python Natural Language Processing Machine Learning Machine Learning

Extract non-PHI data from Amazon HealthLake, reduce complexity, and increase cost efficiency with Amazon Athena and Amazon SageMaker Canvas

AWS Machine Learning Blog

FEBRUARY 28, 2023

Use natural language processing (NLP) in Amazon HealthLake to extract non-sensitive data from unstructured blobs. When he’s not modernizing workloads for global enterprises, Yann plays piano, tinkers in React and Python, and regularly YouTubes about his cloud journey. Use SageMaker Canvas for analytics and predictions.

ML

ML ML AWS Machine Learning

MLOps and the evolution of data science

IBM Journey to AI blog

AUGUST 11, 2023

Origins of the MLOps process MLOps was born out of the realization that ML lifecycle management was slow and difficult to scale for business application. Using AutoML or AutoAI, opensource libraries such as scikit-learn and hyperopt, or hand coding in Python, ML engineers create and train the ML models.

Data Science

Data Science Machine Learning Machine Learning ML

Financial text generation using a domain-adapted fine-tuned large language model in Amazon SageMaker JumpStart

AWS Machine Learning Blog

APRIL 18, 2023

Large language models (LLMs) with billions of parameters are currently at the forefront of natural language processing (NLP). These models are shaking up the field with their incredible abilities to generate text, analyze sentiment, translate languages, and much more.

ML

ML ML Deep Learning Deep Learning

Explosion in 2019: Our Year in Review

Explosion

DECEMBER 28, 2019

Jul 18: After a brief rest following spaCy IRL, Ines took a minute to appear on the Python Bytes podcast with Michael Kennedy and Brian Okken]. Among other things, Ines discussed fast.ai ’s new course on Natural Language Processing and using Polyaxon for model training and experiment management. ?

Machine Learning

Machine Learning Machine Learning Python Natural Language Processing

Domain-adaptation Fine-tuning of Foundation Models in Amazon SageMaker JumpStart on Financial data

AWS Machine Learning Blog

APRIL 18, 2023

Large language models (LLMs) with billions of parameters are currently at the forefront of natural language processing (NLP). These models are shaking up the field with their incredible abilities to generate text, analyze sentiment, translate languages, and much more.

ML

ML ML Deep Learning Deep Learning

Introducing spaCy

Explosion

FEBRUARY 18, 2015

spaCy is a new library for text processing in Python and Cython. I wrote it because I think small companies are terrible at natural language processing (NLP). Labs and Emory University, to appear at ACL 2015. System Language Accuracy Speed spaCy v0.86 Higher is better. 13,963 ClearNLP Java 91.7

Clustering

Clustering Natural Language Processing Machine Learning Machine Learning

How ChatGPT really works and will it change the field of IT and AI??—?a deep dive

Chatbots Life

MAY 12, 2023

And as we could have already seen with the release of GPT-3 a few years ago casual language modelling can be used to perform various tasks and has proven to be universal. We can ask the model to generate a python function or a recipe for a cheesecake. Follow me on LinkedIn if you like my stories.

AI

AI AI ML ML

Evolving Trends in Data Science: Insights from ODSC Conference Sessions from 2015 to 2024

ODSC - Open Data Science

MARCH 10, 2025

Analyzing nearly a decades worth of conference sessions from 2015 to 2024 reveals fascinating shifts in focus areas, popular frameworks, and emerging trends that have shaped thefield. This blog dives deep into these changes of trends in data science, spotlighting how conference topics mirror the broader evolution of datascience.

Data Science

Data Science Deep Learning Deep Learning Machine Learning

Fine-tune Meta Llama 3.2 text generation models for generative AI inference using Amazon SageMaker JumpStart

AWS Machine Learning Blog

NOVEMBER 11, 2024

We then also cover how to fine-tune the model using SageMaker Python SDK. FMs through SageMaker JumpStart in the SageMaker Studio UI and the SageMaker Python SDK. Fine-tune using the SageMaker Python SDK You can also fine-tune Meta Llama 3.2 models using the SageMaker Python SDK. You can access the Meta Llama 3.2

AI

AI AI ML ML

Data Science Current

Zero-shot text classification with Amazon SageMaker JumpStart

Fast and cost-effective LLaMA 2 fine-tuning with AWS Trainium

Webinars

Trending Sources

Chatbot Development using SpaCy

Webinars

Comparative Analysis: PyTorch vs TensorFlow vs Keras

Top 10 Deep Learning Platforms in 2024

Dead Code Should Be Buried

sense2vec reloaded: contextually-keyed word vectors

Robustness of a Markov Blanket Discovery Approach to Adversarial Attack in Image Segmentation: An…

Evaluate the text summarization capabilities of LLMs for enhanced decision-making on AWS

Multi-threading spaCy's parser and named entity recognizer

Extract non-PHI data from Amazon HealthLake, reduce complexity, and increase cost efficiency with Amazon Athena and Amazon SageMaker Canvas

MLOps and the evolution of data science

Financial text generation using a domain-adapted fine-tuned large language model in Amazon SageMaker JumpStart

Explosion in 2019: Our Year in Review

Domain-adaptation Fine-tuning of Foundation Models in Amazon SageMaker JumpStart on Financial data

Introducing spaCy

How ChatGPT really works and will it change the field of IT and AI??—?a deep dive

Evolving Trends in Data Science: Insights from ODSC Conference Sessions from 2015 to 2024

Fine-tune Meta Llama 3.2 text generation models for generative AI inference using Amazon SageMaker JumpStart

Stay Connected