2018, Machine Learning and Supervised Learning

Generative vs Discriminative AI: Understanding the 5 Key Differences

Data Science Dojo

MAY 27, 2024

A visual representation of generative AI – Source: Analytics Vidhya Generative AI is a growing area in machine learning, involving algorithms that create new content on their own. This approach involves techniques where the machine learns from massive amounts of data.

K-nearest Neighbors

K-nearest Neighbors Supervised Learning AI AI

The Hidden Cost of Poor Training Data in Machine Learning: Why Quality Matters

How to Learn Machine Learning

OCTOBER 10, 2024

The quality of your training data in Machine Learning (ML) can make or break your entire project. This article explores real-world cases where poor-quality data led to model failures, and what we can learn from these experiences. Machine learning algorithms rely heavily on the data they are trained on.

Machine Learning

Machine Learning Machine Learning Data Quality Algorithm

Are AI technologies ready for the real world?

Dataconomy

OCTOBER 5, 2023

AI has made significant contributions to various aspects of our lives in the last five years ( Image credit ) How do AI technologies learn from the data we provide? AI technologies learn from the data we provide through a structured process known as training. Another form of machine learning algorithm is known as unsupervised learning.

AI

AI AI Artificial Intelligence Artificial Intelligence

Webinars

Maximizing Profit and Productivity: The New Era of AI-Powered Accounting

Automation, Evolved: Your New Playbook For Smarter Knowledge Work

MORE WEBINARS

Big Data – Das Versprechen wurde eingelöst

Data Science Blog

MARCH 14, 2023

Dann etwa im Jahr 2018 flachte der Hype um Big Data wieder ab, die Euphorie änderte sich in eine Ernüchterung, zumindest für den deutschen Mittelstand. Von Data Science spricht auf Konferenzen heute kaum noch jemand und wurde hype-technisch komplett durch Machine Learning bzw. Artificial Intelligence (AI) ersetzt.

Big Data

Big Data Big Data Apache Hadoop Data Science

What a data scientist should know about machine learning kernels?

Mlearning.ai

APRIL 13, 2023

Photo by Robo Wunderkind on Unsplash In general , a data scientist should have a basic understanding of the following concepts related to kernels in machine learning: 1. Support Vector Machine Support Vector Machine ( SVM ) is a supervised learning algorithm used for classification and regression analysis.

Machine Learning

Machine Learning Machine Learning Data Scientist Support Vector Machines

Modern NLP: A Detailed Overview. Part 2: GPTs

Towards AI

JULY 23, 2023

Year and work published Generative Pre-trained Transformer (GPT) In 2018, OpenAI introduced GPT, which has shown, with the implementation of pre-training, transfer learning, and proper fine-tuning, transformers can achieve state-of-the-art performance. But, the question is, how did all these concepts come together?

Natural Language Processing

Natural Language Processing Supervised Learning Deep Learning Deep Learning

Meet the Winners of the Youth Mental Health Narratives Challenge

DrivenData Labs

FEBRUARY 3, 2025

Recently, I became interested in machine learning, so I was enrolled in the Yandex School of Data Analysis and Computer Science Center. Machine learning is my passion and I often participate in competitions. The semi-supervised learning was repeated using the gemma2-9b model as the soft labeling model.

Machine Learning

Machine Learning Machine Learning Data Science Natural Language Processing

Against LLM maximalism

Explosion

MAY 17, 2023

Once you’re past prototyping and want to deliver the best system you can, supervised learning will often give you better efficiency, accuracy and reliability than in-context learning for non-generative tasks — tasks where there is a specific right answer that you want the model to find. That’s not a path to improvement.

Supervised Learning

Supervised Learning Natural Language Processing Clustering Machine Learning

RLHF vs RLAIF for language model alignment

AssemblyAI

AUGUST 22, 2023

After processing an audio signal, an ASR system can use a language model to rank the probabilities of phonetically-equivalent phrases Starting in 2018, a new paradigm began to emerge. Using such data to train a model is called “supervised learning” On the other hand, pretraining requires no such human-labeled data.

Supervised Learning

Supervised Learning AI AI Machine Learning

Improving ML Datasets with Cleanlab, a Standard Framework for Data-Centric AI

ODSC - Open Data Science

MARCH 22, 2023

Previously, he was a senior scientist at Amazon Web Services developing AutoML and Deep Learning algorithms that now power ML applications at hundreds of companies. A recent report by Cloudfactory found that human annotators have an error rate between 7–80% when labeling data (depending on task difficulty and how much annotators are paid).

ML

ML ML Data Scientist AI

How foundation models and data stores unlock the business potential of generative AI

IBM Journey to AI blog

AUGUST 1, 2023

It’s the underlying engine that gives generative models the enhanced reasoning and deep learning capabilities that traditional machine learning models lack. They can also perform self-supervised learning to generalize and apply their knowledge to new tasks. An open-source model, Google created BERT in 2018.

AI

AI AI Machine Learning Machine Learning

Foundation models: a guide

Snorkel AI

MARCH 1, 2023

Foundation Models (FMs), such as GPT-3 and Stable Diffusion, mark the beginning of a new era in machine learning and artificial intelligence. Foundation models are large AI models trained on enormous quantities of unlabeled data—usually through self-supervised learning. What is self-supervised learning?

Natural Language Processing

Natural Language Processing Supervised Learning Machine Learning Machine Learning

How to Fine-Tune Language Models: First Principles to Scalable Performance

Towards AI

JANUARY 7, 2025

During this process, they learn language patterns but typically are not capable of following instructions or answering questions. In the case of GPT models, this self-supervised learning includes predicting the next word (unidirectional) based on their training data, which is often webpages.

Natural Language Processing

Natural Language Processing Supervised Learning AI AI

Train self-supervised vision transformers on overhead imagery with Amazon SageMaker

AWS Machine Learning Blog

AUGUST 16, 2023

Training machine learning (ML) models to interpret this data, however, is bottlenecked by costly and time-consuming human annotation efforts. One way to overcome this challenge is through self-supervised learning (SSL). Machine Learning Engineer at AWS. Andrew Ang is a Sr.

ML

ML ML Data Scientist AWS

The business value of operating core insurance solutions on the cloud

IBM Journey to AI blog

JUNE 23, 2023

The introduction of ChatGPT capabilities has generated a lot of interest in generative AI foundation models (these are pre-trained on unlabeled datasets and leverage self-supervised learning with the help of Large Language Models using a neural network ). The ROE ranges also varied by country, from –5% to +13% [1].

Supervised Learning

Supervised Learning AI AI Artificial Intelligence

Explosion in 2017: Our Year in Review

Explosion

JANUARY 12, 2018

While we wouldn’t say bootstrapping is for everyone, it’s been a joy to build the company that we want to build, the way we want to build it: Worked with some amazing companies and shipped custom, cutting-edge machine learning solutions for a range of exciting problems. spaCy’s Machine Learning library for NLP in Python.

Machine Learning

Machine Learning Machine Learning Supervised Learning Python

AWS performs fine-tuning on a Large Language Model (LLM) to classify toxic speech for a large gaming company

AWS Machine Learning Blog

AUGUST 7, 2023

The transformer architecture was the foundation for two of the most well-known and popular LLMs in use today, the Bidirectional Encoder Representations from Transformers (BERT) 4 (Radford, 2018) and the Generative Pretrained Transformer (GPT) 5 (Devlin 2018).

AWS

AWS ML ML Data Science

Best Colleges for Data Science Course Online in India

Pickl AI

APRIL 10, 2023

As per the recent report by Nasscom and Zynga, the number of data science jobs in India is set to grow from 2,720 in 2018 to 16,500 by 2025. Top 5 Colleges to Learn Data Science (Online Platforms) 1. It offers an immersive learning experience that is a blend of conceptual and practical expertise. offers a host of courses.

Data Science

Data Science Machine Learning Machine Learning Python

An Exploratory Look at Vector Embeddings

Mlearning.ai

JULY 31, 2023

One example is the Pairwise Inner Product (PIP) loss, a metric designed to measure the dissimilarity between embeddings using their unitary invariance (Yin and Shen, 2018). Yin and Shen (2018) accompany their research with a code implementation on GitHub here. Fortunately, there is; use an embedding loss. Equation 2.3.1. and Auli, M.,

Deep Learning

Deep Learning Deep Learning Supervised Learning Algorithm

Google Research, 2022 & Beyond: Language, Vision and Generative Models

Google Research AI blog

JANUARY 18, 2023

I will begin with a discussion of language, computer vision, multi-modal models, and generative machine learning models. Language Models The progress on larger and more powerful language models has been one of the most exciting areas of machine learning (ML) research over the last decade.

ML

ML ML AI AI

Large language models: their history, capabilities and limitations

Snorkel AI

MAY 25, 2023

Data scientists and researchers train LLMs on enormous amounts of unstructured data through self-supervised learning. The model then predicts the missing words (see “what is self-supervised learning?” From 2018 to the modern day, NLP researchers have engaged in a steady march toward ever-larger models.

Natural Language Processing

Natural Language Processing Python Machine Learning Machine Learning

Large language models: their history, capabilities and limitations

Snorkel AI

MAY 25, 2023

Data scientists and researchers train LLMs on enormous amounts of unstructured data through self-supervised learning. The model then predicts the missing words (see “what is self-supervised learning?” From 2018 to the modern day, NLP researchers have engaged in a steady march toward ever-larger models.

Natural Language Processing

Natural Language Processing Python Machine Learning Machine Learning

DeepMind

Dataconomy

MARCH 5, 2025

Technology and methodology DeepMind’s approach revolves around sophisticated machine learning methods that enable AI to interact with its environment and learn from experience. AlphaGo Zero: This iteration used unsupervised reinforcement learning, allowing the program to exceed its predecessors consistently.

Deep Learning

Deep Learning Deep Learning Artificial Intelligence Artificial Intelligence

Data Science Current

Generative vs Discriminative AI: Understanding the 5 Key Differences

The Hidden Cost of Poor Training Data in Machine Learning: Why Quality Matters

Webinars

Trending Sources

Are AI technologies ready for the real world?

Webinars

Big Data – Das Versprechen wurde eingelöst

What a data scientist should know about machine learning kernels?

Modern NLP: A Detailed Overview. Part 2: GPTs

Meet the Winners of the Youth Mental Health Narratives Challenge

Against LLM maximalism

RLHF vs RLAIF for language model alignment

Improving ML Datasets with Cleanlab, a Standard Framework for Data-Centric AI

How foundation models and data stores unlock the business potential of generative AI

Foundation models: a guide

How to Fine-Tune Language Models: First Principles to Scalable Performance

Train self-supervised vision transformers on overhead imagery with Amazon SageMaker

The business value of operating core insurance solutions on the cloud

Explosion in 2017: Our Year in Review

AWS performs fine-tuning on a Large Language Model (LLM) to classify toxic speech for a large gaming company

Best Colleges for Data Science Course Online in India

An Exploratory Look at Vector Embeddings

Google Research, 2022 & Beyond: Language, Vision and Generative Models

Large language models: their history, capabilities and limitations

Large language models: their history, capabilities and limitations

DeepMind

Stay Connected