Remove 2011 Remove Algorithm Remove Supervised Learning
article thumbnail

Understanding Reinforcement Learning and Multi-Agent Systems: A Beginner’s Guide to MARL (Part 1)

Towards AI

Photo by Hyundai Motor Group on Unsplash When we learn from labeled data, we call it supervised learning. When we learn by grouping similar items, we call it clustering. When we learn by observing rewards or gains, we call it reinforcement learning.

article thumbnail

What Is a Transformer Model?

Hacker News

Transformers made self-supervised learning possible, and AI jumped to warp speed,” said NVIDIA founder and CEO Jensen Huang in his keynote address this week at GTC. Transformers are in many cases replacing convolutional and recurrent neural networks (CNNs and RNNs), the most popular types of deep learning models just five years ago.

article thumbnail

What Is ChatGPT Doing … and Why Does It Work?

Hacker News

And in fact the big breakthrough in “deep learning” that occurred around 2011 was associated with the discovery that in some sense it can be easier to do (at least approximate) minimization when there are lots of weights involved than when there are fairly few. There’s the raw corpus of examples of language.