First and last author of the GPT 2 paper
Alec Radford and Ilya Sutskever
This breakthrough AI architecture was first coded by its author in a single night after a spirited bar-argument with classmates.
The GAN
The smallest of the Llama-3 herd
8 billion
Gemma vs Llama
Gemma: 3.5k
Llama: 59k
First author of the AlphaGo paper
David Silver
The name of the AI system that defeated Garry Kasporov in chess in 1997.
Deep Blue
GPT-2 Small
124 million
NumPy vs SciPy
NumPy: 30k
SciPy: 14k
First author of Attention is All You Need
Vaswani
In this year, the famous go player Lee Sedol was beaten by AlphaGo in this year.
2016
The largest of the Gemma3 models
27 billion
PyTorch vs TensorFlow
PyTorch: 91k
TensorFlow: 191k
First author (or authors) of Alexnet
Alex Krizhevsky
OpenAI was founded, in-part, based on worries that this person would have too much power if AGI was obtained.
Demis Hassabis
BERT-base
110 million
huggingface/peft vs huggingface/trl
(parameter efficient fine-tuning vs transformer RL)
PEFT: 19k
TRL: 14.5k
First (and last) author of "Natural Adversarial Examples"
- the paper where they put stickers on stop signs to create real-world adversarial examples
Dan and Dawn
This popular activation function was secretly named after this person who built the GPU cluster for AlexNet
ReLU
Deepseek V3
671 billion
MMLU vs MATH
MMLU: 1.4k stars
MATH: 1.1k stars