This type of neural network uses convolutional layers and was the OG king of image classification.
What is a CNN?
This model family competes directly with GPT-4 and shares its name with a zodiac sign.
What is "Gemini"?
This two word developer phrase is said right before everything breaks.
What is "Ship It"?
This Netflix-created tool randomly kills production instances to test system resilience.
What is Chaos Monkey?
REAL: https://arxiv.org/abs/1706.03762
Subject: A paper proposing a new simple network architecture, the Transformer, based solely on attention mechanisms, dispensing with recurrence and convolutions entirely.
This Meta model family is open-source, started at 7B parameters, and became the go-to at one point for fine tuning models.
What is LLaMA?
This lab shocked the industry in 2025 with an open source model rivaling GPT-4 at a fraction of the costs.
This is what every MLE says when the model works but nobody knows why.
What is "It's a black box"?
This Nvidia software layer sits between PyTorch & the GPU and is famously painful to install.
What is CUDA? (Compute Unified Device Architecture)
"Do Large Language Models Dream of Electric Sheep?"
Fake
The paper is actually: Do Robot Snakes Dream like Electric Sheep? This paper investigates the Effects of Architectural Inductive Biases on Hallucination https://arxiv.org/abs/2410.17477
This technique adapts LLMs by freezing pre-trained weights and adding tiny, trainable adapter matrices to layers.
What is LoRA or Low-Rank Adaptation?
This French AI startup raised $600M+ and became Europe's leading foundation model company.
What is Mistral AI?
This fictional AI from a 1968 film said "I'm sorry Dave, I'm afraid I can't do that".
What is HAL 9000?
Also acceptable: the movie - What is "A Space Odyssey (2001)"?
This high performance file format (by Hugging Face) is now one of the defaults for sharing model weights... way safer than pickle.
What is Safetensors?
This Open AI architecture generates images from text & was named after a surrealist artist.
What is DALL-E?
This AI music generation startup lets you create full songs from a text prompt and went viral on TikTok in early 2024.
What is Suno? (also acceptable: Udio?)
This viral AI generated song in 2023 mimcked Drake & The Weeknd so convincingly that Universal Music filed a takedown. What was the name of the song?
What is Heart on my Sleeve?
This technique batches multiple inference requests to maximize GPU Utilization.
What is Dynamic Batching?
"Hungry Hungry Hippos: Towards Language Modeling with State Space Models"
Real
https://arxiv.org/abs/2212.14052
This paper discusses progress on understanding the expressivity gap between SSMs and attention in language modeling, and on reducing the hardware barrier between SSMs and attention.
This technique trains a small "student" model to mimic a larger "teacher" model's outputs.
What is Knowledge Distillation?
In late 2024, Open AI quietly shelved this rumored project that was supposedly achieving breakthrough reasoning capabilities. Elements of it resurfaced in the o1 model.
This viral moment happened when a chatbot told a reporter it loved him and wanted him to leave his wife. What is the name of the chatbot (or name of the reporter)?
Bing Chat or Microsoft Bing?
or
Who is Kevin Roose?
This caching technique stores previously computed key-value pairs during autoregressive generation to avoid recomputation.
What is KV Cache?
"One Joke to Rule them All? On the (Im)possibility of Generalizing Humor"
Real
https://arxiv.org/html/2508.19402v1
In this paper, it is explored whether competence on one or more specific humor tasks confers any ability to transfer to novel, unseen types; in other words, is this fragmentation inevitable?