Eyes
ChatGPT
Grading
Flailing Robots
Pseudorandom
100

The name of a news network, this type of neural network is used for vision-related tasks

What is a CNN?

100

Cars to robots and back, this type of model is used for language

What are transformers?

100
You need a _ understanding to do well on a test; A _ neural network has many layers

What is deep?

100

This type of learning is used to train game-playing agents

What is Reinforcement Learning?

100

Controls how fast you pick up new knowledge

What is learning rate?

200
Billiard, layers that merge nearby things together through averaging or max

What is Pooling?

200

This is what GPT stands for

What is the Generative Pretrained Transformer?

200

Most common activation function for intermediate layers

What is ReLU?

200

Learning from demonstrations; parrots do this

What is Imitation Learning?

200

Character in The Bible, well-known optimizer.

What is Adam?

300

Unclear speech, this layer applies a filter to sliding windows of the input image

What is a convolutional layer?

300

Throw a tennis ball and a dog will bring it back. Search chatbots like Bing are also known as _ augmented LLMs

What is Retrieval?

300
Grading on this, track these while your model is training

What are loss curves?

300
These values are estimates of future reward given the current state and action

What are Q-values?

300

French for together; averaging results from multiple models to get better performance

What is ensemble?

400

Something left - this technique helps to reduce performance degradation from having too deep neural networks

What are residuals?

400

Base language models are trained with this objective. The cat jumped over the _?

What is next token prediction?

400

Models that train too long on something might run into this

What is overfitting?

400

Foreign _, this is the entity that takes actions given states

What is a policy?

400

This technique is used to fit language models into small amounts of memory.

What is quantization?

500

Vision Transformers split images into these things, wear over eyes

What are image patches?

500

This is how you fine-tune with human preference data

What is Reinforcement Learning from Human Feedback?

500

Predictions often follow this pattern when RELUs die

What is identical?

500

Rotten Tomatoes; these things offer feedback to actors to further their learning

What are critics?

500

Plants, watermelons, and VC funding. This is set to ensure reproducebility of experiments

What is a seed?

M
e
n
u