Data
Algorithms
Experiments
Infrastructure
100

A flat, simplified version of an excel spreadsheet

What is a .csv?

100

You can find this on a fashion runway, or predicting weather patterns using inputs and outputs. 

What is a model?

100

A numerical measure used to assess model performance.

What is a metric?
100

In computer lingo. This is the time it takes to get a response.

What is latency?

200

Its not the rapper, but you can save lyrics in this data structure that uses key:value pairs.

What is .json?

200

Making computers complete tasks that normally required humans.

What is artificial intelligence?

200

Boo! It's not a ghost. It's just a model giving confident but incorrect outputs. 

What is a hallucination?

200

The ability of models to perform well on unseen data.

What is generalization?

300

Individual measurable inputs a model uses to make classifications or predictions.

What are features?

300

Computers learn and improve from data.

What is machine learning?

300

This test is named after this famous English computer scientists

What is the Turing Test?

300

Logging performance, feedback, drift, and errors over time.

What is monitoring?

400

In supervised learning, this is the correct answers or expected output the model tries to learn.

What are labels?

400

A failure mode where a model memorizes training data but performs poorly on new data.

What is overfitting?

400

The inductive knowledge process of observing, questioning, researching, hypothesizing, testing, analyzing, and communicating.

What is the scientific method?

400

Specialized hardware critical for modern machine learning due to parallel computation.

What is a GPU?

500

You don't use these at an arcade. These are small chunks of text that large language models actually process instead of full words.

What are tokens?

500

Numeric representations that convert text or objects into vectors capturing semantic meaning.

What are embeddings?

500

A measure of how wrong a model’s predictions are during learning.

What is a loss function?

500

This is defined by the 4 Vs: Volume, Variety, Velocity, and Veracity. 

What is big data?