Supervised
Unsupervised
Data
Modern Stuff
AI Bridgers
100

This piece of syntax trains a scikit-learn model to the provided dataset.

What is .fit()?

100

This vitally important piece of a dataset is not present when conducting unsupervised learning.

What are labels/outputs?

100

This aspect of good data ensures that there is high certainty and minimal divergence for both classes of a logistic regression classifier.

What is balanced data?

100

This quieting influence is the goal of a diffusion image generation model.

What is to denoise?

100

These are the names of all of the AIBridger TAs who have helped you this week.

Jack, Sam R, Lily, Michelle (Prof. Liu)

200

This is the name of the library that we have been using for our models.

What is scikit-learn (sklearn)?

200

This modification of the dataset is another goal of the PCA algorithm, allowing computations to take place at a dramatically faster rate.

What is dimensionality reduction?

200

Given one consistent dataset, adjusting this “hyper” part of the model might create different feature importance values.

What is a hyperparameter?

200

A large language model generates new content based on this unit of text, which is also often used as a voucher to exchange for certain goods or services.

What is a token?

200

Who's the best dancer among the AI Bridgers?

Lily

300

The formula for this evaluation metric is TP/(TP+FP).

What is precision?

300

The goal of PCA is to maximise this quantity, which explains how different the data within a feature is.

What is variance?

300

This part of a linear regression model allows us to take a look the model and understand how each of our data features affects the model.

What is a coefficient?

300

The ChatGPT-4 model currently has approximately this number of parameters.

What is 1.7 trillion? (give or take 0.2T)

300

This food item has been the most consumed food item for all 4 of the AIBridgers this past week.

What is sugar?

400

This probability equation was used as a starting point for one of the 5 classification algorithms.

What is the Bayes equation?

400

This term from your linear algebra class years ago is a representation of the direction of one principal component.

What is an eigenvector?

400

This method of dataset balancing allows you to generate new synthetic data to add to the existing dataset.

What is data augmentation?

400

On January 28, 2024, this major company successfully implanted the first brain-computer interface into patient Noland Arbaugh.

Neuralink

400

This major ice cream location in Davis town got the stamp of approval from all four AIBridgers. For each flavour you can name that we ordered, you get extra bonus points.

Davis Creamery


Flavours: ube, peach mango sorbet, cherry chocolate chip, Chips Ahoy cookies and cream, coffee oreo

500

This is the name of the assumption that we made when using linear regression.

What is dataset linearity?

500

K-means clustering is this type of algorithm, known for its built-in randomness.

What is indeterministic?

500

This bloody war gave the receiver operation curve its name.

What is WWII?

500

This major Anthropic LLM got a major update yesterday. For bonus points, name LLM.

What is Claude 3.5 (Sonnet)?

500

Name any of the room numbers that the AIBridgers are staying in this week. (floor 2)

What is 224, 230, or 232?