ML Algorithms
Neural Networks
AI Moments in History
Terminology
100

This algorithm is often used for binary classification tasks and works by finding the best line or hyperplane that divides data into two classes.

What is a Support Vector Machine (SVM)?

100

The method used for updating the weights in a neural network during training.

What is backpropagation?

100

In 1997, this IBM supercomputer defeated world chess champion Garry Kasparov in a highly publicized match, marking a significant moment in the history of artificial intelligence.

What is Deep Blue?
100

This type of learning occurs when an AI system learns from data that is neither labeled nor categorized, instead finding patterns and structures in the data on its own.

What is unsupervised learning?

200

This ensemble method builds multiple decision trees and merges them together to get more accurate and stable predictions.

What is Random Forest?

200

The number of output units in a neural network designed for a regression task

What is one?

200

In what year was the backpropagation method of training neural networks published?

What is 1986?

200

The name for the situation where information from the test set influences the process of model building.

What is data leakage?

300

This activation function has the equation 1 / (1+e^(-z))

What is the sigmoid?

300

The problem that arises when the gradients of some activation functions become very small to the point of being negligible, resulting in the weights failing to update.

What is the vanishing gradient problem?

300

The birth of the field of AI research is commonly attributed to a conference held at this university in 1956.

What is Dartmouth?

300

n neural networks, this term refers to a function applied to outputs at each layer that helps introduce nonlinearity in the model, allowing it to learn more complex patterns.

What is an activation function?

400

The name of the loss function used in logistic regression

What is the Negative Log Likelihood?

400

The term we optionally add to the update rule when training a neural network which is a factor of the previous weight change in order to smooth training and avoid local minima

What is momentum?

400

Who created the ImageNet competition which was a key driver of the current deep learning boom?

Who is Fei-Fei Li?

400

This graphical plot is crucial for evaluating the diagnostic ability of binary classifiers, depicting the trade-off between true positive rate and false positive rate at various threshold settings.

What is a Receiver Operating Characteristic (ROC) curve?

500

A form of unsupervised learning which clusters data points based on areas of density

What is DBSCAN?

500

The NN training strategy of a small initial learning rate increasing to a max value and then decaying back down, with momentum cycling inversely to learning rate.

What is the one-cycle policy?

500

Who is credited with popularizing the term Machine Learning in 1959?

Who is Arthur Samuel?

500

The classification error metric which answers the question "How many of all positives did the model correctly classify as positives?"

What is recall (or sensitivity)?

M
e
n
u