Feature Engineering & Pre-Processing
Machine Learning
Deep Learning
Statistics
Potluck
100

This technique is the process of replacing missing data with substituted values

What is Imputation

100

Models with high BLANK pay a lot of attention to training data and do not generalize on data they haven't seen before

What is Variance 

100

This function calculates a “weighted sum” of its input, adds a bias and then decides whether a node should be “fired” or not

What is an activation function 

100

An error that occurs when the null hypothesis is true, but is rejected

What is a Type 1 error 

100

This is a parameter whose value is set before the learning process begins

What is a hyperparameter 

200

This is the process by which categorical variables are converted into a form that could be provided to ML algorithms

What is One-Hot Encoding

200

An algorithm that takes unlabeled points and gradually learn how to cluster them into groups by computing the mean of the distance between different points.

What is K-means 

200

This algorithm minimizes the cost function in order to minimize an error.

What is gradient descent 

200

This visual plots the distribution of a numeric variable’s values as a series of bars

What is a Histogram 

200

This occurs when a model learns the details and noise in the training data to the degree that it adversely impacts the execution of the model on new information

What is overfitting 

300

This is a scaling technique in which values are shifted and rescaled so that they end up ranging between 0 and 1

What is Normalization

300

A graphical representation of the contrast between the true positive rate and the false positive rate at various thresholds

What is an ROC curve 

300

This technique is used to improve the performance of a network. It revises the error and updates the weights to reduce this error

What is backpropagation

300

If P(A∩B) = P(A) · P(B), then events A and B are BLANK

What is Independent

300

This function is used when you need to come up with a range of continuous random variables 

What is a Probability Density Function(PDF)

400

This is a data transformation method in which it replaces each variable x with a log(x)

What is Log-transform

400

This type of model can efficiently perform a non-linear classification using what is called the kernel trick, implicitly mapping their inputs into high-dimensional feature spaces

What is Support Vector Machine (SVM)

400

This activation function gives an output of X if X is positive and zeroes otherwise. It is often used for hidden layers.

What is ReLU 

400

If the BLANK is more than then the BLANK, then we fail to reject the H0


What is the p-value and critical value 

400

The decrease in entropy after a dataset is split on an attribute.

What is information gain 

500

This technique groups continuous data into a smaller number of groups

What is Binning

500

A technique that is used to discourage the complexity of a model. It does this by penalizing the loss function. 

What is regularization 

500

This technique is used to reduce the spatial dimensions of a CNN

What is pooling 

500

A measure that indicates the extent to which two random variables change in cycle

What is Covariance 

500

This is an iterative technique which adjusts the weight of an observation based on the last classification

What is boosting