The significance of the ŷ ("y hat")
What is the predicted value/s of y?
The problem when two or more independent variables are highly correlated to each other
What is multicollinearity?
The mapping of a set of outcomes to the number line
What is a random variable?
Transformer used to achieve a distribution with mean 0 and standard deviation 1
What is standard scaler?
Immutable, heterogeneous, ordered
What is a tuple?
Correlation ranges between these two values
R2 describes this
The set of all possible outcomes
What is the sample space?
Regularization method that can be used for feature selection
What is LASSO?
Immutable, homogeneous, ordered
What is a string?
How the the line of best fit is determined
What is the minimization of SSE (sum of squared errors)?
The parameters learned by fitting an OLS model
What are the beta coefficients?
The probability of rolling two six-sided die and receiving a sum of 12
What is 1/36?
The distribution used to model a single experiment with a binary outcome
What is the Bernoulli distribution?
Mutable, heterogeneous, unordered
What is a dictionary?
How to calculate R-squared
What is 1 - RSS/TSS?
Errors should have this type of variance
What is homoscedastic?
How to calculate the probability of some discrete event occurring
What is the probability mass function (PMF)?
The full name of LASSO
Mutable, heterogeneous, ordered
What is a list?
The five assumptions of multiple linear regression
What are:
L - linearity
I - independence of errors
N - normality of errors
E - equality of variance
M - no multicollinearity
The most commonly used correlation coefficient
What is Pearson's corr coeff, or Pearson's r?
In a basket with three Snickers and three Milky Ways, the probability of getting a Snickers and then a Milky Way
What is 3/6 * 3/5 = 9/30 = 3/10 = 30%?
The distribution we use to model number of events occurring in some interval
What is Poisson?
Method to retrieve a value from a dictionary, regardless if the key exists or not
What is .get()