The significance of ŷ ("y hat")
What is the predicted value/s of y?
The problem when two or more independent variables are highly correlated to each other
What is multicollinearity?
The mapping of a set of outcomes to the number line
What is a random variable?
A function stored within a class
What is a method?
Immutable, heterogeneous, ordered
What is a tuple?
Correlation ranges between these two values
What is -1 and 1?
R2 describes this
What is the variability in y explained by the predictor(s)?
The distribution used to model a single experiment with a binary outcome
What is the Bernoulli distribution?
An object stored within a class
What is an attribute?
Immutable, homogeneous, ordered
What is a string?
How the line of best fit is determined
What is the minimization of SSE (sum of squared errors)?
The parameters learned by fitting an OLS model
What are the beta coefficients?
The probability of rolling two six-sided die and receiving a sum of 12
What is 1/36?
The pandas property to retrieve an entry by index
What is .iloc[]?
Mutable, heterogeneous, unordered
What is a dictionary?
How to calculate R-squared
What is 1 - RSS/TSS?
Errors should have this type of variance
What is homoscedastic?
How to calculate the probability of some discrete event occurring
What is the probability mass function (PMF)?
A way to filter a Pandas dataframe
What is boolean masking?
Mutable, heterogeneous, ordered
What is a list?
The five assumptions of multiple linear regression
What are:
L - linearity
I - independence of errors
N - normality of errors
E - equality of variance
M - no multicollinearity
The most commonly used correlation coefficient
What is Pearson's corr coeff, or Pearson's r?
In a basket with three kittens and three puppies, the probability of getting a kitten and then a puppy
What is 3/6 * 3/5 = 9/30 = 3/10 = 30%?
The distribution we use to model number of events occurring in some interval
What is Poisson?
Method to retrieve a value from a dictionary, regardless if the key exists or not
What is .get()