Python
Pandas/Numpy
Hypothesis Testing
Linear Regression
Potluck
100

 A Mutable sequence used to store an ordered collection of items

What is a List 

100

A grid of values, all of the same type, and is indexed by a tuple of nonnegative integers

What is a Numpy array 

100

These are the 3 measures of central tendency 

What are the mean, median and mode 

100

Data that can be controlled directly

What is a dependent variable 

100

Measures the strength of a relationship between two variables

What is correlation 

200

This statement conditionally executes a block of code

What is an if statement 

200

This Numpy method is used to create a floating point array 

What is linspace() 

200

This measures how spread out a set of data is relative to the mean.

What is variance 

200

This is another word to represent the coefficients of a model 

What are parameters 

200

The differences from the fitted line to the observed values

What are Residuals 

300

An anonymous function, that can accept any number of arguments, but can only have a single expression.

What is a lambda function 

300

This is a one-dimensional labeled array capable of holding data of any type (integer, string, float, python objects, etc.)

What is a Pandas Series 

300

The probability of obtaining a sample outcome, given that the value stated in the null hypothesis is true.

What is a p-value 

300

A standardized measure of the goodness of fit for a regression model

What is R-Squared 

300

This pandas method is used to detect missing values for an array-like object.

What is isna() OR isnull() 

400

A keyword in Python used to define an instance or an object of a class

What is Self 

400

This Pandas method allows you to replace all null values with whatever value you pass into it 

What is the fillna() method 

400

This occurs when a researcher incorrectly rejects a true null hypothesis.

What is a type 1 error 

400

A situation where there is a high degree of correlation among the independent variables in a regression model

What is multicollinearity 

400

A function for continuous data where the value at any point can be interpreted as providing a relative likelihood that the value of the random variable would equal that sample.

What is a Probability Density Function 

500

This function takes a function as an argument and then applies that function to all the elements of an iterable

What is the map() function

500

This popular Pandas function is used to split the data into groups based on some aggregate method and combine the results 

What is a groupby function 

500

This is known as the standard deviation of a sampling distribution

What is the standard error 

500

Pairwise products of the "original" independent variables.

What are interaction terms 

500

____ operators are used to combine conditional statements in Python 

What are logical operators(and, or, not)