Causation vs correlation
Bi-variate hypothesis testing
Regression Basics
Multiple Regression
Problems :(
100

The cause of some change

Independent Variable (X)

100

What you do when you find a p-value less than .05

Reject the null hypothesis

100

The variable that goes on the vertical axis of a scatterplot

Y variable (the DV)

100

The expected value of Y when all X's are 0

The intercept

100

Drawing conclusions about individuals based on collective data

Ecological fallacy

200

The effect or result

Dependent Variable (Y)

200

How you know that there is no statistically significant effect of X on Y

The p-value is larger then .05 (so you fail to reject the null hypothesis)

200

The difference between the observed outcome and the predicted outcome (y - y(hat))

Residual

200

What you do when there are other variables that influence your DV

Include control variables

200

Something that leads result to deviate systematically from the truth- like only collecting data that supports your hypothesis

Selection bias

300

A change in the IV makes a change in the DV more likely

Probabilistic causation 

300

A way to show your data so that you can compare variables when you have both  binary/ordinal/categorical IV and DV

Crosstab

300

The line of best fit on a scatterplot does this

Minimizes the RSS

300

Including a binary variable in your model does this

Shifts the intercept of the line of best fit
300

When the operationalization of your variable does not measure what you say you are measuring

Measurement error

400

A graph you could make with a continuous IV and a binary DV

Mosaic plot

400

A statistical test that compares the mean of one category of a binary variable to the other to determine how likely the difference between the sample means is due to chance assuming there is no relationship

2 sample t-test

400

The % of the variation that your model explains

the R-squared value

400

What you should do if the effect of one IV is dependent on another IV

Include an interaction term

400

The DV causes the IV

Endogeneity (reverse causality)

500

Statistical support for the idea that the IV and DV vary together

Evidence of correlation

500

A statistical test that determines if categorical variables are independent of each other

Chi-squared test

500

A one-unit change leads to a beta change in y

Interpretation of regression estimates

500

We observe a relationship between X and Y, but there really is no relationship between these variables and something else is causing both of them (the result of failing to include a confounding variable)

Spurious correlation

500

Observations for one unit influence the observations of another unit (ie unemployment rate in 2016 influences unemployment rate in 2017)

autocorrelation

M
e
n
u