Descriptive Statistics
Probability and Random Variables
Inference Methods
Correlation and LSRL's
Sampling and Experimenting
100

If the mean is greater than the median in a data set, the distribution most likely has this shape.

What is skewed to the right?

100

This is what researchers use to mimic a probability scenario that is often too difficult to actually calculate.

What is a simulation?

100

Our three types of test statistics are z, t, and...?

What is Chi-Squared?

100

The letter r tells us these two things about the correlation.

Strength and Direction
(Remember it does not confirm it's linear)

100

This group is not mandatory in an experiment, but is often used to help account for the confounding caused by lurking variables.

What is a control group?

200

Each of the names of all five values in a five number summary.

What is Min, Q1, Median, Q2, and Max?

200

If I want to find the probability that my first successful outcome will occur exactly on the 7th trial, this type of distribution would be best to use.

What is a Geomatric PDF?

200

Instead of the Normal Condition, Chi-Squared distributions use this condition.

What is the Large Sample Size Condition?

200
The formula for a residual (include the variables).

What is observed y - predicted y?
(Or y - "y hat")

200

This is the what we call data when it is extremely unliekly to happe by chance variation alone?

What is statisically significant?

300

Write out the exact calculator output for finding the barrier for the top 10% of heights for a population of kids at a daycare that have an average of 55 inches with a standard deviation of 2 inches.

What is InvNorm(.9, 55, 2)?

300

Binomial and Geometric Random Variables are both different types of this random variable, meaning they only have finite outcomes with definite probabilities.

What is Discrete Random Variables?

300

A sample of random teenagers were polled, and then seperately a random sample of adults were polled. Each age group was asked what their favorite movie genre was. Which inference method would be best to use to compare the two distributions of movie preferences?

What is a chi-squared distribution for homogeneity?

300

A LSRL is comparing the lengths of wolves (in meters) x, to the weight of the wolves (in kg) y. The LSRL is determine to have to a slope of 35.02. Interpret this value in context.

What is, for every one meter the wolf is in length, we predict the weight of the wolf to increase by 35.02 kgs?

300

This need to be present in a study to be able to tie inferences from the sample back to the population.

What is random selection/sampling?

400

This type of graph is best used when comparing two quantitative data sets of two different sizes.

What is a relative frequency histogram?

400

P(A) = 0.8
P(B) = 0.9

Find P(Ac & Bc) assuming A and B are independent events.

What is 2% or 0.02?

400

The pooled proportion should be only used in this inference scenario.

What is a two sample t-test for a difference in population proportions?

400

In a fathom output for a scatterplot, the bottom left corner will often have the variable "s" standing for this measurement.

What is the standard deviation of the residuals?

400

This sample is gathered by seperating all the individuals into groups based on simalarities across a shared variable. Then an SRS from each group is taken in order to form this type of sample.

What is a stratified random sample?

500

Given that Q1 = 80 and Q3 = 120, this would be the biggest number ending in a 5 that is considered an outlier.

What is 15?

500

Your friend asks you to guess the days of the week that each of their six family members were born on.

This is the probability that you get more than half of the guesses correct.

What is Binomcdf(6,1/7,4,6)

500

This seperate inference method can be used to help estimate what values of our null hypothesis would be rejected in a two sided test for a signfigance level at 1%.

What is a 99% confidence interval?

500

This variable tells us what proportion of the variability in our y-variable is accounted for by the least-squares regression line.

What is r2? 

500

This type of sampling error occurs when certain groups are not represented in a study often leading to biased results.

What is undercoverage?