Exploring Data
Sampling & Experiments
Probability & Distributions
Inference & Significance
Interpretations in Context.
100

This is the physical middle number in a data set when the numbers are ordered from least to greatest.

What is the median

100

This is the entire group of individuals that a researcher wants to gather information about.

What is the population

100

If the probability of an event happening is 0.3, this is the probability that the event does not happen.

What is 0.7?


100

You commit this type of error if you reject a true null hypothesis.

What is a Type I error?


100

Interpret a z score of -1.5 for a student's exam score

What is The students exam score is 1.5 standard deviations below the mean score of the whole class


200

This type of display uses a five-number summary to show the spread of data and easily spot outliers.

What is a boxplot (box whisker plot)?


200

This bad sampling method involves only asking people who are easy to reach, like people right next to you.

What is convenience sampling?


200

This is the term for two events if knowing that one happens does not change the probability that the other happens (like flipping a coin 2 times).

What are independent events?


200

This probability value is compared against alpha (α) to reject or fail to reject the null hypothesis.

What is a p value?

200

A newborn baby’s weight is in the 82nd percentile. Interpret this value.

What is 82% of babies weigh the same or less than this specific baby

300

If a distribution has a long tail pointing toward the right side of the graph, it is described as having this shape.

What is skewed right?


300

Unlike an observational study, this type of study actively imposes a treatment on subjects to observe a cause and effect type response.

What is an experiment

300

This famous bell shaped, symmetric distribution is defined by its mean and standard deviation.

What is a normal distribution?

300

You use this type of t-test when you compare the means of two distinct groups where the population standard deviations are unknown.

What is a two sample T test?


300

Assuming the true population proportion of successes is p0, the probability of obtaining a sample proportion p as extreme or more extreme than the one observed in our sample, purely by chance, is 0.034." This sentence correctly shows the formal interpretation of what statistic?



What is a P-value of 0.034?

400

This numerical value measures the strength and direction of a linear relationship, ranging from -1 to

What is the correlation coefficient or r?


400

This fake harmless treatment (IE: sugar pill) is given to a control group to see if changes are due to the actual drug or just the psychology of being treated

What is a placebo

400

The probability of a single event must always be a number between these two values, inclusive.

What are 0 and 1 (or 0% and 100%)?


400

This inference test is used to determine if an observed categorical distribution matches an expected distribution

What is a Chi Square goodness of fit test?


400

For a regression line predicting fuel efficiency in mpg from car weight, s = 2.4. Interpret this value.


What is fuel efficiency of cars will vary by 2.4 mpg?

500

This number represents the distance between the first quartile (Q1) and the third quartile (Q3), measuring the spread of the middle 50% of the data.

What is the interquartile range (IQR)?

500

This occurs when an outside variable is related to the explanatory variable and affects the response, making it impossible to tell which variable caused the result.

What is a confounding variable?

500

This term describes the longrun average outcome of a random variable if you were to repeat an experiment very many times.

What is the mean?

500

This condition must be met to ensure that the standard deviation formula for a sampling distribution is accurate when you sample without replacemnt

What is the 10% condition? (The population must be at least 10 times bigger than the sample size).

500

A population distribution of household water usage is strongly skewed to the right. Interpret what the Central Limit Theorem tells us about the sampling distribution of the sample mean x̄  for a random sample of n = 50 households.

What is normal sampling distribution?

M
e
n
u