DATA and DISTRIBUTIONS
SAMPLING and DESIGN
COLLECTING DATA
BINOMIAL GEOMETRIC
THE LINEAR MODEL
100

What 3 features must be mentioned when"describing and distribution"?

Shape, Center, and Spread (plus outliers)

100

This sampling method divides the population into groups and samples everyone in a few selected groups.

Cluster Sampling

100

This type of bias occurs when people give dishonest answers, often due to the wording of a question.

Response Bias

100

In a Binomial distribution, this letter represents the fixed number of trials.

n

100

In the equation ỳ = a +bx, what does b represent?

The Slope

200

Using the 1.5 IQR rule, how do you calculate the upper bound for outliers?

Q3 + 1.5(IQR)

200

A study where treatments are imposed to observe a response and establish causation.

Randomized Experiment

200

This is a list of every individual in the population from which a sample is actually drawn.

Sampling Frame

200

This "rule" states that np 10 and n(1 - p) ≥ 10 to use a normal approximation.

Large Counts Condition

200

This value, p?, tells you the% of variation in Y explained by the linear model with X.

Coefficient of Determination

300

If a distribution is heavily skewed to the right, which is typically larger: the Mean or the Median?

The Mean

300

The "baseline" group in an experiment that receives no treatment or a placebo.

Control Group

300

If you only sample people who are easy to reach, you are using this biased sampling method.

Convenience Sampling

300

In a Geometric setting, we are calculating the probability of the first success occurring on what?

The kth trial (first success)

300

This is the difference between an observed value and the value predicted by the regression line.

The Residual

400

A Z-score of -2.5 means the data point is located where?



2.5 standard deviations below the mean

400

This type of bias occurs when certain members of a population cannot be chosen for the sample.

Undercoverage

400

To reduce the effect of variation among subjects, researchers use this: the "blocking" of experiments.

Randomized Block Design

400

This is the formula for the Mean (Expected Value) of a Geometric random variable.

Population mean=1/p

400

If you switch the X and Y variables, which value remains unchanged: the slope or the correlation (r)?

Correlation (r)

500

What happens to the SD of a data set if you add 10 to every single value?

It stays the same

500
This experimental design pairs similar subjects together to reduce the effects of lurking variables?

Matched Pairs Design

500

This occurs when a large portion of the invited sample chooses not to respond or participate.

Nonresponse Bias

500

How do yo calculate the SD of a Binomial distribution?

Population = V--------- (square root btw)

                     np(1-p)

500

This plot must show "no clear pattern" for a linea model to be considered appropriate.

Residual Plot

M
e
n
u