Organizing Data
Data Relationships
Producing Data & Experiments
Probability & Distributions
Inference & Tests
100
This measure of center is more resistant to outliers than the mean.
What is the median?
100
observed y - predicted y
What is the residual?
100

This type of sampling involves randomly choosing a heterogeneous group(s) and surveying EVERYONE in that group.

What is cluster sampling?

100

We learned about binomial and geometric distribution right before winter break, this distribution requires waiting UNTIL a FIRST occurs.

What is a geometric distribution?

100

The type of significance test used for quantitative data where we are testing to find the truth about the mean of a single population when the standard deviation of the population is unknown.

What is a T test (or T procedure)?

200
To calculate, subtract the mean of the distribution from the observed x, then divide by the standard deviation.
What is the z-score (or standardized value)?
200
Measures the direction and strength of a linear relationship between two quantitative variables.
What is correlation (or r)?
200

This is given to a control group so that they are not aware if they received the treatment or not.

What is a placebo?

200

At the beginning of the year, we discussed quantitative data in detail. We said that discrete variables that can only take on certain values, and we also talked about this variable that can take on any value. 

What is a continuous variable?

200

We said this idea is SO cool, because no matter what shape the population is, if we take lots of large samples from that population, that sampling distribution will turn out to be a normal shape.

What is the Central Limit Theorem?

300

This rule was an excellent approximation of how much data is between 1, 2, and 3 standard deviations from the mean.

What is the 68-95-99.7 rule?

300

The fraction of the variation in y that is explained by the LSRL of y on x.

What is the coefficient of determination (or r squared)?

300

This experimental design involves the random assignment of units to treatments which are carried out separately within each group of units known to be similar in some way that is expected to affect the responses.

What is block design?

300
Events that have no outcomes in common and can never occur simultaneously, for which the addition rule is used.
What are disjoint events (or mutually exclusive events)?
300

This type of significance test is awesome because it runs multiple 1 prop z tests at once! They also have a condition for expected cell counts which we find in matrices after running the test on our calculator. 

What are the Chi-Square tests?

400

This measure of spread can be found by squaring standard deviation.

What is the variance?

400

The a in equation y-hat = a + bx

What is the y-intercept?

400
The 3 basic principles of experimental design.
What are control, randomize, and replicate?
400

This measures how many standard deviations above or below the mean that a particular value is positioned.

What is a z-score?

400

Some element or evidence of this condition is required in all significance tests

What is Random?

500
This calculator command can be used to find the area under a normal distribution and above an interval.
What is normalcdf?
500

This word or phrase needs to be mentioned when asked to interpret the slope of the LSRL.

What is "on average"?

500

This type of variable occurs in an experiment when we are not able to attribute the response of the experimental unit with the treatment because of it

What is a confounding variable?

500
Finding the mean of a probability distribution can also be called this

What is the expected value?

500

This test is required when we have quantitative data that has to go in a certain order because the values are connected in some way.

What is a matched pairs t-test?