Data Analysis & Modeling Distributions of Data
Probability & Distributions
Describing Relationships
Mystery
Inference Tests
100

This measure of center is more resistant to outliers than the mean.

What is the median?

100

This type of random variable requires a fixed number of trials.

What is a binomial random variable?

100

The difference between an actual value of the response variable and the value predicted by the regression line.

What is the residual?

100

The 3 basic principles of experimental design.

What are control, randomize, and replicate?

100

A test of the null hypothesis that a population mean is equal to a specified value.

What is a T-test for the mean?

200

To calculate, subtract the mean of the distribution from the observed x, then divide by the standard deviation.

What is the z-score (or standardized value)?

200

A number between 0 and 1 that describes the proportion of times an outcome of a chance process would occur in a very long series of repetitions.

What is probability?

200

Measures the direction and strength of a linear relationship between two quantitative variables.

What is correlation (r)?

200

The condition involving the population size that must be satisfied to use sigma divided by the square root of n as the standard deviation of a sampling distribution.

What is the 10% condition?

200

When the observed results of a study are too unusual to be explained by chance alone, the results are called what?

What is statistically significant?

300

This rule helps to determine if data is normally distributed by checking the number of observations within each interval.

What is the empirical rule?

300

Two events A and B that have no outcomes in common and so can never occur together.

What are mutually exclusive events?

300

A measure of the percent reduction in the sum of squared residuals when using the least-squares regression line to make predictions, rather than the mean value of y.

What is the coefficient of determination (r2)?

300

This experimental design involves forming blocks consisting of individuals that are similar in some way that is important to the response. Random assignment of treatments is then carried out separately within each block.

What is a randomized block design?

300

A test of the null hypothesis that a categorical variable has a specified distribution.

What is the goodness of fit (or chi-square) test?

400

The “average” squared deviation of the observations in a data set from their mean. 

What is the variance?

400

If knowing whether one event occurs does not alter the probability that the other event occurs.

What is independence?


400

Applying a logarithmic transformation to both variables causes this type of model to become linear.

What is a power model?

400

Variable that can take any value in an interval on the number line. The probability distribution of this variable is described by a density curve.

What is a continuous random variable?

400

Two of the conditions to be verified for inference about a proportion.

What are the 10% condition and the Large Counts condition? (Random condition could also be one of the two)

500

This calculator command can be used to find the area under a normal distribution and above an interval.

What is normalcdf?

500

If we observe more and more repetitions of any chance process, the proportion of times a specific outcome will occur approaches a single value (in the long run). In the short run, it is unpredictable.

Law of Large Numbers

500

When above-average values of one variable tend to accompany below-average values of the other.

What is negative association?

500

Use of a regression model for prediction far outside the interval of values of the explanatory variable x used to obtain the model. Such predictions are often not accurate.

What is extrapolation?

500

The probability that a test will reject  at a chosen significance level when a specified alternative value of the parameter is true.

What is power?