Descriptive Statistics
Probability
Inferential Statistics
Regression Analysis
Sampling and Surveys
100

This measurement describes the middle value of a data set when arranged in ascending order.

Median 

100

The probability of rolling a sum of 7 with two fair six-sided dice.

1/6

100

This is the term for the range of values within which you expect the population parameter to fall, based on the sample statistic.

confidence interval 

100

Type of line used to predict the value of a dependent variable based on the value of an independent variable.

Regression

100

A subset of a population used to represent the whole group.

Sample

200

The measure of spread calculated as the square root of variance.

Standard Deviation 

200

For two independent events, the probability of both events occurring is found by doing this to their individual probabilities.

Multiplying them

200

This is the term for the range of values within which you expect the population parameter to fall, based on the sample statistic.

Type I Error

200

The correlation coefficient measures these two things between two variables.

strength and direction

200

A sampling method where everyone in the population has an equal chance of being selected.

Simple Random Sample

300

The sum of all data values divided by the number of values.

Mean 

300

The rule used to find the probability that one or the other of two mutually exclusive events occurs.

addition rule 

300

This hypothesis test is used to compare the means of two groups when the standard deviations are unknown but assumed to be equal.

two-sample t-test 

300

This value, often denoted as R2 , indicates the proportion of variability in the dependent variable explained by the independent variable.

Coefficient of Determination 

300

This type of bias occurs when a sample does not accurately represent the population.

Sampling Bias

400

The type of skewness when the tail on the right side of the data distribution is longer or fatter than the left side.

right skewed

400

The word used for the conditional probability which involves updating probabilities based on new evidence.

"given"

400
This statistical test is used to determine whether two variable are likely to be related or not in one population 

Chi Square Test for Independence

400

This is the difference between observed values and the values predicted by the model 

Residual 

400

The method used to divide a population into subgroups before sampling to ensure representation from each subgroup.

Stratified Sampling 

500

Type of plot used to display the distribution of a data set based on a five-number summary.

boxplot 

500

The name of the distribution that models the number of successes in a fixed number of independent trials.

Binomial Distribution 

500

The significance method that involves comparing the p-value to this to decide whether to reject the null hypothesis.

significance level 

500

This is the type of pattern that makes a regression model not appropriate 

Curved 

500

The method of selecting a sample based on members of the population that are easiest to access.

Convenience 

M
e
n
u