This measurement describes the middle value of a data set when arranged in ascending order.
Median
The probability of rolling a sum of 7 with two fair six-sided dice.
1/6
This is the term for the range of values within which you expect the population parameter to fall, based on the sample statistic.
confidence interval
Type of line used to predict the value of a dependent variable based on the value of an independent variable.
Regression
A subset of a population used to represent the whole group.
Sample
The measure of spread calculated as the square root of variance.
Standard Deviation
For two independent events, the probability of both events occurring is found by doing this to their individual probabilities.
Multiplying them
This is the term for the range of values within which you expect the population parameter to fall, based on the sample statistic.
Type I Error
The correlation coefficient measures these two things between two variables.
strength and direction
A sampling method where everyone in the population has an equal chance of being selected.
Simple Random Sample
The sum of all data values divided by the number of values.
Mean
The rule used to find the probability that one or the other of two mutually exclusive events occurs.
addition rule
This hypothesis test is used to compare the means of two groups when the standard deviations are unknown but assumed to be equal.
two-sample t-test
This value, often denoted as R2 , indicates the proportion of variability in the dependent variable explained by the independent variable.
Coefficient of Determination
This type of bias occurs when a sample does not accurately represent the population.
Sampling Bias
The type of skewness when the tail on the right side of the data distribution is longer or fatter than the left side.
right skewed
The word used for the conditional probability which involves updating probabilities based on new evidence.
"given"
Chi Square Test for Independence
This is the difference between observed values and the values predicted by the model
Residual
The method used to divide a population into subgroups before sampling to ensure representation from each subgroup.
Stratified Sampling
Type of plot used to display the distribution of a data set based on a five-number summary.
boxplot
The name of the distribution that models the number of successes in a fixed number of independent trials.
Binomial Distribution
The significance method that involves comparing the p-value to this to decide whether to reject the null hypothesis.
significance level
This is the type of pattern that makes a regression model not appropriate
Curved
The method of selecting a sample based on members of the population that are easiest to access.
Convenience