Measures of Central Tendency & Variability
Data Representation
Probability
Sampling and Inference
Linear Regression & Correlation
100

The measure of center that is the sum of all values divided by the number of values.

the mean

100

This type of plot displays the five-number summary: minimum, first quartile, median, third quartile, and maximum.

Box Plot

100

The probability of an event that is certain to happen.

 1 (or 100%)

100

The entire group of individuals that we want information about

The population

100

The name for the variable that we use to predict the value of another variable

What is the independent variable (or explanatory variable)

200

The middle value of an ordered dataset.

the median

200

 A graph that uses bars to show frequency distributions for continuous numerical data.

Histogram

200

The set of all possible outcomes of an experiment

the sample space

200

A smaller, representative subset of individuals selected from the population

A Sample

200

A measure of the strength and direction of the linear relationship between two variables.

correlation

300

A measure of spread calculated by subtracting the minimum value from the maximum value.

the range

300

This plot uses 'stems' and 'leaves' to show the frequency and shape of a distribution while retaining the original data values.

Stem-and-Leaf-plot

300

Tossing a coin and rolling a number cube at the same time are examples of these types of events.

independent events

300

A method of sampling where every individual in the population has an equal chance of being selected.

a random sample (or simple random sample)

300

A data point that lies far away from the general pattern of the other data points

outlier

400

 The most frequent value in a dataset

the mode

400

A chart using proportional slices to represent parts of a whole.

Pie Chart

400

The probability of rolling a number greater than 2 on a standard six-sided number cube.

4/6 or 2/3

400

Drawing conclusions about a population based on data collected from a sample.

inference (or making an inference)

400

The name of the specific line used to model a linear relationship between two variables and minimize the vertical distances to the data points.

 the least-squares regression line (LSRL) (or line of best fit)

500

This value describes the average distance of each data point from the mean.

standard deviation

500

This type of plot is used to display the relationship between two quantitative variables.

Scatterplot

500

A diagram with branches used to list all possible outcomes of a sequence of events.

tree diagram

500

A study where researchers observe individuals and measure variables of interest without attempting to influence the responses

an observational study

500

The process of using a regression line to predict values outside the range of the observed data.

extrapolation