Scatter Plots
Line of Best Fit
Correlation Coefficient
Residuals
Five-Number Summary
100

This type of graph shows the relationship between two quantitative variables.

Scatter plot

100

This line models the general trend of data points in a scatter plot.

The line of best fit 

100

This statistic measures the strength and direction of a linear relationship.

The correlation coefficient

100

A residual is found by subtracting this from the actual value.

The predicted value

100

This set includes the minimum, Q1, median, Q3, and maximum.

The five-number summary

200

When graphing a scatter plot, this variable belongs on the x-axis.

Independent variable

200

The goal of a line of best fit is to minimize these values.

Residuals

200

The correlation coefficient always falls between these two values.

−1 and 1

200

A positive residual means the actual data point is located where relative to the line?

Above the line

200

This value separates the lower 25% of the data from the rest.

Q1

300

A scatter plot where points trend upward from left to right shows this type of relationship.

Positive relationship

300

In the equation y=mx+b, this parameter represents the rate of change.

m / the slope

300

A correlation coefficient close to 0 indicates this type of relationship.

Weak or no linear relationship

300

If a residual is zero, this describes the data point’s location.

On the line

300

The median represents this percentile.

The 50th percentile

400

If points are randomly scattered with no clear pattern, the relationship is described this way.

No relationship/correlation

400

A line of best fit should have roughly this many points above and below it.

About half

400

If the correlation coefficient is −0.85, the relationship is best described this way.

A strong negative correlation

400

Large residuals may indicate this feature in the data.

An outlier

400

The difference between Q3 and Q1 is called this.

The interquartile range (IQR)

500

This feature of a scatter plot helps determine whether a linear model is reasonable to use.

The overall pattern or trend of the data

500

This is one reason a data point might not lie close to the line of best fit.

Natural variability, an outlier, or measurement error

500

This important idea explains why correlation alone cannot prove cause and effect.

Correlation does not imply causation

500

Residuals are used to judge how well this type of model fits the data.

A linear model

500

The five-number summary is most commonly displayed using this graph.

A box plot

M
e
n
u