Correlation and Determination
Scatterplots
Regression Lines
Association & Causation
Outliers & Residuals
100

What does a correlation coefficient of r = 0.98 indicate?

A very strong positive linear relationship.

100

Q: What is the “form” of a relationship shown in a scatterplot?

A: Whether it is linear or non-linear.

100

Q: In the regression line y=2.3x + 3, what does the slope represent?

A: The x increases by 2.3

100

Q: Shoe size and reading level—what type of association?

A: Common response (age).

100

Q: What is an outlier?

A: A data point that does not follow the pattern.

200

If r = 0.5, what is R² as a percentage?

R² = 0.25, which is 25%


200

Q: If the points slope downwards from left to right, what is the direction?

A: Negative.

200

Q: In the same line y=2.3x + 5, what does the y-intercept represent?

A: Predicted y value of 5 revenue when x = 0

200

Q: Number of fire trucks and property damage— what type of association?

A: Confounded (big fires cause both).

200

Q: What effect does an outlier have on r?

A: It weakens the correlation.

300

What does R² represent?

The percentage of variation explained by the model.

300

Q: Describe the association between health expenditure and infant mortality.

A: Moderate to strong negative linear relationship.

300

Q: What is the formula for a residual?

A: Residual = actual y – predicted y.

300

Q: Ice-cream sales and drowning incidents— what type of association?


A: Common response (hot weather).

300

Q: In a residual plot, what pattern indicates a poor fit?

A: A curved or systematic pattern.

400

What value of r indicates NO linear association?

r = 0.

400

Q: Name one reason it is appropriate to calculate r for the health vs mortality data.

A: The relationship is reasonably linear.

400

Q: If the slope is negative, what does this mean?

A: As x increases, y decreases.

400

Q: Pirates vs global temperature— what type of association??

A: Coincidental

400

Q: If an outlier lies far above the regression line, what is the residual?

A: A large positive residual.

500

Q: What does the remaining (1 – R²) represent?

A: Variation explained by other factors not in the model.

500

Q: What type of plot shows whether data points fit the regression line well?

A: A residual plot.

500

Q: If an outlier is removed, how will the slope and r typically change?

A: Both become stronger (slope steeper, |r| increases).

500

Q: Explain why correlation does not imply causation.

A: A third variable may be influencing both.

500

Q: Why should outliers be investigated rather than simply removed?

A: They may indicate errors or meaningful unusual cases.

M
e
n
u