Scatterplots & Correlation
Regression & Prediction
Cautions in Associations
Probability Basics
Conditional & Applied Probability
100

What type of variables are displayed in a scatterplot?

Two quantitative variables

100

What is the equation of a regression line?

ŷ = a + bx

100

What is extrapolation?

Using the regression line to predict values outside the observed range. 

100

Formula for relative frequency?

(# event occurrences) / (total trials).

100

 Interpret P(A | B) in words.

 “The probability of A given B.”

200

What does a positive correlation (r) indicate?

As one variable increases, the other also increases.

200

In the regression equation, what does b represent?

The slope, or the predicted change in y for each 1-unit change in x.

200

What is an influential point?

A data point that significantly affects the slope of the regression line.

200

Valid range of a probability?

From 0 to 1, inclusive.

200

In symbols, what does the vertical slash “|” mean?

“Given.”

300

What is the range of possible values for r?

From –1 to +1.

300

What does the y-intercept (a) represent?

The predicted y-value when x = 0.

300

Does correlation imply causation?

No, a strong correlation does not mean one variable causes the other.

300

When are two events independent?

When the outcome of one does not affect the other.

300

When are events disjoint (mutually exclusive)?

When they have no outcomes in common.

400

What does an r value close to 0 suggest?

A weak or no linear relationship.

400

What does the r² (coefficient of determination) tell us?

The percentage of variation in y explained by x.

400

What is a lurking variable?

An unmeasured variable that influences the relationship.

400

General addition rule: P(A or B) = ?

 A: P(A) + P(B) − P(A and B).

400

If P(A) = 0.4 and P(B) = 0.5 and A, B are independent, find P(A and B).

0.20

500

What kind of relationship does correlation not measure well?

Nonlinear relationships or those influenced by outliers.

500

How does the least squares method determine the regression line?

It minimizes the sum of squared residuals.

500

Difference between lurking and confounding variables?

Lurking: not measured; Confounding: measured but entangled with other explanatory variables.

500

Multiplication rule for independent A and B?

P(A and B) = P(A) × P(B).

500

Define a probability model.

A specification of possible outcomes and assumptions/probabilities for events in the sample space.

M
e
n
u