Mean vs Median
Comparing Distributions
Sampling and Prediction
Outliers
Real World Interpretation
200

In right-skewed data, which is larger?

mean

200

Same mean, different spread → what differs?

standard deviation

200

A Sample differs from a population because of…

Sampling variation

200

Which rule uses mean?

2σ rule

200

Best summary for skewed income data

Median

400

In left-skewed data, which is smaller?

mean

400

Same median, different IQR → what differs?

Variation

400

Larger samples are generally…

More accurate or representative of the population

400

Which rule uses quartiles?

IQR rule

400

If most values are clustered tightly, the data is…

Consistent and reliable

600

Why mean is not ideal for skewed data?

Sensitive to extreme values

600

What does a consistent distribution look like?

smaller spread / smaller IQR

600

A subset of the population

Sample

600

In skewed data, which rule is better?

IQR rule

600

A dataset has no outliers by the IQR rule but has extreme values. What does this suggest about the distribution?

likely skewed; extreme values are part of the distribution

800

Can mean = median in skewed data?

no (generally not)

800

Two datasets overlap heavily—what does this mean?

They are similar distributions. Similar values

800

Why might a sample of 50 fans give different shirt size proportions than the whole school?

random variation / not perfectly representative

800

Can a value be outlier by one rule but not the other?

yes

800

A long tail indicates…

skew

1000

A distribution has mean > median and a long right tail. Explain how the extreme values are affecting the mean.

High values pull the mean to the right (larger than the median)

1000

Why compare medians instead of means for skewed data?

The median is resistant to extreme values

1000

Best way to improve prediction...

larger/more representative sample

1000

Why 2σ rule may flag more outliers in skewed data

mean/SD affected by skew

1000

A dataset is right-skewed. Explain why the mean is not a good representation of a “typical” value in context.

mean is pulled by high values; median better represents typical values