Analyzing Data
Categorical Data
Describing Data
Variability
Modeling Data
100

Takes numerical values for a measured or counted quantity.

Quantitative Variable

100

Stack up bars to make 100%

Segmented bar graph

100

This acronym reminds you how to describe or compare a distribution with context.

S.O.C.S. or S.O.C.V.

Shape. Outlier. Center. Spread/Variability

100

Resistant to change.

Median

100

Cumulative Relative Frequency

Percentile

200

This display shows frequency (how many) or relative frequency (percent).

Bar graph

200

Takes a fixed set of possible values with gaps between them.

Discrete Variable

200

Qunderset(1) -1.5(IQR)

Qunderset(3) +1.5(IQR)

Outlier rule

200

Mean and standard deviation are greatly affected by outliers.

Nonresistant

200

Describes how many standard deviations a value falls from the mean of the distribution and in what direction.

z-score (standardized score)

300

Assign labels that place individuals into groups.

Categorical Variables

300

Segmented bar graphs are different.

Association (Knowing the value of one variable helps predict the other variable).

300

mean < median

Skewed left

300

min, Qunderset(1), med,Qunderset(2), max

Five-number summary 

300

same shape, mu=0, sigma=1

Standardizing a distribution 

400

Knowing the value of one variable helps predict the other variable.

Association

400

Takes any value in an interval on the number line. (decimals).

Continuous Variable

400

mean ~~ median

Roughly symmetric

400

The heights of students at our school typically vary by about 3.1 inches from the mean height.

Standard deviation

400

If a distribution is approximately normal, then approximately 68%, 95%, and 99.7% of the data will be within 

+-1sigma, +-2sigma, +-3sigma

Empirical Rule

500

The vertical axis of a display does not start at 0.

Misleading graph

500

Segmented bar graph where the width of bars is proportional to group size.

Mosaic plot

500

Median, IQR, 

Qunderset(1),

Qunderset(3) 

Resistant measures

500

sigma^2

Variance

500

Used to find the x-value on a normal distribution given a specific area. When using technology enter these paramters: 

(area, mu, sigma)

invnorm

M
e
n
u