Variables & Data
The Shape Of You
S.O.C.S
Measuring Center
The Outlier Files
100

This type of variable describes an attribute or group, like a Dum Dum flavor or a student’s gender.

What is a Categorical Variable?

100

If the right side of a dotplot is much longer than the left, we say the distribution is this.

What is Skewed Right?

100

In the acronym S.O.C.S., the "S" at the beginning stands for this (e.g., Skewed or Symmetric).

What is Shape?

100

You find this value by adding all observations and dividing by the total number of individuals ($n$).

What is the Mean?

100

An individual observation that falls outside the overall pattern of the rest of the data.

What is an Outlier?

200

This type of variable takes numerical values where it makes sense to find an average, like the number of pops in your bank.

What is a Quantitative Variable?

200

If the left and right sides of a graph are approximately mirror images, we describe it as this.

What is Symmetric?

200

The "C" in S.O.C.S. stands for this, which can be measured by the mean or median.

What is Center?

200

This is the middle value of a data set when the values are arranged in order from smallest to largest.

What is the Median?

200

This term describes a statistic that is NOT affected by extreme outliers.

What is Resistant?

300

This describes the pattern of variation of a variable—it tells us what values the variable takes and how often.

What is a Distribution?

300

A distribution with one single peak is called this.

What is Unimodal?

300

The final "S" in S.O.C.S. describes how much the data varies, also known as this.

What is Spread (or Variability)?

300

If you have 14 pieces of data (like the "Percent of Air in Chips" example), the median is the average of which two positions?

What are the 7th and 8th positions?

300

Between the Mean and the Median, this is the one that is considered Resistant.

What is the Median?

400

This is the specific "count" of individuals in a category divided by the total number of individuals.

What is Relative Frequency?

400

This graph is used for quantitative data and shows each value as a point above a number line.

What is a Dotplot?

400

To calculate this simple measure of spread, you subtract the Minimum from the Maximum.

What is Range?

400

In a Skewed Left distribution, this measure of center is usually pulled down and is smaller than the other.

What is the Mean?

400

In your Lesson 1.6 notes, adding "Mrs. Gallas" (65 hours) to the class data caused the Mean to do this.

What is Increase?

500

A table that shows the distribution of a categorical variable by giving the number of individuals in each category.

What is a Frequency Table?

500

If a dotplot has two distinct clusters or "humps," it is described by this term.

What is Bimodal?

500

When describing a distribution, you must always provide these two things to make your answer complete.

What are Context (units/variable name) and Comparison words (if comparing two sets)?

500

If a distribution is perfectly symmetric, what is the relationship between the Mean and the Median?

What is they are equal?

500

Why is the Median used to report "Typical Home Prices" in a city instead of the Mean?

Because home prices are often Skewed Right (due to a few mansions), and the Median is resistant to those high outliers.

M
e
n
u