Vocab 1
Vocab 2
vocab 3
vocab 4
100

Objects that are describe by a set of data maybe people, animals, or things 

Individuals  

100

A distribution is roughly __ if the right and left sides of the graph are approxi- mately mirror images of each other.

Symmetric

100

A distributed is skewed to the right if the right side of the graph (containing the half of the observations with larger values) is much longer than the left side. It is skewed to the left if the left side of the graph is much longer than the right side.

skewed distributions

100

The ___ s measures the average distance of the observations from their mean. tI is calculated by finding an average of the squared distances and then taking the square root. Thisaverage squared distance si caled the variance. nI symbols, the variance 2⁄3s is given by

 The standard deviation

200

Any characteristic of an individual taking different values for different individuals 

Variable 

200

An association between two variables that holds for each individual value of a third vari- able can be changed or even reversed when the data for al values of the third variable are combined. This reversal is called ____

Simpson's paradox

200

T find the mean _ (pronounced "X-bar") of a set of observations, add their values

and divide by the number of observations. If the n observations are X1, X2, ..X.,, their mean is

The mean X

200

The thing of one of the categorical variables in a two-way table of counts is the distribution of values of that variable among all individuals described by the table.

Marginal distribution

300

A variable places an individual into one of several groups 

Categorical variable 

300

We say that there's __ between two variables fi specific values of one variable tend to occur in common with specific values of the other.

Association

300

The ___ M is the midpoint of a distribution, the number such that half the observations  are smaller and the other half are larger.

median

300

Call an observation an outlier fi it fall’s more than 1.5 × IQR above the third quartile or below the first quartile.

The 1.5 X IQR rule for outliers

400

A variable that takes numeral values for which it makes sense to find an average 

Quantitative variable

400

Distortion or asymmetry in normal distribution of data.

Skew

400

Describes how similar or varied the data is from a specific set of values. Includes range, quartiles, IQR, and standard deviation.



Spread

400

A thing of a variable describes the values of that variable among individuals who have a specific value of another variable. There is a separate conditional distribution foreach value of the other variable.

Conditional distribution

500

The thing of a variable tells us what values the variable takes and how often it takes these values.

Distribution

500

The overall look of the data when arranged on a table. Includes skew, spread, center and outliers.

Shape

500

The circle is then split into fractions and labeled depending on the data.

Pie chart

500

The _____ of a distribution consists of the smallest observation, the first quartile, the median, the third quartile, and the largest observation, written in order from smallest ot largest. In symbols, the five-number summary is

The five-number summary