Stats Term
General Questions
Normal Distribution
Categorical/Quantitative Variable
100

 Takes values that are category names or labels.

(What type of variable)


Categorical variable

100

What is a mean?

Average of group of numbers

100

What is the Z score formula?

X bar - X / standard deviation 

100

How can we represent categorical data?


With a frequency table or relative frequency table.

200

 takes numerical values for a measured or counted quantity.

(What type of variable)


Quantitative variable

200

What summary statistics can be used to describe the center and position of a distribution of quantitative data?

Center: mean, median

Position: Q1 and Q3

200

What is a percentile?

Percentile is the percent of data values less than or equal to a given value.


200

How can we represent categorical data graphically?


Bar Chart/pie chart

300

Types of quantitative variable which is countable with gaps.

Discrete variable

300

What summary statistics can be used to describe the variability of a distribution of quantitative data?


Variability: range, IQR, and standard deviation

300

What does a Z score tell us?

Number of standard deviations above and below the mean. 

300

What are the important characteristics to discuss when describing the distribution of quantitative data?



Shape, center, variability

400

Types of quantitative variable which is not countable with no gaps

Continuous variable

400

What is the five-number summary and how do we use it to make a boxplot?

Minimum, Q1, median, Q3, maximum

Use the five-number summary to split data into quartiles.

400

What is the empirical rule?

About 68% of the data is within 1 SD of the mean.

About 95% of the data is within 2 SD of the mean.

About 99.7% of the data is within 3 SD of the mean.


400

How can we determine if a value in a data set is an outlier?


 less than 1.5 X IQR below Q1 or more than 1.5 X IQR above Q3.

2 or more standard deviations away from the mean

500

What is one way we decide if there is a relationship between two categorical variables in a graphical representation.  


Bar Graph

500

How does the shape of the graph influence the relative relationship of the mean and median?


Skewed right distribution, mean > median

Skewed left distribution, mean < median

500

How can we use the z-scores to find the percent of data values left, right, and between?


Left: get area from Table 

Right: 1 — area from Table 

Between: subtract two areas from Table 

500

Which summary statistics are resistant, and which are nonresistant?


Resistant: median, lQR

Nonresistant: mean, standard deviation, range

M
e
n
u