Statistical questions
The value/response that occurs the most often when collecting data.
Class A mean = 90.5
Class B mean = 92
Which class performed better on the quiz? Why?
Class B, because their mean was larger.
What is the main data display we learned about called?
Box and whisker plot
What is an outlier?
A value that is much different than the rest of the data
Qualitative questions
What is the median of a data set?
The midpoint, or the middle, of a data set. Found by ordering data least to greatest and finding the midpoint of the list.
Class A median = 88
Class B median = 84
Which class performed better on the quiz? Why?
Class A performed better because their median was higher.
What measure of center can we NOT see on a box plot?
Mean and mode
What are the two types of outliers?
Lower and upper
This is the term for "not fair," or, favoring one value/response over others.
What is the range of a data set?
The range is a measure of how expansive our data set is.
Found by calculating maximum - minimum
Class A range - 16
Class B range - 20
Which class had more variability in their quiz scores? Why?
Class B had more variability since their range was greater.
This could also mean that Class A was more consistent than Class B.
What are the 5 numbers in our 5 number summary?
Min
Q1
Med
Q3
Max
What is the lower fence formula?
Q1 - 1.5(IQR)
NAME THAT SAMPLING TECHNIQUE:
You generate a list of 1000 numbers, with each number representing a name. You pick 20 random numbers, representing 20 random names.
Simple Random
What is the mean of a data set?
The mean of a data set is the average. It's the point that takes all data values into account most efficiently.
It is calculated by summing up all data values in the data list, and then dividing by the number of values that make up the list.
Which city got generally more snow? Why?
Ithaca got more snow than Harrison because Ithaca's box plot is more to the right where larger values of snow are than Harrison's.
OR
Ithaca got more snow than Harrison because Ithaca's median was larger than Harrison's.
True or false: a box plot must float ABOVE the number line
TRUE
What is the upper fence formula?
Q3 + 1.5(IQR)
NAME THAT SAMPLING TECHNIQUE:
You generate a list of 1000 numbers, with each number representing a name. You start at the 7th name and then choose every 5th number after your starting point.
Systematic
What is the IQR of a data set?
The IQR, or Interquartile Range, is a measure of how spread out the middle 50% of the data set is.
It is calculated by finding quartile 1 and quartile 3, and finding the difference between them.
If a box plot is wider, that means it has ______ variability than a box plot that is narrow.
more
Each section of the box plot represents ___%
25%
Is NOT