PS4-5
PS6-7
PS8 & PS10
PS9
PS11
100
This type of question allows there to be multiple, valid answers - type of question needed to gather data.

Statistical questions

100
What is the mode of a data set?

The value/response that occurs the most often when collecting data.

100

Class A mean = 90.5

Class B mean = 92

Which class performed better on the quiz? Why?

Class B, because their mean was larger.

100

What is the main data display we learned about called?

Box and whisker plot

100

What is an outlier?

A value that is much different than the rest of the data

200
This type of question receives answers in words/categories - like blue, car, and yes/no. NOT used for collecting numerical data.

Qualitative questions

200

What is the median of a data set?

The midpoint, or the middle, of a data set. Found by ordering data least to greatest and finding the midpoint of the list.

200

Class A median = 88

Class B median = 84

Which class performed better on the quiz? Why?

Class A performed better because their median was higher.

200

What measure of center can we NOT see on a box plot?

Mean and mode

200

What are the two types of outliers?

Lower and upper


300

This is the term for "not fair," or, favoring one value/response over others.

Biased
300

What is the range of a data set?

The range is a measure of how expansive our data set is.

Found by calculating maximum - minimum

300

Class A range - 16

Class B range - 20

Which class had more variability in their quiz scores? Why?

Class B had more variability since their range was greater. 

This could also mean that Class A was more consistent than Class B.

300

What are the 5 numbers in our 5 number summary?

Min

Q1

Med

Q3 

Max

300

What is the lower fence formula?

Q1 - 1.5(IQR)

400

NAME THAT SAMPLING TECHNIQUE:

You generate a list of 1000 numbers, with each number representing a name. You pick 20 random numbers, representing 20 random names.

Simple Random

400

What is the mean of a data set?

The mean of a data set is the average. It's the point that takes all data values into account most efficiently.

It is calculated by summing up all data values in the data list, and then dividing by the number of values that make up the list.

400
Figure A:


Which city got generally more snow? Why?

Ithaca got more snow than Harrison because Ithaca's box plot is more to the right where larger values of snow are than Harrison's.

OR

Ithaca got more snow than Harrison because Ithaca's median was larger than Harrison's.

400

True or false: a box plot must float ABOVE the number line

TRUE

400

What is the upper fence formula?

Q3 + 1.5(IQR)

500

NAME THAT SAMPLING TECHNIQUE:

You generate a list of 1000 numbers, with each number representing a name. You start at the 7th name and then choose every 5th number after your starting point.

Systematic

500

What is the IQR of a data set?

The IQR, or Interquartile Range, is a measure of how spread out the middle 50% of the data set is.

It is calculated by finding quartile 1 and quartile 3, and finding the difference between them.

500

If a box plot is wider, that means it has ______ variability than a box plot that is narrow.

more

500

Each section of the box plot represents ___%

25%

500
If a data value lands ON a fence, it ________ an outlier

Is NOT