Describe this distribution in two terms:
Skewed left and bimodal
Find the mode of the distribution
There isn't one!
Categorical or quantitative?
The average test score on McCool’s first stats test was an 82.
Quantitative
Which of these box plots has the highest IQR, and how do you know?
The third one, because the box is the longest
Describe three instances in which categorical data can hold numerical values.
Intervals, jersey numbers, credit card numbers, categories of age, etc.
A collection of observations or events such as test scores, ages of people in a study, or response times
Data set
What measure of center should be used for this distribution and why?
Median because it is resistant to outliers (skews)
How is mean calculated?
Categorical or quantitative?
Weight
Age
Phone number
Blood type
Q, Q, C, C
Name the following graphs:
Bar graph
Histogram
Stemplot
Box plot
Statistics is ________ and ________ data in order to better understand the world.
Collecting, analyzing
A characteristic to be measured or observed
Variable
On your white board, draw a normal distribution and describe where IQR lies in the data.
Find the mean, median, and mode of this set of data:
4, 26, 38, 95, 114, 201, 223, 268, 300, 332
Mean: 160
Median: 157.5 or 158
Mode: None
Give me a real-life example, besides a yes or no question, in which you would have a two-variable distribution.
Various answers.
Describe the shape, best measure of center, best measure of spread, and whether there are noticeable outliers.
Skewed right (or symmetrical with outliers), median as the measure of center, IQR as the measure of spread, and one possible outlier at 90+.
What does the stem on a stem plot stand for?
Minimum & maximum values, the first digit of every number in the set
If Alicia volunteers herself to participate in a study to test the physical effects of a newly developed medicine, what is she?
An individual of the study
What kind of graph is this?
Line graph
True or false: the mean, median, and mode are all appropriate measures of center for a normal distribution with outliers.
False; you can only use the mean, median, and mode as the measure of center for a normal distribution with no outliers.
Calculate the cumulative frequency for this table and identify how many values there are total in the set:
(F)
57
21
34
45
98
112
61
32
27
(CF)
57
78
112
157
255
367
428
460
487
What is the formula for calculating a z-score, and what does a z-score tell us?
(X - Mean) / SD
The z-score is the amount of standard deviations a value lies from the mean
What concept that we recently learned about helps us estimate a percentile?
Cumulative relative frequency
What is this, and which formula is it found in?
The Greek symbol meaning "the sum of", and it's found in the formula for Standard Deviation
What measure of central tendency can be used for this distribution of data and why?
Mode, because there are only two variables
Find the mean, median, and mode of this set of data:
1, 13, 36, 37, 37, 38, 44, 57, 59, 80, 91, 91, 91, 97, 103, 113
Mean: 61
Median: 58
Mode: 91
Find the individuals, variables (labeled categorical and quantitative), and the average weight.
Individuals: Mario, Lucy, Paul, Mary, Cory
Variables: Height and weight (Q), gender and occupation (C)
Average weight: 201
Draw a box plot for this set of data:
120,125,144,147,152,179,200,213
Minimum: 120
Q1: 134.5
Median: 149.5 or 150
Q3: 190
Maximum: 213
What is the difference between population and sample data?
Population describes an entire group. Example: all people who live in texas, all people who live in Mansfield.
Sample is a portion of a group collected to make inferences about a population.
A subset of a population
Sample
Describe three different names for this distribution
Find the mean, median, and mode of this set of data:
-7, -6, -6, -5, -3, 0, 4, 8, 9, 11, 11, 11, 13, 14, 14, 18, 36, 43, 74, 91, 112, 148
Mean: 26.8 or 27
Median: 11
Mode: 11
Make a frequency table and a cumulative frequency table from the data:
0 0 1 1 1 2 2 2 3 3 3 3 3 4 4 5 6 7 9 9 9 9 9
0 - 2 - 2
1 - 3 - 5
2 - 3 - 8
3 - 5 - 13
4 - 2 - 15
5 - 1 - 16
6 - 1 - 17
7 - 1 - 18
9 - 5 - 23
Draw a CRF line chart for the data below.
Time spent on HW # of students RF
0-20 6 .375
21-40 4 .25
41-60 3 .188
61-80 2 .125
81-100 1 .063
CRF values:
.375
.625
.813
.938
1.00
Give me an example of an outlier in real life, and why they're important to identify.
Various answers: one specifically being the bank fraud example
What is this called, and what does it stand for?
X-bar stands for the mean
Describe this distribution two different ways and give a real-life instance in which it applies:
Symmetrical and uniform; rolling dice, flipping a coin, pulling from a deck of cards
Find the mean, median, mode, range, and IQR of this data.
Mean: 3.64
Median: 3
Mode: 1
Range: 9
IQR: 4
What does n represent?
The number of values in a set of data
Draw the formula for standard deviation
To find IQR, you use ________. Then, _____ it to _____ and __________ it from ____ to find the _____________. Anything outside of the _____________ is an _________.
1.5 x IQR
add
Q3
subtract
Q1
boundaries
boundaries
outlier
Describe standard deviation
Standard deviation is a measure of variability, or how much each value strays from the mean.