Find the mode of this set:
{4, 7, 7, 7, 9, 9, 10, 11, 12}
7
What is the code to create a histogram of heights from the cdc data?
histogram(~height, data=cdc)
Give an example of a question that is NOT a statistical question.
(answers vary)
What does "mutate()" do in R?
It mutates a data set, perhaps by creating a new variable/column.
Who was your substitute teacher most often school this year?
A - Mr. Strope
D - Mr. Math
Find the median of this set:
{4, 7, 7, 7, 9, 9, 10, 11, 12}
9
What is the code to create a scatterplot of height as a function of weight from the cdc data?
xyplot(height~weight, data=cdc)
What are the four parts of the data cycle?
What does "set.seed()" do in R?
It takes the randomness out of a random process, making it replicable.
Where has Mr. Engelhart's daughter been for the past two weeks?
Germany
Find the mean of this set:
{4, 7, 7, 7, 9, 9, 10, 11, 12}
76/9 or 8.4444444444...
What is the code to create this plot from the cdc data?
histogram(~height|gender, data=cdc)
What does SD stand for?
Standard deviation
What does "diff()" do in R?
It calculates the difference between to variable values
Name the 5 classwide campaigns that we did this year.
food habits
personality color
stress and chill
time perception
time use
Find the IQR of this set:
{4, 7, 7, 7, 9, 9, 10, 11, 12}
3.5
What is the code to create this graph from the cdc data set?
bwplot(~height|gender, data=cdc, layout=c(1,2))
What does it mean if the mean of a data set is more than 2 standard deviations from the simulation mean?
It did not occur by chance.
What does "sample()" do in R?
It lets us take a random sample of values of a variable.
How many labs did we do this year?
20
Find the MAD of this set. (Answers within 10% of actual value are accepted.)
{4, 7, 7, 7, 9, 9, 10, 11, 12}
1.95. (range of accepted answers: 1.9 - 2.0)
What is the code to create this graph from the starwars data set? (Hint: the terms on the right are all species.)
xyplot(mass~height, groups = species, data=starwars)
What is the function used to create a random simulation in R.
Shuffle
The code "favstats()" generates 9 statistics. Name 8 of them. (Give the names of the statistics, not the numerical values)
min Q1. median Q3 max
mean sd n missing
What two things are necessary for a statistical ivestigative question?