Find the mode of this set:
{4, 7, 7, 7, 9, 9, 10, 11, 12}
7
What is the code to create a histogram of heights from the cdc data?
histogram(~height, data=cdc)
Give an example of a question that is NOT a statistical question.
(answers vary)
What does "mutate()" do in R?
It mutates a data set, perhaps by creating a new variable/column.
Who was your substitute teacher during the first week of school this year?
A - Ms Koles
D - Mr. Math
Find the median of this set:
{4, 7, 7, 7, 9, 9, 10, 11, 12}
9
What is the code to create a scatterplot of height as a function of weight from the cdc data?
xyplot(height~weight, data=cdc)
What are the four parts of the data cycle?
What does "set.seed()" do in R?
It takes the randomness out of a random process, making it replicable.
What was the treatment given to the experimental group in the Mythbusters episode we watched this year?
Someone yawned in front of them. (Statistical question: Is yawning contagious?)
Find the mean of this set:
{4, 7, 7, 7, 9, 9, 10, 11, 12}
76/9 or 8.4444444444...
What is the code to create this plot from the cdc data?
histogram(~height|gender, data=cdc)
What does MSE stand for?
Mean squared error
What does "str()" do in R?
It gives the structure of a data object, including things like the variable names and types.
Name the 6 classwide campaigns that we did this year.
food habits
personality color
stress and chill
time perception
time use
sports media (A) or commute (D)
Find the IQR of this set:
{4, 7, 7, 7, 9, 9, 10, 11, 12}
3.5
What is the code to create this graph from the cdc data set?
bwplot(~height|gender, data=cdc, layout=c(1,2))
What does it mean if the correlation coefficient between two variables is close to -1?
There is a very strong negative linear relationship between the variables.
What does "sample()" do in R?
It lets us take a random sample of values of a variable.
How many labs did we do this year?
31
Find the standard deviation of this set. (Answers within 10% of actual value are accepted.)
{4, 7, 7, 7, 9, 9, 10, 11, 12}
2.455. (range of accepted answers: 2.2095 - 2.7005)
What is the code to create this graph from the starwars data set? (Hint: the terms on the right are all species.)
xyplot(mass~height, groups = species, data=starwars)
Give an example of when it might make sense to merge two data sets, and how you might do it using R.
(Answers vary)
The code "favstats()" generates 9 statistics. Name 8 of them. (Give the names of the statistics, not the numerical values)
min Q1. median Q3 max
mean sd n missing
What data science term rhymes with the name of an iconic Robin Williams character?
Outlier rhymes with Mrs Doubtfire!