Statistics
Bell Curve
Distributions
Linear Models
Misc
100

What's the difference between the population mean and the sample mean?

Population mean is for entire group, sample mean is for sample of that group

100

If I have a bell curve, what rule can I assume applies? (it starts with an e)

Emperical rule

100

These are the two components of a distribution

What are relative frequencies and possible values?

100

Suppose I record the price of every pizza in Iowa City and fit a model that shows Y = 15 + 3*X, where Y is price and X is the number of toppings. If I add a topping to a pizza, will the price go up $3?

No: correlation does not imply causation

100

What are the possible values of a relative frequency or proportion?

Between 0 and 1

200

What are the possible values of the correlation r? When does it apply?

-1 to 1, applies when you have bivariate data, both quantitative

200

I have a normal distribution with mean = 30 and standard dev = 5. What is AUC(x <= 30)?

0.5
200

The possible values of rolling two die and counting the number of updots

What are 2, 3, ... 12?

200

Suppose your TA fits a linear model of Price and Toilet Paper Holders and gets an r-squared value of 0.62. Interpret the r-squared.

62% of the variation in price is explained, for whatever reason, by the variation in toilet paper holders

200
I have a dataset of coffee order sizes: small, medium, large. What is my data type?

Qualitative ordinal

300

Your TA is 61 inches tall. Suppose she wants to calculate her Z score. What information does she need for that formula?

Population mean height and standard deviation
300

I have that AUC (x <= 5.4) = 0.6. What is the 60th percentile?

5.4

300

The type of distribution where the mode is greater than the median which is is greater than the mean

What is a left skewed distribution?

300

Suppose I have an r-squared of 1 and I know 2 of my data points are as follows: 

(30, 35)

(31, 34)

What is the correlation?

-1

300

Suppose your TA takes a sample of animals in her neighborhood. Out of 10 animals, 3 are tan dogs. 3/7 animals are dogs given that they are tan. What proportion of animals are tan?

HINT: You can use that P(A | B) = P (A and B) / P (B)

7/10 animals are tan

400

Let's say your TA does a study of Starbucks orders. She checks if they ordered a specialty drink (yes/no) and if they bought a food item (yes/no). What is the formula for LIFT for specialty drink? Suppose my LIFT is 3.2. Interpret that. 

Prop ( food = yes | specialty drink = yes) / prop (food = yes)


Of orders that include a specialty drink, the proportion that include food is 3.2 times higher than those that don't. 

400

Suppose I know 80% of the AUC is between 3 and 5. What percentile is 3? 

10th percentile

400

Income has a mean of 30,000 and a standard deviation of 40,000. What number summary should we use?

5 number summary

400

Suppose I have the equation for the LSE line Y = b0 + b*X. I also have the data point (a, c). What would I do to calculate the residual?

Plug a into the LSE line and subtract that from c

400

My histogram bin is [30,40). I have the following data:

30, 30, 35, 36, 37, 40, 40, 32, 40, 46

What is my density?

0.06

500

Milhouse scored .75 standard devations below the mean on B1. The correlation between B1 and B2 is 0.6. How many standard deviations above or below the mean should Milhouse expect to score?

0.6*0.75 sd below the mean

500

I can model the price of thowing a party with b + cX + (a+d)Y. Suppose b = 4, c = 1, a = 2, d = 3, x bar = 6, y bar = 1. What is my mean?

HINT: use the fact that mean(a + bX + cY) = a + b* x bar + c * y bar

HINT: be very careful about what you choose for a, b, and c! 

a = 4, b = 1, c = 5

4 + 6 + 5 = 15

500

Let's say your TA does a study of Starbucks orders. She checks if they ordered a specialty drink (yes/no) and if they bought a food item (yes/no). What are the possible values of the marginal distribution of Specialty Drink | food item?

Specialty drink yes/no

500

Let's say I need to calculate a slope and I know rx, sx, and sy. What slide do I need to go to for the formula?

L4.1 slide 16

500

This is the time your TA is in bed and you shouldn't count on an email response

What is 10pm?

M
e
n
u