Descriptive Statistics
Probability Rules
Distributions and CLT
Inference for Proportions
100
a statistic that takes the sum of all the data points and divides by the total number of data points.
What is a sample mean?
100
P(A)+P(B)-P(A and B)
What is the general addition rule for P(A or B)?
100
The name of a distribution that is symmetric and captures approximately 68%, 95% and 99.7% when 1, 2 and 3 standard deviations away, respectively.
What is the Normal Distribution?
100
The square root of [pHat * qHat / n].
What is the standard error (standard deviation) of pHat used for a confidence interval?
200
A measure of spread that finds the distance between the middle 50% of data points.
What is the IQR or Interquartile Range?
200
P(A and B)/P(B)
What is the formula for P(A|B)?
200
The mean of all sample means is equal to this?
What is the population mean?
200
The test statistic formula for a one-proportion z-test?
What is z= (pHat - p0)/sqrt(p0q0/n)
300
A measure of central tendency that is robust to skewed data and outliers.
What is the median?
300
You must multiply P(A) by this in order to find the P(A and B)
What is P(B|A)?
300
The standard deviation of the population divided by the square root of the sample size, n, gives us this.
What is the standard deviation of sample means with sample size n?
300
The z* you would use for an 90% confidence interval.
What is 1.645 or 1.64?
400
The "average" distance from the mean of a distribution.
What is a standard deviation?
400
If you show that P(A|B) = P(A), we know this is true.
What is A and B are independent?
400
The sample size needed for an approximately normal distribution given the population is not already normal.
What is n >= 30?
400
The condition that says our sample size is large enough for a z-test?
What is np>=10 and nq>=10?
500
A calculation that uses the values from a set of SAMPLE data.
What is statistic?
500
It can be found using the square root of [Var(X)+Var(Y)], when X and Y are independent.
What is the SD(X+Y) or SD (X-Y)?
500
This tells us that the mean of our sample means is the mean of our population, the standard deviation of sample means is the standard deviation of the population divided by the square root of n, AND if the sample size is large, the distribution of sample means will be approximately normal.
What is the central limit theorem?
500
The conclusion we would use when we get a p-value of 0.06 with a level of significance of 0.05.
What is "Since the p-value is greater than 0.05, we fail to reject H0. There is insufficient evidence to show [our alternative true] or [null is false]."
M
e
n
u