Unlike continuous quantitative variables, these variables take only specific values that differ by some minumum amount.
True or False: if r=0, there is no association between the variables.
What is "False"?
A few things could cause a r=0, even when an association occurs, such as an influential point far from the trend of the rest of the data, or a non-linear association.
This is a study which measures a response that has already taken place within a sample, usually by a survey.
What is a retrospective study?
This is the quantity given by the formula
(# of events in A) / (# of events in S)
This is the expected number of heads you get in ten tosses of a fair coin.
This is how you should bet if your friend is tossing a fair coin and it lands heads ten times in a row.
What is "bet that your friend is lying, and the coin isn't really fair." If it is a fair coin, it doesn't matter how you bet- neither outcome is more likely, because tosses of a fair coin are independent.
This is what the residuals plot looks like when a linear model is appropriate for the data set.
What is "no apparent pattern"?
A residuals plot with an apparent pattern or curve is evidence that a linear model is not appropriate for the data.
This survey sampling method separates the sampling frame into homogeneous groups, then randomly selects subjects from within each group.
What is stratified sampling?
What are complementary events?
This is the calculator expression used to find the probability that a binomial random variable X is less than or equal to a.
What is binomialcdf(n,p,a)?
What is one?
A regression analysis of company profits and the amount of money the company spent on advertising found that r2=.72. Which of these is true?
I. This model can correctly predict the profit for 72% of companies.
II. On average, about 72% of a company’s profit results from advertising.
III. On average, companies spend about 72% of their profits on advertising.
What is "none of these?"
r2=.72 imploes that 72% of the variation in the profits is accounted for by the relationship between advertising spending and profits.
In an experiment, the purpose of blocking is to reduce the variability in the response due to the presence of one of these.
What is a confounding variable?
This described A and B whenever P(A and B) = P(A)*P(B).
What are independent events?
Otherwise, P(A and B) = P(A|B)*P(B).
This is the mean of the random variable X + Y if X~N(0,1) and Y~N(0,1).
What is 0?
In a normal distribution, this is how many standard deviations the 95th percentile is from the mean.
What is 1.645?
Trick question. The answer is not "two." 95 percent of the data is within two standard deviations of the mean. The 95th percentile of the standard normal distribution is 1.645.
invNorm(.95, 0, 1)= 1.645
When the correlation between variables x and y is r=0.4, this is the correlation between -y and 2x.
What is r=0.4? Linear changes (multiplication/division, addition/subtraction, switching values) do not alter the correlation between two variables.
This type of experiment design does not incorporate blocking, but randomly distributes subjects among treatment groups.
What is a completely randomized design?
In a contingency table, this is how to find the probability of A given B, when A is a column heading and B is a row heading.
What is the number in the cell in column A and row B divided by the total of row B?
This is the variance of the random variable X + Y if X~N(0,1) and Y~N(0,1).
What is 1.414?
Consider the data summary below:
min= 23, Q1= 40 , med= 48, Q3= 54, max= 61
These are thresholds for outliers in this data set.
What are 27 and 69?
IQR = 54 - 40 = 14
1.5*IQR = 21
med - 1.5*IQR = 27, med + 1.5*IQR = 69
If the constant changes in the explanatory variable reduce the response by the same percent, then this is how the data should be reshaped to make the relationship linear.
This is why blocks should be homogeneous with respect to the blocking variable.
What is "so that differences in response are only compared between subjects for which the value of the confounding variable is similar."
If blocks were heterogeneous, each block has the same problem as the whole sample: it's difficult to tell if variation in response is due to the factor or the confounding variable.
In a standard 52 card deck, this is the probability of drawing a face card or a ten, then drawing an ace.
What is (12/52 + 4/52)(4/51) = .024?
What is 1-binomcdf(5, 12/52, 2) = .084
It's Bernoulli:
P(S) = 12/52, P(F) = 1-12/52
P(S) is constant
Since cards are replaced, trials are independent.
n = 5,
we seek P(X >=3) = 1-P(X<3) = 1-P(X<=2)