Model & Assumptions
Coefficients & Interpretation
Inference
Model Fit
Coding
100

The name for the variable we are trying to explain or predict.

What is the response variable?

100

The slope of the regression line represents this kind of change.

What is the expected change in the mean response for a one-unit increase in X?

100

This parameter is tested in H0:β1= 0.

What is the slope?

100

This statistic tells us the proportion of variation in Y explained by X.

What is R^2?

100

Create pseudocode for estimating a simple linear regression model predicting Y from X from data set called data1.

What is lm(Y, X, data = data1)?

200

This assumption about the errors is required for least squares estimates to be unbiased.

What is mean zero errors?

200

If the intercept is 50 in the regression of exam score on hours studied, interpret it in context.

What is "The predicted exam score for a student who studied 0 hours is 50"?

200

A 95% CI for the slope that includes 0 means this.

What is "There is not strong evidence of a linear relationship"?

200

If R^2=0.70, interpret in context.

What is "70% of the variation in Y is explained by X"?

200

This function is used to create all diagnostic plots for a simple linear regression model. 

What is plot()?

300

Name two of the three conditions that must hold for a linear regression model to be valid.

What are linearity, independence, and constant variance?

300

A negative slope indicates this kind of relationship between X and Y.

What is a negative (or inverse) relationship?

300

The test statistic for slope is B_1 divided by this.

What is its standard error?

300

A regression of GPA on study hours has r=0.6. What is R^2?

What is 0.36 (36%)?

300

The line of code that we should use if randomizing or using a random number generator. 

What is set.seed()?

400

Outliers with extreme X-values that strongly influence the regression line are called this.

What are influential points?

400

A regression of body fat percentage on BMI yields slope = 0.9. Interpret this value.

What is "For each 1-unit increase in BMI, predicted body fat percentage increases by 0.9 on average"?

400

In regression, a prediction interval is always (wider/narrower) than a confidence interval at the same X.

What is wider?

400

SSE = 150 and SST = 600. Compute R^2.

What is 0.75?

400

Load the MASS library and access airquality dataset.

Create a simple linear regression on ozone with Wind as the predictor.

From the coefficients table of the model's summary, what is the estimated values of ozone?

96.8729

500

If residuals vs. fitted values show a curve, which regression assumption is violated?

What is linearity?

500

A student’s residual is -12. What does this mean about their observed and predicted values?

What is "Their actual Y is 12 lower than predicted"?

500
Name all three inference tests, as well their null hypotheses.

What are: 

t-test for slope: B_1 = 0

anova test: B_1 = 0

t-test for correlation: rho = 0

500

Residuals show increasing spread with fitted values. Which regression assumption is violated?

What is constant variance?

500

Load the MASS library and access airquality dataset.

Create a simple linear regression on ozone with Temp as the predictor.

From the coefficients table of the model's summary, what is the adjusted R-Squared of the model?

0.4832