Interpreting Values
Predicted Values and Residuals
LSRL
Describing
Review
100

A new process designed to increase the temperature inside steel girders shows great promise. Using an initial temperature (degree C), we can predict the final temperature (degree C) by using a slope of 1.095. Interpret that value.

For each additional degree in initial temperature there is a 1.095 degree predicted increase in final temperature.

100

A scatterplot displays the association between the size of a diamond (in carats) and its retail price (in Singapore dollars) for 48 observations. Use the model to predict the cost of a 0.25 carat diamond.

cost-hat=197+ 4,000(size)

1,197 Singapore Dollars

100

A bivariate set of data relates the amount of annual salary raise and previous performance rating. The least-squares regression equation 

 y-hat = 1,400 + 2,000x , 

where ^ y is the estimated raise and x is the performance rating. Identify the slope value and y-intercept value.

slope: 2,000

y-int: 1,400

100

This graph shows the relationship between two quantitative variables.

What is a scatterplot?

100

Bias introduced in a sample when individuals can choose on their own whether to participate is the sample.

What is voluntary response bias?

200

A new process designed to increase the temperature inside steel girders shows great promise. Using an initial temperature (degree C), we can predict the final temperature (degree C). The equation has a y-intercept of 37. Interpret the value.

There's a predicted final temperature of 37 degrees C, when the initial temperature is 0 degrees C.

200

Using the model:

cost-hat=197+ 4,000(size)

Calculate the residual for a diamond with 0.31 carats and a cost of 1,500 Singapore Dollars

1500-1437= 63

200

What is the sign of the slope? What is your estimate for the correlation?

Negative!


Anywhere from -0.8, -0.6 ish 

200
These are the four features that you describe when describing a scatterplot.
What is direction, form, strength, and outliers?
200

These are the four principles of experimental design. List the three that are imperative. 

What are control, randomization, replication, and blocking.

300

What does r= -.73 tell you about the correlation?

There's a strong, negative linear association between variables

300

Use the residual plot to answer whether a linear model is appropriate in this case.

A strong pattern indicates a linear model is NOT appropriate

300

If the average time to complete a task is 27.7 seconds,  with a standard deviation of 9.3 and the average age in years of participants is 34.1 with a standard deviation of 5.4, find the slope for the LSRL that would model using age to predict the time it would take to complete the task. Given r = - 0.83

b1= -0.482

300
This variable goes on the x-axis and this variable goes on the y-axis.
What is the explanatory on the x-axis and the response on the y-axis?
300
These are the four feature you describe when describing a distribution.
What is center, shape, spread, and outliers?
400

98% of the variation in weight gain can be explained by the variation in calories consumed. Name this value, and find correlation.

r-squared or coefficient of determination

r= -0.99

400

What is it called when you make a prediction using your model for an x-value outside your data set?

Extrapolation

400

What point does EVERY LSRL model go through? 

( x-bar, y-bar)

400

You would look at this plot to check to see whether a linear regression is appropriate.

What is the residual plot?

400
Using N(1152, 84), the Normal model for weights of Angus steers, this percent of steers weigh over 1250 pounds.
What is 12%?
500

What does a positive residual indicate?

The linear model has made an under prediction or the actual value is larger than the predicted value. 

500

  The smallest tree has an actual height of 312 cm. Find its predicted value, without the model.

-40=312-p

p=352 cm

500

The proper name for the line-of-best-fit is the "Least Squares Regression Line"... What goal does this line achieve that gives it its name?

It minimizes the sum of the squared residuals

500

This value is the fraction on variance of response variable accounted for by the variation of the explanatory variable.

What is r squared or coefficient of determination?

500

Given the following 5 number summary, state the shape of the distribution: 1, 10, 15, 16, 17

What is skewed to the left?