Vocab
LSRL, residual, & bounds
r and R-squared
Interpreting Slope & Y-intercept
Miscellaneous
100
A measure of how far a prediction is from what is actually observed.
What is a residual?
100
The LSRL for an experiment is y = 1.04x - 0.4. (4, 3.5) was one of the data points collected during the experiment. Calculate the residual.
Residual = -0.26
100
Interpret what a correlation coefficient of r = 0.9 means.
What is strong, positive association?
100
y = 1.25x + 10, where x is the amount of fertilizer in mL and y is the height of the plant (cm). Interpret the y-intercept.
When zero mL of fertilizer are used, the plant is expected to be 10 cm tall.
100
Describe the association in the graph comparing temperature and ice cream sales. https://www.mathsisfun.com/data/images/scatter-ice-cream1.gif
positive, linear, strong association with no apparent outliers
200
A unique line that has the smallest possible value for the sum of the squares of the residuals
What is the Least Squares Regression Line (LSRL)?
200
The LSRL is y = 4.2x + 6.5. The largest residual is 1.7. What are the equations for the upper and lower bounds?
Upper bound: y = 4.2x + 8.2 Lower bound: y = 4.2x + 4.8
200
If the correlation coefficient is zero, what does that mean about the association and the LSRL?
There is no association and the LSRL is a horizontal line.
200
y = 1.25x + 10, where x is the amount of fertilizer in mL and y is the height of the plant (cm). Interpret the slope.
For every additional mL of fertilizer used, the height of the plant increases by 1.25 cm.
200
Is a linear model appropriate for the data shown in the residual plot? Explain. http://oregonstate.edu/instruct/st352/kollath/handouts/simplereg/residuals_files/image006.gif
Yes, because there is no apparent pattern in the residual plot.
300
The measure of the strength of the association between two variables.
What is the correlation coefficient (r)?
300
The LSRL is y = 2.4x + 5.1. The largest residual is 0.8. What is an appropriate range of predictions for x = 4.
What is 13.9 to 15.5
300
What must be true for the correlation coefficient to be -1?
All of the points are on the LSRL and the slope is negative.
300
Given the interpretations of slope and y-intercept, write the equation of the LSRL: When the length of the paint on a pencil is 0 cm, the weight of the pencil is 2 grams. For every additional cm of paint, the weight of the pencil increases by 1.4 grams.
y = 1.4x + 2
300
How do you calculate the residuals on your graphing calculator? Write out the steps.
**You already inputted in all of your data and run linear regression. 1. Stat 2. Edit 3. Move to L3 4. Arrow up to L3 and press enter 5. 2nd Stat 6. Choose Resid list and press enter
400
A hidden variable that can explain the association between two other variables that are linked.
What is a lurking variable?
400
Determine the LSRL, lower bound, and upper bound for the following data set: (2, 1.9); (4, 3.5); (6, 6.3); (8, 6.9); (10, 10.6)
LSRL: y = 1.04x - 0.4 Lower Bound: y = 1.04x - 1.42 Upper Bound: y = 1.04x + 0.62
400
Given an LSRL of y = -0.5x + 17 and an R-squared value of .68, determine and interpret the correlation coefficient. Round to the nearest hundredth.
What is r = -0.82? This means there is a moderately strong, negative association.
400
y = 0.41x -14 where x is temperature (degrees F) and y is the number of people at the park (in 1000s). Interpret the y-intercept.
When the temperature is 0 degrees Fahrenheit, there are -14000 people in the park. (Doesn't make sense)
400
(3, -0.5) is the point furthest from the LSRL y = -1.2x + 4.9 Determine the upper and lower bounds for this data set.
Upper bound: y = -1.2x + 6.7 Lower bound: y = -1.2x + 3.1
500
The four categories that are used to describe association.
What is form, direction, strength, and outliers?
500
Determine the LSRL, lower bound, and upper bound for the following data set: (1, .9); (2, 2.5); (3, 5.3); (4, 5.9); (5, 9.6)
LSRL: y = 2.08x - 1.4 Lower Bound: y = 2.08x - 2.42 Upper Bound: y = 2.08x - 0.38
500
y = 1.25x + 10, where x is the amount of fertilizer in mL and y is the height of the plant (cm). r = 0.92 Determine and interpret R-squared.
What is R-squared = 0.8464 About 85% of the variability in plant height can be explained by a linear relationship with the amount of fertilizer used.
500
y = 0.41x -14 where x is temperature (degrees F) and y is the number of people at the park (in 1000s). Interpret the slope.
For every additional degree F in temperature, the number of people at the park increases by 410. When the temperature increases by 1 degree F, the number of people at the park increases by 410.
500

When comparing age and height, Eva calculated R-squared to be 0.76. Write a sentence to explain what R-squared means in this context and provide at least one other factor that can explain the variability in height.

76% of the variability in height can be explained by a linear relationship with age. The other 24% could be explained by genetics, general health, environmental factors, etc.

M
e
n
u