Lydia wanted to know if sons of taller fathers tend to be taller. Here are the heights of the fathers and sons in Lydia's sample:
Weak positive linear association with potential outliers
A long jump competition took place recently at a local high school. The coach is interested in performing as well as possible next time, so he is looking at the relationship between height and distance jumped (both measured in inches). He uses height to predict distance.
The least square solution is:
intercept = 6.4285, and slope = 1.0534.
What is the estimated regression line (in context)?
predicted distance = 6.4285 + 1.0534(height)
A person researching education levels within a company takes the full employee list and applies a random number algorithm to this in order to select people to interview. |
Simple Random Sample
What is this an example of?
Randomized Block Design
________________ refers to the fact that different random samples of the same size from the same population produce different estimates.
sampling variability
Abigail gathered data on different teams' winning percentages and the average yearly salary of their head coaches (in millions of dollars). Here are her results:
weak positive linear association with potential outliers
A long jump competition took place recently at a local high school. The coach is interested in performing as well as possible next time, so he is looking at the relationship between height and distance jumped (both measured in inches). He uses height to predict distance.
Interpret the slope when b0 = 1.0534.
As height increases by one inch, distance increases by 1.0534 inches.
A high school student who is studying year-ten attitudes in the school uses registration tuition classes as strata and studies a random selection of students from each of these classes. |
Stratified Sampling
A drug company wants to test a new pill that should make people's nails grow twice as quickly as normal. How can they make the study a blind study?
Give one group of people the actual pill and one group a placebo, but don't tell the participants which pill they are receiving.
When the observed results of a study are too unusual to be explained by chance alone, the results are called __________________.
statistically significant
The manager of a factory wants to see if there is a relationship between safety and output. For 111111 days, they track the output of the factory and give each day a safety rating. Here are results:
Strong positive linear association with no outliers
The city’s transportation department is interested in studying the relationship between the temperature and the number of passengers that ride the main bus line in order to better serve their customers. The manager recorded the temperature at the beginning of the hour, and then had a bus driver record the number of passengers that boarded the bus throughout the hour.The manager wishes to predict the number of passengers using temperature.
Variable Mean SD
temperature 43.2 5.94
passengers 175.2 23.52
Correlation: r = 0.9984
Use the above information to obtain a least squares regression equation (in context).
predicted passengers = 4.43 + 3.953(temperature)
In a study of the opinions of homeless across a country, rather than study a few homeless people in all towns, a number of towns are selected and all of homeless people are interviewed in each one. |
Cluster Sampling
Half of the participants are given the medication, and the remaining participants are given a placebo. By utilizing a placebo, the study remains free from being altered as neither the participants nor the researcher knows who is receiving the medication. This provides valid and reliable results.
What is this an example of?
Double blind experiment
A ____________ is the difference between the actual value of y and the value of y predicted by the regression line.
residual
Ian wants to figure out there is a relationship between how much a piece of furniture weighs and how much it costs. He samples some items and comes up with the following scatterplot:
Moderately strong positive linear association with no outliers
Predict the number of passengers when temperature is at 45 degrees for the least squares regression line:
passengers = 4.43 + 3.953 temperature
182 passengers (182.3 exact)
A study of people going to night-clubs first determines that there are about 250-300 people in the club (due to fire regulations). A sample size of 30 is selected, giving an interval of 300/30 = 10. A random number between 1 and 10 is generated and comes up with 7. Starting with the 7th person to enter the club, every 10th person is given a brief interview. |
Systematic Sampling
What is this an example of?
Matched Pairs Design
The _______________________ measures the size of a typical residual. That is, s measures the typical distance between the y values and the predicted y values.
standard deviation of the residuals s
An author wants to find out if there is a relationship between how many copies of a book they sell and how many ratings below three stars the book gets. Here are their results:
strong negative linear association with potential outliers
temperature passengers
42 173
37 149
46 185
30 123
50 201
43 174
43 175
46 188
46 186
49 198
Given the information above, can you predict the number of passengers when temperature is at 65◦F?
65◦F falls beyond the x data range. If you used the above regression equation to ‘predict’ the number of passengers, then you basically made a unproved assumption. (Extrapolation)
A group of students in a high school do a study about teacher attitudes. They interview teachers at the school, a couple of teachers in the family and few others who are known to their parents. |
Convenience Sampling
Twenty men and 20 women with migraine headaches were subjects in an experiment to determine the effectiveness of a new pain medication. Ten of the 20 men and 10 of the 20 women were chosen at random to receive the new drug. The remaining 10 men and 10 women received a placebo. The decrease in pain was measured for each subject. The design of this experiment is
randomized block, blocked by gender
A _______________ measures an outcome of a study.
response variable