All About Outliers
Understanding the Data
All for the Plot
Its Just a Phase (Shifts and Transformations)
Random!
100

A point that does not follow the general pattern of the data. 

What is an outlier?

100

Type of data that contains information on each of the two variables.

What is bivariate data?


100

The difference between the predicted value and the observed value.


What is a residual?

100

This plot allows you to choose the model you should use based on the most randomly scattered points. 

What is a scatter plot?

100

The variable whose values are explained/predicted by another variable.

What is the response variable?

200

Data points that have a significant impact on the results of a statistical model.  

What is an influential point? 

200

The variable that predicts, explains, or influences the response variable 

What is the Independent Variable? 

200

A graphical technique that attempts to show the relationship between a given independent variable and the response variable, given that other independent variables are also in the model. 

What is a residual plot?

200

A formula that graphs the independent and dependent variables

What is ŷ=b+ax?

200

The symbol that represents the typical value away from the LSRL 


What is S?

300

By removing an outlier, the R and R^2 values of a scatterplot will...

What is increase?

300

As the X-Values increase, the Y-Values tend to decrease

What is a negative correlation? 

300

A linear model that minimizes the sum of the squared residuals between the data and the model. Also called “line of best fit”.


What is a Linear Squares Regression Line (LSRL)?

300

The Goal of applying transformations to variables

What is linearity?

300

Describes a scatterplot with a correlation coefficient of r = 0 


Random, not associated

400

Data points that lie far away from the center of the data significantly impact the regression line. Also known as high-leverage points 

What is a horizontal outlier? 

400

A high degree of association between two variables, where changes in one variable tend to be closely followed by changes in the other 

What is a strong correlation?

400

A statistical measure in the regression model that determines the proportion of variance in the dependent variable that can be explained by the independent variable. 

What is R-Squared?

400

The two ways to transform variables to obtain a better fit. 

What are log and exponential?

400

The values of two or more variables tend to vary together in a predictable way. There is a pattern in the scatterplot that is too strong to be likely to arise by chance. 

What is association?

500

The points that cause the slope of the LSR to decrease. 

What is point A/B?

500

Given the data, a student with no stress level would have a score of...

Hint...the Y-Intercept

What is 91.67?

500

In a scatter plot, what does a curved pattern in the residual plot suggest about the linear regression model?

What is an inappropriate fit or low r squared value?

500



Transformation used to achieve linearity on the following table...

what is log(y)

500

A regression equation of a scatterplot is below: y = 2.5 + 8.3x. An actual point from this data set is (5,40). This is the residual for this given data point.

 

What is -4?