What does the correlation coefficient tell us about a data set?
It tells the strength and direction of the relationship between two variables.
What is primary data?
Information you gather yourself.
What is quartile 1 (Q1)?
The median of the lower half of the data set.
What is categorical data?
Data recorded as labels or categories (words).
What is the mean of a data set?
The average value found by adding all numbers and dividing by how many values there are.
What does a positive correlation mean?
As one variable increases, the other variable also increases.
What is secondary data?
Information you find or look up from other sources.
What is quartile 2 (Q2)?
The median of the entire data set.
What is numerical data?
Data recorded as numbers.
What is the median of a data set?
The middle value when the data is written in order (or the average of the two middle values).
What does a negative correlation mean?
As one variable increases, the other variable decreases.
What is two-variable data?
Data that is written in ordered pairs (x, y).
What is quartile 3 (Q3)?
The median of the upper half of the data set.
What is single-variable data?
A data set that contains only one list of values (one variable).
What is the mode of a data set?
The number(s) that appear most often in the data set.
What is interpolation?
Making a prediction between two known data points.
What is the difference between continuous and discrete data?
Continuous data can include decimals, while discrete data only includes whole numbers (integers).
What is the range of a data set?
The difference between the highest and lowest values.
What is two-variable data?
Data written as ordered pairs (x, y).
What is data?
Information collected through observations, surveys, or measurements.
What is extrapolation?
Making a prediction beyond the data set.
What is regression?
The process of finding the line of best fit (trend line) for a scatterplot.
What is the interquartile range (IQR)?
The difference between Q3 and Q1 (Q3 − Q1).
What is a histogram used for?
Displaying numerical data using intervals (bins) to show frequency.
What is a cluster in data?
A group of data points that are close together.