The science and art of collecting, analyzing, and drawing conclusions from data.
statistics
What graph is used to display the distribution of a single quantitative variable?
Histogram
What is the measure of center that is most affected by outliers?
The Mean
What does it mean for a distribution to be symmetric?
The left and right sides of the distribution are mirror images of each other.
What is a value that lies far away from the other data values?
Outlier
What type of data involves categories or labels?
Categorical Data
What type of graph is suitable for comparing the distribution of a quantitative variable across different groups?
Boxplot
What is the measure of spread that is resistant to outliers?
Interquartile range (IQR)
What does it mean for a distribution to be skewed right?
The tail of the distribution extends farther to the right.
What is the 1.5 x IQR rule used for?
Identifying potential outliers in a dataset.
What type of data involves numerical values that can be measured?
Quantitative Data
What graph is used to show the relationship between two quantitative variables?
Scatterplot
What is the formula for calculating the interquartile range (IQR)?
Q3-Q1=IQR
How do the mean and median compare in a right-skewed distribution?
The mean is greater than the median.
Why might you transform data?
To make the data more symmetric or to stabilize the variance.
What is the difference between discrete and continuous data?
Discrete data can only take on specific values, while continuous data can take on any value within a range.
A plot where each data value is divided into a "stem" and a "leaf" to display the shape of the distribution.
What is a stem-and-leaf plot?
What is the range?
The difference between the maximum and minimum values in a dataset.
What is the empirical rule?
Approximately 68% of the data falls within one standard deviation of the mean, 95% within two standard deviations, and 99.7% within three standard deviations in a normal distribution.
What is a common transformation used to make skewed data more symmetric?
Logarithmic Transformation
What gives the percentage or proportion of individuals with a specific value for one categorical variable among individuals who share the same value as another categorical variable (the condition)?
conditional relative frequency
A plot where each data value is represented by a dot above a number line.
What is a dot plot?
What is standard deviation?
A measure of the spread of data around the mean.
What are quartiles?
Values that divide the data into four equal parts (25% each).
The number of standard deviations a data point is from the mean.
What are z-scores?