Vocabulary
Graphical Displays
Descriptive Statistics
Shape, Center, and spread
Outliers and transformations
100

The science and art of collecting, analyzing, and drawing conclusions from data.

statistics

100


What graph is used to display the distribution of a single quantitative variable?



Histogram

100


What is the measure of center that is most affected by outliers?





The Mean

100


 What does it mean for a distribution to be symmetric?

   

 The left and right sides of the distribution are mirror images of each other.

100


What is a value that lies far away from the other data values?



Outlier

200

What type of data involves categories or labels?

Categorical Data

200


What type of graph is suitable for comparing the distribution of a quantitative variable across different groups?

 

Boxplot

200

What is the measure of spread that is resistant to outliers?


Interquartile range (IQR)

200


 What does it mean for a distribution to be skewed right?

  


The tail of the distribution extends farther to the right.

200


What is the 1.5 x IQR rule used for?



Identifying potential outliers in a dataset.

300

What type of data involves numerical values that can be measured?

  Quantitative Data

300

What graph is used to show the relationship between two quantitative variables?

Scatterplot

300


 What is the formula for calculating the interquartile range (IQR)?

  


Q3-Q1=IQR

300


 How do the mean and median compare in a right-skewed distribution?



The mean is greater than the median.

300


 Why might you transform data?



   To make the data more symmetric or to stabilize the variance.

400


What is the difference between discrete and continuous data?

    

Discrete data can only take on specific values, while continuous data can take on any value within a range.

400


 A plot where each data value is divided into a "stem" and a "leaf" to display the shape of the distribution.


What is a stem-and-leaf plot?

400


What is the range?


    The difference between the maximum and minimum values in a dataset.

400

 

What is the empirical rule?

 

Approximately 68% of the data falls within one standard deviation of the mean, 95% within two standard deviations, and 99.7% within three standard deviations in a normal distribution.

400


 What is a common transformation used to make skewed data more symmetric?


   Logarithmic Transformation

500

What gives the percentage or proportion of individuals with a specific value for one categorical variable among individuals who share the same value as another categorical variable (the condition)?

conditional relative frequency

500


 A plot where each data value is represented by a dot above a number line.

 What is a dot plot?

500


 What is standard deviation?


A measure of the spread of data around the mean.

500


 What are quartiles?

 

Values that divide the data into four equal parts (25% each).

500

   The number of standard deviations a data point is from the mean.  

 What are z-scores?