Exploring One-Variable Data
Exploring Two-Variable Data
Collecting Data
Probability, Random Variables, and Random Distribution
Sampling Distribution
100

This is a graphical representation of a one-variable data set, where the horizontal axis represents the data values and the vertical axis represents the frequency of those values.

A histogram

100

This is a graphical representation of two-variable data, where the horizontal represents one variable and the vertical represents the other.

A scatterplot

100

This is a type of sampling method in which every member of a population has an equal chance of being selected.

Random sampling 

100

What is a measure of the likelihood that an event will occur, expressed as a number between 0 and 1?

Probability 

100

Blank is unbiased if, on average, the value of the blank is equal to the population parameter 

Estimator

200

This is a measure of the spread of a set of data, calculated as the average of the distances of the data from their mean.

Standard deviation 

200

This is a measure of the strength and direction of the linear relationship between two variables.

Correlation coefficient 

200

A subset of a population that is used to represent the whole population in research.

A sample

200

A random variable whose values can take any real value within a given range.

Continuous random variable 

200

What is the formula for calculating the standard deviation of a sampling distribution?

The standard deviation of the population divided by the square root of the sample size.

300

This is a type of graph that shows the relative frequency of different values in a data set. 

Frequency distribution 

300

This is a statistical test used to determine whether there is a significant linear relationship between two variables.

Linear regression 

300

What is a method of collecting data in which the data collector visits each member of the population and records the data?

Census sampling

300

A variable whose value is determined by chance or probability 

A random variable 

300

The sampling distribution of the mean of any independent, random variable will be approximately normal, regardless of the distribution of the population from which the sample is drawn.

The central limit theorem

400

What type of data is used to represent characteristics or qualities that cannot be measured numerically?

Categorical data 

400

What is a high-leverage point in linear regression?

High-leverage points have unusually high or low x-values

400

How can blocking improve the design of an experiment?

A randomized block design helps to separate natural variability from differences due to the blocking variable.

400

A set of possible outcomes of a random experiment and the probabilities associated with each outcome

Random distribution 

400

How do you calculate the variance of a sampling distribution?

The variance of the population divided by the sample size

500

What type of data is used to represent numerical values that can be measured or counted?

Quantitative data

500

An observation point that lies outside the overall pattern of the data set

Outlier

500

Under what conditions can an experiment be blinded?

Blinding is possible when the subjects and/or researchers are unaware of the treatment being administered

500

A variable that can only take on a countable number of values.

A discrete random variable 

500

What is the difference between a population distribution and a sampling distribution?

Population distribution includes all of the data points in a population while a sampling distribution is based on a sample of the data points from the population.

M
e
n
u