This is a graphical representation of a one-variable data set, where the horizontal axis represents the data values and the vertical axis represents the frequency of those values.
A histogram
This is a graphical representation of two-variable data, where the horizontal represents one variable and the vertical represents the other.
A scatterplot
This is a type of sampling method in which every member of a population has an equal chance of being selected.
Random sampling
What is a measure of the likelihood that an event will occur, expressed as a number between 0 and 1?
Probability
Blank is unbiased if, on average, the value of the blank is equal to the population parameter
Estimator
This is a measure of the spread of a set of data, calculated as the average of the distances of the data from their mean.
Standard deviation
This is a measure of the strength and direction of the linear relationship between two variables.
Correlation coefficient
A subset of a population that is used to represent the whole population in research.
A sample
A random variable whose values can take any real value within a given range.
Continuous random variable
What is the formula for calculating the standard deviation of a sampling distribution?
The standard deviation of the population divided by the square root of the sample size.
This is a type of graph that shows the relative frequency of different values in a data set.
Frequency distribution
This is a statistical test used to determine whether there is a significant linear relationship between two variables.
Linear regression
What is a method of collecting data in which the data collector visits each member of the population and records the data?
Census sampling
A variable whose value is determined by chance or probability
A random variable
The sampling distribution of the mean of any independent, random variable will be approximately normal, regardless of the distribution of the population from which the sample is drawn.
The central limit theorem
What type of data is used to represent characteristics or qualities that cannot be measured numerically?
Categorical data
What is a high-leverage point in linear regression?
High-leverage points have unusually high or low x-values
How can blocking improve the design of an experiment?
A randomized block design helps to separate natural variability from differences due to the blocking variable.
A set of possible outcomes of a random experiment and the probabilities associated with each outcome
Random distribution
How do you calculate the variance of a sampling distribution?
The variance of the population divided by the sample size
What type of data is used to represent numerical values that can be measured or counted?
Quantitative data
An observation point that lies outside the overall pattern of the data set
Outlier
Under what conditions can an experiment be blinded?
Blinding is possible when the subjects and/or researchers are unaware of the treatment being administered
A variable that can only take on a countable number of values.
A discrete random variable
What is the difference between a population distribution and a sampling distribution?
Population distribution includes all of the data points in a population while a sampling distribution is based on a sample of the data points from the population.