Categorical vs. Quantitative Data
Distributions
Linear Regression
Observational Studies vs. Experimental Design
Probability Rules
100

A sample of household income within a municipality is considered to be __________ data.

What is quantitative?

100

When a data set is essentially a "reflection" of itself about the axis of its center, we use this adjective to describe it.

What is symmetric?

100

This term refers to the strength of the linear relationship between the explanatory and response variables. It is highly misused in casual conversation as it only applies to linear associations and should never be confused with causation.

What is correlation?

100

The first and foremost principal in any observational study and well-designed experiment is that the data are collected or assigned at _______________.

What is random?

100
When flipping a coin, the outcome of Tails is known as an __________.

What is an event?

200

Quantitative data that "jump" from integer to integer, for example, AP Exam scores (1, 2, 3, 4, 5) are said to be ________________.

What is discrete?

200

A distribution that is not symmetric and has notable high upper outliers is said to be __________ _________.

What is right skewed?

200

In a linear model attempting to measure the relationship between days of sunshine and the length of the growing season, which variable should be used as the explanatory variable?

What are days of sunshine?

200
When your observational study is poorly designed in such a way that it leads to your results not being representative of the population, your study is said to be _________.

What is biased?

200

When two events cannot happen at the same time, they are said to be _________ or ________ _______?

What is disjoint or mutually exclusive?

300

Quantitative data that can take on any value over a finite or infinite interval are said to be _____________. An example of this would be outdoor temperature (in degrees).

What is continuous?

300

We use this term to describe a distribution that is roughly unimodal, symmetric, and obeys the 68-95-99.7% Rule.

What is Normal?

300

They say that height from parent to child is strong correlated across genetic lines. As such, if a linear model measuring the relationship between heights of parent and heights of children, which variable should be the response variable?

What is height of children?

300

An experimental design that features this recommended, but not required feature, in which the neither the participant nor administrator knows which treatment group they are in is said to be _______ ________.

What is double blinded?

300

When the occurrence or non-occurrence of one outcome influences the probability of another outcome, the events are not _________________.

What is independent?
400
Which one of the following is not appropriate for describing categorical data: bar charts, histograms, pie charts, two-way tables, and segmented bar charts.

What are histograms?

400

A standardized value from the formula:  


(value - mean)/(standard deviation)

is referred to a __-_________.

What is a z-score?

400

The difference between an observed value and its predicted value in a linear model is known as a __________.

What is a residual?

400
Every experimental design must have one of these groups in order to establish a baseline against which the treatment group effects can be measured.

What is a control group?

400

The long run average in a probability model after numerous trials is known as the expected ______.

What is value?

500
The four primary displays of quantitative data are: the histogram, boxplot, dot plot, and stemplot. However, which one of the four displays cannot show whether the data set is unimodal or multimodal ("1 or more centers")?

What is boxplot?

500

The area to the left of a calculated z-score is known as this term. It also refers to where 'you' standard in regards to the "performances" of others in your sample.

What is a percentile?

500

If linear regression is performed on two variables that have absolutely no relationship whatsoever, the slope of the line of best fit should be approximately _________.

What is zero?

500

A necessary feature of a well-designed experiment is that it has enough: participants and repeated trials to ensure that the first results were not simply due to chance. It is referred to as ___________.

What is replication?

500

If the probability that it will rain tomorrow is (2/3) and the probability that you roll a five on a six-sided die is (1/6), then the probability that both events will occur is: 

What is 1/9  or  2/18?