Data Collection
Probability
Organizing Data
4C's
Things you should know
100

When some groups in the population are left out of the process of choosing a sample


Undercoverage

100
What type of event is it when knowing whether or not one event occurs does not change the probability of the other event? 

Independent Events 

100

This measure of center is more resistant to outliers than the mean.


Median

100

What are the 4 C's?

Choose, Check, Calculate, Conclude

100

Does Correlation = Causation? 

NO!

200

A common form of blocking for comparing just two treatments.

Matched pairs


200

When two events cannot occur at the same time? 

Mutually Exclusive Events
200

What is the Z-Score formula? 

(Value - Mean) / SD

200

What is the procedure for the following scenario: 

"Construct and interpret a 90% confidence interval for the proportion of women in the United States who do not feel that they get enough time for themselves." 

One Sample Z-Interval for p

200

What is the interpretation structure for describing the mean? 

After many, many groups of n-trials, the average number of successful events is the mean value. 

300

Neither the subject nor those who measure the response variable know which treatment a subject received.


Double-blind Experiment


300

The probability Jared Rabina gets a goal in lacrosse is 0.56. How would you interpret the probability of Jared's shot being a goal?

After many, many shots taken, the percent Jared scoring a goal approaches 56% in the long run. 

300

What is name of the value after you square the standard deviation? 

Variance
300

What is the procedure for the following scenario: 

Construct and interpret a 99% confidence interval for the true mean difference (Version A – Version B) in final exam scores for Physics students in this district.


One Sample Z-interval for mean difference 

300

When creating histograms, bar graphs, or other that display data, what should the y-axis always start at?

400

The population is divided into groups. Some groups are randomly selected and all individuals in the chosen groups are sampled

Cluster Sampling

400

A roulette wheel has 38 pockets numbered 1through 36 as well as two pockets labeled “0” and “00”. There are 18 red pockets and 18 black pockets, and 2 green pockets (‘0’ and ‘00’). To play roulette, a metal ball is spun around the wheel and lands inside one of the pockets.

What is the probability that the ball lands in a pocket that contains the number ‘2’

13/38  = .3421 = 34.21%

400

What does each letter of SOCV and DUFS stand for?

Shape, Outliers, Center, Variability 

Direction, Unusual Features, Form, Strength 

400
What conditions do you check for a significance test for means

Random Sampling, 10% condition, CLT 

400

What does BINS stand for? 

Binary, Independent, Number of Trials, Success probability

500

Eli Kassoff wants to divide the Washington Commanders fans into two groups: bandwagoners and others. We then take random samples from each group. What sampling method occurred?

Stratified Random Sample

500

Write out the formula/rule for the following scenario: 

Given that a person has an education level of a Bachelor’s degree, what is the probability they identify as liberal?

P(Liberal & Bachelor) / P (Bachelor) 

500

How to identify outliers for univariate data


1.5 IQR Rule 

High Outlier > Q3 + 1.5*IQR

Low Outlier < Q1 - 1.5*IQR 

500

In one study, a random sample of 12 men had a mean of 7.95 days and a standard deviation of 6.2 days, and a random sample of 19 women had a mean of 7.1 days and a standard deviation of 5.0 days. The sample data will be used to construct a 95 percent confidence interval to estimate the difference between men and women in the mean number of days for the length of stay at a hospital. 

Have the conditions been met for inference with a confidence interval?

No, fails CLT. We would need to draw a dotplot to see if it normal, if not, we proceed with caution

500

Given S = 4.11, interpret the value. 

The actual y-variable, is typically about 4.11 units away from the number predicted by the LSRL

M
e
n
u