Variables and Data Types
Summarizing and Interpreting Data
Hypothesis Testing and Inference
Regression and Bivariate Analysis
Advanced Statistical Tools and Techniques
100

In a study examining the effect of exercise on weight loss, weight loss is this type of variable.

what is response variable

100

In a normal distribution, approximately 95% of data falls within this many standard deviations of the mean.

what is 2 standard deviations

100

When the p-value is smaller than the significance level, this is the decision made regarding the null hypothesis.

what is rejecting the null hypothesis

100

The distance between an observed value and the value predicted by the regression line.

what is residual

100

This method uses repeated resampling with replacement to estimate confidence intervals.

what is bootstrapping

200

What type of variable is zipcodes?

what is categorical

200

This measure of center is resistant to outliers

what is median

200

This term describes the smallest significance level at which a null hypothesis can be rejected, derived from a p-value.

what is critical value

200

A correlation coefficient of -0.8 suggests this type of relationship between two variables.

what is strong negative linear relationship

200

This type of distribution is generated by resampling from the observed data under the null hypothesis to assess the strength of evidence against the null.

what is randomized distribution

300

A study finds that more firefighters at a fire lead to greater property damage. The size of the fire is this type of variable.

what is confounding variable

300

In a five-number summary, this value represents the spread between the first quartile (Q1) and third quartile (Q3).

What is IQR (interquartile range)
300

This type of hypothesis test compares the proportions of two independent samples, and its test statistic is based on the difference of sample proportions.

what is z-test for 2 proportions

300

When residual plots exhibit a pattern, it suggests that the regression model might be this.

what is poor fit

300

This type of test compares expected and observed counts in a contingency table to determine association

what is chi-square test

400

This variable type can be visualized using a bar chart or pie chart.

what is categorical

400

If an outlier is added to a dataset, this measure of center will change the most.

what is mean

400

This approach is used when the population standard deviation is unknown and the sample size is small.

what is t-test

400

The relationship between X and Y in a regression model can be tested using this statistic for the slope coefficient.

what is t-statistic

400

This term describes the ratio of the probability of an event in one group to the probability in another group.

what is relative risk

500

In a study, this type of sampling involves randomly selecting individuals without regard to their subgroups or categories.

what is simple random sample

500

When comparing two datasets with the same mean, a dataset with a higher standard deviation has this characteristic.

what is greater variablilty/spread

500

For a two-tailed test, if the p-value is 0.1 and the significance level is 0.05, you would conclude this.

what is we fail to reject the null

500

The square of the correlation coefficient, this value measures the proportion of variability explained by the model.

what is r-squared value

500

In randomization testing, the p-value is determined by comparing the observed test statistic to this.

What is the distribution of the test statistic under the null hypothesis