Probability
Distributions
Sampling & Inference
Statistical Methods
Data Visualization
Statistical Terminology
100

When rolling a fair six-sided die, this is the probability of rolling an even number.

What is 1/2 (or 0.5 or 50%)?

100

This distribution has the classic 'bell curve' shape.

What is the normal distribution?

100

This term describes how well a sample represents the population from which it was drawn

What is representativeness?

100

This value represents the middle point of a data set where half the values are above and half below

What is the median?

100

This simple plot shows the frequency distribution of a single variable

What is a histogram?

100

This term describes how spread out the values in a dataset are from the mean.

What is standard deviation (or variance)?

200

This principle states that in a random experiment, the sum of the probabilities of all possible outcomes equals 1.

What is 1/2 (or 0.5 or 50%)?

200

his distribution models the number of successes in a fixed number of independent trials, each with the same probability of success.

What is the binomial distribution?

200

This type of sampling gives every member of the population an equal chance of being selected.

What is random sampling (or simple random sampling)?

200

This test is used to compare means between two independent groups

What is the t-test?

200

This plot shows the relationship between two numerical variables as points on a coordinate plane.

What is a scatter plot?

200

This value indicates the likelihood of obtaining test results at least as extreme as the observed results, assuming the null hypothesis is correct.

What is the p-value?

300

This type of probability calculation updates beliefs based on new evidence or information

What is Bayesian probability?

300

This distribution has parameters λ (lambda) and is often used to model the number of events occurring within a fixed time interval.

What is the Poisson distribution?

300

This is the difference between a sample statistic and the corresponding population parameter it estimates.

What is sampling error?

300

This type of analysis examines the relationship between a dependent variable and one or more independent variables

What is regression analysis?

300

This visualization displays the distribution of data through their quartiles and identifies outliers

What is a box plot (or box-and-whisker plot)?

300

This type of error occurs when we incorrectly reject a true null hypothesis.

What is Type I error?

400

This rule states that the probability of event A or event B occurring is the sum of their individual probabilities minus the probability of their intersection

What is the addition rule? (or What is P(A or B) = P(A) + P(B) - P(A and B)?)

400

This distribution is used to compare the variance of two samples and is commonly used in ANOVA.

What is the F-distribution?

400

This is the range of values within which a population parameter is estimated to lie, with a specified level of confidence.

What is a confidence interval?

400

This value tells us how far a data point is from the mean in terms of standard deviations

What is a z-score?

400

This type of graph uses vertical or horizontal bars to represent the frequency of different categories of data

What is a bar chart (or bar graph)?

400

This measure tells us how much two variables change together and ranges from -1 to +1.

What is correlation coefficient?

500

In probability theory, this is defined as the expected number of trials needed to get the first success in a Bernoulli experiment."

What is the geometric distribution?

500

When independent random variables are added together, their sum tends toward this distribution regardless of the original distributions.

What is the normal distribution? (Central Limit Theorem)

500

This principle states that as sample size increases, the sampling distribution of the mean approaches a normal distribution.

What is the Central Limit Theorem?

500

This type of analysis is used to determine if there are significant differences between the means of three or more groups

What is ANOVA (Analysis of Variance)?

500

This type of plot connects data points with straight lines and is often used to show changes over time.

What is a line graph (or line chart)?

500

This value represents the percentage of variation in the dependent variable that is explained by the independent variables in a regression model

What is R-squared (or coefficient of determination)?