Total collection of people we want to study
Population
Also known as the average, found by adding all data values and dividing by the number of data values
Mean
The likelihood that an event will occur between 0% and 100%
Probability
The two main characteristics are the probability is between 0 and 1 and the sum of all probabilities is 1
Categorical data; data that includes words
Qualitative data
The data value that occurs most often
Mode
The probability an event will occur given that another event already occurred, denoted P(A|B)
Conditional probability
PDF stands for...
Probability distribution function
Numerical data that is counted; only includes whole numbers
Quantitative discrete data
The middle data value
Median
The probability of an event not occurring, denoted P(A')
Complement
The two main characteristics are a fixed number of trials (n) and only two possible outcomes (p and q)
Binomial distribution
A sample that selects every “nth” participant
Systematic sample
The data values are approximately the same on both sides of a histogram, peaks in the middle
Symmetric or normal distribution
The probability of both events occurring at the same time is zero
Mutually exclusive
The two main characteristics are not having a fixed number of trials (repeat until the first success) and only two possible outcomes (p and q)
Geometric distribution
A sample that includes people that are easily accessible to you
Convenience sample
The data values peak in the left and drop down in the right side of a histogram
Skewed right
The set of all possible outcomes of an experiment
Sample space
Notation for binomial distribution...
X ~ B (n, p)