A company wants to estimate the average annual spending of its 12,000 customers. It randomly selects 300 customers and calculates their average spending.
Identify the population and the sample.
Population: All 12,000 customers
Sample: The 300 selected customers
Explanation:
The population is the entire group of interest. The sample is the subset used to estimate characteristics of the population.
A company records customer type as Retail, Wholesale, or Corporate.
Which graph is most appropriate to display this data?
Bar chart
Explanation:
Customer type is categorical data. Bar charts are used for categorical variables.
Data: 6, 8, 10, 12
Find the sample mean.
Mean = (6 + 8 + 10 + 12) / 4 = 36 / 4 = 9
Explanation:
The mean is the average of the observations.
If P(A) = 0.72, find P(Aᶜ).
P(Aᶜ) = 1 − 0.72 = 0.28
Explanation:
Complement rule: P(Aᶜ) = 1 − P(A).
What are the two key properties of a discrete probability distribution?
0 ≤ P(X = x) ≤ 1
The sum of all probabilities equals 1
Explanation:
All probabilities must be between 0 and 1, and the total probability across all outcomes must equal 1.
Which of the following is true for a continuous random variable?
A) P(X = 5) > 0
B) P(X = 5) = 0
C) P(X = 5) = 1
B) P(X = 5) = 0
Explanation:
For continuous variables, probability exists over intervals, not exact values.
The true average annual spending of all 12,000 customers is $4,850.
Is this value a parameter or a statistic?
Parameter
Explanation:
A parameter describes a population. Since it refers to all 12,000 customers, it is a population value.
A dataset contains employee salaries grouped into intervals ($30k–40k, $40k–50k, etc.).
Which graph is most appropriate?
Histogram
Explanation:
Salaries are quantitative data grouped into numeric intervals. Histograms show distributions of quantitative variables.
Data: 6, 8, 10, 12
Find the median .
Median = (8 + 10) / 2 = 9
Explanation:
For an even number of observations, average the two middle values.
If events A and B are mutually exclusive,
what is P(A ∩ B)?
0
Explanation:
Mutually exclusive events cannot occur at the same time.
A discrete random variable has the following distribution:

Find the expected value.
E(X) = (1)(0.20) + (2)(0.50) + (3)(0.30)
= 0.20 + 1.00 + 0.90
= 2.10
Explanation:
Expected value equals Σ x·P(x).
What does the area under a probability density curve represent?
Probability
Explanation:
The area between two values equals the probability the variable falls in that interval.
The average spending of the 300 selected customers is $4,720.
Is this value a parameter or a statistic?
Statistic
Explanation:
A statistic describes a sample. The value was calculated from only 300 customers.
In a histogram, what does the height of each bar represent?
Frequency or relative frequency
Explanation:
The height shows how many observations fall within each interval.
Data: 6, 8, 10, 12
Calculate the sample variance .
6.67
Explanation:
Excel =VAR.S()
Sample variance divides by n − 1.
If P(A) = 0.40 and P(B) = 0.50 and the events are independent, find P(A ∩ B).
0.40 × 0.50 = 0.20
Explanation:
For independent events: P(A ∩ B) = P(A)P(B).
A discrete random variable has the following distribution:

The mean is 2.10, find the variance.
Compute squared deviations:
(1 − 2.1)² = 1.21
(2 − 2.1)² = 0.01
(3 − 2.1)² = 0.81
Variance = (1.21)(0.20) + (0.01)(0.50) + (0.81)(0.30)
= 0.242 + 0.005 + 0.243
= 0.49
Explanation:
Variance = Σ (x − μ)² P(x).
The time of a call is uniformly distributed between 2 and 10 minutes.
Find the mean.
Formula:
Mean = (a + b) / 2
Mean = (2 + 10) / 2 = 6
Explanation:
The mean of a continuous uniform distribution is the midpoint.
A manager calculates the average, median, and standard deviation of monthly sales data.
Is this descriptive or inferential statistics?
Descriptive statistics
Explanation:
Descriptive statistics summarize observed data without making generalizations beyond the data.
If a distribution is right-skewed, how do the mean and median compare?
Mean > Median
Explanation:
The long right tail pulls the mean upward more than the median.
Using the variance 6.67, find the sample standard deviation.
Standard deviation = √6.67 ≈ 2.58
Explanation:
Standard deviation is the square root of variance.
State the addition rule for non-mutually exclusive events.
P(A ∪ B) = P(A) + P(B) − P(A ∩ B)
Explanation:
We subtract the overlap to avoid double counting.
What four conditions must be satisfied for a binomial experiment?
Fixed number of trials
Independent trials
Two possible outcomes per trial
Constant probability of success
Explanation:
If any of these fail, it is not binomial.
The time of a call is uniformly distributed between 2 and 10 minutes.
Find the variance.
Formula:
Variance = (b − a)² / 12
Variance = (10 − 2)² / 12
= 64 / 12
= 5.33
Explanation:
Standard Deviation is the square root of Variance. So Variance uses the same formula, without the square root.
A manager uses data from 300 customers to estimate the average spending of all 12,000 customers.
Is this descriptive or inferential statistics?
Inferential statistics
Explanation:
Inferential statistics use sample data to make conclusions about a population.
If a distribution is left-skewed, how do the mean and median compare?
Mean < Median
Explanation:
The long left tail pulls the mean downward.
When calculating sample variance, why do we divide by n − 1 instead of n?
To correct bias when estimating population variance.
Explanation:
Dividing by n − 1 adjusts for the fact that the sample mean is an estimate.
If P(A) = 0.60, P(B) = 0.50, and P(A ∩ B) = 0.30, find P(A ∪ B).
0.60 + 0.50 − 0.30 = 0.80
Explanation:
Apply the addition rule.
A production process produces non-defective parts 90% of the time.
If 5 parts are selected, what distribution applies?
Binomial distribution
Explanation:
Fixed trials, independent, two outcomes, constant probability.
Net income is uniformly distributed between $200,000 and $300,000.
Find P(X ≥ 275,000).
Formula:
(d − c) / (b − a)
= (300,000 − 275,000) / (300,000 − 200,000)
= 25,000 / 100,000
= 0.25
Explanation:
Uniform probability equals length of subinterval divided by total interval.
Sales revenue recorded each month for the past 24 months is what type of data?
Time-series data
Explanation:
Time-series data track the same variable over multiple time periods.
Given the following summary statistics:
Q1 = 30
Q3 = 50
Calculate the interquartile range.
IQR = 50 − 30 = 20
Explanation:
The interquartile range measures the spread of the middle 50% of the data.
If a dataset has a standard deviation of 0, what does that indicate?
All observations are identical.
Explanation:
There is no variability in the data.
Define conditional probability.
The probability of an event occurring given that another event has occurred.
Explanation:
Conditional probability adjusts probabilities based on new information.
If n = 10 and p = 0.30, find the mean and variance of the binomial distribution.
Mean = np = 10(0.30) = 3
Variance = np(1 − p) = 10(0.30)(0.70) = 2.1
Explanation:
These are standard binomial formulas.
A normal distribution has mean = 50 and standard deviation = 5.
Find the z-score for X = 60.
Formula:
z = (X − μ) / σ
z = (60 − 50) / 5 = 2
Explanation:
The value is 2 standard deviations above the mean.
Customer income levels collected from 500 customers on March 1 is what type of data?
Cross-sectional data
Explanation:
Cross-sectional data are collected at a single point in time.
Given the following summary statistics:
Q1 = 30
Q3 = 50
Calculate the lower outlier boundary.
Lower boundary = Q1 − 1.5(IQR)
= 30 − 1.5(20)
= 30 − 30
= 0
Explanation:
Values below this boundary are considered potential outliers.
A value has a z-score of 1.8. What does this mean?
The value is 1.8 standard deviations above the mean.
Explanation:
Positive z-scores indicate values above the mean.
If P(A ∩ B) = 0.24 and P(B) = 0.60, find P(A | B).
P(A | B) = 0.24 / 0.60 = 0.40
Explanation:
Conditional probability formula:
P(A | B) = P(A ∩ B) / P(B).
A company averages 4 customer complaints per day.
What distribution should be used to model the number of complaints per day?
Poisson distribution
Explanation:
Poisson models number of events in a fixed time interval.
For a standard normal distribution, what are:
a) Mean
b) Standard deviation
a) 0
b) 1
Explanation:
The standard normal distribution is centered at 0 with SD = 1.
Classify each variable as qualitative or quantitative:
a) Customer satisfaction rating (Excellent, Good, Fair, Poor)
b) Annual revenue in dollars
a) Qualitative
b) Quantitative
Explanation:
Qualitative data describe categories. Quantitative data are numerical and measurable.
Given the following summary statistics:
Q1 = 30
Q3 = 50
Calculate the upper outlier boundary.
Upper boundary = Q3 + 1.5(IQR)
= 50 + 1.5(20)
= 50 + 30
= 80
Explanation:
Values above this boundary are potential outliers.
A value has a z-score of −2.5. Interpret this.
The value is 2.5 standard deviations below the mean.
Explanation:
Negative z-scores indicate values below the mean.
If P(A | B) = 0.30 and P(B) = 0.50, find P(A ∩ B).
0.30 × 0.50 = 0.15
Explanation:
Rearranging conditional probability:
P(A ∩ B) = P(A | B)P(B).
If λ = 3, what is the probability of exactly 0 events?
Use the formula:
P(X = 0) = e⁻³ (3⁰ / 0!)
P(X = 0) = e⁻³ ≈ 0.0498
Explanation:
For Poisson, P(X = 0) = e⁻λ.
If Z = −1.28, what is P(Z < −1.28) approximately?
0.10
Explanation:
From standard normal tables or Excel:
NORM.DIST(−1.28,0,1,TRUE) ≈ 0.1003.
A company surveys only customers who made purchases in the last week.
What potential problem may occur?
Selection bias
Explanation:
The sample may not represent all customers, especially those who purchase less frequently.
A boxplot shows a long upper whisker and several high-value points beyond it.
What does this suggest about the distribution?
The distribution is right-skewed with high outliers
Explanation:
A longer upper tail indicates skewness toward higher values.
If every observation in a dataset increases by 10, what happens to:
a) The mean
b) The standard deviation
a) Mean increases by 10
b) Standard deviation stays the same
Explanation:
Adding a constant shifts the center but does not change spread.
What is prior probability?
The probability of an event before new evidence is considered.
Explanation:
It is the initial belief or probability estimate.
When drawing 5 cards from a deck without replacement and calculating the probability of getting 2 aces, which distribution applies?
Hypergeometric distribution
Explanation:
Sampling without replacement from a finite population.
Which Excel function finds cumulative probability for a normal distribution?
NORM.DIST with TRUE
Explanation:
TRUE returns cumulative probability.
Why is random sampling important when making inferences about a population?
It increases the likelihood that the sample is representative of the population.
Explanation:
Random sampling reduces systematic bias and improves the reliability of conclusions drawn from sample data.
A scatterplot of advertising spending vs. sales shows points rising from left to right.
What does this suggest?
A positive association
Explanation:
As advertising spending increases, sales tend to increase.
Two investments have the following:
Investment A: Mean = 8%, Standard Deviation = 4%
Investment B: Mean = 12%, Standard Deviation = 6%
Which measure should you use to compare relative risk?
Coefficient of variation
Explanation:
The coefficient of variation compares variability relative to the mean, making it appropriate when means differ.
What is posterior probability?
The updated probability of an event after considering new evidence.
Explanation:
Bayes’ Theorem converts prior probability into posterior probability using additional information.
Which Excel functions correspond to:
a) Binomial probability
b) Poisson probability
c) Hypergeometric probability
a) BINOM.DIST
b) POISSON.DIST
c) HYPGEOM.DIST
A machine fails on average once every 4 years.
Assuming exponential distribution, find the probability it lasts at least 6 years.
Step 1:
λ = 1 / mean = 1 / 4
Step 2:
=EXPON.DIST(6,1/4,TRUE)
≈ 0.2231
Explanation:
Exponential survival probability equals e^(−λx).