Category 1: Statistical Tests & Hypotheses
Category 2: Central Tendency & Variability
Category 3: Survival Analysis & Rates
Category 4: Data Types & Measurement
Category 5: Graphs & Visualization
100

A hypothesis that assumes there is no difference between two groups.

What is a Null Hypothesis?

Why:
A null hypothesis is a statement that there is no difference or no effect between groups. 

100

The most frequently occurring value in a dataset.

What is the Mode?

Why:
Mode is the value that appears most often in a dataset.

100

Describes the rate of deaths due to cancer in a given year.

What is (Cancer) Mortality Rate?

Why:
A mortality rate is the number of deaths in a population during a specific time period, often expressed per a standard population size.

100

Smoking status is a type of this data.

What is Nominal Data?

Why:
Nominal data consists of categories with no inherent order.

100

Organizes data in rows and columns.

What is a Table?

Why:
A table is a structured arrangement of data organized into rows and columns.

200

A test used to examine the relationship between categorical variables.

What is a Chi-Square Test?

Why:
A chi-square test is a statistical test used to determine whether there is an association between categorical variables. 

200

The middle value of an ordered dataset.

What is the Median?

Why:
Median is the value that divides an ordered dataset into two equal halves.

200

Rate that measures survival from cancer excluding deaths from unrelated causes.

What is Relative Survival Rate?

Why:
Relative survival rate is the ratio of observed survival in cancer patients to expected survival in a similar population without cancer.

200

This level of measurement has a true zero point and allows for meaningful ratios.  

What is Ratio Data? 

Why:
Ratio data is a level of measurement that has a true zero point and equal intervals between values, allowing for meaningful comparisons using multiplication or division.


200

Graph used to compare values between different groups or categories.

What is a Bar Graph?

Why:
A bar graph displays categorical data using bars to compare quantities across groups.

300

A test that analyzes how one continuous variable varies with another continuous variable.

What is Correlation?

Why:
Correlation is a statistical measure that describes the strength and direction of a relationship between two variables. 

300

The difference between the highest and lowest values in a dataset.

What is the Range?

Why:
Range is a measure of variability calculated as the maximum value minus the minimum value.

300

Survival measure that is based on a specific time after diagnosis.

What is Observed Survival Rate?

Why:
Observed survival rate is the proportion of patients alive after a specified period of time following diagnosis, regardless of cause of death.

300

This type of data can take on a range of values, including decimals.

What is Continuous Data?

Why:
Continuous data consists of numerical values that can take on any value within a given range.

300

Graph best used to display trends over time.

What is a Line Graph?

Why:
A line graph displays data points connected by lines to show changes over time. 

400

A very small p-value provides strong evidence against this.

 

What is the Null Hypothesis?

Why:
A p-value is the probability of obtaining the observed results assuming the null hypothesis is true.

400

A distribution where extreme values pull the mean away from the median.

What is a Skewed Distribution?

Why:
A skewed distribution is one in which the data are not symmetrical and the mean, median, and mode are not equal.

400

A term used when patients survive without any signs or symptoms of cancer after treatment.

What is Disease-Free Survival?

Why:
Disease-free survival is the length of time after treatment during which a patient remains free of signs or symptoms of cancer. 

400

This type of data consists of countable, whole-number values.

What is Discrete Data?

Why:
Discrete data consists of distinct, separate values that can be counted and do not include fractions or decimals.

400

A graph used to display the distribution of continuous data.

What is a Histogram?

Why:
A histogram displays the frequency distribution of continuous numerical data using grouped intervals.

500

A type of analysis used to predict a binary outcome while adjusting for multiple variables.

What is Logistic Regression?

Why:
Logistic regression is a statistical method used to model the relationship between one or more independent variables and a binary outcome. 

500

A narrow, steep bell-shaped curve indicates this about the data.

What is a Small Standard Deviation?

Why:
Standard deviation is a measure of variability that indicates how closely data values are clustered around the mean.

500

The group of patients who are still being followed and are at risk of experiencing the event.  

What is the Population at Risk?

Why:
Population at risk refers to individuals who are being observed and are susceptible to the outcome of interest.

500

A number obtained by dividing one value by another.

What is a Quotient?

Why:
A quotient is the result of dividing one number by another. 

500

A graph used to estimate survival probability over time and account for censored data.

What is a Kaplan-Meier Curve?

Why:
A Kaplan-Meier curve is a survival analysis graph that estimates the probability of survival over time, accounting for censored observations. 

M
e
n
u