Study that involves the collection, organization, description, analysis, and interpretation of data
Statistics
Variables that are typically integers representing a count of objects or abstract amounts
Discrete (variables)
Type of sample that involves a researcher taking the "easy way" when collecting data
Convenience sample
In Statistics, we analyze characteristics of an entire group of individuals by collecting and analyzing data from smaller subgroups of the whole group.
These small subgroups are known as _________.
Samples
This systematic discrepancy between a sample estimate and a parameter is called
Bias
Data in its original form
Raw Data
The type of variables used in a Likert scale: Below Average, Average, Above Average, etc.
Ordinal (variables)
Type of sample that involves separating population into clusters and selecting random members from each cluster
Stratified random sample
You'd like to know the proportion(%) of seniors at THS that are going to attend prom, so you randomly survey 50 of them asking if they are going. Identify the population and the sample.
Population = All THS seniors
Sample = 50 surveyed seniors
A survey asks a political question that attempts to sway people's opinion while reading. Name the type of bias present in this survey.
Questionnaire/interviewer bias
Data that is organized into rows and columns for easier analysis
Structured Data
Type of variable representing your birth year
Discrete
The type of sample that is the most ideal, where each member of the population has an equal likelihood of being selected
Simple random sample
You'd like to know the proportion(%) of seniors at THS that are going to attend prom, so you randomly survey 50 of them asking if they are going. Identify the parameter and the statistic.
Parameter= % of ALL seniors attending prom
Statistic= % of SAMPLED seniors attending prom
Typical type of bias that occurs in an internet survey
Voluntary response bias
What does each column in a data set represent?
The variables
Explain the difference between nominal and ordinal data.
Nominal= cannot be ordered (in a linear way)
Ordinal=can be ordered in a obvious (linear) way
This can show up anytime a sample is collected in a way that does not accurately represent the population it was collected from.
Bias
We'd like to know what % of Texas households own a dog. So we sample 100 of them and find that 47% of them own a dog. Identify the population and the parameter.
Population= All Texas households
Parameter= % of ALL Texas households that own a dog (unknown)
Imagine you want to measure the height of a tree using a measuring tape. The tree’s height is 10 feet, but due to variations in the measuring tape, the angle you look at the tape, the sun in your eyes, the wind blowing the tape, etc., you get slightly different measurements each time you measure it. Why is this? What is this error called>
An attempt to make a prediction based on the data we collect and analyze
Inference
Things like Social Security number, credit card number, and phone number would be considered what type of variable?
Nominal (They're numbers, but not used in a mathematical way.)
The natural variation from sample to sample that happens by chance. It cannot be avoided.
Sampling error
We'd like to know what % of Texas households own a dog. So we randomly sample 100 of them and find that 47% of them own a dog. Identify the sample and the statistic.
Sample= 100 households
Statistic= 47% (that own a dog)
A study posits a link between a specific medication and faster recovery time from a disease. However, the overall health of the patient, which can significantly affect recovery, serves as a _________
Confounding/Lurking Variable