Data structures
Variables
Connecting data to the world we wish to understand
The research cycle
Research design (plus descriptive statistics)
100

What is a table?

A two-dimensional object with a finite number of rows and a finite number of columns

100

What is the main difference between categorical and numerical variables?

For numerical variables, we can measure the size of difference between individuals using mathematical operations. For categorical variables, we cannot.

100

How are units of observation and units analysis similar?

Both are some kind of concrete object, abstract object (e.g., a group), event, or process.

100

Where do research questions come from?

Shared gaps in knowledge in a community of research (even the experts don't know); requires familiarity with one's discipline or area of research

100

What is the main difference between observational and experimental study designs?

Experimental study designs deliberately intervene on some of the focal variables (specifically, explanatory variables)

Observational study designs do not deliberately alter any of the focal variables; they attempt to observe focal variables as they would exist even if we were not observing them

200

What do rows of data tables represent?

Individuals (aka subjects, records, etc.)

200

What kind of variable can we think of as an answer to a "yes/no" question?

Indicator variables (aka binary or dichotomous variables)

200

How are units of observation and units analysis different?

Unit of analysis: the kind of object/event/process whose variability we wish to understand.

Unit of observation: the kind of object/event/process whose variability we directly observe and record in a data set.

200

Where do hypotheses come from?

Hypotheses are deduced from theories that are credible in particular areas of research (Mad Lips metaphor); requires familiarity with theory in one's discipline or area of research

200

What are the three principles of experiments, and what are two other characteristics of experiments that improve their quality? 

(1) Treatment and comparison between treatment arms

(2) Randomized assignment to treatment arms

(3) Equal treatment

(A) Blocking

(B) Replication

300

Every individual in a data table is the same kind of object, event, or process. What is term we give to this object, event, or process?

Unit of observation

300

Which kinds of variables can we use to answer "greater than, less than" questions? Which can we not?

Ordinal and numerical, but NOT nominal

300

What is a source population?

The population from which we draw our samples

300

How does the bait-and-switch metaphor apply to the research cycle?

Our research questions, hypotheses, and inferences focus on our target/study populations, but our research design, data collection, and data description focus on samples instead. If the sample isn't representative, our inferences about populations won't be valid.

300

What is confounding?

Anything that interferes with or prevents causal inference

400

What is a variable?

A characteristic, attribute, or trait that can (but need not) vary between individuals in a data table

400

Why do we often confuse continuous numerical variables with discrete ones?

Practically speaking, we must round continuous variables, making them appear as if they skip over values

400

What is a target or study population?

A population whose patterns of variability we wish to understand. Often, defined with geographic, temporal, or other conditional boundaries.

400

In what ways does the research cycle loop back on itself?

If our research supports our hypothesis, this often raises new research questions.

If our research does not support our hypothesis, we develop new hypotheses to answer our research questions.

If our research proves inconclusive, we return to research design.

400

What is a lurking variable?

A common cause of explanatory and response variables of interest that has not been observed

500

What is a unique identifier?

A unique combination of numbers and/or letters that helps us to avoid confusing the records of different individuals in a data set

500

What is operationalization?

The act of deciding how exactly to observe and record a variable.

500

What is an ecological study, and what is the ecological fallacy?

Ecological study: a study whose unit of observation and analysis is a group rather than an individual.

Ecological fallacy: the mistaken inference that what is true of the pattern of variability in groups is also true of the pattern of variability of individuals within those groups. 

500

How does the the "carts before horses" metaphor apply to the research cycle?

The normal research cycle assumes we design our data collection to collect data in a way that allows us to evaluate our hypotheses.

When we use found data, we don't have this control, so the source population of and the variables constituting found data may not match the needs of our research.

500

What are the major challenges of descriptive statistics: what are their names, what is challenging about them, and how do we solve them?

Forest vs. Trees: Figuring out how much information about samples to convey in our descriptions; we favor summaries that emphasize broad patterns of variability rather than getting lost in the minutia of information available about our samples

Know your audience: Communicating about variability in our samples to audiences in a way they can understand and meets their needs; identify who our audiences are, what their needs and limitations are regarding statistics, and tailoring our message to these