Categorizing Data
Describing Data
Visualizing Data
Interpreting Data
Statistical Tests
100

The values of this type of variable are typically recorded as text, as they have no numerical value

What is qualitative data?

100

Descriptive statistics give us a sense of (a) the ___ of the data and (b) the ___ of the data across the spectrum of values.

a) location b) spread

note: What do each of these mean?

100

This plot type is useful for exploring the relationship between a discrete dependent variable (y) and a qualitative independent variable (x)

What is a barplot?

100

What is the term for making inferences from our data about patterns and processes exhibited by a larger group of organisms?

Inferential statistics

100

When do we use a non-parametric test?

If the data are not normally distributed

200

These variables are quantitative; they still measure an
amount. Yet, they are limited to certain values, such as whole numbers.

What are discrete variables?

200

What is the pattern of a normally distributed data set?

Bell shaped curve

200

This plot type is useful for visualizing the relationship between a continuous dependent variable (y) and a qualitative independent variable (x)

What is a boxplot?

200

Results with p-values less than or equal to this value are called “statistically significant.”

p<0.05

200

Name two questions you should ask when determining the appropriate statistical test for your data

1) Is the dependent variable qualitative or quantitative?
2) Does the data for the dependent variable follow a normal distribution?
3) Is the independent variable qualitative or quantitative?
4) If the independent variable is qualitative, how many levels are there?

300

This is X, the variable potentially influencing
our variable (Y) in a hypothesis test

Independent Variable

300

This value helps quantify how accurately you
have measured the true mean of the whole population by obtaining the mean of a smaller sample

Standard error of the mean

300

This plot type explores the relationship between two quantitative variables (continuous or discrete)

What is a scatterplot?

300

True or false: The lower the variation in your data, the more confidence you can have that observed differences are from your treatment and not from other factors.

True

300

Name the appropriate test for this data: Is there a difference in dune plant height between 5 species?

ANOVA

400

Name the data type: sampling location, presence/absence data, habitat type

Nominal data

400

How do we assess the distribution of data (hint: it's a plot type)?

By constructing histograms, whereby the values of the data are on the x-axis and the frequency of those values that are in the dataset is on the y-axis

400

Name the appropriate plot for this data: distance in meters from the shore (x) vs mole crab length (y)

scatterplot (two quantitative variables)

400

What would be the Rvalue if our data fit perfectly to the model (for example, if all the data points fell exactly on the regression line)

R2=1

400

Name the appropriate test for this data: Does the number of plankton species differ between the Santa Monica and Venice piers?

independent t-test

500

ALL GROUPS

Name the independent and dependent variables and the data types collected for this study: Scientists collected snowy plover body weight and # of chicks in protected vs unprotected nesting sites

Independent: Protected vs unprotected sites

Dependent: body weight, # of chicks

Body weight: continuous

# of chicks: discrete


500

How do we calculate standard deviation?

By adding the squared deviations of each sample value from the mean, dividing by the sample size, and taking the square root

500

Name the appropriate plot for this data: tide pool location (x) vs number of species (y)

barplot

500

What does the p-value of our data tell us?

The p-value helps evaluate how likely it is that the observed differences between treatment and control (or other factors being evaluated) would occur if in reality, the treatment had no effect.

500

Name the appropriate test for this data: Does the size class of mussels affect the presence of drill holes from whelk predation? Bonus: what type of data are each of these variables?

Chi-square test, categorical

M
e
n
u