DPLYR VERBS
GGPLOT
Quantitative or Categorical
Study Design
Modeling
100
This function is used to pick specific columns in a dataframe

select()

100

This graph type is used to visualize the distribution of a quantitative variable

Histogram


100

Height

Quantitative


100

Individuals are randomly assigned to one of three groups and then asked to perform a task, which is then recorded and measured. Is this an observational or experimental study?

Experimental

100

What is the function used for a linear model in R

lm()

200

This function breaks a dataset into subsets based on a categorical variable so that future operations are applied to each category.

group_by()

200

This graph type is used to look at the interaction of two quantitative variables

Scatterplot

200

Gender

Categorical


200

Observational or Experimental: Survey responses of patrons of a coffee shop.

Observational

200

What is the primary difference between the equal and different slopes models?

Different slopes adds interactions between variables

300

This function allows you to modify and create columns

mutate()

300

This graph type is used to visualize the interaction of a categorical variable and a quantitative variable

Boxplot

300

Percentages

Quantitative


300

Polling individuals from certain neighborhoods within a city

Cluster

300

How do we encode categorical variables with more than 2 categories

Assign binary values to our X2, X3, etc

400

What dplyr verb was used to clean this dataset?

filter()

400

This ggplot layer removes the default grey background on a graph

theme_minimal()

400

Income (Levels)

Categorical

400

Students at Bucknell are randomly selected based on year and major

Stratified Sampling

400

What is the equation for the equal slopes model

y = B0 + B1X1 + B2X2 + E





500
What dplyr function(s) were used to clean this dataset?

count()

500

What two ggplot layers were added to create this graph?

geom_point() + facet_wrap()

500

survey ratings (1-5)

Categorical

500

A study is conducted where individuals of different age groups are given either a medication or a placebo. Would you feel comfortable concluding causal relationships?

No, because age isn't enough to capture diversity

500

Interpret B0

When sepal width = 0, Sepal Length is expected to be 6.526