R Symbols
Made-Up Words
Data Perils
Data Wrangling
Visualization
100

<- is used to do this

What is assign a value to a variable?

100

The tidyverse version of a data frame

What is a tibble?

100

Extreme values that can distort the mean of a data set

What are outliers?

100

In tidy data, each observation should correspond to one of these in a data frame

What is a row?

100

The little box at the side of a plot showing what the colors or shapes mean

What is a legend?

200

$ is used to do this

What is specify a column within a data frame?

200

The first g in ggplot2 stands for this

What is grammar?

200

A line of best fit doesn't mean much if this is close to zero

What is a correlation coefficient?

200

This function is used to keep only certain rows of a data frame

What is filter?

200

This sort of plot is good for visualizing time series

What is a line chart?

300

This is the name for %>%

What is the pipe?

300

This sort of operation can make a data frame longer or wider

What is a pivot?

300

These can be addressed by deleting rows or by imputation

What are missing data?

300

This function is used to keep only certain columns of a data frame

What is select?

300

In a histogram, data are divided into these

What are bins?

400

This is the value of the expression 3:6

What is the vector 3, 4, 5, 6?

400

The tidyverse package for data wrangling

What is dplyr?

400

This issue was responsible for the erroneous "Dewey Defeats Truman" headline

What is sampling bias?

400

This function is used to modify a column or add a column to a data frame

What is mutate?

400

In a box plot, the height of the box corresponds to this

What is the interquartile range?

500

This is the value of 1984 %% 10

What is 4?

500

This obscure term, also used in photography, is a command to access some documentation within R

What is vignette?

500

The danger of asking successful entrepreneurs for business advice

What is survivorship bias?

500

These rows would be kept as a result of left_join(a, b)

What are the rows that appear in a?

500

This sort of map might be used to show the level of COVID infection in each state

What is a choropleth map?