<- is used to do this
What is assign a value to a variable?
The tidyverse version of a data frame
What is a tibble?
Extreme values that can distort the mean of a data set
What are outliers?
In tidy data, each observation should correspond to one of these in a data frame
What is a row?
The little box at the side of a plot showing what the colors or shapes mean
What is a legend?
$ is used to do this
What is specify a column within a data frame?
The first g in ggplot2 stands for this
What is grammar?
A line of best fit doesn't mean much if this is close to zero
What is a correlation coefficient?
This function is used to keep only certain rows of a data frame
What is filter?
This sort of plot is good for visualizing time series
What is a line chart?
This is the name for %>%
What is the pipe?
This sort of operation can make a data frame longer or wider
What is a pivot?
These can be addressed by deleting rows or by imputation
What are missing data?
This function is used to keep only certain columns of a data frame
What is select?
In a histogram, data are divided into these
What are bins?
This is the value of the expression 3:6
What is the vector 3, 4, 5, 6?
The tidyverse package for data wrangling
What is dplyr?
This issue was responsible for the erroneous "Dewey Defeats Truman" headline
What is sampling bias?
This function is used to modify a column or add a column to a data frame
What is mutate?
In a box plot, the height of the box corresponds to this
What is the interquartile range?
This is the value of 1984 %% 10
What is 4?
This obscure term, also used in photography, is a command to access some documentation within R
What is vignette?
The danger of asking successful entrepreneurs for business advice
What is survivorship bias?
These rows would be kept as a result of left_join(a, b)
What are the rows that appear in a?
This sort of map might be used to show the level of COVID infection in each state
What is a choropleth map?