Python Packages
Professors
Class Names
Visualizations
Data Cleaning
100

Imported as pd

What is pandas?

100

Created the UIUC Grade Disparity visualization

Who is Wade Fagen-Ulmschneider?

100

STAT/CS/IS 107

What is Data Science Discovery?

100

A chart showing the distribution of a single numeric variable using contiguous bars

What is a histogram?

100

Values that are often filled or dropped

What are missing values?

200

The framework for many higher-level plotting interfaces

What is matplotlib?

200

The statistics half of the “data science duo”

Who is Karle Flanagan?

200

STAT 207

What is Data Science Exploration?

200

A plot of individual data points showing the relationship between two continuous variables

What is a scatter plot?

200

These extreme values can distort statistical summaries and models

What are outliers? 

300

Modeling, but not deep learning

What is scikit-learn?

300

Sports analytics expert known for his love of baseball player Shohei Ohtani

Who is David Dalpiaz?

300

CS 307

What is Modeling and Learning in Data Science?

300

A summary chart showing the median, quartiles, and potential outliers of a distribution

What is a box plot?

300

This type of scaling changes the max value to 1 and the minimum value to 0

What is min-max normalization?

400

Samuel Norman _______ from the American political drama, The West Wing

What is seaborn?

400

Had to give everyone A’s on his final exam because his wife gave birth at the same time

Who is Albert Yu?

400

IS 467

What is Ethics and Policy for Data Science?

400

A box plot which also shows the distribution of the data

What is a violin plot?

400

This type of scaling adjusts data to have a mean of 0 and a standard deviation of 1

What is standardization (or z-score normalization)?

500

TensorFlow is dead, instead use _______

What is pytorch?

500

Joined the Discord server for his class undercover to catch cheaters

Who is Alexey Stepanov?

500

IS 447

What is Data Management, Curation, and Reproducibility?

500

A colored grid showing values of a matrix or correlations between multiple variables

What is a heat map?

500

This method transforms categorical variables into numerical ones, often as binary columns

What is one-hot encoding?