Imported as pd
What is pandas?
Created the UIUC Grade Disparity visualization
Who is Wade Fagen-Ulmschneider?
STAT/CS/IS 107
What is Data Science Discovery?
A chart showing the distribution of a single numeric variable using contiguous bars
What is a histogram?
Values that are often filled or dropped
What are missing values?
The framework for many higher-level plotting interfaces
What is matplotlib?
The statistics half of the “data science duo”
Who is Karle Flanagan?
STAT 207
What is Data Science Exploration?
A plot of individual data points showing the relationship between two continuous variables
What is a scatter plot?
These extreme values can distort statistical summaries and models
What are outliers?
Modeling, but not deep learning
What is scikit-learn?
Sports analytics expert known for his love of baseball player Shohei Ohtani
Who is David Dalpiaz?
CS 307
What is Modeling and Learning in Data Science?
A summary chart showing the median, quartiles, and potential outliers of a distribution
What is a box plot?
This type of scaling changes the max value to 1 and the minimum value to 0
What is min-max normalization?
Samuel Norman _______ from the American political drama, The West Wing
What is seaborn?
Had to give everyone A’s on his final exam because his wife gave birth at the same time
Who is Albert Yu?
IS 467
What is Ethics and Policy for Data Science?
A box plot which also shows the distribution of the data
What is a violin plot?
This type of scaling adjusts data to have a mean of 0 and a standard deviation of 1
What is standardization (or z-score normalization)?
TensorFlow is dead, instead use _______
What is pytorch?
Joined the Discord server for his class undercover to catch cheaters
Who is Alexey Stepanov?
IS 447
What is Data Management, Curation, and Reproducibility?
A colored grid showing values of a matrix or correlations between multiple variables
What is a heat map?
This method transforms categorical variables into numerical ones, often as binary columns
What is one-hot encoding?