What is the mean of the numbers 12, 18, 24, and 30?
21
What does the acronym "CNN" stand for in the context of data science?
Convolutional Neural Network
What type of chart is commonly used to show the distribution of a single numerical variable?
Histogram
What is the mode of the following dataset: [5, 2, 3, 5, 8, 5, 1]?
5
In data science, what does "NaN" stand for when representing missing or undefined values in a dataset?
Not A Number
In statistics, what is the term for a data point that falls outside of the bound of a boxplot?
An outlier
What type of machine learning model is commonly used for classification tasks with two possible outcomes?(Hint: Its a type of regression)
Logistic Regression.
Which R function is used to create visualizations using a colorblind-friendly color palette?
viridis
What is the name of the widely used programming language for data analysis and manipulation in data science?
Python
Which SQL command is used to retrieve all records from a table?
SELECT * FROM table_name.
What is the purpose of data normalization in data analysis?
To clean data and make it more useable/standarized.
What is the most common use of Recurrent Neural Networks (RNNs)?
Voice/Audio recognition
Which data visualization library is known for its capability to create static, high-quality visualizations for publication?
Matplotlib
Which SQL clause is used to filter rows based on a specified condition?
WHERE clause
What are the three main components of the "data science pipeline"?
Data collection, data cleaning/preprocessing, and modeling.
What is the formula for calculating the standard deviation of a sample in relation to the variance?
It is the square root of the variance.
What is the purpose of the K-nearest neighbors (K-NN) algorithm in machine learning?
K-NN is used for classification and regression tasks by considering the 'k' nearest data points to make predictions.
What is the purpose of a heat map in data visualization?
To display data values as colors in a 2D matrix, making it easy to spot patterns and trends.
How many Eggheads are there on UC Davis campus?
7
What is the purpose of regularization techniques in machine learning?
To prevent overfitting and improve model generalization
What is the primary goal of exploratory data analysis (EDA) in data science?
To understand the main characteristics of the data and discover initial insights.
Which machine learning algorithm is inspired by the structure and function of the human brain?
Neural Networks (NNs)
What is the primary purpose of a word cloud in data visualization?
Represent the frequency of words in a textual dataset.
In data science, what is the purpose of feature selection, and how is it different from feature extraction?
Feature selection aims to choose the most relevant features, while feature extraction creates new features from the existing ones.
Where did UC Davis Statistics Department Advisor Jeff Katz go on his honeymoon this past year?
Thailand