Data Analysis

Machine Learning

Data Visualization
Miscellaneous
Data Science General
100

What is the mean of the numbers 12, 18, 24, and 30?

21

100

What does the acronym "CNN" stand for in the context of data science?

Convolutional Neural Network

100

What type of chart is commonly used to show the distribution of a single numerical variable?

Histogram

100

What is the mode of the following dataset: [5, 2, 3, 5, 8, 5, 1]?

5

100

In data science, what does "NaN" stand for when representing missing or undefined values in a dataset?

Not A Number

200

In statistics, what is the term for a data point that falls outside of the bound of a boxplot?

 An outlier

200

What type of machine learning model is commonly used for classification tasks with two possible outcomes?(Hint: Its a type of regression)

Logistic Regression.

200

Which R function is used to create visualizations using a colorblind-friendly color palette?

viridis



200

What is the name of the widely used programming language for data analysis and manipulation in data science?

Python

200

Which SQL command is used to retrieve all records from a table?

SELECT * FROM table_name.

300

What is the purpose of data normalization in data analysis?

To clean data and make it more useable/standarized.

300

What is the most common use of Recurrent Neural Networks (RNNs)?

Voice/Audio recognition

300

Which data visualization library is known for its capability to create static, high-quality visualizations for publication?

Matplotlib

300

Which SQL clause is used to filter rows based on a specified condition?

WHERE clause

300

 What are the three main components of the "data science pipeline"?

Data collection, data cleaning/preprocessing, and modeling.

400

What is the formula for calculating the standard deviation of a sample in relation to the variance?

 It is the square root of the variance.

400

What is the purpose of the K-nearest neighbors (K-NN) algorithm in machine learning?

K-NN is used for classification and regression tasks by considering the 'k' nearest data points to make predictions.

400

What is the purpose of a heat map in data visualization?

To display data values as colors in a 2D matrix, making it easy to spot patterns and trends. 

400

How many Eggheads are there on UC Davis campus?

7

400

 What is the purpose of regularization techniques in machine learning?

To prevent overfitting and improve model generalization


 

500

What is the primary goal of exploratory data analysis (EDA) in data science?

 To understand the main characteristics of the data and discover initial insights.

500

Which machine learning algorithm is inspired by the structure and function of the human brain?

Neural Networks (NNs)

500

What is the primary purpose of a word cloud in data visualization?

Represent the frequency of words in a textual dataset.

500

 In data science, what is the purpose of feature selection, and how is it different from feature extraction?

Feature selection aims to choose the most relevant features, while feature extraction creates new features from the existing ones.

500

Where did UC Davis Statistics Department Advisor Jeff Katz go on his honeymoon this past year?

Thailand