Arrays & Iteration
Visualizations
Probability
Hypothesis Testing/ML
Random Trivia
100

In Python, what would you use to go through every element in a list one by one?

For loop

100

What type of plot would you use to display the relationship between two continuous variables?

scatter plot

100

What is the probability of rolling 5 or greater on a fair die?

2/6 or 1/3

100

What does ML stand for in data science?

Machine learning

100

Guess the movie: 

 πŸ‘©πŸΌπŸ₯€πŸ‘ΉπŸŽΆ 

Beauty and the Beast

200

You have a list of test scores: [80, 90, 70, 100] called scores. What function would you use to get the length of scores?

len(scores)

200

What is the difference between a bar chart and a histogram?

Bar chart: categorical data with bars for each category

Histogram: distribution of continuous numerical data by grouping it into ranges (bins) and showing the frequency in each bin

200

What is the probability of flipping 2 heads in 3 fair coin tosses?

3/8

200

In hypothesis testing, what is the default assumption you are trying to find evidence against?

Null hypothesis

200

Who has won the most grammys of all time?

Beyonce

300

You have a list of numbers and want to find the largest one. What function would you use to get the largest number?

max()

300

What is the name of the library we use to make visualizations?

Seaborn

300

A bag contains 5 red balls, 3 blue balls, and 2 green balls. If two balls are drawn at random without replacement, what is the probability that both are red?


The probability is 2/9.

(First red: 5/10, then second red: 4/9 β†’ (5/10) * (4/9) = 20/90 = 2/9)

300

In hypothesis testing, if your p-value is 0.03 and your significance level (alpha) is 0.05, what conclusion do you draw?

Reject the null hypothesis, because 0.03 < 0.05.

300

What floor number is usually skipped in buildings because it is considered unlucky?

13

400

What Python method would you use to add a new item to a list?

append
400

In a boxplot, which component represents the range of the middle 50% of the data?

The interquartile range (IQR), which is the distance between Q1 and Q3.

400

Two fair six-sided dice are rolled. What is the probability that the sum is a prime number?

15/36.

(Prime sums: 2, 3, 5, 7, 11 β†’ combinations: 1, 2, 4, 6, 2 β†’ total = 15)

400

What type of learning is linear regression?

Supervised learning

400

Who said, "Life is like a box of chocolates, you never know what you're gonna get." Bonus points if you can name the movie that it was said in

Forrest Gump

500

You have the array [3, 1, 4, 1, 5]. Using a for loop, how many total iterations are made when summing all elements?


5 iterations.

(One for each element in the array)

500

What is the correct formatting to create a scatterplot using Python?

sns.scatterplot(x='column_x', y='column_y', data=dataframe)

500

How would you calculate the probability of both events A and B happening if the A and B are independent?

Multiply the probabilities of A and B

500

Name at least 2 out of the 7 steps of machine learning.

Data collection, preparing data, choosing model, training, evaluation, hyperparameter tuning, prediction

500

What are the full names of your HI and all the instructors of your bootcamp?

9:30am - Ethan Shohet, Raghavan Srinivas, Tyler Jung, Ayush Gupta

10:30am - Alankrita Malhotra, Adam Tay, Yatindran Dilip

11:30am - Bill Taing, Gillian Hu, Amber Yu, Sarvagya Goyal

M
e
n
u