What is ML?
Data
Supervised, unsupervised, reinforcement
Application
Ext./Misc.
100

The machine learning process could be compared to the analogy of

What is teaching a baby/a student?

100

A REPRESENTATIVE data collection is important because

What is data is what your model is trained on to solve your problem?

100

Reinforcement learning could be described as

What is a reward-based video game

100

You are looking for a dataset and go to well-known, diverse data repositories. You go to

What is Kaggle, UCI datasets, Keras, Tensorflow?

100

Disadvantages of too little data include

What is prone to bias & generalized trends?

200

The most critical part of the machine learning process is

What is data collection?

200

The sets data is split into is called 

What is training, validation, & testing?

200

Supervised learning is used _ data and could be described as 

What is labeled data and someone giving you the answers to identify?

200

Choose your favorite reward-based video game. Imagine you have started playing it and haven't been successful in your strategy to attain more rewards. What do you do?

What is [depends on the validity and soundness of your answer]?

200
One training/validation/testing split would be described as

What is k-fold, leave-one-out, stratified, time-series, random, etc?

300

AI originated from

What is the calculator?

300
Characteristics of a REPRESENTATIVE dataset include (name at least 3)

What is raw, diverse, recent, less noisy, set dimensional data?

300

Unsupervised learning is used for _ data and is described as

What is unlabeled data and pattern analysis?

300

You are taking care of a toddler. You are trying to teach the toddler some shapes with the books you have (the books include labeled data), however, the baby just seems to not learn anything and cry. For effective learning, you decide to

What is changing the books (to something more interactive) and/or giving the toddler a treat every time it learns a shape right?

300

Advantages of too much data include

What is improved accuracy metrics, higher efficiency for deep learning models, reduced bias (if REPRESENTATIVE), and better EDA results?

400

The machine learning process can be described as (Hint: 5 to 6 steps, think data)

What is data collection, data preparation, EDA (data analysis), machine learning model, and visualization? 

400

The types of data include

What is labeled, unlabeled, numeric, categorical, and ordinal?

400

Subcategories of supervised learning include

What is classification and regression?

400

You are a realtor in the area and want to use AI to your advantage to better understand current trends and projected house prices. The variables you would account for are

What is city, cost of living(#), quality of life(#), current house prices, population, rate of population growth, etc.?

400

Disadvantages of too much data include

What is computationally expensive, increased training times, increased complexity, and diminishing results?

500

Machine learning can be categorized into the 5 subcategories of

What is supervised, unsupervised, reinforcement, deep learning, and deep reinforcement learning?

500

The process(s) to choose a REPRESENTATIVE dataset includes

What is EDA, PCA, &/or LDA?

500

An example that could describe supervised learning is

What is any type of classification or regression?

500

You work in the biomedical industry and want to make an autonomous robots that organizes medicine based on time it needs to be taken and dosage. You have a container you can design, prescription, the medicines, a camera, and any robot material. You follow the machine learning process to make this robot and end up with

What is [we'll judge based on your answer]?

500

Advantages of too little data include

What is reduced storage requirements, faster training times, increased focus on data quality, & better suited for most quantum tasks?