Data Analytics Fundamentals
Machine Learning
Business Intelligence Tools
Stats in Analytics
Companies
& AI trends
100

This programming language is commonly used for database management and is essential for querying relational databases

SQL (Structured Query Language)

100

The method of splitting original data into training and test set

Holdout procedure

100

What term is used for transforming raw data into visual insights?

Data Visualization

100

What term describes the average of a data set?

Mean

100

Which streaming service uses AI to personalize movies and show recommendations?

Netflix

200

In Python, this package is commonly used for data manipulation and analysis, particularly for handling dataframes.

pandas

200

This provides counts of correct and incorrect predictions for each class in a classification problem

confusion matrix

200

What is the term for transforming data into a format suitable for analysis, including techniques such as normalization and transformation?

Data Preprocessing

200

In hypothesis testing, what is the term for the mistake made when failing to reject a false null hypothesis?

Type 2 Error

200

Which cloud service provider offers “Vertex AI,” a platform for building and managing machine learning models?

Google

300

The process of reducing a large amount of information into a summarized, digestible format.

data aggregation

300

This type of learning involves the model learning from labeled data to predict outcomes.

supervised learning

300

Which Microsoft Excel feature allows for complex data manipulation, including summarizing large data sets and creating visual reports?

Pivot Table

300

What technique allows you to examine the relationship between a dependent variable and multiple independent variables simultaneously?

Regression

300

Which cloud service provider offers the AI tool “SageMaker” for building machine learning models?

Amazon Web Services (AWS)

400

This analytic approach focuses on forecasting future outcomes based on historical data.

predictive analytics?

400

A method for reducing overfitting by training a model on multiple subsets of data and averaging the predictions.

cross-validation

400

What package is known for creating visualizations directly from programming languages like Python and R?

Matplotlib

400

What statistical method is used for assessing the goodness of fit of a model by dividing the sum of squares of residuals by the total sum of squares?

R-Squared

400

Which automaker uses AI in its “ProPILOT Assist” system to enhance driver safety and navigation?

Nissan

500

This step involves examining and cleaning the data to handle missing values, outliers, and inconsistencies before analysis.

data preprocessing

500

This form of learning uses a reward and penalty system, often used in robotics and gaming applications.

reinforcement learning

500

What is the term for AI systems that can interpret and generate human language, enabling tasks like translation and sentiment analysis?

Natural Language Processing (NPL)

500

This process removes repetitive or non-contributing features from a dataset, aiming to streamline models and improve performance.

feature selection

500

Which AI trend involves training models with massive datasets to understand and generate human-like text?

Large Language Models (LLMs)

M
e
n
u