This programming language is commonly used for database management and is essential for querying relational databases
SQL (Structured Query Language)
The method of splitting original data into training and test set
Holdout procedure
What term is used for transforming raw data into visual insights?
Data Visualization
What term describes the average of a data set?
Mean
Which streaming service uses AI to personalize movies and show recommendations?
Netflix
In Python, this package is commonly used for data manipulation and analysis, particularly for handling dataframes.
pandas
This provides counts of correct and incorrect predictions for each class in a classification problem
confusion matrix
What is the term for transforming data into a format suitable for analysis, including techniques such as normalization and transformation?
Data Preprocessing
In hypothesis testing, what is the term for the mistake made when failing to reject a false null hypothesis?
Type 2 Error
Which cloud service provider offers “Vertex AI,” a platform for building and managing machine learning models?
The process of reducing a large amount of information into a summarized, digestible format.
data aggregation
This type of learning involves the model learning from labeled data to predict outcomes.
supervised learning
Which Microsoft Excel feature allows for complex data manipulation, including summarizing large data sets and creating visual reports?
Pivot Table
What technique allows you to examine the relationship between a dependent variable and multiple independent variables simultaneously?
Regression
Which cloud service provider offers the AI tool “SageMaker” for building machine learning models?
Amazon Web Services (AWS)
This analytic approach focuses on forecasting future outcomes based on historical data.
predictive analytics?
A method for reducing overfitting by training a model on multiple subsets of data and averaging the predictions.
cross-validation
What package is known for creating visualizations directly from programming languages like Python and R?
Matplotlib
What statistical method is used for assessing the goodness of fit of a model by dividing the sum of squares of residuals by the total sum of squares?
R-Squared
Which automaker uses AI in its “ProPILOT Assist” system to enhance driver safety and navigation?
Nissan
This step involves examining and cleaning the data to handle missing values, outliers, and inconsistencies before analysis.
data preprocessing
This form of learning uses a reward and penalty system, often used in robotics and gaming applications.
reinforcement learning
What is the term for AI systems that can interpret and generate human language, enabling tasks like translation and sentiment analysis?
Natural Language Processing (NPL)
This process removes repetitive or non-contributing features from a dataset, aiming to streamline models and improve performance.
feature selection
Which AI trend involves training models with massive datasets to understand and generate human-like text?
Large Language Models (LLMs)