Data Management Fundamentals
Three Letters (Data Move)
Data Analytics Fundamentals
Cloud Knowledge
All About Capstone #2
100

A data type with only two possible values, usually true or false

Boolean

100

What does ETL stand for?

Extract, Transform, Load

100

A statistical record used to measure achievement or progress toward a goal

Scorecard

100

Name three cloud deployment models

Public, Private and Hybrid

100

How many Capstone do you need to complete for the program?

Two

200

Data organized in a certain format, like rows and columns

Structured data

200

In which stage of a data pipeline does a cloud data analyst check the data for duplicate entries or obvious errors?

Transform Stage

200

The graphical representation of data using charts, graphs, and other visual formats  

Data visualization

200

A cloud computing model that provides hardware and software tools to create an environment for the development of cloud applications, simplifying the application-development process

Platform as a service (PaaS)

200

What does UNSDG stands for?

United Nations Sustainable Development Goals

300

______ allows you to distribute your data store across different storage types

Database partitioning

300

What is the key difference between ETL and ELT?

ETL, data is transformed before loading into storage.

ELT, data is loaded first, then transformed inside the data warehouse

300

A technique that allows users to navigate to related visualizations

Drill through

300

A data warehouse on Google Cloud used to query data, filter large datasets, aggregate results, and perform complex operations

Google BigQuery

300

What day and time the Capstone #2 is due?

September 20th, 2025 at 5:00 PM EST

400

A fully managed service that maximizes open-source data tools for batch processing, querying, streaming, and machine learning  

Dataproc

400

What is one potential risk if the Extract step pulls data from multiple inconsistent sources without validation?

Data Mismatch
Incomplete data

400

An approach to business intelligence that allows both technical and non-technical users across an organization to access data, perform ad-hoc data analysis, and generate reports

Self-service analytics

400

There are six pillars of the Google Cloud Architecture Framework. Name 3.

1) System Design
2) Security, Privacy, and Compliance
3) Reliability
4) Cost Optimization
5) Performance Optimization

400

The final submission includes three components: _____ slides for PowerPoint, _____ minutes for video, and _____ pages for the written report.

8 slides

5 minutes

8 pages

500

________ organizes related fields into different tables, and maintains defined relationships between columns in these different tables

Normalized data

500

You notice that customer IDs are duplicated after loading the data. Which step in ETL should you investigate first, and why?

Transform


500

A specific and objective measure, like a number, quantity, or range

Numerical data (Quantitative data)

500

There are 5 best practices for cost optimization.
Name 3.


1) Delete unused resources and consolidate idle systems
2) Rightsize your system
3) Autoscale computing needs
4) Use heat maps
5) Single cloud vs multi cloud environment  

500

List all the UN goals that you can work for for the Capstone #2.

Goal 1 - No Poverty

Goal 2 - Zero Hunger

Goal 3 - Good Health and Well-being

Goal 10 - Reduced Inequality

Goal 13 - Climate Action

M
e
n
u