A data type with only two possible values, usually true or false
Boolean
What does ETL stand for?
Extract, Transform, Load
A statistical record used to measure achievement or progress toward a goal
Scorecard
Name three cloud deployment models
Public, Private and Hybrid
How many Capstone do you need to complete for the program?
Two
Data organized in a certain format, like rows and columns
Structured data
In which stage of a data pipeline does a cloud data analyst check the data for duplicate entries or obvious errors?
Transform Stage
The graphical representation of data using charts, graphs, and other visual formats
Data visualization
A cloud computing model that provides hardware and software tools to create an environment for the development of cloud applications, simplifying the application-development process
Platform as a service (PaaS)
What does UNSDG stands for?
United Nations Sustainable Development Goals
______ allows you to distribute your data store across different storage types
Database partitioning
What is the key difference between ETL and ELT?
ETL, data is transformed before loading into storage.
ELT, data is loaded first, then transformed inside the data warehouse
A technique that allows users to navigate to related visualizations
Drill through
A data warehouse on Google Cloud used to query data, filter large datasets, aggregate results, and perform complex operations
Google BigQuery
What day and time the Capstone #2 is due?
September 20th, 2025 at 5:00 PM EST
A fully managed service that maximizes open-source data tools for batch processing, querying, streaming, and machine learning
Dataproc
What is one potential risk if the Extract step pulls data from multiple inconsistent sources without validation?
Data Mismatch
Incomplete data
An approach to business intelligence that allows both technical and non-technical users across an organization to access data, perform ad-hoc data analysis, and generate reports
Self-service analytics
There are six pillars of the Google Cloud Architecture Framework. Name 3.
1) System Design
2) Security, Privacy, and Compliance
3) Reliability
4) Cost Optimization
5) Performance Optimization
The final submission includes three components: _____ slides for PowerPoint, _____ minutes for video, and _____ pages for the written report.
8 slides
5 minutes
8 pages
________ organizes related fields into different tables, and maintains defined relationships between columns in these different tables
Normalized data
You notice that customer IDs are duplicated after loading the data. Which step in ETL should you investigate first, and why?
Transform
A specific and objective measure, like a number, quantity, or range
Numerical data (Quantitative data)
There are 5 best practices for cost optimization.
Name 3.
1) Delete unused resources and consolidate idle systems
2) Rightsize your system
3) Autoscale computing needs
4) Use heat maps
5) Single cloud vs multi cloud environment
List all the UN goals that you can work for for the Capstone #2.
Goal 1 - No Poverty
Goal 2 - Zero Hunger
Goal 3 - Good Health and Well-being
Goal 10 - Reduced Inequality
Goal 13 - Climate Action