What is the month with the most jobs’ bytes processed
January 2025
In what item can we find an example for incremental inserts, an optimized MERGE is key. The efficiency comes from leveraging the created date.
1️⃣.5️⃣ Jobs Optimization
Example #1
Beyond individual services, what is the hierarchical structure that organizes all resources within GCP?
is Resource Hierarchy (Organization, Folders, Projects, Resources).
What is the fundamental type of data storage that GCP Cloud Storage provides?
object storage.
Which project experienced the highest percentage of errors
Digital Solutions
What are the three key aspects of project structure
Use a Modular Structure: Organize repository into logical directories
Follow a Naming Convention: Use consistent prefixes and suffixes
Keep SQL Modular: Use includes and macros to avoid redundant code and improve reusability.
What is the primary service that allows for the creation of isolated virtual networks within GCP, enabling secure and scalable resource deployment?
Virtual Private Cloud (VPC).
What is the primary organizational unit within Cloud Storage where data objects are held?
bucket
Which project experienced the lowest runtime on their jobs?
crossbow-engine
Explain the 2 main LookML Modeling Best Practices
Use a Modular Approach
Keep each table in a separate .view.lkml file.
Define reusable dimensions and measures to avoid duplication.
Avoid Hardcoding Values
Instead of hardcoding case statements, use Looker parameters (parameter) to allow users to select values dynamically.
Which GCP services enable the execution of code without explicit server provisioning or management, often used for event-driven functions or containerized applications?
Cloud Functions and Cloud Run.
Cloud Storage offers different classes based on access frequency and cost. Name at least three of these distinct storage classes.
Standard, Nearline, Coldline, and Archive.
What is the most common error type, representing over half of all errors?
Syntax error: Unexpected..
What are the four layers of multi-layered approach to data?
Raw Layer (raw_): Store untransformed data.
Staging Layer (stg_): Clean and standardize data.
Intermediate Layer (int_): Perform joins and aggregations.
Dimensional & Fact Layers (dim_, fact_): Final models for reporting.
Name at least three different database services offered by GCP, highlighting their primary use cases (e.g., relational, NoSQL, data warehousing).
What feature can be enabled on a bucket to retain previous versions of objects, even after they are replaced or deleted?
Object Versioning
which two tables have the same # of partitions?
WFM_DM_nice_factAgentsSchedule
WFM_DM_tbl_Factagentschedule
Explain the "Performance Tuning" based on guideline
Avoid N+1 Query Problems:
Reduce redundant queries by pre-aggregating data.
Use datagroup for Smart Caching:
Set caching policies based on dataset freshness.
Documentation: Datagroups
What are some key factors that influence the overall cost of running workloads on GCP, and what practices can help optimize these costs?
Key factors influencing cost include compute usage (CPU/memory), storage (amount/class), network egress, and database operations. Practices to optimize include rightsizing resources, using committed use discounts, leveraging cheaper storage classes, and monitoring spend with Cloud Billing.
1. Besides project-level IAM, what are two key methods for controlling who can access specific buckets and objects within Cloud Storage?
2. What dedicated GCP service is designed to facilitate transferring large volumes of data into or between Cloud Storage buckets, or even from other cloud providers?
1. Access Control Lists (ACLs) and signed URLs/signed policy documents.
2. Storage Transfer Service (or Transfer Appliance for very large, offline transfers).