GCP Monitoring
GCP Guidelines
General GCP
GCP Cloud Storage
100

What is the month with the most jobs’ bytes processed

January 2025

100

In what item can we find an example for incremental inserts, an optimized MERGE is key. The efficiency comes from leveraging the created date. 

1️⃣.5️⃣ Jobs Optimization

Example #1

100

Beyond individual services, what is the hierarchical structure that organizes all resources within GCP?

is Resource Hierarchy (Organization, Folders, Projects, Resources).

100

What is the fundamental type of data storage that GCP Cloud Storage provides?

object storage.

200

 Which project experienced the highest percentage of errors

Digital Solutions

200

What are the three key aspects of project structure

  • Use a Modular Structure: Organize repository into logical directories

  • Follow a Naming Convention: Use consistent prefixes and suffixes

  • Keep SQL Modular: Use includes and macros to avoid redundant code and improve reusability.

200

What is the primary service that allows for the creation of isolated virtual networks within GCP, enabling secure and scalable resource deployment?

Virtual Private Cloud (VPC).

200

What is the primary organizational unit within Cloud Storage where data objects are held?

bucket

300

Which project experienced the lowest runtime on their jobs?

crossbow-engine


300

Explain the 2 main LookML Modeling Best Practices

Use a Modular Approach

  • Keep each table in a separate .view.lkml file.

  • Define reusable dimensions and measures to avoid duplication.

Avoid Hardcoding Values

  • Instead of hardcoding case statements, use Looker parameters (parameter) to allow users to select values dynamically.

300

Which GCP services enable the execution of code without explicit server provisioning or management, often used for event-driven functions or containerized applications?

Cloud Functions and Cloud Run.

300

Cloud Storage offers different classes based on access frequency and cost. Name at least three of these distinct storage classes.

Standard, Nearline, Coldline, and Archive.

400

What is the most common error type, representing over half of all errors?

Syntax error: Unexpected..

400

What are the four layers of multi-layered approach to data?

  • Raw Layer (raw_): Store untransformed data.

  • Staging Layer (stg_): Clean and standardize data.

  • Intermediate Layer (int_): Perform joins and aggregations.

  • Dimensional & Fact Layers (dim_, fact_): Final models for reporting.

400

Name at least three different database services offered by GCP, highlighting their primary use cases (e.g., relational, NoSQL, data warehousing).

  • Cloud SQL (managed relational databases like MySQL, PostgreSQL, SQL Server)
  • Cloud Spanner (horizontally scalable relational database)
  • Firestore (NoSQL document database)
  • BigQuery (serverless data warehouse)
  • Cloud Bigtable (NoSQL wide-column database)
400

What feature can be enabled on a bucket to retain previous versions of objects, even after they are replaced or deleted?

Object Versioning

500

which two tables have the same # of partitions?

WFM_DM_nice_factAgentsSchedule
WFM_DM_tbl_Factagentschedule

500

Explain the "Performance Tuning" based on guideline

Avoid N+1 Query Problems:

  • Reduce redundant queries by pre-aggregating data.

Use datagroup for Smart Caching:

  • Set caching policies based on dataset freshness.

Documentation: Datagroups

500

What are some key factors that influence the overall cost of running workloads on GCP, and what practices can help optimize these costs?

Key factors influencing cost include compute usage (CPU/memory), storage (amount/class), network egress, and database operations. Practices to optimize include rightsizing resources, using committed use discounts, leveraging cheaper storage classes, and monitoring spend with Cloud Billing.

500

1. Besides project-level IAM, what are two key methods for controlling who can access specific buckets and objects within Cloud Storage?

2. What dedicated GCP service is designed to facilitate transferring large volumes of data into or between Cloud Storage buckets, or even from other cloud providers?

1. Access Control Lists (ACLs) and signed URLs/signed policy documents.

2. Storage Transfer Service (or Transfer Appliance for very large, offline transfers).