PowerBI
Google Analytics
BigQuery & SQL
Let's mix it up (Data Science / Engineering / Analytics)
100

Which visualization's sole purpose is to filter data?

Slicer.

100

What is the difference between User ID and User Pseudo ID?

A user pseudo ID (or "effective user ID") is a unique identifier for a user on a specific device or app instance, while a User ID is a custom identifier you set to track users across multiple device.

100

What is the difference between HAVING and WHERE?

The WHERE clause evaluates all rows, and the HAVING clause evaluates rows after they've been aggregated in the data set.

100

What is ETL?

ETL stands for “Extract, Transform, and Load” and describes the set of processes to extract data from one system, transform it, and load it into a target repository.

200

In Power Query, what language are data transformations coded in? 

M language.

200

This GA4 feature uses machine learning to measure conversions that aren’t directly observable.

Conversion Modelling.

200

🟢 DOUBLE POINTS! 

If you have many fields that you need to group, what is the quickest way to do that?

GROUP BY ALL

200

What is structured and unstructured data?

Structured data is organized and easily searchable, often stored in tables or databases with defined fields, while unstructured data lacks a predefined format or organization, such as images, audios or social media posts.

300

Which of the following is a way to access Power BI content offline?

  1. Using the Power BI desktop application
  2. Using the Power BI service
  3. Using the Power BI mobile app
  4. None of the above
The Power BI mobile app allows you to view and interact with dashboards and reports that you have accessed previously, even when you're offline. The data you see offline is based on what was cached on your device when you last viewed the report or dashboard online.

300

You collect data on your point-of-sale system that could complement the data you are sending to Google Analytics from your website and app.Which of these features allows you to collect and send events directly to Google Analytics servers?

  • Modify Event
  • Data Import
  • Measurement Protocol
  • HTTP Request

Measurement Protocol.

300

What are materialised views?

They are precomputed views that store query results for faster access. They can significantly improve performance for frequently accessed queries by reducing the need to recompute results.

300

What's the difference between Data Lake and Data Warehouse?

Data lakes store all types of raw data. Data warehouses store cleaned and processed data, which can then be used for analytics and reporting purposes.

400

What is a Dataflow?

It’s a reusable transformation logic that can be shared by many semantic models and reports inside Power BI.

400

Which feature, when enabled, associates event data collected from users on an app or website with the Google accounts of users who are signed in and have explicitly consented to sharing this information?

Google Signals.

400

What can be used in conjunction with partitioning to further optimize query performance?

Clustering. By clustering tables based on specific columns, BigQuery can reduce the amount of data scanned during queries.

400

What's a data layer?

It is a temporary data storage. It is a data structure available within a browser on a page and holds all data that you want to process and pass from your website to other applications such as GTM.

500

Which component of the Microsoft Power Platform can be used to allows users to create and manage automated workflows and notifications?

Power Automate

500

If you wanted to create a new data set with a wide view of your business across brands, products, or regions by combining data from multiple-source properties, you would use THIS Analytics 360 feature.

Roll-up Property

500

❌ - No question!

.

500

What is the difference between classification and regression?

Classification is a machine learning technique that assigns data to categories (e.g. spam detection, speech recognition) while regression predicts continuous values (e.g. house price prediction). Both are supervised learning algorithms that use labeled dataset.