Data Science
Data Security
Data Warehouse
Data Visualization
Data Modeling
100

The technical term for the average value of a data set?

What is a Mean

100

This term refers to the overall management and control of data assets within an organization, ensuring data quality, compliance, and security

What is data governance?

100

Type of query language that is used to access data warehouses

What is SQL?

100

A chart that is used to show how two numeric variables are related

What is a scatterplot?

100

Name the two programming languages commonly used in analytics engineering for data manipulation and analysis

What is Python and SQL?

200

The most common measure of spread or dispersion in a data set

What is standard deviation 

200

This type of cyber attack involves gaining unauthorized access to systems

What is Hacking?

200

Type of key uniquely identifing each record in a fact table

What is a surrogate key?

200

A type of plot that shows trends and cycles over time

What is a line graph?

200

This dbt command visualizes the DAG of model dependencies

What is dbt docs generate?

300

Coding language that is commonly used for statistical analysis and data science

What is R?

300

This law gives customers control over how companies use their personal data 

What is GDPR?

300

Data warehouse design technique which involves storing aggregated data in multiple fact tables

What is a star schema?

300

This technique visualizes text data by word frequency

What is a word cloud?

300

This technique models data as nodes/edges in a graph database

What is graph data model?

400

Statistical method that calculates a line that best fits a set of data points

Linear regression

400

This technique obscures sensitive data like credit cards and social security numbers

What is masking?

400

Technique which creates aggregated views of data for reporting and analysis

What is OLAP - Online analytical processing?

400

This chart is commonly used to show hierarchical or tree-structured data

What is a Dendrogram?

400

A data modeling technique that graphically represents the entities, relationships, and attributes within a system

What is an entity relationship diagram?

500

The formula for calculating variance in a data set

What is the sum of squared deviations from mean divided by n-1?

500

This application layer firewall examines traffic before it reaches backend servers

What is WAF - Web application firewall?

500

A NoSQL database that uses key-value pairs for unstructured data

What is MongoDB?

500

This principle states that extra info should not distract from key relations

What is data-ink ratio?

500

This type of join is fastest when joining large data sets

What is a Hash join?

M
e
n
u