SQL Basics
Data Cleaning
Data Visualization (Tableau)
Statistics Fundamentals
Business Analysis
100

What does SELECT do in SQL?

It retrieves data from a table.

100

What is a “missing value”?

Data that wasn’t recorded or is blank/NULL.

100

What is a “dimension” in Tableau?

Categorical field used for slicing data.

100

What does the “mean” measure?

The average value.

100

What is a requirement?

A need or condition to meet for a solution.

200

What SQL keyword removes duplicates?

DISTINCT

200

What is normalization?

Standardizing or scaling data to a similar range.

200

What is a “measure”?

Numeric field used for aggregation.

200

What is correlation?

Strength/direction of relationship between two variables.

200

What is a KPI?

A measurable value that indicates performance.

300

Which clause filters results after grouping?

HAVING

300

Name two ways to handle outliers.

Remove, cap (winsorize), transform, investigate source.

300

Which chart type is best for showing trends over time?

Line chart.

300

What is standard deviation?

How spread out values are from the mean.

300

Define “stakeholder.”

Anyone affected by or involved in a project.

400

Name one type of SQL JOIN.

INNER, LEFT, RIGHT, FULL OUTER

400

What is data type coercion?

Converting data types (e.g., text → number).

400

What is a calculated field?

A user-created field using formulas.

400

What is a p-value?

Probability of getting results if the null hypothesis is true.

400

What are functional vs. non-functional requirements?

  • Functional: What the system does

  • Non-functional: How the system behaves (quality, speed)

500

What’s the main difference between WHERE and HAVING?

  • WHERE filters rows before grouping

  • HAVING filters groups after grouping

500

What is the difference between wide and long data?

  • Wide: Each variable has its own column

  • Long: Variables are stacked in rows with identifiers

500

What is the difference between blending and joining?

  • Join: Combines tables at the row level

  • Blend: Combines after aggregation, keeping sources separate

500

Explain the Central Limit Theorem.

Large samples → sampling distribution becomes normal regardless of original distribution.

500

What is a use case?

A scenario showing how a user interacts with a system.