What is the difference between Big Data and Small Data?
Small = simple, manageable. Big = huge, fast, complex.
Which V refers to the amount of data?
Volume
What does IoT stand for?
Internet of Things
What are the 3 C’s of Data Quality?
Consistency, Completeness, Correctness
Name a dashboard tool.
Tableau, Power BI, Looker, Qlik, Domo
Give one reason Big Data is challenging to process.
Storage, speed, privacy, security.
Which V deals with the speed of data generation?
Velocity
Name one IoT challenge.
Security, privacy, compatibility, huge data volumes.
Why does poor-quality data cause problems?
Leads to bad decisions, errors, misleading results.
What is the purpose of a dashboard?
Communicate insights clearly + support decisions.
Name two industries heavily using Big Data.
Healthcare, retail, banking, marketing, tech, etc.
Which V refers to data formats like videos, text, images?
Variety
What is Data Mining?
The process of discovering patterns, correlations, trends, or predictions in large datasets using statistics and machine-learning methods.
What is Data Governance?
Rules for managing data ethically and properly.
One challenge of dashboards?
Bad data, too many KPIs, slow updates.
What makes Big Data “big”?
Size, speed, variety, and complexity.
Define Veracity.
Data accuracy + trustworthiness.
Name one real-world example of data mining.
Fraud detection, product recommendations, hospital risk predictions.
Name two components of a Data Governance framework.
Standards, roles, security, privacy, metadata, auditing.
What is Data Retention?
Rules for how long data is kept + when it’s deleted.
What is integration?
challenge that involves making sure data from multiple sources matches and works together.
What are the 5 V's of Big Data?
Volume, Velocity, Variety, Veracity, and Value
Name and describe one data mining technique.
Classification, clustering, regression.
How does strong data governance prevent misleading insights?
Ensures clean, accurate data and proper analysis.
What must be included in a Data Retention Policy?
Data types, timeframes, storage, legal rules, deletion procedures.