Data Ethics
Statistics & Modelling Concepts
Data & AI Infrastructure
Data Privacy & Ethical Research Practices
Time Series & Model Interpretability
100

What are the three pillars of data ethics?

Fairness, Accountability, and Transparency.

100

What is it called when a model is too complex?

Overfitting.

100

What type of software interface allows different applications to communicate and integrate services?

APIs.

100

This term refers to the unethical or illegal use of personal or sensitive data, often violating privacy, security, or consent agreements.

Data misuse. 

100

This term refers to understanding and explaining how a machine learning model makes its predictions.

Interpretability.

200

What ethical concern arises from AI-generated work? 

Plagiarism.

200

What technique prevents overfitting? 

Regularization. 

200

What Python library is widely used for data analysis?

Pandas.

200

What political consulting firm was involved in a 2018 scandal for improperly harvesting Facebook user data? 

Cambridge Analytica.

200

What statistical technique is used to ensure models generalize well by preventing overfitting?

Regularization. 

300

What document, established by the Association for Computing Machinery, provides a framework for responsible professional conduct?

ACM Code of Ethics

300

What measures the effect of an independent variable?

Coefficient. 

300

What process involves identifying and resolving errors in software to ensure correct functionality?

Debugging. 

300

This 1979 report laid out ethical principles for research involving human subjects, focusing on respect, beneficence, and justice.

The Belmont Report.

300

This type of visualization highlights which features of an input had the most influence on an AI model’s prediction.

Heatmap. 

400

This California law created a state agency to enforce data privacy rights and strengthen consumer protections.

CPPA

400

What process involves selecting a subset of individuals from a population to draw conclusions about the entire group? 

Sampling Methods.

400

What AI research lab, originating from China, focuses on open-source large language models?

Deepseek.

400

This committee reviews and approves research proposals to ensure ethical standards when human participants are involved. 

Institutional Review Board (IRB).

400

What term describes machine learning models whose decision-making processes are not easily interpretable?

Black-box models.

500

This ethical challenge arises when AI-generated content, such as deepfakes or synthetic media, is used to deceive or manipulate public opinion.

Misinformation. 

500

What statistical technique determines the minimum sample size required for detecting an effect?

Power Analysis.

500

Developed by Meta, this family of open-weight large language models is named after a South American animal. 

LLaMA.

500

This process provides participants with all necessary study information, ensuring voluntary agreement with full understanding of risks.

Informed Consent.

500

In time series analysis, this term refers to when past values are used as predictors for future values, introducing delays. 

Lagging.

M
e
n
u