A collection of related data points or values, often organized in rows and columns.
What is “data set”?
The practice of collecting data to identify patterns or trends over time.
What is a trend analysis?
The average value of a data set, calculated by summing all values and dividing by the number of values.
What is “mean” (in statistics)?
The right of individuals to control their personal information and how it is used.
What is Data Privacy?
To remove or correct inaccuracies and inconsistencies to improve data quality.
What is data cleansing?
Qualitative data describes characteristics, while quantitative data can be measured and expressed numerically.
What is the difference between qualitative and quantitative data?
The representation of data in graphical or pictorial format to make the information easier to understand.
What is a "data visualization"?
Data points that differ significantly from other observations in a data set.
What are outliers?
To protect privacy by removing personally identifiable information from data sets.
What is data anonymization?
A numerical measure that indicates the strength and direction of a relationship between two variables.
What is a correlation coefficient?
A document that describes the structure, attributes, and relationships of data elements in a database.
What is a data dictionary?
To determine the relationship between variables and predict outcomes.
What is regression analysis used for?
A determination that a result is unlikely to have occurred by chance, typically assessed with a p-value.
What is “statistical significance"?
Designed to protect the privacy and personal data of individuals within the EU.
What is General Data Protection Regulation (GDPR)?