This does not equal causation
What is correlation?
Data about data
What is metadata?
The process of making data uniform with changing the meaning
What is cleaning data?
Abstractions of complex problems that mimic real-world events with the purpose of drawing hypotheses and investigating without the constraints of the real world.
What is a simulation?
Often a single source is not enough to answer the question so you must do this to data sources
What is combine?
If the data were an image, list 3 potential pieces of metadata
What is date of creation, file size, and author?
List three reasons why users may not submit uniform data
What is spelling, units, and capitalization?
Library used to simulate variability that exists in the real world
What is random?
An example of a program you can use to analyze data
What is a spreadsheet?
How primary data is affected if the metadata is changed
What is not at all?
Problems of bias are often created by the type or _____ of data being collected
What is source?
Because simulations are created by humans, they contain this
What is bias?
Programs and visuals can help us see these in data
What are patterns?
When locating a file in your file system, the metadata can help you find your file by doing this
What is filtering the data?
This can affect how much information can be extracted from a data set and how accurate the data is
What is sample size?
When experiments are this, it is better to use a simulation
What is impractical?
Looking at only certain data entries based on a condition
What is filtering data?
An example of a conclusion you could make by analyzing the metadata of a text file
Large data sets are difficult to process with a single computer and may require this
What is parallel processing?
An important part of designing a simulation is deciding what variables to ___
What is include and omit?