Storage
Compression
Extraction
Charts
Vocab
100

What is a bit?

building block of storage (0/1)

100

Why is compression helpful?

Reduce bandwidth and save disc space (make it smaller without losing important info)

100

What are large data sets called?

big data

100

What does a bar chart do?

shows data using horizontal and vertical bars

100
Abstraction

reduces complexity by only focusing on the most important parts & hiding the irrelevant parts from the user

200

How many bits make a byte?

8

200

What is lossless compression?

reduce file size without losing any information

200

What are outliers?

data points that do not fit with the typical trend of the data set

200

What do scatterplots help determine?

Correlation

200

Transforming Data

editing or modifying data (ex: doubling every number/graphing data points)

300

What is analog data?

measured continuously (volume of music)

300

What is lossy compression?

getting rid of excess data for greater compression (low resolution picture)
300
What is metadata?

data about data (does not affect the data)

300

What does a correlation coefficient of r=-.2 signify?

a weak negative correlation

300

Cleaning Data

making data uniform w/o changing meaning (ex: correcting misspelled words)

400

What is digital data?

finite set of possibilities

400

What is run-length encoding?

replacing a long string of the same value with the value and the number of times it appears

400

What is data mining?

the process of going through big data to find useful information and patterns

400

What do histograms usually represent?

frequencies and ranges

400

Hexadecimal

used for RGB color codes & it uses Base 16 Conversion Charts: Binary and Hexadecimal

500
How is the binary system used?

using zeroes and ones where 2^x is to represent numbers

500
Does fewer bits mean less information?

No

500

What 4 challenges does data pose?

you have to clean it, there can be incomplete data, there can be invalid data, and you have to combine different data sources

500

What does correlation not show?

causation
500

ASCII Code

converts text to binary format

M
e
n
u