Vocab
Vocab
Vocab
Vocab
Vocab
100

What is a crosstab chart?

It counts how many times combinations of values appear. Arrows show where that row in the data table would be counted in the chart.

100

What is a scatter chart?

Shows combinations of values from two columns

100

What is open data?

  • sharing data with others so they can can analyze it"

  • Open data is publicly available data shared by governments, organizations, and others

  • Making data open help spread useful knowledge or creates opportunities for others to use it to solve problems

100

What is big dtta?

Collect huge amounts of data so we can learn even more from it"

100

What is correlation?

a relationship between two pieces of data, typically referring to the amount that one

200

How are crosstab charts useful?

  • Finding the most / least common combinations of values in two columns

  • Finding patterns across two columns

  • Exploring two columns when one or both are strings.

200

How are scatter useful?

  • Seeing patterns and trends between two values

  • Numeric data with lots of different values

200

What is crowdsourcing?

Crowdsourcing is the practice of obtaining input or information from a large number of people via the Internet. Crowdsourcing offers new models for collaboration, such as connecting businesses or social causes with funding Both are examples of how human capabilities can be enhanced by collaboration via computing

200

What is data bias?

data that does not accurately reflect the full population or phenomenon being studied

200

What is citizen science?

Citizen science is research where some of the data collection is done by members of the public using own computing devices which leads to solving scientific problems.

300

Why are crosstab charts not useful?

  • If either column has too many values (the chart would be enormous)

300

Why are scatter not useful

Lots of repeated values

300

What is data filtering

choosing a smaller subset of a data set to use for analysis
ex: by eliminating / keeping only certain rows in a table

300

What did we learn about machine learning?

-artificial intelligence
-the extraction of knowledge from data based on algorithms created from training data

400

What is this? 

Crosstab

400

What is this?

Scatter

400

What is this?


Histogram 
400

When does Data need to be cleaned?

When data is incomplete, invalid, and multiple tables are combined into one



500

A town decides to publicize data it has collected about electricity usage around the city. The data is freely available for all to use and analyze in the hopes that it is possible to identify more efficient energy usage strategies.

Which of the following does this situation best demonstrate?

Open data

500

Which graphs are only useful for looking at one column of data?

Bar charts and histograms

500
What is metadata?

data about data