Fun with Formats
Legal Eagle
Uses and Abuses
____tion Nation
Potpourri
100

This Windows-owned format is great for data analysis, but not so great for data storage or sharing

XLSX, XLS, or Excel


100

By default, we release the data assets we fund under this open license

Creative Commons by Attribution or CC BY Attribution or CC BY

100

One of the Three Rs, we use open data licenses and formats so that others can do this with the data we fund

Reuse

100

Removing names and addresses from a dataset is an important step toward this privacy-protecting goal

Anonymization or de-identification


100

Determining if we'll collect a data asset starts at this phase of the grant process

Concept Note

200

Oxford or not, this file type is the best open format to store and share tabular or spreadsheet data

CSV or Comma-Separated Values file

200

When a file format is owned and controlled by a company it's _______

Proprietary

200

If you fund a training dataset to teach a computer to recognize a pattern, you're using data to fuel ______ 

Innovation

200

If two funders share a dataset describing where they're building mini grids, they're using data for ______

Coordination

200

In today's world, data has real value. That is why we call the data we fund a(n) ______

An asset or data asset

300

This open format sounds a lot like a famous movie killer, but it's not scary at all for data scientists


JSON or JavaScript Object Notation


300

If you need proprietary data to reach your initiative goals, you can request this from Data & Tech and Legal

A waiver

300

New data challenging a conventional norm can spark a ______

Conversation


300

If you know the who, what, when, where, and how the dataset was created, you've probably read the _______

Documentation

300

By the end of 2018, each RF initiative should collaborate with the Data & Tech team to draft this document

Data strategy

400

This popular geographic data format consists of multiple files stored together in one folder

Shapefile or SHP


400

For the openest of open data, use this license

CC Zero or Creative Commons Zero


400

Names, photos of faces, and DNA are all examples of this type of sensitive information

Personally Identifiable Information or PII

400

When you give credit to the original data collector you're practicing this important process

Attribution

400

This type of data is data about data (whoa)

Metadata

500

This common document format, though very human-readable, is a nightmare for data analysts

PDF

500

Don't want pesky profit seekers using your data to make money? Use this license

CC-BY-NC or Creative Commons Attribution-NonCommercial

500

One way to protect individual privacy is to do this to the responses at the state or district level

Aggregate

500

If you have to verify your identity to access data, the data owner set up this type of system

Authentication

500

If you want to serve up read-time, dynamic data, you need to build this type of interface

An API or Application Programming Interface