Most common channel for data collection
What is internal acquisition?
Well-organized info that is often stored in spreadsheets
What is structured data?
Type of variable that includes names or symbols of objects
What is a categorical variable?
Statistical method that focuses on discovery and exploration
What is data mining?
Used to describe trends over time
What is a line chart?
Data stored across a network of computer servers
What is a distributed file system?
A form of storage used for unconventional and unstructured data
What is a data lake?
Another name for categorical variables
What are a nominal variables?
In data cleaning, these are one-off observations where, at a glance, they do not appear to fit within the data you are analyzing.
Similar to a bar chart with no margin between bars
What is a histogram?
Outsourcing tasks to a remote and distributed workforce
What is crowdsourcing?
Large amounts of information that defy conventional methods of processing
What is big data?
Type of variable that categorizes values in a meaningful sequence
What is an ordinal variable?
This type of DA allows easy interpretation of large volumes of data to identify new opportunities.
What is business intelligence?
Used to show relationships between variables
What is a scatterplot?
Info mined from non-traditional sources
What is alternative data?
Defines where data can be placed inside relational databases
What is a schema?
Type of variable that is expressed and processed mathematically
What is a numeric variable?
Type of analytics that compresses info into easily readable format
What is descriptive analytics?
Used for displaying the distribution of a set of continuous data
What is a box plot?
Collecting info from the web using code and automation
What is web scraping?
A network of connected servers
What is a node?
Binary value that produces one of two set outcomes
What is a Boolean variable?
This is the study of collection, analysis, interpretation, presentation, and organization of data.
What is statistics?
Shows correlation between variables as colors in a matrix
What is a heatmap?