Most common channel for data collection
What is internal acquisition?
Well-organized info that is often stored in spreadsheets
What is structured data?
Type of variable that includes names or symbols of objects
What is a categorical variable?
Statistical method that focuses on discovery and exploration
What is data mining?
Used to describe trends over time
What is a line chart?
Data stored across a network of computer servers
What is a distributed file system?
A form of storage used for unconventional and unstructured data
What is a data lake?
Another name for categorical variables
What are a nominal variables?
Gives computers the ability to learn without being programmed
What is machine learning?
Similar to a bar chart with no margin between bars
What is a histogram?
Outsourcing tasks to a remote and distributed workforce
What is crowdsourcing?
Large amounts of information that defy conventional methods of processing
What is big data?
Type of variable that categorizes values in a meaningful sequence
What is an ordinal variable?
Machine learning that uncovers patterns between inputs/outputs
What is supervised learning?
Used to show relationships between variables
What is a scatterplot?
Info mined from non-traditional sources
What is alternative data?
Defines where data can be placed inside relational databases
What is a schema?
Type of variable that is expressed and processed mathematically
What is a numeric variable?
Type of analytics that compresses info into easily readable format
What is descriptive analytics?
Used for displaying the distribution of a set of continuous data
What is a box plot?
Collecting info from the web using code and automation
What is web scraping?
A network of connected servers
What is a node?
Binary value that produces one of two set outcomes
What is a Boolean variable?
Machine learning that achieves specific output through random trial
What is reinforcement learning?
Shows correlation between variables as colors in a matrix
What is a heatmap?