Ch 3
Ch 4
Ch 5
Ch 6
R
100

"data is always predictive"

data effect

100

technology that learns from experience (data) to predict the future behavior of individuals

predictive analytics

100

this was the name of the competition created by Netflix to improve their recommendations to customers

Netflix Price

100

the name of the AI that shares its name with a famous sidekick 

Watson

100

use this symbol to add a comment in RStudio

#

200

this measures the strength of a linear relationship between two variables

correlation

200

this breaks down data into smaller and smaller groups

decision trees

200

this was the amount of teams that made up BellKor's Pragmatic Chaos

3

200
this was the embarrassing answer Watson gave to the Jeopardy category "U.S. Cities"

"What is Toronto?"

200

you use this function to plot data

ggplot

300

these ranges indicate a strong linear relationship

-1 to -0.8 and 0.8 to 1

300

this is the result of overtraining a model

overfitting

300

this was the practice Netflix used to get competitors with different perspectives and backgrounds rather than experts in predictive analytics specifically

crowdsourcing

300

this was the year the infamous "man vs machine" Jeopardy! game took place

2011

300

this operator is used to chain multiple functions together 

%>% (pipe)

400

"there's always so much more today than yesterday"

"big data"

400

this is a type of model that considers multiple factors about an individual

multivariate model

400

this combines the strengths and weaknesses from models to create one supermodel 

ensemble effect/ensemble model

400

using this score, Watson would either play more aggressive or more conservatively

confidence score

400

you use this symbol as an easier/quicker way to subset

$

500

emotions weren't thought to be able to be measured by predictive analytics until this...

social media

500

art (human creativity) drives machine learning

induction effect

500

this was the data Netflix gave to the competitors to use for the competition

movie ratings

500

the name of the project IBM's Watson was a part of

DeepQA

500

you use this function to force a linear relationship (linear regression)

geom_smooth(method = lm)