THE NUMBER THAT OCCURS MOST FREQUENTLY IN A DISTRIBUTION
MODE
WHAT ALGORITHM USES R^2 AS A METRIC?
REGRESSION
WHAT IS THE BASIC TRAIN/TEST SPLIT?
80/20
WHAT PLOT GROUPS BY RELATIONSHIP?
DENDROGRAM
DATA POINTS THAT FALL IN A SYMMETRICAL, BELL SHAPED CURVE
NORMAL DISTRIBUTION
WHAT ALGORITHM OFTEN USES LOGLOSS AS A METRIC?
LOGISTIC REGRESSION
HOW MANY TIMES IS CROSS VALIDATION COMPUTED RUN?
K TIMES
What are the five values need to make a box-and-whisker plot?
Minimum, 1st quartile, median, 3rd quartile, maximum
HOW SPREAD OUT THE DATA ARE; I.E. HOW DIFFERENT THE VALUES ARE
VARIABILITY
WHAT ALGORITHM IS APPLIED IN DEEP LEARNING?
NEURAL NETWORKS
WHAT IS K USUALLY?
5 OR 10
WHAT WERE THE DENDROGRAM BRANCHES CALLED?
CLADES
HOW TO DESCRIBE A RELATIONSHIP WHEN HIGH LEVELS OF ONE VARIABLE ARE RELATED TO LOW LEVELS OF ANOTHER VARIABLE
NEGATIVE RELATIONSHIP
WHAT ALGORITHM USES RULES TO CLASSIFY?
DECISION TREES
WHAT'S THE TECHNICAL NAME OF THE ERROR RATE?
MISCLASSIFICATION RATE
WHAT PACKAGE IS USED IN R FOR DENDROGRAMS?
GGDENDRA
RANGE OF VALUES WITHIN WHICH THE PARAMETER IS ESTIMATED TO BE, AT A SPECIFIED PROBABILITY, E.G., 95% CI
CONFIDENCE INTERVAL
WHAT ALGORITHM ENSEMBLES RULES?
RANDOM FOREST
WHAT ARE THE STEPS OF CROSS-VALIDATION?
1) DIVIDE THE OBSERVATIONS INTO K GROUPS OR FOLDS
2) THE FIRST FOLD IS TREATED AS A VALIDATION SET
3) PROCEDURE IS REPEATED
List at least 2 libraries in R that is used for data visualization
ggplot2, Lattice, Leaflet, Highcharter, RColorBrewer, plotly, sunburstR, RGL, dygraphs