The Analytical Process
Visualizations
Fomula
Terms
What the What!?!
100

The first step in the analytical process.

What is "Plan"?

100

A visualization for displaying a frequency count.

What is a bar chart?

100

= Max - Min

What is the formula for calculating range?

100

Contingency Table 

What is a count of data displayed across two variables in a table format?

100

Probability

What is the long-run relative frequency of independent random events?

200

The second step in the analytical process.

What is "Do"?

200

A visualization for displaying data from a contingency table.

What is a segmented bar chart?

200

= CORREL

What is the formula for calculating the Correlation Coefficient (R) of a dataset?

200

Time Series

What is a set of data distributed over regular intervals of time?

200

R Square (R2)

What is the term used to describe the percentage of data explained by a relationship in a regression analysis?

300

Combining data in new ways.

How do you find unique insight?

300

A visualization for displaying a time series.

What is a line chart?

300

= (value – mean) / STDEV

What is the formula for calculating z-scores?

300

Visualization

What is a way of displaying the results of statistical analysis to communicate the story revealed by the data?

300

Error or Insight

What is the significance of an outlier in a dataset?

400

The third step in the analytical process.

What is "Report"?

400

A visualization for comparing multiple data sets.

What is a boxplot?

400

P(A and B) = P(A)*P(B)

What is the multiplication rule?

400

Multimodal

What is a histogram with multiple "humps" or peaks?

400

Residuals

What is the term for the variance from the expected values for variables in a regression analysis?

500

Categorical and Analytical

What are the two types of data?

500

A visualization for describing a single data set.

What is a histogram?

500

=LOG10(y)

What is a formula for re-expressing data?

500

Normal Distribution

What is the term for describing a unimodal and symmetrical distribution curve?

500

99.7%

What is the percentage of values that fall within 3 standard deviations of the mean in a normal distribution?