Data Analysis Definitions
Data Science Lifecycle
SCRUM Events
Excel Functions
Excel Fx pt 2
100

The process of cleaning and unifying messy and complex data sets for easy access and analysis.

What is Data Wrangling?

100

Determining what business needs from your Analysis.

What is Business Understanding?

100

Scrum divides the larger project into a series of mini-projects, each of a consistent and fixed-length up to one month in length.

What is Sprint?

100

If a number is greater than or smaller than another number or is equal to something.Produces “True” or “False” (AKA Boolean)

What is AND?

=AND([logical1], [logical2], ...)

100

Calculates number of cells used within a range that have both numbers and letters.

What is COUNTA?

=COUNTA(value1:value2)

200

Removing data that will not help in your analysis because it contains errors or misinformation.

What is Data Cleaning?

200

Determining what data you have/need and whether it needs to be cleaned.

What is Data Understanding?

200

Product owner explains the top Backlog Items (features). Then, the development team forecasts what they can deliver by the end.

What is Sprint Planning?

200

Produces whatever outcome you need it to read if you don’t want “True” or “False”

What is AND using IF?

=IF(logical_test, [value_if_true], [value_if_false])

200

Returns the middle value in the data.

What is MEDIAN?

=MEDIAN(value1:value2)

300

Take your raw data and transforming it to what you can work with.

What is Structuring?

300

Organizing your data for modeling.

What is Data Preparation?

300

When the team closely coordinates and develops daily plans.

What is Daily SCRUM(standup)?

300

Combine averages from different cells.

What is AVERAGEIF?

=AVERAGEIF(range, criteria, [average_range])

300

Used to find the number seen most times.

What is MODE?

=MODE.SNGL(value1:value2)

400

The process of looking at the data that you currently have and deciding whether you are needing to add to it.

What is Enriching?

400

Determining which model from your analysis meets business objectives.

What is Evaluation?

400

At the end of the sprint, the team demonstrates the increments to stakeholders and solicit feedback. These increments should be potentially releasable and meet the pre-defined definition of done.

What is Sprint Review?

400

Calculates the average of a range based on one or more true/false conditions.

What is AVERAGEIFS?

=AVERAGEIFS(average_range, criteria_range1, criteria1, ...)

400

Calculates the Standard Deviation for the entire population

What is STDEV.P?

=STDEV.P

500

Creating your analysis for the public.

What is Publishing?

500

How a client/business accesses the results of your analysis.

What is Deployment?

500

To close a sprint, the team reflects and plans for how it can improve in the next sprint.

What is Sprint Retrospective?

500

Calculates number of cells used within a range that have numbers.

What is COUNT?

=COUNT(value1:value2)

500

Calculates the sum of values in a range based on true/false conditions.

What is SUMIF?

=SUMIF(range, criteria, [sum_range])

M
e
n
u