The process of cleaning and unifying messy and complex data sets for easy access and analysis.
What is Data Wrangling?
Determining what business needs from your Analysis.
What is Business Understanding?
Scrum divides the larger project into a series of mini-projects, each of a consistent and fixed-length up to one month in length.
What is Sprint?
If a number is greater than or smaller than another number or is equal to something.Produces “True” or “False” (AKA Boolean)
What is AND?
=AND([logical1], [logical2], ...)
Calculates number of cells used within a range that have both numbers and letters.
What is COUNTA?
=COUNTA(value1:value2)
Removing data that will not help in your analysis because it contains errors or misinformation.
What is Data Cleaning?
Determining what data you have/need and whether it needs to be cleaned.
What is Data Understanding?
Product owner explains the top Backlog Items (features). Then, the development team forecasts what they can deliver by the end.
What is Sprint Planning?
Produces whatever outcome you need it to read if you don’t want “True” or “False”
What is AND using IF?
=IF(logical_test, [value_if_true], [value_if_false])
Returns the middle value in the data.
What is MEDIAN?
=MEDIAN(value1:value2)
Take your raw data and transforming it to what you can work with.
What is Structuring?
Organizing your data for modeling.
What is Data Preparation?
When the team closely coordinates and develops daily plans.
What is Daily SCRUM(standup)?
Combine averages from different cells.
What is AVERAGEIF?
=AVERAGEIF(range, criteria, [average_range])
Used to find the number seen most times.
What is MODE?
=MODE.SNGL(value1:value2)
The process of looking at the data that you currently have and deciding whether you are needing to add to it.
What is Enriching?
Determining which model from your analysis meets business objectives.
What is Evaluation?
At the end of the sprint, the team demonstrates the increments to stakeholders and solicit feedback. These increments should be potentially releasable and meet the pre-defined definition of done.
What is Sprint Review?
Calculates the average of a range based on one or more true/false conditions.
What is AVERAGEIFS?
=AVERAGEIFS(average_range, criteria_range1, criteria1, ...)
Calculates the Standard Deviation for the entire population
What is STDEV.P?
=STDEV.P
Creating your analysis for the public.
What is Publishing?
How a client/business accesses the results of your analysis.
What is Deployment?
To close a sprint, the team reflects and plans for how it can improve in the next sprint.
What is Sprint Retrospective?
Calculates number of cells used within a range that have numbers.
What is COUNT?
=COUNT(value1:value2)
Calculates the sum of values in a range based on true/false conditions.
What is SUMIF?
=SUMIF(range, criteria, [sum_range])