What percentage of their time do data practitioners spend getting their data ready through cleaning?
60% to 80%
What is the process of examining datasets to draw conclusions about the information they contain?
Data Analysis
What type of question should be visualized using a pie chart?
Yes/No Question
Why is it crucial to avoid overwhelming reports with too many visuals?
The fewer graphs a report has, the more effective it is.
What are quotes used for in qualitative data analysis?
To illustrate and support the identified trends and themes with direct responses from participants.
What is the process of identifying and correcting erroneous data within a dataset in preparation for analysis called?
Data cleaning
What is the term for processed data that provides context and meaning?
Information
What is the best visual for a multiple-choice question with several possible answers?
Clustered Column Chart
What should be linked to the graphs in a report for better understanding?
Interpretation of the data
In qualitative data analysis, what is created to show the respondents' responses to each question?
A qualitative data entry matrix
What are common types of data errors? (Mention 4)
Missing Data, NaNs, Nulls, Unwanted Outliers, Irrelevant Observations, Structural Issues, Duplicate Data
What is knowledge in the context of data analysis? You can mention the term or example)
Knowledge is what we know and how we apply the information to help us reach our goals, such as making recommendations in reports
What is a clustered column chart useful for?
Comparing different groups or categories within a dataset.
Why is it important to provide context to data visualizations in reports?
To help the audience understand the significance of the data and its implications.
Name three of the four steps for qualitative data analysis.
Data Entry, Coding & Categories, Identify Trends, Quotes
What should be done with irrelevant observations in a dataset?
They should be removed to ensure the dataset's relevance and quality.
What is the importance of identifying trends, similarities, and differences in data? (Mention 4 out of five evaluation criteria)
It helps in arranging priorities, informing decision-making, and enhancing relevance, efficiency, effectiveness, impact, and sustainability.
How should data be visualized for a ranking question to highlight the most top priorities?
In Descending Order
What is the role of data interpretation in the decision-making process?
It transforms data insights into actionable recommendations.
During qualitative data analysis, what is the purpose of coding?
To categorize and organize qualitative data into meaningful themes.
What is the purpose of addressing structural issues during data cleaning?
To ensure the dataset is properly organized and formatted for analysis.
What is the goal of descriptive analysis?
To describe and summarize the main features of a dataset.
..............on your graph are crucial for better understanding. (Three guesses only!)
Data labels
What are the two coding rules highlighted in red in the PPT
Rule 1: Coding is up to You, you think these codes are important for a reason, if you’re unbiased and you’re close to the data.
Rule 2: Code as many words / sentences as possible.
In a qualitative data entry matrix, what does each row typically represent?
Each row represents a respondent's answers to the questions.