W's, Categorical data, and Quantitative Data
Displaying Categorical Data
Distributions
Displaying and Summarizing Quantitative Data
100
List the W's and the function each one serves
Who was measured? What was measured? How was data collected? Where was the data collected? When was the study performed? Why was the study performed?
100
Construct a bar chart of the following data for popular movie genres of rented redbox movies Genre #of Rentals in one day Scary 3 Comedy 9 Drama 7 Family 8 Which genre was most common?
Must have: labels for axis, label categories, accurate chart, title
100
Page 41#29 part a and b
a) 9.3% b) 24.7%
100
Given the following 5-Number summary, Find the IQR, and the range Max 9.0 Q3 7.6 Median 7.0 Q1 6.6 Min 3.7
IQR = Q3-Q1 = 7.6 - 6.6 = 1 Range = max-min = 9.0 - 3.7 = 5.3
200
Identify the W’s and if the W is categorical or quantitative: Because of the difficulty of weighing a bear in the woods, researchers caught and measured 54 bears, recording their weight, neck size, length and gender. They hoped to find a way to estimate weight from the other, more easily determined quantities.
Who? 54 Bears What? Weight When? Not specified Where? Not specified Why? To estimate weight from easier to measure variables? How? Researchers collected data on 54 bears they were able to catch
200
The following is data shows the frequency of the ratings of movies that came out during Summer 2011. Rating Frequency G 8 PG 6 PG-13 14 R 7 Construct a pie chart based on this information
Pie charts must be labeled correctly, have a title, and show how a "whole" divides into categories by showing a wedge of a circle whose area corresponds to the proportion in each category
200
This distribution is of either variable alone. The counts or percentages are the totals found in the last row or column of the table.
What is marginal distribution?
200
In a distribution that has one high outlier as it's maximum, how will the following be affected? : Hint: INCREASE, DECREASE or REMAIN THE SAME mean median range IQR standard deviation Q1 Q3
With a high outlier in the distribution, Mean will increase Median will remain the same Range will increase IQR will remain about the same Standard deviation will increase Q1 and Q3 will remain the same
300
Identify the W’s and if the W is categorical or quantitative: In 2006, Consumer Reports published an article evaluating refrigerators. It listed 41 models, giving the brand, cost, size, type, annual energy cost, overall rating, and repair history for the brand
Who? 41 refrigerator models What? Brand, cost, size, type, energy cost, overall rating, repair history When? 2006 Where? United States Why? Provide information to readers of Consumer Reports How? N/a
300
Explain what a frequency table is
A frequency table lists the categories in a categorical variable and gives the count (or percentage if it is a relative frequency table) of observations for each category
300
This distribution is of a variable restricting the Who to consider only a smaller group of individuals.
What is the conditional distribution?
300
In a distribution with a low outlier as the minimum, how will the following be changed? (Increase, Decrease, Remain the same) Mean Median IQR Range Q1 Q3 Standard Deviation
Mean will decrease Median will remain the same IQR will remain the same Range will Increase Q1 and Q3 will remain the same Standard Deviation will increase
400
Identify categorical or quantitative: Political Party Grade Number in school Temperature Gallons of Gas Miles per gallon Minutes Gender
Political Party ~ categorical Grade Number in school ~Categorical Temperature ~Quantitative Gallons of Gas ~Quantitative Miles per gallon ~Quantitative Minutes ~Quantitative Gender ~Categorical
400
What is the area principle?
In a statistical display, each data value should be represented by the same amount of area.
400
The following shows results of a poll taken of what people are looking forward for the Super Bowl Game. Gender Male Female Total Game 279 200 479 Commercial 81 156 237 Won't Watch 132 160 292 Total 492 516 1008 How do the conditional distributions of interest in the commercials differ for men and women?
Men: 81/237 = 34.2% women 156/237 = 65.8% Women make up a szable majority of the adult Americans who look forward to seeing Super Bowl commercials more than the game itself. Nearly 66% of people who voiced a preference for the commercials were women, and only 34% were men
400
Describe the shape of a histogram that represents a nearly normal distribution
The histogram would have to be unimodal, symmetric
500
True or False If your variable’s values are numbers, then we can assume that it’s quantitative.
False- Categories are often given numerical labels. Don’t let that fool you into thinking they have quantitative meaning. Look at the context!
500
We use bar charts and pie charts to describe what type of data?
Categorical Data Make sure you know what the WHO is!
500
page 40 #19 parts b,c,d 2 minutes!
b) 31.7% c) 60% d) i:35.7% ii. can't tell iii. 0% iv. can't tell
500
IQR is usually reported with the mean for skewed distributions. True or False?
False 2 answers: (a)IQR is usually reported with MEDIAN for skewed distributions. or (b) STANDARD DEVIATION is usually reported with the mean for NORMAL distributions