R Fun
Supermodels
Analytics Toolbox
Back to the Titanic
Famous Marist Grades
100

This is best function in R to quickly find the mean of all of your continuous variables.

What is summary(df)?

100

What is the equation of a line?

What is y=mx+b?

100

In the Titanic data, pclass was this kind of data.

What is ordinal?

100
This is the year that the Titanic Set sail

What is 1912?

100

This Marist Grad is the youngest coach ever of a winning superbowl team.

Who is Sean McVay?

200

This is the best function in R to find the mode for a single categorical variable.

what is table(df$variable)?

200

What is the general form of a linear model?

What is y=bo + b1x1 + b2x2...?

200

In the Titanic data, this is the best graphical option to demonstrate the relationship between survived (yes/no) and gender (male/female).

What is a 100% Stacked Bar Chart?

200

The most passengers were in this class of service.

What is third class?

200

This Marist Grad is currently the Chief political Anchor on Fox news.

Who is Brett Baier?

300

This is the one line of code needed to run a simple linear model.

What is

model<-lm(dependent~independent, data = df)

300

Given the following results of a linear model predicting the price of a car, write out the full model using the general form y=bo+b1x1+b2x2...

What is:

Price  = $23,727 - $209*age - $0.07*miles

300

In the USED CAR dataset, "Fuel Type" (e.g., gas, electric, diesel) is a categorical variable.  This is the best graphical option to display this data.

What is a bar chart?

300

There were three cities where the passengers got on the ship - "embarked".  Name one.

What are Southhampton, UK, Queenstown Ireland, Cherbourg, France?

300

This Marist Grad was an all American Safety at Notre Dame and currently starts for the Baltimore Ravens.

Who is Kyle Hamilton?

400

This is the function that you used to find the mean values of all of the continuous variables against the binary dependent (outcome) variable in the credit dataset. 

what is tapply?

tapply(util6,outcome,mean)

400

Given that a modeling exercise generates the model

Price  = $23,727 - $209*age - $0.07*miles

What is the price of the car if the age is 2 and the miles are 25000?

What is $21,559?

400

In the USED CAR dataset, both "Price" and "Miles" were continuous variables.  This is the best graphical option to demonstrate their relationship.

What is a scatterplot?

400

Approximately this percentage of passengers survived (within 5%).

What is 38%?  (33% to 43%)

400

This Marist Grad is the State Attorney General and is running for Governor.

Who is Chris Carr?

500
To view the results of your model, you would run this line of code (assume you titled the output of your model "model").

what is summary(model)?

500

Who is the highest paid model in the world?

500

In the CREDITFILE - the variable UTIL6 represented the utilization rate over the last 6 months and was continuous.  You created a variable "outcome" that was binary.  This is the best graphical option to represent their relationship.

What is a series bar chart?

500

This was the average age of Titanic Passengers (within 2 years).

What is 30?

500

This person attended (but did not graduate from) Marist.  He co-founded Augusta National and the Masters Golf Tournament.  He won all four majors.

Who is Bobby Jones?