Assumptions
Multivariate Methods
Famous Statisticians
Miscellaneous
Data Visualization
100

I calcualte Cpk on a data set - My data ______?

Normal Distribution

100

It’s the first type of modeling we all learn in Stat 101

Simple Linear Regression

100

Author of Design and Analysis of Experiments

Douglas C. Montgomery

100

If you take the "U" out of this action you get this software?

JMP

100

Chart that sounds like a dessert

Pie Chart

200

If my sample size is large enough then I can estimate

Population parameters

200

Principal component analysis (PCA) can be used with variables of any mathematical types: quantitative, qualitative, or a mixture of these types

FALSE

200

She was elected to be the first female member of the Royal Statistical Society

Florence Nightingale

200

Let's say the average teenager watches 2.5 hours of TV a day. You believe this number is much too low and want to do a hypothesis test to see if that claim can be supported. What would be the alternative hypothesis for the population mean of daily teenager TV hours be here?

Teenage TV hours > 2.5 hrs

200

What is an appropriate graphical summary for displaying the relationship between bivariate quantitative variables?

Scatter plot

300

I have determined that the variables in my model are not correlated. Therefore my model is _____ ?

Orthogonal 

300

When the variance of residual is the same for any value of X.

Homoscedasticity

300

David Salsburg published a popular science book entitled The Lady Tasting Tea, a book on this statisticians experiment and "novel" idea on randomization?

Ronald Fisher

300

A given regression equation of 2006 adults in a chosen city is: Predicted IQ = 95 + 6(Years of College Attended). If someone attended four years of college and has an IQ of 117, what is their residual value?

-2

300

The NBA scores are normally distributed. Approximately what percent of these scores will fall between one standard deviation below the mean and two standard deviations above the mean?

81.5%

400

Certain assumptions should be satisfied and checked with residual plots in order to make valid inferences in regression analysis. What are these assumptions?

Residuals will have a constant variance, be approximately normally distributed, and be independent of one another

400

What is this model?
X=TPT + E
Y = UQT + F

Where X is a matrix of predictors and Y is a matrix of responses etc

Partial Least Squares (PLS)

400

The first journal of mathematical statistics and biostatistics was founded by these two people

Francis Glaton & Karl Pearson

400

Which of the following statements is INCORRECT about the moment generating function (MGF)? For reference, MGFx(t) = E[exp(tX)].
a. The MGF does not always exist for every distribution
b. The MGF uniquely determines the distribution
c. The MGF is a positive function
d. When every moments of X exists, then the MGF of X exists

d. When every moments of X exists, then the MGF of X exists

400

What is a mosaic plot?

It is a graphical display of the cell frequencies of a contingency table in which the area of boxes of the plot are proportional to the cell frequencies of the contingency table.

500

For Chi-Square, the levels (or categories) of the variables are ________, in terms of the variables' relationship with each other

Mutually exclusive

500

Distance 1: ||a-b||2 = √ [∑i(ai-bi)2 ]
Distance 2: ||a-b||1 = ∑i|ai-bi|
These are two ways to calculate distances between two points in this analysis method

Hierarchical Clustering

500

The earliest book on statistics is the 9th-century treatise Manuscript on Deciphering Cryptographic Messages was written by this person?

Al-Kindi ( Shortened Name)

500

Name the number that is 3 more than 1/5 of 1/10 of 1/2 of 5000

53

500

What is a steamgraph?

It shows how the size or proportions of groups vary over time, with vertical width of the “stream” representing the size of that entity