Validity
Types of Validity
Reliability
Types of Reliability
100
Mr. Stone gives a social studies test that accurately measures what he wants it to measure, depicting this term.

What is validity?

100

Mr. Brown creates a science test based on the science standards covered in class the past two weeks, depicting this type of validity.

What is content validity?

100

Ms. Fisher gives a universal math screener every two weeks that produces similar results for students each time, representing this term. 

What is high reliability?

100

Ms. Egleston has Tommy take the same science test twice in two days; his scores are almost the same, depicting this type of reliability.

What is test-retest reliability?

200

Mrs. Boseman saw her students’ achievement on a reading comprehension quiz and assumed that students who struggled on the quiz will also struggle understanding more difficult texts, depicting this term.

What is inference or interpretation?

200

Tasha's SAT math results indicate she will have success in a STEM field career in the future, depicting this type of validity.

What is predictive validity?

200

Mr. Ivy's test results varied greatly depending on whether his students took the test during 1st period or 7th period, depicting this.

What is low reliability?

200

The same group of students take the STAR Math test in early August and a similar version of the test again in late August; the results are similar and have a high correlation, depicting this type of reliability.

What are parallel-forms reliability?

300

Mrs. Thompson evaluates Stella's presentation for her class and gives her a certain number of points for different categories using this evaluation tool.

What is a rubric?

300

Ms. Greene briefly glances over an ELA test and says it looks like it does a good job of assessing reading comprehension, depicting this type of validity.

What is face validity?

300

Ms. Jones sees that most students' test science scores are similar, resulting in a low measure of this. If students scores varied greatly instead, it would be a much higher measure of this. 

What is standard deviation?

300

A 20 question algebra test is given to students; the teacher compares the results of the test's odd questions and even questions and determines a high correlation, depicting this type of reliability.

What is split-half reliability?

400

Mrs. Carswell maintains that the district should choose an assessment instrument that covers reading skills (fluency, reading comprehension, etc.) taught during the school year to measure students' reading ability, depicting this term.

What is validity argument?

400

Ms. Cunningham's new Georgia studies test results exhibit strong correlation with the district assessment results; both assessments were taken at the same time, depicting this type of validity.

What is concurrent validity?

400

Bella correctly answers 40 out of 50 questions on a test. If Bella takes the same test again and correctly answers about the same number of questions (give or take a few), her results depict this indicator of consistency.

What is standard error of measurement?

400

Mr. Devon creates an art assessment; from the results, he finds that Cronbach’s alpha is high (>.80), depicting this type of reliability.

What is internal consistency reliability?

500

Mrs. Smith changes her science instruction this week based on her students’ pop quiz results from last week so she can reteach her students the topics they did not understand, depicting this term.

What is instructional sensitivity?

500

An ELA test that measures reading comprehension has weak correlation with a math test that measures a student's ability to solve equations, depicting this type of validity.

What is discriminant validity?

500

Students take two alternate and parallel forms of a math exam over two consecutive days. 85% of the students either pass or fail both times, yielding a 15% measure of this.

What is the decision-consistency index?

500

Mrs. Barker gives a 10 question test over volumes of 3D shapes on Wednesday; she gives a 10 question test over volumes of 3D shapes on Friday. Tests have different questions, but all are multiple choice and have medium difficulty. Both tests produce similar results, depicting this type of reliability.

What is alternate form reliability?