Validity
Essentials
Reliability
Evidence
100

A teacher does this to determine students' knowledge.

What is testing/assessment?

100

A teacher creates a rubric to score students' research papers to assess their knowledge.

What is collecting a score-based inference?

100

A teacher gives a math test to one class but finds out that the test questions were leaked and gives an alternate test to the next class. The average scores for both classes were consistent with no significant variance.  

What is reliability? 

100

A student who typically scores between 80-90% on history tests leaves early on test day with a stomach ache. He stays after school the next day to make up the test. He scores an 85%. 

What is test-retest reliability?

200

An administrator claims that the test they are requiring is this type of test, which does not exist.

What is a valid test?

200

The teacher analyzes the research papers using the rubric to ascertain if the rubric accurately measures the assessment purpose of the research assignment. 

What is evidence collected of the inferences suitability for a test's purpose?

200

A school's academic coach claims that the new mathematics screening is this type of test, which does exist. 

What is a reliable test/assessment?

200

Another student who typically scores between 80-90 on history tests was absent the week of the history test, the teacher writes an alternate test so that he can return the tests and scores to the rest of the class. The student scores an 85. 

What is alternate-form reliability evidence?

300

A school district hires assessment experts and measurement specialists to determine this.

What is the validity of the district-wide assessments?

300

The teacher examines the evidence collected from the previous steps to determine whether her assessment matched her intended purpose and can justify this to students, parents, and administration. 

What is the generation of a reasoned validity argument based on evidence? 

300

The purpose of a kindergarten teacher's letter sound screening was to determine students' letter name/sound correspondence. After completing the screening, the teacher has a clear picture of which students know their letter sounds and which students need interventions for this skill.

What is purpose-compatible reliability evidence?

300

The history test was designed to assess students' knowledge about the Civil War. All test questions address this topic, no points are taken away for grammatical errors and none of the questions are written to assess other skills. 

What is internal consistency reliability evidence?

400

An administrator uses test scores intended to measure student knowledge to evaluate the effectiveness of teachers. 

What is invalid testing?