Reliability vs. Validity Jeopardy Template

Definitions

Rule Breakers

Rules

Practices

Applications

100

This term describes whether a test actually measures what it's supposed to measure.

What is validity?

100

A test that cannot distinguish between well taught and poorly taught students lacks this.

What is instructional sensitivity.

100

The three types of reliability evidence are most definitely not__________________.

What is interchangeable.

100

Teachers must test in order to

What is identify students' covert knowledge or skills.

100

Each time a high-stakes test is constructed, those who construct it will prepare a test-development technical report seriously addressing _____________.

What is the topic of assessment validity.

200

When a math test requires advanced reading skills that aren't related to math ability, it lacks this quality.

What is validity?

200

The vast majority of educational tests we currently employ to evaluate instructional quality are unequivocally unfit for this function because _____

What is they cannot differentiate the impact of influencing internal and external factors on teaching quality.

200

Validity hinges on whether a score-based inference is both accurate and _______.

What is also contributes to the accomplishment of a particular test’s purpose.

200

Of the required Steps in the Building of an Educational Test’s Validity Argument, step 1 is _____________.

Collect score evidence, typically with the help of a rubric.

200

Teacher-made tests created by teachers who at least “think through” such validation steps will, almost always, turn out to be better tests than those produced less thoughtfully. True or False?

What is true.

300

This conception of consistency focused on the consistency among the test’s items in what they were measuring.

What is internal consistency reliability?

300

A test retest coefficient below this point is a cause for concern.

What is .70?

300

A synonym for reliability.

What is consistency?

300

Evidence is then collected regarding the degree to which a test-taker’s score appropriately contributes to accomplishing the test’s primary purpose.

What is Step 2?

300

The purpose for the test under consideration. More specifically, ... the specific decision(s) riding on a student’s test scores...helps... isolate the score-based inference to be made—

What is the score-based interpretation about a student that will, in turn, lead to an interpretation-based decision?

400

Inference-accuracy” evidence and the “purpose-accomplishment” evidence at hand, equal what?

What is a validation argument.

400

When a reading comprehension test requires advanced mathematics skills to interpret graphs and charts, it suffers from this problem where it measures unintended abilities instead of the target skill.

What is poor validity?

400

An indicator of consistency that can be used when focusing on the consistency of an individual student’s performance.

What is the standard error of measurement.

400

Using this method, you give the same test twice to the same group to see if scores are consistent over time.

What is test-retest reliability?

400

The fact that a person’s reading abilities often vary depending on the time of day when the reading is done, the nature of what’s read, or the reader’s perception regarding the importance of what’s read impacts what?

What is test reliability?

500

The most significant process in all of educational testing, culminates in the creation of a validity argument based on evidence of both the inference accuracy and the contribution of a test to the accomplishment of the chief purpose for which the test is used.

What is assessment validation?

500

Measuring reliability in the same way for all tests, including both classroom and high-stakes tests will ensure test reliability is high. True or False?

False - there are many reliability factors to consider, including the environment differences between a classroom and a testing facility.

500

What measurement of reliability is not concerned with test takers' scores, but is more concerned with what decisions are made what percentage of the time based on test takers' score?

What is Decision Consistence?

500

Controlling these external factors—like noise levels, lighting, temperature, and time of day helps reduce measurement error and improve test consistency. What does this impact?

What is reliability?

500

If parents want to know how likely their student is to score in the range of current results in the future, they are look at what measurement?

What is standard deviation?