Validity and Reliability in Assessment

Am I Valid?

Valid or Vibes?

You Can Rely on Me

Old Reliable

100

Any test or assessment can never be this.

What is completely valid?

100

This is considered the most important concept to consider when creating an assessment.

What is validity?

100

As a pair, this is considered the second most important concept in the creation of assessments.

What is reliability?

100

These types of assessments tend to have the highest measures of reliability.

What are high-stakes standardized tests?

200

The overt performance of a student on an assessment is meant to measure this.

What is their covert understanding of the content?

200

A conclusion about a test-taker's knowledge and skills.

What is an inference or interpretation?

200

Another term for reliability.

What is consistency?

200

A measure of whether a different version of an assessment is reliable.

What is alternate-form reliability?

300

This terms means that an assessment, as best as possible, measures the intended purpose.

What is accurate?

300

A measurement instrument for assessment that goes beyond right or wrong.

What is a rubric?

300

A statistical measurement of how closely related paired data is to determine relationships (in this case between test scores).

Correlation Coefficient

300

A consideration of whether or not test items consistently measure their intended purpose.

What is internal consistency reliability?

400

A description of how well an assessment accomplishes the measurement it attempts to measure.

What is fitness of purpose?

400

These types of assessments typically go through a much higher standard of validity argumentation than classroom tests.

What are standardized tests?

400

If the same assessment is given at different times, you are attempting to measure this.

Test-Retest Reliability

400

A consideration of reliability where binary performance is measured as opposed to a correlation analysis.

What is decision consistency estimates?

500

The formal process of determining if an assessment accomplishes the chief purpose for which it is being used.

What is Assessment Validation.

500

A description of how well an assessment measures the quality of instruction given to students taking it.

What is instructional sensitivity?

500

A correlation coefficient of paired scores lower than this number indicates that an assessment may lack reliability.

What is 0.7?

500

A reliability measurement where individual test-takers are given an estimated range of scores as measured by summary statistics.

What is standard error of measurement?