Validity vs. Reliability

Validity

Reliability

Class Connections

Errors

100

This term describes how well a test measures what it claims to measure

What is validity?

100

If a test gives students the same score again and again, but the score is wrong, it illustrates this danger of relying on reliability alone.

What is high reliability without validity?

100

A classroom quiz that uses poorly written questions might produce inconsistent scores, harming this important testing quality.

What is reliability?

100

Random factors—like a student being tired or distracted—that affect a student’s test score fall under this kind of measurement error.

What is systematic error?

200

Popham says this is the most important consideration in testing because it tells us whether our conclusions based on test scores are accurate.

What is validity evidence?

200

This type of reliability checks whether multiple graders score a student response the same way.

What is inter-rater reliability?

200

Teachers rely on test results to place students in groups or identify who needs support. This is why this form of test evidence must come before all others.

What is validity evidence?

200

A test written with confusing directions or misleading questions introduces this kind of threat to reliability.

What is item-related error?

300

A spelling test that ends up measuring handwriting neatness instead is suffering from this major validity issue.

What is construct-irrelevant variance?

300

This term refers to the consistency of test scores across time, forms, or raters.

What is reliability?

300

This tool helps teachers score performance tasks consistently, improving reliability.

What is a rubric?

300

This technical term describes the gap between a student’s true score and obtained score.

What is measurement error?

400

This type of validity asks whether the test content actually matches the skills or knowledge being assessed.

What is content validity?

400

According to Popham, a test cannot be valid without being at least somewhat this.

What is reliable?