Any test or assessment can never be this.
This is considered the most important concept to consider when creating an assessment.
What is validity?
As a pair, this is considered the second most important concept in the creation of assessments.
What is reliability?
These types of assessments tend to have the highest measures of reliability.
What are high-stakes standardized tests?
The overt performance of a student on an assessment is meant to measure this.
What is their covert understanding of the content?
A conclusion about a test-taker's knowledge and skills.
What is an inference or interpretation?
Another term for reliability.
What is consistency?
A measure of whether a different version of an assessment is reliable.
What is alternate-form reliability?
This terms means that an assessment, as best as possible, measures the intended purpose.
What is accurate?
A measurement instrument for assessment that goes beyond right or wrong.
What is a rubric?
A statistical measurement of how closely related paired data is to determine relationships (in this case between test scores).
Correlation Coefficient
A consideration of whether or not test items consistently measure their intended purpose.
What is internal consistency reliability?
A description of how well an assessment accomplishes the measurement it attempts to measure.
What is fitness of purpose?
These types of assessments typically go through a much higher standard of validity argumentation than classroom tests.
What are standardized tests?
If the same assessment is given at different times, you are attempting to measure this.
Test-Retest Reliability
A consideration of reliability where binary performance is measured as opposed to a correlation analysis.
What is decision consistency estimates?
The formal process of determining if an assessment accomplishes the chief purpose for which it is being used.
What is Assessment Validation.
A description of how well an assessment measures the quality of instruction given to students taking it.
What is instructional sensitivity?
A correlation coefficient of paired scores lower than this number indicates that an assessment may lack reliability.
What is 0.7?
A reliability measurement where individual test-takers are given an estimated range of scores as measured by summary statistics.
What is standard error of measurement?