Validity
Validity and Teaching
Reliability
Reliability and Teaching
Which is Weakened?
100

A teacher gives a math test aligned to county standards and uses it to evaluate students' understanding of numbers and counting.

What is a valid test

100

In a Kindergarten classroom, students have been working on rhyming during word work. After several days of instruction, the teacher gives an assessment on syllables. What is missing from this test?

What is vailidity?

100

Consistency

What is a one-word synonym for reliability in educational testing

100

A student's ability to count to 50 is assessed on Monday, and the student scores the same on Wednesday.

What is high test-retest reliability?

100

When assessing counting forwards and backwards from 10, you use a read-aloud and the students follow along whilst counting.  Several students miscount because they cannot follow the story.

What is validity?

200

A teacher collects evidence to justify using letter writing scores to report mastery on the report card.

What is a validity argument

200

Students are taught 1:1 counting, and cross off and count. Assessment shows gains in counting arrays and circles, but not names of shapes, which haven’t been taught.

What is instructional sensitivity?

200

You have two assessment sheets. Both have pictures of up to 10 items scattered or in arrays. The difficulty, the size, and the arrangement of items are the same on each test. Tests are given on different days but there is no additional teaching.

What is the reliability/consistency of a test?

200

An assessment of writing numerals when dictated to the student has this.

What is an example of internal consistency reliability?

200

Two teachers score the same writing numerals task. One teacher counts reversals of numbers as correct, but the other teacher does not, giving a different score for the same paper.

What is reliability?

300

A teacher can say that a (named) student is able to identify all letter names after being given clear directions and shown upper and lower case letters in random order.

What is inference and interpretation?

300

Experienced teachers on the grade level collaboratively discuss each test item on an upcoming end-of-unit test and make judgments about each item's ability to evaluate instructional quality

What is persuasive evidence of instructional sensitivity?

300

You have two quizzes, both of which use pictures to assess the beginning sounds of these picture words. The same beginning sounds are assessed in each quiz, but the pictures are different.

What is alternate-form reliability?

300

Test A and Test B are being used to assess students' ability to count objects in scattered arrangements. Test A assesses numbers to 10, Test B assesses numbers to 20. Students' scores vary considerably between tests.

What is reliability?


400

Judgements are made about a student’s ability to write letters clearly and consistently, using a variety of tools and with different teachers to assess, using a clear rubric.

What is human judgment

400

Two tests are given to assess knowledge of positional words. The difficulty, the positional words being assessed, and the arrangement of items to show relative positions are the same on each test. The directions on the tests are long and wordy. 

What is validity?

500

A teacher needs evidence that a test is reliable and instructionally sensitive. This person will provide the evidence in plain English for teachers and caregivers.

What is the role of an assessment expert or measurement specialist?

M
e
n
u