A teacher gives a math test aligned to county standards and uses it to evaluate students' understanding of numbers and counting.
What is a valid test
In a Kindergarten classroom, students have been working on rhyming during word work. After several days of instruction, the teacher gives an assessment on syllables. What is missing from this test?
What is vailidity?
Consistency
What is a one-word synonym for reliability in educational testing
A student's ability to count to 50 is assessed on Monday, and the student scores the same on Wednesday.
What is high test-retest reliability?
When assessing counting forwards and backwards from 10, you use a read-aloud and the students follow along whilst counting. Several students miscount because they cannot follow the story.
What is validity?
A teacher collects evidence to justify using letter writing scores to report mastery on the report card.
What is a validity argument
Students are taught 1:1 counting, and cross off and count. Assessment shows gains in counting arrays and circles, but not names of shapes, which haven’t been taught.
What is instructional sensitivity?
You have two assessment sheets. Both have pictures of up to 10 items scattered or in arrays. The difficulty, the size, and the arrangement of items are the same on each test. Tests are given on different days but there is no additional teaching.
What is the reliability/consistency of a test?
An assessment of writing numerals when dictated to the student has this.
What is an example of internal consistency reliability?
Two teachers score the same writing numerals task. One teacher counts reversals of numbers as correct, but the other teacher does not, giving a different score for the same paper.
What is reliability?
A teacher can say that a (named) student is able to identify all letter names after being given clear directions and shown upper and lower case letters in random order.
What is inference and interpretation?
Experienced teachers on the grade level collaboratively discuss each test item on an upcoming end-of-unit test and make judgments about each item's ability to evaluate instructional quality
What is persuasive evidence of instructional sensitivity?
You have two quizzes, both of which use pictures to assess the beginning sounds of these picture words. The same beginning sounds are assessed in each quiz, but the pictures are different.
What is alternate-form reliability?
Test A and Test B are being used to assess students' ability to count objects in scattered arrangements. Test A assesses numbers to 10, Test B assesses numbers to 20. Students' scores vary considerably between tests.
What is reliability?
Judgements are made about a student’s ability to write letters clearly and consistently, using a variety of tools and with different teachers to assess, using a clear rubric.
What is human judgment
Two tests are given to assess knowledge of positional words. The difficulty, the positional words being assessed, and the arrangement of items to show relative positions are the same on each test. The directions on the tests are long and wordy.
What is validity?
A teacher needs evidence that a test is reliable and instructionally sensitive. This person will provide the evidence in plain English for teachers and caregivers.
What is the role of an assessment expert or measurement specialist?