SlideShare ist ein Scribd-Unternehmen logo
1 von 25
By: Lauren Chadwick
        ED 450


Monday September 9, 2012
Bellwork
O Please use a pen or
 pencil and draw 2
 appointment slots      Once you are finished
 on the plate I gave    open the folded paper
 you when you           and fill in the blank
 entered the            with the type of word
 classroom:             written on the outside.

     1    2
                        Put paper in box.
SLO’s
O Students will …


    O Understand reliability in assessment.
    O Describe the 3 types of reliability evidence.
    O Explain the importance of standard error of
      measurement.
Let’s have some fun!

       O Everyone get up and
        stand in a circle!
Reliability = Consistency
O Reliability is an     O 2 important ways
  important criteria     reliability may be
                         useful:
  used when judging
                            O How reliable are
  standardized tests.         your assessments
                              for you and your
                              students?
                            O Parents may ask
                              how reliable
                              standardized tests
                              may be.
3 Types of Reliability Evidence
  O Stability
  O Alternate Form
  O Internal Consistency
Stability Reliability
O Consistency of test   O Example: Midterm on
  results over time      Tuesday
O The goal is to see      O A masked man grabs your briefcase
  close scores in           containing the students’ tests!
                          O The next day you tell the students
  tests administered        that their tests were stolen and that
  even if they are          they need to re-take the test!
  given on different      O Because there has been no new
  occasions.                information taught since the last
                            test, you expect that the test scores
                            would be similar.

                          O So, if you were able to see that the
                            scores were not similar, then your
                            exam would be judged to have little
                            stability reliability.
Test-Retest
O To see how stable an assessment’s
  results are over time, we usually test
  students, then in a week or two retest with
  the same form of assessment.
O It is important though that there is no
  significant event taking place in between
  the two assessments. Example: field
  trip, movie, speaker- exposing student to test-relevant
  information.
So, how are scores computed?
Correlation Coefficient
          (r)
                                         So…..
O Reflects the degree of
                                       O …if the test–retest r
  similarity between
                                         for two sets of test
  scores on two tests/                   scores were .84 this
O Ranges from:                           would show that the
    +1.0 to -1.0                         students’ performance
                                         on both tests were
                                         similar.
(close to 1.0 = strong relationship)
(closer to 0 = weak relationship)      O If the r was instead
                                         .23 this would show
                                         that the results were
                                         not as similar.
Classification- Consistency
O To decide which students
                               Student Test 1   Test 2
  would be exempted from
  further assessment.          Jill    84%      99%
O Example: If you decide on    Jack    65%      82%
  an 80% answers correct
  required for
  exemption, then on a test-
  retest approach, the
  teacher would find the %
  of students who scored ≥
  80% on both
  assessments.
How to determine
 Classification-Consistency
A       % of students exempt
42%
B       % of students not exempt
46%
C       % of students different classification
12%
D       % of classification – consistency ( A + B)
-----------------------------------------------------------------------
-----------------------
                              88%
        Classification- Consistency of that test
Alternate-Form Reliability
O Simply put:            O Usually found in
                           settings where a
                           student is testing to
  O Providing an           receive his/her diploma.
    equivalent retest    O BUT keep in mind when
                           you are reviewing a
    for students in        state-developed test
    another form other     which claims to have
    than the first and     equivalent forms, that
    failed assessment.     those tests assess the
                           same information as the
                           original.
Internal Consistency
            Reliability
O Reflects the degree    O Example:
  of consistency of
  the items on a given   20 item test on problem
  test.                  solving:
     O Homogeneous
                         Students who are strong
O Each of the test’s     problem solvers should
                         get most of the 20 items
  items should be        correct
  assessing the same
  variable.
FYI
O Tests containing items     O Tests containing items
  which a students can         where students can
  either be right or wrong     be given different
  such as multiple-            grades such as in
  choice, the most
  commonly used                essay items, the most
  approaches are the:          common approach is:


   Kuder-Richardson                  Cronbach’s
      procedures                 coefficient alpha
Just a thought…
O The more times you taste from a pot of
  soup, the more accurate will be your
  estimate of what the ingredients are.
O One taste might fool you, twenty tastes
  will give you a much better idea of what’s
  in the pot.

O More items on a test will yield more
  reliable estimates of understanding.
Choose one…
Stability Reliability ≠ Alternate-Form Reliability ≠ Internal Consistency Reliability




   These three forms of reliability
  evidence should not be used
  together. Instead, keep in mind
  the kind of educational decision
  linked to the results of the
  assessment.
Standard Error of
           Measurement
O Supplies information of test consistency of an
  individual student by estimating their score
  consistency.
      O Problem: Teacher see’s student results from a
      pervious IQ test. They expect that the student will
                             perform on that exact level
O SEM helps to remind teachers that scores
  earned on commercial or classroom tests are
  not always exact on student’s understanding.
O Another area where SEM is used is in
  reporting of student performance on state
  tests.
60 item Math test

                               O Jack answers 53
Classificatio   # correct
n
                                out of the 60
Advanced        54 – 60
                                questions correctly,
Proficient      44 – 53
                                so what would he
Basic           37 – 43
                                be classified as?
Below Basic     36 and below
                                     a proficient student.
Appointment Time!
Everyone find a classmate for your 1st appointment.
You have 3 minutes to decide whether the following is
stability, alternate-form, or internal consistency reliability.


Suppose you were trying to show that the items on
an educational assessment all measured the same
variable. What type of reliability evidence would
you use?


                          Internal consistency reliability
Appointment Time!
Everyone find a classmate for your 2nd appointment.
You have 3 minutes to decide whether the following is stability,
alternate-form, or internal consistency reliability.


A new nationally standardized reading test for
junior high students has been created, but the test
developers want to make sure that the three
different forms of tests are performing in essentially
the same way. What sort of reliability evidence
should they rely on?


                         Alternate-form reliability
Why do you need to know?

O You may be called on         O Understanding
  to explain to parents         standard error of
  the meaning of their          measurement will
  child’s test scores.          be helpful when you
                                need to explain to
O At least be                   parents and
  knowledgeable about           students how to
  the meaning of                make sense of their
  reliability in educational    scores on
  assessments.                  accountability tests.
Check your understanding..
O Do you understand assessment reliability?


O Do you know the 3 types of reliability
  evidence?

O Can you explain the importance of standard
  error of measurement?
Chapter 3 teachback

Weitere ähnliche Inhalte

Was ist angesagt? (19)

Item analysis
Item analysisItem analysis
Item analysis
 
254457273-LET-Review-Assessment-of-Learning-Test-Items.pptx
254457273-LET-Review-Assessment-of-Learning-Test-Items.pptx254457273-LET-Review-Assessment-of-Learning-Test-Items.pptx
254457273-LET-Review-Assessment-of-Learning-Test-Items.pptx
 
Analyzing and using test item data
Analyzing and using test item dataAnalyzing and using test item data
Analyzing and using test item data
 
Improving the test items
Improving the test itemsImproving the test items
Improving the test items
 
Item
ItemItem
Item
 
Administering, Analyzing, and Improving the Test or Assessment
Administering, Analyzing, and Improving the Test or AssessmentAdministering, Analyzing, and Improving the Test or Assessment
Administering, Analyzing, and Improving the Test or Assessment
 
Item analysis
Item analysis Item analysis
Item analysis
 
ITEM ANALYSIS
ITEM ANALYSISITEM ANALYSIS
ITEM ANALYSIS
 
MCQ test item analysis
MCQ test item analysisMCQ test item analysis
MCQ test item analysis
 
Item analysis
Item analysisItem analysis
Item analysis
 
Item analysis report
Item analysis report Item analysis report
Item analysis report
 
Item Analysis
Item AnalysisItem Analysis
Item Analysis
 
Item analysis and validation
Item analysis and validation Item analysis and validation
Item analysis and validation
 
Test item analysis
Test item analysisTest item analysis
Test item analysis
 
Item analysis
Item analysisItem analysis
Item analysis
 
Item analysis
Item analysisItem analysis
Item analysis
 
Item analysis
Item analysisItem analysis
Item analysis
 
Item Analysis - Discrimination and Difficulty Index
Item Analysis - Discrimination and Difficulty IndexItem Analysis - Discrimination and Difficulty Index
Item Analysis - Discrimination and Difficulty Index
 
Item Analysis
Item AnalysisItem Analysis
Item Analysis
 

Andere mochten auch (9)

Dimkov
DimkovDimkov
Dimkov
 
Introducion
IntroducionIntroducion
Introducion
 
Platos típicos Italianos
Platos típicos ItalianosPlatos típicos Italianos
Platos típicos Italianos
 
Sigpro
SigproSigpro
Sigpro
 
ceramicas alfagres
ceramicas alfagresceramicas alfagres
ceramicas alfagres
 
Latih1
Latih1Latih1
Latih1
 
26249464 ekosistem-rawa
26249464 ekosistem-rawa26249464 ekosistem-rawa
26249464 ekosistem-rawa
 
Rtlws2013
Rtlws2013Rtlws2013
Rtlws2013
 
през0710
през0710през0710
през0710
 

Ähnlich wie Chapter 3 teachback

CRITERIA AND VALIDITY PRESENTATIONS.pptx
CRITERIA AND VALIDITY PRESENTATIONS.pptxCRITERIA AND VALIDITY PRESENTATIONS.pptx
CRITERIA AND VALIDITY PRESENTATIONS.pptxMammeNkwelle
 
Six steps for avoiding misinterpretations
Six steps for avoiding misinterpretationsSix steps for avoiding misinterpretations
Six steps for avoiding misinterpretationsAbdul Majid
 
Testing in language programs (chapter 8)
Testing in language programs (chapter 8)Testing in language programs (chapter 8)
Testing in language programs (chapter 8)Tahere Bakhshi
 
Reliability and validity of Research Data
Reliability and validity of Research DataReliability and validity of Research Data
Reliability and validity of Research DataAida Arifin
 
With-Hershey-Marie-Abarri-4hshzjzhzhzhzhzhz.pptx
With-Hershey-Marie-Abarri-4hshzjzhzhzhzhzhz.pptxWith-Hershey-Marie-Abarri-4hshzjzhzhzhzhzhz.pptx
With-Hershey-Marie-Abarri-4hshzjzhzhzhzhzhz.pptxJunrivRivera
 
Lesson 21 designing the questionaire and establishing validity and reliabilty
Lesson 21 designing the questionaire and establishing validity and reliabiltyLesson 21 designing the questionaire and establishing validity and reliabilty
Lesson 21 designing the questionaire and establishing validity and reliabiltymjlobetos
 
Field Study 5 Episode 3
Field Study 5 Episode 3Field Study 5 Episode 3
Field Study 5 Episode 3Jundel Deliman
 
Relaibility in Assessment
Relaibility in AssessmentRelaibility in Assessment
Relaibility in AssessmentZoya Noor
 
How to make tests more reliable
How to make tests more reliableHow to make tests more reliable
How to make tests more reliableNawaphat Deelert
 
What makes a good testA test is considered good” if the .docx
What makes a good testA test is considered good” if the .docxWhat makes a good testA test is considered good” if the .docx
What makes a good testA test is considered good” if the .docxmecklenburgstrelitzh
 
Faith & ReasonFaith is not opposed to reason, but is sometime.docx
Faith & ReasonFaith is not opposed to reason, but is sometime.docxFaith & ReasonFaith is not opposed to reason, but is sometime.docx
Faith & ReasonFaith is not opposed to reason, but is sometime.docxmecklenburgstrelitzh
 

Ähnlich wie Chapter 3 teachback (20)

Reliability and validity
Reliability and validityReliability and validity
Reliability and validity
 
CRITERIA AND VALIDITY PRESENTATIONS.pptx
CRITERIA AND VALIDITY PRESENTATIONS.pptxCRITERIA AND VALIDITY PRESENTATIONS.pptx
CRITERIA AND VALIDITY PRESENTATIONS.pptx
 
Six steps for avoiding misinterpretations
Six steps for avoiding misinterpretationsSix steps for avoiding misinterpretations
Six steps for avoiding misinterpretations
 
Chapter 6: Validity
Chapter 6: ValidityChapter 6: Validity
Chapter 6: Validity
 
Testing in language programs (chapter 8)
Testing in language programs (chapter 8)Testing in language programs (chapter 8)
Testing in language programs (chapter 8)
 
Validity and reliability
Validity and reliabilityValidity and reliability
Validity and reliability
 
Reliability and validity of Research Data
Reliability and validity of Research DataReliability and validity of Research Data
Reliability and validity of Research Data
 
Assessment
AssessmentAssessment
Assessment
 
Assessment
AssessmentAssessment
Assessment
 
Norm tets
Norm tetsNorm tets
Norm tets
 
Wj definitions
Wj definitionsWj definitions
Wj definitions
 
With-Hershey-Marie-Abarri-4hshzjzhzhzhzhzhz.pptx
With-Hershey-Marie-Abarri-4hshzjzhzhzhzhzhz.pptxWith-Hershey-Marie-Abarri-4hshzjzhzhzhzhzhz.pptx
With-Hershey-Marie-Abarri-4hshzjzhzhzhzhzhz.pptx
 
Lesson 21 designing the questionaire and establishing validity and reliabilty
Lesson 21 designing the questionaire and establishing validity and reliabiltyLesson 21 designing the questionaire and establishing validity and reliabilty
Lesson 21 designing the questionaire and establishing validity and reliabilty
 
Field Study 5 Episode 3
Field Study 5 Episode 3Field Study 5 Episode 3
Field Study 5 Episode 3
 
Relaibility in Assessment
Relaibility in AssessmentRelaibility in Assessment
Relaibility in Assessment
 
Chapter 6: Validity
Chapter 6: ValidityChapter 6: Validity
Chapter 6: Validity
 
Assessment
AssessmentAssessment
Assessment
 
How to make tests more reliable
How to make tests more reliableHow to make tests more reliable
How to make tests more reliable
 
What makes a good testA test is considered good” if the .docx
What makes a good testA test is considered good” if the .docxWhat makes a good testA test is considered good” if the .docx
What makes a good testA test is considered good” if the .docx
 
Faith & ReasonFaith is not opposed to reason, but is sometime.docx
Faith & ReasonFaith is not opposed to reason, but is sometime.docxFaith & ReasonFaith is not opposed to reason, but is sometime.docx
Faith & ReasonFaith is not opposed to reason, but is sometime.docx
 

Kürzlich hochgeladen

“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...Marc Dusseiller Dusjagr
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityGeoBlogs
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxGaneshChakor2
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...EduSkills OECD
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introductionMaksud Ahmed
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxSayali Powar
 
PSYCHIATRIC History collection FORMAT.pptx
PSYCHIATRIC   History collection FORMAT.pptxPSYCHIATRIC   History collection FORMAT.pptx
PSYCHIATRIC History collection FORMAT.pptxPoojaSen20
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionSafetyChain Software
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformChameera Dedduwage
 
mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docxPoojaSen20
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptxVS Mahajan Coaching Centre
 
MENTAL STATUS EXAMINATION format.docx
MENTAL     STATUS EXAMINATION format.docxMENTAL     STATUS EXAMINATION format.docx
MENTAL STATUS EXAMINATION format.docxPoojaSen20
 
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting DataJhengPantaleon
 
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991RKavithamani
 
How to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxHow to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxmanuelaromero2013
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Krashi Coaching
 
Alper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentAlper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentInMediaRes1
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeThiyagu K
 

Kürzlich hochgeladen (20)

“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activity
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptx
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
 
PSYCHIATRIC History collection FORMAT.pptx
PSYCHIATRIC   History collection FORMAT.pptxPSYCHIATRIC   History collection FORMAT.pptx
PSYCHIATRIC History collection FORMAT.pptx
 
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdfTataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory Inspection
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy Reform
 
Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
 
mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docx
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
 
MENTAL STATUS EXAMINATION format.docx
MENTAL     STATUS EXAMINATION format.docxMENTAL     STATUS EXAMINATION format.docx
MENTAL STATUS EXAMINATION format.docx
 
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
 
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
 
How to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxHow to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptx
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
 
Alper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentAlper Gobel In Media Res Media Component
Alper Gobel In Media Res Media Component
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and Mode
 

Chapter 3 teachback

  • 1. By: Lauren Chadwick ED 450 Monday September 9, 2012
  • 2. Bellwork O Please use a pen or pencil and draw 2 appointment slots Once you are finished on the plate I gave open the folded paper you when you and fill in the blank entered the with the type of word classroom: written on the outside. 1 2 Put paper in box.
  • 3. SLO’s O Students will … O Understand reliability in assessment. O Describe the 3 types of reliability evidence. O Explain the importance of standard error of measurement.
  • 4.
  • 5. Let’s have some fun! O Everyone get up and stand in a circle!
  • 6.
  • 7. Reliability = Consistency O Reliability is an O 2 important ways important criteria reliability may be useful: used when judging O How reliable are standardized tests. your assessments for you and your students? O Parents may ask how reliable standardized tests may be.
  • 8. 3 Types of Reliability Evidence O Stability O Alternate Form O Internal Consistency
  • 9. Stability Reliability O Consistency of test O Example: Midterm on results over time Tuesday O The goal is to see O A masked man grabs your briefcase close scores in containing the students’ tests! O The next day you tell the students tests administered that their tests were stolen and that even if they are they need to re-take the test! given on different O Because there has been no new occasions. information taught since the last test, you expect that the test scores would be similar. O So, if you were able to see that the scores were not similar, then your exam would be judged to have little stability reliability.
  • 10. Test-Retest O To see how stable an assessment’s results are over time, we usually test students, then in a week or two retest with the same form of assessment. O It is important though that there is no significant event taking place in between the two assessments. Example: field trip, movie, speaker- exposing student to test-relevant information.
  • 11. So, how are scores computed? Correlation Coefficient (r) So….. O Reflects the degree of O …if the test–retest r similarity between for two sets of test scores on two tests/ scores were .84 this O Ranges from: would show that the +1.0 to -1.0 students’ performance on both tests were similar. (close to 1.0 = strong relationship) (closer to 0 = weak relationship) O If the r was instead .23 this would show that the results were not as similar.
  • 12. Classification- Consistency O To decide which students Student Test 1 Test 2 would be exempted from further assessment. Jill 84% 99% O Example: If you decide on Jack 65% 82% an 80% answers correct required for exemption, then on a test- retest approach, the teacher would find the % of students who scored ≥ 80% on both assessments.
  • 13. How to determine Classification-Consistency A % of students exempt 42% B % of students not exempt 46% C % of students different classification 12% D % of classification – consistency ( A + B) ----------------------------------------------------------------------- ----------------------- 88% Classification- Consistency of that test
  • 14. Alternate-Form Reliability O Simply put: O Usually found in settings where a student is testing to O Providing an receive his/her diploma. equivalent retest O BUT keep in mind when you are reviewing a for students in state-developed test another form other which claims to have than the first and equivalent forms, that failed assessment. those tests assess the same information as the original.
  • 15. Internal Consistency Reliability O Reflects the degree O Example: of consistency of the items on a given 20 item test on problem test. solving: O Homogeneous Students who are strong O Each of the test’s problem solvers should get most of the 20 items items should be correct assessing the same variable.
  • 16. FYI O Tests containing items O Tests containing items which a students can where students can either be right or wrong be given different such as multiple- grades such as in choice, the most commonly used essay items, the most approaches are the: common approach is: Kuder-Richardson Cronbach’s procedures coefficient alpha
  • 17. Just a thought… O The more times you taste from a pot of soup, the more accurate will be your estimate of what the ingredients are. O One taste might fool you, twenty tastes will give you a much better idea of what’s in the pot. O More items on a test will yield more reliable estimates of understanding.
  • 18. Choose one… Stability Reliability ≠ Alternate-Form Reliability ≠ Internal Consistency Reliability These three forms of reliability evidence should not be used together. Instead, keep in mind the kind of educational decision linked to the results of the assessment.
  • 19. Standard Error of Measurement O Supplies information of test consistency of an individual student by estimating their score consistency. O Problem: Teacher see’s student results from a pervious IQ test. They expect that the student will perform on that exact level O SEM helps to remind teachers that scores earned on commercial or classroom tests are not always exact on student’s understanding. O Another area where SEM is used is in reporting of student performance on state tests.
  • 20. 60 item Math test O Jack answers 53 Classificatio # correct n out of the 60 Advanced 54 – 60 questions correctly, Proficient 44 – 53 so what would he Basic 37 – 43 be classified as? Below Basic 36 and below a proficient student.
  • 21. Appointment Time! Everyone find a classmate for your 1st appointment. You have 3 minutes to decide whether the following is stability, alternate-form, or internal consistency reliability. Suppose you were trying to show that the items on an educational assessment all measured the same variable. What type of reliability evidence would you use? Internal consistency reliability
  • 22. Appointment Time! Everyone find a classmate for your 2nd appointment. You have 3 minutes to decide whether the following is stability, alternate-form, or internal consistency reliability. A new nationally standardized reading test for junior high students has been created, but the test developers want to make sure that the three different forms of tests are performing in essentially the same way. What sort of reliability evidence should they rely on? Alternate-form reliability
  • 23. Why do you need to know? O You may be called on O Understanding to explain to parents standard error of the meaning of their measurement will child’s test scores. be helpful when you need to explain to O At least be parents and knowledgeable about students how to the meaning of make sense of their reliability in educational scores on assessments. accountability tests.
  • 24. Check your understanding.. O Do you understand assessment reliability? O Do you know the 3 types of reliability evidence? O Can you explain the importance of standard error of measurement?