SlideShare ist ein Scribd-Unternehmen logo
1 von 35
Downloaden Sie, um offline zu lesen
TEST DEVELOPMENT AND
EVALUATION (6462)
CLASSROOM TESTING AND HIGH-STAKE TESTING
Department of Secondary Teacher Education
ALLAMA IQBAL OPEN UNIVERSITY, ISLAMABAD
OBJECTIVES OF THE UNIT
After studying this unit, the students will have ability to demonstrate.
1. understand the concept of class room testing and its techniques
2. understand the need and scope of high stake testing
3. differentiate between teacher made tests/classroom tests/low stake tests and
standardized/high stake tests
4. enumerate advantages and limitations of the low stake and high stake tests
5. prepare tests using Bloom’s Taxonomy and SOLO Taxonomy
6. elaborate the procedure for test development
7. provide examples of standardized tests with characteristics with examples.
8. enlist few trends in high stake testing
3.1 CONCEPT OF CLASSROOM TESTING AND ITS TECHNIQUES
Classroom assessment is the process, usually conducted by teachers, of designing, collecting,
interpreting and applying information about student learning and attainment to make educational
decisions. There are four interrelated steps to the classroom assessment process.
 The first step is to define the purposes for the information. During this period, the teacher
considers how the information will be used and how the assessment fits in the students'
educational program.
 The next step in the assessment process is to measure student learning or attainment.
Measurement involves using tests, surveys, observation or interviews to produce either numeric
or verbal descriptions of the degree to which a student has achieved academic goals.
 The third step is to evaluate the measurement data, which entails making judgments about the
information. During this stage, the teacher interprets the measurement data to determine if
students have certain strengths or limitations or whether the student has sufficiently attained the
learning goals.
 In the last stage, the teacher applies the interpretations to fulfill the aims of assessment that
were defined in first stage. The teacher uses the data to guide instruction, render grades, or help
students with any particular learning deficiencies or barriers.
3.2 HIGH STAKE TESTING: ITS NATURE, NEED AND SCOPE
 High-stakes testing has consequences attached to the results. For example, highstakes tests
can be used to determine students’ promotion from grade to grade or graduation from high
school (Resnick, 2004; Cizek, 2001).
 The use and misuse of high-stakes tests are a controversial topic in public education, in
advanced countries and even in Pakistan as they are used not only to assess students but in
attempts to increase teacher accountability also.
Precisely we can say that a high-stakes test is a test that:
o is a single, defined assessment,
o has a clear line drawn between those who pass and those who fail, and
o has direct consequences for passing or failing (something "at stake").
• What is Need of High Stake Testing?
• What is Nature of the High Stake Testing?
Teacher made vs
Standardized test
EduTainment
EduTainment
Teacher made vs
Standardized test
Teacher made vs Standardized test
EduTainment
Differences Between Standard And Teachers Made Tests
EduTainment
EduTainment
Differences Between Standard And Teachers Made Tests
Differences Between Standard And Teachers Made Tests
3.5.2 Advantage and Disadvantage of High Stake Testing
 It holds teachers accountable for ensuring that all students learn what they are expected to learn.
 Motivates students to work harder, learn more, and take the tests more seriously, which can promote higher
student achievement.
 Establishes high expectations for both educators and students, which can help reverse the cycles of low
educational expectations, achievement, and attainment that have historically disadvantaged some student
groups, particularly students of color, and that have characterized some schools in poorer communities or
more troubled urban areas.
 Reveals areas of educational need that can be targeted for reform and improvement, such as programs for
students who may be underperforming academically or being underserved by schools.
 Provides easily understandable information about school and student performance in the form of numerical
test scores that reformers, educational leaders, elected officials and policy makers can use to develop new
laws, regulations, and school-improvement strategies.
 Gives parents, employers, colleges and others more confidence that students are learning at a high level or
that high school graduates have acquired the skills they will need to succeed in adulthood.
Disadvantage of High-Stakes Testing
 It forces educators to “teach to the test”—
 It promotes a more “narrow” academic program in schools—
 It may contribute to higher, or even much higher, rates of cheating—
 It has been correlated in some research studies to increase failure rates,
lower graduation rates, and higher dropout rates—
 May diminish the overall quality of teaching and learning—
 Exacerbates negative stereotypes about the intelligence and academic
ability of minority students—
3.6 CONCEPT OF USE OF TAXONOMIES IN TEST
DEVELOPMENT
Using Bloom’s Taxonomy in Test Development
Using SOLO Taxonomy in Test Development
Bloom’s Taxonomy (1956) question samples:
•Knowledge: How many…? Who was it that…? Can you name the…?
•Comprehension: Can you write in your own words…? Can you write a brief outline…? What do you
think could have happened next…?
•Application: Choose the best statements that apply Judge the effects of… What would result …?
•Analysis: Which events could have happened…? If … happened, how might the ending have been
different? How was this similar to…?
•Synthesis: Can you design a … to achieve …? Write a poem, song or creative presentation about…?
Can you see a possible solution to…?
•Evaluation: What criteria would you use to assess…? What data was used to evaluate…? How could
you verify…?
SOLO Taxonomy
 SOLO taxonomy was developed by Biggs and Collis (1982) Stands for Structure of Observed Learning Outcomes
3.7 PROCEDURE OR STEPS FOR A STANDARDIZED TEST
DEVELOPMENT PROCESS
Pilot
Forms, Scoring and Analysis
Development
Review
Purpose
Specifications
3.8 EXAMPLES OF STANDARDIZED TESTS WITH
CHARACTERISTICS
The Standardized tests can be classified as per their functions are
• Group and Individual Tests
• Norm-referenced
• Achievement Tests
• Criterion-referenced
• Aptitude
• Personality
• Projective
• Interest Inventories
• Intelligence tests
Reliability refers to the consistency of scores
obtained by the same individuals when re-
examined with test on different occasions, or
with different sets of equivalent items.
Reliability
Typesof Reliability
Inter-rater reliability by considering the similarity of the scores
awarded by the two observers.
Inter-Rater or Inter-ObserverReliability
⚫ It is used to judge the consistency of
results across items on the same test.
⚫ We estimate test-retest reliability when
we administer the same test to the same
sample on two different occasions.
⚫ The amount of time allowed between
measures is critical.
⚫ The shorter the time gap, the higher the
correlation; the longer the time gap, the
lower the correlation.
Test-RetestReliability
⚫ In split-half reliability we randomly divide all items that claim to
measure the same contents into two sets.
⚫ The split-half reliability estimate is simply the correlation between two
total scores.
Split-Half Reliability
⚫ In parallel form reliability we have to create two different tests from
the same contents to measure the same learning outcomes.
⚫ The correlation between the two parallel forms is the estimate of
reliability.
Parallel-FormReliability
● It is the degree to which items on an instrument are consistent among
themselves and with the instrument as a whole.
Internal ConsistencyReliability
Validity
 The validity of an assessment tool is the degree to which it measures
for what it is designed to measure.
 The concept refers to the appropriateness, meaningfulness, and
usefulness of the specific inferences made from test scores.
Methods of Measuring Validity
1 2
3
4 5
Content Validity
 Content validity evidence involves the degree to which the content of the test
matches a content domain associated with the construct.
 Items in a test appear to cover whole domain.
Face validity
It is an estimate of
whether a test appears
to measure a certain
criterion. It is
appearance of test.
Construct Validity
 Construct is the concept or the characteristic that a test is designed to measure.
 According to Howell (1992) Construct validity is a test’s ability to measure
factors which are relevant to the field of study.
Convergent
Convergent validity
refers to the degree to
which a measure is
correlated with other
measures.
Criterion Validity
 Criterion validity evidence involves
the correlation between the test and a
criterion variable (or variables) taken
as representative of the construct.
 It compares the test with other
measures or outcomes (the criteria)
already held to be valid.
Concurrent Validity
 Concurrent validity refers to the degree to which the scores taken at one point
correlates with other measures (test, observation or interview) of the same
construct that is measured at the same time.
Predictive Validity
 Predictive validity assures how well the
test predicts some future behaviour of the
examinee.
 If higher scores on the Boards Exams are
positively correlated with higher
G.P.A.’s in the Universities and vice
versa, then the Board exams is said to
have predictive validity.
Factors Affecting Validity
 Instructions to Take A Test
 Difficult Language Structure
 Inappropriate Level of Difficulty
 Poorly Constructed Test Items
 Ambiguity in Items Statements
 Length of the Test
 Improper Arrangement of Items
 Identifiable Pattern of Answers
Relationship between Validity and Reliability
 Reliability is a necessary requirement for validity
 Establishing good reliability is only the first part of establishing validity
 Reliability is necessary but not sufficient for validity.
3.8.3 Usability of Tests
 Usability testing refers to evaluating a product or service by testing it with
representative users. Typically, during a test, participants will try to complete
typical tasks while observers watch, listen and takes notes. You should also
select tests based on how easy the test is to use. In addition to reliability and
validity, you need to think about how much time you have to create a test, grade
it and administer it. You need to think about how you will interpret and use the
scores from the tests. And you need to check to make sure the test questions and
directions are written clearly, the test itself is short enough not to overwhelm the
students, the questions don't includes stereotypes or personal biases, and that
they are interesting and make the students think.
Department of Secondary Teacher Education
ALLAMA IQBAL OPEN UNIVERSITY, ISLAMABAD
Dr. Hina Jalal
hinansari23@gmail.com

Weitere ähnliche Inhalte

Was ist angesagt?

Continuous and comprehensive evaluation (cce)
Continuous and comprehensive evaluation (cce)Continuous and comprehensive evaluation (cce)
Continuous and comprehensive evaluation (cce)Waheeda Bushra
 
Ieva Stupans 2008
Ieva Stupans 2008Ieva Stupans 2008
Ieva Stupans 2008Diana Quinn
 
teaching material
teaching material teaching material
teaching material Kadek Astiti
 
Principles of student assessment in medical education 2017 SATYA
Principles of student assessment in medical education  2017 SATYA Principles of student assessment in medical education  2017 SATYA
Principles of student assessment in medical education 2017 SATYA sathyanarayanan varadarajan
 
Assessment: Achieving improved efficiency, effectiveness, educational integri...
Assessment:Achieving improved efficiency, effectiveness, educational integri...Assessment:Achieving improved efficiency, effectiveness, educational integri...
Assessment: Achieving improved efficiency, effectiveness, educational integri...Diana Quinn
 
Classroom assessment, cce, achievement test, dignostic test
Classroom assessment, cce, achievement test, dignostic testClassroom assessment, cce, achievement test, dignostic test
Classroom assessment, cce, achievement test, dignostic testsajeena81
 
Alternate mode of examination
Alternate  mode of examinationAlternate  mode of examination
Alternate mode of examinationgoggigupta
 
Continuous assessment as a relevant tool to quality products of learners in e...
Continuous assessment as a relevant tool to quality products of learners in e...Continuous assessment as a relevant tool to quality products of learners in e...
Continuous assessment as a relevant tool to quality products of learners in e...William Kapambwe
 
Tools n techniques of evaluation
Tools n techniques of evaluationTools n techniques of evaluation
Tools n techniques of evaluationjagannath Dange
 
Standardized Testing
Standardized TestingStandardized Testing
Standardized TestingMiss EAP
 
Ethics in assessment
Ethics in assessmentEthics in assessment
Ethics in assessmentSAIT
 
Classroom testing: Using tests to promote learning
Classroom testing: Using tests to promote learningClassroom testing: Using tests to promote learning
Classroom testing: Using tests to promote learningRichard P Phelps
 
Understanding the concept of continuous and comprehensive evauation.
Understanding the concept of continuous and comprehensive evauation.Understanding the concept of continuous and comprehensive evauation.
Understanding the concept of continuous and comprehensive evauation.Sarvodaya Kanya Vidhyalaya
 
The Use of Formative Assessment in Legal Education
The Use of Formative Assessment in Legal EducationThe Use of Formative Assessment in Legal Education
The Use of Formative Assessment in Legal EducationExamSoft
 
The concept continuous assessment record and its importance
The concept continuous assessment record and its importanceThe concept continuous assessment record and its importance
The concept continuous assessment record and its importanceVICTOR ESAU
 

Was ist angesagt? (20)

Continuous and comprehensive evaluation (cce)
Continuous and comprehensive evaluation (cce)Continuous and comprehensive evaluation (cce)
Continuous and comprehensive evaluation (cce)
 
Ieva Stupans 2008
Ieva Stupans 2008Ieva Stupans 2008
Ieva Stupans 2008
 
Open book examination
Open book examinationOpen book examination
Open book examination
 
teaching material
teaching material teaching material
teaching material
 
Evaluation
Evaluation Evaluation
Evaluation
 
Assessment Trends In Higher Education
Assessment Trends In Higher EducationAssessment Trends In Higher Education
Assessment Trends In Higher Education
 
Continuous Assessment
Continuous AssessmentContinuous Assessment
Continuous Assessment
 
Principles of student assessment in medical education 2017 SATYA
Principles of student assessment in medical education  2017 SATYA Principles of student assessment in medical education  2017 SATYA
Principles of student assessment in medical education 2017 SATYA
 
Assessment: Achieving improved efficiency, effectiveness, educational integri...
Assessment:Achieving improved efficiency, effectiveness, educational integri...Assessment:Achieving improved efficiency, effectiveness, educational integri...
Assessment: Achieving improved efficiency, effectiveness, educational integri...
 
Classroom assessment, cce, achievement test, dignostic test
Classroom assessment, cce, achievement test, dignostic testClassroom assessment, cce, achievement test, dignostic test
Classroom assessment, cce, achievement test, dignostic test
 
Alternate mode of examination
Alternate  mode of examinationAlternate  mode of examination
Alternate mode of examination
 
Continuous assessment as a relevant tool to quality products of learners in e...
Continuous assessment as a relevant tool to quality products of learners in e...Continuous assessment as a relevant tool to quality products of learners in e...
Continuous assessment as a relevant tool to quality products of learners in e...
 
Tools n techniques of evaluation
Tools n techniques of evaluationTools n techniques of evaluation
Tools n techniques of evaluation
 
Standardized Testing
Standardized TestingStandardized Testing
Standardized Testing
 
Ethics in assessment
Ethics in assessmentEthics in assessment
Ethics in assessment
 
ETHICS IN ASESSMENT
ETHICS IN ASESSMENTETHICS IN ASESSMENT
ETHICS IN ASESSMENT
 
Classroom testing: Using tests to promote learning
Classroom testing: Using tests to promote learningClassroom testing: Using tests to promote learning
Classroom testing: Using tests to promote learning
 
Understanding the concept of continuous and comprehensive evauation.
Understanding the concept of continuous and comprehensive evauation.Understanding the concept of continuous and comprehensive evauation.
Understanding the concept of continuous and comprehensive evauation.
 
The Use of Formative Assessment in Legal Education
The Use of Formative Assessment in Legal EducationThe Use of Formative Assessment in Legal Education
The Use of Formative Assessment in Legal Education
 
The concept continuous assessment record and its importance
The concept continuous assessment record and its importanceThe concept continuous assessment record and its importance
The concept continuous assessment record and its importance
 

Ähnlich wie TEST DEVELOPMENT AND EVALUATION (6462)

LESSON 6 JBF 361.pptx
LESSON 6 JBF 361.pptxLESSON 6 JBF 361.pptx
LESSON 6 JBF 361.pptxAdnanIssah
 
test construction in mathematics
test construction in mathematicstest construction in mathematics
test construction in mathematicsAlokBhutia
 
Standardized and non standardized tests
Standardized and non standardized testsStandardized and non standardized tests
Standardized and non standardized testsshaziazamir1
 
Standardized testing
Standardized testingStandardized testing
Standardized testingElLa Bee
 
construction and administration of unit test in science subject
construction and administration of unit test in science subjectconstruction and administration of unit test in science subject
construction and administration of unit test in science subjectAlokBhutia
 
B 190313162555
B 190313162555B 190313162555
B 190313162555pawanbais1
 
Construction of Tests
Construction of TestsConstruction of Tests
Construction of TestsDakshta1
 
Standardized and non standardized tests
Standardized and non standardized testsStandardized and non standardized tests
Standardized and non standardized testsvinoli_sg
 
testing and evaluation
testing and evaluation testing and evaluation
testing and evaluation AqsaSuleman1
 
Educational Assessment and Evaluation
Educational Assessment and Evaluation Educational Assessment and Evaluation
Educational Assessment and Evaluation HennaAnsari
 
Principles of language assessment.pptx
Principles of language assessment.pptxPrinciples of language assessment.pptx
Principles of language assessment.pptxNOELIAANALIPROAOTROY1
 
ASSESSMENT AND EVALUATION IN EDUCATION
ASSESSMENT AND EVALUATION IN EDUCATIONASSESSMENT AND EVALUATION IN EDUCATION
ASSESSMENT AND EVALUATION IN EDUCATIONJustin Knight
 
Assessment and evaluation_in_education
Assessment and evaluation_in_educationAssessment and evaluation_in_education
Assessment and evaluation_in_educationEstherDonnyKimsiong
 
evalution-151228155502 (1).pptx evalution-151228155502 (1).pptx
evalution-151228155502 (1).pptx evalution-151228155502 (1).pptxevalution-151228155502 (1).pptx evalution-151228155502 (1).pptx
evalution-151228155502 (1).pptx evalution-151228155502 (1).pptxLoyalZohaibKhattak
 

Ähnlich wie TEST DEVELOPMENT AND EVALUATION (6462) (20)

LESSON 6 JBF 361.pptx
LESSON 6 JBF 361.pptxLESSON 6 JBF 361.pptx
LESSON 6 JBF 361.pptx
 
test construction in mathematics
test construction in mathematicstest construction in mathematics
test construction in mathematics
 
Standardized and non standardized tests (1)
Standardized and non standardized tests (1)Standardized and non standardized tests (1)
Standardized and non standardized tests (1)
 
Evaluation in education
Evaluation in educationEvaluation in education
Evaluation in education
 
Standardized and non standardized tests
Standardized and non standardized testsStandardized and non standardized tests
Standardized and non standardized tests
 
Standardized testing
Standardized testingStandardized testing
Standardized testing
 
construction and administration of unit test in science subject
construction and administration of unit test in science subjectconstruction and administration of unit test in science subject
construction and administration of unit test in science subject
 
B 190313162555
B 190313162555B 190313162555
B 190313162555
 
Construction of Tests
Construction of TestsConstruction of Tests
Construction of Tests
 
Standardized and non standardized tests
Standardized and non standardized testsStandardized and non standardized tests
Standardized and non standardized tests
 
testing and evaluation
testing and evaluation testing and evaluation
testing and evaluation
 
Educational Assessment and Evaluation
Educational Assessment and Evaluation Educational Assessment and Evaluation
Educational Assessment and Evaluation
 
Language assessment
Language assessmentLanguage assessment
Language assessment
 
Principles of language assessment.pptx
Principles of language assessment.pptxPrinciples of language assessment.pptx
Principles of language assessment.pptx
 
ASSESSMENT AND EVALUATION IN EDUCATION
ASSESSMENT AND EVALUATION IN EDUCATIONASSESSMENT AND EVALUATION IN EDUCATION
ASSESSMENT AND EVALUATION IN EDUCATION
 
Assessment and evaluation_in_education
Assessment and evaluation_in_educationAssessment and evaluation_in_education
Assessment and evaluation_in_education
 
Summative Assessment
Summative AssessmentSummative Assessment
Summative Assessment
 
Evalution
Evalution Evalution
Evalution
 
Bab 3
Bab 3 Bab 3
Bab 3
 
evalution-151228155502 (1).pptx evalution-151228155502 (1).pptx
evalution-151228155502 (1).pptx evalution-151228155502 (1).pptxevalution-151228155502 (1).pptx evalution-151228155502 (1).pptx
evalution-151228155502 (1).pptx evalution-151228155502 (1).pptx
 

Mehr von HennaAnsari

Organizational Identification of Millennial employees working remotely: Quali...
Organizational Identification of Millennial employees working remotely: Quali...Organizational Identification of Millennial employees working remotely: Quali...
Organizational Identification of Millennial employees working remotely: Quali...HennaAnsari
 
Customer satisfaction with hotel services: A case study of the Ikos Aria
Customer satisfaction with hotel services: A case study of the Ikos AriaCustomer satisfaction with hotel services: A case study of the Ikos Aria
Customer satisfaction with hotel services: A case study of the Ikos AriaHennaAnsari
 
Content analysis of customers generated reviews about their satisfaction and ...
Content analysis of customers generated reviews about their satisfaction and ...Content analysis of customers generated reviews about their satisfaction and ...
Content analysis of customers generated reviews about their satisfaction and ...HennaAnsari
 
An Analysis of Memes the way the contents of memes as they are presented on t...
An Analysis of Memes the way the contents of memes as they are presented on t...An Analysis of Memes the way the contents of memes as they are presented on t...
An Analysis of Memes the way the contents of memes as they are presented on t...HennaAnsari
 
Type and Category of Memes used on social media
Type and Category of Memes used on social media Type and Category of Memes used on social media
Type and Category of Memes used on social media HennaAnsari
 
Qualitative analysis/cluster analysis/NVivo analysis /content analysis Interp...
Qualitative analysis/cluster analysis/NVivo analysis /content analysis Interp...Qualitative analysis/cluster analysis/NVivo analysis /content analysis Interp...
Qualitative analysis/cluster analysis/NVivo analysis /content analysis Interp...HennaAnsari
 
How to interpret NVivo/Cluster analysis/ results
How to interpret NVivo/Cluster analysis/ results How to interpret NVivo/Cluster analysis/ results
How to interpret NVivo/Cluster analysis/ results HennaAnsari
 
TEST DEVELOPMENT AND EVALUATION (6462)
TEST DEVELOPMENT AND EVALUATION (6462)TEST DEVELOPMENT AND EVALUATION (6462)
TEST DEVELOPMENT AND EVALUATION (6462)HennaAnsari
 
TEST DEVELOPMENT AND EVALUATION (6462)
TEST DEVELOPMENT AND EVALUATION (6462)TEST DEVELOPMENT AND EVALUATION (6462)
TEST DEVELOPMENT AND EVALUATION (6462)HennaAnsari
 
TEST DEVELOPMENT AND EVALUATION (6462)
TEST DEVELOPMENT AND EVALUATION (6462)TEST DEVELOPMENT AND EVALUATION (6462)
TEST DEVELOPMENT AND EVALUATION (6462)HennaAnsari
 
TEST DEVELOPMENT AND EVALUATION (6462)
TEST DEVELOPMENT AND EVALUATION (6462)TEST DEVELOPMENT AND EVALUATION (6462)
TEST DEVELOPMENT AND EVALUATION (6462)HennaAnsari
 
TEST DEVELOPMENT AND EVALUATION (6462)
TEST DEVELOPMENT AND EVALUATION (6462)TEST DEVELOPMENT AND EVALUATION (6462)
TEST DEVELOPMENT AND EVALUATION (6462)HennaAnsari
 
Factor analysis in AMOS
Factor analysis in AMOSFactor analysis in AMOS
Factor analysis in AMOSHennaAnsari
 
Variance mean and intercept in AMOS
Variance mean and intercept in AMOSVariance mean and intercept in AMOS
Variance mean and intercept in AMOSHennaAnsari
 
Linear regression AMOS (R-Square)
Linear regression AMOS (R-Square)Linear regression AMOS (R-Square)
Linear regression AMOS (R-Square)HennaAnsari
 
Correlation/Covariance in AMOS
Correlation/Covariance in AMOS Correlation/Covariance in AMOS
Correlation/Covariance in AMOS HennaAnsari
 
AMOS tutorial (Estimated Effect in AMOS)
AMOS tutorial (Estimated Effect in AMOS)AMOS tutorial (Estimated Effect in AMOS)
AMOS tutorial (Estimated Effect in AMOS)HennaAnsari
 
Scale of measurement
Scale of measurementScale of measurement
Scale of measurementHennaAnsari
 
Quality enhancement, teaching quality, and students perceived satisfaction: c...
Quality enhancement, teaching quality, and students perceived satisfaction: c...Quality enhancement, teaching quality, and students perceived satisfaction: c...
Quality enhancement, teaching quality, and students perceived satisfaction: c...HennaAnsari
 

Mehr von HennaAnsari (20)

Organizational Identification of Millennial employees working remotely: Quali...
Organizational Identification of Millennial employees working remotely: Quali...Organizational Identification of Millennial employees working remotely: Quali...
Organizational Identification of Millennial employees working remotely: Quali...
 
Customer satisfaction with hotel services: A case study of the Ikos Aria
Customer satisfaction with hotel services: A case study of the Ikos AriaCustomer satisfaction with hotel services: A case study of the Ikos Aria
Customer satisfaction with hotel services: A case study of the Ikos Aria
 
Content analysis of customers generated reviews about their satisfaction and ...
Content analysis of customers generated reviews about their satisfaction and ...Content analysis of customers generated reviews about their satisfaction and ...
Content analysis of customers generated reviews about their satisfaction and ...
 
An Analysis of Memes the way the contents of memes as they are presented on t...
An Analysis of Memes the way the contents of memes as they are presented on t...An Analysis of Memes the way the contents of memes as they are presented on t...
An Analysis of Memes the way the contents of memes as they are presented on t...
 
Type and Category of Memes used on social media
Type and Category of Memes used on social media Type and Category of Memes used on social media
Type and Category of Memes used on social media
 
Qualitative analysis/cluster analysis/NVivo analysis /content analysis Interp...
Qualitative analysis/cluster analysis/NVivo analysis /content analysis Interp...Qualitative analysis/cluster analysis/NVivo analysis /content analysis Interp...
Qualitative analysis/cluster analysis/NVivo analysis /content analysis Interp...
 
How to interpret NVivo/Cluster analysis/ results
How to interpret NVivo/Cluster analysis/ results How to interpret NVivo/Cluster analysis/ results
How to interpret NVivo/Cluster analysis/ results
 
Existantialism
ExistantialismExistantialism
Existantialism
 
TEST DEVELOPMENT AND EVALUATION (6462)
TEST DEVELOPMENT AND EVALUATION (6462)TEST DEVELOPMENT AND EVALUATION (6462)
TEST DEVELOPMENT AND EVALUATION (6462)
 
TEST DEVELOPMENT AND EVALUATION (6462)
TEST DEVELOPMENT AND EVALUATION (6462)TEST DEVELOPMENT AND EVALUATION (6462)
TEST DEVELOPMENT AND EVALUATION (6462)
 
TEST DEVELOPMENT AND EVALUATION (6462)
TEST DEVELOPMENT AND EVALUATION (6462)TEST DEVELOPMENT AND EVALUATION (6462)
TEST DEVELOPMENT AND EVALUATION (6462)
 
TEST DEVELOPMENT AND EVALUATION (6462)
TEST DEVELOPMENT AND EVALUATION (6462)TEST DEVELOPMENT AND EVALUATION (6462)
TEST DEVELOPMENT AND EVALUATION (6462)
 
TEST DEVELOPMENT AND EVALUATION (6462)
TEST DEVELOPMENT AND EVALUATION (6462)TEST DEVELOPMENT AND EVALUATION (6462)
TEST DEVELOPMENT AND EVALUATION (6462)
 
Factor analysis in AMOS
Factor analysis in AMOSFactor analysis in AMOS
Factor analysis in AMOS
 
Variance mean and intercept in AMOS
Variance mean and intercept in AMOSVariance mean and intercept in AMOS
Variance mean and intercept in AMOS
 
Linear regression AMOS (R-Square)
Linear regression AMOS (R-Square)Linear regression AMOS (R-Square)
Linear regression AMOS (R-Square)
 
Correlation/Covariance in AMOS
Correlation/Covariance in AMOS Correlation/Covariance in AMOS
Correlation/Covariance in AMOS
 
AMOS tutorial (Estimated Effect in AMOS)
AMOS tutorial (Estimated Effect in AMOS)AMOS tutorial (Estimated Effect in AMOS)
AMOS tutorial (Estimated Effect in AMOS)
 
Scale of measurement
Scale of measurementScale of measurement
Scale of measurement
 
Quality enhancement, teaching quality, and students perceived satisfaction: c...
Quality enhancement, teaching quality, and students perceived satisfaction: c...Quality enhancement, teaching quality, and students perceived satisfaction: c...
Quality enhancement, teaching quality, and students perceived satisfaction: c...
 

Kürzlich hochgeladen

Blowin' in the Wind of Caste_ Bob Dylan's Song as a Catalyst for Social Justi...
Blowin' in the Wind of Caste_ Bob Dylan's Song as a Catalyst for Social Justi...Blowin' in the Wind of Caste_ Bob Dylan's Song as a Catalyst for Social Justi...
Blowin' in the Wind of Caste_ Bob Dylan's Song as a Catalyst for Social Justi...DhatriParmar
 
Congestive Cardiac Failure..presentation
Congestive Cardiac Failure..presentationCongestive Cardiac Failure..presentation
Congestive Cardiac Failure..presentationdeepaannamalai16
 
PART 1 - CHAPTER 1 - CELL THE FUNDAMENTAL UNIT OF LIFE
PART 1 - CHAPTER 1 - CELL THE FUNDAMENTAL UNIT OF LIFEPART 1 - CHAPTER 1 - CELL THE FUNDAMENTAL UNIT OF LIFE
PART 1 - CHAPTER 1 - CELL THE FUNDAMENTAL UNIT OF LIFEMISSRITIMABIOLOGYEXP
 
Sulphonamides, mechanisms and their uses
Sulphonamides, mechanisms and their usesSulphonamides, mechanisms and their uses
Sulphonamides, mechanisms and their usesVijayaLaxmi84
 
BÀI TẬP BỔ TRỢ TIẾNG ANH 8 - I-LEARN SMART WORLD - CẢ NĂM - CÓ FILE NGHE (BẢN...
BÀI TẬP BỔ TRỢ TIẾNG ANH 8 - I-LEARN SMART WORLD - CẢ NĂM - CÓ FILE NGHE (BẢN...BÀI TẬP BỔ TRỢ TIẾNG ANH 8 - I-LEARN SMART WORLD - CẢ NĂM - CÓ FILE NGHE (BẢN...
BÀI TẬP BỔ TRỢ TIẾNG ANH 8 - I-LEARN SMART WORLD - CẢ NĂM - CÓ FILE NGHE (BẢN...Nguyen Thanh Tu Collection
 
Shark introduction Morphology and its behaviour characteristics
Shark introduction Morphology and its behaviour characteristicsShark introduction Morphology and its behaviour characteristics
Shark introduction Morphology and its behaviour characteristicsArubSultan
 
ICS 2208 Lecture Slide Notes for Topic 6
ICS 2208 Lecture Slide Notes for Topic 6ICS 2208 Lecture Slide Notes for Topic 6
ICS 2208 Lecture Slide Notes for Topic 6Vanessa Camilleri
 
Mythology Quiz-4th April 2024, Quiz Club NITW
Mythology Quiz-4th April 2024, Quiz Club NITWMythology Quiz-4th April 2024, Quiz Club NITW
Mythology Quiz-4th April 2024, Quiz Club NITWQuiz Club NITW
 
DiskStorage_BasicFileStructuresandHashing.pdf
DiskStorage_BasicFileStructuresandHashing.pdfDiskStorage_BasicFileStructuresandHashing.pdf
DiskStorage_BasicFileStructuresandHashing.pdfChristalin Nelson
 
Narcotic and Non Narcotic Analgesic..pdf
Narcotic and Non Narcotic Analgesic..pdfNarcotic and Non Narcotic Analgesic..pdf
Narcotic and Non Narcotic Analgesic..pdfPrerana Jadhav
 
Healthy Minds, Flourishing Lives: A Philosophical Approach to Mental Health a...
Healthy Minds, Flourishing Lives: A Philosophical Approach to Mental Health a...Healthy Minds, Flourishing Lives: A Philosophical Approach to Mental Health a...
Healthy Minds, Flourishing Lives: A Philosophical Approach to Mental Health a...Osopher
 
Geoffrey Chaucer Works II UGC NET JRF TGT PGT MA PHD Entrance Exam II History...
Geoffrey Chaucer Works II UGC NET JRF TGT PGT MA PHD Entrance Exam II History...Geoffrey Chaucer Works II UGC NET JRF TGT PGT MA PHD Entrance Exam II History...
Geoffrey Chaucer Works II UGC NET JRF TGT PGT MA PHD Entrance Exam II History...DrVipulVKapoor
 
Employablity presentation and Future Career Plan.pptx
Employablity presentation and Future Career Plan.pptxEmployablity presentation and Future Career Plan.pptx
Employablity presentation and Future Career Plan.pptxryandux83rd
 
The Emergence of Legislative Behavior in the Colombian Congress
The Emergence of Legislative Behavior in the Colombian CongressThe Emergence of Legislative Behavior in the Colombian Congress
The Emergence of Legislative Behavior in the Colombian CongressMaria Paula Aroca
 
Q-Factor HISPOL Quiz-6th April 2024, Quiz Club NITW
Q-Factor HISPOL Quiz-6th April 2024, Quiz Club NITWQ-Factor HISPOL Quiz-6th April 2024, Quiz Club NITW
Q-Factor HISPOL Quiz-6th April 2024, Quiz Club NITWQuiz Club NITW
 
Scientific Writing :Research Discourse
Scientific  Writing :Research  DiscourseScientific  Writing :Research  Discourse
Scientific Writing :Research DiscourseAnita GoswamiGiri
 
Unit :1 Basics of Professional Intelligence
Unit :1 Basics of Professional IntelligenceUnit :1 Basics of Professional Intelligence
Unit :1 Basics of Professional IntelligenceDr Vijay Vishwakarma
 
BIOCHEMISTRY-CARBOHYDRATE METABOLISM CHAPTER 2.pptx
BIOCHEMISTRY-CARBOHYDRATE METABOLISM CHAPTER 2.pptxBIOCHEMISTRY-CARBOHYDRATE METABOLISM CHAPTER 2.pptx
BIOCHEMISTRY-CARBOHYDRATE METABOLISM CHAPTER 2.pptxSayali Powar
 

Kürzlich hochgeladen (20)

Blowin' in the Wind of Caste_ Bob Dylan's Song as a Catalyst for Social Justi...
Blowin' in the Wind of Caste_ Bob Dylan's Song as a Catalyst for Social Justi...Blowin' in the Wind of Caste_ Bob Dylan's Song as a Catalyst for Social Justi...
Blowin' in the Wind of Caste_ Bob Dylan's Song as a Catalyst for Social Justi...
 
Congestive Cardiac Failure..presentation
Congestive Cardiac Failure..presentationCongestive Cardiac Failure..presentation
Congestive Cardiac Failure..presentation
 
PART 1 - CHAPTER 1 - CELL THE FUNDAMENTAL UNIT OF LIFE
PART 1 - CHAPTER 1 - CELL THE FUNDAMENTAL UNIT OF LIFEPART 1 - CHAPTER 1 - CELL THE FUNDAMENTAL UNIT OF LIFE
PART 1 - CHAPTER 1 - CELL THE FUNDAMENTAL UNIT OF LIFE
 
Sulphonamides, mechanisms and their uses
Sulphonamides, mechanisms and their usesSulphonamides, mechanisms and their uses
Sulphonamides, mechanisms and their uses
 
BÀI TẬP BỔ TRỢ TIẾNG ANH 8 - I-LEARN SMART WORLD - CẢ NĂM - CÓ FILE NGHE (BẢN...
BÀI TẬP BỔ TRỢ TIẾNG ANH 8 - I-LEARN SMART WORLD - CẢ NĂM - CÓ FILE NGHE (BẢN...BÀI TẬP BỔ TRỢ TIẾNG ANH 8 - I-LEARN SMART WORLD - CẢ NĂM - CÓ FILE NGHE (BẢN...
BÀI TẬP BỔ TRỢ TIẾNG ANH 8 - I-LEARN SMART WORLD - CẢ NĂM - CÓ FILE NGHE (BẢN...
 
Shark introduction Morphology and its behaviour characteristics
Shark introduction Morphology and its behaviour characteristicsShark introduction Morphology and its behaviour characteristics
Shark introduction Morphology and its behaviour characteristics
 
ICS 2208 Lecture Slide Notes for Topic 6
ICS 2208 Lecture Slide Notes for Topic 6ICS 2208 Lecture Slide Notes for Topic 6
ICS 2208 Lecture Slide Notes for Topic 6
 
Mythology Quiz-4th April 2024, Quiz Club NITW
Mythology Quiz-4th April 2024, Quiz Club NITWMythology Quiz-4th April 2024, Quiz Club NITW
Mythology Quiz-4th April 2024, Quiz Club NITW
 
DiskStorage_BasicFileStructuresandHashing.pdf
DiskStorage_BasicFileStructuresandHashing.pdfDiskStorage_BasicFileStructuresandHashing.pdf
DiskStorage_BasicFileStructuresandHashing.pdf
 
Narcotic and Non Narcotic Analgesic..pdf
Narcotic and Non Narcotic Analgesic..pdfNarcotic and Non Narcotic Analgesic..pdf
Narcotic and Non Narcotic Analgesic..pdf
 
Healthy Minds, Flourishing Lives: A Philosophical Approach to Mental Health a...
Healthy Minds, Flourishing Lives: A Philosophical Approach to Mental Health a...Healthy Minds, Flourishing Lives: A Philosophical Approach to Mental Health a...
Healthy Minds, Flourishing Lives: A Philosophical Approach to Mental Health a...
 
Geoffrey Chaucer Works II UGC NET JRF TGT PGT MA PHD Entrance Exam II History...
Geoffrey Chaucer Works II UGC NET JRF TGT PGT MA PHD Entrance Exam II History...Geoffrey Chaucer Works II UGC NET JRF TGT PGT MA PHD Entrance Exam II History...
Geoffrey Chaucer Works II UGC NET JRF TGT PGT MA PHD Entrance Exam II History...
 
prashanth updated resume 2024 for Teaching Profession
prashanth updated resume 2024 for Teaching Professionprashanth updated resume 2024 for Teaching Profession
prashanth updated resume 2024 for Teaching Profession
 
Employablity presentation and Future Career Plan.pptx
Employablity presentation and Future Career Plan.pptxEmployablity presentation and Future Career Plan.pptx
Employablity presentation and Future Career Plan.pptx
 
Plagiarism,forms,understand about plagiarism,avoid plagiarism,key significanc...
Plagiarism,forms,understand about plagiarism,avoid plagiarism,key significanc...Plagiarism,forms,understand about plagiarism,avoid plagiarism,key significanc...
Plagiarism,forms,understand about plagiarism,avoid plagiarism,key significanc...
 
The Emergence of Legislative Behavior in the Colombian Congress
The Emergence of Legislative Behavior in the Colombian CongressThe Emergence of Legislative Behavior in the Colombian Congress
The Emergence of Legislative Behavior in the Colombian Congress
 
Q-Factor HISPOL Quiz-6th April 2024, Quiz Club NITW
Q-Factor HISPOL Quiz-6th April 2024, Quiz Club NITWQ-Factor HISPOL Quiz-6th April 2024, Quiz Club NITW
Q-Factor HISPOL Quiz-6th April 2024, Quiz Club NITW
 
Scientific Writing :Research Discourse
Scientific  Writing :Research  DiscourseScientific  Writing :Research  Discourse
Scientific Writing :Research Discourse
 
Unit :1 Basics of Professional Intelligence
Unit :1 Basics of Professional IntelligenceUnit :1 Basics of Professional Intelligence
Unit :1 Basics of Professional Intelligence
 
BIOCHEMISTRY-CARBOHYDRATE METABOLISM CHAPTER 2.pptx
BIOCHEMISTRY-CARBOHYDRATE METABOLISM CHAPTER 2.pptxBIOCHEMISTRY-CARBOHYDRATE METABOLISM CHAPTER 2.pptx
BIOCHEMISTRY-CARBOHYDRATE METABOLISM CHAPTER 2.pptx
 

TEST DEVELOPMENT AND EVALUATION (6462)

  • 1. TEST DEVELOPMENT AND EVALUATION (6462) CLASSROOM TESTING AND HIGH-STAKE TESTING Department of Secondary Teacher Education ALLAMA IQBAL OPEN UNIVERSITY, ISLAMABAD
  • 2. OBJECTIVES OF THE UNIT After studying this unit, the students will have ability to demonstrate. 1. understand the concept of class room testing and its techniques 2. understand the need and scope of high stake testing 3. differentiate between teacher made tests/classroom tests/low stake tests and standardized/high stake tests 4. enumerate advantages and limitations of the low stake and high stake tests 5. prepare tests using Bloom’s Taxonomy and SOLO Taxonomy 6. elaborate the procedure for test development 7. provide examples of standardized tests with characteristics with examples. 8. enlist few trends in high stake testing
  • 3. 3.1 CONCEPT OF CLASSROOM TESTING AND ITS TECHNIQUES Classroom assessment is the process, usually conducted by teachers, of designing, collecting, interpreting and applying information about student learning and attainment to make educational decisions. There are four interrelated steps to the classroom assessment process.  The first step is to define the purposes for the information. During this period, the teacher considers how the information will be used and how the assessment fits in the students' educational program.  The next step in the assessment process is to measure student learning or attainment. Measurement involves using tests, surveys, observation or interviews to produce either numeric or verbal descriptions of the degree to which a student has achieved academic goals.  The third step is to evaluate the measurement data, which entails making judgments about the information. During this stage, the teacher interprets the measurement data to determine if students have certain strengths or limitations or whether the student has sufficiently attained the learning goals.  In the last stage, the teacher applies the interpretations to fulfill the aims of assessment that were defined in first stage. The teacher uses the data to guide instruction, render grades, or help students with any particular learning deficiencies or barriers.
  • 4. 3.2 HIGH STAKE TESTING: ITS NATURE, NEED AND SCOPE  High-stakes testing has consequences attached to the results. For example, highstakes tests can be used to determine students’ promotion from grade to grade or graduation from high school (Resnick, 2004; Cizek, 2001).  The use and misuse of high-stakes tests are a controversial topic in public education, in advanced countries and even in Pakistan as they are used not only to assess students but in attempts to increase teacher accountability also. Precisely we can say that a high-stakes test is a test that: o is a single, defined assessment, o has a clear line drawn between those who pass and those who fail, and o has direct consequences for passing or failing (something "at stake"). • What is Need of High Stake Testing? • What is Nature of the High Stake Testing?
  • 5. Teacher made vs Standardized test EduTainment
  • 7. Teacher made vs Standardized test EduTainment
  • 8. Differences Between Standard And Teachers Made Tests EduTainment
  • 10. Differences Between Standard And Teachers Made Tests
  • 11. 3.5.2 Advantage and Disadvantage of High Stake Testing  It holds teachers accountable for ensuring that all students learn what they are expected to learn.  Motivates students to work harder, learn more, and take the tests more seriously, which can promote higher student achievement.  Establishes high expectations for both educators and students, which can help reverse the cycles of low educational expectations, achievement, and attainment that have historically disadvantaged some student groups, particularly students of color, and that have characterized some schools in poorer communities or more troubled urban areas.  Reveals areas of educational need that can be targeted for reform and improvement, such as programs for students who may be underperforming academically or being underserved by schools.  Provides easily understandable information about school and student performance in the form of numerical test scores that reformers, educational leaders, elected officials and policy makers can use to develop new laws, regulations, and school-improvement strategies.  Gives parents, employers, colleges and others more confidence that students are learning at a high level or that high school graduates have acquired the skills they will need to succeed in adulthood.
  • 12. Disadvantage of High-Stakes Testing  It forces educators to “teach to the test”—  It promotes a more “narrow” academic program in schools—  It may contribute to higher, or even much higher, rates of cheating—  It has been correlated in some research studies to increase failure rates, lower graduation rates, and higher dropout rates—  May diminish the overall quality of teaching and learning—  Exacerbates negative stereotypes about the intelligence and academic ability of minority students—
  • 13. 3.6 CONCEPT OF USE OF TAXONOMIES IN TEST DEVELOPMENT Using Bloom’s Taxonomy in Test Development Using SOLO Taxonomy in Test Development
  • 14. Bloom’s Taxonomy (1956) question samples: •Knowledge: How many…? Who was it that…? Can you name the…? •Comprehension: Can you write in your own words…? Can you write a brief outline…? What do you think could have happened next…? •Application: Choose the best statements that apply Judge the effects of… What would result …? •Analysis: Which events could have happened…? If … happened, how might the ending have been different? How was this similar to…? •Synthesis: Can you design a … to achieve …? Write a poem, song or creative presentation about…? Can you see a possible solution to…? •Evaluation: What criteria would you use to assess…? What data was used to evaluate…? How could you verify…?
  • 15. SOLO Taxonomy  SOLO taxonomy was developed by Biggs and Collis (1982) Stands for Structure of Observed Learning Outcomes
  • 16. 3.7 PROCEDURE OR STEPS FOR A STANDARDIZED TEST DEVELOPMENT PROCESS Pilot Forms, Scoring and Analysis Development Review Purpose Specifications
  • 17. 3.8 EXAMPLES OF STANDARDIZED TESTS WITH CHARACTERISTICS The Standardized tests can be classified as per their functions are • Group and Individual Tests • Norm-referenced • Achievement Tests • Criterion-referenced • Aptitude • Personality • Projective • Interest Inventories • Intelligence tests
  • 18. Reliability refers to the consistency of scores obtained by the same individuals when re- examined with test on different occasions, or with different sets of equivalent items. Reliability
  • 20. Inter-rater reliability by considering the similarity of the scores awarded by the two observers. Inter-Rater or Inter-ObserverReliability
  • 21. ⚫ It is used to judge the consistency of results across items on the same test. ⚫ We estimate test-retest reliability when we administer the same test to the same sample on two different occasions. ⚫ The amount of time allowed between measures is critical. ⚫ The shorter the time gap, the higher the correlation; the longer the time gap, the lower the correlation. Test-RetestReliability
  • 22. ⚫ In split-half reliability we randomly divide all items that claim to measure the same contents into two sets. ⚫ The split-half reliability estimate is simply the correlation between two total scores. Split-Half Reliability
  • 23. ⚫ In parallel form reliability we have to create two different tests from the same contents to measure the same learning outcomes. ⚫ The correlation between the two parallel forms is the estimate of reliability. Parallel-FormReliability
  • 24. ● It is the degree to which items on an instrument are consistent among themselves and with the instrument as a whole. Internal ConsistencyReliability
  • 25. Validity  The validity of an assessment tool is the degree to which it measures for what it is designed to measure.  The concept refers to the appropriateness, meaningfulness, and usefulness of the specific inferences made from test scores.
  • 26. Methods of Measuring Validity 1 2 3 4 5
  • 27. Content Validity  Content validity evidence involves the degree to which the content of the test matches a content domain associated with the construct.  Items in a test appear to cover whole domain. Face validity It is an estimate of whether a test appears to measure a certain criterion. It is appearance of test.
  • 28. Construct Validity  Construct is the concept or the characteristic that a test is designed to measure.  According to Howell (1992) Construct validity is a test’s ability to measure factors which are relevant to the field of study. Convergent Convergent validity refers to the degree to which a measure is correlated with other measures.
  • 29. Criterion Validity  Criterion validity evidence involves the correlation between the test and a criterion variable (or variables) taken as representative of the construct.  It compares the test with other measures or outcomes (the criteria) already held to be valid.
  • 30. Concurrent Validity  Concurrent validity refers to the degree to which the scores taken at one point correlates with other measures (test, observation or interview) of the same construct that is measured at the same time.
  • 31. Predictive Validity  Predictive validity assures how well the test predicts some future behaviour of the examinee.  If higher scores on the Boards Exams are positively correlated with higher G.P.A.’s in the Universities and vice versa, then the Board exams is said to have predictive validity.
  • 32. Factors Affecting Validity  Instructions to Take A Test  Difficult Language Structure  Inappropriate Level of Difficulty  Poorly Constructed Test Items  Ambiguity in Items Statements  Length of the Test  Improper Arrangement of Items  Identifiable Pattern of Answers
  • 33. Relationship between Validity and Reliability  Reliability is a necessary requirement for validity  Establishing good reliability is only the first part of establishing validity  Reliability is necessary but not sufficient for validity.
  • 34. 3.8.3 Usability of Tests  Usability testing refers to evaluating a product or service by testing it with representative users. Typically, during a test, participants will try to complete typical tasks while observers watch, listen and takes notes. You should also select tests based on how easy the test is to use. In addition to reliability and validity, you need to think about how much time you have to create a test, grade it and administer it. You need to think about how you will interpret and use the scores from the tests. And you need to check to make sure the test questions and directions are written clearly, the test itself is short enough not to overwhelm the students, the questions don't includes stereotypes or personal biases, and that they are interesting and make the students think.
  • 35. Department of Secondary Teacher Education ALLAMA IQBAL OPEN UNIVERSITY, ISLAMABAD Dr. Hina Jalal hinansari23@gmail.com