SlideShare ist ein Scribd-Unternehmen logo
1 von 26
Summer School 2013
• In today’s language classrooms, the term
assessment usually evokes images of an end-of-
course paper-pencil test designed to tell both
teachers and students how much material the
student doesn’t know or hasn’t yet mastered
• It includes a broad range of activities and
tasks that teachers use to evaluate student’s
progress and growth on a daily basis.
To make use of evaluation, assessment and test procedures more
effective it is necessary to clarify what these concepts are and to
explain how they differ from one another
It is all-inclusive and it is the widest basis for collecting information
in education.
It involves looking at all factors that influence the learning process:
syllabus, objectives, course design, and materials.
Test is a subcategory of assessment, it is a formal systematic
procedure used to gather information about student progress.
Assessment is part of evaluation because it is concerned with
the student and with what the student does. It refers to
the variety of ways of collecting information on a learner’s
language ability or achievement.
The most common use of language tests is to
identify strengths and weaknesses in student’s
abilities
Information gleaned from tests also assist us in
deciding who should be allowed to participate
in a particular course or program area.
Another common use of tests is to provide
information about effectiveness of programs
instructions
• They asses student’s level of language abilities so they can be placed
in an appropriate course or class. This type of test indicated the level
at which a student will learn most effectively. The primary aim is to
create groups of learners that are homogeneous in level
• They measures capacity or general ability to learn a foreign language.
(Although not commonly used these days)
• They identify language area in which student needs further help. The
information gained from diagnostic tests are crucial for further
course activities and providing students with remediation.
• They measures the progress that students are making
toward defined course or program goals. Progress tests
are generally teacher produced because they cover less
material and assess fewer objectives
• They are similar to progress tests. They are usually
administrated at the mid- and end- point of the semester or
academic year.
• The content is generally based on the specific course content
or on the course objectives.
• They assess the overall language ability of students at
varying levels.
• They tell us how capable a person is in a particular
language skill area.
• Objective versus subjective tests- sometimes tests
are distinguished by the manner in which they are
scored by comparing a student’s responses with an
established set of acceptable/correct responses on an
answer key. With objectively scored tests, the scorer
does not require particular knowledge or training in
the examined area
• In contrast, a subjective test, such as writing an
essay, requires scoring by opinion or personal
judgment so the human element is very important.
• Even experienced scorer need moderated training
sessions to ensure inter-rater reliability
Criterion referenced tests versus Standardized
tests-
• Criterion referenced tests are usually developed to
measure mastery of well-defined instructional objectives
specific for a particular course or program. Their
propose is to measure how much learning has occurred.
Students performance is compared only to the amount or
percentage of material learned.
• Standardized tests are designed to measure global
language abilities. Students’ scores are interpreted
relative to all other students who take the exam. Their
purpose is to spread students out along a continuum of
scores so that those with low abilities in a certain skill
are at one end of the normal distribution and those with
high scores are at the other end, with the majority of
the students falling between extremes.
Summative versus formative tests-
• Tests or tasks administered at the end of the course to
determine if students have achieved the objectives set
out in the curriculum are called summative assessments.
they are often used to decide which students move on to
a higher level
• Formative assessments however, are carried out with
the aim of using the results to improve instruction, so
they are given during course and feedback is provided to
students.
High-stakes versus Low-stakes tests-
• High-stakes tests are those in which the results are
likely to have major impact on the lives of large number
individuals or an large programs.
• Low-stakes tests are those in which the results have
relatively minor impact on the lives of the individual or
on small programs. In class progress tests or short
quizzes are examples of low-stakes tests
practicality reliability validity authenticity washback
• Designed items
• Subjective test
• Rating
• Test item itself
• Conditions of
administration
(noise, light, temperat
ure, desks, chairs)
• Human error
• Subjectivity
• Bias toward “good”
and “bad students”
• Inexperience
• Inattention
• Temporary
illness, fatigue, an
xiety (other
physical, psycholo
gical factors)
Student-
related
Reliability
Rater
Reliability
Test
reliability
Test
Administr
ation
Reliability
Validity
- Measures exactly what is proposed to measure.
- Involves performance that samples the test the test’s criterion.
- Offers useful, meaningful information about a test-taker’s abilities.
- Is supported by an argument.
Criterion-
related
validityConstruct-
related
validity
Consequential
validity (Impact)
Content-related
validity
Introduction
• Objectives will include 4 distinct components:
Audience, Behavior, Condition and Degree.
• Objectives must be both observable and measurable to be effective.
• Use of words like understand and learn in writing objectives are
generally not acceptable as they are difficult to measure.
• Written objectives are a vital part of instructional design because they
provide the roadmap for designing and delivering curriculum.
• Throughout the design and development of curriculum, a comparison
of the content to be delivered should be made to the objectives
identified for the program. This process, called performance
agreement, ensures that the final product meets the overall goal of
instruction identified in the first level objectives.
- Describe the intended learner or end user of the instruction
- Often the audience is identified only in the 1st level of objective
because of redundancy
Describes learner capability
Must be observable and measurable (you will define the measurement elsewhere in the
goal)
If it is a skill, it should be a real world skill
The “behavior” can include demonstration of knowledge or skills in any of the domains
of learning: cognitive, psychomotor, affective, or interpersonal
- Equipment or tools that may (or may not) be utilized in completion
of the behavior
- Environmental conditions may also be included
- States the standard for acceptable performance
(time, accuracy, proportion, quality, etc)
The common mistakes have been
grouped into four categories as
follows:
- General examination characteristics.
- Item characteristics.
- Test validity concerns.
- Administrative and scoring issues.
General
Examination
Characteristics
Item
characteristics
Test-validity
concerns
Administrative and scoring
issue: Lack of cheating control
Inadequate instruction
Administrative inequities
Lack of piloting
Subjectivity of scoring
• Too difficult or too easy
• Insufficient nr of items
• Redundancy of test type
• Lack of confidence measure
• Negative wash back through non-
occurrent forms
• Tricky questions
• Redundant wording
• Divergence cues
• Convergence cues
• Option number
• Mixed content
• Wrong medium
• Common knowledge
• Syllabus mismatch
• Content matching
Tradition assessment
 Pencil-and-paper test.
 Answer the question
 Choose or produce a
correct grammatical form
or vocabulary item.
 Good to check reading and
listening comprehension
ability
Alternative assessment
• Reveal what students can
do with language
• It is scored differently
• Students can evaluate their
own learning and learn from
the evaluation process
• Gives instructors a way to
connect assessment with
review of learning
strategies
- They are build around the
topics of the interest to the
students
- They replicate real-world
communication context and
situations
-They require students to
produce a quality product or
performance
-The evaluation criteria and
standards are known to the
student
- They involve multi-stage
tasks and real problems that
require creative use of
language rather than simple
repetition
-They involve interaction
between assessor and
person assessed
They allow for self-evaluation
Rubrics- provide measurement of quality of
performance on the basis of established
criteria.
There are four main types of rubrics:
• Holistic rubrics
• Analytic rubrics
• Primary trait rubrics
• Multi-trait rubrics
In holistic evaluation, raters
make judgments by forming an overall impression of a performance and
matching it to the best fit from among the descriptions on the scale.
• They are often written generically
and can be used with many tasks.
• They emphasize what learners can
do, rather than what they cannot do.
• They save time by minimizing the
number of decisions raters must
make.
• Trained raters tend to apply them
consistently, resulting in more reliable
measurement.
• They are easily understood by
younger learners.
• They do not provide specific
feedback to test takers about the
strengths and weaknesses of their
performance.
• Performances may meet criteria in
two or more categories, making it
difficult to select the one best
description. (If this occurs
frequently, the rubric may be poorly
written.)
Analytic scales are
usually associated with generic rubrics and tend to focus on broad dimensions of
writing or speaking performance. These dimensions may be the same as those found
in a generic, holistic scale, but they are presented in separate categories and rated
individually. Points may be assigned for performance on each of the dimensions and a
total score calculated.
• They provide useful feedback
to learners on areas of
strength and weakness.
• Their dimensions can be
weighted to reflect relative
importance.
• They can show learners that
they have made progress
over time in some or all
dimensions when the same
rubric categories are used
repeatedly
• They take more time to
create and use.
primary trait scoring would be strictly
classified as task-specific, and performance would be evaluated on only one trait, such
as the "Persuading an audience
Ex. Primary Trait: Persuading an audience
0 Fails to persuade the audience.
1 Attempts to persuade but does not provide sufficient support.
2 Presents a somewhat persuasive argument but without consistent
development and support
3 Develops a persuasive argument that is well developed and supported.
multiple trait scoring rubrics are based on the concepts of primary trait
scoring, to provide diagnostic feedback to learners about performance on
"context-appropriate and task-appropriate criteria" for a specified topic.
• The rubrics are aligned with the task and curriculum.
• Aligned and well-written primary and multiple trait rubrics can ensure
construct and content validity of criterion-referenced assessments.
• Feedback is focused on one or more dimensions that are important in the
current learning context.
• With a multiple trait rubric, learners receive information about their strengths
and weaknesses.
• Primary and multiple trait rubrics are generally written in language that
students understand.
• Teachers are able to rate performances quickly.
• Many rubrics of this type have been developed by teachers who are willing
to share them online, at conferences, and in materials available for
purchase.
Assessment

Weitere ähnliche Inhalte

Was ist angesagt?

Chapter 1 testing assessing and teaching
Chapter 1   testing assessing and teachingChapter 1   testing assessing and teaching
Chapter 1 testing assessing and teachingKlab Warna
 
Testing, assessing,& teaching
Testing, assessing,& teachingTesting, assessing,& teaching
Testing, assessing,& teachingAstrid Caballero
 
Introduction to Test and Assessment
Introduction to Test and Assessment Introduction to Test and Assessment
Introduction to Test and Assessment soerdepoer
 
Testing, assessing, and teaching
Testing, assessing, and teachingTesting, assessing, and teaching
Testing, assessing, and teachingSutrisno Evenddy
 
Course embedded assessment using goals, alignments and reporting
Course embedded assessment using goals, alignments and reportingCourse embedded assessment using goals, alignments and reporting
Course embedded assessment using goals, alignments and reportingKimberly Jordan Seeber
 
Principles of language assessment
Principles of language assessmentPrinciples of language assessment
Principles of language assessmentAstrid Caballero
 
Basic Principles of Assessment
Basic Principles of AssessmentBasic Principles of Assessment
Basic Principles of AssessmentYee Bee Choo
 
Test and types of tests
Test and types of testsTest and types of tests
Test and types of testsFousiya O P
 
Educational Assessment and Evaluation
Educational Assessment and EvaluationEducational Assessment and Evaluation
Educational Assessment and EvaluationAnjali Sharma
 
Assessment plan
Assessment planAssessment plan
Assessment planmskmoss
 
Feedback on summative assessment group pres
Feedback on summative assessment group presFeedback on summative assessment group pres
Feedback on summative assessment group prespardopgcert
 
Assessment &testing in the classroom
Assessment &testing in the classroomAssessment &testing in the classroom
Assessment &testing in the classroomCidher89
 
3 basic-principles_of_assessment
3  basic-principles_of_assessment3  basic-principles_of_assessment
3 basic-principles_of_assessmenthakim azman
 
Question bank preparation, validation & moderation by panel & utiliz...
Question  bank preparation, validation & moderation by panel & utiliz...Question  bank preparation, validation & moderation by panel & utiliz...
Question bank preparation, validation & moderation by panel & utiliz...Amrita Roy (Ex Capt.) (MSN,MBA-HCS,BSN)
 

Was ist angesagt? (20)

Chapter 1 testing assessing and teaching
Chapter 1   testing assessing and teachingChapter 1   testing assessing and teaching
Chapter 1 testing assessing and teaching
 
Testing, assessing,& teaching
Testing, assessing,& teachingTesting, assessing,& teaching
Testing, assessing,& teaching
 
Introduction to Test and Assessment
Introduction to Test and Assessment Introduction to Test and Assessment
Introduction to Test and Assessment
 
Testing, assessing, and teaching
Testing, assessing, and teachingTesting, assessing, and teaching
Testing, assessing, and teaching
 
Course embedded assessment using goals, alignments and reporting
Course embedded assessment using goals, alignments and reportingCourse embedded assessment using goals, alignments and reporting
Course embedded assessment using goals, alignments and reporting
 
Principles of language assessment
Principles of language assessmentPrinciples of language assessment
Principles of language assessment
 
Basic Principles of Assessment
Basic Principles of AssessmentBasic Principles of Assessment
Basic Principles of Assessment
 
Assessment Concepts
Assessment ConceptsAssessment Concepts
Assessment Concepts
 
Pollyana Magne "Inclusive Assessment"
Pollyana Magne "Inclusive Assessment"Pollyana Magne "Inclusive Assessment"
Pollyana Magne "Inclusive Assessment"
 
Test and types of tests
Test and types of testsTest and types of tests
Test and types of tests
 
Educational Assessment and Evaluation
Educational Assessment and EvaluationEducational Assessment and Evaluation
Educational Assessment and Evaluation
 
Assessment plan
Assessment planAssessment plan
Assessment plan
 
Feedback on summative assessment group pres
Feedback on summative assessment group presFeedback on summative assessment group pres
Feedback on summative assessment group pres
 
Language Testing
Language TestingLanguage Testing
Language Testing
 
Assessment &testing in the classroom
Assessment &testing in the classroomAssessment &testing in the classroom
Assessment &testing in the classroom
 
Testing and evaluation
Testing and evaluationTesting and evaluation
Testing and evaluation
 
INTRODUCTION TO ASSESSMENT
INTRODUCTION TO ASSESSMENTINTRODUCTION TO ASSESSMENT
INTRODUCTION TO ASSESSMENT
 
3 basic-principles_of_assessment
3  basic-principles_of_assessment3  basic-principles_of_assessment
3 basic-principles_of_assessment
 
Question bank preparation, validation & moderation by panel & utiliz...
Question  bank preparation, validation & moderation by panel & utiliz...Question  bank preparation, validation & moderation by panel & utiliz...
Question bank preparation, validation & moderation by panel & utiliz...
 
Diagnostic evaluation
Diagnostic evaluationDiagnostic evaluation
Diagnostic evaluation
 

Andere mochten auch

3547 politica nacional logística
3547 politica nacional logística3547 politica nacional logística
3547 politica nacional logísticaLuis Suarez
 
WordPressテンプレート階層を理解する。テーマカスタマイズに必要な5つのポイント!|WordPressもくもく勉強会 at コエド第6回
WordPressテンプレート階層を理解する。テーマカスタマイズに必要な5つのポイント!|WordPressもくもく勉強会 at コエド第6回WordPressテンプレート階層を理解する。テーマカスタマイズに必要な5つのポイント!|WordPressもくもく勉強会 at コエド第6回
WordPressテンプレート階層を理解する。テーマカスタマイズに必要な5つのポイント!|WordPressもくもく勉強会 at コエド第6回Yoshinori Kobayashi
 
De Dieu Butler App for dummies
De Dieu Butler App for dummiesDe Dieu Butler App for dummies
De Dieu Butler App for dummiesTom Hes
 
Don't Write Them Off, Cast Iron Boilers Still Have a Future
Don't Write Them Off, Cast Iron Boilers Still Have a FutureDon't Write Them Off, Cast Iron Boilers Still Have a Future
Don't Write Them Off, Cast Iron Boilers Still Have a FutureBuildingMech
 
Presentation to Business Wealth Club
Presentation to Business Wealth ClubPresentation to Business Wealth Club
Presentation to Business Wealth ClubAlastair Broom
 
Back to School
Back to SchoolBack to School
Back to Schoolmrdavispe
 
The presentation code
The presentation codeThe presentation code
The presentation codeDer Konijnen
 
Enfermedad del parkinson
Enfermedad del parkinsonEnfermedad del parkinson
Enfermedad del parkinsonvivita1070
 
グリッドレイアウトを簡単に行うJavaScript!Masonry.js
グリッドレイアウトを簡単に行うJavaScript!Masonry.jsグリッドレイアウトを簡単に行うJavaScript!Masonry.js
グリッドレイアウトを簡単に行うJavaScript!Masonry.jsYoshinori Kobayashi
 
Analyzing Sound in misfits S1E1
Analyzing Sound in misfits S1E1Analyzing Sound in misfits S1E1
Analyzing Sound in misfits S1E1IndySM
 
The Riley Files: Corrected
The Riley Files: CorrectedThe Riley Files: Corrected
The Riley Files: CorrectedRawleMurdy
 
WordPressのテンプレートタグを理解する
WordPressのテンプレートタグを理解するWordPressのテンプレートタグを理解する
WordPressのテンプレートタグを理解するYoshinori Kobayashi
 

Andere mochten auch (20)

3547 politica nacional logística
3547 politica nacional logística3547 politica nacional logística
3547 politica nacional logística
 
Qr code
Qr codeQr code
Qr code
 
Cairo1
Cairo1Cairo1
Cairo1
 
WordPressテンプレート階層を理解する。テーマカスタマイズに必要な5つのポイント!|WordPressもくもく勉強会 at コエド第6回
WordPressテンプレート階層を理解する。テーマカスタマイズに必要な5つのポイント!|WordPressもくもく勉強会 at コエド第6回WordPressテンプレート階層を理解する。テーマカスタマイズに必要な5つのポイント!|WordPressもくもく勉強会 at コエド第6回
WordPressテンプレート階層を理解する。テーマカスタマイズに必要な5つのポイント!|WordPressもくもく勉強会 at コエド第6回
 
De Dieu Butler App for dummies
De Dieu Butler App for dummiesDe Dieu Butler App for dummies
De Dieu Butler App for dummies
 
Mafalda
MafaldaMafalda
Mafalda
 
simple gift
simple giftsimple gift
simple gift
 
Don't Write Them Off, Cast Iron Boilers Still Have a Future
Don't Write Them Off, Cast Iron Boilers Still Have a FutureDon't Write Them Off, Cast Iron Boilers Still Have a Future
Don't Write Them Off, Cast Iron Boilers Still Have a Future
 
Presentation to Business Wealth Club
Presentation to Business Wealth ClubPresentation to Business Wealth Club
Presentation to Business Wealth Club
 
Back to School
Back to SchoolBack to School
Back to School
 
Photo slide
Photo slidePhoto slide
Photo slide
 
The presentation code
The presentation codeThe presentation code
The presentation code
 
Enfermedad del parkinson
Enfermedad del parkinsonEnfermedad del parkinson
Enfermedad del parkinson
 
グリッドレイアウトを簡単に行うJavaScript!Masonry.js
グリッドレイアウトを簡単に行うJavaScript!Masonry.jsグリッドレイアウトを簡単に行うJavaScript!Masonry.js
グリッドレイアウトを簡単に行うJavaScript!Masonry.js
 
Winter is amazing
Winter is amazingWinter is amazing
Winter is amazing
 
IAT made by me
IAT made by meIAT made by me
IAT made by me
 
Rph Ringkas
Rph RingkasRph Ringkas
Rph Ringkas
 
Analyzing Sound in misfits S1E1
Analyzing Sound in misfits S1E1Analyzing Sound in misfits S1E1
Analyzing Sound in misfits S1E1
 
The Riley Files: Corrected
The Riley Files: CorrectedThe Riley Files: Corrected
The Riley Files: Corrected
 
WordPressのテンプレートタグを理解する
WordPressのテンプレートタグを理解するWordPressのテンプレートタグを理解する
WordPressのテンプレートタグを理解する
 

Ähnlich wie Assessment

A4.Flores.Alisson.CatedraItegradora.pptx
A4.Flores.Alisson.CatedraItegradora.pptxA4.Flores.Alisson.CatedraItegradora.pptx
A4.Flores.Alisson.CatedraItegradora.pptxAlissonFlores20
 
Principles of language assessment.pptx
Principles of language assessment.pptxPrinciples of language assessment.pptx
Principles of language assessment.pptxNOELIAANALIPROAOTROY1
 
Learning_activity1_Navarro Luzuriaga_Joseph Andrés.pptx
Learning_activity1_Navarro Luzuriaga_Joseph Andrés.pptxLearning_activity1_Navarro Luzuriaga_Joseph Andrés.pptx
Learning_activity1_Navarro Luzuriaga_Joseph Andrés.pptxjosephnavarro38
 
Roles of Assessment in Classroom Instruction
Roles of Assessment in Classroom InstructionRoles of Assessment in Classroom Instruction
Roles of Assessment in Classroom InstructionJames Robert Villacorteza
 
La notes (1 7 & 9)
La notes (1 7 & 9)La notes (1 7 & 9)
La notes (1 7 & 9)hakim azman
 
Evaluate Student’s Performance.pptx
Evaluate Student’s Performance.pptxEvaluate Student’s Performance.pptx
Evaluate Student’s Performance.pptxBENITEZSAAVEDRADAYAN
 
MEASUREMENT ASSESSMENT evaluation 1.pptx
MEASUREMENT ASSESSMENT evaluation 1.pptxMEASUREMENT ASSESSMENT evaluation 1.pptx
MEASUREMENT ASSESSMENT evaluation 1.pptxSajan Ks
 
continous assessment (LH) for Jinela Teachers.pdf
continous assessment (LH)   for Jinela Teachers.pdfcontinous assessment (LH)   for Jinela Teachers.pdf
continous assessment (LH) for Jinela Teachers.pdfbeyeneyewondwossenDi
 
Classroom Based Assessment Tools and Techniques 27-09-2022.ppt
Classroom Based Assessment Tools and Techniques 27-09-2022.pptClassroom Based Assessment Tools and Techniques 27-09-2022.ppt
Classroom Based Assessment Tools and Techniques 27-09-2022.pptNasirMahmood976516
 
Evaluation of educational programs in nursing
Evaluation of educational programs in nursingEvaluation of educational programs in nursing
Evaluation of educational programs in nursingNavjyot Singh
 
Languange assessment principles and classroom practices
Languange assessment principles and classroom practicesLanguange assessment principles and classroom practices
Languange assessment principles and classroom practiceszkc8ygk5c9
 
PHYSICS ASSESSMENT General Types of Assessment and The Types of Scales
PHYSICS ASSESSMENT General Types of Assessment and The Types of ScalesPHYSICS ASSESSMENT General Types of Assessment and The Types of Scales
PHYSICS ASSESSMENT General Types of Assessment and The Types of ScalesMillathina Puji Utami
 
evalution-151228155502 (1).pptx evalution-151228155502 (1).pptx
evalution-151228155502 (1).pptx evalution-151228155502 (1).pptxevalution-151228155502 (1).pptx evalution-151228155502 (1).pptx
evalution-151228155502 (1).pptx evalution-151228155502 (1).pptxLoyalZohaibKhattak
 
Test, measurement, assessment & evaluation
Test, measurement, assessment & evaluationTest, measurement, assessment & evaluation
Test, measurement, assessment & evaluationDrSindhuAlmas
 

Ähnlich wie Assessment (20)

A4.Flores.Alisson.CatedraItegradora.pptx
A4.Flores.Alisson.CatedraItegradora.pptxA4.Flores.Alisson.CatedraItegradora.pptx
A4.Flores.Alisson.CatedraItegradora.pptx
 
Principles of language assessment.pptx
Principles of language assessment.pptxPrinciples of language assessment.pptx
Principles of language assessment.pptx
 
Learning_activity1_Navarro Luzuriaga_Joseph Andrés.pptx
Learning_activity1_Navarro Luzuriaga_Joseph Andrés.pptxLearning_activity1_Navarro Luzuriaga_Joseph Andrés.pptx
Learning_activity1_Navarro Luzuriaga_Joseph Andrés.pptx
 
Roles of Assessment in Classroom Instruction
Roles of Assessment in Classroom InstructionRoles of Assessment in Classroom Instruction
Roles of Assessment in Classroom Instruction
 
language and literature assessment
language and literature assessmentlanguage and literature assessment
language and literature assessment
 
La notes (1 7 & 9)
La notes (1 7 & 9)La notes (1 7 & 9)
La notes (1 7 & 9)
 
Evaluate Student’s Performance.pptx
Evaluate Student’s Performance.pptxEvaluate Student’s Performance.pptx
Evaluate Student’s Performance.pptx
 
ASSESSMENT.pptx
ASSESSMENT.pptxASSESSMENT.pptx
ASSESSMENT.pptx
 
Basics of assessment
Basics of assessmentBasics of assessment
Basics of assessment
 
MEASUREMENT ASSESSMENT evaluation 1.pptx
MEASUREMENT ASSESSMENT evaluation 1.pptxMEASUREMENT ASSESSMENT evaluation 1.pptx
MEASUREMENT ASSESSMENT evaluation 1.pptx
 
Assessment purposes and approaches
Assessment purposes and approachesAssessment purposes and approaches
Assessment purposes and approaches
 
continous assessment (LH) for Jinela Teachers.pdf
continous assessment (LH)   for Jinela Teachers.pdfcontinous assessment (LH)   for Jinela Teachers.pdf
continous assessment (LH) for Jinela Teachers.pdf
 
Classroom Based Assessment Tools and Techniques 27-09-2022.ppt
Classroom Based Assessment Tools and Techniques 27-09-2022.pptClassroom Based Assessment Tools and Techniques 27-09-2022.ppt
Classroom Based Assessment Tools and Techniques 27-09-2022.ppt
 
Evaluation of educational programs in nursing
Evaluation of educational programs in nursingEvaluation of educational programs in nursing
Evaluation of educational programs in nursing
 
Languange assessment principles and classroom practices
Languange assessment principles and classroom practicesLanguange assessment principles and classroom practices
Languange assessment principles and classroom practices
 
PHYSICS ASSESSMENT General Types of Assessment and The Types of Scales
PHYSICS ASSESSMENT General Types of Assessment and The Types of ScalesPHYSICS ASSESSMENT General Types of Assessment and The Types of Scales
PHYSICS ASSESSMENT General Types of Assessment and The Types of Scales
 
evalution-151228155502 (1).pptx evalution-151228155502 (1).pptx
evalution-151228155502 (1).pptx evalution-151228155502 (1).pptxevalution-151228155502 (1).pptx evalution-151228155502 (1).pptx
evalution-151228155502 (1).pptx evalution-151228155502 (1).pptx
 
Curriculum Evaluation
Curriculum EvaluationCurriculum Evaluation
Curriculum Evaluation
 
Evalution
Evalution Evalution
Evalution
 
Test, measurement, assessment & evaluation
Test, measurement, assessment & evaluationTest, measurement, assessment & evaluation
Test, measurement, assessment & evaluation
 

Assessment

  • 2. • In today’s language classrooms, the term assessment usually evokes images of an end-of- course paper-pencil test designed to tell both teachers and students how much material the student doesn’t know or hasn’t yet mastered • It includes a broad range of activities and tasks that teachers use to evaluate student’s progress and growth on a daily basis.
  • 3. To make use of evaluation, assessment and test procedures more effective it is necessary to clarify what these concepts are and to explain how they differ from one another It is all-inclusive and it is the widest basis for collecting information in education. It involves looking at all factors that influence the learning process: syllabus, objectives, course design, and materials. Test is a subcategory of assessment, it is a formal systematic procedure used to gather information about student progress. Assessment is part of evaluation because it is concerned with the student and with what the student does. It refers to the variety of ways of collecting information on a learner’s language ability or achievement.
  • 4.
  • 5. The most common use of language tests is to identify strengths and weaknesses in student’s abilities Information gleaned from tests also assist us in deciding who should be allowed to participate in a particular course or program area. Another common use of tests is to provide information about effectiveness of programs instructions
  • 6. • They asses student’s level of language abilities so they can be placed in an appropriate course or class. This type of test indicated the level at which a student will learn most effectively. The primary aim is to create groups of learners that are homogeneous in level • They measures capacity or general ability to learn a foreign language. (Although not commonly used these days) • They identify language area in which student needs further help. The information gained from diagnostic tests are crucial for further course activities and providing students with remediation.
  • 7. • They measures the progress that students are making toward defined course or program goals. Progress tests are generally teacher produced because they cover less material and assess fewer objectives • They are similar to progress tests. They are usually administrated at the mid- and end- point of the semester or academic year. • The content is generally based on the specific course content or on the course objectives. • They assess the overall language ability of students at varying levels. • They tell us how capable a person is in a particular language skill area.
  • 8. • Objective versus subjective tests- sometimes tests are distinguished by the manner in which they are scored by comparing a student’s responses with an established set of acceptable/correct responses on an answer key. With objectively scored tests, the scorer does not require particular knowledge or training in the examined area • In contrast, a subjective test, such as writing an essay, requires scoring by opinion or personal judgment so the human element is very important. • Even experienced scorer need moderated training sessions to ensure inter-rater reliability
  • 9. Criterion referenced tests versus Standardized tests- • Criterion referenced tests are usually developed to measure mastery of well-defined instructional objectives specific for a particular course or program. Their propose is to measure how much learning has occurred. Students performance is compared only to the amount or percentage of material learned. • Standardized tests are designed to measure global language abilities. Students’ scores are interpreted relative to all other students who take the exam. Their purpose is to spread students out along a continuum of scores so that those with low abilities in a certain skill are at one end of the normal distribution and those with high scores are at the other end, with the majority of the students falling between extremes.
  • 10. Summative versus formative tests- • Tests or tasks administered at the end of the course to determine if students have achieved the objectives set out in the curriculum are called summative assessments. they are often used to decide which students move on to a higher level • Formative assessments however, are carried out with the aim of using the results to improve instruction, so they are given during course and feedback is provided to students. High-stakes versus Low-stakes tests- • High-stakes tests are those in which the results are likely to have major impact on the lives of large number individuals or an large programs. • Low-stakes tests are those in which the results have relatively minor impact on the lives of the individual or on small programs. In class progress tests or short quizzes are examples of low-stakes tests
  • 11. practicality reliability validity authenticity washback
  • 12.
  • 13. • Designed items • Subjective test • Rating • Test item itself • Conditions of administration (noise, light, temperat ure, desks, chairs) • Human error • Subjectivity • Bias toward “good” and “bad students” • Inexperience • Inattention • Temporary illness, fatigue, an xiety (other physical, psycholo gical factors) Student- related Reliability Rater Reliability Test reliability Test Administr ation Reliability
  • 14. Validity - Measures exactly what is proposed to measure. - Involves performance that samples the test the test’s criterion. - Offers useful, meaningful information about a test-taker’s abilities. - Is supported by an argument. Criterion- related validityConstruct- related validity Consequential validity (Impact) Content-related validity
  • 15. Introduction • Objectives will include 4 distinct components: Audience, Behavior, Condition and Degree. • Objectives must be both observable and measurable to be effective. • Use of words like understand and learn in writing objectives are generally not acceptable as they are difficult to measure. • Written objectives are a vital part of instructional design because they provide the roadmap for designing and delivering curriculum. • Throughout the design and development of curriculum, a comparison of the content to be delivered should be made to the objectives identified for the program. This process, called performance agreement, ensures that the final product meets the overall goal of instruction identified in the first level objectives.
  • 16. - Describe the intended learner or end user of the instruction - Often the audience is identified only in the 1st level of objective because of redundancy Describes learner capability Must be observable and measurable (you will define the measurement elsewhere in the goal) If it is a skill, it should be a real world skill The “behavior” can include demonstration of knowledge or skills in any of the domains of learning: cognitive, psychomotor, affective, or interpersonal - Equipment or tools that may (or may not) be utilized in completion of the behavior - Environmental conditions may also be included - States the standard for acceptable performance (time, accuracy, proportion, quality, etc)
  • 17. The common mistakes have been grouped into four categories as follows: - General examination characteristics. - Item characteristics. - Test validity concerns. - Administrative and scoring issues.
  • 18. General Examination Characteristics Item characteristics Test-validity concerns Administrative and scoring issue: Lack of cheating control Inadequate instruction Administrative inequities Lack of piloting Subjectivity of scoring • Too difficult or too easy • Insufficient nr of items • Redundancy of test type • Lack of confidence measure • Negative wash back through non- occurrent forms • Tricky questions • Redundant wording • Divergence cues • Convergence cues • Option number • Mixed content • Wrong medium • Common knowledge • Syllabus mismatch • Content matching
  • 19. Tradition assessment  Pencil-and-paper test.  Answer the question  Choose or produce a correct grammatical form or vocabulary item.  Good to check reading and listening comprehension ability Alternative assessment • Reveal what students can do with language • It is scored differently • Students can evaluate their own learning and learn from the evaluation process • Gives instructors a way to connect assessment with review of learning strategies
  • 20. - They are build around the topics of the interest to the students - They replicate real-world communication context and situations -They require students to produce a quality product or performance -The evaluation criteria and standards are known to the student - They involve multi-stage tasks and real problems that require creative use of language rather than simple repetition -They involve interaction between assessor and person assessed They allow for self-evaluation
  • 21. Rubrics- provide measurement of quality of performance on the basis of established criteria. There are four main types of rubrics: • Holistic rubrics • Analytic rubrics • Primary trait rubrics • Multi-trait rubrics
  • 22. In holistic evaluation, raters make judgments by forming an overall impression of a performance and matching it to the best fit from among the descriptions on the scale. • They are often written generically and can be used with many tasks. • They emphasize what learners can do, rather than what they cannot do. • They save time by minimizing the number of decisions raters must make. • Trained raters tend to apply them consistently, resulting in more reliable measurement. • They are easily understood by younger learners. • They do not provide specific feedback to test takers about the strengths and weaknesses of their performance. • Performances may meet criteria in two or more categories, making it difficult to select the one best description. (If this occurs frequently, the rubric may be poorly written.)
  • 23. Analytic scales are usually associated with generic rubrics and tend to focus on broad dimensions of writing or speaking performance. These dimensions may be the same as those found in a generic, holistic scale, but they are presented in separate categories and rated individually. Points may be assigned for performance on each of the dimensions and a total score calculated. • They provide useful feedback to learners on areas of strength and weakness. • Their dimensions can be weighted to reflect relative importance. • They can show learners that they have made progress over time in some or all dimensions when the same rubric categories are used repeatedly • They take more time to create and use.
  • 24. primary trait scoring would be strictly classified as task-specific, and performance would be evaluated on only one trait, such as the "Persuading an audience Ex. Primary Trait: Persuading an audience 0 Fails to persuade the audience. 1 Attempts to persuade but does not provide sufficient support. 2 Presents a somewhat persuasive argument but without consistent development and support 3 Develops a persuasive argument that is well developed and supported.
  • 25. multiple trait scoring rubrics are based on the concepts of primary trait scoring, to provide diagnostic feedback to learners about performance on "context-appropriate and task-appropriate criteria" for a specified topic. • The rubrics are aligned with the task and curriculum. • Aligned and well-written primary and multiple trait rubrics can ensure construct and content validity of criterion-referenced assessments. • Feedback is focused on one or more dimensions that are important in the current learning context. • With a multiple trait rubric, learners receive information about their strengths and weaknesses. • Primary and multiple trait rubrics are generally written in language that students understand. • Teachers are able to rate performances quickly. • Many rubrics of this type have been developed by teachers who are willing to share them online, at conferences, and in materials available for purchase.