SlideShare ist ein Scribd-Unternehmen logo
1 von 5
Downloaden Sie, um offline zu lesen
PTLC2005 J. Szpyra-Kozłowska, J. Frankiewicz, M. Nowacka, L. Stadnicka, Assessing
Assessment Methods: 1


            Assessing assessment methods
     – on the reliability of pronunciation tests in EFL
             Jolanta Szpyra-Kozłowska, Justyna Frankiewicz,
                     Marta Nowacka, Lidia Stadnicka

           Maria Curie-Skłodowska University, Lublin, Poland

1. Introductory remarks
Teaching another language is inevitably tied with testing. Teachers have to assess
the learners’ linguistic ability, their progress and achievements. In this respect
pronunciation is no different from other language skills; if we regard it as an important
element of communicative competence which deserves a place in language
instruction, we should also be able to evaluate the process of teaching/learning it as
well as its outcome. Yet, as pointed out by Celce-Murcia et al. (1996: 341), ‘in the
existing literature on teaching pronunciation, little attention is paid to issues of testing
and evaluation.’ The major reason for this negligence is the fact that, as argued by
Heaton (1988: 88), speaking, which obviously comprises pronunciation, is a very
complex skill ‘to permit any reliable analysis to be made for the purpose of objective
testing.’

The present paper addresses the issue of the reliability of the most frequently
employed assessment methods of EFL learners’ pronunciation. First we examine
impression-based pronunciation testing in the internationally recognized Cambridge
English Examinations and point to its various shortcomings. Next we present a report
on an experiment which compares two approaches to pronunciation testing: holistic
(global, impressionistic) and atomistic (analytic) We point to their strengths and
weaknesses, and show that they are not equivalent and lead to different results.

2. Pronunciation assessment in Cambridge English Examinations
In evaluating different methods of pronunciation testing, it seems useful to start with
analyzing the way in which is it done in international English language examinations.
Pronunciation does not play any important role in the majority of them (for a detailed
analysis see Szpyra-Kozłowska 2003). Cambridge examinations are no exception to
this rule; candidates get only 5%-6% of the total score for this skill. The assessment
is impressionistic in nature Thus, the following criteria have been adopted for the 5
basic examinations:
     • KET (Key English Test)– pronunciation is heavily influenced by L1 features
         and may at times be difficult to understand;
     • PET (Preliminary English Test) – pronunciation is generally intelligible, but L1
         features may put a strain on the listener;
     • FCE (First Certificate in English) – although pronunciation is easily
         understood, L1 features may be intrusive;
     • CAE (Certificate in Advanced English) – L1 accent may be evident but does
         not affect the clarity of the message;
     • CPE (Certificate of Proficiency in English) – pronunciation is easily
         understood and prosodic features are used effectively; many features,
         including pausing and hesitation, are ‘native-like.
It is obvious that these requirements are very general and impression-based. Also
comments addressed to examiners make constant reference to the vague notions of
intelligibility and the amount of strain a candidate’s pronunciation puts on the listener.
In the manual, evaluators, who are usually experienced nonnative teachers of
PTLC2005 J. Szpyra-Kozłowska, J. Frankiewicz, M. Nowacka, L. Stadnicka, Assessing
Assessment Methods: 2


English, are instructed as follows, ‘when assessing pronunciation, examiners should
try to put themselves in the position of a non-EFL specialist, native speaker of
English and assess the amount of strain on the listener and the degree of patience
and effort required to understand the candidate.’ This procedure raises the following
doubts:
    1. A professional teacher of English cannot be required to pretend to be a non-
       EFL specialist who, in addition, is a native speaker of English; not everyone
       has a talent of pretending to be a completely different person (what if he
       fails?).
    2. It is not clear what kind of native speaker the examiner is supposed to
       impersonate – a well-travelled university professor, familiar with many
       nonnative varieties of English or a small-town housewife who has never left
       her birthplace?
    3. A nonnative teacher in most cases can understand even very bad English of
       his fellow-countrymen because of his/her frequent exposure to it. He is,
       therefore, in no position to judge its intelligibility to users of English of different
       nationalities than his own.
    4. Having no precise criteria of pronunciation assessment, the examiner is likely
       to adopt his own subjective principles of evaluation (see section 3). This often
       happens in spite of standardization procedures and examiners’ training.

We can conclude that the examinations under analysis do not provide clear-cut
criteria of assessing the examinees’ pronunciation by relying too heavily on very
imprecise impressionistic judgements and by making unreasonable demands on
nonnative examiners. This, in turn, seriously undermines their inter-rater reliability.

3. Holistic versus atomistic pronunciation testing

As shown in the preceding section, Cambridge English Examinations, similarly to
many other language tests, employ rather objectionable impressionistic evaluation. It
is therefore crucial to examine its logical alternative, i.e. analytic testing. In this
section these two approaches to pronunciation assessment are compared and
verified.

In the holistic approach to language testing (Alderson et al. 1996:289), ‘examiners
are asked not to pay too much attention to any one aspect of a candidate’s
performance, but rather to judge its overall effectiveness.’ The greatest advantage of
this procedure is that it can be administered to large groups of learners within a short
period of time. Moreover, according to Underhill (1987:101), ‘impression marking is
used for the kind of categories that are very hard to define but everybody agrees are
important: fluency, ability to communicate, style, naturalness of speech, and so on.’
For these reasons it is advocated by many researchers (e.g. Celce-Murcia et.
al.1996, Hughes 1991, Koren 1995).

Nevertheless, global pronunciation testing has many drawbacks. It is often too
general and imprecise since the assessment criteria in the rating scales, as has been
shown in section 2, tend to be vague. This means, in consequence, that different
raters might adopt their own criteria of evaluation. Finally, as pointed out by Underhill
(1987: 101), “making accurate impression-based assessments requires a lot of
experience. (…) Even experienced assessors find it difficult to make consistent
impression-based judgements.” In other words, this procedure raises problems both
of intra-rater and inter-rater reliability.

Analytic evaluation consists in establishing a detailed marking scheme in which
specific aspects of the learner’s performance are evaluated separately. Subsequently
PTLC2005 J. Szpyra-Kozłowska, J. Frankiewicz, M. Nowacka, L. Stadnicka, Assessing
Assessment Methods: 3


these different ratings are combined to provide an overall mark. An atomistic
approach to pronunciation testing thus involves judgements on the correctness of the
learner’s production of particular vowels, consonants, stress, rhythm, intonation, etc.
This method of pronunciation testing is claimed to be more objective than the holistic
approach as it provides a more detailed diagnosis of the learner’s problems and
achievements. It is generally preferred by pronunciation specialists and phoneticians
(e.g. Vaughan-Rees 1989).

On the other hand, atomistic procedure is not without its problems. It is extremely
time-consuming and requires recording the learners’ speech samples and
subsequent listening to them several times by the raters. For these reasons this
approach seems unsuitable for large classes and examinations with many
participants.

According to Hughes (1991), the choice between holistic and analytic scoring
depends to some extent on the purpose of testing; atomistic tests are more reliable
for diagnostic purposes in the language classroom and in the situations in which
scoring is carried out in many places by different judges, while holistic evaluation,
which is faster, is more appropriate for experienced scorers who are well familiar with
the grading system.

In order to compare both approaches, we have carried out an experiment whose
primary goal was to examine whether the holistic and atomistic procedures of
pronunciation testing are equivalent and bring about the same results.

In the experiment reported here 10 judges, all teachers of English, evaluated the
pronunciation of 10 randomly selected intermediate Polish learners, secondary
school pupils, who were asked to read aloud a short passage, which was
subsequently recorded. The raters were first asked to evaluate holistically pupils’
pronunciation recorded on the tape using an ordinary scale of Polish school marks of
1, 2, 2,5, 3, 3,5, 4, 4,5, 5 and 6, where 1 = failure and 6 = excellent. After a break of
two weeks the same group of raters assessed the recordings once again. On this
occasion they were given the following 6 criteria to be employed in the evaluation:
pronunciation of individual words, vowel quality (the /i/ - /i:/ distinction in particular),
the interdental fricatives, the -ing suffix, word stress and other phonetic features.
Each of these aspects were rated individually using the same scoring scale as
before. Subsequently, the means were calculated. Finally, the assessors were asked
to comment on the strengths and weaknesses of both approaches.

The questionnaires have revealed that in making holistic evaluation the raters
adopted, in fact, various analytic criteria (such as the pronunciation of ‘silent’ letters,
intonation, pauses, devoicing of final obstruents, etc.), which differed from person to
person. Moreover, 90% of assessors regarded atomistic testing as more reliable and
objective.

The table below contains the results of the experiment. We provide averaged
atomistic and holistic marks given by the raters.
PTLC2005 J. Szpyra-Kozłowska, J. Frankiewicz, M. Nowacka, L. Stadnicka, Assessing
Assessment Methods: 4


                                             ASSESSMENT
                               Learners    Holistic Atomistic
                                  L1         3.7        3
                                 L2           3        2.5
                                 L3           4        3.2
                                 L4          3.2       3.1
                                 L5           4        3.3
                                 L6          4.4       4.1
                                 L7          4.1       3.6
                                 L8           3         3
                                 L9          2.5       2.8
                                 L10         3.4       3.1
                                Mean        3.53      3.17

                  Table 1. Results of holistic and atomistic assessment

As can clearly be observed, in 8 cases out of 10 the mean atomistic marks are lower
that the holistic marks. In one case the results are reversed and in one are the same.
The obtained means are 3,53 in the holistic evaluation and 3,17 in the analytic
procedure.

To verify the obtained results, another experiment, a replica of the previous one, has
been conducted with a different group of 5 raters and 5 other learners. This time the
mean scores have been 3.56 in the holistic and 3.04 in the analytic assessment.

Thus, a conclusion can be drawn that the holistic and atomistic approaches to
pronunciation testing are not equivalent; the former usually results in higher scores
than analytic assessment. This means that raters generally tend to be more lenient in
their overall impressions than in judgements made on the basis of more specific
criteria. An explanation of this phenomenon can be sought in the likely assumption
that in atomistic testing the focus seems to be on error finding more than in the
holistic procedure, where the criterion of intelligibility is employed, which allows for a
more tolerant approach to phonetic inaccuracies.

4. Final remarks
Pronunciation is extremely difficult to test in an objective and reliable fashion. We
have demonstrated that Cambridge English Examinations, just like other similar
tests, are based entirely on impressionistic evaluation and raise many objections with
regard to their reliability. We have considered an alternative procedure of analytic
evaluation and demonstrated that the two methods are not exactly equivalent, the
former being more lenient and permissive than the latter. The atomistic approach can
be regarded as more objective and reliable, and is particularly well-suited for
diagnostic purposes as it allows the teacher to identify specific pronunciation
problems of the learners to be dealt with in the course of subsequent instruction. It is,
however, time-consuming and not easy to execute with large groups of learners or
examinees. Holistic testing, on the other hand, is technically simpler to carry out. It is
invaluable in assessing the overall impression, the intelligibility of the learner’s
speech and other aspects of his pronunciation which cannot be easily expressed by
means of definite, clear-cut criteria. Its reliability, however, is questionable.
Apparently, none of these two methods can be viewed as fulfilling all the necessary
requirements of objectivity, reliability and practicality.
PTLC2005 J. Szpyra-Kozłowska, J. Frankiewicz, M. Nowacka, L. Stadnicka, Assessing
Assessment Methods: 5


References
Alderson, C. J., Wall, D. & C. Claphaim. (1996). Language Test Construction and
Evaluation. Cambridge: Cambridge University Press.
Celce-Murcia, M., Brinton, D. & J. Goodwin. 1996. Teaching Pronunciation: a
Reference for Teachers of English to Speakers of Other Languages. Cambridge:
Cambridge University Press.
Heaton, J. B. 1988. Writing English Language Tests. London: Longman.
Hughes, A. (1991). Testing for Language Teachers. Cambridge: Cambridge
University Press.
Koren, S. (1995). “Foreign language pronunciation testing: a new approach.” System
23 (3). 387-400.
Szpyra-Kozłowska, J. (2003). ”Miejsce i rola fonetyki w międzynarodowych
egzaminach Cambridge, TOEFL i TSE.” Zeszyty Naukowe PWSZ w Płocku.
Neofilologia. Tom V. 181-191.
Underhill, N. (1987). Testing Spoken Language. A handbook of oral testing
techniques. Cambridge: Cambridge University Press.
Vaughan-Rees, M. (1989). “The testing of pronunciation – receptive skills.” Speak
Out! 4. p. 8.

Weitere ähnliche Inhalte

Was ist angesagt?

Test production process - Approaches to language testing - Techniques of lang...
Test production process - Approaches to language testing - Techniques of lang...Test production process - Approaches to language testing - Techniques of lang...
Test production process - Approaches to language testing - Techniques of lang...Phạm Phúc Khánh Minh
 
Language Testing :kinds of tests
Language Testing :kinds of testsLanguage Testing :kinds of tests
Language Testing :kinds of testsahmedabbas1121
 
Language testing approaches & techniques
Language testing approaches & techniquesLanguage testing approaches & techniques
Language testing approaches & techniquesShin Chan
 
Approaches to Language Testing
Approaches to Language TestingApproaches to Language Testing
Approaches to Language Testingmpazhou
 
Communicative testing presentation
Communicative testing presentationCommunicative testing presentation
Communicative testing presentationsantoshector
 
Summary on LANGUAGE TESTING & ASSESSMENT (Part I) Alderson & Banerjee
Summary on LANGUAGE TESTING & ASSESSMENT (Part I) Alderson & Banerjee Summary on LANGUAGE TESTING & ASSESSMENT (Part I) Alderson & Banerjee
Summary on LANGUAGE TESTING & ASSESSMENT (Part I) Alderson & Banerjee MissJillSmith
 
Testing oral ability
Testing oral abilityTesting oral ability
Testing oral abilityArfan rai
 
UTPL-LENGUAGE TESTING-I-BIMESTRE-(OCTUBRE 2011-FEBRERO 2012)
UTPL-LENGUAGE TESTING-I-BIMESTRE-(OCTUBRE 2011-FEBRERO 2012)UTPL-LENGUAGE TESTING-I-BIMESTRE-(OCTUBRE 2011-FEBRERO 2012)
UTPL-LENGUAGE TESTING-I-BIMESTRE-(OCTUBRE 2011-FEBRERO 2012)Videoconferencias UTPL
 
Testing speaking
Testing speakingTesting speaking
Testing speakingM B
 
Communicative Testing
Communicative  TestingCommunicative  Testing
Communicative TestingNingsih SM
 
Approaches to Language Testing
Approaches to Language TestingApproaches to Language Testing
Approaches to Language TestingAnn Liza Sanchez
 
Uses of language by Brown 1990
Uses of language by Brown 1990Uses of language by Brown 1990
Uses of language by Brown 1990Mahsa Farahanynia
 
Assessing grammar
Assessing grammarAssessing grammar
Assessing grammarjuliovangel
 
UTPL-LANGUAGE TESTING-II-BIMESTRE-(OCTUBRE 2011-FEBRERO 2012)
UTPL-LANGUAGE TESTING-II-BIMESTRE-(OCTUBRE 2011-FEBRERO 2012)UTPL-LANGUAGE TESTING-II-BIMESTRE-(OCTUBRE 2011-FEBRERO 2012)
UTPL-LANGUAGE TESTING-II-BIMESTRE-(OCTUBRE 2011-FEBRERO 2012)Videoconferencias UTPL
 
Communicative testing
Communicative testingCommunicative testing
Communicative testingSamcruz5
 

Was ist angesagt? (20)

Test production process - Approaches to language testing - Techniques of lang...
Test production process - Approaches to language testing - Techniques of lang...Test production process - Approaches to language testing - Techniques of lang...
Test production process - Approaches to language testing - Techniques of lang...
 
Testing oral ability
Testing oral ability Testing oral ability
Testing oral ability
 
Tim McNamara
Tim McNamara   Tim McNamara
Tim McNamara
 
Language Testing :kinds of tests
Language Testing :kinds of testsLanguage Testing :kinds of tests
Language Testing :kinds of tests
 
Language testing approaches & techniques
Language testing approaches & techniquesLanguage testing approaches & techniques
Language testing approaches & techniques
 
Approaches to Language Testing
Approaches to Language TestingApproaches to Language Testing
Approaches to Language Testing
 
Communicative testing presentation
Communicative testing presentationCommunicative testing presentation
Communicative testing presentation
 
Summary on LANGUAGE TESTING & ASSESSMENT (Part I) Alderson & Banerjee
Summary on LANGUAGE TESTING & ASSESSMENT (Part I) Alderson & Banerjee Summary on LANGUAGE TESTING & ASSESSMENT (Part I) Alderson & Banerjee
Summary on LANGUAGE TESTING & ASSESSMENT (Part I) Alderson & Banerjee
 
Testing oral ability
Testing oral abilityTesting oral ability
Testing oral ability
 
UTPL-LENGUAGE TESTING-I-BIMESTRE-(OCTUBRE 2011-FEBRERO 2012)
UTPL-LENGUAGE TESTING-I-BIMESTRE-(OCTUBRE 2011-FEBRERO 2012)UTPL-LENGUAGE TESTING-I-BIMESTRE-(OCTUBRE 2011-FEBRERO 2012)
UTPL-LENGUAGE TESTING-I-BIMESTRE-(OCTUBRE 2011-FEBRERO 2012)
 
Testing speaking
Testing speakingTesting speaking
Testing speaking
 
Language assessment
Language assessmentLanguage assessment
Language assessment
 
Communicative Testing
Communicative  TestingCommunicative  Testing
Communicative Testing
 
Kinds of language tests
Kinds of language testsKinds of language tests
Kinds of language tests
 
Approaches to Language Testing
Approaches to Language TestingApproaches to Language Testing
Approaches to Language Testing
 
Uses of language by Brown 1990
Uses of language by Brown 1990Uses of language by Brown 1990
Uses of language by Brown 1990
 
ASSESSMENT OF LISTENING AND SPEAKING
ASSESSMENT OF LISTENING AND SPEAKINGASSESSMENT OF LISTENING AND SPEAKING
ASSESSMENT OF LISTENING AND SPEAKING
 
Assessing grammar
Assessing grammarAssessing grammar
Assessing grammar
 
UTPL-LANGUAGE TESTING-II-BIMESTRE-(OCTUBRE 2011-FEBRERO 2012)
UTPL-LANGUAGE TESTING-II-BIMESTRE-(OCTUBRE 2011-FEBRERO 2012)UTPL-LANGUAGE TESTING-II-BIMESTRE-(OCTUBRE 2011-FEBRERO 2012)
UTPL-LANGUAGE TESTING-II-BIMESTRE-(OCTUBRE 2011-FEBRERO 2012)
 
Communicative testing
Communicative testingCommunicative testing
Communicative testing
 

Andere mochten auch

Project based-learning by Liliana Nederita
Project based-learning by Liliana NederitaProject based-learning by Liliana Nederita
Project based-learning by Liliana NederitaIrina K
 
Siop presentation
Siop presentationSiop presentation
Siop presentationIrina K
 
Pedagogical management in ELT
Pedagogical management in ELTPedagogical management in ELT
Pedagogical management in ELTIrina K
 
System Of Education In Poland
System Of Education In PolandSystem Of Education In Poland
System Of Education In Polandsavetherainbow
 
Royalty in the uk
Royalty in the ukRoyalty in the uk
Royalty in the ukIrina K
 
Cooperative learning theory
Cooperative learning theoryCooperative learning theory
Cooperative learning theoryIrina K
 
Celebrating mother’s day
Celebrating mother’s dayCelebrating mother’s day
Celebrating mother’s dayIrina K
 
Games in English: Thanksgiving
Games in English: ThanksgivingGames in English: Thanksgiving
Games in English: ThanksgivingIrina K
 
Adapting to students needs
Adapting to students needsAdapting to students needs
Adapting to students needsIrina K
 
Assessing grammar & vocabulary
Assessing grammar & vocabularyAssessing grammar & vocabulary
Assessing grammar & vocabularyMusfera Nara Vadia
 
Tips on lesson planning
Tips on lesson planningTips on lesson planning
Tips on lesson planningIrina K
 
Assessing grammar
Assessing grammarAssessing grammar
Assessing grammarSamcruz5
 
Assessing speaking skills
Assessing speaking skills Assessing speaking skills
Assessing speaking skills nairubymata
 
Chapter 7(assessing speaking )
Chapter 7(assessing speaking )Chapter 7(assessing speaking )
Chapter 7(assessing speaking )Kheang Sokheng
 
Testing grammar and vocabulary
Testing grammar and vocabularyTesting grammar and vocabulary
Testing grammar and vocabularymarinasr_
 
Principles of Language Assessment
Principles of Language AssessmentPrinciples of Language Assessment
Principles of Language AssessmentA Faiz
 

Andere mochten auch (19)

Project based-learning by Liliana Nederita
Project based-learning by Liliana NederitaProject based-learning by Liliana Nederita
Project based-learning by Liliana Nederita
 
Assessing Vocabulary with Dr. John Read
Assessing Vocabulary with Dr. John ReadAssessing Vocabulary with Dr. John Read
Assessing Vocabulary with Dr. John Read
 
Siop presentation
Siop presentationSiop presentation
Siop presentation
 
Pedagogical management in ELT
Pedagogical management in ELTPedagogical management in ELT
Pedagogical management in ELT
 
System Of Education In Poland
System Of Education In PolandSystem Of Education In Poland
System Of Education In Poland
 
Royalty in the uk
Royalty in the ukRoyalty in the uk
Royalty in the uk
 
Cooperative learning theory
Cooperative learning theoryCooperative learning theory
Cooperative learning theory
 
Celebrating mother’s day
Celebrating mother’s dayCelebrating mother’s day
Celebrating mother’s day
 
LANGUAJE TESTING
LANGUAJE TESTINGLANGUAJE TESTING
LANGUAJE TESTING
 
Games in English: Thanksgiving
Games in English: ThanksgivingGames in English: Thanksgiving
Games in English: Thanksgiving
 
Adapting to students needs
Adapting to students needsAdapting to students needs
Adapting to students needs
 
Assessing grammar & vocabulary
Assessing grammar & vocabularyAssessing grammar & vocabulary
Assessing grammar & vocabulary
 
Tips on lesson planning
Tips on lesson planningTips on lesson planning
Tips on lesson planning
 
Assessing grammar
Assessing grammarAssessing grammar
Assessing grammar
 
Assessing speaking skills
Assessing speaking skills Assessing speaking skills
Assessing speaking skills
 
Chapter 7(assessing speaking )
Chapter 7(assessing speaking )Chapter 7(assessing speaking )
Chapter 7(assessing speaking )
 
Testing grammar and vocabulary
Testing grammar and vocabularyTesting grammar and vocabulary
Testing grammar and vocabulary
 
Assessing vocabulary
Assessing vocabularyAssessing vocabulary
Assessing vocabulary
 
Principles of Language Assessment
Principles of Language AssessmentPrinciples of Language Assessment
Principles of Language Assessment
 

Ähnlich wie Reliability of pronunciation tests seminar

Assessment &testing in the classroom
Assessment &testing in the classroomAssessment &testing in the classroom
Assessment &testing in the classroomCidher89
 
Summary of all the chapters
Summary of all the chaptersSummary of all the chapters
Summary of all the chapterskashmasardar
 
Introduction to language testing (wed, 23 sept 2014)
Introduction to language testing (wed, 23 sept 2014)Introduction to language testing (wed, 23 sept 2014)
Introduction to language testing (wed, 23 sept 2014)Widya Kurnia Arizona San
 
Language Proficiency Assessment :Oral Language
Language Proficiency Assessment :Oral LanguageLanguage Proficiency Assessment :Oral Language
Language Proficiency Assessment :Oral LanguageJill Frances Salinas
 
LED_207_Module 1_Basic Concepts.docx
LED_207_Module 1_Basic Concepts.docxLED_207_Module 1_Basic Concepts.docx
LED_207_Module 1_Basic Concepts.docxBayacaDebbie
 
Language proficiency assessment oral language
Language proficiency assessment oral languageLanguage proficiency assessment oral language
Language proficiency assessment oral languageJill Frances Salinas
 
Testing and assessment in elt
Testing and assessment in eltTesting and assessment in elt
Testing and assessment in eltCidher89
 
WAYS TO ASSESS PRONUNCIATION LEARNING.docx
WAYS TO ASSESS PRONUNCIATION LEARNING.docxWAYS TO ASSESS PRONUNCIATION LEARNING.docx
WAYS TO ASSESS PRONUNCIATION LEARNING.docxNikMan8
 
Testing : An important part of ELT
Testing : An important part of ELTTesting : An important part of ELT
Testing : An important part of ELTMd.Mahroof Hossain
 
Assesing speaking skills
Assesing speaking skillsAssesing speaking skills
Assesing speaking skillssyed ahmed
 
Language evaluation in science.pptx
Language evaluation in science.pptxLanguage evaluation in science.pptx
Language evaluation in science.pptxSubramanian Mani
 
Assessing Language Learning
Assessing Language LearningAssessing Language Learning
Assessing Language LearningMark Wallace
 
Ppg module tsl3105 topic 4 assessing l&s skills
Ppg module tsl3105 topic 4 assessing l&s skillsPpg module tsl3105 topic 4 assessing l&s skills
Ppg module tsl3105 topic 4 assessing l&s skillsJojo PaPat
 
Basic Assessment Concepts
Basic Assessment ConceptsBasic Assessment Concepts
Basic Assessment ConceptsAliAlZurfi
 
introducing language testing and assessment
 introducing language testing  and assessment introducing language testing  and assessment
introducing language testing and assessmentNajah M. Algolaip
 
Group 1 - Devini.AR , Henny, Wahyuni - Language Testing - Mrs.Tiara Dian Sari...
Group 1 - Devini.AR , Henny, Wahyuni - Language Testing - Mrs.Tiara Dian Sari...Group 1 - Devini.AR , Henny, Wahyuni - Language Testing - Mrs.Tiara Dian Sari...
Group 1 - Devini.AR , Henny, Wahyuni - Language Testing - Mrs.Tiara Dian Sari...tiara dian
 
languagetestingpresentationintroducinglangauageandassessment-210119155750.pdf
languagetestingpresentationintroducinglangauageandassessment-210119155750.pdflanguagetestingpresentationintroducinglangauageandassessment-210119155750.pdf
languagetestingpresentationintroducinglangauageandassessment-210119155750.pdfAttallah Alanazi
 

Ähnlich wie Reliability of pronunciation tests seminar (20)

Assessment &testing in the classroom
Assessment &testing in the classroomAssessment &testing in the classroom
Assessment &testing in the classroom
 
Summary of all the chapters
Summary of all the chaptersSummary of all the chapters
Summary of all the chapters
 
Introduction to language testing (wed, 23 sept 2014)
Introduction to language testing (wed, 23 sept 2014)Introduction to language testing (wed, 23 sept 2014)
Introduction to language testing (wed, 23 sept 2014)
 
Language Proficiency Assessment :Oral Language
Language Proficiency Assessment :Oral LanguageLanguage Proficiency Assessment :Oral Language
Language Proficiency Assessment :Oral Language
 
LED_207_Module 1_Basic Concepts.docx
LED_207_Module 1_Basic Concepts.docxLED_207_Module 1_Basic Concepts.docx
LED_207_Module 1_Basic Concepts.docx
 
Language proficiency assessment oral language
Language proficiency assessment oral languageLanguage proficiency assessment oral language
Language proficiency assessment oral language
 
Testing and assessment in elt
Testing and assessment in eltTesting and assessment in elt
Testing and assessment in elt
 
WAYS TO ASSESS PRONUNCIATION LEARNING.docx
WAYS TO ASSESS PRONUNCIATION LEARNING.docxWAYS TO ASSESS PRONUNCIATION LEARNING.docx
WAYS TO ASSESS PRONUNCIATION LEARNING.docx
 
Testing : An important part of ELT
Testing : An important part of ELTTesting : An important part of ELT
Testing : An important part of ELT
 
Assesing speaking skills
Assesing speaking skillsAssesing speaking skills
Assesing speaking skills
 
Language evaluation in science.pptx
Language evaluation in science.pptxLanguage evaluation in science.pptx
Language evaluation in science.pptx
 
Ket handbook2007
Ket handbook2007Ket handbook2007
Ket handbook2007
 
Assessing Language Learning
Assessing Language LearningAssessing Language Learning
Assessing Language Learning
 
Ppg module tsl3105 topic 4 assessing l&s skills
Ppg module tsl3105 topic 4 assessing l&s skillsPpg module tsl3105 topic 4 assessing l&s skills
Ppg module tsl3105 topic 4 assessing l&s skills
 
Rd connections14
Rd connections14Rd connections14
Rd connections14
 
A AND E.ppt
A AND E.pptA AND E.ppt
A AND E.ppt
 
Basic Assessment Concepts
Basic Assessment ConceptsBasic Assessment Concepts
Basic Assessment Concepts
 
introducing language testing and assessment
 introducing language testing  and assessment introducing language testing  and assessment
introducing language testing and assessment
 
Group 1 - Devini.AR , Henny, Wahyuni - Language Testing - Mrs.Tiara Dian Sari...
Group 1 - Devini.AR , Henny, Wahyuni - Language Testing - Mrs.Tiara Dian Sari...Group 1 - Devini.AR , Henny, Wahyuni - Language Testing - Mrs.Tiara Dian Sari...
Group 1 - Devini.AR , Henny, Wahyuni - Language Testing - Mrs.Tiara Dian Sari...
 
languagetestingpresentationintroducinglangauageandassessment-210119155750.pdf
languagetestingpresentationintroducinglangauageandassessment-210119155750.pdflanguagetestingpresentationintroducinglangauageandassessment-210119155750.pdf
languagetestingpresentationintroducinglangauageandassessment-210119155750.pdf
 

Mehr von Irina K

Superstitions ideas-writing-workbook
Superstitions ideas-writing-workbookSuperstitions ideas-writing-workbook
Superstitions ideas-writing-workbookIrina K
 
Forum info sessions
Forum info sessionsForum info sessions
Forum info sessionsIrina K
 
Meta (teaching tenses with grammar cube)
Meta (teaching tenses with grammar cube)Meta (teaching tenses with grammar cube)
Meta (teaching tenses with grammar cube)Irina K
 
Playing with words and learning vocabulary
Playing with words and learning vocabularyPlaying with words and learning vocabulary
Playing with words and learning vocabularyIrina K
 
Galina meta how_to_spoil_your_lesson
Galina meta how_to_spoil_your_lessonGalina meta how_to_spoil_your_lesson
Galina meta how_to_spoil_your_lessonIrina K
 
Good comes from doing good
Good comes from doing goodGood comes from doing good
Good comes from doing goodIrina K
 
intensifying Adverbs
intensifying Adverbs intensifying Adverbs
intensifying Adverbs Irina K
 
English for teachers - Lesson 1
English for teachers - Lesson 1English for teachers - Lesson 1
English for teachers - Lesson 1Irina K
 
Meta conference 2019
Meta conference 2019Meta conference 2019
Meta conference 2019Irina K
 
Cover letter_German Artyom
Cover letter_German ArtyomCover letter_German Artyom
Cover letter_German ArtyomIrina K
 
Lesson plan slang
Lesson plan slangLesson plan slang
Lesson plan slangIrina K
 
Slang in english
Slang in englishSlang in english
Slang in englishIrina K
 
Agenda of the meeting
Agenda of the meetingAgenda of the meeting
Agenda of the meetingIrina K
 
Lesson Observation - statistics/ lesson analysis
Lesson Observation - statistics/ lesson analysisLesson Observation - statistics/ lesson analysis
Lesson Observation - statistics/ lesson analysisIrina K
 
Developing speaking skill
Developing speaking skillDeveloping speaking skill
Developing speaking skillIrina K
 
Brain teasers
Brain teasersBrain teasers
Brain teasersIrina K
 
Talk for-a-minute
Talk for-a-minuteTalk for-a-minute
Talk for-a-minuteIrina K
 
effective approaches to teaching grammar
effective approaches to teaching grammar effective approaches to teaching grammar
effective approaches to teaching grammar Irina K
 
Cambridge english exams
Cambridge english examsCambridge english exams
Cambridge english examsIrina K
 
dyslexia - special needs
dyslexia - special needsdyslexia - special needs
dyslexia - special needsIrina K
 

Mehr von Irina K (20)

Superstitions ideas-writing-workbook
Superstitions ideas-writing-workbookSuperstitions ideas-writing-workbook
Superstitions ideas-writing-workbook
 
Forum info sessions
Forum info sessionsForum info sessions
Forum info sessions
 
Meta (teaching tenses with grammar cube)
Meta (teaching tenses with grammar cube)Meta (teaching tenses with grammar cube)
Meta (teaching tenses with grammar cube)
 
Playing with words and learning vocabulary
Playing with words and learning vocabularyPlaying with words and learning vocabulary
Playing with words and learning vocabulary
 
Galina meta how_to_spoil_your_lesson
Galina meta how_to_spoil_your_lessonGalina meta how_to_spoil_your_lesson
Galina meta how_to_spoil_your_lesson
 
Good comes from doing good
Good comes from doing goodGood comes from doing good
Good comes from doing good
 
intensifying Adverbs
intensifying Adverbs intensifying Adverbs
intensifying Adverbs
 
English for teachers - Lesson 1
English for teachers - Lesson 1English for teachers - Lesson 1
English for teachers - Lesson 1
 
Meta conference 2019
Meta conference 2019Meta conference 2019
Meta conference 2019
 
Cover letter_German Artyom
Cover letter_German ArtyomCover letter_German Artyom
Cover letter_German Artyom
 
Lesson plan slang
Lesson plan slangLesson plan slang
Lesson plan slang
 
Slang in english
Slang in englishSlang in english
Slang in english
 
Agenda of the meeting
Agenda of the meetingAgenda of the meeting
Agenda of the meeting
 
Lesson Observation - statistics/ lesson analysis
Lesson Observation - statistics/ lesson analysisLesson Observation - statistics/ lesson analysis
Lesson Observation - statistics/ lesson analysis
 
Developing speaking skill
Developing speaking skillDeveloping speaking skill
Developing speaking skill
 
Brain teasers
Brain teasersBrain teasers
Brain teasers
 
Talk for-a-minute
Talk for-a-minuteTalk for-a-minute
Talk for-a-minute
 
effective approaches to teaching grammar
effective approaches to teaching grammar effective approaches to teaching grammar
effective approaches to teaching grammar
 
Cambridge english exams
Cambridge english examsCambridge english exams
Cambridge english exams
 
dyslexia - special needs
dyslexia - special needsdyslexia - special needs
dyslexia - special needs
 

Kürzlich hochgeladen

4.16.24 Poverty and Precarity--Desmond.pptx
4.16.24 Poverty and Precarity--Desmond.pptx4.16.24 Poverty and Precarity--Desmond.pptx
4.16.24 Poverty and Precarity--Desmond.pptxmary850239
 
Mythology Quiz-4th April 2024, Quiz Club NITW
Mythology Quiz-4th April 2024, Quiz Club NITWMythology Quiz-4th April 2024, Quiz Club NITW
Mythology Quiz-4th April 2024, Quiz Club NITWQuiz Club NITW
 
Narcotic and Non Narcotic Analgesic..pdf
Narcotic and Non Narcotic Analgesic..pdfNarcotic and Non Narcotic Analgesic..pdf
Narcotic and Non Narcotic Analgesic..pdfPrerana Jadhav
 
Mental Health Awareness - a toolkit for supporting young minds
Mental Health Awareness - a toolkit for supporting young mindsMental Health Awareness - a toolkit for supporting young minds
Mental Health Awareness - a toolkit for supporting young mindsPooky Knightsmith
 
Using Grammatical Signals Suitable to Patterns of Idea Development
Using Grammatical Signals Suitable to Patterns of Idea DevelopmentUsing Grammatical Signals Suitable to Patterns of Idea Development
Using Grammatical Signals Suitable to Patterns of Idea Developmentchesterberbo7
 
Team Lead Succeed – Helping you and your team achieve high-performance teamwo...
Team Lead Succeed – Helping you and your team achieve high-performance teamwo...Team Lead Succeed – Helping you and your team achieve high-performance teamwo...
Team Lead Succeed – Helping you and your team achieve high-performance teamwo...Association for Project Management
 
ESP 4-EDITED.pdfmmcncncncmcmmnmnmncnmncmnnjvnnv
ESP 4-EDITED.pdfmmcncncncmcmmnmnmncnmncmnnjvnnvESP 4-EDITED.pdfmmcncncncmcmmnmnmncnmncmnnjvnnv
ESP 4-EDITED.pdfmmcncncncmcmmnmnmncnmncmnnjvnnvRicaMaeCastro1
 
week 1 cookery 8 fourth - quarter .pptx
week 1 cookery 8  fourth  -  quarter .pptxweek 1 cookery 8  fourth  -  quarter .pptx
week 1 cookery 8 fourth - quarter .pptxJonalynLegaspi2
 
Daily Lesson Plan in Mathematics Quarter 4
Daily Lesson Plan in Mathematics Quarter 4Daily Lesson Plan in Mathematics Quarter 4
Daily Lesson Plan in Mathematics Quarter 4JOYLYNSAMANIEGO
 
Congestive Cardiac Failure..presentation
Congestive Cardiac Failure..presentationCongestive Cardiac Failure..presentation
Congestive Cardiac Failure..presentationdeepaannamalai16
 
Active Learning Strategies (in short ALS).pdf
Active Learning Strategies (in short ALS).pdfActive Learning Strategies (in short ALS).pdf
Active Learning Strategies (in short ALS).pdfPatidar M
 
4.11.24 Poverty and Inequality in America.pptx
4.11.24 Poverty and Inequality in America.pptx4.11.24 Poverty and Inequality in America.pptx
4.11.24 Poverty and Inequality in America.pptxmary850239
 
Measures of Position DECILES for ungrouped data
Measures of Position DECILES for ungrouped dataMeasures of Position DECILES for ungrouped data
Measures of Position DECILES for ungrouped dataBabyAnnMotar
 
How to Fix XML SyntaxError in Odoo the 17
How to Fix XML SyntaxError in Odoo the 17How to Fix XML SyntaxError in Odoo the 17
How to Fix XML SyntaxError in Odoo the 17Celine George
 
Q-Factor General Quiz-7th April 2024, Quiz Club NITW
Q-Factor General Quiz-7th April 2024, Quiz Club NITWQ-Factor General Quiz-7th April 2024, Quiz Club NITW
Q-Factor General Quiz-7th April 2024, Quiz Club NITWQuiz Club NITW
 
Concurrency Control in Database Management system
Concurrency Control in Database Management systemConcurrency Control in Database Management system
Concurrency Control in Database Management systemChristalin Nelson
 

Kürzlich hochgeladen (20)

4.16.24 Poverty and Precarity--Desmond.pptx
4.16.24 Poverty and Precarity--Desmond.pptx4.16.24 Poverty and Precarity--Desmond.pptx
4.16.24 Poverty and Precarity--Desmond.pptx
 
INCLUSIVE EDUCATION PRACTICES FOR TEACHERS AND TRAINERS.pptx
INCLUSIVE EDUCATION PRACTICES FOR TEACHERS AND TRAINERS.pptxINCLUSIVE EDUCATION PRACTICES FOR TEACHERS AND TRAINERS.pptx
INCLUSIVE EDUCATION PRACTICES FOR TEACHERS AND TRAINERS.pptx
 
Mythology Quiz-4th April 2024, Quiz Club NITW
Mythology Quiz-4th April 2024, Quiz Club NITWMythology Quiz-4th April 2024, Quiz Club NITW
Mythology Quiz-4th April 2024, Quiz Club NITW
 
Narcotic and Non Narcotic Analgesic..pdf
Narcotic and Non Narcotic Analgesic..pdfNarcotic and Non Narcotic Analgesic..pdf
Narcotic and Non Narcotic Analgesic..pdf
 
Mental Health Awareness - a toolkit for supporting young minds
Mental Health Awareness - a toolkit for supporting young mindsMental Health Awareness - a toolkit for supporting young minds
Mental Health Awareness - a toolkit for supporting young minds
 
Using Grammatical Signals Suitable to Patterns of Idea Development
Using Grammatical Signals Suitable to Patterns of Idea DevelopmentUsing Grammatical Signals Suitable to Patterns of Idea Development
Using Grammatical Signals Suitable to Patterns of Idea Development
 
Team Lead Succeed – Helping you and your team achieve high-performance teamwo...
Team Lead Succeed – Helping you and your team achieve high-performance teamwo...Team Lead Succeed – Helping you and your team achieve high-performance teamwo...
Team Lead Succeed – Helping you and your team achieve high-performance teamwo...
 
Faculty Profile prashantha K EEE dept Sri Sairam college of Engineering
Faculty Profile prashantha K EEE dept Sri Sairam college of EngineeringFaculty Profile prashantha K EEE dept Sri Sairam college of Engineering
Faculty Profile prashantha K EEE dept Sri Sairam college of Engineering
 
ESP 4-EDITED.pdfmmcncncncmcmmnmnmncnmncmnnjvnnv
ESP 4-EDITED.pdfmmcncncncmcmmnmnmncnmncmnnjvnnvESP 4-EDITED.pdfmmcncncncmcmmnmnmncnmncmnnjvnnv
ESP 4-EDITED.pdfmmcncncncmcmmnmnmncnmncmnnjvnnv
 
week 1 cookery 8 fourth - quarter .pptx
week 1 cookery 8  fourth  -  quarter .pptxweek 1 cookery 8  fourth  -  quarter .pptx
week 1 cookery 8 fourth - quarter .pptx
 
Daily Lesson Plan in Mathematics Quarter 4
Daily Lesson Plan in Mathematics Quarter 4Daily Lesson Plan in Mathematics Quarter 4
Daily Lesson Plan in Mathematics Quarter 4
 
Congestive Cardiac Failure..presentation
Congestive Cardiac Failure..presentationCongestive Cardiac Failure..presentation
Congestive Cardiac Failure..presentation
 
Active Learning Strategies (in short ALS).pdf
Active Learning Strategies (in short ALS).pdfActive Learning Strategies (in short ALS).pdf
Active Learning Strategies (in short ALS).pdf
 
4.11.24 Poverty and Inequality in America.pptx
4.11.24 Poverty and Inequality in America.pptx4.11.24 Poverty and Inequality in America.pptx
4.11.24 Poverty and Inequality in America.pptx
 
Measures of Position DECILES for ungrouped data
Measures of Position DECILES for ungrouped dataMeasures of Position DECILES for ungrouped data
Measures of Position DECILES for ungrouped data
 
prashanth updated resume 2024 for Teaching Profession
prashanth updated resume 2024 for Teaching Professionprashanth updated resume 2024 for Teaching Profession
prashanth updated resume 2024 for Teaching Profession
 
How to Fix XML SyntaxError in Odoo the 17
How to Fix XML SyntaxError in Odoo the 17How to Fix XML SyntaxError in Odoo the 17
How to Fix XML SyntaxError in Odoo the 17
 
Q-Factor General Quiz-7th April 2024, Quiz Club NITW
Q-Factor General Quiz-7th April 2024, Quiz Club NITWQ-Factor General Quiz-7th April 2024, Quiz Club NITW
Q-Factor General Quiz-7th April 2024, Quiz Club NITW
 
Concurrency Control in Database Management system
Concurrency Control in Database Management systemConcurrency Control in Database Management system
Concurrency Control in Database Management system
 
Mattingly "AI & Prompt Design: Large Language Models"
Mattingly "AI & Prompt Design: Large Language Models"Mattingly "AI & Prompt Design: Large Language Models"
Mattingly "AI & Prompt Design: Large Language Models"
 

Reliability of pronunciation tests seminar

  • 1. PTLC2005 J. Szpyra-Kozłowska, J. Frankiewicz, M. Nowacka, L. Stadnicka, Assessing Assessment Methods: 1 Assessing assessment methods – on the reliability of pronunciation tests in EFL Jolanta Szpyra-Kozłowska, Justyna Frankiewicz, Marta Nowacka, Lidia Stadnicka Maria Curie-Skłodowska University, Lublin, Poland 1. Introductory remarks Teaching another language is inevitably tied with testing. Teachers have to assess the learners’ linguistic ability, their progress and achievements. In this respect pronunciation is no different from other language skills; if we regard it as an important element of communicative competence which deserves a place in language instruction, we should also be able to evaluate the process of teaching/learning it as well as its outcome. Yet, as pointed out by Celce-Murcia et al. (1996: 341), ‘in the existing literature on teaching pronunciation, little attention is paid to issues of testing and evaluation.’ The major reason for this negligence is the fact that, as argued by Heaton (1988: 88), speaking, which obviously comprises pronunciation, is a very complex skill ‘to permit any reliable analysis to be made for the purpose of objective testing.’ The present paper addresses the issue of the reliability of the most frequently employed assessment methods of EFL learners’ pronunciation. First we examine impression-based pronunciation testing in the internationally recognized Cambridge English Examinations and point to its various shortcomings. Next we present a report on an experiment which compares two approaches to pronunciation testing: holistic (global, impressionistic) and atomistic (analytic) We point to their strengths and weaknesses, and show that they are not equivalent and lead to different results. 2. Pronunciation assessment in Cambridge English Examinations In evaluating different methods of pronunciation testing, it seems useful to start with analyzing the way in which is it done in international English language examinations. Pronunciation does not play any important role in the majority of them (for a detailed analysis see Szpyra-Kozłowska 2003). Cambridge examinations are no exception to this rule; candidates get only 5%-6% of the total score for this skill. The assessment is impressionistic in nature Thus, the following criteria have been adopted for the 5 basic examinations: • KET (Key English Test)– pronunciation is heavily influenced by L1 features and may at times be difficult to understand; • PET (Preliminary English Test) – pronunciation is generally intelligible, but L1 features may put a strain on the listener; • FCE (First Certificate in English) – although pronunciation is easily understood, L1 features may be intrusive; • CAE (Certificate in Advanced English) – L1 accent may be evident but does not affect the clarity of the message; • CPE (Certificate of Proficiency in English) – pronunciation is easily understood and prosodic features are used effectively; many features, including pausing and hesitation, are ‘native-like. It is obvious that these requirements are very general and impression-based. Also comments addressed to examiners make constant reference to the vague notions of intelligibility and the amount of strain a candidate’s pronunciation puts on the listener. In the manual, evaluators, who are usually experienced nonnative teachers of
  • 2. PTLC2005 J. Szpyra-Kozłowska, J. Frankiewicz, M. Nowacka, L. Stadnicka, Assessing Assessment Methods: 2 English, are instructed as follows, ‘when assessing pronunciation, examiners should try to put themselves in the position of a non-EFL specialist, native speaker of English and assess the amount of strain on the listener and the degree of patience and effort required to understand the candidate.’ This procedure raises the following doubts: 1. A professional teacher of English cannot be required to pretend to be a non- EFL specialist who, in addition, is a native speaker of English; not everyone has a talent of pretending to be a completely different person (what if he fails?). 2. It is not clear what kind of native speaker the examiner is supposed to impersonate – a well-travelled university professor, familiar with many nonnative varieties of English or a small-town housewife who has never left her birthplace? 3. A nonnative teacher in most cases can understand even very bad English of his fellow-countrymen because of his/her frequent exposure to it. He is, therefore, in no position to judge its intelligibility to users of English of different nationalities than his own. 4. Having no precise criteria of pronunciation assessment, the examiner is likely to adopt his own subjective principles of evaluation (see section 3). This often happens in spite of standardization procedures and examiners’ training. We can conclude that the examinations under analysis do not provide clear-cut criteria of assessing the examinees’ pronunciation by relying too heavily on very imprecise impressionistic judgements and by making unreasonable demands on nonnative examiners. This, in turn, seriously undermines their inter-rater reliability. 3. Holistic versus atomistic pronunciation testing As shown in the preceding section, Cambridge English Examinations, similarly to many other language tests, employ rather objectionable impressionistic evaluation. It is therefore crucial to examine its logical alternative, i.e. analytic testing. In this section these two approaches to pronunciation assessment are compared and verified. In the holistic approach to language testing (Alderson et al. 1996:289), ‘examiners are asked not to pay too much attention to any one aspect of a candidate’s performance, but rather to judge its overall effectiveness.’ The greatest advantage of this procedure is that it can be administered to large groups of learners within a short period of time. Moreover, according to Underhill (1987:101), ‘impression marking is used for the kind of categories that are very hard to define but everybody agrees are important: fluency, ability to communicate, style, naturalness of speech, and so on.’ For these reasons it is advocated by many researchers (e.g. Celce-Murcia et. al.1996, Hughes 1991, Koren 1995). Nevertheless, global pronunciation testing has many drawbacks. It is often too general and imprecise since the assessment criteria in the rating scales, as has been shown in section 2, tend to be vague. This means, in consequence, that different raters might adopt their own criteria of evaluation. Finally, as pointed out by Underhill (1987: 101), “making accurate impression-based assessments requires a lot of experience. (…) Even experienced assessors find it difficult to make consistent impression-based judgements.” In other words, this procedure raises problems both of intra-rater and inter-rater reliability. Analytic evaluation consists in establishing a detailed marking scheme in which specific aspects of the learner’s performance are evaluated separately. Subsequently
  • 3. PTLC2005 J. Szpyra-Kozłowska, J. Frankiewicz, M. Nowacka, L. Stadnicka, Assessing Assessment Methods: 3 these different ratings are combined to provide an overall mark. An atomistic approach to pronunciation testing thus involves judgements on the correctness of the learner’s production of particular vowels, consonants, stress, rhythm, intonation, etc. This method of pronunciation testing is claimed to be more objective than the holistic approach as it provides a more detailed diagnosis of the learner’s problems and achievements. It is generally preferred by pronunciation specialists and phoneticians (e.g. Vaughan-Rees 1989). On the other hand, atomistic procedure is not without its problems. It is extremely time-consuming and requires recording the learners’ speech samples and subsequent listening to them several times by the raters. For these reasons this approach seems unsuitable for large classes and examinations with many participants. According to Hughes (1991), the choice between holistic and analytic scoring depends to some extent on the purpose of testing; atomistic tests are more reliable for diagnostic purposes in the language classroom and in the situations in which scoring is carried out in many places by different judges, while holistic evaluation, which is faster, is more appropriate for experienced scorers who are well familiar with the grading system. In order to compare both approaches, we have carried out an experiment whose primary goal was to examine whether the holistic and atomistic procedures of pronunciation testing are equivalent and bring about the same results. In the experiment reported here 10 judges, all teachers of English, evaluated the pronunciation of 10 randomly selected intermediate Polish learners, secondary school pupils, who were asked to read aloud a short passage, which was subsequently recorded. The raters were first asked to evaluate holistically pupils’ pronunciation recorded on the tape using an ordinary scale of Polish school marks of 1, 2, 2,5, 3, 3,5, 4, 4,5, 5 and 6, where 1 = failure and 6 = excellent. After a break of two weeks the same group of raters assessed the recordings once again. On this occasion they were given the following 6 criteria to be employed in the evaluation: pronunciation of individual words, vowel quality (the /i/ - /i:/ distinction in particular), the interdental fricatives, the -ing suffix, word stress and other phonetic features. Each of these aspects were rated individually using the same scoring scale as before. Subsequently, the means were calculated. Finally, the assessors were asked to comment on the strengths and weaknesses of both approaches. The questionnaires have revealed that in making holistic evaluation the raters adopted, in fact, various analytic criteria (such as the pronunciation of ‘silent’ letters, intonation, pauses, devoicing of final obstruents, etc.), which differed from person to person. Moreover, 90% of assessors regarded atomistic testing as more reliable and objective. The table below contains the results of the experiment. We provide averaged atomistic and holistic marks given by the raters.
  • 4. PTLC2005 J. Szpyra-Kozłowska, J. Frankiewicz, M. Nowacka, L. Stadnicka, Assessing Assessment Methods: 4 ASSESSMENT Learners Holistic Atomistic L1 3.7 3 L2 3 2.5 L3 4 3.2 L4 3.2 3.1 L5 4 3.3 L6 4.4 4.1 L7 4.1 3.6 L8 3 3 L9 2.5 2.8 L10 3.4 3.1 Mean 3.53 3.17 Table 1. Results of holistic and atomistic assessment As can clearly be observed, in 8 cases out of 10 the mean atomistic marks are lower that the holistic marks. In one case the results are reversed and in one are the same. The obtained means are 3,53 in the holistic evaluation and 3,17 in the analytic procedure. To verify the obtained results, another experiment, a replica of the previous one, has been conducted with a different group of 5 raters and 5 other learners. This time the mean scores have been 3.56 in the holistic and 3.04 in the analytic assessment. Thus, a conclusion can be drawn that the holistic and atomistic approaches to pronunciation testing are not equivalent; the former usually results in higher scores than analytic assessment. This means that raters generally tend to be more lenient in their overall impressions than in judgements made on the basis of more specific criteria. An explanation of this phenomenon can be sought in the likely assumption that in atomistic testing the focus seems to be on error finding more than in the holistic procedure, where the criterion of intelligibility is employed, which allows for a more tolerant approach to phonetic inaccuracies. 4. Final remarks Pronunciation is extremely difficult to test in an objective and reliable fashion. We have demonstrated that Cambridge English Examinations, just like other similar tests, are based entirely on impressionistic evaluation and raise many objections with regard to their reliability. We have considered an alternative procedure of analytic evaluation and demonstrated that the two methods are not exactly equivalent, the former being more lenient and permissive than the latter. The atomistic approach can be regarded as more objective and reliable, and is particularly well-suited for diagnostic purposes as it allows the teacher to identify specific pronunciation problems of the learners to be dealt with in the course of subsequent instruction. It is, however, time-consuming and not easy to execute with large groups of learners or examinees. Holistic testing, on the other hand, is technically simpler to carry out. It is invaluable in assessing the overall impression, the intelligibility of the learner’s speech and other aspects of his pronunciation which cannot be easily expressed by means of definite, clear-cut criteria. Its reliability, however, is questionable. Apparently, none of these two methods can be viewed as fulfilling all the necessary requirements of objectivity, reliability and practicality.
  • 5. PTLC2005 J. Szpyra-Kozłowska, J. Frankiewicz, M. Nowacka, L. Stadnicka, Assessing Assessment Methods: 5 References Alderson, C. J., Wall, D. & C. Claphaim. (1996). Language Test Construction and Evaluation. Cambridge: Cambridge University Press. Celce-Murcia, M., Brinton, D. & J. Goodwin. 1996. Teaching Pronunciation: a Reference for Teachers of English to Speakers of Other Languages. Cambridge: Cambridge University Press. Heaton, J. B. 1988. Writing English Language Tests. London: Longman. Hughes, A. (1991). Testing for Language Teachers. Cambridge: Cambridge University Press. Koren, S. (1995). “Foreign language pronunciation testing: a new approach.” System 23 (3). 387-400. Szpyra-Kozłowska, J. (2003). ”Miejsce i rola fonetyki w międzynarodowych egzaminach Cambridge, TOEFL i TSE.” Zeszyty Naukowe PWSZ w Płocku. Neofilologia. Tom V. 181-191. Underhill, N. (1987). Testing Spoken Language. A handbook of oral testing techniques. Cambridge: Cambridge University Press. Vaughan-Rees, M. (1989). “The testing of pronunciation – receptive skills.” Speak Out! 4. p. 8.