SlideShare ist ein Scribd-Unternehmen logo
1 von 72
Pedagogical applications of
corpus data for English for
General and Specific Purposes
Université Catholique de Louvain
FIAL (conférence ouverte aux chercheurs et étudiants): Mercredi 4
décembr 12h45 (local ERAS 56).

Pascual Pérez-Paredes
Universidad de Murcia,
Campus Mare Nostrum
Pedagogical applications of
corpus data for English for
General and Specific Purposes
Université Catholique de Louvain
FIAL (conférence ouverte aux chercheurs et étudiants): Mercredi 4
décembr 12h45 (local ERAS 56).

perezparedes.blogspot.com
3

Outline
1. Background: corpora
2. The SACODEYL- BACKBONE approach at a
glance
3. Getting down to annotating
1.
2.
3.

Backbone Annotator: download and installation
Texts and CMT
Guided annotation

1. BACKBONE
2. Specific purposes: LADEX
4
5

Corpus
Principled collection of texts representative of a
given language or reprsentative of a particular
language domain.
-Language research purposes
-Applied purposes: teaching, learning, dictionary
making, testing…
view.byu.edu/corpora.asp
sacodeyl.inf.um.es/sacodeyl-search/
webapps.ael.uni-tuebingen.de/backbone-search/
6

Corpora in language education
ReCALL special issue: Researching uses of corpora for language teaching
and learning . Boulton & Pérez-Paredes (2014).

-Indirect uses: Thorndike and Lorge’s Teacher’s Word Book
of 30,000 Words (1944), West’s General Service List (1953),
or Gougenheim (e.g. 1958) and colleagues’ work on the
Français Fondamental
-Cobuild work led by John Sinclair (1987)
Routledge Frequency Dictionaries
Coxhead’s Academic Word List (2000)
Martinez and Schmitt’s (2012) Phrasal Expressions List
7

TaLC Lancaster 1994
(1) computers and storage at the time were
improving dramatically;
(2) there was a new interest in authentic data and
usage in language education; and
(3) there was a consensus that learners were
adopting new, more active roles in their
learning process.
8

Imagine …..today
9

• Braun (2005, 2007): pedagogically motivated
corpora
(a) provide a more systematic range of material
than individual texts or scattered collections of
activities and, if well-designed, (b) offer a wider
range of idiolects than the average material.
Braun (2006) : thematic annotation, including topic
keys and section titles, are particularly useful in the
implementation of pedagogically motivated corpora
10
11

• Pérez-Paredes & Alcaraz (2009)
For the time being, the natural corpus playground
continues to be tertiary education.
Our motivation:
CL in the language classroom.
The resulting annotated corpus can be seen as being
integrative of language data and annotated pedagogy.
Pedagogy can be annotated and, subsequently, accessed by
corpus users.
12

Linguistic analysis of interest in
FLT
------>
Linguistics comes first
------->

DDL materials
Concordances
and corpus

Researcher/Linguist
End user

What is possible..
(Alcáraz and
Pérez-Paredes
2008)
13

• Pedagogical analysis (and annotation)
of language corpora
------>
Pedagogy comes first
------->
Pedagogy-driven
DDL
What is feasible..
Material
(Alcáraz and
developer/Teacher
/ Learner
Pérez-Paredes
End user
2008)
14
15

www.um.es/sacodeyl
16
17
18
19

sacodeyl.inf.um.es/sacodeyl-search/
webapps.ael.uni-tuebingen.de/backbone-search/
20

• Default annotation
tree has been
developed by the
teachers &
researchers in
SACODEYL
21

What categories
does this default
category tree
contain ?
Topics
Grammatical
Lexical
Style
CEF Level
….
22

Annotator friendly GUI
23

Multilanguage
• Supports a real multilingual annotation
24

Outline
1. Background: corpora
2. The SACODEYL- BACKBONE approach at a
glance
3. Getting down to annotating
1.
2.
3.

Backbone Annotator: download and installation
Texts and CMT
Guided annotation

1. BACKBONE
2. Specific purposes: LADEX
25
26

What is XML TEI format?
▫ TEI  Text Encoding Initiative
▫ This is a format for storing corpora
▫ Has been promoted by OTA
(Oxford Text Archive)
▫ Is a continuously growing format (more than 50
versions released yet, currently TEI P5)
▫ Is rapidly spreading among the available tools
27

TEI Tools (Research)
• TeiPublisher
“This tool is a XML-based repository that
allows the publication of TEI corpora to the
public community and offers a search tool.”

• Dexter
“This is other annotator tool that used TEI as
the format for the annotated files.”
28

TEI Tools (Research)
• Oxygen XML Editor and XMLSpy
“These are XML Editors that allows the
modification of the TEI files without any
limitation”
(These are complex for non-advanced users)
29

TEI Tools (Research)
• TAPoR (http://portal.tapor.ca/)
“The Text Analysis Portal for Research
(TAPoR) is a gateway to tools for sophisticated
analysis and retrieval, along with representative
texts for experimentation.”
30

TEI Tools (Research)
• TokenX
http://www.unl.edu/libr/etext/tokenx.shtml
“Is a text visualization, analysis, and play
tool”
• WordHoard
http://wordhoard.northwestern.edu/userman/index.html

“Is a tool for annotating or tagging texts by
morphological, lexical, prosodic, and
narratological
criteria and for determining frequency
information”
31

TEI Tools (Research)
• XAIRA
XAIRA (XML Aware Information Retrieval
Architecture) is an open source tool for
constructing high-quality linguisticallymotivated search interfaces to large
collections of XML documents.
32

• The XAIRA search
33

TEI Tools (Classroom)
• A more interesting orientation.
How I can use the Annotation in the classroom?
Backbone Search Tool
www.um.es/backbone
34

Outline
1. Background: corpora
2. The SACODEYL- BACKBONE approach at a
glance
3. Getting down to annotating
1.
2.
3.

Backbone Annotator: download and installation
Texts and CMT
Guided annotation

1. BACKBONE
2. Specific purposes: LADEX
35

Download BACKBONE Annotator +
Install + CMT config
http://www.um.es/backbone/
Pérez-Paredes, P., and Alcaraz-Calero, J. M.
(2009). Developing annotation solutions for online
Data Driven Learning. ReCALL 21, 55..
36

How do I create a corpus?
37

How can I add a new document to
the current corpus?
1.

Add document …

2. Select the text
format/encoding
3. Select the new
document
38

What does the text format mean?
• Mainly 4 text formats are supported:
▫
▫
▫
▫

Plain text (written) .txt
Oral text in Backbone Transcriptor format
Oral text in SACODEYL Transcriptor format
XML text in TEI standard format
(text in special XML files)
39

What does the text encoding mean?
 This

is the form in which the text is stored
(related to the Multilanguage).
(In Windows ANSI by default)
40

Selecting the text to annotate
•

Select a document and annotate it

1.

Open document…

2. Select the document
41

Information shown in the working document
• Section Number
• Applied Categories to this section
(Annotations)
• Speaker (only in oral text)
• Transcription
42

What is a section?
• Is a stretch of text that is “whateverly”
motivated.
• A fragment that could be useful in whatever
context
• A section can be established in any kind of text
(oral and written) with the insertion of the
special char (#) for division of texts into
sections.
43

Intuitive Annotation Process
• Drag and Drop to Annotate a Section
44

What is a Keyword?
• “… [a] keyword is a stretch of language (a
word, more than one word or a whole
paragraph) that the annotator associates to a
category…”
Pérez-Paredes and Alcaraz, ReCALL, 2009 Vol
21. (1)
45

What are Keywords?
• BACKBONE Annotator supports the annotation
of keywords
• Just select text and apply a category by rightclicking
46

Selective View
• Offers a selective view of the information in
order to facilitate the organization.
47

Section title
• Drag and Drop the special “Title”
category to the desired section.
• The title is rendered by a
tool tip when placing the cursor on
the section.
(No tool tip = No title)
48

Extensible annotation
• Supports customization of the
annotation
• User can add his/her own
annotation taxonomy or
remove any annotation
category
49

How can I add a new category?
▫ Select the parent category.
(i.e. Topics)
▫ Press Add Cat. Button.
▫ Fill in
50
51

How can I remove a category?
 Select

the category to
remove (i.e. Topic)
 Be careful …
All

the associated children will
be removed also
All the annotation with the
tags will be removed also
 Press

Delete Cat. Button.
52

How can I reorder the categories?
 Select

the category to
reorder (i.e. Topic)
 Press Up Cat or Down Cat.
to move it.
53

How can I customize a category?
 Select

the category to
customize (i.e. Topic)
 Press double click
54

Can I manage metadata?
55

What if I find mistakes?
• Supports edition of the inserted texts.
• Uses XML TEI standard for encoding corpora.
56

Integration
• Backbone Annotator is integrated with
▫
▫
▫
▫

Backbone Transcriptor
Backbone CMT
Backbone Search
SACODEYL VRP
57

Resource Management

• Offers enrichment
of text with external
resources

• i.e. html links,
videos, audios, etc.
58

Where is the information stored?
• Remember: All the information is store in one
file. The corpus file which you have created.
59

Make your corpus collaborative
60

Make your corpus collaborative
61

Make your corpus collaborative
62

Outline
1. Background: corpora
2. The SACODEYL- BACKBONE approach at a
glance
3. Getting down to annotating
1.
2.
3.

Backbone Annotator: download and installation
Texts and CMT
Guided annotation

1. BACKBONE
2. Specific purposes: LADEX
63

Backbone
• Pedagogic Corpora for Content and Language
Integrated Learning. Insights from the
BACKBONE Project. The EUROCALL Review,
20, 2, September 2012
• Kurt Kohn, Applied English Linguistics,
University of Tübingen (Germany)
http://eurocall.webs.upv.es/index.php?
m=menu_00&n=news_20_2
64

webapps.ael.uni-tuebingen.de/backbone-search/
65

Outline
1. Background: corpora
2. The SACODEYL- BACKBONE approach at a
glance
3. Getting down to annotating
1.
2.
3.

Backbone Annotator: download and installation
Texts and CMT
Guided annotation

1. BACKBONE
2. Specific purposes: LADEX
66

Specific uses: Legal-administrative
language and immigration
This

project aims at filling the existing gap between the linguistic
studies combining legal language characterisation and the cultural
and social implications of immigration, from a multilingual angle
(English, Italian, French and Spanish).
The

project will contribute to the definition of the immigrant in
each society, encouraging the debate on solidarity from a linguistic
perspective.
Our

starting point is the compilation, tagging and annotation of a
multilingual corpus comprising a collection of representative
documents used in immigration (UE and non-UE citizens), issued
by the different Public Administrations and institutions in Spain,
UK, France and Italy, ranging from 2007 to 2011.
67

• 1. Compilation and organisation of legal-administrative
binding documents for immigrants in all the countries involved.
• 2. Contrastive analysis of all those terminological, phraseological
and discoursive aspects which can help us shape the cultural
identity of administrators and immigrants.
• 3. Multilingual study of the legal-administrative language analysed
in the research corpus textual typology.
• 4. Contrastive characterisation of the foreign user and cultural
implications.
68

• LADEX Annotator (Multilingual automatic
tagging) + Manual collaborative annotation

• http://www.um.es/languagecorpora
69

Annotation Aim

• Why are you annotating?
• What is the purpose of your annotation?
• What use are you giving to your annotation?
70

Discussion and debate
• Pedagogical annotation vs. Morphological
tagging paradigm
• Learner-centered vs. Researcher-oriented
• Indirect applications of language corpora vs.
Direct applications
• Constraints of traditional CL in the languagge
classroom
71

Discussion and debate
• Cognitive demands of traditional CL in the
language classroom: learner as a reseacher and
as a traveller
• Is CL an extra hassle in language classrooms?
(Mauranen 2004)
• Customization of language corpus/collection of
texts
• Mediation role of corpus-based resources in the
FLT classroom
• Authenticity issues (Widdowson)
72

References and further reading
• Braun, S. 2005. “From pedagogically relevant corpora to authentic
language learning contents”, ReCALL 17/1:47-64.
• Braun, S. 2006. “ELISA - a pedagogically enriched corpus for language
learning purposes”. In Corpus Technology and Language Pedagogy: New
Resources, New Tools, New Methods, Frankfurt M: Peter Lang. (eds) 2547.
• Braun, S. 2007. “Integrating corpus work into secondary education: from
data-driven learning to needs-driven corpora”. ReCALL 19/3: 307-328.
• Mauranen, A. 2004.” Spoken - general: Spoken corpus for an ordinary
learner”. In How to Use Corpora in Language Teaching, Sinclair, J. McH.
(Ed), 89–105.
• Pérez-Paredes, P. and Alcaraz, J.M. 2009. “Developing annotation
solutions for online data-driven learning”. ReCALL,21,1, .
• Römer, Ute. (2008). “Corpora and Language Teaching”. In Corpus
Linguistics. An International Handbook, Lüdeling, Anke & Merja Kytö
(eds.). Berlin: Mouton de Gruyter.
• Widdowson, H.G. 2003. Defining issues in English Language Teaching.
Oxford: Oxford University Press.

perezparedes.blogspot.com

Weitere ähnliche Inhalte

Was ist angesagt?

Task based syllabus
Task based syllabusTask based syllabus
Task based syllabus
Uspan Sayuti
 
Corpus Tools for Language Teaching
Corpus Tools for Language TeachingCorpus Tools for Language Teaching
Corpus Tools for Language Teaching
CALPER
 
Grammar translation method
Grammar translation methodGrammar translation method
Grammar translation method
otokonoko
 

Was ist angesagt? (20)

course and syllabus design
course and syllabus designcourse and syllabus design
course and syllabus design
 
SKILL BASED SYLLABUS
SKILL BASED SYLLABUSSKILL BASED SYLLABUS
SKILL BASED SYLLABUS
 
Implication of Contrastive Analysis in English Language Teaching
Implication of Contrastive Analysis in English Language TeachingImplication of Contrastive Analysis in English Language Teaching
Implication of Contrastive Analysis in English Language Teaching
 
Assessing speaking
Assessing speakingAssessing speaking
Assessing speaking
 
Corpus Linguistics
Corpus LinguisticsCorpus Linguistics
Corpus Linguistics
 
Task based syllabus
Task based syllabusTask based syllabus
Task based syllabus
 
Corpus Tools for Language Teaching
Corpus Tools for Language TeachingCorpus Tools for Language Teaching
Corpus Tools for Language Teaching
 
Esp
EspEsp
Esp
 
Applied linguistics
Applied linguisticsApplied linguistics
Applied linguistics
 
Needs Analysis and Evaluation In English Specific Purposes
Needs Analysis and Evaluation In English Specific PurposesNeeds Analysis and Evaluation In English Specific Purposes
Needs Analysis and Evaluation In English Specific Purposes
 
Grammar translation method
Grammar translation methodGrammar translation method
Grammar translation method
 
task based language teaching TBLT
task based language teaching TBLTtask based language teaching TBLT
task based language teaching TBLT
 
Grammar-Translation Method (GTM) edited by me
Grammar-Translation Method (GTM) edited by meGrammar-Translation Method (GTM) edited by me
Grammar-Translation Method (GTM) edited by me
 
English for Specific Purposes by Tony Dudley Evans
English for Specific Purposes by Tony Dudley EvansEnglish for Specific Purposes by Tony Dudley Evans
English for Specific Purposes by Tony Dudley Evans
 
What is applied linguistics
What is applied linguisticsWhat is applied linguistics
What is applied linguistics
 
Task-based syllabus design and task sequencing
Task-based syllabus design and task sequencingTask-based syllabus design and task sequencing
Task-based syllabus design and task sequencing
 
[ESP] Definitions, Characteristics, and Principles of English for Specific Pu...
[ESP] Definitions, Characteristics, and Principles of English for Specific Pu...[ESP] Definitions, Characteristics, and Principles of English for Specific Pu...
[ESP] Definitions, Characteristics, and Principles of English for Specific Pu...
 
Grammatical based syllabus. Akram Jabar Najim
Grammatical  based syllabus. Akram Jabar NajimGrammatical  based syllabus. Akram Jabar Najim
Grammatical based syllabus. Akram Jabar Najim
 
Esp and listening skills in applied linguistics
Esp and listening skills in applied linguisticsEsp and listening skills in applied linguistics
Esp and listening skills in applied linguistics
 
Krashen's Input Hypotheses
Krashen's Input HypothesesKrashen's Input Hypotheses
Krashen's Input Hypotheses
 

Andere mochten auch

English for Academic Purposes (EAP) vs. general English—a 101 crash course . ...
English for Academic Purposes (EAP) vs. general English—a 101 crash course. ...English for Academic Purposes (EAP) vs. general English—a 101 crash course. ...
English for Academic Purposes (EAP) vs. general English—a 101 crash course . ...
Macmillan Russia
 
Pedagogy Powerpoint
Pedagogy PowerpointPedagogy Powerpoint
Pedagogy Powerpoint
nompi
 
Approaches in teaching and learning k to 12
Approaches in teaching and learning k to 12 Approaches in teaching and learning k to 12
Approaches in teaching and learning k to 12
Charlyn David
 
Language Teaching Approaches and Methods
Language Teaching Approaches and MethodsLanguage Teaching Approaches and Methods
Language Teaching Approaches and Methods
emma.a
 
Principles of Teaching:Different Methods and Approaches
Principles of Teaching:Different Methods and ApproachesPrinciples of Teaching:Different Methods and Approaches
Principles of Teaching:Different Methods and Approaches
justindoliente
 

Andere mochten auch (14)

ESP and general programs
ESP and general programsESP and general programs
ESP and general programs
 
Pedagogy or child learning
Pedagogy or child learningPedagogy or child learning
Pedagogy or child learning
 
English for Academic Purposes (EAP) vs. general English—a 101 crash course . ...
English for Academic Purposes (EAP) vs. general English—a 101 crash course. ...English for Academic Purposes (EAP) vs. general English—a 101 crash course. ...
English for Academic Purposes (EAP) vs. general English—a 101 crash course . ...
 
Esp sep 2011
Esp sep 2011Esp sep 2011
Esp sep 2011
 
Session 3: Pedagogical Approaches
Session 3: Pedagogical ApproachesSession 3: Pedagogical Approaches
Session 3: Pedagogical Approaches
 
Principles of Pedagogy
Principles of PedagogyPrinciples of Pedagogy
Principles of Pedagogy
 
Pedagogical skills
Pedagogical skillsPedagogical skills
Pedagogical skills
 
ENGLISH FOR SPECIFIC PURPOSES
ENGLISH FOR SPECIFIC PURPOSESENGLISH FOR SPECIFIC PURPOSES
ENGLISH FOR SPECIFIC PURPOSES
 
Pedagogy Powerpoint
Pedagogy PowerpointPedagogy Powerpoint
Pedagogy Powerpoint
 
Approaches in teaching and learning k to 12
Approaches in teaching and learning k to 12 Approaches in teaching and learning k to 12
Approaches in teaching and learning k to 12
 
Methods For Teaching English
Methods For Teaching EnglishMethods For Teaching English
Methods For Teaching English
 
Interactive Teaching Strategies
Interactive Teaching StrategiesInteractive Teaching Strategies
Interactive Teaching Strategies
 
Language Teaching Approaches and Methods
Language Teaching Approaches and MethodsLanguage Teaching Approaches and Methods
Language Teaching Approaches and Methods
 
Principles of Teaching:Different Methods and Approaches
Principles of Teaching:Different Methods and ApproachesPrinciples of Teaching:Different Methods and Approaches
Principles of Teaching:Different Methods and Approaches
 

Ähnlich wie Pedagogical applications of corpus data for English for General and Specific Purposes

Learning and Text Analysis for Ontology Engineering
Learning and Text Analysis for Ontology EngineeringLearning and Text Analysis for Ontology Engineering
Learning and Text Analysis for Ontology Engineering
butest
 

Ähnlich wie Pedagogical applications of corpus data for English for General and Specific Purposes (20)

CALICO 2010 Workshop
CALICO 2010  Workshop CALICO 2010  Workshop
CALICO 2010 Workshop
 
TALC 2008 Workshop 1 - Teaching and Language Corpora
TALC 2008 Workshop 1 - Teaching and Language CorporaTALC 2008 Workshop 1 - Teaching and Language Corpora
TALC 2008 Workshop 1 - Teaching and Language Corpora
 
Using pedagogic corpora in ELT
Using pedagogic corpora in ELTUsing pedagogic corpora in ELT
Using pedagogic corpora in ELT
 
Learning and Text Analysis for Ontology Engineering
Learning and Text Analysis for Ontology EngineeringLearning and Text Analysis for Ontology Engineering
Learning and Text Analysis for Ontology Engineering
 
TDWG VoMaG Vocabulary management workflow, 2013-10-31
TDWG VoMaG Vocabulary management workflow, 2013-10-31TDWG VoMaG Vocabulary management workflow, 2013-10-31
TDWG VoMaG Vocabulary management workflow, 2013-10-31
 
Generating Lexical Information for Terminology in a Bioinformatics Ontology
Generating Lexical Information for Terminologyin a Bioinformatics OntologyGenerating Lexical Information for Terminologyin a Bioinformatics Ontology
Generating Lexical Information for Terminology in a Bioinformatics Ontology
 
LoCloud Vocabulary Services: Thesaurus management introduction, Walter Koch a...
LoCloud Vocabulary Services: Thesaurus management introduction, Walter Koch a...LoCloud Vocabulary Services: Thesaurus management introduction, Walter Koch a...
LoCloud Vocabulary Services: Thesaurus management introduction, Walter Koch a...
 
Bridging Informal MOOCs & Formal English for Academic Purposes Programmes wit...
Bridging Informal MOOCs & Formal English for Academic Purposes Programmes wit...Bridging Informal MOOCs & Formal English for Academic Purposes Programmes wit...
Bridging Informal MOOCs & Formal English for Academic Purposes Programmes wit...
 
Collaboratively Defining Widely Accepted Linguistic Data Categories in the IS...
Collaboratively Defining Widely Accepted Linguistic Data Categories in the IS...Collaboratively Defining Widely Accepted Linguistic Data Categories in the IS...
Collaboratively Defining Widely Accepted Linguistic Data Categories in the IS...
 
eLanguage.net: Shifting the paradigm in Linguistics
eLanguage.net: Shifting the paradigm in LinguisticseLanguage.net: Shifting the paradigm in Linguistics
eLanguage.net: Shifting the paradigm in Linguistics
 
Terminology: tips and tricks to boost your terminology work
Terminology: tips and tricks to boost your terminology workTerminology: tips and tricks to boost your terminology work
Terminology: tips and tricks to boost your terminology work
 
NLP Tasks and Applications.ppt useful in
NLP Tasks and Applications.ppt useful inNLP Tasks and Applications.ppt useful in
NLP Tasks and Applications.ppt useful in
 
lect36-tasks.ppt
lect36-tasks.pptlect36-tasks.ppt
lect36-tasks.ppt
 
Multilingual vocabularies for the Web: Session on multilingual vocabularies, ...
Multilingual vocabularies for the Web: Session on multilingual vocabularies, ...Multilingual vocabularies for the Web: Session on multilingual vocabularies, ...
Multilingual vocabularies for the Web: Session on multilingual vocabularies, ...
 
Flexible, Free and Open Data-Driven Learning for the Masses (MOOCs)
Flexible, Free and Open Data-Driven Learning for the Masses (MOOCs)Flexible, Free and Open Data-Driven Learning for the Masses (MOOCs)
Flexible, Free and Open Data-Driven Learning for the Masses (MOOCs)
 
Presentation of Adaptive Software at CLIL 2010 Conference
Presentation of Adaptive Software at CLIL 2010 ConferencePresentation of Adaptive Software at CLIL 2010 Conference
Presentation of Adaptive Software at CLIL 2010 Conference
 
Library Boot Camp: Basic Cataloging, Part 2
Library Boot Camp: Basic Cataloging, Part 2Library Boot Camp: Basic Cataloging, Part 2
Library Boot Camp: Basic Cataloging, Part 2
 
Philippe Langlais - 2017 - Users and Data: The Two Neglected Children of Bili...
Philippe Langlais - 2017 - Users and Data: The Two Neglected Children of Bili...Philippe Langlais - 2017 - Users and Data: The Two Neglected Children of Bili...
Philippe Langlais - 2017 - Users and Data: The Two Neglected Children of Bili...
 
Application of Ontology in Semantic Information Retrieval by Prof Shahrul Azm...
Application of Ontology in Semantic Information Retrieval by Prof Shahrul Azm...Application of Ontology in Semantic Information Retrieval by Prof Shahrul Azm...
Application of Ontology in Semantic Information Retrieval by Prof Shahrul Azm...
 
Antconc
AntconcAntconc
Antconc
 

Mehr von Pascual Pérez-Paredes

Rannsókn á lestrarvenjum og notkun bókmennta
Rannsókn á lestrarvenjum og notkun bókmenntaRannsókn á lestrarvenjum og notkun bókmennta
Rannsókn á lestrarvenjum og notkun bókmennta
Pascual Pérez-Paredes
 

Mehr von Pascual Pérez-Paredes (20)

TELL-OP App - How it works
TELL-OP  App - How it worksTELL-OP  App - How it works
TELL-OP App - How it works
 
TELL-OP App
TELL-OP  App TELL-OP  App
TELL-OP App
 
Developing corpus-based resources for language learning: looking back in "hope"
Developing corpus-based resources for language learning: looking back in "hope"Developing corpus-based resources for language learning: looking back in "hope"
Developing corpus-based resources for language learning: looking back in "hope"
 
A contrastive analysis of native and non-native speaker interviews
A contrastive analysis of native and non-native speaker interviewsA contrastive analysis of native and non-native speaker interviews
A contrastive analysis of native and non-native speaker interviews
 
Education as a multilingual and multicultural space
Education as a multilingual and multicultural spaceEducation as a multilingual and multicultural space
Education as a multilingual and multicultural space
 
Higher Education as a multilingual and multicultural space
Higher Education as a multilingual and multicultural spaceHigher Education as a multilingual and multicultural space
Higher Education as a multilingual and multicultural space
 
English-medium instruction as a transformation policy
English-medium instruction as a transformation policyEnglish-medium instruction as a transformation policy
English-medium instruction as a transformation policy
 
European Commission Erasmus – Facts, Figures & Trends.
European Commission Erasmus – Facts, Figures & Trends.European Commission Erasmus – Facts, Figures & Trends.
European Commission Erasmus – Facts, Figures & Trends.
 
Escribir ciencia en inglés
Escribir ciencia en inglésEscribir ciencia en inglés
Escribir ciencia en inglés
 
Aesla 2011 getting_things_done_pascual_pérez-paredes
Aesla 2011 getting_things_done_pascual_pérez-paredesAesla 2011 getting_things_done_pascual_pérez-paredes
Aesla 2011 getting_things_done_pascual_pérez-paredes
 
Los blogs en el área de humanidades
Los blogs en el área de humanidadesLos blogs en el área de humanidades
Los blogs en el área de humanidades
 
Kynnig á degi íslenskrar tungu
Kynnig á degi íslenskrar tunguKynnig á degi íslenskrar tungu
Kynnig á degi íslenskrar tungu
 
Rannsókn á lestrarvenjum og notkun bókmennta
Rannsókn á lestrarvenjum og notkun bókmenntaRannsókn á lestrarvenjum og notkun bókmennta
Rannsókn á lestrarvenjum og notkun bókmennta
 
Involvement in personal narratives-ma of learner language
Involvement in personal narratives-ma of learner languageInvolvement in personal narratives-ma of learner language
Involvement in personal narratives-ma of learner language
 
Jornada lectura lit. infantil September 28, 2011
Jornada lectura lit. infantil September 28, 2011Jornada lectura lit. infantil September 28, 2011
Jornada lectura lit. infantil September 28, 2011
 
Teaching and learning children litarature in europa ni̇han
Teaching and learning children litarature in europa ni̇hanTeaching and learning children litarature in europa ni̇han
Teaching and learning children litarature in europa ni̇han
 
UK Comenius project dissemination event
UK Comenius project dissemination eventUK Comenius project dissemination event
UK Comenius project dissemination event
 
Specialist genres
Specialist genresSpecialist genres
Specialist genres
 
What can a corpus tell us about discourse
What can a corpus tell us about discourseWhat can a corpus tell us about discourse
What can a corpus tell us about discourse
 
Discourse and corpus
Discourse and corpusDiscourse and corpus
Discourse and corpus
 

Kürzlich hochgeladen

Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
ZurliaSoop
 

Kürzlich hochgeladen (20)

ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.
 
Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.ppt
 
REMIFENTANIL: An Ultra short acting opioid.pptx
REMIFENTANIL: An Ultra short acting opioid.pptxREMIFENTANIL: An Ultra short acting opioid.pptx
REMIFENTANIL: An Ultra short acting opioid.pptx
 
Wellbeing inclusion and digital dystopias.pptx
Wellbeing inclusion and digital dystopias.pptxWellbeing inclusion and digital dystopias.pptx
Wellbeing inclusion and digital dystopias.pptx
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.
 
On National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsOn National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan Fellows
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
 
Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)
 
How to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POSHow to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POS
 
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
Understanding Accommodations and Modifications
Understanding  Accommodations and ModificationsUnderstanding  Accommodations and Modifications
Understanding Accommodations and Modifications
 
Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...
Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...
Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17
 
FSB Advising Checklist - Orientation 2024
FSB Advising Checklist - Orientation 2024FSB Advising Checklist - Orientation 2024
FSB Advising Checklist - Orientation 2024
 
Graduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - EnglishGraduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - English
 
Fostering Friendships - Enhancing Social Bonds in the Classroom
Fostering Friendships - Enhancing Social Bonds  in the ClassroomFostering Friendships - Enhancing Social Bonds  in the Classroom
Fostering Friendships - Enhancing Social Bonds in the Classroom
 
SOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning PresentationSOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning Presentation
 
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdfUGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
 
Jamworks pilot and AI at Jisc (20/03/2024)
Jamworks pilot and AI at Jisc (20/03/2024)Jamworks pilot and AI at Jisc (20/03/2024)
Jamworks pilot and AI at Jisc (20/03/2024)
 
Python Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxPython Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docx
 

Pedagogical applications of corpus data for English for General and Specific Purposes

  • 1. Pedagogical applications of corpus data for English for General and Specific Purposes Université Catholique de Louvain FIAL (conférence ouverte aux chercheurs et étudiants): Mercredi 4 décembr 12h45 (local ERAS 56). Pascual Pérez-Paredes Universidad de Murcia, Campus Mare Nostrum
  • 2. Pedagogical applications of corpus data for English for General and Specific Purposes Université Catholique de Louvain FIAL (conférence ouverte aux chercheurs et étudiants): Mercredi 4 décembr 12h45 (local ERAS 56). perezparedes.blogspot.com
  • 3. 3 Outline 1. Background: corpora 2. The SACODEYL- BACKBONE approach at a glance 3. Getting down to annotating 1. 2. 3. Backbone Annotator: download and installation Texts and CMT Guided annotation 1. BACKBONE 2. Specific purposes: LADEX
  • 5. 5 Corpus Principled collection of texts representative of a given language or reprsentative of a particular language domain. -Language research purposes -Applied purposes: teaching, learning, dictionary making, testing… view.byu.edu/corpora.asp sacodeyl.inf.um.es/sacodeyl-search/ webapps.ael.uni-tuebingen.de/backbone-search/
  • 6. 6 Corpora in language education ReCALL special issue: Researching uses of corpora for language teaching and learning . Boulton & Pérez-Paredes (2014). -Indirect uses: Thorndike and Lorge’s Teacher’s Word Book of 30,000 Words (1944), West’s General Service List (1953), or Gougenheim (e.g. 1958) and colleagues’ work on the Français Fondamental -Cobuild work led by John Sinclair (1987) Routledge Frequency Dictionaries Coxhead’s Academic Word List (2000) Martinez and Schmitt’s (2012) Phrasal Expressions List
  • 7. 7 TaLC Lancaster 1994 (1) computers and storage at the time were improving dramatically; (2) there was a new interest in authentic data and usage in language education; and (3) there was a consensus that learners were adopting new, more active roles in their learning process.
  • 9. 9 • Braun (2005, 2007): pedagogically motivated corpora (a) provide a more systematic range of material than individual texts or scattered collections of activities and, if well-designed, (b) offer a wider range of idiolects than the average material. Braun (2006) : thematic annotation, including topic keys and section titles, are particularly useful in the implementation of pedagogically motivated corpora
  • 10. 10
  • 11. 11 • Pérez-Paredes & Alcaraz (2009) For the time being, the natural corpus playground continues to be tertiary education. Our motivation: CL in the language classroom. The resulting annotated corpus can be seen as being integrative of language data and annotated pedagogy. Pedagogy can be annotated and, subsequently, accessed by corpus users.
  • 12. 12 Linguistic analysis of interest in FLT ------> Linguistics comes first -------> DDL materials Concordances and corpus Researcher/Linguist End user What is possible.. (Alcáraz and Pérez-Paredes 2008)
  • 13. 13 • Pedagogical analysis (and annotation) of language corpora ------> Pedagogy comes first -------> Pedagogy-driven DDL What is feasible.. Material (Alcáraz and developer/Teacher / Learner Pérez-Paredes End user 2008)
  • 14. 14
  • 16. 16
  • 17. 17
  • 18. 18
  • 20. 20 • Default annotation tree has been developed by the teachers & researchers in SACODEYL
  • 21. 21 What categories does this default category tree contain ? Topics Grammatical Lexical Style CEF Level ….
  • 23. 23 Multilanguage • Supports a real multilingual annotation
  • 24. 24 Outline 1. Background: corpora 2. The SACODEYL- BACKBONE approach at a glance 3. Getting down to annotating 1. 2. 3. Backbone Annotator: download and installation Texts and CMT Guided annotation 1. BACKBONE 2. Specific purposes: LADEX
  • 25. 25
  • 26. 26 What is XML TEI format? ▫ TEI  Text Encoding Initiative ▫ This is a format for storing corpora ▫ Has been promoted by OTA (Oxford Text Archive) ▫ Is a continuously growing format (more than 50 versions released yet, currently TEI P5) ▫ Is rapidly spreading among the available tools
  • 27. 27 TEI Tools (Research) • TeiPublisher “This tool is a XML-based repository that allows the publication of TEI corpora to the public community and offers a search tool.” • Dexter “This is other annotator tool that used TEI as the format for the annotated files.”
  • 28. 28 TEI Tools (Research) • Oxygen XML Editor and XMLSpy “These are XML Editors that allows the modification of the TEI files without any limitation” (These are complex for non-advanced users)
  • 29. 29 TEI Tools (Research) • TAPoR (http://portal.tapor.ca/) “The Text Analysis Portal for Research (TAPoR) is a gateway to tools for sophisticated analysis and retrieval, along with representative texts for experimentation.”
  • 30. 30 TEI Tools (Research) • TokenX http://www.unl.edu/libr/etext/tokenx.shtml “Is a text visualization, analysis, and play tool” • WordHoard http://wordhoard.northwestern.edu/userman/index.html “Is a tool for annotating or tagging texts by morphological, lexical, prosodic, and narratological criteria and for determining frequency information”
  • 31. 31 TEI Tools (Research) • XAIRA XAIRA (XML Aware Information Retrieval Architecture) is an open source tool for constructing high-quality linguisticallymotivated search interfaces to large collections of XML documents.
  • 33. 33 TEI Tools (Classroom) • A more interesting orientation. How I can use the Annotation in the classroom? Backbone Search Tool www.um.es/backbone
  • 34. 34 Outline 1. Background: corpora 2. The SACODEYL- BACKBONE approach at a glance 3. Getting down to annotating 1. 2. 3. Backbone Annotator: download and installation Texts and CMT Guided annotation 1. BACKBONE 2. Specific purposes: LADEX
  • 35. 35 Download BACKBONE Annotator + Install + CMT config http://www.um.es/backbone/ Pérez-Paredes, P., and Alcaraz-Calero, J. M. (2009). Developing annotation solutions for online Data Driven Learning. ReCALL 21, 55..
  • 36. 36 How do I create a corpus?
  • 37. 37 How can I add a new document to the current corpus? 1. Add document … 2. Select the text format/encoding 3. Select the new document
  • 38. 38 What does the text format mean? • Mainly 4 text formats are supported: ▫ ▫ ▫ ▫ Plain text (written) .txt Oral text in Backbone Transcriptor format Oral text in SACODEYL Transcriptor format XML text in TEI standard format (text in special XML files)
  • 39. 39 What does the text encoding mean?  This is the form in which the text is stored (related to the Multilanguage). (In Windows ANSI by default)
  • 40. 40 Selecting the text to annotate • Select a document and annotate it 1. Open document… 2. Select the document
  • 41. 41 Information shown in the working document • Section Number • Applied Categories to this section (Annotations) • Speaker (only in oral text) • Transcription
  • 42. 42 What is a section? • Is a stretch of text that is “whateverly” motivated. • A fragment that could be useful in whatever context • A section can be established in any kind of text (oral and written) with the insertion of the special char (#) for division of texts into sections.
  • 43. 43 Intuitive Annotation Process • Drag and Drop to Annotate a Section
  • 44. 44 What is a Keyword? • “… [a] keyword is a stretch of language (a word, more than one word or a whole paragraph) that the annotator associates to a category…” Pérez-Paredes and Alcaraz, ReCALL, 2009 Vol 21. (1)
  • 45. 45 What are Keywords? • BACKBONE Annotator supports the annotation of keywords • Just select text and apply a category by rightclicking
  • 46. 46 Selective View • Offers a selective view of the information in order to facilitate the organization.
  • 47. 47 Section title • Drag and Drop the special “Title” category to the desired section. • The title is rendered by a tool tip when placing the cursor on the section. (No tool tip = No title)
  • 48. 48 Extensible annotation • Supports customization of the annotation • User can add his/her own annotation taxonomy or remove any annotation category
  • 49. 49 How can I add a new category? ▫ Select the parent category. (i.e. Topics) ▫ Press Add Cat. Button. ▫ Fill in
  • 50. 50
  • 51. 51 How can I remove a category?  Select the category to remove (i.e. Topic)  Be careful … All the associated children will be removed also All the annotation with the tags will be removed also  Press Delete Cat. Button.
  • 52. 52 How can I reorder the categories?  Select the category to reorder (i.e. Topic)  Press Up Cat or Down Cat. to move it.
  • 53. 53 How can I customize a category?  Select the category to customize (i.e. Topic)  Press double click
  • 54. 54 Can I manage metadata?
  • 55. 55 What if I find mistakes? • Supports edition of the inserted texts. • Uses XML TEI standard for encoding corpora.
  • 56. 56 Integration • Backbone Annotator is integrated with ▫ ▫ ▫ ▫ Backbone Transcriptor Backbone CMT Backbone Search SACODEYL VRP
  • 57. 57 Resource Management • Offers enrichment of text with external resources • i.e. html links, videos, audios, etc.
  • 58. 58 Where is the information stored? • Remember: All the information is store in one file. The corpus file which you have created.
  • 59. 59 Make your corpus collaborative
  • 60. 60 Make your corpus collaborative
  • 61. 61 Make your corpus collaborative
  • 62. 62 Outline 1. Background: corpora 2. The SACODEYL- BACKBONE approach at a glance 3. Getting down to annotating 1. 2. 3. Backbone Annotator: download and installation Texts and CMT Guided annotation 1. BACKBONE 2. Specific purposes: LADEX
  • 63. 63 Backbone • Pedagogic Corpora for Content and Language Integrated Learning. Insights from the BACKBONE Project. The EUROCALL Review, 20, 2, September 2012 • Kurt Kohn, Applied English Linguistics, University of Tübingen (Germany) http://eurocall.webs.upv.es/index.php? m=menu_00&n=news_20_2
  • 65. 65 Outline 1. Background: corpora 2. The SACODEYL- BACKBONE approach at a glance 3. Getting down to annotating 1. 2. 3. Backbone Annotator: download and installation Texts and CMT Guided annotation 1. BACKBONE 2. Specific purposes: LADEX
  • 66. 66 Specific uses: Legal-administrative language and immigration This project aims at filling the existing gap between the linguistic studies combining legal language characterisation and the cultural and social implications of immigration, from a multilingual angle (English, Italian, French and Spanish). The project will contribute to the definition of the immigrant in each society, encouraging the debate on solidarity from a linguistic perspective. Our starting point is the compilation, tagging and annotation of a multilingual corpus comprising a collection of representative documents used in immigration (UE and non-UE citizens), issued by the different Public Administrations and institutions in Spain, UK, France and Italy, ranging from 2007 to 2011.
  • 67. 67 • 1. Compilation and organisation of legal-administrative binding documents for immigrants in all the countries involved. • 2. Contrastive analysis of all those terminological, phraseological and discoursive aspects which can help us shape the cultural identity of administrators and immigrants. • 3. Multilingual study of the legal-administrative language analysed in the research corpus textual typology. • 4. Contrastive characterisation of the foreign user and cultural implications.
  • 68. 68 • LADEX Annotator (Multilingual automatic tagging) + Manual collaborative annotation • http://www.um.es/languagecorpora
  • 69. 69 Annotation Aim • Why are you annotating? • What is the purpose of your annotation? • What use are you giving to your annotation?
  • 70. 70 Discussion and debate • Pedagogical annotation vs. Morphological tagging paradigm • Learner-centered vs. Researcher-oriented • Indirect applications of language corpora vs. Direct applications • Constraints of traditional CL in the languagge classroom
  • 71. 71 Discussion and debate • Cognitive demands of traditional CL in the language classroom: learner as a reseacher and as a traveller • Is CL an extra hassle in language classrooms? (Mauranen 2004) • Customization of language corpus/collection of texts • Mediation role of corpus-based resources in the FLT classroom • Authenticity issues (Widdowson)
  • 72. 72 References and further reading • Braun, S. 2005. “From pedagogically relevant corpora to authentic language learning contents”, ReCALL 17/1:47-64. • Braun, S. 2006. “ELISA - a pedagogically enriched corpus for language learning purposes”. In Corpus Technology and Language Pedagogy: New Resources, New Tools, New Methods, Frankfurt M: Peter Lang. (eds) 2547. • Braun, S. 2007. “Integrating corpus work into secondary education: from data-driven learning to needs-driven corpora”. ReCALL 19/3: 307-328. • Mauranen, A. 2004.” Spoken - general: Spoken corpus for an ordinary learner”. In How to Use Corpora in Language Teaching, Sinclair, J. McH. (Ed), 89–105. • Pérez-Paredes, P. and Alcaraz, J.M. 2009. “Developing annotation solutions for online data-driven learning”. ReCALL,21,1, . • Römer, Ute. (2008). “Corpora and Language Teaching”. In Corpus Linguistics. An International Handbook, Lüdeling, Anke & Merja Kytö (eds.). Berlin: Mouton de Gruyter. • Widdowson, H.G. 2003. Defining issues in English Language Teaching. Oxford: Oxford University Press. perezparedes.blogspot.com

Hinweis der Redaktion

  1. Associar elo concepto con el tagging
  2. NOW, YOU CAN START