SlideShare ist ein Scribd-Unternehmen logo
1 von 70
Using corpora to
enhance language
learning
Michael Barlow
Overview
wordlists
collocation lists
online concordancers
text analysis software
concordancers
ParaConc and Collocate
web-based exercises
data-driven learning materials
Wordlists – general and
specialised
Wordlists have been around since before the
invention of computers. General wordlists are
used for curriculum development, textbook
writing etc.
Also possible to produce a word list for a
reading (or a possibly textbook)
Wordlists – general
Use existing wordlists such as West's General
Service List and recent updates. Coxhead's
Academic Wordlist. Kilgarriff's Wordlists
based on the BNC.
Kilgarriff Page
Academic Word List
Academic Word List
Academic Word List
• receptive list (based on morphological
derivations)
• the list excludes words found in non-academic
texts (even if they occur in academic texts)
• do we need subject or genre-specific
wordlists? (Hyland)
Specialised Word List
• Create a wordlist from a corpus (using
concordancer or other utilities)
• May need to create your own corpus –
BootCaT ?? Silvia Bernadini
BootCaT
Vocab Profile
• Tom Cobb's Vocab Profile
• http://www.lextutor.ca/vp/eng/
Collocation lists
• More difficult to find – use Collocation
Dictionary??
• Biber's work on lexical bundles
• Use concordancer or utility to create ngram
lists or locate collocations
• Collocate – shown below
Concordancers
• Online concordancer
Concordancers
Concordancers –
americancorpus.org
Concordancers
• Using a concordancer in the classroom
• Corpus as a reference tool – query the corpus
– can you say “the government are”
– what is the difference between “for
instance” and “for example”
– Tim Johns – Data-driven Learning
• (...caused economic
development...)
Concordancers – text
reconstruction exercises
Data-driven learning
(deductive)
Data-driven learning
(inductive)
Concordance data
• DDL – highlighting/noticing/discovery learning
• Highlight unexpected (for the learner)
distinctions, uses etc.
• Sequence data to build up knowledge
Parallel concordance
data
• Parallel concordance works on translation
corpus
• Students need to have same L1
Concordance data
issues
• KWIC format
• Google effect
• Data overload
• Reauthenticating data
– Sabine Braun – includes discourse
perspective (Why did the speaker use
that form?)
Parallel Corpora – DDL
(CHUJO, Kiyomi)
Parallel Corpora – DDL
(Chujo, Kiyomi)
Collocate
Software to extract collocations/terms
Word search + Span (2 words, 3 words etc.)
n-gram (bigram, trigram) list
Full extract -- collocations in a corpus
Search for analysis
(Span = 2)
analysis - frequency
analysis - t-score
analysis - MI
Trigram search
Trigram -- by freq
Trigram -- alphabetical
Trigram -- by MI
Using batch mode –
Corpuslab.com
Familiar exercise authoring
Currently offline
Aims
avoid duplication of tasks -- identifying
common collocations in Business English
Provide corpus/analysis resources
Bring corpus resources together with
familiar exercise authoring
Student View
Student View
Student View
Student View
Exercise types
Matching
Fill-the-gap
Multiple Choice
Reorder
Categorise
Exercise types
Matching*
Fill-the-gap
Multiple Choice
Reorder
Categorise*
Teacher view
Teacher view
Teacher view
Teacher view -
Resources
Resources
Teacher-generated resources
uploaded frequency lists
worksheets
Tracking
Teachers can track their exercises
“Class teachers” track students in their class
Tracking
Report for exercise Cat1
Tracking of student
School view
Register as a school
Create class names
Assign teachers to classes
Track students in classes
School view
School view
Resources
Site resources
corpora and simple concordancer
text analysis utilities
Text analysis utilities
Create frequency lists
Text analysis in terms of frequency bands
Collocational analysis of texts
Corpora
Teacher/Author resource
Sample corpus -- CSPAE
Add other corpora such as MICASE
Create various options for searching that
make use of corpus annotation
Simple searching
Aims
Create a language learning site
Encourage and facilitate use of corpus data
Matching exercise (up to 5 columns)
Provide access to word lists etc
Provide text analysis tools
Aims
Use traditional exercise types that teachers
are familiar with
Give examples of creative uses of these
standard exercises
Thank you

Weitere ähnliche Inhalte

Ähnlich wie Enhancing Language Learning Using Corpora

Problem-based Learning & Resource-based Learning two complementary approac...
Problem-based Learning & Resource-based Learning  two complementary approac...Problem-based Learning & Resource-based Learning  two complementary approac...
Problem-based Learning & Resource-based Learning two complementary approac...
Wilco te Winkel
 
Chapter 6, curriculum development in language teaching. j.c. richards
Chapter 6, curriculum development in language teaching.  j.c. richardsChapter 6, curriculum development in language teaching.  j.c. richards
Chapter 6, curriculum development in language teaching. j.c. richards
Savaedi
 
Natural Language Processing (NLP)
Natural Language Processing (NLP)Natural Language Processing (NLP)
Natural Language Processing (NLP)
Abdullah al Mamun
 

Ähnlich wie Enhancing Language Learning Using Corpora (20)

2021-0509_JAECS2021_Spring
2021-0509_JAECS2021_Spring2021-0509_JAECS2021_Spring
2021-0509_JAECS2021_Spring
 
Using do-it-yourself corpora in EAP-A tailore-made resource
Using do-it-yourself corpora in EAP-A tailore-made resourceUsing do-it-yourself corpora in EAP-A tailore-made resource
Using do-it-yourself corpora in EAP-A tailore-made resource
 
How to expand your nlp solution to new languages using transfer learning
How to expand your nlp solution to new languages using transfer learningHow to expand your nlp solution to new languages using transfer learning
How to expand your nlp solution to new languages using transfer learning
 
Introduction to natural language processing (NLP)
Introduction to natural language processing (NLP)Introduction to natural language processing (NLP)
Introduction to natural language processing (NLP)
 
Academic Phrasebank Navigable PDF
Academic Phrasebank Navigable PDFAcademic Phrasebank Navigable PDF
Academic Phrasebank Navigable PDF
 
Data Driven Learning Lite Presentation
Data Driven Learning Lite PresentationData Driven Learning Lite Presentation
Data Driven Learning Lite Presentation
 
Edad 695 research methodology
Edad 695 research methodologyEdad 695 research methodology
Edad 695 research methodology
 
Problem-based Learning & Resource-based Learning two complementary approac...
Problem-based Learning & Resource-based Learning  two complementary approac...Problem-based Learning & Resource-based Learning  two complementary approac...
Problem-based Learning & Resource-based Learning two complementary approac...
 
Academic-Phrasebank.pdf
Academic-Phrasebank.pdfAcademic-Phrasebank.pdf
Academic-Phrasebank.pdf
 
Tips for teaching writing
Tips for teaching writingTips for teaching writing
Tips for teaching writing
 
Alannah fitzgerald The TOETOE project planning for impact
Alannah fitzgerald The TOETOE project planning for impactAlannah fitzgerald The TOETOE project planning for impact
Alannah fitzgerald The TOETOE project planning for impact
 
Effective research strategies
Effective research strategiesEffective research strategies
Effective research strategies
 
EBMgt Course Module 6: Searching for Scientific Evidence
EBMgt Course Module 6: Searching for Scientific EvidenceEBMgt Course Module 6: Searching for Scientific Evidence
EBMgt Course Module 6: Searching for Scientific Evidence
 
Using and learning phrases
Using and learning phrasesUsing and learning phrases
Using and learning phrases
 
semantic web & natural language
semantic web & natural languagesemantic web & natural language
semantic web & natural language
 
Chapter 6, curriculum development in language teaching. j.c. richards
Chapter 6, curriculum development in language teaching.  j.c. richardsChapter 6, curriculum development in language teaching.  j.c. richards
Chapter 6, curriculum development in language teaching. j.c. richards
 
Natural Language Processing (NLP)
Natural Language Processing (NLP)Natural Language Processing (NLP)
Natural Language Processing (NLP)
 
Natural Language Processing, Techniques, Current Trends and Applications in I...
Natural Language Processing, Techniques, Current Trends and Applications in I...Natural Language Processing, Techniques, Current Trends and Applications in I...
Natural Language Processing, Techniques, Current Trends and Applications in I...
 
Natural Language Processing using Java
Natural Language Processing using JavaNatural Language Processing using Java
Natural Language Processing using Java
 
Hacks for academic writing
Hacks for academic writingHacks for academic writing
Hacks for academic writing
 

Kürzlich hochgeladen

Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
kauryashika82
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
QucHHunhnh
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
QucHHunhnh
 
Seal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxSeal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptx
negromaestrong
 

Kürzlich hochgeladen (20)

Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
 
Magic bus Group work1and 2 (Team 3).pptx
Magic bus Group work1and 2 (Team 3).pptxMagic bus Group work1and 2 (Team 3).pptx
Magic bus Group work1and 2 (Team 3).pptx
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
Food safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdfFood safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdf
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdf
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
 
Dyslexia AI Workshop for Slideshare.pptx
Dyslexia AI Workshop for Slideshare.pptxDyslexia AI Workshop for Slideshare.pptx
Dyslexia AI Workshop for Slideshare.pptx
 
Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024
 
ICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptx
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
Seal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxSeal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptx
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.
 
How to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POSHow to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POS
 
Asian American Pacific Islander Month DDSD 2024.pptx
Asian American Pacific Islander Month DDSD 2024.pptxAsian American Pacific Islander Month DDSD 2024.pptx
Asian American Pacific Islander Month DDSD 2024.pptx
 
Unit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxUnit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptx
 
ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701
 
Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.ppt
 
General Principles of Intellectual Property: Concepts of Intellectual Proper...
General Principles of Intellectual Property: Concepts of Intellectual  Proper...General Principles of Intellectual Property: Concepts of Intellectual  Proper...
General Principles of Intellectual Property: Concepts of Intellectual Proper...
 
PROCESS RECORDING FORMAT.docx
PROCESS      RECORDING        FORMAT.docxPROCESS      RECORDING        FORMAT.docx
PROCESS RECORDING FORMAT.docx
 
Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)
 

Enhancing Language Learning Using Corpora