SlideShare ist ein Scribd-Unternehmen logo
1 von 30
John Blake
Japan Advanced Institute of Science and Technology
Pedagogic application
of regular expressions
/bbetweenW+(?:w+W+){1,2}?tob/gi;
Overview
02
Introduction
• Probabilistic parsing
• Rule-based pattern matching
• Regular expressions
Pedagogic applications
• Modality detector
• Error detector
• Other: tagged corpora, pronunciation of “ed”
Probabilistic parsing
03
• Dynamic algorithms
• Machine learning
• Training sets
(e.g. Stanford POS parser)
Extremely powerful, but
requires significant knowledge
of computational linguistics and
huge time investment so…
Rule-based pattern matching
04
1. There is a man on your left. T / F
If true, a man is on your left. Stop.
If false, proceed to 2.
2. There is a woman on your left. T / F
If true, there is a woman on your left. Stop.
If false, there is nobody on your left. Stop.
True/false statements
Rule-based pattern matching
05
Decision-tree algorithm
There is a man on your left.
There is a woman on your left.
No.Yes. STOP
Yes. STOP No.
There is nobody on your left. STOP
Assumptions:
1. Only adults are present
2. There is no third gender
Rule-based pattern matching
06
There is a man. /bmanb/;
There is a woman. /bwomanb/;
Regular expressions (regexp|regex)
The discrete words “man” and “woman” will
be identified, generating a “true” result.
Regular expressions (Regex)
07
e.g. /bmaybeb/gi;
 – escape (from normal characters)
i – case insensitive
b – boundary
g – greedy
1. I think that maybe he can understand. T/F
2. He may be able to understand T/F
3. Maybe, he can understand. T/F
4. Maybelline is a company name. T/F
5. Maybe, he said maybe. T/F
Pedagogic applications
08
Modality detector
Online error detectors
- Common error detector (Morrall, 2000-14)
- Corpus-based error detector (Blake, 2012-15)
Other applications
- Annotation highlighter
- Ideas for pronunciation, grammar and vocab
09
Situation
App. 1
Students
graduate students, researchers
Aim
write research articles
Problems
lack of familiarity of genre,
lack of language,
lack of content.
10
Tentative language
& approximation
Type Examples
Modal verbs may, might, would, can
Lexical verbs seem, appear, suggest
Modal adverbs perhaps, probably, possibly,
Modal adjectives probable, possible, uncertain
Modal nouns assumption, claim, possibility
# Approximation
49% Almost a half, nearly 50%, less than 1 in 2
App. 1
11
Material mismatch
Students from different faculties studying
tentative language (hedging) and
approximation in academic writing use
generic materials prepared by teacher.
App. 1
12
Lack of face validity
Some students do not want to “waste
time” dealing with materials not
appropriate to their major. They expect
materials tailored to their exact needs.
App. 1
13
Solution: Modality detector
App. 1
14
Solution: Modality detector
Individualized instruction
• Student selects appropriate text
• Student inputs relevant text
• Regex identifies hedges & approximation
• Execute command labels & highlights
App. 1
15
Warning: False positives
More complex regex reduce false positives
App. 1
16
Piles of unmarked homework
Responding to written work takes too
much time, and is repetitive since many
students make the same surface-level
mistakes.
App. 2
17
No time to respond
Teachers are expected to:
• Identify the location of errors
• Explain the errors (if necessary)
• Correct the errors (if necessary)
All of which take lots of time.
App. 2
18
Solution: Error detector
Identification
Student inputs own work
Regex identifies expected errors
Explanation
Execute command selects and displays
prepared explanation
Correction
Student corrects work and submits
improved version
App. 2
19
Error classification
App. 2
Type Description
Accuracy factual and language errors
Brevity too many words
Clarity vague or ambiguous terms
Objectivity emotive language
Formality abbreviations, contractions, &
informal terms
An ethnographic survey of the literature on writing scientific research articles
revealed five key criteria (Blake & Blake, 2015)
20
App. 2
21
Specific example
Error
• One of the + singular noun
Regex
• /bone of theb/gi;
Execute
• Check that the phrase one of the
is followed by a plural noun
App. 2
22
False positives harnessed in learning
process by forcing student engagement
App. 2
23
Difficult-to-read tags
Introduction Purpose Method Results Discussion
<segment features='problem;introduction;rhetorical_moves' state='active'>We
address the problem of model-based object recognition.</segment> <segment
features='purpose;rhetorical_moves' state='active'>Our aim is to localize and
recognize road vehicles from monocular images or videos in calibrated traffic
scenes.</segment> <segment features='method;rhetorical_moves' state='active'>A
3-D deformable vehicle model with 12 shape parameters is set up as prior
information, and its pose is determined by three parameters, which are its position
on the ground plane and its orientation about the vertical axis under ground-plane
constraints.</segment> <segment features='purpose;rhetorical_moves'
state='active'>An efficient local gradient-based method is proposed to evaluate the
fitness between the projection of the vehicle model and image data, which is
combined into a novel evolutionary computing framework to estimate the 12 shape
parameters and three pose parameters by iterative evolution.</segment> <segment
features='background;introduction;rhetorical_moves' state='active'>The recovery of
pose parameters achieves vehicle localization, whereas the shape parameters are
used for vehicle recognition.</segment> <segment
features='method;rhetorical_moves' state='active'>Numerous experiments are
App. 3
24
Difficult-to-read tags
Introduction Purpose Method Results Discussion
<segment features='problem;introduction;rhetorical_moves' state='active'>We
address the problem of model-based object recognition.</segment> <segment
features='purpose;rhetorical_moves' state='active'>Our aim is to localize and
recognize road vehicles from monocular images or videos in calibrated traffic
scenes.</segment> <segment features='method;rhetorical_moves' state='active'>A
3-D deformable vehicle model with 12 shape parameters is set up as prior
information, and its pose is determined by three parameters, which are its position
on the ground plane and its orientation about the vertical axis under ground-plane
constraints.</segment> <segment features='purpose;rhetorical_moves'
state='active'>An efficient local gradient-based method is proposed to evaluate the
fitness between the projection of the vehicle model and image data, which is
combined into a novel evolutionary computing framework to estimate the 12 shape
parameters and three pose parameters by iterative evolution.</segment> <segment
features='background;introduction;rhetorical_moves' state='active'>The recovery of
pose parameters achieves vehicle localization, whereas the shape parameters are
used for vehicle recognition.</segment> <segment
features='method;rhetorical_moves' state='active'>Numerous experiments are
App. 3
25
Easy-to-read tags
Introduction Purpose Method Results Discussion
http://www.jaist.ac.jp/~johnb/Movehighlighter.html
App. 3
26
Ideas for you and your students
Pronunciation: Regular “ed”
• Regular “ed” /t/, /d/, /id/
• th [voiced or voiceless]
Grammar:
• Tenses: e.g. perfect continuous: been + ing
• Quantifiers : [U] much, little; [C] many, few; [U/C] lots of , a lot of
Vocabulary:
• Colours: red, blue crimson red, cobalt blue,
• Body parts: hand, eyes, leg hand out, eye up, leg it
27
Regular “ed”
False positives:
• learned /d/ /id/
Pron Preceeding sound Potential regex
/id/ d, t /(d|t)edb/gi;
/t/ voiceless consonants /(s|f)edb/gi;
/d/ voiced consonants /(z|v)edb/gi;
/d/ Vowel /(ow|i|ay)edb/gi;
Pronunciation of “ed” is dictated by the sound of the preceeding letter(s).
| – Boolean “or”
so x|y means either x or y
d|ted means d or ted but by adding brackets
(d|t)ed means ded or ted
28
Pronunciation of “th”
Pron Feature Potential regex
/𝜹/ Voiced initial th /btha(n|t|) b/gi;
/bthe(b|ir|m|re|se|y) b/gi;
/bthisb/gi;
/btho(se|ugh|) b/gi;
/bthusb/gi;
/𝜽/ Voiceless initial th /bth/gi;
/t/ th pronounced as t /bthomas|thames|thyme/gi;
Pronunciation of “th” can be predicted by the law that for function words
the initial th is pronounced as a voiced sound.
References
29
Blake, J. (2012, November 28-30). Corpus-based academic written error
detector. Conference proceedings of the 20th International Conference on
Computers in Education. Nanyang Technological University, Singapore.
Blake, X. and Blake, J. (2015, January 29-31). Academic literacy: Mentor and
mentee perspectives. Poster presented at 35th International Conference of
ThaiTESOL, Bangkok, Thailand.
Morrall, A. (2000-2014). Common Error Detector. [Online tool]
http://www2.elc.polyu.edu.hk/cill/errordetector.htm
Any questions, comments or
suggestions?
johnb@jaist.ac.jp

Weitere ähnliche Inhalte

Andere mochten auch

Open project 2011
Open project 2011Open project 2011
Open project 2011
Seaon Lee
 
有爱心的小和尚08 04-12 judy
有爱心的小和尚08 04-12 judy有爱心的小和尚08 04-12 judy
有爱心的小和尚08 04-12 judy
jboose
 
Materials design presentation 01 (2)
Materials design presentation 01 (2)Materials design presentation 01 (2)
Materials design presentation 01 (2)
Brenda Westtesten
 

Andere mochten auch (19)

Top 5 Reasons European companies keep outsourcing IT services
Top 5 Reasons European companies keep outsourcing IT servicesTop 5 Reasons European companies keep outsourcing IT services
Top 5 Reasons European companies keep outsourcing IT services
 
Fundraising by facebook app
Fundraising by facebook appFundraising by facebook app
Fundraising by facebook app
 
Años 80´s
Años 80´sAños 80´s
Años 80´s
 
Varbergs miljömål 2015-2025
Varbergs miljömål 2015-2025Varbergs miljömål 2015-2025
Varbergs miljömål 2015-2025
 
Primera guerra-mundial
Primera guerra-mundial Primera guerra-mundial
Primera guerra-mundial
 
MediaWhiz POV: Conversion Optimization for Lead-gen Paths
MediaWhiz POV: Conversion Optimization for Lead-gen PathsMediaWhiz POV: Conversion Optimization for Lead-gen Paths
MediaWhiz POV: Conversion Optimization for Lead-gen Paths
 
Open project 2011
Open project 2011Open project 2011
Open project 2011
 
Ks 141028 budget 2015
Ks 141028 budget 2015Ks 141028 budget 2015
Ks 141028 budget 2015
 
有爱心的小和尚08 04-12 judy
有爱心的小和尚08 04-12 judy有爱心的小和尚08 04-12 judy
有爱心的小和尚08 04-12 judy
 
SD stödpartiet till Alliansen
SD stödpartiet till AlliansenSD stödpartiet till Alliansen
SD stödpartiet till Alliansen
 
Materials design presentation 01 (2)
Materials design presentation 01 (2)Materials design presentation 01 (2)
Materials design presentation 01 (2)
 
51 ways to reduce your out-of-class marking
51 ways to reduce your out-of-class marking51 ways to reduce your out-of-class marking
51 ways to reduce your out-of-class marking
 
Pune OpenCoffee Club, an Introduction (2014)
Pune OpenCoffee Club, an Introduction (2014)Pune OpenCoffee Club, an Introduction (2014)
Pune OpenCoffee Club, an Introduction (2014)
 
Winning - I am not designed to finish 2nd or 3rd
Winning - I am not designed to finish 2nd or 3rdWinning - I am not designed to finish 2nd or 3rd
Winning - I am not designed to finish 2nd or 3rd
 
Guia 02....
Guia 02....Guia 02....
Guia 02....
 
C#
C#C#
C#
 
On Target eCommerce
On Target eCommerceOn Target eCommerce
On Target eCommerce
 
Waterloo some modern lessons
Waterloo some modern lessonsWaterloo some modern lessons
Waterloo some modern lessons
 
Corporate social responsibility
Corporate social responsibility Corporate social responsibility
Corporate social responsibility
 

Ähnlich wie Pedagogic application of regular expressions

Alapcae math activities 2014 ged test
Alapcae math activities 2014 ged testAlapcae math activities 2014 ged test
Alapcae math activities 2014 ged test
dware655
 
WebQuest for Biomes
WebQuest for BiomesWebQuest for Biomes
WebQuest for Biomes
smtester2
 
Ml approach toprogramming
Ml approach toprogrammingMl approach toprogramming
Ml approach toprogramming
RMani7
 
Problem solving content
Problem solving contentProblem solving content
Problem solving content
Timothy Welsh
 
problem_solving in physics
 problem_solving in physics problem_solving in physics
problem_solving in physics
Timothy Welsh
 
E as t-tle adv pp
 E as t-tle adv pp E as t-tle adv pp
E as t-tle adv pp
benkelsey
 

Ähnlich wie Pedagogic application of regular expressions (20)

Minimal Introduction to C++ - Part II
Minimal Introduction to C++ - Part IIMinimal Introduction to C++ - Part II
Minimal Introduction to C++ - Part II
 
Determining The Barriers Faced By Novice Programmers
Determining The Barriers Faced By Novice ProgrammersDetermining The Barriers Faced By Novice Programmers
Determining The Barriers Faced By Novice Programmers
 
FLOSS Pilot Studies
FLOSS Pilot StudiesFLOSS Pilot Studies
FLOSS Pilot Studies
 
ID Unit Report 3
ID Unit Report 3ID Unit Report 3
ID Unit Report 3
 
Alapcae math activities 2014 ged test
Alapcae math activities 2014 ged testAlapcae math activities 2014 ged test
Alapcae math activities 2014 ged test
 
Organization strategies to increase development productivity
Organization strategies to increase development productivityOrganization strategies to increase development productivity
Organization strategies to increase development productivity
 
WebQuest for Biomes
WebQuest for BiomesWebQuest for Biomes
WebQuest for Biomes
 
Biomes WebQuest
Biomes WebQuestBiomes WebQuest
Biomes WebQuest
 
MLApproachToProgramming.ppt
MLApproachToProgramming.pptMLApproachToProgramming.ppt
MLApproachToProgramming.ppt
 
Ml approach toprogramming
Ml approach toprogrammingMl approach toprogramming
Ml approach toprogramming
 
Problem solving content
Problem solving contentProblem solving content
Problem solving content
 
problem_solving in physics
 problem_solving in physics problem_solving in physics
problem_solving in physics
 
E as t-tle adv pp
 E as t-tle adv pp E as t-tle adv pp
E as t-tle adv pp
 
e-asTTle Staff Meeting PowerPoint
e-asTTle Staff Meeting PowerPointe-asTTle Staff Meeting PowerPoint
e-asTTle Staff Meeting PowerPoint
 
MATH 107 Great Stories /newtonhelp.com
MATH 107 Great Stories /newtonhelp.comMATH 107 Great Stories /newtonhelp.com
MATH 107 Great Stories /newtonhelp.com
 
Specs design
Specs designSpecs design
Specs design
 
Combining General and Genre-Specific Approaches to L2 Writing Instruction
Combining General and Genre-Specific Approaches to L2 Writing InstructionCombining General and Genre-Specific Approaches to L2 Writing Instruction
Combining General and Genre-Specific Approaches to L2 Writing Instruction
 
Introduction to programming languages part 2
Introduction to programming languages   part 2Introduction to programming languages   part 2
Introduction to programming languages part 2
 
Face Detection Using MATLAB (SUD)
Face Detection Using MATLAB (SUD)Face Detection Using MATLAB (SUD)
Face Detection Using MATLAB (SUD)
 
AshwiniCV- SAP Basis
AshwiniCV- SAP BasisAshwiniCV- SAP Basis
AshwiniCV- SAP Basis
 

Mehr von john6938

Mehr von john6938 (20)

Social Media Ethics.pptx
Social Media Ethics.pptxSocial Media Ethics.pptx
Social Media Ethics.pptx
 
Future of Information Ethics.pptx
Future of Information Ethics.pptxFuture of Information Ethics.pptx
Future of Information Ethics.pptx
 
Bioethics.pptx
Bioethics.pptxBioethics.pptx
Bioethics.pptx
 
Surveillance and security.pptx
Surveillance and security.pptxSurveillance and security.pptx
Surveillance and security.pptx
 
Introduction to Expert Systems.pptx
Introduction to Expert Systems.pptxIntroduction to Expert Systems.pptx
Introduction to Expert Systems.pptx
 
Starbuck.pptx
Starbuck.pptxStarbuck.pptx
Starbuck.pptx
 
Unit 4 Problem breakdown.pptx
Unit 4 Problem breakdown.pptxUnit 4 Problem breakdown.pptx
Unit 4 Problem breakdown.pptx
 
Image_recognition.pptx
Image_recognition.pptxImage_recognition.pptx
Image_recognition.pptx
 
Algorithms.pptx
Algorithms.pptxAlgorithms.pptx
Algorithms.pptx
 
Artificial_intelligence.pptx
Artificial_intelligence.pptxArtificial_intelligence.pptx
Artificial_intelligence.pptx
 
Image_generation.pptx
Image_generation.pptxImage_generation.pptx
Image_generation.pptx
 
Computer_Graphics.pptx
Computer_Graphics.pptxComputer_Graphics.pptx
Computer_Graphics.pptx
 
Security.pptx
Security.pptxSecurity.pptx
Security.pptx
 
Gravitational_wave_detection.pptx
Gravitational_wave_detection.pptxGravitational_wave_detection.pptx
Gravitational_wave_detection.pptx
 
Embedded_Systems.pptx
Embedded_Systems.pptxEmbedded_Systems.pptx
Embedded_Systems.pptx
 
Software_engineering.pptx
Software_engineering.pptxSoftware_engineering.pptx
Software_engineering.pptx
 
Quantum_computers.pptx
Quantum_computers.pptxQuantum_computers.pptx
Quantum_computers.pptx
 
NLP.pptx
NLP.pptxNLP.pptx
NLP.pptx
 
Sensors_SLAM.pptx
Sensors_SLAM.pptxSensors_SLAM.pptx
Sensors_SLAM.pptx
 
Maths.pptx
Maths.pptxMaths.pptx
Maths.pptx
 

Kürzlich hochgeladen

An Overview of Mutual Funds Bcom Project.pdf
An Overview of Mutual Funds Bcom Project.pdfAn Overview of Mutual Funds Bcom Project.pdf
An Overview of Mutual Funds Bcom Project.pdf
SanaAli374401
 
Gardella_Mateo_IntellectualProperty.pdf.
Gardella_Mateo_IntellectualProperty.pdf.Gardella_Mateo_IntellectualProperty.pdf.
Gardella_Mateo_IntellectualProperty.pdf.
MateoGardella
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
heathfieldcps1
 

Kürzlich hochgeladen (20)

Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot Graph
 
Class 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfClass 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdf
 
An Overview of Mutual Funds Bcom Project.pdf
An Overview of Mutual Funds Bcom Project.pdfAn Overview of Mutual Funds Bcom Project.pdf
An Overview of Mutual Funds Bcom Project.pdf
 
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
 
fourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writingfourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writing
 
psychiatric nursing HISTORY COLLECTION .docx
psychiatric  nursing HISTORY  COLLECTION  .docxpsychiatric  nursing HISTORY  COLLECTION  .docx
psychiatric nursing HISTORY COLLECTION .docx
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and Mode
 
Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1
 
Gardella_Mateo_IntellectualProperty.pdf.
Gardella_Mateo_IntellectualProperty.pdf.Gardella_Mateo_IntellectualProperty.pdf.
Gardella_Mateo_IntellectualProperty.pdf.
 
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptxINDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
 
Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdf
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activity
 
ICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptx
 
Unit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxUnit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptx
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
 
Unit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxUnit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptx
 

Pedagogic application of regular expressions

  • 1. John Blake Japan Advanced Institute of Science and Technology Pedagogic application of regular expressions /bbetweenW+(?:w+W+){1,2}?tob/gi;
  • 2. Overview 02 Introduction • Probabilistic parsing • Rule-based pattern matching • Regular expressions Pedagogic applications • Modality detector • Error detector • Other: tagged corpora, pronunciation of “ed”
  • 3. Probabilistic parsing 03 • Dynamic algorithms • Machine learning • Training sets (e.g. Stanford POS parser) Extremely powerful, but requires significant knowledge of computational linguistics and huge time investment so…
  • 4. Rule-based pattern matching 04 1. There is a man on your left. T / F If true, a man is on your left. Stop. If false, proceed to 2. 2. There is a woman on your left. T / F If true, there is a woman on your left. Stop. If false, there is nobody on your left. Stop. True/false statements
  • 5. Rule-based pattern matching 05 Decision-tree algorithm There is a man on your left. There is a woman on your left. No.Yes. STOP Yes. STOP No. There is nobody on your left. STOP Assumptions: 1. Only adults are present 2. There is no third gender
  • 6. Rule-based pattern matching 06 There is a man. /bmanb/; There is a woman. /bwomanb/; Regular expressions (regexp|regex) The discrete words “man” and “woman” will be identified, generating a “true” result.
  • 7. Regular expressions (Regex) 07 e.g. /bmaybeb/gi; – escape (from normal characters) i – case insensitive b – boundary g – greedy 1. I think that maybe he can understand. T/F 2. He may be able to understand T/F 3. Maybe, he can understand. T/F 4. Maybelline is a company name. T/F 5. Maybe, he said maybe. T/F
  • 8. Pedagogic applications 08 Modality detector Online error detectors - Common error detector (Morrall, 2000-14) - Corpus-based error detector (Blake, 2012-15) Other applications - Annotation highlighter - Ideas for pronunciation, grammar and vocab
  • 9. 09 Situation App. 1 Students graduate students, researchers Aim write research articles Problems lack of familiarity of genre, lack of language, lack of content.
  • 10. 10 Tentative language & approximation Type Examples Modal verbs may, might, would, can Lexical verbs seem, appear, suggest Modal adverbs perhaps, probably, possibly, Modal adjectives probable, possible, uncertain Modal nouns assumption, claim, possibility # Approximation 49% Almost a half, nearly 50%, less than 1 in 2 App. 1
  • 11. 11 Material mismatch Students from different faculties studying tentative language (hedging) and approximation in academic writing use generic materials prepared by teacher. App. 1
  • 12. 12 Lack of face validity Some students do not want to “waste time” dealing with materials not appropriate to their major. They expect materials tailored to their exact needs. App. 1
  • 14. 14 Solution: Modality detector Individualized instruction • Student selects appropriate text • Student inputs relevant text • Regex identifies hedges & approximation • Execute command labels & highlights App. 1
  • 15. 15 Warning: False positives More complex regex reduce false positives App. 1
  • 16. 16 Piles of unmarked homework Responding to written work takes too much time, and is repetitive since many students make the same surface-level mistakes. App. 2
  • 17. 17 No time to respond Teachers are expected to: • Identify the location of errors • Explain the errors (if necessary) • Correct the errors (if necessary) All of which take lots of time. App. 2
  • 18. 18 Solution: Error detector Identification Student inputs own work Regex identifies expected errors Explanation Execute command selects and displays prepared explanation Correction Student corrects work and submits improved version App. 2
  • 19. 19 Error classification App. 2 Type Description Accuracy factual and language errors Brevity too many words Clarity vague or ambiguous terms Objectivity emotive language Formality abbreviations, contractions, & informal terms An ethnographic survey of the literature on writing scientific research articles revealed five key criteria (Blake & Blake, 2015)
  • 21. 21 Specific example Error • One of the + singular noun Regex • /bone of theb/gi; Execute • Check that the phrase one of the is followed by a plural noun App. 2
  • 22. 22 False positives harnessed in learning process by forcing student engagement App. 2
  • 23. 23 Difficult-to-read tags Introduction Purpose Method Results Discussion <segment features='problem;introduction;rhetorical_moves' state='active'>We address the problem of model-based object recognition.</segment> <segment features='purpose;rhetorical_moves' state='active'>Our aim is to localize and recognize road vehicles from monocular images or videos in calibrated traffic scenes.</segment> <segment features='method;rhetorical_moves' state='active'>A 3-D deformable vehicle model with 12 shape parameters is set up as prior information, and its pose is determined by three parameters, which are its position on the ground plane and its orientation about the vertical axis under ground-plane constraints.</segment> <segment features='purpose;rhetorical_moves' state='active'>An efficient local gradient-based method is proposed to evaluate the fitness between the projection of the vehicle model and image data, which is combined into a novel evolutionary computing framework to estimate the 12 shape parameters and three pose parameters by iterative evolution.</segment> <segment features='background;introduction;rhetorical_moves' state='active'>The recovery of pose parameters achieves vehicle localization, whereas the shape parameters are used for vehicle recognition.</segment> <segment features='method;rhetorical_moves' state='active'>Numerous experiments are App. 3
  • 24. 24 Difficult-to-read tags Introduction Purpose Method Results Discussion <segment features='problem;introduction;rhetorical_moves' state='active'>We address the problem of model-based object recognition.</segment> <segment features='purpose;rhetorical_moves' state='active'>Our aim is to localize and recognize road vehicles from monocular images or videos in calibrated traffic scenes.</segment> <segment features='method;rhetorical_moves' state='active'>A 3-D deformable vehicle model with 12 shape parameters is set up as prior information, and its pose is determined by three parameters, which are its position on the ground plane and its orientation about the vertical axis under ground-plane constraints.</segment> <segment features='purpose;rhetorical_moves' state='active'>An efficient local gradient-based method is proposed to evaluate the fitness between the projection of the vehicle model and image data, which is combined into a novel evolutionary computing framework to estimate the 12 shape parameters and three pose parameters by iterative evolution.</segment> <segment features='background;introduction;rhetorical_moves' state='active'>The recovery of pose parameters achieves vehicle localization, whereas the shape parameters are used for vehicle recognition.</segment> <segment features='method;rhetorical_moves' state='active'>Numerous experiments are App. 3
  • 25. 25 Easy-to-read tags Introduction Purpose Method Results Discussion http://www.jaist.ac.jp/~johnb/Movehighlighter.html App. 3
  • 26. 26 Ideas for you and your students Pronunciation: Regular “ed” • Regular “ed” /t/, /d/, /id/ • th [voiced or voiceless] Grammar: • Tenses: e.g. perfect continuous: been + ing • Quantifiers : [U] much, little; [C] many, few; [U/C] lots of , a lot of Vocabulary: • Colours: red, blue crimson red, cobalt blue, • Body parts: hand, eyes, leg hand out, eye up, leg it
  • 27. 27 Regular “ed” False positives: • learned /d/ /id/ Pron Preceeding sound Potential regex /id/ d, t /(d|t)edb/gi; /t/ voiceless consonants /(s|f)edb/gi; /d/ voiced consonants /(z|v)edb/gi; /d/ Vowel /(ow|i|ay)edb/gi; Pronunciation of “ed” is dictated by the sound of the preceeding letter(s). | – Boolean “or” so x|y means either x or y d|ted means d or ted but by adding brackets (d|t)ed means ded or ted
  • 28. 28 Pronunciation of “th” Pron Feature Potential regex /𝜹/ Voiced initial th /btha(n|t|) b/gi; /bthe(b|ir|m|re|se|y) b/gi; /bthisb/gi; /btho(se|ugh|) b/gi; /bthusb/gi; /𝜽/ Voiceless initial th /bth/gi; /t/ th pronounced as t /bthomas|thames|thyme/gi; Pronunciation of “th” can be predicted by the law that for function words the initial th is pronounced as a voiced sound.
  • 29. References 29 Blake, J. (2012, November 28-30). Corpus-based academic written error detector. Conference proceedings of the 20th International Conference on Computers in Education. Nanyang Technological University, Singapore. Blake, X. and Blake, J. (2015, January 29-31). Academic literacy: Mentor and mentee perspectives. Poster presented at 35th International Conference of ThaiTESOL, Bangkok, Thailand. Morrall, A. (2000-2014). Common Error Detector. [Online tool] http://www2.elc.polyu.edu.hk/cill/errordetector.htm
  • 30. Any questions, comments or suggestions? johnb@jaist.ac.jp