SlideShare ist ein Scribd-Unternehmen logo
1 von 25
Downloaden Sie, um offline zu lesen
Introduction Concepts and tools Relation Extraction Temporal Signals Modelling Tense Conclusion
Determining the Types of Temporal Relations in
Discourse
Leon Derczynski
University of Sheffield
5 March, 2013
Leon Derczynski University of Sheffield
Determining the Types of Temporal Relations in Discourse
Introduction Concepts and tools Relation Extraction Temporal Signals Modelling Tense Conclusion
The Role of Time
Why is time important in language processing?
World state changes constantly
Every empirical assertion has temporal bounds
“The sky is blue”, but it was not always
Without it, na¨ıve knowledge extraction will fail (given an
Almanac of Presidents, who is President?)
By understanding temporal information, you will do better
knowledge extraction.
Overall goal
How do we automatically understand temporal information in
natural languages?
Leon Derczynski University of Sheffield
Determining the Types of Temporal Relations in Discourse
Introduction Concepts and tools Relation Extraction Temporal Signals Modelling Tense Conclusion
Temporal Information Extraction
Existing state of the art
How can we categorise types of temporal information?
Events – e.g. occurrences, states
Temporal expressions (timexes) – e.g. dates, durations
Links – relations between pairs of events or times
Supporting texts – e.g. action cardinality, event ordering
We develop and use ISO-TimeML to annotate these entities.
Main dataset: TimeBank (about 180 annotated documents)
Leon Derczynski University of Sheffield
Determining the Types of Temporal Relations in Discourse
Introduction Concepts and tools Relation Extraction Temporal Signals Modelling Tense Conclusion
TimeML
Organizers
<EVENT eid="e2120" class="REPORTING">state</EVENT>
the
<TIMEX3 tid="t29" type="DURATION" value="P2D"
temporalFunction="false"
functionInDocument="NONE">two days</TIMEX3>
of music, dancing, and speeches is
<EVENT eid="e2123" class="I STATE">expected</EVENT>
to
<EVENT eid="e13" class="OCCURRENCE">draw</EVENT>
some two million people.
<TLINK eventID="e2123" relatedToTime="t29" relType="BEFORE"/>
Leon Derczynski University of Sheffield
Determining the Types of Temporal Relations in Discourse
Introduction Concepts and tools Relation Extraction Temporal Signals Modelling Tense Conclusion
Times and Events
What are temporal expressions?
They refer to a time
Subtasks: recognition and interpretation; SotA recognition is
0.86 F1
What do we consider as events?
Verbal, nominal
State of the art: 0.90 F1 for recognition
Doesn’t cover complex structure; e.g. a music festival
Events are not very useful unless related to other temporal
entities
How can we describe this structural complexity?
Start by modeling the document as a graph
Leon Derczynski University of Sheffield
Determining the Types of Temporal Relations in Discourse
Introduction Concepts and tools Relation Extraction Temporal Signals Modelling Tense Conclusion
Temporal relations
What are temporal relations?
They describe the links between times and events
Can capture both complex and partial orderings
What kinds of temporal relation are there?
1 Interval (before, after, included by, simultaneous)
2 Subordinate (reported speech, modal, conditional)
3 Aspectual (start, culmination – see Vendler, Comrie)
This work is concerned with the coarsest-grained information: the
first category
Leon Derczynski University of Sheffield
Determining the Types of Temporal Relations in Discourse
Introduction Concepts and tools Relation Extraction Temporal Signals Modelling Tense Conclusion
Problem Definition
How are these relations represented?
Temporal interval algebra (Allen 1984) – a set of 14 relations
between a pair of intervals
TimeML defines a set of relation types and also types of
interval
What is our problem?
Assume discourse w/ perfect event and timex annotations
In fact, assume we know which intervals to link!
“Given an ordered pair of intervals (arg1, arg2), which relation in
the set Rallen describes them?”
Leon Derczynski University of Sheffield
Determining the Types of Temporal Relations in Discourse
Introduction Concepts and tools Relation Extraction Temporal Signals Modelling Tense Conclusion
Relation Extraction
How can relations be labelled?
Machine learning
Using TimeML attributes: some success
Using syntactic relations: matches SotA in tree kernels
What’s the state of the art?
2007: Mani et al.: baseline 56%, system has 61% accuracy
2008: Bethard, Chambers: many sophisticated improvements
– ILP, timex-timex ordering. Improved on Mani et al. by 1.5%.
2010: TempEval-2: baseline 58%, best was 65% accuracy
Why do we find this performance ceiling?
Leon Derczynski University of Sheffield
Determining the Types of Temporal Relations in Discourse
Introduction Concepts and tools Relation Extraction Temporal Signals Modelling Tense Conclusion
Sources of Temporal Relation Information
What are we missing?
There is a heterogeneous set of temporal information types,
including:
Explicit signals – subsequently, as soon as
Linguistic theory offers some models
What is the evidence these two types will help?
Conducted failure analysis: TempEval-2010 1
Multiple diverse approaches, same dataset
Find the set of difficult links
Characterise information supporting these links
1
Verhagen et al., 2010: Semeval Task 13 - TempEval-2
Leon Derczynski University of Sheffield
Determining the Types of Temporal Relations in Discourse
Introduction Concepts and tools Relation Extraction Temporal Signals Modelling Tense Conclusion
Task C: event−timex intra−sentence relations
All systems correct 1 fails 2 fail 3 fail 4 fail 5 fail All systems fail
Task D: event−DCT relations
All systems correct 1 fails 2 fail 3 fail 4 fail All systems fail
Task E: main event inter−sentence relations
All systems correct 1 fails 2 fail 3 fail 4 fail 5 fail All systems fail
Task F: event−subordinate intra−sentence relations
All systems correct 1 fails 2 fail 3 fail 4 fail All systems fail
Figure: TempEval-2 relation labelling tasks, showing proportions of
relations according to the number of systems that gave correct labels.
Leon Derczynski University of Sheffield
Determining the Types of Temporal Relations in Discourse
Introduction Concepts and tools Relation Extraction Temporal Signals Modelling Tense Conclusion
C D E F
Proportion of links within a task that are difficult
Task
%difficult
010203040
The problem is difficult, and there is a consistently-difficult set of
links. Perhaps we are ignoring some critical information.
Leon Derczynski University of Sheffield
Determining the Types of Temporal Relations in Discourse
Introduction Concepts and tools Relation Extraction Temporal Signals Modelling Tense Conclusion
New sources of ordering information
Next step: manually characterise each “difficult” link.
Attempt to identify what kind of information could be used to
label it.
Sources to investigate
Explicit text – signals “After you pull the pin, throw the grenade”
Sources to investigate
Tensed relations “Having eaten, I left”
Leon Derczynski University of Sheffield
Determining the Types of Temporal Relations in Discourse
Introduction Concepts and tools Relation Extraction Temporal Signals Modelling Tense Conclusion
Temporal Signals
What are these?
In TimeML, they are text annotated as being helpful to a
temporal relation
Used by 12.2% of TimeBank’s relations
Are temporal signals useful?
A resounding yes! 61% → 83% accuracy with simple
features 2
This level of performance on event-event links is above
general state-of-the-art
Existing corpora are under-annotated
2
Derczynski and Gaizauskas, 2010: Using signals for temporal relation
classification
Leon Derczynski University of Sheffield
Determining the Types of Temporal Relations in Discourse
Introduction Concepts and tools Relation Extraction Temporal Signals Modelling Tense Conclusion
Temporal Signal Annotation
How can we automatically annotate temporal signals?
Define signals formally 3
Define a closed class of signals
Re-annotate TimeBank
Train discrimination and association
We included dependency information and function tagging.
3
Derczynski and Gaizauskas, 2011: A corpus based study of temporal signals
Leon Derczynski University of Sheffield
Determining the Types of Temporal Relations in Discourse
Introduction Concepts and tools Relation Extraction Temporal Signals Modelling Tense Conclusion
Results
How well did our approach perform?
1 Discrimination: 92% accuracy, 75% accuracy on positives
(0.77 IAA)
2 Association: 99% accuracy / 80% error reduction
3 Inductive bias towards independence assumption was harmful
(MaxEnt, NBayes)
Results: 16% of links have signals (31% improvement) and can
now be labelled at high accuracy.
What remains to be done?
How can we remedy under-annotation at the source?
Clear links to spatial signal annotation (e.g. -LOC tags)
Leon Derczynski University of Sheffield
Determining the Types of Temporal Relations in Discourse
Introduction Concepts and tools Relation Extraction Temporal Signals Modelling Tense Conclusion
Reichenbach’s Model of Verbs
How can we model tense in language?
Each verb happens at event time, E
The verb is uttered at speech time, S
Past tense: E < S John ran.
Present tense: E = S I’m free!
What differentiates simple past from past perfect?
John ran. is not the same as John had run.
Introduce abstract reference time, R
John had run. E < R < S
Leon Derczynski University of Sheffield
Determining the Types of Temporal Relations in Discourse
Introduction Concepts and tools Relation Extraction Temporal Signals Modelling Tense Conclusion
Reasoning about tense
How is Reichenbach’s model helpful?
We can describe all verbal events as three points linked by
either equality or precedence
Automatic and quick inference for relating intervals
Does it work?
Conducted first corpus-driven validation of the framework
For reporting-type links, we used features based on pairwise
event-time relations
Add one feature representing the Reichenbachian ordering
Classifier reached 59% accuracy (48% MCC baseline) on 9%
of all temporal relations (above SotA)
Leon Derczynski University of Sheffield
Determining the Types of Temporal Relations in Discourse
Introduction Concepts and tools Relation Extraction Temporal Signals Modelling Tense Conclusion
Extending the model
How else can we use the model?
Positional use
Timexes relate to reference points
Only consider cases where the event and time are linguistically
connected
Identify these using dependency parses
Add a feature hinting at the ordering
We reach 75% accuracy from a 67% baseline (above SotA)
Also useful for timex standard transduction 4
4
Derczynski, Llorens and Saquete 2012: Massively increasing TIMEX3
resources
Leon Derczynski University of Sheffield
Determining the Types of Temporal Relations in Discourse
Introduction Concepts and tools Relation Extraction Temporal Signals Modelling Tense Conclusion
Contributions
A large part of the difficult relation set (roughly 60%) is catered
for by these new information sources.
Difficult task, with notable impact
Focus on automatic annotation of temporal relations
Pushed beyond SotA understanding of the problem
Creation of and contribution to language resources – e.g.
ISO-TimeML, RTMML, CAVaT (among others)
.. where could we go next?
Leon Derczynski University of Sheffield
Determining the Types of Temporal Relations in Discourse
Introduction Concepts and tools Relation Extraction Temporal Signals Modelling Tense Conclusion
Future
Forensic analysis
How can we build a consistent event model from multiple
semi-reliable accounts of an event?
Challenges:
Multi-document event and actor co-reference
Story conflict resolution 5
Spatial and temporal IE from colloquial text
Building and resolving accurate co-constraining models from
unreliable data (belief networks)
5
Regneri, Koller and Pinkal 2010: Learning Script Knowledge with Web
Experiments
Leon Derczynski University of Sheffield
Determining the Types of Temporal Relations in Discourse
Introduction Concepts and tools Relation Extraction Temporal Signals Modelling Tense Conclusion
Future
Assertion bounding
All assertions have temporal bounds. How can we determine these?
Challenges:
Accurate extraction of document temporal structure
Automated reasoning
High-precision timex normalisation
Doing temporal IE & IR at gigaword scale
Leon Derczynski University of Sheffield
Determining the Types of Temporal Relations in Discourse
Introduction Concepts and tools Relation Extraction Temporal Signals Modelling Tense Conclusion
Future
Temporal dataset construction
Many current systems index whole documents by date, but
information is more nuanced than that
Challenges:
Mapping events to temporal data points
Storing and extracting events
Anchoring events with uncertain bounds (“last year’s fighting”
vs. “the fighting on April 23, 2011”)
Mining complex super-events; e.g. the Fukushima disaster;
what happened when?
Leon Derczynski University of Sheffield
Determining the Types of Temporal Relations in Discourse
Introduction Concepts and tools Relation Extraction Temporal Signals Modelling Tense Conclusion
Recap
Temporality is ubiquitous, in the world around us and in the
language we use to describe our world
Processing it automatically is difficult
Doing high-performance temporal IE opens exciting research
avenues
Thank you for your time. Are there any questions?
Leon Derczynski University of Sheffield
Determining the Types of Temporal Relations in Discourse
Introduction Concepts and tools Relation Extraction Temporal Signals Modelling Tense Conclusion
Labellings as probability distributions
Automated methods (e.g. classifiers) may have varying degrees of
confidence about a link’s label.
We could assign a set of labels and probabilities to each label.
Consistency constraints allow us to find the most-likely possible
graph.
A:B → before: 0.9; after 0.1
B:C → before: 0.5; simultaneous: 0.5
A:C → before: 1.0
Very time-consuming to compute
– optimisations welcome!
Leon Derczynski University of Sheffield
Determining the Types of Temporal Relations in Discourse
Introduction Concepts and tools Relation Extraction Temporal Signals Modelling Tense Conclusion
Unuttered temporal orderings
Event/Time distance
“When I was brushing my teeth”
→ This event happens at least twice daily; assume this instance is
0-16 hours away
Complex events
“When we were putting up the tents for the festival”
→ near the beginning of / just before the “festival” event
Leon Derczynski University of Sheffield
Determining the Types of Temporal Relations in Discourse

Weitere ähnliche Inhalte

Ähnlich wie Determining the Types of Temporal Relations in Discourse

Determining the Types of Temporal Relations in Discourse
Determining the Types of Temporal Relations in DiscourseDetermining the Types of Temporal Relations in Discourse
Determining the Types of Temporal Relations in DiscourseLeon Derczynski
 
A Corpus-based Study of Temporal Signals
A Corpus-based Study of Temporal SignalsA Corpus-based Study of Temporal Signals
A Corpus-based Study of Temporal SignalsLeon Derczynski
 
Meaningful Interaction Analysis
Meaningful Interaction AnalysisMeaningful Interaction Analysis
Meaningful Interaction Analysisfridolin.wild
 
Foundations Defining Communication & Communication Study.docx
Foundations Defining Communication & Communication Study.docxFoundations Defining Communication & Communication Study.docx
Foundations Defining Communication & Communication Study.docxbudbarber38650
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language Processingpunedevscom
 
Semantic Relation Classification: Task Formalisation and Refinement
Semantic Relation Classification: Task Formalisation and RefinementSemantic Relation Classification: Task Formalisation and Refinement
Semantic Relation Classification: Task Formalisation and RefinementAndre Freitas
 
L2 endstate and_dynamic_l2_interlanguage.edited
L2 endstate and_dynamic_l2_interlanguage.editedL2 endstate and_dynamic_l2_interlanguage.edited
L2 endstate and_dynamic_l2_interlanguage.editedNigel Daly
 
The role of linguistic information for shallow language processing
The role of linguistic information for shallow language processingThe role of linguistic information for shallow language processing
The role of linguistic information for shallow language processingConstantin Orasan
 
Cognitive plausibility in learning algorithms
Cognitive plausibility in learning algorithmsCognitive plausibility in learning algorithms
Cognitive plausibility in learning algorithmsAndré Karpištšenko
 
05 handbook summ-hovy
05 handbook summ-hovy05 handbook summ-hovy
05 handbook summ-hovySagar Dabhi
 
Information Retrieval using Semantic Similarity
Information Retrieval using Semantic SimilarityInformation Retrieval using Semantic Similarity
Information Retrieval using Semantic SimilaritySaswat Padhi
 
Discourse analysis ca summary
Discourse analysis  ca summaryDiscourse analysis  ca summary
Discourse analysis ca summaryEdgar Lucero
 
Writing and Speech Recognition
Writing and Speech RecognitionWriting and Speech Recognition
Writing and Speech RecognitionTeresaLSoto
 
A DECADE OF USING HYBRID INFERENCE SYSTEMS IN NLP (2005 – 2015): A SURVEY
A DECADE OF USING HYBRID INFERENCE SYSTEMS IN NLP (2005 – 2015): A SURVEYA DECADE OF USING HYBRID INFERENCE SYSTEMS IN NLP (2005 – 2015): A SURVEY
A DECADE OF USING HYBRID INFERENCE SYSTEMS IN NLP (2005 – 2015): A SURVEYijaia
 
Organizational communication ch-1
Organizational communication  ch-1Organizational communication  ch-1
Organizational communication ch-1Waqas Sabir
 
Ca ppt
Ca pptCa ppt
Ca pptMACON
 
On Methods for the Formal Specification of Fault Tolerant Systems
On Methods for the Formal Specification of Fault Tolerant SystemsOn Methods for the Formal Specification of Fault Tolerant Systems
On Methods for the Formal Specification of Fault Tolerant SystemsMazzara1976
 
In line or out of control?
In line or out of control?In line or out of control?
In line or out of control?mareninhh
 

Ähnlich wie Determining the Types of Temporal Relations in Discourse (20)

Determining the Types of Temporal Relations in Discourse
Determining the Types of Temporal Relations in DiscourseDetermining the Types of Temporal Relations in Discourse
Determining the Types of Temporal Relations in Discourse
 
A Corpus-based Study of Temporal Signals
A Corpus-based Study of Temporal SignalsA Corpus-based Study of Temporal Signals
A Corpus-based Study of Temporal Signals
 
Meaningful Interaction Analysis
Meaningful Interaction AnalysisMeaningful Interaction Analysis
Meaningful Interaction Analysis
 
Foundations Defining Communication & Communication Study.docx
Foundations Defining Communication & Communication Study.docxFoundations Defining Communication & Communication Study.docx
Foundations Defining Communication & Communication Study.docx
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language Processing
 
Semantic Relation Classification: Task Formalisation and Refinement
Semantic Relation Classification: Task Formalisation and RefinementSemantic Relation Classification: Task Formalisation and Refinement
Semantic Relation Classification: Task Formalisation and Refinement
 
L2 endstate and_dynamic_l2_interlanguage.edited
L2 endstate and_dynamic_l2_interlanguage.editedL2 endstate and_dynamic_l2_interlanguage.edited
L2 endstate and_dynamic_l2_interlanguage.edited
 
The role of linguistic information for shallow language processing
The role of linguistic information for shallow language processingThe role of linguistic information for shallow language processing
The role of linguistic information for shallow language processing
 
Cognitive plausibility in learning algorithms
Cognitive plausibility in learning algorithmsCognitive plausibility in learning algorithms
Cognitive plausibility in learning algorithms
 
05 handbook summ-hovy
05 handbook summ-hovy05 handbook summ-hovy
05 handbook summ-hovy
 
Information Retrieval using Semantic Similarity
Information Retrieval using Semantic SimilarityInformation Retrieval using Semantic Similarity
Information Retrieval using Semantic Similarity
 
Discourse analysis ca summary
Discourse analysis  ca summaryDiscourse analysis  ca summary
Discourse analysis ca summary
 
Writing and Speech Recognition
Writing and Speech RecognitionWriting and Speech Recognition
Writing and Speech Recognition
 
A DECADE OF USING HYBRID INFERENCE SYSTEMS IN NLP (2005 – 2015): A SURVEY
A DECADE OF USING HYBRID INFERENCE SYSTEMS IN NLP (2005 – 2015): A SURVEYA DECADE OF USING HYBRID INFERENCE SYSTEMS IN NLP (2005 – 2015): A SURVEY
A DECADE OF USING HYBRID INFERENCE SYSTEMS IN NLP (2005 – 2015): A SURVEY
 
Organizational communication ch-1
Organizational communication  ch-1Organizational communication  ch-1
Organizational communication ch-1
 
Ca ppt
Ca pptCa ppt
Ca ppt
 
On Methods for the Formal Specification of Fault Tolerant Systems
On Methods for the Formal Specification of Fault Tolerant SystemsOn Methods for the Formal Specification of Fault Tolerant Systems
On Methods for the Formal Specification of Fault Tolerant Systems
 
NLP
NLPNLP
NLP
 
The impact of standardized terminologies and domain-ontologies in multilingua...
The impact of standardized terminologies and domain-ontologies in multilingua...The impact of standardized terminologies and domain-ontologies in multilingua...
The impact of standardized terminologies and domain-ontologies in multilingua...
 
In line or out of control?
In line or out of control?In line or out of control?
In line or out of control?
 

Mehr von Leon Derczynski

Joint Rumour Stance and Veracity
Joint Rumour Stance and VeracityJoint Rumour Stance and Veracity
Joint Rumour Stance and VeracityLeon Derczynski
 
State of Tools for NLP in Danish: 2018
State of Tools for NLP in Danish: 2018State of Tools for NLP in Danish: 2018
State of Tools for NLP in Danish: 2018Leon Derczynski
 
Broad Twitter Corpus: A Diverse Named Entity Recognition Resource
Broad Twitter Corpus: A Diverse Named Entity Recognition ResourceBroad Twitter Corpus: A Diverse Named Entity Recognition Resource
Broad Twitter Corpus: A Diverse Named Entity Recognition ResourceLeon Derczynski
 
Handling and Mining Linguistic Variation in UGC
Handling and Mining Linguistic Variation in UGCHandling and Mining Linguistic Variation in UGC
Handling and Mining Linguistic Variation in UGCLeon Derczynski
 
Efficient named entity annotation through pre-empting
Efficient named entity annotation through pre-emptingEfficient named entity annotation through pre-empting
Efficient named entity annotation through pre-emptingLeon Derczynski
 
Leveraging the Power of Social Media
Leveraging the Power of Social MediaLeveraging the Power of Social Media
Leveraging the Power of Social MediaLeon Derczynski
 
Corpus Annotation through Crowdsourcing: Towards Best Practice Guidelines
Corpus Annotation through Crowdsourcing: Towards Best Practice GuidelinesCorpus Annotation through Crowdsourcing: Towards Best Practice Guidelines
Corpus Annotation through Crowdsourcing: Towards Best Practice GuidelinesLeon Derczynski
 
Passive-Aggressive Sequence Labeling with Discriminative Post-Editing for Rec...
Passive-Aggressive Sequence Labeling with Discriminative Post-Editing for Rec...Passive-Aggressive Sequence Labeling with Discriminative Post-Editing for Rec...
Passive-Aggressive Sequence Labeling with Discriminative Post-Editing for Rec...Leon Derczynski
 
Starting to Process Social Media
Starting to Process Social MediaStarting to Process Social Media
Starting to Process Social MediaLeon Derczynski
 
Christmas Presentation at Aarhus: What I do
Christmas Presentation at Aarhus: What I doChristmas Presentation at Aarhus: What I do
Christmas Presentation at Aarhus: What I doLeon Derczynski
 
Recognising and Interpreting Named Temporal Expressions
Recognising and Interpreting Named Temporal ExpressionsRecognising and Interpreting Named Temporal Expressions
Recognising and Interpreting Named Temporal ExpressionsLeon Derczynski
 
TwitIE: An Open-Source Information Extraction Pipeline for Microblog Text
TwitIE: An Open-Source Information Extraction Pipeline for Microblog TextTwitIE: An Open-Source Information Extraction Pipeline for Microblog Text
TwitIE: An Open-Source Information Extraction Pipeline for Microblog TextLeon Derczynski
 
Twitter Part-of-Speech Tagging for All: Overcoming Sparse and Noisy Data
 Twitter Part-of-Speech Tagging for All:  Overcoming Sparse and Noisy Data Twitter Part-of-Speech Tagging for All:  Overcoming Sparse and Noisy Data
Twitter Part-of-Speech Tagging for All: Overcoming Sparse and Noisy DataLeon Derczynski
 
Mining Social Media with Linked Open Data, Entity Recognition, and Event Extr...
Mining Social Media with Linked Open Data, Entity Recognition, and Event Extr...Mining Social Media with Linked Open Data, Entity Recognition, and Event Extr...
Mining Social Media with Linked Open Data, Entity Recognition, and Event Extr...Leon Derczynski
 
Microblog-genre noise and its impact on semantic annotation accuracy
Microblog-genre noise and its impact on semantic annotation accuracyMicroblog-genre noise and its impact on semantic annotation accuracy
Microblog-genre noise and its impact on semantic annotation accuracyLeon Derczynski
 
Empirical Validation of Reichenbach’s Tense Framework
Empirical Validation of Reichenbach’s Tense FrameworkEmpirical Validation of Reichenbach’s Tense Framework
Empirical Validation of Reichenbach’s Tense FrameworkLeon Derczynski
 
Towards Context-Aware Search and Analysis on Social Media Data
Towards Context-Aware Search and Analysis on Social Media DataTowards Context-Aware Search and Analysis on Social Media Data
Towards Context-Aware Search and Analysis on Social Media DataLeon Derczynski
 
TIMEN: An Open Temporal Expression Normalisation Resource
TIMEN: An Open Temporal Expression Normalisation ResourceTIMEN: An Open Temporal Expression Normalisation Resource
TIMEN: An Open Temporal Expression Normalisation ResourceLeon Derczynski
 
Review of: Challenges of migrating to agile methodologies
Review of: Challenges of migrating to agile methodologiesReview of: Challenges of migrating to agile methodologies
Review of: Challenges of migrating to agile methodologiesLeon Derczynski
 

Mehr von Leon Derczynski (20)

Joint Rumour Stance and Veracity
Joint Rumour Stance and VeracityJoint Rumour Stance and Veracity
Joint Rumour Stance and Veracity
 
State of Tools for NLP in Danish: 2018
State of Tools for NLP in Danish: 2018State of Tools for NLP in Danish: 2018
State of Tools for NLP in Danish: 2018
 
RumourEval
RumourEvalRumourEval
RumourEval
 
Broad Twitter Corpus: A Diverse Named Entity Recognition Resource
Broad Twitter Corpus: A Diverse Named Entity Recognition ResourceBroad Twitter Corpus: A Diverse Named Entity Recognition Resource
Broad Twitter Corpus: A Diverse Named Entity Recognition Resource
 
Handling and Mining Linguistic Variation in UGC
Handling and Mining Linguistic Variation in UGCHandling and Mining Linguistic Variation in UGC
Handling and Mining Linguistic Variation in UGC
 
Efficient named entity annotation through pre-empting
Efficient named entity annotation through pre-emptingEfficient named entity annotation through pre-empting
Efficient named entity annotation through pre-empting
 
Leveraging the Power of Social Media
Leveraging the Power of Social MediaLeveraging the Power of Social Media
Leveraging the Power of Social Media
 
Corpus Annotation through Crowdsourcing: Towards Best Practice Guidelines
Corpus Annotation through Crowdsourcing: Towards Best Practice GuidelinesCorpus Annotation through Crowdsourcing: Towards Best Practice Guidelines
Corpus Annotation through Crowdsourcing: Towards Best Practice Guidelines
 
Passive-Aggressive Sequence Labeling with Discriminative Post-Editing for Rec...
Passive-Aggressive Sequence Labeling with Discriminative Post-Editing for Rec...Passive-Aggressive Sequence Labeling with Discriminative Post-Editing for Rec...
Passive-Aggressive Sequence Labeling with Discriminative Post-Editing for Rec...
 
Starting to Process Social Media
Starting to Process Social MediaStarting to Process Social Media
Starting to Process Social Media
 
Christmas Presentation at Aarhus: What I do
Christmas Presentation at Aarhus: What I doChristmas Presentation at Aarhus: What I do
Christmas Presentation at Aarhus: What I do
 
Recognising and Interpreting Named Temporal Expressions
Recognising and Interpreting Named Temporal ExpressionsRecognising and Interpreting Named Temporal Expressions
Recognising and Interpreting Named Temporal Expressions
 
TwitIE: An Open-Source Information Extraction Pipeline for Microblog Text
TwitIE: An Open-Source Information Extraction Pipeline for Microblog TextTwitIE: An Open-Source Information Extraction Pipeline for Microblog Text
TwitIE: An Open-Source Information Extraction Pipeline for Microblog Text
 
Twitter Part-of-Speech Tagging for All: Overcoming Sparse and Noisy Data
 Twitter Part-of-Speech Tagging for All:  Overcoming Sparse and Noisy Data Twitter Part-of-Speech Tagging for All:  Overcoming Sparse and Noisy Data
Twitter Part-of-Speech Tagging for All: Overcoming Sparse and Noisy Data
 
Mining Social Media with Linked Open Data, Entity Recognition, and Event Extr...
Mining Social Media with Linked Open Data, Entity Recognition, and Event Extr...Mining Social Media with Linked Open Data, Entity Recognition, and Event Extr...
Mining Social Media with Linked Open Data, Entity Recognition, and Event Extr...
 
Microblog-genre noise and its impact on semantic annotation accuracy
Microblog-genre noise and its impact on semantic annotation accuracyMicroblog-genre noise and its impact on semantic annotation accuracy
Microblog-genre noise and its impact on semantic annotation accuracy
 
Empirical Validation of Reichenbach’s Tense Framework
Empirical Validation of Reichenbach’s Tense FrameworkEmpirical Validation of Reichenbach’s Tense Framework
Empirical Validation of Reichenbach’s Tense Framework
 
Towards Context-Aware Search and Analysis on Social Media Data
Towards Context-Aware Search and Analysis on Social Media DataTowards Context-Aware Search and Analysis on Social Media Data
Towards Context-Aware Search and Analysis on Social Media Data
 
TIMEN: An Open Temporal Expression Normalisation Resource
TIMEN: An Open Temporal Expression Normalisation ResourceTIMEN: An Open Temporal Expression Normalisation Resource
TIMEN: An Open Temporal Expression Normalisation Resource
 
Review of: Challenges of migrating to agile methodologies
Review of: Challenges of migrating to agile methodologiesReview of: Challenges of migrating to agile methodologies
Review of: Challenges of migrating to agile methodologies
 

Kürzlich hochgeladen

Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 

Kürzlich hochgeladen (20)

Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 

Determining the Types of Temporal Relations in Discourse

  • 1. Introduction Concepts and tools Relation Extraction Temporal Signals Modelling Tense Conclusion Determining the Types of Temporal Relations in Discourse Leon Derczynski University of Sheffield 5 March, 2013 Leon Derczynski University of Sheffield Determining the Types of Temporal Relations in Discourse
  • 2. Introduction Concepts and tools Relation Extraction Temporal Signals Modelling Tense Conclusion The Role of Time Why is time important in language processing? World state changes constantly Every empirical assertion has temporal bounds “The sky is blue”, but it was not always Without it, na¨ıve knowledge extraction will fail (given an Almanac of Presidents, who is President?) By understanding temporal information, you will do better knowledge extraction. Overall goal How do we automatically understand temporal information in natural languages? Leon Derczynski University of Sheffield Determining the Types of Temporal Relations in Discourse
  • 3. Introduction Concepts and tools Relation Extraction Temporal Signals Modelling Tense Conclusion Temporal Information Extraction Existing state of the art How can we categorise types of temporal information? Events – e.g. occurrences, states Temporal expressions (timexes) – e.g. dates, durations Links – relations between pairs of events or times Supporting texts – e.g. action cardinality, event ordering We develop and use ISO-TimeML to annotate these entities. Main dataset: TimeBank (about 180 annotated documents) Leon Derczynski University of Sheffield Determining the Types of Temporal Relations in Discourse
  • 4. Introduction Concepts and tools Relation Extraction Temporal Signals Modelling Tense Conclusion TimeML Organizers <EVENT eid="e2120" class="REPORTING">state</EVENT> the <TIMEX3 tid="t29" type="DURATION" value="P2D" temporalFunction="false" functionInDocument="NONE">two days</TIMEX3> of music, dancing, and speeches is <EVENT eid="e2123" class="I STATE">expected</EVENT> to <EVENT eid="e13" class="OCCURRENCE">draw</EVENT> some two million people. <TLINK eventID="e2123" relatedToTime="t29" relType="BEFORE"/> Leon Derczynski University of Sheffield Determining the Types of Temporal Relations in Discourse
  • 5. Introduction Concepts and tools Relation Extraction Temporal Signals Modelling Tense Conclusion Times and Events What are temporal expressions? They refer to a time Subtasks: recognition and interpretation; SotA recognition is 0.86 F1 What do we consider as events? Verbal, nominal State of the art: 0.90 F1 for recognition Doesn’t cover complex structure; e.g. a music festival Events are not very useful unless related to other temporal entities How can we describe this structural complexity? Start by modeling the document as a graph Leon Derczynski University of Sheffield Determining the Types of Temporal Relations in Discourse
  • 6. Introduction Concepts and tools Relation Extraction Temporal Signals Modelling Tense Conclusion Temporal relations What are temporal relations? They describe the links between times and events Can capture both complex and partial orderings What kinds of temporal relation are there? 1 Interval (before, after, included by, simultaneous) 2 Subordinate (reported speech, modal, conditional) 3 Aspectual (start, culmination – see Vendler, Comrie) This work is concerned with the coarsest-grained information: the first category Leon Derczynski University of Sheffield Determining the Types of Temporal Relations in Discourse
  • 7. Introduction Concepts and tools Relation Extraction Temporal Signals Modelling Tense Conclusion Problem Definition How are these relations represented? Temporal interval algebra (Allen 1984) – a set of 14 relations between a pair of intervals TimeML defines a set of relation types and also types of interval What is our problem? Assume discourse w/ perfect event and timex annotations In fact, assume we know which intervals to link! “Given an ordered pair of intervals (arg1, arg2), which relation in the set Rallen describes them?” Leon Derczynski University of Sheffield Determining the Types of Temporal Relations in Discourse
  • 8. Introduction Concepts and tools Relation Extraction Temporal Signals Modelling Tense Conclusion Relation Extraction How can relations be labelled? Machine learning Using TimeML attributes: some success Using syntactic relations: matches SotA in tree kernels What’s the state of the art? 2007: Mani et al.: baseline 56%, system has 61% accuracy 2008: Bethard, Chambers: many sophisticated improvements – ILP, timex-timex ordering. Improved on Mani et al. by 1.5%. 2010: TempEval-2: baseline 58%, best was 65% accuracy Why do we find this performance ceiling? Leon Derczynski University of Sheffield Determining the Types of Temporal Relations in Discourse
  • 9. Introduction Concepts and tools Relation Extraction Temporal Signals Modelling Tense Conclusion Sources of Temporal Relation Information What are we missing? There is a heterogeneous set of temporal information types, including: Explicit signals – subsequently, as soon as Linguistic theory offers some models What is the evidence these two types will help? Conducted failure analysis: TempEval-2010 1 Multiple diverse approaches, same dataset Find the set of difficult links Characterise information supporting these links 1 Verhagen et al., 2010: Semeval Task 13 - TempEval-2 Leon Derczynski University of Sheffield Determining the Types of Temporal Relations in Discourse
  • 10. Introduction Concepts and tools Relation Extraction Temporal Signals Modelling Tense Conclusion Task C: event−timex intra−sentence relations All systems correct 1 fails 2 fail 3 fail 4 fail 5 fail All systems fail Task D: event−DCT relations All systems correct 1 fails 2 fail 3 fail 4 fail All systems fail Task E: main event inter−sentence relations All systems correct 1 fails 2 fail 3 fail 4 fail 5 fail All systems fail Task F: event−subordinate intra−sentence relations All systems correct 1 fails 2 fail 3 fail 4 fail All systems fail Figure: TempEval-2 relation labelling tasks, showing proportions of relations according to the number of systems that gave correct labels. Leon Derczynski University of Sheffield Determining the Types of Temporal Relations in Discourse
  • 11. Introduction Concepts and tools Relation Extraction Temporal Signals Modelling Tense Conclusion C D E F Proportion of links within a task that are difficult Task %difficult 010203040 The problem is difficult, and there is a consistently-difficult set of links. Perhaps we are ignoring some critical information. Leon Derczynski University of Sheffield Determining the Types of Temporal Relations in Discourse
  • 12. Introduction Concepts and tools Relation Extraction Temporal Signals Modelling Tense Conclusion New sources of ordering information Next step: manually characterise each “difficult” link. Attempt to identify what kind of information could be used to label it. Sources to investigate Explicit text – signals “After you pull the pin, throw the grenade” Sources to investigate Tensed relations “Having eaten, I left” Leon Derczynski University of Sheffield Determining the Types of Temporal Relations in Discourse
  • 13. Introduction Concepts and tools Relation Extraction Temporal Signals Modelling Tense Conclusion Temporal Signals What are these? In TimeML, they are text annotated as being helpful to a temporal relation Used by 12.2% of TimeBank’s relations Are temporal signals useful? A resounding yes! 61% → 83% accuracy with simple features 2 This level of performance on event-event links is above general state-of-the-art Existing corpora are under-annotated 2 Derczynski and Gaizauskas, 2010: Using signals for temporal relation classification Leon Derczynski University of Sheffield Determining the Types of Temporal Relations in Discourse
  • 14. Introduction Concepts and tools Relation Extraction Temporal Signals Modelling Tense Conclusion Temporal Signal Annotation How can we automatically annotate temporal signals? Define signals formally 3 Define a closed class of signals Re-annotate TimeBank Train discrimination and association We included dependency information and function tagging. 3 Derczynski and Gaizauskas, 2011: A corpus based study of temporal signals Leon Derczynski University of Sheffield Determining the Types of Temporal Relations in Discourse
  • 15. Introduction Concepts and tools Relation Extraction Temporal Signals Modelling Tense Conclusion Results How well did our approach perform? 1 Discrimination: 92% accuracy, 75% accuracy on positives (0.77 IAA) 2 Association: 99% accuracy / 80% error reduction 3 Inductive bias towards independence assumption was harmful (MaxEnt, NBayes) Results: 16% of links have signals (31% improvement) and can now be labelled at high accuracy. What remains to be done? How can we remedy under-annotation at the source? Clear links to spatial signal annotation (e.g. -LOC tags) Leon Derczynski University of Sheffield Determining the Types of Temporal Relations in Discourse
  • 16. Introduction Concepts and tools Relation Extraction Temporal Signals Modelling Tense Conclusion Reichenbach’s Model of Verbs How can we model tense in language? Each verb happens at event time, E The verb is uttered at speech time, S Past tense: E < S John ran. Present tense: E = S I’m free! What differentiates simple past from past perfect? John ran. is not the same as John had run. Introduce abstract reference time, R John had run. E < R < S Leon Derczynski University of Sheffield Determining the Types of Temporal Relations in Discourse
  • 17. Introduction Concepts and tools Relation Extraction Temporal Signals Modelling Tense Conclusion Reasoning about tense How is Reichenbach’s model helpful? We can describe all verbal events as three points linked by either equality or precedence Automatic and quick inference for relating intervals Does it work? Conducted first corpus-driven validation of the framework For reporting-type links, we used features based on pairwise event-time relations Add one feature representing the Reichenbachian ordering Classifier reached 59% accuracy (48% MCC baseline) on 9% of all temporal relations (above SotA) Leon Derczynski University of Sheffield Determining the Types of Temporal Relations in Discourse
  • 18. Introduction Concepts and tools Relation Extraction Temporal Signals Modelling Tense Conclusion Extending the model How else can we use the model? Positional use Timexes relate to reference points Only consider cases where the event and time are linguistically connected Identify these using dependency parses Add a feature hinting at the ordering We reach 75% accuracy from a 67% baseline (above SotA) Also useful for timex standard transduction 4 4 Derczynski, Llorens and Saquete 2012: Massively increasing TIMEX3 resources Leon Derczynski University of Sheffield Determining the Types of Temporal Relations in Discourse
  • 19. Introduction Concepts and tools Relation Extraction Temporal Signals Modelling Tense Conclusion Contributions A large part of the difficult relation set (roughly 60%) is catered for by these new information sources. Difficult task, with notable impact Focus on automatic annotation of temporal relations Pushed beyond SotA understanding of the problem Creation of and contribution to language resources – e.g. ISO-TimeML, RTMML, CAVaT (among others) .. where could we go next? Leon Derczynski University of Sheffield Determining the Types of Temporal Relations in Discourse
  • 20. Introduction Concepts and tools Relation Extraction Temporal Signals Modelling Tense Conclusion Future Forensic analysis How can we build a consistent event model from multiple semi-reliable accounts of an event? Challenges: Multi-document event and actor co-reference Story conflict resolution 5 Spatial and temporal IE from colloquial text Building and resolving accurate co-constraining models from unreliable data (belief networks) 5 Regneri, Koller and Pinkal 2010: Learning Script Knowledge with Web Experiments Leon Derczynski University of Sheffield Determining the Types of Temporal Relations in Discourse
  • 21. Introduction Concepts and tools Relation Extraction Temporal Signals Modelling Tense Conclusion Future Assertion bounding All assertions have temporal bounds. How can we determine these? Challenges: Accurate extraction of document temporal structure Automated reasoning High-precision timex normalisation Doing temporal IE & IR at gigaword scale Leon Derczynski University of Sheffield Determining the Types of Temporal Relations in Discourse
  • 22. Introduction Concepts and tools Relation Extraction Temporal Signals Modelling Tense Conclusion Future Temporal dataset construction Many current systems index whole documents by date, but information is more nuanced than that Challenges: Mapping events to temporal data points Storing and extracting events Anchoring events with uncertain bounds (“last year’s fighting” vs. “the fighting on April 23, 2011”) Mining complex super-events; e.g. the Fukushima disaster; what happened when? Leon Derczynski University of Sheffield Determining the Types of Temporal Relations in Discourse
  • 23. Introduction Concepts and tools Relation Extraction Temporal Signals Modelling Tense Conclusion Recap Temporality is ubiquitous, in the world around us and in the language we use to describe our world Processing it automatically is difficult Doing high-performance temporal IE opens exciting research avenues Thank you for your time. Are there any questions? Leon Derczynski University of Sheffield Determining the Types of Temporal Relations in Discourse
  • 24. Introduction Concepts and tools Relation Extraction Temporal Signals Modelling Tense Conclusion Labellings as probability distributions Automated methods (e.g. classifiers) may have varying degrees of confidence about a link’s label. We could assign a set of labels and probabilities to each label. Consistency constraints allow us to find the most-likely possible graph. A:B → before: 0.9; after 0.1 B:C → before: 0.5; simultaneous: 0.5 A:C → before: 1.0 Very time-consuming to compute – optimisations welcome! Leon Derczynski University of Sheffield Determining the Types of Temporal Relations in Discourse
  • 25. Introduction Concepts and tools Relation Extraction Temporal Signals Modelling Tense Conclusion Unuttered temporal orderings Event/Time distance “When I was brushing my teeth” → This event happens at least twice daily; assume this instance is 0-16 hours away Complex events “When we were putting up the tents for the festival” → near the beginning of / just before the “festival” event Leon Derczynski University of Sheffield Determining the Types of Temporal Relations in Discourse