SlideShare ist ein Scribd-Unternehmen logo
1 von 25
An Analysis of Causality between Events and its Relation to 
Temporal Information 
The Story about Causelity and his friendship 
with Time 
Paramita Mirza Sara Tonelli 
paramita@fbk.eu satonelli@fbk.eu 
COLING 2014
One day at McRel Inc. 
Eve McRel, the head of Event department, is introducing the team’s 
new member… 
Let me introduce Casey, who will be responsible for extracting 
causality from text. Casey, Tim is the one responsible for temporal 
relations, you two should work together. 
Hello! I have some questions for you, Tim. 
Hi! Feel free to ask. We should have a drink sometimes. 
2 / 22
About their jobs 
Typhoon Haiyan struck the eastern Philippines on Friday, killing thousands of 
people. 
BEFORE 
CAUSE 
IS_INCLUDED 
So, when given a piece of text like that, my job is to tell that struck 
happened before killing, or that struck happened on Friday. 
I see. And my job is to determine that struck is the cause of killing. 
How do you learn to identify the temporal relations? 
Oh, I have this annotated corpus from TempEval-3. 
I learn a lot from that. 3 / 22
Casey is not as lucky as Tim, the TempEval-3 corpus that Tim has doesn’t have 
causal information. He asked Eve to provide him with a causality corpus so 
that he can learn from it. 
Eve ordered someone to investigate about resources on causality… 
Rink et al. (2010) 
Resources on causality 
Use Bethard’s corpus and 
Bethard et al. (2008) 
Causality between events 
under conjunction and 
shows that temporal information 
helps in identifying causal relations 
Girju et al. (2007) 
Causality between nominals 
(SemEval-2007 Task 4) 
Do et al. (2011) 
Causality between verb-verb, 
verb-noun, and noun-noun 
(20 news articles from CNN) 
Riaz and Girju (2013) 
Causality between verbal events with 
markers because and but 
(knowledge base of causal associations 
of verbs) 
4 / 22
Let’s create a causality corpus! 
Since the available resources on causality are not really what they 
wanted, Eve decided to create a new one, so she hired two interns… 
So guys, I want you to add causal information on top of the TempEval-3 
corpus. Can you do it? 
Banana? 
Argh, this won’t do. I need annotation guidelines for them. 
Eve then consulted some philosophers: Lewis, Cheng, Wolff and Talmy; 
and she decided to lean the guidelines on the Dynamics Model (Wolff), 
based on Talmy’s force dynamic account of causality. 
5 / 22
The Annotator’s Guide to the Causality 
(a trilogy in two parts) 
Part 1: CSIGNAL 
A textual element indicating the presence of a causal relation. Parallel to 
SIGNAL to mark the presence of a temporal relation in TimeML. 
• Prepositions 
because of, as a result of, due to, … 
• Conjunctions 
because, since, so that, … 
• Adverbial connectors 
as a result, so, therefore, … 
• Clause-integrated expressions 
the result is, that’s why, … 
6 / 22
The Annotator’s Guide to the Causality 
(a trilogy in two parts) 
Part 2: CLINK 
A directional one-to-one relation where source = causing event and 
target = caused event, (optional) c-signalID = ID of related CSIGNAL. 
Parallel to TLINK for temporal relations in TimeML. 
• Expressions containing affect verbs affect, influence, determine, change 
– Ogun CAN crisisS affects the launchT of the All Progressives Congress 
• Expressions containing link verbs link, lead, depend (on) 
– An earthquakeT in North America was linked to a tsunamiS in Japan 
• Basic construction involving causative verbs of CAUSE, ENABLE, PREVENT type 
– The purchaseS caused the creationT of the current building 7 / 22
The Annotator’s Guide to the Causality 
(a trilogy in two parts) 
Part 2: CLINK 
A directional one-to-one relation where source = causing event and 
target = caused event, (optional) c-signalID = ID of related C-SIGNAL. 
Parallel to TLINK for temporal relations in TimeML. 
• Periphrastic causatives involving causative verbs of CAUSE, ENABLE, PREVENT type 
– The blastS prompts the boat to heelT violently 
• Expressions containing CSIGNALs 
– Iraq said it invadedT Kuwait because of disputesS over oil and money 
8 / 22
The interns’ discussion 
Hmmm… wowee~ Evo in kalarel no anotatata! 
(Hmmm… weird. Some events involved in causal relations were not 
annotated.) 
Real?? May para temporel awali jengajenga 
Anotatata wuliloo! 
(Really? Maybe because it was originally built for temporal relations. 
Let’s annotate them!) 
9 / 22
The interns’ reports 
http://hlt.fbk.eu/technologies/causal-timebank 10 / 22
How to learn causality? 
Casey decided to divide the job into two tasks: 
1. Labeling CSIGNAL: given a text (annotated with events and time 
expressions), decide whether a token is part of causal signals or not 
2. Identifying CLINK: given a pair of events, decide whether the events 
are connected by an explicit causal link 
Both tasks are basically classification tasks. 
I will use the created causality corpus to learn from. 
To evaluate my learning ability, I will use the 5-fold cross-validation scheme. 
11 / 22
Inside Casey’s brain 
on labeling CSIGNAL 
• Text chunking task: a token is classified into B-CSIGNAL, I-CSIGNAL 
and O (for other) 
• Pre-processing: 
– TextPro tool (Pianta et al., 2008) to get NP chunking and named entity 
information 
– Stanford CoreNLP tool to get lemma, PoS tags and dependency relations 
between tokens 
– addDiscourse tool (Pitler and Nenkova, 2009) to get discourse connective 
type 
12 / 22
Inside Casey’s brain 
on labeling CSIGNAL 
• Classifier: 
– Built using SVM algorithm provided by YamCha 
– Features vectors: token, lemma, PoS tags, NP chunking, dependency 
relations, and several binary features indicating whether a token is: 
• part of an event or a temporal expression 
• part of a named entity 
• part of a specific discourse connective type 
13 / 22
Casey’s note 
on labeling CSIGNAL 
System Precision Recall F-score 
Rule-based (baseline) 54.33% 40.35% 46.31% 
Supervised chunking 91.03% 41.76% 57.26% 
Rule-based system basically labels as CSIGNAL 
all causal connectors listed in the annotation guidelines 
and those appearing in specific syntactic construction 
14 / 22
Inside Casey’s brain 
on identifying CLINK 
• Classification task: an ordered pair of events (e1, e2) is classified into 
CLINK (e1 as source, e2 as target), CLINK-R (reversed order of source 
and target) and NO-REL 
• Candidate pairs: 
– Every possible combination of events in the same sentence in a forward 
manner, e.g. ”The e1 and e2 are e3”, event pairs are (e1, e2), (e1, e3), (e2, e3) 
– Combination of each event in a sentence with events in the following 
sentence (only consider linking events in two consecutive sentences) 
• Pre-processing: 
– Stanford CoreNLP tool to get lemma, PoS tags and dependency relations 
between tokens 
15 / 22
Inside Casey’s brain 
on identifying CLINK (continued) 
• Classifier: 
– Built using SVM algorithm provided by YamCha 
– Features vectors: 
• String and grammatical: token, lemma and PoS tags of e1 and e2, and a 
binary feature (e1 and e2 have the same PoS tags) 
• Textual context: sentence distance and event distance of e1 and e2 
• Event attributes: class, tense, aspect and polarity of e1 and e2 as specified 
in TimeML 
16 / 22
Inside Casey’s brain 
on identifying CLINK (continued) 
• Classifier: 
– Built using SVM algorithm provided by YamCha 
– Features vectors: 
• Dependency information: dependency path between e1 and e2 (if any), 
type of causative verbs connecting them (if any), and a binary feature 
(e1/e2 is the root of the sentence) 
• Causal signals: causal signals around e1 and e2, position of the signal 
(between e1 and e2, or before e1), dependency path between e1/e2 and 
the signal 
• Temporal relations (TLINKs): temporal relation type of TLINK connecting 
e1/e2 (if any), taken from gold annotated corpus 
17 / 22
Casey’s note 
on identifying CLINK 
System Precision Recall F-score 
Rule-based (baseline) 36.79% 12.26% 18.40% 
Supervised classification 
(with gold CSIGNALs) 
74.67% 35.22% 47.86% 
- without dependency feature 65.77% 30.82% 41.97% 
- without CSIGNAL feature 57.53% 13.21% 21.48% 
- without TLINK feature 61.59% 29.25% 39.66% 
Supervised classification 
67.29% 22.64% 33.88% 
(with automatic CSIGNALs) 
Rule-based system basically looks for specific dependency constructions 
where an affect verb, a link verb, a causative verb (basic and periphrastic 
constructions) or a causal signal is connected to two events 18 / 22
Just another meeting at McRels Inc. 
Casey reports some findings from his learning activity… 
On labeling CSIGNAL, the low recall is most probably due to data sparseness. 
Well, that’s expected, only 47% of documents in the corpus contain CSIGNAL. 
We should enrich the learning data, maybe with Penn Discourse Treebank (PDTB)? 
Yeah, maybe. Furthermore, false negatives are mostly because of ambiguous 
causal signals, such as by and and. 
For conjunction and, perhaps the corpus by Bethard et al. (2008) can help? 
19 / 22
Just another meeting at McRels Inc. 
Casey reports some findings from his learning activity… 
Hmm.. right. Meanwhile, on identifying CLINK, most mistakes are caused by 
dependency parser errors. 
Try to use another dependency parser. For example… C&C tool (Curran et al., 2007) 
since it has a better coverage of long-range dependencies. 
Okay, worth to try. And again, data sparseness is an issue. Could you provide 
me with more learning data? 
One option is to hire interns again to annotate AQUAINT corpus from TempEval-3. 
Or, using causality information in PDTB, but pre-processing is needed because 
the causality is not between events. Let’s see what I can do…20 / 22
One evening at an Irish pub 
While Tim and Casey are enjoying their Guinness… 
So, how’s your work going? 
It’s going well. 
There are some future directions to improve my learning ability. 
By the way, the temporal information helps me a lot! 
Especially to decide the causality direction, because you know, 
cause should happen before the effect. 
Wow, cool! Perhaps the causal information can also help me too? 
21 / 22
One evening at an Irish pub 
While Tim and Casey are enjoying their Guinness… 
Well, the number of TLINKs that have underlying CLINKs will be much lower. 
So… maybe the causal information won’t help that much. 
Besides, look at this sentence… 
Hmmm… interesting, the cause is after the effect. We should discuss 
more about it. But now… let’s celebrate our future collaboration. 
Cheers! 
Cheers! 
“But some analysts questioned T how much of an impact the retirement package 
will have, because few jobs will end S up being eliminated.” 
…and their story continues, in the next paper ;) 22 / 22
Cast 
Casey 
Tim 
Eve McRel 
Minion 1 
Minion 2 
Causal Relation Extraction System 
Temporal Relation Extraction System 
Event Relation Repository 
Paramita Mirza 
Sara Tonelli 
Thank You!
Interns’ additional reports
Casey’s note 
on dependency parser errors 
“StatesWest Airlines withdrew T its offer to acquire Mesa Airlines because 
the Farmington carrier did not respond S to its offer” 
According to Stanford dependency parser, 
because is a marker of acquire instead of withdrew

Weitere ähnliche Inhalte

Ähnlich wie An Analysis of Causality between Events and its Relation to Temporal Information

Temporal Relations with Signals: the case of Italian Temporal Prepositions
Temporal Relations with Signals: the case of Italian Temporal PrepositionsTemporal Relations with Signals: the case of Italian Temporal Prepositions
Temporal Relations with Signals: the case of Italian Temporal PrepositionsTommasoC_80
 
Determining the Types of Temporal Relations in Discourse
Determining the Types of Temporal Relations in DiscourseDetermining the Types of Temporal Relations in Discourse
Determining the Types of Temporal Relations in DiscourseLeon Derczynski
 
Event templates for Question answering
Event templates for Question answeringEvent templates for Question answering
Event templates for Question answeringBarbara Starr
 
2015-02-25 research seminal, Paul Seitlinger
2015-02-25 research seminal, Paul Seitlinger2015-02-25 research seminal, Paul Seitlinger
2015-02-25 research seminal, Paul Seitlingerifi8106tlu
 
Event detection and summarization based on social networks and semantic query...
Event detection and summarization based on social networks and semantic query...Event detection and summarization based on social networks and semantic query...
Event detection and summarization based on social networks and semantic query...ijnlc
 
Event templatesfor qa2
Event templatesfor qa2Event templatesfor qa2
Event templatesfor qa2Barbara Starr
 
Event templates for improved narrative understanding in Question Answering sy...
Event templates for improved narrative understanding in Question Answering sy...Event templates for improved narrative understanding in Question Answering sy...
Event templates for improved narrative understanding in Question Answering sy...Barbara Starr
 
Determining the Types of Temporal Relations in Discourse
Determining the Types of Temporal Relations in DiscourseDetermining the Types of Temporal Relations in Discourse
Determining the Types of Temporal Relations in DiscourseLeon Derczynski
 
A ROBUST JOINT-TRAINING GRAPHNEURALNETWORKS MODEL FOR EVENT DETECTIONWITHSYMM...
A ROBUST JOINT-TRAINING GRAPHNEURALNETWORKS MODEL FOR EVENT DETECTIONWITHSYMM...A ROBUST JOINT-TRAINING GRAPHNEURALNETWORKS MODEL FOR EVENT DETECTIONWITHSYMM...
A ROBUST JOINT-TRAINING GRAPHNEURALNETWORKS MODEL FOR EVENT DETECTIONWITHSYMM...kevig
 
A Robust Joint-Training Graph Neural Networks Model for Event Detection with ...
A Robust Joint-Training Graph Neural Networks Model for Event Detection with ...A Robust Joint-Training Graph Neural Networks Model for Event Detection with ...
A Robust Joint-Training Graph Neural Networks Model for Event Detection with ...kevig
 
Proposed Event Processing Definitions ,September 20, 2006
Proposed Event Processing Definitions,September 20, 2006Proposed Event Processing Definitions,September 20, 2006
Proposed Event Processing Definitions ,September 20, 2006Tim Bass
 
INFLUENCE OF PRIORS OVER MULTITYPED OBJECT IN EVOLUTIONARY CLUSTERING
INFLUENCE OF PRIORS OVER MULTITYPED OBJECT IN EVOLUTIONARY CLUSTERINGINFLUENCE OF PRIORS OVER MULTITYPED OBJECT IN EVOLUTIONARY CLUSTERING
INFLUENCE OF PRIORS OVER MULTITYPED OBJECT IN EVOLUTIONARY CLUSTERINGcscpconf
 
Influence of priors over multityped object in evolutionary clustering
Influence of priors over multityped object in evolutionary clusteringInfluence of priors over multityped object in evolutionary clustering
Influence of priors over multityped object in evolutionary clusteringcsandit
 
AI2 day.pptx
AI2 day.pptxAI2 day.pptx
AI2 day.pptxSan Kim
 
The Role Of Ontology In Modern Expert Systems Dallas 2008
The Role Of Ontology In Modern Expert Systems   Dallas   2008The Role Of Ontology In Modern Expert Systems   Dallas   2008
The Role Of Ontology In Modern Expert Systems Dallas 2008Jason Morris
 
NAISTビッグデータシンポジウム - 情報 松本先生
NAISTビッグデータシンポジウム - 情報 松本先生NAISTビッグデータシンポジウム - 情報 松本先生
NAISTビッグデータシンポジウム - 情報 松本先生ysuzuki-naist
 
Criminal and Civil Identification with DNA Databases Using Bayesian Networks
Criminal and Civil Identification with DNA Databases Using Bayesian NetworksCriminal and Civil Identification with DNA Databases Using Bayesian Networks
Criminal and Civil Identification with DNA Databases Using Bayesian NetworksCSCJournals
 

Ähnlich wie An Analysis of Causality between Events and its Relation to Temporal Information (20)

Temporal Relations with Signals: the case of Italian Temporal Prepositions
Temporal Relations with Signals: the case of Italian Temporal PrepositionsTemporal Relations with Signals: the case of Italian Temporal Prepositions
Temporal Relations with Signals: the case of Italian Temporal Prepositions
 
Determining the Types of Temporal Relations in Discourse
Determining the Types of Temporal Relations in DiscourseDetermining the Types of Temporal Relations in Discourse
Determining the Types of Temporal Relations in Discourse
 
Event templates for Question answering
Event templates for Question answeringEvent templates for Question answering
Event templates for Question answering
 
2015-02-25 research seminal, Paul Seitlinger
2015-02-25 research seminal, Paul Seitlinger2015-02-25 research seminal, Paul Seitlinger
2015-02-25 research seminal, Paul Seitlinger
 
Event detection and summarization based on social networks and semantic query...
Event detection and summarization based on social networks and semantic query...Event detection and summarization based on social networks and semantic query...
Event detection and summarization based on social networks and semantic query...
 
Event templatesfor qa2
Event templatesfor qa2Event templatesfor qa2
Event templatesfor qa2
 
Event templates for improved narrative understanding in Question Answering sy...
Event templates for improved narrative understanding in Question Answering sy...Event templates for improved narrative understanding in Question Answering sy...
Event templates for improved narrative understanding in Question Answering sy...
 
Determining the Types of Temporal Relations in Discourse
Determining the Types of Temporal Relations in DiscourseDetermining the Types of Temporal Relations in Discourse
Determining the Types of Temporal Relations in Discourse
 
A ROBUST JOINT-TRAINING GRAPHNEURALNETWORKS MODEL FOR EVENT DETECTIONWITHSYMM...
A ROBUST JOINT-TRAINING GRAPHNEURALNETWORKS MODEL FOR EVENT DETECTIONWITHSYMM...A ROBUST JOINT-TRAINING GRAPHNEURALNETWORKS MODEL FOR EVENT DETECTIONWITHSYMM...
A ROBUST JOINT-TRAINING GRAPHNEURALNETWORKS MODEL FOR EVENT DETECTIONWITHSYMM...
 
A Robust Joint-Training Graph Neural Networks Model for Event Detection with ...
A Robust Joint-Training Graph Neural Networks Model for Event Detection with ...A Robust Joint-Training Graph Neural Networks Model for Event Detection with ...
A Robust Joint-Training Graph Neural Networks Model for Event Detection with ...
 
Proposed Event Processing Definitions ,September 20, 2006
Proposed Event Processing Definitions,September 20, 2006Proposed Event Processing Definitions,September 20, 2006
Proposed Event Processing Definitions ,September 20, 2006
 
An Ontology Design Pattern to Define Explanations
An Ontology Design Pattern to Define ExplanationsAn Ontology Design Pattern to Define Explanations
An Ontology Design Pattern to Define Explanations
 
ppt
pptppt
ppt
 
INFLUENCE OF PRIORS OVER MULTITYPED OBJECT IN EVOLUTIONARY CLUSTERING
INFLUENCE OF PRIORS OVER MULTITYPED OBJECT IN EVOLUTIONARY CLUSTERINGINFLUENCE OF PRIORS OVER MULTITYPED OBJECT IN EVOLUTIONARY CLUSTERING
INFLUENCE OF PRIORS OVER MULTITYPED OBJECT IN EVOLUTIONARY CLUSTERING
 
Influence of priors over multityped object in evolutionary clustering
Influence of priors over multityped object in evolutionary clusteringInfluence of priors over multityped object in evolutionary clustering
Influence of priors over multityped object in evolutionary clustering
 
AI2 day.pptx
AI2 day.pptxAI2 day.pptx
AI2 day.pptx
 
The Role Of Ontology In Modern Expert Systems Dallas 2008
The Role Of Ontology In Modern Expert Systems   Dallas   2008The Role Of Ontology In Modern Expert Systems   Dallas   2008
The Role Of Ontology In Modern Expert Systems Dallas 2008
 
PowerMagpie
PowerMagpiePowerMagpie
PowerMagpie
 
NAISTビッグデータシンポジウム - 情報 松本先生
NAISTビッグデータシンポジウム - 情報 松本先生NAISTビッグデータシンポジウム - 情報 松本先生
NAISTビッグデータシンポジウム - 情報 松本先生
 
Criminal and Civil Identification with DNA Databases Using Bayesian Networks
Criminal and Civil Identification with DNA Databases Using Bayesian NetworksCriminal and Civil Identification with DNA Databases Using Bayesian Networks
Criminal and Civil Identification with DNA Databases Using Bayesian Networks
 

Kürzlich hochgeladen

trihybrid cross , test cross chi squares
trihybrid cross , test cross chi squarestrihybrid cross , test cross chi squares
trihybrid cross , test cross chi squaresusmanzain586
 
basic entomology with insect anatomy and taxonomy
basic entomology with insect anatomy and taxonomybasic entomology with insect anatomy and taxonomy
basic entomology with insect anatomy and taxonomyDrAnita Sharma
 
Environmental Biotechnology Topic:- Microbial Biosensor
Environmental Biotechnology Topic:- Microbial BiosensorEnvironmental Biotechnology Topic:- Microbial Biosensor
Environmental Biotechnology Topic:- Microbial Biosensorsonawaneprad
 
STOPPED FLOW METHOD & APPLICATION MURUGAVENI B.pptx
STOPPED FLOW METHOD & APPLICATION MURUGAVENI B.pptxSTOPPED FLOW METHOD & APPLICATION MURUGAVENI B.pptx
STOPPED FLOW METHOD & APPLICATION MURUGAVENI B.pptxMurugaveni B
 
OECD bibliometric indicators: Selected highlights, April 2024
OECD bibliometric indicators: Selected highlights, April 2024OECD bibliometric indicators: Selected highlights, April 2024
OECD bibliometric indicators: Selected highlights, April 2024innovationoecd
 
Pests of soyabean_Binomics_IdentificationDr.UPR.pdf
Pests of soyabean_Binomics_IdentificationDr.UPR.pdfPests of soyabean_Binomics_IdentificationDr.UPR.pdf
Pests of soyabean_Binomics_IdentificationDr.UPR.pdfPirithiRaju
 
Fertilization: Sperm and the egg—collectively called the gametes—fuse togethe...
Fertilization: Sperm and the egg—collectively called the gametes—fuse togethe...Fertilization: Sperm and the egg—collectively called the gametes—fuse togethe...
Fertilization: Sperm and the egg—collectively called the gametes—fuse togethe...D. B. S. College Kanpur
 
ECG Graph Monitoring with AD8232 ECG Sensor & Arduino.pptx
ECG Graph Monitoring with AD8232 ECG Sensor & Arduino.pptxECG Graph Monitoring with AD8232 ECG Sensor & Arduino.pptx
ECG Graph Monitoring with AD8232 ECG Sensor & Arduino.pptxmaryFF1
 
Speech, hearing, noise, intelligibility.pptx
Speech, hearing, noise, intelligibility.pptxSpeech, hearing, noise, intelligibility.pptx
Speech, hearing, noise, intelligibility.pptxpriyankatabhane
 
Dubai Calls Girl Lisa O525547819 Lexi Call Girls In Dubai
Dubai Calls Girl Lisa O525547819 Lexi Call Girls In DubaiDubai Calls Girl Lisa O525547819 Lexi Call Girls In Dubai
Dubai Calls Girl Lisa O525547819 Lexi Call Girls In Dubaikojalkojal131
 
Servosystem Theory / Cybernetic Theory by Petrovic
Servosystem Theory / Cybernetic Theory by PetrovicServosystem Theory / Cybernetic Theory by Petrovic
Servosystem Theory / Cybernetic Theory by PetrovicAditi Jain
 
Topic 9- General Principles of International Law.pptx
Topic 9- General Principles of International Law.pptxTopic 9- General Principles of International Law.pptx
Topic 9- General Principles of International Law.pptxJorenAcuavera1
 
Pests of castor_Binomics_Identification_Dr.UPR.pdf
Pests of castor_Binomics_Identification_Dr.UPR.pdfPests of castor_Binomics_Identification_Dr.UPR.pdf
Pests of castor_Binomics_Identification_Dr.UPR.pdfPirithiRaju
 
THE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptx
THE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptxTHE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptx
THE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptxNandakishor Bhaurao Deshmukh
 
Vision and reflection on Mining Software Repositories research in 2024
Vision and reflection on Mining Software Repositories research in 2024Vision and reflection on Mining Software Repositories research in 2024
Vision and reflection on Mining Software Repositories research in 2024AyushiRastogi48
 
User Guide: Pulsar™ Weather Station (Columbia Weather Systems)
User Guide: Pulsar™ Weather Station (Columbia Weather Systems)User Guide: Pulsar™ Weather Station (Columbia Weather Systems)
User Guide: Pulsar™ Weather Station (Columbia Weather Systems)Columbia Weather Systems
 
Pests of jatropha_Bionomics_identification_Dr.UPR.pdf
Pests of jatropha_Bionomics_identification_Dr.UPR.pdfPests of jatropha_Bionomics_identification_Dr.UPR.pdf
Pests of jatropha_Bionomics_identification_Dr.UPR.pdfPirithiRaju
 
LIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptx
LIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptxLIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptx
LIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptxmalonesandreagweneth
 
REVISTA DE BIOLOGIA E CIÊNCIAS DA TERRA ISSN 1519-5228 - Artigo_Bioterra_V24_...
REVISTA DE BIOLOGIA E CIÊNCIAS DA TERRA ISSN 1519-5228 - Artigo_Bioterra_V24_...REVISTA DE BIOLOGIA E CIÊNCIAS DA TERRA ISSN 1519-5228 - Artigo_Bioterra_V24_...
REVISTA DE BIOLOGIA E CIÊNCIAS DA TERRA ISSN 1519-5228 - Artigo_Bioterra_V24_...Universidade Federal de Sergipe - UFS
 

Kürzlich hochgeladen (20)

trihybrid cross , test cross chi squares
trihybrid cross , test cross chi squarestrihybrid cross , test cross chi squares
trihybrid cross , test cross chi squares
 
basic entomology with insect anatomy and taxonomy
basic entomology with insect anatomy and taxonomybasic entomology with insect anatomy and taxonomy
basic entomology with insect anatomy and taxonomy
 
Environmental Biotechnology Topic:- Microbial Biosensor
Environmental Biotechnology Topic:- Microbial BiosensorEnvironmental Biotechnology Topic:- Microbial Biosensor
Environmental Biotechnology Topic:- Microbial Biosensor
 
STOPPED FLOW METHOD & APPLICATION MURUGAVENI B.pptx
STOPPED FLOW METHOD & APPLICATION MURUGAVENI B.pptxSTOPPED FLOW METHOD & APPLICATION MURUGAVENI B.pptx
STOPPED FLOW METHOD & APPLICATION MURUGAVENI B.pptx
 
OECD bibliometric indicators: Selected highlights, April 2024
OECD bibliometric indicators: Selected highlights, April 2024OECD bibliometric indicators: Selected highlights, April 2024
OECD bibliometric indicators: Selected highlights, April 2024
 
Pests of soyabean_Binomics_IdentificationDr.UPR.pdf
Pests of soyabean_Binomics_IdentificationDr.UPR.pdfPests of soyabean_Binomics_IdentificationDr.UPR.pdf
Pests of soyabean_Binomics_IdentificationDr.UPR.pdf
 
Fertilization: Sperm and the egg—collectively called the gametes—fuse togethe...
Fertilization: Sperm and the egg—collectively called the gametes—fuse togethe...Fertilization: Sperm and the egg—collectively called the gametes—fuse togethe...
Fertilization: Sperm and the egg—collectively called the gametes—fuse togethe...
 
ECG Graph Monitoring with AD8232 ECG Sensor & Arduino.pptx
ECG Graph Monitoring with AD8232 ECG Sensor & Arduino.pptxECG Graph Monitoring with AD8232 ECG Sensor & Arduino.pptx
ECG Graph Monitoring with AD8232 ECG Sensor & Arduino.pptx
 
Speech, hearing, noise, intelligibility.pptx
Speech, hearing, noise, intelligibility.pptxSpeech, hearing, noise, intelligibility.pptx
Speech, hearing, noise, intelligibility.pptx
 
Dubai Calls Girl Lisa O525547819 Lexi Call Girls In Dubai
Dubai Calls Girl Lisa O525547819 Lexi Call Girls In DubaiDubai Calls Girl Lisa O525547819 Lexi Call Girls In Dubai
Dubai Calls Girl Lisa O525547819 Lexi Call Girls In Dubai
 
Servosystem Theory / Cybernetic Theory by Petrovic
Servosystem Theory / Cybernetic Theory by PetrovicServosystem Theory / Cybernetic Theory by Petrovic
Servosystem Theory / Cybernetic Theory by Petrovic
 
Topic 9- General Principles of International Law.pptx
Topic 9- General Principles of International Law.pptxTopic 9- General Principles of International Law.pptx
Topic 9- General Principles of International Law.pptx
 
Pests of castor_Binomics_Identification_Dr.UPR.pdf
Pests of castor_Binomics_Identification_Dr.UPR.pdfPests of castor_Binomics_Identification_Dr.UPR.pdf
Pests of castor_Binomics_Identification_Dr.UPR.pdf
 
THE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptx
THE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptxTHE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptx
THE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptx
 
Let’s Say Someone Did Drop the Bomb. Then What?
Let’s Say Someone Did Drop the Bomb. Then What?Let’s Say Someone Did Drop the Bomb. Then What?
Let’s Say Someone Did Drop the Bomb. Then What?
 
Vision and reflection on Mining Software Repositories research in 2024
Vision and reflection on Mining Software Repositories research in 2024Vision and reflection on Mining Software Repositories research in 2024
Vision and reflection on Mining Software Repositories research in 2024
 
User Guide: Pulsar™ Weather Station (Columbia Weather Systems)
User Guide: Pulsar™ Weather Station (Columbia Weather Systems)User Guide: Pulsar™ Weather Station (Columbia Weather Systems)
User Guide: Pulsar™ Weather Station (Columbia Weather Systems)
 
Pests of jatropha_Bionomics_identification_Dr.UPR.pdf
Pests of jatropha_Bionomics_identification_Dr.UPR.pdfPests of jatropha_Bionomics_identification_Dr.UPR.pdf
Pests of jatropha_Bionomics_identification_Dr.UPR.pdf
 
LIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptx
LIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptxLIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptx
LIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptx
 
REVISTA DE BIOLOGIA E CIÊNCIAS DA TERRA ISSN 1519-5228 - Artigo_Bioterra_V24_...
REVISTA DE BIOLOGIA E CIÊNCIAS DA TERRA ISSN 1519-5228 - Artigo_Bioterra_V24_...REVISTA DE BIOLOGIA E CIÊNCIAS DA TERRA ISSN 1519-5228 - Artigo_Bioterra_V24_...
REVISTA DE BIOLOGIA E CIÊNCIAS DA TERRA ISSN 1519-5228 - Artigo_Bioterra_V24_...
 

An Analysis of Causality between Events and its Relation to Temporal Information

  • 1. An Analysis of Causality between Events and its Relation to Temporal Information The Story about Causelity and his friendship with Time Paramita Mirza Sara Tonelli paramita@fbk.eu satonelli@fbk.eu COLING 2014
  • 2. One day at McRel Inc. Eve McRel, the head of Event department, is introducing the team’s new member… Let me introduce Casey, who will be responsible for extracting causality from text. Casey, Tim is the one responsible for temporal relations, you two should work together. Hello! I have some questions for you, Tim. Hi! Feel free to ask. We should have a drink sometimes. 2 / 22
  • 3. About their jobs Typhoon Haiyan struck the eastern Philippines on Friday, killing thousands of people. BEFORE CAUSE IS_INCLUDED So, when given a piece of text like that, my job is to tell that struck happened before killing, or that struck happened on Friday. I see. And my job is to determine that struck is the cause of killing. How do you learn to identify the temporal relations? Oh, I have this annotated corpus from TempEval-3. I learn a lot from that. 3 / 22
  • 4. Casey is not as lucky as Tim, the TempEval-3 corpus that Tim has doesn’t have causal information. He asked Eve to provide him with a causality corpus so that he can learn from it. Eve ordered someone to investigate about resources on causality… Rink et al. (2010) Resources on causality Use Bethard’s corpus and Bethard et al. (2008) Causality between events under conjunction and shows that temporal information helps in identifying causal relations Girju et al. (2007) Causality between nominals (SemEval-2007 Task 4) Do et al. (2011) Causality between verb-verb, verb-noun, and noun-noun (20 news articles from CNN) Riaz and Girju (2013) Causality between verbal events with markers because and but (knowledge base of causal associations of verbs) 4 / 22
  • 5. Let’s create a causality corpus! Since the available resources on causality are not really what they wanted, Eve decided to create a new one, so she hired two interns… So guys, I want you to add causal information on top of the TempEval-3 corpus. Can you do it? Banana? Argh, this won’t do. I need annotation guidelines for them. Eve then consulted some philosophers: Lewis, Cheng, Wolff and Talmy; and she decided to lean the guidelines on the Dynamics Model (Wolff), based on Talmy’s force dynamic account of causality. 5 / 22
  • 6. The Annotator’s Guide to the Causality (a trilogy in two parts) Part 1: CSIGNAL A textual element indicating the presence of a causal relation. Parallel to SIGNAL to mark the presence of a temporal relation in TimeML. • Prepositions because of, as a result of, due to, … • Conjunctions because, since, so that, … • Adverbial connectors as a result, so, therefore, … • Clause-integrated expressions the result is, that’s why, … 6 / 22
  • 7. The Annotator’s Guide to the Causality (a trilogy in two parts) Part 2: CLINK A directional one-to-one relation where source = causing event and target = caused event, (optional) c-signalID = ID of related CSIGNAL. Parallel to TLINK for temporal relations in TimeML. • Expressions containing affect verbs affect, influence, determine, change – Ogun CAN crisisS affects the launchT of the All Progressives Congress • Expressions containing link verbs link, lead, depend (on) – An earthquakeT in North America was linked to a tsunamiS in Japan • Basic construction involving causative verbs of CAUSE, ENABLE, PREVENT type – The purchaseS caused the creationT of the current building 7 / 22
  • 8. The Annotator’s Guide to the Causality (a trilogy in two parts) Part 2: CLINK A directional one-to-one relation where source = causing event and target = caused event, (optional) c-signalID = ID of related C-SIGNAL. Parallel to TLINK for temporal relations in TimeML. • Periphrastic causatives involving causative verbs of CAUSE, ENABLE, PREVENT type – The blastS prompts the boat to heelT violently • Expressions containing CSIGNALs – Iraq said it invadedT Kuwait because of disputesS over oil and money 8 / 22
  • 9. The interns’ discussion Hmmm… wowee~ Evo in kalarel no anotatata! (Hmmm… weird. Some events involved in causal relations were not annotated.) Real?? May para temporel awali jengajenga Anotatata wuliloo! (Really? Maybe because it was originally built for temporal relations. Let’s annotate them!) 9 / 22
  • 10. The interns’ reports http://hlt.fbk.eu/technologies/causal-timebank 10 / 22
  • 11. How to learn causality? Casey decided to divide the job into two tasks: 1. Labeling CSIGNAL: given a text (annotated with events and time expressions), decide whether a token is part of causal signals or not 2. Identifying CLINK: given a pair of events, decide whether the events are connected by an explicit causal link Both tasks are basically classification tasks. I will use the created causality corpus to learn from. To evaluate my learning ability, I will use the 5-fold cross-validation scheme. 11 / 22
  • 12. Inside Casey’s brain on labeling CSIGNAL • Text chunking task: a token is classified into B-CSIGNAL, I-CSIGNAL and O (for other) • Pre-processing: – TextPro tool (Pianta et al., 2008) to get NP chunking and named entity information – Stanford CoreNLP tool to get lemma, PoS tags and dependency relations between tokens – addDiscourse tool (Pitler and Nenkova, 2009) to get discourse connective type 12 / 22
  • 13. Inside Casey’s brain on labeling CSIGNAL • Classifier: – Built using SVM algorithm provided by YamCha – Features vectors: token, lemma, PoS tags, NP chunking, dependency relations, and several binary features indicating whether a token is: • part of an event or a temporal expression • part of a named entity • part of a specific discourse connective type 13 / 22
  • 14. Casey’s note on labeling CSIGNAL System Precision Recall F-score Rule-based (baseline) 54.33% 40.35% 46.31% Supervised chunking 91.03% 41.76% 57.26% Rule-based system basically labels as CSIGNAL all causal connectors listed in the annotation guidelines and those appearing in specific syntactic construction 14 / 22
  • 15. Inside Casey’s brain on identifying CLINK • Classification task: an ordered pair of events (e1, e2) is classified into CLINK (e1 as source, e2 as target), CLINK-R (reversed order of source and target) and NO-REL • Candidate pairs: – Every possible combination of events in the same sentence in a forward manner, e.g. ”The e1 and e2 are e3”, event pairs are (e1, e2), (e1, e3), (e2, e3) – Combination of each event in a sentence with events in the following sentence (only consider linking events in two consecutive sentences) • Pre-processing: – Stanford CoreNLP tool to get lemma, PoS tags and dependency relations between tokens 15 / 22
  • 16. Inside Casey’s brain on identifying CLINK (continued) • Classifier: – Built using SVM algorithm provided by YamCha – Features vectors: • String and grammatical: token, lemma and PoS tags of e1 and e2, and a binary feature (e1 and e2 have the same PoS tags) • Textual context: sentence distance and event distance of e1 and e2 • Event attributes: class, tense, aspect and polarity of e1 and e2 as specified in TimeML 16 / 22
  • 17. Inside Casey’s brain on identifying CLINK (continued) • Classifier: – Built using SVM algorithm provided by YamCha – Features vectors: • Dependency information: dependency path between e1 and e2 (if any), type of causative verbs connecting them (if any), and a binary feature (e1/e2 is the root of the sentence) • Causal signals: causal signals around e1 and e2, position of the signal (between e1 and e2, or before e1), dependency path between e1/e2 and the signal • Temporal relations (TLINKs): temporal relation type of TLINK connecting e1/e2 (if any), taken from gold annotated corpus 17 / 22
  • 18. Casey’s note on identifying CLINK System Precision Recall F-score Rule-based (baseline) 36.79% 12.26% 18.40% Supervised classification (with gold CSIGNALs) 74.67% 35.22% 47.86% - without dependency feature 65.77% 30.82% 41.97% - without CSIGNAL feature 57.53% 13.21% 21.48% - without TLINK feature 61.59% 29.25% 39.66% Supervised classification 67.29% 22.64% 33.88% (with automatic CSIGNALs) Rule-based system basically looks for specific dependency constructions where an affect verb, a link verb, a causative verb (basic and periphrastic constructions) or a causal signal is connected to two events 18 / 22
  • 19. Just another meeting at McRels Inc. Casey reports some findings from his learning activity… On labeling CSIGNAL, the low recall is most probably due to data sparseness. Well, that’s expected, only 47% of documents in the corpus contain CSIGNAL. We should enrich the learning data, maybe with Penn Discourse Treebank (PDTB)? Yeah, maybe. Furthermore, false negatives are mostly because of ambiguous causal signals, such as by and and. For conjunction and, perhaps the corpus by Bethard et al. (2008) can help? 19 / 22
  • 20. Just another meeting at McRels Inc. Casey reports some findings from his learning activity… Hmm.. right. Meanwhile, on identifying CLINK, most mistakes are caused by dependency parser errors. Try to use another dependency parser. For example… C&C tool (Curran et al., 2007) since it has a better coverage of long-range dependencies. Okay, worth to try. And again, data sparseness is an issue. Could you provide me with more learning data? One option is to hire interns again to annotate AQUAINT corpus from TempEval-3. Or, using causality information in PDTB, but pre-processing is needed because the causality is not between events. Let’s see what I can do…20 / 22
  • 21. One evening at an Irish pub While Tim and Casey are enjoying their Guinness… So, how’s your work going? It’s going well. There are some future directions to improve my learning ability. By the way, the temporal information helps me a lot! Especially to decide the causality direction, because you know, cause should happen before the effect. Wow, cool! Perhaps the causal information can also help me too? 21 / 22
  • 22. One evening at an Irish pub While Tim and Casey are enjoying their Guinness… Well, the number of TLINKs that have underlying CLINKs will be much lower. So… maybe the causal information won’t help that much. Besides, look at this sentence… Hmmm… interesting, the cause is after the effect. We should discuss more about it. But now… let’s celebrate our future collaboration. Cheers! Cheers! “But some analysts questioned T how much of an impact the retirement package will have, because few jobs will end S up being eliminated.” …and their story continues, in the next paper ;) 22 / 22
  • 23. Cast Casey Tim Eve McRel Minion 1 Minion 2 Causal Relation Extraction System Temporal Relation Extraction System Event Relation Repository Paramita Mirza Sara Tonelli Thank You!
  • 25. Casey’s note on dependency parser errors “StatesWest Airlines withdrew T its offer to acquire Mesa Airlines because the Farmington carrier did not respond S to its offer” According to Stanford dependency parser, because is a marker of acquire instead of withdrew