SlideShare a Scribd company logo
1 of 18
Download to read offline
Evaluating Text Coherence Based on
Semantic Similarity Graph
Jan Wira Gotama Putra and Takenobu Tokunaga
Tokyo Institute of Technology, Japan
wiragotama.github.io | gotama.w.aa@m.titech.ac.jp
Semantic Similarity Graph | wiragotama.github.io 1TextGraph-11,ACL 2017
Motivation
• Modeling coherence in linguistics theory into computational task (Barzilay & Lapata,
2008; Guinaudeau & Strube, 2013; Feng et al., 2014; Li and Hovy, 2014; Petersen et
al., 2015, Nguyen and Joty, 2017)
• Approaches
• Supervised – mostly
• Unsupervised – infrequent
2TextGraph-11,ACL 2017Semantic Similarity Graph | wiragotama.github.io
Coherence
• Coherent text is integrated as a whole, rather than a series
of independent sentences (Bamberg, 1983; Garing, 2014)
• Every sentence in a coherent text has relation(s) to each
other (Halliday and Hasan, 1976; Mann and Thompson,
1988; Grosz et al., 1995;Wolf and Gibson, 2005)
• Lexical and semantic (meaning) continuity are indispensable
for coherent text (Feng et al., 2014)
3
Graph structure
Evaluate
coherence
through cohesion
Semantic
similarity
TextGraph-11,ACL 2017Semantic Similarity Graph | wiragotama.github.io
Related Work: Entity Graph (1)
4
• Entity graph was introduced by Guinaudeau &
Strube (2013)
• Text -> Bipartite Graph -> Projection Graphs
• Coherence is achieved by cohesion: considers
repeated mention of entities and their syntactical
role (weight)
TextGraph-11,ACL 2017Semantic Similarity Graph | wiragotama.github.io
• Graph data structure can represent the structure of text and relations among
sentences
• Coherence is achieved through lexical cohesion: repeated mention of entities.
• Disadvantage: cannot capture the relation between related-yet-not identical entities (Li and Hovy,
2014; Petersen et al., 2015)
• Solution: use distributed representation of words/sentences
• Relation between vertices in projection graph has to satisfy surface sequential
ordering
• Proposal: allows two directions (omit the constraint)
Related Work: Entity Graph (2)
5TextGraph-11,ACL 2017Semantic Similarity Graph | wiragotama.github.io
• Formally, text is a graph 𝐺 𝑉, 𝐸 , where
• 𝑉 is a set of vertices, 𝑣& represents i-th sentence.
• 𝐸 is a set of edges, 𝑒&( represents relation (cohesion) from i-th to j-th sentence (weighted &
directed).
• Evaluate the coherence through cohesion
• Sentences are encoded into their meaning form
Average of summation of word vectors (distributed representation of words)
• An edge represents cohesion among sentences
Establishment of edge is decided as the operation of vectors representation of sentences
Proposed Method (1)
6TextGraph-11,ACL 2017Semantic Similarity Graph | wiragotama.github.io
Proposed Method (2)
• Preceding AdjacentVertex (PAV)
• Single SimilarVertex (SSV)
• Multiple SimilarVertex (MSV)
7TextGraph-11,ACL 2017Semantic Similarity Graph | wiragotama.github.io
# outgoing edges of vertex vi
• An edge is established from the sentence vertex in question to the other vertex
with the weight calculated by
• Text coherence measure (higher is better) is calculated by averaging the averaged
weight of outgoing edges from every vertex in the graph as
Semantic Similarity Graph 8
Proposed Method (3)
normalization
# vertices
TextGraph-11,ACL 2017Semantic Similarity Graph | wiragotama.github.io
• Task 1: Discrimination (Barzilay and Lapata, 2008)
• Task 2: Insertion (Eisner and Charniak, 2011)
• Both tasks evaluate how well the methods in comparing coherence between texts
9
Evaluation
TextGraph-11,ACL 2017Semantic Similarity Graph | wiragotama.github.io
S4• The goal is to compare original vs. permutated text
• Program is considered successful when giving greater score to
the more coherent (original) text
• Dataset: 683 WSJ (LDC) texts, 13586 permutations (avg. 24
sentences, 521 tokens)
Evaluation: Discrimination Task
10TextGraph-11,ACL 2017
S1
S2
S3
S4
S1
S2
S3
original permutated
Semantic Similarity Graph | wiragotama.github.io
• Difference of performance is statistically significant at
p < 0.05
• PAV > MSV > Entity Graph
Cohesion is not only about repeating mention of
entities
• PAV – MSV pair shares 88.3% same judgement
(largest).
Local (adjacent) cohesion is possibly more important
than long-distance cohesion
Result: Discrimination Task
11
Method Accuracy
PAV 0.774
SSV 0.676
MSV 0.741
Entity Graph 0.725
TextGraph-11,ACL 2017Semantic Similarity Graph | wiragotama.github.io
• Insertion task is more important than discrimination task
• It was proposed by Eisner and Charniak (2011):
• Given a text, take out a sentence (randomly), then place it into other positions
• Program is considered successful if it prefers to insert take-out-sentence at its original position
rather than arbitrary (distorted) positions
• Our Proposal: useTOEFL iBT insertion-type questions
Evaluation: Insertion Task
12TextGraph-11,ACL 2017Semantic Similarity Graph | wiragotama.github.io
?
• A text is coherent even without the
insertion sentence
• Preservation of coherence is achieved
when the question-sentence is inserted
in the correct place but disrupt
coherence otherwise
• 104 questions
(avg. 7 sentences, 139 tokens)
TOEFL iBT Insertion-type Question
13
(A)	The	raising	of	livestock	is	a	major	economic	
activity	in	semiarid	lands,	where	grasses	are	
generally	the	dominant	type	of	natural	vegetation.	
(B)	The	consequences	of	an	excessive	number	of	
livestock	grazing	in	an	area	are	the	reduction	of	the	
vegetation	cover	and	trampling	and	pulverization	
of	the	soil.	(C) This	is	usually	followed	by	the	
drying	of	the	soil	and	accelerated	erosion.	(D)
Question:	Insert	the	following	sentence	into	one	of	
(A)-(D)
question	sentence	=	"This	economic	reliance	on	
livestock	in	certain	regions	makes	large	tracts	of	
land	susceptible	to	overgrazing.”
correct	answer	=	B
TextGraph-11,ACL 2017Semantic Similarity Graph | wiragotama.github.io
• Difference in every pair of methods is not statistically significant at p < 0.05
• 14 questions are answered incorrectly by PAV, but correctly by SSV.
• In these questions, SSV tends to establish the relationship between distance
sentences (dist = 2.8). For example, exemplification text
Result: Insertion Task
14
Method Accuracy
PAV 0.356
SSV 0.346
MSV 0.327
Entity Graph 0.260
TextGraph-11,ACL 2017Semantic Similarity Graph | wiragotama.github.io
• Coherence can be achieved through cohesion (lexical and semantic continuity)
• Local cohesion is more important than long-distance cohesion in evaluating
coherence, but long-distance cohesion can also contribute as well
• (PAV > {SSV, MSV})
• We need to introduce a more refined mechanism for incorporating distant sentence relations.
• The representation of sentences and method to establish edges would be direct
targets of the refinement
Conclusion and Future Work
15TextGraph-11,ACL 2017Semantic Similarity Graph | wiragotama.github.io
Appendix
16
Discrimination Task
Method Accuracy
PAV 0.774
SSV 0.676
MSV 0.741
Entity Grid 0.845
Entity Graph 0.725
Insertion Task
Method Accuracy
PAV 0.356
SSV 0.346
MSV 0.327
Entity Grid 0.346
Entity Graph 0.260
TextGraph-11,ACL 2017Semantic Similarity Graph | wiragotama.github.io
References	(1)
1. Alphie G. Garing. 2014. Coherence in argumentative essays of first year college of liberal arts students at de la
salle university. DLSU Research Congress.
2. Barbara J. Grosz, Scott Weinstein, and Aravind K. Joshi. 1995. Centering: A framework for modeling the local
coherence of discourse. Computational Linguistics 21(2):203-225.
3. Betty Bamberg. 1983.What makes a text coherent. College Composition and Communication 34(4):417-429.
4. Camille Guinaudeau and Michael Strube. 2013. Graph-based local coherence modeling . In Proceedings of the
51st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). Association for
Computational Linguistics, Sofia, Bulgaria, pages 93–103.
5. Casper Petersen, Christina Lioma, Jakob Grue Simonsen, and Birger Larsen. 2015. Entropy and graph based
modelling of document coherence using discourse entities: An application to IR. In Proceedings of the 2015
International Conference on The Theory of Information Retrieval, pages 191-200.
6. Dat Tien Nguyen and Shafiq Joty. 2017. A neural local coherence model. In Proceedings of Annual meeting for
association for computational linguistics., pages 1320-1330.
7. Florian Wolf and Edward Gibson. 2005. Representing discourse coherence: A corpus-based study.
Computational Linguistics 31(2):249–288.
17TextGraph-11,ACL 2017Semantic Similarity Graph | wiragotama.github.io
References	(2)
8. Jiwei Li and Eduard Hovy. 2014. A model of coherence based on distributed sentence representation. In
Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing. Association for
Computational Linguistics, Doha, Qatar, pages 2039–2048.
9. Micha Eisner and Eugene Charniak. 2011. Extending the entity grid with entity-specific features. In Proceedings
of the 49th Annual Meeting of Association for Computational Linguistics: HLT short papers., pages 125-129.
10. M.A.K Halliday and Ruqaiya Hasan. 1976. Cohesion in English. Longman, Singapore.
11. Regina Barzilay and Mirela Lapata. 2008. Modeling local coherence: Entity based approach. Computational
Linguistics 34(1):1–34.
12. Vanessa Wei Feng, Ziheng Lin, and Graeme Hirst. 2014. The impact of deep hierarchical discourse structures in
the evaluation of text coherence . In Proceedings of COLING 2014, the 25th International Conference on
Computational Linguistics: Technical Papers. Dublin City University and Association for Computational Linguistics,
Dublin, Ireland, pages 940–949.
13. William C. Mann and Sandra A. Thompson. 1988. Rhetorical structure theory: Toward a functional theory of
text organization. Text 8(3):243–281.
18TextGraph-11,ACL 2017Semantic Similarity Graph | wiragotama.github.io

More Related Content

What's hot

Dynamic Question Answer Generator An Enhanced Approach to Question Generation
Dynamic Question Answer Generator An Enhanced Approach to Question GenerationDynamic Question Answer Generator An Enhanced Approach to Question Generation
Dynamic Question Answer Generator An Enhanced Approach to Question Generationijtsrd
 
An Improved Similarity Matching based Clustering Framework for Short and Sent...
An Improved Similarity Matching based Clustering Framework for Short and Sent...An Improved Similarity Matching based Clustering Framework for Short and Sent...
An Improved Similarity Matching based Clustering Framework for Short and Sent...IJECEIAES
 
Lifelong Topic Modelling presentation
Lifelong Topic Modelling presentation Lifelong Topic Modelling presentation
Lifelong Topic Modelling presentation Daniele Di Mitri
 
Biomedical Word Sense Disambiguation presentation [Autosaved]
Biomedical Word Sense Disambiguation presentation [Autosaved]Biomedical Word Sense Disambiguation presentation [Autosaved]
Biomedical Word Sense Disambiguation presentation [Autosaved]akm sabbir
 
Discovering Novel Information with sentence Level clustering From Multi-docu...
Discovering Novel Information with sentence Level clustering  From Multi-docu...Discovering Novel Information with sentence Level clustering  From Multi-docu...
Discovering Novel Information with sentence Level clustering From Multi-docu...irjes
 
An introduction to compositional models in distributional semantics
An introduction to compositional models in distributional semanticsAn introduction to compositional models in distributional semantics
An introduction to compositional models in distributional semanticsAndre Freitas
 
An Approach to Automated Learning of Conceptual Graphs from Text
An Approach to Automated Learning of Conceptual Graphs from TextAn Approach to Automated Learning of Conceptual Graphs from Text
An Approach to Automated Learning of Conceptual Graphs from TextFulvio Rotella
 
Evaluation of subjective answers using glsa enhanced with contextual synonymy
Evaluation of subjective answers using glsa enhanced with contextual synonymyEvaluation of subjective answers using glsa enhanced with contextual synonymy
Evaluation of subjective answers using glsa enhanced with contextual synonymyijnlc
 
Possibility of interdisciplinary research software engineering andnatural lan...
Possibility of interdisciplinary research software engineering andnatural lan...Possibility of interdisciplinary research software engineering andnatural lan...
Possibility of interdisciplinary research software engineering andnatural lan...Nakul Sharma
 
Rules for inducing hierarchies from social tagging data
Rules for inducing hierarchies from social tagging dataRules for inducing hierarchies from social tagging data
Rules for inducing hierarchies from social tagging dataHang Dong
 

What's hot (14)

Dynamic Question Answer Generator An Enhanced Approach to Question Generation
Dynamic Question Answer Generator An Enhanced Approach to Question GenerationDynamic Question Answer Generator An Enhanced Approach to Question Generation
Dynamic Question Answer Generator An Enhanced Approach to Question Generation
 
New word analogy corpus
New word analogy corpusNew word analogy corpus
New word analogy corpus
 
An Improved Similarity Matching based Clustering Framework for Short and Sent...
An Improved Similarity Matching based Clustering Framework for Short and Sent...An Improved Similarity Matching based Clustering Framework for Short and Sent...
An Improved Similarity Matching based Clustering Framework for Short and Sent...
 
Lifelong Topic Modelling presentation
Lifelong Topic Modelling presentation Lifelong Topic Modelling presentation
Lifelong Topic Modelling presentation
 
Biomedical Word Sense Disambiguation presentation [Autosaved]
Biomedical Word Sense Disambiguation presentation [Autosaved]Biomedical Word Sense Disambiguation presentation [Autosaved]
Biomedical Word Sense Disambiguation presentation [Autosaved]
 
Discovering Novel Information with sentence Level clustering From Multi-docu...
Discovering Novel Information with sentence Level clustering  From Multi-docu...Discovering Novel Information with sentence Level clustering  From Multi-docu...
Discovering Novel Information with sentence Level clustering From Multi-docu...
 
Collaborative DL
Collaborative DLCollaborative DL
Collaborative DL
 
An introduction to compositional models in distributional semantics
An introduction to compositional models in distributional semanticsAn introduction to compositional models in distributional semantics
An introduction to compositional models in distributional semantics
 
J045055560
J045055560J045055560
J045055560
 
An Approach to Automated Learning of Conceptual Graphs from Text
An Approach to Automated Learning of Conceptual Graphs from TextAn Approach to Automated Learning of Conceptual Graphs from Text
An Approach to Automated Learning of Conceptual Graphs from Text
 
Evaluation of subjective answers using glsa enhanced with contextual synonymy
Evaluation of subjective answers using glsa enhanced with contextual synonymyEvaluation of subjective answers using glsa enhanced with contextual synonymy
Evaluation of subjective answers using glsa enhanced with contextual synonymy
 
Possibility of interdisciplinary research software engineering andnatural lan...
Possibility of interdisciplinary research software engineering andnatural lan...Possibility of interdisciplinary research software engineering andnatural lan...
Possibility of interdisciplinary research software engineering andnatural lan...
 
Em algorithm
Em algorithmEm algorithm
Em algorithm
 
Rules for inducing hierarchies from social tagging data
Rules for inducing hierarchies from social tagging dataRules for inducing hierarchies from social tagging data
Rules for inducing hierarchies from social tagging data
 

Similar to Jan Wira Gotama Putra - 2017 - Evaluating Text Coherence Based on Semantic Similarity Graph

THE ABILITY OF WORD EMBEDDINGS TO CAPTURE WORD SIMILARITIES
THE ABILITY OF WORD EMBEDDINGS TO CAPTURE WORD SIMILARITIESTHE ABILITY OF WORD EMBEDDINGS TO CAPTURE WORD SIMILARITIES
THE ABILITY OF WORD EMBEDDINGS TO CAPTURE WORD SIMILARITIESkevig
 
THE ABILITY OF WORD EMBEDDINGS TO CAPTURE WORD SIMILARITIES
THE ABILITY OF WORD EMBEDDINGS TO CAPTURE WORD SIMILARITIESTHE ABILITY OF WORD EMBEDDINGS TO CAPTURE WORD SIMILARITIES
THE ABILITY OF WORD EMBEDDINGS TO CAPTURE WORD SIMILARITIESkevig
 
Dissertation defense slides on "Semantic Analysis for Improved Multi-document...
Dissertation defense slides on "Semantic Analysis for Improved Multi-document...Dissertation defense slides on "Semantic Analysis for Improved Multi-document...
Dissertation defense slides on "Semantic Analysis for Improved Multi-document...Quinsulon Israel
 
Interactive Analysis of Word Vector Embeddings
Interactive Analysis of Word Vector EmbeddingsInteractive Analysis of Word Vector Embeddings
Interactive Analysis of Word Vector Embeddingsgleicher
 
Deep Neural Methods for Retrieval
Deep Neural Methods for RetrievalDeep Neural Methods for Retrieval
Deep Neural Methods for RetrievalBhaskar Mitra
 
IRJET-Semantic Similarity Between Sentences
IRJET-Semantic Similarity Between SentencesIRJET-Semantic Similarity Between Sentences
IRJET-Semantic Similarity Between SentencesIRJET Journal
 
Semantic Similarity Between Sentences
Semantic Similarity Between SentencesSemantic Similarity Between Sentences
Semantic Similarity Between SentencesIRJET Journal
 
Barzilay & Lapata 2008 presentation
Barzilay & Lapata 2008 presentationBarzilay & Lapata 2008 presentation
Barzilay & Lapata 2008 presentationRichard Littauer
 
Semantic Relatedness of Web Resources by XESA - Philipp Scholl
Semantic Relatedness of Web Resources by XESA - Philipp SchollSemantic Relatedness of Web Resources by XESA - Philipp Scholl
Semantic Relatedness of Web Resources by XESA - Philipp SchollCROKODIl consortium
 
Optimizing Near-Synonym System
Optimizing Near-Synonym SystemOptimizing Near-Synonym System
Optimizing Near-Synonym SystemSiyuan Zhou
 
bồn tắm jacuzzi.docx
bồn tắm jacuzzi.docxbồn tắm jacuzzi.docx
bồn tắm jacuzzi.docxDarosVitNam
 
Deep Learning for Information Retrieval: Models, Progress, & Opportunities
Deep Learning for Information Retrieval: Models, Progress, & OpportunitiesDeep Learning for Information Retrieval: Models, Progress, & Opportunities
Deep Learning for Information Retrieval: Models, Progress, & OpportunitiesMatthew Lease
 
An Enhanced Suffix Tree Approach to Measure Semantic Similarity between Multi...
An Enhanced Suffix Tree Approach to Measure Semantic Similarity between Multi...An Enhanced Suffix Tree Approach to Measure Semantic Similarity between Multi...
An Enhanced Suffix Tree Approach to Measure Semantic Similarity between Multi...iosrjce
 
G04124041046
G04124041046G04124041046
G04124041046IOSR-JEN
 
[Paper Reading] Supervised Learning of Universal Sentence Representations fro...
[Paper Reading] Supervised Learning of Universal Sentence Representations fro...[Paper Reading] Supervised Learning of Universal Sentence Representations fro...
[Paper Reading] Supervised Learning of Universal Sentence Representations fro...Hiroki Shimanaka
 

Similar to Jan Wira Gotama Putra - 2017 - Evaluating Text Coherence Based on Semantic Similarity Graph (20)

semeval2016
semeval2016semeval2016
semeval2016
 
THE ABILITY OF WORD EMBEDDINGS TO CAPTURE WORD SIMILARITIES
THE ABILITY OF WORD EMBEDDINGS TO CAPTURE WORD SIMILARITIESTHE ABILITY OF WORD EMBEDDINGS TO CAPTURE WORD SIMILARITIES
THE ABILITY OF WORD EMBEDDINGS TO CAPTURE WORD SIMILARITIES
 
THE ABILITY OF WORD EMBEDDINGS TO CAPTURE WORD SIMILARITIES
THE ABILITY OF WORD EMBEDDINGS TO CAPTURE WORD SIMILARITIESTHE ABILITY OF WORD EMBEDDINGS TO CAPTURE WORD SIMILARITIES
THE ABILITY OF WORD EMBEDDINGS TO CAPTURE WORD SIMILARITIES
 
Dissertation defense slides on "Semantic Analysis for Improved Multi-document...
Dissertation defense slides on "Semantic Analysis for Improved Multi-document...Dissertation defense slides on "Semantic Analysis for Improved Multi-document...
Dissertation defense slides on "Semantic Analysis for Improved Multi-document...
 
Interactive Analysis of Word Vector Embeddings
Interactive Analysis of Word Vector EmbeddingsInteractive Analysis of Word Vector Embeddings
Interactive Analysis of Word Vector Embeddings
 
Networks and Natural Language Processing
Networks and Natural Language ProcessingNetworks and Natural Language Processing
Networks and Natural Language Processing
 
Deep Neural Methods for Retrieval
Deep Neural Methods for RetrievalDeep Neural Methods for Retrieval
Deep Neural Methods for Retrieval
 
IRJET-Semantic Similarity Between Sentences
IRJET-Semantic Similarity Between SentencesIRJET-Semantic Similarity Between Sentences
IRJET-Semantic Similarity Between Sentences
 
Semantic Similarity Between Sentences
Semantic Similarity Between SentencesSemantic Similarity Between Sentences
Semantic Similarity Between Sentences
 
P13 corley
P13 corleyP13 corley
P13 corley
 
Barzilay & Lapata 2008 presentation
Barzilay & Lapata 2008 presentationBarzilay & Lapata 2008 presentation
Barzilay & Lapata 2008 presentation
 
The Duet model
The Duet modelThe Duet model
The Duet model
 
Semantic Relatedness of Web Resources by XESA - Philipp Scholl
Semantic Relatedness of Web Resources by XESA - Philipp SchollSemantic Relatedness of Web Resources by XESA - Philipp Scholl
Semantic Relatedness of Web Resources by XESA - Philipp Scholl
 
Optimizing Near-Synonym System
Optimizing Near-Synonym SystemOptimizing Near-Synonym System
Optimizing Near-Synonym System
 
bồn tắm jacuzzi.docx
bồn tắm jacuzzi.docxbồn tắm jacuzzi.docx
bồn tắm jacuzzi.docx
 
Deep Learning for Information Retrieval: Models, Progress, & Opportunities
Deep Learning for Information Retrieval: Models, Progress, & OpportunitiesDeep Learning for Information Retrieval: Models, Progress, & Opportunities
Deep Learning for Information Retrieval: Models, Progress, & Opportunities
 
F017243241
F017243241F017243241
F017243241
 
An Enhanced Suffix Tree Approach to Measure Semantic Similarity between Multi...
An Enhanced Suffix Tree Approach to Measure Semantic Similarity between Multi...An Enhanced Suffix Tree Approach to Measure Semantic Similarity between Multi...
An Enhanced Suffix Tree Approach to Measure Semantic Similarity between Multi...
 
G04124041046
G04124041046G04124041046
G04124041046
 
[Paper Reading] Supervised Learning of Universal Sentence Representations fro...
[Paper Reading] Supervised Learning of Universal Sentence Representations fro...[Paper Reading] Supervised Learning of Universal Sentence Representations fro...
[Paper Reading] Supervised Learning of Universal Sentence Representations fro...
 

More from Association for Computational Linguistics

Castro - 2018 - A High Coverage Method for Automatic False Friends Detection ...
Castro - 2018 - A High Coverage Method for Automatic False Friends Detection ...Castro - 2018 - A High Coverage Method for Automatic False Friends Detection ...
Castro - 2018 - A High Coverage Method for Automatic False Friends Detection ...Association for Computational Linguistics
 
Muthu Kumar Chandrasekaran - 2018 - Countering Position Bias in Instructor In...
Muthu Kumar Chandrasekaran - 2018 - Countering Position Bias in Instructor In...Muthu Kumar Chandrasekaran - 2018 - Countering Position Bias in Instructor In...
Muthu Kumar Chandrasekaran - 2018 - Countering Position Bias in Instructor In...Association for Computational Linguistics
 
Daniel Gildea - 2018 - The ACL Anthology: Current State and Future Directions
Daniel Gildea - 2018 - The ACL Anthology: Current State and Future DirectionsDaniel Gildea - 2018 - The ACL Anthology: Current State and Future Directions
Daniel Gildea - 2018 - The ACL Anthology: Current State and Future DirectionsAssociation for Computational Linguistics
 
Daniel Gildea - 2018 - The ACL Anthology: Current State and Future Directions
Daniel Gildea - 2018 - The ACL Anthology: Current State and Future DirectionsDaniel Gildea - 2018 - The ACL Anthology: Current State and Future Directions
Daniel Gildea - 2018 - The ACL Anthology: Current State and Future DirectionsAssociation for Computational Linguistics
 
Wenqiang Lei - 2018 - Sequicity: Simplifying Task-oriented Dialogue Systems w...
Wenqiang Lei - 2018 - Sequicity: Simplifying Task-oriented Dialogue Systems w...Wenqiang Lei - 2018 - Sequicity: Simplifying Task-oriented Dialogue Systems w...
Wenqiang Lei - 2018 - Sequicity: Simplifying Task-oriented Dialogue Systems w...Association for Computational Linguistics
 
Matthew Marge - 2017 - Exploring Variation of Natural Human Commands to a Rob...
Matthew Marge - 2017 - Exploring Variation of Natural Human Commands to a Rob...Matthew Marge - 2017 - Exploring Variation of Natural Human Commands to a Rob...
Matthew Marge - 2017 - Exploring Variation of Natural Human Commands to a Rob...Association for Computational Linguistics
 
Venkatesh Duppada - 2017 - SeerNet at EmoInt-2017: Tweet Emotion Intensity Es...
Venkatesh Duppada - 2017 - SeerNet at EmoInt-2017: Tweet Emotion Intensity Es...Venkatesh Duppada - 2017 - SeerNet at EmoInt-2017: Tweet Emotion Intensity Es...
Venkatesh Duppada - 2017 - SeerNet at EmoInt-2017: Tweet Emotion Intensity Es...Association for Computational Linguistics
 
Satoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 Workshop
Satoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 WorkshopSatoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 Workshop
Satoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 WorkshopAssociation for Computational Linguistics
 
John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...
John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...
John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...Association for Computational Linguistics
 
John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...
John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...
John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...Association for Computational Linguistics
 
Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...
Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...
Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...Association for Computational Linguistics
 
Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...
Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...
Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...Association for Computational Linguistics
 
Satoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 Workshop
Satoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 WorkshopSatoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 Workshop
Satoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 WorkshopAssociation for Computational Linguistics
 
Graham Neubig - 2015 - Neural Reranking Improves Subjective Quality of Machin...
Graham Neubig - 2015 - Neural Reranking Improves Subjective Quality of Machin...Graham Neubig - 2015 - Neural Reranking Improves Subjective Quality of Machin...
Graham Neubig - 2015 - Neural Reranking Improves Subjective Quality of Machin...Association for Computational Linguistics
 

More from Association for Computational Linguistics (20)

Muis - 2016 - Weak Semi-Markov CRFs for NP Chunking in Informal Text
Muis - 2016 - Weak Semi-Markov CRFs for NP Chunking in Informal TextMuis - 2016 - Weak Semi-Markov CRFs for NP Chunking in Informal Text
Muis - 2016 - Weak Semi-Markov CRFs for NP Chunking in Informal Text
 
Castro - 2018 - A High Coverage Method for Automatic False Friends Detection ...
Castro - 2018 - A High Coverage Method for Automatic False Friends Detection ...Castro - 2018 - A High Coverage Method for Automatic False Friends Detection ...
Castro - 2018 - A High Coverage Method for Automatic False Friends Detection ...
 
Castro - 2018 - A Crowd-Annotated Spanish Corpus for Humour Analysis
Castro - 2018 - A Crowd-Annotated Spanish Corpus for Humour AnalysisCastro - 2018 - A Crowd-Annotated Spanish Corpus for Humour Analysis
Castro - 2018 - A Crowd-Annotated Spanish Corpus for Humour Analysis
 
Muthu Kumar Chandrasekaran - 2018 - Countering Position Bias in Instructor In...
Muthu Kumar Chandrasekaran - 2018 - Countering Position Bias in Instructor In...Muthu Kumar Chandrasekaran - 2018 - Countering Position Bias in Instructor In...
Muthu Kumar Chandrasekaran - 2018 - Countering Position Bias in Instructor In...
 
Daniel Gildea - 2018 - The ACL Anthology: Current State and Future Directions
Daniel Gildea - 2018 - The ACL Anthology: Current State and Future DirectionsDaniel Gildea - 2018 - The ACL Anthology: Current State and Future Directions
Daniel Gildea - 2018 - The ACL Anthology: Current State and Future Directions
 
Elior Sulem - 2018 - Semantic Structural Evaluation for Text Simplification
Elior Sulem - 2018 - Semantic Structural Evaluation for Text SimplificationElior Sulem - 2018 - Semantic Structural Evaluation for Text Simplification
Elior Sulem - 2018 - Semantic Structural Evaluation for Text Simplification
 
Daniel Gildea - 2018 - The ACL Anthology: Current State and Future Directions
Daniel Gildea - 2018 - The ACL Anthology: Current State and Future DirectionsDaniel Gildea - 2018 - The ACL Anthology: Current State and Future Directions
Daniel Gildea - 2018 - The ACL Anthology: Current State and Future Directions
 
Wenqiang Lei - 2018 - Sequicity: Simplifying Task-oriented Dialogue Systems w...
Wenqiang Lei - 2018 - Sequicity: Simplifying Task-oriented Dialogue Systems w...Wenqiang Lei - 2018 - Sequicity: Simplifying Task-oriented Dialogue Systems w...
Wenqiang Lei - 2018 - Sequicity: Simplifying Task-oriented Dialogue Systems w...
 
Matthew Marge - 2017 - Exploring Variation of Natural Human Commands to a Rob...
Matthew Marge - 2017 - Exploring Variation of Natural Human Commands to a Rob...Matthew Marge - 2017 - Exploring Variation of Natural Human Commands to a Rob...
Matthew Marge - 2017 - Exploring Variation of Natural Human Commands to a Rob...
 
Venkatesh Duppada - 2017 - SeerNet at EmoInt-2017: Tweet Emotion Intensity Es...
Venkatesh Duppada - 2017 - SeerNet at EmoInt-2017: Tweet Emotion Intensity Es...Venkatesh Duppada - 2017 - SeerNet at EmoInt-2017: Tweet Emotion Intensity Es...
Venkatesh Duppada - 2017 - SeerNet at EmoInt-2017: Tweet Emotion Intensity Es...
 
Satoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 Workshop
Satoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 WorkshopSatoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 Workshop
Satoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 Workshop
 
Chenchen Ding - 2015 - NICT at WAT 2015
Chenchen Ding - 2015 - NICT at WAT 2015Chenchen Ding - 2015 - NICT at WAT 2015
Chenchen Ding - 2015 - NICT at WAT 2015
 
John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...
John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...
John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...
 
John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...
John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...
John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...
 
Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...
Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...
Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...
 
Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...
Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...
Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...
 
Hyoung-Gyu Lee - 2015 - NAVER Machine Translation System for WAT 2015
Hyoung-Gyu Lee - 2015 - NAVER Machine Translation System for WAT 2015Hyoung-Gyu Lee - 2015 - NAVER Machine Translation System for WAT 2015
Hyoung-Gyu Lee - 2015 - NAVER Machine Translation System for WAT 2015
 
Satoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 Workshop
Satoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 WorkshopSatoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 Workshop
Satoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 Workshop
 
Chenchen Ding - 2015 - NICT at WAT 2015
Chenchen Ding - 2015 - NICT at WAT 2015Chenchen Ding - 2015 - NICT at WAT 2015
Chenchen Ding - 2015 - NICT at WAT 2015
 
Graham Neubig - 2015 - Neural Reranking Improves Subjective Quality of Machin...
Graham Neubig - 2015 - Neural Reranking Improves Subjective Quality of Machin...Graham Neubig - 2015 - Neural Reranking Improves Subjective Quality of Machin...
Graham Neubig - 2015 - Neural Reranking Improves Subjective Quality of Machin...
 

Recently uploaded

PSYPACT- Practicing Over State Lines May 2024.pptx
PSYPACT- Practicing Over State Lines May 2024.pptxPSYPACT- Practicing Over State Lines May 2024.pptx
PSYPACT- Practicing Over State Lines May 2024.pptxMarlene Maheu
 
Graduate Outcomes Presentation Slides - English (v3).pptx
Graduate Outcomes Presentation Slides - English (v3).pptxGraduate Outcomes Presentation Slides - English (v3).pptx
Graduate Outcomes Presentation Slides - English (v3).pptxneillewis46
 
MichaelStarkes_UncutGemsProjectSummary.pdf
MichaelStarkes_UncutGemsProjectSummary.pdfMichaelStarkes_UncutGemsProjectSummary.pdf
MichaelStarkes_UncutGemsProjectSummary.pdfmstarkes24
 
Spring gala 2024 photo slideshow - Celebrating School-Community Partnerships
Spring gala 2024 photo slideshow - Celebrating School-Community PartnershipsSpring gala 2024 photo slideshow - Celebrating School-Community Partnerships
Spring gala 2024 photo slideshow - Celebrating School-Community Partnershipsexpandedwebsite
 
Financial Accounting IFRS, 3rd Edition-dikompresi.pdf
Financial Accounting IFRS, 3rd Edition-dikompresi.pdfFinancial Accounting IFRS, 3rd Edition-dikompresi.pdf
Financial Accounting IFRS, 3rd Edition-dikompresi.pdfMinawBelay
 
ANTI PARKISON DRUGS.pptx
ANTI         PARKISON          DRUGS.pptxANTI         PARKISON          DRUGS.pptx
ANTI PARKISON DRUGS.pptxPoojaSen20
 
Basic Civil Engineering notes on Transportation Engineering, Modes of Transpo...
Basic Civil Engineering notes on Transportation Engineering, Modes of Transpo...Basic Civil Engineering notes on Transportation Engineering, Modes of Transpo...
Basic Civil Engineering notes on Transportation Engineering, Modes of Transpo...Denish Jangid
 
Software testing for project report .pdf
Software testing for project report .pdfSoftware testing for project report .pdf
Software testing for project report .pdfKamal Acharya
 
ĐỀ THAM KHẢO KÌ THI TUYỂN SINH VÀO LỚP 10 MÔN TIẾNG ANH FORM 50 CÂU TRẮC NGHI...
ĐỀ THAM KHẢO KÌ THI TUYỂN SINH VÀO LỚP 10 MÔN TIẾNG ANH FORM 50 CÂU TRẮC NGHI...ĐỀ THAM KHẢO KÌ THI TUYỂN SINH VÀO LỚP 10 MÔN TIẾNG ANH FORM 50 CÂU TRẮC NGHI...
ĐỀ THAM KHẢO KÌ THI TUYỂN SINH VÀO LỚP 10 MÔN TIẾNG ANH FORM 50 CÂU TRẮC NGHI...Nguyen Thanh Tu Collection
 
TỔNG HỢP HƠN 100 ĐỀ THI THỬ TỐT NGHIỆP THPT VẬT LÝ 2024 - TỪ CÁC TRƯỜNG, TRƯ...
TỔNG HỢP HƠN 100 ĐỀ THI THỬ TỐT NGHIỆP THPT VẬT LÝ 2024 - TỪ CÁC TRƯỜNG, TRƯ...TỔNG HỢP HƠN 100 ĐỀ THI THỬ TỐT NGHIỆP THPT VẬT LÝ 2024 - TỪ CÁC TRƯỜNG, TRƯ...
TỔNG HỢP HƠN 100 ĐỀ THI THỬ TỐT NGHIỆP THPT VẬT LÝ 2024 - TỪ CÁC TRƯỜNG, TRƯ...Nguyen Thanh Tu Collection
 
Stl Algorithms in C++ jjjjjjjjjjjjjjjjjj
Stl Algorithms in C++ jjjjjjjjjjjjjjjjjjStl Algorithms in C++ jjjjjjjjjjjjjjjjjj
Stl Algorithms in C++ jjjjjjjjjjjjjjjjjjMohammed Sikander
 
Envelope of Discrepancy in Orthodontics: Enhancing Precision in Treatment
 Envelope of Discrepancy in Orthodontics: Enhancing Precision in Treatment Envelope of Discrepancy in Orthodontics: Enhancing Precision in Treatment
Envelope of Discrepancy in Orthodontics: Enhancing Precision in Treatmentsaipooja36
 
Navigating the Misinformation Minefield: The Role of Higher Education in the ...
Navigating the Misinformation Minefield: The Role of Higher Education in the ...Navigating the Misinformation Minefield: The Role of Higher Education in the ...
Navigating the Misinformation Minefield: The Role of Higher Education in the ...Mark Carrigan
 
....................Muslim-Law notes.pdf
....................Muslim-Law notes.pdf....................Muslim-Law notes.pdf
....................Muslim-Law notes.pdfVikramadityaRaj
 
demyelinated disorder: multiple sclerosis.pptx
demyelinated disorder: multiple sclerosis.pptxdemyelinated disorder: multiple sclerosis.pptx
demyelinated disorder: multiple sclerosis.pptxMohamed Rizk Khodair
 
Dementia (Alzheimer & vasular dementia).
Dementia (Alzheimer & vasular dementia).Dementia (Alzheimer & vasular dementia).
Dementia (Alzheimer & vasular dementia).Mohamed Rizk Khodair
 

Recently uploaded (20)

PSYPACT- Practicing Over State Lines May 2024.pptx
PSYPACT- Practicing Over State Lines May 2024.pptxPSYPACT- Practicing Over State Lines May 2024.pptx
PSYPACT- Practicing Over State Lines May 2024.pptx
 
Graduate Outcomes Presentation Slides - English (v3).pptx
Graduate Outcomes Presentation Slides - English (v3).pptxGraduate Outcomes Presentation Slides - English (v3).pptx
Graduate Outcomes Presentation Slides - English (v3).pptx
 
MichaelStarkes_UncutGemsProjectSummary.pdf
MichaelStarkes_UncutGemsProjectSummary.pdfMichaelStarkes_UncutGemsProjectSummary.pdf
MichaelStarkes_UncutGemsProjectSummary.pdf
 
Spring gala 2024 photo slideshow - Celebrating School-Community Partnerships
Spring gala 2024 photo slideshow - Celebrating School-Community PartnershipsSpring gala 2024 photo slideshow - Celebrating School-Community Partnerships
Spring gala 2024 photo slideshow - Celebrating School-Community Partnerships
 
“O BEIJO” EM ARTE .
“O BEIJO” EM ARTE                       .“O BEIJO” EM ARTE                       .
“O BEIJO” EM ARTE .
 
Financial Accounting IFRS, 3rd Edition-dikompresi.pdf
Financial Accounting IFRS, 3rd Edition-dikompresi.pdfFinancial Accounting IFRS, 3rd Edition-dikompresi.pdf
Financial Accounting IFRS, 3rd Edition-dikompresi.pdf
 
ANTI PARKISON DRUGS.pptx
ANTI         PARKISON          DRUGS.pptxANTI         PARKISON          DRUGS.pptx
ANTI PARKISON DRUGS.pptx
 
Basic Civil Engineering notes on Transportation Engineering, Modes of Transpo...
Basic Civil Engineering notes on Transportation Engineering, Modes of Transpo...Basic Civil Engineering notes on Transportation Engineering, Modes of Transpo...
Basic Civil Engineering notes on Transportation Engineering, Modes of Transpo...
 
Software testing for project report .pdf
Software testing for project report .pdfSoftware testing for project report .pdf
Software testing for project report .pdf
 
Word Stress rules esl .pptx
Word Stress rules esl               .pptxWord Stress rules esl               .pptx
Word Stress rules esl .pptx
 
ĐỀ THAM KHẢO KÌ THI TUYỂN SINH VÀO LỚP 10 MÔN TIẾNG ANH FORM 50 CÂU TRẮC NGHI...
ĐỀ THAM KHẢO KÌ THI TUYỂN SINH VÀO LỚP 10 MÔN TIẾNG ANH FORM 50 CÂU TRẮC NGHI...ĐỀ THAM KHẢO KÌ THI TUYỂN SINH VÀO LỚP 10 MÔN TIẾNG ANH FORM 50 CÂU TRẮC NGHI...
ĐỀ THAM KHẢO KÌ THI TUYỂN SINH VÀO LỚP 10 MÔN TIẾNG ANH FORM 50 CÂU TRẮC NGHI...
 
TỔNG HỢP HƠN 100 ĐỀ THI THỬ TỐT NGHIỆP THPT VẬT LÝ 2024 - TỪ CÁC TRƯỜNG, TRƯ...
TỔNG HỢP HƠN 100 ĐỀ THI THỬ TỐT NGHIỆP THPT VẬT LÝ 2024 - TỪ CÁC TRƯỜNG, TRƯ...TỔNG HỢP HƠN 100 ĐỀ THI THỬ TỐT NGHIỆP THPT VẬT LÝ 2024 - TỪ CÁC TRƯỜNG, TRƯ...
TỔNG HỢP HƠN 100 ĐỀ THI THỬ TỐT NGHIỆP THPT VẬT LÝ 2024 - TỪ CÁC TRƯỜNG, TRƯ...
 
Stl Algorithms in C++ jjjjjjjjjjjjjjjjjj
Stl Algorithms in C++ jjjjjjjjjjjjjjjjjjStl Algorithms in C++ jjjjjjjjjjjjjjjjjj
Stl Algorithms in C++ jjjjjjjjjjjjjjjjjj
 
Envelope of Discrepancy in Orthodontics: Enhancing Precision in Treatment
 Envelope of Discrepancy in Orthodontics: Enhancing Precision in Treatment Envelope of Discrepancy in Orthodontics: Enhancing Precision in Treatment
Envelope of Discrepancy in Orthodontics: Enhancing Precision in Treatment
 
Navigating the Misinformation Minefield: The Role of Higher Education in the ...
Navigating the Misinformation Minefield: The Role of Higher Education in the ...Navigating the Misinformation Minefield: The Role of Higher Education in the ...
Navigating the Misinformation Minefield: The Role of Higher Education in the ...
 
....................Muslim-Law notes.pdf
....................Muslim-Law notes.pdf....................Muslim-Law notes.pdf
....................Muslim-Law notes.pdf
 
demyelinated disorder: multiple sclerosis.pptx
demyelinated disorder: multiple sclerosis.pptxdemyelinated disorder: multiple sclerosis.pptx
demyelinated disorder: multiple sclerosis.pptx
 
Dementia (Alzheimer & vasular dementia).
Dementia (Alzheimer & vasular dementia).Dementia (Alzheimer & vasular dementia).
Dementia (Alzheimer & vasular dementia).
 
IPL Online Quiz by Pragya; Question Set.
IPL Online Quiz by Pragya; Question Set.IPL Online Quiz by Pragya; Question Set.
IPL Online Quiz by Pragya; Question Set.
 
Operations Management - Book1.p - Dr. Abdulfatah A. Salem
Operations Management - Book1.p  - Dr. Abdulfatah A. SalemOperations Management - Book1.p  - Dr. Abdulfatah A. Salem
Operations Management - Book1.p - Dr. Abdulfatah A. Salem
 

Jan Wira Gotama Putra - 2017 - Evaluating Text Coherence Based on Semantic Similarity Graph

  • 1. Evaluating Text Coherence Based on Semantic Similarity Graph Jan Wira Gotama Putra and Takenobu Tokunaga Tokyo Institute of Technology, Japan wiragotama.github.io | gotama.w.aa@m.titech.ac.jp Semantic Similarity Graph | wiragotama.github.io 1TextGraph-11,ACL 2017
  • 2. Motivation • Modeling coherence in linguistics theory into computational task (Barzilay & Lapata, 2008; Guinaudeau & Strube, 2013; Feng et al., 2014; Li and Hovy, 2014; Petersen et al., 2015, Nguyen and Joty, 2017) • Approaches • Supervised – mostly • Unsupervised – infrequent 2TextGraph-11,ACL 2017Semantic Similarity Graph | wiragotama.github.io
  • 3. Coherence • Coherent text is integrated as a whole, rather than a series of independent sentences (Bamberg, 1983; Garing, 2014) • Every sentence in a coherent text has relation(s) to each other (Halliday and Hasan, 1976; Mann and Thompson, 1988; Grosz et al., 1995;Wolf and Gibson, 2005) • Lexical and semantic (meaning) continuity are indispensable for coherent text (Feng et al., 2014) 3 Graph structure Evaluate coherence through cohesion Semantic similarity TextGraph-11,ACL 2017Semantic Similarity Graph | wiragotama.github.io
  • 4. Related Work: Entity Graph (1) 4 • Entity graph was introduced by Guinaudeau & Strube (2013) • Text -> Bipartite Graph -> Projection Graphs • Coherence is achieved by cohesion: considers repeated mention of entities and their syntactical role (weight) TextGraph-11,ACL 2017Semantic Similarity Graph | wiragotama.github.io
  • 5. • Graph data structure can represent the structure of text and relations among sentences • Coherence is achieved through lexical cohesion: repeated mention of entities. • Disadvantage: cannot capture the relation between related-yet-not identical entities (Li and Hovy, 2014; Petersen et al., 2015) • Solution: use distributed representation of words/sentences • Relation between vertices in projection graph has to satisfy surface sequential ordering • Proposal: allows two directions (omit the constraint) Related Work: Entity Graph (2) 5TextGraph-11,ACL 2017Semantic Similarity Graph | wiragotama.github.io
  • 6. • Formally, text is a graph 𝐺 𝑉, 𝐸 , where • 𝑉 is a set of vertices, 𝑣& represents i-th sentence. • 𝐸 is a set of edges, 𝑒&( represents relation (cohesion) from i-th to j-th sentence (weighted & directed). • Evaluate the coherence through cohesion • Sentences are encoded into their meaning form Average of summation of word vectors (distributed representation of words) • An edge represents cohesion among sentences Establishment of edge is decided as the operation of vectors representation of sentences Proposed Method (1) 6TextGraph-11,ACL 2017Semantic Similarity Graph | wiragotama.github.io
  • 7. Proposed Method (2) • Preceding AdjacentVertex (PAV) • Single SimilarVertex (SSV) • Multiple SimilarVertex (MSV) 7TextGraph-11,ACL 2017Semantic Similarity Graph | wiragotama.github.io
  • 8. # outgoing edges of vertex vi • An edge is established from the sentence vertex in question to the other vertex with the weight calculated by • Text coherence measure (higher is better) is calculated by averaging the averaged weight of outgoing edges from every vertex in the graph as Semantic Similarity Graph 8 Proposed Method (3) normalization # vertices TextGraph-11,ACL 2017Semantic Similarity Graph | wiragotama.github.io
  • 9. • Task 1: Discrimination (Barzilay and Lapata, 2008) • Task 2: Insertion (Eisner and Charniak, 2011) • Both tasks evaluate how well the methods in comparing coherence between texts 9 Evaluation TextGraph-11,ACL 2017Semantic Similarity Graph | wiragotama.github.io
  • 10. S4• The goal is to compare original vs. permutated text • Program is considered successful when giving greater score to the more coherent (original) text • Dataset: 683 WSJ (LDC) texts, 13586 permutations (avg. 24 sentences, 521 tokens) Evaluation: Discrimination Task 10TextGraph-11,ACL 2017 S1 S2 S3 S4 S1 S2 S3 original permutated Semantic Similarity Graph | wiragotama.github.io
  • 11. • Difference of performance is statistically significant at p < 0.05 • PAV > MSV > Entity Graph Cohesion is not only about repeating mention of entities • PAV – MSV pair shares 88.3% same judgement (largest). Local (adjacent) cohesion is possibly more important than long-distance cohesion Result: Discrimination Task 11 Method Accuracy PAV 0.774 SSV 0.676 MSV 0.741 Entity Graph 0.725 TextGraph-11,ACL 2017Semantic Similarity Graph | wiragotama.github.io
  • 12. • Insertion task is more important than discrimination task • It was proposed by Eisner and Charniak (2011): • Given a text, take out a sentence (randomly), then place it into other positions • Program is considered successful if it prefers to insert take-out-sentence at its original position rather than arbitrary (distorted) positions • Our Proposal: useTOEFL iBT insertion-type questions Evaluation: Insertion Task 12TextGraph-11,ACL 2017Semantic Similarity Graph | wiragotama.github.io ?
  • 13. • A text is coherent even without the insertion sentence • Preservation of coherence is achieved when the question-sentence is inserted in the correct place but disrupt coherence otherwise • 104 questions (avg. 7 sentences, 139 tokens) TOEFL iBT Insertion-type Question 13 (A) The raising of livestock is a major economic activity in semiarid lands, where grasses are generally the dominant type of natural vegetation. (B) The consequences of an excessive number of livestock grazing in an area are the reduction of the vegetation cover and trampling and pulverization of the soil. (C) This is usually followed by the drying of the soil and accelerated erosion. (D) Question: Insert the following sentence into one of (A)-(D) question sentence = "This economic reliance on livestock in certain regions makes large tracts of land susceptible to overgrazing.” correct answer = B TextGraph-11,ACL 2017Semantic Similarity Graph | wiragotama.github.io
  • 14. • Difference in every pair of methods is not statistically significant at p < 0.05 • 14 questions are answered incorrectly by PAV, but correctly by SSV. • In these questions, SSV tends to establish the relationship between distance sentences (dist = 2.8). For example, exemplification text Result: Insertion Task 14 Method Accuracy PAV 0.356 SSV 0.346 MSV 0.327 Entity Graph 0.260 TextGraph-11,ACL 2017Semantic Similarity Graph | wiragotama.github.io
  • 15. • Coherence can be achieved through cohesion (lexical and semantic continuity) • Local cohesion is more important than long-distance cohesion in evaluating coherence, but long-distance cohesion can also contribute as well • (PAV > {SSV, MSV}) • We need to introduce a more refined mechanism for incorporating distant sentence relations. • The representation of sentences and method to establish edges would be direct targets of the refinement Conclusion and Future Work 15TextGraph-11,ACL 2017Semantic Similarity Graph | wiragotama.github.io
  • 16. Appendix 16 Discrimination Task Method Accuracy PAV 0.774 SSV 0.676 MSV 0.741 Entity Grid 0.845 Entity Graph 0.725 Insertion Task Method Accuracy PAV 0.356 SSV 0.346 MSV 0.327 Entity Grid 0.346 Entity Graph 0.260 TextGraph-11,ACL 2017Semantic Similarity Graph | wiragotama.github.io
  • 17. References (1) 1. Alphie G. Garing. 2014. Coherence in argumentative essays of first year college of liberal arts students at de la salle university. DLSU Research Congress. 2. Barbara J. Grosz, Scott Weinstein, and Aravind K. Joshi. 1995. Centering: A framework for modeling the local coherence of discourse. Computational Linguistics 21(2):203-225. 3. Betty Bamberg. 1983.What makes a text coherent. College Composition and Communication 34(4):417-429. 4. Camille Guinaudeau and Michael Strube. 2013. Graph-based local coherence modeling . In Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). Association for Computational Linguistics, Sofia, Bulgaria, pages 93–103. 5. Casper Petersen, Christina Lioma, Jakob Grue Simonsen, and Birger Larsen. 2015. Entropy and graph based modelling of document coherence using discourse entities: An application to IR. In Proceedings of the 2015 International Conference on The Theory of Information Retrieval, pages 191-200. 6. Dat Tien Nguyen and Shafiq Joty. 2017. A neural local coherence model. In Proceedings of Annual meeting for association for computational linguistics., pages 1320-1330. 7. Florian Wolf and Edward Gibson. 2005. Representing discourse coherence: A corpus-based study. Computational Linguistics 31(2):249–288. 17TextGraph-11,ACL 2017Semantic Similarity Graph | wiragotama.github.io
  • 18. References (2) 8. Jiwei Li and Eduard Hovy. 2014. A model of coherence based on distributed sentence representation. In Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics, Doha, Qatar, pages 2039–2048. 9. Micha Eisner and Eugene Charniak. 2011. Extending the entity grid with entity-specific features. In Proceedings of the 49th Annual Meeting of Association for Computational Linguistics: HLT short papers., pages 125-129. 10. M.A.K Halliday and Ruqaiya Hasan. 1976. Cohesion in English. Longman, Singapore. 11. Regina Barzilay and Mirela Lapata. 2008. Modeling local coherence: Entity based approach. Computational Linguistics 34(1):1–34. 12. Vanessa Wei Feng, Ziheng Lin, and Graeme Hirst. 2014. The impact of deep hierarchical discourse structures in the evaluation of text coherence . In Proceedings of COLING 2014, the 25th International Conference on Computational Linguistics: Technical Papers. Dublin City University and Association for Computational Linguistics, Dublin, Ireland, pages 940–949. 13. William C. Mann and Sandra A. Thompson. 1988. Rhetorical structure theory: Toward a functional theory of text organization. Text 8(3):243–281. 18TextGraph-11,ACL 2017Semantic Similarity Graph | wiragotama.github.io