SlideShare ist ein Scribd-Unternehmen logo
1 von 27
Downloaden Sie, um offline zu lesen
Joint Opinion Relation Detection 
Using One-Class Deep Neural Network 
COLING 2014 reading @ Komachi Lab 
Komachi Lab. M1 Peinan ZHANG
Abstract & Introduction 
有効な opinion relation には3つ必要な条件がある。 
1. 感情極性を含んだ opinion word 
2. 現在のドメインに関連した opinion target 
3. opinion word が opinion target を修飾している 
これをタプルとして表現すると 
Komachi Lab. M1 Peinan ZHANG 
o = (s, t, r) 
となり、それぞれ 
s = opinion word 
t = opinion target 
r = linking relation between s and t 
2
Assumption 1 
Terms are likely to have linking relation with the seed 
terms are believed to be opinion words or opinion 
targets. 
seed terms とのリンクを持つ単語は opinion words 
もしくは opinion targets である。 
Komachi Lab. M1 Peinan ZHANG 
3
Abstract & Introduction 
Example 1. 
This mp3 has a clear screen. 
s = {clear}, t = {screen}, r = {clear, screen} 
Example 2. 
This mp3 has many good things. 
s = {good}, t = {things}, r = {good, things} 
o = (s, t, r) 
s = opinion word 
t = opinion target 
r = linking relation 
between s and t 
Is the word ‘things’ related to the domain ‘mp3’ ? 
Komachi Lab. M1 Peinan ZHANG 
4 
NO!!
The Problems 
前述した3つの条件 
1. 感情極性を含んだ opinion word 
2. 現在のドメインに関連した opinion target 
3. opinion word が opinion target を修飾している 
のうち、2つにのみ焦点を当てていた。 
そのため先行研究は雑多なノイズ単語を抽出してしま 
い、その影響を受けることになった。 
Minging Hu and Bing Liu., 2004., Mining and summarizing 
customer reviews, ACM SIGKDD 
Komachi Lab. M1 Peinan ZHANG 
5
Assumption 2 
The three requirements: the opinion word, the 
opinion target and the linking relation between them, 
shall be all verified during opinion relation detection. 
3つの必要な条件である opinion word と opinion 
target 、それらの linking relation は同時に使わなけ 
ればならない。 
Komachi Lab. M1 Peinan ZHANG 
6
Approach 
a novel Joint Opinion Relation Detection Method 
opinion words, opinion targets and linking relations are simultaneously 
considered in a classification scenario. 
HOW TO 
1. provide a small set of seeds for supervision, which are regarded 
as positive labeled examples. 
n small set of seeds: opinion words, opinion targets 
n negative examples (i.e. noise terms) are hard to acquire, because we 
do not know which term is not an opinion word or target. 
2. This leads to One-Class Classification (OCC) problem. 
n the key to OCC is semantic similarity measuring between terms. 
n Deep Neural Network (DNN) with word embeddings is a powerful 
tool to handle this problem. 
Komachi Lab. M1 Peinan ZHANG 
7
Approach 
a novel Joint Opinion Relation Detection Method 
opinion words, opinion targets and linking relations are simultaneously 
considered in a classification scenario. 
HOW TO 
1. provide a small set of seeds for supervision, which are regarded 
as positive labeled examples. 
n small set: opinion words, opinion targets 
n negative examples (i.e. noise terms) are hard to acquire, because we 
do not know which term is not an opinion word or target. 
2. This leads to One-Class Classification (OCC) problem. 
n the key to OCC is semantic similarity measuring between terms. 
n Deep Neural Network (DNN) with word embeddings is a powerful 
tool to handle this problem. 
Komachi Lab. M1 Peinan ZHANG 
8
The Architecture of OCDNN 
Komachi Lab. M1 Peinan ZHANG 
9 
Consists of two levels. 
Lower Level: Learn features 
n Left 
uses words embedding to 
represent opinion words/ 
targets. 
n Right 
maps linking relations to 
embedding vectors by a 
recursive auto-encoder 
Higher Level: use the learnt 
feature to perform one-class 
classification
Outline 
1. Abstract & Introduction 
2. Approach 
3. The Architecture of OCDNN 
1. Generate Opinion Seeds 
2. Generate Opinion Relation Candidates 
3. Represent Words 
4. Represent Linking Relation 
5. One-Class Classification 
4. Datasets & Experiments 
5. Conclusion 
Komachi Lab. M1 Peinan ZHANG 
10
Outline 
1. Abstract & Introduction 
2. Approach 
3. The Architecture of OCDNN 
1. Generate Opinion Seeds 
2. Generate Opinion Relation Candidates 
3. Represent Words 
4. Represent Linking Relation 
5. One-Class Classification 
4. Datasets & Experiments 
5. Conclusion 
Komachi Lab. M1 Peinan ZHANG 
11
Opinion Seed Generation 
Opinion Word Seeds 
We manually pick 186 domain independent opinion words from 
SentiWordNet as the opinion seed set SS. 
Opinion Target Seeds 
We measure Likelihood Ratio Tests (LRT) between domain name 
and all opinion target candidates. Then highest N terms with 
highest LRT scores are added into the opinion target seed set TS. 
Linking Relation Seeds 
We employ an automatic syntactic opinion pattern learning method 
called Sentiment Graph Walking and get 12 opinion patterns with 
highest confidence as the linking relation seed set RS. 
Komachi Lab. M1 Peinan ZHANG 
12
Opinion Relation Candidate Generation 
Opinion Word Candidates: 
p adjectives or verbs 
Opinion Target Candidates: 
p noun or noun phrases 
Opinion Relation Candidates: 
p get dependency tree of a sentence using Stanford Parser 
p the shortest dependency path between a c_s and a c_t is 
taken as a c_r 
p to avoid introducing too many noise candidates, we 
constrain that there are at most 4 terms in a c_r 
Komachi Lab. M1 Peinan ZHANG 
13
Word Representation by 
Word Embedding Learning 
Words are embedded into a hyperspace, where 2 words 
that are more semantically similar to each other are 
located closer. (something like word2vec) 
See more in the paper below, 
Ronan Collobert et al., 2011., Natural language processing 
(almost) from scratch, Journal of Machine Learning Research 
Komachi Lab. M1 Peinan ZHANG 
14
Linking Relation Representation by 
Using Recursive Auto-encoder 
Goal: represent the linking relation between an opinion 
word and an opinion target by a n-element vector as we do 
during word representation. 
We combined embedding vectors of words in a linking 
relation by a recursive auto-encoder according to syntactic 
dependency structure. 
Komachi Lab. M1 Peinan ZHANG 
15
Linking Relation Representation by 
Using Recursive Auto-encoder 
Komachi Lab. M1 Peinan ZHANG 
1. c_s と c_t との間の係 
り受け関係を取ってくる。 
2. c_s と c_t をそれぞれ 
[SC] と [ST] に置き換え 
る。 
3. 点線内を3層からなる 
auto-encoder とし、2つ 
の n-element vector を 
中間層で1つの n-element 
vector に圧縮す 
る(下式)。 
4. W を入力と出力のユー 
クリッド距離が最小にな 
るまで更新していく。 
16 
Example. too loud to listen to the player
One-Class Classification for 
Opinion Relation Detection 
We represent an opinion relation candidate c_o by a vector 
v_o=[v_s; v_t; v_r], and this vector v_o is to feed to 
upper level auto-encoder. 
For opinion relation detection, error scores that are smaller 
than a threshold theta are classified as positive. 
To estimate theta, we need to introduce a positive 
proportion (pp) score as follows, 
Komachi Lab. M1 Peinan ZHANG 
17
Opinion Target Expansion 
We apply bootstrapping to iteratively expand opinion 
targets seeds. 
p because the vocabulary of seed set is limited, which cannot fully 
represent the distribution of opinion targets. 
After training OCDNN, all opinion relation candidates are 
classified, and opinion targets are ranked in descent order 
by, 
Then, top M candidates are added into the target seed set 
TS for the next training iteration. 
Komachi Lab. M1 Peinan ZHANG 
18
Datasets 
Datasets 
p Customer Review Dataset (CRD) 
n contains review on five products (denoted by D1 to D5) 
p benchmark dataset on MP3 and Hotel 
p crawled from www.amazon.com, which involves Mattress and 
Phone 
Annotation 
p 10000 sentences are randomly selected from reviews and 
annotators are required to judge whether each term is an opinion 
word or an opinion target. 
p 5000 sentences are annotated for MP3 and Hotel. Annotators are 
required to carefully read through each sentence and find out 
every opinion relation detection. 
Komachi Lab. M1 Peinan ZHANG 
19
Evaluation Settings 
AdjRule 
extract opinion words/targets by using adjacency rules 
LRTBOOT 
bootstrapping algorithm which employs Likelihood Ration Test as the 
co-occurrence statistical measure 
DP 
denotes the Double Propagation algorithm 
DP-HITS 
enhanced version of DP by using HITS algorithm 
OCDNN 
proposed method. The target seed size N=40, the opinion targets 
expanded in each iteration M=20, and the max bootstrapping 
iteration number is X=10. 
Komachi Lab. M1 Peinan ZHANG 
20 
statistical co-occurrence-based 
method 
syntax-based method
Experiments 
DP-HITS does not extract opinion words so their results for 
opinion words are not taken into account. 
Komachi Lab. M1 Peinan ZHANG 
21
Experiments 
Komachi Lab. M1 Peinan ZHANG 
22
Experiments 
Komachi Lab. M1 Peinan ZHANG 
23 
n our method outperforms co-occurrence-based methods AdjRule 
and LRTBOOT 
n but achieves comparable or a little worse results than syntax-based 
methods DP and DP-HITS 
n because CRD is quite small, which only contains several 
hundred sentences for each product review set. In this case, 
methods based on careful-designed syntax rules have 
superiority over those based on statistics. 
n our method outperforms all of the competitors 
n OCDNN vs. DP-HITS: those two use similar term ranking metrics, 
but OCDNN significantly outperforms DP-HITS. Therefore, 
positive proportion is more effective than the importance score. 
n OCDNN vs. LRTBOOT: LRTBOOT is better recall but lower 
precision. This is because LRTBOOT follows Assumption 1, which 
suffers a lot from error propagation, while our joint classification 
approach effectively alleviates this issue.
Assumption 1 vs. Assumption 2 
Komachi Lab. M1 Peinan ZHANG 
24
Komachi Lab. M1 Peinan ZHANG 
25 
Assumption 1 vs. Assumption 2 
n OCDNN significantly outperforms all competitors. The average 
improvement of F-measure over the best competitor is 6% on 
CRD and 9% on Hotel and MP3. 
n As Assumption 1 only verifies 2 of the requirements, it would 
inevitably introduce noise terms. 
n For syntax-based method DP, it extracts many false opinion 
relations such as good thing and nice one or objective expressions 
like another mp3 and every mp3. 
n For co-occurrence statistical methods AdjRule and LRTBOOT, it is 
very hard to deal with ambiguous linking relations. For example, in 
phrase this mp3 is very good except the size, co-occurrence 
statistical methods could hardly tell which opinion target does 
good modify (mp3 or size).
Conclusion 
p この論文では joint opinion relation detection を One- 
Class Deep Neural Network に適応させて分類を行った。 
p 特徴的な点は、 opinion words/targets/relations を同時 
に参照して分類することにある。 
p そして実験では、条件の2つしか適応させていなかった 
手法よりも良い結果を示すことが出来た。 
Komachi Lab. M1 Peinan ZHANG 
26
Conclusion 
1. Abstract & Introduction 
2. Approach 
3. The Architecture of OCDNN 
1. Generate Opinion Seeds 
2. Generate Opinion Relation Candidates 
3. Represent Words 
4. Represent Linking Relation 
5. One-Class Classification 
4. Datasets & Experiments 
5. Conclusion 
Komachi Lab. M1 Peinan ZHANG 
27

Weitere ähnliche Inhalte

Was ist angesagt?

Cross lingual similarity discrimination with translation characteristics
Cross lingual similarity discrimination with translation characteristicsCross lingual similarity discrimination with translation characteristics
Cross lingual similarity discrimination with translation characteristicsijaia
 
Monte Carlo Modelling of Confidence Intervals in Translation Quality Evaluati...
Monte Carlo Modelling of Confidence Intervals in Translation Quality Evaluati...Monte Carlo Modelling of Confidence Intervals in Translation Quality Evaluati...
Monte Carlo Modelling of Confidence Intervals in Translation Quality Evaluati...Lifeng (Aaron) Han
 
ENHANCING THE PERFORMANCE OF SENTIMENT ANALYSIS SUPERVISED LEARNING USING SEN...
ENHANCING THE PERFORMANCE OF SENTIMENT ANALYSIS SUPERVISED LEARNING USING SEN...ENHANCING THE PERFORMANCE OF SENTIMENT ANALYSIS SUPERVISED LEARNING USING SEN...
ENHANCING THE PERFORMANCE OF SENTIMENT ANALYSIS SUPERVISED LEARNING USING SEN...csandit
 
IRJET- Survey on Deep Learning Approaches for Phrase Structure Identification...
IRJET- Survey on Deep Learning Approaches for Phrase Structure Identification...IRJET- Survey on Deep Learning Approaches for Phrase Structure Identification...
IRJET- Survey on Deep Learning Approaches for Phrase Structure Identification...IRJET Journal
 
Aspect mining and sentiment association
Aspect mining and sentiment associationAspect mining and sentiment association
Aspect mining and sentiment associationKoushik Ramachandra
 
cushLEPOR uses LABSE distilled knowledge to improve correlation with human tr...
cushLEPOR uses LABSE distilled knowledge to improve correlation with human tr...cushLEPOR uses LABSE distilled knowledge to improve correlation with human tr...
cushLEPOR uses LABSE distilled knowledge to improve correlation with human tr...Lifeng (Aaron) Han
 
Evaluation of hindi english mt systems, challenges and solutions
Evaluation of hindi english mt systems, challenges and solutionsEvaluation of hindi english mt systems, challenges and solutions
Evaluation of hindi english mt systems, challenges and solutionsSajeed Mahaboob
 
SemEval - Aspect Based Sentiment Analysis
SemEval - Aspect Based Sentiment AnalysisSemEval - Aspect Based Sentiment Analysis
SemEval - Aspect Based Sentiment AnalysisAditya Joshi
 
Natural Language Processing - Research and Application Trends
Natural Language Processing - Research and Application TrendsNatural Language Processing - Research and Application Trends
Natural Language Processing - Research and Application TrendsShreyas Suresh Rao
 
Compression-Based Parts-of-Speech Tagger for The Arabic Language
Compression-Based Parts-of-Speech Tagger for The Arabic LanguageCompression-Based Parts-of-Speech Tagger for The Arabic Language
Compression-Based Parts-of-Speech Tagger for The Arabic LanguageCSCJournals
 
LEPOR: an augmented machine translation evaluation metric - Thesis PPT
LEPOR: an augmented machine translation evaluation metric - Thesis PPT LEPOR: an augmented machine translation evaluation metric - Thesis PPT
LEPOR: an augmented machine translation evaluation metric - Thesis PPT Lifeng (Aaron) Han
 
A FILM SYNOPSIS GENRE CLASSIFIER BASED ON MAJORITY VOTE
A FILM SYNOPSIS GENRE CLASSIFIER BASED ON MAJORITY VOTEA FILM SYNOPSIS GENRE CLASSIFIER BASED ON MAJORITY VOTE
A FILM SYNOPSIS GENRE CLASSIFIER BASED ON MAJORITY VOTEijnlc
 
A FILM SYNOPSIS GENRE CLASSIFIER BASED ON MAJORITY VOTE
A FILM SYNOPSIS GENRE CLASSIFIER BASED ON MAJORITY VOTEA FILM SYNOPSIS GENRE CLASSIFIER BASED ON MAJORITY VOTE
A FILM SYNOPSIS GENRE CLASSIFIER BASED ON MAJORITY VOTEkevig
 
ACL-WMT2013.A Description of Tunable Machine Translation Evaluation Systems i...
ACL-WMT2013.A Description of Tunable Machine Translation Evaluation Systems i...ACL-WMT2013.A Description of Tunable Machine Translation Evaluation Systems i...
ACL-WMT2013.A Description of Tunable Machine Translation Evaluation Systems i...Lifeng (Aaron) Han
 
Benchmarking transfer learning approaches for NLP
Benchmarking transfer learning approaches for NLPBenchmarking transfer learning approaches for NLP
Benchmarking transfer learning approaches for NLPYury Kashnitsky
 
IRJET- Vernacular Language Spell Checker & Autocorrection
IRJET- Vernacular Language Spell Checker & AutocorrectionIRJET- Vernacular Language Spell Checker & Autocorrection
IRJET- Vernacular Language Spell Checker & AutocorrectionIRJET Journal
 

Was ist angesagt? (20)

Cross lingual similarity discrimination with translation characteristics
Cross lingual similarity discrimination with translation characteristicsCross lingual similarity discrimination with translation characteristics
Cross lingual similarity discrimination with translation characteristics
 
Monte Carlo Modelling of Confidence Intervals in Translation Quality Evaluati...
Monte Carlo Modelling of Confidence Intervals in Translation Quality Evaluati...Monte Carlo Modelling of Confidence Intervals in Translation Quality Evaluati...
Monte Carlo Modelling of Confidence Intervals in Translation Quality Evaluati...
 
ENHANCING THE PERFORMANCE OF SENTIMENT ANALYSIS SUPERVISED LEARNING USING SEN...
ENHANCING THE PERFORMANCE OF SENTIMENT ANALYSIS SUPERVISED LEARNING USING SEN...ENHANCING THE PERFORMANCE OF SENTIMENT ANALYSIS SUPERVISED LEARNING USING SEN...
ENHANCING THE PERFORMANCE OF SENTIMENT ANALYSIS SUPERVISED LEARNING USING SEN...
 
IRJET- Survey on Deep Learning Approaches for Phrase Structure Identification...
IRJET- Survey on Deep Learning Approaches for Phrase Structure Identification...IRJET- Survey on Deep Learning Approaches for Phrase Structure Identification...
IRJET- Survey on Deep Learning Approaches for Phrase Structure Identification...
 
Aspect mining and sentiment association
Aspect mining and sentiment associationAspect mining and sentiment association
Aspect mining and sentiment association
 
cushLEPOR uses LABSE distilled knowledge to improve correlation with human tr...
cushLEPOR uses LABSE distilled knowledge to improve correlation with human tr...cushLEPOR uses LABSE distilled knowledge to improve correlation with human tr...
cushLEPOR uses LABSE distilled knowledge to improve correlation with human tr...
 
Evaluation of hindi english mt systems, challenges and solutions
Evaluation of hindi english mt systems, challenges and solutionsEvaluation of hindi english mt systems, challenges and solutions
Evaluation of hindi english mt systems, challenges and solutions
 
SemEval - Aspect Based Sentiment Analysis
SemEval - Aspect Based Sentiment AnalysisSemEval - Aspect Based Sentiment Analysis
SemEval - Aspect Based Sentiment Analysis
 
Natural Language Processing - Research and Application Trends
Natural Language Processing - Research and Application TrendsNatural Language Processing - Research and Application Trends
Natural Language Processing - Research and Application Trends
 
Presentation1
Presentation1Presentation1
Presentation1
 
Compression-Based Parts-of-Speech Tagger for The Arabic Language
Compression-Based Parts-of-Speech Tagger for The Arabic LanguageCompression-Based Parts-of-Speech Tagger for The Arabic Language
Compression-Based Parts-of-Speech Tagger for The Arabic Language
 
LEPOR: an augmented machine translation evaluation metric - Thesis PPT
LEPOR: an augmented machine translation evaluation metric - Thesis PPT LEPOR: an augmented machine translation evaluation metric - Thesis PPT
LEPOR: an augmented machine translation evaluation metric - Thesis PPT
 
A FILM SYNOPSIS GENRE CLASSIFIER BASED ON MAJORITY VOTE
A FILM SYNOPSIS GENRE CLASSIFIER BASED ON MAJORITY VOTEA FILM SYNOPSIS GENRE CLASSIFIER BASED ON MAJORITY VOTE
A FILM SYNOPSIS GENRE CLASSIFIER BASED ON MAJORITY VOTE
 
A FILM SYNOPSIS GENRE CLASSIFIER BASED ON MAJORITY VOTE
A FILM SYNOPSIS GENRE CLASSIFIER BASED ON MAJORITY VOTEA FILM SYNOPSIS GENRE CLASSIFIER BASED ON MAJORITY VOTE
A FILM SYNOPSIS GENRE CLASSIFIER BASED ON MAJORITY VOTE
 
ACL-WMT2013.A Description of Tunable Machine Translation Evaluation Systems i...
ACL-WMT2013.A Description of Tunable Machine Translation Evaluation Systems i...ACL-WMT2013.A Description of Tunable Machine Translation Evaluation Systems i...
ACL-WMT2013.A Description of Tunable Machine Translation Evaluation Systems i...
 
Assignment 2
Assignment 2Assignment 2
Assignment 2
 
Decision tables
Decision tablesDecision tables
Decision tables
 
Benchmarking transfer learning approaches for NLP
Benchmarking transfer learning approaches for NLPBenchmarking transfer learning approaches for NLP
Benchmarking transfer learning approaches for NLP
 
IRJET- Vernacular Language Spell Checker & Autocorrection
IRJET- Vernacular Language Spell Checker & AutocorrectionIRJET- Vernacular Language Spell Checker & Autocorrection
IRJET- Vernacular Language Spell Checker & Autocorrection
 
Lac presentation
Lac presentationLac presentation
Lac presentation
 

Ähnlich wie COLING 2014: Joint Opinion Relation Detection Using One-Class Deep Neural Network

A SURVEY PAPER ON EXTRACTION OF OPINION WORD AND OPINION TARGET FROM ONLINE R...
A SURVEY PAPER ON EXTRACTION OF OPINION WORD AND OPINION TARGET FROM ONLINE R...A SURVEY PAPER ON EXTRACTION OF OPINION WORD AND OPINION TARGET FROM ONLINE R...
A SURVEY PAPER ON EXTRACTION OF OPINION WORD AND OPINION TARGET FROM ONLINE R...ijiert bestjournal
 
Sentence level sentiment polarity calculation for customer reviews by conside...
Sentence level sentiment polarity calculation for customer reviews by conside...Sentence level sentiment polarity calculation for customer reviews by conside...
Sentence level sentiment polarity calculation for customer reviews by conside...eSAT Publishing House
 
Lexical Analysis to Effectively Detect User's Opinion
Lexical Analysis to Effectively Detect User's Opinion   Lexical Analysis to Effectively Detect User's Opinion
Lexical Analysis to Effectively Detect User's Opinion dannyijwest
 
Sentiment+Analysis.ppt
Sentiment+Analysis.pptSentiment+Analysis.ppt
Sentiment+Analysis.pptvisheshs4
 
THE ABILITY OF WORD EMBEDDINGS TO CAPTURE WORD SIMILARITIES
THE ABILITY OF WORD EMBEDDINGS TO CAPTURE WORD SIMILARITIESTHE ABILITY OF WORD EMBEDDINGS TO CAPTURE WORD SIMILARITIES
THE ABILITY OF WORD EMBEDDINGS TO CAPTURE WORD SIMILARITIESkevig
 
THE ABILITY OF WORD EMBEDDINGS TO CAPTURE WORD SIMILARITIES
THE ABILITY OF WORD EMBEDDINGS TO CAPTURE WORD SIMILARITIESTHE ABILITY OF WORD EMBEDDINGS TO CAPTURE WORD SIMILARITIES
THE ABILITY OF WORD EMBEDDINGS TO CAPTURE WORD SIMILARITIESkevig
 
INFORMATION RETRIEVAL FROM TEXT
INFORMATION RETRIEVAL FROM TEXTINFORMATION RETRIEVAL FROM TEXT
INFORMATION RETRIEVAL FROM TEXTijcseit
 
Resume_Clasification.pptx
Resume_Clasification.pptxResume_Clasification.pptx
Resume_Clasification.pptxMOINDALVS
 
Intrinsic and Extrinsic Evaluations of Word Embeddings
Intrinsic and Extrinsic Evaluations of Word EmbeddingsIntrinsic and Extrinsic Evaluations of Word Embeddings
Intrinsic and Extrinsic Evaluations of Word EmbeddingsJinho Choi
 
Deep Reinforcement Learning with Distributional Semantic Rewards for Abstract...
Deep Reinforcement Learning with Distributional Semantic Rewards for Abstract...Deep Reinforcement Learning with Distributional Semantic Rewards for Abstract...
Deep Reinforcement Learning with Distributional Semantic Rewards for Abstract...Deren Lei
 
Resume_Clasification.pptx
Resume_Clasification.pptxResume_Clasification.pptx
Resume_Clasification.pptxMOINDALVS
 
IRJET- Sentimental Analysis for Students’ Feedback using Machine Learning App...
IRJET- Sentimental Analysis for Students’ Feedback using Machine Learning App...IRJET- Sentimental Analysis for Students’ Feedback using Machine Learning App...
IRJET- Sentimental Analysis for Students’ Feedback using Machine Learning App...IRJET Journal
 
Conversational transfer learning for emotion recognition
Conversational transfer learning for emotion recognitionConversational transfer learning for emotion recognition
Conversational transfer learning for emotion recognitionTakato Hayashi
 
A novel meta-embedding technique for drug reviews sentiment analysis
A novel meta-embedding technique for drug reviews sentiment analysisA novel meta-embedding technique for drug reviews sentiment analysis
A novel meta-embedding technique for drug reviews sentiment analysisIAESIJAI
 
Aspects&opinions identification_opinion mining complete ppt
Aspects&opinions identification_opinion mining complete pptAspects&opinions identification_opinion mining complete ppt
Aspects&opinions identification_opinion mining complete ppttanvikadam76
 
EMNLP 2014: Opinion Mining with Deep Recurrent Neural Network
EMNLP 2014: Opinion Mining with Deep Recurrent Neural NetworkEMNLP 2014: Opinion Mining with Deep Recurrent Neural Network
EMNLP 2014: Opinion Mining with Deep Recurrent Neural NetworkPeinan ZHANG
 
Aspect Extraction Performance With Common Pattern of Dependency Relation in ...
Aspect Extraction Performance With Common Pattern of  Dependency Relation in ...Aspect Extraction Performance With Common Pattern of  Dependency Relation in ...
Aspect Extraction Performance With Common Pattern of Dependency Relation in ...Nurfadhlina Mohd Sharef
 

Ähnlich wie COLING 2014: Joint Opinion Relation Detection Using One-Class Deep Neural Network (20)

A SURVEY PAPER ON EXTRACTION OF OPINION WORD AND OPINION TARGET FROM ONLINE R...
A SURVEY PAPER ON EXTRACTION OF OPINION WORD AND OPINION TARGET FROM ONLINE R...A SURVEY PAPER ON EXTRACTION OF OPINION WORD AND OPINION TARGET FROM ONLINE R...
A SURVEY PAPER ON EXTRACTION OF OPINION WORD AND OPINION TARGET FROM ONLINE R...
 
Sentence level sentiment polarity calculation for customer reviews by conside...
Sentence level sentiment polarity calculation for customer reviews by conside...Sentence level sentiment polarity calculation for customer reviews by conside...
Sentence level sentiment polarity calculation for customer reviews by conside...
 
Icdm2013 slides
Icdm2013 slidesIcdm2013 slides
Icdm2013 slides
 
Lexical Analysis to Effectively Detect User's Opinion
Lexical Analysis to Effectively Detect User's Opinion   Lexical Analysis to Effectively Detect User's Opinion
Lexical Analysis to Effectively Detect User's Opinion
 
Sentiment+Analysis.ppt
Sentiment+Analysis.pptSentiment+Analysis.ppt
Sentiment+Analysis.ppt
 
THE ABILITY OF WORD EMBEDDINGS TO CAPTURE WORD SIMILARITIES
THE ABILITY OF WORD EMBEDDINGS TO CAPTURE WORD SIMILARITIESTHE ABILITY OF WORD EMBEDDINGS TO CAPTURE WORD SIMILARITIES
THE ABILITY OF WORD EMBEDDINGS TO CAPTURE WORD SIMILARITIES
 
THE ABILITY OF WORD EMBEDDINGS TO CAPTURE WORD SIMILARITIES
THE ABILITY OF WORD EMBEDDINGS TO CAPTURE WORD SIMILARITIESTHE ABILITY OF WORD EMBEDDINGS TO CAPTURE WORD SIMILARITIES
THE ABILITY OF WORD EMBEDDINGS TO CAPTURE WORD SIMILARITIES
 
INFORMATION RETRIEVAL FROM TEXT
INFORMATION RETRIEVAL FROM TEXTINFORMATION RETRIEVAL FROM TEXT
INFORMATION RETRIEVAL FROM TEXT
 
Resume_Clasification.pptx
Resume_Clasification.pptxResume_Clasification.pptx
Resume_Clasification.pptx
 
Intrinsic and Extrinsic Evaluations of Word Embeddings
Intrinsic and Extrinsic Evaluations of Word EmbeddingsIntrinsic and Extrinsic Evaluations of Word Embeddings
Intrinsic and Extrinsic Evaluations of Word Embeddings
 
Deep Reinforcement Learning with Distributional Semantic Rewards for Abstract...
Deep Reinforcement Learning with Distributional Semantic Rewards for Abstract...Deep Reinforcement Learning with Distributional Semantic Rewards for Abstract...
Deep Reinforcement Learning with Distributional Semantic Rewards for Abstract...
 
Resume_Clasification.pptx
Resume_Clasification.pptxResume_Clasification.pptx
Resume_Clasification.pptx
 
IRJET- Sentimental Analysis for Students’ Feedback using Machine Learning App...
IRJET- Sentimental Analysis for Students’ Feedback using Machine Learning App...IRJET- Sentimental Analysis for Students’ Feedback using Machine Learning App...
IRJET- Sentimental Analysis for Students’ Feedback using Machine Learning App...
 
Conversational transfer learning for emotion recognition
Conversational transfer learning for emotion recognitionConversational transfer learning for emotion recognition
Conversational transfer learning for emotion recognition
 
columbia-gwu
columbia-gwucolumbia-gwu
columbia-gwu
 
A novel meta-embedding technique for drug reviews sentiment analysis
A novel meta-embedding technique for drug reviews sentiment analysisA novel meta-embedding technique for drug reviews sentiment analysis
A novel meta-embedding technique for drug reviews sentiment analysis
 
Aspects&opinions identification_opinion mining complete ppt
Aspects&opinions identification_opinion mining complete pptAspects&opinions identification_opinion mining complete ppt
Aspects&opinions identification_opinion mining complete ppt
 
Estimating the overall sentiment score by inferring modus ponens law
Estimating the overall sentiment score by inferring modus ponens lawEstimating the overall sentiment score by inferring modus ponens law
Estimating the overall sentiment score by inferring modus ponens law
 
EMNLP 2014: Opinion Mining with Deep Recurrent Neural Network
EMNLP 2014: Opinion Mining with Deep Recurrent Neural NetworkEMNLP 2014: Opinion Mining with Deep Recurrent Neural Network
EMNLP 2014: Opinion Mining with Deep Recurrent Neural Network
 
Aspect Extraction Performance With Common Pattern of Dependency Relation in ...
Aspect Extraction Performance With Common Pattern of  Dependency Relation in ...Aspect Extraction Performance With Common Pattern of  Dependency Relation in ...
Aspect Extraction Performance With Common Pattern of Dependency Relation in ...
 

Mehr von Peinan ZHANG

単語分散表現を用いた多層 Denoising Auto-Encoder による評価極性分類
単語分散表現を用いた多層 Denoising Auto-Encoder による評価極性分類単語分散表現を用いた多層 Denoising Auto-Encoder による評価極性分類
単語分散表現を用いた多層 Denoising Auto-Encoder による評価極性分類Peinan ZHANG
 
20140801ACL2014読み会
20140801ACL2014読み会20140801ACL2014読み会
20140801ACL2014読み会Peinan ZHANG
 
研究会20140702:進捗報告
研究会20140702:進捗報告研究会20140702:進捗報告
研究会20140702:進捗報告Peinan ZHANG
 
研究会20140618:進捗と闇Pythonistaのワンライナーテクニックを少々
研究会20140618:進捗と闇Pythonistaのワンライナーテクニックを少々研究会20140618:進捗と闇Pythonistaのワンライナーテクニックを少々
研究会20140618:進捗と闇Pythonistaのワンライナーテクニックを少々Peinan ZHANG
 
研究会20140604:進捗とWWDCとコードの書き方を少々
研究会20140604:進捗とWWDCとコードの書き方を少々研究会20140604:進捗とWWDCとコードの書き方を少々
研究会20140604:進捗とWWDCとコードの書き方を少々Peinan ZHANG
 
研究会20140512:研究テーマ決めとvimを少々
研究会20140512:研究テーマ決めとvimを少々研究会20140512:研究テーマ決めとvimを少々
研究会20140512:研究テーマ決めとvimを少々Peinan ZHANG
 

Mehr von Peinan ZHANG (6)

単語分散表現を用いた多層 Denoising Auto-Encoder による評価極性分類
単語分散表現を用いた多層 Denoising Auto-Encoder による評価極性分類単語分散表現を用いた多層 Denoising Auto-Encoder による評価極性分類
単語分散表現を用いた多層 Denoising Auto-Encoder による評価極性分類
 
20140801ACL2014読み会
20140801ACL2014読み会20140801ACL2014読み会
20140801ACL2014読み会
 
研究会20140702:進捗報告
研究会20140702:進捗報告研究会20140702:進捗報告
研究会20140702:進捗報告
 
研究会20140618:進捗と闇Pythonistaのワンライナーテクニックを少々
研究会20140618:進捗と闇Pythonistaのワンライナーテクニックを少々研究会20140618:進捗と闇Pythonistaのワンライナーテクニックを少々
研究会20140618:進捗と闇Pythonistaのワンライナーテクニックを少々
 
研究会20140604:進捗とWWDCとコードの書き方を少々
研究会20140604:進捗とWWDCとコードの書き方を少々研究会20140604:進捗とWWDCとコードの書き方を少々
研究会20140604:進捗とWWDCとコードの書き方を少々
 
研究会20140512:研究テーマ決めとvimを少々
研究会20140512:研究テーマ決めとvimを少々研究会20140512:研究テーマ決めとvimを少々
研究会20140512:研究テーマ決めとvimを少々
 

Kürzlich hochgeladen

GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdflior mazor
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...apidays
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Jeffrey Haguewood
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024The Digital Insurer
 
A Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source MilvusA Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source MilvusZilliz
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodJuan lago vázquez
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...apidays
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...apidays
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...Zilliz
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWERMadyBayot
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingEdi Saputra
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 

Kürzlich hochgeladen (20)

GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024
 
A Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source MilvusA Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source Milvus
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 

COLING 2014: Joint Opinion Relation Detection Using One-Class Deep Neural Network

  • 1. Joint Opinion Relation Detection Using One-Class Deep Neural Network COLING 2014 reading @ Komachi Lab Komachi Lab. M1 Peinan ZHANG
  • 2. Abstract & Introduction 有効な opinion relation には3つ必要な条件がある。 1. 感情極性を含んだ opinion word 2. 現在のドメインに関連した opinion target 3. opinion word が opinion target を修飾している これをタプルとして表現すると Komachi Lab. M1 Peinan ZHANG o = (s, t, r) となり、それぞれ s = opinion word t = opinion target r = linking relation between s and t 2
  • 3. Assumption 1 Terms are likely to have linking relation with the seed terms are believed to be opinion words or opinion targets. seed terms とのリンクを持つ単語は opinion words もしくは opinion targets である。 Komachi Lab. M1 Peinan ZHANG 3
  • 4. Abstract & Introduction Example 1. This mp3 has a clear screen. s = {clear}, t = {screen}, r = {clear, screen} Example 2. This mp3 has many good things. s = {good}, t = {things}, r = {good, things} o = (s, t, r) s = opinion word t = opinion target r = linking relation between s and t Is the word ‘things’ related to the domain ‘mp3’ ? Komachi Lab. M1 Peinan ZHANG 4 NO!!
  • 5. The Problems 前述した3つの条件 1. 感情極性を含んだ opinion word 2. 現在のドメインに関連した opinion target 3. opinion word が opinion target を修飾している のうち、2つにのみ焦点を当てていた。 そのため先行研究は雑多なノイズ単語を抽出してしま い、その影響を受けることになった。 Minging Hu and Bing Liu., 2004., Mining and summarizing customer reviews, ACM SIGKDD Komachi Lab. M1 Peinan ZHANG 5
  • 6. Assumption 2 The three requirements: the opinion word, the opinion target and the linking relation between them, shall be all verified during opinion relation detection. 3つの必要な条件である opinion word と opinion target 、それらの linking relation は同時に使わなけ ればならない。 Komachi Lab. M1 Peinan ZHANG 6
  • 7. Approach a novel Joint Opinion Relation Detection Method opinion words, opinion targets and linking relations are simultaneously considered in a classification scenario. HOW TO 1. provide a small set of seeds for supervision, which are regarded as positive labeled examples. n small set of seeds: opinion words, opinion targets n negative examples (i.e. noise terms) are hard to acquire, because we do not know which term is not an opinion word or target. 2. This leads to One-Class Classification (OCC) problem. n the key to OCC is semantic similarity measuring between terms. n Deep Neural Network (DNN) with word embeddings is a powerful tool to handle this problem. Komachi Lab. M1 Peinan ZHANG 7
  • 8. Approach a novel Joint Opinion Relation Detection Method opinion words, opinion targets and linking relations are simultaneously considered in a classification scenario. HOW TO 1. provide a small set of seeds for supervision, which are regarded as positive labeled examples. n small set: opinion words, opinion targets n negative examples (i.e. noise terms) are hard to acquire, because we do not know which term is not an opinion word or target. 2. This leads to One-Class Classification (OCC) problem. n the key to OCC is semantic similarity measuring between terms. n Deep Neural Network (DNN) with word embeddings is a powerful tool to handle this problem. Komachi Lab. M1 Peinan ZHANG 8
  • 9. The Architecture of OCDNN Komachi Lab. M1 Peinan ZHANG 9 Consists of two levels. Lower Level: Learn features n Left uses words embedding to represent opinion words/ targets. n Right maps linking relations to embedding vectors by a recursive auto-encoder Higher Level: use the learnt feature to perform one-class classification
  • 10. Outline 1. Abstract & Introduction 2. Approach 3. The Architecture of OCDNN 1. Generate Opinion Seeds 2. Generate Opinion Relation Candidates 3. Represent Words 4. Represent Linking Relation 5. One-Class Classification 4. Datasets & Experiments 5. Conclusion Komachi Lab. M1 Peinan ZHANG 10
  • 11. Outline 1. Abstract & Introduction 2. Approach 3. The Architecture of OCDNN 1. Generate Opinion Seeds 2. Generate Opinion Relation Candidates 3. Represent Words 4. Represent Linking Relation 5. One-Class Classification 4. Datasets & Experiments 5. Conclusion Komachi Lab. M1 Peinan ZHANG 11
  • 12. Opinion Seed Generation Opinion Word Seeds We manually pick 186 domain independent opinion words from SentiWordNet as the opinion seed set SS. Opinion Target Seeds We measure Likelihood Ratio Tests (LRT) between domain name and all opinion target candidates. Then highest N terms with highest LRT scores are added into the opinion target seed set TS. Linking Relation Seeds We employ an automatic syntactic opinion pattern learning method called Sentiment Graph Walking and get 12 opinion patterns with highest confidence as the linking relation seed set RS. Komachi Lab. M1 Peinan ZHANG 12
  • 13. Opinion Relation Candidate Generation Opinion Word Candidates: p adjectives or verbs Opinion Target Candidates: p noun or noun phrases Opinion Relation Candidates: p get dependency tree of a sentence using Stanford Parser p the shortest dependency path between a c_s and a c_t is taken as a c_r p to avoid introducing too many noise candidates, we constrain that there are at most 4 terms in a c_r Komachi Lab. M1 Peinan ZHANG 13
  • 14. Word Representation by Word Embedding Learning Words are embedded into a hyperspace, where 2 words that are more semantically similar to each other are located closer. (something like word2vec) See more in the paper below, Ronan Collobert et al., 2011., Natural language processing (almost) from scratch, Journal of Machine Learning Research Komachi Lab. M1 Peinan ZHANG 14
  • 15. Linking Relation Representation by Using Recursive Auto-encoder Goal: represent the linking relation between an opinion word and an opinion target by a n-element vector as we do during word representation. We combined embedding vectors of words in a linking relation by a recursive auto-encoder according to syntactic dependency structure. Komachi Lab. M1 Peinan ZHANG 15
  • 16. Linking Relation Representation by Using Recursive Auto-encoder Komachi Lab. M1 Peinan ZHANG 1. c_s と c_t との間の係 り受け関係を取ってくる。 2. c_s と c_t をそれぞれ [SC] と [ST] に置き換え る。 3. 点線内を3層からなる auto-encoder とし、2つ の n-element vector を 中間層で1つの n-element vector に圧縮す る(下式)。 4. W を入力と出力のユー クリッド距離が最小にな るまで更新していく。 16 Example. too loud to listen to the player
  • 17. One-Class Classification for Opinion Relation Detection We represent an opinion relation candidate c_o by a vector v_o=[v_s; v_t; v_r], and this vector v_o is to feed to upper level auto-encoder. For opinion relation detection, error scores that are smaller than a threshold theta are classified as positive. To estimate theta, we need to introduce a positive proportion (pp) score as follows, Komachi Lab. M1 Peinan ZHANG 17
  • 18. Opinion Target Expansion We apply bootstrapping to iteratively expand opinion targets seeds. p because the vocabulary of seed set is limited, which cannot fully represent the distribution of opinion targets. After training OCDNN, all opinion relation candidates are classified, and opinion targets are ranked in descent order by, Then, top M candidates are added into the target seed set TS for the next training iteration. Komachi Lab. M1 Peinan ZHANG 18
  • 19. Datasets Datasets p Customer Review Dataset (CRD) n contains review on five products (denoted by D1 to D5) p benchmark dataset on MP3 and Hotel p crawled from www.amazon.com, which involves Mattress and Phone Annotation p 10000 sentences are randomly selected from reviews and annotators are required to judge whether each term is an opinion word or an opinion target. p 5000 sentences are annotated for MP3 and Hotel. Annotators are required to carefully read through each sentence and find out every opinion relation detection. Komachi Lab. M1 Peinan ZHANG 19
  • 20. Evaluation Settings AdjRule extract opinion words/targets by using adjacency rules LRTBOOT bootstrapping algorithm which employs Likelihood Ration Test as the co-occurrence statistical measure DP denotes the Double Propagation algorithm DP-HITS enhanced version of DP by using HITS algorithm OCDNN proposed method. The target seed size N=40, the opinion targets expanded in each iteration M=20, and the max bootstrapping iteration number is X=10. Komachi Lab. M1 Peinan ZHANG 20 statistical co-occurrence-based method syntax-based method
  • 21. Experiments DP-HITS does not extract opinion words so their results for opinion words are not taken into account. Komachi Lab. M1 Peinan ZHANG 21
  • 22. Experiments Komachi Lab. M1 Peinan ZHANG 22
  • 23. Experiments Komachi Lab. M1 Peinan ZHANG 23 n our method outperforms co-occurrence-based methods AdjRule and LRTBOOT n but achieves comparable or a little worse results than syntax-based methods DP and DP-HITS n because CRD is quite small, which only contains several hundred sentences for each product review set. In this case, methods based on careful-designed syntax rules have superiority over those based on statistics. n our method outperforms all of the competitors n OCDNN vs. DP-HITS: those two use similar term ranking metrics, but OCDNN significantly outperforms DP-HITS. Therefore, positive proportion is more effective than the importance score. n OCDNN vs. LRTBOOT: LRTBOOT is better recall but lower precision. This is because LRTBOOT follows Assumption 1, which suffers a lot from error propagation, while our joint classification approach effectively alleviates this issue.
  • 24. Assumption 1 vs. Assumption 2 Komachi Lab. M1 Peinan ZHANG 24
  • 25. Komachi Lab. M1 Peinan ZHANG 25 Assumption 1 vs. Assumption 2 n OCDNN significantly outperforms all competitors. The average improvement of F-measure over the best competitor is 6% on CRD and 9% on Hotel and MP3. n As Assumption 1 only verifies 2 of the requirements, it would inevitably introduce noise terms. n For syntax-based method DP, it extracts many false opinion relations such as good thing and nice one or objective expressions like another mp3 and every mp3. n For co-occurrence statistical methods AdjRule and LRTBOOT, it is very hard to deal with ambiguous linking relations. For example, in phrase this mp3 is very good except the size, co-occurrence statistical methods could hardly tell which opinion target does good modify (mp3 or size).
  • 26. Conclusion p この論文では joint opinion relation detection を One- Class Deep Neural Network に適応させて分類を行った。 p 特徴的な点は、 opinion words/targets/relations を同時 に参照して分類することにある。 p そして実験では、条件の2つしか適応させていなかった 手法よりも良い結果を示すことが出来た。 Komachi Lab. M1 Peinan ZHANG 26
  • 27. Conclusion 1. Abstract & Introduction 2. Approach 3. The Architecture of OCDNN 1. Generate Opinion Seeds 2. Generate Opinion Relation Candidates 3. Represent Words 4. Represent Linking Relation 5. One-Class Classification 4. Datasets & Experiments 5. Conclusion Komachi Lab. M1 Peinan ZHANG 27