Aspect Detection for Sentiment Analysis: Comparing LDA and ABAE

Aspect Detection
for Sentiment/Emotion
Analysis
Yassine Benajiba

Outline
▪ Introduction & motivation
▪ 2 approaches forTopic Detection
▪ LDA
▪ ABAE
▪ Why ABAE is more promising?
▪ Closing remarks

Intro & motivation
What is aspect based sentiment analysis?
[Definition from Semeval 2015]:
mining and summarizing opinions from
text about specific entities and their
aspects
I like the food but the waiters were rude
sentiment
ABSA Food
Service
+1 (+0.65)

Intro & motivation
Why do we need it?
- Higher resolution analysis
https://www.researchgate.net/publication/301408174_Twitter_sentiment_analysis http://159.89.224.205/wp-content/uploads/2016/05/tumblr_inline_o72ropcTfR1u37g00_540.png

Intro & motivation
Why do we need it?
- Higher resolution analysis
- It is a more principled way to perform
sentiment analysis … What would you do as a
human? (Specially when most reviews have
more than one sentiment)

Intro & motivation
What do these aspects look like?
They come in two flavors:
1- Dictionaries of aspect terms
https://www.researchgate.net/figure/Most-likely-words-from-4-topics-in-LDA-from-the-AP-corpus-the-topic-titles-in-quotes-are_fig1_220766449

Intro & motivation
What do these aspects look like?
They come in two flavors:
1- Dictionaries of aspect terms
2- Highlighting in the text
I like the food but the waiters were rude

Intro & motivation
What are some of the issues we need to take
into consideration?
- Ambiguity
Topic words like:
time/experience: could be used without meaning
to provide an opinion
kids: family or noise?
kind, noisy: sentiment ‘and’ topic

Intro & motivation
into consideration?
- Ambiguity
- multi word terms
greek yogurt with cucumber dill and garlic taste
crab salad with passion fruit vinaigrette
store events
the lady at the door (staff)

Intro & motivation
into consideration?
- Ambiguity
- multi word terms
- granularity

Latent Dirichlet Allocation
LDA

LDA
paper
JMLR 2003
http://www.cs.columbia.edu/~blei/papers/BleiNgJordan2003.pdf

LDA
graph model
- α and η are parameters over the prior distributions over θ and β
- θd is the distribution of topics for document d (real vector of length K)
- βk is the distribution of words for topic k (real vector of length V)
- zd,n is the topic of the nth word of the dth document
- wd,n is the nth word of the dth document

LDA
graph model
- α and η are parameters over the prior distributions over θ and β
- θd is the distribution of topics for document d (real vector of length K)
- βk is the distribution of words for topic k (real vector of length V)
- zd,n is the topic of the nth word of the dth document
- wd,n is the nth word of the dth document
Observed
https://www.utdallas.edu/~nrr150130/cs6347/2015sp/lects/Lecture_17_LDA.pdf

LDA
training
How do we train this?
Gibbs Sampling ..What does that mean?

LDA
training
Why does it work at all?
Topic words co-occurrence .. Let’s try to remember that.

LDA
outcome
This is what we keep at the end
What does it look like?

LDA
outcome

LDA
recap
Pros:
- Strong mathematical basis -> consistent results
- Many existing of-the-shelf tools
- No domain dependence
- Unsupervised-ish, why not say fully unsupervised? Let’s
look at the cons
Cons:
- Requires list of stop words and preprocessed text
- Why is that an issue?
- Unigrams (can be worked around in a limited way)
- Outcome: list of words out of context
- Doesn’t work well for short text
- Why?
- What do people do to use LDA for short text?

Attention Based Aspect
Extraction
ABAE

ABAE
paper
https://www.aclweb.org/anthology/P/P17/P17-1036.pdf
ACL 2017

ABAE
intuition
pre-trained
em
beddings
encoding
the sentence
com
pute
probas over topics
sentence
reconstruction

ABAE
neural net details
pre-trained
em
beddings
encoding
the sentence
com
pute
probas over topics
sentence
reconstruction

ABAE
neural net details
pre-trained
em
beddings
encoding
the sentence
com
pute
probas over topics
sentence
reconstruction
Filter to capture relevance of
each word to the K topics

ABAE
neural net details
pre-trained
em
beddings
encoding
the sentence
com
pute
probas over topics
sentence
reconstruction
Topics matrix initialized
with k-means

ABAE
training
pre-trained
em
beddings
encoding
the sentence
com
pute
probas over topics
sentence
reconstruction
+ λ * reg. term U:loss =

ABAE
training
What we keep at the end:
- NN overT

ABAE
training
- NN overT
- Encoder, why?

ABAE
training
- NN overT
- Encoder, why?
Multi-word topics in context

ABAE
experimental setup
z: aspect
Sz: Set of words in z
D1(w): doc freq
D2(w1,w2): co-doc freq
Higher is more semantically coherent
Data:
- Citysearch: 50k+ Restaurant reviews where 3400 are manually labeled (6
aspects)
- BeerAdvocate: 1.5M reviews where 1k are manually labeled (5 aspects)
Performance measures:
(Precision, Recall, F1) & Coherence score

ABAE <> LDA
recap
- mathematical basis -> consistent results
- existing of-the-shelf tools
- domain dependence
- Unsupervised-ish?
- list of stop words and preprocessed text
- ABAE uses a stop word list too, but is it the same?
- Unigrams or multiword?
- Outcome?
- works well for short text

Why is ABAE more promising
Let’s take a closer look

ABAE
second look
The authors initialized this with k-means
but this could be anything you want. For
instance …

ABAE
second look
The authors initialized this with k-means
but this could be anything you want. For
instance …
https://aclweb.org/anthology/D18-1403
Summarizing opinions:
Aspect Extraction meets Sentiment Prediction and they are both weakly Supervised.
Angelidis and Lapata

ABAE
second look
The only non-linearity in town.

ABAE
second look
pt is used for a linear
combination ofT’s entries

ABAE
second look
pt is used for a linear
combination ofT’s entries
The topic words are obtained
through NN overT’s entries
What the algorithm really does is search for
‘n’ points in the embedding space
representative of the topics + provide a
mechanism to mask stop words in the text
So how can we best use this for topic detection?
+
+
=

In conclusion
- Topic detection is difficult because it is domain and use
case specific. We need, however, for a proper inference of
“brand” profile.
- existing approaches fail to think about both the inference
of dictionaries and their use in specific context
- LDA provided a strong approach for language independent(-
ish) and unsupervised(-ish) topic modelling.
- … However, it is more likely that extensions and variations of ABAE will take
over in the future for the rich mechanisms they offer.

Thanks!
Questions / Comments ?

Aspect Detection for Sentiment Analysis: Comparing LDA and ABAE

Empfohlen

Empfohlen

Weitere ähnliche Inhalte

Was ist angesagt?

Was ist angesagt? (20)

Ähnlich wie Aspect Detection for Sentiment Analysis: Comparing LDA and ABAE

Ähnlich wie Aspect Detection for Sentiment Analysis: Comparing LDA and ABAE (20)

Mehr von Seth Grimes

Mehr von Seth Grimes (20)

Kürzlich hochgeladen

Kürzlich hochgeladen (20)

Aspect Detection for Sentiment Analysis: Comparing LDA and ABAE