SlideShare ist ein Scribd-Unternehmen logo
1 von 34
Downloaden Sie, um offline zu lesen
Can Deep Learning solve the Sentiment Analysis Problem? 
Mark CieliebakZurichUniversity ofApplied Sciences 
Annual Meeting ofSGAICO –Swiss Group forArtificialIntelligenceandCognitiveScience 
18.11.2014
Outline 
1.What is sentiment analysis? 
2.How good are "classical" approaches? 
3.Does deep learning solve the problem? 
18.11.2014 Mark Cieliebak 2
About Me 
18.11.2014 Mark Cieliebak 3 
Mark Cieliebak 
Institute of Applied Information Technology (InIT) 
ZHAW, Winterthur 
Email: ciel@zhaw.ch, Website: www.zhaw.ch/~ciel 
Text 
Analytics 
Open 
Data 
Automated 
Test 
Generation 
Research 
Interests 
Software 
Engineering
WhatisSentiment Analysis 
"… WiFiAnalytics isa freeAndroid appthatI find veryhandywhenitcomestotroubleshootingandmonitoringa homenetwork. "[1] 
18.11.2014 Mark Cieliebak 4
Sample Application: SocialMedia Monitoring 
Text AnalyticsComponents: 
•Find relevant documents 
•Hot topicAnalysis 
•Sentiment analysis 
18.11.2014 Mark Cieliebak 5 
[7]
FlavoursofSentiment Analysis 
•DocumentBased 
•SentenceBased 
•Target-Specific 
•Rating Prediction 
18.11.2014 Mark Cieliebak 6
Classic ApproachestoSentiment Analysis 
Rule-Based 
Corpus-Based 
18.11.2014 Mark Cieliebak 7 
Predicted 
Label 
[3] 
[4]
Simple Sentiment Analysis 
Idea: Count numberofpositive andnegative words 
"This cameraisgreat[+1]." 
+1 (pos) 
"I find itbeautiful[+1]andgood[+1]." 
+2 (pos) 
"Itlooksterrible[-1]." 
-1 (neg) 
"This carhasa bluecolor." 
0 (neu) 
POSITIVE: 
great 
love 
nice 
... 
NEUTRAL: 
hello 
see 
I 
… 
NEGATIVE: 
bad 
hate 
ugly 
... 
UseSentiment-Dictionary: 
18.11.2014 Mark Cieliebak 8
Sample Rules 
18.11.2014 Mark Cieliebak 9 
•DetectBooster Words: "The carisreallyveryexpensive[-1 -1 -2]." 
•New Category"Mixed": "This carhasan appealing[+1]design andcomfortable[+1]seats, but itisexpensive[-1]." 
•Negation: Invertonlyscore ofwordsoccuringafter thenegation: "The carisappealing[+3]andI do not[*-1]find itexpensive[-2]" 
•I do notfind thecarexpensiveanditisappealing. 
Need to“understand” thesentence
Linguistic Analysis 
-> RULE: Invertscoresofwordsbeingin thesame phrasesasnegation. 
“I do not find thecarexpensive[+2] 
anditisappealing[+3].” → +5 (pos) 
Sentence 
Sentence 
Conj. 
Sentence 
NounPhrase 
Verb Phrase 
Verb 
Adverb 
Verb 
Noun Phrase 
Adj. 
Noun Phrase 
Verb Phrase 
Det. 
Det 
Noun 
Det. 
Verb 
Participle 
I 
do 
not 
find 
the 
car 
expensive 
and 
it 
is 
appealing 
18.11.2014 Mark Cieliebak 10
Rule-BasedSentiment Analysis 
Most ImportantIssues: 
-Requiresgoodhand-craftedrules 
-Hard totransfertonewtasksorlanguages 
-Doesnot workwellfortextswithbadgrammer(Twitter) 
18.11.2014 Mark Cieliebak 11 
[5]
Classic ApproachestoSentiment Analysis 
Rule-Based 
Corpus-Based 
18.11.2014 Mark Cieliebak 12 
Predicted 
Label 
[3] 
[4]
Corpus-BasedSentiment Analysis 
18.11.2014 Mark Cieliebak 13 
Predicted 
Label 
[4]
Corpus-BasedSentiment Analysis 
AnnotatedCorpus 
Sentence 
Polarity 
This analysis is good. 
Pos 
It looks awful. 
Neg 
This car has a blue color. 
Neu 
This car has an appealing design, comfortable seats, but it is expensive. 
Mix 
This carhasa veryappealingdesign, comfortableseats, but itisreallyexpensive. 
Mix 
This analysis is not good. 
Neg 
This car has an appealing design, comfortable seats and it is not expensive. 
Mix 
This movie was like a horror event. 
Neg 
This carisappealingandisnot expensive. 
Mix 
... 
... 
18.11.2014 Mark Cieliebak 14
Sample Features forTweets 
•Word ngrams:presence or absence of contiguous sequences of 1, 2, 3, and 4 tokens; noncontiguous ngrams 
•POS: the number of occurrences of each part-of-speechtag 
•SentimentLexica: eachwordannotatedwithtonalityscore (-1..0..+1) 
•Negation: the number of negated contexts 
•Punctuation: the number of contiguous sequences of exclamation marks, question marks, and both exclamation and question marks 
•Emoticons: presenceorabsence, last token is a positive or negative emoticon; 
•Hashtags: the number of hashtags; 
•Elongatedwords: the number of words with one character repeated (e.g. ‘soooo’) 
from: Mohammad et al., SemEval2013 
18.11.2014 Mark Cieliebak 15
Corpus-BasedSentiment Analysis 
Most ImportantIssues: 
-Requireslarge annotatedcorpora 
-Dependson goodfeatures 
18.11.2014 Mark Cieliebak 16 
[6]
HowgoodareSentiment Analysis Tools? 
18.11.2014 Mark Cieliebak 17
Quick Poll 
•Short texts: 1-2 sentencesfromTwitter, news, reviewsetc. 
•Three-classclassification: positive, negative, other 
•Accuracy= #푐표푟푟푒푐푡푑표푐푠 #푑표푐푠 
Mark Cieliebak 21 
Accuracy 
Votes 
<50% 
50-60% 
60-70% 
70-80% 
80-90% 
>90% 
"Howgoodarestate-of-the-art sentimentanalysistools?" 
18.11.2014
Tool Accuracy 
0,2 
0,3 
0,4 
0,5 
0,6 
0,7 
0,8 
Accuracy 
Best Tool per Corpus 
Worst Tool per Corpus 
22 
61% 
40% 
Avg. 
18.11.2014 Mark Cieliebak 
[14]
Tool Accuracy 
0,2 
0,3 
0,4 
0,5 
0,6 
0,7 
0,8 
Accuracy 
Best Tool per Corpus 
Worst Tool per Corpus 
Overall Best Tool 
23 
61% 
40% 
59% 
Avg. 
18.11.2014 Mark Cieliebak
Take-Home Lesson 
Accuracyofbestcommercialtoolon 
arbitraryshorttextsis59% 
18.11.2014 Mark Cieliebak 24
ApproachestoSentiment Analysis 
Rule-Based 
Corpus-Based 
18.11.2014 Mark Cieliebak 25 
Predicted 
Label 
[9] 
DeepLearning 
[8]
DeepLearning on Text 
It'sall aboutWord Vectors! 
18.11.2014 Mark Cieliebak 26
Word2Vec 
•Hugesetoftextsamples(billionsofwords) 
•Extractdictionary 
•Word-Matrix: k-dimensional vectorforeachword(k typically50-500) 
•Word vectorinitializedrandomly 
•Train wordvectorstopredictnextwords, givena sequenceofwordsfromsample text 
18.11.2014 Mark Cieliebak 27 
Major contributionsbyBengioet al. 2003, Collobert&Weston2008, Socher et al. 2011, Mikolovet al. 2013 
[9]
The Magic ofWord Vectors 
18.11.2014 Mark Cieliebak 28 
King -Man + Woman≈ Queen 
Live Demo on 100b wordsfromGoogle News dataset: http://radimrehurek.com/2014/02/word2vec-tutorial/ 
[10]
Relations LearnedbyWord2Vec 
18.11.2014 Mark Cieliebak 29 
[11]
UsingWord Vectorsin NLP 
18.11.2014 Mark Cieliebak 30 
Collobertet al., 2011: 
•SENNA: GenericNLP System basedon wordvectors 
•Nomanualfeatureengineering 
•SolvesmanyNLP-Tasks asgoodasbenchmarksystems 
[12]
DeepLearning andSentiment 
Maas et al., 2011 
•Enrichwordvectorswithsentimentcontext 
•Capture semanticofwords(unsupervised) andsentiment(supervised) in parallel, usingmultiple learningtasks 
wonderful 
amazing 
terrible 
awful 
18.11.2014 Mark Cieliebak 31
DeepLearning andSentiment 
Socher et al. 2013: 
•Word Vectorsdo not helpforSentiment Analysis 
•RecursiveNeuralTensor Networks 
•Representingsentencestructuresastreeswhileaddingsentimentannotationsat same time 
•Restrictedtosingle, well-structuredsentences 
• 
18.11.2014 Mark Cieliebak 32 
[13]
DeepLearning andSentiment 
QuocandMikolov, 2014: 
•"Paragraph Vectors" 
•Add context(sentence, paragraph, document) towordvectorsduringtraining 
•Improvesmanyexistingapproaches 
18.11.2014 Mark Cieliebak 33 
[9]
DoesDeepLearning solvethe 
Sentiment Analysis Problem? 
18.11.2014 Mark Cieliebak 34
Conclusion: DeepLearning forSentiment 
•Small improvements, not revolution 
•Veryrecentresearch, not yet"end ofthestory" 
•SemEval2015 will bebenchmark 
18.11.2014 Mark Cieliebak 35
Talk in Short! 
1.Classic approachesarerule-basedorcorpus-based 
2.State-of-the-art toolsclassify4 out of10 docswrong 
3.DeepLearning doesnot needhand-craftedfeatures 
4.DeepLearning improvesexistingbenchmarks 
18.11.2014 Mark Cieliebak 36
ThankYou! 
Mark Cieliebak 
ZurichUniversity ofApplied Sciences(ZHAW) 
Winterthur, Switzerland 
Email: ciel@zhaw.ch, Website: www.zhaw.ch/~ciel 
18.11.2014 Mark Cieliebak 37 
[15]

Weitere ähnliche Inhalte

Was ist angesagt?

Sentiment Analysis
Sentiment AnalysisSentiment Analysis
Sentiment Analysis
harit66
 

Was ist angesagt? (20)

Sentiment analysis
Sentiment analysisSentiment analysis
Sentiment analysis
 
Lexicon-Based Sentiment Analysis at GHC 2014
Lexicon-Based Sentiment Analysis at GHC 2014Lexicon-Based Sentiment Analysis at GHC 2014
Lexicon-Based Sentiment Analysis at GHC 2014
 
MTech Seminar Presentation [IIT-Bombay]
MTech Seminar Presentation [IIT-Bombay]MTech Seminar Presentation [IIT-Bombay]
MTech Seminar Presentation [IIT-Bombay]
 
Twitter sentiment analysis
Twitter sentiment analysisTwitter sentiment analysis
Twitter sentiment analysis
 
Sentiment analyzer and opinion mining
Sentiment analyzer and opinion miningSentiment analyzer and opinion mining
Sentiment analyzer and opinion mining
 
RCOMM 2011 - Sentiment Classification with RapidMiner
RCOMM 2011 - Sentiment Classification with RapidMinerRCOMM 2011 - Sentiment Classification with RapidMiner
RCOMM 2011 - Sentiment Classification with RapidMiner
 
Sentiment Analysis
Sentiment AnalysisSentiment Analysis
Sentiment Analysis
 
2 13
2 132 13
2 13
 
Sentiment Analysis
Sentiment AnalysisSentiment Analysis
Sentiment Analysis
 
Adapting Sentiment Lexicons using Contextual Semantics for Sentiment Analysis...
Adapting Sentiment Lexicons using Contextual Semantics for Sentiment Analysis...Adapting Sentiment Lexicons using Contextual Semantics for Sentiment Analysis...
Adapting Sentiment Lexicons using Contextual Semantics for Sentiment Analysis...
 
Sentiment tool Project presentaion
Sentiment tool Project presentaionSentiment tool Project presentaion
Sentiment tool Project presentaion
 
Ml ppt
Ml pptMl ppt
Ml ppt
 
Sentiment Analysis of Twitter Data
Sentiment Analysis of Twitter DataSentiment Analysis of Twitter Data
Sentiment Analysis of Twitter Data
 
Sentiment analysis using naive bayes classifier
Sentiment analysis using naive bayes classifier Sentiment analysis using naive bayes classifier
Sentiment analysis using naive bayes classifier
 
Sentiment analysis in Twitter on Big Data
Sentiment analysis in Twitter on Big DataSentiment analysis in Twitter on Big Data
Sentiment analysis in Twitter on Big Data
 
sentiment analysis
sentiment analysis sentiment analysis
sentiment analysis
 
SentiCheNews - Sentiment Analysis on Newspapers and Tweets
SentiCheNews - Sentiment Analysis on Newspapers and TweetsSentiCheNews - Sentiment Analysis on Newspapers and Tweets
SentiCheNews - Sentiment Analysis on Newspapers and Tweets
 
Sentiment Analysis
Sentiment AnalysisSentiment Analysis
Sentiment Analysis
 
Sentiment analysis
Sentiment analysisSentiment analysis
Sentiment analysis
 
IRE2014-Sentiment Analysis
IRE2014-Sentiment AnalysisIRE2014-Sentiment Analysis
IRE2014-Sentiment Analysis
 

Andere mochten auch

Sentiment analysis of tweets using Neural Networks
Sentiment analysis of tweets using Neural NetworksSentiment analysis of tweets using Neural Networks
Sentiment analysis of tweets using Neural Networks
Adrián Palacios Corella
 
Sentiment Analysis and Social Media: How and Why
Sentiment Analysis and Social Media: How and WhySentiment Analysis and Social Media: How and Why
Sentiment Analysis and Social Media: How and Why
Davide Feltoni Gurini
 
Sentiment analysis of tweets
Sentiment analysis of tweetsSentiment analysis of tweets
Sentiment analysis of tweets
Vasu Jain
 
Sentiment Analysis via R Programming
Sentiment Analysis via R ProgrammingSentiment Analysis via R Programming
Sentiment Analysis via R Programming
Skillspeed
 
connected_issue_49_summer_2013
connected_issue_49_summer_2013connected_issue_49_summer_2013
connected_issue_49_summer_2013
Mary Stephanou
 
SNLI_presentation_2
SNLI_presentation_2SNLI_presentation_2
SNLI_presentation_2
Viral Gupta
 

Andere mochten auch (20)

Sentiment analysis of tweets using Neural Networks
Sentiment analysis of tweets using Neural NetworksSentiment analysis of tweets using Neural Networks
Sentiment analysis of tweets using Neural Networks
 
Sentiment Analysis and Social Media: How and Why
Sentiment Analysis and Social Media: How and WhySentiment Analysis and Social Media: How and Why
Sentiment Analysis and Social Media: How and Why
 
Sentiment analysis of tweets
Sentiment analysis of tweetsSentiment analysis of tweets
Sentiment analysis of tweets
 
CNN for Sentiment Analysis on Italian Tweets
CNN for Sentiment Analysis on Italian TweetsCNN for Sentiment Analysis on Italian Tweets
CNN for Sentiment Analysis on Italian Tweets
 
Negative Sentiment (or "Sentiment Analysis is Sh*te")
Negative Sentiment (or "Sentiment Analysis is Sh*te")Negative Sentiment (or "Sentiment Analysis is Sh*te")
Negative Sentiment (or "Sentiment Analysis is Sh*te")
 
Sentiment analysis in healthcare
Sentiment analysis in healthcareSentiment analysis in healthcare
Sentiment analysis in healthcare
 
Sentiment Analysis Using Hybrid Structure of Machine Learning Algorithms
Sentiment Analysis Using Hybrid Structure of Machine Learning AlgorithmsSentiment Analysis Using Hybrid Structure of Machine Learning Algorithms
Sentiment Analysis Using Hybrid Structure of Machine Learning Algorithms
 
Social media & sentiment analysis splunk conf2012
Social media & sentiment analysis   splunk conf2012Social media & sentiment analysis   splunk conf2012
Social media & sentiment analysis splunk conf2012
 
Practical Sentiment Analysis
Practical Sentiment AnalysisPractical Sentiment Analysis
Practical Sentiment Analysis
 
Emotions Affect Markets in Predictable Ways: Behavioral Finance and Sentiment...
Emotions Affect Markets in Predictable Ways: Behavioral Finance and Sentiment...Emotions Affect Markets in Predictable Ways: Behavioral Finance and Sentiment...
Emotions Affect Markets in Predictable Ways: Behavioral Finance and Sentiment...
 
Sentiment Analysis via R Programming
Sentiment Analysis via R ProgrammingSentiment Analysis via R Programming
Sentiment Analysis via R Programming
 
CNN for Text Classification
CNN for Text ClassificationCNN for Text Classification
CNN for Text Classification
 
How Sentiment Analysis works
How Sentiment Analysis worksHow Sentiment Analysis works
How Sentiment Analysis works
 
Introduction to Sentiment Analysis
Introduction to Sentiment AnalysisIntroduction to Sentiment Analysis
Introduction to Sentiment Analysis
 
Big Data & Sentiment Analysis
Big Data & Sentiment AnalysisBig Data & Sentiment Analysis
Big Data & Sentiment Analysis
 
Continuous Sentiment Intensity Prediction based on Deep Learning
Continuous Sentiment Intensity Prediction based on Deep LearningContinuous Sentiment Intensity Prediction based on Deep Learning
Continuous Sentiment Intensity Prediction based on Deep Learning
 
connected_issue_49_summer_2013
connected_issue_49_summer_2013connected_issue_49_summer_2013
connected_issue_49_summer_2013
 
SNLI_presentation_2
SNLI_presentation_2SNLI_presentation_2
SNLI_presentation_2
 
Lecture 3: Structuring Unstructured Texts Through Sentiment Analysis
Lecture 3: Structuring Unstructured Texts Through Sentiment AnalysisLecture 3: Structuring Unstructured Texts Through Sentiment Analysis
Lecture 3: Structuring Unstructured Texts Through Sentiment Analysis
 
P3
P3P3
P3
 

Ähnlich wie Can Deep Learning solve the Sentiment Analysis Problem

Implementing Conceptual Search in Solr using LSA and Word2Vec: Presented by S...
Implementing Conceptual Search in Solr using LSA and Word2Vec: Presented by S...Implementing Conceptual Search in Solr using LSA and Word2Vec: Presented by S...
Implementing Conceptual Search in Solr using LSA and Word2Vec: Presented by S...
Lucidworks
 

Ähnlich wie Can Deep Learning solve the Sentiment Analysis Problem (20)

#like or #fail - How Can Computers Tell the Difference?
#like or #fail - How Can Computers Tell the Difference? #like or #fail - How Can Computers Tell the Difference?
#like or #fail - How Can Computers Tell the Difference?
 
Ire major project
Ire major projectIre major project
Ire major project
 
Agile Engineering for Managers Workshop
Agile Engineering for Managers WorkshopAgile Engineering for Managers Workshop
Agile Engineering for Managers Workshop
 
Tomáš Mikolov - Distributed Representations for NLP
Tomáš Mikolov - Distributed Representations for NLPTomáš Mikolov - Distributed Representations for NLP
Tomáš Mikolov - Distributed Representations for NLP
 
Vector Search for Data Scientists.pdf
Vector Search for Data Scientists.pdfVector Search for Data Scientists.pdf
Vector Search for Data Scientists.pdf
 
Storytelling for research software engineers
Storytelling for research software engineersStorytelling for research software engineers
Storytelling for research software engineers
 
Implementing Conceptual Search in Solr using LSA and Word2Vec: Presented by S...
Implementing Conceptual Search in Solr using LSA and Word2Vec: Presented by S...Implementing Conceptual Search in Solr using LSA and Word2Vec: Presented by S...
Implementing Conceptual Search in Solr using LSA and Word2Vec: Presented by S...
 
16-nlp (2).ppt
16-nlp (2).ppt16-nlp (2).ppt
16-nlp (2).ppt
 
Case Study: We're Watching You: How and Why Researchers Study Open Source And...
Case Study: We're Watching You: How and Why Researchers Study Open Source And...Case Study: We're Watching You: How and Why Researchers Study Open Source And...
Case Study: We're Watching You: How and Why Researchers Study Open Source And...
 
H2O World - Clustering & Feature Extraction on Text - Seth Redmore
H2O World - Clustering & Feature Extraction on Text - Seth RedmoreH2O World - Clustering & Feature Extraction on Text - Seth Redmore
H2O World - Clustering & Feature Extraction on Text - Seth Redmore
 
Dice.com Bay Area Search - Beyond Learning to Rank Talk
Dice.com Bay Area Search - Beyond Learning to Rank TalkDice.com Bay Area Search - Beyond Learning to Rank Talk
Dice.com Bay Area Search - Beyond Learning to Rank Talk
 
To Label or Not? Advances and Open Challenges in SE-specific Sentiment Analysis
To Label or Not? Advances and Open Challenges in SE-specific Sentiment AnalysisTo Label or Not? Advances and Open Challenges in SE-specific Sentiment Analysis
To Label or Not? Advances and Open Challenges in SE-specific Sentiment Analysis
 
The Actionable Guide to Doing Better Semantic Keyword Research #BrightonSEO (...
The Actionable Guide to Doing Better Semantic Keyword Research #BrightonSEO (...The Actionable Guide to Doing Better Semantic Keyword Research #BrightonSEO (...
The Actionable Guide to Doing Better Semantic Keyword Research #BrightonSEO (...
 
devsecops-reference-architectures-2018.pdf
devsecops-reference-architectures-2018.pdfdevsecops-reference-architectures-2018.pdf
devsecops-reference-architectures-2018.pdf
 
NLP Bootcamp 2018 : Representation Learning of text for NLP
NLP Bootcamp 2018 : Representation Learning of text for NLPNLP Bootcamp 2018 : Representation Learning of text for NLP
NLP Bootcamp 2018 : Representation Learning of text for NLP
 
Deep Learning @ ZHAW Datalab (with Mark Cieliebak & Yves Pauchard)
Deep Learning @ ZHAW Datalab (with Mark Cieliebak & Yves Pauchard)Deep Learning @ ZHAW Datalab (with Mark Cieliebak & Yves Pauchard)
Deep Learning @ ZHAW Datalab (with Mark Cieliebak & Yves Pauchard)
 
Tutorial 13 (explicit ugc + sentiment analysis)
Tutorial 13 (explicit ugc + sentiment analysis)Tutorial 13 (explicit ugc + sentiment analysis)
Tutorial 13 (explicit ugc + sentiment analysis)
 
Social Media Crawling & Mining Seminar
Social Media Crawling & Mining Seminar Social Media Crawling & Mining Seminar
Social Media Crawling & Mining Seminar
 
REVIEW PPT.pptx
REVIEW PPT.pptxREVIEW PPT.pptx
REVIEW PPT.pptx
 
Natural Language Processing with Graphs
Natural Language Processing with GraphsNatural Language Processing with Graphs
Natural Language Processing with Graphs
 

Kürzlich hochgeladen

Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
amitlee9823
 
Probability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter LessonsProbability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter Lessons
JoseMangaJr1
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
AroojKhan71
 
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service BangaloreCall Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
amitlee9823
 
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night StandCall Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
amitlee9823
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
amitlee9823
 
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
amitlee9823
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
amitlee9823
 
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
only4webmaster01
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
amitlee9823
 
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts ServiceCall Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Riyadh +966572737505 get cytotec
 
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men 🔝Bangalore🔝 Esc...
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men  🔝Bangalore🔝   Esc...➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men  🔝Bangalore🔝   Esc...
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men 🔝Bangalore🔝 Esc...
amitlee9823
 

Kürzlich hochgeladen (20)

Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
 
Predicting Loan Approval: A Data Science Project
Predicting Loan Approval: A Data Science ProjectPredicting Loan Approval: A Data Science Project
Predicting Loan Approval: A Data Science Project
 
Capstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramCapstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics Program
 
Probability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter LessonsProbability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter Lessons
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
 
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service BangaloreCall Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
 
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night StandCall Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
 
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interaction
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysis
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
 
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
 
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
 
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceBDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
 
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts ServiceCall Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
 
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
 
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men 🔝Bangalore🔝 Esc...
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men  🔝Bangalore🔝   Esc...➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men  🔝Bangalore🔝   Esc...
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men 🔝Bangalore🔝 Esc...
 

Can Deep Learning solve the Sentiment Analysis Problem

  • 1. Can Deep Learning solve the Sentiment Analysis Problem? Mark CieliebakZurichUniversity ofApplied Sciences Annual Meeting ofSGAICO –Swiss Group forArtificialIntelligenceandCognitiveScience 18.11.2014
  • 2. Outline 1.What is sentiment analysis? 2.How good are "classical" approaches? 3.Does deep learning solve the problem? 18.11.2014 Mark Cieliebak 2
  • 3. About Me 18.11.2014 Mark Cieliebak 3 Mark Cieliebak Institute of Applied Information Technology (InIT) ZHAW, Winterthur Email: ciel@zhaw.ch, Website: www.zhaw.ch/~ciel Text Analytics Open Data Automated Test Generation Research Interests Software Engineering
  • 4. WhatisSentiment Analysis "… WiFiAnalytics isa freeAndroid appthatI find veryhandywhenitcomestotroubleshootingandmonitoringa homenetwork. "[1] 18.11.2014 Mark Cieliebak 4
  • 5. Sample Application: SocialMedia Monitoring Text AnalyticsComponents: •Find relevant documents •Hot topicAnalysis •Sentiment analysis 18.11.2014 Mark Cieliebak 5 [7]
  • 6. FlavoursofSentiment Analysis •DocumentBased •SentenceBased •Target-Specific •Rating Prediction 18.11.2014 Mark Cieliebak 6
  • 7. Classic ApproachestoSentiment Analysis Rule-Based Corpus-Based 18.11.2014 Mark Cieliebak 7 Predicted Label [3] [4]
  • 8. Simple Sentiment Analysis Idea: Count numberofpositive andnegative words "This cameraisgreat[+1]." +1 (pos) "I find itbeautiful[+1]andgood[+1]." +2 (pos) "Itlooksterrible[-1]." -1 (neg) "This carhasa bluecolor." 0 (neu) POSITIVE: great love nice ... NEUTRAL: hello see I … NEGATIVE: bad hate ugly ... UseSentiment-Dictionary: 18.11.2014 Mark Cieliebak 8
  • 9. Sample Rules 18.11.2014 Mark Cieliebak 9 •DetectBooster Words: "The carisreallyveryexpensive[-1 -1 -2]." •New Category"Mixed": "This carhasan appealing[+1]design andcomfortable[+1]seats, but itisexpensive[-1]." •Negation: Invertonlyscore ofwordsoccuringafter thenegation: "The carisappealing[+3]andI do not[*-1]find itexpensive[-2]" •I do notfind thecarexpensiveanditisappealing. Need to“understand” thesentence
  • 10. Linguistic Analysis -> RULE: Invertscoresofwordsbeingin thesame phrasesasnegation. “I do not find thecarexpensive[+2] anditisappealing[+3].” → +5 (pos) Sentence Sentence Conj. Sentence NounPhrase Verb Phrase Verb Adverb Verb Noun Phrase Adj. Noun Phrase Verb Phrase Det. Det Noun Det. Verb Participle I do not find the car expensive and it is appealing 18.11.2014 Mark Cieliebak 10
  • 11. Rule-BasedSentiment Analysis Most ImportantIssues: -Requiresgoodhand-craftedrules -Hard totransfertonewtasksorlanguages -Doesnot workwellfortextswithbadgrammer(Twitter) 18.11.2014 Mark Cieliebak 11 [5]
  • 12. Classic ApproachestoSentiment Analysis Rule-Based Corpus-Based 18.11.2014 Mark Cieliebak 12 Predicted Label [3] [4]
  • 13. Corpus-BasedSentiment Analysis 18.11.2014 Mark Cieliebak 13 Predicted Label [4]
  • 14. Corpus-BasedSentiment Analysis AnnotatedCorpus Sentence Polarity This analysis is good. Pos It looks awful. Neg This car has a blue color. Neu This car has an appealing design, comfortable seats, but it is expensive. Mix This carhasa veryappealingdesign, comfortableseats, but itisreallyexpensive. Mix This analysis is not good. Neg This car has an appealing design, comfortable seats and it is not expensive. Mix This movie was like a horror event. Neg This carisappealingandisnot expensive. Mix ... ... 18.11.2014 Mark Cieliebak 14
  • 15. Sample Features forTweets •Word ngrams:presence or absence of contiguous sequences of 1, 2, 3, and 4 tokens; noncontiguous ngrams •POS: the number of occurrences of each part-of-speechtag •SentimentLexica: eachwordannotatedwithtonalityscore (-1..0..+1) •Negation: the number of negated contexts •Punctuation: the number of contiguous sequences of exclamation marks, question marks, and both exclamation and question marks •Emoticons: presenceorabsence, last token is a positive or negative emoticon; •Hashtags: the number of hashtags; •Elongatedwords: the number of words with one character repeated (e.g. ‘soooo’) from: Mohammad et al., SemEval2013 18.11.2014 Mark Cieliebak 15
  • 16. Corpus-BasedSentiment Analysis Most ImportantIssues: -Requireslarge annotatedcorpora -Dependson goodfeatures 18.11.2014 Mark Cieliebak 16 [6]
  • 17. HowgoodareSentiment Analysis Tools? 18.11.2014 Mark Cieliebak 17
  • 18. Quick Poll •Short texts: 1-2 sentencesfromTwitter, news, reviewsetc. •Three-classclassification: positive, negative, other •Accuracy= #푐표푟푟푒푐푡푑표푐푠 #푑표푐푠 Mark Cieliebak 21 Accuracy Votes <50% 50-60% 60-70% 70-80% 80-90% >90% "Howgoodarestate-of-the-art sentimentanalysistools?" 18.11.2014
  • 19. Tool Accuracy 0,2 0,3 0,4 0,5 0,6 0,7 0,8 Accuracy Best Tool per Corpus Worst Tool per Corpus 22 61% 40% Avg. 18.11.2014 Mark Cieliebak [14]
  • 20. Tool Accuracy 0,2 0,3 0,4 0,5 0,6 0,7 0,8 Accuracy Best Tool per Corpus Worst Tool per Corpus Overall Best Tool 23 61% 40% 59% Avg. 18.11.2014 Mark Cieliebak
  • 21. Take-Home Lesson Accuracyofbestcommercialtoolon arbitraryshorttextsis59% 18.11.2014 Mark Cieliebak 24
  • 22. ApproachestoSentiment Analysis Rule-Based Corpus-Based 18.11.2014 Mark Cieliebak 25 Predicted Label [9] DeepLearning [8]
  • 23. DeepLearning on Text It'sall aboutWord Vectors! 18.11.2014 Mark Cieliebak 26
  • 24. Word2Vec •Hugesetoftextsamples(billionsofwords) •Extractdictionary •Word-Matrix: k-dimensional vectorforeachword(k typically50-500) •Word vectorinitializedrandomly •Train wordvectorstopredictnextwords, givena sequenceofwordsfromsample text 18.11.2014 Mark Cieliebak 27 Major contributionsbyBengioet al. 2003, Collobert&Weston2008, Socher et al. 2011, Mikolovet al. 2013 [9]
  • 25. The Magic ofWord Vectors 18.11.2014 Mark Cieliebak 28 King -Man + Woman≈ Queen Live Demo on 100b wordsfromGoogle News dataset: http://radimrehurek.com/2014/02/word2vec-tutorial/ [10]
  • 26. Relations LearnedbyWord2Vec 18.11.2014 Mark Cieliebak 29 [11]
  • 27. UsingWord Vectorsin NLP 18.11.2014 Mark Cieliebak 30 Collobertet al., 2011: •SENNA: GenericNLP System basedon wordvectors •Nomanualfeatureengineering •SolvesmanyNLP-Tasks asgoodasbenchmarksystems [12]
  • 28. DeepLearning andSentiment Maas et al., 2011 •Enrichwordvectorswithsentimentcontext •Capture semanticofwords(unsupervised) andsentiment(supervised) in parallel, usingmultiple learningtasks wonderful amazing terrible awful 18.11.2014 Mark Cieliebak 31
  • 29. DeepLearning andSentiment Socher et al. 2013: •Word Vectorsdo not helpforSentiment Analysis •RecursiveNeuralTensor Networks •Representingsentencestructuresastreeswhileaddingsentimentannotationsat same time •Restrictedtosingle, well-structuredsentences • 18.11.2014 Mark Cieliebak 32 [13]
  • 30. DeepLearning andSentiment QuocandMikolov, 2014: •"Paragraph Vectors" •Add context(sentence, paragraph, document) towordvectorsduringtraining •Improvesmanyexistingapproaches 18.11.2014 Mark Cieliebak 33 [9]
  • 31. DoesDeepLearning solvethe Sentiment Analysis Problem? 18.11.2014 Mark Cieliebak 34
  • 32. Conclusion: DeepLearning forSentiment •Small improvements, not revolution •Veryrecentresearch, not yet"end ofthestory" •SemEval2015 will bebenchmark 18.11.2014 Mark Cieliebak 35
  • 33. Talk in Short! 1.Classic approachesarerule-basedorcorpus-based 2.State-of-the-art toolsclassify4 out of10 docswrong 3.DeepLearning doesnot needhand-craftedfeatures 4.DeepLearning improvesexistingbenchmarks 18.11.2014 Mark Cieliebak 36
  • 34. ThankYou! Mark Cieliebak ZurichUniversity ofApplied Sciences(ZHAW) Winterthur, Switzerland Email: ciel@zhaw.ch, Website: www.zhaw.ch/~ciel 18.11.2014 Mark Cieliebak 37 [15]