SlideShare ist ein Scribd-Unternehmen logo
1 von 17
“Information about people’s opinions plays important role for more accurate
decision making in a number of domains. There are number of opinion-rich
resources available on the internet e.g. personal blogs, reviews, specialized
websites which helps in understanding the opinions of others.”
Yenkar, Pranali Sawarkar, SD - A conceptual study of Sentiment Mining, 2012
1
2
“The term opinion mining appears in a paper by Dave et al. that was published in the
proceedings of the 2003 WWW conference; the publication venue may explain the popularity
of the term within communities strongly associated with Web search or information
retrieval. According to Dave et al., the ideal opinion-mining tool would “process a set of
search results for a given item, generating a list of product attributes (quality, features, etc.)
and aggregating opinions about each of them (poor, mixed, good)”. Much of the subsequent
research self-identified as opinion mining fits this description in its emphasis on extracting
and analyzing judgments on various aspects of given items. However, the term has recently
also been interpreted more broadly to include many different types of analysis of evaluative
text.
The history of the phrase sentiment analysis parallels that of “opinion mining” in certain
respects. The term “sentiment” used in reference to the automatic analysis of evaluative text
and tracking of the predictive judgments therein appears in 2001 papers by Das and Chen
[66] and Tong [297], due to these authors’ interest in analyzing market sentiment. It
subsequently occurred within 2002 papers by Turney [299] and Pang et al. [235], which were
published in the proceedings of the annual meeting of the Association for Computational
Linguistics (ACL) and the annual conference on Empirical Methods in Natural Language
Processing (EMNLP). Moreover, Nasukawa and Yi [221] entitled their 2003 paper, “Sentiment
analysis: Capturing favorability using natural language processing”, and a paper in the same
year byYi et al. [324]was named “Sentiment Analyzer: Extracting sentiments about a given
topic using natural language processing techniques”. These events together may explain the
popularity of “sentiment analysis” among communities self-identified as focused on NLP. A
sizeable number of papers mentioning “sentiment analysis” focus on the specific application
of classifying reviews as to their polarity (either positive or negative), a fact that appears to
have caused some authors to suggest that the phrase refers specifically to this narrowly
defined task. However, nowadays many construe the term more broadly to mean the
computational treatment of opinion, sentiment, and subjectivity in text.”
B.Bang, L. Lee - Opinion Mining and Sentiment Analysis 2008
3
References:
•2001 Sanjiv Das and Mike Chen. Yahoo! for Amazon: Extracting market sentiment
from stock message boards. In Proceedings of the Asia Pacific Finance Association
Annual Conference (APFA), 2001.
•Richard M. Tong. An operational system for detecting and tracking opinions in on-
line discussion. In Proceedings of the Workshop on Operational Text Classification
(OTC), 2001.
•Satoshi Morinaga, Kenji Yamanishi, Kenji Tateishi, and Toshikazu Fukushima. Mining
product reputations on the web. In Proceedings of the ACM SIGKDD Conference on
Knowledge Discovery and Data Mining (KDD), pages 341–349, 2002. Industry track.
•Kushal Dave, Steve Lawrence, and David M. Pennock. Mining the peanut gallery:
Opinion extraction and semantic classification of product reviews. In Proceedings of
WWW, pages 519–528, 2003.
•Tetsuya Nasukawa and Jeonghee Yi. Sentiment analysis: Capturing favorability using
natural language processing. In Proceedings of the Conference on Knowledge Capture
(K-CAP), 2003.
4
Themoodmap.uk
•real time
•uses ANEW in a Python implementation
•green is happy, red is sad tweet
5
Twittermood.org
This is a quite advertised application that shows a real time Twitter mood map, this
time for US. As the previous Twitter UK mood map, this application is also based on
the ANEW word list. The algorithm does not use simple average calculations over the
sentiments of a sentence but works with relative fractions. An example of how this
algorithm works is found in the research paper of O’Connor and Balasubramanyan.
To determine the emotion of a sentence they take the number of positive words in
that sentence and divide it with the total number of words in the sentence. The real
time mood map displays dark blue and orange circles. Dark blue indicate negative
posts and orange circles indicate positive posts. The diameter of the circles
emphasize the number of posts of the area. On this mood map one cannot read the
actually posted tweets.
• B. O’Connor and R. Balasubramanyan. From Tweets to Polls: Linking Text Sentiment
to Public Opinion Time Series. ICWSM, 11:122–129, 2010.
6
7
Machine Learning is everywhere now in our life:
• simplest example is Amazon and the product
recommendations it shows you - it learns from your previous
bought products and recommends new items
• moreover - everything which has to do with lots and lots of
data, a machine can try to make sense of it based on what
answers we need: learn about disease in medical records,
customer behavior in web click data and so on…
• a machine learning can be used to learn to recognize your
handwriting on an envelope
• text classification
8
ANEW has around 2500 English rated words
9
1- 4 negative , 4-6, neutral, 6-9 positive
10
Themoodmap.uk
This real time mood map for UK is based on Twitter posts. You can watch how
Tweeter users feel at this right moment and also see what they post. Taking a look in
the background the algorithm is based on ANEW word list and is implemented in
Python. Taken as example the following sentence:
I have an example sentence have.
We can separate the sentence into words like this [i, have, an, example, sentence,
have]
such that its tokens list would contain [(I,1),(have,2),
(an,1),(example,1),(sentence,1))].
The actual rating of the sentence is done according to the following equation applying
this for all tokens in a repetitive cycle:
emotion = emotion + valence(of word) * number of times it appears
count = count +number of times the words appears sentence
emotion = emotion / count
From a range of red to dark green it displays on the UK map the negative or positive
tweets. Through clicking a point one can also see the live tweet just posted.
• P. Patel and D. Saul. Mood map python code. URL https://github.com/
priyeshpatel/moodmap/blob/master/moodmap/sentiment.py. Accessed:
2014-02-16.
11
What does a bad road mean?
How can we rate the sentence/text?
12
Davidov, Dmitry, Oren Tsur, and Ari Rappoport. "Semi-supervised recognition of
sarcastic sentences in twitter and amazon." Proceedings of the Fourteenth
Conference on Computational Natural Language Learning. Association for
Computational Linguistics, 2010.
http://aclweb.org/anthology//W/W10/W10-2914.pdf
1. “thank you Janet Jackson for yet another year of Super Bowl classic rock!” (Twitter)
2. “He’s with his other woman: XBox 360. It’s 4:30 fool. Sure I can sleep through the
gunfire” (Twitter)
3. “Wow GPRS data speeds are blazing fast.” (Twitter)
4. “[I] Love The Cover” (book, amazon)
5. “Defective by design” (music player, amazon)
13
14
• Precision - how right are we?
• Recall - did we find all relevant solutions?
• Accuracy - % of true or correct responses from the total.
Wikipedia : http://en.wikipedia.org/wiki/Precision_and_recall
15
16
17

Weitere ähnliche Inhalte

Was ist angesagt?

Sentiment Analysis Using Twitter
Sentiment Analysis Using TwitterSentiment Analysis Using Twitter
Sentiment Analysis Using Twitter
piya chauhan
 
SENTIMENT ANALYSIS OF TWITTER DATA
SENTIMENT ANALYSIS OF TWITTER DATASENTIMENT ANALYSIS OF TWITTER DATA
SENTIMENT ANALYSIS OF TWITTER DATA
Parvathy Devaraj
 

Was ist angesagt? (20)

Social media & sentiment analysis splunk conf2012
Social media & sentiment analysis   splunk conf2012Social media & sentiment analysis   splunk conf2012
Social media & sentiment analysis splunk conf2012
 
Sentiment Analysis Using Twitter
Sentiment Analysis Using TwitterSentiment Analysis Using Twitter
Sentiment Analysis Using Twitter
 
SentiCheNews - Sentiment Analysis on Newspapers and Tweets
SentiCheNews - Sentiment Analysis on Newspapers and TweetsSentiCheNews - Sentiment Analysis on Newspapers and Tweets
SentiCheNews - Sentiment Analysis on Newspapers and Tweets
 
IRE2014-Sentiment Analysis
IRE2014-Sentiment AnalysisIRE2014-Sentiment Analysis
IRE2014-Sentiment Analysis
 
These slides cover the final defense presentation for my Doctorate degree. Th...
These slides cover the final defense presentation for my Doctorate degree. Th...These slides cover the final defense presentation for my Doctorate degree. Th...
These slides cover the final defense presentation for my Doctorate degree. Th...
 
Twitter Analytics
Twitter AnalyticsTwitter Analytics
Twitter Analytics
 
Best Practices for Sentiment Analysis Webinar
Best Practices for Sentiment Analysis Webinar Best Practices for Sentiment Analysis Webinar
Best Practices for Sentiment Analysis Webinar
 
Twitter Sentiment & Investing - modeling stock price movements with twitter s...
Twitter Sentiment & Investing - modeling stock price movements with twitter s...Twitter Sentiment & Investing - modeling stock price movements with twitter s...
Twitter Sentiment & Investing - modeling stock price movements with twitter s...
 
757
757757
757
 
A scalable, lexicon based technique for sentiment analysis
A scalable, lexicon based technique for sentiment analysisA scalable, lexicon based technique for sentiment analysis
A scalable, lexicon based technique for sentiment analysis
 
Sentiment analysis
Sentiment analysisSentiment analysis
Sentiment analysis
 
SENTIMENT ANALYSIS OF TWITTER DATA
SENTIMENT ANALYSIS OF TWITTER DATASENTIMENT ANALYSIS OF TWITTER DATA
SENTIMENT ANALYSIS OF TWITTER DATA
 
Sentiment Analysis of Twitter Data
Sentiment Analysis of Twitter DataSentiment Analysis of Twitter Data
Sentiment Analysis of Twitter Data
 
Pydata Taipei 2020
Pydata Taipei 2020Pydata Taipei 2020
Pydata Taipei 2020
 
Sentiment analysis
Sentiment analysisSentiment analysis
Sentiment analysis
 
Sentiment Analysis in Twitter with Lightweight Discourse Analysis
Sentiment Analysis in Twitter with Lightweight Discourse AnalysisSentiment Analysis in Twitter with Lightweight Discourse Analysis
Sentiment Analysis in Twitter with Lightweight Discourse Analysis
 
Sentiment analysis - Our approach and use cases
Sentiment analysis - Our approach and use casesSentiment analysis - Our approach and use cases
Sentiment analysis - Our approach and use cases
 
Who gives a tweet
Who gives a tweetWho gives a tweet
Who gives a tweet
 
A Survey Of Collaborative Filtering Techniques
A Survey Of Collaborative Filtering TechniquesA Survey Of Collaborative Filtering Techniques
A Survey Of Collaborative Filtering Techniques
 
FAKE NEWS DETECTION WITH SEMANTIC FEATURES AND TEXT MINING
FAKE NEWS DETECTION WITH SEMANTIC FEATURES AND TEXT MININGFAKE NEWS DETECTION WITH SEMANTIC FEATURES AND TEXT MINING
FAKE NEWS DETECTION WITH SEMANTIC FEATURES AND TEXT MINING
 

Ähnlich wie Intro to sentiment analysis

Sentiment Analysis on Twitter Dataset using R Language
Sentiment Analysis on Twitter Dataset using R LanguageSentiment Analysis on Twitter Dataset using R Language
Sentiment Analysis on Twitter Dataset using R Language
ijtsrd
 
Dictionary Based Approach to Sentiment Analysis - A Review
Dictionary Based Approach to Sentiment Analysis - A ReviewDictionary Based Approach to Sentiment Analysis - A Review
Dictionary Based Approach to Sentiment Analysis - A Review
INFOGAIN PUBLICATION
 
Sentiment of Sentence in Tweets: A Review
Sentiment of Sentence in Tweets: A ReviewSentiment of Sentence in Tweets: A Review
Sentiment of Sentence in Tweets: A Review
iosrjce
 

Ähnlich wie Intro to sentiment analysis (20)

Sentiment Analysis on Twitter Dataset using R Language
Sentiment Analysis on Twitter Dataset using R LanguageSentiment Analysis on Twitter Dataset using R Language
Sentiment Analysis on Twitter Dataset using R Language
 
Dictionary Based Approach to Sentiment Analysis - A Review
Dictionary Based Approach to Sentiment Analysis - A ReviewDictionary Based Approach to Sentiment Analysis - A Review
Dictionary Based Approach to Sentiment Analysis - A Review
 
Comparative Study on Lexicon-based sentiment analysers over Negative sentiment
Comparative Study on Lexicon-based sentiment analysers over Negative sentimentComparative Study on Lexicon-based sentiment analysers over Negative sentiment
Comparative Study on Lexicon-based sentiment analysers over Negative sentiment
 
Automatic Movie Rating By Using Twitter Sentiment Analysis And Monitoring Tool
Automatic Movie Rating By Using Twitter Sentiment Analysis And Monitoring ToolAutomatic Movie Rating By Using Twitter Sentiment Analysis And Monitoring Tool
Automatic Movie Rating By Using Twitter Sentiment Analysis And Monitoring Tool
 
Sub1557
Sub1557Sub1557
Sub1557
 
SENTIMENT ANALYSIS-AN OBJECTIVE VIEW
SENTIMENT ANALYSIS-AN OBJECTIVE VIEWSENTIMENT ANALYSIS-AN OBJECTIVE VIEW
SENTIMENT ANALYSIS-AN OBJECTIVE VIEW
 
Aspect-Level Sentiment Analysis On Hotel Reviews
Aspect-Level Sentiment Analysis On Hotel ReviewsAspect-Level Sentiment Analysis On Hotel Reviews
Aspect-Level Sentiment Analysis On Hotel Reviews
 
A NOVEL APPROACH FOR TWITTER SENTIMENT ANALYSIS USING HYBRID CLASSIFIER
A NOVEL APPROACH FOR TWITTER SENTIMENT ANALYSIS USING HYBRID CLASSIFIERA NOVEL APPROACH FOR TWITTER SENTIMENT ANALYSIS USING HYBRID CLASSIFIER
A NOVEL APPROACH FOR TWITTER SENTIMENT ANALYSIS USING HYBRID CLASSIFIER
 
Ijetcas14 580
Ijetcas14 580Ijetcas14 580
Ijetcas14 580
 
Sentiment of Sentence in Tweets: A Review
Sentiment of Sentence in Tweets: A ReviewSentiment of Sentence in Tweets: A Review
Sentiment of Sentence in Tweets: A Review
 
W01761157162
W01761157162W01761157162
W01761157162
 
A Novel Voice Based Sentimental Analysis Technique to Mine the User Driven Re...
A Novel Voice Based Sentimental Analysis Technique to Mine the User Driven Re...A Novel Voice Based Sentimental Analysis Technique to Mine the User Driven Re...
A Novel Voice Based Sentimental Analysis Technique to Mine the User Driven Re...
 
Measuring human and Vader performance on sentiment analysis
Measuring human and Vader performance on sentiment analysisMeasuring human and Vader performance on sentiment analysis
Measuring human and Vader performance on sentiment analysis
 
Opinion Mining Techniques for Non-English Languages: An Overview
Opinion Mining Techniques for Non-English Languages: An OverviewOpinion Mining Techniques for Non-English Languages: An Overview
Opinion Mining Techniques for Non-English Languages: An Overview
 
Web Opinion Mining
Web Opinion MiningWeb Opinion Mining
Web Opinion Mining
 
Ijmet 10 01_094
Ijmet 10 01_094Ijmet 10 01_094
Ijmet 10 01_094
 
Ijcatr04061001
Ijcatr04061001Ijcatr04061001
Ijcatr04061001
 
An Approach To Sentiment Analysis
An Approach To Sentiment AnalysisAn Approach To Sentiment Analysis
An Approach To Sentiment Analysis
 
Sentiment Analysis and Classification of Tweets using Data Mining
Sentiment Analysis and Classification of Tweets using Data MiningSentiment Analysis and Classification of Tweets using Data Mining
Sentiment Analysis and Classification of Tweets using Data Mining
 
Monitoring opinion on esop through social media and clustering its polarity
Monitoring opinion on esop through social media and clustering its polarityMonitoring opinion on esop through social media and clustering its polarity
Monitoring opinion on esop through social media and clustering its polarity
 

Kürzlich hochgeladen

1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
QucHHunhnh
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
heathfieldcps1
 
Gardella_PRCampaignConclusion Pitch Letter
Gardella_PRCampaignConclusion Pitch LetterGardella_PRCampaignConclusion Pitch Letter
Gardella_PRCampaignConclusion Pitch Letter
MateoGardella
 

Kürzlich hochgeladen (20)

Advance Mobile Application Development class 07
Advance Mobile Application Development class 07Advance Mobile Application Development class 07
Advance Mobile Application Development class 07
 
Unit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxUnit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptx
 
SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...
SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...
SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...
 
psychiatric nursing HISTORY COLLECTION .docx
psychiatric  nursing HISTORY  COLLECTION  .docxpsychiatric  nursing HISTORY  COLLECTION  .docx
psychiatric nursing HISTORY COLLECTION .docx
 
Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.ppt
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17
 
PROCESS RECORDING FORMAT.docx
PROCESS      RECORDING        FORMAT.docxPROCESS      RECORDING        FORMAT.docx
PROCESS RECORDING FORMAT.docx
 
Class 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfClass 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdf
 
Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot Graph
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptx
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17
 
Gardella_PRCampaignConclusion Pitch Letter
Gardella_PRCampaignConclusion Pitch LetterGardella_PRCampaignConclusion Pitch Letter
Gardella_PRCampaignConclusion Pitch Letter
 
fourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writingfourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writing
 
Unit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxUnit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptx
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impact
 
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptxINDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
 

Intro to sentiment analysis

  • 1. “Information about people’s opinions plays important role for more accurate decision making in a number of domains. There are number of opinion-rich resources available on the internet e.g. personal blogs, reviews, specialized websites which helps in understanding the opinions of others.” Yenkar, Pranali Sawarkar, SD - A conceptual study of Sentiment Mining, 2012 1
  • 2. 2
  • 3. “The term opinion mining appears in a paper by Dave et al. that was published in the proceedings of the 2003 WWW conference; the publication venue may explain the popularity of the term within communities strongly associated with Web search or information retrieval. According to Dave et al., the ideal opinion-mining tool would “process a set of search results for a given item, generating a list of product attributes (quality, features, etc.) and aggregating opinions about each of them (poor, mixed, good)”. Much of the subsequent research self-identified as opinion mining fits this description in its emphasis on extracting and analyzing judgments on various aspects of given items. However, the term has recently also been interpreted more broadly to include many different types of analysis of evaluative text. The history of the phrase sentiment analysis parallels that of “opinion mining” in certain respects. The term “sentiment” used in reference to the automatic analysis of evaluative text and tracking of the predictive judgments therein appears in 2001 papers by Das and Chen [66] and Tong [297], due to these authors’ interest in analyzing market sentiment. It subsequently occurred within 2002 papers by Turney [299] and Pang et al. [235], which were published in the proceedings of the annual meeting of the Association for Computational Linguistics (ACL) and the annual conference on Empirical Methods in Natural Language Processing (EMNLP). Moreover, Nasukawa and Yi [221] entitled their 2003 paper, “Sentiment analysis: Capturing favorability using natural language processing”, and a paper in the same year byYi et al. [324]was named “Sentiment Analyzer: Extracting sentiments about a given topic using natural language processing techniques”. These events together may explain the popularity of “sentiment analysis” among communities self-identified as focused on NLP. A sizeable number of papers mentioning “sentiment analysis” focus on the specific application of classifying reviews as to their polarity (either positive or negative), a fact that appears to have caused some authors to suggest that the phrase refers specifically to this narrowly defined task. However, nowadays many construe the term more broadly to mean the computational treatment of opinion, sentiment, and subjectivity in text.” B.Bang, L. Lee - Opinion Mining and Sentiment Analysis 2008 3
  • 4. References: •2001 Sanjiv Das and Mike Chen. Yahoo! for Amazon: Extracting market sentiment from stock message boards. In Proceedings of the Asia Pacific Finance Association Annual Conference (APFA), 2001. •Richard M. Tong. An operational system for detecting and tracking opinions in on- line discussion. In Proceedings of the Workshop on Operational Text Classification (OTC), 2001. •Satoshi Morinaga, Kenji Yamanishi, Kenji Tateishi, and Toshikazu Fukushima. Mining product reputations on the web. In Proceedings of the ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD), pages 341–349, 2002. Industry track. •Kushal Dave, Steve Lawrence, and David M. Pennock. Mining the peanut gallery: Opinion extraction and semantic classification of product reviews. In Proceedings of WWW, pages 519–528, 2003. •Tetsuya Nasukawa and Jeonghee Yi. Sentiment analysis: Capturing favorability using natural language processing. In Proceedings of the Conference on Knowledge Capture (K-CAP), 2003. 4
  • 5. Themoodmap.uk •real time •uses ANEW in a Python implementation •green is happy, red is sad tweet 5
  • 6. Twittermood.org This is a quite advertised application that shows a real time Twitter mood map, this time for US. As the previous Twitter UK mood map, this application is also based on the ANEW word list. The algorithm does not use simple average calculations over the sentiments of a sentence but works with relative fractions. An example of how this algorithm works is found in the research paper of O’Connor and Balasubramanyan. To determine the emotion of a sentence they take the number of positive words in that sentence and divide it with the total number of words in the sentence. The real time mood map displays dark blue and orange circles. Dark blue indicate negative posts and orange circles indicate positive posts. The diameter of the circles emphasize the number of posts of the area. On this mood map one cannot read the actually posted tweets. • B. O’Connor and R. Balasubramanyan. From Tweets to Polls: Linking Text Sentiment to Public Opinion Time Series. ICWSM, 11:122–129, 2010. 6
  • 7. 7
  • 8. Machine Learning is everywhere now in our life: • simplest example is Amazon and the product recommendations it shows you - it learns from your previous bought products and recommends new items • moreover - everything which has to do with lots and lots of data, a machine can try to make sense of it based on what answers we need: learn about disease in medical records, customer behavior in web click data and so on… • a machine learning can be used to learn to recognize your handwriting on an envelope • text classification 8
  • 9. ANEW has around 2500 English rated words 9
  • 10. 1- 4 negative , 4-6, neutral, 6-9 positive 10
  • 11. Themoodmap.uk This real time mood map for UK is based on Twitter posts. You can watch how Tweeter users feel at this right moment and also see what they post. Taking a look in the background the algorithm is based on ANEW word list and is implemented in Python. Taken as example the following sentence: I have an example sentence have. We can separate the sentence into words like this [i, have, an, example, sentence, have] such that its tokens list would contain [(I,1),(have,2), (an,1),(example,1),(sentence,1))]. The actual rating of the sentence is done according to the following equation applying this for all tokens in a repetitive cycle: emotion = emotion + valence(of word) * number of times it appears count = count +number of times the words appears sentence emotion = emotion / count From a range of red to dark green it displays on the UK map the negative or positive tweets. Through clicking a point one can also see the live tweet just posted. • P. Patel and D. Saul. Mood map python code. URL https://github.com/ priyeshpatel/moodmap/blob/master/moodmap/sentiment.py. Accessed: 2014-02-16. 11
  • 12. What does a bad road mean? How can we rate the sentence/text? 12
  • 13. Davidov, Dmitry, Oren Tsur, and Ari Rappoport. "Semi-supervised recognition of sarcastic sentences in twitter and amazon." Proceedings of the Fourteenth Conference on Computational Natural Language Learning. Association for Computational Linguistics, 2010. http://aclweb.org/anthology//W/W10/W10-2914.pdf 1. “thank you Janet Jackson for yet another year of Super Bowl classic rock!” (Twitter) 2. “He’s with his other woman: XBox 360. It’s 4:30 fool. Sure I can sleep through the gunfire” (Twitter) 3. “Wow GPRS data speeds are blazing fast.” (Twitter) 4. “[I] Love The Cover” (book, amazon) 5. “Defective by design” (music player, amazon) 13
  • 14. 14
  • 15. • Precision - how right are we? • Recall - did we find all relevant solutions? • Accuracy - % of true or correct responses from the total. Wikipedia : http://en.wikipedia.org/wiki/Precision_and_recall 15
  • 16. 16
  • 17. 17