SlideShare ist ein Scribd-Unternehmen logo
1 von 28
Downloaden Sie, um offline zu lesen
Taras Zagibalov
             T.Zagibalov@sussex.ac.uk


    PhD candidate at University of Sussex
                Brighton, UK
Ford Foundation International Fellowship fellow
Natural languages: Russian, English, Mandarin
          Programming: Java, Prolog


                  Taras Zagibalov© 2009
Unsupervised Sentiment Analysis


     Listening to the Word of Mouth



                                          What is it?
                                  How does it work?
                                 How can it be used?


               Taras Zagibalov© 2009
Outline

   What is Sentiment Analysis
   Application of Sentiment Analysis
   Who's in the business?
   Unsolved Problems
   Why unsupervised?
   Is it effective?



                       Taras Zagibalov© 2009
Sentiment Analysis



  Sentiment Analysis (or Opinion Mining) is a
  relatively new research area in Information
 Retrieval and Natural Language Processing,
which is concerned not with a document's topic,
      but with what opinion it expresses



                  Taras Zagibalov© 2009
What is Sentiment Analysis

     Subjectivity Classification
     Orientation Detection
     Opinion Holder and Target Extraction
     Feature-Based Opinion Mining




                 Taras Zagibalov© 2009
What is Sentiment Analysis

     Subjectivity Classification
     Orientation Detection
     Opinion Holder and Target Extraction
     quot;Feature-Based Opinion Miningquot;


               A car has four wheels.
                                 vs
                    It's a good car.

                Taras Zagibalov© 2009
What is Sentiment Analysis

     Subjectivity Classification
     Orientation Detection
     Opinion Holder and Target Extraction
     quot;Feature-Based Opinion Miningquot;


                     It's a good car.
                                  vs
                       It's a bad car.

                 Taras Zagibalov© 2009
What is Sentiment Analysis

     Subjectivity Classification
     Orientation Detection
     Opinion Holder and Target Extraction
     quot;Feature-Based Opinion Miningquot;


               Ian says it's a good car.




                 Taras Zagibalov© 2009
What is Sentiment Analysis

     Subjectivity Classification
     Orientation Detection
     Opinion Holder and Target Extraction
     quot;Feature-Based Opinion Miningquot;


      The wheels are good, but all the rest is just
       unusable.


                 Taras Zagibalov© 2009
Application of Sentiment
         Analysis

     Where opinions can be found?

    News feeds (Google, Yahoo, Reuters etc)
    Blogs (LJ, Technorati etc)
    Social Networks (Twitter, Facebook...)
    Customer review sites (Amazon, eBay...)




               Taras Zagibalov© 2009
Application of Sentiment
         Analysis

    Marketing Research
            Product Reviews Analysis
            Brand Tracking
            Influence Analysis
    Public Opinion Tracking
    Customer correspondence analysis




                 Taras Zagibalov© 2009
Application of Sentiment
         Analysis

     What questions can be answered by
      Sentiment analysis system?
    What do customers think about our product?
    Which of our customers are unsatisfied?
    What features of our product are the worst?
    Who and how influences our image?
    What is public reaction to (some event or
      some person)?
    and so on...
               Taras Zagibalov© 2009
Example 1

On-line (blogs, mass-media) monitoring of a product
promotion campaigns
   10

    9

    8

    7

    6

    5

    4

    3

    2

    1

    0
                    A                           B


Promotional campaign A is successful as most of on-line
reviews are positive.
Promotional campaign B needs immediate actions as most of
on-line reviews are negative.
                        Taras Zagibalov© 2009
Example 2

New product release as it mirrored in customer on-line
reviews
   8

   7

   6

   5

   4

   3

   2

   1

   0
                    A                           B




(A) Product release and add campaign is quite effective as
public opinion is mostly positive. But the sentiment changes as
sales grow (B), more people are unsatisfied and it needs to be
analysed (probably some quality-related issues)
                        Taras Zagibalov© 2009
Example 3

Influence analysis by tracking blogs
   9

   8

   7

   6

   5

   4

   3

   2

   1

   0
                    A                           B



(A) Negative review in a newspaper does not affect a generally
positive sentiment towards a product, although a positive
review in a magazine (B) is quite effective.


                        Taras Zagibalov© 2009
Who's in the business?

    BrandWatch
    Istrategy Labs
    Cataphora
    Scoutlabs
    Lexalytics
    Infonic
    Attensity
    Open Dover
    ...          Taras Zagibalov© 2009
What's the technology?

   Machine Learning
       Manually tagged training data sets
       User-tagged training data sets (“thumbs up” and the
        “ five stars”)
   Knowledge-based Approaches
       Manually created word-lists
       Generic word-lists (like SentiWordNet or sentiment
        vocabularies)
   Manual Processing
                         Taras Zagibalov© 2009
Unsolved Problems

   Domain-dependency
   Unpredictable evaluation language
   Language-dependency




                     Taras Zagibalov© 2009
Unsolved Problems

   Domain-dependency
   Unpredictable evaluation language
   Language-dependency


      quot;The plot was unpredictablequot;
      vs
      quot;the steering was unpredictablequot;


                       Taras Zagibalov© 2009
Unsolved Problems

   Domain-dependency
   Unpredictable evaluation language
   Language-dependency


    “good” == “bad” in eBay
    “3G” (technology for mobile phones) == “good”



                        Taras Zagibalov© 2009
Unsolved Problems

   Domain-dependency
   Unpredictable evaluation language
   Language-dependency


    Culture-related issues (“good” <> “ 好” )
    Language-related issues (SVO vs SOV)



                      Taras Zagibalov© 2009
Why unsupervised?

   Cross-Domain applicability
   Multi-Lingual applicability
   Cheap Start




               Taras Zagibalov© 2009
Why unsupervised?

   Cross-Domain applicability
   Multi-Lingual applicability
   Cheap Start

     No expensive human annotation needed:
     all information is found in the documents
     which needed to be processed.
     All extracted information is domain-
     specific and free from noise produced by
     “generic” word lists and wordnets.
               Taras Zagibalov© 2009
Why unsupervised?

   Cross-Domain applicability
   Multi-Lingual applicability
   Cheap Start

     Unsupervised systems, being data-
     independent, can be easily ported to
     almost any language.




              Taras Zagibalov© 2009
Why unsupervised?

   Cross-Domain applicability
   Multi-Lingual applicability
   Cheap Start

     Once an unsupervised system is
     developed it can be applied to new data
     almost immediately saving costs of data
     labelling and/or rules (word-lists) writing
     up.

               Taras Zagibalov© 2009
Is it effective?

   The unsupervised approach was tested on
    different language corpora (English, Simplified
    Chinese, Traditional Chinese, Japanese) and in
    many cases compared reasonably well with
    supervised methods.
   Results were presented on some major
    international scientific conferences (ACL,
    IJCNLP, COLING, NTCIR).


                      Taras Zagibalov© 2009
Is it effective?

    The approach can be easily combined with
    supervised techniques:
   Unsupervised system can provide initial data
    for in-depth research of the data (building up
    word-lists and rule-sets)
   Automatically extracted information can be
    used for training machine learning systems.



                      Taras Zagibalov© 2009
Conclusion

   Unsupervised Sentiment Analysis is an efficient
    instument of keeping track of public opinion in
    different domains and languages.
   It can be used as an entry point to a new
    domain or language.
   It can be combined with supervised methods to
    increase accuracy.



                      Taras Zagibalov© 2009

Weitere ähnliche Inhalte

Andere mochten auch

Sentiment Analysis via R Programming
Sentiment Analysis via R ProgrammingSentiment Analysis via R Programming
Sentiment Analysis via R Programming
Skillspeed
 
Sentiment analysis-by-nltk
Sentiment analysis-by-nltkSentiment analysis-by-nltk
Sentiment analysis-by-nltk
Wei-Ting Kuo
 
Sentiment analysis of tweets
Sentiment analysis of tweetsSentiment analysis of tweets
Sentiment analysis of tweets
Vasu Jain
 

Andere mochten auch (19)

SentiCheNews - Sentiment Analysis on Newspapers and Tweets
SentiCheNews - Sentiment Analysis on Newspapers and TweetsSentiCheNews - Sentiment Analysis on Newspapers and Tweets
SentiCheNews - Sentiment Analysis on Newspapers and Tweets
 
Best Practices for Sentiment Analysis Webinar
Best Practices for Sentiment Analysis Webinar Best Practices for Sentiment Analysis Webinar
Best Practices for Sentiment Analysis Webinar
 
Sentiment Analysis Using Hybrid Structure of Machine Learning Algorithms
Sentiment Analysis Using Hybrid Structure of Machine Learning AlgorithmsSentiment Analysis Using Hybrid Structure of Machine Learning Algorithms
Sentiment Analysis Using Hybrid Structure of Machine Learning Algorithms
 
Sentiment Analysis with NVivo 11 Plus
Sentiment Analysis with NVivo 11 PlusSentiment Analysis with NVivo 11 Plus
Sentiment Analysis with NVivo 11 Plus
 
Practical Sentiment Analysis
Practical Sentiment AnalysisPractical Sentiment Analysis
Practical Sentiment Analysis
 
Can Deep Learning solve the Sentiment Analysis Problem
Can Deep Learning solve the Sentiment Analysis ProblemCan Deep Learning solve the Sentiment Analysis Problem
Can Deep Learning solve the Sentiment Analysis Problem
 
Sentiment Analysis via R Programming
Sentiment Analysis via R ProgrammingSentiment Analysis via R Programming
Sentiment Analysis via R Programming
 
Sentiment analysis using naive bayes classifier
Sentiment analysis using naive bayes classifier Sentiment analysis using naive bayes classifier
Sentiment analysis using naive bayes classifier
 
Sentiment analysis-by-nltk
Sentiment analysis-by-nltkSentiment analysis-by-nltk
Sentiment analysis-by-nltk
 
MTech Seminar Presentation [IIT-Bombay]
MTech Seminar Presentation [IIT-Bombay]MTech Seminar Presentation [IIT-Bombay]
MTech Seminar Presentation [IIT-Bombay]
 
Blockchain for Business
Blockchain for BusinessBlockchain for Business
Blockchain for Business
 
Blockchain: the solution for transparency in product supply chains
Blockchain: the solution for transparency in product supply chainsBlockchain: the solution for transparency in product supply chains
Blockchain: the solution for transparency in product supply chains
 
Blockchain in Supply Chain Management: Fad or Potential?
Blockchain in Supply Chain Management: Fad or Potential?Blockchain in Supply Chain Management: Fad or Potential?
Blockchain in Supply Chain Management: Fad or Potential?
 
Smart Cities are the Internet of Things
Smart Cities are the Internet of ThingsSmart Cities are the Internet of Things
Smart Cities are the Internet of Things
 
Internet of Things for Smart Cities
Internet of Things for Smart CitiesInternet of Things for Smart Cities
Internet of Things for Smart Cities
 
Green Computing
Green ComputingGreen Computing
Green Computing
 
Social media analysis in R using twitter API
Social media analysis in R using twitter API Social media analysis in R using twitter API
Social media analysis in R using twitter API
 
Sentiment analysis of tweets
Sentiment analysis of tweetsSentiment analysis of tweets
Sentiment analysis of tweets
 
Deep Learning for Natural Language Processing
Deep Learning for Natural Language ProcessingDeep Learning for Natural Language Processing
Deep Learning for Natural Language Processing
 

Ähnlich wie Unsupervised Sentiment Analysis

Competitive Intelligence
Competitive  IntelligenceCompetitive  Intelligence
Competitive Intelligence
Pallav L
 
Mind Of An Analyst- Jennifer Vessenmeyer
Mind Of An Analyst- Jennifer VessenmeyerMind Of An Analyst- Jennifer Vessenmeyer
Mind Of An Analyst- Jennifer Vessenmeyer
Online Marketing Summit
 
Laptop brand awareness
Laptop brand awarenessLaptop brand awareness
Laptop brand awareness
Prolay Ray
 
Online Community Best Practices Final
Online Community Best Practices FinalOnline Community Best Practices Final
Online Community Best Practices Final
Roger Pilney
 

Ähnlich wie Unsupervised Sentiment Analysis (20)

How to manage test automation complexity
How to manage test automation complexityHow to manage test automation complexity
How to manage test automation complexity
 
Competitive Intelligence
Competitive  IntelligenceCompetitive  Intelligence
Competitive Intelligence
 
Usability
UsabilityUsability
Usability
 
Michael Kowalski
Michael KowalskiMichael Kowalski
Michael Kowalski
 
State of Automatic Speech Recognition
State of Automatic Speech RecognitionState of Automatic Speech Recognition
State of Automatic Speech Recognition
 
User Testing For The Rest Of Us
User Testing For The Rest Of UsUser Testing For The Rest Of Us
User Testing For The Rest Of Us
 
Mind Of An Analyst- Jennifer Vessenmeyer
Mind Of An Analyst- Jennifer VessenmeyerMind Of An Analyst- Jennifer Vessenmeyer
Mind Of An Analyst- Jennifer Vessenmeyer
 
Site Search Analytics
Site Search Analytics Site Search Analytics
Site Search Analytics
 
Sprinkle on Just Enough Process
Sprinkle on Just Enough ProcessSprinkle on Just Enough Process
Sprinkle on Just Enough Process
 
What’s Your Leadership IQ?
What’s Your Leadership IQ?What’s Your Leadership IQ?
What’s Your Leadership IQ?
 
Programming trends
Programming trendsProgramming trends
Programming trends
 
Laptop brand awareness
Laptop brand awarenessLaptop brand awareness
Laptop brand awareness
 
IRJET- Fake Review Detection using Opinion Mining
IRJET- Fake Review Detection using Opinion MiningIRJET- Fake Review Detection using Opinion Mining
IRJET- Fake Review Detection using Opinion Mining
 
Localization of iPhone Apps
Localization of iPhone AppsLocalization of iPhone Apps
Localization of iPhone Apps
 
Online Community Best Practices Final
Online Community Best Practices FinalOnline Community Best Practices Final
Online Community Best Practices Final
 
Weak Supervision.pdf
Weak Supervision.pdfWeak Supervision.pdf
Weak Supervision.pdf
 
Why the h# should I use Appium with React Native
Why the h# should I use Appium with React NativeWhy the h# should I use Appium with React Native
Why the h# should I use Appium with React Native
 
Crawford Upa09 Consumer Health Information Seeking
Crawford Upa09 Consumer Health Information SeekingCrawford Upa09 Consumer Health Information Seeking
Crawford Upa09 Consumer Health Information Seeking
 
Software Development Quantified
Software Development QuantifiedSoftware Development Quantified
Software Development Quantified
 
7 Factors to Consider When Building Your Mobile Strategy
7 Factors to Consider When Building Your Mobile Strategy7 Factors to Consider When Building Your Mobile Strategy
7 Factors to Consider When Building Your Mobile Strategy
 

Kürzlich hochgeladen

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Victor Rentea
 

Kürzlich hochgeladen (20)

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontology
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 

Unsupervised Sentiment Analysis

  • 1. Taras Zagibalov T.Zagibalov@sussex.ac.uk PhD candidate at University of Sussex Brighton, UK Ford Foundation International Fellowship fellow Natural languages: Russian, English, Mandarin Programming: Java, Prolog Taras Zagibalov© 2009
  • 2. Unsupervised Sentiment Analysis Listening to the Word of Mouth What is it? How does it work? How can it be used? Taras Zagibalov© 2009
  • 3. Outline  What is Sentiment Analysis  Application of Sentiment Analysis  Who's in the business?  Unsolved Problems  Why unsupervised?  Is it effective? Taras Zagibalov© 2009
  • 4. Sentiment Analysis Sentiment Analysis (or Opinion Mining) is a relatively new research area in Information Retrieval and Natural Language Processing, which is concerned not with a document's topic, but with what opinion it expresses Taras Zagibalov© 2009
  • 5. What is Sentiment Analysis  Subjectivity Classification  Orientation Detection  Opinion Holder and Target Extraction  Feature-Based Opinion Mining Taras Zagibalov© 2009
  • 6. What is Sentiment Analysis  Subjectivity Classification  Orientation Detection  Opinion Holder and Target Extraction  quot;Feature-Based Opinion Miningquot; A car has four wheels. vs It's a good car. Taras Zagibalov© 2009
  • 7. What is Sentiment Analysis  Subjectivity Classification  Orientation Detection  Opinion Holder and Target Extraction  quot;Feature-Based Opinion Miningquot; It's a good car. vs It's a bad car. Taras Zagibalov© 2009
  • 8. What is Sentiment Analysis  Subjectivity Classification  Orientation Detection  Opinion Holder and Target Extraction  quot;Feature-Based Opinion Miningquot; Ian says it's a good car. Taras Zagibalov© 2009
  • 9. What is Sentiment Analysis  Subjectivity Classification  Orientation Detection  Opinion Holder and Target Extraction  quot;Feature-Based Opinion Miningquot; The wheels are good, but all the rest is just unusable. Taras Zagibalov© 2009
  • 10. Application of Sentiment Analysis Where opinions can be found?  News feeds (Google, Yahoo, Reuters etc)  Blogs (LJ, Technorati etc)  Social Networks (Twitter, Facebook...)  Customer review sites (Amazon, eBay...) Taras Zagibalov© 2009
  • 11. Application of Sentiment Analysis  Marketing Research  Product Reviews Analysis  Brand Tracking  Influence Analysis  Public Opinion Tracking  Customer correspondence analysis Taras Zagibalov© 2009
  • 12. Application of Sentiment Analysis What questions can be answered by Sentiment analysis system?  What do customers think about our product?  Which of our customers are unsatisfied?  What features of our product are the worst?  Who and how influences our image?  What is public reaction to (some event or some person)?  and so on... Taras Zagibalov© 2009
  • 13. Example 1 On-line (blogs, mass-media) monitoring of a product promotion campaigns 10 9 8 7 6 5 4 3 2 1 0 A B Promotional campaign A is successful as most of on-line reviews are positive. Promotional campaign B needs immediate actions as most of on-line reviews are negative. Taras Zagibalov© 2009
  • 14. Example 2 New product release as it mirrored in customer on-line reviews 8 7 6 5 4 3 2 1 0 A B (A) Product release and add campaign is quite effective as public opinion is mostly positive. But the sentiment changes as sales grow (B), more people are unsatisfied and it needs to be analysed (probably some quality-related issues) Taras Zagibalov© 2009
  • 15. Example 3 Influence analysis by tracking blogs 9 8 7 6 5 4 3 2 1 0 A B (A) Negative review in a newspaper does not affect a generally positive sentiment towards a product, although a positive review in a magazine (B) is quite effective. Taras Zagibalov© 2009
  • 16. Who's in the business?  BrandWatch  Istrategy Labs  Cataphora  Scoutlabs  Lexalytics  Infonic  Attensity  Open Dover  ... Taras Zagibalov© 2009
  • 17. What's the technology?  Machine Learning  Manually tagged training data sets  User-tagged training data sets (“thumbs up” and the “ five stars”)  Knowledge-based Approaches  Manually created word-lists  Generic word-lists (like SentiWordNet or sentiment vocabularies)  Manual Processing Taras Zagibalov© 2009
  • 18. Unsolved Problems  Domain-dependency  Unpredictable evaluation language  Language-dependency Taras Zagibalov© 2009
  • 19. Unsolved Problems  Domain-dependency  Unpredictable evaluation language  Language-dependency quot;The plot was unpredictablequot; vs quot;the steering was unpredictablequot; Taras Zagibalov© 2009
  • 20. Unsolved Problems  Domain-dependency  Unpredictable evaluation language  Language-dependency “good” == “bad” in eBay “3G” (technology for mobile phones) == “good” Taras Zagibalov© 2009
  • 21. Unsolved Problems  Domain-dependency  Unpredictable evaluation language  Language-dependency Culture-related issues (“good” <> “ 好” ) Language-related issues (SVO vs SOV) Taras Zagibalov© 2009
  • 22. Why unsupervised?  Cross-Domain applicability  Multi-Lingual applicability  Cheap Start Taras Zagibalov© 2009
  • 23. Why unsupervised?  Cross-Domain applicability  Multi-Lingual applicability  Cheap Start No expensive human annotation needed: all information is found in the documents which needed to be processed. All extracted information is domain- specific and free from noise produced by “generic” word lists and wordnets. Taras Zagibalov© 2009
  • 24. Why unsupervised?  Cross-Domain applicability  Multi-Lingual applicability  Cheap Start Unsupervised systems, being data- independent, can be easily ported to almost any language. Taras Zagibalov© 2009
  • 25. Why unsupervised?  Cross-Domain applicability  Multi-Lingual applicability  Cheap Start Once an unsupervised system is developed it can be applied to new data almost immediately saving costs of data labelling and/or rules (word-lists) writing up. Taras Zagibalov© 2009
  • 26. Is it effective?  The unsupervised approach was tested on different language corpora (English, Simplified Chinese, Traditional Chinese, Japanese) and in many cases compared reasonably well with supervised methods.  Results were presented on some major international scientific conferences (ACL, IJCNLP, COLING, NTCIR). Taras Zagibalov© 2009
  • 27. Is it effective? The approach can be easily combined with supervised techniques:  Unsupervised system can provide initial data for in-depth research of the data (building up word-lists and rule-sets)  Automatically extracted information can be used for training machine learning systems. Taras Zagibalov© 2009
  • 28. Conclusion  Unsupervised Sentiment Analysis is an efficient instument of keeping track of public opinion in different domains and languages.  It can be used as an entry point to a new domain or language.  It can be combined with supervised methods to increase accuracy. Taras Zagibalov© 2009