SlideShare ist ein Scribd-Unternehmen logo
1 von 30
CLASSIFICATION OF USER REVIEW
FOR ELECTRONIC GADGETS
Name: Geetika Gautam(12303008)
Supervisor: Dr.Divakar Yadav
Presentation Outline
 Introduction
 Problem Statement
 Work Done
 Result Analysis
 Conclusion and Future Work
 References
 Publications
Introduction
 With the increase of contents on the Web covering lots of
areas which are growing exponentially in numbers as well as
in volumes as sites are dedicated to specific types of
products and they specialize in collecting users’ reviews
from various sites such as Amazon etc.
 Even Twitter is an area where the tweets convey opinions,
but trying to obtain the overall understanding of these
unstructured data (opinions) can be very time consuming.
 These unstructured data (opinions) on a particular site are
seen by the users and thus creating an image about the
products or services and hence finally generating a certain
judgment. These opinions are then being generalized to
gather feedbacks for different purposes to provide useful
opinions where we use sentiment analysis.
INTRODUCTION
 Sentiment analysis is a process where the dataset
consists of emotions, attitudes or assessment which
takes into account the way a human thinks
 In a sentence, trying to understand the positive and
the negative aspect is a very difficult task.
 These contents are even written in different
approaches which are not easily deduced by the
users or the firms making it difficult to classify them.
PROBLEM STATEMENT
 To provide a Sentiment analysis system for customers’
review classification, that may be helpful to analyze the
information in the form of the number of tweets where
opinions are highly unstructured and are either positive
or negative.
APPROACH TO THE PROBLEM
DATASET AND PRE-PROCESSING
 The tweets contain a lot of opinions about the data which are
expressed in different ways by individuals .
 The twitters dataset used in this work is already labeled. Labeled
dataset has a negative and positive polarity and thus the analysis of
the data becomes easy.
 The raw data having polarity is highly susceptible to inconsistency
and redundancy.
 It deals with the preparation that removes the repeated words and
punctuations and improves efficiency of the data.
 Examples :
that painting is Beauuuutifull #” after pre-processing
converts to “painting Beautiful.” Similarly,
“@Geet is Noww Hardworkingg” converts to “Geet now
hardworking”.
APPROACH TO THE PROBLEM
FEATURE EXTRACTION
 The improved dataset after pre- processing has a lot of
distinctive properties.
 The feature extraction method, extracts the aspect
(adjective) from the dataset.
 Later this adjective is used to show the positive and
negative polarity in a sentence.
 Unigram model extracts the adjective and segregates it. It
discards the preceding and successive word occurring
with the adjective in the sentences.
 Example “painting Beautiful” through unigram model,
only Beautiful is extracted from the sentence.
APPROACH TO THE PROBLEM
TRAINING AND CLASSIFICATION
 Supervised learning is an important technique for
solving classification problems.
 Three supervised techniques i.e.
 Naive Bayes,
 Maximum Entropy
 Support Vector Machine
 semantic analysis is used along with all three
techniques to compute the similarity.
NAIVE BAYES
 It has been used because of its simplicity in both
during training and classifying stage.
 It is a probabilistic classifier and can learn the
pattern of examining a set of documents that has
been categorized.
 It compare the contents with the list of words to
classify the documents to their right category
C* = argmaccPNB(c|d)
Class c* is assigned to tweet d, where, f represents a
feature and ni (d) represents the count of feature fi found
in tweet d. There are a total of m features. Parameters
P(c) and P (f|c) are obtained through maximum likelihood
estimates which are incremented by one for smoothing.
MAXIMUM ENTROPY
 Maximum entropy maximizes the entropy defined on
the conditional probability distribution.
 It even handles overlap feature and is same as
logistic regression which finds distribution over
classes. It also follows certain feature exception
constraints
Where, c is the class, d is the tweet, and ƛ is a weight vector.
The weight vectors decide the significance of a feature in
classification.
SUPPORT VECTOR MACHINE
 Support vector machine analyzes the data, define
the decision boundaries and uses the kernels for
computation which are performed in input space.
The input data are two sets of vectors of size m
each.
 Then every data represented as a vector is
classified in a particular class. Now the task is to
find a margin between two classes that is far from
any document.
 The distance defines the margin of the classifier,
maximizing the margin reduces indecisive
decisions.
APPROACH TO THE PROBLEM
SEMANTIC ANALYSIS
 Semantic analysis is derived from the WordNet database
where each term is associated with each other.
 This database is of English words which are linked
together. If two words are close to each other, they are
semantically similar. More specifically, we are able to
determine synonym like similarity.
 The key task is to use the stored documents that contain
terms and then check the similarity with the words that the
user uses in their sentences.
 For example in the sentence ”I am happy” the word
‘’happy’’ being an adjective gets selected and is compared
with the stored feature vector for synonyms. Let us assume
2 words; ‘glad’ and ‘satisfied’ tend to be very similar to the
word ‘happy’. Now after the semantic analysis, ‘glad’
replaces ‘happy’ which gives a positive polarity.
PSEUDO CODE OF THE PROCESS
Input: Labeled Dataset
Output: positive and negative polarity with synonym
of words and similarity between words
Step-1 Pre-Processing the tweets:
Pre-processing ()
Remove URL:
Remove special symbols
Convert to lower:
Step-2 Get the Feature Vector List:
For w in words:
Replace two or more words
Strip:
If (w in stopwords)
Continue
Else:
Append the file
Return feature vector
Step-3 Extract Features from Feature Vector List:
For word in feature list
Features=word in tweets_words
Return features
Step-4 Combine Pre-Processing Dataset and
Feature Vector List
Pre-processed file=path name of the file
Stopwords=file path name
Feature Vector List=file path of feature
vector list
Step-5 Training the step 4
Apply classifiers classes
Step-6 Find Synonym and Similarity of the
Feature Vector
For every sentences in feature list
Extract feature vector in the tweets
()
For each Feature Vector: x
For each Feature Vector: y
Find the similarity(x, y)
If (similarity>threshold)
Match found
Feature Vector: x= Feature
Vector: y
Classify (x, y)
Print: sentiment polarity with similar feature
words
SNAPSHOTS
NAÏVE BAYES OUTPUT
MAXIMUM ENTROPY OUTPUT
SUPPORT VECTOR MACHINE OUTPUT
WORD NET WITH NAIVE BAYES OUTPUT
VISUALIZATION OF USER REVIEW
 Graph
 Show Comparison Between two
reviews for particular gadgets
Positive
Negative
Nokia Sentiment
Analysis
RESULTS ANALYSIS:
Methods Accuracy
Naive Bayes 88.2
Maximum Entropy 83.8
Support Vector machine 85.5
Semantic Analysis
(WordNet)
89.9
Accuracy Comparison Between the Techniques
Performance Measures (%)
Positive Recall 91.2
Negative Recall 85.4
Positive
Precision
49.3
Negative
Precision
39.3
Performance Measures (%)
Positive Recall 86.1
Negative Recall 80.0
Positive Precision 40.4
Negative Precision 33.6
Performance Measures (%)
Positive Recall 88.3
Negative Recall 83.5
Positive Precision 43.8
Negative Precision 35.7
Naive Bayesian classification
measurements
Maximum entropy
measurements
support vector machine measurements
RECALL MEASUREMENTS OF THE
TECHNIQUES
PRECISION MEASUREMENTS OF THE TECHNIQUES
CONCLUSION AND FUTURE WORK
 The key aim is to analyze a large amount of reviews by using twitter
dataset which are already labeled.
 The naïve byes technique which gives us a better result than the
maximum entropy and SVM is being subjected to unigram model which
gives a better result than using it alone.
 Further the accuracy is again improved when the semantic analysis
WordNet is followed up by the above procedure taking it to 89.9% from
88.2%.
 The training data set can be increased to improve the feature vector
related sentence identification process and can also extend WordNet for
the summarization of the reviews.
 It may give better visualization of the content in better manner that will be
helpful for the users.
 Some failures when using LSA .
REFERENCES
 R. Feldman, ” Techniques and Applications for Sentiment Analysis,”
Communications of the ACM, Vol. 56 No. 4, pp. 82-89, 2013.
 Y. Singh, P. K. Bhatia, and O.P. Sangwan, ”A Review of Studies on Machine
Learning Techniques,” International Journal of Computer Science and
Security, Volume (1) : Issue (1), pp. 70-84, 2007.
 P.D. Turney,” Thumbs Up or Thumbs Down? Semantic Orientation Applied to
Unsupervised Classification of Reviews,” Proceedings of the 40th Annual
Meeting of the Association for Computational Linguistics (ACL), Philadelphia,
pp. 417-424, July 2002.
 B.Ren,L.Cheng,” Research of Classification System based on Naive Bayes
and MetaClass,” Second International Conference on Information and
Computing Science, ICIC ’09, Vol(3), pp. 154 – 156, 2009.
 C.I.Tsatsoulis,M.Hofmann,”Focusing on Maximum Entropy Classification of
Lyrics by Tom Waits,” IEEE International on Advance Computing Conference
(IACC), pp. 664 – 667, 2014.
 M.A. Hearst,”Support vector machines,”IEEE Intelligent Systems, pp. 18-28,
1998.
PUBLICATIONS FORM THE PROJECT
 Geetika and Divakar yadav,” Sentiment Analysis of
Twitter Data Using Machine Learning Approaches
and Semantic Analysis”, Seventh International
Conference on Contemporary Computing (IC3),
JIIT, Noida, India, 2014 (Submitted).
 survey paper on sentiment analysis in process

Weitere ähnliche Inhalte

Was ist angesagt?

Tweets Classification using Naive Bayes and SVM
Tweets Classification using Naive Bayes and SVMTweets Classification using Naive Bayes and SVM
Tweets Classification using Naive Bayes and SVMTrilok Sharma
 
Sentiment analysis in Twitter on Big Data
Sentiment analysis in Twitter on Big DataSentiment analysis in Twitter on Big Data
Sentiment analysis in Twitter on Big DataIswarya M
 
Sentiment Analaysis on Twitter
Sentiment Analaysis on TwitterSentiment Analaysis on Twitter
Sentiment Analaysis on TwitterNitish J Prabhu
 
Sentiment Analysis of Twitter Data
Sentiment Analysis of Twitter DataSentiment Analysis of Twitter Data
Sentiment Analysis of Twitter DataSumit Raj
 
Sentiment analysis of Twitter data using python
Sentiment analysis of Twitter data using pythonSentiment analysis of Twitter data using python
Sentiment analysis of Twitter data using pythonHetu Bhavsar
 
Sentiment analysis of twitter data
Sentiment analysis of twitter dataSentiment analysis of twitter data
Sentiment analysis of twitter dataBhagyashree Deokar
 
New sentiment analysis of tweets using python by Ravi kumar
New sentiment analysis of tweets using python by Ravi kumarNew sentiment analysis of tweets using python by Ravi kumar
New sentiment analysis of tweets using python by Ravi kumarRavi Kumar
 
Twitter sentiment analysis
Twitter sentiment analysisTwitter sentiment analysis
Twitter sentiment analysisSunil Kandari
 
Twitter sentiment analysis ppt
Twitter sentiment analysis pptTwitter sentiment analysis ppt
Twitter sentiment analysis pptSonuCreation
 
Sentiment Analysis
Sentiment Analysis Sentiment Analysis
Sentiment Analysis prnk08
 
Sentiment Analysis Using Product Review
Sentiment Analysis Using Product ReviewSentiment Analysis Using Product Review
Sentiment Analysis Using Product ReviewAbdullah Moin
 
Twitter Sentiment Analysis
Twitter Sentiment AnalysisTwitter Sentiment Analysis
Twitter Sentiment AnalysisAyush Khandelwal
 
IRE2014-Sentiment Analysis
IRE2014-Sentiment AnalysisIRE2014-Sentiment Analysis
IRE2014-Sentiment AnalysisGangasagar Patil
 
Sentiment Analysis in Twitter
Sentiment Analysis in TwitterSentiment Analysis in Twitter
Sentiment Analysis in TwitterAyushi Dalmia
 
Sentiment Analysis on Twitter
Sentiment Analysis on TwitterSentiment Analysis on Twitter
Sentiment Analysis on TwitterSmritiAgarwal26
 
social network analysis project twitter sentimental analysis
social network analysis project twitter sentimental analysissocial network analysis project twitter sentimental analysis
social network analysis project twitter sentimental analysisAshish Mundra
 
Sentiment analysis using ml
Sentiment analysis using mlSentiment analysis using ml
Sentiment analysis using mlPravin Katiyar
 

Was ist angesagt? (20)

Tweets Classification using Naive Bayes and SVM
Tweets Classification using Naive Bayes and SVMTweets Classification using Naive Bayes and SVM
Tweets Classification using Naive Bayes and SVM
 
Sentiment analysis in Twitter on Big Data
Sentiment analysis in Twitter on Big DataSentiment analysis in Twitter on Big Data
Sentiment analysis in Twitter on Big Data
 
Sentiment Analaysis on Twitter
Sentiment Analaysis on TwitterSentiment Analaysis on Twitter
Sentiment Analaysis on Twitter
 
Sentiment Analysis of Twitter Data
Sentiment Analysis of Twitter DataSentiment Analysis of Twitter Data
Sentiment Analysis of Twitter Data
 
Sentiment analysis of Twitter data using python
Sentiment analysis of Twitter data using pythonSentiment analysis of Twitter data using python
Sentiment analysis of Twitter data using python
 
Sentiment analysis of twitter data
Sentiment analysis of twitter dataSentiment analysis of twitter data
Sentiment analysis of twitter data
 
New sentiment analysis of tweets using python by Ravi kumar
New sentiment analysis of tweets using python by Ravi kumarNew sentiment analysis of tweets using python by Ravi kumar
New sentiment analysis of tweets using python by Ravi kumar
 
Twitter sentiment analysis
Twitter sentiment analysisTwitter sentiment analysis
Twitter sentiment analysis
 
Twitter sentiment analysis ppt
Twitter sentiment analysis pptTwitter sentiment analysis ppt
Twitter sentiment analysis ppt
 
Twitter sentiment analysis ppt
Twitter sentiment analysis pptTwitter sentiment analysis ppt
Twitter sentiment analysis ppt
 
Amazon seniment
Amazon senimentAmazon seniment
Amazon seniment
 
Sentiment Analysis
Sentiment Analysis Sentiment Analysis
Sentiment Analysis
 
Sentiment Analysis Using Product Review
Sentiment Analysis Using Product ReviewSentiment Analysis Using Product Review
Sentiment Analysis Using Product Review
 
Twitter Sentiment Analysis
Twitter Sentiment AnalysisTwitter Sentiment Analysis
Twitter Sentiment Analysis
 
Ml ppt
Ml pptMl ppt
Ml ppt
 
IRE2014-Sentiment Analysis
IRE2014-Sentiment AnalysisIRE2014-Sentiment Analysis
IRE2014-Sentiment Analysis
 
Sentiment Analysis in Twitter
Sentiment Analysis in TwitterSentiment Analysis in Twitter
Sentiment Analysis in Twitter
 
Sentiment Analysis on Twitter
Sentiment Analysis on TwitterSentiment Analysis on Twitter
Sentiment Analysis on Twitter
 
social network analysis project twitter sentimental analysis
social network analysis project twitter sentimental analysissocial network analysis project twitter sentimental analysis
social network analysis project twitter sentimental analysis
 
Sentiment analysis using ml
Sentiment analysis using mlSentiment analysis using ml
Sentiment analysis using ml
 

Andere mochten auch

Machine Learning - Object Detection and Classification
Machine Learning - Object Detection and ClassificationMachine Learning - Object Detection and Classification
Machine Learning - Object Detection and ClassificationVikas Jain
 
Sentiment analysis using naive bayes classifier
Sentiment analysis using naive bayes classifier Sentiment analysis using naive bayes classifier
Sentiment analysis using naive bayes classifier Dev Sahu
 
Sentiment mining- The Design and Implementation of an Internet Public Opinion...
Sentiment mining- The Design and Implementation of an Internet PublicOpinion...Sentiment mining- The Design and Implementation of an Internet PublicOpinion...
Sentiment mining- The Design and Implementation of an Internet Public Opinion...Prateek Singh
 
Sentiment Analysis in Twitter
Sentiment Analysis in TwitterSentiment Analysis in Twitter
Sentiment Analysis in Twitterprnk08
 
Optimizing Usage Analysis During Implementation Of Social Media Systems
Optimizing Usage Analysis During Implementation Of Social Media SystemsOptimizing Usage Analysis During Implementation Of Social Media Systems
Optimizing Usage Analysis During Implementation Of Social Media SystemsMartin Rückert
 
Sentiment Analysis
Sentiment AnalysisSentiment Analysis
Sentiment Analysisharit66
 
A Fuzzy Approach For Multi-Domain Sentiment Analysis
A Fuzzy Approach For Multi-Domain Sentiment AnalysisA Fuzzy Approach For Multi-Domain Sentiment Analysis
A Fuzzy Approach For Multi-Domain Sentiment AnalysisMauro Dragoni
 
Mike davies sentiment_analysis_presentation_backup
Mike davies sentiment_analysis_presentation_backupMike davies sentiment_analysis_presentation_backup
Mike davies sentiment_analysis_presentation_backupm1ked
 
Sentiment Analysis in Twitter with Lightweight Discourse Analysis
Sentiment Analysis in Twitter with Lightweight Discourse AnalysisSentiment Analysis in Twitter with Lightweight Discourse Analysis
Sentiment Analysis in Twitter with Lightweight Discourse AnalysisSubhabrata Mukherjee
 
Sentiment Analysis
Sentiment AnalysisSentiment Analysis
Sentiment AnalysisAnkur Tyagi
 
Arabic Text mining Classification
Arabic Text mining Classification Arabic Text mining Classification
Arabic Text mining Classification Zakaria Zubi
 
Sentiment tool Project presentaion
Sentiment tool Project presentaionSentiment tool Project presentaion
Sentiment tool Project presentaionRavindra Chaudhary
 
Introduction to Sentiment Analysis
Introduction to Sentiment AnalysisIntroduction to Sentiment Analysis
Introduction to Sentiment AnalysisMakrand Patil
 
Sentiment Analysis Using Machine Learning
Sentiment Analysis Using Machine LearningSentiment Analysis Using Machine Learning
Sentiment Analysis Using Machine LearningNihar Suryawanshi
 
Sentiment analysis of arabic,a survey
Sentiment analysis of arabic,a surveySentiment analysis of arabic,a survey
Sentiment analysis of arabic,a surveyArabic_NLP_ImamU2013
 
Sentiment Analysis Using Hybrid Structure of Machine Learning Algorithms
Sentiment Analysis Using Hybrid Structure of Machine Learning AlgorithmsSentiment Analysis Using Hybrid Structure of Machine Learning Algorithms
Sentiment Analysis Using Hybrid Structure of Machine Learning AlgorithmsSangeeth Nagarajan
 

Andere mochten auch (20)

Machine Learning - Object Detection and Classification
Machine Learning - Object Detection and ClassificationMachine Learning - Object Detection and Classification
Machine Learning - Object Detection and Classification
 
Sentiment analysis using naive bayes classifier
Sentiment analysis using naive bayes classifier Sentiment analysis using naive bayes classifier
Sentiment analysis using naive bayes classifier
 
Sentiment mining- The Design and Implementation of an Internet Public Opinion...
Sentiment mining- The Design and Implementation of an Internet PublicOpinion...Sentiment mining- The Design and Implementation of an Internet PublicOpinion...
Sentiment mining- The Design and Implementation of an Internet Public Opinion...
 
Sentiment Analysis in Twitter
Sentiment Analysis in TwitterSentiment Analysis in Twitter
Sentiment Analysis in Twitter
 
Optimizing Usage Analysis During Implementation Of Social Media Systems
Optimizing Usage Analysis During Implementation Of Social Media SystemsOptimizing Usage Analysis During Implementation Of Social Media Systems
Optimizing Usage Analysis During Implementation Of Social Media Systems
 
Evaluation Datasets for Twitter Sentiment Analysis: A survey and a new datase...
Evaluation Datasets for Twitter Sentiment Analysis: A survey and a new datase...Evaluation Datasets for Twitter Sentiment Analysis: A survey and a new datase...
Evaluation Datasets for Twitter Sentiment Analysis: A survey and a new datase...
 
Sentiment Analysis
Sentiment AnalysisSentiment Analysis
Sentiment Analysis
 
A Fuzzy Approach For Multi-Domain Sentiment Analysis
A Fuzzy Approach For Multi-Domain Sentiment AnalysisA Fuzzy Approach For Multi-Domain Sentiment Analysis
A Fuzzy Approach For Multi-Domain Sentiment Analysis
 
Data mining project
Data mining projectData mining project
Data mining project
 
Mike davies sentiment_analysis_presentation_backup
Mike davies sentiment_analysis_presentation_backupMike davies sentiment_analysis_presentation_backup
Mike davies sentiment_analysis_presentation_backup
 
Sentiment analysis
Sentiment analysisSentiment analysis
Sentiment analysis
 
Sentiment Analysis in Twitter with Lightweight Discourse Analysis
Sentiment Analysis in Twitter with Lightweight Discourse AnalysisSentiment Analysis in Twitter with Lightweight Discourse Analysis
Sentiment Analysis in Twitter with Lightweight Discourse Analysis
 
Sentiment Analysis
Sentiment AnalysisSentiment Analysis
Sentiment Analysis
 
Arabic Text mining Classification
Arabic Text mining Classification Arabic Text mining Classification
Arabic Text mining Classification
 
Arabic tokenization and stemming
Arabic tokenization and  stemmingArabic tokenization and  stemming
Arabic tokenization and stemming
 
Sentiment tool Project presentaion
Sentiment tool Project presentaionSentiment tool Project presentaion
Sentiment tool Project presentaion
 
Introduction to Sentiment Analysis
Introduction to Sentiment AnalysisIntroduction to Sentiment Analysis
Introduction to Sentiment Analysis
 
Sentiment Analysis Using Machine Learning
Sentiment Analysis Using Machine LearningSentiment Analysis Using Machine Learning
Sentiment Analysis Using Machine Learning
 
Sentiment analysis of arabic,a survey
Sentiment analysis of arabic,a surveySentiment analysis of arabic,a survey
Sentiment analysis of arabic,a survey
 
Sentiment Analysis Using Hybrid Structure of Machine Learning Algorithms
Sentiment Analysis Using Hybrid Structure of Machine Learning AlgorithmsSentiment Analysis Using Hybrid Structure of Machine Learning Algorithms
Sentiment Analysis Using Hybrid Structure of Machine Learning Algorithms
 

Ähnlich wie Project prSentiment Analysis of Twitter Data Using Machine Learning Approaches and Semantic Analysisesentation geetika gautam

Analyzing Text Preprocessing and Feature Selection Methods for Sentiment Anal...
Analyzing Text Preprocessing and Feature Selection Methods for Sentiment Anal...Analyzing Text Preprocessing and Feature Selection Methods for Sentiment Anal...
Analyzing Text Preprocessing and Feature Selection Methods for Sentiment Anal...Nirav Raje
 
A survey of modified support vector machine using particle of swarm optimizat...
A survey of modified support vector machine using particle of swarm optimizat...A survey of modified support vector machine using particle of swarm optimizat...
A survey of modified support vector machine using particle of swarm optimizat...Editor Jacotech
 
Decision treeinductionmethodsandtheirapplicationtobigdatafinal 5
Decision treeinductionmethodsandtheirapplicationtobigdatafinal 5Decision treeinductionmethodsandtheirapplicationtobigdatafinal 5
Decision treeinductionmethodsandtheirapplicationtobigdatafinal 5ssuser33da69
 
Query Aware Determinization of Uncertain Objects
Query Aware Determinization of Uncertain ObjectsQuery Aware Determinization of Uncertain Objects
Query Aware Determinization of Uncertain Objectsnexgentechnology
 
QUERY AWARE DETERMINIZATION OF UNCERTAIN OBJECTS
 QUERY AWARE DETERMINIZATION OF UNCERTAIN OBJECTS QUERY AWARE DETERMINIZATION OF UNCERTAIN OBJECTS
QUERY AWARE DETERMINIZATION OF UNCERTAIN OBJECTSNexgen Technology
 
Query aware determinization of uncertain
Query aware determinization of uncertainQuery aware determinization of uncertain
Query aware determinization of uncertainnexgentech15
 
Query aware determinization of uncertain
Query aware determinization of uncertainQuery aware determinization of uncertain
Query aware determinization of uncertainShakas Technologies
 
Profile Analysis of Users in Data Analytics Domain
Profile Analysis of   Users in Data Analytics DomainProfile Analysis of   Users in Data Analytics Domain
Profile Analysis of Users in Data Analytics DomainDrjabez
 
Multi label classification of
Multi label classification ofMulti label classification of
Multi label classification ofijaia
 
Svm and maximum entropy model for sentiment analysis of tweets
Svm and maximum entropy model for sentiment analysis of tweetsSvm and maximum entropy model for sentiment analysis of tweets
Svm and maximum entropy model for sentiment analysis of tweetsS M Raju
 
Automated Essay Grading using Features Selection
Automated Essay Grading using Features SelectionAutomated Essay Grading using Features Selection
Automated Essay Grading using Features SelectionIRJET Journal
 
Sentiment Analysis Using Hybrid Approach: A Survey
Sentiment Analysis Using Hybrid Approach: A SurveySentiment Analysis Using Hybrid Approach: A Survey
Sentiment Analysis Using Hybrid Approach: A SurveyIJERA Editor
 
Automated Question Paper Generator And Answer Checker Using Information Retri...
Automated Question Paper Generator And Answer Checker Using Information Retri...Automated Question Paper Generator And Answer Checker Using Information Retri...
Automated Question Paper Generator And Answer Checker Using Information Retri...Sheila Sinclair
 
AN EFFICIENT FEATURE SELECTION IN CLASSIFICATION OF AUDIO FILES
AN EFFICIENT FEATURE SELECTION IN CLASSIFICATION OF AUDIO FILESAN EFFICIENT FEATURE SELECTION IN CLASSIFICATION OF AUDIO FILES
AN EFFICIENT FEATURE SELECTION IN CLASSIFICATION OF AUDIO FILEScscpconf
 
An efficient feature selection in
An efficient feature selection inAn efficient feature selection in
An efficient feature selection incsandit
 
Analysis of Textual Data Classification with a Reddit Comments Dataset
Analysis of Textual Data Classification with a Reddit Comments DatasetAnalysis of Textual Data Classification with a Reddit Comments Dataset
Analysis of Textual Data Classification with a Reddit Comments DatasetAdamBab
 
Twitter Sentiment Analysis: An Unsupervised Approach
Twitter Sentiment Analysis: An Unsupervised ApproachTwitter Sentiment Analysis: An Unsupervised Approach
Twitter Sentiment Analysis: An Unsupervised ApproachIRJET Journal
 
Camera ready sentiment analysis : quantification of real time brand advocacy ...
Camera ready sentiment analysis : quantification of real time brand advocacy ...Camera ready sentiment analysis : quantification of real time brand advocacy ...
Camera ready sentiment analysis : quantification of real time brand advocacy ...Absolutdata Analytics
 
Neural Network Based Context Sensitive Sentiment Analysis
Neural Network Based Context Sensitive Sentiment AnalysisNeural Network Based Context Sensitive Sentiment Analysis
Neural Network Based Context Sensitive Sentiment AnalysisEditor IJCATR
 
Feature extraction for classifying students based on theirac ademic performance
Feature extraction for classifying students based on theirac ademic performanceFeature extraction for classifying students based on theirac ademic performance
Feature extraction for classifying students based on theirac ademic performanceVenkat Projects
 

Ähnlich wie Project prSentiment Analysis of Twitter Data Using Machine Learning Approaches and Semantic Analysisesentation geetika gautam (20)

Analyzing Text Preprocessing and Feature Selection Methods for Sentiment Anal...
Analyzing Text Preprocessing and Feature Selection Methods for Sentiment Anal...Analyzing Text Preprocessing and Feature Selection Methods for Sentiment Anal...
Analyzing Text Preprocessing and Feature Selection Methods for Sentiment Anal...
 
A survey of modified support vector machine using particle of swarm optimizat...
A survey of modified support vector machine using particle of swarm optimizat...A survey of modified support vector machine using particle of swarm optimizat...
A survey of modified support vector machine using particle of swarm optimizat...
 
Decision treeinductionmethodsandtheirapplicationtobigdatafinal 5
Decision treeinductionmethodsandtheirapplicationtobigdatafinal 5Decision treeinductionmethodsandtheirapplicationtobigdatafinal 5
Decision treeinductionmethodsandtheirapplicationtobigdatafinal 5
 
Query Aware Determinization of Uncertain Objects
Query Aware Determinization of Uncertain ObjectsQuery Aware Determinization of Uncertain Objects
Query Aware Determinization of Uncertain Objects
 
QUERY AWARE DETERMINIZATION OF UNCERTAIN OBJECTS
 QUERY AWARE DETERMINIZATION OF UNCERTAIN OBJECTS QUERY AWARE DETERMINIZATION OF UNCERTAIN OBJECTS
QUERY AWARE DETERMINIZATION OF UNCERTAIN OBJECTS
 
Query aware determinization of uncertain
Query aware determinization of uncertainQuery aware determinization of uncertain
Query aware determinization of uncertain
 
Query aware determinization of uncertain
Query aware determinization of uncertainQuery aware determinization of uncertain
Query aware determinization of uncertain
 
Profile Analysis of Users in Data Analytics Domain
Profile Analysis of   Users in Data Analytics DomainProfile Analysis of   Users in Data Analytics Domain
Profile Analysis of Users in Data Analytics Domain
 
Multi label classification of
Multi label classification ofMulti label classification of
Multi label classification of
 
Svm and maximum entropy model for sentiment analysis of tweets
Svm and maximum entropy model for sentiment analysis of tweetsSvm and maximum entropy model for sentiment analysis of tweets
Svm and maximum entropy model for sentiment analysis of tweets
 
Automated Essay Grading using Features Selection
Automated Essay Grading using Features SelectionAutomated Essay Grading using Features Selection
Automated Essay Grading using Features Selection
 
Sentiment Analysis Using Hybrid Approach: A Survey
Sentiment Analysis Using Hybrid Approach: A SurveySentiment Analysis Using Hybrid Approach: A Survey
Sentiment Analysis Using Hybrid Approach: A Survey
 
Automated Question Paper Generator And Answer Checker Using Information Retri...
Automated Question Paper Generator And Answer Checker Using Information Retri...Automated Question Paper Generator And Answer Checker Using Information Retri...
Automated Question Paper Generator And Answer Checker Using Information Retri...
 
AN EFFICIENT FEATURE SELECTION IN CLASSIFICATION OF AUDIO FILES
AN EFFICIENT FEATURE SELECTION IN CLASSIFICATION OF AUDIO FILESAN EFFICIENT FEATURE SELECTION IN CLASSIFICATION OF AUDIO FILES
AN EFFICIENT FEATURE SELECTION IN CLASSIFICATION OF AUDIO FILES
 
An efficient feature selection in
An efficient feature selection inAn efficient feature selection in
An efficient feature selection in
 
Analysis of Textual Data Classification with a Reddit Comments Dataset
Analysis of Textual Data Classification with a Reddit Comments DatasetAnalysis of Textual Data Classification with a Reddit Comments Dataset
Analysis of Textual Data Classification with a Reddit Comments Dataset
 
Twitter Sentiment Analysis: An Unsupervised Approach
Twitter Sentiment Analysis: An Unsupervised ApproachTwitter Sentiment Analysis: An Unsupervised Approach
Twitter Sentiment Analysis: An Unsupervised Approach
 
Camera ready sentiment analysis : quantification of real time brand advocacy ...
Camera ready sentiment analysis : quantification of real time brand advocacy ...Camera ready sentiment analysis : quantification of real time brand advocacy ...
Camera ready sentiment analysis : quantification of real time brand advocacy ...
 
Neural Network Based Context Sensitive Sentiment Analysis
Neural Network Based Context Sensitive Sentiment AnalysisNeural Network Based Context Sensitive Sentiment Analysis
Neural Network Based Context Sensitive Sentiment Analysis
 
Feature extraction for classifying students based on theirac ademic performance
Feature extraction for classifying students based on theirac ademic performanceFeature extraction for classifying students based on theirac ademic performance
Feature extraction for classifying students based on theirac ademic performance
 

Kürzlich hochgeladen

How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
Google AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGGoogle AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGSujit Pal
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...gurkirankumar98700
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...HostedbyConfluent
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 

Kürzlich hochgeladen (20)

How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
Google AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGGoogle AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAG
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 

Project prSentiment Analysis of Twitter Data Using Machine Learning Approaches and Semantic Analysisesentation geetika gautam

  • 1. CLASSIFICATION OF USER REVIEW FOR ELECTRONIC GADGETS Name: Geetika Gautam(12303008) Supervisor: Dr.Divakar Yadav
  • 2. Presentation Outline  Introduction  Problem Statement  Work Done  Result Analysis  Conclusion and Future Work  References  Publications
  • 3. Introduction  With the increase of contents on the Web covering lots of areas which are growing exponentially in numbers as well as in volumes as sites are dedicated to specific types of products and they specialize in collecting users’ reviews from various sites such as Amazon etc.  Even Twitter is an area where the tweets convey opinions, but trying to obtain the overall understanding of these unstructured data (opinions) can be very time consuming.  These unstructured data (opinions) on a particular site are seen by the users and thus creating an image about the products or services and hence finally generating a certain judgment. These opinions are then being generalized to gather feedbacks for different purposes to provide useful opinions where we use sentiment analysis.
  • 4. INTRODUCTION  Sentiment analysis is a process where the dataset consists of emotions, attitudes or assessment which takes into account the way a human thinks  In a sentence, trying to understand the positive and the negative aspect is a very difficult task.  These contents are even written in different approaches which are not easily deduced by the users or the firms making it difficult to classify them.
  • 5. PROBLEM STATEMENT  To provide a Sentiment analysis system for customers’ review classification, that may be helpful to analyze the information in the form of the number of tweets where opinions are highly unstructured and are either positive or negative.
  • 6. APPROACH TO THE PROBLEM
  • 7. DATASET AND PRE-PROCESSING  The tweets contain a lot of opinions about the data which are expressed in different ways by individuals .  The twitters dataset used in this work is already labeled. Labeled dataset has a negative and positive polarity and thus the analysis of the data becomes easy.  The raw data having polarity is highly susceptible to inconsistency and redundancy.  It deals with the preparation that removes the repeated words and punctuations and improves efficiency of the data.  Examples : that painting is Beauuuutifull #” after pre-processing converts to “painting Beautiful.” Similarly, “@Geet is Noww Hardworkingg” converts to “Geet now hardworking”.
  • 8. APPROACH TO THE PROBLEM
  • 9. FEATURE EXTRACTION  The improved dataset after pre- processing has a lot of distinctive properties.  The feature extraction method, extracts the aspect (adjective) from the dataset.  Later this adjective is used to show the positive and negative polarity in a sentence.  Unigram model extracts the adjective and segregates it. It discards the preceding and successive word occurring with the adjective in the sentences.  Example “painting Beautiful” through unigram model, only Beautiful is extracted from the sentence.
  • 10. APPROACH TO THE PROBLEM
  • 11. TRAINING AND CLASSIFICATION  Supervised learning is an important technique for solving classification problems.  Three supervised techniques i.e.  Naive Bayes,  Maximum Entropy  Support Vector Machine  semantic analysis is used along with all three techniques to compute the similarity.
  • 12. NAIVE BAYES  It has been used because of its simplicity in both during training and classifying stage.  It is a probabilistic classifier and can learn the pattern of examining a set of documents that has been categorized.  It compare the contents with the list of words to classify the documents to their right category C* = argmaccPNB(c|d)
  • 13. Class c* is assigned to tweet d, where, f represents a feature and ni (d) represents the count of feature fi found in tweet d. There are a total of m features. Parameters P(c) and P (f|c) are obtained through maximum likelihood estimates which are incremented by one for smoothing.
  • 14. MAXIMUM ENTROPY  Maximum entropy maximizes the entropy defined on the conditional probability distribution.  It even handles overlap feature and is same as logistic regression which finds distribution over classes. It also follows certain feature exception constraints Where, c is the class, d is the tweet, and ƛ is a weight vector. The weight vectors decide the significance of a feature in classification.
  • 15. SUPPORT VECTOR MACHINE  Support vector machine analyzes the data, define the decision boundaries and uses the kernels for computation which are performed in input space. The input data are two sets of vectors of size m each.  Then every data represented as a vector is classified in a particular class. Now the task is to find a margin between two classes that is far from any document.  The distance defines the margin of the classifier, maximizing the margin reduces indecisive decisions.
  • 16. APPROACH TO THE PROBLEM
  • 17. SEMANTIC ANALYSIS  Semantic analysis is derived from the WordNet database where each term is associated with each other.  This database is of English words which are linked together. If two words are close to each other, they are semantically similar. More specifically, we are able to determine synonym like similarity.  The key task is to use the stored documents that contain terms and then check the similarity with the words that the user uses in their sentences.  For example in the sentence ”I am happy” the word ‘’happy’’ being an adjective gets selected and is compared with the stored feature vector for synonyms. Let us assume 2 words; ‘glad’ and ‘satisfied’ tend to be very similar to the word ‘happy’. Now after the semantic analysis, ‘glad’ replaces ‘happy’ which gives a positive polarity.
  • 18. PSEUDO CODE OF THE PROCESS Input: Labeled Dataset Output: positive and negative polarity with synonym of words and similarity between words Step-1 Pre-Processing the tweets: Pre-processing () Remove URL: Remove special symbols Convert to lower: Step-2 Get the Feature Vector List: For w in words: Replace two or more words Strip: If (w in stopwords) Continue Else: Append the file Return feature vector Step-3 Extract Features from Feature Vector List: For word in feature list Features=word in tweets_words Return features Step-4 Combine Pre-Processing Dataset and Feature Vector List Pre-processed file=path name of the file Stopwords=file path name Feature Vector List=file path of feature vector list Step-5 Training the step 4 Apply classifiers classes Step-6 Find Synonym and Similarity of the Feature Vector For every sentences in feature list Extract feature vector in the tweets () For each Feature Vector: x For each Feature Vector: y Find the similarity(x, y) If (similarity>threshold) Match found Feature Vector: x= Feature Vector: y Classify (x, y) Print: sentiment polarity with similar feature words
  • 22. WORD NET WITH NAIVE BAYES OUTPUT
  • 23. VISUALIZATION OF USER REVIEW  Graph  Show Comparison Between two reviews for particular gadgets Positive Negative Nokia Sentiment Analysis
  • 24. RESULTS ANALYSIS: Methods Accuracy Naive Bayes 88.2 Maximum Entropy 83.8 Support Vector machine 85.5 Semantic Analysis (WordNet) 89.9 Accuracy Comparison Between the Techniques
  • 25. Performance Measures (%) Positive Recall 91.2 Negative Recall 85.4 Positive Precision 49.3 Negative Precision 39.3 Performance Measures (%) Positive Recall 86.1 Negative Recall 80.0 Positive Precision 40.4 Negative Precision 33.6 Performance Measures (%) Positive Recall 88.3 Negative Recall 83.5 Positive Precision 43.8 Negative Precision 35.7 Naive Bayesian classification measurements Maximum entropy measurements support vector machine measurements
  • 26. RECALL MEASUREMENTS OF THE TECHNIQUES
  • 27. PRECISION MEASUREMENTS OF THE TECHNIQUES
  • 28. CONCLUSION AND FUTURE WORK  The key aim is to analyze a large amount of reviews by using twitter dataset which are already labeled.  The naïve byes technique which gives us a better result than the maximum entropy and SVM is being subjected to unigram model which gives a better result than using it alone.  Further the accuracy is again improved when the semantic analysis WordNet is followed up by the above procedure taking it to 89.9% from 88.2%.  The training data set can be increased to improve the feature vector related sentence identification process and can also extend WordNet for the summarization of the reviews.  It may give better visualization of the content in better manner that will be helpful for the users.  Some failures when using LSA .
  • 29. REFERENCES  R. Feldman, ” Techniques and Applications for Sentiment Analysis,” Communications of the ACM, Vol. 56 No. 4, pp. 82-89, 2013.  Y. Singh, P. K. Bhatia, and O.P. Sangwan, ”A Review of Studies on Machine Learning Techniques,” International Journal of Computer Science and Security, Volume (1) : Issue (1), pp. 70-84, 2007.  P.D. Turney,” Thumbs Up or Thumbs Down? Semantic Orientation Applied to Unsupervised Classification of Reviews,” Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics (ACL), Philadelphia, pp. 417-424, July 2002.  B.Ren,L.Cheng,” Research of Classification System based on Naive Bayes and MetaClass,” Second International Conference on Information and Computing Science, ICIC ’09, Vol(3), pp. 154 – 156, 2009.  C.I.Tsatsoulis,M.Hofmann,”Focusing on Maximum Entropy Classification of Lyrics by Tom Waits,” IEEE International on Advance Computing Conference (IACC), pp. 664 – 667, 2014.  M.A. Hearst,”Support vector machines,”IEEE Intelligent Systems, pp. 18-28, 1998.
  • 30. PUBLICATIONS FORM THE PROJECT  Geetika and Divakar yadav,” Sentiment Analysis of Twitter Data Using Machine Learning Approaches and Semantic Analysis”, Seventh International Conference on Contemporary Computing (IC3), JIIT, Noida, India, 2014 (Submitted).  survey paper on sentiment analysis in process