SlideShare ist ein Scribd-Unternehmen logo
1 von 4
Downloaden Sie, um offline zu lesen
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 06 Issue: 12 | Dec 2019 www.irjet.net p-ISSN: 2395-0072
© 2019, IRJET | Impact Factor value: 7.34 | ISO 9001:2008 Certified Journal | Page 1327
A Survey on Trend Analysis on Twitter for Predicting Public Opinion on
Ongoing Events
V. D. Punjabi1, Snehal More2, Deepika Patil3, Vinay Bafna4, Shrushti Shah5, Harshada Bachhav6
1Assisant Professor, Dept. of Information Technology, R. C. Patel Institute of Technology, Maharashtra, India
2,3,4,5,6Students, Dept. of Information Technology, R. C. Patel Institute of Technology, Maharashtra, India
---------------------------------------------------------------------***----------------------------------------------------------------------
Abstract - Twitter is most popular social media that allows its user to spread and share information. It Monitors their user
postings and detect most discussed topic of the movement. They publish these topics on the list called “Trending Topics”. It
show what is happening in the world and what people's opinions are about it. Twitter Use Method which provides efficient
way to immediately and accurately categorize trending topics without need of external data. We can use different Techniques
as well as many algorithms for trending topics meaning disambiguation and classify that topic in different category. Among
many methods there is no standard method which gave accuracy so comparative analysis is needed to understand which one is
better and gave quality of the detected topic.
Key Words: Information Retrieval, social media, Twitter, Twitter Trending Topic, Topic Detection, Text mining, Trend
analysis, Real time.
1. INTRODUCTION
Twitter has become a huge social media service where millions of users contribute on a daily basis. Two features have
been fundamental in its success:
(1) The shortness of tweets, which cannot exceed 140 characters, facilitates creation and sharing of messages in a few
seconds.
(2) The easiness of spreading those messages to a large number of users in very little time. Throughout the time, the
community of users on Twitter has established a syntax for interaction with one another, which has become the standard
syntax later officially adopted by its developers[1].
Most major Twitter clients have implemented this standard syntax as well. The standards in the interaction syntax include:
• User mentions: when a user mentions another user in their tweet, an at-sign is placed before the corresponding
username, e.g., you should all follow @username, she is always abreast of breaking news and interesting stuff.
• Replies: when a user wants to direct to another user, or reply to an earliertweet, they placethe @usernamemention at
the beginning of the tweet, e.g., @username I agree with you.
• Retweets: a retweet is considered a re-share of a tweet posted by another user, i.e., a retweet means the user considers
that the message in the tweet might be of interest to others. When a user retweets, the new tweet copies the original one in
it. Furthermore, the retweet attaches an RT and the @username of the user who posted the original tweet at the beginning
of the retweet. For instance: if the user @username posted the tweet .
• Hashtags: similar to tags on social tagging systems or other social networking systems, hashtags included in a tweet
trend to group tweets in conversations or represent the main terms of the tweet, usually referred to topics or common
interests of a community. A hashtag is differentiated from the rest of the terms in the tweet in that it has a leading hash, e.g.,
#hashtag[2].
1.1 Trending Topic:
One of the main features on the homepage of Twitter shows a list of top terms so-called trending topics at all times.
These terms reflect the topics that are being discussed most at the very moment on the site’s fast-flowing stream of tweets.
In order to avoid topics that are popular regularly (e.g., good morning or good night on certain times of the day), Twitter
focuses on topics that are being discussed much more than usual, i.e., topics that recently suffered an increase of use, so
that it trended for some reason.
Trending topics have attracted big interest not only among the users themselves but also among other information
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 06 Issue: 12 | Dec 2019 www.irjet.net p-ISSN: 2395-0072
© 2019, IRJET | Impact Factor value: 7.34 | ISO 9001:2008 Certified Journal | Page 1328
consumers such as journalists, real-time application developers, and social media researchers. Being able to know the top
conversations being discussed at a given time helps keep updated about current affairs, and discover the main concerns of
the community. Twitter defines trending topics as “topics that are immediately popular, rather than topics that have been
popular for a while or on a daily basis”. However, no further evidence is known about the algorithm that extracts trending
topics.Itis assumed that the list is made up by terms that appear more frequently in the most recent stream of tweetsthanthe
usual expected[3].
1.2 Modules:
The system comprises of 4 major modules with its description as follows:
1. Login:
- User need to login first using valid credentials to access the system.
2. Search for Latest Trends:
- After successful login, user can search for latest trending tweets by entering the keyword in the search column.
3. View Latest Trending Tweet:
- Based on user-inputted keyword, the search results will be displayed in form of trending tweets.
4. View Tweets:
- User can click on respective trending tweet to view the message twitted by otherusers.
2. LITERATURE SURVEY
Trend analysis and based on that predicting public opinions. It plays important role, many researcher working on automatic
technique of extraction and analysis of huge amount of twitter data. Luca Maria Aiello, compare sixtrend detection method
and find that standard natural language processing technique perform well for social streams on particular topic. They
conclude that n-gram give best performance other than state-of-arttechniques.[4].
Ltawaier, M. M., & Tiun, S. (2016) have used three different machine learning algorithms Naïve Bayes, Decision
Trees and Support Vector Machine for sentiment classification of Arabic dataset which was obtained from twitter. This
research has followed a framework for Arabic tweets classification in which two special sub-tasks were performed in pre-
processing, Term Frequency-Inverse Document Frequency (TF-IDF) and Arabic stemming. They have used one dataset
with three algorithms and performance has been evaluated on the basis three different information retrieval metrics
precision, recall, and f- measure[5].
Kathy Lee, proposed supervised learningtechniques to classify twitter trending topic for that they use text based and
network based classifier and concludebestperformance. In A. Hernandez-Suarez propose model which predict public opinion
on political event by Appling different classifier which predict that whether mood is positive or negative Kathy Lee,
proposed a way to get the pre labeled data from twitter which can be used to train SVM classifier. They used the twitter
hash tags to judge the polarity of tweet. To analyze the accuracy of proposed technique, a test study on the classifier was
conducted which showed the result with the accuracy of 85%[6].
T Go, A. Bhayani, R., & Huang, L. (2009) introduceda new technique to classify the sentiment of tweets as positive or
negative. They presented and discussed the results of machine learning algorithms for twitter sentiment analysis by using
distant supervision[7].
Training data, the authors used consisted of tweets with emotions which were used as noisy labels. According to
authors, the machine learning algorithms such as Naive Bayes, Maximum Entropy and SVM when trained with emotion
tweets can have accuracy more than 80%. The study also highlighted the steps used in preprocessing stage of classification
for high accuracy. In sentiment analysis perform using SVM in that two pre classified datasets of tweets are used then do
comparative analysis, they use measures Precision, Recall and F-Measure[8].
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 06 Issue: 12 | Dec 2019 www.irjet.net p-ISSN: 2395-0072
© 2019, IRJET | Impact Factor value: 7.34 | ISO 9001:2008 Certified Journal | Page 1329
3. PROPOSED METHODOLOGY
3.1 Latent Dirichlet Allocation (LDA):
Topic extraction in textual corpora can be addressed through probabilistic topic models. In general, a topic model is a
Bayesian model that associates with each document a probability Distribution over topics, which are in turn distributions
over words. According toLDA, everydocument is considered as a bag of terms, which are the only observed variables in the
model. The topic distribution per document and the term distribution per topic are instead hidden and have to be estimated
through Bayesian inference. We use the Collapsed Variation Bayesian inference algorithm [9], an LDA variant that is
computationally efficient, more accurate than standard variation Bayesian inference for LDA, and has parallel
implementations already available in Apache Mahout 1. LDA requires the expected number of topics as input and in our
evaluation we explore the quality of the topic for different values of. The estimation of the optimal, although possible
through the use of non-parametric methods [10].
3.2 Document-Pivot Topic Detection (Doc-p):
It is Topic Detection and Tracking method that uses a document-pivot approach. It uses LSH to rapidly retrievethe nearest
neighbour of a document and accelerate the clustering task. The principle behind this method is thesame used for the near-
duplicatedetectionin thesimilarity based aggregation step of the pre-processing phase.
Work as follow:
Perform online clustering of posts:
Compute the cosine similarity of the tf-idf representation of an incoming post to all other posts processed so far. If the
similarity to the best matching post is above some threshold θ tf-idf, assign the item to the same cluster as its best match;
otherwise create a new cluster with the new post as its only item. The best matching tweet is efficiently retrieved by
LSH[11].
3.3 Graph-Based Feature-Pivot Topic Detection:
This method has unique feature is that for the feature clustering step it uses the Structural Clustering Algorithm for
Networks (SCAN). A property of SCAN is that apart from detecting communities of nodes, it provides a list of hubs, each of
which may be connected to a set of communities. Ina feature-pivot approach for topic detection, the nodes of the graph
would correspond to terms and the communities would correspond to topics[12].
The detected hubs would then ideally be considered terms that are related to more than one topic, something that
would not be possible to achieve with a common partitioned clustering algorithm and would effectively provide an explicit
link between topics. We select the terms to be clustered, out of the set of terms present in the corpus, using the approach in.
It uses an independent reference corpus consisting of randomly collected tweets. [13]
The terms with the highest ratio will be the ones with significantly higher than usual frequency ofappearance and it
is expected that they are related to the most actively discussed topics in the corpus. Once the high-ranking terms are
selected, a term graph is constructed and the SCAN graph-based clustering algorithm is applied to extract groups of terms,
each of which is considered to be a distinct topic.
The algorithm steps are the following:
 Selection: The top terms are selected using the ratio of likelihoods and a node for each of them.
 Linking: The nodes of Graph are connected using a term linking strategy. First, a similarity measure for pairs of
terms is selected and then all pairwise similarities are computed. Various options for the similarity measure are
explored: the number of documents in which the terms co-occur, the number of co-occurrences divided by the
larger or smaller document frequency of the two terms, and Jaccard similarity.
 Clustering: The SCAN algorithm is applied to the graph; a topic is generated for each of thedetected communities.
 Cluster enrichment: The connectivity of each of the hubs detected by SCAN to each of the communities is
checked and if it exceeds some threshold, the hub is linked to the community.Ahub may be linked to more than one
topic[13].
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 06 Issue: 12 | Dec 2019 www.irjet.net p-ISSN: 2395-0072
© 2019, IRJET | Impact Factor value: 7.34 | ISO 9001:2008 Certified Journal | Page 1330
4. CONCLUSION
Tweet having short message it use that for predicting public opinions on sports, Economy, ongoing events etc. It is finding
keyword in tweet and predict whether it is having weightage positive or negative by applying machine leaning algorithms. It
can apply multi classification algorithms like SVM, Naïve Bayes, Logistic classification, KNN and Decision tree. The
Information retrieval measures like precision, recall and F- measure. In future it can test with python coding and find best
classifier. Many tools available for finding twittertrend. There is no standard topic detection technique has been established
yet, comparative analysis is needed to understand to what extent these dimensions determine the quality of the detected
topics.
REFERENCES
[1] Trend Analysis of News Topics on Twitter Rong Lu and Qing Yang, International Journal of Machine Learning and
Computing, Vol. 2, No. 3, June 2012.
[2] Soyeon Caren Han, Hyunsuk Chung, Do Hyeong Kim, Sungyoung Lee, and Byeong Ho Kang “Twitter Trending
Topics MeaningDisambiguation”Springer International Publishing Switzerland 2014.
[3] https://www.google.co.in/url?sa=t&rct=j&q=&esrc= s&source=web&cd=6&cad=rja&uact=8&ved=0ahUK
EwiR2bbL_I_TAhVGu48KHUQ0BzEQFggyMAU&url=h ttp%3A%2F%2Fcucis.ece.northwestern.edu%2Fpub
lications%2Fpdf%2FLeePal11.pdf&usg=AFQjCNFo8_ u6YiJFhnwOKV1RDN_yVIDT
Q&sig2=Fq3_jJ34fq6iDkQW3PVbNw&bvm=bv.15217 4688,d.c2I.
[4] Luca Maria Aiello, Georgios Petkos, Carlos Martin, David Corney, Symeon Papadopoulos, Ryan Skraba, Ayse Göker,
Ioannis Kompatsiaris, Senior Member “Sensing Trending Topics in Twitter” IEEE, and Alejandro Jaimes IEEE
Transactions On Multimedia, Vol. 15, No. 6, October 2013.
[5] Altawaier, M. M., & Tiun, S. (2016) “Comparison of Machine Learning Approaches on Arabic Twitter Sentiment
Analysis” International Journal on Advanced Science, Engineering and Information Technology, 6(6), 1067-1073.
[6] Kathy Lee, Diana Palsetia, Ramanathan Narayanan, Md. Mostofa Ali Patwary, Ankit Agrawal, Alok Choudhary,
“Twitter Trending Topic Classification” 2011 11th IEEE International Conference on Data Mining.
[7] Go, A., Bhayani, R., & Huang, L. (2009). Twitter sentiment classification using distant supervision. CS224N Project
Report, Stanford, 1(2009), 12.
[8] Munir Ahmad, Shabib Aftab, Iftikhar Ali “Sentiment Analysis of Tweets using SVM” International Journal of
Computer Applications November 2017.
[9] Y. W. Teh, D. Newman, and M. Welling, “A collapsed variational Bayesian inference algorithm for latent Dirichlet
allocation,” in Adv. Neural Inf.Process.Syst., 2007, vol. 19.
[10]Y. W. Teh, M. I. Jordan, M. J. Beal, and D. M. Blei, “Hierarchical Dirichlet processes,” J. Amer. Statist. Assoc., vol. 101,
no. 476, pp. 1566–1581, 2006.
[11]G. Salton and M. J. McGill, Introduction to Modern Information Retrieval.New York, NY, USA: McGraw- Hill, 1986.
[12]X.Xu, N. Yuruk, Z. Feng, and T. A. J. Schweiger, “SCAN: A structural clustering algorithmfor networks,” in Proc. KDD:
13th ACM Int. Conf.Knowledge Discovery and Data Mining, New York, NY, USA, 2007,pp. 824–833.
[13]B. O’Connor, M. Krieger, and D. Ahn, “TweetMotif: Exploratory search and topic summarization for Twitter,” in
ICWSM, W. W.Cohen, S. Gosling, W. W. Cohen, and S. Gosling, Eds. Palo Alto, CA, USA: AAAI Press, 2010.

Weitere ähnliche Inhalte

Was ist angesagt?

project sentiment analysis
project sentiment analysisproject sentiment analysis
project sentiment analysis
sneha penmetsa
 
Project sentiment analysis
Project sentiment analysisProject sentiment analysis
Project sentiment analysis
Bob Prieto
 

Was ist angesagt? (20)

IRJET - Sentiment Analysis for Marketing and Product Review using a Hybrid Ap...
IRJET - Sentiment Analysis for Marketing and Product Review using a Hybrid Ap...IRJET - Sentiment Analysis for Marketing and Product Review using a Hybrid Ap...
IRJET - Sentiment Analysis for Marketing and Product Review using a Hybrid Ap...
 
IRJET - Implementation of Twitter Sentimental Analysis According to Hash Tag
 IRJET - Implementation of Twitter Sentimental Analysis According to Hash Tag IRJET - Implementation of Twitter Sentimental Analysis According to Hash Tag
IRJET - Implementation of Twitter Sentimental Analysis According to Hash Tag
 
project sentiment analysis
project sentiment analysisproject sentiment analysis
project sentiment analysis
 
Neural Network Based Context Sensitive Sentiment Analysis
Neural Network Based Context Sensitive Sentiment AnalysisNeural Network Based Context Sensitive Sentiment Analysis
Neural Network Based Context Sensitive Sentiment Analysis
 
Twitter Analytics
Twitter AnalyticsTwitter Analytics
Twitter Analytics
 
76201960
7620196076201960
76201960
 
How Anonymous Can Someone be on Twitter?
How Anonymous Can Someone be on Twitter?How Anonymous Can Someone be on Twitter?
How Anonymous Can Someone be on Twitter?
 
Abstract
AbstractAbstract
Abstract
 
Python report on twitter sentiment analysis
Python report on twitter sentiment analysisPython report on twitter sentiment analysis
Python report on twitter sentiment analysis
 
Sub1557
Sub1557Sub1557
Sub1557
 
Project sentiment analysis
Project sentiment analysisProject sentiment analysis
Project sentiment analysis
 
Survey on article extraction and comment monitoring techniques
Survey on article extraction and comment monitoring techniquesSurvey on article extraction and comment monitoring techniques
Survey on article extraction and comment monitoring techniques
 
WARRANTS GENERATIONS USING A LANGUAGE MODEL AND A MULTI-AGENT SYSTEM
WARRANTS GENERATIONS USING A LANGUAGE MODEL AND A MULTI-AGENT SYSTEMWARRANTS GENERATIONS USING A LANGUAGE MODEL AND A MULTI-AGENT SYSTEM
WARRANTS GENERATIONS USING A LANGUAGE MODEL AND A MULTI-AGENT SYSTEM
 
IRJET - Twitter Sentimental Analysis
IRJET -  	  Twitter Sentimental AnalysisIRJET -  	  Twitter Sentimental Analysis
IRJET - Twitter Sentimental Analysis
 
IRJET- Opinion Mining on Pulwama Attack
IRJET-  	  Opinion Mining on Pulwama AttackIRJET-  	  Opinion Mining on Pulwama Attack
IRJET- Opinion Mining on Pulwama Attack
 
IRJET - Text Optimization/Summarizer using Natural Language Processing
IRJET - Text Optimization/Summarizer using Natural Language Processing IRJET - Text Optimization/Summarizer using Natural Language Processing
IRJET - Text Optimization/Summarizer using Natural Language Processing
 
What makes a tweet relevant for a topic?
What makes a tweet relevant for a topic?What makes a tweet relevant for a topic?
What makes a tweet relevant for a topic?
 
IRJET- Reality Show Analytics for TRP Ratings Based on Viewer’s Opinion
IRJET- Reality Show Analytics for TRP Ratings Based on Viewer’s OpinionIRJET- Reality Show Analytics for TRP Ratings Based on Viewer’s Opinion
IRJET- Reality Show Analytics for TRP Ratings Based on Viewer’s Opinion
 
Twitter Sentiment Analysis on 2013 Curriculum Using Ensemble Features and K-N...
Twitter Sentiment Analysis on 2013 Curriculum Using Ensemble Features and K-N...Twitter Sentiment Analysis on 2013 Curriculum Using Ensemble Features and K-N...
Twitter Sentiment Analysis on 2013 Curriculum Using Ensemble Features and K-N...
 
Complaint Analysis in Indonesian Language Using WPKE and RAKE Algorithm
Complaint Analysis in Indonesian Language Using WPKE and RAKE Algorithm Complaint Analysis in Indonesian Language Using WPKE and RAKE Algorithm
Complaint Analysis in Indonesian Language Using WPKE and RAKE Algorithm
 

Ähnlich wie IRJET- A Survey on Trend Analysis on Twitter for Predicting Public Opinion on Ongoing Events

Sentiment Analysis of Twitter tweets using supervised classification technique
Sentiment Analysis of Twitter tweets using supervised classification technique Sentiment Analysis of Twitter tweets using supervised classification technique
Sentiment Analysis of Twitter tweets using supervised classification technique
IJERA Editor
 

Ähnlich wie IRJET- A Survey on Trend Analysis on Twitter for Predicting Public Opinion on Ongoing Events (20)

Detection and Analysis of Twitter Trending Topics via Link-Anomaly Detection
Detection and Analysis of Twitter Trending Topics via Link-Anomaly DetectionDetection and Analysis of Twitter Trending Topics via Link-Anomaly Detection
Detection and Analysis of Twitter Trending Topics via Link-Anomaly Detection
 
IRJET- Effective Countering of Communal Hatred During Disaster Events in Soci...
IRJET- Effective Countering of Communal Hatred During Disaster Events in Soci...IRJET- Effective Countering of Communal Hatred During Disaster Events in Soci...
IRJET- Effective Countering of Communal Hatred During Disaster Events in Soci...
 
IRJET- Identification of Prevalent News from Twitter and Traditional Media us...
IRJET- Identification of Prevalent News from Twitter and Traditional Media us...IRJET- Identification of Prevalent News from Twitter and Traditional Media us...
IRJET- Identification of Prevalent News from Twitter and Traditional Media us...
 
IRJET- Sentimental Analysis of Twitter Data for Job Opportunities
IRJET-  	  Sentimental Analysis of Twitter Data for Job OpportunitiesIRJET-  	  Sentimental Analysis of Twitter Data for Job Opportunities
IRJET- Sentimental Analysis of Twitter Data for Job Opportunities
 
IRJET- Improved Real-Time Twitter Sentiment Analysis using ML & Word2Vec
IRJET-  	  Improved Real-Time Twitter Sentiment Analysis using ML & Word2VecIRJET-  	  Improved Real-Time Twitter Sentiment Analysis using ML & Word2Vec
IRJET- Improved Real-Time Twitter Sentiment Analysis using ML & Word2Vec
 
Sentiment Analysis of Twitter Data
Sentiment Analysis of Twitter DataSentiment Analysis of Twitter Data
Sentiment Analysis of Twitter Data
 
IRJET- Categorization of Geo-Located Tweets for Data Analysis
IRJET- Categorization of Geo-Located Tweets for Data AnalysisIRJET- Categorization of Geo-Located Tweets for Data Analysis
IRJET- Categorization of Geo-Located Tweets for Data Analysis
 
Sentiment Analysis on Twitter data using Machine Learning
Sentiment Analysis on Twitter data using Machine LearningSentiment Analysis on Twitter data using Machine Learning
Sentiment Analysis on Twitter data using Machine Learning
 
Sentiment Analysis of Twitter tweets using supervised classification technique
Sentiment Analysis of Twitter tweets using supervised classification technique Sentiment Analysis of Twitter tweets using supervised classification technique
Sentiment Analysis of Twitter tweets using supervised classification technique
 
Twitter Sentiment Analysis
Twitter Sentiment AnalysisTwitter Sentiment Analysis
Twitter Sentiment Analysis
 
SENTIMENT ANALYSIS OF SOCIAL MEDIA DATA USING DEEP LEARNING
SENTIMENT ANALYSIS OF SOCIAL MEDIA DATA USING DEEP LEARNINGSENTIMENT ANALYSIS OF SOCIAL MEDIA DATA USING DEEP LEARNING
SENTIMENT ANALYSIS OF SOCIAL MEDIA DATA USING DEEP LEARNING
 
Combining Lexicon based and Machine Learning based Methods for Twitter Sentim...
Combining Lexicon based and Machine Learning based Methods for Twitter Sentim...Combining Lexicon based and Machine Learning based Methods for Twitter Sentim...
Combining Lexicon based and Machine Learning based Methods for Twitter Sentim...
 
Applying Clustering Techniques for Efficient Text Mining in Twitter Data
Applying Clustering Techniques for Efficient Text Mining in Twitter DataApplying Clustering Techniques for Efficient Text Mining in Twitter Data
Applying Clustering Techniques for Efficient Text Mining in Twitter Data
 
F017433947
F017433947F017433947
F017433947
 
Classification of Sentiment Analysis on Tweets Based on Techniques from Machi...
Classification of Sentiment Analysis on Tweets Based on Techniques from Machi...Classification of Sentiment Analysis on Tweets Based on Techniques from Machi...
Classification of Sentiment Analysis on Tweets Based on Techniques from Machi...
 
IRJET- An Experimental Evaluation of Mechanical Properties of Bamboo Fiber Re...
IRJET- An Experimental Evaluation of Mechanical Properties of Bamboo Fiber Re...IRJET- An Experimental Evaluation of Mechanical Properties of Bamboo Fiber Re...
IRJET- An Experimental Evaluation of Mechanical Properties of Bamboo Fiber Re...
 
IRJET- Tweet Segmentation and its Application to Named Entity Recognition
IRJET- Tweet Segmentation and its Application to Named Entity RecognitionIRJET- Tweet Segmentation and its Application to Named Entity Recognition
IRJET- Tweet Segmentation and its Application to Named Entity Recognition
 
A STUDY ON TWITTER SENTIMENT ANALYSIS USING DEEP LEARNING
A STUDY ON TWITTER SENTIMENT ANALYSIS USING DEEP LEARNINGA STUDY ON TWITTER SENTIMENT ANALYSIS USING DEEP LEARNING
A STUDY ON TWITTER SENTIMENT ANALYSIS USING DEEP LEARNING
 
Estimating the Efficacy of Efficient Machine Learning Classifiers for Twitter...
Estimating the Efficacy of Efficient Machine Learning Classifiers for Twitter...Estimating the Efficacy of Efficient Machine Learning Classifiers for Twitter...
Estimating the Efficacy of Efficient Machine Learning Classifiers for Twitter...
 
IRJET- Twitter Sentimental Analysis for Predicting Election Result using ...
IRJET-  	  Twitter Sentimental Analysis for Predicting Election Result using ...IRJET-  	  Twitter Sentimental Analysis for Predicting Election Result using ...
IRJET- Twitter Sentimental Analysis for Predicting Election Result using ...
 

Mehr von IRJET Journal

Mehr von IRJET Journal (20)

TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
 
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURESTUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
 
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
 
Effect of Camber and Angles of Attack on Airfoil Characteristics
Effect of Camber and Angles of Attack on Airfoil CharacteristicsEffect of Camber and Angles of Attack on Airfoil Characteristics
Effect of Camber and Angles of Attack on Airfoil Characteristics
 
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
 
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
 
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
 
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
 
A REVIEW ON MACHINE LEARNING IN ADAS
A REVIEW ON MACHINE LEARNING IN ADASA REVIEW ON MACHINE LEARNING IN ADAS
A REVIEW ON MACHINE LEARNING IN ADAS
 
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
 
P.E.B. Framed Structure Design and Analysis Using STAAD Pro
P.E.B. Framed Structure Design and Analysis Using STAAD ProP.E.B. Framed Structure Design and Analysis Using STAAD Pro
P.E.B. Framed Structure Design and Analysis Using STAAD Pro
 
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
 
Survey Paper on Cloud-Based Secured Healthcare System
Survey Paper on Cloud-Based Secured Healthcare SystemSurvey Paper on Cloud-Based Secured Healthcare System
Survey Paper on Cloud-Based Secured Healthcare System
 
Review on studies and research on widening of existing concrete bridges
Review on studies and research on widening of existing concrete bridgesReview on studies and research on widening of existing concrete bridges
Review on studies and research on widening of existing concrete bridges
 
React based fullstack edtech web application
React based fullstack edtech web applicationReact based fullstack edtech web application
React based fullstack edtech web application
 
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
 
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
 
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
 
Multistoried and Multi Bay Steel Building Frame by using Seismic Design
Multistoried and Multi Bay Steel Building Frame by using Seismic DesignMultistoried and Multi Bay Steel Building Frame by using Seismic Design
Multistoried and Multi Bay Steel Building Frame by using Seismic Design
 
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
 

Kürzlich hochgeladen

Call Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
Call Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort ServiceCall Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
Call Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
UNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and workingUNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and working
rknatarajan
 
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
dollysharma2066
 
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 BookingVIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
dharasingh5698
 
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Dr.Costas Sachpazis
 

Kürzlich hochgeladen (20)

Generative AI or GenAI technology based PPT
Generative AI or GenAI technology based PPTGenerative AI or GenAI technology based PPT
Generative AI or GenAI technology based PPT
 
UNIT-III FMM. DIMENSIONAL ANALYSIS
UNIT-III FMM.        DIMENSIONAL ANALYSISUNIT-III FMM.        DIMENSIONAL ANALYSIS
UNIT-III FMM. DIMENSIONAL ANALYSIS
 
Call Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
Call Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort ServiceCall Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
Call Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
 
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
 
UNIT-IFLUID PROPERTIES & FLOW CHARACTERISTICS
UNIT-IFLUID PROPERTIES & FLOW CHARACTERISTICSUNIT-IFLUID PROPERTIES & FLOW CHARACTERISTICS
UNIT-IFLUID PROPERTIES & FLOW CHARACTERISTICS
 
(INDIRA) Call Girl Bhosari Call Now 8617697112 Bhosari Escorts 24x7
(INDIRA) Call Girl Bhosari Call Now 8617697112 Bhosari Escorts 24x7(INDIRA) Call Girl Bhosari Call Now 8617697112 Bhosari Escorts 24x7
(INDIRA) Call Girl Bhosari Call Now 8617697112 Bhosari Escorts 24x7
 
Coefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptxCoefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptx
 
UNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and workingUNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and working
 
Thermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - VThermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - V
 
NFPA 5000 2024 standard .
NFPA 5000 2024 standard                                  .NFPA 5000 2024 standard                                  .
NFPA 5000 2024 standard .
 
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
 
BSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptx
BSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptxBSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptx
BSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptx
 
Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...
 
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
 
Thermal Engineering Unit - I & II . ppt
Thermal Engineering  Unit - I & II . pptThermal Engineering  Unit - I & II . ppt
Thermal Engineering Unit - I & II . ppt
 
Water Industry Process Automation & Control Monthly - April 2024
Water Industry Process Automation & Control Monthly - April 2024Water Industry Process Automation & Control Monthly - April 2024
Water Industry Process Automation & Control Monthly - April 2024
 
Call for Papers - International Journal of Intelligent Systems and Applicatio...
Call for Papers - International Journal of Intelligent Systems and Applicatio...Call for Papers - International Journal of Intelligent Systems and Applicatio...
Call for Papers - International Journal of Intelligent Systems and Applicatio...
 
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 BookingVIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
 
Online banking management system project.pdf
Online banking management system project.pdfOnline banking management system project.pdf
Online banking management system project.pdf
 
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
 

IRJET- A Survey on Trend Analysis on Twitter for Predicting Public Opinion on Ongoing Events

  • 1. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 06 Issue: 12 | Dec 2019 www.irjet.net p-ISSN: 2395-0072 © 2019, IRJET | Impact Factor value: 7.34 | ISO 9001:2008 Certified Journal | Page 1327 A Survey on Trend Analysis on Twitter for Predicting Public Opinion on Ongoing Events V. D. Punjabi1, Snehal More2, Deepika Patil3, Vinay Bafna4, Shrushti Shah5, Harshada Bachhav6 1Assisant Professor, Dept. of Information Technology, R. C. Patel Institute of Technology, Maharashtra, India 2,3,4,5,6Students, Dept. of Information Technology, R. C. Patel Institute of Technology, Maharashtra, India ---------------------------------------------------------------------***---------------------------------------------------------------------- Abstract - Twitter is most popular social media that allows its user to spread and share information. It Monitors their user postings and detect most discussed topic of the movement. They publish these topics on the list called “Trending Topics”. It show what is happening in the world and what people's opinions are about it. Twitter Use Method which provides efficient way to immediately and accurately categorize trending topics without need of external data. We can use different Techniques as well as many algorithms for trending topics meaning disambiguation and classify that topic in different category. Among many methods there is no standard method which gave accuracy so comparative analysis is needed to understand which one is better and gave quality of the detected topic. Key Words: Information Retrieval, social media, Twitter, Twitter Trending Topic, Topic Detection, Text mining, Trend analysis, Real time. 1. INTRODUCTION Twitter has become a huge social media service where millions of users contribute on a daily basis. Two features have been fundamental in its success: (1) The shortness of tweets, which cannot exceed 140 characters, facilitates creation and sharing of messages in a few seconds. (2) The easiness of spreading those messages to a large number of users in very little time. Throughout the time, the community of users on Twitter has established a syntax for interaction with one another, which has become the standard syntax later officially adopted by its developers[1]. Most major Twitter clients have implemented this standard syntax as well. The standards in the interaction syntax include: • User mentions: when a user mentions another user in their tweet, an at-sign is placed before the corresponding username, e.g., you should all follow @username, she is always abreast of breaking news and interesting stuff. • Replies: when a user wants to direct to another user, or reply to an earliertweet, they placethe @usernamemention at the beginning of the tweet, e.g., @username I agree with you. • Retweets: a retweet is considered a re-share of a tweet posted by another user, i.e., a retweet means the user considers that the message in the tweet might be of interest to others. When a user retweets, the new tweet copies the original one in it. Furthermore, the retweet attaches an RT and the @username of the user who posted the original tweet at the beginning of the retweet. For instance: if the user @username posted the tweet . • Hashtags: similar to tags on social tagging systems or other social networking systems, hashtags included in a tweet trend to group tweets in conversations or represent the main terms of the tweet, usually referred to topics or common interests of a community. A hashtag is differentiated from the rest of the terms in the tweet in that it has a leading hash, e.g., #hashtag[2]. 1.1 Trending Topic: One of the main features on the homepage of Twitter shows a list of top terms so-called trending topics at all times. These terms reflect the topics that are being discussed most at the very moment on the site’s fast-flowing stream of tweets. In order to avoid topics that are popular regularly (e.g., good morning or good night on certain times of the day), Twitter focuses on topics that are being discussed much more than usual, i.e., topics that recently suffered an increase of use, so that it trended for some reason. Trending topics have attracted big interest not only among the users themselves but also among other information
  • 2. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 06 Issue: 12 | Dec 2019 www.irjet.net p-ISSN: 2395-0072 © 2019, IRJET | Impact Factor value: 7.34 | ISO 9001:2008 Certified Journal | Page 1328 consumers such as journalists, real-time application developers, and social media researchers. Being able to know the top conversations being discussed at a given time helps keep updated about current affairs, and discover the main concerns of the community. Twitter defines trending topics as “topics that are immediately popular, rather than topics that have been popular for a while or on a daily basis”. However, no further evidence is known about the algorithm that extracts trending topics.Itis assumed that the list is made up by terms that appear more frequently in the most recent stream of tweetsthanthe usual expected[3]. 1.2 Modules: The system comprises of 4 major modules with its description as follows: 1. Login: - User need to login first using valid credentials to access the system. 2. Search for Latest Trends: - After successful login, user can search for latest trending tweets by entering the keyword in the search column. 3. View Latest Trending Tweet: - Based on user-inputted keyword, the search results will be displayed in form of trending tweets. 4. View Tweets: - User can click on respective trending tweet to view the message twitted by otherusers. 2. LITERATURE SURVEY Trend analysis and based on that predicting public opinions. It plays important role, many researcher working on automatic technique of extraction and analysis of huge amount of twitter data. Luca Maria Aiello, compare sixtrend detection method and find that standard natural language processing technique perform well for social streams on particular topic. They conclude that n-gram give best performance other than state-of-arttechniques.[4]. Ltawaier, M. M., & Tiun, S. (2016) have used three different machine learning algorithms Naïve Bayes, Decision Trees and Support Vector Machine for sentiment classification of Arabic dataset which was obtained from twitter. This research has followed a framework for Arabic tweets classification in which two special sub-tasks were performed in pre- processing, Term Frequency-Inverse Document Frequency (TF-IDF) and Arabic stemming. They have used one dataset with three algorithms and performance has been evaluated on the basis three different information retrieval metrics precision, recall, and f- measure[5]. Kathy Lee, proposed supervised learningtechniques to classify twitter trending topic for that they use text based and network based classifier and concludebestperformance. In A. Hernandez-Suarez propose model which predict public opinion on political event by Appling different classifier which predict that whether mood is positive or negative Kathy Lee, proposed a way to get the pre labeled data from twitter which can be used to train SVM classifier. They used the twitter hash tags to judge the polarity of tweet. To analyze the accuracy of proposed technique, a test study on the classifier was conducted which showed the result with the accuracy of 85%[6]. T Go, A. Bhayani, R., & Huang, L. (2009) introduceda new technique to classify the sentiment of tweets as positive or negative. They presented and discussed the results of machine learning algorithms for twitter sentiment analysis by using distant supervision[7]. Training data, the authors used consisted of tweets with emotions which were used as noisy labels. According to authors, the machine learning algorithms such as Naive Bayes, Maximum Entropy and SVM when trained with emotion tweets can have accuracy more than 80%. The study also highlighted the steps used in preprocessing stage of classification for high accuracy. In sentiment analysis perform using SVM in that two pre classified datasets of tweets are used then do comparative analysis, they use measures Precision, Recall and F-Measure[8].
  • 3. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 06 Issue: 12 | Dec 2019 www.irjet.net p-ISSN: 2395-0072 © 2019, IRJET | Impact Factor value: 7.34 | ISO 9001:2008 Certified Journal | Page 1329 3. PROPOSED METHODOLOGY 3.1 Latent Dirichlet Allocation (LDA): Topic extraction in textual corpora can be addressed through probabilistic topic models. In general, a topic model is a Bayesian model that associates with each document a probability Distribution over topics, which are in turn distributions over words. According toLDA, everydocument is considered as a bag of terms, which are the only observed variables in the model. The topic distribution per document and the term distribution per topic are instead hidden and have to be estimated through Bayesian inference. We use the Collapsed Variation Bayesian inference algorithm [9], an LDA variant that is computationally efficient, more accurate than standard variation Bayesian inference for LDA, and has parallel implementations already available in Apache Mahout 1. LDA requires the expected number of topics as input and in our evaluation we explore the quality of the topic for different values of. The estimation of the optimal, although possible through the use of non-parametric methods [10]. 3.2 Document-Pivot Topic Detection (Doc-p): It is Topic Detection and Tracking method that uses a document-pivot approach. It uses LSH to rapidly retrievethe nearest neighbour of a document and accelerate the clustering task. The principle behind this method is thesame used for the near- duplicatedetectionin thesimilarity based aggregation step of the pre-processing phase. Work as follow: Perform online clustering of posts: Compute the cosine similarity of the tf-idf representation of an incoming post to all other posts processed so far. If the similarity to the best matching post is above some threshold θ tf-idf, assign the item to the same cluster as its best match; otherwise create a new cluster with the new post as its only item. The best matching tweet is efficiently retrieved by LSH[11]. 3.3 Graph-Based Feature-Pivot Topic Detection: This method has unique feature is that for the feature clustering step it uses the Structural Clustering Algorithm for Networks (SCAN). A property of SCAN is that apart from detecting communities of nodes, it provides a list of hubs, each of which may be connected to a set of communities. Ina feature-pivot approach for topic detection, the nodes of the graph would correspond to terms and the communities would correspond to topics[12]. The detected hubs would then ideally be considered terms that are related to more than one topic, something that would not be possible to achieve with a common partitioned clustering algorithm and would effectively provide an explicit link between topics. We select the terms to be clustered, out of the set of terms present in the corpus, using the approach in. It uses an independent reference corpus consisting of randomly collected tweets. [13] The terms with the highest ratio will be the ones with significantly higher than usual frequency ofappearance and it is expected that they are related to the most actively discussed topics in the corpus. Once the high-ranking terms are selected, a term graph is constructed and the SCAN graph-based clustering algorithm is applied to extract groups of terms, each of which is considered to be a distinct topic. The algorithm steps are the following:  Selection: The top terms are selected using the ratio of likelihoods and a node for each of them.  Linking: The nodes of Graph are connected using a term linking strategy. First, a similarity measure for pairs of terms is selected and then all pairwise similarities are computed. Various options for the similarity measure are explored: the number of documents in which the terms co-occur, the number of co-occurrences divided by the larger or smaller document frequency of the two terms, and Jaccard similarity.  Clustering: The SCAN algorithm is applied to the graph; a topic is generated for each of thedetected communities.  Cluster enrichment: The connectivity of each of the hubs detected by SCAN to each of the communities is checked and if it exceeds some threshold, the hub is linked to the community.Ahub may be linked to more than one topic[13].
  • 4. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 06 Issue: 12 | Dec 2019 www.irjet.net p-ISSN: 2395-0072 © 2019, IRJET | Impact Factor value: 7.34 | ISO 9001:2008 Certified Journal | Page 1330 4. CONCLUSION Tweet having short message it use that for predicting public opinions on sports, Economy, ongoing events etc. It is finding keyword in tweet and predict whether it is having weightage positive or negative by applying machine leaning algorithms. It can apply multi classification algorithms like SVM, Naïve Bayes, Logistic classification, KNN and Decision tree. The Information retrieval measures like precision, recall and F- measure. In future it can test with python coding and find best classifier. Many tools available for finding twittertrend. There is no standard topic detection technique has been established yet, comparative analysis is needed to understand to what extent these dimensions determine the quality of the detected topics. REFERENCES [1] Trend Analysis of News Topics on Twitter Rong Lu and Qing Yang, International Journal of Machine Learning and Computing, Vol. 2, No. 3, June 2012. [2] Soyeon Caren Han, Hyunsuk Chung, Do Hyeong Kim, Sungyoung Lee, and Byeong Ho Kang “Twitter Trending Topics MeaningDisambiguation”Springer International Publishing Switzerland 2014. [3] https://www.google.co.in/url?sa=t&rct=j&q=&esrc= s&source=web&cd=6&cad=rja&uact=8&ved=0ahUK EwiR2bbL_I_TAhVGu48KHUQ0BzEQFggyMAU&url=h ttp%3A%2F%2Fcucis.ece.northwestern.edu%2Fpub lications%2Fpdf%2FLeePal11.pdf&usg=AFQjCNFo8_ u6YiJFhnwOKV1RDN_yVIDT Q&sig2=Fq3_jJ34fq6iDkQW3PVbNw&bvm=bv.15217 4688,d.c2I. [4] Luca Maria Aiello, Georgios Petkos, Carlos Martin, David Corney, Symeon Papadopoulos, Ryan Skraba, Ayse Göker, Ioannis Kompatsiaris, Senior Member “Sensing Trending Topics in Twitter” IEEE, and Alejandro Jaimes IEEE Transactions On Multimedia, Vol. 15, No. 6, October 2013. [5] Altawaier, M. M., & Tiun, S. (2016) “Comparison of Machine Learning Approaches on Arabic Twitter Sentiment Analysis” International Journal on Advanced Science, Engineering and Information Technology, 6(6), 1067-1073. [6] Kathy Lee, Diana Palsetia, Ramanathan Narayanan, Md. Mostofa Ali Patwary, Ankit Agrawal, Alok Choudhary, “Twitter Trending Topic Classification” 2011 11th IEEE International Conference on Data Mining. [7] Go, A., Bhayani, R., & Huang, L. (2009). Twitter sentiment classification using distant supervision. CS224N Project Report, Stanford, 1(2009), 12. [8] Munir Ahmad, Shabib Aftab, Iftikhar Ali “Sentiment Analysis of Tweets using SVM” International Journal of Computer Applications November 2017. [9] Y. W. Teh, D. Newman, and M. Welling, “A collapsed variational Bayesian inference algorithm for latent Dirichlet allocation,” in Adv. Neural Inf.Process.Syst., 2007, vol. 19. [10]Y. W. Teh, M. I. Jordan, M. J. Beal, and D. M. Blei, “Hierarchical Dirichlet processes,” J. Amer. Statist. Assoc., vol. 101, no. 476, pp. 1566–1581, 2006. [11]G. Salton and M. J. McGill, Introduction to Modern Information Retrieval.New York, NY, USA: McGraw- Hill, 1986. [12]X.Xu, N. Yuruk, Z. Feng, and T. A. J. Schweiger, “SCAN: A structural clustering algorithmfor networks,” in Proc. KDD: 13th ACM Int. Conf.Knowledge Discovery and Data Mining, New York, NY, USA, 2007,pp. 824–833. [13]B. O’Connor, M. Krieger, and D. Ahn, “TweetMotif: Exploratory search and topic summarization for Twitter,” in ICWSM, W. W.Cohen, S. Gosling, W. W. Cohen, and S. Gosling, Eds. Palo Alto, CA, USA: AAAI Press, 2010.