SlideShare ist ein Scribd-Unternehmen logo
1 von 23
Downloaden Sie, um offline zu lesen
Twitter as a Personalizable
Information Service
Part 2
2015/11/9 John
Outline
● Review Part 1
● Related work
○ Aggregation, Propagation, and Recommendation Through Social Network
○ Event Identification in Social Network
○ Content Personalization
● Implementation
○ Real-time Vectorization of Tweets
○ Analyzing the Stream of Information by Taking into Account Temporal Conditions
○ PageRank
● Summary
● Next
Review: Abstract
Twitter, making this information system one of the fastest in the world. This
chapter introduces a novel topic-detection method about the most emerging
arguments expressed by the Twitter network around his/her domain interests. It
propose an innovative term aging model, based on a biological metaphor, to
retrieve the freshest arguments of discussion.
Review: What’s Data Mining
The process of analyzing data from different perspectives and summarizing it into
useful information.
“You really understand your client or customer?”
Review: Introduction
Simplified notion:
Topics
Extracting
emerging
topics
Personalizing
Related work
In the last decade, the enormous amount of content generated by Web users
created new challenges and new research questions within the data mining
community, those works are related to the research of this chapter.
Survey fields:
1. Aggregation, Propagatoin, and Recommendation information from large-scale social network.
2. Automatic detection of events within user-generated environment
3. Current personalization and current user context analysis
Aggregation, Propagation, and Recommendation Through Social Network
● First issue: Aggregation of content through fliter and merging. There are two
main approaches:
Collaborate filtering: Selecting and proposing content by looking at what
similar users have already selected. (i.e., filter what users select)
Content-based techniques: Analyze the semantics of the content without
considering its origin. (i.e., from only text content)
● Second issue: Analysis of how much information spreads. In general, analyze
from as like with the spread of a disease in a social envrionment.
● Trendistic and Twopular repersent two examples which it is possible to
analyze the trends of keywords along a timeline specified by the user.
Event Identification in Social Network
● Identifying events in real-time on Twitter is a challenging problem, due to the
heterogeneity(diverse) and immense scale of the data.
● A research defines a typology of five generic classes of tweets: News, Events,
Opinions, Deals, Private messages.
● The real time social content can also be seen as a sensor that captures what
is happening in the world: similarly to the recommendation task, this can be
exploited for a zero-delay information broadcasting system that detects
emerging cocepts.
● All the techniques rely on some measure of importance of the keywords. TF-
IDF to avoid the collapse of important terms when they appear in many text
documents.
Content Personalization
● Also cover this survey as relevant work since the system of this chapter
includes a module for the personalization of the emerging topics to be
retrieved from Twitter.
● There obviously exists several approaches to facing such as a task.
● Depending on the domain, one may be interested in ‘re-rank’ the results
based on their relevance, rather than ‘diversify’ them.
● Brief overview of the system in this chapter, it can be classified as a ‘re-rank’
approach.
Implementation
This section illustrates the method for analyzing, in real-time, the dynamic stream
of information expressed by the Twitter community and retrieve the most emerging
topics within the user’s interests.
1. First, a set of tweets is generated within a specific time interval, is
represented as a set of keyword vectors. (term vectors)
2. A term aging model monitors the usage of each keyword over the time.
3. Moreover, the social reputation of the Twitter users is leveraged to balance
the importance of the information expressed by the community.
4. Finally, the user context is taken into account, provided by the generated set
of tweets, to highlight the most emerging topics within user’s interests.
Real-time Vectorization of Tweets
● In most Information Retrieval(IR) system, the first step is extraction of the
relevant keywords. (called ‘term’ in this chapter)
● Considered a interval, I, which is at tth and given a time range r:
● The corpus is extracted, with text tweets extracted during
the time interval.
● Each component of the vector represents a weighted term extracted
from the related tweet vector . And weight of the xth vocabulary
term in jth tweet by using the argmented normal frequency:
TF-IDF
Analyzing the Stream of Information by Taking into Account Temporal Conditions
● Generally speaking, a term can be viewed as a semantic unit which can
potentially link to a new event.
● This section uses a content aging theory to automatically identify coherent
discussions through a life cycle-based content model.
● Many conventional clustering and classification strategies can not be applied
to this problem due to the fact that tend to ignored the temporal relationships
(about time aspect) amoung documents(tweets).
● A keyword of the life cycle can be considered as analogous(like) to the one of
a living being(living thing) with abundant nourishment (i.e., related tweets).
● However, a keyword or a live form dies when nourishment(food) becomes
insufficient.
Analyzing the Stream of Information by Taking into Account Temporal Conditions
● Relaying on this live analogy.
● It is possible to evaluate the usage of a keyword by its burstiness.
● Burstiness indicates the vitality status of the keyword and can qualify the
keyword’s usage.
● High burstiness or low burstiness implies that the term is becoming important
or not.
● Therefore, the system uses the concept of authority to define the quality of
the nutrition that each tweet gives to every contained keyword.
● Different tweets containing the same keyword generate different amount of
nutrition(i.e., calorie) in the community:
Analyzing the Stream of Information by Taking into Account Temporal Conditions
Reputation of the users
● In Twitter, the social model enables to define an author-based graph,
● Reputation can be extended by taking into account the fact that the
importance of a user. It is also related to the degree of importance of its
followers.
● We can refer to the well-known PageRank algorithm for this task that
calculates the reputation as follow:
Note: Later, it is explained by a simple example of PageRank.
Analyzing the Stream of Information by Taking into Account Temporal Conditions
Computing term Burstiness values
● Once the nutrition of a term is calculated, the aim is to map into a value of
burstiness.
● The burstiness value indicates a term’s actual contribution(i.e., how much it is
emergent) in the corpus of tweets.
● A keyword is defined as emergent if it results to be hot in the considered time
interval.
● We analyze the keyword life cycles by comparing their nutrition values
obtained on the considered time frame withe the usage of the same terms in
the past time interval. Namely, the current nourishment is analyzed in
comparison to the ones built in the previous time internals.
Analyzing the Stream of Information by Taking into Account Temporal Conditions
Computing term Burstiness values
● If its nutrition value stays constant during closer time intervals, it means that
community is probably still referring to the same news event.
● Event if the keyword can be considered as hot, it can not be referred as
emergent due to the temporal discrimination. (Temporal parameter influences
the emerging keyword retrieved by the system)
● A parameter s, where , that limits the number of previous time slots
considered by the system to study the keywords life cycles and defines the
history worthiness of the resulting emerging keywords.
PageRank
PageRank works by counting the number and quality of links to a page to
determine a rough estimate of how important the website is. The underlying
assumption is that more important websites are likely to receive more links from
other websites.
Random surfer, where d = 0.85 (usually)
Note: It can be seen a Markov Chain.
Keyword: PageRank, Google search, Markov,
WebCrawler, SEO
PageRank
Example in notion (random traverse method)
Suppose it is all pages in the world.
Compute PR(A), PR(B), PR(C), PR(D)
1. Traverse pages until
reached 1000 times.
2. Traverse pages until
reached 1 million times.
A
D
C
B
PageRank
Simply Example:
Traverse 1000 times
PR(A): 118 times
PR(B): 109 times
PR(C): 13 times
PR(D): 144 times
A
D
C
B
PageRank
Simply Example:
Traverse 1 million times
PR(A): 13%
PR(B): 10%
PR(C): 2%
PR(D): 15%
7%
4%
4%
4%
A
14%
10%
2%
2%
2%
D
C
2%
B
5%
4%
Summary
● Currently, a lot of researches work on Aggregation, Propagatoin, and
Recommendation information from Large scale social network.
● Automatic detection and Personalization on User context become intesesting
and important.
● From implementation, first is extracting the keywords and its weights from
tweets in the interval.
● Computing the user’s reputation and term’s burtiness values.
● Burtiness values of the emerging keyword is influenced with temporal
parameter. (interval and its past interval)
● Compute reputation likes to do PageRank, they are on a direct graph network.
Next
Continue
Implementation:
● Selection of Emerging Terms
● Leveraging User’ Context for Persionalization Purposes
● From Emerging Terms to Emergin Topics
● Topic Detection, Labeling, and Ranking
Experiments
● Case and User Studies
References
● Wiki of TF-IDF - https://en.wikipedia.org/wiki/Tf%E2%80%93idf
● Wiki of Markov Chain - https://en.wikipedia.org/wiki/Markov_chain
● Wiki of PageRank - https://en.wikipedia.org/wiki/PageRank
● PageRank how it work - http://goo.gl/bbShFd
● Nine Algorithms That Changed the Future - http://goo.gl/Y9BFmO

Weitere ähnliche Inhalte

Was ist angesagt?

Final Poster for Engineering Showcase
Final Poster for Engineering ShowcaseFinal Poster for Engineering Showcase
Final Poster for Engineering ShowcaseTucker Truesdale
 
Sensing Trending Topics in Twitter for Greater Jakarta Area
Sensing Trending Topics in Twitter for Greater Jakarta Area Sensing Trending Topics in Twitter for Greater Jakarta Area
Sensing Trending Topics in Twitter for Greater Jakarta Area IJECEIAES
 
Topic Lifecycle on Social Networks
Topic Lifecycle on Social NetworksTopic Lifecycle on Social Networks
Topic Lifecycle on Social NetworksKritika Garg
 
Wikipedia as an Ontology for Describing Documents
Wikipedia as an Ontology for Describing DocumentsWikipedia as an Ontology for Describing Documents
Wikipedia as an Ontology for Describing DocumentsZareen Syed
 
On Summarization and Timeline Generation for Evolutionary Tweet Streams
On Summarization and Timeline Generation for Evolutionary Tweet StreamsOn Summarization and Timeline Generation for Evolutionary Tweet Streams
On Summarization and Timeline Generation for Evolutionary Tweet Streams1crore projects
 
Sentiment Analysis of Twitter tweets using supervised classification technique
Sentiment Analysis of Twitter tweets using supervised classification technique Sentiment Analysis of Twitter tweets using supervised classification technique
Sentiment Analysis of Twitter tweets using supervised classification technique IJERA Editor
 
Stock prediction using social network
Stock prediction using social networkStock prediction using social network
Stock prediction using social networkChanon Hongsirikulkit
 
Link prediction 방법의 개념 및 활용
Link prediction 방법의 개념 및 활용Link prediction 방법의 개념 및 활용
Link prediction 방법의 개념 및 활용Kyunghoon Kim
 
Text mining on Twitter information based on R platform
Text mining on Twitter information based on R platformText mining on Twitter information based on R platform
Text mining on Twitter information based on R platformFayan TAO
 
IRJET - Implementation of Twitter Sentimental Analysis According to Hash Tag
 IRJET - Implementation of Twitter Sentimental Analysis According to Hash Tag IRJET - Implementation of Twitter Sentimental Analysis According to Hash Tag
IRJET - Implementation of Twitter Sentimental Analysis According to Hash TagIRJET Journal
 
Stop thinking, start tagging - Tag Semantics emerge from Collaborative Verbosity
Stop thinking, start tagging - Tag Semantics emerge from Collaborative VerbosityStop thinking, start tagging - Tag Semantics emerge from Collaborative Verbosity
Stop thinking, start tagging - Tag Semantics emerge from Collaborative VerbosityInovex GmbH
 
How Anonymous Can Someone be on Twitter?
How Anonymous Can Someone be on Twitter?How Anonymous Can Someone be on Twitter?
How Anonymous Can Someone be on Twitter?George Sam
 
Twitter sentiment analysis project report
Twitter sentiment analysis project reportTwitter sentiment analysis project report
Twitter sentiment analysis project reportBharat Khanna
 
Trend detection and analysis on Twitter
Trend detection and analysis on TwitterTrend detection and analysis on Twitter
Trend detection and analysis on TwitterLukas Masuch
 
WWW2010_Earthquake Shakes Twitter User: Analyzing Tweets for Real-Time Event...
WWW2010_Earthquake Shakes Twitter User: Analyzing Tweets for Real-Time Event...WWW2010_Earthquake Shakes Twitter User: Analyzing Tweets for Real-Time Event...
WWW2010_Earthquake Shakes Twitter User: Analyzing Tweets for Real-Time Event...tksakaki
 
Earthquake shakes twitter users
Earthquake shakes twitter usersEarthquake shakes twitter users
Earthquake shakes twitter usersEshan Mudwel
 

Was ist angesagt? (20)

Final Poster for Engineering Showcase
Final Poster for Engineering ShowcaseFinal Poster for Engineering Showcase
Final Poster for Engineering Showcase
 
Sensing Trending Topics in Twitter for Greater Jakarta Area
Sensing Trending Topics in Twitter for Greater Jakarta Area Sensing Trending Topics in Twitter for Greater Jakarta Area
Sensing Trending Topics in Twitter for Greater Jakarta Area
 
Topic Lifecycle on Social Networks
Topic Lifecycle on Social NetworksTopic Lifecycle on Social Networks
Topic Lifecycle on Social Networks
 
Wikipedia as an Ontology for Describing Documents
Wikipedia as an Ontology for Describing DocumentsWikipedia as an Ontology for Describing Documents
Wikipedia as an Ontology for Describing Documents
 
Mcs 021 solve assignment
Mcs 021 solve assignmentMcs 021 solve assignment
Mcs 021 solve assignment
 
Mcs 021
Mcs 021Mcs 021
Mcs 021
 
On Summarization and Timeline Generation for Evolutionary Tweet Streams
On Summarization and Timeline Generation for Evolutionary Tweet StreamsOn Summarization and Timeline Generation for Evolutionary Tweet Streams
On Summarization and Timeline Generation for Evolutionary Tweet Streams
 
wendi_ppt
wendi_pptwendi_ppt
wendi_ppt
 
Sentiment Analysis of Twitter tweets using supervised classification technique
Sentiment Analysis of Twitter tweets using supervised classification technique Sentiment Analysis of Twitter tweets using supervised classification technique
Sentiment Analysis of Twitter tweets using supervised classification technique
 
Stock prediction using social network
Stock prediction using social networkStock prediction using social network
Stock prediction using social network
 
Link prediction 방법의 개념 및 활용
Link prediction 방법의 개념 및 활용Link prediction 방법의 개념 및 활용
Link prediction 방법의 개념 및 활용
 
Text mining on Twitter information based on R platform
Text mining on Twitter information based on R platformText mining on Twitter information based on R platform
Text mining on Twitter information based on R platform
 
IRJET - Implementation of Twitter Sentimental Analysis According to Hash Tag
 IRJET - Implementation of Twitter Sentimental Analysis According to Hash Tag IRJET - Implementation of Twitter Sentimental Analysis According to Hash Tag
IRJET - Implementation of Twitter Sentimental Analysis According to Hash Tag
 
Stop thinking, start tagging - Tag Semantics emerge from Collaborative Verbosity
Stop thinking, start tagging - Tag Semantics emerge from Collaborative VerbosityStop thinking, start tagging - Tag Semantics emerge from Collaborative Verbosity
Stop thinking, start tagging - Tag Semantics emerge from Collaborative Verbosity
 
How Anonymous Can Someone be on Twitter?
How Anonymous Can Someone be on Twitter?How Anonymous Can Someone be on Twitter?
How Anonymous Can Someone be on Twitter?
 
Who gives a tweet
Who gives a tweetWho gives a tweet
Who gives a tweet
 
Twitter sentiment analysis project report
Twitter sentiment analysis project reportTwitter sentiment analysis project report
Twitter sentiment analysis project report
 
Trend detection and analysis on Twitter
Trend detection and analysis on TwitterTrend detection and analysis on Twitter
Trend detection and analysis on Twitter
 
WWW2010_Earthquake Shakes Twitter User: Analyzing Tweets for Real-Time Event...
WWW2010_Earthquake Shakes Twitter User: Analyzing Tweets for Real-Time Event...WWW2010_Earthquake Shakes Twitter User: Analyzing Tweets for Real-Time Event...
WWW2010_Earthquake Shakes Twitter User: Analyzing Tweets for Real-Time Event...
 
Earthquake shakes twitter users
Earthquake shakes twitter usersEarthquake shakes twitter users
Earthquake shakes twitter users
 

Andere mochten auch

Andere mochten auch (12)

Acetato cap4
Acetato cap4Acetato cap4
Acetato cap4
 
Guia unidad-1-comercio-electronico
Guia unidad-1-comercio-electronicoGuia unidad-1-comercio-electronico
Guia unidad-1-comercio-electronico
 
Poland 2
Poland 2Poland 2
Poland 2
 
Pintura de ricardo monteiro 3-2015
Pintura de ricardo monteiro  3-2015Pintura de ricardo monteiro  3-2015
Pintura de ricardo monteiro 3-2015
 
Benjamin d
Benjamin dBenjamin d
Benjamin d
 
ซักซ้อมแผนสี่ปี
ซักซ้อมแผนสี่ปีซักซ้อมแผนสี่ปี
ซักซ้อมแผนสี่ปี
 
Improving the contribution of livestock to human health and nutrition and wel...
Improving the contribution of livestock to human health and nutrition and wel...Improving the contribution of livestock to human health and nutrition and wel...
Improving the contribution of livestock to human health and nutrition and wel...
 
LPA ด้าน 4 การบริการสาธารณะ
LPA ด้าน 4 การบริการสาธารณะLPA ด้าน 4 การบริการสาธารณะ
LPA ด้าน 4 การบริการสาธารณะ
 
National parks of india
National parks of indiaNational parks of india
National parks of india
 
Solid State Drives - Seminar Report for Semester 6 Computer Engineering - VIT...
Solid State Drives - Seminar Report for Semester 6 Computer Engineering - VIT...Solid State Drives - Seminar Report for Semester 6 Computer Engineering - VIT...
Solid State Drives - Seminar Report for Semester 6 Computer Engineering - VIT...
 
Law+Of+Partnership
Law+Of+PartnershipLaw+Of+Partnership
Law+Of+Partnership
 
CULTURA DE PAZ
CULTURA DE PAZCULTURA DE PAZ
CULTURA DE PAZ
 

Ähnlich wie Twitter as a personalizable information service ii

IEEE 2014 JAVA DATA MINING PROJECTS Discovering emerging topics in social str...
IEEE 2014 JAVA DATA MINING PROJECTS Discovering emerging topics in social str...IEEE 2014 JAVA DATA MINING PROJECTS Discovering emerging topics in social str...
IEEE 2014 JAVA DATA MINING PROJECTS Discovering emerging topics in social str...IEEEFINALYEARSTUDENTPROJECTS
 
2014 IEEE JAVA DATA MINING PROJECT Discovering emerging topics in social stre...
2014 IEEE JAVA DATA MINING PROJECT Discovering emerging topics in social stre...2014 IEEE JAVA DATA MINING PROJECT Discovering emerging topics in social stre...
2014 IEEE JAVA DATA MINING PROJECT Discovering emerging topics in social stre...IEEEFINALYEARSTUDENTPROJECT
 
2014 IEEE JAVA DATA MINING PROJECT Discovering emerging topics in social stre...
2014 IEEE JAVA DATA MINING PROJECT Discovering emerging topics in social stre...2014 IEEE JAVA DATA MINING PROJECT Discovering emerging topics in social stre...
2014 IEEE JAVA DATA MINING PROJECT Discovering emerging topics in social stre...IEEEMEMTECHSTUDENTSPROJECTS
 
Twitter: Social Network Or News Medium?
Twitter: Social Network Or News Medium?Twitter: Social Network Or News Medium?
Twitter: Social Network Or News Medium?Serge Beckers
 
Marketing analysis
Marketing analysisMarketing analysis
Marketing analysisGaurav Dubey
 
IRJET- A Survey on Trend Analysis on Twitter for Predicting Public Opinion on...
IRJET- A Survey on Trend Analysis on Twitter for Predicting Public Opinion on...IRJET- A Survey on Trend Analysis on Twitter for Predicting Public Opinion on...
IRJET- A Survey on Trend Analysis on Twitter for Predicting Public Opinion on...IRJET Journal
 
Sentiment Analysis of Twitter Data
Sentiment Analysis of Twitter DataSentiment Analysis of Twitter Data
Sentiment Analysis of Twitter DataSumit Raj
 
SampleLiteratureReviewTemplate_IVBTechIISEM_MajorProject.pptx
SampleLiteratureReviewTemplate_IVBTechIISEM_MajorProject.pptxSampleLiteratureReviewTemplate_IVBTechIISEM_MajorProject.pptx
SampleLiteratureReviewTemplate_IVBTechIISEM_MajorProject.pptx20211a05p7
 
Detection and Analysis of Twitter Trending Topics via Link-Anomaly Detection
Detection and Analysis of Twitter Trending Topics via Link-Anomaly DetectionDetection and Analysis of Twitter Trending Topics via Link-Anomaly Detection
Detection and Analysis of Twitter Trending Topics via Link-Anomaly DetectionIJERA Editor
 
Twitter Sentiment Analysis
Twitter Sentiment AnalysisTwitter Sentiment Analysis
Twitter Sentiment Analysisijtsrd
 
Topic Evolutionary Tweet Stream Clustering Algorithm and TCV Rank Summarization
Topic Evolutionary Tweet Stream Clustering Algorithm and TCV Rank SummarizationTopic Evolutionary Tweet Stream Clustering Algorithm and TCV Rank Summarization
Topic Evolutionary Tweet Stream Clustering Algorithm and TCV Rank SummarizationIJERA Editor
 
IRJET - Sentiment Analysis for Marketing and Product Review using a Hybrid Ap...
IRJET - Sentiment Analysis for Marketing and Product Review using a Hybrid Ap...IRJET - Sentiment Analysis for Marketing and Product Review using a Hybrid Ap...
IRJET - Sentiment Analysis for Marketing and Product Review using a Hybrid Ap...IRJET Journal
 
Groundhog day: near duplicate detection on twitter
Groundhog day: near duplicate detection on twitterGroundhog day: near duplicate detection on twitter
Groundhog day: near duplicate detection on twitterDan Nguyen
 
Adressing Volume and Velocity Challenge on the Social Web using Crowd Sourced...
Adressing Volume and Velocity Challenge on the Social Web using Crowd Sourced...Adressing Volume and Velocity Challenge on the Social Web using Crowd Sourced...
Adressing Volume and Velocity Challenge on the Social Web using Crowd Sourced...Pavan Kapanipathi
 
IRJET- An Experimental Evaluation of Mechanical Properties of Bamboo Fiber Re...
IRJET- An Experimental Evaluation of Mechanical Properties of Bamboo Fiber Re...IRJET- An Experimental Evaluation of Mechanical Properties of Bamboo Fiber Re...
IRJET- An Experimental Evaluation of Mechanical Properties of Bamboo Fiber Re...IRJET Journal
 
IRJET- Tweet Segmentation and its Application to Named Entity Recognition
IRJET- Tweet Segmentation and its Application to Named Entity RecognitionIRJET- Tweet Segmentation and its Application to Named Entity Recognition
IRJET- Tweet Segmentation and its Application to Named Entity RecognitionIRJET Journal
 
FRAMEWORK FOR ANALYZING TWITTER TO DETECT COMMUNITY SUSPICIOUS CRIME ACTIVITY
FRAMEWORK FOR ANALYZING TWITTER TO DETECT COMMUNITY SUSPICIOUS CRIME ACTIVITYFRAMEWORK FOR ANALYZING TWITTER TO DETECT COMMUNITY SUSPICIOUS CRIME ACTIVITY
FRAMEWORK FOR ANALYZING TWITTER TO DETECT COMMUNITY SUSPICIOUS CRIME ACTIVITYcscpconf
 

Ähnlich wie Twitter as a personalizable information service ii (20)

IEEE 2014 JAVA DATA MINING PROJECTS Discovering emerging topics in social str...
IEEE 2014 JAVA DATA MINING PROJECTS Discovering emerging topics in social str...IEEE 2014 JAVA DATA MINING PROJECTS Discovering emerging topics in social str...
IEEE 2014 JAVA DATA MINING PROJECTS Discovering emerging topics in social str...
 
2014 IEEE JAVA DATA MINING PROJECT Discovering emerging topics in social stre...
2014 IEEE JAVA DATA MINING PROJECT Discovering emerging topics in social stre...2014 IEEE JAVA DATA MINING PROJECT Discovering emerging topics in social stre...
2014 IEEE JAVA DATA MINING PROJECT Discovering emerging topics in social stre...
 
2014 IEEE JAVA DATA MINING PROJECT Discovering emerging topics in social stre...
2014 IEEE JAVA DATA MINING PROJECT Discovering emerging topics in social stre...2014 IEEE JAVA DATA MINING PROJECT Discovering emerging topics in social stre...
2014 IEEE JAVA DATA MINING PROJECT Discovering emerging topics in social stre...
 
Twitter: Social Network Or News Medium?
Twitter: Social Network Or News Medium?Twitter: Social Network Or News Medium?
Twitter: Social Network Or News Medium?
 
Marketing analysis
Marketing analysisMarketing analysis
Marketing analysis
 
IRJET- A Survey on Trend Analysis on Twitter for Predicting Public Opinion on...
IRJET- A Survey on Trend Analysis on Twitter for Predicting Public Opinion on...IRJET- A Survey on Trend Analysis on Twitter for Predicting Public Opinion on...
IRJET- A Survey on Trend Analysis on Twitter for Predicting Public Opinion on...
 
Sentiment Analysis of Twitter Data
Sentiment Analysis of Twitter DataSentiment Analysis of Twitter Data
Sentiment Analysis of Twitter Data
 
SampleLiteratureReviewTemplate_IVBTechIISEM_MajorProject.pptx
SampleLiteratureReviewTemplate_IVBTechIISEM_MajorProject.pptxSampleLiteratureReviewTemplate_IVBTechIISEM_MajorProject.pptx
SampleLiteratureReviewTemplate_IVBTechIISEM_MajorProject.pptx
 
Automatic Summarizaton Tutorial
Automatic Summarizaton TutorialAutomatic Summarizaton Tutorial
Automatic Summarizaton Tutorial
 
Detection and Analysis of Twitter Trending Topics via Link-Anomaly Detection
Detection and Analysis of Twitter Trending Topics via Link-Anomaly DetectionDetection and Analysis of Twitter Trending Topics via Link-Anomaly Detection
Detection and Analysis of Twitter Trending Topics via Link-Anomaly Detection
 
Twitter Sentiment Analysis
Twitter Sentiment AnalysisTwitter Sentiment Analysis
Twitter Sentiment Analysis
 
Topic Evolutionary Tweet Stream Clustering Algorithm and TCV Rank Summarization
Topic Evolutionary Tweet Stream Clustering Algorithm and TCV Rank SummarizationTopic Evolutionary Tweet Stream Clustering Algorithm and TCV Rank Summarization
Topic Evolutionary Tweet Stream Clustering Algorithm and TCV Rank Summarization
 
IRJET - Sentiment Analysis for Marketing and Product Review using a Hybrid Ap...
IRJET - Sentiment Analysis for Marketing and Product Review using a Hybrid Ap...IRJET - Sentiment Analysis for Marketing and Product Review using a Hybrid Ap...
IRJET - Sentiment Analysis for Marketing and Product Review using a Hybrid Ap...
 
F017433947
F017433947F017433947
F017433947
 
Groundhog day: near duplicate detection on twitter
Groundhog day: near duplicate detection on twitterGroundhog day: near duplicate detection on twitter
Groundhog day: near duplicate detection on twitter
 
Adressing Volume and Velocity Challenge on the Social Web using Crowd Sourced...
Adressing Volume and Velocity Challenge on the Social Web using Crowd Sourced...Adressing Volume and Velocity Challenge on the Social Web using Crowd Sourced...
Adressing Volume and Velocity Challenge on the Social Web using Crowd Sourced...
 
IRJET- An Experimental Evaluation of Mechanical Properties of Bamboo Fiber Re...
IRJET- An Experimental Evaluation of Mechanical Properties of Bamboo Fiber Re...IRJET- An Experimental Evaluation of Mechanical Properties of Bamboo Fiber Re...
IRJET- An Experimental Evaluation of Mechanical Properties of Bamboo Fiber Re...
 
IRJET- Tweet Segmentation and its Application to Named Entity Recognition
IRJET- Tweet Segmentation and its Application to Named Entity RecognitionIRJET- Tweet Segmentation and its Application to Named Entity Recognition
IRJET- Tweet Segmentation and its Application to Named Entity Recognition
 
E017433538
E017433538E017433538
E017433538
 
FRAMEWORK FOR ANALYZING TWITTER TO DETECT COMMUNITY SUSPICIOUS CRIME ACTIVITY
FRAMEWORK FOR ANALYZING TWITTER TO DETECT COMMUNITY SUSPICIOUS CRIME ACTIVITYFRAMEWORK FOR ANALYZING TWITTER TO DETECT COMMUNITY SUSPICIOUS CRIME ACTIVITY
FRAMEWORK FOR ANALYZING TWITTER TO DETECT COMMUNITY SUSPICIOUS CRIME ACTIVITY
 

Mehr von Kan-Han (John) Lu

Working process and git branch strategy
Working process and git branch strategyWorking process and git branch strategy
Working process and git branch strategyKan-Han (John) Lu
 
Deep neural network for youtube recommendations
Deep neural network for youtube recommendationsDeep neural network for youtube recommendations
Deep neural network for youtube recommendationsKan-Han (John) Lu
 
Multimedia data minig and analytics sentiment analysis using social multimedia
Multimedia data minig and analytics sentiment analysis using social multimediaMultimedia data minig and analytics sentiment analysis using social multimedia
Multimedia data minig and analytics sentiment analysis using social multimediaKan-Han (John) Lu
 
ARM: Trusted Zone on Android
ARM: Trusted Zone on AndroidARM: Trusted Zone on Android
ARM: Trusted Zone on AndroidKan-Han (John) Lu
 
Android Training - Card Style
Android Training - Card StyleAndroid Training - Card Style
Android Training - Card StyleKan-Han (John) Lu
 
Android Training - View Pager
Android Training - View PagerAndroid Training - View Pager
Android Training - View PagerKan-Han (John) Lu
 
Android Training - Sliding Menu
Android Training - Sliding MenuAndroid Training - Sliding Menu
Android Training - Sliding MenuKan-Han (John) Lu
 
Android Training - Pull to Refresh
Android Training - Pull to RefreshAndroid Training - Pull to Refresh
Android Training - Pull to RefreshKan-Han (John) Lu
 
Code analyzer: FindBugs and PMD
Code analyzer: FindBugs and PMDCode analyzer: FindBugs and PMD
Code analyzer: FindBugs and PMDKan-Han (John) Lu
 
Android Training - Content Sharing
Android Training - Content SharingAndroid Training - Content Sharing
Android Training - Content SharingKan-Han (John) Lu
 
Android Training - Action Bar
Android Training - Action BarAndroid Training - Action Bar
Android Training - Action BarKan-Han (John) Lu
 

Mehr von Kan-Han (John) Lu (20)

Dagger for android
Dagger for androidDagger for android
Dagger for android
 
Android develop guideline
Android develop guidelineAndroid develop guideline
Android develop guideline
 
Working process and git branch strategy
Working process and git branch strategyWorking process and git branch strategy
Working process and git branch strategy
 
Deep neural network for youtube recommendations
Deep neural network for youtube recommendationsDeep neural network for youtube recommendations
Deep neural network for youtube recommendations
 
Android testing part i
Android testing part iAndroid testing part i
Android testing part i
 
Cuda project paper
Cuda project paperCuda project paper
Cuda project paper
 
Multimedia data minig and analytics sentiment analysis using social multimedia
Multimedia data minig and analytics sentiment analysis using social multimediaMultimedia data minig and analytics sentiment analysis using social multimedia
Multimedia data minig and analytics sentiment analysis using social multimedia
 
Android IPC: Binder
Android IPC: BinderAndroid IPC: Binder
Android IPC: Binder
 
ARM: Trusted Zone on Android
ARM: Trusted Zone on AndroidARM: Trusted Zone on Android
ARM: Trusted Zone on Android
 
Android Training - Card Style
Android Training - Card StyleAndroid Training - Card Style
Android Training - Card Style
 
Android Training - View Pager
Android Training - View PagerAndroid Training - View Pager
Android Training - View Pager
 
Android Training - Sliding Menu
Android Training - Sliding MenuAndroid Training - Sliding Menu
Android Training - Sliding Menu
 
Android Training - Pull to Refresh
Android Training - Pull to RefreshAndroid Training - Pull to Refresh
Android Training - Pull to Refresh
 
Java: Exception Handling
Java: Exception HandlingJava: Exception Handling
Java: Exception Handling
 
Dynamic Proxy by Java
Dynamic Proxy by JavaDynamic Proxy by Java
Dynamic Proxy by Java
 
Code analyzer: FindBugs and PMD
Code analyzer: FindBugs and PMDCode analyzer: FindBugs and PMD
Code analyzer: FindBugs and PMD
 
Android UI System
Android UI SystemAndroid UI System
Android UI System
 
Android Fragment
Android FragmentAndroid Fragment
Android Fragment
 
Android Training - Content Sharing
Android Training - Content SharingAndroid Training - Content Sharing
Android Training - Content Sharing
 
Android Training - Action Bar
Android Training - Action BarAndroid Training - Action Bar
Android Training - Action Bar
 

Kürzlich hochgeladen

KubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlyKubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlysanyuktamishra911
 
Bhosari ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready For ...
Bhosari ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready For ...Bhosari ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready For ...
Bhosari ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready For ...tanu pandey
 
Unit 2- Effective stress & Permeability.pdf
Unit 2- Effective stress & Permeability.pdfUnit 2- Effective stress & Permeability.pdf
Unit 2- Effective stress & Permeability.pdfRagavanV2
 
Double rodded leveling 1 pdf activity 01
Double rodded leveling 1 pdf activity 01Double rodded leveling 1 pdf activity 01
Double rodded leveling 1 pdf activity 01KreezheaRecto
 
Top Rated Call Girls In chittoor 📱 {7001035870} VIP Escorts chittoor
Top Rated Call Girls In chittoor 📱 {7001035870} VIP Escorts chittoorTop Rated Call Girls In chittoor 📱 {7001035870} VIP Escorts chittoor
Top Rated Call Girls In chittoor 📱 {7001035870} VIP Escorts chittoordharasingh5698
 
Call Girls In Bangalore ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bangalore ☎ 7737669865 🥵 Book Your One night StandCall Girls In Bangalore ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bangalore ☎ 7737669865 🥵 Book Your One night Standamitlee9823
 
Online banking management system project.pdf
Online banking management system project.pdfOnline banking management system project.pdf
Online banking management system project.pdfKamal Acharya
 
data_management_and _data_science_cheat_sheet.pdf
data_management_and _data_science_cheat_sheet.pdfdata_management_and _data_science_cheat_sheet.pdf
data_management_and _data_science_cheat_sheet.pdfJiananWang21
 
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 BookingVIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Bookingdharasingh5698
 
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdfONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdfKamal Acharya
 
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...Call Girls in Nagpur High Profile
 
Call Girls Wakad Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Wakad Call Me 7737669865 Budget Friendly No Advance BookingCall Girls Wakad Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Wakad Call Me 7737669865 Budget Friendly No Advance Bookingroncy bisnoi
 
Call Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance BookingCall Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance Bookingroncy bisnoi
 
VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...
VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...
VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...SUHANI PANDEY
 
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete RecordCCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete RecordAsst.prof M.Gokilavani
 
Double Revolving field theory-how the rotor develops torque
Double Revolving field theory-how the rotor develops torqueDouble Revolving field theory-how the rotor develops torque
Double Revolving field theory-how the rotor develops torqueBhangaleSonal
 

Kürzlich hochgeladen (20)

KubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlyKubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghly
 
Bhosari ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready For ...
Bhosari ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready For ...Bhosari ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready For ...
Bhosari ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready For ...
 
NFPA 5000 2024 standard .
NFPA 5000 2024 standard                                  .NFPA 5000 2024 standard                                  .
NFPA 5000 2024 standard .
 
Call Girls in Netaji Nagar, Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
Call Girls in Netaji Nagar, Delhi 💯 Call Us 🔝9953056974 🔝 Escort ServiceCall Girls in Netaji Nagar, Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
Call Girls in Netaji Nagar, Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
 
Unit 2- Effective stress & Permeability.pdf
Unit 2- Effective stress & Permeability.pdfUnit 2- Effective stress & Permeability.pdf
Unit 2- Effective stress & Permeability.pdf
 
Double rodded leveling 1 pdf activity 01
Double rodded leveling 1 pdf activity 01Double rodded leveling 1 pdf activity 01
Double rodded leveling 1 pdf activity 01
 
Top Rated Call Girls In chittoor 📱 {7001035870} VIP Escorts chittoor
Top Rated Call Girls In chittoor 📱 {7001035870} VIP Escorts chittoorTop Rated Call Girls In chittoor 📱 {7001035870} VIP Escorts chittoor
Top Rated Call Girls In chittoor 📱 {7001035870} VIP Escorts chittoor
 
Call Girls In Bangalore ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bangalore ☎ 7737669865 🥵 Book Your One night StandCall Girls In Bangalore ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bangalore ☎ 7737669865 🥵 Book Your One night Stand
 
Online banking management system project.pdf
Online banking management system project.pdfOnline banking management system project.pdf
Online banking management system project.pdf
 
(INDIRA) Call Girl Meerut Call Now 8617697112 Meerut Escorts 24x7
(INDIRA) Call Girl Meerut Call Now 8617697112 Meerut Escorts 24x7(INDIRA) Call Girl Meerut Call Now 8617697112 Meerut Escorts 24x7
(INDIRA) Call Girl Meerut Call Now 8617697112 Meerut Escorts 24x7
 
data_management_and _data_science_cheat_sheet.pdf
data_management_and _data_science_cheat_sheet.pdfdata_management_and _data_science_cheat_sheet.pdf
data_management_and _data_science_cheat_sheet.pdf
 
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak HamilCara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
 
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 BookingVIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
 
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdfONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
 
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
 
Call Girls Wakad Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Wakad Call Me 7737669865 Budget Friendly No Advance BookingCall Girls Wakad Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Wakad Call Me 7737669865 Budget Friendly No Advance Booking
 
Call Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance BookingCall Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance Booking
 
VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...
VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...
VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...
 
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete RecordCCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
 
Double Revolving field theory-how the rotor develops torque
Double Revolving field theory-how the rotor develops torqueDouble Revolving field theory-how the rotor develops torque
Double Revolving field theory-how the rotor develops torque
 

Twitter as a personalizable information service ii

  • 1. Twitter as a Personalizable Information Service Part 2 2015/11/9 John
  • 2. Outline ● Review Part 1 ● Related work ○ Aggregation, Propagation, and Recommendation Through Social Network ○ Event Identification in Social Network ○ Content Personalization ● Implementation ○ Real-time Vectorization of Tweets ○ Analyzing the Stream of Information by Taking into Account Temporal Conditions ○ PageRank ● Summary ● Next
  • 3. Review: Abstract Twitter, making this information system one of the fastest in the world. This chapter introduces a novel topic-detection method about the most emerging arguments expressed by the Twitter network around his/her domain interests. It propose an innovative term aging model, based on a biological metaphor, to retrieve the freshest arguments of discussion.
  • 4. Review: What’s Data Mining The process of analyzing data from different perspectives and summarizing it into useful information. “You really understand your client or customer?”
  • 6. Related work In the last decade, the enormous amount of content generated by Web users created new challenges and new research questions within the data mining community, those works are related to the research of this chapter. Survey fields: 1. Aggregation, Propagatoin, and Recommendation information from large-scale social network. 2. Automatic detection of events within user-generated environment 3. Current personalization and current user context analysis
  • 7. Aggregation, Propagation, and Recommendation Through Social Network ● First issue: Aggregation of content through fliter and merging. There are two main approaches: Collaborate filtering: Selecting and proposing content by looking at what similar users have already selected. (i.e., filter what users select) Content-based techniques: Analyze the semantics of the content without considering its origin. (i.e., from only text content) ● Second issue: Analysis of how much information spreads. In general, analyze from as like with the spread of a disease in a social envrionment. ● Trendistic and Twopular repersent two examples which it is possible to analyze the trends of keywords along a timeline specified by the user.
  • 8. Event Identification in Social Network ● Identifying events in real-time on Twitter is a challenging problem, due to the heterogeneity(diverse) and immense scale of the data. ● A research defines a typology of five generic classes of tweets: News, Events, Opinions, Deals, Private messages. ● The real time social content can also be seen as a sensor that captures what is happening in the world: similarly to the recommendation task, this can be exploited for a zero-delay information broadcasting system that detects emerging cocepts. ● All the techniques rely on some measure of importance of the keywords. TF- IDF to avoid the collapse of important terms when they appear in many text documents.
  • 9. Content Personalization ● Also cover this survey as relevant work since the system of this chapter includes a module for the personalization of the emerging topics to be retrieved from Twitter. ● There obviously exists several approaches to facing such as a task. ● Depending on the domain, one may be interested in ‘re-rank’ the results based on their relevance, rather than ‘diversify’ them. ● Brief overview of the system in this chapter, it can be classified as a ‘re-rank’ approach.
  • 10. Implementation This section illustrates the method for analyzing, in real-time, the dynamic stream of information expressed by the Twitter community and retrieve the most emerging topics within the user’s interests. 1. First, a set of tweets is generated within a specific time interval, is represented as a set of keyword vectors. (term vectors) 2. A term aging model monitors the usage of each keyword over the time. 3. Moreover, the social reputation of the Twitter users is leveraged to balance the importance of the information expressed by the community. 4. Finally, the user context is taken into account, provided by the generated set of tweets, to highlight the most emerging topics within user’s interests.
  • 11. Real-time Vectorization of Tweets ● In most Information Retrieval(IR) system, the first step is extraction of the relevant keywords. (called ‘term’ in this chapter) ● Considered a interval, I, which is at tth and given a time range r: ● The corpus is extracted, with text tweets extracted during the time interval. ● Each component of the vector represents a weighted term extracted from the related tweet vector . And weight of the xth vocabulary term in jth tweet by using the argmented normal frequency: TF-IDF
  • 12. Analyzing the Stream of Information by Taking into Account Temporal Conditions ● Generally speaking, a term can be viewed as a semantic unit which can potentially link to a new event. ● This section uses a content aging theory to automatically identify coherent discussions through a life cycle-based content model. ● Many conventional clustering and classification strategies can not be applied to this problem due to the fact that tend to ignored the temporal relationships (about time aspect) amoung documents(tweets). ● A keyword of the life cycle can be considered as analogous(like) to the one of a living being(living thing) with abundant nourishment (i.e., related tweets). ● However, a keyword or a live form dies when nourishment(food) becomes insufficient.
  • 13. Analyzing the Stream of Information by Taking into Account Temporal Conditions ● Relaying on this live analogy. ● It is possible to evaluate the usage of a keyword by its burstiness. ● Burstiness indicates the vitality status of the keyword and can qualify the keyword’s usage. ● High burstiness or low burstiness implies that the term is becoming important or not. ● Therefore, the system uses the concept of authority to define the quality of the nutrition that each tweet gives to every contained keyword. ● Different tweets containing the same keyword generate different amount of nutrition(i.e., calorie) in the community:
  • 14. Analyzing the Stream of Information by Taking into Account Temporal Conditions Reputation of the users ● In Twitter, the social model enables to define an author-based graph, ● Reputation can be extended by taking into account the fact that the importance of a user. It is also related to the degree of importance of its followers. ● We can refer to the well-known PageRank algorithm for this task that calculates the reputation as follow: Note: Later, it is explained by a simple example of PageRank.
  • 15. Analyzing the Stream of Information by Taking into Account Temporal Conditions Computing term Burstiness values ● Once the nutrition of a term is calculated, the aim is to map into a value of burstiness. ● The burstiness value indicates a term’s actual contribution(i.e., how much it is emergent) in the corpus of tweets. ● A keyword is defined as emergent if it results to be hot in the considered time interval. ● We analyze the keyword life cycles by comparing their nutrition values obtained on the considered time frame withe the usage of the same terms in the past time interval. Namely, the current nourishment is analyzed in comparison to the ones built in the previous time internals.
  • 16. Analyzing the Stream of Information by Taking into Account Temporal Conditions Computing term Burstiness values ● If its nutrition value stays constant during closer time intervals, it means that community is probably still referring to the same news event. ● Event if the keyword can be considered as hot, it can not be referred as emergent due to the temporal discrimination. (Temporal parameter influences the emerging keyword retrieved by the system) ● A parameter s, where , that limits the number of previous time slots considered by the system to study the keywords life cycles and defines the history worthiness of the resulting emerging keywords.
  • 17. PageRank PageRank works by counting the number and quality of links to a page to determine a rough estimate of how important the website is. The underlying assumption is that more important websites are likely to receive more links from other websites. Random surfer, where d = 0.85 (usually) Note: It can be seen a Markov Chain. Keyword: PageRank, Google search, Markov, WebCrawler, SEO
  • 18. PageRank Example in notion (random traverse method) Suppose it is all pages in the world. Compute PR(A), PR(B), PR(C), PR(D) 1. Traverse pages until reached 1000 times. 2. Traverse pages until reached 1 million times. A D C B
  • 19. PageRank Simply Example: Traverse 1000 times PR(A): 118 times PR(B): 109 times PR(C): 13 times PR(D): 144 times A D C B
  • 20. PageRank Simply Example: Traverse 1 million times PR(A): 13% PR(B): 10% PR(C): 2% PR(D): 15% 7% 4% 4% 4% A 14% 10% 2% 2% 2% D C 2% B 5% 4%
  • 21. Summary ● Currently, a lot of researches work on Aggregation, Propagatoin, and Recommendation information from Large scale social network. ● Automatic detection and Personalization on User context become intesesting and important. ● From implementation, first is extracting the keywords and its weights from tweets in the interval. ● Computing the user’s reputation and term’s burtiness values. ● Burtiness values of the emerging keyword is influenced with temporal parameter. (interval and its past interval) ● Compute reputation likes to do PageRank, they are on a direct graph network.
  • 22. Next Continue Implementation: ● Selection of Emerging Terms ● Leveraging User’ Context for Persionalization Purposes ● From Emerging Terms to Emergin Topics ● Topic Detection, Labeling, and Ranking Experiments ● Case and User Studies
  • 23. References ● Wiki of TF-IDF - https://en.wikipedia.org/wiki/Tf%E2%80%93idf ● Wiki of Markov Chain - https://en.wikipedia.org/wiki/Markov_chain ● Wiki of PageRank - https://en.wikipedia.org/wiki/PageRank ● PageRank how it work - http://goo.gl/bbShFd ● Nine Algorithms That Changed the Future - http://goo.gl/Y9BFmO