SlideShare ist ein Scribd-Unternehmen logo
1 von 28
Downloaden Sie, um offline zu lesen
Characterizing the Life Cycle
of Online News Stories
Using Social Media Reactions
Carlos Castillo, Mohammed El-Haddad, Matt Stempeck, Jürgen Pfeffer
Twitter: @ChaToX
2
Carlos Castillo – @chatox
http://www.chato.cl/research/
Outline
• Determining classes of news articles
• Predicting traffic using social media
3
Carlos Castillo – @chatox
http://www.chato.cl/research/
Usage analysis in online news
• Aikat (1998)
– Short dwell times, weekday+, weekend-,
bursty traffic.
• Crane and Sornette (2008), Yang and
Leskovec (2011), Lehmann et al. (2012)
– Behavioral classes of attention online
4
Carlos Castillo – @chatox
http://www.chato.cl/research/
Analysis of social media responses
• SocialFlow whitepaper (Lotan, Gaffney,
and Meyer 2011)
– Al Jazeera, BBC News, CNN, The Economist,
Fox News and The New York Times
• Hu et al. (2011)
– Tweets during speech of US president
5
Carlos Castillo – @chatox
http://www.chato.cl/research/
Predictive Web Analytics (references)
6
Carlos Castillo – @chatox
http://www.chato.cl/research/
Data collection
• Three weeks in October 2012
• “Beacon” embedded in Al Jazeera pages
– Real-time data processing
– Apache S4 application for online processing
– Cassandra (NoSQL database) for storage
≈ 3M visits
≈ 200K social media reactions
7
Carlos Castillo – @chatox
http://www.chato.cl/research/
Summary of dataset
8
Carlos Castillo – @chatox
http://www.chato.cl/research/
News In-Depth
Examples:
• US state of Maryland
abolishes death penalty
(May 2nd, 2013)
• Hundreds arrested in
China over 'fake' meat
(May 3rd, 2013)
Examples:
• Spirits of Japan shrine
haunt Asian relations
(May 2nd, 2013)
• Interactive: Powering
the Gulf (May 2nd,
2013)
9
Carlos Castillo – @chatox
http://www.chato.cl/research/
News (322) In-Depth (139)
Tag clouds extracted from titles of articles
Average News profile
Average In-Depth profile
In-Depth items have a slower growth
In-Depth items have a longer shelf-life
In-Depth items are shared on Facebook
News items are shared on Twitter
15
Carlos Castillo – @chatox
http://www.chato.cl/research/
Typical visitation profiles (12 hours)
Decreasing (78%)
Steady (9%)
Increasing (3%)
Rebounding (10%)
Examples
Decreasing
(78%):
● Almost all
breaking news
● Sometimes
delayed due to
timezone
differences, e.g.
Hurricane Sandy
Steady or
Increasing (12%):
● Ongoing news:
Obama/Romney,
Worker strikes in
SA, Syrian unrest
● Articles updated
with supporting
content
Rebounding
(10%):
● Articles picked up
by external
sources or social
media (typically
single source of
traffic)
● Background
articles to new
developments
17
Carlos Castillo – @chatox
http://www.chato.cl/research/
Prediction of visits
• Short-term traffic is to a large extent
correlated with long-term traffic
• Social media signals are correlated with
traffic and shelf-life
More reactions → more traffic
More discussion → longer shelf-life
• Can we predict 7 days after 30 minutes?
18
Carlos Castillo – @chatox
http://www.chato.cl/research/
Predicting traffic and shelf-life online
has a long history
• Predicting long-term behavior and
half-life from short-term observations
– Observations = comments, visits, votes, …
– Behavior = total comments, total visits, …
– 10+ papers specifically on web traffic
• Bit.ly (2011, 2012)
– Studies half-life per topic and platform
Results (traffic predictions)
Results (traffic predictions)
Extrapolate
visits
News are more
predictable than
In-Depth
Results (traffic predictions)
Improved
predictions
Using social
media variables
22
Carlos Castillo – @chatox
http://www.chato.cl/research/
Selected variables, traffic prediction
Results (shelf-life prediction)
Larger
improvements
for In-Depth
articles
Still, this is a 12 hours
error in predicting
something with an
average of 48-72 hours
24
Carlos Castillo – @chatox
http://www.chato.cl/research/
http://fast.qcri.org/
25
Carlos Castillo – @chatox
http://www.chato.cl/research/
What did we learn?
• Decrease, Stay or Increase. Rebound
– Roughly 80:10:10 ratio
• News vs In-Depth: different behavior
• Social media signals are useful to
understand and predict visits
26
Carlos Castillo – @chatox
http://www.chato.cl/research/
Invitation:
ECML/PKDD Discovery Challenge 2014
• Open competition
on predictive Web
Analytics
• Data provided by
Chartbeat Inc.
Thank you!
Carlos Castillo · chato@acm.org
http://www.chato.cl/research/
Characterizing the Life Cycle of Online News Stories Using Social Media Reactions

Weitere ähnliche Inhalte

Andere mochten auch

Social Media News Communities: Gatekeeping, Coverage, and Statement Bias
 Social Media News Communities: Gatekeeping, Coverage, and Statement Bias Social Media News Communities: Gatekeeping, Coverage, and Statement Bias
Social Media News Communities: Gatekeeping, Coverage, and Statement BiasMounia Lalmas-Roelleke
 
Keynote talk: Big Crisis Data, an Open Invitation
Keynote talk: Big Crisis Data, an Open InvitationKeynote talk: Big Crisis Data, an Open Invitation
Keynote talk: Big Crisis Data, an Open InvitationCarlos Castillo (ChaTo)
 
Kdd12 tutorial-inf-part-ii
Kdd12 tutorial-inf-part-iiKdd12 tutorial-inf-part-ii
Kdd12 tutorial-inf-part-iiLaks Lakshmanan
 
Kdd12 tutorial-inf-part-iv
Kdd12 tutorial-inf-part-ivKdd12 tutorial-inf-part-iv
Kdd12 tutorial-inf-part-ivLaks Lakshmanan
 
TweetCred: Real-Time Credibility Assessment of 
 Content on Twitter @ Socinfo...
TweetCred: Real-Time Credibility Assessment of 
 Content on Twitter @ Socinfo...TweetCred: Real-Time Credibility Assessment of 
 Content on Twitter @ Socinfo...
TweetCred: Real-Time Credibility Assessment of 
 Content on Twitter @ Socinfo...IIIT Hyderabad
 
Kdd12 tutorial-inf-part-i
Kdd12 tutorial-inf-part-iKdd12 tutorial-inf-part-i
Kdd12 tutorial-inf-part-iLaks Lakshmanan
 
Extracting Information Nuggets from Disaster-Related Messages in Social Media
Extracting Information Nuggets from Disaster-Related Messages in Social MediaExtracting Information Nuggets from Disaster-Related Messages in Social Media
Extracting Information Nuggets from Disaster-Related Messages in Social MediaMuhammad Imran
 
What to Expect When the Unexpected Happens: Social Media Communications Acros...
What to Expect When the Unexpected Happens: Social Media Communications Acros...What to Expect When the Unexpected Happens: Social Media Communications Acros...
What to Expect When the Unexpected Happens: Social Media Communications Acros...Carlos Castillo (ChaTo)
 
Emotions and dialogue in a peer-production community: the case of Wikipedia
Emotions and dialogue in a peer-production community: the case of WikipediaEmotions and dialogue in a peer-production community: the case of Wikipedia
Emotions and dialogue in a peer-production community: the case of WikipediaDavid Laniado
 
Kdd12 tutorial-inf-part-iii
Kdd12 tutorial-inf-part-iiiKdd12 tutorial-inf-part-iii
Kdd12 tutorial-inf-part-iiiLaks Lakshmanan
 
SIAM SDM2014 tutorial - Social Media and Web of Data to Assist Crisis Respons...
SIAM SDM2014 tutorial - Social Media and Web of Data to Assist Crisis Respons...SIAM SDM2014 tutorial - Social Media and Web of Data to Assist Crisis Respons...
SIAM SDM2014 tutorial - Social Media and Web of Data to Assist Crisis Respons...Artificial Intelligence Institute at UofSC
 

Andere mochten auch (15)

Social Media News Communities: Gatekeeping, Coverage, and Statement Bias
 Social Media News Communities: Gatekeeping, Coverage, and Statement Bias Social Media News Communities: Gatekeeping, Coverage, and Statement Bias
Social Media News Communities: Gatekeeping, Coverage, and Statement Bias
 
Keynote talk: Big Crisis Data, an Open Invitation
Keynote talk: Big Crisis Data, an Open InvitationKeynote talk: Big Crisis Data, an Open Invitation
Keynote talk: Big Crisis Data, an Open Invitation
 
Kdd12 tutorial-inf-part-ii
Kdd12 tutorial-inf-part-iiKdd12 tutorial-inf-part-ii
Kdd12 tutorial-inf-part-ii
 
Kdd12 tutorial-inf-part-iv
Kdd12 tutorial-inf-part-ivKdd12 tutorial-inf-part-iv
Kdd12 tutorial-inf-part-iv
 
TweetCred: Real-Time Credibility Assessment of 
 Content on Twitter @ Socinfo...
TweetCred: Real-Time Credibility Assessment of 
 Content on Twitter @ Socinfo...TweetCred: Real-Time Credibility Assessment of 
 Content on Twitter @ Socinfo...
TweetCred: Real-Time Credibility Assessment of 
 Content on Twitter @ Socinfo...
 
Kdd12 tutorial-inf-part-i
Kdd12 tutorial-inf-part-iKdd12 tutorial-inf-part-i
Kdd12 tutorial-inf-part-i
 
Extracting Information Nuggets from Disaster-Related Messages in Social Media
Extracting Information Nuggets from Disaster-Related Messages in Social MediaExtracting Information Nuggets from Disaster-Related Messages in Social Media
Extracting Information Nuggets from Disaster-Related Messages in Social Media
 
What to Expect When the Unexpected Happens: Social Media Communications Acros...
What to Expect When the Unexpected Happens: Social Media Communications Acros...What to Expect When the Unexpected Happens: Social Media Communications Acros...
What to Expect When the Unexpected Happens: Social Media Communications Acros...
 
Fairness-Aware Data Mining
Fairness-Aware Data MiningFairness-Aware Data Mining
Fairness-Aware Data Mining
 
Crisis Computing
Crisis ComputingCrisis Computing
Crisis Computing
 
Emotions and dialogue in a peer-production community: the case of Wikipedia
Emotions and dialogue in a peer-production community: the case of WikipediaEmotions and dialogue in a peer-production community: the case of Wikipedia
Emotions and dialogue in a peer-production community: the case of Wikipedia
 
Kdd12 tutorial-inf-part-iii
Kdd12 tutorial-inf-part-iiiKdd12 tutorial-inf-part-iii
Kdd12 tutorial-inf-part-iii
 
SIAM SDM2014 tutorial - Social Media and Web of Data to Assist Crisis Respons...
SIAM SDM2014 tutorial - Social Media and Web of Data to Assist Crisis Respons...SIAM SDM2014 tutorial - Social Media and Web of Data to Assist Crisis Respons...
SIAM SDM2014 tutorial - Social Media and Web of Data to Assist Crisis Respons...
 
Social Media Mining and Retrieval
Social Media Mining and RetrievalSocial Media Mining and Retrieval
Social Media Mining and Retrieval
 
Discrimination Discovery
Discrimination DiscoveryDiscrimination Discovery
Discrimination Discovery
 

Ähnlich wie Characterizing the Life Cycle of Online News Stories Using Social Media Reactions

Ausvotes
AusvotesAusvotes
Ausvoteslchu125
 
Prime "Social" Ministers - François Hollande Analysis
Prime "Social" Ministers - François Hollande AnalysisPrime "Social" Ministers - François Hollande Analysis
Prime "Social" Ministers - François Hollande AnalysisDOING
 
Prime "Social" Ministers - Alexis Tsipras Analysis
Prime "Social" Ministers - Alexis Tsipras AnalysisPrime "Social" Ministers - Alexis Tsipras Analysis
Prime "Social" Ministers - Alexis Tsipras AnalysisDOING
 
Icwsm Politics Panel
Icwsm Politics PanelIcwsm Politics Panel
Icwsm Politics PanelKathy Gill
 
Prime social ministers - David Cameron Analysis
Prime social ministers - David Cameron AnalysisPrime social ministers - David Cameron Analysis
Prime social ministers - David Cameron AnalysisDOING
 
Prime "Social" Ministers - Matteo Renzi Analysis
Prime "Social" Ministers - Matteo Renzi AnalysisPrime "Social" Ministers - Matteo Renzi Analysis
Prime "Social" Ministers - Matteo Renzi AnalysisDOING
 
Filter Bubbles in the Australian Twittersphere?
Filter Bubbles in the Australian Twittersphere?Filter Bubbles in the Australian Twittersphere?
Filter Bubbles in the Australian Twittersphere?Axel Bruns
 
Pizza Talk IV: Fighting Back Shitstorms With An Army of Superfans
Pizza Talk IV: Fighting Back Shitstorms With An Army of SuperfansPizza Talk IV: Fighting Back Shitstorms With An Army of Superfans
Pizza Talk IV: Fighting Back Shitstorms With An Army of Superfansvm-people GmbH
 
AI in between online and offline discourse - and what has ChatGPT to do with ...
AI in between online and offline discourse - and what has ChatGPT to do with ...AI in between online and offline discourse - and what has ChatGPT to do with ...
AI in between online and offline discourse - and what has ChatGPT to do with ...Stefan Dietze
 
WBS OutlineWork Breakdown Structure OutlineProject Initiation1.1De.docx
WBS OutlineWork Breakdown Structure OutlineProject Initiation1.1De.docxWBS OutlineWork Breakdown Structure OutlineProject Initiation1.1De.docx
WBS OutlineWork Breakdown Structure OutlineProject Initiation1.1De.docxcelenarouzie
 
News Diffusion on Twitter: Comparing the Dissemination Careers for Mainstream...
News Diffusion on Twitter: Comparing the Dissemination Careers for Mainstream...News Diffusion on Twitter: Comparing the Dissemination Careers for Mainstream...
News Diffusion on Twitter: Comparing the Dissemination Careers for Mainstream...Axel Bruns
 
Distinguere grano e loglio segnali, rumore e altre storie in un big (data) wo...
Distinguere grano e loglio segnali, rumore e altre storie in un big (data) wo...Distinguere grano e loglio segnali, rumore e altre storie in un big (data) wo...
Distinguere grano e loglio segnali, rumore e altre storie in un big (data) wo...SocialMediaDayMI
 
Prime "Social" Ministers - Mariano Rajoy Analysis
Prime "Social" Ministers - Mariano Rajoy AnalysisPrime "Social" Ministers - Mariano Rajoy Analysis
Prime "Social" Ministers - Mariano Rajoy AnalysisDOING
 
Vladimir Alexiev | Semantic Enrichment of Twitter Microposts Helps Understand...
Vladimir Alexiev | Semantic Enrichment of Twitter Microposts Helps Understand...Vladimir Alexiev | Semantic Enrichment of Twitter Microposts Helps Understand...
Vladimir Alexiev | Semantic Enrichment of Twitter Microposts Helps Understand...semanticsconference
 
Ausvotes
AusvotesAusvotes
Ausvoteslchu125
 
Presentation ISCRAM 2012
Presentation ISCRAM 2012Presentation ISCRAM 2012
Presentation ISCRAM 2012Twittercrisis
 

Ähnlich wie Characterizing the Life Cycle of Online News Stories Using Social Media Reactions (20)

Ausvotes
AusvotesAusvotes
Ausvotes
 
Prime "Social" Ministers - François Hollande Analysis
Prime "Social" Ministers - François Hollande AnalysisPrime "Social" Ministers - François Hollande Analysis
Prime "Social" Ministers - François Hollande Analysis
 
Prime "Social" Ministers - Alexis Tsipras Analysis
Prime "Social" Ministers - Alexis Tsipras AnalysisPrime "Social" Ministers - Alexis Tsipras Analysis
Prime "Social" Ministers - Alexis Tsipras Analysis
 
Icwsm Politics Panel
Icwsm Politics PanelIcwsm Politics Panel
Icwsm Politics Panel
 
Prime social ministers - David Cameron Analysis
Prime social ministers - David Cameron AnalysisPrime social ministers - David Cameron Analysis
Prime social ministers - David Cameron Analysis
 
Prime "Social" Ministers - Matteo Renzi Analysis
Prime "Social" Ministers - Matteo Renzi AnalysisPrime "Social" Ministers - Matteo Renzi Analysis
Prime "Social" Ministers - Matteo Renzi Analysis
 
Filter Bubbles in the Australian Twittersphere?
Filter Bubbles in the Australian Twittersphere?Filter Bubbles in the Australian Twittersphere?
Filter Bubbles in the Australian Twittersphere?
 
New tools twitter
New tools twitterNew tools twitter
New tools twitter
 
Pizza Talk IV: Fighting Back Shitstorms With An Army of Superfans
Pizza Talk IV: Fighting Back Shitstorms With An Army of SuperfansPizza Talk IV: Fighting Back Shitstorms With An Army of Superfans
Pizza Talk IV: Fighting Back Shitstorms With An Army of Superfans
 
Document(2)
Document(2)Document(2)
Document(2)
 
AI in between online and offline discourse - and what has ChatGPT to do with ...
AI in between online and offline discourse - and what has ChatGPT to do with ...AI in between online and offline discourse - and what has ChatGPT to do with ...
AI in between online and offline discourse - and what has ChatGPT to do with ...
 
Broker Bots: Analyzing automated activity during High Impact Events on Twitter
Broker Bots: Analyzing automated activity during High Impact Events on TwitterBroker Bots: Analyzing automated activity during High Impact Events on Twitter
Broker Bots: Analyzing automated activity during High Impact Events on Twitter
 
WBS OutlineWork Breakdown Structure OutlineProject Initiation1.1De.docx
WBS OutlineWork Breakdown Structure OutlineProject Initiation1.1De.docxWBS OutlineWork Breakdown Structure OutlineProject Initiation1.1De.docx
WBS OutlineWork Breakdown Structure OutlineProject Initiation1.1De.docx
 
News Diffusion on Twitter: Comparing the Dissemination Careers for Mainstream...
News Diffusion on Twitter: Comparing the Dissemination Careers for Mainstream...News Diffusion on Twitter: Comparing the Dissemination Careers for Mainstream...
News Diffusion on Twitter: Comparing the Dissemination Careers for Mainstream...
 
Distinguere grano e loglio segnali, rumore e altre storie in un big (data) wo...
Distinguere grano e loglio segnali, rumore e altre storie in un big (data) wo...Distinguere grano e loglio segnali, rumore e altre storie in un big (data) wo...
Distinguere grano e loglio segnali, rumore e altre storie in un big (data) wo...
 
Prime "Social" Ministers - Mariano Rajoy Analysis
Prime "Social" Ministers - Mariano Rajoy AnalysisPrime "Social" Ministers - Mariano Rajoy Analysis
Prime "Social" Ministers - Mariano Rajoy Analysis
 
Vladimir Alexiev | Semantic Enrichment of Twitter Microposts Helps Understand...
Vladimir Alexiev | Semantic Enrichment of Twitter Microposts Helps Understand...Vladimir Alexiev | Semantic Enrichment of Twitter Microposts Helps Understand...
Vladimir Alexiev | Semantic Enrichment of Twitter Microposts Helps Understand...
 
Twitter 101
Twitter 101Twitter 101
Twitter 101
 
Ausvotes
AusvotesAusvotes
Ausvotes
 
Presentation ISCRAM 2012
Presentation ISCRAM 2012Presentation ISCRAM 2012
Presentation ISCRAM 2012
 

Mehr von Carlos Castillo (ChaTo)

Finding High Quality Content in Social Media
Finding High Quality Content in Social MediaFinding High Quality Content in Social Media
Finding High Quality Content in Social MediaCarlos Castillo (ChaTo)
 
Socia Media and Digital Volunteering in Disaster Management @ DSEM 2017
Socia Media and Digital Volunteering in Disaster Management @ DSEM 2017Socia Media and Digital Volunteering in Disaster Management @ DSEM 2017
Socia Media and Digital Volunteering in Disaster Management @ DSEM 2017Carlos Castillo (ChaTo)
 
Detecting Algorithmic Bias (keynote at DIR 2016)
Detecting Algorithmic Bias (keynote at DIR 2016)Detecting Algorithmic Bias (keynote at DIR 2016)
Detecting Algorithmic Bias (keynote at DIR 2016)Carlos Castillo (ChaTo)
 

Mehr von Carlos Castillo (ChaTo) (20)

Finding High Quality Content in Social Media
Finding High Quality Content in Social MediaFinding High Quality Content in Social Media
Finding High Quality Content in Social Media
 
When no clicks are good news
When no clicks are good newsWhen no clicks are good news
When no clicks are good news
 
Socia Media and Digital Volunteering in Disaster Management @ DSEM 2017
Socia Media and Digital Volunteering in Disaster Management @ DSEM 2017Socia Media and Digital Volunteering in Disaster Management @ DSEM 2017
Socia Media and Digital Volunteering in Disaster Management @ DSEM 2017
 
Detecting Algorithmic Bias (keynote at DIR 2016)
Detecting Algorithmic Bias (keynote at DIR 2016)Detecting Algorithmic Bias (keynote at DIR 2016)
Detecting Algorithmic Bias (keynote at DIR 2016)
 
Big Crisis Data for ISPC
Big Crisis Data for ISPCBig Crisis Data for ISPC
Big Crisis Data for ISPC
 
Databeers: Big Crisis Data
Databeers: Big Crisis DataDatabeers: Big Crisis Data
Databeers: Big Crisis Data
 
Observational studies in social media
Observational studies in social mediaObservational studies in social media
Observational studies in social media
 
Natural experiments
Natural experimentsNatural experiments
Natural experiments
 
Content-based link prediction
Content-based link predictionContent-based link prediction
Content-based link prediction
 
Link prediction
Link predictionLink prediction
Link prediction
 
Recommender Systems
Recommender SystemsRecommender Systems
Recommender Systems
 
Graph Partitioning and Spectral Methods
Graph Partitioning and Spectral MethodsGraph Partitioning and Spectral Methods
Graph Partitioning and Spectral Methods
 
Finding Dense Subgraphs
Finding Dense SubgraphsFinding Dense Subgraphs
Finding Dense Subgraphs
 
Graph Evolution Models
Graph Evolution ModelsGraph Evolution Models
Graph Evolution Models
 
Link-Based Ranking
Link-Based RankingLink-Based Ranking
Link-Based Ranking
 
Text Indexing / Inverted Indices
Text Indexing / Inverted IndicesText Indexing / Inverted Indices
Text Indexing / Inverted Indices
 
Indexing
IndexingIndexing
Indexing
 
Text Summarization
Text SummarizationText Summarization
Text Summarization
 
Hierarchical Clustering
Hierarchical ClusteringHierarchical Clustering
Hierarchical Clustering
 
K-Means Algorithm
K-Means AlgorithmK-Means Algorithm
K-Means Algorithm
 

Kürzlich hochgeladen

04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfhans926745
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?Antenna Manufacturer Coco
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 

Kürzlich hochgeladen (20)

04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 

Characterizing the Life Cycle of Online News Stories Using Social Media Reactions

  • 1. Characterizing the Life Cycle of Online News Stories Using Social Media Reactions Carlos Castillo, Mohammed El-Haddad, Matt Stempeck, Jürgen Pfeffer Twitter: @ChaToX
  • 2. 2 Carlos Castillo – @chatox http://www.chato.cl/research/ Outline • Determining classes of news articles • Predicting traffic using social media
  • 3. 3 Carlos Castillo – @chatox http://www.chato.cl/research/ Usage analysis in online news • Aikat (1998) – Short dwell times, weekday+, weekend-, bursty traffic. • Crane and Sornette (2008), Yang and Leskovec (2011), Lehmann et al. (2012) – Behavioral classes of attention online
  • 4. 4 Carlos Castillo – @chatox http://www.chato.cl/research/ Analysis of social media responses • SocialFlow whitepaper (Lotan, Gaffney, and Meyer 2011) – Al Jazeera, BBC News, CNN, The Economist, Fox News and The New York Times • Hu et al. (2011) – Tweets during speech of US president
  • 5. 5 Carlos Castillo – @chatox http://www.chato.cl/research/ Predictive Web Analytics (references)
  • 6. 6 Carlos Castillo – @chatox http://www.chato.cl/research/ Data collection • Three weeks in October 2012 • “Beacon” embedded in Al Jazeera pages – Real-time data processing – Apache S4 application for online processing – Cassandra (NoSQL database) for storage ≈ 3M visits ≈ 200K social media reactions
  • 7. 7 Carlos Castillo – @chatox http://www.chato.cl/research/ Summary of dataset
  • 8. 8 Carlos Castillo – @chatox http://www.chato.cl/research/ News In-Depth Examples: • US state of Maryland abolishes death penalty (May 2nd, 2013) • Hundreds arrested in China over 'fake' meat (May 3rd, 2013) Examples: • Spirits of Japan shrine haunt Asian relations (May 2nd, 2013) • Interactive: Powering the Gulf (May 2nd, 2013)
  • 9. 9 Carlos Castillo – @chatox http://www.chato.cl/research/ News (322) In-Depth (139) Tag clouds extracted from titles of articles
  • 12. In-Depth items have a slower growth
  • 13. In-Depth items have a longer shelf-life
  • 14. In-Depth items are shared on Facebook News items are shared on Twitter
  • 15. 15 Carlos Castillo – @chatox http://www.chato.cl/research/ Typical visitation profiles (12 hours) Decreasing (78%) Steady (9%) Increasing (3%) Rebounding (10%)
  • 16. Examples Decreasing (78%): ● Almost all breaking news ● Sometimes delayed due to timezone differences, e.g. Hurricane Sandy Steady or Increasing (12%): ● Ongoing news: Obama/Romney, Worker strikes in SA, Syrian unrest ● Articles updated with supporting content Rebounding (10%): ● Articles picked up by external sources or social media (typically single source of traffic) ● Background articles to new developments
  • 17. 17 Carlos Castillo – @chatox http://www.chato.cl/research/ Prediction of visits • Short-term traffic is to a large extent correlated with long-term traffic • Social media signals are correlated with traffic and shelf-life More reactions → more traffic More discussion → longer shelf-life • Can we predict 7 days after 30 minutes?
  • 18. 18 Carlos Castillo – @chatox http://www.chato.cl/research/ Predicting traffic and shelf-life online has a long history • Predicting long-term behavior and half-life from short-term observations – Observations = comments, visits, votes, … – Behavior = total comments, total visits, … – 10+ papers specifically on web traffic • Bit.ly (2011, 2012) – Studies half-life per topic and platform
  • 20. Results (traffic predictions) Extrapolate visits News are more predictable than In-Depth
  • 22. 22 Carlos Castillo – @chatox http://www.chato.cl/research/ Selected variables, traffic prediction
  • 23. Results (shelf-life prediction) Larger improvements for In-Depth articles Still, this is a 12 hours error in predicting something with an average of 48-72 hours
  • 24. 24 Carlos Castillo – @chatox http://www.chato.cl/research/ http://fast.qcri.org/
  • 25. 25 Carlos Castillo – @chatox http://www.chato.cl/research/ What did we learn? • Decrease, Stay or Increase. Rebound – Roughly 80:10:10 ratio • News vs In-Depth: different behavior • Social media signals are useful to understand and predict visits
  • 26. 26 Carlos Castillo – @chatox http://www.chato.cl/research/ Invitation: ECML/PKDD Discovery Challenge 2014 • Open competition on predictive Web Analytics • Data provided by Chartbeat Inc.
  • 27. Thank you! Carlos Castillo · chato@acm.org http://www.chato.cl/research/