SlideShare ist ein Scribd-Unternehmen logo
1 von 15
Downloaden Sie, um offline zu lesen
REVEAL Project: Co-funded by the EU FP7 Programme Nr.: 610928 www.revealproject.eu © 2016 REVEAL consortium
Stefanie Wiegand & Stuart E. Middleton
University of Southampton IT Innovation Centre
{sw,sem}@it-innovation.soton.ac.uk
Veracity & Velocity of Social Media Content
during Breaking News:
Analysis of November 2015 Paris Shootings
REVEAL Project: Co-funded by the EU FP7 Programme Nr.: 610928 www.revealproject.eu © 2016 REVEAL consortium 1
 Introduction
 Experiment
 Results
 Discussion
 Future work
Overview
REVEAL Project: Co-funded by the EU FP7 Programme Nr.: 610928 www.revealproject.eu © 2016 REVEAL consortium
What's this all about?
2
 Problems:
 Journalists doing breaking UGC verification – speed vs. accuracy
 Echo chamber can make false rumours go viral
 Automate information gathering – Journalists make the final decision
 Ideas:
 First 60 mins of a UGC post filter by attribution to trusted sources
 Visualise traffic patterns for posts attributed to trusted and untrusted sources
 Can traffic analysis help to verify / debunk content?
 First 5 mins rank UGC not seen before by mention count
 Provide a ranked list of likely eyewitness UGC every 5 mins
 Can we produce a high quality eyewitness UGC feed?
Introduction
REVEAL Project: Co-funded by the EU FP7 Programme Nr.: 610928 www.revealproject.eu © 2016 REVEAL consortium
Experiment setup
3
 Data
 5 viral UGC posts (3 eyewitness, 2 debunked) - manually identified
 38GB of serialised data covering the first 6h after the first attack
 5.9M posts, ~40k attributed sources, ~418k unique URLs
 ~160k - 1.8M posts in the first hour per UGC test case
 Technology
 Target UGC Image/Video → TinEye → Duplicate Images/Videos
 Posts → Text extraction → Sources → PostgreSQL
 PostgreSQL → Triple store → Trust knowledge model → Trusted posts
Experiment
REVEAL Project: Co-funded by the EU FP7 Programme Nr.: 610928 www.revealproject.eu © 2016 REVEAL consortium
Experiment method
4
 Verification (Experiment 1)
 Filter (un-)trusted content in first 60 mins of 5 target UGC posts
 Examine velocity of trusted and untrusted sources mentioning target UGC
 When is target UGC attributed to trusted sources?
 Identification (Experiment 2)
 Temporally segment first 5 mins of posts for 5 target event times
 Filter old URLs (including alternative URLs)
 Rank by mention frequency
 Does target UGC appear highly in ranked list?
Experiment
REVEAL Project: Co-funded by the EU FP7 Programme Nr.: 610928 www.revealproject.eu © 2016 REVEAL consortium
Experiment 1 - Case P1
5
Results
0
50
100
150
200
250
300
350
400
10 20 30 40 50 60
contentitems[#]
time [min]
P1
trusted unknown untrusted total
REVEAL Project: Co-funded by the EU FP7 Programme Nr.: 610928 www.revealproject.eu © 2016 REVEAL consortium
Experiment 1 - Case P2
6
Results
0
200
400
600
800
1000
1200
10 20 30 40 50 60
contentitems[#]
time [min]
P2
trusted unknown untrusted total
REVEAL Project: Co-funded by the EU FP7 Programme Nr.: 610928 www.revealproject.eu © 2016 REVEAL consortium
Experiment 1 - Case P3
7
Results
0
50
100
150
200
250
10 20 30 40 50 60
contentitems[#]
time [min]
P3
trusted unknown untrusted total
REVEAL Project: Co-funded by the EU FP7 Programme Nr.: 610928 www.revealproject.eu © 2016 REVEAL consortium
Experiment 1 - Case P3
7
Results
0
1
2
3
4
5
10 20 30 40 50 60
contentitems[#]
time [min]
trusted/untrusted P3
trusted untrusted
REVEAL Project: Co-funded by the EU FP7 Programme Nr.: 610928 www.revealproject.eu © 2016 REVEAL consortium
Experiment 1 - Case D1
9
Results
0
500
1000
1500
2000
2500
3000
3500
10 20 30 40 50 60
contentitems[#]
time [min]
D1
trusted unknown untrusted total
REVEAL Project: Co-funded by the EU FP7 Programme Nr.: 610928 www.revealproject.eu © 2016 REVEAL consortium
Experiment 1 - Case D2
9
Results
0
500
1000
1500
2000
2500
3000
3500
10 20 30 40 50 60
contentitems[#]
time [min]
D2
trusted unknown untrusted total
REVEAL Project: Co-funded by the EU FP7 Programme Nr.: 610928 www.revealproject.eu © 2016 REVEAL consortium
Experiment 2
11
Results
Target Image ID P1 P2 P3 D1 D2
number of followers of author 335 1.4k 218 2.8k 151k
content likes 11 408 35 17k 29k
content retweets 83 3.3k 194 22k 30k
total # of tweets
in 60 minute window
483918 162111 811079 1501000 1837173
total # of unique mentioned URLs in
60 minute window
785 4331 535 7907 13252
ranking of target image set in total for
5 minute segment
(top x percent)
9 / 653
(2%)
1 / 603
(1%)
61 / 1097
(6%)
427 / 11605
(4%)
1 / 11337
(1%)
total number of eyewitness content in
5 minute segment
25 2 12 29 30
unique number of eyewitness content
in 5 minute segment
4 1 4 13 14
REVEAL Project: Co-funded by the EU FP7 Programme Nr.: 610928 www.revealproject.eu © 2016 REVEAL consortium
How is this useful to journalists?
12
 Posts by trusted matter for verification
 Wisdom of the crowds is not always wisdom at all
 Twitter "echo chamber" is less useful than a post by a trusted source
 Easier/faster to spot new eyewitness UGC
 Filter feeds to 10s of posts not 1000s of posts
 Reduce information overload for journalists in first 5 mins
 Additional analysis can improve eyewitness UGC further
 Eyewitness classification
 Image analysis (e.g. Exif metadata)
 Author profile pages
Discussion
REVEAL Project: Co-funded by the EU FP7 Programme Nr.: 610928 www.revealproject.eu © 2016 REVEAL consortium
Where to go from here
13
 Cross check known facts
 Extend knowledge model to support this
 e.g. image classification of weather/lighting ↔ time & location of event
 e.g. mentions of known event actors
 Use linked open data to visualise source bias
 this can include political, religious or other bias
 Observational study of journalists verifying UGC
 Journalist experts show best practice verification on specific examples
 We train our algorithms on observed best practice
 We check our algorithms results against journalists ground truth
Future work
REVEAL Project: Co-funded by the EU FP7 Programme Nr.: 610928 www.revealproject.eu © 2016 REVEAL consortium 14
Any questions?
Stefanie Wiegand & Stuart E. Middleton
University of Southampton IT Innovation Centre
email: {sw|sem}@it-innovation.soton.ac.uk
web: www.it-innovation.soton.ac.uk
twitter: @RevealEU, @IT_Innov, @stuart_e_middle
Many thanks for your attention!

Weitere ähnliche Inhalte

Ähnlich wie Veracity & Velocity of Social Media Content during Breaking News

TRIDEC and REVEAL projects: Geoparsing and Geosemantic knowledge model for tr...
TRIDEC and REVEAL projects: Geoparsing and Geosemantic knowledge model for tr...TRIDEC and REVEAL projects: Geoparsing and Geosemantic knowledge model for tr...
TRIDEC and REVEAL projects: Geoparsing and Geosemantic knowledge model for tr...REVEAL - Social Media Verification
 
20151019 webinar Open Access in Horizon 2020
20151019 webinar  Open Access in Horizon 202020151019 webinar  Open Access in Horizon 2020
20151019 webinar Open Access in Horizon 2020OpenAccessBelgium
 
CloudTeams - Boosting Collaboration of Developers and End Users Together for ...
CloudTeams - Boosting Collaboration of Developers and End Users Together for ...CloudTeams - Boosting Collaboration of Developers and End Users Together for ...
CloudTeams - Boosting Collaboration of Developers and End Users Together for ...CloudTeams
 
OpenAIRE services and tools for researchers/authors and projects (FOSTER work...
OpenAIRE services and tools for researchers/authors and projects (FOSTER work...OpenAIRE services and tools for researchers/authors and projects (FOSTER work...
OpenAIRE services and tools for researchers/authors and projects (FOSTER work...Pedro Príncipe
 
OpenAIRE services and tools for researchers/authors and projects (FOSTER work...
OpenAIRE services and tools for researchers/authors and projects (FOSTER work...OpenAIRE services and tools for researchers/authors and projects (FOSTER work...
OpenAIRE services and tools for researchers/authors and projects (FOSTER work...OpenAIRE
 
Nieuwerburgh - Open science e-infrastructure for research analysis and impact...
Nieuwerburgh - Open science e-infrastructure for research analysis and impact...Nieuwerburgh - Open science e-infrastructure for research analysis and impact...
Nieuwerburgh - Open science e-infrastructure for research analysis and impact...innovationoecd
 
Horizon 2020 Open Access mandate - OpenAIRE webinar by Inge Van Nieuwerburgh
Horizon 2020 Open Access mandate - OpenAIRE webinar by Inge Van NieuwerburghHorizon 2020 Open Access mandate - OpenAIRE webinar by Inge Van Nieuwerburgh
Horizon 2020 Open Access mandate - OpenAIRE webinar by Inge Van NieuwerburghOpenAIRE
 
TTO Keynote 08 10 2021
TTO Keynote 08 10 2021TTO Keynote 08 10 2021
TTO Keynote 08 10 2021Weverify
 
LinkedTV results at the end of the 3rd year
LinkedTV results at the end of the 3rd yearLinkedTV results at the end of the 3rd year
LinkedTV results at the end of the 3rd yearLinkedTV
 
MICO — Towards Contextual Media Analysis
MICO — Towards Contextual Media AnalysisMICO — Towards Contextual Media Analysis
MICO — Towards Contextual Media AnalysisThomas Kurz
 
Fraunhofer IAO Research Landscaping
Fraunhofer IAO Research LandscapingFraunhofer IAO Research Landscaping
Fraunhofer IAO Research LandscapingEd Morrison
 
2nd WeGov Workshop Agenda
2nd WeGov Workshop Agenda2nd WeGov Workshop Agenda
2nd WeGov Workshop AgendaWeGov project
 
WeVerify at NILC - May 2019.pptx
WeVerify at NILC - May 2019.pptxWeVerify at NILC - May 2019.pptx
WeVerify at NILC - May 2019.pptxWeverify
 
Finodex- New fund for open data entrepreneurs in Europe
Finodex- New fund for open data entrepreneurs in EuropeFinodex- New fund for open data entrepreneurs in Europe
Finodex- New fund for open data entrepreneurs in EuropeliberTIC
 
Open Innovation in der öffentlichen Verwaltung
Open Innovation in der öffentlichen VerwaltungOpen Innovation in der öffentlichen Verwaltung
Open Innovation in der öffentlichen VerwaltungFrank Piller
 
OpenAIRE @ OECD Blue Sky III
OpenAIRE @ OECD Blue Sky IIIOpenAIRE @ OECD Blue Sky III
OpenAIRE @ OECD Blue Sky IIIOpenAIRE
 
SLOPE Final Conference - novel planning tool
SLOPE Final Conference - novel planning toolSLOPE Final Conference - novel planning tool
SLOPE Final Conference - novel planning toolSLOPE Project
 
Session 1 - Cluster Analysis - Academia
Session 1 - Cluster Analysis - AcademiaSession 1 - Cluster Analysis - Academia
Session 1 - Cluster Analysis - AcademiaPhilip O'Reilly
 
SC7 Hangout 2: Community Building activities for Big Data in Secure Societies
SC7 Hangout 2: Community Building activities for Big Data in Secure SocietiesSC7 Hangout 2: Community Building activities for Big Data in Secure Societies
SC7 Hangout 2: Community Building activities for Big Data in Secure SocietiesBigData_Europe
 

Ähnlich wie Veracity & Velocity of Social Media Content during Breaking News (20)

TRIDEC and REVEAL projects: Geoparsing and Geosemantic knowledge model for tr...
TRIDEC and REVEAL projects: Geoparsing and Geosemantic knowledge model for tr...TRIDEC and REVEAL projects: Geoparsing and Geosemantic knowledge model for tr...
TRIDEC and REVEAL projects: Geoparsing and Geosemantic knowledge model for tr...
 
20151019 webinar Open Access in Horizon 2020
20151019 webinar  Open Access in Horizon 202020151019 webinar  Open Access in Horizon 2020
20151019 webinar Open Access in Horizon 2020
 
CloudTeams - Boosting Collaboration of Developers and End Users Together for ...
CloudTeams - Boosting Collaboration of Developers and End Users Together for ...CloudTeams - Boosting Collaboration of Developers and End Users Together for ...
CloudTeams - Boosting Collaboration of Developers and End Users Together for ...
 
OpenAIRE services and tools for researchers/authors and projects (FOSTER work...
OpenAIRE services and tools for researchers/authors and projects (FOSTER work...OpenAIRE services and tools for researchers/authors and projects (FOSTER work...
OpenAIRE services and tools for researchers/authors and projects (FOSTER work...
 
OpenAIRE services and tools for researchers/authors and projects (FOSTER work...
OpenAIRE services and tools for researchers/authors and projects (FOSTER work...OpenAIRE services and tools for researchers/authors and projects (FOSTER work...
OpenAIRE services and tools for researchers/authors and projects (FOSTER work...
 
Nieuwerburgh - Open science e-infrastructure for research analysis and impact...
Nieuwerburgh - Open science e-infrastructure for research analysis and impact...Nieuwerburgh - Open science e-infrastructure for research analysis and impact...
Nieuwerburgh - Open science e-infrastructure for research analysis and impact...
 
Horizon 2020 Open Access mandate - OpenAIRE webinar by Inge Van Nieuwerburgh
Horizon 2020 Open Access mandate - OpenAIRE webinar by Inge Van NieuwerburghHorizon 2020 Open Access mandate - OpenAIRE webinar by Inge Van Nieuwerburgh
Horizon 2020 Open Access mandate - OpenAIRE webinar by Inge Van Nieuwerburgh
 
TTO Keynote 08 10 2021
TTO Keynote 08 10 2021TTO Keynote 08 10 2021
TTO Keynote 08 10 2021
 
The European Open Science Cloud
The European Open Science CloudThe European Open Science Cloud
The European Open Science Cloud
 
LinkedTV results at the end of the 3rd year
LinkedTV results at the end of the 3rd yearLinkedTV results at the end of the 3rd year
LinkedTV results at the end of the 3rd year
 
MICO — Towards Contextual Media Analysis
MICO — Towards Contextual Media AnalysisMICO — Towards Contextual Media Analysis
MICO — Towards Contextual Media Analysis
 
Fraunhofer IAO Research Landscaping
Fraunhofer IAO Research LandscapingFraunhofer IAO Research Landscaping
Fraunhofer IAO Research Landscaping
 
2nd WeGov Workshop Agenda
2nd WeGov Workshop Agenda2nd WeGov Workshop Agenda
2nd WeGov Workshop Agenda
 
WeVerify at NILC - May 2019.pptx
WeVerify at NILC - May 2019.pptxWeVerify at NILC - May 2019.pptx
WeVerify at NILC - May 2019.pptx
 
Finodex- New fund for open data entrepreneurs in Europe
Finodex- New fund for open data entrepreneurs in EuropeFinodex- New fund for open data entrepreneurs in Europe
Finodex- New fund for open data entrepreneurs in Europe
 
Open Innovation in der öffentlichen Verwaltung
Open Innovation in der öffentlichen VerwaltungOpen Innovation in der öffentlichen Verwaltung
Open Innovation in der öffentlichen Verwaltung
 
OpenAIRE @ OECD Blue Sky III
OpenAIRE @ OECD Blue Sky IIIOpenAIRE @ OECD Blue Sky III
OpenAIRE @ OECD Blue Sky III
 
SLOPE Final Conference - novel planning tool
SLOPE Final Conference - novel planning toolSLOPE Final Conference - novel planning tool
SLOPE Final Conference - novel planning tool
 
Session 1 - Cluster Analysis - Academia
Session 1 - Cluster Analysis - AcademiaSession 1 - Cluster Analysis - Academia
Session 1 - Cluster Analysis - Academia
 
SC7 Hangout 2: Community Building activities for Big Data in Secure Societies
SC7 Hangout 2: Community Building activities for Big Data in Secure SocietiesSC7 Hangout 2: Community Building activities for Big Data in Secure Societies
SC7 Hangout 2: Community Building activities for Big Data in Secure Societies
 

Mehr von REVEAL - Social Media Verification

Geoparsing and Real-time Social Media Analytics - technical and social challe...
Geoparsing and Real-time Social Media Analytics - technical and social challe...Geoparsing and Real-time Social Media Analytics - technical and social challe...
Geoparsing and Real-time Social Media Analytics - technical and social challe...REVEAL - Social Media Verification
 
Verification of UGC/Eyewitness Media: Challenges and Approaches
Verification of UGC/Eyewitness Media: Challenges and Approaches Verification of UGC/Eyewitness Media: Challenges and Approaches
Verification of UGC/Eyewitness Media: Challenges and Approaches REVEAL - Social Media Verification
 
Web image size prediction for efficient focused image crawling
Web image size prediction for efficient focused image crawlingWeb image size prediction for efficient focused image crawling
Web image size prediction for efficient focused image crawlingREVEAL - Social Media Verification
 
Geotagging Social Media Content with a Refined Language Modelling Approach
Geotagging Social Media Content with a Refined Language Modelling ApproachGeotagging Social Media Content with a Refined Language Modelling Approach
Geotagging Social Media Content with a Refined Language Modelling ApproachREVEAL - Social Media Verification
 
Mediarevealr: A social multimedia monitoring and intelligence system for Web ...
Mediarevealr: A social multimedia monitoring and intelligence system for Web ...Mediarevealr: A social multimedia monitoring and intelligence system for Web ...
Mediarevealr: A social multimedia monitoring and intelligence system for Web ...REVEAL - Social Media Verification
 
Cross-Media Konferenz "Think Cross - Change Media" in Magdeburg, Germany
 Cross-Media Konferenz "Think Cross - Change Media" in Magdeburg, Germany Cross-Media Konferenz "Think Cross - Change Media" in Magdeburg, Germany
Cross-Media Konferenz "Think Cross - Change Media" in Magdeburg, GermanyREVEAL - Social Media Verification
 

Mehr von REVEAL - Social Media Verification (11)

Geoparsing and Real-time Social Media Analytics - technical and social challe...
Geoparsing and Real-time Social Media Analytics - technical and social challe...Geoparsing and Real-time Social Media Analytics - technical and social challe...
Geoparsing and Real-time Social Media Analytics - technical and social challe...
 
Prix Italia 2015 - Verification in Social Newsgathering
Prix Italia 2015 - Verification in Social NewsgatheringPrix Italia 2015 - Verification in Social Newsgathering
Prix Italia 2015 - Verification in Social Newsgathering
 
Verification of UGC/Eyewitness Media: Challenges and Approaches
Verification of UGC/Eyewitness Media: Challenges and Approaches Verification of UGC/Eyewitness Media: Challenges and Approaches
Verification of UGC/Eyewitness Media: Challenges and Approaches
 
Web image size prediction for efficient focused image crawling
Web image size prediction for efficient focused image crawlingWeb image size prediction for efficient focused image crawling
Web image size prediction for efficient focused image crawling
 
News-oriented multimedia search over multiple social networks
News-oriented multimedia search over multiple social networksNews-oriented multimedia search over multiple social networks
News-oriented multimedia search over multiple social networks
 
Geotagging Social Media Content with a Refined Language Modelling Approach
Geotagging Social Media Content with a Refined Language Modelling ApproachGeotagging Social Media Content with a Refined Language Modelling Approach
Geotagging Social Media Content with a Refined Language Modelling Approach
 
Mediarevealr: A social multimedia monitoring and intelligence system for Web ...
Mediarevealr: A social multimedia monitoring and intelligence system for Web ...Mediarevealr: A social multimedia monitoring and intelligence system for Web ...
Mediarevealr: A social multimedia monitoring and intelligence system for Web ...
 
Cross-Media Konferenz "Think Cross - Change Media" in Magdeburg, Germany
 Cross-Media Konferenz "Think Cross - Change Media" in Magdeburg, Germany Cross-Media Konferenz "Think Cross - Change Media" in Magdeburg, Germany
Cross-Media Konferenz "Think Cross - Change Media" in Magdeburg, Germany
 
Reveal - Social Media Verification - poster
Reveal - Social Media Verification - posterReveal - Social Media Verification - poster
Reveal - Social Media Verification - poster
 
Focused Exploration of Geospatial Context on Linked Open Data
Focused Exploration of Geospatial Context on Linked Open DataFocused Exploration of Geospatial Context on Linked Open Data
Focused Exploration of Geospatial Context on Linked Open Data
 
REVEAL - Social Media Verification - brochure
REVEAL - Social Media Verification - brochureREVEAL - Social Media Verification - brochure
REVEAL - Social Media Verification - brochure
 

Kürzlich hochgeladen

Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAzure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAndikSusilo4
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxOnBoard
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphNeo4j
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 

Kürzlich hochgeladen (20)

Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAzure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & Application
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptx
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 

Veracity & Velocity of Social Media Content during Breaking News

  • 1. REVEAL Project: Co-funded by the EU FP7 Programme Nr.: 610928 www.revealproject.eu © 2016 REVEAL consortium Stefanie Wiegand & Stuart E. Middleton University of Southampton IT Innovation Centre {sw,sem}@it-innovation.soton.ac.uk Veracity & Velocity of Social Media Content during Breaking News: Analysis of November 2015 Paris Shootings
  • 2. REVEAL Project: Co-funded by the EU FP7 Programme Nr.: 610928 www.revealproject.eu © 2016 REVEAL consortium 1  Introduction  Experiment  Results  Discussion  Future work Overview
  • 3. REVEAL Project: Co-funded by the EU FP7 Programme Nr.: 610928 www.revealproject.eu © 2016 REVEAL consortium What's this all about? 2  Problems:  Journalists doing breaking UGC verification – speed vs. accuracy  Echo chamber can make false rumours go viral  Automate information gathering – Journalists make the final decision  Ideas:  First 60 mins of a UGC post filter by attribution to trusted sources  Visualise traffic patterns for posts attributed to trusted and untrusted sources  Can traffic analysis help to verify / debunk content?  First 5 mins rank UGC not seen before by mention count  Provide a ranked list of likely eyewitness UGC every 5 mins  Can we produce a high quality eyewitness UGC feed? Introduction
  • 4. REVEAL Project: Co-funded by the EU FP7 Programme Nr.: 610928 www.revealproject.eu © 2016 REVEAL consortium Experiment setup 3  Data  5 viral UGC posts (3 eyewitness, 2 debunked) - manually identified  38GB of serialised data covering the first 6h after the first attack  5.9M posts, ~40k attributed sources, ~418k unique URLs  ~160k - 1.8M posts in the first hour per UGC test case  Technology  Target UGC Image/Video → TinEye → Duplicate Images/Videos  Posts → Text extraction → Sources → PostgreSQL  PostgreSQL → Triple store → Trust knowledge model → Trusted posts Experiment
  • 5. REVEAL Project: Co-funded by the EU FP7 Programme Nr.: 610928 www.revealproject.eu © 2016 REVEAL consortium Experiment method 4  Verification (Experiment 1)  Filter (un-)trusted content in first 60 mins of 5 target UGC posts  Examine velocity of trusted and untrusted sources mentioning target UGC  When is target UGC attributed to trusted sources?  Identification (Experiment 2)  Temporally segment first 5 mins of posts for 5 target event times  Filter old URLs (including alternative URLs)  Rank by mention frequency  Does target UGC appear highly in ranked list? Experiment
  • 6. REVEAL Project: Co-funded by the EU FP7 Programme Nr.: 610928 www.revealproject.eu © 2016 REVEAL consortium Experiment 1 - Case P1 5 Results 0 50 100 150 200 250 300 350 400 10 20 30 40 50 60 contentitems[#] time [min] P1 trusted unknown untrusted total
  • 7. REVEAL Project: Co-funded by the EU FP7 Programme Nr.: 610928 www.revealproject.eu © 2016 REVEAL consortium Experiment 1 - Case P2 6 Results 0 200 400 600 800 1000 1200 10 20 30 40 50 60 contentitems[#] time [min] P2 trusted unknown untrusted total
  • 8. REVEAL Project: Co-funded by the EU FP7 Programme Nr.: 610928 www.revealproject.eu © 2016 REVEAL consortium Experiment 1 - Case P3 7 Results 0 50 100 150 200 250 10 20 30 40 50 60 contentitems[#] time [min] P3 trusted unknown untrusted total
  • 9. REVEAL Project: Co-funded by the EU FP7 Programme Nr.: 610928 www.revealproject.eu © 2016 REVEAL consortium Experiment 1 - Case P3 7 Results 0 1 2 3 4 5 10 20 30 40 50 60 contentitems[#] time [min] trusted/untrusted P3 trusted untrusted
  • 10. REVEAL Project: Co-funded by the EU FP7 Programme Nr.: 610928 www.revealproject.eu © 2016 REVEAL consortium Experiment 1 - Case D1 9 Results 0 500 1000 1500 2000 2500 3000 3500 10 20 30 40 50 60 contentitems[#] time [min] D1 trusted unknown untrusted total
  • 11. REVEAL Project: Co-funded by the EU FP7 Programme Nr.: 610928 www.revealproject.eu © 2016 REVEAL consortium Experiment 1 - Case D2 9 Results 0 500 1000 1500 2000 2500 3000 3500 10 20 30 40 50 60 contentitems[#] time [min] D2 trusted unknown untrusted total
  • 12. REVEAL Project: Co-funded by the EU FP7 Programme Nr.: 610928 www.revealproject.eu © 2016 REVEAL consortium Experiment 2 11 Results Target Image ID P1 P2 P3 D1 D2 number of followers of author 335 1.4k 218 2.8k 151k content likes 11 408 35 17k 29k content retweets 83 3.3k 194 22k 30k total # of tweets in 60 minute window 483918 162111 811079 1501000 1837173 total # of unique mentioned URLs in 60 minute window 785 4331 535 7907 13252 ranking of target image set in total for 5 minute segment (top x percent) 9 / 653 (2%) 1 / 603 (1%) 61 / 1097 (6%) 427 / 11605 (4%) 1 / 11337 (1%) total number of eyewitness content in 5 minute segment 25 2 12 29 30 unique number of eyewitness content in 5 minute segment 4 1 4 13 14
  • 13. REVEAL Project: Co-funded by the EU FP7 Programme Nr.: 610928 www.revealproject.eu © 2016 REVEAL consortium How is this useful to journalists? 12  Posts by trusted matter for verification  Wisdom of the crowds is not always wisdom at all  Twitter "echo chamber" is less useful than a post by a trusted source  Easier/faster to spot new eyewitness UGC  Filter feeds to 10s of posts not 1000s of posts  Reduce information overload for journalists in first 5 mins  Additional analysis can improve eyewitness UGC further  Eyewitness classification  Image analysis (e.g. Exif metadata)  Author profile pages Discussion
  • 14. REVEAL Project: Co-funded by the EU FP7 Programme Nr.: 610928 www.revealproject.eu © 2016 REVEAL consortium Where to go from here 13  Cross check known facts  Extend knowledge model to support this  e.g. image classification of weather/lighting ↔ time & location of event  e.g. mentions of known event actors  Use linked open data to visualise source bias  this can include political, religious or other bias  Observational study of journalists verifying UGC  Journalist experts show best practice verification on specific examples  We train our algorithms on observed best practice  We check our algorithms results against journalists ground truth Future work
  • 15. REVEAL Project: Co-funded by the EU FP7 Programme Nr.: 610928 www.revealproject.eu © 2016 REVEAL consortium 14 Any questions? Stefanie Wiegand & Stuart E. Middleton University of Southampton IT Innovation Centre email: {sw|sem}@it-innovation.soton.ac.uk web: www.it-innovation.soton.ac.uk twitter: @RevealEU, @IT_Innov, @stuart_e_middle Many thanks for your attention!