SlideShare ist ein Scribd-Unternehmen logo
1 von 27
Extracting Information Nuggets from
Disaster-Related Messages in Social Media
Muhammad Imran, Shady Elbassuoni, Carlos Castillo, Fernando Diaz, Patrick Meier
Outline
• Social Media response to disaster
• Finding tactical and actionable information
• Disaster ontologies
• Filtering, classification and extraction
• Ongoing work
• Discussion
Disaster and Social Media
2.3 million tweets reflecting the words “Haiti”
or “Red Cross” from Jan 12 to Jan 14, 2010
http://www.sysomos.com
Disaster and Social Media
Why Social Media?
• Virtual Collaboration, Information Sharing
• Highly valuable information
• Contribute to situational awareness
• Highly useful, if analyzed timely and
effectively
Sandy Tweets
@NYGovCuomo orders closing of NYC bridges. Only Staten Island
bridges unaffected at this time. Bridges must close by 7pm. #Sandy
#NYC.
rt @911buff: public help needed: 2 boys 2 & 4 missing nearly 24 hours
after they got separated from their mom when car submerged in si.
#sandy #911buff
freaking out. home alone. will just watch tv #Sandy #NYC.
400 Volunteers are needed for areas that #Sandy destroyed.
Sandy Tweets
@NYGovCuomo orders closing of NYC bridges. Only Staten Island
bridges unaffected at this time. Bridges must close by 7pm. #Sandy
#NYC.
rt @911buff: public help needed: 2 boys 2 & 4 missing nearly 24 hours
after they got separated from their mom when car submerged in si.
#sandy #911buff
freaking out. home alone. will just watch tv #Sandy #NYC.
400 Volunteers are needed for areas that #Sandy destroyed.
Personal
Informative
Sandy Tweets
@NYGovCuomo orders closing of NYC bridges. Only Staten Island
bridges unaffected at this time. Bridges must close by 7pm. #Sandy
#NYC.
rt @911buff: public help needed: 2 boys 2 & 4 missing nearly 24 hours
after they got separated from their mom when car submerged in si.
#sandy #911buff
freaking out. home alone. will just watch tv #Sandy #NYC.
400 Volunteers are needed for areas that #Sandy destroyed.
Personal
Informative
Caution and Advice
Casualties and Damage
Donations
Sandy Tweets
@NYGovCuomo orders closing of NYC bridges. Only Staten Island
bridges unaffected at this time. Bridges must close by 7pm. #Sandy
#NYC.
rt @911buff: public help needed: 2 boys 2 & 4 missing nearly 24 hours
after they got separated from their mom when car submerged in si.
#sandy #911buff
freaking out. home alone. will just watch tv #Sandy #NYC.
400 Volunteers are needed for areas that #Sandy destroyed.
Personal
Informative
Caution and Advice
Casualties and Damage
Donations
Finding Tactical & Actionable Information
Personal
Informative
(Direct & Indirect)
Other
Caution and advice
Casualties and damage
Donations
People missing, found, or seen
Information source
Siren heard, warning issued/lifted etc.
People dead, injured, damage etc.
Money, shelter, blood, goods, or services
Webpages, photos, videos information sources
…
Our Approach
3.
Extraction
2.
Classification
1.
Filtering
Our Datasets
Joplin Dataset
• 206,764 tweets collected during Joplin tornado
that hit Joplin, Missouri on May 22, 2011
• Collected by researchers at the university of
Colorado at Boulder
• Collected through Twitter API by monitoring the
tweets with hashtags #joplin or #tornado
Our Datasets
Sandy Dataset:
• 140,000 tweets collected during hurricane Sandy
that hit northeastern USA on Oct 29, 2012
• Collected through Twitter API by monitoring the
tweets with hashtag #sandy or #nyc
1. Filtering
Is disaster-
related?
Contributes to
situational
awareness?
Yes Yes
No No
1. Filtering: Training Data
32%
60%
8%
4406 tweets sampled uniformly from the
Joplin dataset Annotated using CrowdFlower
Personal
Informative
Other
2. Classification
Caution &
Advice
Information
Sources
Damage &
Casualties
Donations
Health
Shelter
Food
Water
Logistics
...
...
Filtered
tweets
Distribution of Tweet Types
50%
18%
16%
10%
6%
Caution/Advice
Info Source
Donations
Casualties/Damage
Unknown
Joplin Tornado (2011)
Automatic Classification
Class Prec Rec F-Measure AUC
Caution and advice 0.85 0.76 0.80 0.91
Information source 0.54 0.58 0.56 0.76
Donations 0.72 0.71 0.72 0.89
Casualties/damage 0.52 0.65 0.58 0.87
• Binary (hashtags, URL, emotion etc.)
• Scalar (tweet length)
• Text features (Unigram, bigram, POS tags, Verbnet etc.)
Features:
3. Extraction
...
Classified
tweets
@JimFreund: Apparently we have no choice.
There is a tornado watch in effect
tonight.
Labels for Extraction: Training Data
• Type-dependent instruction
• Ask evaluators to copy-paste a word/phrase
from each tweet
Tool
• CMU ARK Twitter NLP
– Tokenization
– Feature extraction
– CRF learning
• Very easy to use: simply change the training
set (part-of-speech tags) into anything, and re-
train
Extraction Evaluation
Setting Rec Prec
Train 2/3 Joplin, Test 1/3 Joplin 78% 90%
Train 2/3 Sandy, Test 1/3 Sandy 41% 79%
Train Joplin, Test Sandy 11% 78%
Train Joplin + 10% Sandy, Test 90% Sandy 21% 81%
• Precision is: one word or more in common with
what humans extracted (Imran et al., 2013)
Ongoing work
Self-service for crisis-related classification
• Machine learning software can be provided as
a service
– e.g. Google Prediction API
• Can we provide crisis-related tweet
classification as a service?
– Automatic collection of tweets
– Re-usable ontologies / default training sets
– Active learning
Request Labeled / Unlabeled Datasets
Contact us at: mimran@qf.org.qa
References
• K. Starbird, L. Palen, A. Hughes, and S. Vieweg (2010) Chatter on the red: what hazards
threat reveals about the social life of microblogged information. In Proceedings of the 2010
ACM conference on Computer supported cooperative work, pages 241–250. ACM.
• Latonero, Mark, and Irina Shklovski. "“Respectfully Yours in Safety and Service”: Emergency
Management & Social Media Evangelism." Proceedings of the 7th International ISCRAM
Conference–Seattle. Vol. 1. 2010.
• Muhammad Imran, Shady Elbassuoni, Carlos Castillo, Fernando Diaz and Patrick Meier.
Practical Extraction of Disaster-Relevant Information from Social Media. WWW-2013
SWDM, May 2013
Thank you!
Muhammad Imran
mimran@qf.org.qa
With thanks to Carlos Castillo for several slides

Weitere ähnliche Inhalte

Ähnlich wie ISCRAM 2013: Extracting Information Nuggets from Disaster-Related Messages in Social Media

#EMAG2011 Use Social Media Now for Emergency Management
#EMAG2011 Use Social Media Now for Emergency Management#EMAG2011 Use Social Media Now for Emergency Management
#EMAG2011 Use Social Media Now for Emergency ManagementConnie White
 
2012: NJAC: Trick or Tweet - Social Media Use during Hurricane Irene and the ...
2012: NJAC: Trick or Tweet - Social Media Use during Hurricane Irene and the ...2012: NJAC: Trick or Tweet - Social Media Use during Hurricane Irene and the ...
2012: NJAC: Trick or Tweet - Social Media Use during Hurricane Irene and the ...Carol Spencer
 
ICCM 2013 Ignite Session 1
ICCM 2013 Ignite Session 1ICCM 2013 Ignite Session 1
ICCM 2013 Ignite Session 1Tom Weinandy
 
Social Media Empowering a Global Response #IDCExpo
Social Media Empowering a Global Response #IDCExpoSocial Media Empowering a Global Response #IDCExpo
Social Media Empowering a Global Response #IDCExpoConnie White
 
How to Manage Your Online Reputation
How to Manage Your Online ReputationHow to Manage Your Online Reputation
How to Manage Your Online ReputationBullseye
 
2012: NW Ohio Public Health: Trick or Tweet - Social Media Use during Hurrica...
2012: NW Ohio Public Health: Trick or Tweet - Social Media Use during Hurrica...2012: NW Ohio Public Health: Trick or Tweet - Social Media Use during Hurrica...
2012: NW Ohio Public Health: Trick or Tweet - Social Media Use during Hurrica...Carol Spencer
 
Classifying Microblogs For Disasters
Classifying Microblogs For DisastersClassifying Microblogs For Disasters
Classifying Microblogs For DisastersSarvnaz Karimi
 
Connie white social media
Connie white social mediaConnie white social media
Connie white social mediaBilly Green
 
Pacific Endeavor 2012 Presentation
Pacific Endeavor 2012 PresentationPacific Endeavor 2012 Presentation
Pacific Endeavor 2012 PresentationCatherine Graham
 
Sinsai info-makoto-draft-1
Sinsai info-makoto-draft-1Sinsai info-makoto-draft-1
Sinsai info-makoto-draft-1Makoto Inoue
 
Evolution of the Humanitarian Data Ecosystem
Evolution of the Humanitarian Data EcosystemEvolution of the Humanitarian Data Ecosystem
Evolution of the Humanitarian Data EcosystemSara-Jayne Terp
 
Colorado fire chiefs presentation micki trost 2017
Colorado fire chiefs presentation micki trost 2017Colorado fire chiefs presentation micki trost 2017
Colorado fire chiefs presentation micki trost 2017Trost, Micki
 
So You Think You're Prepared: Seven Events in Seven Weeks in 2017
So You Think You're Prepared: Seven Events in Seven Weeks in 2017So You Think You're Prepared: Seven Events in Seven Weeks in 2017
So You Think You're Prepared: Seven Events in Seven Weeks in 2017Carol Spencer
 
InfoCrisis.Social - Design Process
InfoCrisis.Social - Design ProcessInfoCrisis.Social - Design Process
InfoCrisis.Social - Design ProcessJavier Velasco, PhD
 
Where They Are @: or how I learned to stop worrying and love social media
Where They Are @: or how I learned to stop worrying and love social mediaWhere They Are @: or how I learned to stop worrying and love social media
Where They Are @: or how I learned to stop worrying and love social mediaJames Garrow
 
Social Media, Crisis Communications, and the 2013 Boston Marathon
Social Media, Crisis Communications, and the 2013 Boston Marathon Social Media, Crisis Communications, and the 2013 Boston Marathon
Social Media, Crisis Communications, and the 2013 Boston Marathon Anne-Marie McLaughlin MSBC, MA, MEP
 

Ähnlich wie ISCRAM 2013: Extracting Information Nuggets from Disaster-Related Messages in Social Media (20)

#EMAG2011 Use Social Media Now for Emergency Management
#EMAG2011 Use Social Media Now for Emergency Management#EMAG2011 Use Social Media Now for Emergency Management
#EMAG2011 Use Social Media Now for Emergency Management
 
2012: NJAC: Trick or Tweet - Social Media Use during Hurricane Irene and the ...
2012: NJAC: Trick or Tweet - Social Media Use during Hurricane Irene and the ...2012: NJAC: Trick or Tweet - Social Media Use during Hurricane Irene and the ...
2012: NJAC: Trick or Tweet - Social Media Use during Hurricane Irene and the ...
 
ICCM 2013 Ignite Session 1
ICCM 2013 Ignite Session 1ICCM 2013 Ignite Session 1
ICCM 2013 Ignite Session 1
 
Social Media Empowering a Global Response #IDCExpo
Social Media Empowering a Global Response #IDCExpoSocial Media Empowering a Global Response #IDCExpo
Social Media Empowering a Global Response #IDCExpo
 
How to Manage Your Online Reputation
How to Manage Your Online ReputationHow to Manage Your Online Reputation
How to Manage Your Online Reputation
 
2012: NW Ohio Public Health: Trick or Tweet - Social Media Use during Hurrica...
2012: NW Ohio Public Health: Trick or Tweet - Social Media Use during Hurrica...2012: NW Ohio Public Health: Trick or Tweet - Social Media Use during Hurrica...
2012: NW Ohio Public Health: Trick or Tweet - Social Media Use during Hurrica...
 
Classifying Microblogs For Disasters
Classifying Microblogs For DisastersClassifying Microblogs For Disasters
Classifying Microblogs For Disasters
 
Connie white social media
Connie white social mediaConnie white social media
Connie white social media
 
Pacific Endeavor 2012 Presentation
Pacific Endeavor 2012 PresentationPacific Endeavor 2012 Presentation
Pacific Endeavor 2012 Presentation
 
Sinsai info-makoto-draft-1
Sinsai info-makoto-draft-1Sinsai info-makoto-draft-1
Sinsai info-makoto-draft-1
 
Evolution of the Humanitarian Data Ecosystem
Evolution of the Humanitarian Data EcosystemEvolution of the Humanitarian Data Ecosystem
Evolution of the Humanitarian Data Ecosystem
 
Colorado fire chiefs presentation micki trost 2017
Colorado fire chiefs presentation micki trost 2017Colorado fire chiefs presentation micki trost 2017
Colorado fire chiefs presentation micki trost 2017
 
Crisis Computing
Crisis ComputingCrisis Computing
Crisis Computing
 
So You Think You're Prepared: Seven Events in Seven Weeks in 2017
So You Think You're Prepared: Seven Events in Seven Weeks in 2017So You Think You're Prepared: Seven Events in Seven Weeks in 2017
So You Think You're Prepared: Seven Events in Seven Weeks in 2017
 
InfoCrisis.Social - Design Process
InfoCrisis.Social - Design ProcessInfoCrisis.Social - Design Process
InfoCrisis.Social - Design Process
 
Where They Are @: or how I learned to stop worrying and love social media
Where They Are @: or how I learned to stop worrying and love social mediaWhere They Are @: or how I learned to stop worrying and love social media
Where They Are @: or how I learned to stop worrying and love social media
 
Examples of Real-World Big Data Application
Examples of Real-World Big Data ApplicationExamples of Real-World Big Data Application
Examples of Real-World Big Data Application
 
How to Leverage Social Media Communities for Crisis Response Coordination
How to Leverage Social Media Communities for Crisis Response CoordinationHow to Leverage Social Media Communities for Crisis Response Coordination
How to Leverage Social Media Communities for Crisis Response Coordination
 
Social Media, Crisis Communications, and the 2013 Boston Marathon
Social Media, Crisis Communications, and the 2013 Boston Marathon Social Media, Crisis Communications, and the 2013 Boston Marathon
Social Media, Crisis Communications, and the 2013 Boston Marathon
 
Situational Awareness Through Social Media
 Situational Awareness Through Social Media  Situational Awareness Through Social Media
Situational Awareness Through Social Media
 

Mehr von ISCRAM Events

ISCRAM 2013: Supporting multi-level situation awareness in crisis management
ISCRAM 2013: Supporting multi-level situation awareness in crisis managementISCRAM 2013: Supporting multi-level situation awareness in crisis management
ISCRAM 2013: Supporting multi-level situation awareness in crisis managementISCRAM Events
 
ISCRAM 2013: Smartphones as an Alerting, Command and Control System for the P...
ISCRAM 2013: Smartphones as an Alerting, Command and Control System for the P...ISCRAM 2013: Smartphones as an Alerting, Command and Control System for the P...
ISCRAM 2013: Smartphones as an Alerting, Command and Control System for the P...ISCRAM Events
 
ISCRAM 2013: Social media in C2 Proof-of-principle experiment
ISCRAM 2013: Social media in C2 Proof-of-principle experimentISCRAM 2013: Social media in C2 Proof-of-principle experiment
ISCRAM 2013: Social media in C2 Proof-of-principle experimentISCRAM Events
 
ISCRAM 2013: Leading Cats: How to Effectively Command Collectives
ISCRAM 2013: Leading Cats: How to Effectively Command CollectivesISCRAM 2013: Leading Cats: How to Effectively Command Collectives
ISCRAM 2013: Leading Cats: How to Effectively Command CollectivesISCRAM Events
 
ISCRAM 2013: Community-based Comprehensive Recovery Closing collaboration gap...
ISCRAM 2013: Community-based Comprehensive Recovery Closing collaboration gap...ISCRAM 2013: Community-based Comprehensive Recovery Closing collaboration gap...
ISCRAM 2013: Community-based Comprehensive Recovery Closing collaboration gap...ISCRAM Events
 
ISCRAM 2013: Designing towards an impact evaluation framework for a collabora...
ISCRAM 2013: Designing towards an impact evaluation framework for a collabora...ISCRAM 2013: Designing towards an impact evaluation framework for a collabora...
ISCRAM 2013: Designing towards an impact evaluation framework for a collabora...ISCRAM Events
 
ISCRAM 2013: Context Ontology for Humanitarian Assistance in Crisis Response
ISCRAM 2013: Context Ontology for Humanitarian Assistance in Crisis ResponseISCRAM 2013: Context Ontology for Humanitarian Assistance in Crisis Response
ISCRAM 2013: Context Ontology for Humanitarian Assistance in Crisis ResponseISCRAM Events
 
ISCRAM 2013: Meeting the Sphere Standards a case analysis of earthquake respo...
ISCRAM 2013: Meeting the Sphere Standards a case analysis of earthquake respo...ISCRAM 2013: Meeting the Sphere Standards a case analysis of earthquake respo...
ISCRAM 2013: Meeting the Sphere Standards a case analysis of earthquake respo...ISCRAM Events
 
ISCRAM 2013: Building robust supply networks for effective and efficient disa...
ISCRAM 2013: Building robust supply networks for effective and efficient disa...ISCRAM 2013: Building robust supply networks for effective and efficient disa...
ISCRAM 2013: Building robust supply networks for effective and efficient disa...ISCRAM Events
 
ISCRAM 2013: A multi-objective optimization model for relocating relief goods...
ISCRAM 2013: A multi-objective optimization model for relocating relief goods...ISCRAM 2013: A multi-objective optimization model for relocating relief goods...
ISCRAM 2013: A multi-objective optimization model for relocating relief goods...ISCRAM Events
 
ISCRAM 2013: Kimberly Roberson - UNHCR
ISCRAM 2013: Kimberly Roberson - UNHCRISCRAM 2013: Kimberly Roberson - UNHCR
ISCRAM 2013: Kimberly Roberson - UNHCRISCRAM Events
 
ISCRAM 2013: A CBR Detection Framework Using Fuzzy Logic- WIP
ISCRAM 2013: A CBR Detection Framework Using Fuzzy Logic- WIPISCRAM 2013: A CBR Detection Framework Using Fuzzy Logic- WIP
ISCRAM 2013: A CBR Detection Framework Using Fuzzy Logic- WIPISCRAM Events
 
ISCRAM 2013: A decision support system for effective use of probability forec...
ISCRAM 2013: A decision support system for effective use of probability forec...ISCRAM 2013: A decision support system for effective use of probability forec...
ISCRAM 2013: A decision support system for effective use of probability forec...ISCRAM Events
 
ISCRAM 2013: A Fine-Grained Sentiment Analysis Approach for Detecting Crisis ...
ISCRAM 2013: A Fine-Grained Sentiment Analysis Approach for Detecting Crisis ...ISCRAM 2013: A Fine-Grained Sentiment Analysis Approach for Detecting Crisis ...
ISCRAM 2013: A Fine-Grained Sentiment Analysis Approach for Detecting Crisis ...ISCRAM Events
 
ISCRAM 2013: Social Media-based Event Detection for Crisis Management in the ...
ISCRAM 2013: Social Media-based Event Detection for Crisis Management in the ...ISCRAM 2013: Social Media-based Event Detection for Crisis Management in the ...
ISCRAM 2013: Social Media-based Event Detection for Crisis Management in the ...ISCRAM Events
 
ISCRAM 2013: Interoperability during a Cross-Border Firefighting Operation at...
ISCRAM 2013: Interoperability during a Cross-Border Firefighting Operation at...ISCRAM 2013: Interoperability during a Cross-Border Firefighting Operation at...
ISCRAM 2013: Interoperability during a Cross-Border Firefighting Operation at...ISCRAM Events
 
ISCRAM 2013: Lessons Learnt from the 2011 Great East Japan Earthquake and Tsu...
ISCRAM 2013: Lessons Learnt from the 2011 Great East Japan Earthquake and Tsu...ISCRAM 2013: Lessons Learnt from the 2011 Great East Japan Earthquake and Tsu...
ISCRAM 2013: Lessons Learnt from the 2011 Great East Japan Earthquake and Tsu...ISCRAM Events
 
ISCRAM 2013: Analysis of a German First Responder Exercise: Requirements for ...
ISCRAM 2013: Analysis of a German First Responder Exercise: Requirements for ...ISCRAM 2013: Analysis of a German First Responder Exercise: Requirements for ...
ISCRAM 2013: Analysis of a German First Responder Exercise: Requirements for ...ISCRAM Events
 
ISCRAM 2013: Impact of the distribution and enrichment of information on the ...
ISCRAM 2013: Impact of the distribution and enrichment of information on the ...ISCRAM 2013: Impact of the distribution and enrichment of information on the ...
ISCRAM 2013: Impact of the distribution and enrichment of information on the ...ISCRAM Events
 
ISCRAM 2013: Twitter Integration and Content Moderation in GDACSmobile
ISCRAM 2013: Twitter Integration and Content Moderation in GDACSmobileISCRAM 2013: Twitter Integration and Content Moderation in GDACSmobile
ISCRAM 2013: Twitter Integration and Content Moderation in GDACSmobileISCRAM Events
 

Mehr von ISCRAM Events (20)

ISCRAM 2013: Supporting multi-level situation awareness in crisis management
ISCRAM 2013: Supporting multi-level situation awareness in crisis managementISCRAM 2013: Supporting multi-level situation awareness in crisis management
ISCRAM 2013: Supporting multi-level situation awareness in crisis management
 
ISCRAM 2013: Smartphones as an Alerting, Command and Control System for the P...
ISCRAM 2013: Smartphones as an Alerting, Command and Control System for the P...ISCRAM 2013: Smartphones as an Alerting, Command and Control System for the P...
ISCRAM 2013: Smartphones as an Alerting, Command and Control System for the P...
 
ISCRAM 2013: Social media in C2 Proof-of-principle experiment
ISCRAM 2013: Social media in C2 Proof-of-principle experimentISCRAM 2013: Social media in C2 Proof-of-principle experiment
ISCRAM 2013: Social media in C2 Proof-of-principle experiment
 
ISCRAM 2013: Leading Cats: How to Effectively Command Collectives
ISCRAM 2013: Leading Cats: How to Effectively Command CollectivesISCRAM 2013: Leading Cats: How to Effectively Command Collectives
ISCRAM 2013: Leading Cats: How to Effectively Command Collectives
 
ISCRAM 2013: Community-based Comprehensive Recovery Closing collaboration gap...
ISCRAM 2013: Community-based Comprehensive Recovery Closing collaboration gap...ISCRAM 2013: Community-based Comprehensive Recovery Closing collaboration gap...
ISCRAM 2013: Community-based Comprehensive Recovery Closing collaboration gap...
 
ISCRAM 2013: Designing towards an impact evaluation framework for a collabora...
ISCRAM 2013: Designing towards an impact evaluation framework for a collabora...ISCRAM 2013: Designing towards an impact evaluation framework for a collabora...
ISCRAM 2013: Designing towards an impact evaluation framework for a collabora...
 
ISCRAM 2013: Context Ontology for Humanitarian Assistance in Crisis Response
ISCRAM 2013: Context Ontology for Humanitarian Assistance in Crisis ResponseISCRAM 2013: Context Ontology for Humanitarian Assistance in Crisis Response
ISCRAM 2013: Context Ontology for Humanitarian Assistance in Crisis Response
 
ISCRAM 2013: Meeting the Sphere Standards a case analysis of earthquake respo...
ISCRAM 2013: Meeting the Sphere Standards a case analysis of earthquake respo...ISCRAM 2013: Meeting the Sphere Standards a case analysis of earthquake respo...
ISCRAM 2013: Meeting the Sphere Standards a case analysis of earthquake respo...
 
ISCRAM 2013: Building robust supply networks for effective and efficient disa...
ISCRAM 2013: Building robust supply networks for effective and efficient disa...ISCRAM 2013: Building robust supply networks for effective and efficient disa...
ISCRAM 2013: Building robust supply networks for effective and efficient disa...
 
ISCRAM 2013: A multi-objective optimization model for relocating relief goods...
ISCRAM 2013: A multi-objective optimization model for relocating relief goods...ISCRAM 2013: A multi-objective optimization model for relocating relief goods...
ISCRAM 2013: A multi-objective optimization model for relocating relief goods...
 
ISCRAM 2013: Kimberly Roberson - UNHCR
ISCRAM 2013: Kimberly Roberson - UNHCRISCRAM 2013: Kimberly Roberson - UNHCR
ISCRAM 2013: Kimberly Roberson - UNHCR
 
ISCRAM 2013: A CBR Detection Framework Using Fuzzy Logic- WIP
ISCRAM 2013: A CBR Detection Framework Using Fuzzy Logic- WIPISCRAM 2013: A CBR Detection Framework Using Fuzzy Logic- WIP
ISCRAM 2013: A CBR Detection Framework Using Fuzzy Logic- WIP
 
ISCRAM 2013: A decision support system for effective use of probability forec...
ISCRAM 2013: A decision support system for effective use of probability forec...ISCRAM 2013: A decision support system for effective use of probability forec...
ISCRAM 2013: A decision support system for effective use of probability forec...
 
ISCRAM 2013: A Fine-Grained Sentiment Analysis Approach for Detecting Crisis ...
ISCRAM 2013: A Fine-Grained Sentiment Analysis Approach for Detecting Crisis ...ISCRAM 2013: A Fine-Grained Sentiment Analysis Approach for Detecting Crisis ...
ISCRAM 2013: A Fine-Grained Sentiment Analysis Approach for Detecting Crisis ...
 
ISCRAM 2013: Social Media-based Event Detection for Crisis Management in the ...
ISCRAM 2013: Social Media-based Event Detection for Crisis Management in the ...ISCRAM 2013: Social Media-based Event Detection for Crisis Management in the ...
ISCRAM 2013: Social Media-based Event Detection for Crisis Management in the ...
 
ISCRAM 2013: Interoperability during a Cross-Border Firefighting Operation at...
ISCRAM 2013: Interoperability during a Cross-Border Firefighting Operation at...ISCRAM 2013: Interoperability during a Cross-Border Firefighting Operation at...
ISCRAM 2013: Interoperability during a Cross-Border Firefighting Operation at...
 
ISCRAM 2013: Lessons Learnt from the 2011 Great East Japan Earthquake and Tsu...
ISCRAM 2013: Lessons Learnt from the 2011 Great East Japan Earthquake and Tsu...ISCRAM 2013: Lessons Learnt from the 2011 Great East Japan Earthquake and Tsu...
ISCRAM 2013: Lessons Learnt from the 2011 Great East Japan Earthquake and Tsu...
 
ISCRAM 2013: Analysis of a German First Responder Exercise: Requirements for ...
ISCRAM 2013: Analysis of a German First Responder Exercise: Requirements for ...ISCRAM 2013: Analysis of a German First Responder Exercise: Requirements for ...
ISCRAM 2013: Analysis of a German First Responder Exercise: Requirements for ...
 
ISCRAM 2013: Impact of the distribution and enrichment of information on the ...
ISCRAM 2013: Impact of the distribution and enrichment of information on the ...ISCRAM 2013: Impact of the distribution and enrichment of information on the ...
ISCRAM 2013: Impact of the distribution and enrichment of information on the ...
 
ISCRAM 2013: Twitter Integration and Content Moderation in GDACSmobile
ISCRAM 2013: Twitter Integration and Content Moderation in GDACSmobileISCRAM 2013: Twitter Integration and Content Moderation in GDACSmobile
ISCRAM 2013: Twitter Integration and Content Moderation in GDACSmobile
 

Kürzlich hochgeladen

The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxLoriGlavin3
 
Decarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityDecarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityIES VE
 
Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024Hiroshi SHIBATA
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxLoriGlavin3
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...Scott Andery
 
Sample pptx for embedding into website for demo
Sample pptx for embedding into website for demoSample pptx for embedding into website for demo
Sample pptx for embedding into website for demoHarshalMandlekar2
 
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better StrongerModern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better Strongerpanagenda
 
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Mark Goldstein
 
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...panagenda
 
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Alkin Tezuysal
 
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...Wes McKinney
 
Potential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsPotential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsRavi Sanghani
 
Scale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterScale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterMydbops
 
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentEmixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentPim van der Noll
 
Testing tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesTesting tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesKari Kakkonen
 
Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfConnecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfNeo4j
 
Data governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationData governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationKnoldus Inc.
 

Kürzlich hochgeladen (20)

The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
 
Decarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityDecarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a reality
 
Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
 
Sample pptx for embedding into website for demo
Sample pptx for embedding into website for demoSample pptx for embedding into website for demo
Sample pptx for embedding into website for demo
 
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better StrongerModern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
 
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
 
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
 
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
 
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
 
Potential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsPotential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and Insights
 
Scale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterScale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL Router
 
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentEmixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
 
Testing tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesTesting tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examples
 
Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfConnecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdf
 
Data governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationData governance with Unity Catalog Presentation
Data governance with Unity Catalog Presentation
 

ISCRAM 2013: Extracting Information Nuggets from Disaster-Related Messages in Social Media

  • 1. Extracting Information Nuggets from Disaster-Related Messages in Social Media Muhammad Imran, Shady Elbassuoni, Carlos Castillo, Fernando Diaz, Patrick Meier
  • 2. Outline • Social Media response to disaster • Finding tactical and actionable information • Disaster ontologies • Filtering, classification and extraction • Ongoing work • Discussion
  • 3. Disaster and Social Media 2.3 million tweets reflecting the words “Haiti” or “Red Cross” from Jan 12 to Jan 14, 2010 http://www.sysomos.com
  • 5. Why Social Media? • Virtual Collaboration, Information Sharing • Highly valuable information • Contribute to situational awareness • Highly useful, if analyzed timely and effectively
  • 6. Sandy Tweets @NYGovCuomo orders closing of NYC bridges. Only Staten Island bridges unaffected at this time. Bridges must close by 7pm. #Sandy #NYC. rt @911buff: public help needed: 2 boys 2 & 4 missing nearly 24 hours after they got separated from their mom when car submerged in si. #sandy #911buff freaking out. home alone. will just watch tv #Sandy #NYC. 400 Volunteers are needed for areas that #Sandy destroyed.
  • 7. Sandy Tweets @NYGovCuomo orders closing of NYC bridges. Only Staten Island bridges unaffected at this time. Bridges must close by 7pm. #Sandy #NYC. rt @911buff: public help needed: 2 boys 2 & 4 missing nearly 24 hours after they got separated from their mom when car submerged in si. #sandy #911buff freaking out. home alone. will just watch tv #Sandy #NYC. 400 Volunteers are needed for areas that #Sandy destroyed. Personal Informative
  • 8. Sandy Tweets @NYGovCuomo orders closing of NYC bridges. Only Staten Island bridges unaffected at this time. Bridges must close by 7pm. #Sandy #NYC. rt @911buff: public help needed: 2 boys 2 & 4 missing nearly 24 hours after they got separated from their mom when car submerged in si. #sandy #911buff freaking out. home alone. will just watch tv #Sandy #NYC. 400 Volunteers are needed for areas that #Sandy destroyed. Personal Informative Caution and Advice Casualties and Damage Donations
  • 9. Sandy Tweets @NYGovCuomo orders closing of NYC bridges. Only Staten Island bridges unaffected at this time. Bridges must close by 7pm. #Sandy #NYC. rt @911buff: public help needed: 2 boys 2 & 4 missing nearly 24 hours after they got separated from their mom when car submerged in si. #sandy #911buff freaking out. home alone. will just watch tv #Sandy #NYC. 400 Volunteers are needed for areas that #Sandy destroyed. Personal Informative Caution and Advice Casualties and Damage Donations
  • 10. Finding Tactical & Actionable Information Personal Informative (Direct & Indirect) Other Caution and advice Casualties and damage Donations People missing, found, or seen Information source Siren heard, warning issued/lifted etc. People dead, injured, damage etc. Money, shelter, blood, goods, or services Webpages, photos, videos information sources …
  • 12. Our Datasets Joplin Dataset • 206,764 tweets collected during Joplin tornado that hit Joplin, Missouri on May 22, 2011 • Collected by researchers at the university of Colorado at Boulder • Collected through Twitter API by monitoring the tweets with hashtags #joplin or #tornado
  • 13. Our Datasets Sandy Dataset: • 140,000 tweets collected during hurricane Sandy that hit northeastern USA on Oct 29, 2012 • Collected through Twitter API by monitoring the tweets with hashtag #sandy or #nyc
  • 14. 1. Filtering Is disaster- related? Contributes to situational awareness? Yes Yes No No
  • 15. 1. Filtering: Training Data 32% 60% 8% 4406 tweets sampled uniformly from the Joplin dataset Annotated using CrowdFlower Personal Informative Other
  • 16. 2. Classification Caution & Advice Information Sources Damage & Casualties Donations Health Shelter Food Water Logistics ... ... Filtered tweets
  • 17. Distribution of Tweet Types 50% 18% 16% 10% 6% Caution/Advice Info Source Donations Casualties/Damage Unknown Joplin Tornado (2011)
  • 18. Automatic Classification Class Prec Rec F-Measure AUC Caution and advice 0.85 0.76 0.80 0.91 Information source 0.54 0.58 0.56 0.76 Donations 0.72 0.71 0.72 0.89 Casualties/damage 0.52 0.65 0.58 0.87 • Binary (hashtags, URL, emotion etc.) • Scalar (tweet length) • Text features (Unigram, bigram, POS tags, Verbnet etc.) Features:
  • 19. 3. Extraction ... Classified tweets @JimFreund: Apparently we have no choice. There is a tornado watch in effect tonight.
  • 20. Labels for Extraction: Training Data • Type-dependent instruction • Ask evaluators to copy-paste a word/phrase from each tweet
  • 21. Tool • CMU ARK Twitter NLP – Tokenization – Feature extraction – CRF learning • Very easy to use: simply change the training set (part-of-speech tags) into anything, and re- train
  • 22. Extraction Evaluation Setting Rec Prec Train 2/3 Joplin, Test 1/3 Joplin 78% 90% Train 2/3 Sandy, Test 1/3 Sandy 41% 79% Train Joplin, Test Sandy 11% 78% Train Joplin + 10% Sandy, Test 90% Sandy 21% 81% • Precision is: one word or more in common with what humans extracted (Imran et al., 2013)
  • 24. Self-service for crisis-related classification • Machine learning software can be provided as a service – e.g. Google Prediction API • Can we provide crisis-related tweet classification as a service? – Automatic collection of tweets – Re-usable ontologies / default training sets – Active learning
  • 25. Request Labeled / Unlabeled Datasets Contact us at: mimran@qf.org.qa
  • 26. References • K. Starbird, L. Palen, A. Hughes, and S. Vieweg (2010) Chatter on the red: what hazards threat reveals about the social life of microblogged information. In Proceedings of the 2010 ACM conference on Computer supported cooperative work, pages 241–250. ACM. • Latonero, Mark, and Irina Shklovski. "“Respectfully Yours in Safety and Service”: Emergency Management & Social Media Evangelism." Proceedings of the 7th International ISCRAM Conference–Seattle. Vol. 1. 2010. • Muhammad Imran, Shady Elbassuoni, Carlos Castillo, Fernando Diaz and Patrick Meier. Practical Extraction of Disaster-Relevant Information from Social Media. WWW-2013 SWDM, May 2013
  • 27. Thank you! Muhammad Imran mimran@qf.org.qa With thanks to Carlos Castillo for several slides

Hinweis der Redaktion

  1. Social media empowers individuals, providing them a platform from which to share opinions, experiences and information from anywhere at any time. Ultimately the shared information can be highly useful provided if analyzed timely and effectively. And that’s what I am going to present in this session.
  2. Finding tactical and actionable information from a millions of messages that people post on social media is a complex and challenging task. For this purpose, specifically for disasters we came up with a sensible ontology that has mainly three stages. Every stage refine a piece of information that thus can highly contribute to disaster management. In order to get to the actionable information it is required that we first categories a coming message to a predefined category that is of disaster-specific.
  3. Identifies what named entities, what caution/advice and temporal information and others.
  4. The inter annotator agreement value shows the level of agreement among workers on an assessable unit(i.e., in our case a tweet). High agreement indicates that different workers frequently gave the same response forthe same tweet message.