SlideShare ist ein Scribd-Unternehmen logo
1 von 21
Social media and it's use
    in disease surveillance


March 2010
✤   How do we improve disease surveillance?

✤   Can social media (e.g. twitter) be effectively
    used to monitor disease outbreaks?
Tweets: disease reports

✤
    Omg.. The never-ending flu+sore throat.. ☹ bleh.. ☹

✤   Stomach flu. Urgh.

✤   i love puking... f@#k you flu

✤   Having a sore throat,sucks.Having flu,sucks even
    MORE.DAMMIT!

✤   Feeling dizzy/ feverish ever since that class at the gym!
    overexertion or the flu??
Tweets: non disease reports

✤   Study finds H1N1 flu in pregnancy is critical
    risk - Reuters - http://bit.ly/bLiLnz
✤   This March Madness turns out to be the flu!
✤   Smiling is infectious, You can catch it like the
    flu. Someone smiled at me today, And I
    started smiling too.
We need Natural Language
Processing (NLP)


✤   We need a NLP engine in order to process
    tweets:
✤   Tweet → NLP Engine → It's the flu!
Maybe we need NLP + Ontologies


✤   Do we just search for simple keywords?
✤   An ontology can provide us with organized
    concepts relevant to a domain (i.e. health,
    biomedicine)
✤   How about processing natural language to match
    concepts organized in an ontology?
Ontologies help answer these
questions

✤   How do we know if a user is referring to a
    symptom or a disease?
✤   We seem to need a set of keywords. Where do get
    this set of symptoms and disease names?
✤   How do we link references to one or more
    symptom to a specific disease?
The UMLS Ontology

✤   A comprehensive thesaurus and ontology of
    biomedical concepts
✤   Facilitates development of computer systems that
    behave as if they "understand" the meaning of the
    language of biomedicine and health.
✤   Integrates 2+ million names for ~900k concepts
    from 60+ families of biomedical vocabularies, and
    12 million relations among these concepts.
UMLS & MetaMap


✤   MetaMap is a tool that given an arbitrary
    piece of text, finds and returns the relevant
    concepts available in the UMLS Ontology
✤   MetaMap is a software interface to query
    the “MetaThesaurus” and the “Semantic
    Network”, both a component of UMLS
Concept mapping with MetaMap
✤   Using MetaMap to query the
    MetaThesaurus, we can map the following
    text strings to the concept "Atrial
    Fibrillation"
     ✤ Atrial fibrillation!
     ✤ AF!

     ✤ AFib!


     ✤ Atrial fibrillation (disorder)
✤   But who actually tweets “atrial
           fibrillation” ??
“Having a sore throat, sucks.
Having flu, sucks even MORE”

✤   Matches:
    ✤   SORETHROAT (Sore Throat) [Sign or
        Symptom]
    ✤   Flu (Influenza) [Disease or Syndrome]
    ✤   Sucking [Physiologic Function]
“i love puking... damn you flu”


✤   Matches:
    ✤   I (Iodides) [Inorganic Chemical]
    ✤   Love [Mental Process]
    ✤   Flu (Influenza) [Disease or Syndrome]
“Feeling dizzy/ feverish ever since that class at
the gym! overexertion or the flu??”

✤   Matches:
    ✤   Feeling dizzy [Sign or Symptom]
    ✤   Feverish (Fever) [Finding]
    ✤   Overexertion (Exhaustion due to excessive
        exertion) [Injury or Poisoning]
    ✤   Flu (Influenza) [Disease or Syndrome]
“Smiling is infectious, u can catch it like the
flu; someone smiled at me today, and I started
smiling too”
✤   Matches:

    ✤   Smiling [Social Behavior]

    ✤   Infection [Disease or Syndrome]

    ✤   Catch (Catch - Finding of sensory dimension of pain)
        [Sign or Symptom]

    ✤   Flu (Influenza) [Disease or Syndrome]

    ✤   Today [Temporal Concept]
✤   Not the best results but it’s a
                start...
Using MetaMap

✤   Free of Charge!

✤   MetaMap Transfer (MMTx) is a java-based distributable
    version of the MetaMap program

✤   Requires 7GB disk space (uncompressed) and at least 1GB
    of RAM (2GB recommended)

✤   “MetaMap is not an end user product. Users will need a
    moderate amount of programming knowledge to use
    MMTx effectively.” - from UMLS website
We identified tweets that mention
a concept...SO WHAT?


✤   We can't assume its a case report!
✤   How the we go around this?
✤   Are we done here?
Supervised learning to improve
the results?


✤   What if we use machine learning?
✤   Supervised learning is a machine learning
    technique for deducing a function from
    training data
Is it feasible?

✤   Weka is a collection of machine learning algorithms for data
    mining tasks.

✤   Algorithms can be applied directly to a dataset or called from
    your own Java code.

✤   Input: dataset of concept matches; Output: Classifier Java
    Class

✤   This automatically generated java class can be easily be used
    to answer if a tweet matching X and Y medical concepts is or is
    not a disease report
Processing a tweet overview

✤   Get Tweet
✤   Process tweet using MetaMap
✤   Get matching concepts from MetaMap
✤   Feed the matches to the Classifier Java Class
✤   Get a True or False answer indicator “it's a disease
    report”

Weitere ähnliche Inhalte

Ähnlich wie Social media and it's use in disease surveillance

On Norman doors, sign design, and why you need to think like a designer!
On Norman doors, sign design, and why you need to think like a designer!On Norman doors, sign design, and why you need to think like a designer!
On Norman doors, sign design, and why you need to think like a designer!Joyce Lee
 
Homework Is Necessary Essay. Online assignment writing service.
Homework Is Necessary Essay. Online assignment writing service.Homework Is Necessary Essay. Online assignment writing service.
Homework Is Necessary Essay. Online assignment writing service.Lisa Richardson
 
Module 5: Decision Trees
Module 5: Decision TreesModule 5: Decision Trees
Module 5: Decision TreesSara Hooker
 
Automated health responses
Automated health responses Automated health responses
Automated health responses Austin Powell
 
Bibliographic Essay Guidelines. Online assignment writing service.
Bibliographic Essay Guidelines. Online assignment writing service.Bibliographic Essay Guidelines. Online assignment writing service.
Bibliographic Essay Guidelines. Online assignment writing service.Diana Hole
 
Mental Health and AI.pptx
Mental Health and AI.pptxMental Health and AI.pptx
Mental Health and AI.pptxSPMujuni
 
Explore the Effects of Emoticons on Twitter Sentiment Analysis
Explore the Effects of Emoticons on Twitter Sentiment Analysis Explore the Effects of Emoticons on Twitter Sentiment Analysis
Explore the Effects of Emoticons on Twitter Sentiment Analysis csandit
 
Psychology Research Paper Examples
Psychology Research Paper ExamplesPsychology Research Paper Examples
Psychology Research Paper ExamplesErin Ross
 
Contoh Soal Essay Tentang Fotosintesis
Contoh Soal Essay Tentang FotosintesisContoh Soal Essay Tentang Fotosintesis
Contoh Soal Essay Tentang FotosintesisAndrea Lawson
 
Online communities as a vehicle to improving interactions between patients an...
Online communities as a vehicle to improving interactions between patients an...Online communities as a vehicle to improving interactions between patients an...
Online communities as a vehicle to improving interactions between patients an...Manuel Armayones
 
PPT - Urgent Essay Writing Help PowerPoint Presentation, Free Dow
PPT - Urgent Essay Writing Help PowerPoint Presentation, Free DowPPT - Urgent Essay Writing Help PowerPoint Presentation, Free Dow
PPT - Urgent Essay Writing Help PowerPoint Presentation, Free DowNat Rice
 
Pairing Analytics With Qualitative Methods to Understand the WHY
Pairing Analytics With Qualitative Methods to Understand the WHYPairing Analytics With Qualitative Methods to Understand the WHY
Pairing Analytics With Qualitative Methods to Understand the WHYMichele Kiss
 
Uchealth Character Analysis
Uchealth Character AnalysisUchealth Character Analysis
Uchealth Character AnalysisJennifer Baker
 
ROJoson PEP Talk: Clinical Diagnostic Algorithm for Common Surface Lumps
ROJoson PEP Talk: Clinical Diagnostic Algorithm for Common Surface LumpsROJoson PEP Talk: Clinical Diagnostic Algorithm for Common Surface Lumps
ROJoson PEP Talk: Clinical Diagnostic Algorithm for Common Surface LumpsReynaldo Joson
 
David papini escape emotional intelligence traps
David papini   escape emotional intelligence trapsDavid papini   escape emotional intelligence traps
David papini escape emotional intelligence trapsDavid Papini
 
The JoyPop Resilience App for Youth
The JoyPop Resilience App for YouthThe JoyPop Resilience App for Youth
The JoyPop Resilience App for YouthChristine Wekerle
 
Critique Paper Example - How To Write Critique Pap
Critique Paper Example - How To Write Critique PapCritique Paper Example - How To Write Critique Pap
Critique Paper Example - How To Write Critique PapLisa Graves
 

Ähnlich wie Social media and it's use in disease surveillance (20)

Meet Empifany
Meet EmpifanyMeet Empifany
Meet Empifany
 
On Norman doors, sign design, and why you need to think like a designer!
On Norman doors, sign design, and why you need to think like a designer!On Norman doors, sign design, and why you need to think like a designer!
On Norman doors, sign design, and why you need to think like a designer!
 
Homework Is Necessary Essay. Online assignment writing service.
Homework Is Necessary Essay. Online assignment writing service.Homework Is Necessary Essay. Online assignment writing service.
Homework Is Necessary Essay. Online assignment writing service.
 
Module 5: Decision Trees
Module 5: Decision TreesModule 5: Decision Trees
Module 5: Decision Trees
 
Automated health responses
Automated health responses Automated health responses
Automated health responses
 
Bibliographic Essay Guidelines. Online assignment writing service.
Bibliographic Essay Guidelines. Online assignment writing service.Bibliographic Essay Guidelines. Online assignment writing service.
Bibliographic Essay Guidelines. Online assignment writing service.
 
Sentiment analysis
Sentiment analysisSentiment analysis
Sentiment analysis
 
Mental Health and AI.pptx
Mental Health and AI.pptxMental Health and AI.pptx
Mental Health and AI.pptx
 
Explore the Effects of Emoticons on Twitter Sentiment Analysis
Explore the Effects of Emoticons on Twitter Sentiment Analysis Explore the Effects of Emoticons on Twitter Sentiment Analysis
Explore the Effects of Emoticons on Twitter Sentiment Analysis
 
Psychology Research Paper Examples
Psychology Research Paper ExamplesPsychology Research Paper Examples
Psychology Research Paper Examples
 
Contoh Soal Essay Tentang Fotosintesis
Contoh Soal Essay Tentang FotosintesisContoh Soal Essay Tentang Fotosintesis
Contoh Soal Essay Tentang Fotosintesis
 
Online communities as a vehicle to improving interactions between patients an...
Online communities as a vehicle to improving interactions between patients an...Online communities as a vehicle to improving interactions between patients an...
Online communities as a vehicle to improving interactions between patients an...
 
The Cold Extractor
The Cold ExtractorThe Cold Extractor
The Cold Extractor
 
PPT - Urgent Essay Writing Help PowerPoint Presentation, Free Dow
PPT - Urgent Essay Writing Help PowerPoint Presentation, Free DowPPT - Urgent Essay Writing Help PowerPoint Presentation, Free Dow
PPT - Urgent Essay Writing Help PowerPoint Presentation, Free Dow
 
Pairing Analytics With Qualitative Methods to Understand the WHY
Pairing Analytics With Qualitative Methods to Understand the WHYPairing Analytics With Qualitative Methods to Understand the WHY
Pairing Analytics With Qualitative Methods to Understand the WHY
 
Uchealth Character Analysis
Uchealth Character AnalysisUchealth Character Analysis
Uchealth Character Analysis
 
ROJoson PEP Talk: Clinical Diagnostic Algorithm for Common Surface Lumps
ROJoson PEP Talk: Clinical Diagnostic Algorithm for Common Surface LumpsROJoson PEP Talk: Clinical Diagnostic Algorithm for Common Surface Lumps
ROJoson PEP Talk: Clinical Diagnostic Algorithm for Common Surface Lumps
 
David papini escape emotional intelligence traps
David papini   escape emotional intelligence trapsDavid papini   escape emotional intelligence traps
David papini escape emotional intelligence traps
 
The JoyPop Resilience App for Youth
The JoyPop Resilience App for YouthThe JoyPop Resilience App for Youth
The JoyPop Resilience App for Youth
 
Critique Paper Example - How To Write Critique Pap
Critique Paper Example - How To Write Critique PapCritique Paper Example - How To Write Critique Pap
Critique Paper Example - How To Write Critique Pap
 

Kürzlich hochgeladen

What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfMounikaPolabathina
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxLoriGlavin3
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxLoriGlavin3
 

Kürzlich hochgeladen (20)

What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdf
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine Tuning
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptx
 

Social media and it's use in disease surveillance

  • 1. Social media and it's use in disease surveillance March 2010
  • 2. How do we improve disease surveillance? ✤ Can social media (e.g. twitter) be effectively used to monitor disease outbreaks?
  • 3. Tweets: disease reports ✤ Omg.. The never-ending flu+sore throat.. ☹ bleh.. ☹ ✤ Stomach flu. Urgh. ✤ i love puking... f@#k you flu ✤ Having a sore throat,sucks.Having flu,sucks even MORE.DAMMIT! ✤ Feeling dizzy/ feverish ever since that class at the gym! overexertion or the flu??
  • 4. Tweets: non disease reports ✤ Study finds H1N1 flu in pregnancy is critical risk - Reuters - http://bit.ly/bLiLnz ✤ This March Madness turns out to be the flu! ✤ Smiling is infectious, You can catch it like the flu. Someone smiled at me today, And I started smiling too.
  • 5. We need Natural Language Processing (NLP) ✤ We need a NLP engine in order to process tweets: ✤ Tweet → NLP Engine → It's the flu!
  • 6. Maybe we need NLP + Ontologies ✤ Do we just search for simple keywords? ✤ An ontology can provide us with organized concepts relevant to a domain (i.e. health, biomedicine) ✤ How about processing natural language to match concepts organized in an ontology?
  • 7. Ontologies help answer these questions ✤ How do we know if a user is referring to a symptom or a disease? ✤ We seem to need a set of keywords. Where do get this set of symptoms and disease names? ✤ How do we link references to one or more symptom to a specific disease?
  • 8. The UMLS Ontology ✤ A comprehensive thesaurus and ontology of biomedical concepts ✤ Facilitates development of computer systems that behave as if they "understand" the meaning of the language of biomedicine and health. ✤ Integrates 2+ million names for ~900k concepts from 60+ families of biomedical vocabularies, and 12 million relations among these concepts.
  • 9. UMLS & MetaMap ✤ MetaMap is a tool that given an arbitrary piece of text, finds and returns the relevant concepts available in the UMLS Ontology ✤ MetaMap is a software interface to query the “MetaThesaurus” and the “Semantic Network”, both a component of UMLS
  • 10. Concept mapping with MetaMap ✤ Using MetaMap to query the MetaThesaurus, we can map the following text strings to the concept "Atrial Fibrillation" ✤ Atrial fibrillation! ✤ AF! ✤ AFib! ✤ Atrial fibrillation (disorder)
  • 11. But who actually tweets “atrial fibrillation” ??
  • 12. “Having a sore throat, sucks. Having flu, sucks even MORE” ✤ Matches: ✤ SORETHROAT (Sore Throat) [Sign or Symptom] ✤ Flu (Influenza) [Disease or Syndrome] ✤ Sucking [Physiologic Function]
  • 13. “i love puking... damn you flu” ✤ Matches: ✤ I (Iodides) [Inorganic Chemical] ✤ Love [Mental Process] ✤ Flu (Influenza) [Disease or Syndrome]
  • 14. “Feeling dizzy/ feverish ever since that class at the gym! overexertion or the flu??” ✤ Matches: ✤ Feeling dizzy [Sign or Symptom] ✤ Feverish (Fever) [Finding] ✤ Overexertion (Exhaustion due to excessive exertion) [Injury or Poisoning] ✤ Flu (Influenza) [Disease or Syndrome]
  • 15. “Smiling is infectious, u can catch it like the flu; someone smiled at me today, and I started smiling too” ✤ Matches: ✤ Smiling [Social Behavior] ✤ Infection [Disease or Syndrome] ✤ Catch (Catch - Finding of sensory dimension of pain) [Sign or Symptom] ✤ Flu (Influenza) [Disease or Syndrome] ✤ Today [Temporal Concept]
  • 16. Not the best results but it’s a start...
  • 17. Using MetaMap ✤ Free of Charge! ✤ MetaMap Transfer (MMTx) is a java-based distributable version of the MetaMap program ✤ Requires 7GB disk space (uncompressed) and at least 1GB of RAM (2GB recommended) ✤ “MetaMap is not an end user product. Users will need a moderate amount of programming knowledge to use MMTx effectively.” - from UMLS website
  • 18. We identified tweets that mention a concept...SO WHAT? ✤ We can't assume its a case report! ✤ How the we go around this? ✤ Are we done here?
  • 19. Supervised learning to improve the results? ✤ What if we use machine learning? ✤ Supervised learning is a machine learning technique for deducing a function from training data
  • 20. Is it feasible? ✤ Weka is a collection of machine learning algorithms for data mining tasks. ✤ Algorithms can be applied directly to a dataset or called from your own Java code. ✤ Input: dataset of concept matches; Output: Classifier Java Class ✤ This automatically generated java class can be easily be used to answer if a tweet matching X and Y medical concepts is or is not a disease report
  • 21. Processing a tweet overview ✤ Get Tweet ✤ Process tweet using MetaMap ✤ Get matching concepts from MetaMap ✤ Feed the matches to the Classifier Java Class ✤ Get a True or False answer indicator “it's a disease report”

Hinweis der Redaktion

  1. \n
  2. \n
  3. \n
  4. \n
  5. \n
  6. \n
  7. \n
  8. \n
  9. \n
  10. \n
  11. \n
  12. \n
  13. \n
  14. \n
  15. \n
  16. \n
  17. \n
  18. \n
  19. \n
  20. \n
  21. \n