SlideShare ist ein Scribd-Unternehmen logo
1 von 25
Elsevier Health Sciences
Smart Content Drives Smart Applications
Linked Data in HCLS for Commercial Applications


  Semantic Web for Health Care and Life           Iker Huerga
  Sciences Summer School, W3C@MIT                 Sr. Semantic Software Engineer
  August 29, 2012                                 i.huerga@elsevier.com
                                                  @ihuerga
Elsevier Proprietary and Confidential
The Challenge




Elsevier Proprietary and Confidential
The Challenge




     Providing doctors/researchers with
       the right information in the right
      moment to make the best decisions




Elsevier Health Sciences | Proprietary and Confidential
How to solve it?




Elsevier Proprietary and Confidential
How to Solve it

•Step 1
 Making Elsevier’s authoritative Health Care and Life
 Sciences content “smarter”

•Step 2
 Enriching Elsevier’s content by integration with third party
 data

•Step 3
 Creating interfaces to provide fast discoverability of the
 most relevant answers and more intuitive searching.

                                       We Need Semantic Web for all This
Elsevier Health Sciences | Proprietary and Confidential
Introducing
                                        Smart Content




Elsevier Proprietary and Confidential
Taxonomy-Powered Content = Smart Content
Content with applied taxonomy




Content today with
   structured XML




                                                                     Copyright 2011 Outsell Gilbane Services, Inc.
 Elsevier Proprietary and Confidential                                                  http://www.outsellinc.com
                                         http://gilbane.com/xml/2009/11/what-is-smart-content.html#ixzz0hnuRhaBc
Smart Content At Elsevier

                                                                Smart Content Applications
                                                                           Better discovery through
                                                                           semantic search & navigation
               Linked data from
                                                                           •Faceted search & browse
               partners and the Web
                                                                           •Ontology-driven navigation
                                                                           •Task-specific results
                                                                           •Personalized/localized results
                                                                           •Question answering
                                                                           •Link to evidenced-based content

                                         Text                              Better understanding
                                                                           through analysis and
                                                Entities,                  visualization
Elsevier                                        concepts                   •Tag clouds
content                              Tables     and                        •Heatmaps
                                                                           •Streamgraphs
                                                relationships              •Scatterplots
                                                                           •Time series
                                     Images                                •Animations
                                                                           New knowledge through
                                                                           aggregation and synthesis
                                                                           •Topic pages
                      Elsevier                                             •Social network maps
                      knowledge                                            •Geolocation maps
                      organization                                         •Data mashups
                      systems                                              •Text mining reports

                                                                                                       8
Elsevier Proprietary and Confidentiall
Introducing EMMeT (Elsevier Merged Medical Taxonomy)
Parent Terms
•      Breast Disorders                      2
•      Cancer of the Thorax
•      Mammary Neoplasms
•      More….
                                                                              Symptoms            Breast Lump, Nipple Retraction, …..


    Medical Name
                                                                              Diagnostic
          Malignant Neoplasm of the Breast                                                        Mammography, Breast Biopsy, …..
                                                                              Procedures
    Consumer Friendly Name
          Breast Cancer
    Synonyms                                     1                                            4
          Malignant Tumor of Breast                                           Treatment
                                                                                                  Chemotherapy, Mastectomy, ….
          Malignant Breast Neoplasm                                           Procedures
                                                     Semantic Relationships
          Breast Ca
    Codes
          ICD9 – 174.9
          MeSH – D001943                                                      Medications         Tamoxifen, Doxorubicin, …..
          SNOMED-CT – 190121004
    Semantic Type/Group
         Neoplastic Process/Disease
                                                                              Risk Factors        Family History, Genetics, Predisposition, ….


Children Terms
•       Breast Sarcoma
                                             3                                Prevention          Screening, Preemptive Mastectomy, ….
•       Familial Breast Cancer
•       Malignant lymphoma of the Breast
•       Malignant Neoplasm of the breast outer
        quadrant                                                              Complications       Metastatic Cancer, ….
•       More…

    Elsevier Proprietary and Confidential
Automated Indexing: Weighted Tags for Better Search
                                                                                   Article-level SMART Content tags help
                                                                                   confirm relevance and provide a topical
                                                                                   overview about a piece of content.




                                         Paragraph-level SMART Content tags
                                         uncover highly-relevant information not
                                         necessarily evident from the title or
                                         abstract alone.




 Elsevier Proprietary and Confidential
Standards
                                        The Key Piece




Elsevier Proprietary and Confidential
The Satellite: a Linked Data Compliant Data Format


• Motivations:
    –Help answer research questions
    –Direct material to interested readers
    –Extract disparate facts from the
     literature to create knowledge bases      Satellite Specification First Version
                                               •Use RDF/XML serialization
• Technical Requirements:
                                               •Use XML Schemas to validate the syntax
    –Use of open standards based               so that document which validate will
     metadata frameworks: SKOS, DCMI
                                               produce correct RDF
     and SWAN
    –Need of a common model to represent       •Use the extensive XML-capable
     ontological annotations                   infrastructure, QA tools, etc.
    –Data will be transferred from suppliers
     to Elsevier and back
    –QA of tags (aka Provenance)
    –Some people have RDF knowledge,
     but very limited in proportion


Elsevier Proprietary and Confidential
The Satellite Format: a Linked Data Compliant Data




Elsevier Health Sciences | Proprietary and Confidential
The Satellite Format: a Linked Data Compliant Data


•What we have learned so far
    –RDF/XML has some limitations
            • Not all RDF graphs can be serialized in XML (QNames, Unicode characters)

            • There is no support for RDF Graphs in RDF/XML, at the moment one satellite is one RDF Graph
              in the LDR

            • Complexity of RDF/XML abbreviation rules

            • Can’t put attributes on the predicates

    –An XML Capable infrastructure does not necessarily entail an RDF/XML Capable
     infrastructure
            • Many XML tools can’t be used with RDF/XML

            • Multiple different serializations for the same RDF Graph exist

            • XML Schema validation makes the specification less flexible

                                It’s time to move towards a more “RDF friendly” serialization




Elsevier Health Sciences | Proprietary and Confidential
The Satellite Format: a Linked Data Compliant Data


•Turtle as the RDF serialization format
    –It is becoming the de facto serialization for RDF
    –It makes RDF much more ‘human friendly’
    –Gives us the flexibility we need for the next satellite generation
    –All the Libraries we are currently using support Turtle
    –It follows the triple pattern syntax of SPARQL, more convenient for querying


•Steps to the transition
    –Both serializations will coexist for a period of time
    –Internal tools, Validation, QA, etc., need to be adapted to ‘understand’ Turtle
    –Tools for transforming RDF/XML into Turtle needs to be provided to the suppliers




Elsevier Health Sciences | Proprietary and Confidential
How is all this transformed
                             into Commercial applications




Elsevier Proprietary and Confidential
The Linked Data Repository


•    The LDR stores metadata describing Non Information Resources [httpRange14]

•    The LDR provides a rich semantic layer on top of IR and enables search and discovery of
     metadata

•    Extends Elsevier extracted knowledge by interlinking data with other related sources of
     content from partners and the Web, using the Web as its API


•    Optimized for high-volume of RDF I/O operations

•    Provide service layer APIs for ease of integration

•    Opens up discovery and utility of content beyond searchable documents




Elsevier Health Sciences | Proprietary and Confidential
Represent Enhancements and Vocabularies In RDF
Satellites
                                        •Creation of Satellite Standards
                                         –Linked data compliant RDF representing metadata objects
                                         –Leverage common namespaces from dct, pav, rdf, skos
                                         –Taxonomies in SKOS to enhance portability in the linked data
                                          world                               LDR

                                         –Subject tagging against a vocabulary representing extracted
                                          knowledge
                                         –Concept URIs that can be equated to URIs in linked data

                                        •Example RDF Statements
                                         –Tags from a taxonomy for a given document
                                         –Document sections relevant to a given concept
                                         –Document sections providing answers to a given question
                                         –Genes mentioned in a given document
                                         –Documents supporting or disputing conclusions of a given
                                          document
Elsevier Proprietary and Confidential    –Concepts in the areas of expertise for a given author
LDR Semantic Infrastructure


                                                               Linked Data                                             Linked Data Loader (REST)

                                                                                                                           Data Space Services
                                                                Vocab &




                                                                                             Annotation




                                                                                                                                                                                         Linked Data
                                                                Annotation




                                                                                              Satellites



                                                                                                                        Satellites



                                                                                                                                                 Satellites




                                                                                                                                                                             3rd Party
                                                                                                                                                  Vocab
                                                                                                                          Asset
                                                                RDF




                                                                                                                                                                               Data
                                                                Satellites
                  Smart Content Indexing Pipeline

                                                                                             Linked Data Pipeline Services (Hadoop)




                                                                                                                                                                                                                        AWS Cloud Management
        EMMeT                            Vocabulary SKOS
        Semantic




                                                                                                                                         RDF Validation

                                                                                                                                                          Ontology Svcs
                                           Generation




                                                                                                                          Interlinking
                                                                                                           Reasoning
                                                                                 Transform
        Network




                                                                                 N-Quads
                                                                                  Extract
                                                                                   JSON




                                                                                                                                                                            …
                                        Tagging and Indexing
                                         Services (Concepts,
   Content
   Elsevier




                                         Chapters, Articles,
                                           Guidelines,etc)
                                          RDF Generation                                    Discovery Services (Semantic Knowledgebase)
     3rd Party




                    Content
     Content


                     Instit.




                                                                               Amazon                            MongoDB                                      SOLR/SIRE                                 Virtuoso
                                                                               S3                                 NoSQL                                           n                                    Triplestore
                                           Product-specific
                                            Smart Content                     Access &                                                                                     Admin &
                                                                                                                   Atom Feed                                                                               Analytics
                                            Search Index                     Entitlements                                                                                 Monitoring
                                                                             Discovery Svc                             Ontology
                                                                                                                                                                          SPARQL                             Alerts
                                                                              API (REST)                               Service


                                                                                                                                                                                                                       19
Elsevier Proprietary and Confidential
Clinical Key - the most clinically relevant answers




 Elsevier Proprietary and Confidential
Clinical Key - the most clinically relevant answers




 Elsevier Proprietary and Confidential
Comprehensive Drug Research




                                        • Moving world-class content online to Point of Care.
                                        • Extracted knowledge is linked for further enrichment.
                                        • Information is condensed, immediate and actionable.




Elsevier Proprietary and Confidential
Linking Patient Data To Evidence-Based Research




                                        - Discover knowledge from research relevant to a
                                        patient profile
                                        - Alerts on FDA Announcements.




Elsevier Proprietary and Confidential
SciVerse Widgets Powered by Smart Content




                        Article search on ScienceDirect results in related
                        specialty content recommendations available from
                        The Lancet Journal.




Elsevier Proprietary and Confidential
Questions




  Iker Huerga
  i.huerga@elsevier.com


Elsevier Proprietary and Confidential

Weitere ähnliche Inhalte

Ähnlich wie W3C HCLS

Scio12 sem web_final
Scio12 sem web_finalScio12 sem web_final
Scio12 sem web_finalKristi Holmes
 
Experiences in building an ontology driven image database for ...
Experiences in building an ontology driven image database for ...Experiences in building an ontology driven image database for ...
Experiences in building an ontology driven image database for ...Carla Lima
 
Semantics empowered Physical-Cyber-Social Systems for EarthCube
Semantics empowered Physical-Cyber-Social Systems for EarthCubeSemantics empowered Physical-Cyber-Social Systems for EarthCube
Semantics empowered Physical-Cyber-Social Systems for EarthCubeAmit Sheth
 
Women's Engineering Society, UK; 11 September 2009
Women's Engineering Society, UK; 11 September 2009Women's Engineering Society, UK; 11 September 2009
Women's Engineering Society, UK; 11 September 2009Wendy Schultz
 
Industry Transformation via Health Analytics
Industry Transformation via Health AnalyticsIndustry Transformation via Health Analytics
Industry Transformation via Health AnalyticsJason Burke
 
Found in Space: Creating and Visualizing IEEE Abstract Space for Publication ...
Found in Space: Creating and Visualizing IEEE Abstract Space for Publication ...Found in Space: Creating and Visualizing IEEE Abstract Space for Publication ...
Found in Space: Creating and Visualizing IEEE Abstract Space for Publication ...TSoholt
 
Curriculum Development at the Tetherless World Constellation - Peter Fox - RD...
Curriculum Development at the Tetherless World Constellation - Peter Fox - RD...Curriculum Development at the Tetherless World Constellation - Peter Fox - RD...
Curriculum Development at the Tetherless World Constellation - Peter Fox - RD...ASIS&T
 
Cs1050
Cs1050Cs1050
Cs1050nmjb
 
Sci Know Mine 2013: What can we learn from topic modeling on 350M academic do...
Sci Know Mine 2013: What can we learn from topic modeling on 350M academic do...Sci Know Mine 2013: What can we learn from topic modeling on 350M academic do...
Sci Know Mine 2013: What can we learn from topic modeling on 350M academic do...William Gunn
 
Program of Academic Excellence
Program of Academic ExcellenceProgram of Academic Excellence
Program of Academic ExcellenceDarrell W. Gunter
 
If we build it will they come?
If we build it will they come?If we build it will they come?
If we build it will they come?myGrid team
 
Qualitative Methods Workshop Day 1
Qualitative Methods Workshop Day 1Qualitative Methods Workshop Day 1
Qualitative Methods Workshop Day 1Jason Rutter
 
Knowledge management for integrative omics data analysis
Knowledge management for integrative omics data analysisKnowledge management for integrative omics data analysis
Knowledge management for integrative omics data analysisCOST action BM1006
 
Smart Content Conference How Semantic Tech Helps Scientific Research
Smart Content Conference How Semantic Tech Helps Scientific ResearchSmart Content Conference How Semantic Tech Helps Scientific Research
Smart Content Conference How Semantic Tech Helps Scientific ResearchDarrell W. Gunter
 
Jonathan Izant AAAS Annual Meeting 2012-02-18
Jonathan Izant AAAS Annual Meeting 2012-02-18Jonathan Izant AAAS Annual Meeting 2012-02-18
Jonathan Izant AAAS Annual Meeting 2012-02-18Sage Base
 
If we build it will they come? BOSC2012 Keynote Goble
If we build it will they come? BOSC2012 Keynote GobleIf we build it will they come? BOSC2012 Keynote Goble
If we build it will they come? BOSC2012 Keynote GobleCarole Goble
 

Ähnlich wie W3C HCLS (20)

Scio12 sem web_final
Scio12 sem web_finalScio12 sem web_final
Scio12 sem web_final
 
ChemSpider – disseminating data and enabling an abundance of chemistry platforms
ChemSpider – disseminating data and enabling an abundance of chemistry platformsChemSpider – disseminating data and enabling an abundance of chemistry platforms
ChemSpider – disseminating data and enabling an abundance of chemistry platforms
 
Experiences in building an ontology driven image database for ...
Experiences in building an ontology driven image database for ...Experiences in building an ontology driven image database for ...
Experiences in building an ontology driven image database for ...
 
Semantics empowered Physical-Cyber-Social Systems for EarthCube
Semantics empowered Physical-Cyber-Social Systems for EarthCubeSemantics empowered Physical-Cyber-Social Systems for EarthCube
Semantics empowered Physical-Cyber-Social Systems for EarthCube
 
Women's Engineering Society, UK; 11 September 2009
Women's Engineering Society, UK; 11 September 2009Women's Engineering Society, UK; 11 September 2009
Women's Engineering Society, UK; 11 September 2009
 
Industry Transformation via Health Analytics
Industry Transformation via Health AnalyticsIndustry Transformation via Health Analytics
Industry Transformation via Health Analytics
 
Found in Space: Creating and Visualizing IEEE Abstract Space for Publication ...
Found in Space: Creating and Visualizing IEEE Abstract Space for Publication ...Found in Space: Creating and Visualizing IEEE Abstract Space for Publication ...
Found in Space: Creating and Visualizing IEEE Abstract Space for Publication ...
 
Curriculum Development at the Tetherless World Constellation - Peter Fox - RD...
Curriculum Development at the Tetherless World Constellation - Peter Fox - RD...Curriculum Development at the Tetherless World Constellation - Peter Fox - RD...
Curriculum Development at the Tetherless World Constellation - Peter Fox - RD...
 
Cs1050
Cs1050Cs1050
Cs1050
 
Sci Know Mine 2013: What can we learn from topic modeling on 350M academic do...
Sci Know Mine 2013: What can we learn from topic modeling on 350M academic do...Sci Know Mine 2013: What can we learn from topic modeling on 350M academic do...
Sci Know Mine 2013: What can we learn from topic modeling on 350M academic do...
 
Program of Academic Excellence
Program of Academic ExcellenceProgram of Academic Excellence
Program of Academic Excellence
 
If we build it will they come?
If we build it will they come?If we build it will they come?
If we build it will they come?
 
Qualitative Methods Workshop Day 1
Qualitative Methods Workshop Day 1Qualitative Methods Workshop Day 1
Qualitative Methods Workshop Day 1
 
Knowledge management for integrative omics data analysis
Knowledge management for integrative omics data analysisKnowledge management for integrative omics data analysis
Knowledge management for integrative omics data analysis
 
Smart Content Conference How Semantic Tech Helps Scientific Research
Smart Content Conference How Semantic Tech Helps Scientific ResearchSmart Content Conference How Semantic Tech Helps Scientific Research
Smart Content Conference How Semantic Tech Helps Scientific Research
 
Jonathan Izant AAAS Annual Meeting 2012-02-18
Jonathan Izant AAAS Annual Meeting 2012-02-18Jonathan Izant AAAS Annual Meeting 2012-02-18
Jonathan Izant AAAS Annual Meeting 2012-02-18
 
If we build it will they come? BOSC2012 Keynote Goble
If we build it will they come? BOSC2012 Keynote GobleIf we build it will they come? BOSC2012 Keynote Goble
If we build it will they come? BOSC2012 Keynote Goble
 
Fypca4
Fypca4Fypca4
Fypca4
 
Fypca4
Fypca4Fypca4
Fypca4
 
R vaughan
R vaughanR vaughan
R vaughan
 

Kürzlich hochgeladen

PULMONARY EMBOLISM AND ITS MANAGEMENTS.pdf
PULMONARY EMBOLISM AND ITS MANAGEMENTS.pdfPULMONARY EMBOLISM AND ITS MANAGEMENTS.pdf
PULMONARY EMBOLISM AND ITS MANAGEMENTS.pdfDolisha Warbi
 
VarSeq 2.6.0: Advancing Pharmacogenomics and Genomic Analysis
VarSeq 2.6.0: Advancing Pharmacogenomics and Genomic AnalysisVarSeq 2.6.0: Advancing Pharmacogenomics and Genomic Analysis
VarSeq 2.6.0: Advancing Pharmacogenomics and Genomic AnalysisGolden Helix
 
Lippincott Microcards_ Microbiology Flash Cards-LWW (2015).pdf
Lippincott Microcards_ Microbiology Flash Cards-LWW (2015).pdfLippincott Microcards_ Microbiology Flash Cards-LWW (2015).pdf
Lippincott Microcards_ Microbiology Flash Cards-LWW (2015).pdfSreeja Cherukuru
 
The next social challenge to public health: the information environment.pptx
The next social challenge to public health:  the information environment.pptxThe next social challenge to public health:  the information environment.pptx
The next social challenge to public health: the information environment.pptxTina Purnat
 
COVID-19 (NOVEL CORONA VIRUS DISEASE PANDEMIC ).pptx
COVID-19  (NOVEL CORONA  VIRUS DISEASE PANDEMIC ).pptxCOVID-19  (NOVEL CORONA  VIRUS DISEASE PANDEMIC ).pptx
COVID-19 (NOVEL CORONA VIRUS DISEASE PANDEMIC ).pptxBibekananda shah
 
world health day presentation ppt download
world health day presentation ppt downloadworld health day presentation ppt download
world health day presentation ppt downloadAnkitKumar311566
 
Presentation on General Anesthetics pdf.
Presentation on General Anesthetics pdf.Presentation on General Anesthetics pdf.
Presentation on General Anesthetics pdf.Prerana Jadhav
 
Case Report Peripartum Cardiomyopathy.pptx
Case Report Peripartum Cardiomyopathy.pptxCase Report Peripartum Cardiomyopathy.pptx
Case Report Peripartum Cardiomyopathy.pptxNiranjan Chavan
 
History and Development of Pharmacovigilence.pdf
History and Development of Pharmacovigilence.pdfHistory and Development of Pharmacovigilence.pdf
History and Development of Pharmacovigilence.pdfSasikiranMarri
 
April 2024 ONCOLOGY CARTOON by DR KANHU CHARAN PATRO
April 2024 ONCOLOGY CARTOON by  DR KANHU CHARAN PATROApril 2024 ONCOLOGY CARTOON by  DR KANHU CHARAN PATRO
April 2024 ONCOLOGY CARTOON by DR KANHU CHARAN PATROKanhu Charan
 
SWD (Short wave diathermy)- Physiotherapy.ppt
SWD (Short wave diathermy)- Physiotherapy.pptSWD (Short wave diathermy)- Physiotherapy.ppt
SWD (Short wave diathermy)- Physiotherapy.pptMumux Mirani
 
Biomechanics- Shoulder Joint!!!!!!!!!!!!
Biomechanics- Shoulder Joint!!!!!!!!!!!!Biomechanics- Shoulder Joint!!!!!!!!!!!!
Biomechanics- Shoulder Joint!!!!!!!!!!!!ibtesaam huma
 
Tans femoral Amputee : Prosthetics Knee Joints.pptx
Tans femoral Amputee : Prosthetics Knee Joints.pptxTans femoral Amputee : Prosthetics Knee Joints.pptx
Tans femoral Amputee : Prosthetics Knee Joints.pptxKezaiah S
 
Music Therapy's Impact in Palliative Care| IAPCON2024| Dr. Tara Rajendran
Music Therapy's Impact in Palliative Care| IAPCON2024| Dr. Tara RajendranMusic Therapy's Impact in Palliative Care| IAPCON2024| Dr. Tara Rajendran
Music Therapy's Impact in Palliative Care| IAPCON2024| Dr. Tara RajendranTara Rajendran
 
epilepsy and status epilepticus for undergraduate.pptx
epilepsy and status epilepticus  for undergraduate.pptxepilepsy and status epilepticus  for undergraduate.pptx
epilepsy and status epilepticus for undergraduate.pptxMohamed Rizk Khodair
 
Apiculture Chapter 1. Introduction 2.ppt
Apiculture Chapter 1. Introduction 2.pptApiculture Chapter 1. Introduction 2.ppt
Apiculture Chapter 1. Introduction 2.pptkedirjemalharun
 
MedDRA-A-Comprehensive-Guide-to-Standardized-Medical-Terminology.pdf
MedDRA-A-Comprehensive-Guide-to-Standardized-Medical-Terminology.pdfMedDRA-A-Comprehensive-Guide-to-Standardized-Medical-Terminology.pdf
MedDRA-A-Comprehensive-Guide-to-Standardized-Medical-Terminology.pdfSasikiranMarri
 
Study on the Impact of FOCUS-PDCA Management Model on the Disinfection Qualit...
Study on the Impact of FOCUS-PDCA Management Model on the Disinfection Qualit...Study on the Impact of FOCUS-PDCA Management Model on the Disinfection Qualit...
Study on the Impact of FOCUS-PDCA Management Model on the Disinfection Qualit...MehranMouzam
 
LUNG TUMORS AND ITS CLASSIFICATIONS.pdf
LUNG TUMORS AND ITS  CLASSIFICATIONS.pdfLUNG TUMORS AND ITS  CLASSIFICATIONS.pdf
LUNG TUMORS AND ITS CLASSIFICATIONS.pdfDolisha Warbi
 

Kürzlich hochgeladen (20)

PULMONARY EMBOLISM AND ITS MANAGEMENTS.pdf
PULMONARY EMBOLISM AND ITS MANAGEMENTS.pdfPULMONARY EMBOLISM AND ITS MANAGEMENTS.pdf
PULMONARY EMBOLISM AND ITS MANAGEMENTS.pdf
 
Epilepsy
EpilepsyEpilepsy
Epilepsy
 
VarSeq 2.6.0: Advancing Pharmacogenomics and Genomic Analysis
VarSeq 2.6.0: Advancing Pharmacogenomics and Genomic AnalysisVarSeq 2.6.0: Advancing Pharmacogenomics and Genomic Analysis
VarSeq 2.6.0: Advancing Pharmacogenomics and Genomic Analysis
 
Lippincott Microcards_ Microbiology Flash Cards-LWW (2015).pdf
Lippincott Microcards_ Microbiology Flash Cards-LWW (2015).pdfLippincott Microcards_ Microbiology Flash Cards-LWW (2015).pdf
Lippincott Microcards_ Microbiology Flash Cards-LWW (2015).pdf
 
The next social challenge to public health: the information environment.pptx
The next social challenge to public health:  the information environment.pptxThe next social challenge to public health:  the information environment.pptx
The next social challenge to public health: the information environment.pptx
 
COVID-19 (NOVEL CORONA VIRUS DISEASE PANDEMIC ).pptx
COVID-19  (NOVEL CORONA  VIRUS DISEASE PANDEMIC ).pptxCOVID-19  (NOVEL CORONA  VIRUS DISEASE PANDEMIC ).pptx
COVID-19 (NOVEL CORONA VIRUS DISEASE PANDEMIC ).pptx
 
world health day presentation ppt download
world health day presentation ppt downloadworld health day presentation ppt download
world health day presentation ppt download
 
Presentation on General Anesthetics pdf.
Presentation on General Anesthetics pdf.Presentation on General Anesthetics pdf.
Presentation on General Anesthetics pdf.
 
Case Report Peripartum Cardiomyopathy.pptx
Case Report Peripartum Cardiomyopathy.pptxCase Report Peripartum Cardiomyopathy.pptx
Case Report Peripartum Cardiomyopathy.pptx
 
History and Development of Pharmacovigilence.pdf
History and Development of Pharmacovigilence.pdfHistory and Development of Pharmacovigilence.pdf
History and Development of Pharmacovigilence.pdf
 
April 2024 ONCOLOGY CARTOON by DR KANHU CHARAN PATRO
April 2024 ONCOLOGY CARTOON by  DR KANHU CHARAN PATROApril 2024 ONCOLOGY CARTOON by  DR KANHU CHARAN PATRO
April 2024 ONCOLOGY CARTOON by DR KANHU CHARAN PATRO
 
SWD (Short wave diathermy)- Physiotherapy.ppt
SWD (Short wave diathermy)- Physiotherapy.pptSWD (Short wave diathermy)- Physiotherapy.ppt
SWD (Short wave diathermy)- Physiotherapy.ppt
 
Biomechanics- Shoulder Joint!!!!!!!!!!!!
Biomechanics- Shoulder Joint!!!!!!!!!!!!Biomechanics- Shoulder Joint!!!!!!!!!!!!
Biomechanics- Shoulder Joint!!!!!!!!!!!!
 
Tans femoral Amputee : Prosthetics Knee Joints.pptx
Tans femoral Amputee : Prosthetics Knee Joints.pptxTans femoral Amputee : Prosthetics Knee Joints.pptx
Tans femoral Amputee : Prosthetics Knee Joints.pptx
 
Music Therapy's Impact in Palliative Care| IAPCON2024| Dr. Tara Rajendran
Music Therapy's Impact in Palliative Care| IAPCON2024| Dr. Tara RajendranMusic Therapy's Impact in Palliative Care| IAPCON2024| Dr. Tara Rajendran
Music Therapy's Impact in Palliative Care| IAPCON2024| Dr. Tara Rajendran
 
epilepsy and status epilepticus for undergraduate.pptx
epilepsy and status epilepticus  for undergraduate.pptxepilepsy and status epilepticus  for undergraduate.pptx
epilepsy and status epilepticus for undergraduate.pptx
 
Apiculture Chapter 1. Introduction 2.ppt
Apiculture Chapter 1. Introduction 2.pptApiculture Chapter 1. Introduction 2.ppt
Apiculture Chapter 1. Introduction 2.ppt
 
MedDRA-A-Comprehensive-Guide-to-Standardized-Medical-Terminology.pdf
MedDRA-A-Comprehensive-Guide-to-Standardized-Medical-Terminology.pdfMedDRA-A-Comprehensive-Guide-to-Standardized-Medical-Terminology.pdf
MedDRA-A-Comprehensive-Guide-to-Standardized-Medical-Terminology.pdf
 
Study on the Impact of FOCUS-PDCA Management Model on the Disinfection Qualit...
Study on the Impact of FOCUS-PDCA Management Model on the Disinfection Qualit...Study on the Impact of FOCUS-PDCA Management Model on the Disinfection Qualit...
Study on the Impact of FOCUS-PDCA Management Model on the Disinfection Qualit...
 
LUNG TUMORS AND ITS CLASSIFICATIONS.pdf
LUNG TUMORS AND ITS  CLASSIFICATIONS.pdfLUNG TUMORS AND ITS  CLASSIFICATIONS.pdf
LUNG TUMORS AND ITS CLASSIFICATIONS.pdf
 

W3C HCLS

  • 1. Elsevier Health Sciences Smart Content Drives Smart Applications Linked Data in HCLS for Commercial Applications Semantic Web for Health Care and Life Iker Huerga Sciences Summer School, W3C@MIT Sr. Semantic Software Engineer August 29, 2012 i.huerga@elsevier.com @ihuerga Elsevier Proprietary and Confidential
  • 3. The Challenge Providing doctors/researchers with the right information in the right moment to make the best decisions Elsevier Health Sciences | Proprietary and Confidential
  • 4. How to solve it? Elsevier Proprietary and Confidential
  • 5. How to Solve it •Step 1 Making Elsevier’s authoritative Health Care and Life Sciences content “smarter” •Step 2 Enriching Elsevier’s content by integration with third party data •Step 3 Creating interfaces to provide fast discoverability of the most relevant answers and more intuitive searching. We Need Semantic Web for all This Elsevier Health Sciences | Proprietary and Confidential
  • 6. Introducing Smart Content Elsevier Proprietary and Confidential
  • 7. Taxonomy-Powered Content = Smart Content Content with applied taxonomy Content today with structured XML Copyright 2011 Outsell Gilbane Services, Inc. Elsevier Proprietary and Confidential http://www.outsellinc.com http://gilbane.com/xml/2009/11/what-is-smart-content.html#ixzz0hnuRhaBc
  • 8. Smart Content At Elsevier Smart Content Applications Better discovery through semantic search & navigation Linked data from •Faceted search & browse partners and the Web •Ontology-driven navigation •Task-specific results •Personalized/localized results •Question answering •Link to evidenced-based content Text Better understanding through analysis and Entities, visualization Elsevier concepts •Tag clouds content Tables and •Heatmaps •Streamgraphs relationships •Scatterplots •Time series Images •Animations New knowledge through aggregation and synthesis •Topic pages Elsevier •Social network maps knowledge •Geolocation maps organization •Data mashups systems •Text mining reports 8 Elsevier Proprietary and Confidentiall
  • 9. Introducing EMMeT (Elsevier Merged Medical Taxonomy) Parent Terms • Breast Disorders 2 • Cancer of the Thorax • Mammary Neoplasms • More…. Symptoms Breast Lump, Nipple Retraction, ….. Medical Name Diagnostic Malignant Neoplasm of the Breast Mammography, Breast Biopsy, ….. Procedures Consumer Friendly Name Breast Cancer Synonyms 1 4 Malignant Tumor of Breast Treatment Chemotherapy, Mastectomy, …. Malignant Breast Neoplasm Procedures Semantic Relationships Breast Ca Codes ICD9 – 174.9 MeSH – D001943 Medications Tamoxifen, Doxorubicin, ….. SNOMED-CT – 190121004 Semantic Type/Group Neoplastic Process/Disease Risk Factors Family History, Genetics, Predisposition, …. Children Terms • Breast Sarcoma 3 Prevention Screening, Preemptive Mastectomy, …. • Familial Breast Cancer • Malignant lymphoma of the Breast • Malignant Neoplasm of the breast outer quadrant Complications Metastatic Cancer, …. • More… Elsevier Proprietary and Confidential
  • 10. Automated Indexing: Weighted Tags for Better Search Article-level SMART Content tags help confirm relevance and provide a topical overview about a piece of content. Paragraph-level SMART Content tags uncover highly-relevant information not necessarily evident from the title or abstract alone. Elsevier Proprietary and Confidential
  • 11. Standards The Key Piece Elsevier Proprietary and Confidential
  • 12. The Satellite: a Linked Data Compliant Data Format • Motivations: –Help answer research questions –Direct material to interested readers –Extract disparate facts from the literature to create knowledge bases Satellite Specification First Version •Use RDF/XML serialization • Technical Requirements: •Use XML Schemas to validate the syntax –Use of open standards based so that document which validate will metadata frameworks: SKOS, DCMI produce correct RDF and SWAN –Need of a common model to represent •Use the extensive XML-capable ontological annotations infrastructure, QA tools, etc. –Data will be transferred from suppliers to Elsevier and back –QA of tags (aka Provenance) –Some people have RDF knowledge, but very limited in proportion Elsevier Proprietary and Confidential
  • 13. The Satellite Format: a Linked Data Compliant Data Elsevier Health Sciences | Proprietary and Confidential
  • 14. The Satellite Format: a Linked Data Compliant Data •What we have learned so far –RDF/XML has some limitations • Not all RDF graphs can be serialized in XML (QNames, Unicode characters) • There is no support for RDF Graphs in RDF/XML, at the moment one satellite is one RDF Graph in the LDR • Complexity of RDF/XML abbreviation rules • Can’t put attributes on the predicates –An XML Capable infrastructure does not necessarily entail an RDF/XML Capable infrastructure • Many XML tools can’t be used with RDF/XML • Multiple different serializations for the same RDF Graph exist • XML Schema validation makes the specification less flexible It’s time to move towards a more “RDF friendly” serialization Elsevier Health Sciences | Proprietary and Confidential
  • 15. The Satellite Format: a Linked Data Compliant Data •Turtle as the RDF serialization format –It is becoming the de facto serialization for RDF –It makes RDF much more ‘human friendly’ –Gives us the flexibility we need for the next satellite generation –All the Libraries we are currently using support Turtle –It follows the triple pattern syntax of SPARQL, more convenient for querying •Steps to the transition –Both serializations will coexist for a period of time –Internal tools, Validation, QA, etc., need to be adapted to ‘understand’ Turtle –Tools for transforming RDF/XML into Turtle needs to be provided to the suppliers Elsevier Health Sciences | Proprietary and Confidential
  • 16. How is all this transformed into Commercial applications Elsevier Proprietary and Confidential
  • 17. The Linked Data Repository • The LDR stores metadata describing Non Information Resources [httpRange14] • The LDR provides a rich semantic layer on top of IR and enables search and discovery of metadata • Extends Elsevier extracted knowledge by interlinking data with other related sources of content from partners and the Web, using the Web as its API • Optimized for high-volume of RDF I/O operations • Provide service layer APIs for ease of integration • Opens up discovery and utility of content beyond searchable documents Elsevier Health Sciences | Proprietary and Confidential
  • 18. Represent Enhancements and Vocabularies In RDF Satellites •Creation of Satellite Standards –Linked data compliant RDF representing metadata objects –Leverage common namespaces from dct, pav, rdf, skos –Taxonomies in SKOS to enhance portability in the linked data world LDR –Subject tagging against a vocabulary representing extracted knowledge –Concept URIs that can be equated to URIs in linked data •Example RDF Statements –Tags from a taxonomy for a given document –Document sections relevant to a given concept –Document sections providing answers to a given question –Genes mentioned in a given document –Documents supporting or disputing conclusions of a given document Elsevier Proprietary and Confidential –Concepts in the areas of expertise for a given author
  • 19. LDR Semantic Infrastructure Linked Data Linked Data Loader (REST) Data Space Services Vocab & Annotation Linked Data Annotation Satellites Satellites Satellites 3rd Party Vocab Asset RDF Data Satellites Smart Content Indexing Pipeline Linked Data Pipeline Services (Hadoop) AWS Cloud Management EMMeT Vocabulary SKOS Semantic RDF Validation Ontology Svcs Generation Interlinking Reasoning Transform Network N-Quads Extract JSON … Tagging and Indexing Services (Concepts, Content Elsevier Chapters, Articles, Guidelines,etc) RDF Generation Discovery Services (Semantic Knowledgebase) 3rd Party Content Content Instit. Amazon MongoDB SOLR/SIRE Virtuoso S3 NoSQL n Triplestore Product-specific Smart Content Access & Admin & Atom Feed Analytics Search Index Entitlements Monitoring Discovery Svc Ontology SPARQL Alerts API (REST) Service 19 Elsevier Proprietary and Confidential
  • 20. Clinical Key - the most clinically relevant answers Elsevier Proprietary and Confidential
  • 21. Clinical Key - the most clinically relevant answers Elsevier Proprietary and Confidential
  • 22. Comprehensive Drug Research • Moving world-class content online to Point of Care. • Extracted knowledge is linked for further enrichment. • Information is condensed, immediate and actionable. Elsevier Proprietary and Confidential
  • 23. Linking Patient Data To Evidence-Based Research - Discover knowledge from research relevant to a patient profile - Alerts on FDA Announcements. Elsevier Proprietary and Confidential
  • 24. SciVerse Widgets Powered by Smart Content Article search on ScienceDirect results in related specialty content recommendations available from The Lancet Journal. Elsevier Proprietary and Confidential
  • 25. Questions Iker Huerga i.huerga@elsevier.com Elsevier Proprietary and Confidential