SlideShare ist ein Scribd-Unternehmen logo
1 von 25
Microdata and ontologies
Simon Jupp and Tony Burdett
Samples, Phenotypes and Ontologies Team
The European Bioinformatics Institute
Ontologies in the life sciences
• Life sciences been quick to adopt ontologies for
annotation of data
• Over 200 biomedical ontologies in active use
• Large amount of data at EMBL-EBI annotated to
ontologies
• A lot of it still hidden in backend databases
• Rarely exposed in a structured way
<a href=http://www.ebi.ac.uk/efo/EFO_0000400>diabetic</a>
Ontology concept
Ontology terms sometimes hyperlinked
Ontology Lookup Service
• http://www.ebi.ac.uk/ols/beta/
• 140 biomedical ontologies
• 3.4 million terms
• 11 million relationships
Connecting ontologies to the data
Data
Terms Ontology
Defined byisAbout
Connecting ontologies to the data
Biosamples entry
(Diabetic mouse strain)
Diabetes term
EFO_0000400
Experimental
Factor
Ontology
Defined byisAbout
Schema.org properties for ontology terms
Ontology term = vocab:MedicalCode
Ontology term = vocab:MedicalCode
Ontology = vocab:CreativeWork
Ontology = vocab:CreativeWork
Biosamples page = vocab:MedicalEntity
Biosamples page = vocab:MedicalEntity
Using Schema.org to connect these
resources
Organization
- name
MedicalEntity
- name
- description
MedicalCode
- codeValue
…
MedicalCode
- name
- url
- alternateName
- description
- codeValue
- codingSystem
…
CreativeWork
- about
- name
- description
- url
- datePublished
…
Data Term Ontology
Using Schema.org to connect these
resources
What ontologies are used in <my data>?
Using Schema.org to connect these
resources
Organization
- name
MedicalEntity
- name
- description
MedicalCode
- codeValue
MedicalCode
- name
- url
- alternateName
- description
- codeValue
- codingSystem
…
CreativeWork
- about
- name
- description
- url
- datePublished
…
Data Term Ontology
Using Schema.org to connect these
resources
What is <my data> broadly about?
What is the biosamples page about?
Organization
- name
MedicalEntity
- name
- description
MedicalCode
- codeValue
MedicalCode
- name
- url
- alternateName
- description
- codeValue
- codingSystem
…
CreativeWork
- about (disease)
- name
- description
- url
- datePublished
…
Data Term Ontology
Using Schema.org to connect these
resources
Which databases are using <my ontology>?
Where is an ontology/term being used?
Organization
- name
MedicalEntity
- name
- description
MedicalCode
- codeValue
- codingSystem
MedicalCode
- name
- url
- alternateName
- description
- codeValue
- codingSystem
…
CreativeWork
- about
- name
- description
- url
- datePublished
…
Data Term Ontology
Using Schema.org to connect these
resources
Can I use an ontology to enrich the search over <my data>?
Enriching content
Organization
- name
MedicalEntity
- name
- description
MedicalCode
- codeValue
- codingSystem
MedicalCode
- name
- url
- alternateName
- description
- codeValue
- codingSystem
…
CreativeWork
- about
- name
- description
- url
- datePublished
…
Data Term Ontology
Google custom search engine over
vocab:MedicalCode
Schema.org questions
• MedicalEntity / MedicalCode too narrow
• We have plants and other non-medical entities
• Ontology/Terminology as a CreativeWork?
• Where does schema.org stop?
• AnatomicalStructure > Bone, Nerve, Muscle seem very
specific
What next
• Develop patterns and best practice for schema.org
markup for data + ontology
• Pilot to add markup to Biosamples and GWAS website
• Develop more use cases
• How to exploit Google CSE
• What would a rich snippets for data + ontology look like?

Weitere ähnliche Inhalte

Was ist angesagt?

The MIAPA ontology: An annotation ontology for validating minimum metadata re...
The MIAPA ontology: An annotation ontology for validating minimum metadata re...The MIAPA ontology: An annotation ontology for validating minimum metadata re...
The MIAPA ontology: An annotation ontology for validating minimum metadata re...
Hilmar Lapp
 
Metadata in the BioSample Online Repository are Impaired by Numerous Anomalie...
Metadata in the BioSample Online Repository are Impaired by Numerous Anomalie...Metadata in the BioSample Online Repository are Impaired by Numerous Anomalie...
Metadata in the BioSample Online Repository are Impaired by Numerous Anomalie...
CEDAR: Center for Expanded Data Annotation and Retrieval
 
Serving the medicinal chemistry community with Royal Society of Chemistry che...
Serving the medicinal chemistry community with Royal Society of Chemistry che...Serving the medicinal chemistry community with Royal Society of Chemistry che...
Serving the medicinal chemistry community with Royal Society of Chemistry che...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Acs collaborative computational technologies for biomedical research an enabl...
Acs collaborative computational technologies for biomedical research an enabl...Acs collaborative computational technologies for biomedical research an enabl...
Acs collaborative computational technologies for biomedical research an enabl...
Sean Ekins
 
Graph DB + Bioinformatics: Bio4j, recent applications and future directions
Graph DB + Bioinformatics:  Bio4j, recent applications and future directions Graph DB + Bioinformatics:  Bio4j, recent applications and future directions
Graph DB + Bioinformatics: Bio4j, recent applications and future directions
Pablo Pareja Tobes
 
20130622 okfn hackathon t2
20130622 okfn hackathon t220130622 okfn hackathon t2
20130622 okfn hackathon t2
Seonho Kim
 

Was ist angesagt? (20)

Bh14 ogo
Bh14 ogoBh14 ogo
Bh14 ogo
 
OBOPedia: An Encyclopaedia of Biology Using OBO OntologiesObopedia swat4ls-20...
OBOPedia: An Encyclopaedia of Biology Using OBO OntologiesObopedia swat4ls-20...OBOPedia: An Encyclopaedia of Biology Using OBO OntologiesObopedia swat4ls-20...
OBOPedia: An Encyclopaedia of Biology Using OBO OntologiesObopedia swat4ls-20...
 
Connected Data for Machine Learning | Paul Groth
Connected Data for Machine Learning | Paul GrothConnected Data for Machine Learning | Paul Groth
Connected Data for Machine Learning | Paul Groth
 
The MIAPA ontology: An annotation ontology for validating minimum metadata re...
The MIAPA ontology: An annotation ontology for validating minimum metadata re...The MIAPA ontology: An annotation ontology for validating minimum metadata re...
The MIAPA ontology: An annotation ontology for validating minimum metadata re...
 
Metadata in the BioSample Online Repository are Impaired by Numerous Anomalie...
Metadata in the BioSample Online Repository are Impaired by Numerous Anomalie...Metadata in the BioSample Online Repository are Impaired by Numerous Anomalie...
Metadata in the BioSample Online Repository are Impaired by Numerous Anomalie...
 
Bio4j
Bio4jBio4j
Bio4j
 
ACS 248th Paper 71 ChAMP Project
ACS 248th Paper 71 ChAMP ProjectACS 248th Paper 71 ChAMP Project
ACS 248th Paper 71 ChAMP Project
 
Ontologies: Necessary, but not sufficient
Ontologies: Necessary, but not sufficientOntologies: Necessary, but not sufficient
Ontologies: Necessary, but not sufficient
 
Serving the medicinal chemistry community with Royal Society of Chemistry che...
Serving the medicinal chemistry community with Royal Society of Chemistry che...Serving the medicinal chemistry community with Royal Society of Chemistry che...
Serving the medicinal chemistry community with Royal Society of Chemistry che...
 
Embracing Semantic Technology for Better Metadata Authoring in Biomedicine (S...
Embracing Semantic Technology for Better Metadata Authoring in Biomedicine (S...Embracing Semantic Technology for Better Metadata Authoring in Biomedicine (S...
Embracing Semantic Technology for Better Metadata Authoring in Biomedicine (S...
 
The FAIRDOM Commons for Systems Biology
The FAIRDOM Commons for Systems BiologyThe FAIRDOM Commons for Systems Biology
The FAIRDOM Commons for Systems Biology
 
An Open Repository Model for Acquiring Knowledge About Scientific Experiments
An Open Repository Model for Acquiring Knowledge About Scientific ExperimentsAn Open Repository Model for Acquiring Knowledge About Scientific Experiments
An Open Repository Model for Acquiring Knowledge About Scientific Experiments
 
Acs collaborative computational technologies for biomedical research an enabl...
Acs collaborative computational technologies for biomedical research an enabl...Acs collaborative computational technologies for biomedical research an enabl...
Acs collaborative computational technologies for biomedical research an enabl...
 
Graph DB + Bioinformatics: Bio4j, recent applications and future directions
Graph DB + Bioinformatics:  Bio4j, recent applications and future directions Graph DB + Bioinformatics:  Bio4j, recent applications and future directions
Graph DB + Bioinformatics: Bio4j, recent applications and future directions
 
Bio solr building a better search for bioinformatics
Bio solr   building a better search for bioinformaticsBio solr   building a better search for bioinformatics
Bio solr building a better search for bioinformatics
 
The CEDAR Workbench: An Ontology-Assisted Environment for Authoring Metadata ...
The CEDAR Workbench: An Ontology-Assisted Environment for Authoring Metadata ...The CEDAR Workbench: An Ontology-Assisted Environment for Authoring Metadata ...
The CEDAR Workbench: An Ontology-Assisted Environment for Authoring Metadata ...
 
20130622 okfn hackathon t2
20130622 okfn hackathon t220130622 okfn hackathon t2
20130622 okfn hackathon t2
 
Citing data in research articles: principles, implementation, challenges - an...
Citing data in research articles: principles, implementation, challenges - an...Citing data in research articles: principles, implementation, challenges - an...
Citing data in research articles: principles, implementation, challenges - an...
 
Reproducible and citable data and models: an introduction.
Reproducible and citable data and models: an introduction.Reproducible and citable data and models: an introduction.
Reproducible and citable data and models: an introduction.
 
FAIR data and model management for systems biology.
FAIR data and model management for systems biology.FAIR data and model management for systems biology.
FAIR data and model management for systems biology.
 

Ähnlich wie schema.org and biomedical ontologies

Ontology Services for the Biomedical Sciences
Ontology Services for the Biomedical SciencesOntology Services for the Biomedical Sciences
Ontology Services for the Biomedical Sciences
Connected Data World
 
The seven-deadly-sins-of-bioinformatics3960
The seven-deadly-sins-of-bioinformatics3960The seven-deadly-sins-of-bioinformatics3960
The seven-deadly-sins-of-bioinformatics3960
mare34
 

Ähnlich wie schema.org and biomedical ontologies (20)

NCBO haendel talk 2013
NCBO haendel talk 2013NCBO haendel talk 2013
NCBO haendel talk 2013
 
Computing on the shoulders of giants
Computing on the shoulders of giantsComputing on the shoulders of giants
Computing on the shoulders of giants
 
Improving online chemistry one structure at a time
Improving online chemistry one structure at a timeImproving online chemistry one structure at a time
Improving online chemistry one structure at a time
 
Whitney Symposium Lecture June 2008
Whitney Symposium Lecture June 2008Whitney Symposium Lecture June 2008
Whitney Symposium Lecture June 2008
 
Ontologies for life sciences: examples from the gene ontology
Ontologies for life sciences: examples from the gene ontologyOntologies for life sciences: examples from the gene ontology
Ontologies for life sciences: examples from the gene ontology
 
Ibn Sina
Ibn SinaIbn Sina
Ibn Sina
 
Ontology Services for the Biomedical Sciences
Ontology Services for the Biomedical SciencesOntology Services for the Biomedical Sciences
Ontology Services for the Biomedical Sciences
 
Online Resources to Support Open Drug Discovery Systems
Online Resources to Support Open Drug Discovery SystemsOnline Resources to Support Open Drug Discovery Systems
Online Resources to Support Open Drug Discovery Systems
 
Using the NCBO Annotator to Develop an Ontology-Based Index of Biomedical Res...
Using the NCBO Annotator to Develop an Ontology-Based Index of Biomedical Res...Using the NCBO Annotator to Develop an Ontology-Based Index of Biomedical Res...
Using the NCBO Annotator to Develop an Ontology-Based Index of Biomedical Res...
 
Searching the Literature: Search Techniques and Construction
Searching the Literature: Search Techniques and ConstructionSearching the Literature: Search Techniques and Construction
Searching the Literature: Search Techniques and Construction
 
Beyond Transparency: Success & Lessons From tambisBoston2003
Beyond Transparency: Success & Lessons From tambisBoston2003Beyond Transparency: Success & Lessons From tambisBoston2003
Beyond Transparency: Success & Lessons From tambisBoston2003
 
The seven-deadly-sins-of-bioinformatics3960
The seven-deadly-sins-of-bioinformatics3960The seven-deadly-sins-of-bioinformatics3960
The seven-deadly-sins-of-bioinformatics3960
 
The Seven Deadly Sins of Bioinformatics
The Seven Deadly Sins of BioinformaticsThe Seven Deadly Sins of Bioinformatics
The Seven Deadly Sins of Bioinformatics
 
Delivering Curated Chemistry to the World via Crowdsourced Deposition and Ann...
Delivering Curated Chemistry to the World via Crowdsourced Deposition and Ann...Delivering Curated Chemistry to the World via Crowdsourced Deposition and Ann...
Delivering Curated Chemistry to the World via Crowdsourced Deposition and Ann...
 
Delivering Curated Chemistry to the World via Crowdsourced Deposition and Ann...
Delivering Curated Chemistry to the World via Crowdsourced Deposition and Ann...Delivering Curated Chemistry to the World via Crowdsourced Deposition and Ann...
Delivering Curated Chemistry to the World via Crowdsourced Deposition and Ann...
 
Search /certified fixed orthodontic courses by Indian dental academy
Search /certified fixed orthodontic courses by Indian dental academy Search /certified fixed orthodontic courses by Indian dental academy
Search /certified fixed orthodontic courses by Indian dental academy
 
Semantic Web for Health Care and Biomedical Informatics
Semantic Web for Health Care and Biomedical InformaticsSemantic Web for Health Care and Biomedical Informatics
Semantic Web for Health Care and Biomedical Informatics
 
bioinformatics enabling knowledge generation from agricultural omics data
bioinformatics enabling knowledge generation from agricultural omics databioinformatics enabling knowledge generation from agricultural omics data
bioinformatics enabling knowledge generation from agricultural omics data
 
OSFair2017 Workshop | OmicsDI: Omics discovery index
OSFair2017 Workshop | OmicsDI: Omics discovery indexOSFair2017 Workshop | OmicsDI: Omics discovery index
OSFair2017 Workshop | OmicsDI: Omics discovery index
 
Chibucos annot go_final
Chibucos annot go_finalChibucos annot go_final
Chibucos annot go_final
 

Kürzlich hochgeladen

Bacterial Identification and Classifications
Bacterial Identification and ClassificationsBacterial Identification and Classifications
Bacterial Identification and Classifications
Areesha Ahmad
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Sérgio Sacani
 
biology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGYbiology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGY
1301aanya
 
POGONATUM : morphology, anatomy, reproduction etc.
POGONATUM : morphology, anatomy, reproduction etc.POGONATUM : morphology, anatomy, reproduction etc.
POGONATUM : morphology, anatomy, reproduction etc.
Silpa
 
Digital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptxDigital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptx
MohamedFarag457087
 

Kürzlich hochgeladen (20)

CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIA
CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIACURRENT SCENARIO OF POULTRY PRODUCTION IN INDIA
CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIA
 
An introduction on sequence tagged site mapping
An introduction on sequence tagged site mappingAn introduction on sequence tagged site mapping
An introduction on sequence tagged site mapping
 
Velocity and Acceleration PowerPoint.ppt
Velocity and Acceleration PowerPoint.pptVelocity and Acceleration PowerPoint.ppt
Velocity and Acceleration PowerPoint.ppt
 
pumpkin fruit fly, water melon fruit fly, cucumber fruit fly
pumpkin fruit fly, water melon fruit fly, cucumber fruit flypumpkin fruit fly, water melon fruit fly, cucumber fruit fly
pumpkin fruit fly, water melon fruit fly, cucumber fruit fly
 
300003-World Science Day For Peace And Development.pptx
300003-World Science Day For Peace And Development.pptx300003-World Science Day For Peace And Development.pptx
300003-World Science Day For Peace And Development.pptx
 
Bacterial Identification and Classifications
Bacterial Identification and ClassificationsBacterial Identification and Classifications
Bacterial Identification and Classifications
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
 
biology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGYbiology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGY
 
Selaginella: features, morphology ,anatomy and reproduction.
Selaginella: features, morphology ,anatomy and reproduction.Selaginella: features, morphology ,anatomy and reproduction.
Selaginella: features, morphology ,anatomy and reproduction.
 
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceuticsPulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
 
Dr. E. Muralinath_ Blood indices_clinical aspects
Dr. E. Muralinath_ Blood indices_clinical  aspectsDr. E. Muralinath_ Blood indices_clinical  aspects
Dr. E. Muralinath_ Blood indices_clinical aspects
 
Zoology 5th semester notes( Sumit_yadav).pdf
Zoology 5th semester notes( Sumit_yadav).pdfZoology 5th semester notes( Sumit_yadav).pdf
Zoology 5th semester notes( Sumit_yadav).pdf
 
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
 
Factory Acceptance Test( FAT).pptx .
Factory Acceptance Test( FAT).pptx       .Factory Acceptance Test( FAT).pptx       .
Factory Acceptance Test( FAT).pptx .
 
Grade 7 - Lesson 1 - Microscope and Its Functions
Grade 7 - Lesson 1 - Microscope and Its FunctionsGrade 7 - Lesson 1 - Microscope and Its Functions
Grade 7 - Lesson 1 - Microscope and Its Functions
 
POGONATUM : morphology, anatomy, reproduction etc.
POGONATUM : morphology, anatomy, reproduction etc.POGONATUM : morphology, anatomy, reproduction etc.
POGONATUM : morphology, anatomy, reproduction etc.
 
Clean In Place(CIP).pptx .
Clean In Place(CIP).pptx                 .Clean In Place(CIP).pptx                 .
Clean In Place(CIP).pptx .
 
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and SpectrometryFAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
 
Site Acceptance Test .
Site Acceptance Test                    .Site Acceptance Test                    .
Site Acceptance Test .
 
Digital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptxDigital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptx
 

schema.org and biomedical ontologies

  • 1. Microdata and ontologies Simon Jupp and Tony Burdett Samples, Phenotypes and Ontologies Team The European Bioinformatics Institute
  • 2. Ontologies in the life sciences • Life sciences been quick to adopt ontologies for annotation of data • Over 200 biomedical ontologies in active use • Large amount of data at EMBL-EBI annotated to ontologies • A lot of it still hidden in backend databases • Rarely exposed in a structured way
  • 4. Ontology Lookup Service • http://www.ebi.ac.uk/ols/beta/ • 140 biomedical ontologies • 3.4 million terms • 11 million relationships
  • 5. Connecting ontologies to the data Data Terms Ontology Defined byisAbout
  • 6. Connecting ontologies to the data Biosamples entry (Diabetic mouse strain) Diabetes term EFO_0000400 Experimental Factor Ontology Defined byisAbout
  • 7. Schema.org properties for ontology terms
  • 8. Ontology term = vocab:MedicalCode
  • 9. Ontology term = vocab:MedicalCode
  • 12. Biosamples page = vocab:MedicalEntity
  • 13. Biosamples page = vocab:MedicalEntity
  • 14. Using Schema.org to connect these resources Organization - name MedicalEntity - name - description MedicalCode - codeValue … MedicalCode - name - url - alternateName - description - codeValue - codingSystem … CreativeWork - about - name - description - url - datePublished … Data Term Ontology
  • 15. Using Schema.org to connect these resources What ontologies are used in <my data>?
  • 16. Using Schema.org to connect these resources Organization - name MedicalEntity - name - description MedicalCode - codeValue MedicalCode - name - url - alternateName - description - codeValue - codingSystem … CreativeWork - about - name - description - url - datePublished … Data Term Ontology
  • 17. Using Schema.org to connect these resources What is <my data> broadly about?
  • 18. What is the biosamples page about? Organization - name MedicalEntity - name - description MedicalCode - codeValue MedicalCode - name - url - alternateName - description - codeValue - codingSystem … CreativeWork - about (disease) - name - description - url - datePublished … Data Term Ontology
  • 19. Using Schema.org to connect these resources Which databases are using <my ontology>?
  • 20. Where is an ontology/term being used? Organization - name MedicalEntity - name - description MedicalCode - codeValue - codingSystem MedicalCode - name - url - alternateName - description - codeValue - codingSystem … CreativeWork - about - name - description - url - datePublished … Data Term Ontology
  • 21. Using Schema.org to connect these resources Can I use an ontology to enrich the search over <my data>?
  • 22. Enriching content Organization - name MedicalEntity - name - description MedicalCode - codeValue - codingSystem MedicalCode - name - url - alternateName - description - codeValue - codingSystem … CreativeWork - about - name - description - url - datePublished … Data Term Ontology
  • 23. Google custom search engine over vocab:MedicalCode
  • 24. Schema.org questions • MedicalEntity / MedicalCode too narrow • We have plants and other non-medical entities • Ontology/Terminology as a CreativeWork? • Where does schema.org stop? • AnatomicalStructure > Bone, Nerve, Muscle seem very specific
  • 25. What next • Develop patterns and best practice for schema.org markup for data + ontology • Pilot to add markup to Biosamples and GWAS website • Develop more use cases • How to exploit Google CSE • What would a rich snippets for data + ontology look like?