1. Microdata and ontologies
Simon Jupp and Tony Burdett
Samples, Phenotypes and Ontologies Team
The European Bioinformatics Institute
2. Ontologies in the life sciences
• Life sciences been quick to adopt ontologies for
annotation of data
• Over 200 biomedical ontologies in active use
• Large amount of data at EMBL-EBI annotated to
ontologies
• A lot of it still hidden in backend databases
• Rarely exposed in a structured way
14. Using Schema.org to connect these
resources
Organization
- name
MedicalEntity
- name
- description
MedicalCode
- codeValue
…
MedicalCode
- name
- url
- alternateName
- description
- codeValue
- codingSystem
…
CreativeWork
- about
- name
- description
- url
- datePublished
…
Data Term Ontology
15. Using Schema.org to connect these
resources
What ontologies are used in <my data>?
16. Using Schema.org to connect these
resources
Organization
- name
MedicalEntity
- name
- description
MedicalCode
- codeValue
MedicalCode
- name
- url
- alternateName
- description
- codeValue
- codingSystem
…
CreativeWork
- about
- name
- description
- url
- datePublished
…
Data Term Ontology
17. Using Schema.org to connect these
resources
What is <my data> broadly about?
18. What is the biosamples page about?
Organization
- name
MedicalEntity
- name
- description
MedicalCode
- codeValue
MedicalCode
- name
- url
- alternateName
- description
- codeValue
- codingSystem
…
CreativeWork
- about (disease)
- name
- description
- url
- datePublished
…
Data Term Ontology
19. Using Schema.org to connect these
resources
Which databases are using <my ontology>?
20. Where is an ontology/term being used?
Organization
- name
MedicalEntity
- name
- description
MedicalCode
- codeValue
- codingSystem
MedicalCode
- name
- url
- alternateName
- description
- codeValue
- codingSystem
…
CreativeWork
- about
- name
- description
- url
- datePublished
…
Data Term Ontology
21. Using Schema.org to connect these
resources
Can I use an ontology to enrich the search over <my data>?
22. Enriching content
Organization
- name
MedicalEntity
- name
- description
MedicalCode
- codeValue
- codingSystem
MedicalCode
- name
- url
- alternateName
- description
- codeValue
- codingSystem
…
CreativeWork
- about
- name
- description
- url
- datePublished
…
Data Term Ontology
24. Schema.org questions
• MedicalEntity / MedicalCode too narrow
• We have plants and other non-medical entities
• Ontology/Terminology as a CreativeWork?
• Where does schema.org stop?
• AnatomicalStructure > Bone, Nerve, Muscle seem very
specific
25. What next
• Develop patterns and best practice for schema.org
markup for data + ontology
• Pilot to add markup to Biosamples and GWAS website
• Develop more use cases
• How to exploit Google CSE
• What would a rich snippets for data + ontology look like?