SlideShare a Scribd company logo
1 of 38
Use of Uberon in the Bgee database:
How to deal with a complex, large, dynamic
ontology?



Frederic Bastian
Biocuration 2013
A biocurator nightmare?


Ontologies now regularly include thousands of terms.

Complex relations are used, e.g., “transitively proximally
connected to”.

Curators are expected to provide complex annotations, e.g.:
post-composition of terms.



=> How can we simplify the use of complex ontologies?

© 2013 SIB
The Bgee database




                     http://bgee.unil.ch




© 2013 SIB
The Bgee database




                       http://bgee.unil.ch


              Description of anatomy
                and development




© 2013 SIB
The Bgee database




                           http://bgee.unil.ch


Expression data   Description of anatomy
                    and development




 © 2013 SIB
The Bgee database




                           http://bgee.unil.ch


Expression data   Description of anatomy         Homology
                    and development




 © 2013 SIB
The Bgee database




                                   …
             http://tinyurl.com/bgee12-hoxa5a


© 2013 SIB
The Bgee database




                                   …
             http://tinyurl.com/bgee12-hoxa5a


© 2013 SIB
The Bgee database




                                   …
             http://tinyurl.com/bgee12-hoxa5a


© 2013 SIB
Use of anatomical ontologies in Bgee

Several species-specific ontologies were used:
•            ZFA
•            XAO
•            FBbt
•            EMAPA, MA
•            EHDAA, EV




© 2013 SIB
Use of anatomical ontologies in Bgee

Several species-specific ontologies were used:
•            ZFA
•            XAO
•            FBbt
•            EMAPA, MA
•            EHDAA, EV


=> Limitation to add new species
=> Inconsistent anatomical descriptions, different formalisms
adopted, etc.
© 2013 SIB
Homology relations between anatomical ontologies

 To perform automated comparisons:
 • We built groups of homologous organs
 • We organized these groups into an ontology

                   VHOG:0000157 brain
             EHDAA:2629    brain
             EHDAA:300     brain
             EHDAA:830     future brain
             EMAPA:16089   future brain
             EMAPA:16894   brain
             EV:0100164    brain
             MA:0000168    brain
             XAO:0000010   brain
             ZFA:0000008   brain
             ZFA:0000146   presumptive brain

© 2013 SIB
Homology relations between anatomical ontologies

 To perform automated comparisons:
 • We built groups of homologous organs
 • We organized these groups into an ontology

 => vHOG ontology
 vHOG, a multispecies vertebrate ontology of homologous organs groups
 Bioinformatics (2012) 28(7): 1017-1020, 2012.




© 2013 SIB
Homology relations between anatomical ontologies

 To perform automated comparisons:
 • We built groups of homologous organs
 • We organized these groups into an ontology

 => vHOG ontology

 To add a species:
 • All groups need to be re-evaluated
 • The graph structure needs to be updated

 => Not maintainable on the long run

© 2013 SIB
And then came Uberon …
                                                       only_in_taxon
                                      UBERON: bone                      Vertebrata


                                            is_a                                 is_a


Drosophila melanogaster               UBERON: tibia                    Homo sapiens
                           is_a                          is_a
             part_of                                                   part_of




             Fruit fly FBbt ‘tibia’                   Human FMA ‘tibia’

© 2013 SIB
And then came Uberon …
                                                       only_in_taxon
                                      UBERON: bone                      Vertebrata


                                            is_a                                 is_a


Drosophila melanogaster               UBERON: tibia                    Homo sapiens
                           is_a                          is_a
             part_of                                                   part_of




             Fruit fly FBbt ‘tibia’                   Human FMA ‘tibia’

© 2013 SIB
And then came Uberon …


 Uberon also provides a composite ontology:

 Merges terms from species-specific ontologies, when term
 not present in Uberon.
      ....
             is_a UBERON:0003059 ! presomitic mesoderm
                devf UBERON:0002329 ! somite
                  is_a ZFA:0000073 ! somite 5 (zebrafish)
                  is_a ZFA:0000982 ! somite 6 (zebrafish)
                  is_a EHDAA2:0001853 ! somite 05 (embryonic human)
                  is_a EHDAA2:0001854 ! somite 06 (embryonic human)


 => Allow to import data from Model Organism Databases.

© 2013 SIB
And then came Uberon … BUT

 Uberon is complex:

 • About 22 000 terms in the composite ontology




© 2013 SIB
And then came Uberon … BUT

 Uberon is complex:

 • About 22 000 terms in the composite ontology

 • Use of advanced constructs, supported only in OWL
 • Use of high level abstract terms for interoperability




© 2013 SIB
And then came Uberon … BUT

 Uberon is complex:

 • About 22 000 terms in the composite ontology

 • Use of advanced constructs, supported only in OWL
 • Use of high level abstract terms for interoperability

 • Frequently updated, highly responsive
 • Structure changes when any imported species-specific
   ontology changes => even more updated



© 2013 SIB
Uberon cannot be easily browsed




© 2013 SIB
First step: ontology simplification




© 2013 SIB
First step: ontology simplification

 1. Simplification of the relations

 Keep only is_a, part_of, develops_from.

 Map all relations to their ancestors, e.g.:
   develops_directly_from => develops_from




© 2013 SIB
First step: ontology simplification

 2. Removal of redundant relations

            A is_a B; B is_a C;
         => A is_a C is redundant.




© 2013 SIB
First step: ontology simplification

 2. Removal of redundant relations

            A is_a B; B is_a C;
         => A is_a C is redundant.

 But, we consider part_of and is_a relations as equivalent.
       A part_of B; B is_a C
    => A part_of C and A is_a C are considered redundant

 This removes almost all “is_a anatomical entity”


© 2013 SIB
First step: ontology simplification

 3. Removal of relations to upper_level terms
 upper_level subset: "abstract upper-level terms not directly
 useful for analysis”

 Terms useful for analysis are almost all present under
 “upper_level” terms, thus being confusing.

 => remove relations to “upper_level” terms if non-orphan




© 2013 SIB
First step: ontology simplification

 3. Removal of relations to upper_level terms
 upper_level subset: "abstract upper-level terms not directly
 useful for analysis”

 Terms useful for analysis are almost all present under
 “upper_level” terms, thus being confusing.

 => remove relations to “upper_level” terms if non-orphan
      [Term]
      id: MA:0000747
      name: lymph organ (mouse)
      is_a: UBERON:0001062 ! anatomical entity
      relationship: part_of UBERON:0002465 ! lymphoid system


© 2013 SIB
First step: ontology simplification

 3. Removal of relations to upper_level terms
 upper_level subset: "abstract upper-level terms not directly
 useful for analysis”

 Terms useful for analysis are almost all present under
 “upper_level” terms, thus being confusing.

 => remove relations to “upper_level” terms if non-orphan
      [Term]
      id: MA:0000747
      name: lymph organ (mouse)
      is_a: UBERON:0001062 ! anatomical entity
      relationship: part_of UBERON:0002465 ! lymphoid system


© 2013 SIB
First step: ontology simplification

 3. Removal of relations to upper_level terms
 upper_level subset: "abstract upper-level terms not directly
 useful for analysis”

 Terms useful for analysis are almost all present under
 “upper_level” terms, thus being confusing.

 => remove relations to “upper_level” terms if non-orphan
      [Term]
      id: UBERON:0007502
      name: epithelial plexus
      is_a: UBERON:0000480 ! anatomical group



© 2013 SIB
First step: ontology simplification

 4. Generate species-specific versions

 To simplify even more the “composite-metazoan”
 ontology, generate a version for each species used in
 Bgee.




© 2013 SIB
First step: ontology simplification




© 2013 SIB
Second step: track ontology changes


 1. Store annotation status

 - “Perfect” annotation: would not need to be refined as
   long as the term used is not obsoleted.

 - “Missing granularity” annotation: a term is missing in the
   ontology, e.g., vastus lateralis.
   If a new child was added to the term, refine annotation




© 2013 SIB
Second step: track ontology changes


 2. Track ontology changes

 - Compare the versions used between two annotation
   cycles.

 - If a term used in a “missing granularity” annotation has
   new children, refine the annotation.




© 2013 SIB
Conclusion 1/2


 To manage complex, frequently updated ontology:

 1. Provide a formal version for the reasoning, and a
    simplified view for the end-user.

 2. Store annotation status, to focus only on annotations
    which need to be updated.




© 2013 SIB
Conclusion 2/2
 Major update of Bgee incoming for fall 2013:

 - All expression data annotations are being transferred to
   Uberon.
 - All homology information are being transferred from
   vHOG to Uberon, using an external file.




© 2013 SIB
Conclusion 2/2
 Major update of Bgee incoming for fall 2013:

 - All expression data annotations are being transferred to
   Uberon.
 - All homology information are being transferred from
   vHOG to Uberon, using an external file.

 And also:

 - Besides present/absent calls, Bgee will include:
   overexpression calls; biologically significant expression.
 - Revamped interfaces, webservices, APIs, …



© 2013 SIB
Advertisement! Other Bgee-related work


Poster 145:
Average rank IQR: a new improved method for
Affymetrix microarray quality control for meta-
analyses and database curation.
                                                     Marta Rosikiewicz

Database biocuration virtual issue:
Uncovering hidden duplicated content in public transcriptomics
data
Marta Rosikiewicz, Aurélie Comte, Anne Niknejad, Marc Robinson-
Rechavi, and Frederic B. Bastian
Database Vol. 2013, bat010; doi:10.1093/database/bat010

 © 2013 SIB
Thank You




Aurélie Comte     Sébastien Moretti   Anne Niknejad   Marta Rosikiewicz




     Marc Robinson-Rechavi    Komal Sanjeev    Mathieu Seppey

And also:

• Melissa Haendel

• Chris Mungall

More Related Content

Recently uploaded

Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfsanyamsingh5019
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactPECB
 
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...Sapna Thakur
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...christianmathematics
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfagholdier
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Sapana Sha
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfciinovamais
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDThiyagu K
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfchloefrazer622
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformChameera Dedduwage
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)eniolaolutunde
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Celine George
 
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...fonyou31
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfJayanti Pande
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityGeoBlogs
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsTechSoup
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationnomboosow
 
Student login on Anyboli platform.helpin
Student login on Anyboli platform.helpinStudent login on Anyboli platform.helpin
Student login on Anyboli platform.helpinRaunakKeshri1
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdfQucHHunhnh
 

Recently uploaded (20)

Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdf
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
 
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
 
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdf
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SD
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdf
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy Reform
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17
 
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdf
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activity
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communication
 
Student login on Anyboli platform.helpin
Student login on Anyboli platform.helpinStudent login on Anyboli platform.helpin
Student login on Anyboli platform.helpin
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 

Featured

Everything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPTEverything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPTExpeed Software
 
Product Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage EngineeringsProduct Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage EngineeringsPixeldarts
 
How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthThinkNow
 
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfmarketingartwork
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024Neil Kimberley
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)contently
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024Albert Qian
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsKurio // The Social Media Age(ncy)
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Search Engine Journal
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summarySpeakerHub
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next Tessa Mero
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentLily Ray
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best PracticesVit Horky
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project managementMindGenius
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...RachelPearson36
 
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Applitools
 

Featured (20)

Everything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPTEverything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPT
 
Product Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage EngineeringsProduct Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage Engineerings
 
How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental Health
 
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
 
Skeleton Culture Code
Skeleton Culture CodeSkeleton Culture Code
Skeleton Culture Code
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search Intent
 
How to have difficult conversations
How to have difficult conversations How to have difficult conversations
How to have difficult conversations
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best Practices
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project management
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
 
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
 

Use of Uberon in the Bgee database: How to deal with a complex, large, dynamic ontology?

  • 1. Use of Uberon in the Bgee database: How to deal with a complex, large, dynamic ontology? Frederic Bastian Biocuration 2013
  • 2. A biocurator nightmare? Ontologies now regularly include thousands of terms. Complex relations are used, e.g., “transitively proximally connected to”. Curators are expected to provide complex annotations, e.g.: post-composition of terms. => How can we simplify the use of complex ontologies? © 2013 SIB
  • 3. The Bgee database http://bgee.unil.ch © 2013 SIB
  • 4. The Bgee database http://bgee.unil.ch Description of anatomy and development © 2013 SIB
  • 5. The Bgee database http://bgee.unil.ch Expression data Description of anatomy and development © 2013 SIB
  • 6. The Bgee database http://bgee.unil.ch Expression data Description of anatomy Homology and development © 2013 SIB
  • 7. The Bgee database … http://tinyurl.com/bgee12-hoxa5a © 2013 SIB
  • 8. The Bgee database … http://tinyurl.com/bgee12-hoxa5a © 2013 SIB
  • 9. The Bgee database … http://tinyurl.com/bgee12-hoxa5a © 2013 SIB
  • 10. Use of anatomical ontologies in Bgee Several species-specific ontologies were used: • ZFA • XAO • FBbt • EMAPA, MA • EHDAA, EV © 2013 SIB
  • 11. Use of anatomical ontologies in Bgee Several species-specific ontologies were used: • ZFA • XAO • FBbt • EMAPA, MA • EHDAA, EV => Limitation to add new species => Inconsistent anatomical descriptions, different formalisms adopted, etc. © 2013 SIB
  • 12. Homology relations between anatomical ontologies To perform automated comparisons: • We built groups of homologous organs • We organized these groups into an ontology VHOG:0000157 brain EHDAA:2629 brain EHDAA:300 brain EHDAA:830 future brain EMAPA:16089 future brain EMAPA:16894 brain EV:0100164 brain MA:0000168 brain XAO:0000010 brain ZFA:0000008 brain ZFA:0000146 presumptive brain © 2013 SIB
  • 13. Homology relations between anatomical ontologies To perform automated comparisons: • We built groups of homologous organs • We organized these groups into an ontology => vHOG ontology vHOG, a multispecies vertebrate ontology of homologous organs groups Bioinformatics (2012) 28(7): 1017-1020, 2012. © 2013 SIB
  • 14. Homology relations between anatomical ontologies To perform automated comparisons: • We built groups of homologous organs • We organized these groups into an ontology => vHOG ontology To add a species: • All groups need to be re-evaluated • The graph structure needs to be updated => Not maintainable on the long run © 2013 SIB
  • 15. And then came Uberon … only_in_taxon UBERON: bone Vertebrata is_a is_a Drosophila melanogaster UBERON: tibia Homo sapiens is_a is_a part_of part_of Fruit fly FBbt ‘tibia’ Human FMA ‘tibia’ © 2013 SIB
  • 16. And then came Uberon … only_in_taxon UBERON: bone Vertebrata is_a is_a Drosophila melanogaster UBERON: tibia Homo sapiens is_a is_a part_of part_of Fruit fly FBbt ‘tibia’ Human FMA ‘tibia’ © 2013 SIB
  • 17. And then came Uberon … Uberon also provides a composite ontology: Merges terms from species-specific ontologies, when term not present in Uberon. .... is_a UBERON:0003059 ! presomitic mesoderm devf UBERON:0002329 ! somite is_a ZFA:0000073 ! somite 5 (zebrafish) is_a ZFA:0000982 ! somite 6 (zebrafish) is_a EHDAA2:0001853 ! somite 05 (embryonic human) is_a EHDAA2:0001854 ! somite 06 (embryonic human) => Allow to import data from Model Organism Databases. © 2013 SIB
  • 18. And then came Uberon … BUT Uberon is complex: • About 22 000 terms in the composite ontology © 2013 SIB
  • 19. And then came Uberon … BUT Uberon is complex: • About 22 000 terms in the composite ontology • Use of advanced constructs, supported only in OWL • Use of high level abstract terms for interoperability © 2013 SIB
  • 20. And then came Uberon … BUT Uberon is complex: • About 22 000 terms in the composite ontology • Use of advanced constructs, supported only in OWL • Use of high level abstract terms for interoperability • Frequently updated, highly responsive • Structure changes when any imported species-specific ontology changes => even more updated © 2013 SIB
  • 21. Uberon cannot be easily browsed © 2013 SIB
  • 22. First step: ontology simplification © 2013 SIB
  • 23. First step: ontology simplification 1. Simplification of the relations Keep only is_a, part_of, develops_from. Map all relations to their ancestors, e.g.: develops_directly_from => develops_from © 2013 SIB
  • 24. First step: ontology simplification 2. Removal of redundant relations A is_a B; B is_a C; => A is_a C is redundant. © 2013 SIB
  • 25. First step: ontology simplification 2. Removal of redundant relations A is_a B; B is_a C; => A is_a C is redundant. But, we consider part_of and is_a relations as equivalent. A part_of B; B is_a C => A part_of C and A is_a C are considered redundant This removes almost all “is_a anatomical entity” © 2013 SIB
  • 26. First step: ontology simplification 3. Removal of relations to upper_level terms upper_level subset: "abstract upper-level terms not directly useful for analysis” Terms useful for analysis are almost all present under “upper_level” terms, thus being confusing. => remove relations to “upper_level” terms if non-orphan © 2013 SIB
  • 27. First step: ontology simplification 3. Removal of relations to upper_level terms upper_level subset: "abstract upper-level terms not directly useful for analysis” Terms useful for analysis are almost all present under “upper_level” terms, thus being confusing. => remove relations to “upper_level” terms if non-orphan [Term] id: MA:0000747 name: lymph organ (mouse) is_a: UBERON:0001062 ! anatomical entity relationship: part_of UBERON:0002465 ! lymphoid system © 2013 SIB
  • 28. First step: ontology simplification 3. Removal of relations to upper_level terms upper_level subset: "abstract upper-level terms not directly useful for analysis” Terms useful for analysis are almost all present under “upper_level” terms, thus being confusing. => remove relations to “upper_level” terms if non-orphan [Term] id: MA:0000747 name: lymph organ (mouse) is_a: UBERON:0001062 ! anatomical entity relationship: part_of UBERON:0002465 ! lymphoid system © 2013 SIB
  • 29. First step: ontology simplification 3. Removal of relations to upper_level terms upper_level subset: "abstract upper-level terms not directly useful for analysis” Terms useful for analysis are almost all present under “upper_level” terms, thus being confusing. => remove relations to “upper_level” terms if non-orphan [Term] id: UBERON:0007502 name: epithelial plexus is_a: UBERON:0000480 ! anatomical group © 2013 SIB
  • 30. First step: ontology simplification 4. Generate species-specific versions To simplify even more the “composite-metazoan” ontology, generate a version for each species used in Bgee. © 2013 SIB
  • 31. First step: ontology simplification © 2013 SIB
  • 32. Second step: track ontology changes 1. Store annotation status - “Perfect” annotation: would not need to be refined as long as the term used is not obsoleted. - “Missing granularity” annotation: a term is missing in the ontology, e.g., vastus lateralis. If a new child was added to the term, refine annotation © 2013 SIB
  • 33. Second step: track ontology changes 2. Track ontology changes - Compare the versions used between two annotation cycles. - If a term used in a “missing granularity” annotation has new children, refine the annotation. © 2013 SIB
  • 34. Conclusion 1/2 To manage complex, frequently updated ontology: 1. Provide a formal version for the reasoning, and a simplified view for the end-user. 2. Store annotation status, to focus only on annotations which need to be updated. © 2013 SIB
  • 35. Conclusion 2/2 Major update of Bgee incoming for fall 2013: - All expression data annotations are being transferred to Uberon. - All homology information are being transferred from vHOG to Uberon, using an external file. © 2013 SIB
  • 36. Conclusion 2/2 Major update of Bgee incoming for fall 2013: - All expression data annotations are being transferred to Uberon. - All homology information are being transferred from vHOG to Uberon, using an external file. And also: - Besides present/absent calls, Bgee will include: overexpression calls; biologically significant expression. - Revamped interfaces, webservices, APIs, … © 2013 SIB
  • 37. Advertisement! Other Bgee-related work Poster 145: Average rank IQR: a new improved method for Affymetrix microarray quality control for meta- analyses and database curation. Marta Rosikiewicz Database biocuration virtual issue: Uncovering hidden duplicated content in public transcriptomics data Marta Rosikiewicz, Aurélie Comte, Anne Niknejad, Marc Robinson- Rechavi, and Frederic B. Bastian Database Vol. 2013, bat010; doi:10.1093/database/bat010 © 2013 SIB
  • 38. Thank You Aurélie Comte Sébastien Moretti Anne Niknejad Marta Rosikiewicz Marc Robinson-Rechavi Komal Sanjeev Mathieu Seppey And also: • Melissa Haendel • Chris Mungall