SlideShare ist ein Scribd-Unternehmen logo
1 von 21
Downloaden Sie, um offline zu lesen
Graph based storage and retrieval of
           computational models

      Ron Henkel, Martin Scharm, Dagmar Waltemath, Olaf Wolkenhauer


                        Department of Systems Biology and Bioinformatics
                                      University of Rostock


                                      www.sbi.uni-rostock.de
29.11.2012   © 2009 UNIVERSITÄT ROSTOCK
Motivation

                   1000                                                                                                                              120000

                    900
                                                                                                                                                     100000
                    800

                    700
                                                                                                                                                     80000




                                                                                                                                                              Number of Annotations
Number of Models




                    600

                    500                                                                                                                              60000

                    400
                                                                                                                                                     40000
                    300

                    200
                                                                                                                                                     20000
                    100

                      0                                                                                                                              0                  Models
                          Apr Jul Okt Jan Apr Jul Okt Jan Apr Jul Okt Jan Apr Jul Okt Jan Apr Jul Okt Jan Apr Jul Okt Jan Apr Jul Okt Jan Apr Jul
                                                                                                                                                                        Annotation
                          05 05 05 06 06 06 06 07 07 07 07 08 08 08 08 09 09 09 09 10 10 10 10 11 11 11 11 12 12 12

                                                                                                                                                    Data from BioModels Database
                   11.12.2012          © 2009 UNIVERSITÄT ROSTOCK                                                                                                                     2
Motivation



       • Models:

                  Grow in number and complexity
                  Are provided with supplementary material
                  Evolve over time




11.12.2012       © 2009 UNIVERSITÄT ROSTOCK                   3
State of the Art

       • Storage:
                  Relational Databases
                  Model files on Hard Disk Drive (HDD)
                  Additional files (images, result sets, paper)


       • Search:
                  SQL statements
                  Facetted search
                  Data browsing
29.11.2012       © 2009 UNIVERSITÄT ROSTOCK                        4
State of the Art - Demo




11.12.2012   © 2009 UNIVERSITÄT ROSTOCK   5
Available Data for Ranked Retrieval

        Model file                          Annotation & Ontologies       A model‘s network




• Constituent names                  • Biochemical background         • Model structure
• Model code                         • Synonyms                       • Aggregate values




  29.01.2012   © 2009 UNIVERSITÄT ROSTOCK                                                     6
Available Data for Ranked Retrieval
#       aspect                      importance   contained features
1       Administrative              none         ids, file name, version, formalism…
2       Person                      medium       creator, encoder, submitter, publication author
3       Dates                       low          creation and modification date
4       Publication                 high         title, abstract, full-text, journal
5       Constituents                very high    compartment, species, reaction
6       User content                very high    keywords, tags, remarks, changes

    •     The concept is abstract and can be applied to different model formalisms.
    •     Depending on the formalism the aspects can be refined into features.
    •     The model constituents also contain the annotations.
                                                                             Henkel et al. (2010) BMC Bioinf
11.12.2012       © 2009 UNIVERSITÄT ROSTOCK                                                              7
Biomodels Database – A Test Case

       • Apache Lucene Framework

       • Model Index
           425 models, 140.977 terms

       • Semantic Index
           2261 URIs, 409.124 terms




                     http://www.ebi.ac.uk/biomodels-demo/
11.12.2012   © 2009 UNIVERSITÄT ROSTOCK                     8
Demo




11.12.2012   © 2009 UNIVERSITÄT ROSTOCK   9
Improvements

       • Ranking
       • Enhanced query possibilities
                  Required, optional and excluded criteria
                  Allow full-text and Ontology queries
       • Example: “Find cell cycle models”

                     Query           BiomodelsDB    Using IR   Gold Standard
                   cell cycle                 135     173           n/a
                  “cell cycle”                14      26            28

11.12.2012       © 2009 UNIVERSITÄT ROSTOCK                                    10
Available Data for Ranked Retrieval


        Model based                         Annotation & Ontologies          A model‘s network




• Model name                         • Biochemical background             • Include model structure
• Model code                         • Allows to identify e.g. synonyms   • Aggregate values




  29.11,2012   © 2009 UNIVERSITÄT ROSTOCK                                                             11
Mapping a Model to a Database

     A model‘s network




• Include model structure
• Aggregate values




   29.11.2012   © 2009 UNIVERSITÄT ROSTOCK   12
Advantages of Graph Databases

       •     Easy mapping of model structure
       •     Fast browsing through models
       •     Flexible and schema-free storage
       •     Easy linking to models, simulation setups or results,
             and external resources




11.12.2012    © 2009 UNIVERSITÄT ROSTOCK                             13
Document




                                                                                Model




                         R                                  P                        S                       E          C


                               asProduct
                                                                asReactant




                                                                asModifier
                                                                                               isEncodedBy
                                              isVersionOf
                    is




                                                                                is




                                                                                                                   is
                                           uniprot:P0710                     uniprot:Q0339
                 SBO:0000268                                                                 HGNC:8582           GO:0005737
                                                 1                                 3



11.12.2012   © 2009 UNIVERSITÄT ROSTOCK                                                                                       14
11.12.2012   © 2009 UNIVERSITÄT ROSTOCK   15
Preliminary Results

       • All models stored in Biomodels DB were stored into the
         graph database
       • Implemented storage and search in Jummp
                  official demo release upcoming
       • Added 140.811 models from path2models project
                  done, but including annotation blows the memory
                  database scales well and is reasonably fast


11.12.2012       © 2009 UNIVERSITÄT ROSTOCK                          16
Demo




11.12.2012   © 2009 UNIVERSITÄT ROSTOCK          17
Future Work: Relate model versions

                                          • Link successor and predecessor
                                          • Relate changed entities
                                          • Store the diff

                                                     • Enable version control for multi-
                                                       document models
                                                     • Propagate changes for imported models




11.12.2012   © 2009 UNIVERSITÄT ROSTOCK                                                        18
SEMS: Methods for Model & Simulation Management

  Model Version control                      Model Storage                     Model Search

• XML version control         • Relational databases                • Ranked model retrieval
                                Waltemath et al., 2011 (DBSpektrum)   Henkel et al., 2010 (BMC Bioinf)
• Difference detection in XML
  Waltemath et al., submitted • Graph-based storage                 • Structure- and
                                Henkel et al., 2012 (INFORMATIK)      ontology-based search


       Simulation VC                       Simulation Storage                SimulationSearch

• Standardized encoding of simulation setups Waltemath et al., 2011 (BMC SysBiol)
• Linking models and simulation descriptions Henkel et al., 2012 (INFORMATIK)
 11.12.2012   © 2009 UNIVERSITÄT ROSTOCK                                                             19
Take Home Message

       • Ranked retrieval is a necessary feature for model
         databases.
       • The model’s inherent structure should be queryable.
       • Graph based storage reflects well a model‘s encoding
         and evolution.




11.12.2012   © 2009 UNIVERSITÄT ROSTOCK                         20
Thanks for your attention.



                                          Questions?




                                ron.henkel@uni-rostock.de
11.12.2012   © 2009 UNIVERSITÄT ROSTOCK                     21

Weitere ähnliche Inhalte

Ähnlich wie SEMS: Model search and ranked Retrieval (Ron Henkel)

FAIR Data and Model Management for Systems Biology (and SOPs too!)
FAIR Data and Model Management for Systems Biology(and SOPs too!)FAIR Data and Model Management for Systems Biology(and SOPs too!)
FAIR Data and Model Management for Systems Biology (and SOPs too!)
Carole Goble
 

Ähnlich wie SEMS: Model search and ranked Retrieval (Ron Henkel) (20)

Supporting the Research Data Life Cycle
Supporting the Research Data Life CycleSupporting the Research Data Life Cycle
Supporting the Research Data Life Cycle
 
Automated Extraction of Reactions from the Patent Literature
Automated Extraction of Reactions from the Patent LiteratureAutomated Extraction of Reactions from the Patent Literature
Automated Extraction of Reactions from the Patent Literature
 
Meta-Information for Bio-Models
Meta-Information for Bio-ModelsMeta-Information for Bio-Models
Meta-Information for Bio-Models
 
Bio-Model Meta-Information and SED-ML
Bio-Model Meta-Information and SED-MLBio-Model Meta-Information and SED-ML
Bio-Model Meta-Information and SED-ML
 
Reproducibility of model-based results: standards, infrastructure, and recogn...
Reproducibility of model-based results: standards, infrastructure, and recogn...Reproducibility of model-based results: standards, infrastructure, and recogn...
Reproducibility of model-based results: standards, infrastructure, and recogn...
 
FAIR data and model management for systems biology (and SOPs too!)
FAIR data and model management for systems biology (and SOPs too!)FAIR data and model management for systems biology (and SOPs too!)
FAIR data and model management for systems biology (and SOPs too!)
 
FAIR Data and Model Management for Systems Biology (and SOPs too!)
FAIR Data and Model Management for Systems Biology(and SOPs too!)FAIR Data and Model Management for Systems Biology(and SOPs too!)
FAIR Data and Model Management for Systems Biology (and SOPs too!)
 
Competitive data science: A tale of two web services
Competitive data science: A tale of two web servicesCompetitive data science: A tale of two web services
Competitive data science: A tale of two web services
 
Simulation experiment descriptions and management
Simulation experiment descriptions and managementSimulation experiment descriptions and management
Simulation experiment descriptions and management
 
OER for repository managers
OER for repository managersOER for repository managers
OER for repository managers
 
International Battery: Applying Lean Manufacturing Processes In The Productio...
International Battery: Applying Lean Manufacturing Processes In The Productio...International Battery: Applying Lean Manufacturing Processes In The Productio...
International Battery: Applying Lean Manufacturing Processes In The Productio...
 
JLibSEDML update April 2011
JLibSEDML update April 2011JLibSEDML update April 2011
JLibSEDML update April 2011
 
Experience with MarkLogic at Elsevier
Experience with MarkLogic at ElsevierExperience with MarkLogic at Elsevier
Experience with MarkLogic at Elsevier
 
Ape 2013 23012013
Ape 2013 23012013Ape 2013 23012013
Ape 2013 23012013
 
Where have all the print journals gone? Adapting Print Collections to an E-ce...
Where have all the print journals gone? Adapting Print Collections to an E-ce...Where have all the print journals gone? Adapting Print Collections to an E-ce...
Where have all the print journals gone? Adapting Print Collections to an E-ce...
 
NoSQL databases and managing big data
NoSQL databases and managing big dataNoSQL databases and managing big data
NoSQL databases and managing big data
 
Standards and software: practical aids for reproducibility of computational r...
Standards and software: practical aids for reproducibility of computational r...Standards and software: practical aids for reproducibility of computational r...
Standards and software: practical aids for reproducibility of computational r...
 
Chem draW
Chem draWChem draW
Chem draW
 
MOST: exploring differences between versions of models in BioModels and in th...
MOST: exploring differences between versions of models in BioModels and in th...MOST: exploring differences between versions of models in BioModels and in th...
MOST: exploring differences between versions of models in BioModels and in th...
 
Adding value to scientific results: COMBINE standards & guidelines for system...
Adding value to scientific results: COMBINE standards & guidelines for system...Adding value to scientific results: COMBINE standards & guidelines for system...
Adding value to scientific results: COMBINE standards & guidelines for system...
 

Mehr von University Medicine Greifswald

Possibilities for integrating model-related data in computational biology (DI...
Possibilities for integrating model-related data in computational biology (DI...Possibilities for integrating model-related data in computational biology (DI...
Possibilities for integrating model-related data in computational biology (DI...
University Medicine Greifswald
 

Mehr von University Medicine Greifswald (19)

A guide to the COMBINE: Navigating through specifications, mailing lists and ...
A guide to the COMBINE: Navigating through specifications, mailing lists and ...A guide to the COMBINE: Navigating through specifications, mailing lists and ...
A guide to the COMBINE: Navigating through specifications, mailing lists and ...
 
When is a model FAIR – and why should we care?
When is a model FAIR – and why should we care?When is a model FAIR – and why should we care?
When is a model FAIR – and why should we care?
 
COMBINE standards & tools: Getting model management right
COMBINE standards & tools: Getting model management rightCOMBINE standards & tools: Getting model management right
COMBINE standards & tools: Getting model management right
 
FAIR data management in biomedicine
FAIR data management  in biomedicineFAIR data management  in biomedicine
FAIR data management in biomedicine
 
Model repositories and standard formats for model reusability
Model repositories and standard formats for model reusabilityModel repositories and standard formats for model reusability
Model repositories and standard formats for model reusability
 
2019 07-04-model reuse-bonn
2019 07-04-model reuse-bonn2019 07-04-model reuse-bonn
2019 07-04-model reuse-bonn
 
Mehr Medizininformatik am Meer
Mehr Medizininformatik am MeerMehr Medizininformatik am Meer
Mehr Medizininformatik am Meer
 
Implementierung Graph-basierter Ansätze für das Management systembiologischer...
Implementierung Graph-basierter Ansätze für das Management systembiologischer...Implementierung Graph-basierter Ansätze für das Management systembiologischer...
Implementierung Graph-basierter Ansätze für das Management systembiologischer...
 
Using Neo4j technologies for the management of systems biology models
Using Neo4j technologies for the management of systems biology modelsUsing Neo4j technologies for the management of systems biology models
Using Neo4j technologies for the management of systems biology models
 
Identifying pattern in reaction networks of computational models
Identifying pattern in reaction networks of computational modelsIdentifying pattern in reaction networks of computational models
Identifying pattern in reaction networks of computational models
 
Management of simulation studies in computational biology
Management of simulation studies in computational biologyManagement of simulation studies in computational biology
Management of simulation studies in computational biology
 
Extended support for standard graphical notations of biological networks in s...
Extended support for standard graphical notations of biological networks in s...Extended support for standard graphical notations of biological networks in s...
Extended support for standard graphical notations of biological networks in s...
 
Modelling sample at SEMS from a graph perspective
Modelling sample at SEMS from a graph perspectiveModelling sample at SEMS from a graph perspective
Modelling sample at SEMS from a graph perspective
 
Coming Soon: de.NBI and SBGN-ED @ SEMS
Coming Soon: de.NBI and SBGN-ED @ SEMSComing Soon: de.NBI and SBGN-ED @ SEMS
Coming Soon: de.NBI and SBGN-ED @ SEMS
 
Masymos: Finding hidden treasures in model repositories
Masymos: Finding hidden treasures in model repositoriesMasymos: Finding hidden treasures in model repositories
Masymos: Finding hidden treasures in model repositories
 
Reproducibility, dissemination, and management of modeling results
Reproducibility, dissemination,  and management of modeling resultsReproducibility, dissemination,  and management of modeling results
Reproducibility, dissemination, and management of modeling results
 
e:Bio Kick-Off Meeting, SEMS
e:Bio Kick-Off Meeting, SEMSe:Bio Kick-Off Meeting, SEMS
e:Bio Kick-Off Meeting, SEMS
 
Possibilities for integrating model-related data in computational biology (DI...
Possibilities for integrating model-related data in computational biology (DI...Possibilities for integrating model-related data in computational biology (DI...
Possibilities for integrating model-related data in computational biology (DI...
 
Sems project overview
Sems project overviewSems project overview
Sems project overview
 

Kürzlich hochgeladen

Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
fonyou31
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
PECB
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
QucHHunhnh
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
QucHHunhnh
 

Kürzlich hochgeladen (20)

Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communication
 
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
 
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptxINDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 
social pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajansocial pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajan
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
 
Class 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfClass 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdf
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy Reform
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdf
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdf
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activity
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
Advance Mobile Application Development class 07
Advance Mobile Application Development class 07Advance Mobile Application Development class 07
Advance Mobile Application Development class 07
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across Sectors
 

SEMS: Model search and ranked Retrieval (Ron Henkel)

  • 1. Graph based storage and retrieval of computational models Ron Henkel, Martin Scharm, Dagmar Waltemath, Olaf Wolkenhauer Department of Systems Biology and Bioinformatics University of Rostock www.sbi.uni-rostock.de 29.11.2012 © 2009 UNIVERSITÄT ROSTOCK
  • 2. Motivation 1000 120000 900 100000 800 700 80000 Number of Annotations Number of Models 600 500 60000 400 40000 300 200 20000 100 0 0 Models Apr Jul Okt Jan Apr Jul Okt Jan Apr Jul Okt Jan Apr Jul Okt Jan Apr Jul Okt Jan Apr Jul Okt Jan Apr Jul Okt Jan Apr Jul Annotation 05 05 05 06 06 06 06 07 07 07 07 08 08 08 08 09 09 09 09 10 10 10 10 11 11 11 11 12 12 12 Data from BioModels Database 11.12.2012 © 2009 UNIVERSITÄT ROSTOCK 2
  • 3. Motivation • Models:  Grow in number and complexity  Are provided with supplementary material  Evolve over time 11.12.2012 © 2009 UNIVERSITÄT ROSTOCK 3
  • 4. State of the Art • Storage:  Relational Databases  Model files on Hard Disk Drive (HDD)  Additional files (images, result sets, paper) • Search:  SQL statements  Facetted search  Data browsing 29.11.2012 © 2009 UNIVERSITÄT ROSTOCK 4
  • 5. State of the Art - Demo 11.12.2012 © 2009 UNIVERSITÄT ROSTOCK 5
  • 6. Available Data for Ranked Retrieval Model file Annotation & Ontologies A model‘s network • Constituent names • Biochemical background • Model structure • Model code • Synonyms • Aggregate values 29.01.2012 © 2009 UNIVERSITÄT ROSTOCK 6
  • 7. Available Data for Ranked Retrieval # aspect importance contained features 1 Administrative none ids, file name, version, formalism… 2 Person medium creator, encoder, submitter, publication author 3 Dates low creation and modification date 4 Publication high title, abstract, full-text, journal 5 Constituents very high compartment, species, reaction 6 User content very high keywords, tags, remarks, changes • The concept is abstract and can be applied to different model formalisms. • Depending on the formalism the aspects can be refined into features. • The model constituents also contain the annotations. Henkel et al. (2010) BMC Bioinf 11.12.2012 © 2009 UNIVERSITÄT ROSTOCK 7
  • 8. Biomodels Database – A Test Case • Apache Lucene Framework • Model Index  425 models, 140.977 terms • Semantic Index  2261 URIs, 409.124 terms http://www.ebi.ac.uk/biomodels-demo/ 11.12.2012 © 2009 UNIVERSITÄT ROSTOCK 8
  • 9. Demo 11.12.2012 © 2009 UNIVERSITÄT ROSTOCK 9
  • 10. Improvements • Ranking • Enhanced query possibilities  Required, optional and excluded criteria  Allow full-text and Ontology queries • Example: “Find cell cycle models” Query BiomodelsDB Using IR Gold Standard cell cycle 135 173 n/a “cell cycle” 14 26 28 11.12.2012 © 2009 UNIVERSITÄT ROSTOCK 10
  • 11. Available Data for Ranked Retrieval Model based Annotation & Ontologies A model‘s network • Model name • Biochemical background • Include model structure • Model code • Allows to identify e.g. synonyms • Aggregate values 29.11,2012 © 2009 UNIVERSITÄT ROSTOCK 11
  • 12. Mapping a Model to a Database A model‘s network • Include model structure • Aggregate values 29.11.2012 © 2009 UNIVERSITÄT ROSTOCK 12
  • 13. Advantages of Graph Databases • Easy mapping of model structure • Fast browsing through models • Flexible and schema-free storage • Easy linking to models, simulation setups or results, and external resources 11.12.2012 © 2009 UNIVERSITÄT ROSTOCK 13
  • 14. Document Model R P S E C asProduct asReactant asModifier isEncodedBy isVersionOf is is is uniprot:P0710 uniprot:Q0339 SBO:0000268 HGNC:8582 GO:0005737 1 3 11.12.2012 © 2009 UNIVERSITÄT ROSTOCK 14
  • 15. 11.12.2012 © 2009 UNIVERSITÄT ROSTOCK 15
  • 16. Preliminary Results • All models stored in Biomodels DB were stored into the graph database • Implemented storage and search in Jummp  official demo release upcoming • Added 140.811 models from path2models project  done, but including annotation blows the memory  database scales well and is reasonably fast 11.12.2012 © 2009 UNIVERSITÄT ROSTOCK 16
  • 17. Demo 11.12.2012 © 2009 UNIVERSITÄT ROSTOCK 17
  • 18. Future Work: Relate model versions • Link successor and predecessor • Relate changed entities • Store the diff • Enable version control for multi- document models • Propagate changes for imported models 11.12.2012 © 2009 UNIVERSITÄT ROSTOCK 18
  • 19. SEMS: Methods for Model & Simulation Management Model Version control Model Storage Model Search • XML version control • Relational databases • Ranked model retrieval Waltemath et al., 2011 (DBSpektrum) Henkel et al., 2010 (BMC Bioinf) • Difference detection in XML Waltemath et al., submitted • Graph-based storage • Structure- and Henkel et al., 2012 (INFORMATIK) ontology-based search Simulation VC Simulation Storage SimulationSearch • Standardized encoding of simulation setups Waltemath et al., 2011 (BMC SysBiol) • Linking models and simulation descriptions Henkel et al., 2012 (INFORMATIK) 11.12.2012 © 2009 UNIVERSITÄT ROSTOCK 19
  • 20. Take Home Message • Ranked retrieval is a necessary feature for model databases. • The model’s inherent structure should be queryable. • Graph based storage reflects well a model‘s encoding and evolution. 11.12.2012 © 2009 UNIVERSITÄT ROSTOCK 20
  • 21. Thanks for your attention. Questions? ron.henkel@uni-rostock.de 11.12.2012 © 2009 UNIVERSITÄT ROSTOCK 21