SlideShare a Scribd company logo
1 of 16
Download to read offline
SYSTEMS BIOLOGY
BIOINFORMATICS
ROSTOCK
S E Ssimulation experiment management system
BiVeS & BudHat
Difference Detection for
Computational Models
MARTIN SCHARM
Department of Systems Biology & Bioinformatics
Faculty of Computer Science & Electrical Engineering
University of Rostock
http://sems.uni-rostock.de
COMBINE 2013
Sep 19, 2013 Bives & Budhat | Martin Scharm 1
SYSTEMS BIOLOGY
BIOINFORMATICS
ROSTOCK
S E Ssimulation experiment management system
track development
store retrieve
rank
Management
Δ
Δ
Version 1
Version 2
latest
Format-independent,
graph-based storage
Information Retrieval-based
search and ranking
Difference detection
and provenance
http://sems.uni-rostock.de/
Sep 19, 2013 Bives & Budhat | Martin Scharm 2
Provenance
Provenance represents the seven W’s (Who, What, Where, Why, When, Which, (W)how).
– Goble 2002
G ˇ
ĽĽ
ˇ ˇ
ŤŤŤŤ
ˇ ˇ
šššš
ˇ 2
ˇ ˇ
ŁŁ
ˇ
Music
and Arts
BiVeS library. M.S. extended BiVeS and implemented the online
application BudHat.
Funding: This work has been funded by the German Federal
Ministry of Education and Research (e:Bio program).
Conflict of Interest: none declared.
REFERENCES
Beard,D.A. et al. (2009) CellML metadata standards, associated tools and reposi-
tories. Philos. Transact. A Math. Phys. Eng. Sci., 367, 1845–1867.
Chawathe,S. et al. (1996) Change detection in hierarchically structured information.
ACM SIGMOD Rec, 25, 493–504.
Cooper,J. et al. (2011) High-throughput functional curation of cellular electrophysi-
ology models. Prog. Biophys. Mol. Biol, 107, 11–20.
Cuellar,A. et al. (2006) The CellML metadata 1.0 specification. http://www.cellml.
org/specifications/metadata/cellml_metadata_1.0 (30 January 2013, date last
accessed).
Cuellar,A.A. et al. (2003) An overview of CellML 1.1, a biological model descrip-
tion language. Simulation, 79, 740–747.
Finkelstein,A. et al. (2004) Computational challenges of systems biology. IEEE
Comp., 37, 26–33.
Gleeson,P. et al. (2010) NeuroML: a language for describing data driven models of
neurons and networks with a high degree of biological detail. PLoS Comput.
Biol., 6. e1000815þ.
Henkel,R. et al. (2010) Ranked retrieval of computational biology models. BMC
Bioinformatics, 11. 423þ.
Henkel,R. et al. (2012) Considerations of graph-based concepts to manage compu-
tational biology models and associated simulations. In: Goltz,U. et al. (ed.)
Workshop on data in the life sciences. ‘‘INFORMATIK 2012’’, Lecture Notes
in Informatics 208, 1545–1551.
Herrga˚ rd,M. et al. (2008) A consensus yeast metabolic network reconstruction ob-
tained from a community approach to systems biology. Nat. Biotechnol., 26,
1155–1160.
Hines,M.L. et al. (2004) ModelDB: a database to support computational neurosci-
ence. J. Comput. Neurosci., 17, 7–11–11.
Hirschberg,D. (1977) Algorithms for the longest common subsequence problem. J.
ACM, 24, 664–675.
Hoehndorf,R. et al. (2011) Integrating systems biology models and biomedical
ontologies. BMC Syst. Biol., 5, 124.
Hucka,M. et al. (2003) The systems biology markup language (sbml): a medium for
representation and exchange of biochemical network models. Bioinformatics, 19,
524–531.
Hucka,M. et al. (2010) The Systems Biology Markup Language (SBML): Language
specification for level 3 version 1 core. http://iweb.dl.sourceforge.net/project/
sbml/specifications/Level%203%20Ver.%201/sbml-level-3-version-1-core-rel-1.
pdf (30 January 2013, date last accessed).
Juty,N. et al. (2012) Identifiers.org and MIRIAM Registry: community resources to
provide persistent identification. Nucleic Acids Res., 40 (D1), D580–D586.
Krause,F. et al. (2010) Annotation and merging of SBML models with
semanticSBML. Bioinformatics, 2, 421.
Lassila,O. et al. (1998) Resource description framework (RDF) model and syntax
specification. W3C Recommendation. http://www.w3.org/TR/REC-rdf-syntax/
(30 January 2013, date last accessed).
Le Nove` re et al. (2005) Minimum Information Requested In the Annotation of
biochemical Models (MIRIAM). Nature Biotechnol., 23, 1509–1515.
Le Nove` re,N. et al. (2009) The systems biology graphical notation. Nature
Biotechnol., 27, 735–741.
Li,C. et al. (2010) Biomodels database: an enhanced, curated and annotated re-
source for published quantitative kinetic models. BMC Syst. Biol., 4, 92.
Miller,A. et al. (2010) An overview of the cellml api and its implementation. BMC
Bioinformatics, 11, 178.
Miller,A. et al. (2011) Revision history aware repositories of computational models
of biological systems. BMC Bioinformatics., 12, 22.
Ro¨ nnau,S. et al. (2005) Towards xml version control of office documents. In:
Proceedings of the 2005 ACM symposium on Document engineering. ACM,
New York, NY, USA, pp. 10–19.
Ro¨ nnau,S. et al. (2009) Efficient and reliable merging of xml documents. In:
Proceedings of the 18th ACM conference on Information and knowledge manage-
ment (CIKM 09). ACM, New York, NY, pp. 2105–2106.
Rosado,A.L. et al. (2009) An XQuery-based version extension of an XML Native
Database. In: Proceedings of the 2009 EDBT/ICDT Workshops. ACM, New
York, NY, USA, pp. 99–106.
Saffrey,P. and Owen,R. (2009) Version control of pathway models using xml
patches. BMC Syst. Biol., 3, 34.
Tyson,J.J. (1991) Modeling the cell division cycle: cdc2 and cyclin interactions. Proc.
Natl Acad. Sci. USA, 88, 7328–7332.
Waltemath,D. et al. (2011) Reproducible computational biology experiments with
SED-ML—the simulation experiment description markup language. BMC Syst.
Biol., 5, 198.
Wittig,U. et al. (2006) Sabio-rk: integration and curation of reaction kinetics data.
In: Data Integration in the Life Sciences. Springer, Berlin Heidelberg,
pp. 94–103.
Yu,T. et al. (2011) The physiome model repository 2. Bioinformatics, 27, 743–744.
D.Waltemath et al.
atUniversitaetsbibliothekRostockonJuly18,2013http://bioinformatics.oxfordjournals.org/Downloadedfrom
Literature
Code
Sep 19, 2013 Bives & Budhat | Martin Scharm 3
Provenance
Requirements
X1 X2
Y1
Y2
Y3
Availability & Identity
I O
-vs-
I
M
O
Difference
Detection
I
M
O
updated
reaction
introduced
new modifier
Interpretation
Sep 19, 2013 Bives & Budhat | Martin Scharm 4
Provenance
Requirements
X1 X2
Y1
Y2
Y3
Availability
& Identity
I O
-vs-
I
M
O
Difference Detection
I
M
O
updated
reaction
introduced
new modifier
Interpretation
Sep 19, 2013 Bives & Budhat | Martin Scharm 4
Provenance
Requirements
X1 X2
Y1
Y2
Y3
Availability
& Identity
I O
-vs-
I
M
O
Difference
Detection
I
M
O
updated
reaction
introduced
new modifier
Interpretation
Sep 19, 2013 Bives & Budhat | Martin Scharm 4
Provenance
Requirements
X1 X2
Y1
Y2
Y3
Availability
& Identity
I O
-vs-
I
M
O
Difference
Detection
I
M
O
updated
reaction
introduced
new modifier
Interpretation
  
Sep 19, 2013 Bives  Budhat | Martin Scharm 4
Version Control
good news
A r C
B
D
cycE/cdk2
RB/E2F
RB-Hypo
free E2F
A r
B
C
D
E s
RB/E2F
RB-Hypo
free E2F
cycE/cdk2
RB-Phos
new insights
Waltemath et al.: Improving the reuse of computational models through version
control. Bioinformatics (2013) 29(6): 742-728;
Sep 19, 2013 Bives  Budhat | Martin Scharm 5
BiVeS
Difference Detection
A r C
B
D
cycE/cdk2
RB/E2F
RB-Hypo
free E2F
A r
B
C
D
E s
RB/E2F
RB-Hypo
free E2F
cycE/cdk2
RB-Phos
A
r
B
C
D
A
r
B
C
D
E
s
Biochemical Model Version Control System
• compares models encoded in standadized
formats (currently: and )
• maps hierarchically structured content
• constructs a diff (in XML format)
• is able to interprete this diff
XML
Diff
moves
product of r: C
deletes
product of r: B
inserts
species: E
product of r: E
reaction s
/XML
mapping
di construction
Sep 19, 2013 Bives  Budhat | Martin Scharm 6
BudHat
Diff Visualization
A r C
B
D
cycE/cdk2
RB/E2F
RB-Hypo
free E2F
A r
B
C
D
E s
RB/E2F
RB-Hypo
free E2F
cycE/cdk2
RB-Phos
A
r
B
C
D
A
r
B
C
D
E
s
XML
Diff
moves
product of r: C
deletes
product of r: B
inserts
species: E
product of r: E
reaction s
/XML
• calls BiVeS to construct the diff
• displays the result in various formats
• the XML diff
• a reaction network highlighting the
changes using
• a human readable report
A r B
C
D
E s
Sep 19, 2013 Bives  Budhat | Martin Scharm 7
BiVeS  BudHat
DEMO
BudHat in action!
http://budhat.sems.uni-rostock.de
Sep 19, 2013 Bives  Budhat | Martin Scharm 8
BiVeS
Integration
jvm network cmd
import de.unirostock.sems.bives.api.SBMLDiff;
[...]
SBMLDiff differ = new SBMLDiff (sbmlFileA, sbmlFileB);
differ.mapTrees ();
String graph = differ.getCRNGraphML ();
[...]
Sep 19, 2013 Bives  Budhat | Martin Scharm 9
BiVeS
Integration
jvm network cmd
curl -d ’{
get:
[
documentType,
xmlDiff
],
files:
{
versionA:http://your.db/path/to/versionA.sbml,
versionB:http://your.db/path/to/versionA.sbml
}
}’ http://bives.server.tld
Sep 19, 2013 Bives  Budhat | Martin Scharm 9
BiVeS
Integration
jvm network cmd
java -jar BiVeS.jar path/to/versionA path/to/versionB
git diff 88fea1cddf b64477d742 model.file
Sep 19, 2013 Bives  Budhat | Martin Scharm 9
BiVeS  BudHat
Summary
• BiVeS = Difference detection for hierarchically structured content
• BudHat = Prototypic web interface to demonstrate how BiVeS can be used to
communicate the differences
Sep 19, 2013 Bives  Budhat | Martin Scharm 10
SYSTEMS BIOLOGY
BIOINFORMATICS
ROSTOCK
S E Ssimulation experiment management system
That’s it! Stay tuned ;-)
@SemsProject
http://sems.uni-rostock.de
http://budhat.sems.uni-rostock.de
Questions? Suggestions? Recommendations? Drop me an email:
martin.scharm@uni-rostock.de
Sep 19, 2013 Bives  Budhat | Martin Scharm 11

More Related Content

Similar to BiVeS & BudHat @ Combine2013 in Paris

Adding value to scientific results: COMBINE standards & guidelines for system...
Adding value to scientific results: COMBINE standards & guidelines for system...Adding value to scientific results: COMBINE standards & guidelines for system...
Adding value to scientific results: COMBINE standards & guidelines for system...University Medicine Greifswald
 
The Research Object Initiative: Frameworks and Use Cases
The Research Object Initiative:Frameworks and Use CasesThe Research Object Initiative:Frameworks and Use Cases
The Research Object Initiative: Frameworks and Use CasesCarole Goble
 
Syst biol 2012-burguiere-sysbio sys069
Syst biol 2012-burguiere-sysbio sys069Syst biol 2012-burguiere-sysbio sys069
Syst biol 2012-burguiere-sysbio sys069Thomas Burguiere
 
BiVeS & BudHat @ Harmony2013
BiVeS & BudHat @ Harmony2013BiVeS & BudHat @ Harmony2013
BiVeS & BudHat @ Harmony2013Martin Scharm
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational WorkflowsCarole Goble
 
Being Reproducible: SSBSS Summer School 2017
Being Reproducible: SSBSS Summer School 2017Being Reproducible: SSBSS Summer School 2017
Being Reproducible: SSBSS Summer School 2017Carole Goble
 
Eclipse Meets Systems Biology
Eclipse Meets Systems BiologyEclipse Meets Systems Biology
Eclipse Meets Systems BiologyRichard Adams
 
Reproducible research in molecular biophysics and structural biology
Reproducible research in molecular biophysics and structural biologyReproducible research in molecular biophysics and structural biology
Reproducible research in molecular biophysics and structural biologykhinsen
 
COBieOWL An OWL ontology based on COBie standard
COBieOWL An OWL ontology based on COBie standardCOBieOWL An OWL ontology based on COBie standard
COBieOWL An OWL ontology based on COBie standardAna Roxin
 
20090219 The case for another systems biology modelling environment
20090219 The case for another systems biology modelling environment20090219 The case for another systems biology modelling environment
20090219 The case for another systems biology modelling environmentJonathan Blakes
 
Parma 2016-05-17 - JGrass-NewAGE - Some About The State of Art
Parma 2016-05-17 - JGrass-NewAGE - Some About The State of ArtParma 2016-05-17 - JGrass-NewAGE - Some About The State of Art
Parma 2016-05-17 - JGrass-NewAGE - Some About The State of ArtRiccardo Rigon
 
Web services for sharing germplasm data sets, at FAO in Rome (2006)
Web services for sharing germplasm data sets, at FAO in Rome (2006)Web services for sharing germplasm data sets, at FAO in Rome (2006)
Web services for sharing germplasm data sets, at FAO in Rome (2006)Dag Endresen
 
FAIR data and model management for systems biology (and SOPs too!)
FAIR data and model management for systems biology (and SOPs too!)FAIR data and model management for systems biology (and SOPs too!)
FAIR data and model management for systems biology (and SOPs too!)FAIRDOM
 
FAIR Data and Model Management for Systems Biology (and SOPs too!)
FAIR Data and Model Management for Systems Biology(and SOPs too!)FAIR Data and Model Management for Systems Biology(and SOPs too!)
FAIR Data and Model Management for Systems Biology (and SOPs too!)Carole Goble
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational Workflows Carole Goble
 
SBML FOR OPTIMIZING DECISION SUPPORT'S TOOLS
SBML FOR OPTIMIZING DECISION SUPPORT'S TOOLS SBML FOR OPTIMIZING DECISION SUPPORT'S TOOLS
SBML FOR OPTIMIZING DECISION SUPPORT'S TOOLS cscpconf
 
LOA seminar 2017 - Product and 3D geometry ontologies at action in constructi...
LOA seminar 2017 - Product and 3D geometry ontologies at action in constructi...LOA seminar 2017 - Product and 3D geometry ontologies at action in constructi...
LOA seminar 2017 - Product and 3D geometry ontologies at action in constructi...Pieter Pauwels
 
Management of simulation studies in computational biology
Management of simulation studies in computational biologyManagement of simulation studies in computational biology
Management of simulation studies in computational biologyUniversity Medicine Greifswald
 
A Practical Ontology for the Large-Scale Modeling of Scholarly Artifacts and ...
A Practical Ontology for the Large-Scale Modeling of Scholarly Artifacts and ...A Practical Ontology for the Large-Scale Modeling of Scholarly Artifacts and ...
A Practical Ontology for the Large-Scale Modeling of Scholarly Artifacts and ...Marko Rodriguez
 

Similar to BiVeS & BudHat @ Combine2013 in Paris (20)

Adding value to scientific results: COMBINE standards & guidelines for system...
Adding value to scientific results: COMBINE standards & guidelines for system...Adding value to scientific results: COMBINE standards & guidelines for system...
Adding value to scientific results: COMBINE standards & guidelines for system...
 
The Research Object Initiative: Frameworks and Use Cases
The Research Object Initiative:Frameworks and Use CasesThe Research Object Initiative:Frameworks and Use Cases
The Research Object Initiative: Frameworks and Use Cases
 
Syst biol 2012-burguiere-sysbio sys069
Syst biol 2012-burguiere-sysbio sys069Syst biol 2012-burguiere-sysbio sys069
Syst biol 2012-burguiere-sysbio sys069
 
BiVeS & BudHat @ Harmony2013
BiVeS & BudHat @ Harmony2013BiVeS & BudHat @ Harmony2013
BiVeS & BudHat @ Harmony2013
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational Workflows
 
Being Reproducible: SSBSS Summer School 2017
Being Reproducible: SSBSS Summer School 2017Being Reproducible: SSBSS Summer School 2017
Being Reproducible: SSBSS Summer School 2017
 
Eclipse Meets Systems Biology
Eclipse Meets Systems BiologyEclipse Meets Systems Biology
Eclipse Meets Systems Biology
 
Reproducible research in molecular biophysics and structural biology
Reproducible research in molecular biophysics and structural biologyReproducible research in molecular biophysics and structural biology
Reproducible research in molecular biophysics and structural biology
 
COBieOWL An OWL ontology based on COBie standard
COBieOWL An OWL ontology based on COBie standardCOBieOWL An OWL ontology based on COBie standard
COBieOWL An OWL ontology based on COBie standard
 
20090219 The case for another systems biology modelling environment
20090219 The case for another systems biology modelling environment20090219 The case for another systems biology modelling environment
20090219 The case for another systems biology modelling environment
 
Parma 2016-05-17 - JGrass-NewAGE - Some About The State of Art
Parma 2016-05-17 - JGrass-NewAGE - Some About The State of ArtParma 2016-05-17 - JGrass-NewAGE - Some About The State of Art
Parma 2016-05-17 - JGrass-NewAGE - Some About The State of Art
 
Web services for sharing germplasm data sets, at FAO in Rome (2006)
Web services for sharing germplasm data sets, at FAO in Rome (2006)Web services for sharing germplasm data sets, at FAO in Rome (2006)
Web services for sharing germplasm data sets, at FAO in Rome (2006)
 
FAIR data management in biomedicine
FAIR data management  in biomedicineFAIR data management  in biomedicine
FAIR data management in biomedicine
 
FAIR data and model management for systems biology (and SOPs too!)
FAIR data and model management for systems biology (and SOPs too!)FAIR data and model management for systems biology (and SOPs too!)
FAIR data and model management for systems biology (and SOPs too!)
 
FAIR Data and Model Management for Systems Biology (and SOPs too!)
FAIR Data and Model Management for Systems Biology(and SOPs too!)FAIR Data and Model Management for Systems Biology(and SOPs too!)
FAIR Data and Model Management for Systems Biology (and SOPs too!)
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational Workflows
 
SBML FOR OPTIMIZING DECISION SUPPORT'S TOOLS
SBML FOR OPTIMIZING DECISION SUPPORT'S TOOLS SBML FOR OPTIMIZING DECISION SUPPORT'S TOOLS
SBML FOR OPTIMIZING DECISION SUPPORT'S TOOLS
 
LOA seminar 2017 - Product and 3D geometry ontologies at action in constructi...
LOA seminar 2017 - Product and 3D geometry ontologies at action in constructi...LOA seminar 2017 - Product and 3D geometry ontologies at action in constructi...
LOA seminar 2017 - Product and 3D geometry ontologies at action in constructi...
 
Management of simulation studies in computational biology
Management of simulation studies in computational biologyManagement of simulation studies in computational biology
Management of simulation studies in computational biology
 
A Practical Ontology for the Large-Scale Modeling of Scholarly Artifacts and ...
A Practical Ontology for the Large-Scale Modeling of Scholarly Artifacts and ...A Practical Ontology for the Large-Scale Modeling of Scholarly Artifacts and ...
A Practical Ontology for the Large-Scale Modeling of Scholarly Artifacts and ...
 

More from Martin Scharm

Vertrauen oder Kontrolle? Meine Gesundheitsdaten und ich
Vertrauen oder Kontrolle? Meine Gesundheitsdaten und ichVertrauen oder Kontrolle? Meine Gesundheitsdaten und ich
Vertrauen oder Kontrolle? Meine Gesundheitsdaten und ichMartin Scharm
 
Improving Reproducibility and Reuse of Modelling Results in the Life Sciences
Improving Reproducibility and Reuse of Modelling Results in the Life SciencesImproving Reproducibility and Reuse of Modelling Results in the Life Sciences
Improving Reproducibility and Reuse of Modelling Results in the Life SciencesMartin Scharm
 
Model Management in Systems Biology: Challenges – Approaches – Solutions
Model Management in Systems Biology: Challenges – Approaches – SolutionsModel Management in Systems Biology: Challenges – Approaches – Solutions
Model Management in Systems Biology: Challenges – Approaches – SolutionsMartin Scharm
 
Docker Demo @ IuK Seminar
Docker Demo @ IuK SeminarDocker Demo @ IuK Seminar
Docker Demo @ IuK SeminarMartin Scharm
 
COMBINE (archive) meta data
COMBINE (archive) meta dataCOMBINE (archive) meta data
COMBINE (archive) meta dataMartin Scharm
 
Characterising differences between model versions
Characterising differences between model versionsCharacterising differences between model versions
Characterising differences between model versionsMartin Scharm
 
Group meeting in Manchester.
Group meeting in Manchester.Group meeting in Manchester.
Group meeting in Manchester.Martin Scharm
 
M2CAT: Extracting reproducible simulation studies from model repositories usi...
M2CAT: Extracting reproducible simulation studies from model repositories usi...M2CAT: Extracting reproducible simulation studies from model repositories usi...
M2CAT: Extracting reproducible simulation studies from model repositories usi...Martin Scharm
 
M2CAT: Extracting reproducible simulation studies from model repositories usi...
M2CAT: Extracting reproducible simulation studies from model repositories usi...M2CAT: Extracting reproducible simulation studies from model repositories usi...
M2CAT: Extracting reproducible simulation studies from model repositories usi...Martin Scharm
 
HandsOn: git (or version control in general...)
HandsOn: git (or version control in general...)HandsOn: git (or version control in general...)
HandsOn: git (or version control in general...)Martin Scharm
 
The CellML models’ walk through the repository
The CellML models’ walk through the repositoryThe CellML models’ walk through the repository
The CellML models’ walk through the repositoryMartin Scharm
 
CombineArchiveWeb -- web based tool to handle files associated with modelling...
CombineArchiveWeb -- web based tool to handle files associated with modelling...CombineArchiveWeb -- web based tool to handle files associated with modelling...
CombineArchiveWeb -- web based tool to handle files associated with modelling...Martin Scharm
 
Improving the Management of Computational Models -- Invited talk at the EBI
Improving the Management of Computational Models -- Invited talk at the EBIImproving the Management of Computational Models -- Invited talk at the EBI
Improving the Management of Computational Models -- Invited talk at the EBIMartin Scharm
 
BiVeS & BudHat: Invited Talk @ IPK Gatersleben
BiVeS & BudHat: Invited Talk @ IPK GaterslebenBiVeS & BudHat: Invited Talk @ IPK Gatersleben
BiVeS & BudHat: Invited Talk @ IPK GaterslebenMartin Scharm
 
BiVeS & BudHat -- Version Control for Computational Models @ All hands PALs M...
BiVeS & BudHat -- Version Control for Computational Models @ All hands PALs M...BiVeS & BudHat -- Version Control for Computational Models @ All hands PALs M...
BiVeS & BudHat -- Version Control for Computational Models @ All hands PALs M...Martin Scharm
 

More from Martin Scharm (15)

Vertrauen oder Kontrolle? Meine Gesundheitsdaten und ich
Vertrauen oder Kontrolle? Meine Gesundheitsdaten und ichVertrauen oder Kontrolle? Meine Gesundheitsdaten und ich
Vertrauen oder Kontrolle? Meine Gesundheitsdaten und ich
 
Improving Reproducibility and Reuse of Modelling Results in the Life Sciences
Improving Reproducibility and Reuse of Modelling Results in the Life SciencesImproving Reproducibility and Reuse of Modelling Results in the Life Sciences
Improving Reproducibility and Reuse of Modelling Results in the Life Sciences
 
Model Management in Systems Biology: Challenges – Approaches – Solutions
Model Management in Systems Biology: Challenges – Approaches – SolutionsModel Management in Systems Biology: Challenges – Approaches – Solutions
Model Management in Systems Biology: Challenges – Approaches – Solutions
 
Docker Demo @ IuK Seminar
Docker Demo @ IuK SeminarDocker Demo @ IuK Seminar
Docker Demo @ IuK Seminar
 
COMBINE (archive) meta data
COMBINE (archive) meta dataCOMBINE (archive) meta data
COMBINE (archive) meta data
 
Characterising differences between model versions
Characterising differences between model versionsCharacterising differences between model versions
Characterising differences between model versions
 
Group meeting in Manchester.
Group meeting in Manchester.Group meeting in Manchester.
Group meeting in Manchester.
 
M2CAT: Extracting reproducible simulation studies from model repositories usi...
M2CAT: Extracting reproducible simulation studies from model repositories usi...M2CAT: Extracting reproducible simulation studies from model repositories usi...
M2CAT: Extracting reproducible simulation studies from model repositories usi...
 
M2CAT: Extracting reproducible simulation studies from model repositories usi...
M2CAT: Extracting reproducible simulation studies from model repositories usi...M2CAT: Extracting reproducible simulation studies from model repositories usi...
M2CAT: Extracting reproducible simulation studies from model repositories usi...
 
HandsOn: git (or version control in general...)
HandsOn: git (or version control in general...)HandsOn: git (or version control in general...)
HandsOn: git (or version control in general...)
 
The CellML models’ walk through the repository
The CellML models’ walk through the repositoryThe CellML models’ walk through the repository
The CellML models’ walk through the repository
 
CombineArchiveWeb -- web based tool to handle files associated with modelling...
CombineArchiveWeb -- web based tool to handle files associated with modelling...CombineArchiveWeb -- web based tool to handle files associated with modelling...
CombineArchiveWeb -- web based tool to handle files associated with modelling...
 
Improving the Management of Computational Models -- Invited talk at the EBI
Improving the Management of Computational Models -- Invited talk at the EBIImproving the Management of Computational Models -- Invited talk at the EBI
Improving the Management of Computational Models -- Invited talk at the EBI
 
BiVeS & BudHat: Invited Talk @ IPK Gatersleben
BiVeS & BudHat: Invited Talk @ IPK GaterslebenBiVeS & BudHat: Invited Talk @ IPK Gatersleben
BiVeS & BudHat: Invited Talk @ IPK Gatersleben
 
BiVeS & BudHat -- Version Control for Computational Models @ All hands PALs M...
BiVeS & BudHat -- Version Control for Computational Models @ All hands PALs M...BiVeS & BudHat -- Version Control for Computational Models @ All hands PALs M...
BiVeS & BudHat -- Version Control for Computational Models @ All hands PALs M...
 

Recently uploaded

Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfMounikaPolabathina
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxLoriGlavin3
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfPrecisely
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionDilum Bandara
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxLoriGlavin3
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 

Recently uploaded (20)

Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdf
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine Tuning
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An Introduction
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 

BiVeS & BudHat @ Combine2013 in Paris

  • 1. SYSTEMS BIOLOGY BIOINFORMATICS ROSTOCK S E Ssimulation experiment management system BiVeS & BudHat Difference Detection for Computational Models MARTIN SCHARM Department of Systems Biology & Bioinformatics Faculty of Computer Science & Electrical Engineering University of Rostock http://sems.uni-rostock.de COMBINE 2013 Sep 19, 2013 Bives & Budhat | Martin Scharm 1
  • 2. SYSTEMS BIOLOGY BIOINFORMATICS ROSTOCK S E Ssimulation experiment management system track development store retrieve rank Management Δ Δ Version 1 Version 2 latest Format-independent, graph-based storage Information Retrieval-based search and ranking Difference detection and provenance http://sems.uni-rostock.de/ Sep 19, 2013 Bives & Budhat | Martin Scharm 2
  • 3. Provenance Provenance represents the seven W’s (Who, What, Where, Why, When, Which, (W)how). – Goble 2002 G ˇ ĽĽ ˇ ˇ ŤŤŤŤ ˇ ˇ šššš ˇ 2 ˇ ˇ ŁŁ ˇ Music and Arts BiVeS library. M.S. extended BiVeS and implemented the online application BudHat. Funding: This work has been funded by the German Federal Ministry of Education and Research (e:Bio program). Conflict of Interest: none declared. REFERENCES Beard,D.A. et al. (2009) CellML metadata standards, associated tools and reposi- tories. Philos. Transact. A Math. Phys. Eng. Sci., 367, 1845–1867. Chawathe,S. et al. (1996) Change detection in hierarchically structured information. ACM SIGMOD Rec, 25, 493–504. Cooper,J. et al. (2011) High-throughput functional curation of cellular electrophysi- ology models. Prog. Biophys. Mol. Biol, 107, 11–20. Cuellar,A. et al. (2006) The CellML metadata 1.0 specification. http://www.cellml. org/specifications/metadata/cellml_metadata_1.0 (30 January 2013, date last accessed). Cuellar,A.A. et al. (2003) An overview of CellML 1.1, a biological model descrip- tion language. Simulation, 79, 740–747. Finkelstein,A. et al. (2004) Computational challenges of systems biology. IEEE Comp., 37, 26–33. Gleeson,P. et al. (2010) NeuroML: a language for describing data driven models of neurons and networks with a high degree of biological detail. PLoS Comput. Biol., 6. e1000815þ. Henkel,R. et al. (2010) Ranked retrieval of computational biology models. BMC Bioinformatics, 11. 423þ. Henkel,R. et al. (2012) Considerations of graph-based concepts to manage compu- tational biology models and associated simulations. In: Goltz,U. et al. (ed.) Workshop on data in the life sciences. ‘‘INFORMATIK 2012’’, Lecture Notes in Informatics 208, 1545–1551. Herrga˚ rd,M. et al. (2008) A consensus yeast metabolic network reconstruction ob- tained from a community approach to systems biology. Nat. Biotechnol., 26, 1155–1160. Hines,M.L. et al. (2004) ModelDB: a database to support computational neurosci- ence. J. Comput. Neurosci., 17, 7–11–11. Hirschberg,D. (1977) Algorithms for the longest common subsequence problem. J. ACM, 24, 664–675. Hoehndorf,R. et al. (2011) Integrating systems biology models and biomedical ontologies. BMC Syst. Biol., 5, 124. Hucka,M. et al. (2003) The systems biology markup language (sbml): a medium for representation and exchange of biochemical network models. Bioinformatics, 19, 524–531. Hucka,M. et al. (2010) The Systems Biology Markup Language (SBML): Language specification for level 3 version 1 core. http://iweb.dl.sourceforge.net/project/ sbml/specifications/Level%203%20Ver.%201/sbml-level-3-version-1-core-rel-1. pdf (30 January 2013, date last accessed). Juty,N. et al. (2012) Identifiers.org and MIRIAM Registry: community resources to provide persistent identification. Nucleic Acids Res., 40 (D1), D580–D586. Krause,F. et al. (2010) Annotation and merging of SBML models with semanticSBML. Bioinformatics, 2, 421. Lassila,O. et al. (1998) Resource description framework (RDF) model and syntax specification. W3C Recommendation. http://www.w3.org/TR/REC-rdf-syntax/ (30 January 2013, date last accessed). Le Nove` re et al. (2005) Minimum Information Requested In the Annotation of biochemical Models (MIRIAM). Nature Biotechnol., 23, 1509–1515. Le Nove` re,N. et al. (2009) The systems biology graphical notation. Nature Biotechnol., 27, 735–741. Li,C. et al. (2010) Biomodels database: an enhanced, curated and annotated re- source for published quantitative kinetic models. BMC Syst. Biol., 4, 92. Miller,A. et al. (2010) An overview of the cellml api and its implementation. BMC Bioinformatics, 11, 178. Miller,A. et al. (2011) Revision history aware repositories of computational models of biological systems. BMC Bioinformatics., 12, 22. Ro¨ nnau,S. et al. (2005) Towards xml version control of office documents. In: Proceedings of the 2005 ACM symposium on Document engineering. ACM, New York, NY, USA, pp. 10–19. Ro¨ nnau,S. et al. (2009) Efficient and reliable merging of xml documents. In: Proceedings of the 18th ACM conference on Information and knowledge manage- ment (CIKM 09). ACM, New York, NY, pp. 2105–2106. Rosado,A.L. et al. (2009) An XQuery-based version extension of an XML Native Database. In: Proceedings of the 2009 EDBT/ICDT Workshops. ACM, New York, NY, USA, pp. 99–106. Saffrey,P. and Owen,R. (2009) Version control of pathway models using xml patches. BMC Syst. Biol., 3, 34. Tyson,J.J. (1991) Modeling the cell division cycle: cdc2 and cyclin interactions. Proc. Natl Acad. Sci. USA, 88, 7328–7332. Waltemath,D. et al. (2011) Reproducible computational biology experiments with SED-ML—the simulation experiment description markup language. BMC Syst. Biol., 5, 198. Wittig,U. et al. (2006) Sabio-rk: integration and curation of reaction kinetics data. In: Data Integration in the Life Sciences. Springer, Berlin Heidelberg, pp. 94–103. Yu,T. et al. (2011) The physiome model repository 2. Bioinformatics, 27, 743–744. D.Waltemath et al. atUniversitaetsbibliothekRostockonJuly18,2013http://bioinformatics.oxfordjournals.org/Downloadedfrom Literature Code Sep 19, 2013 Bives & Budhat | Martin Scharm 3
  • 4. Provenance Requirements X1 X2 Y1 Y2 Y3 Availability & Identity I O -vs- I M O Difference Detection I M O updated reaction introduced new modifier Interpretation Sep 19, 2013 Bives & Budhat | Martin Scharm 4
  • 5. Provenance Requirements X1 X2 Y1 Y2 Y3 Availability & Identity I O -vs- I M O Difference Detection I M O updated reaction introduced new modifier Interpretation Sep 19, 2013 Bives & Budhat | Martin Scharm 4
  • 6. Provenance Requirements X1 X2 Y1 Y2 Y3 Availability & Identity I O -vs- I M O Difference Detection I M O updated reaction introduced new modifier Interpretation Sep 19, 2013 Bives & Budhat | Martin Scharm 4
  • 7. Provenance Requirements X1 X2 Y1 Y2 Y3 Availability & Identity I O -vs- I M O Difference Detection I M O updated reaction introduced new modifier Interpretation Sep 19, 2013 Bives Budhat | Martin Scharm 4
  • 8. Version Control good news A r C B D cycE/cdk2 RB/E2F RB-Hypo free E2F A r B C D E s RB/E2F RB-Hypo free E2F cycE/cdk2 RB-Phos new insights Waltemath et al.: Improving the reuse of computational models through version control. Bioinformatics (2013) 29(6): 742-728; Sep 19, 2013 Bives Budhat | Martin Scharm 5
  • 9. BiVeS Difference Detection A r C B D cycE/cdk2 RB/E2F RB-Hypo free E2F A r B C D E s RB/E2F RB-Hypo free E2F cycE/cdk2 RB-Phos A r B C D A r B C D E s Biochemical Model Version Control System • compares models encoded in standadized formats (currently: and ) • maps hierarchically structured content • constructs a diff (in XML format) • is able to interprete this diff XML Diff moves product of r: C deletes product of r: B inserts species: E product of r: E reaction s /XML mapping di construction Sep 19, 2013 Bives Budhat | Martin Scharm 6
  • 10. BudHat Diff Visualization A r C B D cycE/cdk2 RB/E2F RB-Hypo free E2F A r B C D E s RB/E2F RB-Hypo free E2F cycE/cdk2 RB-Phos A r B C D A r B C D E s XML Diff moves product of r: C deletes product of r: B inserts species: E product of r: E reaction s /XML • calls BiVeS to construct the diff • displays the result in various formats • the XML diff • a reaction network highlighting the changes using • a human readable report A r B C D E s Sep 19, 2013 Bives Budhat | Martin Scharm 7
  • 11. BiVeS BudHat DEMO BudHat in action! http://budhat.sems.uni-rostock.de Sep 19, 2013 Bives Budhat | Martin Scharm 8
  • 12. BiVeS Integration jvm network cmd import de.unirostock.sems.bives.api.SBMLDiff; [...] SBMLDiff differ = new SBMLDiff (sbmlFileA, sbmlFileB); differ.mapTrees (); String graph = differ.getCRNGraphML (); [...] Sep 19, 2013 Bives Budhat | Martin Scharm 9
  • 13. BiVeS Integration jvm network cmd curl -d ’{ get: [ documentType, xmlDiff ], files: { versionA:http://your.db/path/to/versionA.sbml, versionB:http://your.db/path/to/versionA.sbml } }’ http://bives.server.tld Sep 19, 2013 Bives Budhat | Martin Scharm 9
  • 14. BiVeS Integration jvm network cmd java -jar BiVeS.jar path/to/versionA path/to/versionB git diff 88fea1cddf b64477d742 model.file Sep 19, 2013 Bives Budhat | Martin Scharm 9
  • 15. BiVeS BudHat Summary • BiVeS = Difference detection for hierarchically structured content • BudHat = Prototypic web interface to demonstrate how BiVeS can be used to communicate the differences Sep 19, 2013 Bives Budhat | Martin Scharm 10
  • 16. SYSTEMS BIOLOGY BIOINFORMATICS ROSTOCK S E Ssimulation experiment management system That’s it! Stay tuned ;-) @SemsProject http://sems.uni-rostock.de http://budhat.sems.uni-rostock.de Questions? Suggestions? Recommendations? Drop me an email: martin.scharm@uni-rostock.de Sep 19, 2013 Bives Budhat | Martin Scharm 11