SlideShare ist ein Scribd-Unternehmen logo
1 von 19
Downloaden Sie, um offline zu lesen
Possibilities for Integrating
Model-related Data in Computational
Biology
Databases in Life Sciences, Montreal, July 2013
Dagmar Waltemath, University of Rostock, Germany
Nicolas Le Novère, Babraham Institute, UK
Michel Dumontier, Carleton University, Canada
Archive
Introduction
13-07-12 Integrating model-related data 2
Fig.: DOI: 10.1038/35002125
Introduction
13-07-12 Integrating model-related data 3
Fig.: DOI: 10.1038/35002125
Introduction
13-07-12 Integrating model-related data 4
No and size
of models
time
Fig.: DOI: 10.1038/35002125
Introduction
13-07-12 Integrating model-related data 5
Fig.: DOI: 10.1038/35002125
model reuse – result reproducibility
Introduction
13-07-12 Integrating model-related data 6
Introduction
1. How can we distribute models with all
information necessary to reuse them (MIRIAM)?
2. How can we effectively manage different types
of model-related data?
3. How can we link model-related data to the rest
of the world?
13-07-12 Integrating model-related data 7
1. Distributing models
Archive
13-07-12 Integrating model-related data 8
Frank Bergmann
Nicolas Le Novère
1. Distributing models
The COMBINE archive v0.1
• single “.zip” file
• bundles models and model-related data
• single file
http://co.mbine.org/documents/archive
13-07-12 Integrating model-related data 9
1. A manifest file,
"manifest.xml“,
2. all described files,
3. a metadata file,
"metadata.*“,
4. remaining files.
• All documents necessary
for the description of a
model and all associated
data and procedures.
• In the future: also
references to documents
1. Distributing models
<?xml version="1.0" encoding="utf-8"?>
<omexManifest
xmlns="http://identifiers.org/combine.specifications/omex-manifest">
<content location="./manifest.xml"
format="http://identifiers.org/combine.specifications/omex-
manifest"/>
<content location="./model/model.xml"
format="http://identifiers.org/combine.specifications/sbml"/>
<content location="./simulation.xml"
format="http://identifiers.org/combine.specifications/sedml"/>
<content location="./article.pdf"
format="application/pdf"/>
<content location="./metadata.rdf"
format="http://identifiers.org/combine.specifications/omex-
metadata"/>
</omexManifest>
13-07-12 Integrating model-related data 10
2. Managing models
13-07-12 11Integrating model-related data
Ron Henkel
2. Managing models
• Neo4J database
• Model2graph mapping ( , )
• Rich relations
http://biomodels.net/qualifiers
• Links to annotations
13-07-12 Integrating model-related data 12
“Which models are annotated with
‚Adenosine tri-phosphate‘?“
“Which models contain reactions with ATP
as reactant and ADP as product?”
Document
Model
P E CR S
SBO:0000268 uniprot:P07101 uniprot:Q03393 GO:0005737HGNC:8582
is
isVersionOf
is
isEncodedBy
is
asProduct
asReactant
asModifier
Fig.: Henkel et al. (2012) INFORMATIK 2012, Braunschweig
Document
Model
P E CR S
SBO:0000268 uniprot:P07101 uniprot:Q03393 GO:0005737HGNC:8582
is
isVersionOf
is
isEncodedBy
is
asProduct
asReactant
asModifier
2. Managing models
• Lucene-based ranked retrieval
13-07-12 Integrating model-related data 13
“Give me the best matching model published
about the Cell Cycle and covering forms of cdc.“
Lucene query "cdc*" AND "Cell Cycle"
http://www.ebi.ac.uk/biomodels-demo/
Henkel et al. (2010), Bioinformatics
Fig.: Henkel et al. (2012) INFORMATIK 2012, Braunschweig
2. Managing models
• Representing simulation descriptions
• ... and other types of model-related data
13-07-12 Integrating model-related data 14
“Give me all possible simulations that
show the dependency of the Cell Cycle
on the concentration of cdc25.“
Fig.: Henkel et al. (2012) INFORMATIK 2012, Braunschweig
3. Integrating model data
13-07-12 Integrating model-related data 15
3. Integrating model data
13-07-12 Integrating model-related data 1616
At the heart of Linked Data for the Life Sciences
• Free and open source
• Based on Semantic Web standards
• Billions of interlinked statements from dozens
of conventional and high value datasets
• Partnerships with EBI, NCBI, DBCLS, NCBO,
OpenPHACTS, and commercial tool providers
chemicals/drugs/formulations,
genomes/genes/proteins, domains
Interactions, complexes & pathways
BioModels
animal models and phenotypes
Disease, genetic markers, treatments
Terminologies & publications
3. Integrating model data
# get all biochemical reactions in biomodels that are kinds of "protein
catabolic process“, as defined by the gene ontology (in bioportal endpoint)
SELECT ?go ?label count(distinct ?x)
WHERE {
?go rdfs:label ?label .
?go rdfs:subClassOf ?tgo OPTION (TRANSITIVE) .
?tgo rdfs:label ?tlabel .
FILTER regex(?tlabel, "^protein catabolic process")
service <http://biomodels.bio2rdf.org/sparql> {
?x <http://bio2rdf.org/biopax_vocabulary:identical-to> ?go .
?x a <http://www.biopax.org/release/biopax-level3.owl#BiochemicalReaction> .
}
13-07-12 Integrating model-related data 17
Gene Ontology Annotation Number of Reactions
protein catabolic process [go:0030163] 51
cellular protein catabolic process [go:0044257] 26
modification-dependent protein catabolic process [go:0019941] 1
beta-amyloid formation [go:0034205] 1
“Give me all reactions in BioModels
Database that represent protein
catabolic processes. “
Summary
Approach Features Purpose
COMBINE archive File bundle;
• Easy access to all model-related
data through one single file
Shipping files
Graph-DB (MORRE) Network of interrelated nodes
• IR techniques easily applicable
• No schema
• Link models and simulations
Managing
existing
model data
BIO2RDF Semantic integration of knowledge
• Automated reasoning
• No schema
• Linking into LOD
Full
integration
13-07-12 Integrating model-related data 18
Thank you.
13-07-12 Integrating model-related data 19
http://co.mbine.org/events/COMBINE_2013

Weitere ähnliche Inhalte

Was ist angesagt?

Students academic performance using clustering technique
Students academic performance using clustering techniqueStudents academic performance using clustering technique
Students academic performance using clustering techniquesaniacorreya
 
Managing and Analyzing Health Data (VLDB Conference)
Managing and Analyzing Health Data (VLDB Conference)Managing and Analyzing Health Data (VLDB Conference)
Managing and Analyzing Health Data (VLDB Conference)Peter Speyer
 
Machine Learning Powered Metabolomic Network Analysis
Machine Learning Powered Metabolomic Network AnalysisMachine Learning Powered Metabolomic Network Analysis
Machine Learning Powered Metabolomic Network AnalysisDmitry Grapov
 
An Open Source Tool for Game Theoretic Health Data De-Identification
An Open Source Tool for Game Theoretic Health Data De-IdentificationAn Open Source Tool for Game Theoretic Health Data De-Identification
An Open Source Tool for Game Theoretic Health Data De-Identificationarx-deidentifier
 
IRJET- A Review of Data Cleaning and its Current Approaches
IRJET- A Review of Data Cleaning and its Current ApproachesIRJET- A Review of Data Cleaning and its Current Approaches
IRJET- A Review of Data Cleaning and its Current ApproachesIRJET Journal
 
Niso usage data forum 2007
Niso usage data forum 2007Niso usage data forum 2007
Niso usage data forum 2007John McDonald
 
Predicting students performance in final examination
Predicting students performance in final examinationPredicting students performance in final examination
Predicting students performance in final examinationRashid Ansari
 
Complex Systems Biology Informed Data Analysis and Machine Learning
Complex Systems Biology Informed Data Analysis and Machine LearningComplex Systems Biology Informed Data Analysis and Machine Learning
Complex Systems Biology Informed Data Analysis and Machine LearningDmitry Grapov
 
Analysis on Data Mining Techniques for Heart Disease Dataset
Analysis on Data Mining Techniques for Heart Disease DatasetAnalysis on Data Mining Techniques for Heart Disease Dataset
Analysis on Data Mining Techniques for Heart Disease DatasetIRJET Journal
 
Data preprocessing
Data preprocessingData preprocessing
Data preprocessingsuganmca14
 
Anomaly detection via eliminating data redundancy and rectifying data error i...
Anomaly detection via eliminating data redundancy and rectifying data error i...Anomaly detection via eliminating data redundancy and rectifying data error i...
Anomaly detection via eliminating data redundancy and rectifying data error i...nalini manogaran
 
Effects of Highly Agreed Documents in Relevancy Prediction
Effects of Highly Agreed Documents in Relevancy PredictionEffects of Highly Agreed Documents in Relevancy Prediction
Effects of Highly Agreed Documents in Relevancy PredictionNTNU
 
Awash in eJournal Data: What It Is, Where It Is, and What Can Be Done With It.
Awash in eJournal Data: What It Is, Where It Is, and What Can Be Done With It.Awash in eJournal Data: What It Is, Where It Is, and What Can Be Done With It.
Awash in eJournal Data: What It Is, Where It Is, and What Can Be Done With It.Charleston Conference
 
Smit-Scrap supplementary material-nfdp13
Smit-Scrap supplementary material-nfdp13Smit-Scrap supplementary material-nfdp13
Smit-Scrap supplementary material-nfdp13DataDryad
 
Mapping to the Metabolomic Manifold
Mapping to the Metabolomic ManifoldMapping to the Metabolomic Manifold
Mapping to the Metabolomic ManifoldDmitry Grapov
 
API-Centric Data Integration for Human Genomics Reference Databases: Achieve...
 API-Centric Data Integration for Human Genomics Reference Databases: Achieve... API-Centric Data Integration for Human Genomics Reference Databases: Achieve...
API-Centric Data Integration for Human Genomics Reference Databases: Achieve...Genomika Diagnósticos
 
Connecting and synchronizing scientific knowledge
Connecting and synchronizing scientific knowledgeConnecting and synchronizing scientific knowledge
Connecting and synchronizing scientific knowledgePrashant Gupta
 

Was ist angesagt? (20)

Students academic performance using clustering technique
Students academic performance using clustering techniqueStudents academic performance using clustering technique
Students academic performance using clustering technique
 
Managing and Analyzing Health Data (VLDB Conference)
Managing and Analyzing Health Data (VLDB Conference)Managing and Analyzing Health Data (VLDB Conference)
Managing and Analyzing Health Data (VLDB Conference)
 
Machine Learning Powered Metabolomic Network Analysis
Machine Learning Powered Metabolomic Network AnalysisMachine Learning Powered Metabolomic Network Analysis
Machine Learning Powered Metabolomic Network Analysis
 
An Open Source Tool for Game Theoretic Health Data De-Identification
An Open Source Tool for Game Theoretic Health Data De-IdentificationAn Open Source Tool for Game Theoretic Health Data De-Identification
An Open Source Tool for Game Theoretic Health Data De-Identification
 
IRJET- A Review of Data Cleaning and its Current Approaches
IRJET- A Review of Data Cleaning and its Current ApproachesIRJET- A Review of Data Cleaning and its Current Approaches
IRJET- A Review of Data Cleaning and its Current Approaches
 
Niso usage data forum 2007
Niso usage data forum 2007Niso usage data forum 2007
Niso usage data forum 2007
 
Predicting students performance in final examination
Predicting students performance in final examinationPredicting students performance in final examination
Predicting students performance in final examination
 
Complex Systems Biology Informed Data Analysis and Machine Learning
Complex Systems Biology Informed Data Analysis and Machine LearningComplex Systems Biology Informed Data Analysis and Machine Learning
Complex Systems Biology Informed Data Analysis and Machine Learning
 
Bi4101343346
Bi4101343346Bi4101343346
Bi4101343346
 
Student Performance Data Mining Project Report
Student Performance Data Mining Project ReportStudent Performance Data Mining Project Report
Student Performance Data Mining Project Report
 
Analysis on Data Mining Techniques for Heart Disease Dataset
Analysis on Data Mining Techniques for Heart Disease DatasetAnalysis on Data Mining Techniques for Heart Disease Dataset
Analysis on Data Mining Techniques for Heart Disease Dataset
 
Data preprocessing
Data preprocessingData preprocessing
Data preprocessing
 
Anomaly detection via eliminating data redundancy and rectifying data error i...
Anomaly detection via eliminating data redundancy and rectifying data error i...Anomaly detection via eliminating data redundancy and rectifying data error i...
Anomaly detection via eliminating data redundancy and rectifying data error i...
 
Effects of Highly Agreed Documents in Relevancy Prediction
Effects of Highly Agreed Documents in Relevancy PredictionEffects of Highly Agreed Documents in Relevancy Prediction
Effects of Highly Agreed Documents in Relevancy Prediction
 
Awash in eJournal Data: What It Is, Where It Is, and What Can Be Done With It.
Awash in eJournal Data: What It Is, Where It Is, and What Can Be Done With It.Awash in eJournal Data: What It Is, Where It Is, and What Can Be Done With It.
Awash in eJournal Data: What It Is, Where It Is, and What Can Be Done With It.
 
Smit-Scrap supplementary material-nfdp13
Smit-Scrap supplementary material-nfdp13Smit-Scrap supplementary material-nfdp13
Smit-Scrap supplementary material-nfdp13
 
Mapping to the Metabolomic Manifold
Mapping to the Metabolomic ManifoldMapping to the Metabolomic Manifold
Mapping to the Metabolomic Manifold
 
API-Centric Data Integration for Human Genomics Reference Databases: Achieve...
 API-Centric Data Integration for Human Genomics Reference Databases: Achieve... API-Centric Data Integration for Human Genomics Reference Databases: Achieve...
API-Centric Data Integration for Human Genomics Reference Databases: Achieve...
 
Connecting and synchronizing scientific knowledge
Connecting and synchronizing scientific knowledgeConnecting and synchronizing scientific knowledge
Connecting and synchronizing scientific knowledge
 
Pedro-Combining rapid data modelling and ontology services
Pedro-Combining rapid data modelling and ontology servicesPedro-Combining rapid data modelling and ontology services
Pedro-Combining rapid data modelling and ontology services
 

Andere mochten auch

Management of simulation studies in computational biology
Management of simulation studies in computational biologyManagement of simulation studies in computational biology
Management of simulation studies in computational biologyUniversity Medicine Greifswald
 
Reproducibility, dissemination, and management of modeling results
Reproducibility, dissemination,  and management of modeling resultsReproducibility, dissemination,  and management of modeling results
Reproducibility, dissemination, and management of modeling resultsUniversity Medicine Greifswald
 
Grayling wcf davos visual social media
Grayling wcf davos   visual social mediaGrayling wcf davos   visual social media
Grayling wcf davos visual social mediaCatalana Occidente
 
Web analytics for Marketing & Communications - Best Practice Methodologies fo...
Web analytics for Marketing & Communications - Best Practice Methodologies fo...Web analytics for Marketing & Communications - Best Practice Methodologies fo...
Web analytics for Marketing & Communications - Best Practice Methodologies fo...Glide Technologies
 

Andere mochten auch (8)

Meta-Information for Bio-Models
Meta-Information for Bio-ModelsMeta-Information for Bio-Models
Meta-Information for Bio-Models
 
Management of simulation studies in computational biology
Management of simulation studies in computational biologyManagement of simulation studies in computational biology
Management of simulation studies in computational biology
 
Reproducibility, dissemination, and management of modeling results
Reproducibility, dissemination,  and management of modeling resultsReproducibility, dissemination,  and management of modeling results
Reproducibility, dissemination, and management of modeling results
 
Senior Project
Senior ProjectSenior Project
Senior Project
 
Grayling wcf davos visual social media
Grayling wcf davos   visual social mediaGrayling wcf davos   visual social media
Grayling wcf davos visual social media
 
Modelling sample at SEMS from a graph perspective
Modelling sample at SEMS from a graph perspectiveModelling sample at SEMS from a graph perspective
Modelling sample at SEMS from a graph perspective
 
Web analytics for Marketing & Communications - Best Practice Methodologies fo...
Web analytics for Marketing & Communications - Best Practice Methodologies fo...Web analytics for Marketing & Communications - Best Practice Methodologies fo...
Web analytics for Marketing & Communications - Best Practice Methodologies fo...
 
e:Bio Kick-Off Meeting, SEMS
e:Bio Kick-Off Meeting, SEMSe:Bio Kick-Off Meeting, SEMS
e:Bio Kick-Off Meeting, SEMS
 

Ähnlich wie Possibilities for integrating model-related data in computational biology (DILS 2013)

Adding value to scientific results: COMBINE standards & guidelines for system...
Adding value to scientific results: COMBINE standards & guidelines for system...Adding value to scientific results: COMBINE standards & guidelines for system...
Adding value to scientific results: COMBINE standards & guidelines for system...University Medicine Greifswald
 
COMBINE standards & tools: Getting model management right
COMBINE standards & tools: Getting model management rightCOMBINE standards & tools: Getting model management right
COMBINE standards & tools: Getting model management rightUniversity Medicine Greifswald
 
Large Graph Mining
Large Graph MiningLarge Graph Mining
Large Graph MiningSabri Skhiri
 
PhD dissertation Luis Marco Ruiz
PhD dissertation Luis Marco RuizPhD dissertation Luis Marco Ruiz
PhD dissertation Luis Marco RuizLuis Marco Ruiz
 
MULTI MODEL DATA MINING APPROACH FOR HEART FAILURE PREDICTION
MULTI MODEL DATA MINING APPROACH FOR HEART FAILURE PREDICTIONMULTI MODEL DATA MINING APPROACH FOR HEART FAILURE PREDICTION
MULTI MODEL DATA MINING APPROACH FOR HEART FAILURE PREDICTIONIJDKP
 
Poster: Very Open Data Project
Poster: Very Open Data ProjectPoster: Very Open Data Project
Poster: Very Open Data ProjectEdward Blurock
 
Standards and tools for model management in biomedical research
Standards and tools for model management in biomedical researchStandards and tools for model management in biomedical research
Standards and tools for model management in biomedical researchUniversity Medicine Greifswald
 
Minimal viable data reuse
Minimal viable data reuseMinimal viable data reuse
Minimal viable data reusevoginip
 
Big Data in Biomedicine – An NIH Perspective
Big Data in Biomedicine – An NIH PerspectiveBig Data in Biomedicine – An NIH Perspective
Big Data in Biomedicine – An NIH PerspectivePhilip Bourne
 
Model repositories and standard formats for model reusability
Model repositories and standard formats for model reusabilityModel repositories and standard formats for model reusability
Model repositories and standard formats for model reusabilityUniversity Medicine Greifswald
 
FAIR data and model management for systems biology (and SOPs too!)
FAIR data and model management for systems biology (and SOPs too!)FAIR data and model management for systems biology (and SOPs too!)
FAIR data and model management for systems biology (and SOPs too!)FAIRDOM
 
FAIR Data and Model Management for Systems Biology (and SOPs too!)
FAIR Data and Model Management for Systems Biology(and SOPs too!)FAIR Data and Model Management for Systems Biology(and SOPs too!)
FAIR Data and Model Management for Systems Biology (and SOPs too!)Carole Goble
 
UDM (Unified Data Model) - Enabling Exchange of Comprehensive Reaction Inform...
UDM (Unified Data Model) - Enabling Exchange of Comprehensive Reaction Inform...UDM (Unified Data Model) - Enabling Exchange of Comprehensive Reaction Inform...
UDM (Unified Data Model) - Enabling Exchange of Comprehensive Reaction Inform...Frederik van den Broek
 
PREDICTIVE ANALYTICS IN HEALTHCARE SYSTEM USING DATA MINING TECHNIQUES
PREDICTIVE ANALYTICS IN HEALTHCARE SYSTEM USING DATA MINING TECHNIQUESPREDICTIVE ANALYTICS IN HEALTHCARE SYSTEM USING DATA MINING TECHNIQUES
PREDICTIVE ANALYTICS IN HEALTHCARE SYSTEM USING DATA MINING TECHNIQUEScscpconf
 
Paper Annotated: SinGAN-Seg: Synthetic Training Data Generation for Medical I...
Paper Annotated: SinGAN-Seg: Synthetic Training Data Generation for Medical I...Paper Annotated: SinGAN-Seg: Synthetic Training Data Generation for Medical I...
Paper Annotated: SinGAN-Seg: Synthetic Training Data Generation for Medical I...Devansh16
 

Ähnlich wie Possibilities for integrating model-related data in computational biology (DILS 2013) (20)

Adding value to scientific results: COMBINE standards & guidelines for system...
Adding value to scientific results: COMBINE standards & guidelines for system...Adding value to scientific results: COMBINE standards & guidelines for system...
Adding value to scientific results: COMBINE standards & guidelines for system...
 
Model management for systems biology projects
Model management for systems biology projectsModel management for systems biology projects
Model management for systems biology projects
 
FAIR data management in biomedicine
FAIR data management  in biomedicineFAIR data management  in biomedicine
FAIR data management in biomedicine
 
COMBINE standards & tools: Getting model management right
COMBINE standards & tools: Getting model management rightCOMBINE standards & tools: Getting model management right
COMBINE standards & tools: Getting model management right
 
Large Graph Mining
Large Graph MiningLarge Graph Mining
Large Graph Mining
 
PhD dissertation Luis Marco Ruiz
PhD dissertation Luis Marco RuizPhD dissertation Luis Marco Ruiz
PhD dissertation Luis Marco Ruiz
 
Data and model management in Systems Biology
Data and model management in Systems BiologyData and model management in Systems Biology
Data and model management in Systems Biology
 
MULTI MODEL DATA MINING APPROACH FOR HEART FAILURE PREDICTION
MULTI MODEL DATA MINING APPROACH FOR HEART FAILURE PREDICTIONMULTI MODEL DATA MINING APPROACH FOR HEART FAILURE PREDICTION
MULTI MODEL DATA MINING APPROACH FOR HEART FAILURE PREDICTION
 
Poster: Very Open Data Project
Poster: Very Open Data ProjectPoster: Very Open Data Project
Poster: Very Open Data Project
 
Standards and tools for model management in biomedical research
Standards and tools for model management in biomedical researchStandards and tools for model management in biomedical research
Standards and tools for model management in biomedical research
 
OGD Metadata standards – The ENGAGE metadata architecture
OGD Metadata standards – The ENGAGE metadata architectureOGD Metadata standards – The ENGAGE metadata architecture
OGD Metadata standards – The ENGAGE metadata architecture
 
Minimal viable data reuse
Minimal viable data reuseMinimal viable data reuse
Minimal viable data reuse
 
Poster CBIS 2012
Poster CBIS 2012Poster CBIS 2012
Poster CBIS 2012
 
Big Data in Biomedicine – An NIH Perspective
Big Data in Biomedicine – An NIH PerspectiveBig Data in Biomedicine – An NIH Perspective
Big Data in Biomedicine – An NIH Perspective
 
Model repositories and standard formats for model reusability
Model repositories and standard formats for model reusabilityModel repositories and standard formats for model reusability
Model repositories and standard formats for model reusability
 
FAIR data and model management for systems biology (and SOPs too!)
FAIR data and model management for systems biology (and SOPs too!)FAIR data and model management for systems biology (and SOPs too!)
FAIR data and model management for systems biology (and SOPs too!)
 
FAIR Data and Model Management for Systems Biology (and SOPs too!)
FAIR Data and Model Management for Systems Biology(and SOPs too!)FAIR Data and Model Management for Systems Biology(and SOPs too!)
FAIR Data and Model Management for Systems Biology (and SOPs too!)
 
UDM (Unified Data Model) - Enabling Exchange of Comprehensive Reaction Inform...
UDM (Unified Data Model) - Enabling Exchange of Comprehensive Reaction Inform...UDM (Unified Data Model) - Enabling Exchange of Comprehensive Reaction Inform...
UDM (Unified Data Model) - Enabling Exchange of Comprehensive Reaction Inform...
 
PREDICTIVE ANALYTICS IN HEALTHCARE SYSTEM USING DATA MINING TECHNIQUES
PREDICTIVE ANALYTICS IN HEALTHCARE SYSTEM USING DATA MINING TECHNIQUESPREDICTIVE ANALYTICS IN HEALTHCARE SYSTEM USING DATA MINING TECHNIQUES
PREDICTIVE ANALYTICS IN HEALTHCARE SYSTEM USING DATA MINING TECHNIQUES
 
Paper Annotated: SinGAN-Seg: Synthetic Training Data Generation for Medical I...
Paper Annotated: SinGAN-Seg: Synthetic Training Data Generation for Medical I...Paper Annotated: SinGAN-Seg: Synthetic Training Data Generation for Medical I...
Paper Annotated: SinGAN-Seg: Synthetic Training Data Generation for Medical I...
 

Mehr von University Medicine Greifswald

A guide to the COMBINE: Navigating through specifications, mailing lists and ...
A guide to the COMBINE: Navigating through specifications, mailing lists and ...A guide to the COMBINE: Navigating through specifications, mailing lists and ...
A guide to the COMBINE: Navigating through specifications, mailing lists and ...University Medicine Greifswald
 
Implementierung Graph-basierter Ansätze für das Management systembiologischer...
Implementierung Graph-basierter Ansätze für das Management systembiologischer...Implementierung Graph-basierter Ansätze für das Management systembiologischer...
Implementierung Graph-basierter Ansätze für das Management systembiologischer...University Medicine Greifswald
 
Using Neo4j technologies for the management of systems biology models
Using Neo4j technologies for the management of systems biology modelsUsing Neo4j technologies for the management of systems biology models
Using Neo4j technologies for the management of systems biology modelsUniversity Medicine Greifswald
 
Model management tools for improved reproducibility in systems biology
Model management tools for improved reproducibility in systems biologyModel management tools for improved reproducibility in systems biology
Model management tools for improved reproducibility in systems biologyUniversity Medicine Greifswald
 
Identifying pattern in reaction networks of computational models
Identifying pattern in reaction networks of computational modelsIdentifying pattern in reaction networks of computational models
Identifying pattern in reaction networks of computational modelsUniversity Medicine Greifswald
 
Extended support for standard graphical notations of biological networks in s...
Extended support for standard graphical notations of biological networks in s...Extended support for standard graphical notations of biological networks in s...
Extended support for standard graphical notations of biological networks in s...University Medicine Greifswald
 
Masymos: Finding hidden treasures in model repositories
Masymos: Finding hidden treasures in model repositoriesMasymos: Finding hidden treasures in model repositories
Masymos: Finding hidden treasures in model repositoriesUniversity Medicine Greifswald
 

Mehr von University Medicine Greifswald (17)

A guide to the COMBINE: Navigating through specifications, mailing lists and ...
A guide to the COMBINE: Navigating through specifications, mailing lists and ...A guide to the COMBINE: Navigating through specifications, mailing lists and ...
A guide to the COMBINE: Navigating through specifications, mailing lists and ...
 
When is a model FAIR – and why should we care?
When is a model FAIR – and why should we care?When is a model FAIR – and why should we care?
When is a model FAIR – and why should we care?
 
2019 07-04-model reuse-bonn
2019 07-04-model reuse-bonn2019 07-04-model reuse-bonn
2019 07-04-model reuse-bonn
 
Mehr Medizininformatik am Meer
Mehr Medizininformatik am MeerMehr Medizininformatik am Meer
Mehr Medizininformatik am Meer
 
Implementierung Graph-basierter Ansätze für das Management systembiologischer...
Implementierung Graph-basierter Ansätze für das Management systembiologischer...Implementierung Graph-basierter Ansätze für das Management systembiologischer...
Implementierung Graph-basierter Ansätze für das Management systembiologischer...
 
Using Neo4j technologies for the management of systems biology models
Using Neo4j technologies for the management of systems biology modelsUsing Neo4j technologies for the management of systems biology models
Using Neo4j technologies for the management of systems biology models
 
Model management tools for improved reproducibility in systems biology
Model management tools for improved reproducibility in systems biologyModel management tools for improved reproducibility in systems biology
Model management tools for improved reproducibility in systems biology
 
Short introduction to SED-ML
Short introduction to SED-MLShort introduction to SED-ML
Short introduction to SED-ML
 
Data and Model Management for Systems Biology
Data and Model Management  for Systems BiologyData and Model Management  for Systems Biology
Data and Model Management for Systems Biology
 
Identifying pattern in reaction networks of computational models
Identifying pattern in reaction networks of computational modelsIdentifying pattern in reaction networks of computational models
Identifying pattern in reaction networks of computational models
 
Extended support for standard graphical notations of biological networks in s...
Extended support for standard graphical notations of biological networks in s...Extended support for standard graphical notations of biological networks in s...
Extended support for standard graphical notations of biological networks in s...
 
Coming Soon: de.NBI and SBGN-ED @ SEMS
Coming Soon: de.NBI and SBGN-ED @ SEMSComing Soon: de.NBI and SBGN-ED @ SEMS
Coming Soon: de.NBI and SBGN-ED @ SEMS
 
Masymos: Finding hidden treasures in model repositories
Masymos: Finding hidden treasures in model repositoriesMasymos: Finding hidden treasures in model repositories
Masymos: Finding hidden treasures in model repositories
 
SEMS: Model search and ranked Retrieval (Ron Henkel)
SEMS: Model search and ranked Retrieval (Ron Henkel)SEMS: Model search and ranked Retrieval (Ron Henkel)
SEMS: Model search and ranked Retrieval (Ron Henkel)
 
Simulation experiment descriptions and management
Simulation experiment descriptions and managementSimulation experiment descriptions and management
Simulation experiment descriptions and management
 
Sems project overview
Sems project overviewSems project overview
Sems project overview
 
Bio-Model Meta-Information and SED-ML
Bio-Model Meta-Information and SED-MLBio-Model Meta-Information and SED-ML
Bio-Model Meta-Information and SED-ML
 

Kürzlich hochgeladen

mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docxPoojaSen20
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfciinovamais
 
URLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppURLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppCeline George
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13Steve Thomason
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)eniolaolutunde
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityGeoBlogs
 
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991RKavithamani
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxSayali Powar
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdfQucHHunhnh
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxGaneshChakor2
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Educationpboyjonauth
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationnomboosow
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdfssuser54595a
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3JemimahLaneBuaron
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104misteraugie
 
Hybridoma Technology ( Production , Purification , and Application )
Hybridoma Technology  ( Production , Purification , and Application  ) Hybridoma Technology  ( Production , Purification , and Application  )
Hybridoma Technology ( Production , Purification , and Application ) Sakshi Ghasle
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfJayanti Pande
 
Student login on Anyboli platform.helpin
Student login on Anyboli platform.helpinStudent login on Anyboli platform.helpin
Student login on Anyboli platform.helpinRaunakKeshri1
 

Kürzlich hochgeladen (20)

mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docx
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
 
URLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppURLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website App
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activity
 
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptx
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Education
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communication
 
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
 
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdfTataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104
 
Hybridoma Technology ( Production , Purification , and Application )
Hybridoma Technology  ( Production , Purification , and Application  ) Hybridoma Technology  ( Production , Purification , and Application  )
Hybridoma Technology ( Production , Purification , and Application )
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdf
 
Student login on Anyboli platform.helpin
Student login on Anyboli platform.helpinStudent login on Anyboli platform.helpin
Student login on Anyboli platform.helpin
 

Possibilities for integrating model-related data in computational biology (DILS 2013)

  • 1. Possibilities for Integrating Model-related Data in Computational Biology Databases in Life Sciences, Montreal, July 2013 Dagmar Waltemath, University of Rostock, Germany Nicolas Le Novère, Babraham Institute, UK Michel Dumontier, Carleton University, Canada Archive
  • 2. Introduction 13-07-12 Integrating model-related data 2 Fig.: DOI: 10.1038/35002125
  • 3. Introduction 13-07-12 Integrating model-related data 3 Fig.: DOI: 10.1038/35002125
  • 4. Introduction 13-07-12 Integrating model-related data 4 No and size of models time Fig.: DOI: 10.1038/35002125
  • 5. Introduction 13-07-12 Integrating model-related data 5 Fig.: DOI: 10.1038/35002125 model reuse – result reproducibility
  • 7. Introduction 1. How can we distribute models with all information necessary to reuse them (MIRIAM)? 2. How can we effectively manage different types of model-related data? 3. How can we link model-related data to the rest of the world? 13-07-12 Integrating model-related data 7
  • 8. 1. Distributing models Archive 13-07-12 Integrating model-related data 8 Frank Bergmann Nicolas Le Novère
  • 9. 1. Distributing models The COMBINE archive v0.1 • single “.zip” file • bundles models and model-related data • single file http://co.mbine.org/documents/archive 13-07-12 Integrating model-related data 9
  • 10. 1. A manifest file, "manifest.xml“, 2. all described files, 3. a metadata file, "metadata.*“, 4. remaining files. • All documents necessary for the description of a model and all associated data and procedures. • In the future: also references to documents 1. Distributing models <?xml version="1.0" encoding="utf-8"?> <omexManifest xmlns="http://identifiers.org/combine.specifications/omex-manifest"> <content location="./manifest.xml" format="http://identifiers.org/combine.specifications/omex- manifest"/> <content location="./model/model.xml" format="http://identifiers.org/combine.specifications/sbml"/> <content location="./simulation.xml" format="http://identifiers.org/combine.specifications/sedml"/> <content location="./article.pdf" format="application/pdf"/> <content location="./metadata.rdf" format="http://identifiers.org/combine.specifications/omex- metadata"/> </omexManifest> 13-07-12 Integrating model-related data 10
  • 11. 2. Managing models 13-07-12 11Integrating model-related data Ron Henkel
  • 12. 2. Managing models • Neo4J database • Model2graph mapping ( , ) • Rich relations http://biomodels.net/qualifiers • Links to annotations 13-07-12 Integrating model-related data 12 “Which models are annotated with ‚Adenosine tri-phosphate‘?“ “Which models contain reactions with ATP as reactant and ADP as product?” Document Model P E CR S SBO:0000268 uniprot:P07101 uniprot:Q03393 GO:0005737HGNC:8582 is isVersionOf is isEncodedBy is asProduct asReactant asModifier Fig.: Henkel et al. (2012) INFORMATIK 2012, Braunschweig
  • 13. Document Model P E CR S SBO:0000268 uniprot:P07101 uniprot:Q03393 GO:0005737HGNC:8582 is isVersionOf is isEncodedBy is asProduct asReactant asModifier 2. Managing models • Lucene-based ranked retrieval 13-07-12 Integrating model-related data 13 “Give me the best matching model published about the Cell Cycle and covering forms of cdc.“ Lucene query "cdc*" AND "Cell Cycle" http://www.ebi.ac.uk/biomodels-demo/ Henkel et al. (2010), Bioinformatics Fig.: Henkel et al. (2012) INFORMATIK 2012, Braunschweig
  • 14. 2. Managing models • Representing simulation descriptions • ... and other types of model-related data 13-07-12 Integrating model-related data 14 “Give me all possible simulations that show the dependency of the Cell Cycle on the concentration of cdc25.“ Fig.: Henkel et al. (2012) INFORMATIK 2012, Braunschweig
  • 15. 3. Integrating model data 13-07-12 Integrating model-related data 15
  • 16. 3. Integrating model data 13-07-12 Integrating model-related data 1616 At the heart of Linked Data for the Life Sciences • Free and open source • Based on Semantic Web standards • Billions of interlinked statements from dozens of conventional and high value datasets • Partnerships with EBI, NCBI, DBCLS, NCBO, OpenPHACTS, and commercial tool providers chemicals/drugs/formulations, genomes/genes/proteins, domains Interactions, complexes & pathways BioModels animal models and phenotypes Disease, genetic markers, treatments Terminologies & publications
  • 17. 3. Integrating model data # get all biochemical reactions in biomodels that are kinds of "protein catabolic process“, as defined by the gene ontology (in bioportal endpoint) SELECT ?go ?label count(distinct ?x) WHERE { ?go rdfs:label ?label . ?go rdfs:subClassOf ?tgo OPTION (TRANSITIVE) . ?tgo rdfs:label ?tlabel . FILTER regex(?tlabel, "^protein catabolic process") service <http://biomodels.bio2rdf.org/sparql> { ?x <http://bio2rdf.org/biopax_vocabulary:identical-to> ?go . ?x a <http://www.biopax.org/release/biopax-level3.owl#BiochemicalReaction> . } 13-07-12 Integrating model-related data 17 Gene Ontology Annotation Number of Reactions protein catabolic process [go:0030163] 51 cellular protein catabolic process [go:0044257] 26 modification-dependent protein catabolic process [go:0019941] 1 beta-amyloid formation [go:0034205] 1 “Give me all reactions in BioModels Database that represent protein catabolic processes. “
  • 18. Summary Approach Features Purpose COMBINE archive File bundle; • Easy access to all model-related data through one single file Shipping files Graph-DB (MORRE) Network of interrelated nodes • IR techniques easily applicable • No schema • Link models and simulations Managing existing model data BIO2RDF Semantic integration of knowledge • Automated reasoning • No schema • Linking into LOD Full integration 13-07-12 Integrating model-related data 18
  • 19. Thank you. 13-07-12 Integrating model-related data 19 http://co.mbine.org/events/COMBINE_2013