SlideShare ist ein Scribd-Unternehmen logo
1 von 28
BioPAX A Data Exchange Format for Biological Pathways BioPAX Workgroup www.biopax.org
Introduction ,[object Object],[object Object],[object Object]
Exchange Formats in the Pathway Data Space BioPAX Molecular Interactions Pro:Pro  All:All PSI Biochemical Reactions SBML, CellML Regulatory Pathways Low Detail  High Detail Genetic Interactions Interaction  Networks Molecular  Non-molecular Pro:Pro  TF:Gene  Genetic Metabolic  Pathways Low Detail  High Detail Database Exchange Formats Simulation Model Exchange Formats Small Molecules (CML) Rate Formulas
High Throughput Experimental Methods Expression, Interaction Data, Function, Protein modifications PubMed Existing Literature Multiple Pathway Databases Integration Nightmare! Microarray Two-Hybrid Mass Spectrometry Genetics
Goals ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Goals ,[object Object],[object Object],[object Object],[object Object]
Ontology  Syntax ,[object Object],[object Object],[object Object],[object Object],[object Object]
Syntax Languages ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Data stream organization ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
BioPAX Ontology : Root ,[object Object],[object Object],[object Object],[object Object]
BioPAX Ontology : Root ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],IS A HAS A
BioPAX Ontology: Parts ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
BioPAX Ontology: Interactions ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
BioPAX Ontology ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Use Cases ,[object Object],[object Object],[object Object],[object Object]
Representing Metabolic Data in BioPAX EcoCyc: Reaction BioPAX Class: Reaction Glucose-6-p to fructose-6-p Name 5.3.1.9 EC 0.4 kcal/mole Delta G <cml>fructose-6-phosphate</cml> Product <cml>glucose-6-phosphate</cml> Substrate 1 ID Reaction
Representing Metabolic Data in BioPAX (cont 1) EcoCyc: Enzyme Catalysis BioPAX Class: Catalysis 2 ID Low pH Inhibitors BioPAX ID=1 Reaction glucose-6-phosphate isomerase Enzyme Catalysis of glucose-6-p to fructose-6-p Name Catalysis
Representing Metabolic Data in BioPAX (cont 2) EcoCyc: Pathway BioPAX Class: Pathway 10 ID 1. BioPAX ID=2 2. BioPAX ID=4 3. BioPAX ID=6 etc. Interactions Glycolysis Name Pathway
Converting PSI Data into BioPAX PSI XML BioPAX Class: Molecular Association PDB:3HHR DB Source PMID = 1549776 Reference X-ray Crystallography Experiment Description hGHR binds to hGH Name hGRH; hGH Participants 1 ID Molecular Association
Signal Transduction DBs ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],CSNDB Pathway
BioPAX Subgroups ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
BioPAX Subgroups: Small Molecule ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
BioPAX Subgroups: States ,[object Object],[object Object]
BioPAX Subgroups: Examples ,[object Object]
Current Status ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Next Steps ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
How to Contribute ,[object Object],[object Object],[object Object],[object Object]
BioPAX Supporting Groups ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

Weitere ähnliche Inhalte

Was ist angesagt?

PROTEIN STRUCTURE DATABANK
PROTEIN STRUCTURE DATABANKPROTEIN STRUCTURE DATABANK
PROTEIN STRUCTURE DATABANK
Malvika Bansal
 
Areejit Samal Emergence Alaska 2013
Areejit Samal Emergence Alaska 2013Areejit Samal Emergence Alaska 2013
Areejit Samal Emergence Alaska 2013
Areejit Samal
 
UniProt and the Semantic Web
UniProt and the Semantic WebUniProt and the Semantic Web
UniProt and the Semantic Web
Chimezie Ogbuji
 

Was ist angesagt? (20)

PIR & MINT
PIR & MINTPIR & MINT
PIR & MINT
 
PROTEIN STRUCTURE DATABANK
PROTEIN STRUCTURE DATABANKPROTEIN STRUCTURE DATABANK
PROTEIN STRUCTURE DATABANK
 
Protein Data Bank
Protein Data BankProtein Data Bank
Protein Data Bank
 
Cytoscape: Gene coexppression and PPI networks
Cytoscape: Gene coexppression and PPI networksCytoscape: Gene coexppression and PPI networks
Cytoscape: Gene coexppression and PPI networks
 
Prediction of protein function
Prediction of protein functionPrediction of protein function
Prediction of protein function
 
Areejit Samal Emergence Alaska 2013
Areejit Samal Emergence Alaska 2013Areejit Samal Emergence Alaska 2013
Areejit Samal Emergence Alaska 2013
 
Scop database
Scop databaseScop database
Scop database
 
Protein database
Protein databaseProtein database
Protein database
 
Protein function prediction
Protein function predictionProtein function prediction
Protein function prediction
 
Literature Based Framework for Semantic Descriptions of e-Science resources
Literature Based Framework for Semantic Descriptions of e-Science resourcesLiterature Based Framework for Semantic Descriptions of e-Science resources
Literature Based Framework for Semantic Descriptions of e-Science resources
 
protein data bank
protein data bankprotein data bank
protein data bank
 
protein databases
 protein databases protein databases
protein databases
 
Swiss prot database
Swiss prot databaseSwiss prot database
Swiss prot database
 
Major databases in bioinformatics
Major databases in bioinformaticsMajor databases in bioinformatics
Major databases in bioinformatics
 
InterPro and InterProScan 5.0
InterPro and InterProScan 5.0InterPro and InterProScan 5.0
InterPro and InterProScan 5.0
 
Chibucos annot go_final
Chibucos annot go_finalChibucos annot go_final
Chibucos annot go_final
 
Gutell 122.chapter comparative analy_russell_2013
Gutell 122.chapter comparative analy_russell_2013Gutell 122.chapter comparative analy_russell_2013
Gutell 122.chapter comparative analy_russell_2013
 
UniProt and the Semantic Web
UniProt and the Semantic WebUniProt and the Semantic Web
UniProt and the Semantic Web
 
BITS: Basics of sequence analysis
BITS: Basics of sequence analysisBITS: Basics of sequence analysis
BITS: Basics of sequence analysis
 
Proteomics and protein-protein interaction
Proteomics  and protein-protein interactionProteomics  and protein-protein interaction
Proteomics and protein-protein interaction
 

Andere mochten auch

2013 dec 26_bgu_israel_seminar_l_luciano
2013 dec 26_bgu_israel_seminar_l_luciano2013 dec 26_bgu_israel_seminar_l_luciano
2013 dec 26_bgu_israel_seminar_l_luciano
Joanne Luciano
 
2013 dec bgu_israel_luciano_dec_22
2013 dec bgu_israel_luciano_dec_222013 dec bgu_israel_luciano_dec_22
2013 dec bgu_israel_luciano_dec_22
Joanne Luciano
 
Pal gov.tutorial4.session1 1.needforsharedsemantics
Pal gov.tutorial4.session1 1.needforsharedsemanticsPal gov.tutorial4.session1 1.needforsharedsemantics
Pal gov.tutorial4.session1 1.needforsharedsemantics
Mustafa Jarrar
 
2013 dec bgu_israel_luciano_day_1_dec_22
2013 dec bgu_israel_luciano_day_1_dec_222013 dec bgu_israel_luciano_day_1_dec_22
2013 dec bgu_israel_luciano_day_1_dec_22
Joanne Luciano
 
How Bio Ontologies Enable Open Science
How Bio Ontologies Enable Open ScienceHow Bio Ontologies Enable Open Science
How Bio Ontologies Enable Open Science
drnigam
 
2013 dec bgu_israel_luciano_day_3_dec_25
2013 dec bgu_israel_luciano_day_3_dec_252013 dec bgu_israel_luciano_day_3_dec_25
2013 dec bgu_israel_luciano_day_3_dec_25
Joanne Luciano
 

Andere mochten auch (12)

2013 dec 26_bgu_israel_seminar_l_luciano
2013 dec 26_bgu_israel_seminar_l_luciano2013 dec 26_bgu_israel_seminar_l_luciano
2013 dec 26_bgu_israel_seminar_l_luciano
 
Luciano pr 08-849_ontology_evaluation_methods_metrics
Luciano pr 08-849_ontology_evaluation_methods_metricsLuciano pr 08-849_ontology_evaluation_methods_metrics
Luciano pr 08-849_ontology_evaluation_methods_metrics
 
2013 dec bgu_israel_luciano_dec_22
2013 dec bgu_israel_luciano_dec_222013 dec bgu_israel_luciano_dec_22
2013 dec bgu_israel_luciano_dec_22
 
06063028 face page
06063028 face page06063028 face page
06063028 face page
 
Ontology development and use for efficient information input and retrieval
Ontology development and use for efficient information input and retrievalOntology development and use for efficient information input and retrieval
Ontology development and use for efficient information input and retrieval
 
Pal gov.tutorial4.session1 1.needforsharedsemantics
Pal gov.tutorial4.session1 1.needforsharedsemanticsPal gov.tutorial4.session1 1.needforsharedsemantics
Pal gov.tutorial4.session1 1.needforsharedsemantics
 
Ontology Support for Influenza and Surveillance
Ontology Support for Influenza and Surveillance Ontology Support for Influenza and Surveillance
Ontology Support for Influenza and Surveillance
 
2013 dec bgu_israel_luciano_day_1_dec_22
2013 dec bgu_israel_luciano_day_1_dec_222013 dec bgu_israel_luciano_day_1_dec_22
2013 dec bgu_israel_luciano_day_1_dec_22
 
Bussolon, Betti: Conceptualize once, design anywhere
Bussolon, Betti: Conceptualize once, design anywhereBussolon, Betti: Conceptualize once, design anywhere
Bussolon, Betti: Conceptualize once, design anywhere
 
How Bio Ontologies Enable Open Science
How Bio Ontologies Enable Open ScienceHow Bio Ontologies Enable Open Science
How Bio Ontologies Enable Open Science
 
Research Inventy : International Journal of Engineering and Science
Research Inventy : International Journal of Engineering and ScienceResearch Inventy : International Journal of Engineering and Science
Research Inventy : International Journal of Engineering and Science
 
2013 dec bgu_israel_luciano_day_3_dec_25
2013 dec bgu_israel_luciano_day_3_dec_252013 dec bgu_israel_luciano_day_3_dec_25
2013 dec bgu_israel_luciano_day_3_dec_25
 

Ähnlich wie Bio onttalk 30minutes-june2003[1]

Communications
CommunicationsCommunications
Communications
somasushma
 
100505 koenig biological_databases
100505 koenig biological_databases100505 koenig biological_databases
100505 koenig biological_databases
Meetika Gupta
 
Proteomics: lecture (1) introduction to proteomics
Proteomics: lecture (1) introduction to proteomicsProteomics: lecture (1) introduction to proteomics
Proteomics: lecture (1) introduction to proteomics
Claudine83
 

Ähnlich wie Bio onttalk 30minutes-june2003[1] (20)

Experiences with logic programming in bioinformatics
Experiences with logic programming in bioinformaticsExperiences with logic programming in bioinformatics
Experiences with logic programming in bioinformatics
 
Communications
CommunicationsCommunications
Communications
 
Ondex: Data integration and visualisation
Ondex: Data integration and visualisationOndex: Data integration and visualisation
Ondex: Data integration and visualisation
 
100505 koenig biological_databases
100505 koenig biological_databases100505 koenig biological_databases
100505 koenig biological_databases
 
BITS: Overview of important biological databases beyond sequences
BITS: Overview of important biological databases beyond sequencesBITS: Overview of important biological databases beyond sequences
BITS: Overview of important biological databases beyond sequences
 
Gen bank
Gen bankGen bank
Gen bank
 
Major biological nucleotide databases
Major biological nucleotide databasesMajor biological nucleotide databases
Major biological nucleotide databases
 
Protein database
Protein databaseProtein database
Protein database
 
Ismb2009
Ismb2009Ismb2009
Ismb2009
 
Algal Functional Annotation Tool
Algal Functional Annotation ToolAlgal Functional Annotation Tool
Algal Functional Annotation Tool
 
Dynamic Semantic Metadata in Biomedical Communications
Dynamic Semantic Metadata in Biomedical CommunicationsDynamic Semantic Metadata in Biomedical Communications
Dynamic Semantic Metadata in Biomedical Communications
 
Protein databases
Protein databasesProtein databases
Protein databases
 
Bioinformatics final
Bioinformatics finalBioinformatics final
Bioinformatics final
 
Modularity and evolvability
Modularity and evolvabilityModularity and evolvability
Modularity and evolvability
 
Analysis with biological pathways:
Analysis with biological pathways: Analysis with biological pathways:
Analysis with biological pathways:
 
Systems biology and biotechnology of Streptomyces species for the production ...
Systems biology and biotechnology of Streptomyces species for the production ...Systems biology and biotechnology of Streptomyces species for the production ...
Systems biology and biotechnology of Streptomyces species for the production ...
 
Proteomics: lecture (1) introduction to proteomics
Proteomics: lecture (1) introduction to proteomicsProteomics: lecture (1) introduction to proteomics
Proteomics: lecture (1) introduction to proteomics
 
Huwang-2-7.ppt
Huwang-2-7.pptHuwang-2-7.ppt
Huwang-2-7.ppt
 
Connecting life sciences data at the European Bioinformatics Institute
Connecting life sciences data at the European Bioinformatics InstituteConnecting life sciences data at the European Bioinformatics Institute
Connecting life sciences data at the European Bioinformatics Institute
 
Interactomics, Integromics to Systems Biology: Next Animal Biotechnology Fron...
Interactomics, Integromics to Systems Biology: Next Animal Biotechnology Fron...Interactomics, Integromics to Systems Biology: Next Animal Biotechnology Fron...
Interactomics, Integromics to Systems Biology: Next Animal Biotechnology Fron...
 

Mehr von Joanne Luciano

06317731 Patent page 1
06317731 Patent page 106317731 Patent page 1
06317731 Patent page 1
Joanne Luciano
 
The Translational Medicine
The Translational MedicineThe Translational Medicine
The Translational Medicine
Joanne Luciano
 

Mehr von Joanne Luciano (12)

Luciano uvi hackfest.28.10.2020
Luciano uvi hackfest.28.10.2020Luciano uvi hackfest.28.10.2020
Luciano uvi hackfest.28.10.2020
 
Indiana University 2018 SICE summer camp slides
Indiana University 2018 SICE summer camp slidesIndiana University 2018 SICE summer camp slides
Indiana University 2018 SICE summer camp slides
 
Why are some websites successful (at behavioral change) Informs International...
Why are some websites successful (at behavioral change) Informs International...Why are some websites successful (at behavioral change) Informs International...
Why are some websites successful (at behavioral change) Informs International...
 
The General Ontology Evaluation Framework (GOEF) & the I-Choose Use Case A ...
The General Ontology Evaluation Framework (GOEF) & the I-Choose Use CaseA ...The General Ontology Evaluation Framework (GOEF) & the I-Choose Use CaseA ...
The General Ontology Evaluation Framework (GOEF) & the I-Choose Use Case A ...
 
Luciano informs healthcare_2015 Nashville, TN USA July 30 2015
Luciano informs healthcare_2015 Nashville, TN USA July 30 2015Luciano informs healthcare_2015 Nashville, TN USA July 30 2015
Luciano informs healthcare_2015 Nashville, TN USA July 30 2015
 
Translational Medicine: Patterns of Response to Antidepressant Treatment and ...
Translational Medicine: Patterns of Response to Antidepressant Treatment and ...Translational Medicine: Patterns of Response to Antidepressant Treatment and ...
Translational Medicine: Patterns of Response to Antidepressant Treatment and ...
 
Amia tbi 2010_pmi_luciano.ppt
Amia tbi 2010_pmi_luciano.pptAmia tbi 2010_pmi_luciano.ppt
Amia tbi 2010_pmi_luciano.ppt
 
06317731 Patent page 1
06317731 Patent page 106317731 Patent page 1
06317731 Patent page 1
 
Bio it 2005_rdf_workshop05
Bio it 2005_rdf_workshop05Bio it 2005_rdf_workshop05
Bio it 2005_rdf_workshop05
 
Luciano phddefense
Luciano phddefenseLuciano phddefense
Luciano phddefense
 
Luciano pr 08-849_ontology_evaluation_methods_metrics
Luciano pr 08-849_ontology_evaluation_methods_metricsLuciano pr 08-849_ontology_evaluation_methods_metrics
Luciano pr 08-849_ontology_evaluation_methods_metrics
 
The Translational Medicine
The Translational MedicineThe Translational Medicine
The Translational Medicine
 

Bio onttalk 30minutes-june2003[1]

  • 1. BioPAX A Data Exchange Format for Biological Pathways BioPAX Workgroup www.biopax.org
  • 2.
  • 3. Exchange Formats in the Pathway Data Space BioPAX Molecular Interactions Pro:Pro All:All PSI Biochemical Reactions SBML, CellML Regulatory Pathways Low Detail High Detail Genetic Interactions Interaction Networks Molecular Non-molecular Pro:Pro TF:Gene Genetic Metabolic Pathways Low Detail High Detail Database Exchange Formats Simulation Model Exchange Formats Small Molecules (CML) Rate Formulas
  • 4. High Throughput Experimental Methods Expression, Interaction Data, Function, Protein modifications PubMed Existing Literature Multiple Pathway Databases Integration Nightmare! Microarray Two-Hybrid Mass Spectrometry Genetics
  • 5.
  • 6.
  • 7.
  • 8.
  • 9.
  • 10.
  • 11.
  • 12.
  • 13.
  • 14.
  • 15.
  • 16. Representing Metabolic Data in BioPAX EcoCyc: Reaction BioPAX Class: Reaction Glucose-6-p to fructose-6-p Name 5.3.1.9 EC 0.4 kcal/mole Delta G <cml>fructose-6-phosphate</cml> Product <cml>glucose-6-phosphate</cml> Substrate 1 ID Reaction
  • 17. Representing Metabolic Data in BioPAX (cont 1) EcoCyc: Enzyme Catalysis BioPAX Class: Catalysis 2 ID Low pH Inhibitors BioPAX ID=1 Reaction glucose-6-phosphate isomerase Enzyme Catalysis of glucose-6-p to fructose-6-p Name Catalysis
  • 18. Representing Metabolic Data in BioPAX (cont 2) EcoCyc: Pathway BioPAX Class: Pathway 10 ID 1. BioPAX ID=2 2. BioPAX ID=4 3. BioPAX ID=6 etc. Interactions Glycolysis Name Pathway
  • 19. Converting PSI Data into BioPAX PSI XML BioPAX Class: Molecular Association PDB:3HHR DB Source PMID = 1549776 Reference X-ray Crystallography Experiment Description hGHR binds to hGH Name hGRH; hGH Participants 1 ID Molecular Association
  • 20.
  • 21.
  • 22.
  • 23.
  • 24.
  • 25.
  • 26.
  • 27.
  • 28.

Hinweis der Redaktion

  1. This slide shows the difference between exchange formats in the pathway data space. Pathway databases and tools are not considered here, although each one is important and addresses specific use cases. BioPAX level 1 is broad and shallow allowing us to quickly build a format that can represent most the existing data types in databases today. Details will be added in subsequent levels. This is a practical approach.
  2. Why create a pathway exchange standard? There is a dire need in the community for the existing pathway databases to standardize on a common exchange format!!! Existing formats for pathways, such as SBML, CellML do not cater to databases needs. E.g. databases need to store extensive information is a structured way about database accession numbers, experimental description, copyright statements, etc. No current format provides this. Also, lots of data is being produced, as shown here that needs to be integrated. Power of analysis software increases with the size, quality of the data set it is run on
  3. Also many other databases and tools have been published e.g. Patika We would like to learn from these as well. Importantly: An exchange format needs to be a compatible superset of the databases, thus BioPAX will be copying certain structures from the databases in order to support them. A good reaction to BioPAX from a group that has developed their own ontology is that BioPAX looks like their specification. Designed by the databases for themselves (DB-DB exchange) and for users.
  4. Encapsulation: All proteins, small molecules, etc. should be able to be stored in a BioPAX record so that people don’t have to fish around for a Swiss-Prot entry when they want information about a protein. This is not space efficient, so is optional – you can choose at download time whether you want the explicit form or not. Compatible: We specifically want to be file format compatible with PSI. We want to use the same constructs as SBML where possible to maintain compatibility there. CellML can be converted to SBML, so there is already some aspect of compatibility there, Flexible: this is extremely important for community acceptance because people will not change their representation methods to enable data exchange. Thus, BioPAX will not solve the problem of semantic matching between ontologies, but it is a very good first step towards that goal, towards the goal of having everyone speak the same language (this may not be possible, though, to encompass all cases because one language may not be able to solve everyone’s problems).
  5. One way to translate between XML based languages is to use XSLT, which is a standard method of transforming one XML document into another. (XSLT = XML Stylesheet Language Transformation) XML Schema is much more widely used. OWL will likely become the standard method for ontology development used in the knowledge representation community and the basis for the proposed semantic web. Some members of BioPAX require the types of ontology features only present in OWL. Problems we have with OWL: current lack of support, not being able to constrain slots so that they have more specific names or aliases going down the class hierarchy
  6. Like the PSI format, BioPAX will likely have a choice of representation to cater to different needs. For instance, the user can choose a compact representation or a full representation at download time. BioPAX will be able to support both representations.
  7. We may end up calling parts ‘Interactor’ instead of ‘Part’ to be compatible with the PSI format Pathway is a name for a set of interactions. The cell is really a large network, but we tend to organize the network into pathways for our own understanding of the very complex network. For this reason, it is important to be able to have a set of interactions and give it a name (and other description)
  8. This list is extensible, so we can add more later. We have defined our scope to be molecular and cellular biology. For instance, you could add ‘organism’ to allow encoding of food webs or host-pathogen interaction networks on a high level of spatial scale or you could add atoms and subatomic particles to describe chemical bonds at a low level of spatial scale, but these are currently out of our scope.
  9. This list is extensible if we choose to add other classes of interactions. This list was the most difficult to develop. We initially broke all interactions down to ‘directed’ and ‘undirected’, which is a classification by direction and is nice and simple. This did not capture enough biology for our liking so we moved to this list, which captures good parts of BIND, BioCyc, WIT, aMAZE and Patika ontologies, thus we should be able to encode and exchange data between these and other resources. Genetic interactions are in a separate branch of interactions because they are fundamentally different than physical interactions. They have no direct basis in the physical world, only in relation to the genetic concepts of genotype and phenotype. Combinations of genetic interactions can be used to infer physical pathways and interactions.
  10. This slide puts it all together. Also shows how we will implement. XML Schema is also simpler to understand. Good editors and tools available. OWL provides a more in-depth class structure, constraints, etc. – allows you to develop a real ontology in the AI/knowledge representation field of computer science sense. DBs that support OWL have powerful data query ability that makes use of the ontology to make logical inferences to improve query results. Problems with OWL: not yet standardized, not many tools available, not many people using it or related languages.
  11. A scientist studies particular Pathway. She is interested if newly discovered compound or its analogs can be connected to the Pathway. This requires identification of possible structural or functional analogs and cross-database search for potential connection to the Pathway. This is one of the goals of ChemBank --- siRNA, knock-outs   A scientist is interested in knocking out a particular physiological function. He knows several genes reported to be involved. He constructs a total metabolic/signal transduction/gene activation network of an organism. His task is to identify if these genes 1. Can be isolated as functional blocks, modules or cluster in the network topology; 2. What is the minimum number of compounds, which need to be removed to isolate this block of genes; 3. If function is still undisrupted – go back to experiment and try to identify compensatory pathways.
  12. Most of the data present in the BioCyc and WIT databases is metabolic pathway information. This and the following examples show that there are multiple ways to represent this data and BioPAX must be flexible to allow each representation. Remember, this is an exchange format, so we want to optimize data exchange. It seems that mandating that everyone agree to change their representations to a single model can not work. The reason for this is that each model is optimized to its specific task and people will not want to use an inefficient data model for their day to day work. This example is the second step in glycolysis and can be represented as a BioPAX reaction. Metabolic pathway data is important because it is well known for a variety of species and is present in the best developed pathway resources. Note that the enzyme is not represented here. Since multiple enzymes can catalyze the same reaction in different or even the same contexts, enzyme catalysis must be decoupled from biochemical reaction.
  13. This example shows the enzyme catalysis step. The catalysis and the reaction classes shown here together represent this glycolysis step. This is very similar to how this data is represented in BioCyc, aMAZE and other databases.
  14. The entire glycolysis pathway can be represented as a set of catalysis and reaction interactions. The pathway can then be given the name ‘Glycolysis’
  15. This example shows how protein-protein interactions are represented in BioPAX. This data is important because it is being generated at a very large rate in the Proteomics field and is providing much insight into how signal transduction is organized in the cell – the basis of diseases such as cancer, diabetes and Alzheimer&apos;s. The PSI format already exists to exchange protein interaction information. It has been ratified and will be supported by the major protein interaction databases. It makes a lot of sense for BioPAX to be compatible with this existing format, so our representation of protein interactions is identical to PSI.
  16. This example shows one way how signal transduction pathways can be represented. It is similar to how metabolic pathways are represented, although signal transduction can be represented in other ways, such as just a list of protein interactions, without any reference to the underlying biochemical mechanisms. This is an accepted high level representation because it is efficient and people in the signal transduction field can easily understand this representation because they share a significant amount of common knowledge about these patwhays.
  17. Having subgroups allows us to have a good balance between representation and size in BioPAX. Large size slows discussion progress, but can be very representative, so subgroups involve many, but groups are small.
  18. No comprehensive small molecule database exists, but ChemBank looks like it will become the de facto standard if it continues its current rate of development. The small molecule subgroup is also in contact with ChemBank to raise issues related to BioPAX, such as having unique and stable accession numbers for small molecules in ChemBank.
  19. The states subgroup currently has reached a recommendation stage for BioPAX, which now needs to be turned into a document and a part of the ontology.
  20. Examples are important to make sure we are not missing being able to represent valuable data. We are currently optimizing our development time to represent abundant data first. Although we must think about all types of data so that we can remain extensible.
  21. This is mostly a volunteer effort, so no one is working on this full-time. We are satisfied with progress so far, although it could be faster.
  22. Implementing the ontology in OWL will be easy when OWL support is available in GKB. Otherwise it will be very difficult because there is currently no OWL editor available. XML Schema implementation involves augmenting the PSI format. This has been done once, but needs to be updated.
  23. Since BioPAX will be compatible with PSI, if you have only protein-protein interactions to make available, you should use that format: more details at http://psidev.sf.net
  24. Many other people have contributed in discussions in person and on the mailing list and we would like to thank them as well.