SlideShare ist ein Scribd-Unternehmen logo
1 von 67
Downloaden Sie, um offline zu lesen
Res Potentia as a route to
understanding function
and evolution of cellular networks
Adam Pah
NetSci
June 21, 2012
1
Where do we stand and how can we do better?
2
We are generating
biological data faster
than ever
Where do we stand and how can we do better?
But generating is only
one part, we still have
to convert that to
actual usable knowledge
2
Knowledge
We are generating
biological data faster
than ever
Where do we stand and how can we do better?
But generating is only
one part, we still have
to convert that to
actual usable knowledge
2
KnowledgeData
We are generating
biological data faster
than ever
Where do we stand and how can we do better?
But generating is only
one part, we still have
to convert that to
actual usable knowledge
2
KnowledgeData
Knowledge
We are generating
biological data faster
than ever
Why study metabolism?
3
• My goal is to create a generalizable framework for
understanding cellular networks
• I use metabolism because:
Why study metabolism?
3
• My goal is to create a generalizable framework for
understanding cellular networks
• I use metabolism because:
• The data fidelity, while not perfect, is far better
Why study metabolism?
3
• My goal is to create a generalizable framework for
understanding cellular networks
• I use metabolism because:
• The data fidelity, while not perfect, is far better
• We can use metabolism as a test case to help
develop an understanding of cellular networks
Why study metabolism?
3
• My goal is to create a generalizable framework for
understanding cellular networks
• I use metabolism because:
• The data fidelity, while not perfect, is far better
• We can use metabolism as a test case to help
develop an understanding of cellular networks
• There is also the ability to produce metabolites
or chemicals that are of interest
Why study metabolism?
3
• My goal is to create a generalizable framework for
understanding cellular networks
Metabolic networks are constructed from the Kyoto
Encyclopedia of Genes and Genomes database for
each organism where:
How do we construct a metabolic network
• Metabolites are connected if they are a part of
the main reaction pair
Metabolic networks are constructed from the Kyoto
Encyclopedia of Genes and Genomes database for
each organism where:
How do we construct a metabolic network
• Metabolites are connected if they are a part of
the main reaction pair
• Substrates are connected to Products only.
Metabolic networks are constructed from the Kyoto
Encyclopedia of Genes and Genomes database for
each organism where:
How do we construct a metabolic network
• Metabolites are connected if they are a part of
the main reaction pair
• Substrates are connected to Products only.
Metabolic networks are constructed from the Kyoto
Encyclopedia of Genes and Genomes database for
each organism where:
How do we construct a metabolic network
UDP-Glucose + H2
O + 2 NAD+
UDP-Glucuronate + 2 NADH + 2 H+
• Metabolites are connected if they are a part of
the main reaction pair
• Substrates are connected to Products only.
Metabolic networks are constructed from the Kyoto
Encyclopedia of Genes and Genomes database for
each organism where:
How do we construct a metabolic network
UDP-Glucose + H2
O + 2 NAD+
UDP-Glucuronate + 2 NADH + 2 H+
UDP-Glucose + H2
O + 2 NAD+
UDP-Glucuronate + 2 NADH + 2 H+
• Metabolites are connected if they are a part of
the main reaction pair
• Substrates are connected to Products only.
Metabolic networks are constructed from the Kyoto
Encyclopedia of Genes and Genomes database for
each organism where:
How do we construct a metabolic network
UDP-Glucose + H2
O + 2 NAD+
UDP-Glucuronate + 2 NADH + 2 H+
UDP-Glucose + H2
O + 2 NAD+
UDP-Glucuronate + 2 NADH + 2 H+
UDP-Glucose UDP-Glucuronate
2 NAD+ 2 NADH
Looking at one organism
5
Methanococcus maripaludis
Looking at one organism
5
Methanococcus maripaludis
How do we construct a framework
6
Methanococcus maripaludis
Escherichia coli Homo sapiensArabidopsis thaliana
How do we construct a framework
Current knowledge
of Realm of actuals
‘Res Extenta’
6
Methanococcus maripaludis
Escherichia coli Homo sapiensArabidopsis thaliana
How do we construct a framework
Current knowledge
of Realm of actuals
‘Res Extenta’
Realm of Possibles
‘Res Potentia’
6
Methanococcus maripaludis
It can identify new features
7
It can identify new features
7
Increased emphasis
on metabolite roles
It can identify new features
7
Increased emphasis
on metabolite roles
It can identify new features
7
Increased emphasis
on metabolite roles
Putative metabolic
‘devices’
We can use this network to revise our knowledge
8
Methanococcus
maripaludis
We can use this network to revise our knowledge
8
Methanococcus
maripaludis
We can use this network to revise our knowledge
8
Methanococcus
maripaludis
Helping to sort out the bigger picture
9
How much of a need exists to correct databases?
10
In the course of 1 year for 979 organisms in the
Kyoto Encyclopedia of Genes and Genomes
Database:
• 88,000 metabolites have been added as
annotations
How much of a need exists to correct databases?
10
In the course of 1 year for 979 organisms in the
Kyoto Encyclopedia of Genes and Genomes
Database:
• 88,000 metabolites have been added as
annotations
• 31,000 metabolites that were annotated have
been removed
How much of a need exists to correct databases?
10
In the course of 1 year for 979 organisms in the
Kyoto Encyclopedia of Genes and Genomes
Database:
• 88,000 metabolites have been added as
annotations
• 31,000 metabolites that were annotated have
been removed
• Resulting in over 100 changes per organism
How much of a need exists to correct databases?
10
In the course of 1 year for 979 organisms in the
Kyoto Encyclopedia of Genes and Genomes
Database:
How can we make predictions?
11
For every reaction there is a set of enzyme sequences
that we can compare to each organismal set of proteins
to see how well that reaction ‘fits’
How can we make predictions?
11
For every reaction there is a set of enzyme sequences
that we can compare to each organismal set of proteins
to see how well that reaction ‘fits’
How can we make predictions?
11
For every reaction there is a set of enzyme sequences
that we can compare to each organismal set of proteins
to see how well that reaction ‘fits’
Protein1
Organism1
Protein2
Organism1
Protein3
Organism1
Protein4
Organism1
Organism1
proteins
How can we make predictions?
11
For every reaction there is a set of enzyme sequences
that we can compare to each organismal set of proteins
to see how well that reaction ‘fits’
Reaction1
(Annotated)
Protein1
Organism1
Protein2
Organism1
Protein3
Organism1
Protein4
Organism1
Organism1
proteins
Enzyme1
Organism1
Enzyme1
Organism2
Enzyme1
Organism3
Enzyme1
Organism4
Reaction1
enzymes
How can we make predictions?
12
Protein1
Organism1
Protein2
Organism1
Protein3
Organism1
Protein4
Organism1
Organism1
proteins
Enzyme1
Organism1
Enzyme1
Organism2
Enzyme1
Organism3
Enzyme1
Organism4
Reaction1
enzymes
How can we make predictions?
12
Protein1
Organism1
Protein2
Organism1
Protein3
Organism1
Protein4
Organism1
Organism1
proteins
Enzyme1
Organism1
Enzyme1
Organism2
Enzyme1
Organism3
Enzyme1
Organism4
Reaction1
enzymes
How can we make predictions?
12
Protein1
Organism1
Protein2
Organism1
Protein3
Organism1
Protein4
Organism1
Organism1
proteins
Enzyme1
Organism1
Enzyme1
Organism2
Enzyme1
Organism3
Enzyme1
Organism4
Reaction1
enzymes
Protein BLAST
for Enzyme Sequences
How can we make predictions?
13
For every reaction there is a set of enzyme sequences
that we can compare to each organismal set of proteins
to see how well that reaction ‘fits’
Reaction1
(Annotated)
Protein1
Organism1
Protein2
Organism1
Protein3
Organism1
Protein4
Organism1
Organism1
proteins
Enzyme1
Organism1
Enzyme1
Organism2
Enzyme1
Organism3
Enzyme1
Organism4
Reaction1
enzymes
0.0
Match
E-values
10-3
10-4
5.0
10-2
How can we make predictions?
14
For every reaction there is a set of enzyme sequences
that we can compare to each organismal set of proteins
to see how well that reaction ‘fits’
Protein1
Organism1
Protein2
Organism1
Protein3
Organism1
Protein4
Organism1
Organism1
proteins
Enzyme1
Organism1
Enzyme1
Organism2
Enzyme1
Organism3
Enzyme1
Organism4
Reaction1
enzymes
0.0
Match
E-values
10-3
10-4
5.0
10-2
0.0
0.2
0.4
0.6
0.8
1.0
Excellent
Matches
FractionofMatches
Poor
Matches
How can we make predictions?
14
For every reaction there is a set of enzyme sequences
that we can compare to each organismal set of proteins
to see how well that reaction ‘fits’
Reaction1
(Annotated)
Reaction2
(Unannotated)
0.0
0.2
0.4
0.6
0.8
1.0
Excellent
Matches
FractionofMatches
Poor
Matches
How can we make predictions?
14
For every reaction there is a set of enzyme sequences
that we can compare to each organismal set of proteins
to see how well that reaction ‘fits’
Reaction1
(Annotated)
Reaction2
(Unannotated)
0.0
0.2
0.4
0.6
0.8
1.0
Excellent
Matches
FractionofMatches
Poor
Matches
How can we make predictions?
15
For every reaction there is a set of enzyme sequences
that we can compare to each organismal set of proteins
to see how well that reaction ‘fits’
Repeat this for all 3328
reactions using 5.94 million
enzyme sequences in 873
organisms
0.0
0.2
0.4
0.6
0.8
1.0
Excellent
Matches
FractionofMatches
Poor
Matches
Picking an optimal threshold
16
0.0
0.2
0.4
0.6
0.8
1.0
Excellent
Matches
FractionofMatches
Poor
Matches
Picking an optimal threshold
16
0.0
0.2
0.4
0.6
0.8
1.0
Excellent
Matches
FractionofMatches
Poor
Matches
Picking an optimal threshold
16
0.0
0.2
0.4
0.6
0.8
1.0
Excellent
Matches
FractionofMatches
Poor
Matches
• We have one starting dataset, metabolic networks
from KEGG 2009
How do we validate our results?
17
• We have our predicted networks and its changes to
this dataset (Predicted Changes)
• We have one starting dataset, metabolic networks
from KEGG 2009
How do we validate our results?
17
• We have our predicted networks and its changes to
this dataset (Predicted Changes)
• I also have the entire KEGG dataset for 2 years
following that date (KEGG Changes)
• We have one starting dataset, metabolic networks
from KEGG 2009
How do we validate our results?
17
• We have our predicted networks and its changes to
this dataset (Predicted Changes)
• I also have the entire KEGG dataset for 2 years
following that date (KEGG Changes)
• We can then compare how well each set of changes
does in correcting the networks
• We have one starting dataset, metabolic networks
from KEGG 2009
How do we validate our results?
17
• We have our predicted networks and its changes to
this dataset (Predicted Changes)
• I also have the entire KEGG dataset for 2 years
following that date (KEGG Changes)
• We can then compare how well each set of changes
does in correcting the networks
• Ideally the networks should make sense and be as
connected as reasonably possible
• We have one starting dataset, metabolic networks
from KEGG 2009
How do we validate our results?
17
Validate by promoting connectedness
18
We can test and see how the actual changes in
the database do at completing and filling in gaps
in the networks
Validate by promoting connectedness
18
We can test and see how the actual changes in
the database do at completing and filling in gaps
in the networks
Validate by promoting connectedness
18
Gap Size
0.00
0.02
0.04
0.06
0.08
0.10
0.12
FractionofGapsFilled
KEGG Changes
Random
1 2 3 4 5
Predicted Changes
We can test and see how the actual changes in
the database do at completing and filling in gaps
in the networks
Validate by promoting connectedness
18
Gap Size
0.00
0.02
0.04
0.06
0.08
0.10
0.12
FractionofGapsFilled
KEGG Changes
Random
1 2 3 4 5
Predicted Changes
We can test and see how the actual changes in
the database do at completing and filling in gaps
in the networks
Validate by promoting connectedness
18
Gap Size
0.00
0.02
0.04
0.06
0.08
0.10
0.12
FractionofGapsFilled
KEGG Changes
Random
1 2 3 4 5
Predicted Changes
We can test and see how the actual changes in
the database do at completing and filling in gaps
in the networks
Validate by promoting connectedness
18
Gap Size
0.00
0.02
0.04
0.06
0.08
0.10
0.12
FractionofGapsFilled
KEGG Changes
Random
1 2 3 4 5
Predicted Changes
We can test and see how the actual changes in
the database do at completing and filling in gaps
in the networks
Validate by promoting connectedness
19
We can test and see how the actual changes in
the database create gaps
Validate by promoting connectedness
19
We can test and see how the actual changes in
the database create gaps
Validate by promoting connectedness
19
We can test and see how the actual changes in
the database create gaps
Validate by promoting connectedness
19
We can test and see how the actual changes in
the database create gaps
-0.1 -0.06 -0.02 0.02 0.06 0.1
RPF Predicted
Deletions
KEGG 2011
Deletions
Relative fraction of removed reactions
that create additional components
Validate by promoting connectedness
19
We can test and see how the actual changes in
the database create gaps
-0.1 -0.06 -0.02 0.02 0.06 0.1
RPF Predicted
Deletions
KEGG 2011
Deletions
Relative fraction of removed reactions
that create additional components
Considering reactions in the context of the Res
Potentia enhances the ability to correct and close
gaps in organismal networks
What did we learn
20
Considering reactions in the context of the Res
Potentia enhances the ability to correct and close
gaps in organismal networks
What did we learn
20
Now we can begin to
analyze and understand
more complex features
of these networks
Acknowledgements
• Luis Amaral
• Irmak Sirer, Pat McMullen, Sam Seaver, Erin
Sawardecker
With financial support from:
• Northwestern/NIH Biotechnology Training Grant
• Chicago Biomedical Consortium

Weitere ähnliche Inhalte

Ähnlich wie Pah res-potentia-netsci emailable-stagebuild

IGCSE Biology Edexcel (9-1) Biological molecules
IGCSE Biology Edexcel (9-1) Biological moleculesIGCSE Biology Edexcel (9-1) Biological molecules
IGCSE Biology Edexcel (9-1) Biological moleculesssnm88
 
The Past, Present and Future of Knowledge in Biology
The Past, Present and Future of Knowledge in BiologyThe Past, Present and Future of Knowledge in Biology
The Past, Present and Future of Knowledge in Biologyrobertstevens65
 
Biology Review
Biology ReviewBiology Review
Biology ReviewErin Mucci
 
Biology review
Biology reviewBiology review
Biology reviewErin Mucci
 
Ontology Services for the Biomedical Sciences
Ontology Services for the Biomedical SciencesOntology Services for the Biomedical Sciences
Ontology Services for the Biomedical SciencesConnected Data World
 
1biologyeocreviewnotes all
1biologyeocreviewnotes all1biologyeocreviewnotes all
1biologyeocreviewnotes allkonnerhouse
 
1PhylogeneticAnalysisHomeworkassignmentThisa.docx
1PhylogeneticAnalysisHomeworkassignmentThisa.docx1PhylogeneticAnalysisHomeworkassignmentThisa.docx
1PhylogeneticAnalysisHomeworkassignmentThisa.docxfelicidaddinwoodie
 
Proteomics a search tool for vaccines
Proteomics a search tool for vaccinesProteomics a search tool for vaccines
Proteomics a search tool for vaccinesLawrence Okoror
 
Cell Structure And Function
Cell Structure And FunctionCell Structure And Function
Cell Structure And Functionjrt004
 
Waksman Student Scholars Program Poster
Waksman Student Scholars Program PosterWaksman Student Scholars Program Poster
Waksman Student Scholars Program PosterWesley Kwong
 
Hsc biology 3.1: Gene expression, maintenance and repair of body tissue
Hsc biology 3.1:  Gene expression, maintenance and repair of body tissueHsc biology 3.1:  Gene expression, maintenance and repair of body tissue
Hsc biology 3.1: Gene expression, maintenance and repair of body tissueTemuge Namjilsuren
 
yeast two hybrid system
yeast two hybrid systemyeast two hybrid system
yeast two hybrid systemSheetal Mehla
 
Directed evolution
Directed evolutionDirected evolution
Directed evolutionIfrah Ishaq
 

Ähnlich wie Pah res-potentia-netsci emailable-stagebuild (20)

TAKS Objective 2
TAKS Objective 2TAKS Objective 2
TAKS Objective 2
 
soutenance
soutenancesoutenance
soutenance
 
IGCSE Biology Edexcel (9-1) Biological molecules
IGCSE Biology Edexcel (9-1) Biological moleculesIGCSE Biology Edexcel (9-1) Biological molecules
IGCSE Biology Edexcel (9-1) Biological molecules
 
The Past, Present and Future of Knowledge in Biology
The Past, Present and Future of Knowledge in BiologyThe Past, Present and Future of Knowledge in Biology
The Past, Present and Future of Knowledge in Biology
 
Biology Review
Biology ReviewBiology Review
Biology Review
 
Biology review
Biology reviewBiology review
Biology review
 
Ontology Services for the Biomedical Sciences
Ontology Services for the Biomedical SciencesOntology Services for the Biomedical Sciences
Ontology Services for the Biomedical Sciences
 
1biologyeocreviewnotes all
1biologyeocreviewnotes all1biologyeocreviewnotes all
1biologyeocreviewnotes all
 
1PhylogeneticAnalysisHomeworkassignmentThisa.docx
1PhylogeneticAnalysisHomeworkassignmentThisa.docx1PhylogeneticAnalysisHomeworkassignmentThisa.docx
1PhylogeneticAnalysisHomeworkassignmentThisa.docx
 
2014
20142014
2014
 
Biology eoc review 1
Biology eoc review 1Biology eoc review 1
Biology eoc review 1
 
Proteomics ppt
Proteomics pptProteomics ppt
Proteomics ppt
 
Proteomics a search tool for vaccines
Proteomics a search tool for vaccinesProteomics a search tool for vaccines
Proteomics a search tool for vaccines
 
Cell Structure And Function
Cell Structure And FunctionCell Structure And Function
Cell Structure And Function
 
Kegg
KeggKegg
Kegg
 
Waksman Student Scholars Program Poster
Waksman Student Scholars Program PosterWaksman Student Scholars Program Poster
Waksman Student Scholars Program Poster
 
Hsc biology 3.1: Gene expression, maintenance and repair of body tissue
Hsc biology 3.1:  Gene expression, maintenance and repair of body tissueHsc biology 3.1:  Gene expression, maintenance and repair of body tissue
Hsc biology 3.1: Gene expression, maintenance and repair of body tissue
 
yeast two hybrid system
yeast two hybrid systemyeast two hybrid system
yeast two hybrid system
 
Directed evolution
Directed evolutionDirected evolution
Directed evolution
 
Yeast two hybrid method
Yeast two hybrid methodYeast two hybrid method
Yeast two hybrid method
 

Mehr von Adam Pah

Why Python?
Why Python?Why Python?
Why Python?Adam Pah
 
Quest overview
Quest overviewQuest overview
Quest overviewAdam Pah
 
A quick overview of why to use and how to set up iPython notebooks for research
A quick overview of why to use and how to set up iPython notebooks for researchA quick overview of why to use and how to set up iPython notebooks for research
A quick overview of why to use and how to set up iPython notebooks for researchAdam Pah
 
Kaggle "Give me some credit" challenge overview
Kaggle "Give me some credit" challenge overviewKaggle "Give me some credit" challenge overview
Kaggle "Give me some credit" challenge overviewAdam Pah
 
D3 interactivity Linegraph basic example
D3 interactivity Linegraph basic exampleD3 interactivity Linegraph basic example
D3 interactivity Linegraph basic exampleAdam Pah
 
Mercurial Tutorial
Mercurial TutorialMercurial Tutorial
Mercurial TutorialAdam Pah
 

Mehr von Adam Pah (6)

Why Python?
Why Python?Why Python?
Why Python?
 
Quest overview
Quest overviewQuest overview
Quest overview
 
A quick overview of why to use and how to set up iPython notebooks for research
A quick overview of why to use and how to set up iPython notebooks for researchA quick overview of why to use and how to set up iPython notebooks for research
A quick overview of why to use and how to set up iPython notebooks for research
 
Kaggle "Give me some credit" challenge overview
Kaggle "Give me some credit" challenge overviewKaggle "Give me some credit" challenge overview
Kaggle "Give me some credit" challenge overview
 
D3 interactivity Linegraph basic example
D3 interactivity Linegraph basic exampleD3 interactivity Linegraph basic example
D3 interactivity Linegraph basic example
 
Mercurial Tutorial
Mercurial TutorialMercurial Tutorial
Mercurial Tutorial
 

Kürzlich hochgeladen

GFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptxGFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptxAleenaTreesaSaji
 
GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)Areesha Ahmad
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​kaibalyasahoo82800
 
Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )aarthirajkumar25
 
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptxUnlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptxanandsmhk
 
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls AgencyHire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls AgencySheetal Arora
 
Chemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdfChemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdfSumit Kumar yadav
 
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral AnalysisRaman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral AnalysisDiwakar Mishra
 
GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)Areesha Ahmad
 
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...Sérgio Sacani
 
Chromatin Structure | EUCHROMATIN | HETEROCHROMATIN
Chromatin Structure | EUCHROMATIN | HETEROCHROMATINChromatin Structure | EUCHROMATIN | HETEROCHROMATIN
Chromatin Structure | EUCHROMATIN | HETEROCHROMATINsankalpkumarsahoo174
 
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCESTERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCEPRINCE C P
 
Natural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsNatural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsAArockiyaNisha
 
Botany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questionsBotany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questionsSumit Kumar yadav
 
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRStunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRDelhi Call girls
 
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsHubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsSérgio Sacani
 
Animal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxAnimal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxUmerFayaz5
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPirithiRaju
 

Kürzlich hochgeladen (20)

GFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptxGFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptx
 
GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)
 
9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service
9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service
9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​
 
Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )
 
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptxUnlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
 
The Philosophy of Science
The Philosophy of ScienceThe Philosophy of Science
The Philosophy of Science
 
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls AgencyHire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
 
Chemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdfChemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdf
 
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral AnalysisRaman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
 
GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)
 
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
 
Chromatin Structure | EUCHROMATIN | HETEROCHROMATIN
Chromatin Structure | EUCHROMATIN | HETEROCHROMATINChromatin Structure | EUCHROMATIN | HETEROCHROMATIN
Chromatin Structure | EUCHROMATIN | HETEROCHROMATIN
 
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCESTERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
 
Natural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsNatural Polymer Based Nanomaterials
Natural Polymer Based Nanomaterials
 
Botany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questionsBotany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questions
 
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRStunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
 
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsHubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
 
Animal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxAnimal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptx
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
 

Pah res-potentia-netsci emailable-stagebuild

  • 1. Res Potentia as a route to understanding function and evolution of cellular networks Adam Pah NetSci June 21, 2012 1
  • 2. Where do we stand and how can we do better? 2 We are generating biological data faster than ever
  • 3. Where do we stand and how can we do better? But generating is only one part, we still have to convert that to actual usable knowledge 2 Knowledge We are generating biological data faster than ever
  • 4. Where do we stand and how can we do better? But generating is only one part, we still have to convert that to actual usable knowledge 2 KnowledgeData We are generating biological data faster than ever
  • 5. Where do we stand and how can we do better? But generating is only one part, we still have to convert that to actual usable knowledge 2 KnowledgeData Knowledge We are generating biological data faster than ever
  • 6. Why study metabolism? 3 • My goal is to create a generalizable framework for understanding cellular networks
  • 7. • I use metabolism because: Why study metabolism? 3 • My goal is to create a generalizable framework for understanding cellular networks
  • 8. • I use metabolism because: • The data fidelity, while not perfect, is far better Why study metabolism? 3 • My goal is to create a generalizable framework for understanding cellular networks
  • 9. • I use metabolism because: • The data fidelity, while not perfect, is far better • We can use metabolism as a test case to help develop an understanding of cellular networks Why study metabolism? 3 • My goal is to create a generalizable framework for understanding cellular networks
  • 10. • I use metabolism because: • The data fidelity, while not perfect, is far better • We can use metabolism as a test case to help develop an understanding of cellular networks • There is also the ability to produce metabolites or chemicals that are of interest Why study metabolism? 3 • My goal is to create a generalizable framework for understanding cellular networks
  • 11. Metabolic networks are constructed from the Kyoto Encyclopedia of Genes and Genomes database for each organism where: How do we construct a metabolic network
  • 12. • Metabolites are connected if they are a part of the main reaction pair Metabolic networks are constructed from the Kyoto Encyclopedia of Genes and Genomes database for each organism where: How do we construct a metabolic network
  • 13. • Metabolites are connected if they are a part of the main reaction pair • Substrates are connected to Products only. Metabolic networks are constructed from the Kyoto Encyclopedia of Genes and Genomes database for each organism where: How do we construct a metabolic network
  • 14. • Metabolites are connected if they are a part of the main reaction pair • Substrates are connected to Products only. Metabolic networks are constructed from the Kyoto Encyclopedia of Genes and Genomes database for each organism where: How do we construct a metabolic network UDP-Glucose + H2 O + 2 NAD+ UDP-Glucuronate + 2 NADH + 2 H+
  • 15. • Metabolites are connected if they are a part of the main reaction pair • Substrates are connected to Products only. Metabolic networks are constructed from the Kyoto Encyclopedia of Genes and Genomes database for each organism where: How do we construct a metabolic network UDP-Glucose + H2 O + 2 NAD+ UDP-Glucuronate + 2 NADH + 2 H+ UDP-Glucose + H2 O + 2 NAD+ UDP-Glucuronate + 2 NADH + 2 H+
  • 16. • Metabolites are connected if they are a part of the main reaction pair • Substrates are connected to Products only. Metabolic networks are constructed from the Kyoto Encyclopedia of Genes and Genomes database for each organism where: How do we construct a metabolic network UDP-Glucose + H2 O + 2 NAD+ UDP-Glucuronate + 2 NADH + 2 H+ UDP-Glucose + H2 O + 2 NAD+ UDP-Glucuronate + 2 NADH + 2 H+ UDP-Glucose UDP-Glucuronate 2 NAD+ 2 NADH
  • 17. Looking at one organism 5 Methanococcus maripaludis
  • 18. Looking at one organism 5 Methanococcus maripaludis
  • 19. How do we construct a framework 6 Methanococcus maripaludis
  • 20. Escherichia coli Homo sapiensArabidopsis thaliana How do we construct a framework Current knowledge of Realm of actuals ‘Res Extenta’ 6 Methanococcus maripaludis
  • 21. Escherichia coli Homo sapiensArabidopsis thaliana How do we construct a framework Current knowledge of Realm of actuals ‘Res Extenta’ Realm of Possibles ‘Res Potentia’ 6 Methanococcus maripaludis
  • 22. It can identify new features 7
  • 23. It can identify new features 7 Increased emphasis on metabolite roles
  • 24. It can identify new features 7 Increased emphasis on metabolite roles
  • 25. It can identify new features 7 Increased emphasis on metabolite roles Putative metabolic ‘devices’
  • 26. We can use this network to revise our knowledge 8 Methanococcus maripaludis
  • 27. We can use this network to revise our knowledge 8 Methanococcus maripaludis
  • 28. We can use this network to revise our knowledge 8 Methanococcus maripaludis
  • 29. Helping to sort out the bigger picture 9
  • 30. How much of a need exists to correct databases? 10 In the course of 1 year for 979 organisms in the Kyoto Encyclopedia of Genes and Genomes Database:
  • 31. • 88,000 metabolites have been added as annotations How much of a need exists to correct databases? 10 In the course of 1 year for 979 organisms in the Kyoto Encyclopedia of Genes and Genomes Database:
  • 32. • 88,000 metabolites have been added as annotations • 31,000 metabolites that were annotated have been removed How much of a need exists to correct databases? 10 In the course of 1 year for 979 organisms in the Kyoto Encyclopedia of Genes and Genomes Database:
  • 33. • 88,000 metabolites have been added as annotations • 31,000 metabolites that were annotated have been removed • Resulting in over 100 changes per organism How much of a need exists to correct databases? 10 In the course of 1 year for 979 organisms in the Kyoto Encyclopedia of Genes and Genomes Database:
  • 34. How can we make predictions? 11 For every reaction there is a set of enzyme sequences that we can compare to each organismal set of proteins to see how well that reaction ‘fits’
  • 35. How can we make predictions? 11 For every reaction there is a set of enzyme sequences that we can compare to each organismal set of proteins to see how well that reaction ‘fits’
  • 36. How can we make predictions? 11 For every reaction there is a set of enzyme sequences that we can compare to each organismal set of proteins to see how well that reaction ‘fits’ Protein1 Organism1 Protein2 Organism1 Protein3 Organism1 Protein4 Organism1 Organism1 proteins
  • 37. How can we make predictions? 11 For every reaction there is a set of enzyme sequences that we can compare to each organismal set of proteins to see how well that reaction ‘fits’ Reaction1 (Annotated) Protein1 Organism1 Protein2 Organism1 Protein3 Organism1 Protein4 Organism1 Organism1 proteins Enzyme1 Organism1 Enzyme1 Organism2 Enzyme1 Organism3 Enzyme1 Organism4 Reaction1 enzymes
  • 38. How can we make predictions? 12 Protein1 Organism1 Protein2 Organism1 Protein3 Organism1 Protein4 Organism1 Organism1 proteins Enzyme1 Organism1 Enzyme1 Organism2 Enzyme1 Organism3 Enzyme1 Organism4 Reaction1 enzymes
  • 39. How can we make predictions? 12 Protein1 Organism1 Protein2 Organism1 Protein3 Organism1 Protein4 Organism1 Organism1 proteins Enzyme1 Organism1 Enzyme1 Organism2 Enzyme1 Organism3 Enzyme1 Organism4 Reaction1 enzymes
  • 40. How can we make predictions? 12 Protein1 Organism1 Protein2 Organism1 Protein3 Organism1 Protein4 Organism1 Organism1 proteins Enzyme1 Organism1 Enzyme1 Organism2 Enzyme1 Organism3 Enzyme1 Organism4 Reaction1 enzymes Protein BLAST for Enzyme Sequences
  • 41. How can we make predictions? 13 For every reaction there is a set of enzyme sequences that we can compare to each organismal set of proteins to see how well that reaction ‘fits’ Reaction1 (Annotated) Protein1 Organism1 Protein2 Organism1 Protein3 Organism1 Protein4 Organism1 Organism1 proteins Enzyme1 Organism1 Enzyme1 Organism2 Enzyme1 Organism3 Enzyme1 Organism4 Reaction1 enzymes 0.0 Match E-values 10-3 10-4 5.0 10-2
  • 42. How can we make predictions? 14 For every reaction there is a set of enzyme sequences that we can compare to each organismal set of proteins to see how well that reaction ‘fits’ Protein1 Organism1 Protein2 Organism1 Protein3 Organism1 Protein4 Organism1 Organism1 proteins Enzyme1 Organism1 Enzyme1 Organism2 Enzyme1 Organism3 Enzyme1 Organism4 Reaction1 enzymes 0.0 Match E-values 10-3 10-4 5.0 10-2 0.0 0.2 0.4 0.6 0.8 1.0 Excellent Matches FractionofMatches Poor Matches
  • 43. How can we make predictions? 14 For every reaction there is a set of enzyme sequences that we can compare to each organismal set of proteins to see how well that reaction ‘fits’ Reaction1 (Annotated) Reaction2 (Unannotated) 0.0 0.2 0.4 0.6 0.8 1.0 Excellent Matches FractionofMatches Poor Matches
  • 44. How can we make predictions? 14 For every reaction there is a set of enzyme sequences that we can compare to each organismal set of proteins to see how well that reaction ‘fits’ Reaction1 (Annotated) Reaction2 (Unannotated) 0.0 0.2 0.4 0.6 0.8 1.0 Excellent Matches FractionofMatches Poor Matches
  • 45. How can we make predictions? 15 For every reaction there is a set of enzyme sequences that we can compare to each organismal set of proteins to see how well that reaction ‘fits’ Repeat this for all 3328 reactions using 5.94 million enzyme sequences in 873 organisms 0.0 0.2 0.4 0.6 0.8 1.0 Excellent Matches FractionofMatches Poor Matches
  • 46. Picking an optimal threshold 16 0.0 0.2 0.4 0.6 0.8 1.0 Excellent Matches FractionofMatches Poor Matches
  • 47. Picking an optimal threshold 16 0.0 0.2 0.4 0.6 0.8 1.0 Excellent Matches FractionofMatches Poor Matches
  • 48. Picking an optimal threshold 16 0.0 0.2 0.4 0.6 0.8 1.0 Excellent Matches FractionofMatches Poor Matches
  • 49. • We have one starting dataset, metabolic networks from KEGG 2009 How do we validate our results? 17
  • 50. • We have our predicted networks and its changes to this dataset (Predicted Changes) • We have one starting dataset, metabolic networks from KEGG 2009 How do we validate our results? 17
  • 51. • We have our predicted networks and its changes to this dataset (Predicted Changes) • I also have the entire KEGG dataset for 2 years following that date (KEGG Changes) • We have one starting dataset, metabolic networks from KEGG 2009 How do we validate our results? 17
  • 52. • We have our predicted networks and its changes to this dataset (Predicted Changes) • I also have the entire KEGG dataset for 2 years following that date (KEGG Changes) • We can then compare how well each set of changes does in correcting the networks • We have one starting dataset, metabolic networks from KEGG 2009 How do we validate our results? 17
  • 53. • We have our predicted networks and its changes to this dataset (Predicted Changes) • I also have the entire KEGG dataset for 2 years following that date (KEGG Changes) • We can then compare how well each set of changes does in correcting the networks • Ideally the networks should make sense and be as connected as reasonably possible • We have one starting dataset, metabolic networks from KEGG 2009 How do we validate our results? 17
  • 54. Validate by promoting connectedness 18 We can test and see how the actual changes in the database do at completing and filling in gaps in the networks
  • 55. Validate by promoting connectedness 18 We can test and see how the actual changes in the database do at completing and filling in gaps in the networks
  • 56. Validate by promoting connectedness 18 Gap Size 0.00 0.02 0.04 0.06 0.08 0.10 0.12 FractionofGapsFilled KEGG Changes Random 1 2 3 4 5 Predicted Changes We can test and see how the actual changes in the database do at completing and filling in gaps in the networks
  • 57. Validate by promoting connectedness 18 Gap Size 0.00 0.02 0.04 0.06 0.08 0.10 0.12 FractionofGapsFilled KEGG Changes Random 1 2 3 4 5 Predicted Changes We can test and see how the actual changes in the database do at completing and filling in gaps in the networks
  • 58. Validate by promoting connectedness 18 Gap Size 0.00 0.02 0.04 0.06 0.08 0.10 0.12 FractionofGapsFilled KEGG Changes Random 1 2 3 4 5 Predicted Changes We can test and see how the actual changes in the database do at completing and filling in gaps in the networks
  • 59. Validate by promoting connectedness 18 Gap Size 0.00 0.02 0.04 0.06 0.08 0.10 0.12 FractionofGapsFilled KEGG Changes Random 1 2 3 4 5 Predicted Changes We can test and see how the actual changes in the database do at completing and filling in gaps in the networks
  • 60. Validate by promoting connectedness 19 We can test and see how the actual changes in the database create gaps
  • 61. Validate by promoting connectedness 19 We can test and see how the actual changes in the database create gaps
  • 62. Validate by promoting connectedness 19 We can test and see how the actual changes in the database create gaps
  • 63. Validate by promoting connectedness 19 We can test and see how the actual changes in the database create gaps -0.1 -0.06 -0.02 0.02 0.06 0.1 RPF Predicted Deletions KEGG 2011 Deletions Relative fraction of removed reactions that create additional components
  • 64. Validate by promoting connectedness 19 We can test and see how the actual changes in the database create gaps -0.1 -0.06 -0.02 0.02 0.06 0.1 RPF Predicted Deletions KEGG 2011 Deletions Relative fraction of removed reactions that create additional components
  • 65. Considering reactions in the context of the Res Potentia enhances the ability to correct and close gaps in organismal networks What did we learn 20
  • 66. Considering reactions in the context of the Res Potentia enhances the ability to correct and close gaps in organismal networks What did we learn 20 Now we can begin to analyze and understand more complex features of these networks
  • 67. Acknowledgements • Luis Amaral • Irmak Sirer, Pat McMullen, Sam Seaver, Erin Sawardecker With financial support from: • Northwestern/NIH Biotechnology Training Grant • Chicago Biomedical Consortium