Pah res-potentia-netsci emailable-stagebuild

Res Potentia as a route to
understanding function
and evolution of cellular networks
Adam Pah
NetSci
June 21, 2012
1

Where do we stand and how can we do better?
2
We are generating
biological data faster
than ever

But generating is only
one part, we still have
to convert that to
actual usable knowledge
2
Knowledge
We are generating
than ever

to convert that to
2
KnowledgeData
We are generating
than ever

to convert that to
2
KnowledgeData
Knowledge
We are generating
than ever

Why study metabolism?
3
• My goal is to create a generalizable framework for
understanding cellular networks

• I use metabolism because:
3

• The data ﬁdelity, while not perfect, is far better
3

• We can use metabolism as a test case to help
develop an understanding of cellular networks
3

• We can use metabolism as a test case to help
develop an understanding of cellular networks
• There is also the ability to produce metabolites
or chemicals that are of interest
3

Metabolic networks are constructed from the Kyoto
Encyclopedia of Genes and Genomes database for
each organism where:
How do we construct a metabolic network

• Metabolites are connected if they are a part of
the main reaction pair

• Substrates are connected to Products only.

UDP-Glucose + H2
O + 2 NAD+
UDP-Glucuronate + 2 NADH + 2 H+

UDP-Glucose + H2
O + 2 NAD+
UDP-Glucose + H2
O + 2 NAD+

UDP-Glucose + H2
O + 2 NAD+
UDP-Glucose + H2
O + 2 NAD+
UDP-Glucose UDP-Glucuronate
2 NAD+ 2 NADH

Looking at one organism
5
Methanococcus maripaludis

How do we construct a framework
6

Escherichia coli Homo sapiensArabidopsis thaliana
Current knowledge
of Realm of actuals
‘Res Extenta’
6

Escherichia coli Homo sapiensArabidopsis thaliana
Current knowledge
of Realm of actuals
‘Res Extenta’
Realm of Possibles
‘Res Potentia’
6

It can identify new features
7

7
Increased emphasis
on metabolite roles

7
Increased emphasis
on metabolite roles
Putative metabolic
‘devices’

We can use this network to revise our knowledge
8
Methanococcus
maripaludis

Helping to sort out the bigger picture
9

How much of a need exists to correct databases?
10
In the course of 1 year for 979 organisms in the
Kyoto Encyclopedia of Genes and Genomes
Database:

• 88,000 metabolites have been added as
annotations
10
Database:

annotations
• 31,000 metabolites that were annotated have
been removed
10
Database:

annotations
• 31,000 metabolites that were annotated have
been removed
• Resulting in over 100 changes per organism
10
Database:

How can we make predictions?
11
For every reaction there is a set of enzyme sequences
that we can compare to each organismal set of proteins
to see how well that reaction ‘ﬁts’

11
Protein1
Organism1
Protein2
Organism1
Protein3
Organism1
Protein4
Organism1
Organism1
proteins

11
Reaction1
(Annotated)
Protein1
Organism1
Protein2
Organism1
Protein3
Organism1
Protein4
Organism1
Organism1
proteins
Enzyme1
Organism1
Enzyme1
Organism2
Enzyme1
Organism3
Enzyme1
Organism4
Reaction1
enzymes

12
Protein1
Organism1
Protein2
Organism1
Protein3
Organism1
Protein4
Organism1
Organism1
proteins
Enzyme1
Organism1
Enzyme1
Organism2
Enzyme1
Organism3
Enzyme1
Organism4
Reaction1
enzymes

12
Protein1
Organism1
Protein2
Organism1
Protein3
Organism1
Protein4
Organism1
Organism1
proteins
Enzyme1
Organism1
Enzyme1
Organism2
Enzyme1
Organism3
Enzyme1
Organism4
Reaction1
enzymes
Protein BLAST
for Enzyme Sequences

13
Reaction1
(Annotated)
Protein1
Organism1
Protein2
Organism1
Protein3
Organism1
Protein4
Organism1
Organism1
proteins
Enzyme1
Organism1
Enzyme1
Organism2
Enzyme1
Organism3
Enzyme1
Organism4
Reaction1
enzymes
0.0
Match
E-values
10-3
10-4
5.0
10-2

14
Protein1
Organism1
Protein2
Organism1
Protein3
Organism1
Protein4
Organism1
Organism1
proteins
Enzyme1
Organism1
Enzyme1
Organism2
Enzyme1
Organism3
Enzyme1
Organism4
Reaction1
enzymes
0.0
Match
E-values
10-3
10-4
5.0
10-2
0.0
0.2
0.4
0.6
0.8
1.0
Excellent
Matches
FractionofMatches
Poor
Matches

14
Reaction1
(Annotated)
Reaction2
(Unannotated)
0.0
0.2
0.4
0.6
0.8
1.0
Excellent
Matches
FractionofMatches
Poor
Matches

15
Repeat this for all 3328
reactions using 5.94 million
enzyme sequences in 873
organisms
0.0
0.2
0.4
0.6
0.8
1.0
Excellent
Matches
FractionofMatches
Poor
Matches

Picking an optimal threshold
16
0.0
0.2
0.4
0.6
0.8
1.0
Excellent
Matches
FractionofMatches
Poor
Matches

• We have one starting dataset, metabolic networks
from KEGG 2009
How do we validate our results?
17

• We have our predicted networks and its changes to
this dataset (Predicted Changes)
from KEGG 2009
17

• I also have the entire KEGG dataset for 2 years
following that date (KEGG Changes)
from KEGG 2009
17

• We can then compare how well each set of changes
does in correcting the networks
from KEGG 2009
17

• We can then compare how well each set of changes
does in correcting the networks
• Ideally the networks should make sense and be as
connected as reasonably possible
from KEGG 2009
17

Validate by promoting connectedness
18
We can test and see how the actual changes in
the database do at completing and ﬁlling in gaps
in the networks

18
Gap Size
0.00
0.02
0.04
0.06
0.08
0.10
0.12
FractionofGapsFilled
KEGG Changes
Random
1 2 3 4 5
Predicted Changes
the database do at completing and ﬁlling in gaps
in the networks

19
the database create gaps

19
the database create gaps
-0.1 -0.06 -0.02 0.02 0.06 0.1
RPF Predicted
Deletions
KEGG 2011
Deletions
Relative fraction of removed reactions
that create additional components

Considering reactions in the context of the Res
Potentia enhances the ability to correct and close
gaps in organismal networks
What did we learn
20

Considering reactions in the context of the Res
Potentia enhances the ability to correct and close
gaps in organismal networks
What did we learn
20
Now we can begin to
analyze and understand
more complex features
of these networks

Acknowledgements
• Luis Amaral
• Irmak Sirer, Pat McMullen, Sam Seaver, Erin
Sawardecker
With ﬁnancial support from:
• Northwestern/NIH Biotechnology Training Grant
• Chicago Biomedical Consortium

Pah res-potentia-netsci emailable-stagebuild

Empfohlen

Empfohlen

Weitere ähnliche Inhalte

Ähnlich wie Pah res-potentia-netsci emailable-stagebuild

Ähnlich wie Pah res-potentia-netsci emailable-stagebuild (20)

Mehr von Adam Pah

Mehr von Adam Pah (6)

Kürzlich hochgeladen

Kürzlich hochgeladen (20)

Pah res-potentia-netsci emailable-stagebuild