This document discusses systems biology approaches to studying cancer. It defines systems biology as studying organisms as interacting networks of genes, proteins, and reactions. Biological networks are constructed from different types of data and relationships. Integrating multiple data types into networks can provide a more complete understanding of cancer than single data types in isolation. Networks can be used to identify cancer driver genes, dysregulated pathways, and biomarkers for disease classification, understanding mechanisms, and drug development. While current biological networks are incomplete, systems approaches have already provided insights and are expected to be more powerful as networks become more comprehensive.
3. What is systems biology?
• Systems biology is the study of an
organism, viewed as an
integrated and interacting
network of genes, proteins and
biochemical reactions which give
rise to life.
• Networks organize and integrate
information at different levels to
create biologically meaningful
models.
• Networks formulate hypotheses
about biological function and
provide temporal and spatial
insights into dynamical changes.
Oltvai and Barabási, Science. 2002Hood and Tian, Genomics, Proteomics & Bioinformatics, 2012
3
4. How is a network constructed ?
Wang and Marcotte, J Proteomics. 2010
4
6. Moral of the Story
(from previous slide)
• Biological networks should not be used blindly
• Even a single organism can have multiple
types of networks
• The meaning or the edges in the network
(relationships) must be kept in mind while
analyzing the data
6
7. Characteristics of a Biological Network
Elgoyhen et al., Front. Syst. Neurosci. 2012
7
11. Why data integration is required in for
cancer studies ?
Ding et al. Hum. Mol. Genet. 2010
Studying cancer dataset in isolation
will produce an incomplete story
11
15. Cause Effect
Somatic Mutations
Structural Variations
Copy Number
Aberrations
Prioritize Candidate Driver Genes of Cancer
Gene Fusions
Alternative Splicing
DNA Methylation
? ?
Interaction Network
Gene Expression
miRNA Expression
Model
Strategies of Data Integration: Few Examples
Hypothesis:
Thus a perturbation in one
gene can be propagated
through the interactions,
and affect other genes in
the network.
15
17. Gene Marker Sets
• Examine genome-wide expression profiles
– Score individual genes for how well they discriminate
between different classes of disease
• Establish gene expression signature
– Problem: # genes >> # patients
17
18. Pathway Expression vs. PPI
Subnetwork as Marker
• Score known pathways for
coherence of gene
expression changes?
– Majority of human genes not
yet assigned to a definitive
pathway
18
• Large Protein-Protein
Interaction networks
recently became available
– Extract subnetworks from PPI
networks as markers
19. Chuang et al. Mol Syst Biol. 2007
• Subnetwork markers
correspond to the hallmarks
of cancer
• Subnetwork markers have
increased reproducibility
across data sets
• Subnetwork markers
increase the classification
accuracy of metastasis
• Subnetwork markers are
informative of non-
discriminative disease genes
Cho et al. PLoS Comput Biol. 2012
19
25. Conclusion
• Present knowledge of the cellular
map (interaction network) :: tip
of an iceberg
• Still with the incomplete map
system biology has been able to
produce a lot of success stories.
• System biology techniques &
methods will even be more
efficient, robust and more
reliable in the future.
• Maps will be just as important to
biological discoveries as they
were to the discoveries in the era
of Columbus
25
“Following the light of the sun, we left the
Old World.” –Christopher ColumbusFriend and Norman, Nat. Biotech. April 2013
Although the individual components are unique to a given organism, the topologic properties of cellular networks share surprising similarities with those of natural and social networks
A yeast transcription factor-binding network, composed of known transcription factor-binding data collected with large-scale ChIP–chip and small-scale experiments.(B) A yeast protein–protein interaction network, containing protein–protein interactions identified by yeast two-hybrid and protein complexes identified by affinity purification and mass spectrometry(C) A yeast phosphorylation network comprised primarily of in vitro phosphorylation events identified using protein microarrays(D) An E. coli metabolic network with 574 reactions and 473 metabolites colored according to their modules(E) A yeast genetic network constructed with synthetic lethal interactions using SGA analysis on eight yeast genes