Bioinformatics A Biased Overview

A {Biased} Overview of Bioinformatics with Examples Drawn from Our Own Work Philip E. Bourne Professor of Pharmacology UCSD [email_address] Bioinformatics - Overview

There Are Multiple Types of Informatics in the Life Sciences Bioinformatics - Overview Pharmacy Informatics Biomedical Informatics Bioinformatics Note: These are only representative examples Drug dosing Pharmacokinetics Pharmacy Information Systems EHR Decision support systems Hospital Information Systems Algorithms Genomics Proteomics Biological networks Systems Biology

There Are Multiple Types of Informatics in the Life Sciences Bioinformatics - Overview Pharmacy Informatics Biomedical Informatics Bioinformatics Controlled vocabularies Ontologies Literature searching Data management Pharmacogenomics Personalized medicine Note: These are only representative examples

Bioinformatics In One Slide Biological Experiment Data Information Knowledge Discovery Collect Characterize Compare Model Infer Sequence Structure Assembly Sub-cellular Cellular Organ Higher-life 90 05 Computing Power Sequencing Data 1 10 100 1000 10 5 95 00 Human Genome Project E.Coli Genome C.Elegans Genome 1 Small Genome/Mo. ESTs Yeast Genome Gene Chips Virus Structure Ribosome Model Metaboloic Pathway of E.coli Complexity Technology Brain Mapping Genetic Circuits Neuronal Modeling Cardiac Modeling Human Genome # People /Web Site 10 6 10 2 1 Virtual Communities 10 6 Blogs Facebook 1000 ’s GWAS The Omics Revolution Bioinformatics - Overview

Bioinformatics – One Definition ,[object Object],Bioinformatics - Overview

Biological Scales (Complexity) Bioinformatics - Overview Genomics Proteomics Protein-protein interactions Biological Networks Systems Biology We will look at an example of how bioinformatics is used at each scale

Some Thoughts on Genomic Data ,[object Object],[object Object],[object Object],[object Object],On the Future of Genomic Data Science 11 February 2011: vol. 331 no. 6018 728-729

Bioinformatics & Metagenomics ,[object Object],[object Object],[object Object],[object Object],Bioinformatics at Different Scales - Genomics Bioinformatics - Overview

Metagenomics: Early Results ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Bioinformatics at Different Scales - Genomics Bioinformatics - Overview

Metagenomics New Discoveries Environmental (red) vs. Currently Known PTPases (blue) Higher eukaryotes 1 2 3 4 Bioinformatics at Different Scales - Genomics Bioinformatics - Overview

Proteomics Bioinformatics - Overview

Its Not Just About Numbers its About Complexity Number of released entries Year Courtesy of the RCSB Protein Data Bank Bioinformatics at Different Scales - Proteomics Bioinformatics - Overview

Determining 3D Structures – The Impact of Bioinformatics Structural biology moves from being functionally driven to genomically driven Fill in protein fold space Robotics -ve data Software engineering Functional prediction Not necessarily Bioinformatics at Different Scales - Proteomics Bioinformatics - Overview Basic Steps Target Selection ,[object Object],[object Object],[object Object],[object Object],[object Object],Data Collection Structure Solution Structure Refinement Functional Annotation Publish

Bioinformatics at Different Scales - Proteomics Bioinformatics - Overview

Nature ’s Reductionism There are ~ 20 300 possible proteins >>>> all the atoms in the Universe ~20M protein sequences from UniProt/TrEMBL ~75,000 protein structures Yield ~1500 folds, ~2000 superfamilies, ~4000 families (SCOP 1.75) Using Protein Structure to Study Evolution

Structure Provides an Evolutionary Fingerprint Distribution among the three kingdoms as taken from SUPERFAMILY ,[object Object],1 153/14 9/1 21/2 310/0 645/49 29/0 68/0 Any genome / All genomes Using Protein Structure to Study Evolution

Method – Distance Determination Presence/Absence Data Matrix Distance Matrix Using Protein Structure to Study Evolution (FSF) SCOP SUPERFAMILY organisms C. intestinalis C. briggsae F. rubripes a.1.1 1 1 1 a.1.2 1 1 1 a.10.1 0 0 1 a.100.1 1 1 1 a.101.1 0 0 0 a.102.1 0 1 1 a.102.2 1 1 1 C. intestinalis C. briggsae F. rubripes C. intestinalis 0 101 109 C. briggsae 0 144 F. rubripes 0

If Structure is so Conserved is it a Useful Tool in the Study of Evolution? The Answer Would Appear to be Yes ,[object Object],Using Protein Structure to Study Evolution Yang, Doolittle & Bourne (2005) PNAS 102(2) 373-8

The Influence of Environment on Life Chris Dupont Scripps Institute of Oceanography UCSD DuPont, Yang, Palenik, Bourne. 2006 PNAS 103(47) 17822-17827 Using Protein Structure to Study Evolution

Consider the Distribution of Disulfide B onds among Folds ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],1 Using Protein Structure to Study Evolution

Evolution of the Earth ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Using Protein Structure to Study Evolution

[object Object],[object Object],Theoretical Levels of Trace Metals and Oxygen in the Deep Ocean Through Earth ’s History Replotted from Saito et al, 2003 Inorganica Chimica Acta 356: 308-318 Using Protein Structure to Study Evolution

The Gaia Hypothesis ,[object Object],James Lovelock Gaia (pronounced /'geɪ.ə/ or /'gaɪ.ə/) "land" or "earth", from the Greek Γαῖα ; is a Greek goddess personifying the Earth Using Protein Structure to Study Evolution

The Question ,[object Object],[object Object],[object Object],[object Object],Using Protein Structure to Study Evolution

Making the Metallome of Each Species – Can Only be Done from Structure and Requires Human Effort ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Using Protein Structure to Study Evolution

Levels of Ambiguity ,[object Object],[object Object],[object Object],[object Object],Using Protein Structure to Study Evolution

Superfamily Distribution As Well As Overall Content Has Changed Using Protein Structure to Study Evolution

Metal Binding Proteins are Not Consistent Across Superkingdoms Since these data are derived from current species they are independent of evolutionary events such as duplication, gene loss, horizontal transfer and endosymbiosis Using Protein Structure to Study Evolution

Power Laws: Fundamental Constants in the Evolution of Proteomes ,[object Object],van Nimwegen E (2006) in: Koonin EV, Wolf YI, Karev GP, (Ed.). Power laws, scale-free networks, and genome biology Using Protein Structure to Study Evolution

Why are the Power Laws Different for Each Superkingdom? ,[object Object],[object Object],[object Object],Using Protein Structure to Study Evolution

Do the Metallomes Contain Further Support for this Hypothesis? Using Protein Structure to Study Evolution

e - Transfer Proteins Same Broad Function, Same Metal, Different Chemistry Induced by the Environment? ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Using Protein Structure to Study Evolution

Hypothesis ,[object Object],[object Object],[object Object],[object Object],[object Object],Using Protein Structure to Study Evolution

Bioinformatics in the Context of Drug Discovery Bioinformatics - Overview

Our Motivation ,[object Object],[object Object],[object Object],[object Object],Collins and Workman 2006 Nature Chemical Biology 2 689-700 Motivators

A Reverse Engineering Approach to Drug Discovery Across Gene Families Characterize ligand binding site of primary target (Geometric Potential) Identify off-targets by ligand binding site similarity (Sequence order independent profile-profile alignment) Extract known drugs or inhibitors of the primary and/or off-targets Search for similar small molecules Dock molecules to both primary and off-targets Statistics analysis of docking score correlations … Computational Methodology Xie and Bourne 2009 Bioinformatics 25(12) 305-312

The Problem with Tuberculosis ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Repositioning - The TB Story

The TB-Drugome ,[object Object],[object Object],[object Object],[object Object],A Multi-target/drug Strategy Kinnings et al 2010 PLoS Comp Biol 6(11): e1000976

1. Determine the TB Structural Proteome ,[object Object],284 1, 446 3, 996 2, 266 TB proteome homology models solved structures A Multi-target/drug Strategy Kinnings et al 2010 PLoS Comp Biol 6(11): e1000976

2. Determine all Known Drug Binding Sites in the PDB ,[object Object],[object Object],No. of drug binding sites Methotrexate Chenodiol Alitretinoin Conjugated estrogens Darunavir Acarbose A Multi-target/drug Strategy Kinnings et al 2010 PLoS Comp Biol 6(11): e1000976

Map 2 onto 1 – The TB-Drugome http://funsite.sdsc.edu/drugome/TB/ Similarities between the binding sites of M.tb proteins (blue), and binding sites containing approved drugs (red).

From a Drug Repositioning Perspective ,[object Object],[object Object],No. of potential TB targets raloxifene alitretinoin conjugated estrogens & methotrexate ritonavir testosterone levothyroxine chenodiol A Multi-target/drug Strategy Kinnings et al 2010 PLoS Comp Biol 6(11): e1000976

Top 5 Most Highly Connected Drugs Drug Intended targets Indications No. of connections TB proteins levothyroxine transthyretin, thyroid hormone receptor α & β -1, thyroxine-binding globulin, mu-crystallin homolog, serum albumin hypothyroidism, goiter, chronic lymphocytic thyroiditis, myxedema coma, stupor 14 adenylyl cyclase, argR , bioD, CRP/FNR trans. reg ., ethR , glbN , glbO, kasB , lrpA , nusA , prrA , secA1 , thyX , trans. reg. protein alitretinoin retinoic acid receptor RXR- α , β & γ , retinoic acid receptor α , β & γ -1&2, cellular retinoic acid-binding protein 1&2 cutaneous lesions in patients with Kaposi's sarcoma 13 adenylyl cyclase, aroG , bioD, bpoC, CRP/FNR trans. reg. , cyp125 , embR , glbN , inhA , lppX , nusA , pknE , purN conjugated estrogens estrogen receptor menopausal vasomotor symptoms, osteoporosis, hypoestrogenism, primary ovarian failure 10 acetylglutamate kinase, adenylyl cyclase, bphD , CRP/FNR trans. reg. , cyp121 , cysM, inhA , mscL , pknB , sigC methotrexate dihydrofolate reductase, serum albumin gestational choriocarcinoma, chorioadenoma destruens, hydatidiform mole, severe psoriasis, rheumatoid arthritis 10 acetylglutamate kinase, aroF , cmaA2 , CRP/FNR trans. reg. , cyp121 , cyp51 , lpd , mmaA4 , panC , usp raloxifene estrogen receptor, estrogen receptor β osteoporosis in post-menopausal women 9 adenylyl cyclase, CRP/FNR trans. reg., deoD, inhA, pknB , pknE , Rv1347c , secA1, sigC

Systems Biology & Drug Discovery Chang et al. 2010 Plos Comp. Biol. 6(9): e1000938 Bioinformatics - Overview

Bioinformatics & Patient Care Bioinformatics - Overview

7. Social Change Josh Sommer and Chordoma Disease http://fora.tv/2010/04/23/Sage_Commons_Josh_Sommer_Chordoma_Foundation#fullprogram

5. Personalized Medicine http://pharmacogenomics.ucsd.edu/

Additional Reading ,[object Object],Bioinformatics - Overview

Questions? [email_address] Bioinformatics - Overview

Bioinformatics A Biased Overview

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Viewers also liked

Viewers also liked (12)

Similar to Bioinformatics A Biased Overview

Similar to Bioinformatics A Biased Overview (20)

More from Philip Bourne

More from Philip Bourne (20)

Recently uploaded

Recently uploaded (20)

Bioinformatics A Biased Overview

Editor's Notes