SlideShare ist ein Scribd-Unternehmen logo
1 von 42
Downloaden Sie, um offline zu lesen
Issues for metabolomics and
       systems biology
              Douglas Kell
   School of Chemistry, University of Manchester,
           MANCHESTER M60 1QD, U.K.
        dbk@manchester.ac.uk
         http://dbkgroup.org/
    http://www.mib.ac.uk www.mcisb.org
What I can’t do now and would
            like to
Some facts I ‘know’ (i.e. think I can
           remember…)
• Epidemiologically, statins enhance
  longevity
• Cholesterol is barely a risk factor when
  within the normal range of 120-240 mg%
• Statins supposedly act (only) via HMG-
  CoA reductase to lower cholesterol
• Actually many have (and from the above
  logically must have) off-target effects
More ‘facts’
• Although originating as natural products,
  many/most statins can bear comparatively little
  structural relationships to them or to each other
• Are there QSAR-type relations between the
  various off-target effects and the drugs that cause
  them?




      lovastatin           atorvastatin
The software tool I want would
 integrate all of those questions by:
• Finding the facts from the literature (and the
  Web) by reading the articles ‘intelligently’
• Displaying and setting out the facts sensibly
• Allowing the QSARs directly from the papers
  as the structures and substructures would be
  ‘known’ (or knowlable via PubChem,
  DrugBank etc)
• Classify/cluster the off-target effects and the
  papers that described them (via TM and ML)
• Without me having to write any actual code
Westerhoff & Palsson NBT 22, 1249-52 (2004)




But despite everything science is in some ways
becoming LESS effective in an applied context
Declining numbers of drug launches




    Leeson & Springthorpe, NRDD 6, 881-890 (2007)
Drug Discovery/Development Pipeline
• Multifaceted, complicated, lengthy process


                                                                                                                                                                                                                                                   gyy
                                                                                                                                                                                                                                              o llo g
                                                                                                                                                                                                                                                  o
                                                                                                                                                                                                                 yy                          co
                                                                                                                                                                                                                                            ac
                                                                                                                                                                                       g y
                                                                                                                                                                                         y                     et
                                                                                                                                                                                                             ffe t                      rrm
                                                                                                                                                                                                                                          m
                                                                                                                                                                                                                                            a
                                                                                                                                                                                 ll llo g
                                                                                                                                                                                a o   o                   Sa
                                                                                                                                                                                                       ll S
                                                                                                                                                                                                            a
                                                                                                                                                                                                                                     haa
                                                                                                                                                                              c a co                                               P hy
                                                                                                                                                                           niic a c
                                                                                                                                                                         lii n ma                   ca
                                                                                                                                                                                                  iic a                            P y
                                                                                                                                                                                                                               all fett
                                                                                                                                                                        cl a
                                                                                                                                                                      --c arrm                 inn                          iic a a fe
                                                                                                                                                                                             clli
                                                                                                                                                                                           --c
                                                                                                                                                                                                                              c
                                                                                                                                                                                                                         iin S a
                                                   N



                                                                                                                                                                     e h
                                                                                                                                                                     e                                                     n S
                                                                                                                                                                  Prr P h                 e                             l
                                          HO                       OH



                                                                                                                                                                                          e                            Cl &
                                                       OH            HO          N




                                                                                                                                                                                       Prr
                                                                                                                          OH        OH    O



                                                                                                                                                                  P P
                                                   O               O



                                                                                                                                                                                                                       C &
                                                                                                       O
                                                                           O                                      N                           O-


                                                                                                                                                                                       P
                                               O               O
                                                                                                   N
                                                           O                                       H




                                                                                                                                                                                                                                             Products
                                                                                                                                F
                                                               OH
                                                                       O
                                                                                           O
                                                                                                                                                                  NH2
                                      OH
                                N N                                                                                                                NHCH3
                                               N N                               O    HN       N
                                                                                                   N
                                                                                                           H 2N       O
                                                                                                                                                                  C O 2H
                                                                                           N
                                N                      N                                                          S
                                               F                                                             O
                                                                               O 2S
                                                                                      N
                                                                                                                          N N
                                                                                           N                                        CF3
                                      F
                                                                                                                                                        Cl
                                                                                                                                                   Cl




                                           Discovery                                                                                                         Exploratory Development
                                                                                                                                                             Exploratory Development                                          Full Development
         Cl




                                           Discovery                                                                                                                                                                          Full Development
          O       O

 CH 3O                O

              N
              H   O

                          NH2




                                                                                                                                                                   Phase II
                                                                                                                                                                   Phase                Phase II
                                                                                                                                                                                        Phase II                           Phase III
                                                                                                                                                                                                                           Phase III


         0                                                                                                                                                    5                                                       10                                   15
Idea                                                                                                                                                                       12 -15 Years                                                                  Drug
                                                                                                                                                   Peter S. Dragovich, Pfizer
Attrition




Kola & Landis, NRDD 3, 711-5 (2004)
Issues of attrition

• PK/PD less of an issue in last decade
• Now mostly due to (i) lack of efficacy, (ii) toxicity
• Both problems are underpinned by the fact that
  drugs are typically first developed on the basis of
  molecular assays before being tested in the intact
  system
• These failures turn drug discovery – if it was not
  already – into a problem of systems biology
Nature Rev Drug Disc 7, 205-220
        (March 2008)
Poor correlation between different artificial
   membrane (Corti & PAMPA) assays




        Corti et al EJ Pharm Sci 7, 354-362 (2006)
Poor correlation between Caco-2 cells and
  artificial membrane (PAMPA) assays

                             Note axis scales




         Balimane et al., AAPS J 8, e1-e13 (2006)
Poor relationship between PAMPA
    permeability and log Ko/w




    Corti et al. EJ Pharm Sci 7, 354-362 (2006)
Poor relationship between Caco-2
    permeability and log Ko/w

                                      r2 = 0.097


THESE THEORIES OF DRUG UPTAKE
WERE BIOPHYSICAL, ‘LIPID-ONLY’
          THEORIES


     Corti et al. EJ Pharm Sci 7, 354-362 (2006)
Narcotics (‘general anaesthetics’)
• Potency also correlates with log P (up to a cut-
  off) (Meyer & Overton)
• Negligible structure-activity relationships
• Was assumed that they also act by a
  ‘biophysical’ mechanism by partitioning
  ‘nonspecifically’ into membrane and e.g.
  ‘squeezing’ nerve channels
• This too was a ‘lipid-only’ theory
• None of this now stands up
Anaesthetic potency does largely correlate with
 partitioning into membrane, suggesting (to
       many) a ‘lipid-only’ mechanism




          P. Seeman, Pharmacol Rev 24, 583-655 (1972)
But…narcotics inhibit luciferase, a soluble
protein, with the same potency with which
  they anaesthetise animals, over 5 logs!


              No lipid involved!




       Franks & Lieb, Nature 310, 599-601 (1984)
The structural basis is known



                                             Binding of
                                             bromoform to
                                             luciferase




Franks et al, Biophys J 75, 2205-11 (1998)
Halothane affects narcosis in part via a
        TREK-1 K+ channel




        Heurteaux et al. EMBO J 23, 2684-95 (2004)
How to integrate all this
  information with biological and
      physiological networks?
• One strategy is Integrative Systems
  Biology
One view of systems biology


           Experiment
               Theory
Computation/            Technology
Modelling
Bringing together metabolomics
   and systems biology models




Drug Discovery Today 11, 1085-1092 (2006)
There is a convergence between systems
   biology models from whole-genome
    reconstruction and the number of
experimental metabolome peaks (ca 3000
            for human serum)
The human metabolic network (1)



•   8 cellular compartments
•   2,712 compartment-specific metabolites
•   ~ 1,500 different chemical entities
•   1,496 genes
•   2,233 metabolic reactions (1,795 unique)
•   1,078 transport reactions (32.6%)
            PNAS 104, 1777-1782 (2007)
The human metabolic network (2)




• Not yet compartmentalised
• 2,823 reactions (incl 300 ‘orphans’), of which 2,215
  have disease associations, plus 1189 transport
  reactions and 457 exchange reactions
• 2,322 genes (1069 common with Palsson model)
        Molecular Systems Biology 3, 135 (2007)
Systems biology and modelling are
     all about representation
The main representation for systems
     biology models is SBML




                         www.sbml.org
VISUALISE                                     edit            Literature mining
                              create
  Layouts and views
        SBGN                                                              Store in dB
                            BIOCHEMICAL
 Overlays, dynamics
                          MODEL (assumed to
Model merging: (not)                                                     Compare with
  LEGO blocks                be in SBML)                                 other models

   Cheminformatic
        THERE
     analyses
                  ARE MANYRun, analyse THINGS THAT ONEfit to real
                            POSSIBLE          Compare with and
           MIGHT DO WITH THIS REPRESENTATION, AND and
                         (sensitivities, etc)   data (parameters
                                              variables) with constraints
             THESE ACTIONS CAN BE SEEN AS MODULES
Integrate various
     levels                 Store results of
                            manipulations
                                                           How to deal with fitting,
                                                            including as f(global
    LINK WORKFLOWS            Network Motif
                                                            parameters like pH)
                               discovery
     Soaplab, Taverna,
                                                          Automatic characterisation
     Web services, etc.      Optimal DoE for               of parameter space and
                            Sys Identification,               constraint checking
                            incl identifiability
BIOCHEMICAL
                MODEL (assumed to
                   be in SBML)
FEBS J 274, 5576-5585        Compare with and fit to real
                               data (parameters and
                             variables) with constraints



                                  4, 74-97
The Data Management Infrastructure of
 the Manchester Centre for Integrated
          Systems Biology

Norman Paton
University of Manchester
Capabilities
• We require software to support:
  – Data capture: Pedro.
  – Data access: Pierre.
  – Integration of data and analyses: Taverna.
Pipeline Pilot workflow




                      etc…
LITERATURE                                STORE NEW
             MINING               ANNOTATE          MODEL IN DB

 CREATE                   VISUALISE
 MODEL                                                  COMPARE WITH
                                                         ‘REAL’ DATA
           METABOLIC
          MODEL IN SBML

                                                   SCAN
                    RUN BASE MODEL              PARAMETER
   STORE                                          SPACE
  MODEL IN
     DB                               SENSITIVITY
                                       ANALYSES

                                          COMPARE
     DIFFERENT
                                           MODELS
     METABOLIC
    MODEL IN SBML                                            STORE
                                                        DIFFERENCES AS
                                                         NEW MODEL IN
SYSTEMS BIOLOGY WORKFLOWS                                      DB
Scientists                             Decoupled
                                       suppliers &
                                       consumers
                                tion
                             ora
                       o llab
                   C




           dge t
        wle en
     n o em
    K ag
    M an
              Science
‘Warehouse’ vs distributed workflows
• Different ‘modules’ developed in different labs can reside
  on different computers anywhere, and expose themselves as
  Web Services
• Labs can then specialise in what they are best at
• All that is then needed is an environment for enacting
  bioinformatic workflows by coupling together these service-
  oriented architectures
• One such is Taverna
• This is arguably the best way to combine metabolomic
  SBML models with metabolomic data, and is what are
  using at MCISB
Overall Architecture
                                                         Workflow
                                                        Repository

   Data                                    Analysis1
Integration
                                                          Model
   Using
Workflows                                               Repository
                                           Analysisn



              Consistent Web Service Interfaces



              Repository1   …       Repositoryn



              Experiment1   …      Experimentn
                                                       Consistent Web
                                                         Interfaces
The Taverna API consumer along with
    libSBML allows many of these
  transformations to be performed




Details: http://www.mcisb.org/software/taverna/libsbml/index.html
Relating Models to Expression
                                  Read gene
                                  names of
                                  enzymes from
                                  SBML model

Query maxd
transcriptome
database using
gene names


                                   Create new
                 Compute colour    SBMLmodel
                 for expression
                 readings
Visualise Models Using Cell Designer




    JC_C-0.07-1_Measurement   JC_N-0.07-1_Measurement
Potential Solutions
• Semantic annotation
• Chemical and bio-text mining
• RDF annotations – that can also be included
  within the SBML
• Integrated reasoning engine
• Allowing literature-based discovery
• But we still lack a proper and useful
  (bio)chemical ontology integrating roles,
  pathways, diseases, chemical (sub)structures,
  targets, etc.
• This last is probably the most damaging lack
  and thus most important need
Issues for metabolomics and
       systems biology
              Douglas Kell
   School of Chemistry, University of Manchester,
           MANCHESTER M60 1QD, U.K.
        dbk@manchester.ac.uk
         http://dbkgroup.org/
    http://www.mib.ac.uk www.mcisb.org

Weitere ähnliche Inhalte

Was ist angesagt?

SFE Program in Sri Lanka
SFE Program in Sri LankaSFE Program in Sri Lanka
SFE Program in Sri LankaAnup Soans
 
VocaLight Infrared Classroom Amplification Brochure
VocaLight Infrared Classroom Amplification BrochureVocaLight Infrared Classroom Amplification Brochure
VocaLight Infrared Classroom Amplification BrochureGailMaynard
 
Transition Sporting Equipment Report(Web)
Transition Sporting Equipment Report(Web)Transition Sporting Equipment Report(Web)
Transition Sporting Equipment Report(Web)Locus Research
 
CSU ShortCourse Biomass And Bioenergy Jan 2008 Mpm
CSU ShortCourse Biomass And Bioenergy Jan 2008 MpmCSU ShortCourse Biomass And Bioenergy Jan 2008 Mpm
CSU ShortCourse Biomass And Bioenergy Jan 2008 Mpmmpattonmallory
 
Implementing Advanced Access
Implementing Advanced AccessImplementing Advanced Access
Implementing Advanced Accessprimary
 
Media – audience profile
Media – audience profileMedia – audience profile
Media – audience profilelennylavers
 
Media – data analysis
Media – data analysisMedia – data analysis
Media – data analysislennylavers
 
Ifhp price report2010_comparativepricereport29112010
Ifhp price report2010_comparativepricereport29112010Ifhp price report2010_comparativepricereport29112010
Ifhp price report2010_comparativepricereport29112010Óscar Miranda
 

Was ist angesagt? (12)

SFE Program in Sri Lanka
SFE Program in Sri LankaSFE Program in Sri Lanka
SFE Program in Sri Lanka
 
VocaLight Infrared Classroom Amplification Brochure
VocaLight Infrared Classroom Amplification BrochureVocaLight Infrared Classroom Amplification Brochure
VocaLight Infrared Classroom Amplification Brochure
 
Transition Sporting Equipment Report(Web)
Transition Sporting Equipment Report(Web)Transition Sporting Equipment Report(Web)
Transition Sporting Equipment Report(Web)
 
CSU ShortCourse Biomass And Bioenergy Jan 2008 Mpm
CSU ShortCourse Biomass And Bioenergy Jan 2008 MpmCSU ShortCourse Biomass And Bioenergy Jan 2008 Mpm
CSU ShortCourse Biomass And Bioenergy Jan 2008 Mpm
 
Implementing Advanced Access
Implementing Advanced AccessImplementing Advanced Access
Implementing Advanced Access
 
The Singles Lifecycle
The Singles LifecycleThe Singles Lifecycle
The Singles Lifecycle
 
Media – audience profile
Media – audience profileMedia – audience profile
Media – audience profile
 
Media – data analysis
Media – data analysisMedia – data analysis
Media – data analysis
 
Diagonal paper
Diagonal paperDiagonal paper
Diagonal paper
 
Ifhp price report2010_comparativepricereport29112010
Ifhp price report2010_comparativepricereport29112010Ifhp price report2010_comparativepricereport29112010
Ifhp price report2010_comparativepricereport29112010
 
Ivesco feline vaccine comparison chart 2011
Ivesco feline vaccine comparison chart 2011Ivesco feline vaccine comparison chart 2011
Ivesco feline vaccine comparison chart 2011
 
Studio93 postcard[1]
Studio93 postcard[1]Studio93 postcard[1]
Studio93 postcard[1]
 

Ähnlich wie Issues for metabolomics and

art of presentation Map of Jamies Yam
art of presentation Map of Jamies Yamart of presentation Map of Jamies Yam
art of presentation Map of Jamies YamJamies Yam
 
High stakes world of Mobile Payments
High stakes world of Mobile PaymentsHigh stakes world of Mobile Payments
High stakes world of Mobile PaymentstxtNation
 
High stakes-world-of-mobile-payments-infographic
High stakes-world-of-mobile-payments-infographicHigh stakes-world-of-mobile-payments-infographic
High stakes-world-of-mobile-payments-infographicTyson Hackwood
 
Seattle Magazine 2012 Media Kit
Seattle Magazine 2012 Media KitSeattle Magazine 2012 Media Kit
Seattle Magazine 2012 Media Kitjkueber
 
International Trade Compliance Strategy Responsibility Matrix
International Trade Compliance Strategy Responsibility MatrixInternational Trade Compliance Strategy Responsibility Matrix
International Trade Compliance Strategy Responsibility MatrixGHY International
 
if you forget me
if you forget meif you forget me
if you forget menitish
 
U.S. Renewable Energy Market And Growth
U.S. Renewable Energy Market And GrowthU.S. Renewable Energy Market And Growth
U.S. Renewable Energy Market And GrowthBrookeHeaton
 
Technology use and educational performance
Technology use and educational performanceTechnology use and educational performance
Technology use and educational performanceFrancesc Pedró
 
Programme Assurance Questionnaire V1.4
Programme Assurance Questionnaire V1.4Programme Assurance Questionnaire V1.4
Programme Assurance Questionnaire V1.4stuart1403
 
Xero Advertisment - NZ Accountants Journal November 2007
Xero Advertisment - NZ Accountants Journal  November 2007Xero Advertisment - NZ Accountants Journal  November 2007
Xero Advertisment - NZ Accountants Journal November 2007Xero
 
Dental amalgam
Dental amalgamDental amalgam
Dental amalgamZirgi Rana
 
Expanded Game Experience
Expanded Game ExperienceExpanded Game Experience
Expanded Game Experienceaakoo
 
VMware vCloud Director and Nexus 1000V / Workload Mobility
VMware vCloud Director and Nexus 1000V / Workload MobilityVMware vCloud Director and Nexus 1000V / Workload Mobility
VMware vCloud Director and Nexus 1000V / Workload MobilitySal Lopez
 
M&A Integration Planning The Integration Of An Acquired Companys Legal De...
M&A Integration Planning The Integration Of An Acquired Companys Legal De...M&A Integration Planning The Integration Of An Acquired Companys Legal De...
M&A Integration Planning The Integration Of An Acquired Companys Legal De...Frank Fletcher
 

Ähnlich wie Issues for metabolomics and (20)

art of presentation Map of Jamies Yam
art of presentation Map of Jamies Yamart of presentation Map of Jamies Yam
art of presentation Map of Jamies Yam
 
High stakes world of Mobile Payments
High stakes world of Mobile PaymentsHigh stakes world of Mobile Payments
High stakes world of Mobile Payments
 
High stakes-world-of-mobile-payments-infographic
High stakes-world-of-mobile-payments-infographicHigh stakes-world-of-mobile-payments-infographic
High stakes-world-of-mobile-payments-infographic
 
Seattle Magazine 2012 Media Kit
Seattle Magazine 2012 Media KitSeattle Magazine 2012 Media Kit
Seattle Magazine 2012 Media Kit
 
International Trade Compliance Strategy Responsibility Matrix
International Trade Compliance Strategy Responsibility MatrixInternational Trade Compliance Strategy Responsibility Matrix
International Trade Compliance Strategy Responsibility Matrix
 
if you forget me
if you forget meif you forget me
if you forget me
 
U.S. Renewable Energy Market And Growth
U.S. Renewable Energy Market And GrowthU.S. Renewable Energy Market And Growth
U.S. Renewable Energy Market And Growth
 
Map A4ol
Map A4olMap A4ol
Map A4ol
 
Map A4ol
Map A4olMap A4ol
Map A4ol
 
5
55
5
 
Spiral Of Knowledge - 1967
Spiral Of Knowledge - 1967Spiral Of Knowledge - 1967
Spiral Of Knowledge - 1967
 
Social Media Branding by Dawn Jensen
Social Media Branding by Dawn JensenSocial Media Branding by Dawn Jensen
Social Media Branding by Dawn Jensen
 
Technology use and educational performance
Technology use and educational performanceTechnology use and educational performance
Technology use and educational performance
 
Programme Assurance Questionnaire V1.4
Programme Assurance Questionnaire V1.4Programme Assurance Questionnaire V1.4
Programme Assurance Questionnaire V1.4
 
Xero Advertisment - NZ Accountants Journal November 2007
Xero Advertisment - NZ Accountants Journal  November 2007Xero Advertisment - NZ Accountants Journal  November 2007
Xero Advertisment - NZ Accountants Journal November 2007
 
Are you paying attention
Are you paying attentionAre you paying attention
Are you paying attention
 
Dental amalgam
Dental amalgamDental amalgam
Dental amalgam
 
Expanded Game Experience
Expanded Game ExperienceExpanded Game Experience
Expanded Game Experience
 
VMware vCloud Director and Nexus 1000V / Workload Mobility
VMware vCloud Director and Nexus 1000V / Workload MobilityVMware vCloud Director and Nexus 1000V / Workload Mobility
VMware vCloud Director and Nexus 1000V / Workload Mobility
 
M&A Integration Planning The Integration Of An Acquired Companys Legal De...
M&A Integration Planning The Integration Of An Acquired Companys Legal De...M&A Integration Planning The Integration Of An Acquired Companys Legal De...
M&A Integration Planning The Integration Of An Acquired Companys Legal De...
 

Mehr von Duncan Hull

Why study plants?
Why study plants?Why study plants?
Why study plants?Duncan Hull
 
Embedding employability in the Computer Science curriculum
Embedding employability in the Computer Science curriculumEmbedding employability in the Computer Science curriculum
Embedding employability in the Computer Science curriculumDuncan Hull
 
Wikipedia at the Royal Society: The Good, the Bad and the Ugly
Wikipedia at the Royal Society: The Good, the Bad and the UglyWikipedia at the Royal Society: The Good, the Bad and the Ugly
Wikipedia at the Royal Society: The Good, the Bad and the UglyDuncan Hull
 
Improving the troubled relationship between Scientists and Wikipedia
Improving the troubled relationship between Scientists and Wikipedia Improving the troubled relationship between Scientists and Wikipedia
Improving the troubled relationship between Scientists and Wikipedia Duncan Hull
 
Bibliography 2.0: A citeulike case study from the Wellcome Trust Genome Campus
Bibliography 2.0: A citeulike case study from the Wellcome Trust Genome CampusBibliography 2.0: A citeulike case study from the Wellcome Trust Genome Campus
Bibliography 2.0: A citeulike case study from the Wellcome Trust Genome CampusDuncan Hull
 
Accessing small molecule data using ChEBI
Accessing small molecule data using ChEBIAccessing small molecule data using ChEBI
Accessing small molecule data using ChEBIDuncan Hull
 
OWL-XML-Summer-School-09
OWL-XML-Summer-School-09OWL-XML-Summer-School-09
OWL-XML-Summer-School-09Duncan Hull
 
Authenticating Scientists with OpenID
Authenticating Scientists with OpenIDAuthenticating Scientists with OpenID
Authenticating Scientists with OpenIDDuncan Hull
 
The Invisible Scientist
The Invisible ScientistThe Invisible Scientist
The Invisible ScientistDuncan Hull
 
myExperiment @ Nettab
myExperiment @ NettabmyExperiment @ Nettab
myExperiment @ NettabDuncan Hull
 
The Year of Blogging Dangerously
The Year of Blogging DangerouslyThe Year of Blogging Dangerously
The Year of Blogging DangerouslyDuncan Hull
 
eScience: A Transformed Scientific Method
eScience: A Transformed Scientific MethodeScience: A Transformed Scientific Method
eScience: A Transformed Scientific MethodDuncan Hull
 
Defrosting the Digital Library: A survey of bibliographic tools for the next ...
Defrosting the Digital Library: A survey of bibliographic tools for the next ...Defrosting the Digital Library: A survey of bibliographic tools for the next ...
Defrosting the Digital Library: A survey of bibliographic tools for the next ...Duncan Hull
 
The Future of Research (Science and Technology)
The Future of Research (Science and Technology)The Future of Research (Science and Technology)
The Future of Research (Science and Technology)Duncan Hull
 
Chemical named entity recognition and literature mark-up
Chemical named entity recognition and literature mark-upChemical named entity recognition and literature mark-up
Chemical named entity recognition and literature mark-upDuncan Hull
 
Chemoinformatics and information management
Chemoinformatics and information managementChemoinformatics and information management
Chemoinformatics and information managementDuncan Hull
 
Text mining tools for semantically enriching scientific literature
Text mining tools for semantically enriching scientific literatureText mining tools for semantically enriching scientific literature
Text mining tools for semantically enriching scientific literatureDuncan Hull
 
Adding Meaning To Your Data
Adding Meaning To Your DataAdding Meaning To Your Data
Adding Meaning To Your DataDuncan Hull
 

Mehr von Duncan Hull (20)

Why study plants?
Why study plants?Why study plants?
Why study plants?
 
Embedding employability in the Computer Science curriculum
Embedding employability in the Computer Science curriculumEmbedding employability in the Computer Science curriculum
Embedding employability in the Computer Science curriculum
 
Wikipedia at the Royal Society: The Good, the Bad and the Ugly
Wikipedia at the Royal Society: The Good, the Bad and the UglyWikipedia at the Royal Society: The Good, the Bad and the Ugly
Wikipedia at the Royal Society: The Good, the Bad and the Ugly
 
Improving the troubled relationship between Scientists and Wikipedia
Improving the troubled relationship between Scientists and Wikipedia Improving the troubled relationship between Scientists and Wikipedia
Improving the troubled relationship between Scientists and Wikipedia
 
Bibliography 2.0: A citeulike case study from the Wellcome Trust Genome Campus
Bibliography 2.0: A citeulike case study from the Wellcome Trust Genome CampusBibliography 2.0: A citeulike case study from the Wellcome Trust Genome Campus
Bibliography 2.0: A citeulike case study from the Wellcome Trust Genome Campus
 
OWL and OBO
OWL and OBOOWL and OBO
OWL and OBO
 
Accessing small molecule data using ChEBI
Accessing small molecule data using ChEBIAccessing small molecule data using ChEBI
Accessing small molecule data using ChEBI
 
How to Blog
How to BlogHow to Blog
How to Blog
 
OWL-XML-Summer-School-09
OWL-XML-Summer-School-09OWL-XML-Summer-School-09
OWL-XML-Summer-School-09
 
Authenticating Scientists with OpenID
Authenticating Scientists with OpenIDAuthenticating Scientists with OpenID
Authenticating Scientists with OpenID
 
The Invisible Scientist
The Invisible ScientistThe Invisible Scientist
The Invisible Scientist
 
myExperiment @ Nettab
myExperiment @ NettabmyExperiment @ Nettab
myExperiment @ Nettab
 
The Year of Blogging Dangerously
The Year of Blogging DangerouslyThe Year of Blogging Dangerously
The Year of Blogging Dangerously
 
eScience: A Transformed Scientific Method
eScience: A Transformed Scientific MethodeScience: A Transformed Scientific Method
eScience: A Transformed Scientific Method
 
Defrosting the Digital Library: A survey of bibliographic tools for the next ...
Defrosting the Digital Library: A survey of bibliographic tools for the next ...Defrosting the Digital Library: A survey of bibliographic tools for the next ...
Defrosting the Digital Library: A survey of bibliographic tools for the next ...
 
The Future of Research (Science and Technology)
The Future of Research (Science and Technology)The Future of Research (Science and Technology)
The Future of Research (Science and Technology)
 
Chemical named entity recognition and literature mark-up
Chemical named entity recognition and literature mark-upChemical named entity recognition and literature mark-up
Chemical named entity recognition and literature mark-up
 
Chemoinformatics and information management
Chemoinformatics and information managementChemoinformatics and information management
Chemoinformatics and information management
 
Text mining tools for semantically enriching scientific literature
Text mining tools for semantically enriching scientific literatureText mining tools for semantically enriching scientific literature
Text mining tools for semantically enriching scientific literature
 
Adding Meaning To Your Data
Adding Meaning To Your DataAdding Meaning To Your Data
Adding Meaning To Your Data
 

Kürzlich hochgeladen

"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clashcharlottematthew16
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embeddingZilliz
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 

Kürzlich hochgeladen (20)

"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clash
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embedding
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering Tips
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 

Issues for metabolomics and

  • 1. Issues for metabolomics and systems biology Douglas Kell School of Chemistry, University of Manchester, MANCHESTER M60 1QD, U.K. dbk@manchester.ac.uk http://dbkgroup.org/ http://www.mib.ac.uk www.mcisb.org
  • 2. What I can’t do now and would like to
  • 3. Some facts I ‘know’ (i.e. think I can remember…) • Epidemiologically, statins enhance longevity • Cholesterol is barely a risk factor when within the normal range of 120-240 mg% • Statins supposedly act (only) via HMG- CoA reductase to lower cholesterol • Actually many have (and from the above logically must have) off-target effects
  • 4. More ‘facts’ • Although originating as natural products, many/most statins can bear comparatively little structural relationships to them or to each other • Are there QSAR-type relations between the various off-target effects and the drugs that cause them? lovastatin atorvastatin
  • 5. The software tool I want would integrate all of those questions by: • Finding the facts from the literature (and the Web) by reading the articles ‘intelligently’ • Displaying and setting out the facts sensibly • Allowing the QSARs directly from the papers as the structures and substructures would be ‘known’ (or knowlable via PubChem, DrugBank etc) • Classify/cluster the off-target effects and the papers that described them (via TM and ML) • Without me having to write any actual code
  • 6. Westerhoff & Palsson NBT 22, 1249-52 (2004) But despite everything science is in some ways becoming LESS effective in an applied context
  • 7. Declining numbers of drug launches Leeson & Springthorpe, NRDD 6, 881-890 (2007)
  • 8. Drug Discovery/Development Pipeline • Multifaceted, complicated, lengthy process gyy o llo g o yy co ac g y y et ffe t rrm m a ll llo g a o o Sa ll S a haa c a co P hy niic a c lii n ma ca iic a P y all fett cl a --c arrm inn iic a a fe clli --c c iin S a N e h e n S Prr P h e l HO OH e Cl & OH HO N Prr OH OH O P P O O C & O O N O- P O O N O H Products F OH O O NH2 OH N N NHCH3 N N O HN N N H 2N O C O 2H N N N S F O O 2S N N N N CF3 F Cl Cl Discovery Exploratory Development Exploratory Development Full Development Cl Discovery Full Development O O CH 3O O N H O NH2 Phase II Phase Phase II Phase II Phase III Phase III 0 5 10 15 Idea 12 -15 Years Drug Peter S. Dragovich, Pfizer
  • 9. Attrition Kola & Landis, NRDD 3, 711-5 (2004)
  • 10. Issues of attrition • PK/PD less of an issue in last decade • Now mostly due to (i) lack of efficacy, (ii) toxicity • Both problems are underpinned by the fact that drugs are typically first developed on the basis of molecular assays before being tested in the intact system • These failures turn drug discovery – if it was not already – into a problem of systems biology
  • 11. Nature Rev Drug Disc 7, 205-220 (March 2008)
  • 12. Poor correlation between different artificial membrane (Corti & PAMPA) assays Corti et al EJ Pharm Sci 7, 354-362 (2006)
  • 13. Poor correlation between Caco-2 cells and artificial membrane (PAMPA) assays Note axis scales Balimane et al., AAPS J 8, e1-e13 (2006)
  • 14. Poor relationship between PAMPA permeability and log Ko/w Corti et al. EJ Pharm Sci 7, 354-362 (2006)
  • 15. Poor relationship between Caco-2 permeability and log Ko/w r2 = 0.097 THESE THEORIES OF DRUG UPTAKE WERE BIOPHYSICAL, ‘LIPID-ONLY’ THEORIES Corti et al. EJ Pharm Sci 7, 354-362 (2006)
  • 16. Narcotics (‘general anaesthetics’) • Potency also correlates with log P (up to a cut- off) (Meyer & Overton) • Negligible structure-activity relationships • Was assumed that they also act by a ‘biophysical’ mechanism by partitioning ‘nonspecifically’ into membrane and e.g. ‘squeezing’ nerve channels • This too was a ‘lipid-only’ theory • None of this now stands up
  • 17. Anaesthetic potency does largely correlate with partitioning into membrane, suggesting (to many) a ‘lipid-only’ mechanism P. Seeman, Pharmacol Rev 24, 583-655 (1972)
  • 18. But…narcotics inhibit luciferase, a soluble protein, with the same potency with which they anaesthetise animals, over 5 logs! No lipid involved! Franks & Lieb, Nature 310, 599-601 (1984)
  • 19. The structural basis is known Binding of bromoform to luciferase Franks et al, Biophys J 75, 2205-11 (1998)
  • 20. Halothane affects narcosis in part via a TREK-1 K+ channel Heurteaux et al. EMBO J 23, 2684-95 (2004)
  • 21. How to integrate all this information with biological and physiological networks? • One strategy is Integrative Systems Biology
  • 22. One view of systems biology Experiment Theory Computation/ Technology Modelling
  • 23. Bringing together metabolomics and systems biology models Drug Discovery Today 11, 1085-1092 (2006)
  • 24. There is a convergence between systems biology models from whole-genome reconstruction and the number of experimental metabolome peaks (ca 3000 for human serum)
  • 25. The human metabolic network (1) • 8 cellular compartments • 2,712 compartment-specific metabolites • ~ 1,500 different chemical entities • 1,496 genes • 2,233 metabolic reactions (1,795 unique) • 1,078 transport reactions (32.6%) PNAS 104, 1777-1782 (2007)
  • 26. The human metabolic network (2) • Not yet compartmentalised • 2,823 reactions (incl 300 ‘orphans’), of which 2,215 have disease associations, plus 1189 transport reactions and 457 exchange reactions • 2,322 genes (1069 common with Palsson model) Molecular Systems Biology 3, 135 (2007)
  • 27. Systems biology and modelling are all about representation
  • 28. The main representation for systems biology models is SBML www.sbml.org
  • 29. VISUALISE edit Literature mining create Layouts and views SBGN Store in dB BIOCHEMICAL Overlays, dynamics MODEL (assumed to Model merging: (not) Compare with LEGO blocks be in SBML) other models Cheminformatic THERE analyses ARE MANYRun, analyse THINGS THAT ONEfit to real POSSIBLE Compare with and MIGHT DO WITH THIS REPRESENTATION, AND and (sensitivities, etc) data (parameters variables) with constraints THESE ACTIONS CAN BE SEEN AS MODULES Integrate various levels Store results of manipulations How to deal with fitting, including as f(global LINK WORKFLOWS Network Motif parameters like pH) discovery Soaplab, Taverna, Automatic characterisation Web services, etc. Optimal DoE for of parameter space and Sys Identification, constraint checking incl identifiability
  • 30. BIOCHEMICAL MODEL (assumed to be in SBML) FEBS J 274, 5576-5585 Compare with and fit to real data (parameters and variables) with constraints 4, 74-97
  • 31. The Data Management Infrastructure of the Manchester Centre for Integrated Systems Biology Norman Paton University of Manchester
  • 32. Capabilities • We require software to support: – Data capture: Pedro. – Data access: Pierre. – Integration of data and analyses: Taverna.
  • 34. LITERATURE STORE NEW MINING ANNOTATE MODEL IN DB CREATE VISUALISE MODEL COMPARE WITH ‘REAL’ DATA METABOLIC MODEL IN SBML SCAN RUN BASE MODEL PARAMETER STORE SPACE MODEL IN DB SENSITIVITY ANALYSES COMPARE DIFFERENT MODELS METABOLIC MODEL IN SBML STORE DIFFERENCES AS NEW MODEL IN SYSTEMS BIOLOGY WORKFLOWS DB
  • 35. Scientists Decoupled suppliers & consumers tion ora o llab C dge t wle en n o em K ag M an Science
  • 36. ‘Warehouse’ vs distributed workflows • Different ‘modules’ developed in different labs can reside on different computers anywhere, and expose themselves as Web Services • Labs can then specialise in what they are best at • All that is then needed is an environment for enacting bioinformatic workflows by coupling together these service- oriented architectures • One such is Taverna • This is arguably the best way to combine metabolomic SBML models with metabolomic data, and is what are using at MCISB
  • 37. Overall Architecture Workflow Repository Data Analysis1 Integration Model Using Workflows Repository Analysisn Consistent Web Service Interfaces Repository1 … Repositoryn Experiment1 … Experimentn Consistent Web Interfaces
  • 38. The Taverna API consumer along with libSBML allows many of these transformations to be performed Details: http://www.mcisb.org/software/taverna/libsbml/index.html
  • 39. Relating Models to Expression Read gene names of enzymes from SBML model Query maxd transcriptome database using gene names Create new Compute colour SBMLmodel for expression readings
  • 40. Visualise Models Using Cell Designer JC_C-0.07-1_Measurement JC_N-0.07-1_Measurement
  • 41. Potential Solutions • Semantic annotation • Chemical and bio-text mining • RDF annotations – that can also be included within the SBML • Integrated reasoning engine • Allowing literature-based discovery • But we still lack a proper and useful (bio)chemical ontology integrating roles, pathways, diseases, chemical (sub)structures, targets, etc. • This last is probably the most damaging lack and thus most important need
  • 42. Issues for metabolomics and systems biology Douglas Kell School of Chemistry, University of Manchester, MANCHESTER M60 1QD, U.K. dbk@manchester.ac.uk http://dbkgroup.org/ http://www.mib.ac.uk www.mcisb.org