SlideShare ist ein Scribd-Unternehmen logo
1 von 42
The Tetherless World
                    Constellation


 Representing Microarray
Experiment Metadata Using
   Provenance Models
Jim McCusker and Deborah L. McGuinness
     Tetherless World Constellation,
     Rensselaer Polytechnic Institute
            http://tw.rpi.edu
Observation

• experimental metadata (n):
   The information about an experiment that
   describes it in sufficient detail for someone to
   understand and reproduce the experiment.
• provenance in science (n):
   The information about an experiment that
   describes it in sufficient detail for someone to
   understand and reproduce the experiment.
Experimental
 metadata
     is
provenance.
Examples: Provenance
   Representations




            All of these are
          representations of
             provenance.
Examples: Provenance
          Representations
• Computational
  Workflows




                    All of these are
                  representations of
                     provenance.
Examples: Provenance
          Representations
• Computational
  Workflows
• Experimental
  Workflows




                    All of these are
                  representations of
                     provenance.
Examples: Provenance
          Representations
• Computational
  Workflows
• Experimental
  Workflows
• Biospecimen
  Management


                    All of these are
                  representations of
                     provenance.
Examples: Provenance
          Representations
• Computational
  Workflows
• Experimental
  Workflows
• Biospecimen
  Management
• Data Sources,
  Ownership &
  Attribution       All of these are
                  representations of
                     provenance.
Hypothesis
• If experimental metadata is provenance:
 • Then a good model of provenance can be
   used to encode experimental metadata as
   well as a domain-specific metadata.
 • Then a general model of provenance can
   model the entirety of experimental metadata.
• We try to evaluate this by mapping
  microarray metadata onto provenance
  graphs.
Why a generic
provenance model?
An Integrated View of Provenance
An Integrated View of Provenance
Transformation


Normalization


    Scan


Hybridization

   Labeled                 Cell
                 Extract
   Extract                 Line
An Integrated View of Provenance
Transformation


Normalization


    Scan


Hybridization

   Labeled                 Cell
                 Extract          Section   Biopsy   Participant
   Extract                 Line
An Integrated View of Provenance
Transformation


Normalization

                                                      Clinical
    Scan                                             Diagnosis


Hybridization

   Labeled                 Cell
                 Extract          Section   Biopsy   Participant
   Extract                 Line
An Integrated View of Provenance
Transformation


Normalization

                                                      Clinical
    Scan                                             Diagnosis


Hybridization                                         DOB


   Labeled                 Cell
                 Extract          Section   Biopsy   Participant
   Extract                 Line
An Integrated View of Provenance
Transformation


Normalization

                                  Histology             Clinical
    Scan                           Image               Diagnosis


Hybridization                      Slide                DOB


   Labeled                 Cell
                 Extract          Section     Biopsy   Participant
   Extract                 Line
An Integrated View of Provenance
Transformation
                                               Lab
                                                Lab
                                                 Lab
                                                  Lab
                                              Results
                                              Results
Normalization                                  Results
                                                Results

                                  Histology                Clinical
    Scan                           Image                  Diagnosis


                                                Blood
Hybridization                      Slide                   DOB
                                               Sample

   Labeled                 Cell
                 Extract          Section      Biopsy     Participant
   Extract                 Line
An Integrated View of Provenance
Transformation
                                               Lab
                                                Lab          Adverse
                                                             Adverse
                                                 Lab
                                                  Lab         Adverse
                                                               Adverse
                                              Results
                                              Results        Events
                                                              Events
Normalization                                  Results
                                                Results        Events
                                                               Events

                                  Histology                Clinical
    Scan                           Image                  Diagnosis


                                                Blood
Hybridization                      Slide                   DOB
                                               Sample

   Labeled                 Cell
                 Extract          Section      Biopsy     Participant
   Extract                 Line
An Integrated View of Provenance
Transformation
                                                Lab
                                                 Lab          Adverse
                                                              Adverse
                                                  Lab
                                                   Lab         Adverse
                                                                Adverse
                                               Results
                                               Results        Events
                                                               Events
Normalization                                   Results
                                                 Results        Events
                                                                Events

                                   Histology                Clinical
    Scan                            Image                  Diagnosis


                                                 Blood
Hybridization                       Slide                   DOB
                                                Sample

   Labeled                 Cell
                 Extract           Section      Biopsy     Participant
   Extract                 Line
                                  Biopsy Event Date
Related Work, ABC Soup
•MAGE (MicroArray and Gene Expression
 Object Model
•MAGE-TAB: Tab-delimited spreadsheet of
 MAGE.
  •IDF (Investigation Design Format)
  •SDRF (Sample and Data Relationship Format)
•MGED (Microarray Gene Expression
  Data)Ontology and MGED Society
•OPM (Open Provenance Model)
•PML (Proof Markup Language)
General Models of Provenance:
  Open Provenance Model
General Models of Provenance:
   Proof Markup Language
Methods Overview
MAGE-TAB      MAGE-TAB
  IDF           SDRF
Methods Overview
MAGE-TAB              MAGE-TAB
  IDF                   SDRF

    Used(IDF)   Used(SDRF)

MAGE-TAB
   to
MAGE-RDF
    WasGeneratedBy(RDF)



MAGE-RDF
Methods Overview
MAGE-TAB              MAGE-TAB
  IDF                   SDRF

    Used(IDF)   Used(SDRF)

MAGE-TAB
   to            mageprov.rules
MAGE-RDF
                                           mageprovenance.owl
                             Used(Rules)
    WasGeneratedBy(RDF)


                             Jena Rule      Used(OWL)
MAGE-RDF         Used(RDF)    Engine
Methods Overview
MAGE-TAB              MAGE-TAB
  IDF                   SDRF
                                            PML & OPM
    Used(IDF)   Used(SDRF)
                                            WasGeneratedBy(RDF)
MAGE-TAB
   to            mageprov.rules
MAGE-RDF
                                           mageprovenance.owl
                             Used(Rules)
    WasGeneratedBy(RDF)


                             Jena Rule      Used(OWL)
MAGE-RDF         Used(RDF)    Engine
MAGE-TAB 2 MAGE-RDF:
    an RDF Version of MAGE
• Used Limpopo to parse MAGE-TAB
• Big thanks to HCLS SIG for input
• (Small) issues with MGED Ontology:
  • Missing ProtocolApplication, some key
    properties.
  • Cannot perform inferencing without ignoring
    owl:someValuesFrom.
     • Contains 234 owl:Classes with
       193 owl:someValuesFrom restrictions.
     • Should be using integrity constraint instead.
• http://magetab2rdf.googlecode.com
Methods: MAGE-RDF to Open
        Provenance Model
• 16 rules in Jena     Example Rule:
  Inference Language   [opm.generated.sample:
                          (?exp rdf:type mged:Experiment),

• 15 subClass             (?exp mage:has_protocol_application ?pa),
                          makeTemp(?gen),

  mappings                (?pa mged:has_protocol ?protocol),
                          (?pa mage:has_derivative ?sample)
                       ->
• 10 individuals          (?gen rdf:type opm:WasGeneratedBy),
                          (?gen opm:cause ?protocol),
  (Roles)                 (?gen opm:account ?exp),
                          (?gen opm:effect ?sample),
                          (?sample rdf:type opm:Artifact)
• 2 subProperty        ]

  mappings
Methods: MAGE-RDF to Proof
        Markup Language
• 4 rules in Jena   Example Rule:
  Inference         [pml.protocolApplication:
                      (?pa rdf:type mage:ProtocolApplication),

  Language            (?pa mage:has_derivation_source ?derivation),
                      (?pa mged:has_protocol ?protocol),
                      (?pa mage:has_derivative ?derived),

• 14 subClass
                      (?pa mged:has_protocol ?protocol),
                      (?derivedNodeSet pmlj:hasConclusion ?derived),
                      (?derivationNodeSet pmlj:hasConclusion ?derivation),

  mappings          ->
                      makeTemp(?antecedentList)

                      (?pa rdf:type pmlj:InferenceStep),

• 2 subProperty       (?pa pmlj:hasInferenceRule ?protocol),
                      (?pa pmlj:hasIndex 1),

  mappings
                      (?derivedNodeSet pmlj:isConsequentOf ?pa),
                      (?pa pmlj:hasAntecedentList ?antecedentList),
                      (?antecedentList rdf:type pmlj:NodeSetList),
                      (?antecedentList ds:first ?derivationNodeSet)
                    ]
Evaluation: Declarative Mapping

Pros:                     Cons:
• Easy to understand.     • “Unclean”
• Easy to validate.         representations.
• Little programming      • More difficult to
  involved.                 debug.
• Many languages.         • Many languages.
• Easy integration with   • Logic programming
  existing ontologies.      not everyone's
                            cup of tea.
• Powerful rule
  languages available.
Evaluation:
Expressiveness & Coverage
Evaluation:
    Expressiveness & Coverage
Missing in OPM:
• See below.
Evaluation:
    Expressiveness & Coverage
Missing in OPM:   Missing in PML:
• See below.      • Parameters
                  • Parameter Values
                    (could have been
                    variable mappings)
Evaluation:
    Expressiveness & Coverage
Missing in OPM:    Missing in PML:
• See below.       • Parameters
                   • Parameter Values
Could have added     (could have been
to OPM and PML:      variable mappings)
• Publications
• Time
• Comments
Evaluation:
     Expressiveness & Coverage
Missing in OPM:        Missing in PML:
• See below.           • Parameters
                       • Parameter Values
Could have added         (could have been
to OPM and PML:          variable mappings)
• Publications       MAGE comments are
• Time           structured name-value pairs.
• Comments          These would have to be
                     represented as PML
                Information and OPM Artifacts.
Evaluation:
PML Visualization
Evaluation:
PML Visualization
Evaluation:
OPM Visualization
Evaluation:
OPM Visualization
Future Work

• Integration with:
 • Tissue management tools
 • Clinical data sources
 • Workflow engines (Taverna already
   exports OPM)
• Use case-based evaluation of
  sufficiency.
Conclusions
• Experimental metadata is provenance.
• All experimental provenance can benefit
  from a common model.
• MAGE-provenance means 9,975 assays
  ready for conversion to a common model.
• OPM provides slightly better modeling
  coverage and workflow integration.
• PML provides better reasoning coverage
  and theorem prover integration.
Acknowledgements
               and References
• Tetherless World (Deborah McGuinness et al.)
• Krauthammer Lab (Michael Krauthammer et al.)
• W3C Health Care and Life Sciences SIG.
• LIMPOPO: limpopo.sourceforge.net
• MAGE-TAB: magetab.sourceforge.net
• MGED Society: www.mged.org
• PML: www.inference-web.org
• OPM: www.openprovenance.org
• Jiao Tao, Li Ding, Deborah L. McGuinness, Instance Data
  Evaluation for Semantic Web-Based Knowledge Management
  Systems, 42th Hawaii International Conference on System
  Sciences (HICSS-42), January 2009.

Weitere ähnliche Inhalte

Was ist angesagt?

Target selection lead discovery medicinal drug discovery process style design...
Target selection lead discovery medicinal drug discovery process style design...Target selection lead discovery medicinal drug discovery process style design...
Target selection lead discovery medicinal drug discovery process style design...SlideTeam.net
 
Target selection lead discovery medicinal drug discovery strategy design 5 po...
Target selection lead discovery medicinal drug discovery strategy design 5 po...Target selection lead discovery medicinal drug discovery strategy design 5 po...
Target selection lead discovery medicinal drug discovery strategy design 5 po...SlideTeam.net
 
Target selection lead discovery medicinal drug discovery process design 5 pow...
Target selection lead discovery medicinal drug discovery process design 5 pow...Target selection lead discovery medicinal drug discovery process design 5 pow...
Target selection lead discovery medicinal drug discovery process design 5 pow...SlideTeam.net
 
Target selection lead discovery medicinal drug discovery process style design...
Target selection lead discovery medicinal drug discovery process style design...Target selection lead discovery medicinal drug discovery process style design...
Target selection lead discovery medicinal drug discovery process style design...SlideTeam.net
 
Target selection lead discovery medicinal drug discovery process design 5 pow...
Target selection lead discovery medicinal drug discovery process design 5 pow...Target selection lead discovery medicinal drug discovery process design 5 pow...
Target selection lead discovery medicinal drug discovery process design 5 pow...SlideTeam.net
 
Target selection lead discovery medicinal drug discovery process style design...
Target selection lead discovery medicinal drug discovery process style design...Target selection lead discovery medicinal drug discovery process style design...
Target selection lead discovery medicinal drug discovery process style design...SlideTeam.net
 
Target selection lead discovery medicinal drug discovery strategy design 5 po...
Target selection lead discovery medicinal drug discovery strategy design 5 po...Target selection lead discovery medicinal drug discovery strategy design 5 po...
Target selection lead discovery medicinal drug discovery strategy design 5 po...SlideTeam.net
 
Target selection lead discovery medicinal drug discovery strategy style desig...
Target selection lead discovery medicinal drug discovery strategy style desig...Target selection lead discovery medicinal drug discovery strategy style desig...
Target selection lead discovery medicinal drug discovery strategy style desig...SlideTeam.net
 
Target selection lead discovery medicinal drug discovery strategy style desig...
Target selection lead discovery medicinal drug discovery strategy style desig...Target selection lead discovery medicinal drug discovery strategy style desig...
Target selection lead discovery medicinal drug discovery strategy style desig...SlideTeam.net
 
Target selection lead discovery medicinal drug discovery strategy style desig...
Target selection lead discovery medicinal drug discovery strategy style desig...Target selection lead discovery medicinal drug discovery strategy style desig...
Target selection lead discovery medicinal drug discovery strategy style desig...SlideTeam.net
 
Clinicalgenet.slideshare
Clinicalgenet.slideshareClinicalgenet.slideshare
Clinicalgenet.slideshareMasakazu KOHDA
 
NIST program to develop genomic reference materials
NIST program to develop genomic reference materialsNIST program to develop genomic reference materials
NIST program to develop genomic reference materialsGenomeInABottle
 
How to make create genomic research target molecules drug discovery strategy ...
How to make create genomic research target molecules drug discovery strategy ...How to make create genomic research target molecules drug discovery strategy ...
How to make create genomic research target molecules drug discovery strategy ...SlideTeam.net
 
How to make create genomic research target molecules drug discovery strategy ...
How to make create genomic research target molecules drug discovery strategy ...How to make create genomic research target molecules drug discovery strategy ...
How to make create genomic research target molecules drug discovery strategy ...SlideTeam.net
 

Was ist angesagt? (14)

Target selection lead discovery medicinal drug discovery process style design...
Target selection lead discovery medicinal drug discovery process style design...Target selection lead discovery medicinal drug discovery process style design...
Target selection lead discovery medicinal drug discovery process style design...
 
Target selection lead discovery medicinal drug discovery strategy design 5 po...
Target selection lead discovery medicinal drug discovery strategy design 5 po...Target selection lead discovery medicinal drug discovery strategy design 5 po...
Target selection lead discovery medicinal drug discovery strategy design 5 po...
 
Target selection lead discovery medicinal drug discovery process design 5 pow...
Target selection lead discovery medicinal drug discovery process design 5 pow...Target selection lead discovery medicinal drug discovery process design 5 pow...
Target selection lead discovery medicinal drug discovery process design 5 pow...
 
Target selection lead discovery medicinal drug discovery process style design...
Target selection lead discovery medicinal drug discovery process style design...Target selection lead discovery medicinal drug discovery process style design...
Target selection lead discovery medicinal drug discovery process style design...
 
Target selection lead discovery medicinal drug discovery process design 5 pow...
Target selection lead discovery medicinal drug discovery process design 5 pow...Target selection lead discovery medicinal drug discovery process design 5 pow...
Target selection lead discovery medicinal drug discovery process design 5 pow...
 
Target selection lead discovery medicinal drug discovery process style design...
Target selection lead discovery medicinal drug discovery process style design...Target selection lead discovery medicinal drug discovery process style design...
Target selection lead discovery medicinal drug discovery process style design...
 
Target selection lead discovery medicinal drug discovery strategy design 5 po...
Target selection lead discovery medicinal drug discovery strategy design 5 po...Target selection lead discovery medicinal drug discovery strategy design 5 po...
Target selection lead discovery medicinal drug discovery strategy design 5 po...
 
Target selection lead discovery medicinal drug discovery strategy style desig...
Target selection lead discovery medicinal drug discovery strategy style desig...Target selection lead discovery medicinal drug discovery strategy style desig...
Target selection lead discovery medicinal drug discovery strategy style desig...
 
Target selection lead discovery medicinal drug discovery strategy style desig...
Target selection lead discovery medicinal drug discovery strategy style desig...Target selection lead discovery medicinal drug discovery strategy style desig...
Target selection lead discovery medicinal drug discovery strategy style desig...
 
Target selection lead discovery medicinal drug discovery strategy style desig...
Target selection lead discovery medicinal drug discovery strategy style desig...Target selection lead discovery medicinal drug discovery strategy style desig...
Target selection lead discovery medicinal drug discovery strategy style desig...
 
Clinicalgenet.slideshare
Clinicalgenet.slideshareClinicalgenet.slideshare
Clinicalgenet.slideshare
 
NIST program to develop genomic reference materials
NIST program to develop genomic reference materialsNIST program to develop genomic reference materials
NIST program to develop genomic reference materials
 
How to make create genomic research target molecules drug discovery strategy ...
How to make create genomic research target molecules drug discovery strategy ...How to make create genomic research target molecules drug discovery strategy ...
How to make create genomic research target molecules drug discovery strategy ...
 
How to make create genomic research target molecules drug discovery strategy ...
How to make create genomic research target molecules drug discovery strategy ...How to make create genomic research target molecules drug discovery strategy ...
How to make create genomic research target molecules drug discovery strategy ...
 

Andere mochten auch

Computational approaches to cell cycle analysis: Cell-cycle microarray analysis
Computational approaches to cell cycle analysis: Cell-cycle microarray analysisComputational approaches to cell cycle analysis: Cell-cycle microarray analysis
Computational approaches to cell cycle analysis: Cell-cycle microarray analysisLars Juhl Jensen
 
20100509 bioinformatics kapushesky_lecture03-04_0
20100509 bioinformatics kapushesky_lecture03-04_020100509 bioinformatics kapushesky_lecture03-04_0
20100509 bioinformatics kapushesky_lecture03-04_0Computer Science Club
 
Basic Aspects of Microarray Technology and Data Analysis (UEB-UAT Bioinformat...
Basic Aspects of Microarray Technology and Data Analysis (UEB-UAT Bioinformat...Basic Aspects of Microarray Technology and Data Analysis (UEB-UAT Bioinformat...
Basic Aspects of Microarray Technology and Data Analysis (UEB-UAT Bioinformat...VHIR Vall d’Hebron Institut de Recerca
 
Microarray and dna chips for transcriptome study
Microarray and dna chips for transcriptome studyMicroarray and dna chips for transcriptome study
Microarray and dna chips for transcriptome studyBia Khan
 
DNA microarray
DNA microarrayDNA microarray
DNA microarrayS Rasouli
 
Dna microarray (dna chips)
Dna microarray (dna chips)Dna microarray (dna chips)
Dna microarray (dna chips)Rachana Tiwari
 
Masters Thesis Defense Presentation
Masters Thesis Defense PresentationMasters Thesis Defense Presentation
Masters Thesis Defense Presentationprajon
 
Microarray Statistics
Microarray StatisticsMicroarray Statistics
Microarray StatisticsA Roy
 
Powerpoint presentation M.A. Thesis Defence
Powerpoint presentation M.A. Thesis DefencePowerpoint presentation M.A. Thesis Defence
Powerpoint presentation M.A. Thesis DefenceCatie Chase
 
Dissertation oral defense presentation
Dissertation   oral defense presentationDissertation   oral defense presentation
Dissertation oral defense presentationDr. Naomi Mangatu
 

Andere mochten auch (11)

Computational approaches to cell cycle analysis: Cell-cycle microarray analysis
Computational approaches to cell cycle analysis: Cell-cycle microarray analysisComputational approaches to cell cycle analysis: Cell-cycle microarray analysis
Computational approaches to cell cycle analysis: Cell-cycle microarray analysis
 
20100509 bioinformatics kapushesky_lecture03-04_0
20100509 bioinformatics kapushesky_lecture03-04_020100509 bioinformatics kapushesky_lecture03-04_0
20100509 bioinformatics kapushesky_lecture03-04_0
 
Microarray
MicroarrayMicroarray
Microarray
 
Basic Aspects of Microarray Technology and Data Analysis (UEB-UAT Bioinformat...
Basic Aspects of Microarray Technology and Data Analysis (UEB-UAT Bioinformat...Basic Aspects of Microarray Technology and Data Analysis (UEB-UAT Bioinformat...
Basic Aspects of Microarray Technology and Data Analysis (UEB-UAT Bioinformat...
 
Microarray and dna chips for transcriptome study
Microarray and dna chips for transcriptome studyMicroarray and dna chips for transcriptome study
Microarray and dna chips for transcriptome study
 
DNA microarray
DNA microarrayDNA microarray
DNA microarray
 
Dna microarray (dna chips)
Dna microarray (dna chips)Dna microarray (dna chips)
Dna microarray (dna chips)
 
Masters Thesis Defense Presentation
Masters Thesis Defense PresentationMasters Thesis Defense Presentation
Masters Thesis Defense Presentation
 
Microarray Statistics
Microarray StatisticsMicroarray Statistics
Microarray Statistics
 
Powerpoint presentation M.A. Thesis Defence
Powerpoint presentation M.A. Thesis DefencePowerpoint presentation M.A. Thesis Defence
Powerpoint presentation M.A. Thesis Defence
 
Dissertation oral defense presentation
Dissertation   oral defense presentationDissertation   oral defense presentation
Dissertation oral defense presentation
 

Ähnlich wie Representing Microarray Experiment Metadata Using Provenance Models

IntOGen, Integrative Oncogenomics for Personal Cancer Genomes
IntOGen, Integrative Oncogenomics for Personal Cancer GenomesIntOGen, Integrative Oncogenomics for Personal Cancer Genomes
IntOGen, Integrative Oncogenomics for Personal Cancer Genomeschristian.perez
 
Drug discovery process style 5 powerpoint presentation slides db ppt templates
Drug discovery process style 5 powerpoint presentation slides db ppt templatesDrug discovery process style 5 powerpoint presentation slides db ppt templates
Drug discovery process style 5 powerpoint presentation slides db ppt templatesSlideTeam.net
 
Supporting the virtual physiological human with semantics and services e scie...
Supporting the virtual physiological human with semantics and services e scie...Supporting the virtual physiological human with semantics and services e scie...
Supporting the virtual physiological human with semantics and services e scie...Carlos Pedrinaci
 
How to make create target selection lead discovery medicinal drug discovery p...
How to make create target selection lead discovery medicinal drug discovery p...How to make create target selection lead discovery medicinal drug discovery p...
How to make create target selection lead discovery medicinal drug discovery p...SlideTeam.net
 
Target selection lead discovery medicinal drug discovery process design 5 pow...
Target selection lead discovery medicinal drug discovery process design 5 pow...Target selection lead discovery medicinal drug discovery process design 5 pow...
Target selection lead discovery medicinal drug discovery process design 5 pow...SlideTeam.net
 
Target selection lead discovery medicinal drug discovery process design 5 pow...
Target selection lead discovery medicinal drug discovery process design 5 pow...Target selection lead discovery medicinal drug discovery process design 5 pow...
Target selection lead discovery medicinal drug discovery process design 5 pow...SlideTeam.net
 
Drug discovery process style 5 powerpoint presentation templates
Drug discovery process style 5 powerpoint presentation templatesDrug discovery process style 5 powerpoint presentation templates
Drug discovery process style 5 powerpoint presentation templatesSlideTeam.net
 
High throughput screening (HTS) at iNovacia
High throughput screening (HTS) at iNovaciaHigh throughput screening (HTS) at iNovacia
High throughput screening (HTS) at iNovaciaiNovacia AB
 
Drug discovery process style 3 powerpoint presentation templates
Drug discovery process style 3 powerpoint presentation templatesDrug discovery process style 3 powerpoint presentation templates
Drug discovery process style 3 powerpoint presentation templatesSlideTeam.net
 
Legionella Laboratory Testing | Biosan Laboratories
Legionella Laboratory Testing | Biosan LaboratoriesLegionella Laboratory Testing | Biosan Laboratories
Legionella Laboratory Testing | Biosan LaboratoriesAnthony Lewis
 
Biological variation update_ed
Biological variation update_edBiological variation update_ed
Biological variation update_edmarufkhan056
 
Fragment based screening at inovacia
Fragment based screening at inovaciaFragment based screening at inovacia
Fragment based screening at inovaciaiNovacia AB
 
Stephen Friend Food & Drug Administration 2011-07-18
Stephen Friend Food & Drug Administration 2011-07-18Stephen Friend Food & Drug Administration 2011-07-18
Stephen Friend Food & Drug Administration 2011-07-18Sage Base
 
Validation virus removal 092600 kw
Validation virus removal 092600 kwValidation virus removal 092600 kw
Validation virus removal 092600 kwMMLOKA
 
Pv H Europroteome BBMRI
Pv H Europroteome BBMRIPv H Europroteome BBMRI
Pv H Europroteome BBMRIPvonHoegen
 
New Molecular Approaches to Identify 21st Century Microbes - Dr Melissa Mille...
New Molecular Approaches to Identify 21st Century Microbes - Dr Melissa Mille...New Molecular Approaches to Identify 21st Century Microbes - Dr Melissa Mille...
New Molecular Approaches to Identify 21st Century Microbes - Dr Melissa Mille...Eastern Pennsylvania Branch ASM
 
Dancey Clinical Trials Vancouver Dancey 20110302 Final.Ppt [Compatibility Mode]
Dancey Clinical Trials Vancouver Dancey 20110302 Final.Ppt [Compatibility Mode]Dancey Clinical Trials Vancouver Dancey 20110302 Final.Ppt [Compatibility Mode]
Dancey Clinical Trials Vancouver Dancey 20110302 Final.Ppt [Compatibility Mode]Warren Hamilton
 

Ähnlich wie Representing Microarray Experiment Metadata Using Provenance Models (20)

IntOGen, Integrative Oncogenomics for Personal Cancer Genomes
IntOGen, Integrative Oncogenomics for Personal Cancer GenomesIntOGen, Integrative Oncogenomics for Personal Cancer Genomes
IntOGen, Integrative Oncogenomics for Personal Cancer Genomes
 
Drug discovery process style 5 powerpoint presentation slides db ppt templates
Drug discovery process style 5 powerpoint presentation slides db ppt templatesDrug discovery process style 5 powerpoint presentation slides db ppt templates
Drug discovery process style 5 powerpoint presentation slides db ppt templates
 
Supporting the virtual physiological human with semantics and services e scie...
Supporting the virtual physiological human with semantics and services e scie...Supporting the virtual physiological human with semantics and services e scie...
Supporting the virtual physiological human with semantics and services e scie...
 
How to make create target selection lead discovery medicinal drug discovery p...
How to make create target selection lead discovery medicinal drug discovery p...How to make create target selection lead discovery medicinal drug discovery p...
How to make create target selection lead discovery medicinal drug discovery p...
 
Target selection lead discovery medicinal drug discovery process design 5 pow...
Target selection lead discovery medicinal drug discovery process design 5 pow...Target selection lead discovery medicinal drug discovery process design 5 pow...
Target selection lead discovery medicinal drug discovery process design 5 pow...
 
Target selection lead discovery medicinal drug discovery process design 5 pow...
Target selection lead discovery medicinal drug discovery process design 5 pow...Target selection lead discovery medicinal drug discovery process design 5 pow...
Target selection lead discovery medicinal drug discovery process design 5 pow...
 
Drug discovery process style 5 powerpoint presentation templates
Drug discovery process style 5 powerpoint presentation templatesDrug discovery process style 5 powerpoint presentation templates
Drug discovery process style 5 powerpoint presentation templates
 
15 arrays
15 arrays15 arrays
15 arrays
 
High throughput screening (HTS) at iNovacia
High throughput screening (HTS) at iNovaciaHigh throughput screening (HTS) at iNovacia
High throughput screening (HTS) at iNovacia
 
Drug discovery process style 3 powerpoint presentation templates
Drug discovery process style 3 powerpoint presentation templatesDrug discovery process style 3 powerpoint presentation templates
Drug discovery process style 3 powerpoint presentation templates
 
Legionella Laboratory Testing | Biosan Laboratories
Legionella Laboratory Testing | Biosan LaboratoriesLegionella Laboratory Testing | Biosan Laboratories
Legionella Laboratory Testing | Biosan Laboratories
 
Biological variation update_ed
Biological variation update_edBiological variation update_ed
Biological variation update_ed
 
Fragment based screening at inovacia
Fragment based screening at inovaciaFragment based screening at inovacia
Fragment based screening at inovacia
 
Stephen Friend Food & Drug Administration 2011-07-18
Stephen Friend Food & Drug Administration 2011-07-18Stephen Friend Food & Drug Administration 2011-07-18
Stephen Friend Food & Drug Administration 2011-07-18
 
Validation virus removal 092600 kw
Validation virus removal 092600 kwValidation virus removal 092600 kw
Validation virus removal 092600 kw
 
Pv H Europroteome BBMRI
Pv H Europroteome BBMRIPv H Europroteome BBMRI
Pv H Europroteome BBMRI
 
Setting Biological Process Specifications
Setting Biological Process SpecificationsSetting Biological Process Specifications
Setting Biological Process Specifications
 
New Molecular Approaches to Identify 21st Century Microbes - Dr Melissa Mille...
New Molecular Approaches to Identify 21st Century Microbes - Dr Melissa Mille...New Molecular Approaches to Identify 21st Century Microbes - Dr Melissa Mille...
New Molecular Approaches to Identify 21st Century Microbes - Dr Melissa Mille...
 
Novio gendix - jack schalken 25032010
Novio gendix - jack schalken 25032010Novio gendix - jack schalken 25032010
Novio gendix - jack schalken 25032010
 
Dancey Clinical Trials Vancouver Dancey 20110302 Final.Ppt [Compatibility Mode]
Dancey Clinical Trials Vancouver Dancey 20110302 Final.Ppt [Compatibility Mode]Dancey Clinical Trials Vancouver Dancey 20110302 Final.Ppt [Compatibility Mode]
Dancey Clinical Trials Vancouver Dancey 20110302 Final.Ppt [Compatibility Mode]
 

Kürzlich hochgeladen

Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionSafetyChain Software
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfchloefrazer622
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxGaneshChakor2
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactPECB
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxheathfieldcps1
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13Steve Thomason
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdfQucHHunhnh
 
Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxNirmalaLoungPoorunde1
 
Student login on Anyboli platform.helpin
Student login on Anyboli platform.helpinStudent login on Anyboli platform.helpin
Student login on Anyboli platform.helpinRaunakKeshri1
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityGeoBlogs
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Sapana Sha
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeThiyagu K
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Celine George
 
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...Marc Dusseiller Dusjagr
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxiammrhaywood
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3JemimahLaneBuaron
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...EduSkills OECD
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdfssuser54595a
 

Kürzlich hochgeladen (20)

Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory Inspection
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdf
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptx
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
 
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptxINDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptx
 
Student login on Anyboli platform.helpin
Student login on Anyboli platform.helpinStudent login on Anyboli platform.helpin
Student login on Anyboli platform.helpin
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activity
 
Staff of Color (SOC) Retention Efforts DDSD
Staff of Color (SOC) Retention Efforts DDSDStaff of Color (SOC) Retention Efforts DDSD
Staff of Color (SOC) Retention Efforts DDSD
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and Mode
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17
 
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
 

Representing Microarray Experiment Metadata Using Provenance Models

  • 1. The Tetherless World Constellation Representing Microarray Experiment Metadata Using Provenance Models Jim McCusker and Deborah L. McGuinness Tetherless World Constellation, Rensselaer Polytechnic Institute http://tw.rpi.edu
  • 2. Observation • experimental metadata (n): The information about an experiment that describes it in sufficient detail for someone to understand and reproduce the experiment. • provenance in science (n): The information about an experiment that describes it in sufficient detail for someone to understand and reproduce the experiment.
  • 3. Experimental metadata is provenance.
  • 4. Examples: Provenance Representations All of these are representations of provenance.
  • 5. Examples: Provenance Representations • Computational Workflows All of these are representations of provenance.
  • 6. Examples: Provenance Representations • Computational Workflows • Experimental Workflows All of these are representations of provenance.
  • 7. Examples: Provenance Representations • Computational Workflows • Experimental Workflows • Biospecimen Management All of these are representations of provenance.
  • 8. Examples: Provenance Representations • Computational Workflows • Experimental Workflows • Biospecimen Management • Data Sources, Ownership & Attribution All of these are representations of provenance.
  • 9. Hypothesis • If experimental metadata is provenance: • Then a good model of provenance can be used to encode experimental metadata as well as a domain-specific metadata. • Then a general model of provenance can model the entirety of experimental metadata. • We try to evaluate this by mapping microarray metadata onto provenance graphs.
  • 11. An Integrated View of Provenance
  • 12. An Integrated View of Provenance Transformation Normalization Scan Hybridization Labeled Cell Extract Extract Line
  • 13. An Integrated View of Provenance Transformation Normalization Scan Hybridization Labeled Cell Extract Section Biopsy Participant Extract Line
  • 14. An Integrated View of Provenance Transformation Normalization Clinical Scan Diagnosis Hybridization Labeled Cell Extract Section Biopsy Participant Extract Line
  • 15. An Integrated View of Provenance Transformation Normalization Clinical Scan Diagnosis Hybridization DOB Labeled Cell Extract Section Biopsy Participant Extract Line
  • 16. An Integrated View of Provenance Transformation Normalization Histology Clinical Scan Image Diagnosis Hybridization Slide DOB Labeled Cell Extract Section Biopsy Participant Extract Line
  • 17. An Integrated View of Provenance Transformation Lab Lab Lab Lab Results Results Normalization Results Results Histology Clinical Scan Image Diagnosis Blood Hybridization Slide DOB Sample Labeled Cell Extract Section Biopsy Participant Extract Line
  • 18. An Integrated View of Provenance Transformation Lab Lab Adverse Adverse Lab Lab Adverse Adverse Results Results Events Events Normalization Results Results Events Events Histology Clinical Scan Image Diagnosis Blood Hybridization Slide DOB Sample Labeled Cell Extract Section Biopsy Participant Extract Line
  • 19. An Integrated View of Provenance Transformation Lab Lab Adverse Adverse Lab Lab Adverse Adverse Results Results Events Events Normalization Results Results Events Events Histology Clinical Scan Image Diagnosis Blood Hybridization Slide DOB Sample Labeled Cell Extract Section Biopsy Participant Extract Line Biopsy Event Date
  • 20. Related Work, ABC Soup •MAGE (MicroArray and Gene Expression Object Model •MAGE-TAB: Tab-delimited spreadsheet of MAGE. •IDF (Investigation Design Format) •SDRF (Sample and Data Relationship Format) •MGED (Microarray Gene Expression Data)Ontology and MGED Society •OPM (Open Provenance Model) •PML (Proof Markup Language)
  • 21. General Models of Provenance: Open Provenance Model
  • 22. General Models of Provenance: Proof Markup Language
  • 23. Methods Overview MAGE-TAB MAGE-TAB IDF SDRF
  • 24. Methods Overview MAGE-TAB MAGE-TAB IDF SDRF Used(IDF) Used(SDRF) MAGE-TAB to MAGE-RDF WasGeneratedBy(RDF) MAGE-RDF
  • 25. Methods Overview MAGE-TAB MAGE-TAB IDF SDRF Used(IDF) Used(SDRF) MAGE-TAB to mageprov.rules MAGE-RDF mageprovenance.owl Used(Rules) WasGeneratedBy(RDF) Jena Rule Used(OWL) MAGE-RDF Used(RDF) Engine
  • 26. Methods Overview MAGE-TAB MAGE-TAB IDF SDRF PML & OPM Used(IDF) Used(SDRF) WasGeneratedBy(RDF) MAGE-TAB to mageprov.rules MAGE-RDF mageprovenance.owl Used(Rules) WasGeneratedBy(RDF) Jena Rule Used(OWL) MAGE-RDF Used(RDF) Engine
  • 27. MAGE-TAB 2 MAGE-RDF: an RDF Version of MAGE • Used Limpopo to parse MAGE-TAB • Big thanks to HCLS SIG for input • (Small) issues with MGED Ontology: • Missing ProtocolApplication, some key properties. • Cannot perform inferencing without ignoring owl:someValuesFrom. • Contains 234 owl:Classes with 193 owl:someValuesFrom restrictions. • Should be using integrity constraint instead. • http://magetab2rdf.googlecode.com
  • 28. Methods: MAGE-RDF to Open Provenance Model • 16 rules in Jena Example Rule: Inference Language [opm.generated.sample: (?exp rdf:type mged:Experiment), • 15 subClass (?exp mage:has_protocol_application ?pa), makeTemp(?gen), mappings (?pa mged:has_protocol ?protocol), (?pa mage:has_derivative ?sample) -> • 10 individuals (?gen rdf:type opm:WasGeneratedBy), (?gen opm:cause ?protocol), (Roles) (?gen opm:account ?exp), (?gen opm:effect ?sample), (?sample rdf:type opm:Artifact) • 2 subProperty ] mappings
  • 29. Methods: MAGE-RDF to Proof Markup Language • 4 rules in Jena Example Rule: Inference [pml.protocolApplication: (?pa rdf:type mage:ProtocolApplication), Language (?pa mage:has_derivation_source ?derivation), (?pa mged:has_protocol ?protocol), (?pa mage:has_derivative ?derived), • 14 subClass (?pa mged:has_protocol ?protocol), (?derivedNodeSet pmlj:hasConclusion ?derived), (?derivationNodeSet pmlj:hasConclusion ?derivation), mappings -> makeTemp(?antecedentList) (?pa rdf:type pmlj:InferenceStep), • 2 subProperty (?pa pmlj:hasInferenceRule ?protocol), (?pa pmlj:hasIndex 1), mappings (?derivedNodeSet pmlj:isConsequentOf ?pa), (?pa pmlj:hasAntecedentList ?antecedentList), (?antecedentList rdf:type pmlj:NodeSetList), (?antecedentList ds:first ?derivationNodeSet) ]
  • 30. Evaluation: Declarative Mapping Pros: Cons: • Easy to understand. • “Unclean” • Easy to validate. representations. • Little programming • More difficult to involved. debug. • Many languages. • Many languages. • Easy integration with • Logic programming existing ontologies. not everyone's cup of tea. • Powerful rule languages available.
  • 32. Evaluation: Expressiveness & Coverage Missing in OPM: • See below.
  • 33. Evaluation: Expressiveness & Coverage Missing in OPM: Missing in PML: • See below. • Parameters • Parameter Values (could have been variable mappings)
  • 34. Evaluation: Expressiveness & Coverage Missing in OPM: Missing in PML: • See below. • Parameters • Parameter Values Could have added (could have been to OPM and PML: variable mappings) • Publications • Time • Comments
  • 35. Evaluation: Expressiveness & Coverage Missing in OPM: Missing in PML: • See below. • Parameters • Parameter Values Could have added (could have been to OPM and PML: variable mappings) • Publications MAGE comments are • Time structured name-value pairs. • Comments These would have to be represented as PML Information and OPM Artifacts.
  • 40. Future Work • Integration with: • Tissue management tools • Clinical data sources • Workflow engines (Taverna already exports OPM) • Use case-based evaluation of sufficiency.
  • 41. Conclusions • Experimental metadata is provenance. • All experimental provenance can benefit from a common model. • MAGE-provenance means 9,975 assays ready for conversion to a common model. • OPM provides slightly better modeling coverage and workflow integration. • PML provides better reasoning coverage and theorem prover integration.
  • 42. Acknowledgements and References • Tetherless World (Deborah McGuinness et al.) • Krauthammer Lab (Michael Krauthammer et al.) • W3C Health Care and Life Sciences SIG. • LIMPOPO: limpopo.sourceforge.net • MAGE-TAB: magetab.sourceforge.net • MGED Society: www.mged.org • PML: www.inference-web.org • OPM: www.openprovenance.org • Jiao Tao, Li Ding, Deborah L. McGuinness, Instance Data Evaluation for Semantic Web-Based Knowledge Management Systems, 42th Hawaii International Conference on System Sciences (HICSS-42), January 2009.

Hinweis der Redaktion