SlideShare ist ein Scribd-Unternehmen logo
Web Science 2.0

Conducting in silico research in the Web
    from hypothesis to publication
                      Mark Wilkinson




  Isaac Peral Senior Researcher in Biological Informatics
  Centro de Biotecnología y Genómica de Plantas, UPM, Madrid, Spain
  Adjunct Professor of Medical Genetics, University of British Columbia
  Vancouver, BC, Canada.
Context


Multiple recent surveys of high-throughput biology

reveal that upwards of 50% of published studies

              are not reproducible


                                     - Baggerly, 2009
                                     - Ioannidis, 2009
Context


“the most common errors are simple,

the most simple errors are common”




                         - Baggerly, 2009
Context


        These errors pass peer review

     The researcher is unaware of the error

 The process that led to the error is not recorded

Therefore it cannot be detected during peer-review
Context


Discovery of such errors have resulted in retractions

         and even shut-down clinical trials
Context


In March, 2012, the US Institute of Medicine said

              “Enough is enough!”
Context
   Institute of Medicine Recommendations
  For Conduct of High-Throughput Research:


1. Rigorously-described, -annotated, and -followed data
   management procedures

2. “Lock down” the computational analysis pipeline once it
   has been selected

3. Publish the workflow in a formal manner, together with the
   full starting and result datasets




                              Evolution of Translational Omics Lessons Learned and the Path Forward. The
                              Institute of Medicine of the National Academies, Report Brief, March 2012.
Achieving these recommendations

requires integration of existing technologies

        and invention of new ones
Context
“While it took 2,300 years after the first report of angina for the condition to be commonly taught in medical
            curricula, modern discoveries are being disseminated at an increasingly rapid pace.”




                                                                                                   The Healthcare
                                                                                                   Singularity and the
                                                                                                   Age of Semantic
                                                                                                   Medicine, Michael
                                                                                                   Gillam, et al, The
                                                                                                   Fourth Paradigm:
                                                                                                   Data-Intensive
                                                                                                   Scientific Discovery
                                                                                                   Tony Hey (Editor),
                                                                                                   2009

                                                                                                   Slide adapted with
                                                                                                   permission from
                                                                                                   Joanne Luciano,
                                                                                                   Presentation at
                                                                                                   Health Web
                                                                                                   Science Workshop
                                                                                                   2012, Evanston IL,
                                                                                                   USA
                                                                                                   June 22, 2012.
“The Singularity”




   The X-intercept is where, the moment a discovery is made,
                it is immediately put into practice

     (not only medical practice, but any research endeavour...)

The Healthcare Singularity and the Age of Semantic Medicine, Michael Gillam, et al, The Fourth Paradigm: Data-Intensive Scientific Discovery Tony Hey (Editor), 2009
Slide Borrowed with Permission from Joanne Luciano, Presentation at Health Web Science Workshop 2012, Evanston IL, USA
June 22, 2012.
The technology required to achieve this
          does not yet exist
You
                                      Are
                                      Here




Scientific research would have to be conducted
              within a medium that
 immediately interpreted and disseminated
                  the results...
You
                                           Are
                                           Here




...in a form that immediately (actively!) affected the
                  research of others...
You
                                  Are
                                  Here




...without requiring them to be aware
       of these new discoveries.
I‟d like to show you how close
   we now are to this vision




   and how we got there
Web Science 2.0
We wanted to duplicate
a real, peer-reviewed, bioinformatics analysis


   simply by building a model in the Web
       describing what the answer
                  (if one existed)
               would look like
...the machine had to make
     every other decision
         on it‟s own
Brief Digression




 “in” the Web??
How we use
The Web today
By clicking here you cause this incredibly
powerful computational tool called The Web
 to retrieve a chunk of text and images that
    can only be understood by a human...
The Web is not a pigeon!
To achieve this vision


 We must learn how to
do research IN the Web


 Not OVER the Web
Resume Speed
We wanted to duplicate
a real, peer-reviewed, bioinformatics analysis


   simply by building a model in the Web
       describing what the answer
                  (if one existed)
               would look like
...the machine had to make
     every other decision
         on it‟s own
This is the study we chose:
Gordon, P.M.K., Soliman, M.A., Bose, P., Trinh, Q., Sensen, C.W., Riabowol, K.: Interspecies
data mining to predict novel ING-protein interactions in human. BMC genomics. 9, 426 (2008).
Original Study Simplified




Using what is known about interactions in fly & yeast


         predict new interactions with your
             human protein of interest
“Pseudo-code” Abstracted Workflow

Given a protein P in Species X

   Find proteins similar to P in Species Y
   Retrieve interactors in Species Y
   Sequence-compare Y-interactors with Species X genome
        (1)  Keep only those with homologue in X


   Find proteins similar to P in Species Z
   Retrieve interactors in Species Z
   Sequence-compare Z-interactors with (1)



              Putative interactors in Species X
Modeling the answer...




                                           OWL




               Web Ontology Language (OWL) is the
                   language approved by the W3C
               for representing knowledge in the Web
Modeling the answer...

                    Note that every word in this
                    diagram is, in reality, a URL
                    (because it is OWL)

                    The model of the answer is
                    published in The Web
                    and borrows ideas from other
                    models published in The Web
Modeling the answer...


               ProbableInteractor
                   is homologous to (
                       Potential Interactor from ModelOrganism1…)
                       and
                      Potential Interactor from ModelOrganism2…)




Probable Interactor is defined in OWL as a subclass of Potential Interactor
   that requires homologous pairs of interacting proteins to exist in both
                       comparator model organisms.

                        (Effectively, an intersection)
Publish our OWL model of a Probable Interactor


                  in the Web
Running the Web Science Experiment


                       In a local data-file

           provide the protein we are interested in

    and the two species we wish to use in our comparison




  taxon:9606       a      i:OrganismOfInterest . # human
  uniprot:Q9UK53   a      i:ProteinOfInterest . # ING1
  taxon:4932       a      i:ModelOrganism1 . # yeast
  taxon:7227       a      i:ModelOrganism2 . # fly
The tricky bit is...

  In the abstract, the
search for homology is
    “generic” – ANY
 Protein, ANY model
        system

But when the machine
does the experiment, it
  must use specific of
resources because the
    answer requires       taxon:4932   a   i:ModelOrganism1 . # yeast
 information from two     taxon:7227   a   i:ModelOrganism2 . # fly
   declared species
This is the question we ask:
                  (the query language here is SPARQL)




PREFIX i: <http://sadiframework.org/ontologies/InteractingProteins.owl#>

SELECT ?protein
FROM <file:/local/workflow.input.n3>
WHERE {

     ?protein        a        i:ProbableInteractor .


}



                         The reference (URL) to our OWL model of the answer
Our system then derives (and executes) the following workflow automatically




                                                  These are different
                                                  Web services!

                                                  ...selected at run-time
                                                  based on the same model
There are four very cool things about what you just saw...
There are four very cool things about what you just saw...




              The system was able to
            create a workflow based on
             an OWL model (ontology)
There are four very cool things about what you just saw...




          The system was able to create a
            COMPUTATIONAL workflow
          based on a BIOLOGICAL model
There are four very cool things about what you just saw...




               The workflow it created
              (i.e. the services chosen)
                 differed depending on
                         context
There are four very cool things about what you just saw...




     The choice of tool-selection was guided
  by the encoded knowledge of domain-experts
                    worldwide
We got the answer

“simply” by designing a model of the answer!
How did we do that?
A “Smart” Biomedical Resource Representation System
A Web application that answers
    SPARQL-DL queries

      Query-answering
     Enhanced by SADI
Demo #1
Imagine a “virtual database”


      all of the data
   from all databases
              +
          result of
 every conceivable analysis
How can we query that database?
What is the phenotype of every allele of the
          Antirrhinum majus DEFICIENS gene




SELECT ?allele    ?image     ?desc

WHERE {
      locus:DEF            genetics:hasVariant      ?allele .
        ?allele            info:visualizedByImage   ?image .
         ?image            info:hasDescription      ?desc
}
What is the phenotype of every allele of the
          Antirrhinum majus DEFICIENS gene




SELECT ?allele    ?image     ?desc

WHERE {
      locus:DEF            genetics:hasVariant            ?allele .
        ?allele            info:visualizedByImage         ?image .
         ?image            info:hasDescription            ?desc
}


                       Note that there is no “FROM” clause!
                       We don‟t tell it where it should get the information,
                       The machine has to figure that out by itself...
Enter that query into
      SHARE
Click “Submit”...
...and in a few seconds you get your answer.
The query results are live hyperlinks
to the respective Database or images
Neither SADI nor SHARE

  know anything about

plant biology or genetics
What pathways does UniProt protein P47989 belong to?

PREFIX pred: <http://sadiframework.org/ontologies/predicates.owl#>
PREFIX ont: <http://ontology.dumontierlab.com/>
PREFIX uniprot: <http://lsrn.org/UniProt:>
SELECT ?gene ?pathway
WHERE {
        uniprot:P47989   pred:isEncodedBy    ?gene .
        ?gene            ont:isParticipantIn ?pathway .
}
What pathways does UniProt protein P47989 belong to?

PREFIX pred: <http://sadiframework.org/ontologies/predicates.owl#>
PREFIX ont: <http://ontology.dumontierlab.com/>
PREFIX uniprot: <http://lsrn.org/UniProt:>
SELECT ?gene ?pathway
WHERE {
        uniprot:P47989   pred:isEncodedBy    ?gene .
        ?gene            ont:isParticipantIn ?pathway .
}
What pathways does UniProt protein P47989 belong to?

PREFIX pred: <http://sadiframework.org/ontologies/predicates.owl#>
PREFIX ont: <http://ontology.dumontierlab.com/>
PREFIX uniprot: <http://lsrn.org/UniProt:>
SELECT ?gene ?pathway
WHERE {
        uniprot:P47989   pred:isEncodedBy    ?gene .
        ?gene            ont:isParticipantIn ?pathway .
}



               Note again that there is no “From” clause…

              I have not told SHARE where to look for the
                 answer, I am simply asking my question
Enter that query into
      SHARE
Two different
Two different   providers of
providers of    pathway
gene            information
information     (KEGG and
(KEGG &         GO);
NCBI);          were found &
were found &    accessed
accessed
The results are all links to the original data
Neither SADI nor SHARE

      know anything about

proteins or biochemical pathways
Recap
what we just saw

   We posed, and answered
~complex multi-database queries

WITHOUT A DATA WAREHOUSE
Demo #2
An example from the Clinical domain
Show me the latest Blood Urea Nitrogen and Creatinine levels
    of patients who appear to be rejecting their transplants



PREFIX rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#>
PREFIX patient: <http://sadiframework.org/ontologies/patients.owl#>
PREFIX l: <http://sadiframework.org/ontologies/predicates.owl#>
SELECT ?patient ?bun ?creat
FROM <http://sadiframework.org/ontologies/patients.rdf>
WHERE {
        ?patient rdf:type           patient:LikelyRejecter .
        ?patient l:latestBUN        ?bun .
        ?patient l:latestCreatinine ?creat .
}
Show me the latest Blood Urea Nitrogen (BUN) and
        Creatinine levels of patients who appear to be
                  rejecting their transplants


PREFIX rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#>
PREFIX patient: <http://sadiframework.org/ontologies/patients.owl#>
PREFIX l: <http://sadiframework.org/ontologies/predicates.owl#>
SELECT ?patient ?bun ?creat
FROM <http://sadiframework.org/ontologies/patients.rdf>
WHERE {
        ?patient rdf:type           patient:LikelyRejecter .
        ?patient l:latestBUN        ?bun .
        ?patient l:latestCreatinine ?creat .
}
Likely Rejecter:

A patient who has creatinine levels
   that are increasing over time


                 - - Mark D Wilkinson‟s definition
Likely Rejecter:

  …but there is no “likely rejecter”
 column or table in our database…
only blood chemistry measurements
        at various time-points
Likely Rejecter:

So the data required to answer this question
             DOESN‟T EXIST!
?
Enter that query into
      SHARE
Now…

Two “magical” events occur…
The machine decides

              by itself

         that it needs to do a
      Linear Regression analysis
on the blood creatinine measurements
   in order to answer your question
The machine decides

          by itself

how and where that analysis
       can be done

and does it automatically!
http://www.impactlab.net/2009/03/22/improve-your-brain-power/
The SHARE system utilizes SADI to discover
analytical services on the Web that do linear regression analysis
                and sends the data to be analysed
VOILA!
Neither SADI nor SHARE

     know anything about

blood chemistry, or mathematics
So how does the machine know
        what to do??
Ontologies
Ontologies explicitly define the kinds of
        things that (can) exist…

    …and what those things are “like”



                 i.e. what properties they have
     (color, weight, shape, texture, temperature, “state”)
       and what relationships they have to one another
  (inside-of, adjacent-to, part-of, binds-to, controls, inhibits,
                        degrades, etc.)
So we create ………….
  ontologies about biology
         and health

We* publish them on the Web




                   * We… or anybody! Anybody can publish an ontology!
My definition of a Likely Rejecter is encoded in
       a machine-readable document written in the OWL Ontology language

Basically:

  “the regression line over creatinine measurements should have an increasing slope”
Our ontology refers to other ontologies (possibly published by other people)
          to learn about what the properties of “regression models” are
              e.g. that regression models have slopes and intercepts
                and that slopes and intercepts have decimal values
SHARE examines the query


Looks on the Web for ontologies that describe the
  problem it is trying to solve, and “reads” them


 then uses that “knowledge” to figure out which
   data-sources and analytical tools it needs
              to answer the query
The way SHARE “interprets” data varies
        depending on the context of the query
  (i.e. which ontologies it reads – Mine? Yours?)


              and on what part of the query
     it is trying to answer at any given moment
(which ontological concept is relevant to that clause)
Data exhibits “late binding”
Late binding:

     “purpose and meaning”
           of the data is
       not determined until
    the moment it is required

a.k.a The “semantics” of the data
Benefit
  of late binding


  Data is amenable to
constant re-interpretation
Example?

Blood Creatinine measurements

 were not dictated to be (only)

Blood Creatinine measurements
Example?

   The data had the „qualities/properties‟ that

       allowed one machine to interpret

that they were Blood Creatinine measurements

(e.g. to determine which patients were rejecting)
Example?

But the data also had the „qualities/properties‟ that

 allowed another machine to interpret them as

            Simple X/Y coordinate data

   (e.g. the Linear Regression calculation tool)
Benefit
  of late binding


  Data is amenable to
constant re-interpretation
http://www.flickr.com/people/faernworks/
And that brings us to...
Gordon, P.M.K., Soliman, M.A., Bose, P., Trinh, Q., Sensen, C.W., Riabowol, K.: Interspecies
data mining to predict novel ING-protein interactions in human. BMC genomics. 9, 426 (2008).
We built a model of the proposed answer
Our system converted the model into the experiment
The analytical tools chosen for that
  experiment changed depending on

                context

even though the biological model driving
      their selection was the same
i.e.

The published model is re-usable
i.e.

      The published model is re-usable

In different contexts... By different researchers
and because the model IS the experiment


  the published EXPERIMENT is re-usable!!




simply point the same query at your own dataset...
The publication is an
executable document!
Every component of the model

      Every component of the input data

     Every component of the output data

                   is a URL


Therefore the question, the experiment, and the
answer, are immediately published IN the Web
Every component of the model

      Every component of the input data

     Every component of the output data

                  is a URL


The answer, and the knowledge derived from it,
  is immediately available to search engines
and moreover, can affect the outcome of other
         Web Science experiments
You
Are Now
 Here!!!
Final thoughts
An experiment... based on a hypothesis
An experiment... based on a hypothesis

        now modeled in OWL
Does this OWL Class represent the Hypothesis?

                I think it does!
We modeled the answer...
...but the answer was hypothetical
Change the way we think of “hypotheses”
In Web Science 2.0


Model what the world would “look like”
    if your hypothesis were true


   Then ask “is there any data that
          fits that model?”
Like the blind men examining an elephant

    Seemingly different aspects of research
when viewed from the perspective of Web Science
           become the same “thing”

                  The Model
Our vision of Web Science 2.0

Hypothesis                                                     Query




Workflow



  Ontology                                                  Result
                      Materials &
                       Methods
                                    These can be automatically derived through
                                    provenance information during workflow execution
Please join us!

SADI and SHARE are Open-Source projects

       http://sadiframework.org
My New Home!
University of British Columbia


Luke McCarthy – Lead Dev.                  Edward Kawas
Everything...                              SADI Service auto-generator



Benjamin VanderValk                        Ian Wood
SHARE & SADI & Experimental modeling &     Experimental modeling project
myHeath Button




Soroush Samadian
Cardiovascular data modeling and queries
C-BRASS Collaborators at other sites

U of New Brunswick      Carleton University

Dr. Chris Baker         Dr. Michel Dumontier
Alexandre Riazanov      Marc-Alexandre Nolin
                        Leonid Chepelev
                        Steve Etlinger
                        Nichaella Kieth
                        Jose Cruz
Microsoft Research

Weitere ähnliche Inhalte

Andere mochten auch

LA FARMACIA DE DIOS
LA FARMACIA DE DIOSLA FARMACIA DE DIOS
LA FARMACIA DE DIOS
pipis397
 
Projectplan lerarentekort Rotterdam 2008
Projectplan lerarentekort Rotterdam 2008Projectplan lerarentekort Rotterdam 2008
Projectplan lerarentekort Rotterdam 2008Luc Sluijsmans
 
How SADI & SHARE help restore the Scientific Method to in silico science
How SADI & SHARE help restore the Scientific Method to in silico scienceHow SADI & SHARE help restore the Scientific Method to in silico science
How SADI & SHARE help restore the Scientific Method to in silico science
Mark Wilkinson
 
HOJA DE VIDA "LEODAN MARTINEZ "
HOJA DE VIDA "LEODAN MARTINEZ "HOJA DE VIDA "LEODAN MARTINEZ "
HOJA DE VIDA "LEODAN MARTINEZ "
leodanxxg
 
Building a community around your blog v3
Building a community around your blog v3Building a community around your blog v3
Building a community around your blog v3
Brendan Sera-Shriar
 
Tutorial 1.4 - Explore results in a heatmap
Tutorial 1.4 - Explore results in a heatmapTutorial 1.4 - Explore results in a heatmap
Tutorial 1.4 - Explore results in a heatmap
Biomedical Genomics (GRIB)
 
ISoLA 2010: SADI Taverna plug-in
ISoLA 2010:  SADI Taverna plug-inISoLA 2010:  SADI Taverna plug-in
ISoLA 2010: SADI Taverna plug-in
Mark Wilkinson
 
The Evolution of Live Preview Environment Design
The Evolution of Live Preview Environment DesignThe Evolution of Live Preview Environment Design
The Evolution of Live Preview Environment Design
Brendan Sera-Shriar
 
Using Semantics to personalize medical research
Using Semantics to personalize medical researchUsing Semantics to personalize medical research
Using Semantics to personalize medical research
Mark Wilkinson
 
It’s a WIN, WIN: ‘WordPress On Windows’
It’s a WIN, WIN: ‘WordPress On Windows’It’s a WIN, WIN: ‘WordPress On Windows’
It’s a WIN, WIN: ‘WordPress On Windows’
Brendan Sera-Shriar
 
T I P S P A R A D I S F R U T A R T U V I D A Mafalda
T I P S  P A R A  D I S F R U T A R  T U  V I D A     MafaldaT I P S  P A R A  D I S F R U T A R  T U  V I D A     Mafalda
T I P S P A R A D I S F R U T A R T U V I D A Mafalda
pipis397
 
Artikel opleiden in de school in Rotterdam
Artikel opleiden in de school in RotterdamArtikel opleiden in de school in Rotterdam
Artikel opleiden in de school in RotterdamLuc Sluijsmans
 
Ata 2010 Steve Brubaker Data Analytics
Ata 2010   Steve Brubaker Data AnalyticsAta 2010   Steve Brubaker Data Analytics
Ata 2010 Steve Brubaker Data Analytics
stevebrubaker
 
SmartBrief Portfolio
SmartBrief PortfolioSmartBrief Portfolio
SmartBrief Portfolio
SmartBrief
 
Parade of books!
Parade of books!Parade of books!
Parade of books!
Paola Padilla
 

Andere mochten auch (16)

LA FARMACIA DE DIOS
LA FARMACIA DE DIOSLA FARMACIA DE DIOS
LA FARMACIA DE DIOS
 
Projectplan lerarentekort Rotterdam 2008
Projectplan lerarentekort Rotterdam 2008Projectplan lerarentekort Rotterdam 2008
Projectplan lerarentekort Rotterdam 2008
 
How SADI & SHARE help restore the Scientific Method to in silico science
How SADI & SHARE help restore the Scientific Method to in silico scienceHow SADI & SHARE help restore the Scientific Method to in silico science
How SADI & SHARE help restore the Scientific Method to in silico science
 
HOJA DE VIDA "LEODAN MARTINEZ "
HOJA DE VIDA "LEODAN MARTINEZ "HOJA DE VIDA "LEODAN MARTINEZ "
HOJA DE VIDA "LEODAN MARTINEZ "
 
Building a community around your blog v3
Building a community around your blog v3Building a community around your blog v3
Building a community around your blog v3
 
Tutorial 1.4 - Explore results in a heatmap
Tutorial 1.4 - Explore results in a heatmapTutorial 1.4 - Explore results in a heatmap
Tutorial 1.4 - Explore results in a heatmap
 
ISoLA 2010: SADI Taverna plug-in
ISoLA 2010:  SADI Taverna plug-inISoLA 2010:  SADI Taverna plug-in
ISoLA 2010: SADI Taverna plug-in
 
PAPER 2 2010
PAPER 2 2010PAPER 2 2010
PAPER 2 2010
 
The Evolution of Live Preview Environment Design
The Evolution of Live Preview Environment DesignThe Evolution of Live Preview Environment Design
The Evolution of Live Preview Environment Design
 
Using Semantics to personalize medical research
Using Semantics to personalize medical researchUsing Semantics to personalize medical research
Using Semantics to personalize medical research
 
It’s a WIN, WIN: ‘WordPress On Windows’
It’s a WIN, WIN: ‘WordPress On Windows’It’s a WIN, WIN: ‘WordPress On Windows’
It’s a WIN, WIN: ‘WordPress On Windows’
 
T I P S P A R A D I S F R U T A R T U V I D A Mafalda
T I P S  P A R A  D I S F R U T A R  T U  V I D A     MafaldaT I P S  P A R A  D I S F R U T A R  T U  V I D A     Mafalda
T I P S P A R A D I S F R U T A R T U V I D A Mafalda
 
Artikel opleiden in de school in Rotterdam
Artikel opleiden in de school in RotterdamArtikel opleiden in de school in Rotterdam
Artikel opleiden in de school in Rotterdam
 
Ata 2010 Steve Brubaker Data Analytics
Ata 2010   Steve Brubaker Data AnalyticsAta 2010   Steve Brubaker Data Analytics
Ata 2010 Steve Brubaker Data Analytics
 
SmartBrief Portfolio
SmartBrief PortfolioSmartBrief Portfolio
SmartBrief Portfolio
 
Parade of books!
Parade of books!Parade of books!
Parade of books!
 

Ähnlich wie Web Science 2.0 - in silico science

Presentation to the J. Craig Venter Institute, Dec. 2014
Presentation to the J. Craig Venter Institute, Dec. 2014Presentation to the J. Craig Venter Institute, Dec. 2014
Presentation to the J. Craig Venter Institute, Dec. 2014
Mark Wilkinson
 
Big Data and the Future by Sherri Rose
Big Data and the Future by Sherri RoseBig Data and the Future by Sherri Rose
Big Data and the Future by Sherri Rose
Lewis Lin 🦊
 
298-1678-2-PB
298-1678-2-PB298-1678-2-PB
298-1678-2-PB
Leila Jameel
 
Disrupting drug discovery one disease at a time
Disrupting drug discovery one disease at a timeDisrupting drug discovery one disease at a time
Disrupting drug discovery one disease at a time
Sean Ekins
 
Jason KnottBritain on ViewPhotolibrarychapter 1Psych.docx
Jason KnottBritain on ViewPhotolibrarychapter 1Psych.docxJason KnottBritain on ViewPhotolibrarychapter 1Psych.docx
Jason KnottBritain on ViewPhotolibrarychapter 1Psych.docx
christiandean12115
 
Why Research?
Why Research?Why Research?
Why Research?
Bhaswat Chakraborty
 
11 am sim
11 am sim11 am sim
Ecc2012 13 4
Ecc2012 13 4Ecc2012 13 4
Ecc2012 13 4
elena.pasquinelli
 
Sources of innovation
Sources of innovationSources of innovation
Sources of innovation
RohanaDaulay
 
final year project write up
final year project write upfinal year project write up
final year project write up
Daniel Little
 
Research Ethics for JPCFM (Jan2011)
Research Ethics for JPCFM (Jan2011)Research Ethics for JPCFM (Jan2011)
Research Ethics for JPCFM (Jan2011)
Dr Ghaiath Hussein
 
IT and me reflections Kim Solez
IT and me reflections Kim SolezIT and me reflections Kim Solez
IT and me reflections Kim Solez
Kim Solez ,
 
Biochemistry Career Progression
Biochemistry Career ProgressionBiochemistry Career Progression
Biochemistry Career Progression
Jennifer Perry
 
Kim Solez How Will The Technological Singularity Express Itself in Nepal?
Kim Solez How Will The Technological Singularity Express Itself in Nepal?Kim Solez How Will The Technological Singularity Express Itself in Nepal?
Kim Solez How Will The Technological Singularity Express Itself in Nepal?
Kim Solez ,
 
Research Essay Conclusion
Research Essay ConclusionResearch Essay Conclusion
Research Essay Conclusion
Paper Writing Service Cheap
 
Growing Stronger Research Fund Overview Jan 2012 For Donors
Growing Stronger Research Fund Overview Jan 2012 For DonorsGrowing Stronger Research Fund Overview Jan 2012 For Donors
Growing Stronger Research Fund Overview Jan 2012 For Donors
Amer Haider
 
13b Data analysis and causal inference – 2
13b Data analysis and causal inference – 213b Data analysis and causal inference – 2
13b Data analysis and causal inference – 2
Abdiwali Abdullahi Abdiwali
 
Advancing-OSHMS High-Performance WS in OHM
Advancing-OSHMS High-Performance WS in OHMAdvancing-OSHMS High-Performance WS in OHM
Advancing-OSHMS High-Performance WS in OHM
Christopher Bell, MSHI, CAPM, CHTS-IM
 
Nano med july 24 25 2017 tbagchi
Nano med july 24 25 2017 tbagchiNano med july 24 25 2017 tbagchi
Nano med july 24 25 2017 tbagchi
Tista Bagchi
 
Web Science, SADI, and the Singularity
Web Science, SADI, and the SingularityWeb Science, SADI, and the Singularity
Web Science, SADI, and the Singularity
Mark Wilkinson
 

Ähnlich wie Web Science 2.0 - in silico science (20)

Presentation to the J. Craig Venter Institute, Dec. 2014
Presentation to the J. Craig Venter Institute, Dec. 2014Presentation to the J. Craig Venter Institute, Dec. 2014
Presentation to the J. Craig Venter Institute, Dec. 2014
 
Big Data and the Future by Sherri Rose
Big Data and the Future by Sherri RoseBig Data and the Future by Sherri Rose
Big Data and the Future by Sherri Rose
 
298-1678-2-PB
298-1678-2-PB298-1678-2-PB
298-1678-2-PB
 
Disrupting drug discovery one disease at a time
Disrupting drug discovery one disease at a timeDisrupting drug discovery one disease at a time
Disrupting drug discovery one disease at a time
 
Jason KnottBritain on ViewPhotolibrarychapter 1Psych.docx
Jason KnottBritain on ViewPhotolibrarychapter 1Psych.docxJason KnottBritain on ViewPhotolibrarychapter 1Psych.docx
Jason KnottBritain on ViewPhotolibrarychapter 1Psych.docx
 
Why Research?
Why Research?Why Research?
Why Research?
 
11 am sim
11 am sim11 am sim
11 am sim
 
Ecc2012 13 4
Ecc2012 13 4Ecc2012 13 4
Ecc2012 13 4
 
Sources of innovation
Sources of innovationSources of innovation
Sources of innovation
 
final year project write up
final year project write upfinal year project write up
final year project write up
 
Research Ethics for JPCFM (Jan2011)
Research Ethics for JPCFM (Jan2011)Research Ethics for JPCFM (Jan2011)
Research Ethics for JPCFM (Jan2011)
 
IT and me reflections Kim Solez
IT and me reflections Kim SolezIT and me reflections Kim Solez
IT and me reflections Kim Solez
 
Biochemistry Career Progression
Biochemistry Career ProgressionBiochemistry Career Progression
Biochemistry Career Progression
 
Kim Solez How Will The Technological Singularity Express Itself in Nepal?
Kim Solez How Will The Technological Singularity Express Itself in Nepal?Kim Solez How Will The Technological Singularity Express Itself in Nepal?
Kim Solez How Will The Technological Singularity Express Itself in Nepal?
 
Research Essay Conclusion
Research Essay ConclusionResearch Essay Conclusion
Research Essay Conclusion
 
Growing Stronger Research Fund Overview Jan 2012 For Donors
Growing Stronger Research Fund Overview Jan 2012 For DonorsGrowing Stronger Research Fund Overview Jan 2012 For Donors
Growing Stronger Research Fund Overview Jan 2012 For Donors
 
13b Data analysis and causal inference – 2
13b Data analysis and causal inference – 213b Data analysis and causal inference – 2
13b Data analysis and causal inference – 2
 
Advancing-OSHMS High-Performance WS in OHM
Advancing-OSHMS High-Performance WS in OHMAdvancing-OSHMS High-Performance WS in OHM
Advancing-OSHMS High-Performance WS in OHM
 
Nano med july 24 25 2017 tbagchi
Nano med july 24 25 2017 tbagchiNano med july 24 25 2017 tbagchi
Nano med july 24 25 2017 tbagchi
 
Web Science, SADI, and the Singularity
Web Science, SADI, and the SingularityWeb Science, SADI, and the Singularity
Web Science, SADI, and the Singularity
 

Mehr von Mark Wilkinson

FAIR Metrics - Presentation to NIH KC1
FAIR Metrics - Presentation to NIH KC1FAIR Metrics - Presentation to NIH KC1
FAIR Metrics - Presentation to NIH KC1
Mark Wilkinson
 
Introducing the fair evaluator
Introducing the fair evaluatorIntroducing the fair evaluator
Introducing the fair evaluator
Mark Wilkinson
 
FAIR Projector Builder
FAIR Projector BuilderFAIR Projector Builder
FAIR Projector Builder
Mark Wilkinson
 
Tech. session : Interoperability and Data FAIRness emerges from a novel combi...
Tech. session : Interoperability and Data FAIRness emerges from a novel combi...Tech. session : Interoperability and Data FAIRness emerges from a novel combi...
Tech. session : Interoperability and Data FAIRness emerges from a novel combi...
Mark Wilkinson
 
smartAPIs: EUDAT Semantic Working Group Presentation @ RDA 9th Plenary
smartAPIs:  EUDAT Semantic Working Group Presentation @ RDA 9th PlenarysmartAPIs:  EUDAT Semantic Working Group Presentation @ RDA 9th Plenary
smartAPIs: EUDAT Semantic Working Group Presentation @ RDA 9th Plenary
Mark Wilkinson
 
IBC FAIR Data Prototype Implementation slideshow
IBC FAIR Data Prototype Implementation   slideshowIBC FAIR Data Prototype Implementation   slideshow
IBC FAIR Data Prototype Implementation slideshow
Mark Wilkinson
 
FAIR Data Prototype - Interoperability and FAIRness through a novel combinati...
FAIR Data Prototype - Interoperability and FAIRness through a novel combinati...FAIR Data Prototype - Interoperability and FAIRness through a novel combinati...
FAIR Data Prototype - Interoperability and FAIRness through a novel combinati...
Mark Wilkinson
 
Building SADI Services Tutorial - SIB Workshop, Geneva, December 2015
Building SADI Services Tutorial - SIB Workshop, Geneva, December 2015Building SADI Services Tutorial - SIB Workshop, Geneva, December 2015
Building SADI Services Tutorial - SIB Workshop, Geneva, December 2015
Mark Wilkinson
 
Sample data and other ur ls
Sample data and other ur lsSample data and other ur ls
Sample data and other ur ls
Mark Wilkinson
 
Example code for the SADI BMI Calculator Web Service
Example code for the SADI BMI Calculator Web ServiceExample code for the SADI BMI Calculator Web Service
Example code for the SADI BMI Calculator Web Service
Mark Wilkinson
 
Sadi service
Sadi serviceSadi service
Sadi service
Mark Wilkinson
 
Tutorial - Creating SADI semantic-web-services
Tutorial - Creating SADI semantic-web-servicesTutorial - Creating SADI semantic-web-services
Tutorial - Creating SADI semantic-web-services
Mark Wilkinson
 
Data FAIRport Prototype & Demo - Presentation to Elsevier, Jul 10, 2015
Data FAIRport Prototype & Demo - Presentation to Elsevier, Jul 10, 2015Data FAIRport Prototype & Demo - Presentation to Elsevier, Jul 10, 2015
Data FAIRport Prototype & Demo - Presentation to Elsevier, Jul 10, 2015
Mark Wilkinson
 
Force11 JDDCP workshop presentation, @ Force2015, Oxford
Force11 JDDCP workshop presentation, @ Force2015, OxfordForce11 JDDCP workshop presentation, @ Force2015, Oxford
Force11 JDDCP workshop presentation, @ Force2015, Oxford
Mark Wilkinson
 
Enhancing Reproducibility and Transparency in Clinical Research through Seman...
Enhancing Reproducibility and Transparency in Clinical Research through Seman...Enhancing Reproducibility and Transparency in Clinical Research through Seman...
Enhancing Reproducibility and Transparency in Clinical Research through Seman...
Mark Wilkinson
 
SADI CSHALS 2013
SADI CSHALS 2013SADI CSHALS 2013
SADI CSHALS 2013
Mark Wilkinson
 
Web Science - ISoLA 2012
Web Science - ISoLA 2012Web Science - ISoLA 2012
Web Science - ISoLA 2012
Mark Wilkinson
 
SWAT4LS 2011: SADI Knowledge Explorer Plug-in
SWAT4LS 2011: SADI Knowledge Explorer Plug-inSWAT4LS 2011: SADI Knowledge Explorer Plug-in
SWAT4LS 2011: SADI Knowledge Explorer Plug-in
Mark Wilkinson
 
SADI in Perl - Protege Plugin Tutorial (fixed Aug 24, 2011)
SADI in Perl - Protege Plugin Tutorial (fixed Aug 24, 2011)SADI in Perl - Protege Plugin Tutorial (fixed Aug 24, 2011)
SADI in Perl - Protege Plugin Tutorial (fixed Aug 24, 2011)
Mark Wilkinson
 
Technologies, methods and challenges to data sharing and aggrigation
Technologies, methods and challenges to data sharing and aggrigationTechnologies, methods and challenges to data sharing and aggrigation
Technologies, methods and challenges to data sharing and aggrigation
Mark Wilkinson
 

Mehr von Mark Wilkinson (20)

FAIR Metrics - Presentation to NIH KC1
FAIR Metrics - Presentation to NIH KC1FAIR Metrics - Presentation to NIH KC1
FAIR Metrics - Presentation to NIH KC1
 
Introducing the fair evaluator
Introducing the fair evaluatorIntroducing the fair evaluator
Introducing the fair evaluator
 
FAIR Projector Builder
FAIR Projector BuilderFAIR Projector Builder
FAIR Projector Builder
 
Tech. session : Interoperability and Data FAIRness emerges from a novel combi...
Tech. session : Interoperability and Data FAIRness emerges from a novel combi...Tech. session : Interoperability and Data FAIRness emerges from a novel combi...
Tech. session : Interoperability and Data FAIRness emerges from a novel combi...
 
smartAPIs: EUDAT Semantic Working Group Presentation @ RDA 9th Plenary
smartAPIs:  EUDAT Semantic Working Group Presentation @ RDA 9th PlenarysmartAPIs:  EUDAT Semantic Working Group Presentation @ RDA 9th Plenary
smartAPIs: EUDAT Semantic Working Group Presentation @ RDA 9th Plenary
 
IBC FAIR Data Prototype Implementation slideshow
IBC FAIR Data Prototype Implementation   slideshowIBC FAIR Data Prototype Implementation   slideshow
IBC FAIR Data Prototype Implementation slideshow
 
FAIR Data Prototype - Interoperability and FAIRness through a novel combinati...
FAIR Data Prototype - Interoperability and FAIRness through a novel combinati...FAIR Data Prototype - Interoperability and FAIRness through a novel combinati...
FAIR Data Prototype - Interoperability and FAIRness through a novel combinati...
 
Building SADI Services Tutorial - SIB Workshop, Geneva, December 2015
Building SADI Services Tutorial - SIB Workshop, Geneva, December 2015Building SADI Services Tutorial - SIB Workshop, Geneva, December 2015
Building SADI Services Tutorial - SIB Workshop, Geneva, December 2015
 
Sample data and other ur ls
Sample data and other ur lsSample data and other ur ls
Sample data and other ur ls
 
Example code for the SADI BMI Calculator Web Service
Example code for the SADI BMI Calculator Web ServiceExample code for the SADI BMI Calculator Web Service
Example code for the SADI BMI Calculator Web Service
 
Sadi service
Sadi serviceSadi service
Sadi service
 
Tutorial - Creating SADI semantic-web-services
Tutorial - Creating SADI semantic-web-servicesTutorial - Creating SADI semantic-web-services
Tutorial - Creating SADI semantic-web-services
 
Data FAIRport Prototype & Demo - Presentation to Elsevier, Jul 10, 2015
Data FAIRport Prototype & Demo - Presentation to Elsevier, Jul 10, 2015Data FAIRport Prototype & Demo - Presentation to Elsevier, Jul 10, 2015
Data FAIRport Prototype & Demo - Presentation to Elsevier, Jul 10, 2015
 
Force11 JDDCP workshop presentation, @ Force2015, Oxford
Force11 JDDCP workshop presentation, @ Force2015, OxfordForce11 JDDCP workshop presentation, @ Force2015, Oxford
Force11 JDDCP workshop presentation, @ Force2015, Oxford
 
Enhancing Reproducibility and Transparency in Clinical Research through Seman...
Enhancing Reproducibility and Transparency in Clinical Research through Seman...Enhancing Reproducibility and Transparency in Clinical Research through Seman...
Enhancing Reproducibility and Transparency in Clinical Research through Seman...
 
SADI CSHALS 2013
SADI CSHALS 2013SADI CSHALS 2013
SADI CSHALS 2013
 
Web Science - ISoLA 2012
Web Science - ISoLA 2012Web Science - ISoLA 2012
Web Science - ISoLA 2012
 
SWAT4LS 2011: SADI Knowledge Explorer Plug-in
SWAT4LS 2011: SADI Knowledge Explorer Plug-inSWAT4LS 2011: SADI Knowledge Explorer Plug-in
SWAT4LS 2011: SADI Knowledge Explorer Plug-in
 
SADI in Perl - Protege Plugin Tutorial (fixed Aug 24, 2011)
SADI in Perl - Protege Plugin Tutorial (fixed Aug 24, 2011)SADI in Perl - Protege Plugin Tutorial (fixed Aug 24, 2011)
SADI in Perl - Protege Plugin Tutorial (fixed Aug 24, 2011)
 
Technologies, methods and challenges to data sharing and aggrigation
Technologies, methods and challenges to data sharing and aggrigationTechnologies, methods and challenges to data sharing and aggrigation
Technologies, methods and challenges to data sharing and aggrigation
 

Kürzlich hochgeladen

Mind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AIMind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AI
Kumud Singh
 
HCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAUHCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAU
panagenda
 
GenAI Pilot Implementation in the organizations
GenAI Pilot Implementation in the organizationsGenAI Pilot Implementation in the organizations
GenAI Pilot Implementation in the organizations
kumardaparthi1024
 
Full-RAG: A modern architecture for hyper-personalization
Full-RAG: A modern architecture for hyper-personalizationFull-RAG: A modern architecture for hyper-personalization
Full-RAG: A modern architecture for hyper-personalization
Zilliz
 
Choosing The Best AWS Service For Your Website + API.pptx
Choosing The Best AWS Service For Your Website + API.pptxChoosing The Best AWS Service For Your Website + API.pptx
Choosing The Best AWS Service For Your Website + API.pptx
Brandon Minnick, MBA
 
Infrastructure Challenges in Scaling RAG with Custom AI models
Infrastructure Challenges in Scaling RAG with Custom AI modelsInfrastructure Challenges in Scaling RAG with Custom AI models
Infrastructure Challenges in Scaling RAG with Custom AI models
Zilliz
 
Artificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopmentArtificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopment
Octavian Nadolu
 
Presentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of GermanyPresentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of Germany
innovationoecd
 
Climate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing DaysClimate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing Days
Kari Kakkonen
 
June Patch Tuesday
June Patch TuesdayJune Patch Tuesday
June Patch Tuesday
Ivanti
 
Building Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and MilvusBuilding Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and Milvus
Zilliz
 
Taking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdfTaking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdf
ssuserfac0301
 
“I’m still / I’m still / Chaining from the Block”
“I’m still / I’m still / Chaining from the Block”“I’m still / I’m still / Chaining from the Block”
“I’m still / I’m still / Chaining from the Block”
Claudio Di Ciccio
 
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAUHCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
panagenda
 
Serial Arm Control in Real Time Presentation
Serial Arm Control in Real Time PresentationSerial Arm Control in Real Time Presentation
Serial Arm Control in Real Time Presentation
tolgahangng
 
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with SlackLet's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
shyamraj55
 
Mariano G Tinti - Decoding SpaceX
Mariano G Tinti - Decoding SpaceXMariano G Tinti - Decoding SpaceX
Mariano G Tinti - Decoding SpaceX
Mariano Tinti
 
UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6
DianaGray10
 
How to use Firebase Data Connect For Flutter
How to use Firebase Data Connect For FlutterHow to use Firebase Data Connect For Flutter
How to use Firebase Data Connect For Flutter
Daiki Mogmet Ito
 
Things to Consider When Choosing a Website Developer for your Website | FODUU
Things to Consider When Choosing a Website Developer for your Website | FODUUThings to Consider When Choosing a Website Developer for your Website | FODUU
Things to Consider When Choosing a Website Developer for your Website | FODUU
FODUU
 

Kürzlich hochgeladen (20)

Mind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AIMind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AI
 
HCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAUHCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAU
 
GenAI Pilot Implementation in the organizations
GenAI Pilot Implementation in the organizationsGenAI Pilot Implementation in the organizations
GenAI Pilot Implementation in the organizations
 
Full-RAG: A modern architecture for hyper-personalization
Full-RAG: A modern architecture for hyper-personalizationFull-RAG: A modern architecture for hyper-personalization
Full-RAG: A modern architecture for hyper-personalization
 
Choosing The Best AWS Service For Your Website + API.pptx
Choosing The Best AWS Service For Your Website + API.pptxChoosing The Best AWS Service For Your Website + API.pptx
Choosing The Best AWS Service For Your Website + API.pptx
 
Infrastructure Challenges in Scaling RAG with Custom AI models
Infrastructure Challenges in Scaling RAG with Custom AI modelsInfrastructure Challenges in Scaling RAG with Custom AI models
Infrastructure Challenges in Scaling RAG with Custom AI models
 
Artificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopmentArtificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopment
 
Presentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of GermanyPresentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of Germany
 
Climate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing DaysClimate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing Days
 
June Patch Tuesday
June Patch TuesdayJune Patch Tuesday
June Patch Tuesday
 
Building Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and MilvusBuilding Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and Milvus
 
Taking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdfTaking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdf
 
“I’m still / I’m still / Chaining from the Block”
“I’m still / I’m still / Chaining from the Block”“I’m still / I’m still / Chaining from the Block”
“I’m still / I’m still / Chaining from the Block”
 
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAUHCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
 
Serial Arm Control in Real Time Presentation
Serial Arm Control in Real Time PresentationSerial Arm Control in Real Time Presentation
Serial Arm Control in Real Time Presentation
 
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with SlackLet's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
 
Mariano G Tinti - Decoding SpaceX
Mariano G Tinti - Decoding SpaceXMariano G Tinti - Decoding SpaceX
Mariano G Tinti - Decoding SpaceX
 
UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6
 
How to use Firebase Data Connect For Flutter
How to use Firebase Data Connect For FlutterHow to use Firebase Data Connect For Flutter
How to use Firebase Data Connect For Flutter
 
Things to Consider When Choosing a Website Developer for your Website | FODUU
Things to Consider When Choosing a Website Developer for your Website | FODUUThings to Consider When Choosing a Website Developer for your Website | FODUU
Things to Consider When Choosing a Website Developer for your Website | FODUU
 

Web Science 2.0 - in silico science

  • 1. Web Science 2.0 Conducting in silico research in the Web from hypothesis to publication Mark Wilkinson Isaac Peral Senior Researcher in Biological Informatics Centro de Biotecnología y Genómica de Plantas, UPM, Madrid, Spain Adjunct Professor of Medical Genetics, University of British Columbia Vancouver, BC, Canada.
  • 2. Context Multiple recent surveys of high-throughput biology reveal that upwards of 50% of published studies are not reproducible - Baggerly, 2009 - Ioannidis, 2009
  • 3. Context “the most common errors are simple, the most simple errors are common” - Baggerly, 2009
  • 4. Context These errors pass peer review The researcher is unaware of the error The process that led to the error is not recorded Therefore it cannot be detected during peer-review
  • 5. Context Discovery of such errors have resulted in retractions and even shut-down clinical trials
  • 6. Context In March, 2012, the US Institute of Medicine said “Enough is enough!”
  • 7. Context Institute of Medicine Recommendations For Conduct of High-Throughput Research: 1. Rigorously-described, -annotated, and -followed data management procedures 2. “Lock down” the computational analysis pipeline once it has been selected 3. Publish the workflow in a formal manner, together with the full starting and result datasets Evolution of Translational Omics Lessons Learned and the Path Forward. The Institute of Medicine of the National Academies, Report Brief, March 2012.
  • 8. Achieving these recommendations requires integration of existing technologies and invention of new ones
  • 9. Context “While it took 2,300 years after the first report of angina for the condition to be commonly taught in medical curricula, modern discoveries are being disseminated at an increasingly rapid pace.” The Healthcare Singularity and the Age of Semantic Medicine, Michael Gillam, et al, The Fourth Paradigm: Data-Intensive Scientific Discovery Tony Hey (Editor), 2009 Slide adapted with permission from Joanne Luciano, Presentation at Health Web Science Workshop 2012, Evanston IL, USA June 22, 2012.
  • 10. “The Singularity” The X-intercept is where, the moment a discovery is made, it is immediately put into practice (not only medical practice, but any research endeavour...) The Healthcare Singularity and the Age of Semantic Medicine, Michael Gillam, et al, The Fourth Paradigm: Data-Intensive Scientific Discovery Tony Hey (Editor), 2009 Slide Borrowed with Permission from Joanne Luciano, Presentation at Health Web Science Workshop 2012, Evanston IL, USA June 22, 2012.
  • 11. The technology required to achieve this does not yet exist
  • 12. You Are Here Scientific research would have to be conducted within a medium that immediately interpreted and disseminated the results...
  • 13. You Are Here ...in a form that immediately (actively!) affected the research of others...
  • 14. You Are Here ...without requiring them to be aware of these new discoveries.
  • 15. I‟d like to show you how close we now are to this vision and how we got there
  • 17. We wanted to duplicate a real, peer-reviewed, bioinformatics analysis simply by building a model in the Web describing what the answer (if one existed) would look like
  • 18. ...the machine had to make every other decision on it‟s own
  • 20. How we use The Web today
  • 21.
  • 22. By clicking here you cause this incredibly powerful computational tool called The Web to retrieve a chunk of text and images that can only be understood by a human...
  • 23. The Web is not a pigeon!
  • 24. To achieve this vision We must learn how to do research IN the Web Not OVER the Web
  • 26. We wanted to duplicate a real, peer-reviewed, bioinformatics analysis simply by building a model in the Web describing what the answer (if one existed) would look like
  • 27. ...the machine had to make every other decision on it‟s own
  • 28. This is the study we chose:
  • 29. Gordon, P.M.K., Soliman, M.A., Bose, P., Trinh, Q., Sensen, C.W., Riabowol, K.: Interspecies data mining to predict novel ING-protein interactions in human. BMC genomics. 9, 426 (2008).
  • 30. Original Study Simplified Using what is known about interactions in fly & yeast predict new interactions with your human protein of interest
  • 31. “Pseudo-code” Abstracted Workflow Given a protein P in Species X Find proteins similar to P in Species Y Retrieve interactors in Species Y Sequence-compare Y-interactors with Species X genome (1)  Keep only those with homologue in X Find proteins similar to P in Species Z Retrieve interactors in Species Z Sequence-compare Z-interactors with (1)  Putative interactors in Species X
  • 32. Modeling the answer... OWL Web Ontology Language (OWL) is the language approved by the W3C for representing knowledge in the Web
  • 33. Modeling the answer... Note that every word in this diagram is, in reality, a URL (because it is OWL) The model of the answer is published in The Web and borrows ideas from other models published in The Web
  • 34. Modeling the answer... ProbableInteractor is homologous to ( Potential Interactor from ModelOrganism1…) and Potential Interactor from ModelOrganism2…) Probable Interactor is defined in OWL as a subclass of Potential Interactor that requires homologous pairs of interacting proteins to exist in both comparator model organisms. (Effectively, an intersection)
  • 35. Publish our OWL model of a Probable Interactor in the Web
  • 36. Running the Web Science Experiment In a local data-file provide the protein we are interested in and the two species we wish to use in our comparison taxon:9606 a i:OrganismOfInterest . # human uniprot:Q9UK53 a i:ProteinOfInterest . # ING1 taxon:4932 a i:ModelOrganism1 . # yeast taxon:7227 a i:ModelOrganism2 . # fly
  • 37. The tricky bit is... In the abstract, the search for homology is “generic” – ANY Protein, ANY model system But when the machine does the experiment, it must use specific of resources because the answer requires taxon:4932 a i:ModelOrganism1 . # yeast information from two taxon:7227 a i:ModelOrganism2 . # fly declared species
  • 38. This is the question we ask: (the query language here is SPARQL) PREFIX i: <http://sadiframework.org/ontologies/InteractingProteins.owl#> SELECT ?protein FROM <file:/local/workflow.input.n3> WHERE { ?protein a i:ProbableInteractor . } The reference (URL) to our OWL model of the answer
  • 39. Our system then derives (and executes) the following workflow automatically These are different Web services! ...selected at run-time based on the same model
  • 40.
  • 41. There are four very cool things about what you just saw...
  • 42. There are four very cool things about what you just saw... The system was able to create a workflow based on an OWL model (ontology)
  • 43. There are four very cool things about what you just saw... The system was able to create a COMPUTATIONAL workflow based on a BIOLOGICAL model
  • 44. There are four very cool things about what you just saw... The workflow it created (i.e. the services chosen) differed depending on context
  • 45. There are four very cool things about what you just saw... The choice of tool-selection was guided by the encoded knowledge of domain-experts worldwide
  • 46. We got the answer “simply” by designing a model of the answer!
  • 47. How did we do that?
  • 48. A “Smart” Biomedical Resource Representation System
  • 49. A Web application that answers SPARQL-DL queries Query-answering Enhanced by SADI
  • 51. Imagine a “virtual database” all of the data from all databases + result of every conceivable analysis
  • 52. How can we query that database?
  • 53.
  • 54. What is the phenotype of every allele of the Antirrhinum majus DEFICIENS gene SELECT ?allele ?image ?desc WHERE { locus:DEF genetics:hasVariant ?allele . ?allele info:visualizedByImage ?image . ?image info:hasDescription ?desc }
  • 55. What is the phenotype of every allele of the Antirrhinum majus DEFICIENS gene SELECT ?allele ?image ?desc WHERE { locus:DEF genetics:hasVariant ?allele . ?allele info:visualizedByImage ?image . ?image info:hasDescription ?desc } Note that there is no “FROM” clause! We don‟t tell it where it should get the information, The machine has to figure that out by itself...
  • 56. Enter that query into SHARE
  • 58. ...and in a few seconds you get your answer.
  • 59. The query results are live hyperlinks to the respective Database or images
  • 60. Neither SADI nor SHARE know anything about plant biology or genetics
  • 61. What pathways does UniProt protein P47989 belong to? PREFIX pred: <http://sadiframework.org/ontologies/predicates.owl#> PREFIX ont: <http://ontology.dumontierlab.com/> PREFIX uniprot: <http://lsrn.org/UniProt:> SELECT ?gene ?pathway WHERE { uniprot:P47989 pred:isEncodedBy ?gene . ?gene ont:isParticipantIn ?pathway . }
  • 62. What pathways does UniProt protein P47989 belong to? PREFIX pred: <http://sadiframework.org/ontologies/predicates.owl#> PREFIX ont: <http://ontology.dumontierlab.com/> PREFIX uniprot: <http://lsrn.org/UniProt:> SELECT ?gene ?pathway WHERE { uniprot:P47989 pred:isEncodedBy ?gene . ?gene ont:isParticipantIn ?pathway . }
  • 63. What pathways does UniProt protein P47989 belong to? PREFIX pred: <http://sadiframework.org/ontologies/predicates.owl#> PREFIX ont: <http://ontology.dumontierlab.com/> PREFIX uniprot: <http://lsrn.org/UniProt:> SELECT ?gene ?pathway WHERE { uniprot:P47989 pred:isEncodedBy ?gene . ?gene ont:isParticipantIn ?pathway . } Note again that there is no “From” clause… I have not told SHARE where to look for the answer, I am simply asking my question
  • 64. Enter that query into SHARE
  • 65.
  • 66.
  • 67. Two different Two different providers of providers of pathway gene information information (KEGG and (KEGG & GO); NCBI); were found & were found & accessed accessed
  • 68. The results are all links to the original data
  • 69. Neither SADI nor SHARE know anything about proteins or biochemical pathways
  • 70. Recap what we just saw We posed, and answered ~complex multi-database queries WITHOUT A DATA WAREHOUSE
  • 71. Demo #2 An example from the Clinical domain
  • 72. Show me the latest Blood Urea Nitrogen and Creatinine levels of patients who appear to be rejecting their transplants PREFIX rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#> PREFIX patient: <http://sadiframework.org/ontologies/patients.owl#> PREFIX l: <http://sadiframework.org/ontologies/predicates.owl#> SELECT ?patient ?bun ?creat FROM <http://sadiframework.org/ontologies/patients.rdf> WHERE { ?patient rdf:type patient:LikelyRejecter . ?patient l:latestBUN ?bun . ?patient l:latestCreatinine ?creat . }
  • 73. Show me the latest Blood Urea Nitrogen (BUN) and Creatinine levels of patients who appear to be rejecting their transplants PREFIX rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#> PREFIX patient: <http://sadiframework.org/ontologies/patients.owl#> PREFIX l: <http://sadiframework.org/ontologies/predicates.owl#> SELECT ?patient ?bun ?creat FROM <http://sadiframework.org/ontologies/patients.rdf> WHERE { ?patient rdf:type patient:LikelyRejecter . ?patient l:latestBUN ?bun . ?patient l:latestCreatinine ?creat . }
  • 74. Likely Rejecter: A patient who has creatinine levels that are increasing over time - - Mark D Wilkinson‟s definition
  • 75. Likely Rejecter: …but there is no “likely rejecter” column or table in our database… only blood chemistry measurements at various time-points
  • 76. Likely Rejecter: So the data required to answer this question DOESN‟T EXIST!
  • 77. ?
  • 78. Enter that query into SHARE
  • 80. The machine decides by itself that it needs to do a Linear Regression analysis on the blood creatinine measurements in order to answer your question
  • 81. The machine decides by itself how and where that analysis can be done and does it automatically!
  • 83. The SHARE system utilizes SADI to discover analytical services on the Web that do linear regression analysis and sends the data to be analysed
  • 85. Neither SADI nor SHARE know anything about blood chemistry, or mathematics
  • 86. So how does the machine know what to do??
  • 88. Ontologies explicitly define the kinds of things that (can) exist… …and what those things are “like” i.e. what properties they have (color, weight, shape, texture, temperature, “state”) and what relationships they have to one another (inside-of, adjacent-to, part-of, binds-to, controls, inhibits, degrades, etc.)
  • 89. So we create …………. ontologies about biology and health We* publish them on the Web * We… or anybody! Anybody can publish an ontology!
  • 90. My definition of a Likely Rejecter is encoded in a machine-readable document written in the OWL Ontology language Basically: “the regression line over creatinine measurements should have an increasing slope”
  • 91. Our ontology refers to other ontologies (possibly published by other people) to learn about what the properties of “regression models” are e.g. that regression models have slopes and intercepts and that slopes and intercepts have decimal values
  • 92. SHARE examines the query Looks on the Web for ontologies that describe the problem it is trying to solve, and “reads” them then uses that “knowledge” to figure out which data-sources and analytical tools it needs to answer the query
  • 93. The way SHARE “interprets” data varies depending on the context of the query (i.e. which ontologies it reads – Mine? Yours?) and on what part of the query it is trying to answer at any given moment (which ontological concept is relevant to that clause)
  • 94. Data exhibits “late binding”
  • 95. Late binding: “purpose and meaning” of the data is not determined until the moment it is required a.k.a The “semantics” of the data
  • 96. Benefit of late binding Data is amenable to constant re-interpretation
  • 97. Example? Blood Creatinine measurements were not dictated to be (only) Blood Creatinine measurements
  • 98. Example? The data had the „qualities/properties‟ that allowed one machine to interpret that they were Blood Creatinine measurements (e.g. to determine which patients were rejecting)
  • 99. Example? But the data also had the „qualities/properties‟ that allowed another machine to interpret them as Simple X/Y coordinate data (e.g. the Linear Regression calculation tool)
  • 100. Benefit of late binding Data is amenable to constant re-interpretation
  • 102. And that brings us to...
  • 103. Gordon, P.M.K., Soliman, M.A., Bose, P., Trinh, Q., Sensen, C.W., Riabowol, K.: Interspecies data mining to predict novel ING-protein interactions in human. BMC genomics. 9, 426 (2008).
  • 104. We built a model of the proposed answer
  • 105. Our system converted the model into the experiment
  • 106. The analytical tools chosen for that experiment changed depending on context even though the biological model driving their selection was the same
  • 107. i.e. The published model is re-usable
  • 108. i.e. The published model is re-usable In different contexts... By different researchers
  • 109. and because the model IS the experiment the published EXPERIMENT is re-usable!! simply point the same query at your own dataset...
  • 110. The publication is an executable document!
  • 111. Every component of the model Every component of the input data Every component of the output data is a URL Therefore the question, the experiment, and the answer, are immediately published IN the Web
  • 112. Every component of the model Every component of the input data Every component of the output data is a URL The answer, and the knowledge derived from it, is immediately available to search engines and moreover, can affect the outcome of other Web Science experiments
  • 113.
  • 116. An experiment... based on a hypothesis
  • 117. An experiment... based on a hypothesis now modeled in OWL
  • 118. Does this OWL Class represent the Hypothesis? I think it does!
  • 119. We modeled the answer... ...but the answer was hypothetical
  • 120. Change the way we think of “hypotheses”
  • 121. In Web Science 2.0 Model what the world would “look like” if your hypothesis were true Then ask “is there any data that fits that model?”
  • 122. Like the blind men examining an elephant Seemingly different aspects of research when viewed from the perspective of Web Science become the same “thing” The Model
  • 123. Our vision of Web Science 2.0 Hypothesis Query Workflow Ontology Result Materials & Methods These can be automatically derived through provenance information during workflow execution
  • 124.
  • 125. Please join us! SADI and SHARE are Open-Source projects http://sadiframework.org
  • 127. University of British Columbia Luke McCarthy – Lead Dev. Edward Kawas Everything... SADI Service auto-generator Benjamin VanderValk Ian Wood SHARE & SADI & Experimental modeling & Experimental modeling project myHeath Button Soroush Samadian Cardiovascular data modeling and queries
  • 128. C-BRASS Collaborators at other sites U of New Brunswick Carleton University Dr. Chris Baker Dr. Michel Dumontier Alexandre Riazanov Marc-Alexandre Nolin Leonid Chepelev Steve Etlinger Nichaella Kieth Jose Cruz