SlideShare a Scribd company logo
1 of 93
Research… this time it’s Personal! Mark Wilkinson, PI Bioinformatics, Heart + Lung Institute @ St. Paul’s Hospital Vancouver, BC, Canada
DEMO...to give you incentive to listen to the rest of the presentation ;-)
Show me patients with elevated creatinine along with their latest BUN and creatinine levels PREFIX rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#>  PREFIX patients: <http://sadiframework.org/ontologies/patients.owl#>  PREFIX pred: <http://sadiframework.org/ontologies/predicates.owl#>  SELECT ?patient ?bun ?creat FROM <http://sadiframework.org/ontologies/patients.rdf> WHERE { 	?patient rdf:typepatients:ElevatedCreatininePatient . 	?patient pred:latestBUN ?bun .  	?patient pred:latestCreatinine ?creat .  }
VOILA!
There was no database...There was no patient data anywhere annotated as “Elevated Creatinine Patient”
How did I answer a question where the required data didn’t exist? (...no, I didn’t just make it up!  LOL!)
The story begins...
This is going to hurt...
Web Servicesvs.Semantic Web
Web Servicesare not “connected” to the Semantic WebWhy?
Web ServicesXML + XML SchemaSemantic WebRDF + OWL
Web ServicesPOST of SOAP-XMLSemantic WebGET of RDF-XML
Web ServicesNo (rigorous) semanticsSemantic WebRich, flexible semantics
Web Services&Semantic WebFundamentally and deeply different Web technologies!
>1000 X more data in the “Deep Web” than in Web pages
Accessing these databases and analytical algorithms “transparently”, based on an individual researcher’s  ideas, beliefs, and preferenceswill help us personalize medical research
Mark Butler (2003) Is the semantic web hype? Hewlett Packard laboratories presentation at MMU, 2003-03-12
Semantic Web?(my definition) An information system where machines can receive information from one source, re-interpret it, and correctly use it for a purpose that the source had not anticipated.
Re-interpretation Correct re-use Both are critical to the personalization of research
Building a personalized Semantic Web… Step-by-step…
Semantic Automated Discovery and Integrationhttp://sadiframework.org(open source) MicrosoftResearch Founding partner
SADI“best-practices” for Semantic Web Service provision
standards-compliant
Lightweight(only 2 “rules”)
Rules come from observations:
SADI Observation #1: Web Services in Bioinformatics create implicitbiologicalrelationships between their input and output
SADI Observation #1:
SADI Best Practice #1 Make the implicit explicit… A Web Service should create “triples” linking the input data to the output data, thus explicitly describing the semantic relationship between them
SADI Best Practice #1 This is what bioinformatics Web Services implicitly do anyway! Easy to implement this as a best-practice
SADI Observation #2:HTTP GET and POST GET guarantees the response relates to the request URI in a very precise and predictable way POST does not…
SADI Observation #2:GET and POST That’s why Web Services have a fundamentally different behaviour than the Semantic Web
SADI Observation #2:GET and POST We can fix that! (without breaking any existing rules or standards!)
SADI Best Practice #2 SUBJECT URI of the output graph (triples)  is the sameas the SUBJECT URI of the input graph (triples) (the output is “about” the input... Now explicitly!)
Consequence The “Semantics” of our interaction with the Web Service are now explicitandidentical to the “Semantics” of GET
SADI Web Service Interfaces Service Interfaces defined by two OWL classes:
SADI Web Service Interfaces OWL Class #1:  My Input Class
SADI Web Service Interfaces OWL Class #2:  My Output Class
SADI Web Service Interfaces My Service consumes OWL Individuals of Class #1and returns OWL Individuals of Class #2 …but the URI of those two individuals is the same!(see best practice #2)
How do we discover services? Since input and output are about the same “thing”, we can automatically determine what a service doesby comparing the Input and Output OWL classes
How do we discover services? Automatically index services in a registry based on what properties (predicates) Services add to their respective input data
EXAMPLE   Input Data:  BRCA1   rdf:type    Gene ID   Output Data:     BRCA1    hasDNASequence    AGCTTAGCCA…   Registry Index:  Service provides “hasDNASequence” property to Gene IDs
Now we can answer questions like  “what is the DNA sequence of BRCA1?”  Discover a SADI Web Service that generates the DNA Sequence property for gene identifiers
Okay, enough tech gobbledygookWhat will this do for ME?
Demo #1
Imagine there is a “virtual database” containing all of the data from all of the databases,together with the output ofevery conceivable analysis
How do we query that database?
“SHARE”Semantic Health And Research EnvironmentSADI client application
What pathways does UniProt protein P47989 belong to? PREFIX pred: <http://sadiframework.org/ontologies/predicates.owl#> PREFIX ont: <http://ontology.dumontierlab.com/> PREFIX uniprot: <http://lsrn.org/UniProt:> SELECT ?gene ?pathway  WHERE {  	uniprot:P47989 pred:isEncodedBy ?gene .  	?gene ont:isParticipantIn ?pathway .  }
Recapwhat we just saw A standard SPARQL query was entered into SHARE, a SADI-aware query engine
Recapwhat we just saw The query was interpreted to extract the properties being queried and these were passed to SADI for Web Service discovery
Recapwhat we just saw SADIsearched-for, found, and accessed all databases and/or analytical tools capable of generating those properties
Recapwhat we just saw We posed, and answered a complex database query  WITHOUT A DATABASE (in fact, the data didn’t even have to exist...)
Cool!
 …but I’m supposed to be personalizing research… Let’s make this a little more personal by bringing in Ontologies
My Definition of Ontology (for this talk) Ontologies explicitly define the things that exist in “the world” based on what propertieseach kind of thing must have
 Ontology Spectrum Frames (Properties) Thesauri “narrower term” relation Selected Logical 	Constraints (disjointness,  inverse, …)  Catalog/ ID Formal is-a Informal is-a Formal instance General Logical constraints Terms/ glossary Value Restrs.
Demo #2 Discover instances of OWL classes  from data that doesn’t exist…
Show me patients whose creatinine level is increasing over time, along with their latest BUN and creatinine levels. PREFIX regress: <http://sadiframework.org/examples/regression.owl#>  PREFIX patients: <http://sadiframework.org/ontologies/patients.owl#>  PREFIX pred: <http://sadiframework.org/ontologies/predicates.owl#>  SELECT ?patient ?bun ?creat FROM <http://sadiframework.org/ontologies/patients.rdf> WHERE { 	?patient patients:creatinineLevels ?collection .  	?collection regress:hasRegressionModel ?model .  	?model regress:slope ?slope 		FILTER (?slope > 0) . 	?patient pred:latestBUN ?bun .  	?patient pred:latestCreatinine ?creat .  }
Show me patients with elevated creatinine along with their latest BUN and creatinine levels PREFIX rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#>  PREFIX patients: <http://sadiframework.org/ontologies/patients.owl#>  PREFIX pred: <http://sadiframework.org/ontologies/predicates.owl#>  SELECT ?patient ?bun ?creat FROM <http://sadiframework.org/ontologies/patients.rdf> WHERE { 	?patient rdf:typepatients:ElevatedCreatininePatient . 	?patient pred:latestBUN ?bun .  	?patient pred:latestCreatinine ?creat .  }
Start burrowing through the OWL class  find that we need aregression model OWL class
Regression models have features like slopes and intercepts… and so onThe class is completely decomposed until a set of required Services are discoveredcapable of creating all these necessary properties
Successful decomposition of the OWL class to discover  the need for a LinearRegression Web Service, and so on
VOILA!
OWL Class restrictions converted into workflows SPARQL queries converted into workflows Reasoning happening in parallel with query executionData fulfilling OWL models is discovered, or generated through running analytical tools SADI and CardioSHARE
I still don’t seewhy this is“Personal”??
Show me patients whose creatinine level is increasing over time, along with their latest BUN and creatinine SELECT ?patient ?bun ?creat FROM <http://sadiframework.org/ontologies/patients.rdf> WHERE { 	?patient rdf:typepatients:ElevatedCreatininePatient . 	?patient pred:latestBUN ?bun .  	?patient pred:latestCreatinine ?creat .  }
I created a small ontologydescribing my definition ofan Elevated Creatinine Patient
… it was MY ontology!
I can re-use it
I can modify it as I change myworld-view
I can publish it for others to use
Others can modify it to fit THEIRworld-view
My personal world-view is being dynamically resolved againstglobal data and knowledge
…but it’s bigger than that…
“Elevated Creatinine Patient”
I made that up!  It came out of my head!
What’s a fancy word for a world-view that you make-up? Hypothesis
Current Research We believe that ontologies and hypothesesare, in some ways, the same “thing”……simply assertions about individuals that may or may not existFuture SADI client applications will supportdata-driven hypothesis generation and resolution
Recap SADI Semantic Web Services generate triples; the predicates of those triples are indexed... Period. For a given query, determine which properties are available, and which need to be discovered/generated Find services that generate the properties we need
Semantic Web An information system where machines can receive information from one source, re-interpret it, and correctly use it for a purpose that the source had not anticipated. My Purpose!!
What SADI + SHARE supports Re-interpretation We constantly compare the collection of properties, gathered from third-parties worldwide, to whatever world-model (query/ontology) we wish to view it through. MY world model
What SADI + SHARE supports Novel re-use There is no way for the provider to dictate how their data should be used, or how it should be interpreted.  They simply add their properties into the “data cloud” and those properties are used in whatever way is appropriate forME.
And all this because SADI simply requires that the input URI  is the same  as the output URI
Semi-automated  SADI service writing and deployment Taverna Semantically-guided  SADI service discovery and pipelining SADI Plug-ins

More Related Content

Viewers also liked

Juc paris olivier lamy talk
Juc paris olivier lamy talkJuc paris olivier lamy talk
Juc paris olivier lamy talk
Olivier Lamy
 
Sustainability Assembly1
Sustainability Assembly1Sustainability Assembly1
Sustainability Assembly1
Paola Padilla
 
SmartBrief Portfolio
SmartBrief PortfolioSmartBrief Portfolio
SmartBrief Portfolio
SmartBrief
 
Smart brief content marketing trifecta
Smart brief content marketing trifectaSmart brief content marketing trifecta
Smart brief content marketing trifecta
SmartBrief
 
Teamworks Campaign Blueprint
Teamworks Campaign BlueprintTeamworks Campaign Blueprint
Teamworks Campaign Blueprint
Judy Lane
 
T I P S P A R A D I S F R U T A R T U V I D A Mafalda
T I P S  P A R A  D I S F R U T A R  T U  V I D A     MafaldaT I P S  P A R A  D I S F R U T A R  T U  V I D A     Mafalda
T I P S P A R A D I S F R U T A R T U V I D A Mafalda
pipis397
 
Curriculum specification F4
Curriculum specification F4Curriculum specification F4
Curriculum specification F4
hajahrokiah
 
Teamworks Customer Relationship Marketing
Teamworks Customer Relationship MarketingTeamworks Customer Relationship Marketing
Teamworks Customer Relationship Marketing
Judy Lane
 

Viewers also liked (19)

Juc paris olivier lamy talk
Juc paris olivier lamy talkJuc paris olivier lamy talk
Juc paris olivier lamy talk
 
The BioMoby Semantic Annotation Experiment
The BioMoby Semantic Annotation ExperimentThe BioMoby Semantic Annotation Experiment
The BioMoby Semantic Annotation Experiment
 
Web Science, SADI, and the Singularity
Web Science, SADI, and the SingularityWeb Science, SADI, and the Singularity
Web Science, SADI, and the Singularity
 
Sustainability Assembly1
Sustainability Assembly1Sustainability Assembly1
Sustainability Assembly1
 
Test King Virtual Test--Network+
Test King Virtual Test--Network+ Test King Virtual Test--Network+
Test King Virtual Test--Network+
 
The Apache Way olamy
The Apache Way olamyThe Apache Way olamy
The Apache Way olamy
 
SmartBrief Portfolio
SmartBrief PortfolioSmartBrief Portfolio
SmartBrief Portfolio
 
PAPER 2 2010
PAPER 2 2010PAPER 2 2010
PAPER 2 2010
 
Smart brief content marketing trifecta
Smart brief content marketing trifectaSmart brief content marketing trifecta
Smart brief content marketing trifecta
 
Bambu Communication Group Credential
Bambu Communication Group CredentialBambu Communication Group Credential
Bambu Communication Group Credential
 
WPAN
WPANWPAN
WPAN
 
Teamworks Campaign Blueprint
Teamworks Campaign BlueprintTeamworks Campaign Blueprint
Teamworks Campaign Blueprint
 
Opleiden in de School in Rotterdam
Opleiden in de School in RotterdamOpleiden in de School in Rotterdam
Opleiden in de School in Rotterdam
 
Grupo de carlos
Grupo de carlosGrupo de carlos
Grupo de carlos
 
UX Munich2015
UX Munich2015UX Munich2015
UX Munich2015
 
Internet Protocol
Internet ProtocolInternet Protocol
Internet Protocol
 
T I P S P A R A D I S F R U T A R T U V I D A Mafalda
T I P S  P A R A  D I S F R U T A R  T U  V I D A     MafaldaT I P S  P A R A  D I S F R U T A R  T U  V I D A     Mafalda
T I P S P A R A D I S F R U T A R T U V I D A Mafalda
 
Curriculum specification F4
Curriculum specification F4Curriculum specification F4
Curriculum specification F4
 
Teamworks Customer Relationship Marketing
Teamworks Customer Relationship MarketingTeamworks Customer Relationship Marketing
Teamworks Customer Relationship Marketing
 

Similar to Research - this time it's personal

Wilkinson bosc2010 moby-to-sadi
Wilkinson bosc2010 moby-to-sadiWilkinson bosc2010 moby-to-sadi
Wilkinson bosc2010 moby-to-sadi
BOSC 2010
 
BioAssay Express: Creating and exploiting assay metadata
BioAssay Express: Creating and exploiting assay metadataBioAssay Express: Creating and exploiting assay metadata
BioAssay Express: Creating and exploiting assay metadata
Philip Cheung
 
Services For Science April 2009
Services For Science April 2009Services For Science April 2009
Services For Science April 2009
Ian Foster
 
The seven-deadly-sins-of-bioinformatics3960
The seven-deadly-sins-of-bioinformatics3960The seven-deadly-sins-of-bioinformatics3960
The seven-deadly-sins-of-bioinformatics3960
mare34
 

Similar to Research - this time it's personal (20)

How SADI & SHARE help restore the Scientific Method to in silico science
How SADI & SHARE help restore the Scientific Method to in silico scienceHow SADI & SHARE help restore the Scientific Method to in silico science
How SADI & SHARE help restore the Scientific Method to in silico science
 
Wilkinson bosc2010 moby-to-sadi
Wilkinson bosc2010 moby-to-sadiWilkinson bosc2010 moby-to-sadi
Wilkinson bosc2010 moby-to-sadi
 
Building SADI Services Tutorial - SIB Workshop, Geneva, December 2015
Building SADI Services Tutorial - SIB Workshop, Geneva, December 2015Building SADI Services Tutorial - SIB Workshop, Geneva, December 2015
Building SADI Services Tutorial - SIB Workshop, Geneva, December 2015
 
Using Semantics to personalize medical research
Using Semantics to personalize medical researchUsing Semantics to personalize medical research
Using Semantics to personalize medical research
 
ChemSpider - Does Community Engagement work to Build a Quality Online Resourc...
ChemSpider - Does Community Engagement work to Build a Quality Online Resourc...ChemSpider - Does Community Engagement work to Build a Quality Online Resourc...
ChemSpider - Does Community Engagement work to Build a Quality Online Resourc...
 
The Scientific Method on the Semantic Web
The Scientific Method on the Semantic WebThe Scientific Method on the Semantic Web
The Scientific Method on the Semantic Web
 
CDISC Presentation
CDISC PresentationCDISC Presentation
CDISC Presentation
 
Presentation to the J. Craig Venter Institute, Dec. 2014
Presentation to the J. Craig Venter Institute, Dec. 2014Presentation to the J. Craig Venter Institute, Dec. 2014
Presentation to the J. Craig Venter Institute, Dec. 2014
 
Data Citation Made Easy
Data Citation Made EasyData Citation Made Easy
Data Citation Made Easy
 
Getting to Know Your Data with R
Getting to Know Your Data with RGetting to Know Your Data with R
Getting to Know Your Data with R
 
BioAssay Express: Creating and exploiting assay metadata
BioAssay Express: Creating and exploiting assay metadataBioAssay Express: Creating and exploiting assay metadata
BioAssay Express: Creating and exploiting assay metadata
 
DNA analysis on your laptop: Spot the differences
DNA analysis on your laptop: Spot the differencesDNA analysis on your laptop: Spot the differences
DNA analysis on your laptop: Spot the differences
 
Improving online chemistry one structure at a time
Improving online chemistry one structure at a timeImproving online chemistry one structure at a time
Improving online chemistry one structure at a time
 
Data analysis & integration challenges in genomics
Data analysis & integration challenges in genomicsData analysis & integration challenges in genomics
Data analysis & integration challenges in genomics
 
Services For Science April 2009
Services For Science April 2009Services For Science April 2009
Services For Science April 2009
 
Ontologies Ontop Databases
Ontologies Ontop DatabasesOntologies Ontop Databases
Ontologies Ontop Databases
 
ChemSpider – A Crowdsourcing Environment for Hosting and Validating Chemistry...
ChemSpider – A Crowdsourcing Environment for Hosting and Validating Chemistry...ChemSpider – A Crowdsourcing Environment for Hosting and Validating Chemistry...
ChemSpider – A Crowdsourcing Environment for Hosting and Validating Chemistry...
 
The seven-deadly-sins-of-bioinformatics3960
The seven-deadly-sins-of-bioinformatics3960The seven-deadly-sins-of-bioinformatics3960
The seven-deadly-sins-of-bioinformatics3960
 
The Seven Deadly Sins of Bioinformatics
The Seven Deadly Sins of BioinformaticsThe Seven Deadly Sins of Bioinformatics
The Seven Deadly Sins of Bioinformatics
 
schema.org and biomedical ontologies
schema.org and biomedical ontologies schema.org and biomedical ontologies
schema.org and biomedical ontologies
 

More from Mark Wilkinson

Sample data and other ur ls
Sample data and other ur lsSample data and other ur ls
Sample data and other ur ls
Mark Wilkinson
 

More from Mark Wilkinson (20)

FAIR Metrics - Presentation to NIH KC1
FAIR Metrics - Presentation to NIH KC1FAIR Metrics - Presentation to NIH KC1
FAIR Metrics - Presentation to NIH KC1
 
Introducing the fair evaluator
Introducing the fair evaluatorIntroducing the fair evaluator
Introducing the fair evaluator
 
FAIR Projector Builder
FAIR Projector BuilderFAIR Projector Builder
FAIR Projector Builder
 
Tech. session : Interoperability and Data FAIRness emerges from a novel combi...
Tech. session : Interoperability and Data FAIRness emerges from a novel combi...Tech. session : Interoperability and Data FAIRness emerges from a novel combi...
Tech. session : Interoperability and Data FAIRness emerges from a novel combi...
 
smartAPIs: EUDAT Semantic Working Group Presentation @ RDA 9th Plenary
smartAPIs:  EUDAT Semantic Working Group Presentation @ RDA 9th PlenarysmartAPIs:  EUDAT Semantic Working Group Presentation @ RDA 9th Plenary
smartAPIs: EUDAT Semantic Working Group Presentation @ RDA 9th Plenary
 
IBC FAIR Data Prototype Implementation slideshow
IBC FAIR Data Prototype Implementation   slideshowIBC FAIR Data Prototype Implementation   slideshow
IBC FAIR Data Prototype Implementation slideshow
 
FAIR Data Prototype - Interoperability and FAIRness through a novel combinati...
FAIR Data Prototype - Interoperability and FAIRness through a novel combinati...FAIR Data Prototype - Interoperability and FAIRness through a novel combinati...
FAIR Data Prototype - Interoperability and FAIRness through a novel combinati...
 
Sample data and other ur ls
Sample data and other ur lsSample data and other ur ls
Sample data and other ur ls
 
Example code for the SADI BMI Calculator Web Service
Example code for the SADI BMI Calculator Web ServiceExample code for the SADI BMI Calculator Web Service
Example code for the SADI BMI Calculator Web Service
 
Sadi service
Sadi serviceSadi service
Sadi service
 
Tutorial - Creating SADI semantic-web-services
Tutorial - Creating SADI semantic-web-servicesTutorial - Creating SADI semantic-web-services
Tutorial - Creating SADI semantic-web-services
 
Data FAIRport Prototype & Demo - Presentation to Elsevier, Jul 10, 2015
Data FAIRport Prototype & Demo - Presentation to Elsevier, Jul 10, 2015Data FAIRport Prototype & Demo - Presentation to Elsevier, Jul 10, 2015
Data FAIRport Prototype & Demo - Presentation to Elsevier, Jul 10, 2015
 
Force11 JDDCP workshop presentation, @ Force2015, Oxford
Force11 JDDCP workshop presentation, @ Force2015, OxfordForce11 JDDCP workshop presentation, @ Force2015, Oxford
Force11 JDDCP workshop presentation, @ Force2015, Oxford
 
Enhancing Reproducibility and Transparency in Clinical Research through Seman...
Enhancing Reproducibility and Transparency in Clinical Research through Seman...Enhancing Reproducibility and Transparency in Clinical Research through Seman...
Enhancing Reproducibility and Transparency in Clinical Research through Seman...
 
SADI CSHALS 2013
SADI CSHALS 2013SADI CSHALS 2013
SADI CSHALS 2013
 
Web Science 2.0 - in silico science
Web Science 2.0 - in silico scienceWeb Science 2.0 - in silico science
Web Science 2.0 - in silico science
 
Web Science - ISoLA 2012
Web Science - ISoLA 2012Web Science - ISoLA 2012
Web Science - ISoLA 2012
 
Evaluating Hypotheses using SPARQL-DL as an abstract workflow language to cho...
Evaluating Hypotheses using SPARQL-DL as an abstract workflow language to cho...Evaluating Hypotheses using SPARQL-DL as an abstract workflow language to cho...
Evaluating Hypotheses using SPARQL-DL as an abstract workflow language to cho...
 
SWAT4LS 2011: SADI Knowledge Explorer Plug-in
SWAT4LS 2011: SADI Knowledge Explorer Plug-inSWAT4LS 2011: SADI Knowledge Explorer Plug-in
SWAT4LS 2011: SADI Knowledge Explorer Plug-in
 
SADI in Perl - Protege Plugin Tutorial (fixed Aug 24, 2011)
SADI in Perl - Protege Plugin Tutorial (fixed Aug 24, 2011)SADI in Perl - Protege Plugin Tutorial (fixed Aug 24, 2011)
SADI in Perl - Protege Plugin Tutorial (fixed Aug 24, 2011)
 

Recently uploaded

Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 

Recently uploaded (20)

Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024
 
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 

Research - this time it's personal

  • 1. Research… this time it’s Personal! Mark Wilkinson, PI Bioinformatics, Heart + Lung Institute @ St. Paul’s Hospital Vancouver, BC, Canada
  • 2. DEMO...to give you incentive to listen to the rest of the presentation ;-)
  • 3. Show me patients with elevated creatinine along with their latest BUN and creatinine levels PREFIX rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#> PREFIX patients: <http://sadiframework.org/ontologies/patients.owl#> PREFIX pred: <http://sadiframework.org/ontologies/predicates.owl#> SELECT ?patient ?bun ?creat FROM <http://sadiframework.org/ontologies/patients.rdf> WHERE { ?patient rdf:typepatients:ElevatedCreatininePatient . ?patient pred:latestBUN ?bun . ?patient pred:latestCreatinine ?creat . }
  • 5. There was no database...There was no patient data anywhere annotated as “Elevated Creatinine Patient”
  • 6. How did I answer a question where the required data didn’t exist? (...no, I didn’t just make it up! LOL!)
  • 8. This is going to hurt...
  • 10. Web Servicesare not “connected” to the Semantic WebWhy?
  • 11. Web ServicesXML + XML SchemaSemantic WebRDF + OWL
  • 12. Web ServicesPOST of SOAP-XMLSemantic WebGET of RDF-XML
  • 13. Web ServicesNo (rigorous) semanticsSemantic WebRich, flexible semantics
  • 14. Web Services&Semantic WebFundamentally and deeply different Web technologies!
  • 15.
  • 16. >1000 X more data in the “Deep Web” than in Web pages
  • 17. Accessing these databases and analytical algorithms “transparently”, based on an individual researcher’s ideas, beliefs, and preferenceswill help us personalize medical research
  • 18. Mark Butler (2003) Is the semantic web hype? Hewlett Packard laboratories presentation at MMU, 2003-03-12
  • 19. Semantic Web?(my definition) An information system where machines can receive information from one source, re-interpret it, and correctly use it for a purpose that the source had not anticipated.
  • 20. Re-interpretation Correct re-use Both are critical to the personalization of research
  • 21. Building a personalized Semantic Web… Step-by-step…
  • 22. Semantic Automated Discovery and Integrationhttp://sadiframework.org(open source) MicrosoftResearch Founding partner
  • 23. SADI“best-practices” for Semantic Web Service provision
  • 26. Rules come from observations:
  • 27. SADI Observation #1: Web Services in Bioinformatics create implicitbiologicalrelationships between their input and output
  • 29. SADI Best Practice #1 Make the implicit explicit… A Web Service should create “triples” linking the input data to the output data, thus explicitly describing the semantic relationship between them
  • 30. SADI Best Practice #1 This is what bioinformatics Web Services implicitly do anyway! Easy to implement this as a best-practice
  • 31. SADI Observation #2:HTTP GET and POST GET guarantees the response relates to the request URI in a very precise and predictable way POST does not…
  • 32. SADI Observation #2:GET and POST That’s why Web Services have a fundamentally different behaviour than the Semantic Web
  • 33. SADI Observation #2:GET and POST We can fix that! (without breaking any existing rules or standards!)
  • 34. SADI Best Practice #2 SUBJECT URI of the output graph (triples) is the sameas the SUBJECT URI of the input graph (triples) (the output is “about” the input... Now explicitly!)
  • 35. Consequence The “Semantics” of our interaction with the Web Service are now explicitandidentical to the “Semantics” of GET
  • 36. SADI Web Service Interfaces Service Interfaces defined by two OWL classes:
  • 37. SADI Web Service Interfaces OWL Class #1: My Input Class
  • 38. SADI Web Service Interfaces OWL Class #2: My Output Class
  • 39. SADI Web Service Interfaces My Service consumes OWL Individuals of Class #1and returns OWL Individuals of Class #2 …but the URI of those two individuals is the same!(see best practice #2)
  • 40. How do we discover services? Since input and output are about the same “thing”, we can automatically determine what a service doesby comparing the Input and Output OWL classes
  • 41. How do we discover services? Automatically index services in a registry based on what properties (predicates) Services add to their respective input data
  • 42. EXAMPLE Input Data: BRCA1 rdf:type Gene ID Output Data: BRCA1 hasDNASequence AGCTTAGCCA… Registry Index: Service provides “hasDNASequence” property to Gene IDs
  • 43. Now we can answer questions like “what is the DNA sequence of BRCA1?”  Discover a SADI Web Service that generates the DNA Sequence property for gene identifiers
  • 44. Okay, enough tech gobbledygookWhat will this do for ME?
  • 46. Imagine there is a “virtual database” containing all of the data from all of the databases,together with the output ofevery conceivable analysis
  • 47. How do we query that database?
  • 48. “SHARE”Semantic Health And Research EnvironmentSADI client application
  • 49.
  • 50. What pathways does UniProt protein P47989 belong to? PREFIX pred: <http://sadiframework.org/ontologies/predicates.owl#> PREFIX ont: <http://ontology.dumontierlab.com/> PREFIX uniprot: <http://lsrn.org/UniProt:> SELECT ?gene ?pathway WHERE { uniprot:P47989 pred:isEncodedBy ?gene . ?gene ont:isParticipantIn ?pathway . }
  • 51.
  • 52.
  • 53.
  • 54. Recapwhat we just saw A standard SPARQL query was entered into SHARE, a SADI-aware query engine
  • 55. Recapwhat we just saw The query was interpreted to extract the properties being queried and these were passed to SADI for Web Service discovery
  • 56. Recapwhat we just saw SADIsearched-for, found, and accessed all databases and/or analytical tools capable of generating those properties
  • 57. Recapwhat we just saw We posed, and answered a complex database query WITHOUT A DATABASE (in fact, the data didn’t even have to exist...)
  • 58. Cool!
  • 59. …but I’m supposed to be personalizing research… Let’s make this a little more personal by bringing in Ontologies
  • 60. My Definition of Ontology (for this talk) Ontologies explicitly define the things that exist in “the world” based on what propertieseach kind of thing must have
  • 61. Ontology Spectrum Frames (Properties) Thesauri “narrower term” relation Selected Logical Constraints (disjointness, inverse, …) Catalog/ ID Formal is-a Informal is-a Formal instance General Logical constraints Terms/ glossary Value Restrs.
  • 62. Demo #2 Discover instances of OWL classes from data that doesn’t exist…
  • 63. Show me patients whose creatinine level is increasing over time, along with their latest BUN and creatinine levels. PREFIX regress: <http://sadiframework.org/examples/regression.owl#> PREFIX patients: <http://sadiframework.org/ontologies/patients.owl#> PREFIX pred: <http://sadiframework.org/ontologies/predicates.owl#> SELECT ?patient ?bun ?creat FROM <http://sadiframework.org/ontologies/patients.rdf> WHERE { ?patient patients:creatinineLevels ?collection . ?collection regress:hasRegressionModel ?model . ?model regress:slope ?slope FILTER (?slope > 0) . ?patient pred:latestBUN ?bun . ?patient pred:latestCreatinine ?creat . }
  • 64.
  • 65.
  • 66.
  • 67. Show me patients with elevated creatinine along with their latest BUN and creatinine levels PREFIX rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#> PREFIX patients: <http://sadiframework.org/ontologies/patients.owl#> PREFIX pred: <http://sadiframework.org/ontologies/predicates.owl#> SELECT ?patient ?bun ?creat FROM <http://sadiframework.org/ontologies/patients.rdf> WHERE { ?patient rdf:typepatients:ElevatedCreatininePatient . ?patient pred:latestBUN ?bun . ?patient pred:latestCreatinine ?creat . }
  • 68. Start burrowing through the OWL class  find that we need aregression model OWL class
  • 69. Regression models have features like slopes and intercepts… and so onThe class is completely decomposed until a set of required Services are discoveredcapable of creating all these necessary properties
  • 70. Successful decomposition of the OWL class to discover the need for a LinearRegression Web Service, and so on
  • 72. OWL Class restrictions converted into workflows SPARQL queries converted into workflows Reasoning happening in parallel with query executionData fulfilling OWL models is discovered, or generated through running analytical tools SADI and CardioSHARE
  • 73. I still don’t seewhy this is“Personal”??
  • 74. Show me patients whose creatinine level is increasing over time, along with their latest BUN and creatinine SELECT ?patient ?bun ?creat FROM <http://sadiframework.org/ontologies/patients.rdf> WHERE { ?patient rdf:typepatients:ElevatedCreatininePatient . ?patient pred:latestBUN ?bun . ?patient pred:latestCreatinine ?creat . }
  • 75. I created a small ontologydescribing my definition ofan Elevated Creatinine Patient
  • 76. … it was MY ontology!
  • 78. I can modify it as I change myworld-view
  • 79. I can publish it for others to use
  • 80. Others can modify it to fit THEIRworld-view
  • 81.
  • 82. My personal world-view is being dynamically resolved againstglobal data and knowledge
  • 83. …but it’s bigger than that…
  • 85. I made that up! It came out of my head!
  • 86. What’s a fancy word for a world-view that you make-up? Hypothesis
  • 87. Current Research We believe that ontologies and hypothesesare, in some ways, the same “thing”……simply assertions about individuals that may or may not existFuture SADI client applications will supportdata-driven hypothesis generation and resolution
  • 88. Recap SADI Semantic Web Services generate triples; the predicates of those triples are indexed... Period. For a given query, determine which properties are available, and which need to be discovered/generated Find services that generate the properties we need
  • 89. Semantic Web An information system where machines can receive information from one source, re-interpret it, and correctly use it for a purpose that the source had not anticipated. My Purpose!!
  • 90. What SADI + SHARE supports Re-interpretation We constantly compare the collection of properties, gathered from third-parties worldwide, to whatever world-model (query/ontology) we wish to view it through. MY world model
  • 91. What SADI + SHARE supports Novel re-use There is no way for the provider to dictate how their data should be used, or how it should be interpreted. They simply add their properties into the “data cloud” and those properties are used in whatever way is appropriate forME.
  • 92. And all this because SADI simply requires that the input URI is the same as the output URI
  • 93. Semi-automated SADI service writing and deployment Taverna Semantically-guided SADI service discovery and pipelining SADI Plug-ins
  • 94. Simple and Open WINS! Join us! We have recently received funding from CANARIEto assist and train service providersin deploying their own SADI Semantic Web Services Come join us – we’re having a lot of fun!! http://sadiframework.org http://twitter.com/sadiframework
  • 95. Credits Benjamin VanderValk (SADI & CardioSHARE) Luke McCarthy (SADI & CardioSHARE) SoroushSamadian (CardioSHARE) Microsoft Research Fin This presentation available on SlideShare: keywords ‘wilkinson’ ‘BioIT-2010’
  • 96. Credits Benjamin VanderValk(SADI & CardioSHARE) Luke McCarthy (SADI & CardioSHARE) SoroushSamadian(CardioSHARE)