SlideShare ist ein Scribd-Unternehmen logo
1 von 62
Downloaden Sie, um offline zu lesen
Powering Scientific Discovery with the Semantic Web
1
Michel Dumontier, Ph.D.
Associate Professor of Medicine (Biomedical Informatics)
Stanford University
@micheldumontier::VanBUG:13-03-2014
2 @micheldumontier::VanBUG:13-03-2014HTTP://XKCD.COM/242/
Science
@micheldumontier::VanBUG:13-03-20143
We use the biomedical literature to gather evidence
4 @micheldumontier::VanBUG:13-03-2014
we query literature-curated databases
(problems: access, format, identifiers & linking)
5 @micheldumontier::VanBUG:13-03-2014
we use software to analyze, predict and evaluate
(problems: OS, versioning, input/output formats)
6 @micheldumontier::VanBUG:13-03-2014
we develop fairly sophisticated workflows to
gather support for our hypotheses
7 @micheldumontier::VanBUG:13-03-2014
Wouldn’t it be great if we could just find the evidence
required to support or dispute a scientific hypothesis using
the most up-to-date and relevant data, tools and scientific
knowledge?
8 @micheldumontier::VanBUG:13-03-2014
madness!
1. Build a massive network of interconnected data
and software using web standards
– Bio2RDF + SADI
2. Generate and test scientific hypotheses
– prepare data (tactical formalization)
– discover associations (ontology-based enrichment)
– gather evidence (HyQue)
3. Contribute back to the global knowledge graph
@micheldumontier::VanBUG:13-03-20149
The Semantic Web
is the new global web of knowledge
10 @micheldumontier::VanBUG:13-03-2014
standards for publishing, sharing and querying
facts, expert knowledge and services
scalable approach for the discovery
of independently formulated
and distributed knowledge
Resource Description Framework
• It’s a language to represent knowledge
– Logic-based formalism -> automated reasoning
– graph-like properties -> data analysis
• Good for
– Describing in terms of type, attributes, relations
– Integrating data from different sources
– Sharing the data (W3C standard)
– Reusing what is available, developing what you need,
and contributing back to the web of data.
@micheldumontier::VanBUG:13-03-201411
we’re building an incredible network of linked data
12 @micheldumontier::VanBUG:13-03-2014Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch
@micheldumontier::VanBUG:13-03-2014
Linked Data for the Life Sciences
13
Bio2RDF is an open source project to unify the
representation and interlinking of biological data using RDF.
chemicals/drugs/formulations,
genomes/genes/proteins, domains
Interactions, complexes & pathways
animal models and phenotypes
Disease, genetic markers, treatments
Terminologies & publications
• Release 2 (Jan 2013): 1B+ interlinked statements
from 19 high value datasets
• dataset description, provenance & statistics
• Partnerships with EBI, NCBI, DBCLS, NCBO,
OpenPHACTS, and commercial tool providers
@micheldumontier::VanBUG:13-03-2014
drugbank:DB00586
drugbank_vocabulary:Drug
rdf:type
drugbank_target:290
drugbank_vocabulary:Target
rdf:type
drugbank_vocabulary:targets
rdfs:label
Prostaglandin G/H synthase 2
[drugbank_target:290]
rdfs:label
Diclofenac [drugbank:DB00586]
14
The linked data network expands
with every reference
@micheldumontier::VanBUG:13-03-2014
drugbank:DB00586
pharmgkb_vocabulary:Drug
rdf:type
rdfs:label
diclofenac [drugbank:DB00586]
pharmgkb:PA449293
drugbank_vocabulary:Drug
pharmgkb_vocabulary:xref
diclofenac [pharmgkb:PA449293]
rdfs:label
DrugBank
PharmGKB
15
@micheldumontier::VanBUG:13-03-201416
@micheldumontier::VanBUG:13-03-201417
@micheldumontier::VanBUG:13-03-201418
Bio2RDF R2 connectivity
@micheldumontier::VanBUG:13-03-201419
Graph summaries in query formulation
@micheldumontier::VanBUG:13-03-2014
PREFIX drugbank_vocabulary: <http://bio2rdf.org/drugbank_vocabulary:>
PREFIX rdfs: <http://www.w3.org/2000/01/rdf-schema#>
SELECT ?ddi ?d1name ?d2name
WHERE {
?ddi a drugbank_vocabulary:Drug-Drug-Interaction .
?d1 drugbank_vocabulary:ddi-interactor-in ?ddi .
?d1 rdfs:label ?d1name .
?d2 drugbank_vocabulary:ddi-interactor-in ?ddi .
?d2 rdfs:label ?d2name.
FILTER (?d1 != ?d2)
}
20
You can use the SPARQLed query
assistant with updated endpoints
@micheldumontier::VanBUG:13-03-2014
http://sindicetech.com/sindice-suite/sparqled/
graph: http://sindicetech.com/analytics21
Use virtuoso’s built in faceted
browser to construct increasingly
complex queries with little effort
@micheldumontier::VanBUG:13-03-201422
Federated Queries over Independent
SPARQL EndPoints
Get all protein catabolic processes (and more specific) in biomodels
SELECT ?go ?label count(distinct ?x)
WHERE {
service <http://bioportal.bio2rdf.org/sparql> {
?go rdfs:label ?label .
?go rdfs:subClassOf ?tgo
?tgo rdfs:label ?tlabel .
FILTER regex(?tlabel, "^protein catabolic process")
}
service <http://biomodels.bio2rdf.org/sparql> {
?x <http://bio2rdf.org/biopax_vocabulary:identical-to> ?go .
?x a <http://www.biopax.org/release/biopax-level3.owl#BiochemicalReaction> .
}
}
@micheldumontier::VanBUG:13-03-201423
24 @micheldumontier::VanBUG:13-03-2014
Bio2RDF: 1M+ SPARQL queries per month
UniProt: 6.4M queries per month
EBI: 3.5M queries (Oct 2013)
Despite all the data, it’s still hard to find answers to questions
Because there are many ways to represent the same data
and each dataset represents it differently
@micheldumontier::VanBUG:13-03-201425
multiple formalizations of the same kind of
data do emerge, each with their own merit
@micheldumontier::VanBUG:13-03-201426
Massive Proliferation of Ontologies / Vocabularies
@micheldumontier::VanBUG:13-03-201427
Multi-Stakeholder Efforts to Standardize
Representations are Reasonable,
Long Term Strategies for Data Integration
@micheldumontier::VanBUG:13-03-201428
uniprot:P05067
uniprot:Protein
is a
sio:gene
is a is a
Semantic data integration, consistency checking and
query answering over Bio2RDF with the
Semanticscience Integrated Ontology (SIO)
dataset
ontology
Knowledge Base
@micheldumontier::VanBUG:13-03-2014
pharmgkb:PA30917
refseq:Protein
is a
is a
omim:189931
omim:Gene pharmgkb:Gene
Querying Bio2RDF Linked Open Data with a Global Schema. Alison Callahan, José Cruz-Toledo and
Michel Dumontier. Bio-ontologies 2012.
29
@micheldumontier::VanBUG:13-03-201430
SRIQ(D)
10700+ axioms
1300+ classes
201 object properties (inc. inverses)
1 datatype property
Bio2RDF and SIO powered SPARQL 1.1 federated query:
Find chemicals (from CTD) and proteins (from SGD) that
participate in the same process (from GOA)
SELECT ?chem, ?prot, ?proc
FROM <http://bio2rdf.org/ctd>
WHERE {
SERVICE <http://ctd.bio2rdf.org/sparql> {
?chemical a sio:chemical-entity.
?chemical rdfs:label ?chem.
?chemical sio:is-participant-in ?process.
?process rdfs:label ?proc.
FILTER regex (?process, "http://bio2rdf.org/go:")
}
SERVICE <http://sgd.bio2rdf.org/sparql> {
?protein a sio:protein .
?protein sio:is-participant-in ?process.
?protein rdfs:label ?prot .
}
} @micheldumontier::VanBUG:13-03-201431
tactical formalization
@micheldumontier::VanBUG:13-03-201432
Take what you need
and represent it in a way that directly serves your objective
STANDARDSUSER DRIVEN REPRESENTATION
identifying aberrant and pharmacological pathways
predicting drug targets using organism phenotypes
Biopax-pathway exploration
SPARQL-powered genome navigation
aberrant and pharmacological pathways
Q1. Can we identify pathways that are associated
with a particular disease or class of diseases?
Q2. Can we identify pathways are associated with
a particular drug or class of drugs?
drug
pathway
disease
@micheldumontier::VanBUG:13-03-201433
Identification of
drug and disease enriched pathways
• Approach
– Integrate 3 datasets
• DrugBank, PharmGKB and CTD
– Integrate 7 terminologies
• MeSH, ATC, ChEBI, UMLS, SNOMED, ICD, DO
– Formalize selected elements
– Identify significant associations using enrichment
analysis over the fully inferred knowledge base
Identifying aberrant pathways through integrated analysis of knowledge in pharmacogenomics.
Bioinformatics. 2012. @micheldumontier::VanBUG:13-03-201434
Formal knowledge
representation
as a strategy for
data integration
@micheldumontier::VanBUG:13-03-201435
Have you heard of OWL?
@micheldumontier::VanBUG:13-03-201436
mercaptopurine
[pharmgkb:PA450379]
mercaptopurine
[drugbank:DB01033]
purine-6-thiol
[CHEBI:2208]
Class Equivalence
mercaptopurine
[ATC:L01BB02]
Top Level Classes
(disjointness)
drug diseasegenepathway
property chains
Class subsumption
mercaptopurine
[mesh:D015122]
Reciprocal
Existentials
drug disease
pathway gene
Formalized as an OWL-EL ontology
650,000+ classes, 3.2M subClassOf axioms, 75,000
equivalentClass axioms
@micheldumontier::VanBUG:13-03-201437
Benefits: Enhanced Query Capability
– Use any mapped terminology to query a target resource.
– Use knowledge in target ontologies to formulate more
precise questions
• ask for drugs that are associated with diseases of the joint:
‘Chikungunya’ (do:0050012) is defined as a viral infectious disease
located in the ‘joint’ (fma:7490) and caused by a ‘Chikungunya
virus’ (taxon:37124).
– Learn relationships that are inferred by automated
reasoning.
• alcohol (ChEBI:30879) is associated with alcoholism (PA443309)
since alcoholism is directly associated with ethanol (CHEBI:16236)
• ‘parasitic infectious disease’ (do:0001398) associated with 129
drugs, 15 more than are directly linked.
@micheldumontier::VanBUG:13-03-201438
Knowledge Discovery through Data
Integration and Enrichment Analysis
• OntoFunc: Tool to discover significant associations between sets of objects
and ontology categories. enrichment of attribute among a selected set of
input items as compared to a reference set. hypergeometric or the
binomial distribution, Fisher's exact test, or a chi-square test.
• We found 22,653 disease-pathway associations, where for each pathway
we find genes that are linked to disease.
– Mood disorder (do:3324) associated with Zidovudine Pathway
(pharmgkb:PA165859361). Zidovudine is for treating HIV/AIDS. Side
effects include fatigue, headache, myalgia, malaise and anorexia
• We found 13,826 pathway-chemical associations
– Clopidogrel (chebi:37941) associated with Endothelin signaling
pathway (pharmgkb:PA164728163). Endothelins are proteins that
constrict blood vessels and raise blood pressure. Clopidogrel inhibits
platelet aggregation and prolongs bleeding time.
@micheldumontier::VanBUG:13-03-201439
Tactical Formalization + Automated Reasoning
Offers Compelling Value for Certain Problems
We need to be smart about the goal, and how best to
achieve it. Tactical formalization is another tool in the
toolbox.
We’ve formalized data as OWL ontologies to verify, fix and
exploit Linked Data through expressive OWL reasoning
• To identify mistakes in human curated knowledge
• To identify conflicting meaning in terms
• To identify mistakes in the representation of RDF data
o incorrect use of relations
o incorrect assertion of identity (owl:sameAs)
Many other applications can be envisioned.
@micheldumontier::VanBUG:13-03-201440
PhenomeDrug
A computational approach to predict drug
targets, drug effects, and drug indications using
phenotypes
@micheldumontier::VanBUG:13-03-201441
Mouse model phenotypes provide information about human drug targets.
Hoehndorf R, Hiebert T, Hardy NW, Schofield PN, Gkoutos GV, Dumontier M.
Bioinformatics. 2013.
animal models provide insight for on target effects
• In the majority of 100 best selling drugs ($148B in US
alone), there is a direct correlation between knockout
phenotype and drug effect
• Gastroesophageal reflux
– Proton pump inhibitors (e.g Prilosec - $5B)
– KO of alpha or beta unit of H/K ATPase are achlorhydric (normal
pH) in stomach compared to wild type (pH 3-5) when exposed
to acidic solution
• Immunological Indications
– Anti-histamines (Claritin, Allegra, Zyrtec)
– KO of histamine H1 receptor leads to decreased responsiveness
of immune system
– Predicts on target effects : drowsiness, reduced anxiety
@micheldumontier::VanBUG:13-03-201442
Zambrowicz and Sands. Nat Rev Drug Disc. 2003.
Identifying drug targets
from mouse knock-out phenotypes
@micheldumontier::VanBUG:13-03-201443
drug
gene
phenotypes effects
human gene
non-functional
gene model
ortholog
similar
inhibits
Main idea: if a drug’s phenotypes matches the phenotypes of a
null model, this suggests that the drug is an inhibitor of the gene
Terminological Interoperability
(we must compare apples with apples)
Mouse
Phenotypes
Drug effects
(mappings from UMLS to DO, NBO, MP)
Mammalian
Phenotype
OntologyPhenomeNet
PhenomeDrug
@micheldumontier::VanBUG:13-03-2014
Semantic Similarity
@micheldumontier::VanBUG:13-03-201445
Given a drug effect profile D and a mouse model M, we
compute the semantic similarity as an information weighted
Jaccard metric.
The similarity measure used is non-symmetrical and
determines the amount of information about a drug effect
profile D that is covered by a set of mouse model
phenotypes M.
Loss of function models predict
targets of inhibitor drugs
• 14,682 drugs; 7,255 mouse genotypes
• Validation against known and predicted inhibitor-target pairs
– 0.76 ROC AUC for human targets (DrugBank)
– 0.81 ROC AUC for mouse targets (STITCH)
• diclofenac (STITCH:000003032)
– NSAID used to treat pain, osteoarthritis and rheumatoid arthritis
– Drug effects include liver inflammation (hepatitis), swelling of liver
(hepatomegaly), redness of skin (erythema)
– 49% explained by PPARg knockout
• peroxisome proliferator activated receptor gamma (PPARg) regulates metabolism,
proliferation, inflammation and differentiation,
• Diclofenac is a known inhibitor
– 46% explained by COX-2 knockout
• Diclofenac is a known inhibitor
@micheldumontier::VanBUG:13-03-2014
Using the Semantic Web to Gather
Evidence for Scientific Hypotheses
What evidence supports or disputes that TKIs are cardiotoxic?
@micheldumontier::VanBUG:13-03-201447
• Tyrosine Kinase Inhibitors (TKI)
– Imatinib, Sorafenib, Sunitinib, Dasatinib, Nilotinib, Lapatinib
– Used to treat cancer
– Linked to cardiotoxicity.
• FDA launched drug safety program to detect toxicity
– Need to integrate data and ontologies (Abernethy, CPT 2011)
– Abernethy (2013) suggest using public data in genetics,
pharmacology, toxicology, systems biology, to
predict/validate adverse events
• What evidence could we gather to give credence that
TKI’s causes non-QT cardiotoxicity?
@micheldumontier::VanBUG:13-03-201448
FDA Use Case:
TKI non-QT Cardiotoxicity
@micheldumontier::VanBUG:13-03-201449
Jane P.F. Bai and Darrell R. Abernethy. Systems Pharmacology to Predict Drug Toxicity: Integration Across Levels of
Biological Organization. Annu. Rev. Pharmacol. Toxicol. 2013.53:451-473
• The goal of HyQue is retrieve and evaluate evidence
that supports/disputes a hypothesis
– hypotheses are described as a set of events
• e.g. binding, inhibition, phenotypic effect
– events are associated with types of evidence
• a query is written to retrieve data
• a weight is assigned to provide significance
• Hypotheses are written by people who seek answers
• data retrieval rules are written by people who know the
data and how it should be interpreted 
@micheldumontier::VanBUG:13-03-201450
HyQue
1. HyQue: Evaluating hypotheses using Semantic Web technologies. J Biomed Semantics. 2011 May 17;2 Suppl 2:S3.
2. Evaluating scientific hypotheses using the SPARQL Inferencing Notation. Extended Semantic Web Conference (ESWC
2012). Heraklion, Crete. May 27-31, 2012.
HyQue: A Semantic Web Application
@micheldumontier::VanBUG:1
51
Software
OntologiesData
Hypothesis Evaluation
What evidence might we gather?
• clinical: Are there cardiotoxic effects associated with the drug?
– Literature (studies) [curated db]
– Product labels (studies) [r3:sider]
– Clinical trials (studies) [r3:clinicaltrials]
– Adverse event reports [r2:pharmgkb/onesides]
– Electronic health records (observations)
• pre-clinical associations:
– genotype-phenotype (null/disease models) [r2:mgi, r2:sgd; r3:wormbase]
– in vitro assays (IC50) [r3:chembl]
– drug targets [r2:drugbank; r2:ctd; r3:stitch]
– drug-gene expression [r3:gxa]
– pathways [r2:kegg; r3:reactome]
– Drug-pathway, disease-pathway enrichments [aberrant pathways]
– Chemical properties [r2:pubchem; r2.drugbank]
– Toxicology [r1.toxkb/cebs]
@micheldumontier::VanBUG:13-03-201452
Data retrieval is done with SPARQL
@micheldumontier::VanBUG:13-03-201453
Data Evaluation is done with SPIN
rules
@micheldumontier::VanBUG:13-03-201454
@micheldumontier::VanBUG:13-03-201455
@micheldumontier::VanBUG:13-03-201456
http://bio2rdf.org/drugbank:DB01268
@micheldumontier::VanBUG:13-03-201457
@micheldumontier::VanBUG:13-03-201458
@micheldumontier::VanBUG:13-03-201459
In Summary
• This talk was about making sense of the
structured data we already have
• RDF-based Linked Open Data acts as a substrate
for query answering and task-based formalization
in OWL
• Discovery through the generation of testable
hypotheses in the target domain.
• Using Linked Data to gather evidence regarding
scientific hypotheses
• The big challenge lies in establishing discovery
pipelines with user-generated scientific data.
@micheldumontier::VanBUG:13-03-201460
Acknowledgements
Bio2RDF Release 2:
Allison Callahan, Jose Cruz-Toledo, Peter Ansell
Aberrant Pathways: Robert Hoehndorf, Georgios Gkoutos
PhenomeDrug: Tanya Hiebert, Robert Hoehndorf, Georgios
Gkoutos, Paul Schofield
TKI Cardiotoxicity: Alison Callahan, Tania Hiebert, Beatriz
Lujan, Sira Sarntivijai (FDA)
@micheldumontier::VanBUG:13-03-201461
dumontierlab.com
michel.dumontier@stanford.edu
Website: http://dumontierlab.com
Presentations: http://slideshare.com/micheldumontier
62 @micheldumontier::VanBUG:13-03-2014

Weitere ähnliche Inhalte

Was ist angesagt?

Howe et al. - 2015 - BioAssay Research Database (BARD) chemical biolog
Howe et al. - 2015 - BioAssay Research Database (BARD) chemical biologHowe et al. - 2015 - BioAssay Research Database (BARD) chemical biolog
Howe et al. - 2015 - BioAssay Research Database (BARD) chemical biologEleanor Howe
 
Model Organism Linked Data
Model Organism Linked DataModel Organism Linked Data
Model Organism Linked DataMichel Dumontier
 
Natalya Csatari_Library Database and Website Search
Natalya Csatari_Library Database and Website SearchNatalya Csatari_Library Database and Website Search
Natalya Csatari_Library Database and Website SearchNatalya Csatari, RN, BSN,CM
 
Open Access and Property Rights on a Collision Course with Scholars
Open Access and Property Rights on a Collision Course with ScholarsOpen Access and Property Rights on a Collision Course with Scholars
Open Access and Property Rights on a Collision Course with ScholarsKimberly Yang
 
PubChem and Its Applications for Drug Discovery
PubChem and Its Applications for Drug DiscoveryPubChem and Its Applications for Drug Discovery
PubChem and Its Applications for Drug DiscoverySunghwan Kim
 
PubChem: a public chemical information resource for big data chemistry
PubChem: a public chemical information resource for big data chemistryPubChem: a public chemical information resource for big data chemistry
PubChem: a public chemical information resource for big data chemistrySunghwan Kim
 
PubChem as a resource for chemical information training
PubChem as a resource for chemical information trainingPubChem as a resource for chemical information training
PubChem as a resource for chemical information trainingSunghwan Kim
 
Toxicological information in PubChem
Toxicological information in PubChemToxicological information in PubChem
Toxicological information in PubChemSunghwan Kim
 
PubChem for chemical information literacy training
PubChem for chemical information literacy trainingPubChem for chemical information literacy training
PubChem for chemical information literacy trainingSunghwan Kim
 
Mashing Up Drug Discovery
Mashing Up Drug DiscoveryMashing Up Drug Discovery
Mashing Up Drug DiscoverySciBite Limited
 
Searching for chemical information using PubChem
Searching for chemical information using PubChemSearching for chemical information using PubChem
Searching for chemical information using PubChemSunghwan Kim
 
Guide to Malaria Pharmacology, GEMM 2019
Guide to Malaria Pharmacology, GEMM 2019Guide to Malaria Pharmacology, GEMM 2019
Guide to Malaria Pharmacology, GEMM 2019Guide to PHARMACOLOGY
 
GuideToImmunopharmacology_SIF_Nov2019
GuideToImmunopharmacology_SIF_Nov2019GuideToImmunopharmacology_SIF_Nov2019
GuideToImmunopharmacology_SIF_Nov2019Guide to PHARMACOLOGY
 
GtoPDB_ELIXIR_UK_AllHands_update_Dec2019
GtoPDB_ELIXIR_UK_AllHands_update_Dec2019GtoPDB_ELIXIR_UK_AllHands_update_Dec2019
GtoPDB_ELIXIR_UK_AllHands_update_Dec2019Guide to PHARMACOLOGY
 
PubChem for drug discovery in the age of big data and artificial intelligence
PubChem for drug discovery in the age of big data and artificial intelligencePubChem for drug discovery in the age of big data and artificial intelligence
PubChem for drug discovery in the age of big data and artificial intelligenceSunghwan Kim
 
Data-driven drug discovery for rare diseases - Tales from the trenches (CINF ...
Data-driven drug discovery for rare diseases - Tales from the trenches (CINF ...Data-driven drug discovery for rare diseases - Tales from the trenches (CINF ...
Data-driven drug discovery for rare diseases - Tales from the trenches (CINF ...Frederik van den Broek
 
dkNET Poster Experimental Biology 2019
dkNET Poster Experimental Biology 2019dkNET Poster Experimental Biology 2019
dkNET Poster Experimental Biology 2019dkNET
 
DisGeNET: a discovery platform to support translational research and drug dis...
DisGeNET: a discovery platform to support translational research and drug dis...DisGeNET: a discovery platform to support translational research and drug dis...
DisGeNET: a discovery platform to support translational research and drug dis...Núria Queralt Rosinach
 

Was ist angesagt? (20)

Howe et al. - 2015 - BioAssay Research Database (BARD) chemical biolog
Howe et al. - 2015 - BioAssay Research Database (BARD) chemical biologHowe et al. - 2015 - BioAssay Research Database (BARD) chemical biolog
Howe et al. - 2015 - BioAssay Research Database (BARD) chemical biolog
 
Model Organism Linked Data
Model Organism Linked DataModel Organism Linked Data
Model Organism Linked Data
 
MLA CE Course: Third-Party PubMed Tools
MLA CE Course: Third-Party PubMed ToolsMLA CE Course: Third-Party PubMed Tools
MLA CE Course: Third-Party PubMed Tools
 
Third-Party PubMed Tools
Third-Party PubMed ToolsThird-Party PubMed Tools
Third-Party PubMed Tools
 
Natalya Csatari_Library Database and Website Search
Natalya Csatari_Library Database and Website SearchNatalya Csatari_Library Database and Website Search
Natalya Csatari_Library Database and Website Search
 
Open Access and Property Rights on a Collision Course with Scholars
Open Access and Property Rights on a Collision Course with ScholarsOpen Access and Property Rights on a Collision Course with Scholars
Open Access and Property Rights on a Collision Course with Scholars
 
PubChem and Its Applications for Drug Discovery
PubChem and Its Applications for Drug DiscoveryPubChem and Its Applications for Drug Discovery
PubChem and Its Applications for Drug Discovery
 
PubChem: a public chemical information resource for big data chemistry
PubChem: a public chemical information resource for big data chemistryPubChem: a public chemical information resource for big data chemistry
PubChem: a public chemical information resource for big data chemistry
 
PubChem as a resource for chemical information training
PubChem as a resource for chemical information trainingPubChem as a resource for chemical information training
PubChem as a resource for chemical information training
 
Toxicological information in PubChem
Toxicological information in PubChemToxicological information in PubChem
Toxicological information in PubChem
 
PubChem for chemical information literacy training
PubChem for chemical information literacy trainingPubChem for chemical information literacy training
PubChem for chemical information literacy training
 
Mashing Up Drug Discovery
Mashing Up Drug DiscoveryMashing Up Drug Discovery
Mashing Up Drug Discovery
 
Searching for chemical information using PubChem
Searching for chemical information using PubChemSearching for chemical information using PubChem
Searching for chemical information using PubChem
 
Guide to Malaria Pharmacology, GEMM 2019
Guide to Malaria Pharmacology, GEMM 2019Guide to Malaria Pharmacology, GEMM 2019
Guide to Malaria Pharmacology, GEMM 2019
 
GuideToImmunopharmacology_SIF_Nov2019
GuideToImmunopharmacology_SIF_Nov2019GuideToImmunopharmacology_SIF_Nov2019
GuideToImmunopharmacology_SIF_Nov2019
 
GtoPDB_ELIXIR_UK_AllHands_update_Dec2019
GtoPDB_ELIXIR_UK_AllHands_update_Dec2019GtoPDB_ELIXIR_UK_AllHands_update_Dec2019
GtoPDB_ELIXIR_UK_AllHands_update_Dec2019
 
PubChem for drug discovery in the age of big data and artificial intelligence
PubChem for drug discovery in the age of big data and artificial intelligencePubChem for drug discovery in the age of big data and artificial intelligence
PubChem for drug discovery in the age of big data and artificial intelligence
 
Data-driven drug discovery for rare diseases - Tales from the trenches (CINF ...
Data-driven drug discovery for rare diseases - Tales from the trenches (CINF ...Data-driven drug discovery for rare diseases - Tales from the trenches (CINF ...
Data-driven drug discovery for rare diseases - Tales from the trenches (CINF ...
 
dkNET Poster Experimental Biology 2019
dkNET Poster Experimental Biology 2019dkNET Poster Experimental Biology 2019
dkNET Poster Experimental Biology 2019
 
DisGeNET: a discovery platform to support translational research and drug dis...
DisGeNET: a discovery platform to support translational research and drug dis...DisGeNET: a discovery platform to support translational research and drug dis...
DisGeNET: a discovery platform to support translational research and drug dis...
 

Andere mochten auch

Blerta Interpreter 1
Blerta Interpreter 1Blerta Interpreter 1
Blerta Interpreter 1Blerta
 
IT for Nursing - 5
IT for Nursing - 5IT for Nursing - 5
IT for Nursing - 5Sascha Funk
 
Innovation Benefits Realization for Industrial Research (Part-3)
Innovation Benefits Realization for Industrial Research (Part-3)Innovation Benefits Realization for Industrial Research (Part-3)
Innovation Benefits Realization for Industrial Research (Part-3)Iain Sanders
 
Web 2.0, Dutch Railways & Teststation Leiden
Web 2.0, Dutch Railways & Teststation LeidenWeb 2.0, Dutch Railways & Teststation Leiden
Web 2.0, Dutch Railways & Teststation LeidenCoenDirkx
 
Survival of the Quickest
Survival of the QuickestSurvival of the Quickest
Survival of the QuickestChristina_W
 
Web Configurator
Web ConfiguratorWeb Configurator
Web Configuratormikuzz
 
Technical Communication Lab Projects
Technical Communication Lab ProjectsTechnical Communication Lab Projects
Technical Communication Lab ProjectsDebopriyo Roy
 
Innovation Benefits Realization for Industrial Research (Part-2)
Innovation Benefits Realization for Industrial Research (Part-2)Innovation Benefits Realization for Industrial Research (Part-2)
Innovation Benefits Realization for Industrial Research (Part-2)Iain Sanders
 
Guerrilla Readers - Portfolio
Guerrilla Readers - PortfolioGuerrilla Readers - Portfolio
Guerrilla Readers - PortfolioGuerrilla Readers
 
Titian Presentation
Titian PresentationTitian Presentation
Titian Presentationkatsen
 
Android Bootcamp Santa Fe GTUG
Android Bootcamp Santa Fe GTUGAndroid Bootcamp Santa Fe GTUG
Android Bootcamp Santa Fe GTUGmatiasmolinas
 

Andere mochten auch (20)

Lourenza
LourenzaLourenza
Lourenza
 
Blerta Interpreter 1
Blerta Interpreter 1Blerta Interpreter 1
Blerta Interpreter 1
 
Sentimenduak
SentimenduakSentimenduak
Sentimenduak
 
Squizz presentation
Squizz presentationSquizz presentation
Squizz presentation
 
Classical Series
Classical SeriesClassical Series
Classical Series
 
IT for Nursing - 5
IT for Nursing - 5IT for Nursing - 5
IT for Nursing - 5
 
Innovation Benefits Realization for Industrial Research (Part-3)
Innovation Benefits Realization for Industrial Research (Part-3)Innovation Benefits Realization for Industrial Research (Part-3)
Innovation Benefits Realization for Industrial Research (Part-3)
 
Web 2.0, Dutch Railways & Teststation Leiden
Web 2.0, Dutch Railways & Teststation LeidenWeb 2.0, Dutch Railways & Teststation Leiden
Web 2.0, Dutch Railways & Teststation Leiden
 
Survival of the Quickest
Survival of the QuickestSurvival of the Quickest
Survival of the Quickest
 
Web Configurator
Web ConfiguratorWeb Configurator
Web Configurator
 
Ishii presentation
Ishii presentationIshii presentation
Ishii presentation
 
CloudCoffee
CloudCoffeeCloudCoffee
CloudCoffee
 
Technical Communication Lab Projects
Technical Communication Lab ProjectsTechnical Communication Lab Projects
Technical Communication Lab Projects
 
Innovation Benefits Realization for Industrial Research (Part-2)
Innovation Benefits Realization for Industrial Research (Part-2)Innovation Benefits Realization for Industrial Research (Part-2)
Innovation Benefits Realization for Industrial Research (Part-2)
 
BC#1
BC#1BC#1
BC#1
 
Force Majeure: a Time Bomb
Force Majeure: a Time BombForce Majeure: a Time Bomb
Force Majeure: a Time Bomb
 
Guerrilla Readers - Portfolio
Guerrilla Readers - PortfolioGuerrilla Readers - Portfolio
Guerrilla Readers - Portfolio
 
Titian Presentation
Titian PresentationTitian Presentation
Titian Presentation
 
Android Bootcamp Santa Fe GTUG
Android Bootcamp Santa Fe GTUGAndroid Bootcamp Santa Fe GTUG
Android Bootcamp Santa Fe GTUG
 
Dfi Portfolio
Dfi PortfolioDfi Portfolio
Dfi Portfolio
 

Ähnlich wie Powering Scientific Discovery with Semantic Web Integration

Building a Network of Interoperable and Independently Produced Linked and Ope...
Building a Network of Interoperable and Independently Produced Linked and Ope...Building a Network of Interoperable and Independently Produced Linked and Ope...
Building a Network of Interoperable and Independently Produced Linked and Ope...Michel Dumontier
 
GenomeTrakr: Perspectives on linking internationally - Canada and IRIDA.ca
GenomeTrakr: Perspectives on linking internationally - Canada and IRIDA.caGenomeTrakr: Perspectives on linking internationally - Canada and IRIDA.ca
GenomeTrakr: Perspectives on linking internationally - Canada and IRIDA.cafionabrinkman
 
TB Mobile: Appifying data on antituberculosis molecule targets
TB Mobile: Appifying data on antituberculosis molecule targetsTB Mobile: Appifying data on antituberculosis molecule targets
TB Mobile: Appifying data on antituberculosis molecule targetsSean Ekins
 
Pistoia Alliance USA Conference 2016
Pistoia Alliance USA Conference 2016Pistoia Alliance USA Conference 2016
Pistoia Alliance USA Conference 2016Pistoia Alliance
 
MseqDR consortium: a grass-roots effort to establish a global resource aimed ...
MseqDR consortium: a grass-roots effort to establish a global resource aimed ...MseqDR consortium: a grass-roots effort to establish a global resource aimed ...
MseqDR consortium: a grass-roots effort to establish a global resource aimed ...Human Variome Project
 
GenomeTrakr: Whole-Genome Sequencing for Food Safety and A New Way Forward in...
GenomeTrakr: Whole-Genome Sequencing for Food Safety and A New Way Forward in...GenomeTrakr: Whole-Genome Sequencing for Food Safety and A New Way Forward in...
GenomeTrakr: Whole-Genome Sequencing for Food Safety and A New Way Forward in...ExternalEvents
 
Cancer Research Meets Data Science — What Can We Do Together?
Cancer Research Meets Data Science — What Can We Do Together?Cancer Research Meets Data Science — What Can We Do Together?
Cancer Research Meets Data Science — What Can We Do Together?Philip Bourne
 
Introduction to Gene Mining Part A: BLASTn-off!
Introduction to Gene Mining Part A: BLASTn-off!Introduction to Gene Mining Part A: BLASTn-off!
Introduction to Gene Mining Part A: BLASTn-off!adcobb
 
Introduction to Bioinformatics.
 Introduction to Bioinformatics. Introduction to Bioinformatics.
Introduction to Bioinformatics.Elena Sügis
 
Open science and medical evidence generation - Kees van Bochove - The Hyve
Open science and medical evidence generation - Kees van Bochove - The HyveOpen science and medical evidence generation - Kees van Bochove - The Hyve
Open science and medical evidence generation - Kees van Bochove - The HyveKees van Bochove
 
2011-10-11 Open PHACTS at BioIT World Europe
2011-10-11 Open PHACTS at BioIT World Europe2011-10-11 Open PHACTS at BioIT World Europe
2011-10-11 Open PHACTS at BioIT World Europeopen_phacts
 
Data Mining and Big Data Analytics in Pharma
Data Mining and Big Data Analytics in Pharma Data Mining and Big Data Analytics in Pharma
Data Mining and Big Data Analytics in Pharma Ankur Khanna
 
5. BIOINFORMATICS.pptx B.Pharm sem 2 Computer Applications in Pharmacy
5. BIOINFORMATICS.pptx B.Pharm sem 2 Computer Applications in Pharmacy5. BIOINFORMATICS.pptx B.Pharm sem 2 Computer Applications in Pharmacy
5. BIOINFORMATICS.pptx B.Pharm sem 2 Computer Applications in PharmacyVedika Narvekar
 
Tactical Formalization of Linked Open Data (Ontology Summit 2014)
Tactical Formalization of Linked Open Data (Ontology Summit 2014)Tactical Formalization of Linked Open Data (Ontology Summit 2014)
Tactical Formalization of Linked Open Data (Ontology Summit 2014)Michel Dumontier
 
Big Data, AI, and Pharma
Big Data, AI, and PharmaBig Data, AI, and Pharma
Big Data, AI, and PharmaAmit Sheth
 
Pharmacogenomics annotation in drug structured product labeling for clinical ...
Pharmacogenomics annotation in drug structured product labeling for clinical ...Pharmacogenomics annotation in drug structured product labeling for clinical ...
Pharmacogenomics annotation in drug structured product labeling for clinical ...Richard Boyce, PhD
 
20160811 Big Data for Health and Medicine
20160811 Big Data for Health and Medicine20160811 Big Data for Health and Medicine
20160811 Big Data for Health and MedicineBrian Bot
 
CINECA webinar slides: Open science through fair health data networks dream o...
CINECA webinar slides: Open science through fair health data networks dream o...CINECA webinar slides: Open science through fair health data networks dream o...
CINECA webinar slides: Open science through fair health data networks dream o...CINECAProject
 

Ähnlich wie Powering Scientific Discovery with Semantic Web Integration (20)

Building a Network of Interoperable and Independently Produced Linked and Ope...
Building a Network of Interoperable and Independently Produced Linked and Ope...Building a Network of Interoperable and Independently Produced Linked and Ope...
Building a Network of Interoperable and Independently Produced Linked and Ope...
 
GenomeTrakr: Perspectives on linking internationally - Canada and IRIDA.ca
GenomeTrakr: Perspectives on linking internationally - Canada and IRIDA.caGenomeTrakr: Perspectives on linking internationally - Canada and IRIDA.ca
GenomeTrakr: Perspectives on linking internationally - Canada and IRIDA.ca
 
TB Mobile: Appifying data on antituberculosis molecule targets
TB Mobile: Appifying data on antituberculosis molecule targetsTB Mobile: Appifying data on antituberculosis molecule targets
TB Mobile: Appifying data on antituberculosis molecule targets
 
Pistoia Alliance USA Conference 2016
Pistoia Alliance USA Conference 2016Pistoia Alliance USA Conference 2016
Pistoia Alliance USA Conference 2016
 
MseqDR consortium: a grass-roots effort to establish a global resource aimed ...
MseqDR consortium: a grass-roots effort to establish a global resource aimed ...MseqDR consortium: a grass-roots effort to establish a global resource aimed ...
MseqDR consortium: a grass-roots effort to establish a global resource aimed ...
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
GenomeTrakr: Whole-Genome Sequencing for Food Safety and A New Way Forward in...
GenomeTrakr: Whole-Genome Sequencing for Food Safety and A New Way Forward in...GenomeTrakr: Whole-Genome Sequencing for Food Safety and A New Way Forward in...
GenomeTrakr: Whole-Genome Sequencing for Food Safety and A New Way Forward in...
 
Cancer Research Meets Data Science — What Can We Do Together?
Cancer Research Meets Data Science — What Can We Do Together?Cancer Research Meets Data Science — What Can We Do Together?
Cancer Research Meets Data Science — What Can We Do Together?
 
Introduction to Gene Mining Part A: BLASTn-off!
Introduction to Gene Mining Part A: BLASTn-off!Introduction to Gene Mining Part A: BLASTn-off!
Introduction to Gene Mining Part A: BLASTn-off!
 
Introduction to Bioinformatics.
 Introduction to Bioinformatics. Introduction to Bioinformatics.
Introduction to Bioinformatics.
 
Open science and medical evidence generation - Kees van Bochove - The Hyve
Open science and medical evidence generation - Kees van Bochove - The HyveOpen science and medical evidence generation - Kees van Bochove - The Hyve
Open science and medical evidence generation - Kees van Bochove - The Hyve
 
2011-10-11 Open PHACTS at BioIT World Europe
2011-10-11 Open PHACTS at BioIT World Europe2011-10-11 Open PHACTS at BioIT World Europe
2011-10-11 Open PHACTS at BioIT World Europe
 
Online Resources to Support Open Drug Discovery Systems
Online Resources to Support Open Drug Discovery SystemsOnline Resources to Support Open Drug Discovery Systems
Online Resources to Support Open Drug Discovery Systems
 
Data Mining and Big Data Analytics in Pharma
Data Mining and Big Data Analytics in Pharma Data Mining and Big Data Analytics in Pharma
Data Mining and Big Data Analytics in Pharma
 
5. BIOINFORMATICS.pptx B.Pharm sem 2 Computer Applications in Pharmacy
5. BIOINFORMATICS.pptx B.Pharm sem 2 Computer Applications in Pharmacy5. BIOINFORMATICS.pptx B.Pharm sem 2 Computer Applications in Pharmacy
5. BIOINFORMATICS.pptx B.Pharm sem 2 Computer Applications in Pharmacy
 
Tactical Formalization of Linked Open Data (Ontology Summit 2014)
Tactical Formalization of Linked Open Data (Ontology Summit 2014)Tactical Formalization of Linked Open Data (Ontology Summit 2014)
Tactical Formalization of Linked Open Data (Ontology Summit 2014)
 
Big Data, AI, and Pharma
Big Data, AI, and PharmaBig Data, AI, and Pharma
Big Data, AI, and Pharma
 
Pharmacogenomics annotation in drug structured product labeling for clinical ...
Pharmacogenomics annotation in drug structured product labeling for clinical ...Pharmacogenomics annotation in drug structured product labeling for clinical ...
Pharmacogenomics annotation in drug structured product labeling for clinical ...
 
20160811 Big Data for Health and Medicine
20160811 Big Data for Health and Medicine20160811 Big Data for Health and Medicine
20160811 Big Data for Health and Medicine
 
CINECA webinar slides: Open science through fair health data networks dream o...
CINECA webinar slides: Open science through fair health data networks dream o...CINECA webinar slides: Open science through fair health data networks dream o...
CINECA webinar slides: Open science through fair health data networks dream o...
 

Mehr von Michel Dumontier

A metadata standard for Knowledge Graphs
A metadata standard for Knowledge GraphsA metadata standard for Knowledge Graphs
A metadata standard for Knowledge GraphsMichel Dumontier
 
Data-Driven Discovery Science with FAIR Knowledge Graphs
Data-Driven Discovery Science with FAIR Knowledge GraphsData-Driven Discovery Science with FAIR Knowledge Graphs
Data-Driven Discovery Science with FAIR Knowledge GraphsMichel Dumontier
 
The Role of the FAIR Guiding Principles for an effective Learning Health System
The Role of the FAIR Guiding Principles for an effective Learning Health SystemThe Role of the FAIR Guiding Principles for an effective Learning Health System
The Role of the FAIR Guiding Principles for an effective Learning Health SystemMichel Dumontier
 
CIKM2020 Keynote: Accelerating discovery science with an Internet of FAIR dat...
CIKM2020 Keynote: Accelerating discovery science with an Internet of FAIR dat...CIKM2020 Keynote: Accelerating discovery science with an Internet of FAIR dat...
CIKM2020 Keynote: Accelerating discovery science with an Internet of FAIR dat...Michel Dumontier
 
The role of the FAIR Guiding Principles in a Learning Health System
The role of the FAIR Guiding Principles in a Learning Health SystemThe role of the FAIR Guiding Principles in a Learning Health System
The role of the FAIR Guiding Principles in a Learning Health SystemMichel Dumontier
 
Acclerating biomedical discovery with an internet of FAIR data and services -...
Acclerating biomedical discovery with an internet of FAIR data and services -...Acclerating biomedical discovery with an internet of FAIR data and services -...
Acclerating biomedical discovery with an internet of FAIR data and services -...Michel Dumontier
 
Accelerating Biomedical Research with the Emerging Internet of FAIR Data and ...
Accelerating Biomedical Research with the Emerging Internet of FAIR Data and ...Accelerating Biomedical Research with the Emerging Internet of FAIR Data and ...
Accelerating Biomedical Research with the Emerging Internet of FAIR Data and ...Michel Dumontier
 
Are we FAIR yet? And will it be worth it?
Are we FAIR yet? And will it be worth it?Are we FAIR yet? And will it be worth it?
Are we FAIR yet? And will it be worth it?Michel Dumontier
 
The Future of FAIR Data: An international social, legal and technological inf...
The Future of FAIR Data: An international social, legal and technological inf...The Future of FAIR Data: An international social, legal and technological inf...
The Future of FAIR Data: An international social, legal and technological inf...Michel Dumontier
 
Keynote at the 2018 Maastricht University Dinner
Keynote at the 2018 Maastricht University DinnerKeynote at the 2018 Maastricht University Dinner
Keynote at the 2018 Maastricht University DinnerMichel Dumontier
 
The future of science and business - a UM Star Lecture
The future of science and business - a UM Star LectureThe future of science and business - a UM Star Lecture
The future of science and business - a UM Star LectureMichel Dumontier
 
Developing and assessing FAIR digital resources
Developing and assessing FAIR digital resourcesDeveloping and assessing FAIR digital resources
Developing and assessing FAIR digital resourcesMichel Dumontier
 
Advancing Biomedical Knowledge Reuse with FAIR
Advancing Biomedical Knowledge Reuse with FAIRAdvancing Biomedical Knowledge Reuse with FAIR
Advancing Biomedical Knowledge Reuse with FAIRMichel Dumontier
 
A Framework to develop the FAIR Metrics
A Framework to develop the FAIR MetricsA Framework to develop the FAIR Metrics
A Framework to develop the FAIR MetricsMichel Dumontier
 
FAIR principles and metrics for evaluation
FAIR principles and metrics for evaluationFAIR principles and metrics for evaluation
FAIR principles and metrics for evaluationMichel Dumontier
 
Towards metrics to assess and encourage FAIRness
Towards metrics to assess and encourage FAIRnessTowards metrics to assess and encourage FAIRness
Towards metrics to assess and encourage FAIRnessMichel Dumontier
 

Mehr von Michel Dumontier (20)

A metadata standard for Knowledge Graphs
A metadata standard for Knowledge GraphsA metadata standard for Knowledge Graphs
A metadata standard for Knowledge Graphs
 
Data-Driven Discovery Science with FAIR Knowledge Graphs
Data-Driven Discovery Science with FAIR Knowledge GraphsData-Driven Discovery Science with FAIR Knowledge Graphs
Data-Driven Discovery Science with FAIR Knowledge Graphs
 
Evaluating FAIRness
Evaluating FAIRnessEvaluating FAIRness
Evaluating FAIRness
 
The Role of the FAIR Guiding Principles for an effective Learning Health System
The Role of the FAIR Guiding Principles for an effective Learning Health SystemThe Role of the FAIR Guiding Principles for an effective Learning Health System
The Role of the FAIR Guiding Principles for an effective Learning Health System
 
CIKM2020 Keynote: Accelerating discovery science with an Internet of FAIR dat...
CIKM2020 Keynote: Accelerating discovery science with an Internet of FAIR dat...CIKM2020 Keynote: Accelerating discovery science with an Internet of FAIR dat...
CIKM2020 Keynote: Accelerating discovery science with an Internet of FAIR dat...
 
The role of the FAIR Guiding Principles in a Learning Health System
The role of the FAIR Guiding Principles in a Learning Health SystemThe role of the FAIR Guiding Principles in a Learning Health System
The role of the FAIR Guiding Principles in a Learning Health System
 
Acclerating biomedical discovery with an internet of FAIR data and services -...
Acclerating biomedical discovery with an internet of FAIR data and services -...Acclerating biomedical discovery with an internet of FAIR data and services -...
Acclerating biomedical discovery with an internet of FAIR data and services -...
 
Accelerating Biomedical Research with the Emerging Internet of FAIR Data and ...
Accelerating Biomedical Research with the Emerging Internet of FAIR Data and ...Accelerating Biomedical Research with the Emerging Internet of FAIR Data and ...
Accelerating Biomedical Research with the Emerging Internet of FAIR Data and ...
 
Are we FAIR yet? And will it be worth it?
Are we FAIR yet? And will it be worth it?Are we FAIR yet? And will it be worth it?
Are we FAIR yet? And will it be worth it?
 
The Future of FAIR Data: An international social, legal and technological inf...
The Future of FAIR Data: An international social, legal and technological inf...The Future of FAIR Data: An international social, legal and technological inf...
The Future of FAIR Data: An international social, legal and technological inf...
 
Keynote at the 2018 Maastricht University Dinner
Keynote at the 2018 Maastricht University DinnerKeynote at the 2018 Maastricht University Dinner
Keynote at the 2018 Maastricht University Dinner
 
The future of science and business - a UM Star Lecture
The future of science and business - a UM Star LectureThe future of science and business - a UM Star Lecture
The future of science and business - a UM Star Lecture
 
Are we FAIR yet?
Are we FAIR yet?Are we FAIR yet?
Are we FAIR yet?
 
Developing and assessing FAIR digital resources
Developing and assessing FAIR digital resourcesDeveloping and assessing FAIR digital resources
Developing and assessing FAIR digital resources
 
Advancing Biomedical Knowledge Reuse with FAIR
Advancing Biomedical Knowledge Reuse with FAIRAdvancing Biomedical Knowledge Reuse with FAIR
Advancing Biomedical Knowledge Reuse with FAIR
 
A Framework to develop the FAIR Metrics
A Framework to develop the FAIR MetricsA Framework to develop the FAIR Metrics
A Framework to develop the FAIR Metrics
 
FAIR principles and metrics for evaluation
FAIR principles and metrics for evaluationFAIR principles and metrics for evaluation
FAIR principles and metrics for evaluation
 
Towards metrics to assess and encourage FAIRness
Towards metrics to assess and encourage FAIRnessTowards metrics to assess and encourage FAIRness
Towards metrics to assess and encourage FAIRness
 
Data Science for the Win
Data Science for the WinData Science for the Win
Data Science for the Win
 
Ontologies
OntologiesOntologies
Ontologies
 

Kürzlich hochgeladen

Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfLoriGlavin3
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxBkGupta21
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxLoriGlavin3
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESmohitsingh558521
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfMounikaPolabathina
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxLoriGlavin3
 

Kürzlich hochgeladen (20)

Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdf
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptx
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine Tuning
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdf
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
 

Powering Scientific Discovery with Semantic Web Integration

  • 1. Powering Scientific Discovery with the Semantic Web 1 Michel Dumontier, Ph.D. Associate Professor of Medicine (Biomedical Informatics) Stanford University @micheldumontier::VanBUG:13-03-2014
  • 4. We use the biomedical literature to gather evidence 4 @micheldumontier::VanBUG:13-03-2014
  • 5. we query literature-curated databases (problems: access, format, identifiers & linking) 5 @micheldumontier::VanBUG:13-03-2014
  • 6. we use software to analyze, predict and evaluate (problems: OS, versioning, input/output formats) 6 @micheldumontier::VanBUG:13-03-2014
  • 7. we develop fairly sophisticated workflows to gather support for our hypotheses 7 @micheldumontier::VanBUG:13-03-2014
  • 8. Wouldn’t it be great if we could just find the evidence required to support or dispute a scientific hypothesis using the most up-to-date and relevant data, tools and scientific knowledge? 8 @micheldumontier::VanBUG:13-03-2014
  • 9. madness! 1. Build a massive network of interconnected data and software using web standards – Bio2RDF + SADI 2. Generate and test scientific hypotheses – prepare data (tactical formalization) – discover associations (ontology-based enrichment) – gather evidence (HyQue) 3. Contribute back to the global knowledge graph @micheldumontier::VanBUG:13-03-20149
  • 10. The Semantic Web is the new global web of knowledge 10 @micheldumontier::VanBUG:13-03-2014 standards for publishing, sharing and querying facts, expert knowledge and services scalable approach for the discovery of independently formulated and distributed knowledge
  • 11. Resource Description Framework • It’s a language to represent knowledge – Logic-based formalism -> automated reasoning – graph-like properties -> data analysis • Good for – Describing in terms of type, attributes, relations – Integrating data from different sources – Sharing the data (W3C standard) – Reusing what is available, developing what you need, and contributing back to the web of data. @micheldumontier::VanBUG:13-03-201411
  • 12. we’re building an incredible network of linked data 12 @micheldumontier::VanBUG:13-03-2014Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch
  • 13. @micheldumontier::VanBUG:13-03-2014 Linked Data for the Life Sciences 13 Bio2RDF is an open source project to unify the representation and interlinking of biological data using RDF. chemicals/drugs/formulations, genomes/genes/proteins, domains Interactions, complexes & pathways animal models and phenotypes Disease, genetic markers, treatments Terminologies & publications • Release 2 (Jan 2013): 1B+ interlinked statements from 19 high value datasets • dataset description, provenance & statistics • Partnerships with EBI, NCBI, DBCLS, NCBO, OpenPHACTS, and commercial tool providers
  • 15. The linked data network expands with every reference @micheldumontier::VanBUG:13-03-2014 drugbank:DB00586 pharmgkb_vocabulary:Drug rdf:type rdfs:label diclofenac [drugbank:DB00586] pharmgkb:PA449293 drugbank_vocabulary:Drug pharmgkb_vocabulary:xref diclofenac [pharmgkb:PA449293] rdfs:label DrugBank PharmGKB 15
  • 20. Graph summaries in query formulation @micheldumontier::VanBUG:13-03-2014 PREFIX drugbank_vocabulary: <http://bio2rdf.org/drugbank_vocabulary:> PREFIX rdfs: <http://www.w3.org/2000/01/rdf-schema#> SELECT ?ddi ?d1name ?d2name WHERE { ?ddi a drugbank_vocabulary:Drug-Drug-Interaction . ?d1 drugbank_vocabulary:ddi-interactor-in ?ddi . ?d1 rdfs:label ?d1name . ?d2 drugbank_vocabulary:ddi-interactor-in ?ddi . ?d2 rdfs:label ?d2name. FILTER (?d1 != ?d2) } 20
  • 21. You can use the SPARQLed query assistant with updated endpoints @micheldumontier::VanBUG:13-03-2014 http://sindicetech.com/sindice-suite/sparqled/ graph: http://sindicetech.com/analytics21
  • 22. Use virtuoso’s built in faceted browser to construct increasingly complex queries with little effort @micheldumontier::VanBUG:13-03-201422
  • 23. Federated Queries over Independent SPARQL EndPoints Get all protein catabolic processes (and more specific) in biomodels SELECT ?go ?label count(distinct ?x) WHERE { service <http://bioportal.bio2rdf.org/sparql> { ?go rdfs:label ?label . ?go rdfs:subClassOf ?tgo ?tgo rdfs:label ?tlabel . FILTER regex(?tlabel, "^protein catabolic process") } service <http://biomodels.bio2rdf.org/sparql> { ?x <http://bio2rdf.org/biopax_vocabulary:identical-to> ?go . ?x a <http://www.biopax.org/release/biopax-level3.owl#BiochemicalReaction> . } } @micheldumontier::VanBUG:13-03-201423
  • 24. 24 @micheldumontier::VanBUG:13-03-2014 Bio2RDF: 1M+ SPARQL queries per month UniProt: 6.4M queries per month EBI: 3.5M queries (Oct 2013)
  • 25. Despite all the data, it’s still hard to find answers to questions Because there are many ways to represent the same data and each dataset represents it differently @micheldumontier::VanBUG:13-03-201425
  • 26. multiple formalizations of the same kind of data do emerge, each with their own merit @micheldumontier::VanBUG:13-03-201426
  • 27. Massive Proliferation of Ontologies / Vocabularies @micheldumontier::VanBUG:13-03-201427
  • 28. Multi-Stakeholder Efforts to Standardize Representations are Reasonable, Long Term Strategies for Data Integration @micheldumontier::VanBUG:13-03-201428
  • 29. uniprot:P05067 uniprot:Protein is a sio:gene is a is a Semantic data integration, consistency checking and query answering over Bio2RDF with the Semanticscience Integrated Ontology (SIO) dataset ontology Knowledge Base @micheldumontier::VanBUG:13-03-2014 pharmgkb:PA30917 refseq:Protein is a is a omim:189931 omim:Gene pharmgkb:Gene Querying Bio2RDF Linked Open Data with a Global Schema. Alison Callahan, José Cruz-Toledo and Michel Dumontier. Bio-ontologies 2012. 29
  • 30. @micheldumontier::VanBUG:13-03-201430 SRIQ(D) 10700+ axioms 1300+ classes 201 object properties (inc. inverses) 1 datatype property
  • 31. Bio2RDF and SIO powered SPARQL 1.1 federated query: Find chemicals (from CTD) and proteins (from SGD) that participate in the same process (from GOA) SELECT ?chem, ?prot, ?proc FROM <http://bio2rdf.org/ctd> WHERE { SERVICE <http://ctd.bio2rdf.org/sparql> { ?chemical a sio:chemical-entity. ?chemical rdfs:label ?chem. ?chemical sio:is-participant-in ?process. ?process rdfs:label ?proc. FILTER regex (?process, "http://bio2rdf.org/go:") } SERVICE <http://sgd.bio2rdf.org/sparql> { ?protein a sio:protein . ?protein sio:is-participant-in ?process. ?protein rdfs:label ?prot . } } @micheldumontier::VanBUG:13-03-201431
  • 32. tactical formalization @micheldumontier::VanBUG:13-03-201432 Take what you need and represent it in a way that directly serves your objective STANDARDSUSER DRIVEN REPRESENTATION identifying aberrant and pharmacological pathways predicting drug targets using organism phenotypes Biopax-pathway exploration SPARQL-powered genome navigation
  • 33. aberrant and pharmacological pathways Q1. Can we identify pathways that are associated with a particular disease or class of diseases? Q2. Can we identify pathways are associated with a particular drug or class of drugs? drug pathway disease @micheldumontier::VanBUG:13-03-201433
  • 34. Identification of drug and disease enriched pathways • Approach – Integrate 3 datasets • DrugBank, PharmGKB and CTD – Integrate 7 terminologies • MeSH, ATC, ChEBI, UMLS, SNOMED, ICD, DO – Formalize selected elements – Identify significant associations using enrichment analysis over the fully inferred knowledge base Identifying aberrant pathways through integrated analysis of knowledge in pharmacogenomics. Bioinformatics. 2012. @micheldumontier::VanBUG:13-03-201434
  • 35. Formal knowledge representation as a strategy for data integration @micheldumontier::VanBUG:13-03-201435
  • 36. Have you heard of OWL? @micheldumontier::VanBUG:13-03-201436
  • 37. mercaptopurine [pharmgkb:PA450379] mercaptopurine [drugbank:DB01033] purine-6-thiol [CHEBI:2208] Class Equivalence mercaptopurine [ATC:L01BB02] Top Level Classes (disjointness) drug diseasegenepathway property chains Class subsumption mercaptopurine [mesh:D015122] Reciprocal Existentials drug disease pathway gene Formalized as an OWL-EL ontology 650,000+ classes, 3.2M subClassOf axioms, 75,000 equivalentClass axioms @micheldumontier::VanBUG:13-03-201437
  • 38. Benefits: Enhanced Query Capability – Use any mapped terminology to query a target resource. – Use knowledge in target ontologies to formulate more precise questions • ask for drugs that are associated with diseases of the joint: ‘Chikungunya’ (do:0050012) is defined as a viral infectious disease located in the ‘joint’ (fma:7490) and caused by a ‘Chikungunya virus’ (taxon:37124). – Learn relationships that are inferred by automated reasoning. • alcohol (ChEBI:30879) is associated with alcoholism (PA443309) since alcoholism is directly associated with ethanol (CHEBI:16236) • ‘parasitic infectious disease’ (do:0001398) associated with 129 drugs, 15 more than are directly linked. @micheldumontier::VanBUG:13-03-201438
  • 39. Knowledge Discovery through Data Integration and Enrichment Analysis • OntoFunc: Tool to discover significant associations between sets of objects and ontology categories. enrichment of attribute among a selected set of input items as compared to a reference set. hypergeometric or the binomial distribution, Fisher's exact test, or a chi-square test. • We found 22,653 disease-pathway associations, where for each pathway we find genes that are linked to disease. – Mood disorder (do:3324) associated with Zidovudine Pathway (pharmgkb:PA165859361). Zidovudine is for treating HIV/AIDS. Side effects include fatigue, headache, myalgia, malaise and anorexia • We found 13,826 pathway-chemical associations – Clopidogrel (chebi:37941) associated with Endothelin signaling pathway (pharmgkb:PA164728163). Endothelins are proteins that constrict blood vessels and raise blood pressure. Clopidogrel inhibits platelet aggregation and prolongs bleeding time. @micheldumontier::VanBUG:13-03-201439
  • 40. Tactical Formalization + Automated Reasoning Offers Compelling Value for Certain Problems We need to be smart about the goal, and how best to achieve it. Tactical formalization is another tool in the toolbox. We’ve formalized data as OWL ontologies to verify, fix and exploit Linked Data through expressive OWL reasoning • To identify mistakes in human curated knowledge • To identify conflicting meaning in terms • To identify mistakes in the representation of RDF data o incorrect use of relations o incorrect assertion of identity (owl:sameAs) Many other applications can be envisioned. @micheldumontier::VanBUG:13-03-201440
  • 41. PhenomeDrug A computational approach to predict drug targets, drug effects, and drug indications using phenotypes @micheldumontier::VanBUG:13-03-201441 Mouse model phenotypes provide information about human drug targets. Hoehndorf R, Hiebert T, Hardy NW, Schofield PN, Gkoutos GV, Dumontier M. Bioinformatics. 2013.
  • 42. animal models provide insight for on target effects • In the majority of 100 best selling drugs ($148B in US alone), there is a direct correlation between knockout phenotype and drug effect • Gastroesophageal reflux – Proton pump inhibitors (e.g Prilosec - $5B) – KO of alpha or beta unit of H/K ATPase are achlorhydric (normal pH) in stomach compared to wild type (pH 3-5) when exposed to acidic solution • Immunological Indications – Anti-histamines (Claritin, Allegra, Zyrtec) – KO of histamine H1 receptor leads to decreased responsiveness of immune system – Predicts on target effects : drowsiness, reduced anxiety @micheldumontier::VanBUG:13-03-201442 Zambrowicz and Sands. Nat Rev Drug Disc. 2003.
  • 43. Identifying drug targets from mouse knock-out phenotypes @micheldumontier::VanBUG:13-03-201443 drug gene phenotypes effects human gene non-functional gene model ortholog similar inhibits Main idea: if a drug’s phenotypes matches the phenotypes of a null model, this suggests that the drug is an inhibitor of the gene
  • 44. Terminological Interoperability (we must compare apples with apples) Mouse Phenotypes Drug effects (mappings from UMLS to DO, NBO, MP) Mammalian Phenotype OntologyPhenomeNet PhenomeDrug @micheldumontier::VanBUG:13-03-2014
  • 45. Semantic Similarity @micheldumontier::VanBUG:13-03-201445 Given a drug effect profile D and a mouse model M, we compute the semantic similarity as an information weighted Jaccard metric. The similarity measure used is non-symmetrical and determines the amount of information about a drug effect profile D that is covered by a set of mouse model phenotypes M.
  • 46. Loss of function models predict targets of inhibitor drugs • 14,682 drugs; 7,255 mouse genotypes • Validation against known and predicted inhibitor-target pairs – 0.76 ROC AUC for human targets (DrugBank) – 0.81 ROC AUC for mouse targets (STITCH) • diclofenac (STITCH:000003032) – NSAID used to treat pain, osteoarthritis and rheumatoid arthritis – Drug effects include liver inflammation (hepatitis), swelling of liver (hepatomegaly), redness of skin (erythema) – 49% explained by PPARg knockout • peroxisome proliferator activated receptor gamma (PPARg) regulates metabolism, proliferation, inflammation and differentiation, • Diclofenac is a known inhibitor – 46% explained by COX-2 knockout • Diclofenac is a known inhibitor @micheldumontier::VanBUG:13-03-2014
  • 47. Using the Semantic Web to Gather Evidence for Scientific Hypotheses What evidence supports or disputes that TKIs are cardiotoxic? @micheldumontier::VanBUG:13-03-201447
  • 48. • Tyrosine Kinase Inhibitors (TKI) – Imatinib, Sorafenib, Sunitinib, Dasatinib, Nilotinib, Lapatinib – Used to treat cancer – Linked to cardiotoxicity. • FDA launched drug safety program to detect toxicity – Need to integrate data and ontologies (Abernethy, CPT 2011) – Abernethy (2013) suggest using public data in genetics, pharmacology, toxicology, systems biology, to predict/validate adverse events • What evidence could we gather to give credence that TKI’s causes non-QT cardiotoxicity? @micheldumontier::VanBUG:13-03-201448 FDA Use Case: TKI non-QT Cardiotoxicity
  • 49. @micheldumontier::VanBUG:13-03-201449 Jane P.F. Bai and Darrell R. Abernethy. Systems Pharmacology to Predict Drug Toxicity: Integration Across Levels of Biological Organization. Annu. Rev. Pharmacol. Toxicol. 2013.53:451-473
  • 50. • The goal of HyQue is retrieve and evaluate evidence that supports/disputes a hypothesis – hypotheses are described as a set of events • e.g. binding, inhibition, phenotypic effect – events are associated with types of evidence • a query is written to retrieve data • a weight is assigned to provide significance • Hypotheses are written by people who seek answers • data retrieval rules are written by people who know the data and how it should be interpreted  @micheldumontier::VanBUG:13-03-201450 HyQue 1. HyQue: Evaluating hypotheses using Semantic Web technologies. J Biomed Semantics. 2011 May 17;2 Suppl 2:S3. 2. Evaluating scientific hypotheses using the SPARQL Inferencing Notation. Extended Semantic Web Conference (ESWC 2012). Heraklion, Crete. May 27-31, 2012.
  • 51. HyQue: A Semantic Web Application @micheldumontier::VanBUG:1 51 Software OntologiesData Hypothesis Evaluation
  • 52. What evidence might we gather? • clinical: Are there cardiotoxic effects associated with the drug? – Literature (studies) [curated db] – Product labels (studies) [r3:sider] – Clinical trials (studies) [r3:clinicaltrials] – Adverse event reports [r2:pharmgkb/onesides] – Electronic health records (observations) • pre-clinical associations: – genotype-phenotype (null/disease models) [r2:mgi, r2:sgd; r3:wormbase] – in vitro assays (IC50) [r3:chembl] – drug targets [r2:drugbank; r2:ctd; r3:stitch] – drug-gene expression [r3:gxa] – pathways [r2:kegg; r3:reactome] – Drug-pathway, disease-pathway enrichments [aberrant pathways] – Chemical properties [r2:pubchem; r2.drugbank] – Toxicology [r1.toxkb/cebs] @micheldumontier::VanBUG:13-03-201452
  • 53. Data retrieval is done with SPARQL @micheldumontier::VanBUG:13-03-201453
  • 54. Data Evaluation is done with SPIN rules @micheldumontier::VanBUG:13-03-201454
  • 60. In Summary • This talk was about making sense of the structured data we already have • RDF-based Linked Open Data acts as a substrate for query answering and task-based formalization in OWL • Discovery through the generation of testable hypotheses in the target domain. • Using Linked Data to gather evidence regarding scientific hypotheses • The big challenge lies in establishing discovery pipelines with user-generated scientific data. @micheldumontier::VanBUG:13-03-201460
  • 61. Acknowledgements Bio2RDF Release 2: Allison Callahan, Jose Cruz-Toledo, Peter Ansell Aberrant Pathways: Robert Hoehndorf, Georgios Gkoutos PhenomeDrug: Tanya Hiebert, Robert Hoehndorf, Georgios Gkoutos, Paul Schofield TKI Cardiotoxicity: Alison Callahan, Tania Hiebert, Beatriz Lujan, Sira Sarntivijai (FDA) @micheldumontier::VanBUG:13-03-201461