SlideShare ist ein Scribd-Unternehmen logo
1 von 74
Open Targets: integrating
genetics and genomics for
disease biology and
translational medicine
What is the Open Targets
Partnership?
How to navigate the
Open Targets Platform?
Aims
Are there other Open
Targets tools?
Where do I get
help?
Source: PhRMA adaptation based on Tufts CSDD & School of Medicine, and FDA
Lengthy, costly, low success rate, HIGH ATTRITION RATES
Drug discovery: some challenges
Public databases for drug discovery
• EMBL-EBI (European Bioinformatics Institute)
• Elsewhere
Fit everything together
• Time consuming
• Possible lack of resources or expertise
• …
I wish I did not have to go to all
those different places to get the
information I’m after.
I know. If we only had a one-stop
shop with as much data as possible,
plus new analyses and links to the
original source for my own
assessment.
A resource that is
comprehensive, trustworthy,
up-to-date, sustainable,
easy-to-use and free.
Open Targets is all you need!
Our Vision
A partnership to transform drug discovery
through the systematic identification and
prioritisation of targets
https://www.opentargets.org
2014 2016 2017 2018
Data
generation
Therapeutic
hypothesis
Public data
Data
integration
targetvalidation.org
Experimental projects Bioinformatic projects
Virtuous cycle in Open Targets
www.opentargets.org/projects
Concurrent
Open Targets generates data
www.opentargets.org/science/
• EMBL-EBI and Sanger Institute
• > 1,000 cancer cell lines + drug sensitivity data
• RNASeq, CRISPR screens
• Sanger Institute and GSK
• Genome wide knockouts in gut epithelium
• Organoids, metagenomics
• Alzheimer’s and Parkinson’s
• CRISPR screens, iPS cells
• Sanger Institute, Biogen, Gurdon
Open Targets integrates data*
• EMBL-EBI, Biogen, GSK
• Associations between targets and diseases
• Germline variants
• Somatic mutations
• Drug information
• RNA expression
• Animal models
• Text mining
* Publicly available resources
In addition to upcoming Open Targets experimental data
Open Targets Platform
https://www.targetvalidation.org
• Associations between targets and diseases…
• Ensembl Gene IDs e.g. ENSGXXXXXXXXXXX
• UniProt IDs e.g P15056
• HGNC names e.g. DMD
• Also non-coding RNA genes
Our targets  genes or proteins
• Modified version of Experimental Factor Ontology (EFO)
• Controlled vocabulary (Alzheimers versus Alzheimer’s)
• Hierarchy (relationships)
Our diseases
• Promotes consistency
• Increases the richness of annotation
• Allow for easier and automatic integration
Evidence for our T-D associations
https://docs.targetvalidation.org/data-sources/data-sources
Data sources grouped into data types
Genetic
Associations
Somatic
Mutations
Drugs
Affected
Pathways
Differentia
l RNA
expression
Animal
Models
Text
Mining
EVA
GWAS
Catalog
PheWA
S
Cancer Gene
Census
EVA
Expression Atlas PhenoDigm
Europe
PMC
G2P
How the data* flows
JSON
summary
document
Validator Association
score
calculation
Target Profile
Disease profile
* e.g. genetic variants from NHGRI-GWAS catalog
JSON summary document
* IDs (gene, disease, papers) + curation (e.g. manual) + evidence + source + stats for the score
JSON
Association score
Which targets have more
evidence for an association?
What is the relative weight of the
evidence for different targets?
Statistical integration, aggregation and scoring
Four-tier scoring framework
https://docs.targetvalidation.org/getting-started/scoring
A) per evidence (e.g. one SNP from a GWAS paper)
B) per data source (e.g. SNPs from the GWAS catalog)
C) per data type (e.g. Genetic associations)
D) overall
EVA
UniProt
Gene2Phenotype
GWAS catalog
Cancer Gene Census
EVA (somatic)
IntOGen
ChEMBL
Reactome
Expression Atlas
Europe PMC
PhenoDigm
Genetic associations
Somatic mutations
RNA expression
Animal models
Affected pathways
Text mining
Drugs
*1.0
*1.0
*1.0
*1.0
*1.0
*1.0
*1.0
*1.0
*1.0
*0.5
*0.2
*0.2
Association
S1 + S2/22
+ S3/32
+ S4/42
+ Si/i2
ΣH
ΣH
ΣH
ΣH
ΣH
ΣH
ΣH
ΣH
ΣH
ΣH
ΣH
ΣH
ΣH
ΣH
ΣH
ΣH
Genomics England
PhEWAS catalog
*1.0
*1.0
ΣH
ΣH
Calculated at 4
levels:
•Evidence
•Data source
•Data type
•Overall
Score: 0 to 1 (max)
weight factor
Aggregation with
(harmonic sum)
ΣH
Note: Each data set has
its own scoring and
ranking scheme
Aggregating data  harmonic sum
f = sample size (cases versus controls)
s = predicted functional consequence (VEP)
c = p value reported in the paper
Factors affecting the relative strength of an evidence
e.g. GWAS Catalog
S = f * s * c
f, relative occurrence of a target-disease evidence
s, strength of the effect described by the evidence
c, confidence of the observation for the target-disease evidence
https://docs.targetvalidation.org/getting-started/scoring
Ranking target-disease association
Association score: the overall score across all data types
• Based on the data sources
• Different weight applied:
genetic association = drugs = mutations = pathways > RNA expression > animal models = text mining
https://www.targetvalidation.org/
Demo 1: Disease centric workflow
What is the evidence for the
association between a target
and a disease?
Which targets are
associated with a disease?
Pages 7 - 30
In addition to T-D associations
• Everything you wanted to know about…
… but were afraid to ask.
Disease
profile page
Target profile
page
Target profile page*
Protein Drugs
Pathways
interactions
RNA and
protein
baseline
expression
Variants,
isoforms
and
genomic
context
Mouse
phenotypes
Bibliography
Description
Synonyms
Gene Ontology
Protein Structure
Protein Interactions
Similar
Targets
Expression
Atlas
Library/LINK
Extra, extra, extra!
Cancer hallmarks and cancer biomarkers
Gene tree
* e.g. http://www.targetvalidation.org/target/ENSG00000141510
Classification Drugs
Similar
diseases
Bibliography
Open Targets
Library/LINK
Disease profile page*
* e.g. http://www.targetvalidation.org/disease/Orphanet_262
Modes of data access
We have a list of 26 possible
targets for inflammatory
bowel disease?
https://tinyurl.com/batch-video
Demo 2: Batch search
Are these targets represented
in other diseases?
Which pathways are
represented in this set of
targets?
https://api.opentargets.io/v3/platform/public/evidence/filter?
target=ENSG00000141867&disease=EFO_0000565&datatype=expression_atla
s&size=100&format=json
REST API calls: some examples*
https://api.opentargets.io/v3/platform/public/search?q=EFO_0003767
https://api.opentargets.io/v3/platform/public/association/filter?
target=ENSG00000110324&direct=false&fields=is_direct&fields=disease.efo_info.la
bel&size=100
* blog.opentargets.org/tag/api/
https://api.opentargets.io/v3/platform/public/search?q=asthma
Breaking down the URLs
https://api.opentargets.io/v3/platform/ public/association/filter
?target=ENSG00000163914&size=10000&fields=target.id&fields=disease.id
Server
Endpoint parameters
Parameters
https://api.opentargets.io/v3/platform/public/association/filter?
target=ENSG00000163914&size=10000&fields=target.id&fields=disease.id
http://api.opentargets.io/v3/platform/docs#
The documentation
Private: methods used by the UI to serve external data.
Subject to change without notice
http://api.opentargets.io/v3/platform/docs#
How to search
http://api.opentargets.io/v3/platform/docs
REST API: some use cases
How to get all diseases
associated with a target
How to get the association score
for a target – disease pair
How to get the evidence for a
target – disease association
How to search
https://api.opentargets.io/v3/platform/public/search?q=PTEN
http://api.opentargets.io/v3/platform
/public/association/filter?
target=ENSG00000171862
&direct=true
How to get all diseases associated with a target
http://api.opentargets.io/v3/platform/
public/association?id=
ENSG00000171862-EFO_0000616
How to get the score for a target – disease pair
How to get the evidence for an association
http://api.opentargets.io/v3/platform/
public/evidence/filter
?target=ENSG00000171862
http://api.opentargets.io/v3/platform
/public/evidence/filter?
target=ENSG00000171862
&disease=Orphanet_2563
Introduction: REST API webinar
https://youtu.be/KQbfhwpeEvc
How to run our REST endpoints
* http://opentargets.readthedocs.io/en/stable/index.html
• Paste the URL in the location bar in a browser
• Use the terminal window (e.g. with CURL command)
• Use our free clients (i.e. Python* and R**)
• Call them from your own application/workflow
** no longer supported by Open Targets; feel free to make PRs
Paste the URL in the a location bar
Command line e.g. CURL –X GET
Python and R clients for the REST API
http://opentargets.readthedocs.iohttp://opentargets.readthedocs.io
No longer supported by
Open Targets
No longer supported by
Open Targets
Can we change the way the
associations are scored? Perhaps
to increase the weight on text
mining data?
Yes, you can with the Open Targets Python client!
Data downloads
Open Targets toolkit: LINK
• LINK: Literature coNcept Knowledgebase
• Subject / predicate / object structured relations
From PubMed abstracts
Proof of Concept
Further developement
http://link.opentargets.io/
Addressing text mining shortcomings
• Entities: genes, diseases, drugs
• Concepts extracted via NLP
(Natural Language Processing)
• 28 M documents, 500 M relations
• http://blog.opentargets.org/link/
Open Targets toolkit: DoRothEA
• Candidate TF-drug interactions in cancer
• 1000 cancer cell lines
• 265 anti-cancer compounds
• 127 transcription factors
http://cancerres.aacrjournals.org/content/early/2017/12/09/0008-5472.CAN-17-1679
dorothea.opentargets.io
Example: Rapamycin
• ~ 1000 cancer cell lines
• 265 anti-cancer compounds
• 127 transcription factors
• Resource of integrated multiomics data
• Added value (e.g. score) and links to original sources
• Graphical web interface: easy to use
April 2018 release
Open Targets Platform
21K
targets
9.7K
diseases
2.3 M
associations
6.1 M
evidence
Data
generation
Therapeutic
hypothesis
Public data
Data
integration
targetvalidation.org
Experimental projects Bioinformatic projects
Virtuous cycle in Open Targets
www.opentargets.org/projects
Concurrent
We support decision-making
Which targets are
associated with a
disease?
Are there FDA drugs
for this association?
…
Can I find out about the
mechanisms of the
disease?
How to access the Platform
Core bioinformatics
pipelines
www.opentargets.org/projects
Experimental
projects
Generate new evidence
CRISPR/Cas9
Organoids and IPS cells
(cellular models for disease)
Integration of available data
Web interface
Batch search tool
REST API
Data dumps
Main data store
Elasticsearch
Angular JS
Web App*
Public
Access
REST
API**
* UI: first released in December 2015
** API first release in April 2016
https://www.targetvalidation.org
https://api.opentargets.io
Our breakthrough paper
http://nar.oxfordjournals.org/content/45/D1/D985.long
http://www.narbreakthrough.com/
blog.opentargets.org/
@targetvalidate
support@targetvalidation.org
http://tinyurl.com/opentargets-in
Help!
https://tinyurl.com/opentargets-youtube
https://docs.targetvalidation.org/
Acknowledgements
Data sources: GWAS catalog
• Genome Wide Association Studies
• Array-based chips  genotyping 100,000 SNPs genomewide
Details on data sources to associate
targets and diseases
Extra slides
Data sources: UniProt
• Protein: sequence, annotation, function
• Manual curation of coding variants in patients
EMBL-EBI train online
• Variants, genes, phenotypes in rare diseases
• Literature curation  consultant clinical geneticists in the UK
Data sources: Gene2Phenotype
Data sources: UniProt
• Protein: sequence, annotation, function
• Manual curation of coding variants in patients
EMBL-EBI train online
Data sources: PheWAS
• Phenome Wide Association Studies
• A variant associated with multiple phenotypes
• Clinical phenotypes derived from EMR-linked biobank BioVU
• ICD9 codes mapped to EFO
Data sources: GE PanelApp
• Aid clinical interpretation of genomes for the 100K project
• We include ‘green genes’ from version 1+ and phenotypes
Data sources: EVA
• With ClinVar information for rare diseases
• Clinical significance: pathogenic, protective
EMBL-EBI train online
Data sources: The Cancer Gene Census
• Genes with mutations causally implicated in cancer
• Gene associated with a cancer plus other cancers associated
with that gene
Data sources: IntOGen
• Genes and somatic (driver) mutations, 28 cancer types
• Involvement in cancer biology
• Rubio-Perez et al. 2015
Data sources: ChEMBL
• Known drugs linked to a disease and a known target
• FDA approved for clinical trials or marketing
EMBL-EBI train online
Data sources: Reactome
• Biochemical reactions and pathways
• Manual curation of pathways affected by mutations
EMBL-EBI train online
Data sources: SLAPenrich
• 374 pathways curated and mapped to cancer hallmarks
• Divergence of the total number of cancer samples with
genomic alterations
• Mutational burden and total exonic block length of genes
Data sources: PROGENy
• Comparison of pathway activities between normal and primary
samples from The Cancer Genome Atlas
• Inferred from RNA-seq: 9,250 tumour and 741 normal samples
• EGFR, hypoxia, JAK.STAT, MAPK, NFkB, PI3K, TGFb, TNFa,
Trail, VEGF, and p53
Data sources: Expression Atlas
• Baseline expression for human genes
- target profile page
• Differential mRNA expression (healthy versus diseased):
- target-disease associations
EMBL-EBI train online
Data sources: Europe PMC
• Mining titles, abstracts, full text in research articles
• Target and disease co-occurrence in the same sentence
• Dictionary (not NLP)
EMBL-EBI train online
Data sources: PhenoDigm
• Semantic approach to associate mouse models with diseases
Aggregating scores across the data
• Using a mathematical function, the harmonic sum*
where S1,S2,...,Si are the individual sorted evidence scores in descending order
* PMID: 19107201, PMID: 20118918
• Advantages:
A) account for replication
B) deflate the effect of large amounts of data e.g. text mining
Target-Disease Association Score
EuropePMC
(Text Mining)
UniProt
(Manual Curation)
ChEMBL
(Manual Curation)
Overall
VERY simplified diagram

Weitere ähnliche Inhalte

Was ist angesagt?

1. Introduction to epidemiological study designs
1. Introduction to epidemiological study designs1. Introduction to epidemiological study designs
1. Introduction to epidemiological study designsRazif Shahril
 
A Graduate Critical Appraisal Assignment for Athletic Training
A Graduate Critical Appraisal Assignment for Athletic TrainingA Graduate Critical Appraisal Assignment for Athletic Training
A Graduate Critical Appraisal Assignment for Athletic TrainingJohn Parsons
 
PICO in systematic literature review
PICO in systematic literature reviewPICO in systematic literature review
PICO in systematic literature reviewKhalid Mahmood
 
Chapter 3 epidemiologic study design2008
Chapter 3 epidemiologic study design2008Chapter 3 epidemiologic study design2008
Chapter 3 epidemiologic study design2008drtaa62
 
From Big Data to Precision Medicine
From Big Data to Precision Medicine From Big Data to Precision Medicine
From Big Data to Precision Medicine Year of the X
 
Safety Monitoring and Reporting in Clinical Trials DIA Poster 2015
Safety Monitoring and Reporting in Clinical Trials DIA Poster 2015Safety Monitoring and Reporting in Clinical Trials DIA Poster 2015
Safety Monitoring and Reporting in Clinical Trials DIA Poster 2015KCR
 
Data type source presentation im
Data type source presentation imData type source presentation im
Data type source presentation imMohmmedirfan Momin
 
Clinical data management web based data capture edc & rdc
Clinical data management web based data capture edc & rdcClinical data management web based data capture edc & rdc
Clinical data management web based data capture edc & rdcPristyn Research Solutions
 
MT115 Precision Medicine: Integrating genomics to enable better patient outcomes
MT115 Precision Medicine: Integrating genomics to enable better patient outcomesMT115 Precision Medicine: Integrating genomics to enable better patient outcomes
MT115 Precision Medicine: Integrating genomics to enable better patient outcomesDell EMC World
 
Clinical trial design
Clinical trial designClinical trial design
Clinical trial designSandhya Talla
 
Presentation on bias and confouinding
Presentation on bias and confouindingPresentation on bias and confouinding
Presentation on bias and confouindingAashish Deoju
 
Epidemological studies
Epidemological studies Epidemological studies
Epidemological studies bhuvanesh4668
 
Clinical trial study team
Clinical trial study teamClinical trial study team
Clinical trial study teamBharatPatil42
 
Effective PhD research Progress
Effective PhD research ProgressEffective PhD research Progress
Effective PhD research ProgressOmer Mahfoodh
 
Updated - What makes systematic review systematic - Anna Sidorchuk
Updated - What makes systematic review systematic - Anna SidorchukUpdated - What makes systematic review systematic - Anna Sidorchuk
Updated - What makes systematic review systematic - Anna Sidorchukadinafeldman
 

Was ist angesagt? (20)

1. Introduction to epidemiological study designs
1. Introduction to epidemiological study designs1. Introduction to epidemiological study designs
1. Introduction to epidemiological study designs
 
Informed consent process in India
Informed consent process in IndiaInformed consent process in India
Informed consent process in India
 
A Graduate Critical Appraisal Assignment for Athletic Training
A Graduate Critical Appraisal Assignment for Athletic TrainingA Graduate Critical Appraisal Assignment for Athletic Training
A Graduate Critical Appraisal Assignment for Athletic Training
 
Clinical trial design
Clinical trial designClinical trial design
Clinical trial design
 
PICO in systematic literature review
PICO in systematic literature reviewPICO in systematic literature review
PICO in systematic literature review
 
Chapter 3 epidemiologic study design2008
Chapter 3 epidemiologic study design2008Chapter 3 epidemiologic study design2008
Chapter 3 epidemiologic study design2008
 
From Big Data to Precision Medicine
From Big Data to Precision Medicine From Big Data to Precision Medicine
From Big Data to Precision Medicine
 
Safety Monitoring and Reporting in Clinical Trials DIA Poster 2015
Safety Monitoring and Reporting in Clinical Trials DIA Poster 2015Safety Monitoring and Reporting in Clinical Trials DIA Poster 2015
Safety Monitoring and Reporting in Clinical Trials DIA Poster 2015
 
Data type source presentation im
Data type source presentation imData type source presentation im
Data type source presentation im
 
Clinical data management web based data capture edc & rdc
Clinical data management web based data capture edc & rdcClinical data management web based data capture edc & rdc
Clinical data management web based data capture edc & rdc
 
MT115 Precision Medicine: Integrating genomics to enable better patient outcomes
MT115 Precision Medicine: Integrating genomics to enable better patient outcomesMT115 Precision Medicine: Integrating genomics to enable better patient outcomes
MT115 Precision Medicine: Integrating genomics to enable better patient outcomes
 
Clinical trial design
Clinical trial designClinical trial design
Clinical trial design
 
Presentation on bias and confouinding
Presentation on bias and confouindingPresentation on bias and confouinding
Presentation on bias and confouinding
 
Data Extraction
Data ExtractionData Extraction
Data Extraction
 
Responsible Conduct of Research: A Framework for Research Administrators and ...
Responsible Conduct of Research: A Framework for Research Administrators and ...Responsible Conduct of Research: A Framework for Research Administrators and ...
Responsible Conduct of Research: A Framework for Research Administrators and ...
 
Epidemological studies
Epidemological studies Epidemological studies
Epidemological studies
 
Clinical trial study team
Clinical trial study teamClinical trial study team
Clinical trial study team
 
Effective PhD research Progress
Effective PhD research ProgressEffective PhD research Progress
Effective PhD research Progress
 
Epidemiology Study Design
Epidemiology Study DesignEpidemiology Study Design
Epidemiology Study Design
 
Updated - What makes systematic review systematic - Anna Sidorchuk
Updated - What makes systematic review systematic - Anna SidorchukUpdated - What makes systematic review systematic - Anna Sidorchuk
Updated - What makes systematic review systematic - Anna Sidorchuk
 

Ähnlich wie Open Targets, identifying targets for drug development in the treatment of diseases.

Equivalence is in the (ID) of the beholder
Equivalence is in the (ID) of the beholderEquivalence is in the (ID) of the beholder
Equivalence is in the (ID) of the beholdermhaendel
 
Open PHACTS for BDE SC1.1
Open PHACTS for BDE SC1.1Open PHACTS for BDE SC1.1
Open PHACTS for BDE SC1.1BigData_Europe
 
Opening up pharmacological space, the OPEN PHACTs api
Opening up pharmacological space, the OPEN PHACTs apiOpening up pharmacological space, the OPEN PHACTs api
Opening up pharmacological space, the OPEN PHACTs apiChris Evelo
 
provenance of microarray experiments
provenance of microarray experimentsprovenance of microarray experiments
provenance of microarray experimentsHelena Deus
 
Text Mining for Biocuration of Bacterial Infectious Diseases
Text Mining for Biocuration of Bacterial Infectious DiseasesText Mining for Biocuration of Bacterial Infectious Diseases
Text Mining for Biocuration of Bacterial Infectious DiseasesDan Sullivan, Ph.D.
 
Wikidata for biomedical knowledge integration and curation
Wikidata for biomedical knowledge integration and curationWikidata for biomedical knowledge integration and curation
Wikidata for biomedical knowledge integration and curationGregory Stupp
 
Pathway studio into webinar 052715v1
Pathway studio into webinar 052715v1Pathway studio into webinar 052715v1
Pathway studio into webinar 052715v1Ann-Marie Roche
 
Usage of open source software for Real World Data Analysis in pharmaceutical ...
Usage of open source software for Real World Data Analysis in pharmaceutical ...Usage of open source software for Real World Data Analysis in pharmaceutical ...
Usage of open source software for Real World Data Analysis in pharmaceutical ...Kees van Bochove
 
2011-10-11 Open PHACTS at BioIT World Europe
2011-10-11 Open PHACTS at BioIT World Europe2011-10-11 Open PHACTS at BioIT World Europe
2011-10-11 Open PHACTS at BioIT World Europeopen_phacts
 
Data-driven drug discovery for rare diseases - Tales from the trenches (CINF ...
Data-driven drug discovery for rare diseases - Tales from the trenches (CINF ...Data-driven drug discovery for rare diseases - Tales from the trenches (CINF ...
Data-driven drug discovery for rare diseases - Tales from the trenches (CINF ...Frederik van den Broek
 
NSF Northeast Hub Big Data Workshop
NSF Northeast Hub Big Data WorkshopNSF Northeast Hub Big Data Workshop
NSF Northeast Hub Big Data WorkshopChirag Patel
 
Machine learning in biology
Machine learning in biologyMachine learning in biology
Machine learning in biologyPranavathiyani G
 
Stratergies for the intergration of information (IPI_ConfEX)
Stratergies for the intergration of information (IPI_ConfEX)Stratergies for the intergration of information (IPI_ConfEX)
Stratergies for the intergration of information (IPI_ConfEX)Ben Gardner
 
Research Statement Chien-Wei Lin
Research Statement Chien-Wei LinResearch Statement Chien-Wei Lin
Research Statement Chien-Wei LinChien-Wei Lin
 
The Monarch Initiative: From Model Organism to Precision Medicine
The Monarch Initiative: From Model Organism to Precision MedicineThe Monarch Initiative: From Model Organism to Precision Medicine
The Monarch Initiative: From Model Organism to Precision Medicinemhaendel
 
Combining Patient Records, Genomic Data and Environmental Data to Enable Tran...
Combining Patient Records, Genomic Data and Environmental Data to Enable Tran...Combining Patient Records, Genomic Data and Environmental Data to Enable Tran...
Combining Patient Records, Genomic Data and Environmental Data to Enable Tran...Perficient, Inc.
 
Unleash transcriptomics to gain insights in disease mechanisms: integration i...
Unleash transcriptomics to gain insights in disease mechanisms: integration i...Unleash transcriptomics to gain insights in disease mechanisms: integration i...
Unleash transcriptomics to gain insights in disease mechanisms: integration i...Maté Ongenaert
 
tranSMART Community Meeting 5-7 Nov 13 - Session 3: transmart’s application t...
tranSMART Community Meeting 5-7 Nov 13 - Session 3: transmart’s application t...tranSMART Community Meeting 5-7 Nov 13 - Session 3: transmart’s application t...
tranSMART Community Meeting 5-7 Nov 13 - Session 3: transmart’s application t...David Peyruc
 

Ähnlich wie Open Targets, identifying targets for drug development in the treatment of diseases. (20)

Open Targets workshop at C4X in 2019
Open Targets workshop at C4X in 2019Open Targets workshop at C4X in 2019
Open Targets workshop at C4X in 2019
 
Online Resources to Support Open Drug Discovery Systems
Online Resources to Support Open Drug Discovery SystemsOnline Resources to Support Open Drug Discovery Systems
Online Resources to Support Open Drug Discovery Systems
 
Equivalence is in the (ID) of the beholder
Equivalence is in the (ID) of the beholderEquivalence is in the (ID) of the beholder
Equivalence is in the (ID) of the beholder
 
Open PHACTS for BDE SC1.1
Open PHACTS for BDE SC1.1Open PHACTS for BDE SC1.1
Open PHACTS for BDE SC1.1
 
Opening up pharmacological space, the OPEN PHACTs api
Opening up pharmacological space, the OPEN PHACTs apiOpening up pharmacological space, the OPEN PHACTs api
Opening up pharmacological space, the OPEN PHACTs api
 
provenance of microarray experiments
provenance of microarray experimentsprovenance of microarray experiments
provenance of microarray experiments
 
Text Mining for Biocuration of Bacterial Infectious Diseases
Text Mining for Biocuration of Bacterial Infectious DiseasesText Mining for Biocuration of Bacterial Infectious Diseases
Text Mining for Biocuration of Bacterial Infectious Diseases
 
Wikidata for biomedical knowledge integration and curation
Wikidata for biomedical knowledge integration and curationWikidata for biomedical knowledge integration and curation
Wikidata for biomedical knowledge integration and curation
 
Pathway studio into webinar 052715v1
Pathway studio into webinar 052715v1Pathway studio into webinar 052715v1
Pathway studio into webinar 052715v1
 
Usage of open source software for Real World Data Analysis in pharmaceutical ...
Usage of open source software for Real World Data Analysis in pharmaceutical ...Usage of open source software for Real World Data Analysis in pharmaceutical ...
Usage of open source software for Real World Data Analysis in pharmaceutical ...
 
2011-10-11 Open PHACTS at BioIT World Europe
2011-10-11 Open PHACTS at BioIT World Europe2011-10-11 Open PHACTS at BioIT World Europe
2011-10-11 Open PHACTS at BioIT World Europe
 
Data-driven drug discovery for rare diseases - Tales from the trenches (CINF ...
Data-driven drug discovery for rare diseases - Tales from the trenches (CINF ...Data-driven drug discovery for rare diseases - Tales from the trenches (CINF ...
Data-driven drug discovery for rare diseases - Tales from the trenches (CINF ...
 
NSF Northeast Hub Big Data Workshop
NSF Northeast Hub Big Data WorkshopNSF Northeast Hub Big Data Workshop
NSF Northeast Hub Big Data Workshop
 
Machine learning in biology
Machine learning in biologyMachine learning in biology
Machine learning in biology
 
Stratergies for the intergration of information (IPI_ConfEX)
Stratergies for the intergration of information (IPI_ConfEX)Stratergies for the intergration of information (IPI_ConfEX)
Stratergies for the intergration of information (IPI_ConfEX)
 
Research Statement Chien-Wei Lin
Research Statement Chien-Wei LinResearch Statement Chien-Wei Lin
Research Statement Chien-Wei Lin
 
The Monarch Initiative: From Model Organism to Precision Medicine
The Monarch Initiative: From Model Organism to Precision MedicineThe Monarch Initiative: From Model Organism to Precision Medicine
The Monarch Initiative: From Model Organism to Precision Medicine
 
Combining Patient Records, Genomic Data and Environmental Data to Enable Tran...
Combining Patient Records, Genomic Data and Environmental Data to Enable Tran...Combining Patient Records, Genomic Data and Environmental Data to Enable Tran...
Combining Patient Records, Genomic Data and Environmental Data to Enable Tran...
 
Unleash transcriptomics to gain insights in disease mechanisms: integration i...
Unleash transcriptomics to gain insights in disease mechanisms: integration i...Unleash transcriptomics to gain insights in disease mechanisms: integration i...
Unleash transcriptomics to gain insights in disease mechanisms: integration i...
 
tranSMART Community Meeting 5-7 Nov 13 - Session 3: transmart’s application t...
tranSMART Community Meeting 5-7 Nov 13 - Session 3: transmart’s application t...tranSMART Community Meeting 5-7 Nov 13 - Session 3: transmart’s application t...
tranSMART Community Meeting 5-7 Nov 13 - Session 3: transmart’s application t...
 

Mehr von Denise Carvalho-Silva, PhD

EMBL-EBI at Plant and Animal Genome conference
EMBL-EBI at Plant and Animal Genome conference EMBL-EBI at Plant and Animal Genome conference
EMBL-EBI at Plant and Animal Genome conference Denise Carvalho-Silva, PhD
 
Genes and Transcripts: Ensembl Online Webinar series
Genes and Transcripts: Ensembl Online Webinar series Genes and Transcripts: Ensembl Online Webinar series
Genes and Transcripts: Ensembl Online Webinar series Denise Carvalho-Silva, PhD
 
Variation and the VEP: Ensembl Online Webinar series
Variation and the VEP: Ensembl Online Webinar seriesVariation and the VEP: Ensembl Online Webinar series
Variation and the VEP: Ensembl Online Webinar seriesDenise Carvalho-Silva, PhD
 
Browsing Genes, Variation and Regulation data with Ensembl
Browsing Genes, Variation and Regulation data with EnsemblBrowsing Genes, Variation and Regulation data with Ensembl
Browsing Genes, Variation and Regulation data with EnsemblDenise Carvalho-Silva, PhD
 

Mehr von Denise Carvalho-Silva, PhD (6)

Open Targets talk at the BSPR 2019 meeting
Open Targets talk at the BSPR 2019 meetingOpen Targets talk at the BSPR 2019 meeting
Open Targets talk at the BSPR 2019 meeting
 
Ensembl Browser Workshop
Ensembl Browser WorkshopEnsembl Browser Workshop
Ensembl Browser Workshop
 
EMBL-EBI at Plant and Animal Genome conference
EMBL-EBI at Plant and Animal Genome conference EMBL-EBI at Plant and Animal Genome conference
EMBL-EBI at Plant and Animal Genome conference
 
Genes and Transcripts: Ensembl Online Webinar series
Genes and Transcripts: Ensembl Online Webinar series Genes and Transcripts: Ensembl Online Webinar series
Genes and Transcripts: Ensembl Online Webinar series
 
Variation and the VEP: Ensembl Online Webinar series
Variation and the VEP: Ensembl Online Webinar seriesVariation and the VEP: Ensembl Online Webinar series
Variation and the VEP: Ensembl Online Webinar series
 
Browsing Genes, Variation and Regulation data with Ensembl
Browsing Genes, Variation and Regulation data with EnsemblBrowsing Genes, Variation and Regulation data with Ensembl
Browsing Genes, Variation and Regulation data with Ensembl
 

Kürzlich hochgeladen

Vision and reflection on Mining Software Repositories research in 2024
Vision and reflection on Mining Software Repositories research in 2024Vision and reflection on Mining Software Repositories research in 2024
Vision and reflection on Mining Software Repositories research in 2024AyushiRastogi48
 
《Queensland毕业文凭-昆士兰大学毕业证成绩单》
《Queensland毕业文凭-昆士兰大学毕业证成绩单》《Queensland毕业文凭-昆士兰大学毕业证成绩单》
《Queensland毕业文凭-昆士兰大学毕业证成绩单》rnrncn29
 
Pests of jatropha_Bionomics_identification_Dr.UPR.pdf
Pests of jatropha_Bionomics_identification_Dr.UPR.pdfPests of jatropha_Bionomics_identification_Dr.UPR.pdf
Pests of jatropha_Bionomics_identification_Dr.UPR.pdfPirithiRaju
 
CHROMATOGRAPHY PALLAVI RAWAT.pptx
CHROMATOGRAPHY  PALLAVI RAWAT.pptxCHROMATOGRAPHY  PALLAVI RAWAT.pptx
CHROMATOGRAPHY PALLAVI RAWAT.pptxpallavirawat456
 
Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...
Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...
Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...lizamodels9
 
ECG Graph Monitoring with AD8232 ECG Sensor & Arduino.pptx
ECG Graph Monitoring with AD8232 ECG Sensor & Arduino.pptxECG Graph Monitoring with AD8232 ECG Sensor & Arduino.pptx
ECG Graph Monitoring with AD8232 ECG Sensor & Arduino.pptxmaryFF1
 
FREE NURSING BUNDLE FOR NURSES.PDF by na
FREE NURSING BUNDLE FOR NURSES.PDF by naFREE NURSING BUNDLE FOR NURSES.PDF by na
FREE NURSING BUNDLE FOR NURSES.PDF by naJASISJULIANOELYNV
 
Topic 9- General Principles of International Law.pptx
Topic 9- General Principles of International Law.pptxTopic 9- General Principles of International Law.pptx
Topic 9- General Principles of International Law.pptxJorenAcuavera1
 
Pests of Blackgram, greengram, cowpea_Dr.UPR.pdf
Pests of Blackgram, greengram, cowpea_Dr.UPR.pdfPests of Blackgram, greengram, cowpea_Dr.UPR.pdf
Pests of Blackgram, greengram, cowpea_Dr.UPR.pdfPirithiRaju
 
Servosystem Theory / Cybernetic Theory by Petrovic
Servosystem Theory / Cybernetic Theory by PetrovicServosystem Theory / Cybernetic Theory by Petrovic
Servosystem Theory / Cybernetic Theory by PetrovicAditi Jain
 
Pests of soyabean_Binomics_IdentificationDr.UPR.pdf
Pests of soyabean_Binomics_IdentificationDr.UPR.pdfPests of soyabean_Binomics_IdentificationDr.UPR.pdf
Pests of soyabean_Binomics_IdentificationDr.UPR.pdfPirithiRaju
 
Dubai Calls Girl Lisa O525547819 Lexi Call Girls In Dubai
Dubai Calls Girl Lisa O525547819 Lexi Call Girls In DubaiDubai Calls Girl Lisa O525547819 Lexi Call Girls In Dubai
Dubai Calls Girl Lisa O525547819 Lexi Call Girls In Dubaikojalkojal131
 
ALL ABOUT MIXTURES IN GRADE 7 CLASS PPTX
ALL ABOUT MIXTURES IN GRADE 7 CLASS PPTXALL ABOUT MIXTURES IN GRADE 7 CLASS PPTX
ALL ABOUT MIXTURES IN GRADE 7 CLASS PPTXDole Philippines School
 
Thermodynamics ,types of system,formulae ,gibbs free energy .pptx
Thermodynamics ,types of system,formulae ,gibbs free energy .pptxThermodynamics ,types of system,formulae ,gibbs free energy .pptx
Thermodynamics ,types of system,formulae ,gibbs free energy .pptxuniversity
 
GenBio2 - Lesson 1 - Introduction to Genetics.pptx
GenBio2 - Lesson 1 - Introduction to Genetics.pptxGenBio2 - Lesson 1 - Introduction to Genetics.pptx
GenBio2 - Lesson 1 - Introduction to Genetics.pptxBerniceCayabyab1
 
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.PraveenaKalaiselvan1
 
Fertilization: Sperm and the egg—collectively called the gametes—fuse togethe...
Fertilization: Sperm and the egg—collectively called the gametes—fuse togethe...Fertilization: Sperm and the egg—collectively called the gametes—fuse togethe...
Fertilization: Sperm and the egg—collectively called the gametes—fuse togethe...D. B. S. College Kanpur
 
Harmful and Useful Microorganisms Presentation
Harmful and Useful Microorganisms PresentationHarmful and Useful Microorganisms Presentation
Harmful and Useful Microorganisms Presentationtahreemzahra82
 
Radiation physics in Dental Radiology...
Radiation physics in Dental Radiology...Radiation physics in Dental Radiology...
Radiation physics in Dental Radiology...navyadasi1992
 

Kürzlich hochgeladen (20)

Vision and reflection on Mining Software Repositories research in 2024
Vision and reflection on Mining Software Repositories research in 2024Vision and reflection on Mining Software Repositories research in 2024
Vision and reflection on Mining Software Repositories research in 2024
 
《Queensland毕业文凭-昆士兰大学毕业证成绩单》
《Queensland毕业文凭-昆士兰大学毕业证成绩单》《Queensland毕业文凭-昆士兰大学毕业证成绩单》
《Queensland毕业文凭-昆士兰大学毕业证成绩单》
 
Pests of jatropha_Bionomics_identification_Dr.UPR.pdf
Pests of jatropha_Bionomics_identification_Dr.UPR.pdfPests of jatropha_Bionomics_identification_Dr.UPR.pdf
Pests of jatropha_Bionomics_identification_Dr.UPR.pdf
 
CHROMATOGRAPHY PALLAVI RAWAT.pptx
CHROMATOGRAPHY  PALLAVI RAWAT.pptxCHROMATOGRAPHY  PALLAVI RAWAT.pptx
CHROMATOGRAPHY PALLAVI RAWAT.pptx
 
Volatile Oils Pharmacognosy And Phytochemistry -I
Volatile Oils Pharmacognosy And Phytochemistry -IVolatile Oils Pharmacognosy And Phytochemistry -I
Volatile Oils Pharmacognosy And Phytochemistry -I
 
Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...
Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...
Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...
 
ECG Graph Monitoring with AD8232 ECG Sensor & Arduino.pptx
ECG Graph Monitoring with AD8232 ECG Sensor & Arduino.pptxECG Graph Monitoring with AD8232 ECG Sensor & Arduino.pptx
ECG Graph Monitoring with AD8232 ECG Sensor & Arduino.pptx
 
FREE NURSING BUNDLE FOR NURSES.PDF by na
FREE NURSING BUNDLE FOR NURSES.PDF by naFREE NURSING BUNDLE FOR NURSES.PDF by na
FREE NURSING BUNDLE FOR NURSES.PDF by na
 
Topic 9- General Principles of International Law.pptx
Topic 9- General Principles of International Law.pptxTopic 9- General Principles of International Law.pptx
Topic 9- General Principles of International Law.pptx
 
Pests of Blackgram, greengram, cowpea_Dr.UPR.pdf
Pests of Blackgram, greengram, cowpea_Dr.UPR.pdfPests of Blackgram, greengram, cowpea_Dr.UPR.pdf
Pests of Blackgram, greengram, cowpea_Dr.UPR.pdf
 
Servosystem Theory / Cybernetic Theory by Petrovic
Servosystem Theory / Cybernetic Theory by PetrovicServosystem Theory / Cybernetic Theory by Petrovic
Servosystem Theory / Cybernetic Theory by Petrovic
 
Pests of soyabean_Binomics_IdentificationDr.UPR.pdf
Pests of soyabean_Binomics_IdentificationDr.UPR.pdfPests of soyabean_Binomics_IdentificationDr.UPR.pdf
Pests of soyabean_Binomics_IdentificationDr.UPR.pdf
 
Dubai Calls Girl Lisa O525547819 Lexi Call Girls In Dubai
Dubai Calls Girl Lisa O525547819 Lexi Call Girls In DubaiDubai Calls Girl Lisa O525547819 Lexi Call Girls In Dubai
Dubai Calls Girl Lisa O525547819 Lexi Call Girls In Dubai
 
ALL ABOUT MIXTURES IN GRADE 7 CLASS PPTX
ALL ABOUT MIXTURES IN GRADE 7 CLASS PPTXALL ABOUT MIXTURES IN GRADE 7 CLASS PPTX
ALL ABOUT MIXTURES IN GRADE 7 CLASS PPTX
 
Thermodynamics ,types of system,formulae ,gibbs free energy .pptx
Thermodynamics ,types of system,formulae ,gibbs free energy .pptxThermodynamics ,types of system,formulae ,gibbs free energy .pptx
Thermodynamics ,types of system,formulae ,gibbs free energy .pptx
 
GenBio2 - Lesson 1 - Introduction to Genetics.pptx
GenBio2 - Lesson 1 - Introduction to Genetics.pptxGenBio2 - Lesson 1 - Introduction to Genetics.pptx
GenBio2 - Lesson 1 - Introduction to Genetics.pptx
 
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.
 
Fertilization: Sperm and the egg—collectively called the gametes—fuse togethe...
Fertilization: Sperm and the egg—collectively called the gametes—fuse togethe...Fertilization: Sperm and the egg—collectively called the gametes—fuse togethe...
Fertilization: Sperm and the egg—collectively called the gametes—fuse togethe...
 
Harmful and Useful Microorganisms Presentation
Harmful and Useful Microorganisms PresentationHarmful and Useful Microorganisms Presentation
Harmful and Useful Microorganisms Presentation
 
Radiation physics in Dental Radiology...
Radiation physics in Dental Radiology...Radiation physics in Dental Radiology...
Radiation physics in Dental Radiology...
 

Open Targets, identifying targets for drug development in the treatment of diseases.

  • 1. Open Targets: integrating genetics and genomics for disease biology and translational medicine
  • 2. What is the Open Targets Partnership? How to navigate the Open Targets Platform? Aims Are there other Open Targets tools? Where do I get help?
  • 3. Source: PhRMA adaptation based on Tufts CSDD & School of Medicine, and FDA Lengthy, costly, low success rate, HIGH ATTRITION RATES Drug discovery: some challenges
  • 4. Public databases for drug discovery • EMBL-EBI (European Bioinformatics Institute) • Elsewhere
  • 5. Fit everything together • Time consuming • Possible lack of resources or expertise • …
  • 6. I wish I did not have to go to all those different places to get the information I’m after. I know. If we only had a one-stop shop with as much data as possible, plus new analyses and links to the original source for my own assessment. A resource that is comprehensive, trustworthy, up-to-date, sustainable, easy-to-use and free. Open Targets is all you need!
  • 7. Our Vision A partnership to transform drug discovery through the systematic identification and prioritisation of targets https://www.opentargets.org 2014 2016 2017 2018
  • 8. Data generation Therapeutic hypothesis Public data Data integration targetvalidation.org Experimental projects Bioinformatic projects Virtuous cycle in Open Targets www.opentargets.org/projects Concurrent
  • 9. Open Targets generates data www.opentargets.org/science/ • EMBL-EBI and Sanger Institute • > 1,000 cancer cell lines + drug sensitivity data • RNASeq, CRISPR screens • Sanger Institute and GSK • Genome wide knockouts in gut epithelium • Organoids, metagenomics • Alzheimer’s and Parkinson’s • CRISPR screens, iPS cells • Sanger Institute, Biogen, Gurdon
  • 10. Open Targets integrates data* • EMBL-EBI, Biogen, GSK • Associations between targets and diseases • Germline variants • Somatic mutations • Drug information • RNA expression • Animal models • Text mining * Publicly available resources In addition to upcoming Open Targets experimental data
  • 11. Open Targets Platform https://www.targetvalidation.org • Associations between targets and diseases…
  • 12. • Ensembl Gene IDs e.g. ENSGXXXXXXXXXXX • UniProt IDs e.g P15056 • HGNC names e.g. DMD • Also non-coding RNA genes Our targets  genes or proteins
  • 13. • Modified version of Experimental Factor Ontology (EFO) • Controlled vocabulary (Alzheimers versus Alzheimer’s) • Hierarchy (relationships) Our diseases • Promotes consistency • Increases the richness of annotation • Allow for easier and automatic integration
  • 14. Evidence for our T-D associations https://docs.targetvalidation.org/data-sources/data-sources
  • 15. Data sources grouped into data types Genetic Associations Somatic Mutations Drugs Affected Pathways Differentia l RNA expression Animal Models Text Mining EVA GWAS Catalog PheWA S Cancer Gene Census EVA Expression Atlas PhenoDigm Europe PMC G2P
  • 16. How the data* flows JSON summary document Validator Association score calculation Target Profile Disease profile * e.g. genetic variants from NHGRI-GWAS catalog
  • 17. JSON summary document * IDs (gene, disease, papers) + curation (e.g. manual) + evidence + source + stats for the score JSON
  • 18. Association score Which targets have more evidence for an association? What is the relative weight of the evidence for different targets?
  • 19. Statistical integration, aggregation and scoring Four-tier scoring framework https://docs.targetvalidation.org/getting-started/scoring A) per evidence (e.g. one SNP from a GWAS paper) B) per data source (e.g. SNPs from the GWAS catalog) C) per data type (e.g. Genetic associations) D) overall
  • 20. EVA UniProt Gene2Phenotype GWAS catalog Cancer Gene Census EVA (somatic) IntOGen ChEMBL Reactome Expression Atlas Europe PMC PhenoDigm Genetic associations Somatic mutations RNA expression Animal models Affected pathways Text mining Drugs *1.0 *1.0 *1.0 *1.0 *1.0 *1.0 *1.0 *1.0 *1.0 *0.5 *0.2 *0.2 Association S1 + S2/22 + S3/32 + S4/42 + Si/i2 ΣH ΣH ΣH ΣH ΣH ΣH ΣH ΣH ΣH ΣH ΣH ΣH ΣH ΣH ΣH ΣH Genomics England PhEWAS catalog *1.0 *1.0 ΣH ΣH Calculated at 4 levels: •Evidence •Data source •Data type •Overall Score: 0 to 1 (max) weight factor Aggregation with (harmonic sum) ΣH Note: Each data set has its own scoring and ranking scheme Aggregating data  harmonic sum
  • 21. f = sample size (cases versus controls) s = predicted functional consequence (VEP) c = p value reported in the paper Factors affecting the relative strength of an evidence e.g. GWAS Catalog S = f * s * c f, relative occurrence of a target-disease evidence s, strength of the effect described by the evidence c, confidence of the observation for the target-disease evidence https://docs.targetvalidation.org/getting-started/scoring
  • 22. Ranking target-disease association Association score: the overall score across all data types • Based on the data sources • Different weight applied: genetic association = drugs = mutations = pathways > RNA expression > animal models = text mining
  • 23. https://www.targetvalidation.org/ Demo 1: Disease centric workflow What is the evidence for the association between a target and a disease? Which targets are associated with a disease? Pages 7 - 30
  • 24. In addition to T-D associations • Everything you wanted to know about… … but were afraid to ask. Disease profile page Target profile page
  • 25. Target profile page* Protein Drugs Pathways interactions RNA and protein baseline expression Variants, isoforms and genomic context Mouse phenotypes Bibliography Description Synonyms Gene Ontology Protein Structure Protein Interactions Similar Targets Expression Atlas Library/LINK Extra, extra, extra! Cancer hallmarks and cancer biomarkers Gene tree * e.g. http://www.targetvalidation.org/target/ENSG00000141510
  • 26. Classification Drugs Similar diseases Bibliography Open Targets Library/LINK Disease profile page* * e.g. http://www.targetvalidation.org/disease/Orphanet_262
  • 27. Modes of data access
  • 28. We have a list of 26 possible targets for inflammatory bowel disease? https://tinyurl.com/batch-video Demo 2: Batch search Are these targets represented in other diseases? Which pathways are represented in this set of targets?
  • 29. https://api.opentargets.io/v3/platform/public/evidence/filter? target=ENSG00000141867&disease=EFO_0000565&datatype=expression_atla s&size=100&format=json REST API calls: some examples* https://api.opentargets.io/v3/platform/public/search?q=EFO_0003767 https://api.opentargets.io/v3/platform/public/association/filter? target=ENSG00000110324&direct=false&fields=is_direct&fields=disease.efo_info.la bel&size=100 * blog.opentargets.org/tag/api/ https://api.opentargets.io/v3/platform/public/search?q=asthma
  • 30. Breaking down the URLs https://api.opentargets.io/v3/platform/ public/association/filter ?target=ENSG00000163914&size=10000&fields=target.id&fields=disease.id Server Endpoint parameters Parameters https://api.opentargets.io/v3/platform/public/association/filter? target=ENSG00000163914&size=10000&fields=target.id&fields=disease.id
  • 31. http://api.opentargets.io/v3/platform/docs# The documentation Private: methods used by the UI to serve external data. Subject to change without notice http://api.opentargets.io/v3/platform/docs#
  • 32. How to search http://api.opentargets.io/v3/platform/docs REST API: some use cases How to get all diseases associated with a target How to get the association score for a target – disease pair How to get the evidence for a target – disease association
  • 36. How to get the evidence for an association http://api.opentargets.io/v3/platform/ public/evidence/filter ?target=ENSG00000171862 http://api.opentargets.io/v3/platform /public/evidence/filter? target=ENSG00000171862 &disease=Orphanet_2563
  • 37. Introduction: REST API webinar https://youtu.be/KQbfhwpeEvc
  • 38. How to run our REST endpoints * http://opentargets.readthedocs.io/en/stable/index.html • Paste the URL in the location bar in a browser • Use the terminal window (e.g. with CURL command) • Use our free clients (i.e. Python* and R**) • Call them from your own application/workflow ** no longer supported by Open Targets; feel free to make PRs
  • 39. Paste the URL in the a location bar
  • 40.
  • 41. Command line e.g. CURL –X GET
  • 42. Python and R clients for the REST API http://opentargets.readthedocs.iohttp://opentargets.readthedocs.io No longer supported by Open Targets No longer supported by Open Targets
  • 43. Can we change the way the associations are scored? Perhaps to increase the weight on text mining data? Yes, you can with the Open Targets Python client!
  • 45. Open Targets toolkit: LINK • LINK: Literature coNcept Knowledgebase • Subject / predicate / object structured relations From PubMed abstracts Proof of Concept Further developement http://link.opentargets.io/
  • 46. Addressing text mining shortcomings • Entities: genes, diseases, drugs • Concepts extracted via NLP (Natural Language Processing) • 28 M documents, 500 M relations • http://blog.opentargets.org/link/
  • 47. Open Targets toolkit: DoRothEA • Candidate TF-drug interactions in cancer • 1000 cancer cell lines • 265 anti-cancer compounds • 127 transcription factors http://cancerres.aacrjournals.org/content/early/2017/12/09/0008-5472.CAN-17-1679 dorothea.opentargets.io
  • 48. Example: Rapamycin • ~ 1000 cancer cell lines • 265 anti-cancer compounds • 127 transcription factors
  • 49. • Resource of integrated multiomics data • Added value (e.g. score) and links to original sources • Graphical web interface: easy to use April 2018 release Open Targets Platform 21K targets 9.7K diseases 2.3 M associations 6.1 M evidence
  • 50. Data generation Therapeutic hypothesis Public data Data integration targetvalidation.org Experimental projects Bioinformatic projects Virtuous cycle in Open Targets www.opentargets.org/projects Concurrent
  • 51. We support decision-making Which targets are associated with a disease? Are there FDA drugs for this association? … Can I find out about the mechanisms of the disease?
  • 52. How to access the Platform Core bioinformatics pipelines www.opentargets.org/projects Experimental projects Generate new evidence CRISPR/Cas9 Organoids and IPS cells (cellular models for disease) Integration of available data Web interface Batch search tool REST API Data dumps Main data store Elasticsearch Angular JS Web App* Public Access REST API** * UI: first released in December 2015 ** API first release in April 2016 https://www.targetvalidation.org https://api.opentargets.io
  • 56. Data sources: GWAS catalog • Genome Wide Association Studies • Array-based chips  genotyping 100,000 SNPs genomewide
  • 57. Details on data sources to associate targets and diseases Extra slides
  • 58. Data sources: UniProt • Protein: sequence, annotation, function • Manual curation of coding variants in patients EMBL-EBI train online
  • 59. • Variants, genes, phenotypes in rare diseases • Literature curation  consultant clinical geneticists in the UK Data sources: Gene2Phenotype
  • 60. Data sources: UniProt • Protein: sequence, annotation, function • Manual curation of coding variants in patients EMBL-EBI train online
  • 61. Data sources: PheWAS • Phenome Wide Association Studies • A variant associated with multiple phenotypes • Clinical phenotypes derived from EMR-linked biobank BioVU • ICD9 codes mapped to EFO
  • 62. Data sources: GE PanelApp • Aid clinical interpretation of genomes for the 100K project • We include ‘green genes’ from version 1+ and phenotypes
  • 63. Data sources: EVA • With ClinVar information for rare diseases • Clinical significance: pathogenic, protective EMBL-EBI train online
  • 64. Data sources: The Cancer Gene Census • Genes with mutations causally implicated in cancer • Gene associated with a cancer plus other cancers associated with that gene
  • 65. Data sources: IntOGen • Genes and somatic (driver) mutations, 28 cancer types • Involvement in cancer biology • Rubio-Perez et al. 2015
  • 66. Data sources: ChEMBL • Known drugs linked to a disease and a known target • FDA approved for clinical trials or marketing EMBL-EBI train online
  • 67. Data sources: Reactome • Biochemical reactions and pathways • Manual curation of pathways affected by mutations EMBL-EBI train online
  • 68. Data sources: SLAPenrich • 374 pathways curated and mapped to cancer hallmarks • Divergence of the total number of cancer samples with genomic alterations • Mutational burden and total exonic block length of genes
  • 69. Data sources: PROGENy • Comparison of pathway activities between normal and primary samples from The Cancer Genome Atlas • Inferred from RNA-seq: 9,250 tumour and 741 normal samples • EGFR, hypoxia, JAK.STAT, MAPK, NFkB, PI3K, TGFb, TNFa, Trail, VEGF, and p53
  • 70. Data sources: Expression Atlas • Baseline expression for human genes - target profile page • Differential mRNA expression (healthy versus diseased): - target-disease associations EMBL-EBI train online
  • 71. Data sources: Europe PMC • Mining titles, abstracts, full text in research articles • Target and disease co-occurrence in the same sentence • Dictionary (not NLP) EMBL-EBI train online
  • 72. Data sources: PhenoDigm • Semantic approach to associate mouse models with diseases
  • 73. Aggregating scores across the data • Using a mathematical function, the harmonic sum* where S1,S2,...,Si are the individual sorted evidence scores in descending order * PMID: 19107201, PMID: 20118918 • Advantages: A) account for replication B) deflate the effect of large amounts of data e.g. text mining
  • 74. Target-Disease Association Score EuropePMC (Text Mining) UniProt (Manual Curation) ChEMBL (Manual Curation) Overall VERY simplified diagram

Hinweis der Redaktion

  1. PhRMA ( Lengthy: Drug discovery from start (idea) to finish (market) can take up to 20 years. Costly: specially from clinical trials when human subjects are tested (volunteers or suffering from the condition) Compounds drop out along the drug discovery journey  low success, high attrition
  2. If you work in early stages of drug discovery, which databases or resources do you rely on? Some of the DBs out there.
  3. Fit everything to generate therapeutic hypothesis. Going through those resources takes time, my group may not have allocated resources (computing) or expertise (bioinformatics, ML, NLP), we are working on our own (in one specific lab) so it’s not multidisciplinary. Can workshop attendees list others?
  4. A shared vision to create a partnership to transform drug discovery through systematics target ID and prioritisation. From that idea Open Targets (formerly CTTV) was started to combine the world class functional genomics expertise at Sanger with the EBI‘s excellence in bioinformatics and computational biology and the industry expertise of GSK and now Biogen as well with the aim to find out all that‘s important about a target before starting the drug development pipeline. Thereby the pipeline would be filled with validated and valid targets with a much improved odds ratio for success.
  5. Target ID and prioritisation knowledge cycle. Our research focusses on data generation, data integration, and enabled data access Our research is connected as we provide the tools to generate hypothesis and explore them experimentally We focus primarily on oncology, immunology (including IBD) neurodegeneration with some experiments in other areas or across diseases
  6. The choice of our disease ontology was EFO. It is cross-referenced against DO, OMIM, HP, MP, MONDO as well
  7. Evidence: sequence (SNPs, mutations, germline and somatic curated at the DNA or protein level), differential mRNA expression, sentences from research papers, drug information, pathways affected by pathogenic mutation, animal models (https://github.com/opentargets/json_schema/blob/master/doc/instructions.md)
  8. Blueprint data included in baseline expression
  9. Highlight the REST
  10. http://api.opentargets.io/v3/platform/public/association?id=ENSG00000171862-EFO_0000616
  11. http://api.opentargets.io/v3/platform/public/association?id=ENSG00000171862-EFO_0000616 http://api.opentargets.io/v3/platform/public/association/filter?target=ENSG00000171862
  12. http://api.opentargets.io/v3/platform /public/evidece/filter? target=ENSG00000171862 &disease=Orphanet_2563 http://api.opentargets.io/v3/platform/public/association?id=ENSG00000171862-EFO_0000616 http://api.opentargets.io/v3/platform/public/association/filter?target=ENSG00000171862
  13. Target ID and prioritisation knowledge cycle. Our research focusses on data generation, data integration, and enabled data access Our research is connected as we provide the tools to generate hypothesis and explore them experimentally We focus primarily on oncology, immunology (including IBD) neurodegeneration with some experiments in other areas or across diseases
  14. Somatic mutations from cancer drivers genes across 28 tumor types. We have imported data (2014.12) from 6,792 samples spanning 28 cancer types (Rubio-Perez et al)."
  15. Linear regression model from 14 tumour types.
  16. Back up