Patient-led deep phenotyping using a lay-friendly version of the Human Phenotype Ontology

Patient-led deep phenotyping using a lay-friendly
version of the Human Phenotype Ontology
Melissa Haendel@ontowonka
@monarchinit

The Human Phenotype Ontology
 11,813
phenotype
terms
 127,125 rare
disease -
phenotype
annotations
 136,268
common
disease -
phenotype
annotations
bit.ly/hpo-paper
Peter Robinson, Sebastian Koehler, Chris Mungall

Exomiser: Free, secure, and better than ever
Validated for
the most
difficult
diagnoses; top
candidate
correct in 67%
of cases and
executes in
under 1 minute.
Exomiser
V 10.0
March 2018
bit.ly/exomiser-10

Fuzzy phenotype matching
DOI: 10.1126/scitranslmed.3009262

How much phenotyping is enough?
Enlarged ears (2)Dark hair (6) Female (4)
Male (4)
Blue skin (1)
Pointy ears (1)
Hair absent on head (1)
Horns present (1)
Hair present
on head (7)
Enlarged lip (2)
Increased skin
pigmentation (3)
bit.ly/annotationsufficiency

Patients should be empowered to help.

Example Lay-Synonyms
doi:10.5281/zenodo.1168737
Nature Genetics In press

Layperson disease coverage
4,555 terms in the HPO are annotated with at least
one lay person synonym (35.4% coverage)
7,607 number of lay person synonyms total
60% of all disease annotations (73,932 of 122,120)
are referring to HPO terms with lay translations

Simulated patient self-
reported HPO profile
HPO reference profile
Comparison
Determine diagnostic capability of GenomeConnect survey
and layperson app
Disease 1
Disease 2
Disease 3
Disease 4
Disease 7700
Simulated
patient

Survey
Mapping
Clinical
HPO
Diagnosticcapability
Phenotype Profile Granularity
Layperson
HPO
How diagnostically useful are the HPO terms
generated by patients?

Groups
Analysis
How well can we describe a disease using plain language
synonyms only?
• 4555 HPO terms with a lay person synonym
• 216 terms in Genome Connect Survey
Analysis
• Semantic similarity to 7667 rare diseases
• What is the drop in annotation sufficiency?
• What diseases are enriched?

Example layperson HPO and GC
comparison
doi:10.6084/m9.figshare.5513356.v2

Similarity of Lay person and GC Subsets to Gold
Standard Annotations
0
500
1000
1500
2000
2500
3000
3500
4000 Layperson
Genome
Connect
NumberofDiseases
Phenotypic similarity range
(in deciles)
6944
402
321
HP outperforms GC
HP and GC equivalent
GC outperforms HP
HPO layperson subset
outperforms the Genome
Connect subset in more than 95%
of diseases.
Why the gaps?
1) existing HPO terms that should be layperson eligible (like
schizophrenia) but are not designated in this way yet
2) GC terms which aren't really layperson or borderline, such as
"abnormal EEG”
3) GC phenotypic descriptions that don’t distill into short lay-friendly
labels
Number of diseases by
performance category

Disease Enrichment
Analysis of Terms with Lay
Synonyms
2.35E-36 rare developmental defect during embryogenesis
1.29E-35 disorder of development or morphogenesis
2.84E-35 rare genetic developmental defect during embryogenesis
1.43E-25 bone disease
9.06E-25 connective tissue disease
4.47E-20 multiple congenital anomalies/dysmorphic syndrome
4.14E-19 dysostosis
1.72E-18 musculoskeletal system disease
4.09E-18 skeletal system disease
6.11E-18 congenital limb malformation
*2,917 of 4,555 phenotypes with direct
Disease annotations

Layperson-HPO driven phenotyping tool
https://www.pcori.org/research-results/2017/realization-standard-care-rare-
diseases-using-patient-engaged-phenotyping

Matchmaker Exchange:
for patients, diseases, and model organisms
Patients can assist in matching themselves to other n-of-1 patients globally
bit.ly/mme-matchbox
patientarchive.org
bit.ly/exomiser-2017
bit.ly/exomiser-10

Conclusions
For over 95% of the 7,000+ diseases, layperson
phenotype subsets were more diagnostically
useful than Genome Connect phenotype subsets
Patients are a key source of phenotypic information
and both patients and clinicians need to participate
(we think!) to maximize diagnostic capabilities
Some diseases have low coverage and require
diagnosis by a physician. (eg. glutathione synthetase
deficiency, beta-thalassemia, epithelial basement
membrane dystrophy)
<
Next steps: Test with a disease search/classifier
o How confident are we in the diagnosis?
o Is the disease the top hit
o What is the confidence of the follow up hits
o Add noise

www.monarchinitiative.org
Funding:
PCORI: PCORI ME-1511-33184
NIH Office of Director: 2R24OD011883
Nicole Vasilevsky
Kent Shefchek
Erin Foster
Mark Englestad
Peter Robinson
Sebastian Koeller
Chris Mungall
Jim Balhoff
Ingrid Holm
Catherine Brownstein
Kayli Rageth
Julie McMurry

Patient-led deep phenotyping using a lay-friendly version of the Human Phenotype Ontology

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Patient-led deep phenotyping using a lay-friendly version of the Human Phenotype Ontology

Similar to Patient-led deep phenotyping using a lay-friendly version of the Human Phenotype Ontology (20)

More from mhaendel

More from mhaendel (20)

Recently uploaded

Recently uploaded (20)

Patient-led deep phenotyping using a lay-friendly version of the Human Phenotype Ontology

Editor's Notes