SlideShare ist ein Scribd-Unternehmen logo
1 von 1
Ontologies for Semantic Normalization of Immunological Data
Yannick Pouliot1, Atul J. Butte1,2
1. Division of Systems Medicine, Stanford University School of Medicine
2. Center for Pediatric Bioinformatics, Lucile Packard Children’s Hospital, Palo Alto, California

The Problem
Data from experiments probing the immune system are inherently
complex because of the diversity of data types, assay types and the
number of biological agents involved. This complexity is further
increased by the multi-center nature of data generated by HIPC.
One of the goals of HIPC is to deliver a database able to support
broad community access to these complex data sets. Critical to the
success of this database will be its ability to provide conceptual
characterizations of experiments and their results (“data and
metadata encoding”). Such encodings identify data sets according to
experimental properties so that users can quickly narrow their
searches to the most pertinent results.
To this end, conceptual encoding that rely on “industry-standard”
ontologies are preferred is the best way to achieve this. We
determined the extent to which existing ontologies can be used to
encode HIPC data, and ImmPort’s ability to support the application of
these concepts.
Since ImmPort will be the repository of HIPC data, we evaluated its
use of ontologies. Upon determining that ImmPort is not ontologycompliant, we analyzed the universe of ontologies to determine the
extent to which existing ontologies can be used to encode HIPC data,
and ImmPort’s ability to support the application of these concepts.

Ontologies provided by BioPortal1 were first selected
based on their domain of application:
Domain

Description

Example Ontology

Analysis
Anatomy

Process of data analysis
Anatomical structures at all level of
resolution except molecular
Disease states manifested by organisms at
anatomical, spatial, temporal and
functional levels
Conditions/specifications associated with
a scientific or clinical protocol
Process/properties/data types of
modeling, computational or otherwise
Aspects of biomolecules : structure,
sequence, function
Biochemical, signaling pathways used by
organisms
States manifested by organisms at
anatomical, spatial, temporal and
functional levels. “Anatomy” and
“Disease” are components of
“Phenotype” but treated distinctly

Ontology of Data Mining
Cell Ontology

Experimental
conditions
Modeling
Molecule
Pathways
Phenotype

Criterium

Parameter

Design

Must be integrative (ability to draw on similar/identical
concepts from other ontologies)
Must minimize overlap with other ontologies, consistent with
providing terms that can be inter-related across ontologies
Should be “relatable” to clinical applications
Must be applicable to humans, and perhaps one animal model
Must be an ontology, not just a controlled vocabulary (with a
few exceptions)
Evidence of ongoing development and maintenance

Developmental
state

Usage
Content

Developed with, accepted by standards or professional
organizations
Adheres to standards such as the Basic Formal Ontology
Must be released (post beta)
Reasonably widely adopted
Must exhibit a good balance between expressiveness and
usability/understandability:
Conceptual clarity (e.g., no ambiguous classifications for dualuse organs such as reproductive/urinary organs)
Limited redundancy of synonymous concepts
Usable definitions of concepts describing HIPC data or
metadata, including experiment design
Completeness (frequency of missing concepts)
Correctness (how accurately is a concept is expressed)

These ontologies were then analyzed for their ability to
recognize terms from text obtained from Build 1 datasets,
protocols, and metadata, as well as from Stanford’s DataMt
database2 (which stores many Stanford HIPC datasets). An
automated pipeline that relies on the National Center for
Biomedical Ontology’s Annotator3 was written that relies on
BioPortal’s Web services to parse the text and attempt to
map to the reference ontologies.

Methods

Disease

57 Ontologies within these domains were then
screened for preliminary suitability to HIPC data/metadata
according to four broad criteria:

Infectious Disease Ontology (IDO)

Ontology of Clinical Research
(OCRe)
Interaction Network Ontology
(INO)
ONTIE - Ontology of Immune
Epitopes
Pathway Ontology
Phenotypic Quality

Results & Discussion
Ontology
NCI Thesaurus
Medical Subject Headings
Molecule role
SNOMED Clinical Terms
PRotein Ontology (PRO)
Cell Cycle Ontology
Ontology for Biomedical Investigations
Experimental Factor Ontology
SemanticScience Integrated Ontology
Units of measurement
Phenotypic quality
EDAM
Foundational Model of Anatomy
Vaccine Ontology
ICPC-2 PLUS
MGED Ontology
Measurement Method Ontology
Gene Ontology
Ontology of Clinical Research (OCRe)
Protein-protein interaction
Mammalian phenotype

Build 1
(%)
16.7%
7.1%
14.3%
2.4%
16.7%
7.1%
2.4%

n
7
3
6
1
7
3
1

ImmPort
CV
(%)
33.3%
21.4%
0.0%
11.9%
0.0%
0.0%
14.3%
7.1%
2.4%
2.4%
7.1%
2.4%
2.4%
2.4%
2.4%

n
14
9

5

6
3

1
1
3

1

Data Mt
(%)
52.6%
22.9%
24.6%
17.7%
10.3%
12.6%
5.7%
7.4%
2.9%
2.3%
2.3%
0.6%
2.3%
1.1%
1.1%

n
92
40
43
31
18
22
10
13
5
4
4
1
4
2
2

1
1
1

0.6%
0.6%
0.6%

1
1
1

• Many mapping failures attributable to lack of definition for commercial objects
within the reference ontologies (e.g., “Anti-CD27” antibody from BD)
Solution: Contacting ontology owners to have them add commercial terms
to their ontologies
• Many mapping failures are easily correctable
Example: Adding a pre-processor able to recognized instances of the “anti-“
problem (e.g., “anti-CD20” not recognized even though “CD20” is known)
We conclude that ImmPort should be able to migrate toward ontologically-based
encodings.

References
1. Noy et al., (2009) “BioPortal: Ontologies and Integrated Data resources at the Click of a
Mouse”, Nucl. Acids Res., 37:W170-W173.
2. Siebert, J., Munsil, D. & Maecker, H. (2011) "A Novel Approach for Integrating and Exploring
Heterogeneous Translational Data", manuscript in preparation.
3. Jonquet et al., (2009) “The Open Biomedical Annotator”, Summit on Transla. Bioinfo., 56-60.

Acknowledgments
NIAID, Hewlett Packard Foundation, Butte Lab

Weitere ähnliche Inhalte

Was ist angesagt?

Chapter 1. Introduction
Chapter 1. IntroductionChapter 1. Introduction
Chapter 1. Introduction
butest
 
I NTRODUCTION.doc
I NTRODUCTION.docI NTRODUCTION.doc
I NTRODUCTION.doc
butest
 
Колкер Е. An introduction to MOPED: Multi-Omics Profiling Expression Database
Колкер Е. An introduction to MOPED: Multi-Omics Profiling Expression DatabaseКолкер Е. An introduction to MOPED: Multi-Omics Profiling Expression Database
Колкер Е. An introduction to MOPED: Multi-Omics Profiling Expression Database
bigdatabm
 
IEEE AP-MTT OSU Seminar
IEEE AP-MTT OSU SeminarIEEE AP-MTT OSU Seminar
IEEE AP-MTT OSU Seminar
Ogan Gurel MD
 

Was ist angesagt? (20)

Text and data mining
Text and data miningText and data mining
Text and data mining
 
Machine learning in biology
Machine learning in biologyMachine learning in biology
Machine learning in biology
 
X-Meeting Poster 2015 - Vallys A Coverage tool
X-Meeting Poster 2015 - Vallys A Coverage toolX-Meeting Poster 2015 - Vallys A Coverage tool
X-Meeting Poster 2015 - Vallys A Coverage tool
 
Ai and biology
Ai and biologyAi and biology
Ai and biology
 
Introduction to bioinformatics
Introduction to bioinformaticsIntroduction to bioinformatics
Introduction to bioinformatics
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
Uses of Artificial Intelligence in Bioinformatics
Uses of Artificial Intelligence in BioinformaticsUses of Artificial Intelligence in Bioinformatics
Uses of Artificial Intelligence in Bioinformatics
 
GENOMICS
GENOMICSGENOMICS
GENOMICS
 
Introduction to Bioinformatics
Introduction to BioinformaticsIntroduction to Bioinformatics
Introduction to Bioinformatics
 
Ontology-Driven Clinical Intelligence: Removing Data Barriers for Cross-Disci...
Ontology-Driven Clinical Intelligence: Removing Data Barriers for Cross-Disci...Ontology-Driven Clinical Intelligence: Removing Data Barriers for Cross-Disci...
Ontology-Driven Clinical Intelligence: Removing Data Barriers for Cross-Disci...
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
Chapter 1. Introduction
Chapter 1. IntroductionChapter 1. Introduction
Chapter 1. Introduction
 
I NTRODUCTION.doc
I NTRODUCTION.docI NTRODUCTION.doc
I NTRODUCTION.doc
 
Application of blockchain technology in healthcare and biomedicine
Application of blockchain technology in healthcare and biomedicineApplication of blockchain technology in healthcare and biomedicine
Application of blockchain technology in healthcare and biomedicine
 
Колкер Е. An introduction to MOPED: Multi-Omics Profiling Expression Database
Колкер Е. An introduction to MOPED: Multi-Omics Profiling Expression DatabaseКолкер Е. An introduction to MOPED: Multi-Omics Profiling Expression Database
Колкер Е. An introduction to MOPED: Multi-Omics Profiling Expression Database
 
03 Guerra, Rudy
03 Guerra, Rudy03 Guerra, Rudy
03 Guerra, Rudy
 
Introduction to Bioinformatics
Introduction to BioinformaticsIntroduction to Bioinformatics
Introduction to Bioinformatics
 
Machine Learning in Biology and Why It Doesn't Make Sense - Theo Knijnenburg,...
Machine Learning in Biology and Why It Doesn't Make Sense - Theo Knijnenburg,...Machine Learning in Biology and Why It Doesn't Make Sense - Theo Knijnenburg,...
Machine Learning in Biology and Why It Doesn't Make Sense - Theo Knijnenburg,...
 
IEEE AP-MTT OSU Seminar
IEEE AP-MTT OSU SeminarIEEE AP-MTT OSU Seminar
IEEE AP-MTT OSU Seminar
 
NetBioSIG2014-Talk by David Amar
NetBioSIG2014-Talk by David AmarNetBioSIG2014-Talk by David Amar
NetBioSIG2014-Talk by David Amar
 

Ähnlich wie Ontologies for Semantic Normalization of Immunological Data

Introducción a la bioinformatica
Introducción a la bioinformaticaIntroducción a la bioinformatica
Introducción a la bioinformatica
Martín Arrieta
 
Ben Goertzel AIs, Superflies and the Path to Immortality - singsum au 2011
Ben Goertzel AIs, Superflies and the Path to Immortality - singsum au 2011Ben Goertzel AIs, Superflies and the Path to Immortality - singsum au 2011
Ben Goertzel AIs, Superflies and the Path to Immortality - singsum au 2011
Adam Ford
 
Haendel clingenetics.3.14.14
Haendel clingenetics.3.14.14Haendel clingenetics.3.14.14
Haendel clingenetics.3.14.14
mhaendel
 
My ontology is better than yours! Building and evaluating ontologies for inte...
My ontology is better than yours! Building and evaluating ontologies for inte...My ontology is better than yours! Building and evaluating ontologies for inte...
My ontology is better than yours! Building and evaluating ontologies for inte...
Robert Hoehndorf
 
BRN Seminar 13/06/16 12 Revolutionizing healthcare management
BRN Seminar 13/06/16 12 Revolutionizing healthcare managementBRN Seminar 13/06/16 12 Revolutionizing healthcare management
BRN Seminar 13/06/16 12 Revolutionizing healthcare management
brnmomentum
 
Metrological experiments in biomarker development (mass spectrometry statisti...
Metrological experiments in biomarker development (mass spectrometry statisti...Metrological experiments in biomarker development (mass spectrometry statisti...
Metrological experiments in biomarker development (mass spectrometry statisti...
National Institute of Biologics
 

Ähnlich wie Ontologies for Semantic Normalization of Immunological Data (20)

2011-10-11 Open PHACTS at BioIT World Europe
2011-10-11 Open PHACTS at BioIT World Europe2011-10-11 Open PHACTS at BioIT World Europe
2011-10-11 Open PHACTS at BioIT World Europe
 
Quantifying the content of biomedical semantic resources as a core for drug d...
Quantifying the content of biomedical semantic resources as a core for drug d...Quantifying the content of biomedical semantic resources as a core for drug d...
Quantifying the content of biomedical semantic resources as a core for drug d...
 
IMMUNOINFORMATICS , MICROARRAY and Machine Learning - All about Immunology, I...
IMMUNOINFORMATICS , MICROARRAY and Machine Learning - All about Immunology, I...IMMUNOINFORMATICS , MICROARRAY and Machine Learning - All about Immunology, I...
IMMUNOINFORMATICS , MICROARRAY and Machine Learning - All about Immunology, I...
 
Online Resources to Support Open Drug Discovery Systems
Online Resources to Support Open Drug Discovery SystemsOnline Resources to Support Open Drug Discovery Systems
Online Resources to Support Open Drug Discovery Systems
 
Bioinformatics: Building the cornerstones of Sequence Homology and its use fo...
Bioinformatics: Building the cornerstones of Sequence Homology and its use fo...Bioinformatics: Building the cornerstones of Sequence Homology and its use fo...
Bioinformatics: Building the cornerstones of Sequence Homology and its use fo...
 
Introducción a la bioinformatica
Introducción a la bioinformaticaIntroducción a la bioinformatica
Introducción a la bioinformatica
 
Ben Goertzel AIs, Superflies and the Path to Immortality - singsum au 2011
Ben Goertzel AIs, Superflies and the Path to Immortality - singsum au 2011Ben Goertzel AIs, Superflies and the Path to Immortality - singsum au 2011
Ben Goertzel AIs, Superflies and the Path to Immortality - singsum au 2011
 
Role of bioinformatics of drug designing
Role of bioinformatics of drug designingRole of bioinformatics of drug designing
Role of bioinformatics of drug designing
 
Haendel clingenetics.3.14.14
Haendel clingenetics.3.14.14Haendel clingenetics.3.14.14
Haendel clingenetics.3.14.14
 
Bioinformatics Introduction and Use of BLAST Tool
Bioinformatics Introduction and Use of BLAST ToolBioinformatics Introduction and Use of BLAST Tool
Bioinformatics Introduction and Use of BLAST Tool
 
INBIOMEDvision Workshop at MIE 2011. Victoria López
INBIOMEDvision Workshop at MIE 2011. Victoria LópezINBIOMEDvision Workshop at MIE 2011. Victoria López
INBIOMEDvision Workshop at MIE 2011. Victoria López
 
Bioinformatics .pptx
Bioinformatics .pptxBioinformatics .pptx
Bioinformatics .pptx
 
My ontology is better than yours! Building and evaluating ontologies for inte...
My ontology is better than yours! Building and evaluating ontologies for inte...My ontology is better than yours! Building and evaluating ontologies for inte...
My ontology is better than yours! Building and evaluating ontologies for inte...
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
Gellibolian 2010 Audio Visual2
Gellibolian 2010 Audio Visual2Gellibolian 2010 Audio Visual2
Gellibolian 2010 Audio Visual2
 
BRN Seminar 13/06/16 12 Revolutionizing healthcare management
BRN Seminar 13/06/16 12 Revolutionizing healthcare managementBRN Seminar 13/06/16 12 Revolutionizing healthcare management
BRN Seminar 13/06/16 12 Revolutionizing healthcare management
 
Revolutionizing Heathcare and Wellness Management through Systems P4 Medicine
Revolutionizing Heathcare and Wellness Management through Systems P4 MedicineRevolutionizing Heathcare and Wellness Management through Systems P4 Medicine
Revolutionizing Heathcare and Wellness Management through Systems P4 Medicine
 
Metrological experiments in biomarker development (mass spectrometry statisti...
Metrological experiments in biomarker development (mass spectrometry statisti...Metrological experiments in biomarker development (mass spectrometry statisti...
Metrological experiments in biomarker development (mass spectrometry statisti...
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
Assignment Of Protein Function And Discovery Of Novel Nucleolar Proteins Base...
Assignment Of Protein Function And Discovery Of Novel Nucleolar Proteins Base...Assignment Of Protein Function And Discovery Of Novel Nucleolar Proteins Base...
Assignment Of Protein Function And Discovery Of Novel Nucleolar Proteins Base...
 

Mehr von Yannick Pouliot

Predicting Adverse Drug Reactions Using PubChem Screening Data
Predicting Adverse Drug Reactions Using PubChem Screening DataPredicting Adverse Drug Reactions Using PubChem Screening Data
Predicting Adverse Drug Reactions Using PubChem Screening Data
Yannick Pouliot
 

Mehr von Yannick Pouliot (11)

Survey of Spark for Data Pre-Processing and Analytics
Survey of Spark for Data Pre-Processing and AnalyticsSurvey of Spark for Data Pre-Processing and Analytics
Survey of Spark for Data Pre-Processing and Analytics
 
Systems Immunology -- 2014
Systems Immunology -- 2014Systems Immunology -- 2014
Systems Immunology -- 2014
 
Managing experiment data using Excel and Friends
Managing experiment data using Excel and FriendsManaging experiment data using Excel and Friends
Managing experiment data using Excel and Friends
 
Essential UNIX skills for biologists
Essential UNIX skills for biologistsEssential UNIX skills for biologists
Essential UNIX skills for biologists
 
A guided SQL tour of bioinformatics databases
A guided SQL tour of bioinformatics databasesA guided SQL tour of bioinformatics databases
A guided SQL tour of bioinformatics databases
 
Ontologically-Aware Automated Gating
Ontologically-Aware Automated GatingOntologically-Aware Automated Gating
Ontologically-Aware Automated Gating
 
Why The Cloud Is A Computational Biologist's Best Friend
Why The Cloud Is A Computational Biologist's Best FriendWhy The Cloud Is A Computational Biologist's Best Friend
Why The Cloud Is A Computational Biologist's Best Friend
 
There’s No Avoiding It: Programming Skills You’ll Need
There’s No Avoiding It:  Programming Skills You’ll NeedThere’s No Avoiding It:  Programming Skills You’ll Need
There’s No Avoiding It: Programming Skills You’ll Need
 
Predicting Adverse Drug Reactions Using PubChem Screening Data
Predicting Adverse Drug Reactions Using PubChem Screening DataPredicting Adverse Drug Reactions Using PubChem Screening Data
Predicting Adverse Drug Reactions Using PubChem Screening Data
 
Repositioning Old Drugs For New Indications Using Computational Approaches
Repositioning Old Drugs For New Indications Using Computational ApproachesRepositioning Old Drugs For New Indications Using Computational Approaches
Repositioning Old Drugs For New Indications Using Computational Approaches
 
Databases, Web Services and Tools For Systems Immunology
Databases, Web Services and Tools For Systems ImmunologyDatabases, Web Services and Tools For Systems Immunology
Databases, Web Services and Tools For Systems Immunology
 

Kürzlich hochgeladen

Call Girls Aurangabad Just Call 8250077686 Top Class Call Girl Service Available
Call Girls Aurangabad Just Call 8250077686 Top Class Call Girl Service AvailableCall Girls Aurangabad Just Call 8250077686 Top Class Call Girl Service Available
Call Girls Aurangabad Just Call 8250077686 Top Class Call Girl Service Available
Dipal Arora
 
Call Girls in Gagan Vihar (delhi) call me [🔝 9953056974 🔝] escort service 24X7
Call Girls in Gagan Vihar (delhi) call me [🔝  9953056974 🔝] escort service 24X7Call Girls in Gagan Vihar (delhi) call me [🔝  9953056974 🔝] escort service 24X7
Call Girls in Gagan Vihar (delhi) call me [🔝 9953056974 🔝] escort service 24X7
9953056974 Low Rate Call Girls In Saket, Delhi NCR
 

Kürzlich hochgeladen (20)

Call Girls Coimbatore Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Coimbatore Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Coimbatore Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Coimbatore Just Call 9907093804 Top Class Call Girl Service Available
 
Call Girls Bangalore Just Call 8250077686 Top Class Call Girl Service Available
Call Girls Bangalore Just Call 8250077686 Top Class Call Girl Service AvailableCall Girls Bangalore Just Call 8250077686 Top Class Call Girl Service Available
Call Girls Bangalore Just Call 8250077686 Top Class Call Girl Service Available
 
Premium Call Girls In Jaipur {8445551418} ❤️VVIP SEEMA Call Girl in Jaipur Ra...
Premium Call Girls In Jaipur {8445551418} ❤️VVIP SEEMA Call Girl in Jaipur Ra...Premium Call Girls In Jaipur {8445551418} ❤️VVIP SEEMA Call Girl in Jaipur Ra...
Premium Call Girls In Jaipur {8445551418} ❤️VVIP SEEMA Call Girl in Jaipur Ra...
 
Call Girls Aurangabad Just Call 8250077686 Top Class Call Girl Service Available
Call Girls Aurangabad Just Call 8250077686 Top Class Call Girl Service AvailableCall Girls Aurangabad Just Call 8250077686 Top Class Call Girl Service Available
Call Girls Aurangabad Just Call 8250077686 Top Class Call Girl Service Available
 
Call Girls Nagpur Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Nagpur Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Nagpur Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Nagpur Just Call 9907093804 Top Class Call Girl Service Available
 
Top Rated Bangalore Call Girls Ramamurthy Nagar ⟟ 9332606886 ⟟ Call Me For G...
Top Rated Bangalore Call Girls Ramamurthy Nagar ⟟  9332606886 ⟟ Call Me For G...Top Rated Bangalore Call Girls Ramamurthy Nagar ⟟  9332606886 ⟟ Call Me For G...
Top Rated Bangalore Call Girls Ramamurthy Nagar ⟟ 9332606886 ⟟ Call Me For G...
 
Call Girls Jabalpur Just Call 8250077686 Top Class Call Girl Service Available
Call Girls Jabalpur Just Call 8250077686 Top Class Call Girl Service AvailableCall Girls Jabalpur Just Call 8250077686 Top Class Call Girl Service Available
Call Girls Jabalpur Just Call 8250077686 Top Class Call Girl Service Available
 
VIP Hyderabad Call Girls Bahadurpally 7877925207 ₹5000 To 25K With AC Room 💚😋
VIP Hyderabad Call Girls Bahadurpally 7877925207 ₹5000 To 25K With AC Room 💚😋VIP Hyderabad Call Girls Bahadurpally 7877925207 ₹5000 To 25K With AC Room 💚😋
VIP Hyderabad Call Girls Bahadurpally 7877925207 ₹5000 To 25K With AC Room 💚😋
 
The Most Attractive Hyderabad Call Girls Kothapet 𖠋 9332606886 𖠋 Will You Mis...
The Most Attractive Hyderabad Call Girls Kothapet 𖠋 9332606886 𖠋 Will You Mis...The Most Attractive Hyderabad Call Girls Kothapet 𖠋 9332606886 𖠋 Will You Mis...
The Most Attractive Hyderabad Call Girls Kothapet 𖠋 9332606886 𖠋 Will You Mis...
 
Call Girls in Gagan Vihar (delhi) call me [🔝 9953056974 🔝] escort service 24X7
Call Girls in Gagan Vihar (delhi) call me [🔝  9953056974 🔝] escort service 24X7Call Girls in Gagan Vihar (delhi) call me [🔝  9953056974 🔝] escort service 24X7
Call Girls in Gagan Vihar (delhi) call me [🔝 9953056974 🔝] escort service 24X7
 
Call Girls Ooty Just Call 8250077686 Top Class Call Girl Service Available
Call Girls Ooty Just Call 8250077686 Top Class Call Girl Service AvailableCall Girls Ooty Just Call 8250077686 Top Class Call Girl Service Available
Call Girls Ooty Just Call 8250077686 Top Class Call Girl Service Available
 
Call Girls Ludhiana Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Ludhiana Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Ludhiana Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Ludhiana Just Call 9907093804 Top Class Call Girl Service Available
 
Call Girls Varanasi Just Call 8250077686 Top Class Call Girl Service Available
Call Girls Varanasi Just Call 8250077686 Top Class Call Girl Service AvailableCall Girls Varanasi Just Call 8250077686 Top Class Call Girl Service Available
Call Girls Varanasi Just Call 8250077686 Top Class Call Girl Service Available
 
Top Rated Bangalore Call Girls Mg Road ⟟ 9332606886 ⟟ Call Me For Genuine S...
Top Rated Bangalore Call Girls Mg Road ⟟   9332606886 ⟟ Call Me For Genuine S...Top Rated Bangalore Call Girls Mg Road ⟟   9332606886 ⟟ Call Me For Genuine S...
Top Rated Bangalore Call Girls Mg Road ⟟ 9332606886 ⟟ Call Me For Genuine S...
 
Call Girls Service Jaipur {9521753030} ❤️VVIP RIDDHI Call Girl in Jaipur Raja...
Call Girls Service Jaipur {9521753030} ❤️VVIP RIDDHI Call Girl in Jaipur Raja...Call Girls Service Jaipur {9521753030} ❤️VVIP RIDDHI Call Girl in Jaipur Raja...
Call Girls Service Jaipur {9521753030} ❤️VVIP RIDDHI Call Girl in Jaipur Raja...
 
Call Girls Tirupati Just Call 8250077686 Top Class Call Girl Service Available
Call Girls Tirupati Just Call 8250077686 Top Class Call Girl Service AvailableCall Girls Tirupati Just Call 8250077686 Top Class Call Girl Service Available
Call Girls Tirupati Just Call 8250077686 Top Class Call Girl Service Available
 
Premium Call Girls Cottonpet Whatsapp 7001035870 Independent Escort Service
Premium Call Girls Cottonpet Whatsapp 7001035870 Independent Escort ServicePremium Call Girls Cottonpet Whatsapp 7001035870 Independent Escort Service
Premium Call Girls Cottonpet Whatsapp 7001035870 Independent Escort Service
 
(Low Rate RASHMI ) Rate Of Call Girls Jaipur ❣ 8445551418 ❣ Elite Models & Ce...
(Low Rate RASHMI ) Rate Of Call Girls Jaipur ❣ 8445551418 ❣ Elite Models & Ce...(Low Rate RASHMI ) Rate Of Call Girls Jaipur ❣ 8445551418 ❣ Elite Models & Ce...
(Low Rate RASHMI ) Rate Of Call Girls Jaipur ❣ 8445551418 ❣ Elite Models & Ce...
 
Best Rate (Patna ) Call Girls Patna ⟟ 8617370543 ⟟ High Class Call Girl In 5 ...
Best Rate (Patna ) Call Girls Patna ⟟ 8617370543 ⟟ High Class Call Girl In 5 ...Best Rate (Patna ) Call Girls Patna ⟟ 8617370543 ⟟ High Class Call Girl In 5 ...
Best Rate (Patna ) Call Girls Patna ⟟ 8617370543 ⟟ High Class Call Girl In 5 ...
 
Call Girls Agra Just Call 8250077686 Top Class Call Girl Service Available
Call Girls Agra Just Call 8250077686 Top Class Call Girl Service AvailableCall Girls Agra Just Call 8250077686 Top Class Call Girl Service Available
Call Girls Agra Just Call 8250077686 Top Class Call Girl Service Available
 

Ontologies for Semantic Normalization of Immunological Data

  • 1. Ontologies for Semantic Normalization of Immunological Data Yannick Pouliot1, Atul J. Butte1,2 1. Division of Systems Medicine, Stanford University School of Medicine 2. Center for Pediatric Bioinformatics, Lucile Packard Children’s Hospital, Palo Alto, California The Problem Data from experiments probing the immune system are inherently complex because of the diversity of data types, assay types and the number of biological agents involved. This complexity is further increased by the multi-center nature of data generated by HIPC. One of the goals of HIPC is to deliver a database able to support broad community access to these complex data sets. Critical to the success of this database will be its ability to provide conceptual characterizations of experiments and their results (“data and metadata encoding”). Such encodings identify data sets according to experimental properties so that users can quickly narrow their searches to the most pertinent results. To this end, conceptual encoding that rely on “industry-standard” ontologies are preferred is the best way to achieve this. We determined the extent to which existing ontologies can be used to encode HIPC data, and ImmPort’s ability to support the application of these concepts. Since ImmPort will be the repository of HIPC data, we evaluated its use of ontologies. Upon determining that ImmPort is not ontologycompliant, we analyzed the universe of ontologies to determine the extent to which existing ontologies can be used to encode HIPC data, and ImmPort’s ability to support the application of these concepts. Ontologies provided by BioPortal1 were first selected based on their domain of application: Domain Description Example Ontology Analysis Anatomy Process of data analysis Anatomical structures at all level of resolution except molecular Disease states manifested by organisms at anatomical, spatial, temporal and functional levels Conditions/specifications associated with a scientific or clinical protocol Process/properties/data types of modeling, computational or otherwise Aspects of biomolecules : structure, sequence, function Biochemical, signaling pathways used by organisms States manifested by organisms at anatomical, spatial, temporal and functional levels. “Anatomy” and “Disease” are components of “Phenotype” but treated distinctly Ontology of Data Mining Cell Ontology Experimental conditions Modeling Molecule Pathways Phenotype Criterium Parameter Design Must be integrative (ability to draw on similar/identical concepts from other ontologies) Must minimize overlap with other ontologies, consistent with providing terms that can be inter-related across ontologies Should be “relatable” to clinical applications Must be applicable to humans, and perhaps one animal model Must be an ontology, not just a controlled vocabulary (with a few exceptions) Evidence of ongoing development and maintenance Developmental state Usage Content Developed with, accepted by standards or professional organizations Adheres to standards such as the Basic Formal Ontology Must be released (post beta) Reasonably widely adopted Must exhibit a good balance between expressiveness and usability/understandability: Conceptual clarity (e.g., no ambiguous classifications for dualuse organs such as reproductive/urinary organs) Limited redundancy of synonymous concepts Usable definitions of concepts describing HIPC data or metadata, including experiment design Completeness (frequency of missing concepts) Correctness (how accurately is a concept is expressed) These ontologies were then analyzed for their ability to recognize terms from text obtained from Build 1 datasets, protocols, and metadata, as well as from Stanford’s DataMt database2 (which stores many Stanford HIPC datasets). An automated pipeline that relies on the National Center for Biomedical Ontology’s Annotator3 was written that relies on BioPortal’s Web services to parse the text and attempt to map to the reference ontologies. Methods Disease 57 Ontologies within these domains were then screened for preliminary suitability to HIPC data/metadata according to four broad criteria: Infectious Disease Ontology (IDO) Ontology of Clinical Research (OCRe) Interaction Network Ontology (INO) ONTIE - Ontology of Immune Epitopes Pathway Ontology Phenotypic Quality Results & Discussion Ontology NCI Thesaurus Medical Subject Headings Molecule role SNOMED Clinical Terms PRotein Ontology (PRO) Cell Cycle Ontology Ontology for Biomedical Investigations Experimental Factor Ontology SemanticScience Integrated Ontology Units of measurement Phenotypic quality EDAM Foundational Model of Anatomy Vaccine Ontology ICPC-2 PLUS MGED Ontology Measurement Method Ontology Gene Ontology Ontology of Clinical Research (OCRe) Protein-protein interaction Mammalian phenotype Build 1 (%) 16.7% 7.1% 14.3% 2.4% 16.7% 7.1% 2.4% n 7 3 6 1 7 3 1 ImmPort CV (%) 33.3% 21.4% 0.0% 11.9% 0.0% 0.0% 14.3% 7.1% 2.4% 2.4% 7.1% 2.4% 2.4% 2.4% 2.4% n 14 9 5 6 3 1 1 3 1 Data Mt (%) 52.6% 22.9% 24.6% 17.7% 10.3% 12.6% 5.7% 7.4% 2.9% 2.3% 2.3% 0.6% 2.3% 1.1% 1.1% n 92 40 43 31 18 22 10 13 5 4 4 1 4 2 2 1 1 1 0.6% 0.6% 0.6% 1 1 1 • Many mapping failures attributable to lack of definition for commercial objects within the reference ontologies (e.g., “Anti-CD27” antibody from BD) Solution: Contacting ontology owners to have them add commercial terms to their ontologies • Many mapping failures are easily correctable Example: Adding a pre-processor able to recognized instances of the “anti-“ problem (e.g., “anti-CD20” not recognized even though “CD20” is known) We conclude that ImmPort should be able to migrate toward ontologically-based encodings. References 1. Noy et al., (2009) “BioPortal: Ontologies and Integrated Data resources at the Click of a Mouse”, Nucl. Acids Res., 37:W170-W173. 2. Siebert, J., Munsil, D. & Maecker, H. (2011) "A Novel Approach for Integrating and Exploring Heterogeneous Translational Data", manuscript in preparation. 3. Jonquet et al., (2009) “The Open Biomedical Annotator”, Summit on Transla. Bioinfo., 56-60. Acknowledgments NIAID, Hewlett Packard Foundation, Butte Lab