SlideShare ist ein Scribd-Unternehmen logo
1 von 1
Downloaden Sie, um offline zu lesen
© VANDERBILT UNIVERSITY 2015
BIOMEDICAL INFORMATICS
Implementing PheWAS
in i2b2
Huan Mo, MD, MS1
Jacqueline Kirby1, Zhiao Shi1
Robert Carroll, PhD1, Lisa Bastarache1
Richard Kiefer5, Guoqian Jiang, MD, PhD5
Luke Rasmussen3, Jennifer Pacheco4
William Thompson, PhD2, Firas Wehbe, MD, PhD1
Jyotishman Pathak, PhD5, Joshua Denny, MD, MS1
1Dept. Biomed. Inform., Vanderbilt Univ., TN;
2Cntr. Biomed. Res. Inform., NorthShore Univ. HS, IL;
3Dept. Preventive Med. & 4Cntr Genet. Med.,
Feinberg SOM, Northwestern Univ., IL;
5Dept. Health Sci. Res., Mayo Clinic, MN
OBJECTIVE:
1. Transform and import PheWAS ontology (ICD-9
based) to i2b2 ontology cell.
2. Connect PheWAS R package to i2b2 CRC cell.
3. (Next step) Create PheWAS cell that runs on i2b2
framework. Seek/establish i2b2 solution for
genotype data.
BACKGROUND
Phenome-wide association studies (PheWAS)
look for associations between many
phenotypes and a single nucleotide
polymorphism (SNP), or other predictors (e.g.,
use of a medication, disease subtypes). In a
standard PheWAS, a phenotype is defined by
the presence of certain ICD-9 codes in a
patient’s electronic health record (EHR); a
control for the phenotype is defined as the
absence of these ICD-9 codes as well as other,
more broadly related ICD-9 codes. In our
previous efforts, we have grouped ICD-9 codes
to ~1,600 PheWAS codes hierarchically (as a
PheWAS ontology) according to the similarity
of their underlying pathophysiological
processes.
METHODS
PheWAS Ontology for i2b2: We used a KNIME
workflow to transform the mapping table of
PheWAS codes to an i2b2 metadata table (for
the ontology cell). In metadata SELECT SQL
query construction fields, we adopted an “IN”
clause (instead of “LIKE”) with a list of ICD-9
codes that mapped to a PheWAS concept unit.
No changes in the CRC cell or its database are
needed.
PheWAS Ontology for i2b2
i2b2-PheWAS services
Connect PheWAS to CRC cell: We developed an R
package (i2b2-PheWAS) to generate phenotype files
from CRC cells that can be used in Carroll’s 2014
PheWAS R package. For each phenotype, i2b2-
PheWAS issues two XML queries (for cases and
controls) to CRC cells, and transforms the response
XML forms to a phenotype column in R. (A PhEMA
effort.)
Availability of Software
PheWAS ontology for i2b2 can be downloaded from
ProjectPhEMA.org. I2b2-PheWAS R package is
available upon request, and will be publicly
available in the near future.
FUTURE DIRECTIONS
1. We will test the i2b2-PheWAS package with i2b2
instances for production with real clinical data.
2. We will integrate i2b2-PheWAS and PheWAS R
package into an i2b2 cell with a graphical user
interface for “Analysis Tools” on the Web Client.
3. As an iPGx effort, we will seek or build a
solution to allow i2b2 to manage genotype data
(similar to PLINK), so that we can perform GWAS
within the i2b2 framework.
FUNDING
PheWAS: R01 LM010685; PhEMA: R01 GM105688;
iPGx: R01 GM 103859
REFERENCES
Denny JC, Bastarache L, Ritchie MD, et al. (2013). Systematic comparison of
phenome-wide association study of electronic medical record data and
genome-wide association study data. Nature Biotechnology, 31(12), 1102–10.
Carroll RJ, Bastarache L, & Denny JC (2014). R PheWAS: Data analysis and
plotting tools for phenome-wide association studies in the R environment.
Bioinformatics, 30(16), 2375-6.
C_TOOLTIP =
PheWAS  case  circulatory system  (411.X) Ischemic Heart Disease  (411.3) Angina pectoris	

C_COLUMNNAME = concept_cd; C_OPERATOR = IN	

C_DIMCODE = 'ICD9:413','ICD9:413.0','ICD9:413.1','ICD9:413.9'	

CRC
ONT
with
PheWAS
/request
/pdorequest
Case
item_key:
PHEWAS...matched...
(411-3)…case inclusion
item_occurrences: 2
invert: 0
result_output: patientset
Control
item_key:
PHEWAS...matched...
(411-3)…control exclusion
item_occurrences: 1
invert: 1
result_output: patientset
Header:
username
passkey
query_result_access:
case = 123456
control = 123457
R: httr (RESTful)
R:XML (authoring)
Case List
Patient 1
Control List
Patient 3
PheWAS Code 411.3:
Angina pectoris
Case List
Patient 1 (Pt 1)
Control List
Patient 3 (Pt 3)
411.3 SNP
Pt 1 TRUE 2
Pt 2 NA 0
Pt 3 FALSE 1
R PheWAS pipeline (Carroll 2014) for science success

Weitere ähnliche Inhalte

Was ist angesagt?

Gene Editing - Challenges and Future of CRISPR in Clinical Development
Gene Editing - Challenges and Future of CRISPR in Clinical DevelopmentGene Editing - Challenges and Future of CRISPR in Clinical Development
Gene Editing - Challenges and Future of CRISPR in Clinical DevelopmentMedpace
 
Crispr future prospects in public health
Crispr  future prospects in public healthCrispr  future prospects in public health
Crispr future prospects in public healthRashmiSharma304
 
CRISPR Screening: the What, Why and How
CRISPR Screening: the What, Why and HowCRISPR Screening: the What, Why and How
CRISPR Screening: the What, Why and HowHorizonDiscovery
 
Transhumanismo y Mejoramiento Genético mediante CRISPR
Transhumanismo y Mejoramiento Genético mediante CRISPRTranshumanismo y Mejoramiento Genético mediante CRISPR
Transhumanismo y Mejoramiento Genético mediante CRISPRBioeticared
 
Research project
Research project Research project
Research project Dingquan Yu
 
The relative ease of use and reproducibility of
The relative ease of use and reproducibility ofThe relative ease of use and reproducibility of
The relative ease of use and reproducibility ofShiv Nadar University
 
CRISPR: Opportunities and Challenges Webinar
CRISPR: Opportunities and Challenges WebinarCRISPR: Opportunities and Challenges Webinar
CRISPR: Opportunities and Challenges WebinarPreScouter
 
CRISPR-Revolutionary Genome editing tools for Plants.....
CRISPR-Revolutionary Genome editing tools for Plants.....CRISPR-Revolutionary Genome editing tools for Plants.....
CRISPR-Revolutionary Genome editing tools for Plants.....BHU,Varanasi, INDIA
 
Automated data pipelines at the rat genome database
Automated data pipelines at the rat genome databaseAutomated data pipelines at the rat genome database
Automated data pipelines at the rat genome databaseJennifer Smith
 
Crispr handbook 2015
Crispr handbook 2015Crispr handbook 2015
Crispr handbook 2015rochonf
 
Crispr cas:an advance and efficient tool for genome modification
Crispr cas:an advance and efficient tool for genome modificationCrispr cas:an advance and efficient tool for genome modification
Crispr cas:an advance and efficient tool for genome modificationavinash tiwari
 
Genome Editing with CRISPR-Cas9
Genome Editing with CRISPR-Cas9Genome Editing with CRISPR-Cas9
Genome Editing with CRISPR-Cas9Lopamudra Nayak
 
CRISPR theory mechanism and applications || كرسبر النظريه وطريقه العمل والتطب...
CRISPR theory mechanism and applications || كرسبر النظريه وطريقه العمل والتطب...CRISPR theory mechanism and applications || كرسبر النظريه وطريقه العمل والتطب...
CRISPR theory mechanism and applications || كرسبر النظريه وطريقه العمل والتطب...Mohemmad Osama
 
UCSBigemPosterSmaller
UCSBigemPosterSmallerUCSBigemPosterSmaller
UCSBigemPosterSmallerJohn Errico
 
Aug2015 steve lincoln analytical validation
Aug2015 steve lincoln analytical validationAug2015 steve lincoln analytical validation
Aug2015 steve lincoln analytical validationGenomeInABottle
 
Literature mining: what is it, and should I care?
Literature mining: what is it, and should I care?Literature mining: what is it, and should I care?
Literature mining: what is it, and should I care?Lars Juhl Jensen
 
Biomedical literature mining (and why we really need open access)
Biomedical literature mining (and why we really need open access)Biomedical literature mining (and why we really need open access)
Biomedical literature mining (and why we really need open access)Lars Juhl Jensen
 

Was ist angesagt? (20)

Gene Editing - Challenges and Future of CRISPR in Clinical Development
Gene Editing - Challenges and Future of CRISPR in Clinical DevelopmentGene Editing - Challenges and Future of CRISPR in Clinical Development
Gene Editing - Challenges and Future of CRISPR in Clinical Development
 
Crispr future prospects in public health
Crispr  future prospects in public healthCrispr  future prospects in public health
Crispr future prospects in public health
 
CRISPR Screening: the What, Why and How
CRISPR Screening: the What, Why and HowCRISPR Screening: the What, Why and How
CRISPR Screening: the What, Why and How
 
Transhumanismo y Mejoramiento Genético mediante CRISPR
Transhumanismo y Mejoramiento Genético mediante CRISPRTranshumanismo y Mejoramiento Genético mediante CRISPR
Transhumanismo y Mejoramiento Genético mediante CRISPR
 
Research project
Research project Research project
Research project
 
The relative ease of use and reproducibility of
The relative ease of use and reproducibility ofThe relative ease of use and reproducibility of
The relative ease of use and reproducibility of
 
The CRISPR/Cas9 Toolbox
The CRISPR/Cas9 ToolboxThe CRISPR/Cas9 Toolbox
The CRISPR/Cas9 Toolbox
 
Editing our genes
Editing our genesEditing our genes
Editing our genes
 
CRISPR: Opportunities and Challenges Webinar
CRISPR: Opportunities and Challenges WebinarCRISPR: Opportunities and Challenges Webinar
CRISPR: Opportunities and Challenges Webinar
 
CRISPR-Revolutionary Genome editing tools for Plants.....
CRISPR-Revolutionary Genome editing tools for Plants.....CRISPR-Revolutionary Genome editing tools for Plants.....
CRISPR-Revolutionary Genome editing tools for Plants.....
 
Automated data pipelines at the rat genome database
Automated data pipelines at the rat genome databaseAutomated data pipelines at the rat genome database
Automated data pipelines at the rat genome database
 
Crispr handbook 2015
Crispr handbook 2015Crispr handbook 2015
Crispr handbook 2015
 
Crispr cas:an advance and efficient tool for genome modification
Crispr cas:an advance and efficient tool for genome modificationCrispr cas:an advance and efficient tool for genome modification
Crispr cas:an advance and efficient tool for genome modification
 
Genome Editing with CRISPR-Cas9
Genome Editing with CRISPR-Cas9Genome Editing with CRISPR-Cas9
Genome Editing with CRISPR-Cas9
 
CRISPR theory mechanism and applications || كرسبر النظريه وطريقه العمل والتطب...
CRISPR theory mechanism and applications || كرسبر النظريه وطريقه العمل والتطب...CRISPR theory mechanism and applications || كرسبر النظريه وطريقه العمل والتطب...
CRISPR theory mechanism and applications || كرسبر النظريه وطريقه العمل والتطب...
 
CRISPR /Cas9
CRISPR /Cas9CRISPR /Cas9
CRISPR /Cas9
 
UCSBigemPosterSmaller
UCSBigemPosterSmallerUCSBigemPosterSmaller
UCSBigemPosterSmaller
 
Aug2015 steve lincoln analytical validation
Aug2015 steve lincoln analytical validationAug2015 steve lincoln analytical validation
Aug2015 steve lincoln analytical validation
 
Literature mining: what is it, and should I care?
Literature mining: what is it, and should I care?Literature mining: what is it, and should I care?
Literature mining: what is it, and should I care?
 
Biomedical literature mining (and why we really need open access)
Biomedical literature mining (and why we really need open access)Biomedical literature mining (and why we really need open access)
Biomedical literature mining (and why we really need open access)
 

Andere mochten auch

La gestion de los sistemas educativos
La gestion de los sistemas educativosLa gestion de los sistemas educativos
La gestion de los sistemas educativossecundaria10
 
Les cèl·lules
Les cèl·lulesLes cèl·lules
Les cèl·lulesalbinsqui
 
University of St. Thomas MBA Viewbook
University of St. Thomas MBA ViewbookUniversity of St. Thomas MBA Viewbook
University of St. Thomas MBA Viewbookfutureandnow
 
Semantic Publishing and Nanopublications
Semantic Publishing and NanopublicationsSemantic Publishing and Nanopublications
Semantic Publishing and NanopublicationsTobias Kuhn
 
Msafara Day 1 - Mombasa
Msafara Day 1 - MombasaMsafara Day 1 - Mombasa
Msafara Day 1 - Mombasadanflec
 
Diagrama de-equilibrio-de-las-aleaciones-hierro-2
Diagrama de-equilibrio-de-las-aleaciones-hierro-2Diagrama de-equilibrio-de-las-aleaciones-hierro-2
Diagrama de-equilibrio-de-las-aleaciones-hierro-2Blanca Rozas
 
Andresa rodrigues turma7º3 nº3
Andresa rodrigues turma7º3 nº3Andresa rodrigues turma7º3 nº3
Andresa rodrigues turma7º3 nº3Mayjö .
 
Eguberrietako lana Paula Berrade
Eguberrietako lana Paula BerradeEguberrietako lana Paula Berrade
Eguberrietako lana Paula BerradePaulaBerrade
 

Andere mochten auch (11)

9 formas de usar el video en clase
9 formas de usar el video en clase9 formas de usar el video en clase
9 formas de usar el video en clase
 
Las tendencias para 2016 en e learning
Las tendencias para 2016 en e learningLas tendencias para 2016 en e learning
Las tendencias para 2016 en e learning
 
La gestion de los sistemas educativos
La gestion de los sistemas educativosLa gestion de los sistemas educativos
La gestion de los sistemas educativos
 
Les cèl·lules
Les cèl·lulesLes cèl·lules
Les cèl·lules
 
University of St. Thomas MBA Viewbook
University of St. Thomas MBA ViewbookUniversity of St. Thomas MBA Viewbook
University of St. Thomas MBA Viewbook
 
Semantic Publishing and Nanopublications
Semantic Publishing and NanopublicationsSemantic Publishing and Nanopublications
Semantic Publishing and Nanopublications
 
Msafara Day 1 - Mombasa
Msafara Day 1 - MombasaMsafara Day 1 - Mombasa
Msafara Day 1 - Mombasa
 
Diagrama de-equilibrio-de-las-aleaciones-hierro-2
Diagrama de-equilibrio-de-las-aleaciones-hierro-2Diagrama de-equilibrio-de-las-aleaciones-hierro-2
Diagrama de-equilibrio-de-las-aleaciones-hierro-2
 
Andresa rodrigues turma7º3 nº3
Andresa rodrigues turma7º3 nº3Andresa rodrigues turma7º3 nº3
Andresa rodrigues turma7º3 nº3
 
стимульный материал 7 класс
стимульный материал 7 классстимульный материал 7 класс
стимульный материал 7 класс
 
Eguberrietako lana Paula Berrade
Eguberrietako lana Paula BerradeEguberrietako lana Paula Berrade
Eguberrietako lana Paula Berrade
 

Ähnlich wie PheWAS_i2b2_v1.0

Cutting Edge Conversations: Addressing COVID-19 During Uncertainty
Cutting Edge Conversations: Addressing COVID-19 During UncertaintyCutting Edge Conversations: Addressing COVID-19 During Uncertainty
Cutting Edge Conversations: Addressing COVID-19 During UncertaintyInsideScientific
 
From Advanced Queries to Algorithms and Graph-Based ML: Tackling Diabetes wit...
From Advanced Queries to Algorithms and Graph-Based ML: Tackling Diabetes wit...From Advanced Queries to Algorithms and Graph-Based ML: Tackling Diabetes wit...
From Advanced Queries to Algorithms and Graph-Based ML: Tackling Diabetes wit...Neo4j
 
FinalMethodology.docx
FinalMethodology.docxFinalMethodology.docx
FinalMethodology.docxInfoEric33
 
COMPUTER AIDED PERSPECTIVE OF SELECTION OF PLANTS AGAINST VIRUSES
COMPUTER AIDED PERSPECTIVE OF SELECTION OF PLANTS AGAINST VIRUSESCOMPUTER AIDED PERSPECTIVE OF SELECTION OF PLANTS AGAINST VIRUSES
COMPUTER AIDED PERSPECTIVE OF SELECTION OF PLANTS AGAINST VIRUSESManik Ghosh
 
UKSG 2023 - Will artificial intelligence change how readers use the research ...
UKSG 2023 - Will artificial intelligence change how readers use the research ...UKSG 2023 - Will artificial intelligence change how readers use the research ...
UKSG 2023 - Will artificial intelligence change how readers use the research ...UKSG: connecting the knowledge community
 
mingcv_2016
mingcv_2016mingcv_2016
mingcv_2016Ming Li
 
Open zika presentation
Open zika presentation Open zika presentation
Open zika presentation Sean Ekins
 
'Stories that persuade with data' - talk at CENDI meeting January 9 2014
'Stories that persuade with data' - talk at CENDI meeting January 9 2014'Stories that persuade with data' - talk at CENDI meeting January 9 2014
'Stories that persuade with data' - talk at CENDI meeting January 9 2014Anita de Waard
 
EpiVax FastVax 23Feb2010
EpiVax FastVax 23Feb2010EpiVax FastVax 23Feb2010
EpiVax FastVax 23Feb2010Annie De Groot
 
EPGP Informatics Publication - nihms-369795
EPGP Informatics Publication - nihms-369795EPGP Informatics Publication - nihms-369795
EPGP Informatics Publication - nihms-369795Michael Williams
 
New Target Prediction and Visualization Tools Incorporating Open Source Molec...
New Target Prediction and Visualization Tools Incorporating Open Source Molec...New Target Prediction and Visualization Tools Incorporating Open Source Molec...
New Target Prediction and Visualization Tools Incorporating Open Source Molec...Sean Ekins
 
Prasanth Kumar Nadh Dehydrogenase Subunit 1
Prasanth Kumar Nadh Dehydrogenase Subunit 1Prasanth Kumar Nadh Dehydrogenase Subunit 1
Prasanth Kumar Nadh Dehydrogenase Subunit 1Prasanthperceptron
 
Human Studies Database Project (demo)
Human Studies Database Project (demo)Human Studies Database Project (demo)
Human Studies Database Project (demo)Ida Sim
 
Mutation Prediction for Coronaviruses using Genome Sequence and Recurrent Neu...
Mutation Prediction for Coronaviruses using Genome Sequence and Recurrent Neu...Mutation Prediction for Coronaviruses using Genome Sequence and Recurrent Neu...
Mutation Prediction for Coronaviruses using Genome Sequence and Recurrent Neu...Christo Ananth
 
DF-HMe-melanobase2002
DF-HMe-melanobase2002DF-HMe-melanobase2002
DF-HMe-melanobase2002David Fung
 
Monkeypox Drug and Vaccine Discovery - Creative Biolabs
Monkeypox Drug and Vaccine Discovery - Creative BiolabsMonkeypox Drug and Vaccine Discovery - Creative Biolabs
Monkeypox Drug and Vaccine Discovery - Creative BiolabsCreative-Biolabs
 

Ähnlich wie PheWAS_i2b2_v1.0 (20)

Cutting Edge Conversations: Addressing COVID-19 During Uncertainty
Cutting Edge Conversations: Addressing COVID-19 During UncertaintyCutting Edge Conversations: Addressing COVID-19 During Uncertainty
Cutting Edge Conversations: Addressing COVID-19 During Uncertainty
 
From Advanced Queries to Algorithms and Graph-Based ML: Tackling Diabetes wit...
From Advanced Queries to Algorithms and Graph-Based ML: Tackling Diabetes wit...From Advanced Queries to Algorithms and Graph-Based ML: Tackling Diabetes wit...
From Advanced Queries to Algorithms and Graph-Based ML: Tackling Diabetes wit...
 
FinalMethodology.docx
FinalMethodology.docxFinalMethodology.docx
FinalMethodology.docx
 
COMPUTER AIDED PERSPECTIVE OF SELECTION OF PLANTS AGAINST VIRUSES
COMPUTER AIDED PERSPECTIVE OF SELECTION OF PLANTS AGAINST VIRUSESCOMPUTER AIDED PERSPECTIVE OF SELECTION OF PLANTS AGAINST VIRUSES
COMPUTER AIDED PERSPECTIVE OF SELECTION OF PLANTS AGAINST VIRUSES
 
UKSG 2023 - Will artificial intelligence change how readers use the research ...
UKSG 2023 - Will artificial intelligence change how readers use the research ...UKSG 2023 - Will artificial intelligence change how readers use the research ...
UKSG 2023 - Will artificial intelligence change how readers use the research ...
 
mingcv_2016
mingcv_2016mingcv_2016
mingcv_2016
 
Open zika presentation
Open zika presentation Open zika presentation
Open zika presentation
 
Ismb2009
Ismb2009Ismb2009
Ismb2009
 
'Stories that persuade with data' - talk at CENDI meeting January 9 2014
'Stories that persuade with data' - talk at CENDI meeting January 9 2014'Stories that persuade with data' - talk at CENDI meeting January 9 2014
'Stories that persuade with data' - talk at CENDI meeting January 9 2014
 
EpiVax FastVax 23Feb2010
EpiVax FastVax 23Feb2010EpiVax FastVax 23Feb2010
EpiVax FastVax 23Feb2010
 
EPGP Informatics Publication - nihms-369795
EPGP Informatics Publication - nihms-369795EPGP Informatics Publication - nihms-369795
EPGP Informatics Publication - nihms-369795
 
New Target Prediction and Visualization Tools Incorporating Open Source Molec...
New Target Prediction and Visualization Tools Incorporating Open Source Molec...New Target Prediction and Visualization Tools Incorporating Open Source Molec...
New Target Prediction and Visualization Tools Incorporating Open Source Molec...
 
Prasanth Kumar Nadh Dehydrogenase Subunit 1
Prasanth Kumar Nadh Dehydrogenase Subunit 1Prasanth Kumar Nadh Dehydrogenase Subunit 1
Prasanth Kumar Nadh Dehydrogenase Subunit 1
 
Dynamics of developmental fate decisions - Luís A. Nunes Amaral
Dynamics of developmental fate decisions - Luís A. Nunes AmaralDynamics of developmental fate decisions - Luís A. Nunes Amaral
Dynamics of developmental fate decisions - Luís A. Nunes Amaral
 
Human Studies Database Project (demo)
Human Studies Database Project (demo)Human Studies Database Project (demo)
Human Studies Database Project (demo)
 
mRNA VACCINES.pptx
mRNA VACCINES.pptxmRNA VACCINES.pptx
mRNA VACCINES.pptx
 
Mutation Prediction for Coronaviruses using Genome Sequence and Recurrent Neu...
Mutation Prediction for Coronaviruses using Genome Sequence and Recurrent Neu...Mutation Prediction for Coronaviruses using Genome Sequence and Recurrent Neu...
Mutation Prediction for Coronaviruses using Genome Sequence and Recurrent Neu...
 
DF-HMe-melanobase2002
DF-HMe-melanobase2002DF-HMe-melanobase2002
DF-HMe-melanobase2002
 
FINAL PUBLISHED PAPER SEP_2015
FINAL PUBLISHED PAPER SEP_2015FINAL PUBLISHED PAPER SEP_2015
FINAL PUBLISHED PAPER SEP_2015
 
Monkeypox Drug and Vaccine Discovery - Creative Biolabs
Monkeypox Drug and Vaccine Discovery - Creative BiolabsMonkeypox Drug and Vaccine Discovery - Creative Biolabs
Monkeypox Drug and Vaccine Discovery - Creative Biolabs
 

PheWAS_i2b2_v1.0

  • 1. © VANDERBILT UNIVERSITY 2015 BIOMEDICAL INFORMATICS Implementing PheWAS in i2b2 Huan Mo, MD, MS1 Jacqueline Kirby1, Zhiao Shi1 Robert Carroll, PhD1, Lisa Bastarache1 Richard Kiefer5, Guoqian Jiang, MD, PhD5 Luke Rasmussen3, Jennifer Pacheco4 William Thompson, PhD2, Firas Wehbe, MD, PhD1 Jyotishman Pathak, PhD5, Joshua Denny, MD, MS1 1Dept. Biomed. Inform., Vanderbilt Univ., TN; 2Cntr. Biomed. Res. Inform., NorthShore Univ. HS, IL; 3Dept. Preventive Med. & 4Cntr Genet. Med., Feinberg SOM, Northwestern Univ., IL; 5Dept. Health Sci. Res., Mayo Clinic, MN OBJECTIVE: 1. Transform and import PheWAS ontology (ICD-9 based) to i2b2 ontology cell. 2. Connect PheWAS R package to i2b2 CRC cell. 3. (Next step) Create PheWAS cell that runs on i2b2 framework. Seek/establish i2b2 solution for genotype data. BACKGROUND Phenome-wide association studies (PheWAS) look for associations between many phenotypes and a single nucleotide polymorphism (SNP), or other predictors (e.g., use of a medication, disease subtypes). In a standard PheWAS, a phenotype is defined by the presence of certain ICD-9 codes in a patient’s electronic health record (EHR); a control for the phenotype is defined as the absence of these ICD-9 codes as well as other, more broadly related ICD-9 codes. In our previous efforts, we have grouped ICD-9 codes to ~1,600 PheWAS codes hierarchically (as a PheWAS ontology) according to the similarity of their underlying pathophysiological processes. METHODS PheWAS Ontology for i2b2: We used a KNIME workflow to transform the mapping table of PheWAS codes to an i2b2 metadata table (for the ontology cell). In metadata SELECT SQL query construction fields, we adopted an “IN” clause (instead of “LIKE”) with a list of ICD-9 codes that mapped to a PheWAS concept unit. No changes in the CRC cell or its database are needed. PheWAS Ontology for i2b2 i2b2-PheWAS services Connect PheWAS to CRC cell: We developed an R package (i2b2-PheWAS) to generate phenotype files from CRC cells that can be used in Carroll’s 2014 PheWAS R package. For each phenotype, i2b2- PheWAS issues two XML queries (for cases and controls) to CRC cells, and transforms the response XML forms to a phenotype column in R. (A PhEMA effort.) Availability of Software PheWAS ontology for i2b2 can be downloaded from ProjectPhEMA.org. I2b2-PheWAS R package is available upon request, and will be publicly available in the near future. FUTURE DIRECTIONS 1. We will test the i2b2-PheWAS package with i2b2 instances for production with real clinical data. 2. We will integrate i2b2-PheWAS and PheWAS R package into an i2b2 cell with a graphical user interface for “Analysis Tools” on the Web Client. 3. As an iPGx effort, we will seek or build a solution to allow i2b2 to manage genotype data (similar to PLINK), so that we can perform GWAS within the i2b2 framework. FUNDING PheWAS: R01 LM010685; PhEMA: R01 GM105688; iPGx: R01 GM 103859 REFERENCES Denny JC, Bastarache L, Ritchie MD, et al. (2013). Systematic comparison of phenome-wide association study of electronic medical record data and genome-wide association study data. Nature Biotechnology, 31(12), 1102–10. Carroll RJ, Bastarache L, & Denny JC (2014). R PheWAS: Data analysis and plotting tools for phenome-wide association studies in the R environment. Bioinformatics, 30(16), 2375-6. C_TOOLTIP = PheWAS case circulatory system (411.X) Ischemic Heart Disease (411.3) Angina pectoris C_COLUMNNAME = concept_cd; C_OPERATOR = IN C_DIMCODE = 'ICD9:413','ICD9:413.0','ICD9:413.1','ICD9:413.9' CRC ONT with PheWAS /request /pdorequest Case item_key: PHEWAS...matched... (411-3)…case inclusion item_occurrences: 2 invert: 0 result_output: patientset Control item_key: PHEWAS...matched... (411-3)…control exclusion item_occurrences: 1 invert: 1 result_output: patientset Header: username passkey query_result_access: case = 123456 control = 123457 R: httr (RESTful) R:XML (authoring) Case List Patient 1 Control List Patient 3 PheWAS Code 411.3: Angina pectoris Case List Patient 1 (Pt 1) Control List Patient 3 (Pt 3) 411.3 SNP Pt 1 TRUE 2 Pt 2 NA 0 Pt 3 FALSE 1 R PheWAS pipeline (Carroll 2014) for science success