SlideShare ist ein Scribd-Unternehmen logo
1 von 33
Downloaden Sie, um offline zu lesen
Joaquín Dopazo 
Computational Genomics Department, 
Centro de Investigación Príncipe Felipe (CIPF), 
Functional Genomics Node, (INB), 
Bioinformatics Group (CIBERER) and 
Medical Genome Project, 
Spain. 
http://bioinfo.cipf.es 
http://www.medicalgenomeproject.com 
http://www.babelomics.org 
http://www.hpc4g.org 
@xdopazo 
Forum on Personalized Medicine, 25 September 2014 
Bioinformatics and Big Data in the era of Personalized Medicine
Allison, 2008. Is personalized medicine finally arriving? Nature. 
Personalized medicine: just about a better understanding of the relationship phenotype-genotype 
Personalized medicine through precision medicine 
•Precision medicine requires of better ways of defining diseases by introducing genomic technologies into the diagnostic procedures. 
•A more precise diagnostic of diseases, based on the description of their molecular mechanisms, is critical for creating innovative diagnostic, prognostic, and therapeutic strategies properly tailored to each patient’s necessities
The future of personalized medicine is strongly based on genomics 
•Personalized medicine is based on the availability of diagnostic biomarkers 
•Genome sequencing offers ALL this information (if properly analyzed) 
•Genome sequence prices are in free fall (exome price expected < 300€ in 2-3 years) 
•Over 30-40 % of budget (>500 B $) per year, is spent on costs associated with “overuse, underuse, misuse, ...”
While the cost falls down, the amount of data to manage and its complexity raise exponentially. 
Costs are already almost competitive enough to be used in clinic 
The problem is… are we ready to deal with this data? 
Exome sequencing successfully used. NGS prices will be soon affordable. 
http://www.genome.gov/sequencingcosts/
http://www.nih.gov/news/health/jun2014/nhgri-18.htm 
http://www.nejm.org/doi/full/10.1056/NEJMra1312543 
More than 10,000 exomes will be ordered for diagnostic purposes 
Clinical application of exomes
Personalized Genomic Medicine. Phase I: generating the knowledge database 
----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- 
sequencing 
Patient 
List of variants 
Database. Query: variant/pathway 
Therapy 
Outcome 
System feedback 
Genetic variants are linked to therapies through the knowledge of their functional effects (systems biology) 
Initially the system will need much feedback: Knowledge generation phase. 
Growing knowledge database 
Genomic medicine 
Knowledge database
Personalized genomic medicine. Phase II: applying the knowledge database 
Patient 
1)Genomic sequencing 
2)Database of markers 
3)Therapy prediction 
Genomic core facility phase II 
Clinician receives hints on possible prescriptions and therapeutic interventions 
+ 
Other factors (risk, cost, etc.) 
Prescription 
Pre-symptomatic: 
• Genetic predisposition of acquired diseases (>6000. some treatable) 
• Early diagnosis of genetic diseases Symptomatic analysis 
• Diagnostic of acquired diseases 
• Early cancer detection 
• Cancer treatment recommendation
From genetics to genomic medicine 
Test 1 
Test 2 
Therapy 1 
Therapy 2 
Therapy 3 
? 
Genetic medicine 
Test 
Therapy 1 
Therapy 2 
Therapy 3 
? 
Genomic medicine 
+ 
Genomic analysis allows associating patients to therapies from the very beginning, saving time and costs and increasing the success of treatments. 
feedback
Some examples 
Conventional sequencing 
NGS (with capture) 
Marfan syndrome 
1300€ 
2 genes, 75 exons 
900€ 
3 genes, 237 exons 
Hereditary deafness 
12500€ 
36 genes 1500 exons 
1100€ 
38 genes > 1500 exons 
• Low initial investment 
• Already existent infrastructure 
• Quick implementation 
• Easily implementation as a cloud service that guarantees sustainability
Preparing the scenario for the introduction of genome in the clinics 
Patient 
Treatment 
eHR 
Decision support techniques: algorithms that relate biomarkers to treatments, outcomes, etc. (gene prioritization and predictors) 
Integration of the data in the eHR 
Visualization and data presentation. Ready for the clinical interpretation 
Acceleration of algorithms for data pre- processing. Data strorage optimization 
feedback 
Corporative systems 
Orion clinic Abucasis, Gaia, etc.
Preparing the scenario for the introduction of genome in the clinics 
Patient 
Treatment 
eHR 
feedback 
Corporative systems 
Orion clinic Abucasis, Gaia, etc. 
Decision support techniques: algorithms that relate biomarkers to treatments, outcomes, etc. (gene prioritization and predictors) 
Visualization and data presentation. 
Ready for the clinical interpretation 
Integration of the data in the eHR 
Acceleration of algorithms for data pre- processing. Data strorage optimization
New Big Data storage strategies 
Automatic QC Sequence cleansing 
Variant calling + QC 
Mapping 
+ QC 
8-10 hours 8-12 hours 8-12 hours 
CLOUD 
FASTQ (10GB) 
BAM (7GB) 
VCF (200MB) 
Data sizes for exomes. In case of whole genomes sizes are >20x 
Remote visualization of big data. Data production phase 
e-health record 
Final human supervision of data QC
Tools developed to improve the pipeline Genome Maps, a HTML5+SVG data visualization of VCF and BAM 
oGenome scale data visualization plays an important role in the data analysis process. It is a big data management problem. 
oFeatures of Genome Maps (Medina, 2013, NAR; ICGC data analysis portal) 
●First 100% HTML5 web based: HTML5+SVG (inspired in Google Maps) 
●Always updated, no browser plugins or installation 
●Data taken from CellBase, remote NGS data, local files and DAS servers: genes, transcripts, exons, SNPs, TFBS, miRNA targets, etc. 
●Other features: Multi species, API oriented, easy integration, plugin framework, etc. 
BAM viewer 
VCF viewer 
ICGC genomic viewer 
www.genomemaps.org
Patient 
Treatment 
eHR 
feedback 
Corporative systems 
Orion clinic Abucasis, Gaia, etc. 
Acceleration of algorithms for data pre- processing. Data strorage optimization 
Integration of the data in the eHR 
Visualization and data presentation. Ready for the clinical interpretation 
Decision support techniques: algorithms that relate biomarkers to treatments, outcomes, etc. (gene prioritization and predictors) 
Preparing the scenario for the introduction of genome in the clinics
Finding new biomarkers 
Test 
Therapy 1 
Therapy 2 
Therapy 3 
? 
feedback 
Feedback: treatment failures are reanalyzed to search for: 
1)Biomarkers (of failure) 
2)Subgroups (to search for new personalized and rational therapeutic interventions 
Treatables 
Failure treatment biomarkers 
Group A biomarkers 
Group A biomarkers 
Irrelevant 
Non treatables 
Signaling 
Protein interaction 
Regulation 
Variants are used as biomarkers to distinguish between responders and non-responders and to sub-classify non-responders 
Rationale design of therapies rely on Systems Biology concepts. Pathways are complex and must be understood with the proper bioinformatic tools
Patient 
Treatment 
eHR 
feedback 
Corporative systems Orion clinic Abucasis, Gaia, etc. 
Decision support techniques: algorithms that relate biomarkers to treatments, outcomes, etc. (gene prioritization and predictors) 
Acceleration of algorithms for data pre- processing. Data strorage optimization 
Visualization and data presentation. 
Ready for the clinical interpretation 
Integration of the data in the eHR 
Preparing the scenario for the introduction of the genome in clinics
BiERapp: interactive web-based tool for easy candidate prioritization by successive filtering 
SEQUENCING CENTER 
Data preprocessing 
VCF 
FASTQ 
Genome Maps 
BAM 
BiERapp filters 
No-SQL (Mongo) VCF indexing 
Population frequencies 
Consequence types 
Experimental 
design 
BAM viewer and Genomic context 
? 
Easy 
scale up
NA19660 NA19661 
NA19600 NA19685 
BiERapp: the interactive filtering tool for easy candidate prioritization 
http://bierapp.babelomics.org 
Aleman et al., 2014 NAR
3-Methylglutaconic aciduria (3- MGA-uria) is a heterogeneous group of syndromes characterized by an increased excretion of 3-methylglutaconic and 3-methylglutaric acids. 
WES with a consecutive filter approach is enough to detect the new mutation in this case. 
Successive Filtering approach An example with 3-Methylglutaconic aciduria syndrome
Use known variants and their population frequencies to filter out irrelevant polymorphisms. 
•Typically dbSNP, 1000 genomes and the 6515 exomes from the ESP are used as sources of population frequencies. 
•We sequenced 300 healthy controls (rigorously phenotyped) to add and extra filtering step to the analysis pipeline 
Novembre et al., 2008. Genes mirror geography within Europe. Nature 
Comparison of MGP controls to 1000g 
How important do you think local information is to detect disease genes?
Filtering with or without local variants 
Number of genes as a function of individuals in the study of a dominant disease Retinitis Pigmentosa autosomal dominant 
The use of local variants makes an enormous difference
New variants and disease genes found with WES and successive filtering 
WES 
IRDs 
arRP (EYS) 
BBS 
arRP 
arRP (USH2) 
3-MGA- uria (SERAC1) 
NBD (BCKDK )
Knowledge DB 
Freq. popul. 
MySeq IonTorrent IonProton 
Illumina 
NO 
Diagnostic 
Therapeutic decision 
New variants 
Disease 
All 
Candidate Prioritization 
Data preprocessing 
Sequence DB 
Sequences 
Freqs. 
Future 
technologies 
New knowledge for future diagnostic 
The final schema: diagnostic and discovery
Diagnostic by targeted sequencing (panels of genes) 
Tool for defining panels 
New filter based on local population variant frequencies 
If no diagnostic variants appear, then secondary findings are studied 
Diagnostic mutations 
http://team.babelomics.org
Implementation of tools in the IT4I Supercomputing Center (Czech Republic) 
The pipelines of primary and secondary analysis developed by the Computational Genomics Department of the CIPF in close collaboration with the Bull Chair has proven its efficiency in the analysis of more than 1000 exomes in a joint collaborative project of the CIBERER and the MGP 
A first pilot implementation has been done in the IT4I supercomputing center, which aims to centralize the analysis of genomics data in the country.
Implementation in the AVS 
….. 
1PB DB 
We have taken advantage of the already operative corporative medical image system using a quite similar philosophy. 
eHR 
gateway 
Upload image 
Retrieve (by patient ID) 
Genomic gateway 
Pilot project with 20 leukemias
Knowledge DB 
Freq. popul. 
MySeq 
IonTorrent 
IonProton 
Illumina 
NO 
Diagnostic 
Therapeutic decision 
New variants 
Disease 
All 
Candidate Prioritization 
Data preprocessing 
Sequence DB 
Sequences 
Freqs. 
Future 
technologies 
New knowledge for future diagnostic 
Gene discovery and diagnostic implemented 
But… what about personalized treatments? 

Patient’s omic data Biological 
knowledge 
Systems 
biology 
computational 
models 
Epigenomics Regulation 
Interaction 
Function 
Proteomics 
Genomics and 
transcriptomics 
Patient 
Metabolomics 
Diagnostic biomarkers 
Personalized medicine 
Therapeutic 
targets 
Cell culture 
Best 
combination 
Xenograft model 
Drug treatment 
Network drugs 
Personalized 
therapy 
Are individualized treatments a realistic option? 
Dopazo, 2003, Drug Discovery Today
Modeling pathways The effect of gene expression over signaling can be estimated. Virtual KOs (or over-expressions) can be simulated 
Colorectal cancer activates a signaling circuit of VEGF pathway that produces PGI2. 
Virtual KO of COX2 interrupts the circuit (known therapeutic inhibitor in CGR 
COX2 gene 
KO
Future prospects Exome vs complete genome
The ENCODE project suggests a functional 
role for a large fraction of the genome 
Which percentage of the genome is 
occupied by: 
Coding genes: 2.4% 
TFBSs 8.1% 
Open chromatin regions 15.2% 
Different RNA types 62.0% 
Total annotated elements: 80.4% 
Exomes are only covering a small fraction of the potential functionality of the genome 
(2.4%). 
Is the missing heritability hidden in the remaining 78%? 
If so, what type of variant should be expect to discover? SNVs? SVs?
Future prospects 
We need to efficiently query all the information contained in the genome, including all the epigenomic signatures as well as the structural variation. 
This involves data integration and “epistatic” queries. 
We need to prepare our health systems to deal with all the genomic data flood 
Information about variations Processed Raw 
Genome variant information (VCF) 150 MB 250 GB 
Epigenome 150 MB 250 GB 
Each transcriptome 20 MB 80 GB 
Individual complete variability 400 MB 525 GB 
Hospital (100.000 patients) 40 TB 50 PB 
We are only starting to realize the dimension of the daunting challenges posed by genomic big data 
There are technical (data size) and conceptual problems (data analysis) in the way genomic information is managed that must be addressed.
The Computational Genomics Department at the Centro de Investigación Príncipe Felipe (CIPF), Valencia, Spain, and… 
...the INB, National Institute of Bioinformatics (Functional Genomics Node) and the CIBERER Network of Centers for Rare Diseases. 
@xdopazo 
@bioinfocipf

Weitere ähnliche Inhalte

Was ist angesagt?

Bioinformatics in dermato-oncology
Bioinformatics in dermato-oncologyBioinformatics in dermato-oncology
Bioinformatics in dermato-oncologyJoaquin Dopazo
 
Big Data and Genomic Medicine by Corey Nislow
Big Data and Genomic Medicine by Corey NislowBig Data and Genomic Medicine by Corey Nislow
Big Data and Genomic Medicine by Corey NislowKnome_Inc
 
Computational challenges in precision medicine and genomics
Computational challenges in precision medicine and genomicsComputational challenges in precision medicine and genomics
Computational challenges in precision medicine and genomicsGary Bader
 
Genomics: The coming challenge to the health system
Genomics: The coming challenge to the health systemGenomics: The coming challenge to the health system
Genomics: The coming challenge to the health systemPrivate Healthcare Australia
 
Multigenic (mechanistic) biomarkers
Multigenic (mechanistic) biomarkersMultigenic (mechanistic) biomarkers
Multigenic (mechanistic) biomarkersJoaquin Dopazo
 
2016 ngs health_lecture
2016 ngs health_lecture2016 ngs health_lecture
2016 ngs health_lectureDan Gaston
 
0301 ostrer
0301   ostrer0301   ostrer
0301 ostrertczucker
 
Integrative analysis and visualization of clinical and molecular data for can...
Integrative analysis and visualization of clinical and molecular data for can...Integrative analysis and visualization of clinical and molecular data for can...
Integrative analysis and visualization of clinical and molecular data for can...Lake Como School of Advanced Studies
 
INBIOMEDvision Workshop at MIE 2011. Victoria López
INBIOMEDvision Workshop at MIE 2011. Victoria LópezINBIOMEDvision Workshop at MIE 2011. Victoria López
INBIOMEDvision Workshop at MIE 2011. Victoria LópezINBIOMEDvision
 
Reg Sci Lecture Dec 2016
Reg Sci Lecture Dec 2016Reg Sci Lecture Dec 2016
Reg Sci Lecture Dec 2016Rick Silva
 
Biomarkers: Next Generation Sequencing and Updates on NTRK and ctDNA
Biomarkers: Next Generation Sequencing and Updates on NTRK and ctDNABiomarkers: Next Generation Sequencing and Updates on NTRK and ctDNA
Biomarkers: Next Generation Sequencing and Updates on NTRK and ctDNAFight Colorectal Cancer
 
Gardner and Song_2015_Genetics in Medicine
Gardner and Song_2015_Genetics in MedicineGardner and Song_2015_Genetics in Medicine
Gardner and Song_2015_Genetics in MedicineAmanda Natalizio
 
Monarch Initiative Poster - Rare Disease Symposium 2015
Monarch Initiative Poster - Rare Disease Symposium 2015Monarch Initiative Poster - Rare Disease Symposium 2015
Monarch Initiative Poster - Rare Disease Symposium 2015Nicole Vasilevsky
 
NGS in cancer treatment
NGS in cancer treatmentNGS in cancer treatment
NGS in cancer treatmentNur Suhaida
 
Bioinformatics
BioinformaticsBioinformatics
BioinformaticsIncedo
 
Personalized medicine through wes and big data analytics
Personalized medicine through wes and big data analyticsPersonalized medicine through wes and big data analytics
Personalized medicine through wes and big data analyticsJunaidAKG
 
Emerging collaboration models for academic medical centers _ our place in the...
Emerging collaboration models for academic medical centers _ our place in the...Emerging collaboration models for academic medical centers _ our place in the...
Emerging collaboration models for academic medical centers _ our place in the...Rick Silva
 
BE Retreat 2015 Poster
BE Retreat 2015 PosterBE Retreat 2015 Poster
BE Retreat 2015 PosterEric Ma
 

Was ist angesagt? (20)

Bioinformatics in dermato-oncology
Bioinformatics in dermato-oncologyBioinformatics in dermato-oncology
Bioinformatics in dermato-oncology
 
Big Data and Genomic Medicine by Corey Nislow
Big Data and Genomic Medicine by Corey NislowBig Data and Genomic Medicine by Corey Nislow
Big Data and Genomic Medicine by Corey Nislow
 
Computational challenges in precision medicine and genomics
Computational challenges in precision medicine and genomicsComputational challenges in precision medicine and genomics
Computational challenges in precision medicine and genomics
 
JALANov2000
JALANov2000JALANov2000
JALANov2000
 
Genomics: The coming challenge to the health system
Genomics: The coming challenge to the health systemGenomics: The coming challenge to the health system
Genomics: The coming challenge to the health system
 
Multigenic (mechanistic) biomarkers
Multigenic (mechanistic) biomarkersMultigenic (mechanistic) biomarkers
Multigenic (mechanistic) biomarkers
 
2016 ngs health_lecture
2016 ngs health_lecture2016 ngs health_lecture
2016 ngs health_lecture
 
0301 ostrer
0301   ostrer0301   ostrer
0301 ostrer
 
Integrative analysis and visualization of clinical and molecular data for can...
Integrative analysis and visualization of clinical and molecular data for can...Integrative analysis and visualization of clinical and molecular data for can...
Integrative analysis and visualization of clinical and molecular data for can...
 
INBIOMEDvision Workshop at MIE 2011. Victoria López
INBIOMEDvision Workshop at MIE 2011. Victoria LópezINBIOMEDvision Workshop at MIE 2011. Victoria López
INBIOMEDvision Workshop at MIE 2011. Victoria López
 
Reg Sci Lecture Dec 2016
Reg Sci Lecture Dec 2016Reg Sci Lecture Dec 2016
Reg Sci Lecture Dec 2016
 
Biomarkers: Next Generation Sequencing and Updates on NTRK and ctDNA
Biomarkers: Next Generation Sequencing and Updates on NTRK and ctDNABiomarkers: Next Generation Sequencing and Updates on NTRK and ctDNA
Biomarkers: Next Generation Sequencing and Updates on NTRK and ctDNA
 
Gardner and Song_2015_Genetics in Medicine
Gardner and Song_2015_Genetics in MedicineGardner and Song_2015_Genetics in Medicine
Gardner and Song_2015_Genetics in Medicine
 
Dr. Leroy Hood Lecuture on P4 Medicine
Dr. Leroy Hood Lecuture on P4 MedicineDr. Leroy Hood Lecuture on P4 Medicine
Dr. Leroy Hood Lecuture on P4 Medicine
 
Monarch Initiative Poster - Rare Disease Symposium 2015
Monarch Initiative Poster - Rare Disease Symposium 2015Monarch Initiative Poster - Rare Disease Symposium 2015
Monarch Initiative Poster - Rare Disease Symposium 2015
 
NGS in cancer treatment
NGS in cancer treatmentNGS in cancer treatment
NGS in cancer treatment
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
Personalized medicine through wes and big data analytics
Personalized medicine through wes and big data analyticsPersonalized medicine through wes and big data analytics
Personalized medicine through wes and big data analytics
 
Emerging collaboration models for academic medical centers _ our place in the...
Emerging collaboration models for academic medical centers _ our place in the...Emerging collaboration models for academic medical centers _ our place in the...
Emerging collaboration models for academic medical centers _ our place in the...
 
BE Retreat 2015 Poster
BE Retreat 2015 PosterBE Retreat 2015 Poster
BE Retreat 2015 Poster
 

Ähnlich wie Forum on Personalized Medicine: Challenges for the next decade

Dr. Nanyingi Technology Keynote
Dr. Nanyingi Technology KeynoteDr. Nanyingi Technology Keynote
Dr. Nanyingi Technology KeynoteNanyingi Mark
 
Quantitative Medicine Feb 2009
Quantitative Medicine Feb 2009Quantitative Medicine Feb 2009
Quantitative Medicine Feb 2009Ian Foster
 
NCI Cancer Genomic Data Commons for NCAB September 2016
NCI Cancer Genomic Data Commons for NCAB September 2016NCI Cancer Genomic Data Commons for NCAB September 2016
NCI Cancer Genomic Data Commons for NCAB September 2016Warren Kibbe
 
인공지능 논문작성과 심사에관한요령
인공지능 논문작성과 심사에관한요령인공지능 논문작성과 심사에관한요령
인공지능 논문작성과 심사에관한요령Namkug Kim
 
LIMS in Modern Molecular Pathology by Dr. Perry Maxwell
LIMS in Modern Molecular Pathology by Dr. Perry MaxwellLIMS in Modern Molecular Pathology by Dr. Perry Maxwell
LIMS in Modern Molecular Pathology by Dr. Perry MaxwellCirdan
 
Analytics in healthcare bhuvaneashwar 11th_march
Analytics in healthcare  bhuvaneashwar  11th_marchAnalytics in healthcare  bhuvaneashwar  11th_march
Analytics in healthcare bhuvaneashwar 11th_marchBhuvaneashwar Subramanian
 
Ontology-Driven Clinical Intelligence: A Path from the Biobank to Cross-Disea...
Ontology-Driven Clinical Intelligence: A Path from the Biobank to Cross-Disea...Ontology-Driven Clinical Intelligence: A Path from the Biobank to Cross-Disea...
Ontology-Driven Clinical Intelligence: A Path from the Biobank to Cross-Disea...Remedy Informatics
 
Open Source Networking Solving Molecular Analysis of Cancer
Open Source Networking Solving Molecular Analysis of CancerOpen Source Networking Solving Molecular Analysis of Cancer
Open Source Networking Solving Molecular Analysis of CancerOpen Networking Summit
 
The Clinical Genome Conference 2014
The Clinical Genome Conference 2014The Clinical Genome Conference 2014
The Clinical Genome Conference 2014Nicole Proulx
 
G. Poste. Big Data and the Evolution of Precision Medicine, Cambridge 2nd Ann...
G. Poste. Big Data and the Evolution of Precision Medicine, Cambridge 2nd Ann...G. Poste. Big Data and the Evolution of Precision Medicine, Cambridge 2nd Ann...
G. Poste. Big Data and the Evolution of Precision Medicine, Cambridge 2nd Ann...CASI, Arizona State University
 
Computational Pathology Workshop July 8 2014
Computational Pathology Workshop July 8 2014Computational Pathology Workshop July 8 2014
Computational Pathology Workshop July 8 2014Joel Saltz
 
PadminiNarayanan-Intro-2018.pptx
PadminiNarayanan-Intro-2018.pptxPadminiNarayanan-Intro-2018.pptx
PadminiNarayanan-Intro-2018.pptxDESMONDEZIEKE1
 
Semantic Web for Health Care and Biomedical Informatics
Semantic Web for Health Care and Biomedical InformaticsSemantic Web for Health Care and Biomedical Informatics
Semantic Web for Health Care and Biomedical InformaticsAmit Sheth
 
Enabling Translational Medicine with e-Science
Enabling Translational Medicine with e-ScienceEnabling Translational Medicine with e-Science
Enabling Translational Medicine with e-ScienceOla Spjuth
 
Next Gen Sequencing and Associated Big Data / AI problem
Next Gen Sequencing and Associated Big Data / AI problemNext Gen Sequencing and Associated Big Data / AI problem
Next Gen Sequencing and Associated Big Data / AI problemSubhendu Dey
 
The state of the art in behavioral machine learning for healthcare
The state of the art in behavioral machine learning for healthcareThe state of the art in behavioral machine learning for healthcare
The state of the art in behavioral machine learning for healthcareAfrica Perianez
 

Ähnlich wie Forum on Personalized Medicine: Challenges for the next decade (20)

Dr. Nanyingi Technology Keynote
Dr. Nanyingi Technology KeynoteDr. Nanyingi Technology Keynote
Dr. Nanyingi Technology Keynote
 
Quantitative Medicine Feb 2009
Quantitative Medicine Feb 2009Quantitative Medicine Feb 2009
Quantitative Medicine Feb 2009
 
NCI Cancer Genomic Data Commons for NCAB September 2016
NCI Cancer Genomic Data Commons for NCAB September 2016NCI Cancer Genomic Data Commons for NCAB September 2016
NCI Cancer Genomic Data Commons for NCAB September 2016
 
인공지능 논문작성과 심사에관한요령
인공지능 논문작성과 심사에관한요령인공지능 논문작성과 심사에관한요령
인공지능 논문작성과 심사에관한요령
 
LIMS in Modern Molecular Pathology by Dr. Perry Maxwell
LIMS in Modern Molecular Pathology by Dr. Perry MaxwellLIMS in Modern Molecular Pathology by Dr. Perry Maxwell
LIMS in Modern Molecular Pathology by Dr. Perry Maxwell
 
Analytics in healthcare bhuvaneashwar 11th_march
Analytics in healthcare  bhuvaneashwar  11th_marchAnalytics in healthcare  bhuvaneashwar  11th_march
Analytics in healthcare bhuvaneashwar 11th_march
 
Ontology-Driven Clinical Intelligence: A Path from the Biobank to Cross-Disea...
Ontology-Driven Clinical Intelligence: A Path from the Biobank to Cross-Disea...Ontology-Driven Clinical Intelligence: A Path from the Biobank to Cross-Disea...
Ontology-Driven Clinical Intelligence: A Path from the Biobank to Cross-Disea...
 
Data sharing and analysis
Data sharing and analysisData sharing and analysis
Data sharing and analysis
 
Open Source Networking Solving Molecular Analysis of Cancer
Open Source Networking Solving Molecular Analysis of CancerOpen Source Networking Solving Molecular Analysis of Cancer
Open Source Networking Solving Molecular Analysis of Cancer
 
The Clinical Genome Conference 2014
The Clinical Genome Conference 2014The Clinical Genome Conference 2014
The Clinical Genome Conference 2014
 
G. Poste. Big Data and the Evolution of Precision Medicine, Cambridge 2nd Ann...
G. Poste. Big Data and the Evolution of Precision Medicine, Cambridge 2nd Ann...G. Poste. Big Data and the Evolution of Precision Medicine, Cambridge 2nd Ann...
G. Poste. Big Data and the Evolution of Precision Medicine, Cambridge 2nd Ann...
 
NGS and the molecular basis of disease: a practical view
NGS and the molecular basis of disease: a practical viewNGS and the molecular basis of disease: a practical view
NGS and the molecular basis of disease: a practical view
 
Computational Pathology Workshop July 8 2014
Computational Pathology Workshop July 8 2014Computational Pathology Workshop July 8 2014
Computational Pathology Workshop July 8 2014
 
PadminiNarayanan-Intro-2018.pptx
PadminiNarayanan-Intro-2018.pptxPadminiNarayanan-Intro-2018.pptx
PadminiNarayanan-Intro-2018.pptx
 
Semantic Web for Health Care and Biomedical Informatics
Semantic Web for Health Care and Biomedical InformaticsSemantic Web for Health Care and Biomedical Informatics
Semantic Web for Health Care and Biomedical Informatics
 
Enabling Translational Medicine with e-Science
Enabling Translational Medicine with e-ScienceEnabling Translational Medicine with e-Science
Enabling Translational Medicine with e-Science
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
Next Gen Sequencing and Associated Big Data / AI problem
Next Gen Sequencing and Associated Big Data / AI problemNext Gen Sequencing and Associated Big Data / AI problem
Next Gen Sequencing and Associated Big Data / AI problem
 
ASHG_2014_AP
ASHG_2014_APASHG_2014_AP
ASHG_2014_AP
 
The state of the art in behavioral machine learning for healthcare
The state of the art in behavioral machine learning for healthcareThe state of the art in behavioral machine learning for healthcare
The state of the art in behavioral machine learning for healthcare
 

Mehr von Joaquin Dopazo

Accelerating the benefits of genomics worldwide
Accelerating the benefits of genomics worldwideAccelerating the benefits of genomics worldwide
Accelerating the benefits of genomics worldwideJoaquin Dopazo
 
Navigating through disease maps
Navigating through disease mapsNavigating through disease maps
Navigating through disease mapsJoaquin Dopazo
 
Differential metabolic activity and discovery of therapeutic targets using su...
Differential metabolic activity and discovery of therapeutic targets using su...Differential metabolic activity and discovery of therapeutic targets using su...
Differential metabolic activity and discovery of therapeutic targets using su...Joaquin Dopazo
 
Taller Genómica y cáncer Introducción
Taller Genómica y cáncer IntroducciónTaller Genómica y cáncer Introducción
Taller Genómica y cáncer IntroducciónJoaquin Dopazo
 
From empirical biomarkers to models of disease mechanisms in the transition t...
From empirical biomarkers to models of disease mechanisms in the transition t...From empirical biomarkers to models of disease mechanisms in the transition t...
From empirical biomarkers to models of disease mechanisms in the transition t...Joaquin Dopazo
 
A look at the human mutational load from the systems biology perspective
A look at the human mutational load from the systems biology perspectiveA look at the human mutational load from the systems biology perspective
A look at the human mutational load from the systems biology perspectiveJoaquin Dopazo
 
Functional profile of the pre- to post-mortem transition in blood
Functional profile of the pre- to post-mortem transition in bloodFunctional profile of the pre- to post-mortem transition in blood
Functional profile of the pre- to post-mortem transition in bloodJoaquin Dopazo
 

Mehr von Joaquin Dopazo (8)

Accelerating the benefits of genomics worldwide
Accelerating the benefits of genomics worldwideAccelerating the benefits of genomics worldwide
Accelerating the benefits of genomics worldwide
 
Navigating through disease maps
Navigating through disease mapsNavigating through disease maps
Navigating through disease maps
 
Differential metabolic activity and discovery of therapeutic targets using su...
Differential metabolic activity and discovery of therapeutic targets using su...Differential metabolic activity and discovery of therapeutic targets using su...
Differential metabolic activity and discovery of therapeutic targets using su...
 
Taller Genómica y cáncer Introducción
Taller Genómica y cáncer IntroducciónTaller Genómica y cáncer Introducción
Taller Genómica y cáncer Introducción
 
From empirical biomarkers to models of disease mechanisms in the transition t...
From empirical biomarkers to models of disease mechanisms in the transition t...From empirical biomarkers to models of disease mechanisms in the transition t...
From empirical biomarkers to models of disease mechanisms in the transition t...
 
A look at the human mutational load from the systems biology perspective
A look at the human mutational load from the systems biology perspectiveA look at the human mutational load from the systems biology perspective
A look at the human mutational load from the systems biology perspective
 
Functional profile of the pre- to post-mortem transition in blood
Functional profile of the pre- to post-mortem transition in bloodFunctional profile of the pre- to post-mortem transition in blood
Functional profile of the pre- to post-mortem transition in blood
 
Big data genomico
Big data genomicoBig data genomico
Big data genomico
 

Kürzlich hochgeladen

NEONATAL RESPIRATORY CARE FROM A PHYSIO POV.pptx
NEONATAL RESPIRATORY CARE FROM A PHYSIO POV.pptxNEONATAL RESPIRATORY CARE FROM A PHYSIO POV.pptx
NEONATAL RESPIRATORY CARE FROM A PHYSIO POV.pptxHanineHassan2
 
Basics of Giant Cell Tumor of bone (GCTB)
Basics of Giant Cell Tumor of bone (GCTB)Basics of Giant Cell Tumor of bone (GCTB)
Basics of Giant Cell Tumor of bone (GCTB)bishwabandhuniraula
 
Anatomy Shelf Notevbhhhhhhhhhhhhhhhs.pdf
Anatomy Shelf Notevbhhhhhhhhhhhhhhhs.pdfAnatomy Shelf Notevbhhhhhhhhhhhhhhhs.pdf
Anatomy Shelf Notevbhhhhhhhhhhhhhhhs.pdfhezamzaki1
 
Diseases of the Respiratory System (J00-J99),.pptx
Diseases of the Respiratory System (J00-J99),.pptxDiseases of the Respiratory System (J00-J99),.pptx
Diseases of the Respiratory System (J00-J99),.pptxEMADABATHINI PRABHU TEJA
 
EYE CANCER.pptx prepared by Neha kewat digital learning
EYE CANCER.pptx prepared by  Neha kewat digital learningEYE CANCER.pptx prepared by  Neha kewat digital learning
EYE CANCER.pptx prepared by Neha kewat digital learningNehaKewat
 
Pharmacovigilance audits inspections.pptx
Pharmacovigilance audits inspections.pptxPharmacovigilance audits inspections.pptx
Pharmacovigilance audits inspections.pptxCliniminds India
 
Health literacies in marginalised communities LILAC 24.pptx
Health literacies in marginalised communities LILAC 24.pptxHealth literacies in marginalised communities LILAC 24.pptx
Health literacies in marginalised communities LILAC 24.pptxPamela McKinney
 
Understanding Warts and Moles: Differences, Types, and Common Locations
Understanding Warts and Moles: Differences, Types, and Common LocationsUnderstanding Warts and Moles: Differences, Types, and Common Locations
Understanding Warts and Moles: Differences, Types, and Common LocationsNeha Sharma
 
21 NEMT Trends & Statistics to Know in 2024
21 NEMT Trends & Statistics to Know in 202421 NEMT Trends & Statistics to Know in 2024
21 NEMT Trends & Statistics to Know in 2024Traumasoft LLC
 
Living Well Every Day: Lyons Wellness Practice | Nurtures Your Complete Health
Living Well Every Day: Lyons Wellness Practice | Nurtures Your Complete HealthLiving Well Every Day: Lyons Wellness Practice | Nurtures Your Complete Health
Living Well Every Day: Lyons Wellness Practice | Nurtures Your Complete HealthLyons Health
 
"ANATOMY AND PHYSIOLOGY OF THE SKIN".pdf
"ANATOMY AND PHYSIOLOGY OF THE SKIN".pdf"ANATOMY AND PHYSIOLOGY OF THE SKIN".pdf
"ANATOMY AND PHYSIOLOGY OF THE SKIN".pdfDolisha Warbi
 
CECT NECK NECK ANGIOGRAPHY CAROTID ANGIOGRAPHY
CECT NECK NECK ANGIOGRAPHY CAROTID ANGIOGRAPHYCECT NECK NECK ANGIOGRAPHY CAROTID ANGIOGRAPHY
CECT NECK NECK ANGIOGRAPHY CAROTID ANGIOGRAPHYRMC
 
Identifying Signs of Mental Health Presentation (1).pptx
Identifying Signs of Mental Health Presentation (1).pptxIdentifying Signs of Mental Health Presentation (1).pptx
Identifying Signs of Mental Health Presentation (1).pptxsandhulove46637
 
ACCA Version of AI & Healthcare: An Overview for the Curious
ACCA Version of AI & Healthcare: An Overview for the CuriousACCA Version of AI & Healthcare: An Overview for the Curious
ACCA Version of AI & Healthcare: An Overview for the CuriousKR_Barker
 
ARTHRITIS.pptx Prepared by monika gopal Tutor
ARTHRITIS.pptx Prepared  by monika gopal TutorARTHRITIS.pptx Prepared  by monika gopal Tutor
ARTHRITIS.pptx Prepared by monika gopal TutorNehaKewat
 
Eating Disorders in Athletes I Sports Psychology
Eating Disorders in Athletes I Sports PsychologyEating Disorders in Athletes I Sports Psychology
Eating Disorders in Athletes I Sports Psychologyshantisphysio
 

Kürzlich hochgeladen (20)

Painting Rats White Angers Them to No End
Painting Rats White Angers Them to No EndPainting Rats White Angers Them to No End
Painting Rats White Angers Them to No End
 
Annual Training
Annual TrainingAnnual Training
Annual Training
 
NEONATAL RESPIRATORY CARE FROM A PHYSIO POV.pptx
NEONATAL RESPIRATORY CARE FROM A PHYSIO POV.pptxNEONATAL RESPIRATORY CARE FROM A PHYSIO POV.pptx
NEONATAL RESPIRATORY CARE FROM A PHYSIO POV.pptx
 
Basics of Giant Cell Tumor of bone (GCTB)
Basics of Giant Cell Tumor of bone (GCTB)Basics of Giant Cell Tumor of bone (GCTB)
Basics of Giant Cell Tumor of bone (GCTB)
 
Anatomy Shelf Notevbhhhhhhhhhhhhhhhs.pdf
Anatomy Shelf Notevbhhhhhhhhhhhhhhhs.pdfAnatomy Shelf Notevbhhhhhhhhhhhhhhhs.pdf
Anatomy Shelf Notevbhhhhhhhhhhhhhhhs.pdf
 
Diseases of the Respiratory System (J00-J99),.pptx
Diseases of the Respiratory System (J00-J99),.pptxDiseases of the Respiratory System (J00-J99),.pptx
Diseases of the Respiratory System (J00-J99),.pptx
 
EYE CANCER.pptx prepared by Neha kewat digital learning
EYE CANCER.pptx prepared by  Neha kewat digital learningEYE CANCER.pptx prepared by  Neha kewat digital learning
EYE CANCER.pptx prepared by Neha kewat digital learning
 
Pharmacovigilance audits inspections.pptx
Pharmacovigilance audits inspections.pptxPharmacovigilance audits inspections.pptx
Pharmacovigilance audits inspections.pptx
 
SCOPE OF CRITICAL CARE ORGANIZATION
SCOPE OF CRITICAL CARE ORGANIZATIONSCOPE OF CRITICAL CARE ORGANIZATION
SCOPE OF CRITICAL CARE ORGANIZATION
 
Health literacies in marginalised communities LILAC 24.pptx
Health literacies in marginalised communities LILAC 24.pptxHealth literacies in marginalised communities LILAC 24.pptx
Health literacies in marginalised communities LILAC 24.pptx
 
Understanding Warts and Moles: Differences, Types, and Common Locations
Understanding Warts and Moles: Differences, Types, and Common LocationsUnderstanding Warts and Moles: Differences, Types, and Common Locations
Understanding Warts and Moles: Differences, Types, and Common Locations
 
21 NEMT Trends & Statistics to Know in 2024
21 NEMT Trends & Statistics to Know in 202421 NEMT Trends & Statistics to Know in 2024
21 NEMT Trends & Statistics to Know in 2024
 
Living Well Every Day: Lyons Wellness Practice | Nurtures Your Complete Health
Living Well Every Day: Lyons Wellness Practice | Nurtures Your Complete HealthLiving Well Every Day: Lyons Wellness Practice | Nurtures Your Complete Health
Living Well Every Day: Lyons Wellness Practice | Nurtures Your Complete Health
 
"ANATOMY AND PHYSIOLOGY OF THE SKIN".pdf
"ANATOMY AND PHYSIOLOGY OF THE SKIN".pdf"ANATOMY AND PHYSIOLOGY OF THE SKIN".pdf
"ANATOMY AND PHYSIOLOGY OF THE SKIN".pdf
 
The Power of Active listening - Tool in effective communication.pdf
The Power of Active listening - Tool in effective communication.pdfThe Power of Active listening - Tool in effective communication.pdf
The Power of Active listening - Tool in effective communication.pdf
 
CECT NECK NECK ANGIOGRAPHY CAROTID ANGIOGRAPHY
CECT NECK NECK ANGIOGRAPHY CAROTID ANGIOGRAPHYCECT NECK NECK ANGIOGRAPHY CAROTID ANGIOGRAPHY
CECT NECK NECK ANGIOGRAPHY CAROTID ANGIOGRAPHY
 
Identifying Signs of Mental Health Presentation (1).pptx
Identifying Signs of Mental Health Presentation (1).pptxIdentifying Signs of Mental Health Presentation (1).pptx
Identifying Signs of Mental Health Presentation (1).pptx
 
ACCA Version of AI & Healthcare: An Overview for the Curious
ACCA Version of AI & Healthcare: An Overview for the CuriousACCA Version of AI & Healthcare: An Overview for the Curious
ACCA Version of AI & Healthcare: An Overview for the Curious
 
ARTHRITIS.pptx Prepared by monika gopal Tutor
ARTHRITIS.pptx Prepared  by monika gopal TutorARTHRITIS.pptx Prepared  by monika gopal Tutor
ARTHRITIS.pptx Prepared by monika gopal Tutor
 
Eating Disorders in Athletes I Sports Psychology
Eating Disorders in Athletes I Sports PsychologyEating Disorders in Athletes I Sports Psychology
Eating Disorders in Athletes I Sports Psychology
 

Forum on Personalized Medicine: Challenges for the next decade

  • 1. Joaquín Dopazo Computational Genomics Department, Centro de Investigación Príncipe Felipe (CIPF), Functional Genomics Node, (INB), Bioinformatics Group (CIBERER) and Medical Genome Project, Spain. http://bioinfo.cipf.es http://www.medicalgenomeproject.com http://www.babelomics.org http://www.hpc4g.org @xdopazo Forum on Personalized Medicine, 25 September 2014 Bioinformatics and Big Data in the era of Personalized Medicine
  • 2. Allison, 2008. Is personalized medicine finally arriving? Nature. Personalized medicine: just about a better understanding of the relationship phenotype-genotype Personalized medicine through precision medicine •Precision medicine requires of better ways of defining diseases by introducing genomic technologies into the diagnostic procedures. •A more precise diagnostic of diseases, based on the description of their molecular mechanisms, is critical for creating innovative diagnostic, prognostic, and therapeutic strategies properly tailored to each patient’s necessities
  • 3. The future of personalized medicine is strongly based on genomics •Personalized medicine is based on the availability of diagnostic biomarkers •Genome sequencing offers ALL this information (if properly analyzed) •Genome sequence prices are in free fall (exome price expected < 300€ in 2-3 years) •Over 30-40 % of budget (>500 B $) per year, is spent on costs associated with “overuse, underuse, misuse, ...”
  • 4. While the cost falls down, the amount of data to manage and its complexity raise exponentially. Costs are already almost competitive enough to be used in clinic The problem is… are we ready to deal with this data? Exome sequencing successfully used. NGS prices will be soon affordable. http://www.genome.gov/sequencingcosts/
  • 5. http://www.nih.gov/news/health/jun2014/nhgri-18.htm http://www.nejm.org/doi/full/10.1056/NEJMra1312543 More than 10,000 exomes will be ordered for diagnostic purposes Clinical application of exomes
  • 6. Personalized Genomic Medicine. Phase I: generating the knowledge database ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- sequencing Patient List of variants Database. Query: variant/pathway Therapy Outcome System feedback Genetic variants are linked to therapies through the knowledge of their functional effects (systems biology) Initially the system will need much feedback: Knowledge generation phase. Growing knowledge database Genomic medicine Knowledge database
  • 7. Personalized genomic medicine. Phase II: applying the knowledge database Patient 1)Genomic sequencing 2)Database of markers 3)Therapy prediction Genomic core facility phase II Clinician receives hints on possible prescriptions and therapeutic interventions + Other factors (risk, cost, etc.) Prescription Pre-symptomatic: • Genetic predisposition of acquired diseases (>6000. some treatable) • Early diagnosis of genetic diseases Symptomatic analysis • Diagnostic of acquired diseases • Early cancer detection • Cancer treatment recommendation
  • 8. From genetics to genomic medicine Test 1 Test 2 Therapy 1 Therapy 2 Therapy 3 ? Genetic medicine Test Therapy 1 Therapy 2 Therapy 3 ? Genomic medicine + Genomic analysis allows associating patients to therapies from the very beginning, saving time and costs and increasing the success of treatments. feedback
  • 9. Some examples Conventional sequencing NGS (with capture) Marfan syndrome 1300€ 2 genes, 75 exons 900€ 3 genes, 237 exons Hereditary deafness 12500€ 36 genes 1500 exons 1100€ 38 genes > 1500 exons • Low initial investment • Already existent infrastructure • Quick implementation • Easily implementation as a cloud service that guarantees sustainability
  • 10. Preparing the scenario for the introduction of genome in the clinics Patient Treatment eHR Decision support techniques: algorithms that relate biomarkers to treatments, outcomes, etc. (gene prioritization and predictors) Integration of the data in the eHR Visualization and data presentation. Ready for the clinical interpretation Acceleration of algorithms for data pre- processing. Data strorage optimization feedback Corporative systems Orion clinic Abucasis, Gaia, etc.
  • 11. Preparing the scenario for the introduction of genome in the clinics Patient Treatment eHR feedback Corporative systems Orion clinic Abucasis, Gaia, etc. Decision support techniques: algorithms that relate biomarkers to treatments, outcomes, etc. (gene prioritization and predictors) Visualization and data presentation. Ready for the clinical interpretation Integration of the data in the eHR Acceleration of algorithms for data pre- processing. Data strorage optimization
  • 12. New Big Data storage strategies Automatic QC Sequence cleansing Variant calling + QC Mapping + QC 8-10 hours 8-12 hours 8-12 hours CLOUD FASTQ (10GB) BAM (7GB) VCF (200MB) Data sizes for exomes. In case of whole genomes sizes are >20x Remote visualization of big data. Data production phase e-health record Final human supervision of data QC
  • 13. Tools developed to improve the pipeline Genome Maps, a HTML5+SVG data visualization of VCF and BAM oGenome scale data visualization plays an important role in the data analysis process. It is a big data management problem. oFeatures of Genome Maps (Medina, 2013, NAR; ICGC data analysis portal) ●First 100% HTML5 web based: HTML5+SVG (inspired in Google Maps) ●Always updated, no browser plugins or installation ●Data taken from CellBase, remote NGS data, local files and DAS servers: genes, transcripts, exons, SNPs, TFBS, miRNA targets, etc. ●Other features: Multi species, API oriented, easy integration, plugin framework, etc. BAM viewer VCF viewer ICGC genomic viewer www.genomemaps.org
  • 14. Patient Treatment eHR feedback Corporative systems Orion clinic Abucasis, Gaia, etc. Acceleration of algorithms for data pre- processing. Data strorage optimization Integration of the data in the eHR Visualization and data presentation. Ready for the clinical interpretation Decision support techniques: algorithms that relate biomarkers to treatments, outcomes, etc. (gene prioritization and predictors) Preparing the scenario for the introduction of genome in the clinics
  • 15. Finding new biomarkers Test Therapy 1 Therapy 2 Therapy 3 ? feedback Feedback: treatment failures are reanalyzed to search for: 1)Biomarkers (of failure) 2)Subgroups (to search for new personalized and rational therapeutic interventions Treatables Failure treatment biomarkers Group A biomarkers Group A biomarkers Irrelevant Non treatables Signaling Protein interaction Regulation Variants are used as biomarkers to distinguish between responders and non-responders and to sub-classify non-responders Rationale design of therapies rely on Systems Biology concepts. Pathways are complex and must be understood with the proper bioinformatic tools
  • 16. Patient Treatment eHR feedback Corporative systems Orion clinic Abucasis, Gaia, etc. Decision support techniques: algorithms that relate biomarkers to treatments, outcomes, etc. (gene prioritization and predictors) Acceleration of algorithms for data pre- processing. Data strorage optimization Visualization and data presentation. Ready for the clinical interpretation Integration of the data in the eHR Preparing the scenario for the introduction of the genome in clinics
  • 17. BiERapp: interactive web-based tool for easy candidate prioritization by successive filtering SEQUENCING CENTER Data preprocessing VCF FASTQ Genome Maps BAM BiERapp filters No-SQL (Mongo) VCF indexing Population frequencies Consequence types Experimental design BAM viewer and Genomic context ? Easy scale up
  • 18. NA19660 NA19661 NA19600 NA19685 BiERapp: the interactive filtering tool for easy candidate prioritization http://bierapp.babelomics.org Aleman et al., 2014 NAR
  • 19. 3-Methylglutaconic aciduria (3- MGA-uria) is a heterogeneous group of syndromes characterized by an increased excretion of 3-methylglutaconic and 3-methylglutaric acids. WES with a consecutive filter approach is enough to detect the new mutation in this case. Successive Filtering approach An example with 3-Methylglutaconic aciduria syndrome
  • 20. Use known variants and their population frequencies to filter out irrelevant polymorphisms. •Typically dbSNP, 1000 genomes and the 6515 exomes from the ESP are used as sources of population frequencies. •We sequenced 300 healthy controls (rigorously phenotyped) to add and extra filtering step to the analysis pipeline Novembre et al., 2008. Genes mirror geography within Europe. Nature Comparison of MGP controls to 1000g How important do you think local information is to detect disease genes?
  • 21. Filtering with or without local variants Number of genes as a function of individuals in the study of a dominant disease Retinitis Pigmentosa autosomal dominant The use of local variants makes an enormous difference
  • 22. New variants and disease genes found with WES and successive filtering WES IRDs arRP (EYS) BBS arRP arRP (USH2) 3-MGA- uria (SERAC1) NBD (BCKDK )
  • 23. Knowledge DB Freq. popul. MySeq IonTorrent IonProton Illumina NO Diagnostic Therapeutic decision New variants Disease All Candidate Prioritization Data preprocessing Sequence DB Sequences Freqs. Future technologies New knowledge for future diagnostic The final schema: diagnostic and discovery
  • 24. Diagnostic by targeted sequencing (panels of genes) Tool for defining panels New filter based on local population variant frequencies If no diagnostic variants appear, then secondary findings are studied Diagnostic mutations http://team.babelomics.org
  • 25. Implementation of tools in the IT4I Supercomputing Center (Czech Republic) The pipelines of primary and secondary analysis developed by the Computational Genomics Department of the CIPF in close collaboration with the Bull Chair has proven its efficiency in the analysis of more than 1000 exomes in a joint collaborative project of the CIBERER and the MGP A first pilot implementation has been done in the IT4I supercomputing center, which aims to centralize the analysis of genomics data in the country.
  • 26. Implementation in the AVS ….. 1PB DB We have taken advantage of the already operative corporative medical image system using a quite similar philosophy. eHR gateway Upload image Retrieve (by patient ID) Genomic gateway Pilot project with 20 leukemias
  • 27. Knowledge DB Freq. popul. MySeq IonTorrent IonProton Illumina NO Diagnostic Therapeutic decision New variants Disease All Candidate Prioritization Data preprocessing Sequence DB Sequences Freqs. Future technologies New knowledge for future diagnostic Gene discovery and diagnostic implemented But… what about personalized treatments? 
  • 28. Patient’s omic data Biological knowledge Systems biology computational models Epigenomics Regulation Interaction Function Proteomics Genomics and transcriptomics Patient Metabolomics Diagnostic biomarkers Personalized medicine Therapeutic targets Cell culture Best combination Xenograft model Drug treatment Network drugs Personalized therapy Are individualized treatments a realistic option? Dopazo, 2003, Drug Discovery Today
  • 29. Modeling pathways The effect of gene expression over signaling can be estimated. Virtual KOs (or over-expressions) can be simulated Colorectal cancer activates a signaling circuit of VEGF pathway that produces PGI2. Virtual KO of COX2 interrupts the circuit (known therapeutic inhibitor in CGR COX2 gene KO
  • 30. Future prospects Exome vs complete genome
  • 31. The ENCODE project suggests a functional role for a large fraction of the genome Which percentage of the genome is occupied by: Coding genes: 2.4% TFBSs 8.1% Open chromatin regions 15.2% Different RNA types 62.0% Total annotated elements: 80.4% Exomes are only covering a small fraction of the potential functionality of the genome (2.4%). Is the missing heritability hidden in the remaining 78%? If so, what type of variant should be expect to discover? SNVs? SVs?
  • 32. Future prospects We need to efficiently query all the information contained in the genome, including all the epigenomic signatures as well as the structural variation. This involves data integration and “epistatic” queries. We need to prepare our health systems to deal with all the genomic data flood Information about variations Processed Raw Genome variant information (VCF) 150 MB 250 GB Epigenome 150 MB 250 GB Each transcriptome 20 MB 80 GB Individual complete variability 400 MB 525 GB Hospital (100.000 patients) 40 TB 50 PB We are only starting to realize the dimension of the daunting challenges posed by genomic big data There are technical (data size) and conceptual problems (data analysis) in the way genomic information is managed that must be addressed.
  • 33. The Computational Genomics Department at the Centro de Investigación Príncipe Felipe (CIPF), Valencia, Spain, and… ...the INB, National Institute of Bioinformatics (Functional Genomics Node) and the CIBERER Network of Centers for Rare Diseases. @xdopazo @bioinfocipf