SlideShare a Scribd company logo
1 of 22
Download to read offline
Big Data in Genomics: Opportunities and Challenges
Dr. Matthieu-P. Schapranow
Bio Data World Congress, Cambridge, UK
Oct 22, 2015
■  Online: Visit we.analyzegenomes.com for latest research results, tools, and news
■  Offline: Read more about it, e.g. High-Performance In-Memory Genome Data Analysis:
How In-Memory Database Technology Accelerates Personalized Medicine, In-Memory
Data Management Research, Springer, ISBN: 978-3-319-03034-0, 2014
■  In Person: Join us for “Festival of Genomics” Jan 19-21, 2016 in London, UK
Important things first:
Where do you find additional information?
Schapranow, Bio Data,
Cambridge, UK, Oct 22,
2015
Big Data in Genomics:
Opportunities and
Challenges
2
What is the Hasso Plattner Institute, Potsdam, Germany?
Schapranow, HPI, Oct
13, 2015
Analyze Genomes: A
Federated In-Memory
Database Computing
Platform
3
Prof. Dr. h.c. Hasso Plattner
■ Research focuses on the technical aspects of enterprise
software and design of complex applications
□  In-Memory Data Management for Enterprise
Applications
□  Enterprise Application Programming Model
□  Scientific Data Management
□  Human-Centered Software Design and Engineering
■ Industry cooperations, e.g. SAP, Siemens, Audi, and EADS
■ Research cooperations, e.g. Stanford, MIT, and Berkeley
Hasso Plattner Institute
Enterprise Platform and Integration Concepts Group
Schapranow, HPI, Oct
13, 2015
Analyze Genomes: A
Federated In-Memory
Database Computing
Platform
4
Partner of Stanford
Center for Design
Research
Partner of MIT in
Supply Chain
Innovation and
CSAIL
Partner at
UC Berkeley
RAD / AMP Lab
Partner of SAP
AG
■  Since 2009 Program Manager E-Health & Life Sciences
■  2006-2014 Strategic Projects SAP HANA
■  Visiting Scientist at V.A., Boston, MA and Charité, Berlin
■  Software Engineer by training (PhD, M.Sc., B.Sc.)
With whom are you dealing?
Schapranow, HPI, Oct
13, 2015
Analyze Genomes: A
Federated In-Memory
Database Computing
Platform
5
■  Patients
□  Individual anamnesis, family history, and background
□  Require fast access to individualized therapy
■  Clinicians
□  Identify root and extent of disease using laboratory tests
□  Evaluate therapy alternatives, adapt existing therapy
■  Researchers
□  Conduct laboratory work, e.g. analyze patient samples
□  Create new research findings and come-up with treatment alternatives
The Setting
Actors in Oncology
Schapranow, Bio Data,
Cambridge, UK, Oct 22,
2015
6
Big Data in Genomics:
Opportunities and
Challenges
IT Challenges
Distributed Heterogeneous Data Sources
7
Human genome/biological data
600GB per full genome
15PB+ in databases of leading institutes
Prescription data
1.5B records from 10,000 doctors and
10M Patients (100 GB)
Clinical trials
Currently more than 30k
recruiting on ClinicalTrials.gov
Human proteome
160M data points (2.4GB) per sample
>3TB raw proteome data in ProteomicsDB
PubMed database
>23M articles
Hospital information systems
Often more than 50GB
Medical sensor data
Scan of a single organ in 1s
creates 10GB of raw dataCancer patient records
>160k records at NCT
Big Data in Genomics:
Opportunities and
Challenges
Schapranow, Bio Data,
Cambridge, UK, Oct 22,
2015
Schapranow, HPI, Oct
13, 2015
Our Approach
Analyze Genomes: Real-time Analysis of Big Medical Data
8
In-Memory Database
Extensions for Life Sciences
Data Exchange,
App Store
Access Control,
Data Protection
Fair Use
Statistical
Tools
Real-time
Analysis
App-spanning
User Profiles
Combined and Linked Data
Genome
Data
Cellular
Pathways
Genome
Metadata
Research
Publications
Pipeline and
Analysis Models
Drugs and
Interactions
Analyze Genomes: A
Federated In-Memory
Database Computing
Platform
Drug Response
Analysis
Pathway Topology
Analysis
Medical
Knowledge CockpitOncolyzer
Clinical Trial
Recruitment
Cohort
Analysis
...
Indexed
Sources
Case Vignette
■  Patient: 48 years, female, non-smoker, smoke-free environment
■  Diagnosis: Non-Small Cell Lung Cancer (NSCLC), stage IV
■  Markers: KRAS, EGFR, BRAF, NRAS, (ERBB2)
■  Initial treatment: Surgery
■  Therapy: Palliative chemotherapy
Schapranow, Bio Data,
Cambridge, UK, Oct 22,
2015
Big Data in Genomics:
Opportunities and
Challenges
9
Cloud-based Services for Processing of DNA Data
■  Control center for processing of raw DNA data, such as
FASTQ, SAM, and VCF
■  Personal user profile guarantees privacy of uploaded
and processed data
■  Supports reproducible research process by storing all
relevant process parameters
■  Implements prioritized data processing and fair use, e.g.
per department or per institute
■  Supports additional service, such as data annotations,
billing, and sharing for all Analyze Genomes services
■  Honored by the 2014 European Life Science Award
Big Data in Genomics:
Opportunities and
Challenges
Standardized Modeling and
runtime environment for
analysis pipelines
10
Schapranow, Bio Data,
Cambridge, UK, Oct 22,
2015
■  Query-oriented search interface
■  Seamless integration of patient specifics, e.g. from EMR
■  Parallel search in international knowledge bases, e.g. for biomarkers, literature,
cellular pathway, and clinical trials
Medical Knowledge Cockpit for Patients and Clinicians
Linking Patient Specifics with International Knowledge
Big Data in Genomics:
Opportunities and
Challenges
11
Schapranow, Bio Data,
Cambridge, UK, Oct 22,
2015
Medical Knowledge Cockpit for Patients and Clinicians
■  Search for affected genes in distributed and
heterogeneous data sources
■  Immediate exploration of relevant information, such as
□  Gene descriptions,
□  Molecular impact and related pathways,
□  Scientific publications, and
□  Suitable clinical trials.
■  No manual searching for hours or days:
In-memory technology translates searching into
interactive finding!
Big Data in Genomics:
Opportunities and
Challenges
Automatic clinical trial
matching build on text
analysis features
Unified access to structured
and un-structured data
sources
12
Schapranow, Bio Data,
Cambridge, UK, Oct 22,
2015
Schapranow, Bio Data,
Cambridge, UK, Oct 22,
2015
Medical Knowledge Cockpit for Patients and Clinicians
Pathway Topology Analysis
■  Search in pathways is limited to “is a certain
element contained” today
■  Integrated >1,5k pathways from international
sources, e.g. KEGG, HumanCyc, and WikiPathways,
into HANA
■  Implemented graph-based topology exploration and
ranking based on patient specifics
■  Enables interactive identification of possible
dysfunctions affecting the course of a therapy
before its start
Big Data in Genomics:
Opportunities and
Challenges
Unified access to multiple formerly
disjoint data sources
Pathway analysis of genetic
variants with graph engine
13
Real-time Data Analysis and
Interactive Exploration
Drug Response Analysis
Data Sources
Schapranow, Bio Data,
Cambridge, UK, Oct 22,
2015
Big Data in Genomics:
Opportunities and
Challenges
Smoking status,
tumor classification
and age
(1MB - 100MB)
Raw DNA data
and genetic variants
(100MB - 1TB)
Medication efficiency
and wet lab results
(10MB - 1GB)
14
Patient-specific
Data
Tumor-specific
Data
Compound
Interaction Data
Schapranow, Bio Data,
Cambridge, UK, Oct 22,
2015
Big Data in Genomics:
Opportunities and
Challenges
15
Showcase
Schapranow, Bio Data,
Cambridge, UK, Oct 22,
2015
Big Data in Genomics:
Opportunities and
Challenges
16
Calculating Drug Response…Predict Drug Response
Schapranow, Bio Data,
Cambridge, UK, Oct 22,
2015
Big Data in Genomics:
Opportunities and
Challenges
17
cetuximab might be more
beneficial for the current case
Our Methodology
Design Thinking
Schapranow, Bio Data,
Cambridge, UK, Oct 22,
2015
Big Data in Genomics:
Opportunities and
Challenges
18
Our Methodology
Design Thinking
Schapranow, Bio Data,
Cambridge, UK, Oct 22,
2015
Big Data in Genomics:
Opportunities and
Challenges
19
Desirability
■  Portfolio of integrated services for clinicians, researchers, and patients
■  Include latest treatment option, e.g. most effective therapies
Viability
■  Enable precision medicine also in far-off
regions and developing countries
■  Involve word-wide experts (cost-saving)
■  Combine latest international data
(publications, annotations, genome data)
Feasibility
■  HiSeq 2500 enables high-coverage
whole genome sequencing in 20h
■  IMDB enables allele frequency
determination of 12B records within <1s
■  Cloud-based data processing services
reduce TCO
Combined column
and row store
Map/Reduce Single and
multi-tenancy
Lightweight
compression
Insert only
for time travel
Real-time
replication
Working on
integers
SQL interface on
columns and rows
Active/passive
data store
Minimal
projections
Group key Reduction of
software layers
Dynamic multi-
threading
Bulk load
of data
Object-
relational
mapping
Text retrieval
and extraction engine
No aggregate
tables
Data partitioning Any attribute
as index
No disk
On-the-fly
extensibility
Analytics on
historical data
Multi-core/
parallelization
Our Technology
In-Memory Database Technology
+
++
+
+
P
v
+++
t
SQL
x
x
T
disk
20
Schapranow, Bio Data,
Cambridge, UK, Oct 22,
2015
Big Data in Genomics:
Opportunities and
Challenges
■  For patients
□  Identify relevant clinical trials and medical experts
□  Become an informed patient
■  For clinicians
□  Identify pharmacokinetic correlations
□  Scan for similar patient cases, e.g. to evaluate therapy efficiency
■  For researchers
□  Enable real-time analysis of medical data, e.g. assess pathways
to identify impact of detected variants
□  Combined mining in structured and unstructured data, e.g. publications,
diagnosis, and EMR data
What to Take Home?
Test it Yourself: AnalyzeGenomes.com
Schapranow, Bio Data,
Cambridge, UK, Oct 22,
2015
21
Big Data in Genomics:
Opportunities and
Challenges
Keep in contact with us!
Hasso Plattner Institute
Enterprise Platform & Integration Concepts (EPIC)
Program Manager E-Health
Dr. Matthieu-P. Schapranow
August-Bebel-Str. 88
14482 Potsdam, Germany
Dr. Matthieu-P. Schapranow
schapranow@hpi.de
http://we.analyzegenomes.com/
Schapranow, Bio Data,
Cambridge, UK, Oct 22,
2015
Big Data in Genomics:
Opportunities and
Challenges
22

More Related Content

What's hot

Analyze Genomes Services for Precision Medicine
Analyze Genomes Services for Precision MedicineAnalyze Genomes Services for Precision Medicine
Analyze Genomes Services for Precision MedicineMatthieu Schapranow
 
Festival of Genomics 2016 London: Analyze Genomes: A Federated In-Memory Comp...
Festival of Genomics 2016 London: Analyze Genomes: A Federated In-Memory Comp...Festival of Genomics 2016 London: Analyze Genomes: A Federated In-Memory Comp...
Festival of Genomics 2016 London: Analyze Genomes: A Federated In-Memory Comp...Matthieu Schapranow
 
The Driver of the Healthcare System in the 21st Century: Real-world Applicati...
The Driver of the Healthcare System in the 21st Century: Real-world Applicati...The Driver of the Healthcare System in the 21st Century: Real-world Applicati...
The Driver of the Healthcare System in the 21st Century: Real-world Applicati...Matthieu Schapranow
 
In-Memory Data Management for Systems Medicine
In-Memory Data Management for Systems MedicineIn-Memory Data Management for Systems Medicine
In-Memory Data Management for Systems MedicineMatthieu Schapranow
 
Analyze Genomes Services for Precision Medicine
Analyze Genomes Services for Precision MedicineAnalyze Genomes Services for Precision Medicine
Analyze Genomes Services for Precision MedicineMatthieu Schapranow
 
Analyze Genomes: A Federated In-memory Database Computing Platform enabling r...
Analyze Genomes: A Federated In-memory Database Computing Platform enabling r...Analyze Genomes: A Federated In-memory Database Computing Platform enabling r...
Analyze Genomes: A Federated In-memory Database Computing Platform enabling r...Matthieu Schapranow
 
Big Medical Data – Challenge or Potential?
Big Medical Data – Challenge or Potential?Big Medical Data – Challenge or Potential?
Big Medical Data – Challenge or Potential?Matthieu Schapranow
 
Gesundheit geht uns alle an: Smart Data ermöglicht passendere Entscheidungen...
Gesundheit geht uns alle an: Smart Data ermöglicht passendere Entscheidungen...Gesundheit geht uns alle an: Smart Data ermöglicht passendere Entscheidungen...
Gesundheit geht uns alle an: Smart Data ermöglicht passendere Entscheidungen...Matthieu Schapranow
 
Analyze Genomes: Drug Response Analysis
Analyze Genomes: Drug Response AnalysisAnalyze Genomes: Drug Response Analysis
Analyze Genomes: Drug Response AnalysisMatthieu Schapranow
 
AnalyzeGenomes.com: A Federated In-Memory Database Platform for Digital Health
AnalyzeGenomes.com: A Federated In-Memory Database Platform for Digital HealthAnalyzeGenomes.com: A Federated In-Memory Database Platform for Digital Health
AnalyzeGenomes.com: A Federated In-Memory Database Platform for Digital HealthMatthieu Schapranow
 
ICT Platform to Enable Consortium Work for Systems Medicine of Heart Failure
ICT Platform to Enable Consortium Work for Systems Medicine of Heart FailureICT Platform to Enable Consortium Work for Systems Medicine of Heart Failure
ICT Platform to Enable Consortium Work for Systems Medicine of Heart FailureMatthieu Schapranow
 
Festival of Genomics 2016 London: What to take home?
Festival of Genomics 2016 London: What to take home?Festival of Genomics 2016 London: What to take home?
Festival of Genomics 2016 London: What to take home?Matthieu Schapranow
 
Analyze Genomes: In-memory Apps for Next-generation Life Sciences Research
Analyze Genomes: In-memory Apps for Next-generation Life Sciences ResearchAnalyze Genomes: In-memory Apps for Next-generation Life Sciences Research
Analyze Genomes: In-memory Apps for Next-generation Life Sciences ResearchMatthieu Schapranow
 
Patient Journey in Oncology 2025: Molecular Tumour Boards in Practice
Patient Journey in Oncology 2025: Molecular Tumour Boards in PracticePatient Journey in Oncology 2025: Molecular Tumour Boards in Practice
Patient Journey in Oncology 2025: Molecular Tumour Boards in PracticeMatthieu Schapranow
 
Festival of Genomics 2016 London: Analyze Genomes: Modeling and Executing Gen...
Festival of Genomics 2016 London: Analyze Genomes: Modeling and Executing Gen...Festival of Genomics 2016 London: Analyze Genomes: Modeling and Executing Gen...
Festival of Genomics 2016 London: Analyze Genomes: Modeling and Executing Gen...Matthieu Schapranow
 
In-Memory Apps for Precision Medicine
In-Memory Apps for Precision MedicineIn-Memory Apps for Precision Medicine
In-Memory Apps for Precision MedicineMatthieu Schapranow
 
How will AI affect the patient journey of the future?
How will AI affect the patient journey of the future?How will AI affect the patient journey of the future?
How will AI affect the patient journey of the future?Matthieu Schapranow
 
Festival of Genomics 2016 London: Analyze Genomes: Real-world Examples
Festival of Genomics 2016 London: Analyze Genomes: Real-world ExamplesFestival of Genomics 2016 London: Analyze Genomes: Real-world Examples
Festival of Genomics 2016 London: Analyze Genomes: Real-world ExamplesMatthieu Schapranow
 
Festival of Genomics 2016 London: Mining and Processing of Unstructured Medic...
Festival of Genomics 2016 London: Mining and Processing of Unstructured Medic...Festival of Genomics 2016 London: Mining and Processing of Unstructured Medic...
Festival of Genomics 2016 London: Mining and Processing of Unstructured Medic...Matthieu Schapranow
 

What's hot (20)

Analyze Genomes Services for Precision Medicine
Analyze Genomes Services for Precision MedicineAnalyze Genomes Services for Precision Medicine
Analyze Genomes Services for Precision Medicine
 
Festival of Genomics 2016 London: Analyze Genomes: A Federated In-Memory Comp...
Festival of Genomics 2016 London: Analyze Genomes: A Federated In-Memory Comp...Festival of Genomics 2016 London: Analyze Genomes: A Federated In-Memory Comp...
Festival of Genomics 2016 London: Analyze Genomes: A Federated In-Memory Comp...
 
The Driver of the Healthcare System in the 21st Century: Real-world Applicati...
The Driver of the Healthcare System in the 21st Century: Real-world Applicati...The Driver of the Healthcare System in the 21st Century: Real-world Applicati...
The Driver of the Healthcare System in the 21st Century: Real-world Applicati...
 
In-Memory Data Management for Systems Medicine
In-Memory Data Management for Systems MedicineIn-Memory Data Management for Systems Medicine
In-Memory Data Management for Systems Medicine
 
"When time matters..."
"When time matters...""When time matters..."
"When time matters..."
 
Analyze Genomes Services for Precision Medicine
Analyze Genomes Services for Precision MedicineAnalyze Genomes Services for Precision Medicine
Analyze Genomes Services for Precision Medicine
 
Analyze Genomes: A Federated In-memory Database Computing Platform enabling r...
Analyze Genomes: A Federated In-memory Database Computing Platform enabling r...Analyze Genomes: A Federated In-memory Database Computing Platform enabling r...
Analyze Genomes: A Federated In-memory Database Computing Platform enabling r...
 
Big Medical Data – Challenge or Potential?
Big Medical Data – Challenge or Potential?Big Medical Data – Challenge or Potential?
Big Medical Data – Challenge or Potential?
 
Gesundheit geht uns alle an: Smart Data ermöglicht passendere Entscheidungen...
Gesundheit geht uns alle an: Smart Data ermöglicht passendere Entscheidungen...Gesundheit geht uns alle an: Smart Data ermöglicht passendere Entscheidungen...
Gesundheit geht uns alle an: Smart Data ermöglicht passendere Entscheidungen...
 
Analyze Genomes: Drug Response Analysis
Analyze Genomes: Drug Response AnalysisAnalyze Genomes: Drug Response Analysis
Analyze Genomes: Drug Response Analysis
 
AnalyzeGenomes.com: A Federated In-Memory Database Platform for Digital Health
AnalyzeGenomes.com: A Federated In-Memory Database Platform for Digital HealthAnalyzeGenomes.com: A Federated In-Memory Database Platform for Digital Health
AnalyzeGenomes.com: A Federated In-Memory Database Platform for Digital Health
 
ICT Platform to Enable Consortium Work for Systems Medicine of Heart Failure
ICT Platform to Enable Consortium Work for Systems Medicine of Heart FailureICT Platform to Enable Consortium Work for Systems Medicine of Heart Failure
ICT Platform to Enable Consortium Work for Systems Medicine of Heart Failure
 
Festival of Genomics 2016 London: What to take home?
Festival of Genomics 2016 London: What to take home?Festival of Genomics 2016 London: What to take home?
Festival of Genomics 2016 London: What to take home?
 
Analyze Genomes: In-memory Apps for Next-generation Life Sciences Research
Analyze Genomes: In-memory Apps for Next-generation Life Sciences ResearchAnalyze Genomes: In-memory Apps for Next-generation Life Sciences Research
Analyze Genomes: In-memory Apps for Next-generation Life Sciences Research
 
Patient Journey in Oncology 2025: Molecular Tumour Boards in Practice
Patient Journey in Oncology 2025: Molecular Tumour Boards in PracticePatient Journey in Oncology 2025: Molecular Tumour Boards in Practice
Patient Journey in Oncology 2025: Molecular Tumour Boards in Practice
 
Festival of Genomics 2016 London: Analyze Genomes: Modeling and Executing Gen...
Festival of Genomics 2016 London: Analyze Genomes: Modeling and Executing Gen...Festival of Genomics 2016 London: Analyze Genomes: Modeling and Executing Gen...
Festival of Genomics 2016 London: Analyze Genomes: Modeling and Executing Gen...
 
In-Memory Apps for Precision Medicine
In-Memory Apps for Precision MedicineIn-Memory Apps for Precision Medicine
In-Memory Apps for Precision Medicine
 
How will AI affect the patient journey of the future?
How will AI affect the patient journey of the future?How will AI affect the patient journey of the future?
How will AI affect the patient journey of the future?
 
Festival of Genomics 2016 London: Analyze Genomes: Real-world Examples
Festival of Genomics 2016 London: Analyze Genomes: Real-world ExamplesFestival of Genomics 2016 London: Analyze Genomes: Real-world Examples
Festival of Genomics 2016 London: Analyze Genomes: Real-world Examples
 
Festival of Genomics 2016 London: Mining and Processing of Unstructured Medic...
Festival of Genomics 2016 London: Mining and Processing of Unstructured Medic...Festival of Genomics 2016 London: Mining and Processing of Unstructured Medic...
Festival of Genomics 2016 London: Mining and Processing of Unstructured Medic...
 

Similar to Big Data in Genomics: Opportunities and Challenges

Festival of Genomics 2016 London: Challenges of Big Medical Data?
Festival of Genomics 2016 London: Challenges of Big Medical Data?Festival of Genomics 2016 London: Challenges of Big Medical Data?
Festival of Genomics 2016 London: Challenges of Big Medical Data?Matthieu Schapranow
 
Turning Big Data into Precision Medicine
Turning Big Data into Precision MedicineTurning Big Data into Precision Medicine
Turning Big Data into Precision MedicineMatthieu Schapranow
 
Introduction to High-performance In-memory Genome Project at HPI
Introduction to High-performance In-memory Genome Project at HPI Introduction to High-performance In-memory Genome Project at HPI
Introduction to High-performance In-memory Genome Project at HPI Matthieu Schapranow
 
Healthcare Conference 2013 : Toekomstvisie op ICT in de gezondheidszorg - pro...
Healthcare Conference 2013 : Toekomstvisie op ICT in de gezondheidszorg - pro...Healthcare Conference 2013 : Toekomstvisie op ICT in de gezondheidszorg - pro...
Healthcare Conference 2013 : Toekomstvisie op ICT in de gezondheidszorg - pro...D3 Consutling
 
Cancer Moonshot, Data sharing and the Genomic Data Commons
Cancer Moonshot, Data sharing and the Genomic Data CommonsCancer Moonshot, Data sharing and the Genomic Data Commons
Cancer Moonshot, Data sharing and the Genomic Data CommonsWarren Kibbe
 
Enabling Real-Time Genome Data Research with In-Memory Database Technology (I...
Enabling Real-Time Genome Data Research with In-Memory Database Technology (I...Enabling Real-Time Genome Data Research with In-Memory Database Technology (I...
Enabling Real-Time Genome Data Research with In-Memory Database Technology (I...Matthieu Schapranow
 
cBioPortal Webinar Slides (2/3)
cBioPortal Webinar Slides (2/3)cBioPortal Webinar Slides (2/3)
cBioPortal Webinar Slides (2/3)Pistoia Alliance
 
Enabling Real-time Genome Data Research with In-memory Database Technology (S...
Enabling Real-time Genome Data Research with In-memory Database Technology (S...Enabling Real-time Genome Data Research with In-memory Database Technology (S...
Enabling Real-time Genome Data Research with In-memory Database Technology (S...Matthieu Schapranow
 
Developing open data analysis pipelines in the cloud: Enabling the ‘big data’...
Developing open data analysis pipelines in the cloud: Enabling the ‘big data’...Developing open data analysis pipelines in the cloud: Enabling the ‘big data’...
Developing open data analysis pipelines in the cloud: Enabling the ‘big data’...Juan Antonio Vizcaino
 
Quantitative Medicine Feb 2009
Quantitative Medicine Feb 2009Quantitative Medicine Feb 2009
Quantitative Medicine Feb 2009Ian Foster
 
Using Healthcare Data for Research @ The Hyve - Campus Party 2016
Using Healthcare Data for Research @ The Hyve - Campus Party 2016Using Healthcare Data for Research @ The Hyve - Campus Party 2016
Using Healthcare Data for Research @ The Hyve - Campus Party 2016Kees van Bochove
 
Cancer moonshot and data sharing
Cancer moonshot and data sharingCancer moonshot and data sharing
Cancer moonshot and data sharingWarren Kibbe
 
In-memory Applications for Oncology
In-memory Applications for OncologyIn-memory Applications for Oncology
In-memory Applications for OncologyMatthieu Schapranow
 
Next Gen Sequencing and Associated Big Data / AI problem
Next Gen Sequencing and Associated Big Data / AI problemNext Gen Sequencing and Associated Big Data / AI problem
Next Gen Sequencing and Associated Big Data / AI problemSubhendu Dey
 
Big Data Analytics in the Health Domain
Big Data Analytics in the Health DomainBig Data Analytics in the Health Domain
Big Data Analytics in the Health DomainBigData_Europe
 
How Real-time Analysis turns Big Medical Data into Precision Medicine
How Real-time Analysis turns Big Medical Data into Precision MedicineHow Real-time Analysis turns Big Medical Data into Precision Medicine
How Real-time Analysis turns Big Medical Data into Precision MedicineMatthieu Schapranow
 
Big Data in Biomedicine: Where is the NIH Headed
Big Data in Biomedicine: Where is the NIH HeadedBig Data in Biomedicine: Where is the NIH Headed
Big Data in Biomedicine: Where is the NIH HeadedPhilip Bourne
 
Data Commons & Data Science Workshop
Data Commons & Data Science WorkshopData Commons & Data Science Workshop
Data Commons & Data Science WorkshopWarren Kibbe
 

Similar to Big Data in Genomics: Opportunities and Challenges (20)

Festival of Genomics 2016 London: Challenges of Big Medical Data?
Festival of Genomics 2016 London: Challenges of Big Medical Data?Festival of Genomics 2016 London: Challenges of Big Medical Data?
Festival of Genomics 2016 London: Challenges of Big Medical Data?
 
Turning Big Data into Precision Medicine
Turning Big Data into Precision MedicineTurning Big Data into Precision Medicine
Turning Big Data into Precision Medicine
 
Introduction to High-performance In-memory Genome Project at HPI
Introduction to High-performance In-memory Genome Project at HPI Introduction to High-performance In-memory Genome Project at HPI
Introduction to High-performance In-memory Genome Project at HPI
 
Healthcare Conference 2013 : Toekomstvisie op ICT in de gezondheidszorg - pro...
Healthcare Conference 2013 : Toekomstvisie op ICT in de gezondheidszorg - pro...Healthcare Conference 2013 : Toekomstvisie op ICT in de gezondheidszorg - pro...
Healthcare Conference 2013 : Toekomstvisie op ICT in de gezondheidszorg - pro...
 
Cancer Moonshot, Data sharing and the Genomic Data Commons
Cancer Moonshot, Data sharing and the Genomic Data CommonsCancer Moonshot, Data sharing and the Genomic Data Commons
Cancer Moonshot, Data sharing and the Genomic Data Commons
 
Enabling Real-Time Genome Data Research with In-Memory Database Technology (I...
Enabling Real-Time Genome Data Research with In-Memory Database Technology (I...Enabling Real-Time Genome Data Research with In-Memory Database Technology (I...
Enabling Real-Time Genome Data Research with In-Memory Database Technology (I...
 
cBioPortal Webinar Slides (2/3)
cBioPortal Webinar Slides (2/3)cBioPortal Webinar Slides (2/3)
cBioPortal Webinar Slides (2/3)
 
Enabling Real-time Genome Data Research with In-memory Database Technology (S...
Enabling Real-time Genome Data Research with In-memory Database Technology (S...Enabling Real-time Genome Data Research with In-memory Database Technology (S...
Enabling Real-time Genome Data Research with In-memory Database Technology (S...
 
Developing open data analysis pipelines in the cloud: Enabling the ‘big data’...
Developing open data analysis pipelines in the cloud: Enabling the ‘big data’...Developing open data analysis pipelines in the cloud: Enabling the ‘big data’...
Developing open data analysis pipelines in the cloud: Enabling the ‘big data’...
 
Quantitative Medicine Feb 2009
Quantitative Medicine Feb 2009Quantitative Medicine Feb 2009
Quantitative Medicine Feb 2009
 
Julie Corfield
Julie CorfieldJulie Corfield
Julie Corfield
 
Using Healthcare Data for Research @ The Hyve - Campus Party 2016
Using Healthcare Data for Research @ The Hyve - Campus Party 2016Using Healthcare Data for Research @ The Hyve - Campus Party 2016
Using Healthcare Data for Research @ The Hyve - Campus Party 2016
 
Cancer moonshot and data sharing
Cancer moonshot and data sharingCancer moonshot and data sharing
Cancer moonshot and data sharing
 
Open data genomics_palermo_2017_ver03
Open data genomics_palermo_2017_ver03Open data genomics_palermo_2017_ver03
Open data genomics_palermo_2017_ver03
 
In-memory Applications for Oncology
In-memory Applications for OncologyIn-memory Applications for Oncology
In-memory Applications for Oncology
 
Next Gen Sequencing and Associated Big Data / AI problem
Next Gen Sequencing and Associated Big Data / AI problemNext Gen Sequencing and Associated Big Data / AI problem
Next Gen Sequencing and Associated Big Data / AI problem
 
Big Data Analytics in the Health Domain
Big Data Analytics in the Health DomainBig Data Analytics in the Health Domain
Big Data Analytics in the Health Domain
 
How Real-time Analysis turns Big Medical Data into Precision Medicine
How Real-time Analysis turns Big Medical Data into Precision MedicineHow Real-time Analysis turns Big Medical Data into Precision Medicine
How Real-time Analysis turns Big Medical Data into Precision Medicine
 
Big Data in Biomedicine: Where is the NIH Headed
Big Data in Biomedicine: Where is the NIH HeadedBig Data in Biomedicine: Where is the NIH Headed
Big Data in Biomedicine: Where is the NIH Headed
 
Data Commons & Data Science Workshop
Data Commons & Data Science WorkshopData Commons & Data Science Workshop
Data Commons & Data Science Workshop
 

Recently uploaded

AI You Can Trust - Ensuring Success with Data Integrity Webinar
AI You Can Trust - Ensuring Success with Data Integrity WebinarAI You Can Trust - Ensuring Success with Data Integrity Webinar
AI You Can Trust - Ensuring Success with Data Integrity WebinarPrecisely
 
Nanopower In Semiconductor Industry.pdf
Nanopower  In Semiconductor Industry.pdfNanopower  In Semiconductor Industry.pdf
Nanopower In Semiconductor Industry.pdfPedro Manuel
 
activity_diagram_combine_v4_20190827.pdfactivity_diagram_combine_v4_20190827.pdf
activity_diagram_combine_v4_20190827.pdfactivity_diagram_combine_v4_20190827.pdfactivity_diagram_combine_v4_20190827.pdfactivity_diagram_combine_v4_20190827.pdf
activity_diagram_combine_v4_20190827.pdfactivity_diagram_combine_v4_20190827.pdfJamie (Taka) Wang
 
Connector Corner: Extending LLM automation use cases with UiPath GenAI connec...
Connector Corner: Extending LLM automation use cases with UiPath GenAI connec...Connector Corner: Extending LLM automation use cases with UiPath GenAI connec...
Connector Corner: Extending LLM automation use cases with UiPath GenAI connec...DianaGray10
 
KubeConEU24-Monitoring Kubernetes and Cloud Spend with OpenCost
KubeConEU24-Monitoring Kubernetes and Cloud Spend with OpenCostKubeConEU24-Monitoring Kubernetes and Cloud Spend with OpenCost
KubeConEU24-Monitoring Kubernetes and Cloud Spend with OpenCostMatt Ray
 
How Accurate are Carbon Emissions Projections?
How Accurate are Carbon Emissions Projections?How Accurate are Carbon Emissions Projections?
How Accurate are Carbon Emissions Projections?IES VE
 
The Data Metaverse: Unpacking the Roles, Use Cases, and Tech Trends in Data a...
The Data Metaverse: Unpacking the Roles, Use Cases, and Tech Trends in Data a...The Data Metaverse: Unpacking the Roles, Use Cases, and Tech Trends in Data a...
The Data Metaverse: Unpacking the Roles, Use Cases, and Tech Trends in Data a...Aggregage
 
COMPUTER 10: Lesson 7 - File Storage and Online Collaboration
COMPUTER 10: Lesson 7 - File Storage and Online CollaborationCOMPUTER 10: Lesson 7 - File Storage and Online Collaboration
COMPUTER 10: Lesson 7 - File Storage and Online Collaborationbruanjhuli
 
Meet the new FSP 3000 M-Flex800™
Meet the new FSP 3000 M-Flex800™Meet the new FSP 3000 M-Flex800™
Meet the new FSP 3000 M-Flex800™Adtran
 
Salesforce Miami User Group Event - 1st Quarter 2024
Salesforce Miami User Group Event - 1st Quarter 2024Salesforce Miami User Group Event - 1st Quarter 2024
Salesforce Miami User Group Event - 1st Quarter 2024SkyPlanner
 
Cybersecurity Workshop #1.pptx
Cybersecurity Workshop #1.pptxCybersecurity Workshop #1.pptx
Cybersecurity Workshop #1.pptxGDSC PJATK
 
Apres-Cyber - The Data Dilemma: Bridging Offensive Operations and Machine Lea...
Apres-Cyber - The Data Dilemma: Bridging Offensive Operations and Machine Lea...Apres-Cyber - The Data Dilemma: Bridging Offensive Operations and Machine Lea...
Apres-Cyber - The Data Dilemma: Bridging Offensive Operations and Machine Lea...Will Schroeder
 
Comparing Sidecar-less Service Mesh from Cilium and Istio
Comparing Sidecar-less Service Mesh from Cilium and IstioComparing Sidecar-less Service Mesh from Cilium and Istio
Comparing Sidecar-less Service Mesh from Cilium and IstioChristian Posta
 
20230202 - Introduction to tis-py
20230202 - Introduction to tis-py20230202 - Introduction to tis-py
20230202 - Introduction to tis-pyJamie (Taka) Wang
 
Empowering Africa's Next Generation: The AI Leadership Blueprint
Empowering Africa's Next Generation: The AI Leadership BlueprintEmpowering Africa's Next Generation: The AI Leadership Blueprint
Empowering Africa's Next Generation: The AI Leadership BlueprintMahmoud Rabie
 
Videogame localization & technology_ how to enhance the power of translation.pdf
Videogame localization & technology_ how to enhance the power of translation.pdfVideogame localization & technology_ how to enhance the power of translation.pdf
Videogame localization & technology_ how to enhance the power of translation.pdfinfogdgmi
 
COMPUTER 10 Lesson 8 - Building a Website
COMPUTER 10 Lesson 8 - Building a WebsiteCOMPUTER 10 Lesson 8 - Building a Website
COMPUTER 10 Lesson 8 - Building a Websitedgelyza
 
Introduction to Matsuo Laboratory (ENG).pptx
Introduction to Matsuo Laboratory (ENG).pptxIntroduction to Matsuo Laboratory (ENG).pptx
Introduction to Matsuo Laboratory (ENG).pptxMatsuo Lab
 

Recently uploaded (20)

AI You Can Trust - Ensuring Success with Data Integrity Webinar
AI You Can Trust - Ensuring Success with Data Integrity WebinarAI You Can Trust - Ensuring Success with Data Integrity Webinar
AI You Can Trust - Ensuring Success with Data Integrity Webinar
 
Nanopower In Semiconductor Industry.pdf
Nanopower  In Semiconductor Industry.pdfNanopower  In Semiconductor Industry.pdf
Nanopower In Semiconductor Industry.pdf
 
activity_diagram_combine_v4_20190827.pdfactivity_diagram_combine_v4_20190827.pdf
activity_diagram_combine_v4_20190827.pdfactivity_diagram_combine_v4_20190827.pdfactivity_diagram_combine_v4_20190827.pdfactivity_diagram_combine_v4_20190827.pdf
activity_diagram_combine_v4_20190827.pdfactivity_diagram_combine_v4_20190827.pdf
 
Connector Corner: Extending LLM automation use cases with UiPath GenAI connec...
Connector Corner: Extending LLM automation use cases with UiPath GenAI connec...Connector Corner: Extending LLM automation use cases with UiPath GenAI connec...
Connector Corner: Extending LLM automation use cases with UiPath GenAI connec...
 
KubeConEU24-Monitoring Kubernetes and Cloud Spend with OpenCost
KubeConEU24-Monitoring Kubernetes and Cloud Spend with OpenCostKubeConEU24-Monitoring Kubernetes and Cloud Spend with OpenCost
KubeConEU24-Monitoring Kubernetes and Cloud Spend with OpenCost
 
How Accurate are Carbon Emissions Projections?
How Accurate are Carbon Emissions Projections?How Accurate are Carbon Emissions Projections?
How Accurate are Carbon Emissions Projections?
 
The Data Metaverse: Unpacking the Roles, Use Cases, and Tech Trends in Data a...
The Data Metaverse: Unpacking the Roles, Use Cases, and Tech Trends in Data a...The Data Metaverse: Unpacking the Roles, Use Cases, and Tech Trends in Data a...
The Data Metaverse: Unpacking the Roles, Use Cases, and Tech Trends in Data a...
 
COMPUTER 10: Lesson 7 - File Storage and Online Collaboration
COMPUTER 10: Lesson 7 - File Storage and Online CollaborationCOMPUTER 10: Lesson 7 - File Storage and Online Collaboration
COMPUTER 10: Lesson 7 - File Storage and Online Collaboration
 
201610817 - edge part1
201610817 - edge part1201610817 - edge part1
201610817 - edge part1
 
Meet the new FSP 3000 M-Flex800™
Meet the new FSP 3000 M-Flex800™Meet the new FSP 3000 M-Flex800™
Meet the new FSP 3000 M-Flex800™
 
Salesforce Miami User Group Event - 1st Quarter 2024
Salesforce Miami User Group Event - 1st Quarter 2024Salesforce Miami User Group Event - 1st Quarter 2024
Salesforce Miami User Group Event - 1st Quarter 2024
 
Cybersecurity Workshop #1.pptx
Cybersecurity Workshop #1.pptxCybersecurity Workshop #1.pptx
Cybersecurity Workshop #1.pptx
 
Apres-Cyber - The Data Dilemma: Bridging Offensive Operations and Machine Lea...
Apres-Cyber - The Data Dilemma: Bridging Offensive Operations and Machine Lea...Apres-Cyber - The Data Dilemma: Bridging Offensive Operations and Machine Lea...
Apres-Cyber - The Data Dilemma: Bridging Offensive Operations and Machine Lea...
 
Comparing Sidecar-less Service Mesh from Cilium and Istio
Comparing Sidecar-less Service Mesh from Cilium and IstioComparing Sidecar-less Service Mesh from Cilium and Istio
Comparing Sidecar-less Service Mesh from Cilium and Istio
 
20230202 - Introduction to tis-py
20230202 - Introduction to tis-py20230202 - Introduction to tis-py
20230202 - Introduction to tis-py
 
Empowering Africa's Next Generation: The AI Leadership Blueprint
Empowering Africa's Next Generation: The AI Leadership BlueprintEmpowering Africa's Next Generation: The AI Leadership Blueprint
Empowering Africa's Next Generation: The AI Leadership Blueprint
 
Videogame localization & technology_ how to enhance the power of translation.pdf
Videogame localization & technology_ how to enhance the power of translation.pdfVideogame localization & technology_ how to enhance the power of translation.pdf
Videogame localization & technology_ how to enhance the power of translation.pdf
 
20150722 - AGV
20150722 - AGV20150722 - AGV
20150722 - AGV
 
COMPUTER 10 Lesson 8 - Building a Website
COMPUTER 10 Lesson 8 - Building a WebsiteCOMPUTER 10 Lesson 8 - Building a Website
COMPUTER 10 Lesson 8 - Building a Website
 
Introduction to Matsuo Laboratory (ENG).pptx
Introduction to Matsuo Laboratory (ENG).pptxIntroduction to Matsuo Laboratory (ENG).pptx
Introduction to Matsuo Laboratory (ENG).pptx
 

Big Data in Genomics: Opportunities and Challenges

  • 1. Big Data in Genomics: Opportunities and Challenges Dr. Matthieu-P. Schapranow Bio Data World Congress, Cambridge, UK Oct 22, 2015
  • 2. ■  Online: Visit we.analyzegenomes.com for latest research results, tools, and news ■  Offline: Read more about it, e.g. High-Performance In-Memory Genome Data Analysis: How In-Memory Database Technology Accelerates Personalized Medicine, In-Memory Data Management Research, Springer, ISBN: 978-3-319-03034-0, 2014 ■  In Person: Join us for “Festival of Genomics” Jan 19-21, 2016 in London, UK Important things first: Where do you find additional information? Schapranow, Bio Data, Cambridge, UK, Oct 22, 2015 Big Data in Genomics: Opportunities and Challenges 2
  • 3. What is the Hasso Plattner Institute, Potsdam, Germany? Schapranow, HPI, Oct 13, 2015 Analyze Genomes: A Federated In-Memory Database Computing Platform 3
  • 4. Prof. Dr. h.c. Hasso Plattner ■ Research focuses on the technical aspects of enterprise software and design of complex applications □  In-Memory Data Management for Enterprise Applications □  Enterprise Application Programming Model □  Scientific Data Management □  Human-Centered Software Design and Engineering ■ Industry cooperations, e.g. SAP, Siemens, Audi, and EADS ■ Research cooperations, e.g. Stanford, MIT, and Berkeley Hasso Plattner Institute Enterprise Platform and Integration Concepts Group Schapranow, HPI, Oct 13, 2015 Analyze Genomes: A Federated In-Memory Database Computing Platform 4 Partner of Stanford Center for Design Research Partner of MIT in Supply Chain Innovation and CSAIL Partner at UC Berkeley RAD / AMP Lab Partner of SAP AG
  • 5. ■  Since 2009 Program Manager E-Health & Life Sciences ■  2006-2014 Strategic Projects SAP HANA ■  Visiting Scientist at V.A., Boston, MA and Charité, Berlin ■  Software Engineer by training (PhD, M.Sc., B.Sc.) With whom are you dealing? Schapranow, HPI, Oct 13, 2015 Analyze Genomes: A Federated In-Memory Database Computing Platform 5
  • 6. ■  Patients □  Individual anamnesis, family history, and background □  Require fast access to individualized therapy ■  Clinicians □  Identify root and extent of disease using laboratory tests □  Evaluate therapy alternatives, adapt existing therapy ■  Researchers □  Conduct laboratory work, e.g. analyze patient samples □  Create new research findings and come-up with treatment alternatives The Setting Actors in Oncology Schapranow, Bio Data, Cambridge, UK, Oct 22, 2015 6 Big Data in Genomics: Opportunities and Challenges
  • 7. IT Challenges Distributed Heterogeneous Data Sources 7 Human genome/biological data 600GB per full genome 15PB+ in databases of leading institutes Prescription data 1.5B records from 10,000 doctors and 10M Patients (100 GB) Clinical trials Currently more than 30k recruiting on ClinicalTrials.gov Human proteome 160M data points (2.4GB) per sample >3TB raw proteome data in ProteomicsDB PubMed database >23M articles Hospital information systems Often more than 50GB Medical sensor data Scan of a single organ in 1s creates 10GB of raw dataCancer patient records >160k records at NCT Big Data in Genomics: Opportunities and Challenges Schapranow, Bio Data, Cambridge, UK, Oct 22, 2015
  • 8. Schapranow, HPI, Oct 13, 2015 Our Approach Analyze Genomes: Real-time Analysis of Big Medical Data 8 In-Memory Database Extensions for Life Sciences Data Exchange, App Store Access Control, Data Protection Fair Use Statistical Tools Real-time Analysis App-spanning User Profiles Combined and Linked Data Genome Data Cellular Pathways Genome Metadata Research Publications Pipeline and Analysis Models Drugs and Interactions Analyze Genomes: A Federated In-Memory Database Computing Platform Drug Response Analysis Pathway Topology Analysis Medical Knowledge CockpitOncolyzer Clinical Trial Recruitment Cohort Analysis ... Indexed Sources
  • 9. Case Vignette ■  Patient: 48 years, female, non-smoker, smoke-free environment ■  Diagnosis: Non-Small Cell Lung Cancer (NSCLC), stage IV ■  Markers: KRAS, EGFR, BRAF, NRAS, (ERBB2) ■  Initial treatment: Surgery ■  Therapy: Palliative chemotherapy Schapranow, Bio Data, Cambridge, UK, Oct 22, 2015 Big Data in Genomics: Opportunities and Challenges 9
  • 10. Cloud-based Services for Processing of DNA Data ■  Control center for processing of raw DNA data, such as FASTQ, SAM, and VCF ■  Personal user profile guarantees privacy of uploaded and processed data ■  Supports reproducible research process by storing all relevant process parameters ■  Implements prioritized data processing and fair use, e.g. per department or per institute ■  Supports additional service, such as data annotations, billing, and sharing for all Analyze Genomes services ■  Honored by the 2014 European Life Science Award Big Data in Genomics: Opportunities and Challenges Standardized Modeling and runtime environment for analysis pipelines 10 Schapranow, Bio Data, Cambridge, UK, Oct 22, 2015
  • 11. ■  Query-oriented search interface ■  Seamless integration of patient specifics, e.g. from EMR ■  Parallel search in international knowledge bases, e.g. for biomarkers, literature, cellular pathway, and clinical trials Medical Knowledge Cockpit for Patients and Clinicians Linking Patient Specifics with International Knowledge Big Data in Genomics: Opportunities and Challenges 11 Schapranow, Bio Data, Cambridge, UK, Oct 22, 2015
  • 12. Medical Knowledge Cockpit for Patients and Clinicians ■  Search for affected genes in distributed and heterogeneous data sources ■  Immediate exploration of relevant information, such as □  Gene descriptions, □  Molecular impact and related pathways, □  Scientific publications, and □  Suitable clinical trials. ■  No manual searching for hours or days: In-memory technology translates searching into interactive finding! Big Data in Genomics: Opportunities and Challenges Automatic clinical trial matching build on text analysis features Unified access to structured and un-structured data sources 12 Schapranow, Bio Data, Cambridge, UK, Oct 22, 2015
  • 13. Schapranow, Bio Data, Cambridge, UK, Oct 22, 2015 Medical Knowledge Cockpit for Patients and Clinicians Pathway Topology Analysis ■  Search in pathways is limited to “is a certain element contained” today ■  Integrated >1,5k pathways from international sources, e.g. KEGG, HumanCyc, and WikiPathways, into HANA ■  Implemented graph-based topology exploration and ranking based on patient specifics ■  Enables interactive identification of possible dysfunctions affecting the course of a therapy before its start Big Data in Genomics: Opportunities and Challenges Unified access to multiple formerly disjoint data sources Pathway analysis of genetic variants with graph engine 13
  • 14. Real-time Data Analysis and Interactive Exploration Drug Response Analysis Data Sources Schapranow, Bio Data, Cambridge, UK, Oct 22, 2015 Big Data in Genomics: Opportunities and Challenges Smoking status, tumor classification and age (1MB - 100MB) Raw DNA data and genetic variants (100MB - 1TB) Medication efficiency and wet lab results (10MB - 1GB) 14 Patient-specific Data Tumor-specific Data Compound Interaction Data
  • 15. Schapranow, Bio Data, Cambridge, UK, Oct 22, 2015 Big Data in Genomics: Opportunities and Challenges 15
  • 16. Showcase Schapranow, Bio Data, Cambridge, UK, Oct 22, 2015 Big Data in Genomics: Opportunities and Challenges 16 Calculating Drug Response…Predict Drug Response
  • 17. Schapranow, Bio Data, Cambridge, UK, Oct 22, 2015 Big Data in Genomics: Opportunities and Challenges 17 cetuximab might be more beneficial for the current case
  • 18. Our Methodology Design Thinking Schapranow, Bio Data, Cambridge, UK, Oct 22, 2015 Big Data in Genomics: Opportunities and Challenges 18
  • 19. Our Methodology Design Thinking Schapranow, Bio Data, Cambridge, UK, Oct 22, 2015 Big Data in Genomics: Opportunities and Challenges 19 Desirability ■  Portfolio of integrated services for clinicians, researchers, and patients ■  Include latest treatment option, e.g. most effective therapies Viability ■  Enable precision medicine also in far-off regions and developing countries ■  Involve word-wide experts (cost-saving) ■  Combine latest international data (publications, annotations, genome data) Feasibility ■  HiSeq 2500 enables high-coverage whole genome sequencing in 20h ■  IMDB enables allele frequency determination of 12B records within <1s ■  Cloud-based data processing services reduce TCO
  • 20. Combined column and row store Map/Reduce Single and multi-tenancy Lightweight compression Insert only for time travel Real-time replication Working on integers SQL interface on columns and rows Active/passive data store Minimal projections Group key Reduction of software layers Dynamic multi- threading Bulk load of data Object- relational mapping Text retrieval and extraction engine No aggregate tables Data partitioning Any attribute as index No disk On-the-fly extensibility Analytics on historical data Multi-core/ parallelization Our Technology In-Memory Database Technology + ++ + + P v +++ t SQL x x T disk 20 Schapranow, Bio Data, Cambridge, UK, Oct 22, 2015 Big Data in Genomics: Opportunities and Challenges
  • 21. ■  For patients □  Identify relevant clinical trials and medical experts □  Become an informed patient ■  For clinicians □  Identify pharmacokinetic correlations □  Scan for similar patient cases, e.g. to evaluate therapy efficiency ■  For researchers □  Enable real-time analysis of medical data, e.g. assess pathways to identify impact of detected variants □  Combined mining in structured and unstructured data, e.g. publications, diagnosis, and EMR data What to Take Home? Test it Yourself: AnalyzeGenomes.com Schapranow, Bio Data, Cambridge, UK, Oct 22, 2015 21 Big Data in Genomics: Opportunities and Challenges
  • 22. Keep in contact with us! Hasso Plattner Institute Enterprise Platform & Integration Concepts (EPIC) Program Manager E-Health Dr. Matthieu-P. Schapranow August-Bebel-Str. 88 14482 Potsdam, Germany Dr. Matthieu-P. Schapranow schapranow@hpi.de http://we.analyzegenomes.com/ Schapranow, Bio Data, Cambridge, UK, Oct 22, 2015 Big Data in Genomics: Opportunities and Challenges 22