SlideShare ist ein Scribd-Unternehmen logo
1 von 23
Downloaden Sie, um offline zu lesen
A Platform for Integrated
Genome Data Analysis
All About Data Congress, UMC Groningen

January 30th, 2014

Dr. Matthieu Schapranow
Hasso Plattner Institute
Hasso Plattner Institute
Key Facts
2

■  Founded as a public-private partnership
in 1998 in Potsdam near Berlin, Germany
■  Institute belongs to the
University of Potsdam
■  Ranked 1st in CHE since 2009
■  500 B.Sc. and M.Sc. students
■  10 professors, 150 PhD students
■  Course of study: IT Systems Engineering

A Platform for Integrated Genome Data Analysis, All About Data Congress, Schapranow, Jan 30, 2014
Hasso Plattner Institute
Programs
3

■  Full university curriculum
■  Bachelor (6 semesters)
■  Master (4 semesters)

A Platform for Integrated Genome Data Analysis, All About Data Congress, Schapranow, Jan 30, 2014
Hasso Plattner Institute
Enterprise Platform and Integration Concepts Group
4

Prof. Dr. h.c. Hasso Plattner
■  Research focuses on the technical aspects of enterprise
software and design of complex applications
□  In-Memory Data Management for
Enterprise Applications
□  Enterprise Application Programming Model
□  Scientific Data Management
□  Human-Centered Software Design and Engineering
■  Industry cooperations, e.g. SAP, Siemens, Audi, and EADS
■  Research cooperations, e.g. Stanford, MIT, and Berkeley

Partner of Stanford Center
for Design Research

Partner of MIT in Supply
Chain Innovation and CSAIL

Partner at UC Berkeley

RAD / AMP Lab

A Platform for Integrated Genome Data Analysis, All About Data Congress, Schapranow, Jan 30, 2014
Our Motivation
Personalized Medicine
5

■  Motivation: Can we analyze the entire data of a patient, incl.
Electronic Health Records (EHR) and genome data, during a
doctor’s visit?
■  Genome data analysis may sum up to weeks,
i.e. biopsy, biological preparation, sequencing,
alignment, variant calling, full analysis, and evaluation
■  Issue: Complex and time-consuming data processing tasks
■  In-memory technology accelerates genome data processing
□  Highly parallel alignment / variant calling
□  Real-time analysis of individual patient or cohort data
□  Combined search in structured / unstructured data
A Platform for Integrated Genome Data Analysis, All About Data Congress, Schapranow, Jan 30, 2014
Our Challenge
Distributed Big Data Sources
6

Human genome/biological data
600GB per full genome
15PB+ in databases of leading institutes

Hospital information systems
Often more than 50GB

Cancer patient records
>160k records at NCT

Prescription data
1.5B records from 10,000 doctors
and 10M Patients (100 GB)

Human proteome
160M data points (2.4GB) per sample
>3TB raw proteome data in ProteomicsDB

PubMed database
>23M articles

Medical sensor data
Scan of a single organ in 1s
creates 10GB of raw data

Clinical trials
Currently more than 30k
recruiting on ClinicalTrials.gov

A Platform for Integrated Genome Data Analysis, All About Data Congress, Schapranow, Jan 30, 2014
Our Approach
In-Memory Technology
●
●
●
●

Read Event
Read Event
Repositories
Repositories

Combined
column
and row store
Insert only
for time travel

+

+
+
++
A

up to 2.000
up to 2.000
requests
requests
per second
per second

Minimal
projections

Any attribute
as index

Bulk load

Discovery Service
Discovery Service

Multi-core/
parallelization

SAP HANA
SAP HANA

P

Active/passive
data store
Dynamic
multithreading
within nodes
No aggregate
tables

+++

up to 8.000 read
up to 8.000 read
event notifications
event notifications
per second
per second

On-the-fly
extensibility
Map
reduce

P
P

t

A
A

Lightweight
Compression

Partitioning

SQL

Analytics on
historical
data
Single and
multi-tenancy
Object to
relational
mapping
Group Key

SQL interface
on columns &
rows
Reduction of
layers

x
x

7

Verification
Verification
Services
Services

T

Text Retrieval
and Extraction
No disk

A Platform for Integrated Genome Data Analysis, All About Data Congress, Schapranow, Jan 30, 2014
Our Vision
Personalized Medicine
8

A Platform for Integrated Genome Data Analysis, All About Data Congress, Schapranow, Jan 30, 2014
Our Vision
Personalized Medicine
9

Desirability
■  Leveraging directed customer services
■  Portfolio of integrated services for
clinicians, researchers, and patients
■  Include latest research results, e.g. most
effective therapies
Viability
Feasibility
■  Enable personalized medicine also in faroff regions and developing countries to
exceed customer base

■  HiSeq 2500 enables highcoverage whole genome
sequencing in ≈1d

■  Share data via the Internet to get
feedback from word-wide experts (costsaving)

■  IMDB enables allele frequency
determination of 12B records
within <1s

■  Combine research data (publications,
annotations, genome data) from
international databases in a single
knowledge base

■  Identification of relevant
annotations out of 80M <1s
■  Data preparation as a service
reduces TCO

A Platform for Integrated Genome Data Analysis, All About Data Congress, Schapranow, Jan 30, 2014
High-Performance In-Memory Genome Project

Integration of Genomic Data
10

■  Preprocessing of DNA
(alignment, variant
calling) can be
modeled and is
executed as integrated
process
■  Results are stored in
in-memory database
■  Real-time analysis of
genome data enables
completely new way of
research and therapies

A Platform for Integrated Genome Data Analysis, All About Data Congress, Schapranow, Jan 30, 2014
High-Performance In-Memory Genome Project

Architectural Overview
11

Information and Feedback within the Window of Opportunity

Patients

Doctors

Insurers

Researchers

Real-Time Capturing
and Data Analysis
In-Memory Database

*omics

Electronic
Service Line
Medical
Items
Records
All Relevant Medical Information

...

A Platform for Integrated Genome Data Analysis, All About Data Congress, Schapranow, Jan 30, 2014
High-Performance In-Memory Genome Project

Selected Research Topics
12

Improving Analyses:
■  Medical Knowledge Cockpit, Oncolyzer
■  Genome Browser enables deep dive into the genome
■  Cohort Analysis, e.g. clustering of patient cohorts
■  Combined Search, e.g. in clinical trials and side-effect databases
■  Pathways Topology Analysis, e.g. to identify cause/effect
Improving Data Preparations:
■  Graphical modeling of Genome Data Processing (GDP) pipelines
■  Scheduling and execution of multiple GPD pipelines in parallel
■  App store for medical knowledge (bring algorithms to data)
■  Exchange of sensitive data, e.g. history-based access control
■  Billing processes for intellectual property and services
A Platform for Integrated Genome Data Analysis, All About Data Congress, Schapranow, Jan 30, 2014
High-Performance In-Memory Genome Project

HANA Oncolyzer
13

■  Research initiative for
exchanging relevant tumor
data to improve treatment
■  Awarded with the 2012
Innovation Award of the
German Capitol Region
■  In-Memory Technology as
key-enabler for real-time
analysis of tumor data in
seconds instead of hours
■  Information available at your fingertips: In-Memory Technology on
mobile devices (iPad)
■  Interdisciplinary cooperation between medical doctors, researchers,
and software engineers
A Platform for Integrated Genome Data Analysis, All About Data Congress, Schapranow, Jan 30, 2014
HANA Oncolyzer
Patient Search and Details
14

A Platform for Integrated Genome Data Analysis, All About Data Congress, Schapranow, Jan 30, 2014
HANA Oncolyzer
Analysis of Patient Cohorts
15

A Platform for Integrated Genome Data Analysis, All About Data Congress, Schapranow, Jan 30, 2014
High-Performance In-Memory Genome Project

Medical Knowledge Cockpit

■  Search for affected genes in
distributed and heterogeneous data
sources

16

■  Immediate exploration of relevant
information, such as
□  Gene descriptions,
□  Molecular impact and related
pathways,
Unified access to
structured and
unstructured data
sources

Automatic
clinical trial
matching using
HANA text analysis
features

□  Scientific publications, and
□  Suitable clinical trials.
■  No manual searching for hours or
days – In-memory technology
translates it into interactive finding

A Platform for Integrated Genome Data Analysis, All About Data Congress, Schapranow, Jan 30, 2014
High-Performance In-Memory Genome Project
Search in Structured and Unstructured Medical Data

■  Extended text analysis feature by
medical terminology

17

□  Genes (122,975 + 186,771
synonyms)
□  Medical terms and categories
(98,886 diseases + 48,561
synonyms, 47 categories)
Unified access to
structured and
unstructured data
sources

Clinical trial
matching using
text analysis features

□  Pharmaceutical ingredients
(7,099 + 5,561 synonyms)
■  Imported clinicaltrials.gov database
(145k trials/30,138 recruiting)
■  Extracted, e.g., 320k genes, 161k
ingredients, 30k periods
■  Select all studies based on multiple
filters in <500ms

A Platform for Integrated Genome Data Analysis, All About Data Congress, Schapranow, Jan 30, 2014
High-Performance In-Memory Genome Project

Analysis of Patient Cohorts

■  In a patient cohort, a subset does
not respond to therapy – why?

18

■  Clustering using various statistical
algorithms, such as k-means or
hierarchical clustering
■  Calculation of all locus combinations
in which at least 5% of all TCGA
participants have mutations: 200ms
for top 20 combinations

Fast clustering
directly inside HANA

■  Individual clusters are calculated in
parallel directly within the database
■  K-means algorithm: 50ms (PAL) vs.
500ms (R)

A Platform for Integrated Genome Data Analysis, All About Data Congress, Schapranow, Jan 30, 2014
High-Performance In-Memory Genome Project

Genome Browser

■  Genome Browser: Comparison of
multiple genomes with reference

19

■  Combined knowledge base: latest
relevant annotations and literature,
e.g. NCBI, dbSNP, and UCSC
■  Detailed exploration of genome
locations and existing associations
Unified access
to multiple formerly
disjoint data sources

Matching of
variants with relevant
international annotations

■  Ranked variants, e.g. accordingly to
known diseases
■  Links to more detailed sources
enable fast identification of relevant
information while eliminating longlasting searches.

A Platform for Integrated Genome Data Analysis, All About Data Congress, Schapranow, Jan 30, 2014
High-Performance In-Memory Genome Project

Pathway Analysis

■  Search in pathways is limited to “is
a certain element contained” today

20

■  Integrated >1,5k pathways from
international sources, e.g. KEGG,
HumanCyc, and WikiPathways, into
HANA

Unified access
to multiple formerly
disjoint data sources

■  Implemented graph-based topology
exploration and ranking based on
patient specifics
■  Enables interactive identification of
possible dysfunctions affecting the
course of a therapy before its start

Pathway analysis
of genetic variations with
Graph Engine

A Platform for Integrated Genome Data Analysis, All About Data Congress, Schapranow, Jan 30, 2014
High-Performance In-Memory Genome Project

Drug Response Testing

■  Drug response depends on
individual genetic variants of tumors

21

■  Challenge: Identification of relevant
genetic variants and their impact on
drug response is a ongoing research
activity, e.g. Xenograft models
■  Exploration of experiment results is
time-consuming and Excel-driven
Interactive analysis
of correlations between drugs
and genetic variants

■  In-memory technology enables
interactive exploration of
experiment data to leverage new
scientific insights

A Platform for Integrated Genome Data Analysis, All About Data Congress, Schapranow, Jan 30, 2014
What to take home?

Test it: http://we.AnalyzeGenomes.com
22

For researchers
■  Enable real-time analysis of genome data
■  Automatic scan of pathways to identify cellular
impact of mutations
■  Free-text search in publications, diagnosis, and EMR
data (structured and unstructured data)
For clinicians
■  Preventive diagnostics to identify risk patients
■  Indicate pharmacokinetic correlations
■  Scan for comparable patient cases
For patients
■  Identify relevant clinical trials / experts
■  Start most appropriate therapy early based on all
evidences and latest knowledge
A Platform for Integrated Genome Data Analysis, All About Data Congress, Schapranow, Jan 30, 2014
Thank you for your interest!
Keep in contact with us.
23

Dr. Matthieu-P. Schapranow
schapranow@hpi.uni-potsdam.de
http://we.analyzegenomes.com/

Hasso Plattner Institute
Enterprise Platform & Integration Concepts
Dr. Matthieu-P. Schapranow
August-Bebel-Str. 88
14482 Potsdam, Germany
A Platform for Integrated Genome Data Analysis, All About Data Congress, Schapranow, Jan 30, 2014

Weitere ähnliche Inhalte

Was ist angesagt?

Festival of Genomics 2016 London: Analyze Genomes: A Federated In-Memory Comp...
Festival of Genomics 2016 London: Analyze Genomes: A Federated In-Memory Comp...Festival of Genomics 2016 London: Analyze Genomes: A Federated In-Memory Comp...
Festival of Genomics 2016 London: Analyze Genomes: A Federated In-Memory Comp...Matthieu Schapranow
 
Gesundheit geht uns alle an: Smart Data ermöglicht passendere Entscheidungen...
Gesundheit geht uns alle an: Smart Data ermöglicht passendere Entscheidungen...Gesundheit geht uns alle an: Smart Data ermöglicht passendere Entscheidungen...
Gesundheit geht uns alle an: Smart Data ermöglicht passendere Entscheidungen...Matthieu Schapranow
 
In-Memory Data Management for Systems Medicine
In-Memory Data Management for Systems MedicineIn-Memory Data Management for Systems Medicine
In-Memory Data Management for Systems MedicineMatthieu Schapranow
 
A Federated In-Memory Database System for Life Sciences
A Federated In-Memory Database System for Life SciencesA Federated In-Memory Database System for Life Sciences
A Federated In-Memory Database System for Life SciencesMatthieu Schapranow
 
Big Medical Data – Challenge or Potential?
Big Medical Data – Challenge or Potential?Big Medical Data – Challenge or Potential?
Big Medical Data – Challenge or Potential?Matthieu Schapranow
 
Analyze Genomes: A Federated In-memory Database Computing Platform enabling r...
Analyze Genomes: A Federated In-memory Database Computing Platform enabling r...Analyze Genomes: A Federated In-memory Database Computing Platform enabling r...
Analyze Genomes: A Federated In-memory Database Computing Platform enabling r...Matthieu Schapranow
 
AnalyzeGenomes.com: A Federated In-Memory Database Platform for Digital Health
AnalyzeGenomes.com: A Federated In-Memory Database Platform for Digital HealthAnalyzeGenomes.com: A Federated In-Memory Database Platform for Digital Health
AnalyzeGenomes.com: A Federated In-Memory Database Platform for Digital HealthMatthieu Schapranow
 
ICT Platform to Enable Consortium Work for Systems Medicine of Heart Failure
ICT Platform to Enable Consortium Work for Systems Medicine of Heart FailureICT Platform to Enable Consortium Work for Systems Medicine of Heart Failure
ICT Platform to Enable Consortium Work for Systems Medicine of Heart FailureMatthieu Schapranow
 
The Driver of the Healthcare System in the 21st Century: Real-world Applicati...
The Driver of the Healthcare System in the 21st Century: Real-world Applicati...The Driver of the Healthcare System in the 21st Century: Real-world Applicati...
The Driver of the Healthcare System in the 21st Century: Real-world Applicati...Matthieu Schapranow
 
Analyze Genomes: In-memory Apps for Next-generation Life Sciences Research
Analyze Genomes: In-memory Apps for Next-generation Life Sciences ResearchAnalyze Genomes: In-memory Apps for Next-generation Life Sciences Research
Analyze Genomes: In-memory Apps for Next-generation Life Sciences ResearchMatthieu Schapranow
 
In-Memory Apps for Precision Medicine
In-Memory Apps for Precision MedicineIn-Memory Apps for Precision Medicine
In-Memory Apps for Precision MedicineMatthieu Schapranow
 
Patient Journey in Oncology 2025: Molecular Tumour Boards in Practice
Patient Journey in Oncology 2025: Molecular Tumour Boards in PracticePatient Journey in Oncology 2025: Molecular Tumour Boards in Practice
Patient Journey in Oncology 2025: Molecular Tumour Boards in PracticeMatthieu Schapranow
 
A Federated In-Memory Database Computing Platform Enabling Real-Time Analysis...
A Federated In-Memory Database Computing Platform Enabling Real-Time Analysis...A Federated In-Memory Database Computing Platform Enabling Real-Time Analysis...
A Federated In-Memory Database Computing Platform Enabling Real-Time Analysis...Matthieu Schapranow
 
Festival of Genomics 2016 London: Analyze Genomes: Modeling and Executing Gen...
Festival of Genomics 2016 London: Analyze Genomes: Modeling and Executing Gen...Festival of Genomics 2016 London: Analyze Genomes: Modeling and Executing Gen...
Festival of Genomics 2016 London: Analyze Genomes: Modeling and Executing Gen...Matthieu Schapranow
 
Human Genome and Big Data Challenges
Human Genome and Big Data ChallengesHuman Genome and Big Data Challenges
Human Genome and Big Data ChallengesPhilip Bourne
 
How will AI affect the patient journey of the future?
How will AI affect the patient journey of the future?How will AI affect the patient journey of the future?
How will AI affect the patient journey of the future?Matthieu Schapranow
 
Analyze Genomes: Drug Response Analysis
Analyze Genomes: Drug Response AnalysisAnalyze Genomes: Drug Response Analysis
Analyze Genomes: Drug Response AnalysisMatthieu Schapranow
 
Festival of Genomics 2016 London: What to take home?
Festival of Genomics 2016 London: What to take home?Festival of Genomics 2016 London: What to take home?
Festival of Genomics 2016 London: What to take home?Matthieu Schapranow
 
Festival of Genomics 2016 London: Mining and Processing of Unstructured Medic...
Festival of Genomics 2016 London: Mining and Processing of Unstructured Medic...Festival of Genomics 2016 London: Mining and Processing of Unstructured Medic...
Festival of Genomics 2016 London: Mining and Processing of Unstructured Medic...Matthieu Schapranow
 

Was ist angesagt? (20)

Festival of Genomics 2016 London: Analyze Genomes: A Federated In-Memory Comp...
Festival of Genomics 2016 London: Analyze Genomes: A Federated In-Memory Comp...Festival of Genomics 2016 London: Analyze Genomes: A Federated In-Memory Comp...
Festival of Genomics 2016 London: Analyze Genomes: A Federated In-Memory Comp...
 
Gesundheit geht uns alle an: Smart Data ermöglicht passendere Entscheidungen...
Gesundheit geht uns alle an: Smart Data ermöglicht passendere Entscheidungen...Gesundheit geht uns alle an: Smart Data ermöglicht passendere Entscheidungen...
Gesundheit geht uns alle an: Smart Data ermöglicht passendere Entscheidungen...
 
In-Memory Data Management for Systems Medicine
In-Memory Data Management for Systems MedicineIn-Memory Data Management for Systems Medicine
In-Memory Data Management for Systems Medicine
 
A Federated In-Memory Database System for Life Sciences
A Federated In-Memory Database System for Life SciencesA Federated In-Memory Database System for Life Sciences
A Federated In-Memory Database System for Life Sciences
 
Big Medical Data – Challenge or Potential?
Big Medical Data – Challenge or Potential?Big Medical Data – Challenge or Potential?
Big Medical Data – Challenge or Potential?
 
Analyze Genomes: A Federated In-memory Database Computing Platform enabling r...
Analyze Genomes: A Federated In-memory Database Computing Platform enabling r...Analyze Genomes: A Federated In-memory Database Computing Platform enabling r...
Analyze Genomes: A Federated In-memory Database Computing Platform enabling r...
 
AnalyzeGenomes.com: A Federated In-Memory Database Platform for Digital Health
AnalyzeGenomes.com: A Federated In-Memory Database Platform for Digital HealthAnalyzeGenomes.com: A Federated In-Memory Database Platform for Digital Health
AnalyzeGenomes.com: A Federated In-Memory Database Platform for Digital Health
 
ICT Platform to Enable Consortium Work for Systems Medicine of Heart Failure
ICT Platform to Enable Consortium Work for Systems Medicine of Heart FailureICT Platform to Enable Consortium Work for Systems Medicine of Heart Failure
ICT Platform to Enable Consortium Work for Systems Medicine of Heart Failure
 
The Driver of the Healthcare System in the 21st Century: Real-world Applicati...
The Driver of the Healthcare System in the 21st Century: Real-world Applicati...The Driver of the Healthcare System in the 21st Century: Real-world Applicati...
The Driver of the Healthcare System in the 21st Century: Real-world Applicati...
 
Analyze Genomes: In-memory Apps for Next-generation Life Sciences Research
Analyze Genomes: In-memory Apps for Next-generation Life Sciences ResearchAnalyze Genomes: In-memory Apps for Next-generation Life Sciences Research
Analyze Genomes: In-memory Apps for Next-generation Life Sciences Research
 
Big Data in Life Sciences
Big Data in Life SciencesBig Data in Life Sciences
Big Data in Life Sciences
 
In-Memory Apps for Precision Medicine
In-Memory Apps for Precision MedicineIn-Memory Apps for Precision Medicine
In-Memory Apps for Precision Medicine
 
Patient Journey in Oncology 2025: Molecular Tumour Boards in Practice
Patient Journey in Oncology 2025: Molecular Tumour Boards in PracticePatient Journey in Oncology 2025: Molecular Tumour Boards in Practice
Patient Journey in Oncology 2025: Molecular Tumour Boards in Practice
 
A Federated In-Memory Database Computing Platform Enabling Real-Time Analysis...
A Federated In-Memory Database Computing Platform Enabling Real-Time Analysis...A Federated In-Memory Database Computing Platform Enabling Real-Time Analysis...
A Federated In-Memory Database Computing Platform Enabling Real-Time Analysis...
 
Festival of Genomics 2016 London: Analyze Genomes: Modeling and Executing Gen...
Festival of Genomics 2016 London: Analyze Genomes: Modeling and Executing Gen...Festival of Genomics 2016 London: Analyze Genomes: Modeling and Executing Gen...
Festival of Genomics 2016 London: Analyze Genomes: Modeling and Executing Gen...
 
Human Genome and Big Data Challenges
Human Genome and Big Data ChallengesHuman Genome and Big Data Challenges
Human Genome and Big Data Challenges
 
How will AI affect the patient journey of the future?
How will AI affect the patient journey of the future?How will AI affect the patient journey of the future?
How will AI affect the patient journey of the future?
 
Analyze Genomes: Drug Response Analysis
Analyze Genomes: Drug Response AnalysisAnalyze Genomes: Drug Response Analysis
Analyze Genomes: Drug Response Analysis
 
Festival of Genomics 2016 London: What to take home?
Festival of Genomics 2016 London: What to take home?Festival of Genomics 2016 London: What to take home?
Festival of Genomics 2016 London: What to take home?
 
Festival of Genomics 2016 London: Mining and Processing of Unstructured Medic...
Festival of Genomics 2016 London: Mining and Processing of Unstructured Medic...Festival of Genomics 2016 London: Mining and Processing of Unstructured Medic...
Festival of Genomics 2016 London: Mining and Processing of Unstructured Medic...
 

Andere mochten auch

Strong reversal of the lung fibrosis disease signature by autotaxin inhibitor...
Strong reversal of the lung fibrosis disease signature by autotaxin inhibitor...Strong reversal of the lung fibrosis disease signature by autotaxin inhibitor...
Strong reversal of the lung fibrosis disease signature by autotaxin inhibitor...Maté Ongenaert
 
Workshop NGS data analysis - 3
Workshop NGS data analysis - 3Workshop NGS data analysis - 3
Workshop NGS data analysis - 3Maté Ongenaert
 
Genome voyager-beta-brochure
Genome voyager-beta-brochureGenome voyager-beta-brochure
Genome voyager-beta-brochureXing Xu
 
Easygenomics ISCB Cloud section 2012
Easygenomics ISCB Cloud section 2012Easygenomics ISCB Cloud section 2012
Easygenomics ISCB Cloud section 2012Xing Xu
 
Ecobouwers opendeur passiefhuis Lokeren
Ecobouwers opendeur passiefhuis LokerenEcobouwers opendeur passiefhuis Lokeren
Ecobouwers opendeur passiefhuis LokerenMaté Ongenaert
 
ENCODE project: brief summary of main findings
ENCODE project: brief summary of main findingsENCODE project: brief summary of main findings
ENCODE project: brief summary of main findingsMaté Ongenaert
 
Best pratices at BGI for the Challenges in the Era of Big Genomics Data
Best pratices at BGI for the Challenges in the Era of Big Genomics DataBest pratices at BGI for the Challenges in the Era of Big Genomics Data
Best pratices at BGI for the Challenges in the Era of Big Genomics DataXing Xu
 
Workshop NGS data analysis - 2
Workshop NGS data analysis - 2Workshop NGS data analysis - 2
Workshop NGS data analysis - 2Maté Ongenaert
 
Las nuevas tecnologías
Las nuevas tecnologíasLas nuevas tecnologías
Las nuevas tecnologíasMayra Canela
 
Buying or Selling an Investment Advisory Firm: A Lawyer\'s Perspective
Buying or Selling an Investment Advisory Firm: A Lawyer\'s PerspectiveBuying or Selling an Investment Advisory Firm: A Lawyer\'s Perspective
Buying or Selling an Investment Advisory Firm: A Lawyer\'s Perspectivejimeccleston
 
WSPDC June 2013: Working with Large Lists in SharePoint 2010 by Zewdi Solomon
WSPDC June 2013: Working with Large Lists in SharePoint 2010 by Zewdi SolomonWSPDC June 2013: Working with Large Lists in SharePoint 2010 by Zewdi Solomon
WSPDC June 2013: Working with Large Lists in SharePoint 2010 by Zewdi SolomonWSPDC & FEDSPUG
 
#GetsmART: Lessons from the Artists BLC15 Minikeynote
#GetsmART: Lessons from the Artists BLC15 Minikeynote#GetsmART: Lessons from the Artists BLC15 Minikeynote
#GetsmART: Lessons from the Artists BLC15 MinikeynoteAmy Burvall
 
How effective is the combination of my main product and ancillary
How effective is the combination of my main product and ancillaryHow effective is the combination of my main product and ancillary
How effective is the combination of my main product and ancillaryCraig Dennett
 
9.2.12 The Missional Church - Matthew 16:13-27
9.2.12 The Missional Church - Matthew 16:13-279.2.12 The Missional Church - Matthew 16:13-27
9.2.12 The Missional Church - Matthew 16:13-27Cody Nazarene Church
 
Seven most important metrics for video email marketing.
Seven most important metrics for video email marketing. Seven most important metrics for video email marketing.
Seven most important metrics for video email marketing. Vidoomail
 
Open Badges: A fit for purpose credit mechanism
Open Badges: A fit for purpose credit mechanismOpen Badges: A fit for purpose credit mechanism
Open Badges: A fit for purpose credit mechanismAmye Kenall
 
WeChangeMakers: Dmajor's TFT keynote
WeChangeMakers: Dmajor's TFT keynoteWeChangeMakers: Dmajor's TFT keynote
WeChangeMakers: Dmajor's TFT keynote경만 고
 

Andere mochten auch (20)

Strong reversal of the lung fibrosis disease signature by autotaxin inhibitor...
Strong reversal of the lung fibrosis disease signature by autotaxin inhibitor...Strong reversal of the lung fibrosis disease signature by autotaxin inhibitor...
Strong reversal of the lung fibrosis disease signature by autotaxin inhibitor...
 
Workshop NGS data analysis - 3
Workshop NGS data analysis - 3Workshop NGS data analysis - 3
Workshop NGS data analysis - 3
 
Genome voyager-beta-brochure
Genome voyager-beta-brochureGenome voyager-beta-brochure
Genome voyager-beta-brochure
 
Easygenomics ISCB Cloud section 2012
Easygenomics ISCB Cloud section 2012Easygenomics ISCB Cloud section 2012
Easygenomics ISCB Cloud section 2012
 
Ecobouwers opendeur passiefhuis Lokeren
Ecobouwers opendeur passiefhuis LokerenEcobouwers opendeur passiefhuis Lokeren
Ecobouwers opendeur passiefhuis Lokeren
 
ENCODE project: brief summary of main findings
ENCODE project: brief summary of main findingsENCODE project: brief summary of main findings
ENCODE project: brief summary of main findings
 
Best pratices at BGI for the Challenges in the Era of Big Genomics Data
Best pratices at BGI for the Challenges in the Era of Big Genomics DataBest pratices at BGI for the Challenges in the Era of Big Genomics Data
Best pratices at BGI for the Challenges in the Era of Big Genomics Data
 
Workshop NGS data analysis - 2
Workshop NGS data analysis - 2Workshop NGS data analysis - 2
Workshop NGS data analysis - 2
 
Open human genome data
Open human genome dataOpen human genome data
Open human genome data
 
Economía Popular y Solidaria
Economía Popular y SolidariaEconomía Popular y Solidaria
Economía Popular y Solidaria
 
Las nuevas tecnologías
Las nuevas tecnologíasLas nuevas tecnologías
Las nuevas tecnologías
 
Buying or Selling an Investment Advisory Firm: A Lawyer\'s Perspective
Buying or Selling an Investment Advisory Firm: A Lawyer\'s PerspectiveBuying or Selling an Investment Advisory Firm: A Lawyer\'s Perspective
Buying or Selling an Investment Advisory Firm: A Lawyer\'s Perspective
 
WSPDC June 2013: Working with Large Lists in SharePoint 2010 by Zewdi Solomon
WSPDC June 2013: Working with Large Lists in SharePoint 2010 by Zewdi SolomonWSPDC June 2013: Working with Large Lists in SharePoint 2010 by Zewdi Solomon
WSPDC June 2013: Working with Large Lists in SharePoint 2010 by Zewdi Solomon
 
#GetsmART: Lessons from the Artists BLC15 Minikeynote
#GetsmART: Lessons from the Artists BLC15 Minikeynote#GetsmART: Lessons from the Artists BLC15 Minikeynote
#GetsmART: Lessons from the Artists BLC15 Minikeynote
 
How effective is the combination of my main product and ancillary
How effective is the combination of my main product and ancillaryHow effective is the combination of my main product and ancillary
How effective is the combination of my main product and ancillary
 
9.2.12 The Missional Church - Matthew 16:13-27
9.2.12 The Missional Church - Matthew 16:13-279.2.12 The Missional Church - Matthew 16:13-27
9.2.12 The Missional Church - Matthew 16:13-27
 
Seven most important metrics for video email marketing.
Seven most important metrics for video email marketing. Seven most important metrics for video email marketing.
Seven most important metrics for video email marketing.
 
Open Badges: A fit for purpose credit mechanism
Open Badges: A fit for purpose credit mechanismOpen Badges: A fit for purpose credit mechanism
Open Badges: A fit for purpose credit mechanism
 
WeChangeMakers: Dmajor's TFT keynote
WeChangeMakers: Dmajor's TFT keynoteWeChangeMakers: Dmajor's TFT keynote
WeChangeMakers: Dmajor's TFT keynote
 
Die....
Die....Die....
Die....
 

Ähnlich wie A Platform for Integrated Genome Data Analysis

In-memory Applications for Oncology
In-memory Applications for OncologyIn-memory Applications for Oncology
In-memory Applications for OncologyMatthieu Schapranow
 
Enabling Real-time Genome Data Research with In-memory Database Technology (S...
Enabling Real-time Genome Data Research with In-memory Database Technology (S...Enabling Real-time Genome Data Research with In-memory Database Technology (S...
Enabling Real-time Genome Data Research with In-memory Database Technology (S...Matthieu Schapranow
 
In-memory Applications for Informed Patients
In-memory Applications for Informed PatientsIn-memory Applications for Informed Patients
In-memory Applications for Informed PatientsMatthieu Schapranow
 
Introduction to High-performance In-memory Genome Project at HPI
Introduction to High-performance In-memory Genome Project at HPI Introduction to High-performance In-memory Genome Project at HPI
Introduction to High-performance In-memory Genome Project at HPI Matthieu Schapranow
 
How Real-time Analysis turns Big Medical Data into Precision Medicine
How Real-time Analysis turns Big Medical Data into Precision MedicineHow Real-time Analysis turns Big Medical Data into Precision Medicine
How Real-time Analysis turns Big Medical Data into Precision MedicineMatthieu Schapranow
 
Enabling Real-Time Genome Data Research with In-Memory Database Technology (I...
Enabling Real-Time Genome Data Research with In-Memory Database Technology (I...Enabling Real-Time Genome Data Research with In-Memory Database Technology (I...
Enabling Real-Time Genome Data Research with In-Memory Database Technology (I...Matthieu Schapranow
 
Turning Big Data into Precision Medicine
Turning Big Data into Precision MedicineTurning Big Data into Precision Medicine
Turning Big Data into Precision MedicineMatthieu Schapranow
 
Gaining Time – Real-time Analysis of Big Medical Data
Gaining Time – Real-time Analysis of Big Medical Data Gaining Time – Real-time Analysis of Big Medical Data
Gaining Time – Real-time Analysis of Big Medical Data SAP Technology
 
Festival of Genomics 2016 London: Analyze Genomes: Real-world Examples
Festival of Genomics 2016 London: Analyze Genomes: Real-world ExamplesFestival of Genomics 2016 London: Analyze Genomes: Real-world Examples
Festival of Genomics 2016 London: Analyze Genomes: Real-world ExamplesMatthieu Schapranow
 
A Tool for Optimizing De-Identified Health Data for Use in Statistical Classi...
A Tool for Optimizing De-Identified Health Data for Use in Statistical Classi...A Tool for Optimizing De-Identified Health Data for Use in Statistical Classi...
A Tool for Optimizing De-Identified Health Data for Use in Statistical Classi...arx-deidentifier
 
Festival of Genomics 2016 London: Challenges of Big Medical Data?
Festival of Genomics 2016 London: Challenges of Big Medical Data?Festival of Genomics 2016 London: Challenges of Big Medical Data?
Festival of Genomics 2016 London: Challenges of Big Medical Data?Matthieu Schapranow
 
Supporting a Collaborative R&D Organization with a Dynamic Big Data Solution
Supporting a Collaborative R&D Organization with a Dynamic Big Data SolutionSupporting a Collaborative R&D Organization with a Dynamic Big Data Solution
Supporting a Collaborative R&D Organization with a Dynamic Big Data SolutionSaama
 
Pistoia alliance debates analytics 15-09-2015 16.00
Pistoia alliance debates   analytics 15-09-2015 16.00Pistoia alliance debates   analytics 15-09-2015 16.00
Pistoia alliance debates analytics 15-09-2015 16.00Pistoia Alliance
 
Introduction to CTIM - the Clinical Trial Information Mediator
Introduction to CTIM - the Clinical Trial Information MediatorIntroduction to CTIM - the Clinical Trial Information Mediator
Introduction to CTIM - the Clinical Trial Information MediatorWolfgang Kuchinke
 
Gaining Time -- Real-time Analysis of Big Medical Data
Gaining Time -- Real-time Analysis of Big Medical DataGaining Time -- Real-time Analysis of Big Medical Data
Gaining Time -- Real-time Analysis of Big Medical DataMatthieu Schapranow
 
2014 11-26 EATRIS biomarkers platform meeting, Amsterdam, Organising technolo...
2014 11-26 EATRIS biomarkers platform meeting, Amsterdam, Organising technolo...2014 11-26 EATRIS biomarkers platform meeting, Amsterdam, Organising technolo...
2014 11-26 EATRIS biomarkers platform meeting, Amsterdam, Organising technolo...Alain van Gool
 
tranSMART Community Meeting 5-7 Nov 13 - Session 3: transmart’s application t...
tranSMART Community Meeting 5-7 Nov 13 - Session 3: transmart’s application t...tranSMART Community Meeting 5-7 Nov 13 - Session 3: transmart’s application t...
tranSMART Community Meeting 5-7 Nov 13 - Session 3: transmart’s application t...David Peyruc
 

Ähnlich wie A Platform for Integrated Genome Data Analysis (19)

In-memory Applications for Oncology
In-memory Applications for OncologyIn-memory Applications for Oncology
In-memory Applications for Oncology
 
Enabling Real-time Genome Data Research with In-memory Database Technology (S...
Enabling Real-time Genome Data Research with In-memory Database Technology (S...Enabling Real-time Genome Data Research with In-memory Database Technology (S...
Enabling Real-time Genome Data Research with In-memory Database Technology (S...
 
In-memory Applications for Informed Patients
In-memory Applications for Informed PatientsIn-memory Applications for Informed Patients
In-memory Applications for Informed Patients
 
Introduction to High-performance In-memory Genome Project at HPI
Introduction to High-performance In-memory Genome Project at HPI Introduction to High-performance In-memory Genome Project at HPI
Introduction to High-performance In-memory Genome Project at HPI
 
How Real-time Analysis turns Big Medical Data into Precision Medicine
How Real-time Analysis turns Big Medical Data into Precision MedicineHow Real-time Analysis turns Big Medical Data into Precision Medicine
How Real-time Analysis turns Big Medical Data into Precision Medicine
 
Enabling Real-Time Genome Data Research with In-Memory Database Technology (I...
Enabling Real-Time Genome Data Research with In-Memory Database Technology (I...Enabling Real-Time Genome Data Research with In-Memory Database Technology (I...
Enabling Real-Time Genome Data Research with In-Memory Database Technology (I...
 
Turning Big Data into Precision Medicine
Turning Big Data into Precision MedicineTurning Big Data into Precision Medicine
Turning Big Data into Precision Medicine
 
Gaining Time – Real-time Analysis of Big Medical Data
Gaining Time – Real-time Analysis of Big Medical Data Gaining Time – Real-time Analysis of Big Medical Data
Gaining Time – Real-time Analysis of Big Medical Data
 
Festival of Genomics 2016 London: Analyze Genomes: Real-world Examples
Festival of Genomics 2016 London: Analyze Genomes: Real-world ExamplesFestival of Genomics 2016 London: Analyze Genomes: Real-world Examples
Festival of Genomics 2016 London: Analyze Genomes: Real-world Examples
 
A Tool for Optimizing De-Identified Health Data for Use in Statistical Classi...
A Tool for Optimizing De-Identified Health Data for Use in Statistical Classi...A Tool for Optimizing De-Identified Health Data for Use in Statistical Classi...
A Tool for Optimizing De-Identified Health Data for Use in Statistical Classi...
 
Festival of Genomics 2016 London: Challenges of Big Medical Data?
Festival of Genomics 2016 London: Challenges of Big Medical Data?Festival of Genomics 2016 London: Challenges of Big Medical Data?
Festival of Genomics 2016 London: Challenges of Big Medical Data?
 
Supporting a Collaborative R&D Organization with a Dynamic Big Data Solution
Supporting a Collaborative R&D Organization with a Dynamic Big Data SolutionSupporting a Collaborative R&D Organization with a Dynamic Big Data Solution
Supporting a Collaborative R&D Organization with a Dynamic Big Data Solution
 
Markham2009
Markham2009Markham2009
Markham2009
 
Cri big data
Cri big dataCri big data
Cri big data
 
Pistoia alliance debates analytics 15-09-2015 16.00
Pistoia alliance debates   analytics 15-09-2015 16.00Pistoia alliance debates   analytics 15-09-2015 16.00
Pistoia alliance debates analytics 15-09-2015 16.00
 
Introduction to CTIM - the Clinical Trial Information Mediator
Introduction to CTIM - the Clinical Trial Information MediatorIntroduction to CTIM - the Clinical Trial Information Mediator
Introduction to CTIM - the Clinical Trial Information Mediator
 
Gaining Time -- Real-time Analysis of Big Medical Data
Gaining Time -- Real-time Analysis of Big Medical DataGaining Time -- Real-time Analysis of Big Medical Data
Gaining Time -- Real-time Analysis of Big Medical Data
 
2014 11-26 EATRIS biomarkers platform meeting, Amsterdam, Organising technolo...
2014 11-26 EATRIS biomarkers platform meeting, Amsterdam, Organising technolo...2014 11-26 EATRIS biomarkers platform meeting, Amsterdam, Organising technolo...
2014 11-26 EATRIS biomarkers platform meeting, Amsterdam, Organising technolo...
 
tranSMART Community Meeting 5-7 Nov 13 - Session 3: transmart’s application t...
tranSMART Community Meeting 5-7 Nov 13 - Session 3: transmart’s application t...tranSMART Community Meeting 5-7 Nov 13 - Session 3: transmart’s application t...
tranSMART Community Meeting 5-7 Nov 13 - Session 3: transmart’s application t...
 

Kürzlich hochgeladen

From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024The Digital Insurer
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesBoston Institute of Analytics
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfhans926745
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?Antenna Manufacturer Coco
 

Kürzlich hochgeladen (20)

From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 

A Platform for Integrated Genome Data Analysis

  • 1. A Platform for Integrated Genome Data Analysis All About Data Congress, UMC Groningen January 30th, 2014 Dr. Matthieu Schapranow Hasso Plattner Institute
  • 2. Hasso Plattner Institute Key Facts 2 ■  Founded as a public-private partnership in 1998 in Potsdam near Berlin, Germany ■  Institute belongs to the University of Potsdam ■  Ranked 1st in CHE since 2009 ■  500 B.Sc. and M.Sc. students ■  10 professors, 150 PhD students ■  Course of study: IT Systems Engineering A Platform for Integrated Genome Data Analysis, All About Data Congress, Schapranow, Jan 30, 2014
  • 3. Hasso Plattner Institute Programs 3 ■  Full university curriculum ■  Bachelor (6 semesters) ■  Master (4 semesters) A Platform for Integrated Genome Data Analysis, All About Data Congress, Schapranow, Jan 30, 2014
  • 4. Hasso Plattner Institute Enterprise Platform and Integration Concepts Group 4 Prof. Dr. h.c. Hasso Plattner ■  Research focuses on the technical aspects of enterprise software and design of complex applications □  In-Memory Data Management for Enterprise Applications □  Enterprise Application Programming Model □  Scientific Data Management □  Human-Centered Software Design and Engineering ■  Industry cooperations, e.g. SAP, Siemens, Audi, and EADS ■  Research cooperations, e.g. Stanford, MIT, and Berkeley Partner of Stanford Center for Design Research Partner of MIT in Supply Chain Innovation and CSAIL Partner at UC Berkeley
 RAD / AMP Lab A Platform for Integrated Genome Data Analysis, All About Data Congress, Schapranow, Jan 30, 2014
  • 5. Our Motivation Personalized Medicine 5 ■  Motivation: Can we analyze the entire data of a patient, incl. Electronic Health Records (EHR) and genome data, during a doctor’s visit? ■  Genome data analysis may sum up to weeks, i.e. biopsy, biological preparation, sequencing, alignment, variant calling, full analysis, and evaluation ■  Issue: Complex and time-consuming data processing tasks ■  In-memory technology accelerates genome data processing □  Highly parallel alignment / variant calling □  Real-time analysis of individual patient or cohort data □  Combined search in structured / unstructured data A Platform for Integrated Genome Data Analysis, All About Data Congress, Schapranow, Jan 30, 2014
  • 6. Our Challenge Distributed Big Data Sources 6 Human genome/biological data 600GB per full genome 15PB+ in databases of leading institutes Hospital information systems Often more than 50GB Cancer patient records >160k records at NCT Prescription data 1.5B records from 10,000 doctors and 10M Patients (100 GB) Human proteome 160M data points (2.4GB) per sample >3TB raw proteome data in ProteomicsDB PubMed database >23M articles Medical sensor data Scan of a single organ in 1s creates 10GB of raw data Clinical trials Currently more than 30k recruiting on ClinicalTrials.gov A Platform for Integrated Genome Data Analysis, All About Data Congress, Schapranow, Jan 30, 2014
  • 7. Our Approach In-Memory Technology ● ● ● ● Read Event Read Event Repositories Repositories Combined column and row store Insert only for time travel + + + ++ A up to 2.000 up to 2.000 requests requests per second per second Minimal projections Any attribute as index Bulk load Discovery Service Discovery Service Multi-core/ parallelization SAP HANA SAP HANA P Active/passive data store Dynamic multithreading within nodes No aggregate tables +++ up to 8.000 read up to 8.000 read event notifications event notifications per second per second On-the-fly extensibility Map reduce P P t A A Lightweight Compression Partitioning SQL Analytics on historical data Single and multi-tenancy Object to relational mapping Group Key SQL interface on columns & rows Reduction of layers x x 7 Verification Verification Services Services T Text Retrieval and Extraction No disk A Platform for Integrated Genome Data Analysis, All About Data Congress, Schapranow, Jan 30, 2014
  • 8. Our Vision Personalized Medicine 8 A Platform for Integrated Genome Data Analysis, All About Data Congress, Schapranow, Jan 30, 2014
  • 9. Our Vision Personalized Medicine 9 Desirability ■  Leveraging directed customer services ■  Portfolio of integrated services for clinicians, researchers, and patients ■  Include latest research results, e.g. most effective therapies Viability Feasibility ■  Enable personalized medicine also in faroff regions and developing countries to exceed customer base ■  HiSeq 2500 enables highcoverage whole genome sequencing in ≈1d ■  Share data via the Internet to get feedback from word-wide experts (costsaving) ■  IMDB enables allele frequency determination of 12B records within <1s ■  Combine research data (publications, annotations, genome data) from international databases in a single knowledge base ■  Identification of relevant annotations out of 80M <1s ■  Data preparation as a service reduces TCO A Platform for Integrated Genome Data Analysis, All About Data Congress, Schapranow, Jan 30, 2014
  • 10. High-Performance In-Memory Genome Project Integration of Genomic Data 10 ■  Preprocessing of DNA (alignment, variant calling) can be modeled and is executed as integrated process ■  Results are stored in in-memory database ■  Real-time analysis of genome data enables completely new way of research and therapies A Platform for Integrated Genome Data Analysis, All About Data Congress, Schapranow, Jan 30, 2014
  • 11. High-Performance In-Memory Genome Project Architectural Overview 11 Information and Feedback within the Window of Opportunity Patients Doctors Insurers Researchers Real-Time Capturing and Data Analysis In-Memory Database *omics Electronic Service Line Medical Items Records All Relevant Medical Information ... A Platform for Integrated Genome Data Analysis, All About Data Congress, Schapranow, Jan 30, 2014
  • 12. High-Performance In-Memory Genome Project Selected Research Topics 12 Improving Analyses: ■  Medical Knowledge Cockpit, Oncolyzer ■  Genome Browser enables deep dive into the genome ■  Cohort Analysis, e.g. clustering of patient cohorts ■  Combined Search, e.g. in clinical trials and side-effect databases ■  Pathways Topology Analysis, e.g. to identify cause/effect Improving Data Preparations: ■  Graphical modeling of Genome Data Processing (GDP) pipelines ■  Scheduling and execution of multiple GPD pipelines in parallel ■  App store for medical knowledge (bring algorithms to data) ■  Exchange of sensitive data, e.g. history-based access control ■  Billing processes for intellectual property and services A Platform for Integrated Genome Data Analysis, All About Data Congress, Schapranow, Jan 30, 2014
  • 13. High-Performance In-Memory Genome Project HANA Oncolyzer 13 ■  Research initiative for exchanging relevant tumor data to improve treatment ■  Awarded with the 2012 Innovation Award of the German Capitol Region ■  In-Memory Technology as key-enabler for real-time analysis of tumor data in seconds instead of hours ■  Information available at your fingertips: In-Memory Technology on mobile devices (iPad) ■  Interdisciplinary cooperation between medical doctors, researchers, and software engineers A Platform for Integrated Genome Data Analysis, All About Data Congress, Schapranow, Jan 30, 2014
  • 14. HANA Oncolyzer Patient Search and Details 14 A Platform for Integrated Genome Data Analysis, All About Data Congress, Schapranow, Jan 30, 2014
  • 15. HANA Oncolyzer Analysis of Patient Cohorts 15 A Platform for Integrated Genome Data Analysis, All About Data Congress, Schapranow, Jan 30, 2014
  • 16. High-Performance In-Memory Genome Project Medical Knowledge Cockpit ■  Search for affected genes in distributed and heterogeneous data sources 16 ■  Immediate exploration of relevant information, such as □  Gene descriptions, □  Molecular impact and related pathways, Unified access to structured and unstructured data sources Automatic clinical trial matching using HANA text analysis features □  Scientific publications, and □  Suitable clinical trials. ■  No manual searching for hours or days – In-memory technology translates it into interactive finding A Platform for Integrated Genome Data Analysis, All About Data Congress, Schapranow, Jan 30, 2014
  • 17. High-Performance In-Memory Genome Project Search in Structured and Unstructured Medical Data ■  Extended text analysis feature by medical terminology 17 □  Genes (122,975 + 186,771 synonyms) □  Medical terms and categories (98,886 diseases + 48,561 synonyms, 47 categories) Unified access to structured and unstructured data sources Clinical trial matching using text analysis features □  Pharmaceutical ingredients (7,099 + 5,561 synonyms) ■  Imported clinicaltrials.gov database (145k trials/30,138 recruiting) ■  Extracted, e.g., 320k genes, 161k ingredients, 30k periods ■  Select all studies based on multiple filters in <500ms A Platform for Integrated Genome Data Analysis, All About Data Congress, Schapranow, Jan 30, 2014
  • 18. High-Performance In-Memory Genome Project Analysis of Patient Cohorts ■  In a patient cohort, a subset does not respond to therapy – why? 18 ■  Clustering using various statistical algorithms, such as k-means or hierarchical clustering ■  Calculation of all locus combinations in which at least 5% of all TCGA participants have mutations: 200ms for top 20 combinations Fast clustering directly inside HANA ■  Individual clusters are calculated in parallel directly within the database ■  K-means algorithm: 50ms (PAL) vs. 500ms (R) A Platform for Integrated Genome Data Analysis, All About Data Congress, Schapranow, Jan 30, 2014
  • 19. High-Performance In-Memory Genome Project Genome Browser ■  Genome Browser: Comparison of multiple genomes with reference 19 ■  Combined knowledge base: latest relevant annotations and literature, e.g. NCBI, dbSNP, and UCSC ■  Detailed exploration of genome locations and existing associations Unified access to multiple formerly disjoint data sources Matching of variants with relevant international annotations ■  Ranked variants, e.g. accordingly to known diseases ■  Links to more detailed sources enable fast identification of relevant information while eliminating longlasting searches. A Platform for Integrated Genome Data Analysis, All About Data Congress, Schapranow, Jan 30, 2014
  • 20. High-Performance In-Memory Genome Project Pathway Analysis ■  Search in pathways is limited to “is a certain element contained” today 20 ■  Integrated >1,5k pathways from international sources, e.g. KEGG, HumanCyc, and WikiPathways, into HANA Unified access to multiple formerly disjoint data sources ■  Implemented graph-based topology exploration and ranking based on patient specifics ■  Enables interactive identification of possible dysfunctions affecting the course of a therapy before its start Pathway analysis of genetic variations with Graph Engine A Platform for Integrated Genome Data Analysis, All About Data Congress, Schapranow, Jan 30, 2014
  • 21. High-Performance In-Memory Genome Project Drug Response Testing ■  Drug response depends on individual genetic variants of tumors 21 ■  Challenge: Identification of relevant genetic variants and their impact on drug response is a ongoing research activity, e.g. Xenograft models ■  Exploration of experiment results is time-consuming and Excel-driven Interactive analysis of correlations between drugs and genetic variants ■  In-memory technology enables interactive exploration of experiment data to leverage new scientific insights A Platform for Integrated Genome Data Analysis, All About Data Congress, Schapranow, Jan 30, 2014
  • 22. What to take home? Test it: http://we.AnalyzeGenomes.com 22 For researchers ■  Enable real-time analysis of genome data ■  Automatic scan of pathways to identify cellular impact of mutations ■  Free-text search in publications, diagnosis, and EMR data (structured and unstructured data) For clinicians ■  Preventive diagnostics to identify risk patients ■  Indicate pharmacokinetic correlations ■  Scan for comparable patient cases For patients ■  Identify relevant clinical trials / experts ■  Start most appropriate therapy early based on all evidences and latest knowledge A Platform for Integrated Genome Data Analysis, All About Data Congress, Schapranow, Jan 30, 2014
  • 23. Thank you for your interest! Keep in contact with us. 23 Dr. Matthieu-P. Schapranow schapranow@hpi.uni-potsdam.de http://we.analyzegenomes.com/ Hasso Plattner Institute Enterprise Platform & Integration Concepts Dr. Matthieu-P. Schapranow August-Bebel-Str. 88 14482 Potsdam, Germany A Platform for Integrated Genome Data Analysis, All About Data Congress, Schapranow, Jan 30, 2014