SlideShare ist ein Scribd-Unternehmen logo
1 von 76
Canceromatic III - Session I: Pan-Cancer analysis
- Changing landscape of data and tools available
for reproducible cancer genomics workflows: report
from the ICGC trenches.
Nov 14th 2016
B.F. Francis Ouellette francis@oicr.on.ca
• Senior Scientists & Associate Director,
Informatics and Biocomputing, Ontario Institute for
Cancer Research, Toronto, ON
• Associate Professor, Department of Cell and Systems Biology,
University of Toronto, Toronto, ON.
2Module #: Title of Module
Module 2 bioinformatics.ca
ONTARIO INSTITUTE FOR CANCER RESEARC
You are free to:
Copy, share, adapt, or re-mix;
Photograph, film, or broadcast;
Blog, live-blog, or post video of;
This presentation. Provided that:
You attribute the work to its author and respect the rights
and licenses associated with its components.
Slide Concept by Cameron Neylon, who has waived all copyright and related or neighbouring rights. This slide only ccZero.
Social Media Icons adapted with permission from originals by Christopher Ross. Original images are available under GPL at;
http://www.thisismyurl.com/free-downloads/15-free-speech-bubble-icons-for-popular-websites
ONTARIO INSTITUTE FOR CANCER RESEARC
@bffo
francis@oicr.on.caE-mail
ONTARIO INSTITUTE FOR CANCER RESEARC
6
Cancer-om-atics Jul 6-9 2009
Cancer-om-atics II Mar 28-30 2011
Canceromatics III Nov 13 -16 2016
ONTARIO INSTITUTE FOR CANCER RESEARC
Disclaimers
I do not (and will not) profit in any way, shape or form, from
any of the brands, products or companies I may mention.
I am a big proponent of Open Access, Open Source, Opent
Data and Open Courseware
I am on the SAB of many NIH funded projects (SGD, Galaxy,
GenomeSpace, H3ABionet, and HMP2), as well as Elixir and
Genome Canada’s SIAC, and the NRC’s KMAC.
This comes with a bias on how science should be done!
ONTARIO INSTITUTE FOR CANCER RESEARC
Outline
8
Introduction
ICGC
PCAWG
Closing remarks
ONTARIO INSTITUTE FOR CANCER RESEARC
9
adapted from https://goo.gl/fQJAz1
ICGC PCAWG
Docker
Testing
ONTARIO INSTITUTE FOR CANCER RESEARC
Cancer is a Disease
of the Genome
Challenge in Treating Cancer:
 Every tumour is different
 Every cancer patient is different
Adapted from Tom Hudsonhttps://www.cancer.gov/research/areas/genomics
ONTARIO INSTITUTE FOR CANCER RESEARC
 Johns Hopkins
> 18,000 genes analyzed for mutations
11 breast and 11 colon tumors
L.D. Wood et al, Science, Oct. 2007
 Wellcome Trust Sanger Institute
518 genes analyzed for mutations
210 tumors of various types
C. Greenman et al, Nature, Mar. 2007
 TCGA (NIH)
Multiple technologies
brain (glioblastoma multiforme), lung (squamous carcinoma),
and ovarian (serous cystadenocarcinoma).
F.S. Collins & A.D. Barker, Sci. Am, Mar. 2007
Large-Scale Studies of Cancer Genomes
ONTARIO INSTITUTE FOR CANCER RESEARC
 Heterogeneity within and across tumor types
 High rate of abnormalities (driver vs passenger)
 Sample quality matters
 Consent and controlled data access is complicated
Lessons learned from early studies
MR Stratton et al. Nature 458, 719-724 (2009) doi:10.1038/nature07943
ONTARIO INSTITUTE FOR CANCER RESEARC
Analysis Data Types
Simple Somatic Mutations (SSM or SNV)
Copy Number Alterations (CAN or CNV)
Structural Variants (SV)
Germline variants (SNPs)
Gene Expression (micro-arrays and RNASeq)
miRNA Expression (RNASeq)
Epigenomics (Arrays and Methylation)
Splicing Variation (RNASeq)
Protein Expression (Arrays)
ONTARIO INSTITUTE FOR CANCER RESEARC
Rationale for the ICGC:
Scope is huge
Reduce duplication of effort
Standardization and uniform quality
measures
Merging of datasets
Spectrum of many cancers varies
across the world
Accelerate the dissemination of
genomic and analytical methods
ONTARIO INSTITUTE FOR CANCER RESEARC
International Cancer Genome Consortium
Collect ~500 tumour/normal pairs from each of 50 different
major cancer types; 25,000 T/N pairs!
Comprehensive genome analysis of each T/N pair:
Genome
Transcriptome
Methylome
Clinical data
Make the data available to the research community & public.
Identify
genome
changes
…GATTATTCCAGGTAT… …GATTATTGCAGGTAT… …GATTATTGCAGGTAT…
Adapted from Tom Hudson
ONTARIO INSTITUTE FOR CANCER RESEARC
16
ONTARIO INSTITUTE FOR CANCER RESEARC
International Cancer Genome Consortium: http:/icgc.org
ONTARIO INSTITUTE FOR CANCER RESEARC
Data
Submission
Validation
ValidationValidation
(dictionary)
Validation
(across
fields)
Validation
(across
fields)
Validation
(across
fields)
indexing
Happy
Users
http://goo.gl/1EcyR
ONTARIO INSTITUTE FOR CANCER RESEARC
ICGC needs to deal with different
kinds of users!
19
Biologists/Clinicians:
Web interface to processed data, providing:
Affected gene lists with consequences
Impact on pathways
Power users:
Application Programing Interface (API) to get to data
Availability and Integration with cloud resources
ONTARIO INSTITUTE FOR CANCER RESEARC
ICGC Data Coordinating Centre:
dcc.icgc.org
20
ONTARIO INSTITUTE FOR CANCER RESEARC
BRAF missense mutations in colorectal cancer
21
ONTARIO INSTITUTE FOR CANCER RESEARC
https://dcc.icgc.org/
22
ONTARIO INSTITUTE FOR CANCER RESEARC
23
https://dcc.icgc.org/icgc-in-the-cloud
ONTARIO INSTITUTE FOR CANCER RESEARC
24
http://www.cancercollaboratory.org/
ONTARIO INSTITUTE FOR CANCER RESEARC
http://docs.icgc.org/
User and submitter documentation
ONTARIO INSTITUTE FOR CANCER RESEARC
Software development discussions
26
https://discuss.icgc.org/
ONTARIO INSTITUTE FOR CANCER RESEARC
Some challenges:
27
So, we have lots of data, is
it generated the same way?
ONTARIO INSTITUTE FOR CANCER RESEARC
Every country/group has basically been
submitting:
28
Simple Somatic Mutations (SSM or SNV)
Copy Number Alterations (CAN or CNV)
Structural Variants (SV)
Germline variants (SNPs)
Gene Expression (micro-arrays and RNASeq)
miRNA Expression (RNASeq)
Epigenomics (Arrays and Methylation)
Splicing Variation (RNASeq)
Protein Expression (Arrays)
ONTARIO INSTITUTE FOR CANCER RESEARC
Are they all using the same pipelines?
29
No
ONTARIO INSTITUTE FOR CANCER RESEARC
http://goo.gl/CekF6y
Missing Clinical Data?
ONTARIO INSTITUTE FOR CANCER RESEARC
31
http://goo.gl/CekF6y
ONTARIO INSTITUTE FOR CANCER RESEARC
Are we all using the same definition for
controlled access data?
32
No
ONTARIO INSTITUTE FOR CANCER RESEARC
ICGC
BAM/FASTQ
TCGA
BAM/FASTQ
ICGC
Open
Data
(includes
TCGA
Open Data)
ONTARIO INSTITUTE FOR CANCER RESEARC
• Detailed Phenotype and Outcome data
Region of residence
Risk factors
Examination
Surgery
Radiation
Sample
Slide
Specific histological features
Analyte
Aliquot
Donor notes
• Gene Expression (probe-level data)
• Raw genotype calls
• Gene-sample identifier links
• Genome sequence files
ICGC Controlled
Access Datasets
• Cancer Pathology
Histologic type or subtype
Histologic nuclear grade
• Patient/Person
Gender, Age range,
Vital status, Survival time
Relapse type, Status at follow-up
• Gene Expression (normalized)
• DNA methylation
•Computed Copy Number and
Loss of Heterozygosity
• Newly discovered somatic variants
ICGC OA
Datasets
http://goo.gl/w4mrV
ONTARIO INSTITUTE FOR CANCER RESEARC
ICGC
TCGA
ONTARIO INSTITUTE FOR CANCER RESEARC
ICG
C
TCGA
Differences between ICGC & TCGA
• Different tumour types
• Different geographic rules
• Many countries vs one jurisdiction
• Different definitions of what is controlled
• Different data access rules
ONTARIO INSTITUTE FOR CANCER RESEARC
• Detailed Phenotype and Outcome data
• Gene Expression (probe-level data)
• Raw genotype calls
• Gene-sample identifier links
• Genome sequence files
• Germ line variants
ICGC Controlled
Access Datasets
• Cancer Pathology
Histologic type or subtype
Histologic nuclear grade
• Patient/Person
Gender, Age range,
Vital status, Survival time
Relapse type, Status at follow-up
• Gene Expression (normalized)
• DNA methylation
•Computed Copy Number and
Loss of Heterozygosity
• Somatic variants from Exome or WGS
ICGC Open
Access Datasets
http://goo.gl/w4mrV
ONTARIO INSTITUTE FOR CANCER RESEARC
• Primary sequence data
(BAM and FASTQ files)
• SNP6 array level 1 and level 2 data
• Exon array level 1 and level 2 data
• Somatic variants from whole
genome sequencing
• Certain information in MAFs
• A full list of controlled-access
data types can be found at:
http://goo.gl/K1h7zu
TCGA Controlled
Access Datasets
• De-identified clinical and
demographic data
• Gene expression data
• Copy number alterations in regions
of the genome
• Epigenetic data
• Summaries of data compiled across
individuals
• Anonymized single amplicon DNA
sequence data
• Somatic variants from scrubbed
exome sequencing
TCGA Open
Access Datasets
http://goo.gl/A1rMRB
ONTARIO INSTITUTE FOR CANCER RESEARC
Can we do better?
39
ONTARIO INSTITUTE FOR CANCER RESEARC
From ICGC/TCGA
40
Each groups have been free to decide on their own if
they wanted to sequence Exomes or Whole Genomes.
A bit more than 10% of all genomes done were done
with Whole Genome Sequencing
A steering comitte was formed and we decided to
alnalyze these WG in a robust way with the primary
question of figuring out what was hidden in the genomic
sequence of cancer patients!
ONTARIO INSTITUTE FOR CANCER RESEARC
41
ONTARIO INSTITUTE FOR CANCER RESEARC
Steering Committee of PCAWG
42
Peter Campbell, Sanger Inst.
Gady Getz, Broad
Jan Korbel, EMBL
Lincoln Stein, OICR
Josh Stuart, UCSC
ONTARIO INSTITUTE FOR CANCER RESEARC
PanCancer Analysis of Whole Genomes
(PCAWG)
> 2,800 T/N pairs with clinical data from 20
tumour type of whole genome analysis.
Aligned with one standard pipeline.
Genomic Variants determined with 3
pipelines
17 working groups
Start writing papers now
ONTARIO INSTITUTE FOR CANCER RESEARC
Deliverable for PCAWG will include:
44
1st PANCANCER analysis on > 2,800 cancer tumours
from a WGS perspective
RNA, SSM, CNV, Methylation analysis & germline
Published (executable) pipelines
Docker / Dockstore
Mutiple cloud access to data
Multiple portal access to data
ONTARIO INSTITUTE FOR CANCER RESEARC
https://dcc.icgc.org/pcawg
45
ONTARIO INSTITUTE FOR CANCER RESEARC
Working Groups (1/2)
46
1 Novel somatic mutation calling methods
2 Analysis of mutations in regulatory regions
3 Integration of transcriptome and genome
4 Integration of epigenome and genome
5 Consequences of somatic mutations on pathway
and network activity
6 Patterns of structural variations, signatures, genomic
correlations, retrotransposons, mobile elements
7 Mutation signatures and processes
8 Germline cancer genome
ONTARIO INSTITUTE FOR CANCER RESEARC
Working Groups (1/2)
47
9 Inferring driver mutations and identifying cancer genes
and pathways
10 Translating cancer genomes to the clinic
11 Evolution and heterogeneity
12 Exploratory: portals, visualization and software
infrastructure
13 Molecular subtypes and classification
14 Analysis of mutations in non-coding RNA
15 Exploratory: mitochondrial
16 Exploratory: pathogens
Tech Technical working group
ONTARIO INSTITUTE FOR CANCER RESEARC
http://dockstore.org
48
ONTARIO INSTITUTE FOR CANCER RESEARC
49
PCAWG pipelines now on Dockstore
ONTARIO INSTITUTE FOR CANCER RESEARC
DOCKSTORE testing group
50
Andrew Duncan, OICR
Christina Yung, OICR
Denis Yuen, OICR
Zhibin Lu, OICR
Brian O’Connor, UCSC
Alex Buchanan, OHSU
Kyle Ellrott, OHSU
Francis Ouellette, OICR
Gordon Saksena, Broad
Junjun Zhang, OICR
Miguel Vazquez, CNIO
Oliver Hofmann, Australia
Solomon Shorser, OICR
Adam Strucka, OHSU
ONTARIO INSTITUTE FOR CANCER RESEARC
Challenges:
51
Too many conference calls!
Too many clouds
Even though we learned from what not to do with ICGC,
we had to learn what not to do in the clouds.
TCGA and ICGC have different authorization protocols
Not all data can exist everywhere
Dockstore testing is taking too long!
ONTARIO INSTITUTE FOR CANCER RESEARC
Other projects in planning
ICGC to finish in Spring of 2018
Planning for ICGCmed
ICGC 1: 25,000 tumours (DNA, RNA, Epigenome,
Clinical data)
ICGCmed: 200,000 Tumours (DNA, RNA,
Epigenome, Clinical trial)
ICGC1 was the picture, ICGCmed will be the movie
(before and after treatment).
Submission system with one place for data and
metadata
Tools/links directory portal
ONTARIO INSTITUTE FOR CANCER RESEARC
53
29,647
ONTARIO INSTITUTE FOR CANCER RESEARC
54
29,647
ONTARIO INSTITUTE FOR CANCER RESEARC
55
2,834
ONTARIO INSTITUTE FOR CANCER RESEARC
56
2,834
ONTARIO INSTITUTE FOR CANCER RESEARC
57
1477
ONTARIO INSTITUTE FOR CANCER RESEARC
58
1477
ONTARIO INSTITUTE FOR CANCER RESEARC
59
915
ONTARIO INSTITUTE FOR CANCER RESEARC
60
915
ONTARIO INSTITUTE FOR CANCER RESEARC
61
20
ONTARIO INSTITUTE FOR CANCER RESEARC
62
20
ONTARIO INSTITUTE FOR CANCER RESEARC
17
ONTARIO INSTITUTE FOR CANCER RESEARC
http://bioinformatics.ca/
17
ONTARIO INSTITUTE FOR CANCER RESEARC
65
12
ONTARIO INSTITUTE FOR CANCER RESEARC
66
0-Toronto1-Bethesda2-Hinxton
4-Queensland 3-Madrid5-Kyoto
7-Hidelberg 6-Cannes8-Toronto
9-Beijing
10-Mumbai11- Boston
12
ONTARIO INSTITUTE FOR CANCER RESEARC
67
10
ONTARIO INSTITUTE FOR CANCER RESEARC
Informatics & BioComputing @ OICR
68
10
ONTARIO INSTITUTE FOR CANCER RESEARC
9
ONTARIO INSTITUTE FOR CANCER RESEARC
9
ONTARIO INSTITUTE FOR CANCER RESEARC
71
1
ONTARIO INSTITUTE FOR CANCER RESEARC
Bioinformatics.ca workshops Content
72
http://bioinformatics-ca.github.io/
https://goo.gl/CGu13q
1
ONTARIO INSTITUTE FOR CANCER RESEARC
DCC Software
Developer
Vincent Ferretti
Dusan Andric
Phuong-My Do
Francois Gerthoffert
Terry Lin
Michael Moncada
Vitalii Slobodianyk
Bob Tiernay
Douglas Wong
Linda Xiang
Junjun Zhang
Acknowledgments
ICGC/OICR
Project leaders:
Tom Hudson
John McPherson
Lincoln Stein
Jared Simpson
Paul Boutros
Vincent Ferretti
Francis Ouellette
Jennifer Jennings
Christine Yung
Ouellette Lab
Alysha Moncrieffe
Ann Meyer
Zhibin Lu
Web Dev
Joseph Yamada
Kaman Wu
Kim Cullion
Koji Miyauchi
Miyuki Fukuma
ICGC DCC Biocuration
Hardeep Nahal
Marc Perry
http://oicr.on.ca http://icgc.org
… and all the patients and their
families that that are putting
their hopes into our work!
Research
IT/Systems
David Sutton,
Bob Gibson
David Magda
Rob Naccarato
Brian Ott
Gino Yearwood
EGA
Jordi Rambla De
Argila
Arcadi Navarro
Audald Iloret
Mauricio Moldes
ONTARIO INSTITUTE FOR CANCER RESEARC
http://icgc.org
http://dcc.icgc.org
http://docs.icgc.org
info@icgc.org
http://bioinformatics.ca
ONTARIO INSTITUTE FOR CANCER RESEARC
We are hiring:
• OICR Director
• Genome Technology Director
• Junior Faculty in Informatics
& Biocomputing
• PDFs
Interested? Ask Paul Boutros or I
ONTARIO INSTITUTE FOR CANCER RESEARC
76
Muchas gracias!

Weitere ähnliche Inhalte

Was ist angesagt?

2015 bioinformatics personal_genomics_wim_vancriekinge
2015 bioinformatics personal_genomics_wim_vancriekinge2015 bioinformatics personal_genomics_wim_vancriekinge
2015 bioinformatics personal_genomics_wim_vancriekingeProf. Wim Van Criekinge
 
Haendel clingenetics.3.14.14
Haendel clingenetics.3.14.14Haendel clingenetics.3.14.14
Haendel clingenetics.3.14.14mhaendel
 
tranSMART Community Meeting 5-7 Nov 13 - Session 3: Characterization of the c...
tranSMART Community Meeting 5-7 Nov 13 - Session 3: Characterization of the c...tranSMART Community Meeting 5-7 Nov 13 - Session 3: Characterization of the c...
tranSMART Community Meeting 5-7 Nov 13 - Session 3: Characterization of the c...David Peyruc
 
Introduction to Metagenomics. Applications, Approaches and Tools (Bioinformat...
Introduction to Metagenomics. Applications, Approaches and Tools (Bioinformat...Introduction to Metagenomics. Applications, Approaches and Tools (Bioinformat...
Introduction to Metagenomics. Applications, Approaches and Tools (Bioinformat...VHIR Vall d’Hebron Institut de Recerca
 
cBioPortal Webinar Slides (2/3)
cBioPortal Webinar Slides (2/3)cBioPortal Webinar Slides (2/3)
cBioPortal Webinar Slides (2/3)Pistoia Alliance
 
Trans disciplinary research is a must for excellence in science by Prof. Moha...
Trans disciplinary research is a must for excellence in science by Prof. Moha...Trans disciplinary research is a must for excellence in science by Prof. Moha...
Trans disciplinary research is a must for excellence in science by Prof. Moha...Prof. Mohamed Labib Salem
 
Digital Pathology: Precision Medicine, Deep Learning and Computer Aided Inter...
Digital Pathology: Precision Medicine, Deep Learning and Computer Aided Inter...Digital Pathology: Precision Medicine, Deep Learning and Computer Aided Inter...
Digital Pathology: Precision Medicine, Deep Learning and Computer Aided Inter...Joel Saltz
 
Dubina Michael biomedical technologies at Skolkovo
Dubina Michael biomedical technologies at SkolkovoDubina Michael biomedical technologies at Skolkovo
Dubina Michael biomedical technologies at Skolkovoigorod
 
Common languages in genomic epidemiology: from ontologies to algorithms
Common languages in genomic epidemiology: from ontologies to algorithmsCommon languages in genomic epidemiology: from ontologies to algorithms
Common languages in genomic epidemiology: from ontologies to algorithmsJoão André Carriço
 
Pathomics Based Biomarkers and Precision Medicine
Pathomics Based Biomarkers and Precision MedicinePathomics Based Biomarkers and Precision Medicine
Pathomics Based Biomarkers and Precision MedicineJoel Saltz
 
Software Pipelines: The Good, The Bad and The Ugly
Software Pipelines: The Good, The Bad and The UglySoftware Pipelines: The Good, The Bad and The Ugly
Software Pipelines: The Good, The Bad and The UglyJoão André Carriço
 
Learning, Training,  Classification,  Common Sense and Exascale Computing
Learning, Training,  Classification,  Common Sense and Exascale ComputingLearning, Training,  Classification,  Common Sense and Exascale Computing
Learning, Training,  Classification,  Common Sense and Exascale ComputingJoel Saltz
 
NGS Applications II (UEB-UAT Bioinformatics Course - Session 2.1.3 - VHIR, Ba...
NGS Applications II (UEB-UAT Bioinformatics Course - Session 2.1.3 - VHIR, Ba...NGS Applications II (UEB-UAT Bioinformatics Course - Session 2.1.3 - VHIR, Ba...
NGS Applications II (UEB-UAT Bioinformatics Course - Session 2.1.3 - VHIR, Ba...VHIR Vall d’Hebron Institut de Recerca
 
Personal Genomes: what can I do with my data?
Personal Genomes: what can I do with my data?Personal Genomes: what can I do with my data?
Personal Genomes: what can I do with my data?Melanie Swan
 

Was ist angesagt? (20)

2015 bioinformatics personal_genomics_wim_vancriekinge
2015 bioinformatics personal_genomics_wim_vancriekinge2015 bioinformatics personal_genomics_wim_vancriekinge
2015 bioinformatics personal_genomics_wim_vancriekinge
 
Ouellette icgc toronto_oct2012_fged_ver02
Ouellette icgc toronto_oct2012_fged_ver02Ouellette icgc toronto_oct2012_fged_ver02
Ouellette icgc toronto_oct2012_fged_ver02
 
Enriching Scholarship Personal Genomics presentation
Enriching Scholarship Personal Genomics presentationEnriching Scholarship Personal Genomics presentation
Enriching Scholarship Personal Genomics presentation
 
Haendel clingenetics.3.14.14
Haendel clingenetics.3.14.14Haendel clingenetics.3.14.14
Haendel clingenetics.3.14.14
 
tranSMART Community Meeting 5-7 Nov 13 - Session 3: Characterization of the c...
tranSMART Community Meeting 5-7 Nov 13 - Session 3: Characterization of the c...tranSMART Community Meeting 5-7 Nov 13 - Session 3: Characterization of the c...
tranSMART Community Meeting 5-7 Nov 13 - Session 3: Characterization of the c...
 
Introduction to Metagenomics. Applications, Approaches and Tools (Bioinformat...
Introduction to Metagenomics. Applications, Approaches and Tools (Bioinformat...Introduction to Metagenomics. Applications, Approaches and Tools (Bioinformat...
Introduction to Metagenomics. Applications, Approaches and Tools (Bioinformat...
 
UNMSymposium2014
UNMSymposium2014UNMSymposium2014
UNMSymposium2014
 
cBioPortal Webinar Slides (2/3)
cBioPortal Webinar Slides (2/3)cBioPortal Webinar Slides (2/3)
cBioPortal Webinar Slides (2/3)
 
Final cecr presenation
Final cecr presenationFinal cecr presenation
Final cecr presenation
 
Trans disciplinary research is a must for excellence in science by Prof. Moha...
Trans disciplinary research is a must for excellence in science by Prof. Moha...Trans disciplinary research is a must for excellence in science by Prof. Moha...
Trans disciplinary research is a must for excellence in science by Prof. Moha...
 
Digital Pathology: Precision Medicine, Deep Learning and Computer Aided Inter...
Digital Pathology: Precision Medicine, Deep Learning and Computer Aided Inter...Digital Pathology: Precision Medicine, Deep Learning and Computer Aided Inter...
Digital Pathology: Precision Medicine, Deep Learning and Computer Aided Inter...
 
Dubina Michael biomedical technologies at Skolkovo
Dubina Michael biomedical technologies at SkolkovoDubina Michael biomedical technologies at Skolkovo
Dubina Michael biomedical technologies at Skolkovo
 
Common languages in genomic epidemiology: from ontologies to algorithms
Common languages in genomic epidemiology: from ontologies to algorithmsCommon languages in genomic epidemiology: from ontologies to algorithms
Common languages in genomic epidemiology: from ontologies to algorithms
 
Brief introduction to Bioinformatics
Brief introduction to BioinformaticsBrief introduction to Bioinformatics
Brief introduction to Bioinformatics
 
Pathomics Based Biomarkers and Precision Medicine
Pathomics Based Biomarkers and Precision MedicinePathomics Based Biomarkers and Precision Medicine
Pathomics Based Biomarkers and Precision Medicine
 
Prof. Mohamed Labib Salem's students
Prof. Mohamed Labib Salem's studentsProf. Mohamed Labib Salem's students
Prof. Mohamed Labib Salem's students
 
Software Pipelines: The Good, The Bad and The Ugly
Software Pipelines: The Good, The Bad and The UglySoftware Pipelines: The Good, The Bad and The Ugly
Software Pipelines: The Good, The Bad and The Ugly
 
Learning, Training,  Classification,  Common Sense and Exascale Computing
Learning, Training,  Classification,  Common Sense and Exascale ComputingLearning, Training,  Classification,  Common Sense and Exascale Computing
Learning, Training,  Classification,  Common Sense and Exascale Computing
 
NGS Applications II (UEB-UAT Bioinformatics Course - Session 2.1.3 - VHIR, Ba...
NGS Applications II (UEB-UAT Bioinformatics Course - Session 2.1.3 - VHIR, Ba...NGS Applications II (UEB-UAT Bioinformatics Course - Session 2.1.3 - VHIR, Ba...
NGS Applications II (UEB-UAT Bioinformatics Course - Session 2.1.3 - VHIR, Ba...
 
Personal Genomes: what can I do with my data?
Personal Genomes: what can I do with my data?Personal Genomes: what can I do with my data?
Personal Genomes: what can I do with my data?
 

Ähnlich wie Madrid icgc pcawg_2016_slideshare

Genomics and Computation in Precision Medicine March 2017
Genomics and Computation in Precision Medicine March 2017Genomics and Computation in Precision Medicine March 2017
Genomics and Computation in Precision Medicine March 2017Warren Kibbe
 
Web cast cancer gene panels march 11 2015
Web cast cancer gene panels march 11 2015Web cast cancer gene panels march 11 2015
Web cast cancer gene panels march 11 2015Andreas Scherer
 
Cancer Gene Panels
Cancer Gene PanelsCancer Gene Panels
Cancer Gene PanelsGolden Helix
 
Integrative Everything, Deep Learning and Streaming Data
Integrative Everything, Deep Learning and Streaming DataIntegrative Everything, Deep Learning and Streaming Data
Integrative Everything, Deep Learning and Streaming DataJoel Saltz
 
Pistoia Alliance US Conference 2015 - 1.5.4 New data - Nikolaus Schultz
Pistoia Alliance US Conference 2015 - 1.5.4 New data - Nikolaus SchultzPistoia Alliance US Conference 2015 - 1.5.4 New data - Nikolaus Schultz
Pistoia Alliance US Conference 2015 - 1.5.4 New data - Nikolaus SchultzPistoia Alliance
 
Rozen 2016-10-05-ieee-cibcb-big-genome-data-to-share
Rozen 2016-10-05-ieee-cibcb-big-genome-data-to-shareRozen 2016-10-05-ieee-cibcb-big-genome-data-to-share
Rozen 2016-10-05-ieee-cibcb-big-genome-data-to-shareSteve Rozen
 
Cancer moonshot and data sharing
Cancer moonshot and data sharingCancer moonshot and data sharing
Cancer moonshot and data sharingWarren Kibbe
 
Advancing Innovation and Convergence in Cancer Research: US Federal Cancer Mo...
Advancing Innovation and Convergence in Cancer Research: US Federal Cancer Mo...Advancing Innovation and Convergence in Cancer Research: US Federal Cancer Mo...
Advancing Innovation and Convergence in Cancer Research: US Federal Cancer Mo...Jerry Lee
 
Nci clinical genomics data sharing ncra sept 2016
Nci clinical genomics data sharing ncra sept 2016Nci clinical genomics data sharing ncra sept 2016
Nci clinical genomics data sharing ncra sept 2016Warren Kibbe
 
Morphologomics - Challenges for Surgical Pathology in the Genomic Age by Dr. ...
Morphologomics - Challenges for Surgical Pathology in the Genomic Age by Dr. ...Morphologomics - Challenges for Surgical Pathology in the Genomic Age by Dr. ...
Morphologomics - Challenges for Surgical Pathology in the Genomic Age by Dr. ...Cirdan
 
Precision Oncology - using Genomics, Proteomics and Imaging to inform biology...
Precision Oncology - using Genomics, Proteomics and Imaging to inform biology...Precision Oncology - using Genomics, Proteomics and Imaging to inform biology...
Precision Oncology - using Genomics, Proteomics and Imaging to inform biology...Warren Kibbe
 
Paneles genómicos en tumores sólidos
Paneles genómicos en tumores sólidosPaneles genómicos en tumores sólidos
Paneles genómicos en tumores sólidosMauricio Lema
 
Introduction to cancer bioinformatics
Introduction to cancer bioinformaticsIntroduction to cancer bioinformatics
Introduction to cancer bioinformaticscreativebiolabs11
 
Oncoset symposium kibbe 201704
Oncoset symposium kibbe 201704Oncoset symposium kibbe 201704
Oncoset symposium kibbe 201704Warren Kibbe
 
Big Data Training for Cancer Research, Purdue, May 2023
Big Data Training for Cancer Research, Purdue, May 2023Big Data Training for Cancer Research, Purdue, May 2023
Big Data Training for Cancer Research, Purdue, May 2023Warren Kibbe
 
ICBO 2014, October 8, 2014
ICBO 2014, October 8, 2014ICBO 2014, October 8, 2014
ICBO 2014, October 8, 2014Warren Kibbe
 
Bioinformatics tools for NGS data analysis
Bioinformatics tools for NGS data analysisBioinformatics tools for NGS data analysis
Bioinformatics tools for NGS data analysisDespoina Kalfakakou
 
Genomics 2015 Keynote - Utilizing cancer sequencing in the clinic
Genomics 2015 Keynote - Utilizing cancer sequencing in the clinicGenomics 2015 Keynote - Utilizing cancer sequencing in the clinic
Genomics 2015 Keynote - Utilizing cancer sequencing in the clinicAndreas Scherer
 

Ähnlich wie Madrid icgc pcawg_2016_slideshare (20)

Genomics and Computation in Precision Medicine March 2017
Genomics and Computation in Precision Medicine March 2017Genomics and Computation in Precision Medicine March 2017
Genomics and Computation in Precision Medicine March 2017
 
Web cast cancer gene panels march 11 2015
Web cast cancer gene panels march 11 2015Web cast cancer gene panels march 11 2015
Web cast cancer gene panels march 11 2015
 
Cancer Gene Panels
Cancer Gene PanelsCancer Gene Panels
Cancer Gene Panels
 
Integrative Everything, Deep Learning and Streaming Data
Integrative Everything, Deep Learning and Streaming DataIntegrative Everything, Deep Learning and Streaming Data
Integrative Everything, Deep Learning and Streaming Data
 
Pistoia Alliance US Conference 2015 - 1.5.4 New data - Nikolaus Schultz
Pistoia Alliance US Conference 2015 - 1.5.4 New data - Nikolaus SchultzPistoia Alliance US Conference 2015 - 1.5.4 New data - Nikolaus Schultz
Pistoia Alliance US Conference 2015 - 1.5.4 New data - Nikolaus Schultz
 
Rozen 2016-10-05-ieee-cibcb-big-genome-data-to-share
Rozen 2016-10-05-ieee-cibcb-big-genome-data-to-shareRozen 2016-10-05-ieee-cibcb-big-genome-data-to-share
Rozen 2016-10-05-ieee-cibcb-big-genome-data-to-share
 
Cancer moonshot and data sharing
Cancer moonshot and data sharingCancer moonshot and data sharing
Cancer moonshot and data sharing
 
Advancing Innovation and Convergence in Cancer Research: US Federal Cancer Mo...
Advancing Innovation and Convergence in Cancer Research: US Federal Cancer Mo...Advancing Innovation and Convergence in Cancer Research: US Federal Cancer Mo...
Advancing Innovation and Convergence in Cancer Research: US Federal Cancer Mo...
 
Nci clinical genomics data sharing ncra sept 2016
Nci clinical genomics data sharing ncra sept 2016Nci clinical genomics data sharing ncra sept 2016
Nci clinical genomics data sharing ncra sept 2016
 
Open data genomics_palermo_2017_ver03
Open data genomics_palermo_2017_ver03Open data genomics_palermo_2017_ver03
Open data genomics_palermo_2017_ver03
 
Morphologomics - Challenges for Surgical Pathology in the Genomic Age by Dr. ...
Morphologomics - Challenges for Surgical Pathology in the Genomic Age by Dr. ...Morphologomics - Challenges for Surgical Pathology in the Genomic Age by Dr. ...
Morphologomics - Challenges for Surgical Pathology in the Genomic Age by Dr. ...
 
Precision Oncology - using Genomics, Proteomics and Imaging to inform biology...
Precision Oncology - using Genomics, Proteomics and Imaging to inform biology...Precision Oncology - using Genomics, Proteomics and Imaging to inform biology...
Precision Oncology - using Genomics, Proteomics and Imaging to inform biology...
 
Paneles genómicos en tumores sólidos
Paneles genómicos en tumores sólidosPaneles genómicos en tumores sólidos
Paneles genómicos en tumores sólidos
 
Introduction to cancer bioinformatics
Introduction to cancer bioinformaticsIntroduction to cancer bioinformatics
Introduction to cancer bioinformatics
 
Oncoset symposium kibbe 201704
Oncoset symposium kibbe 201704Oncoset symposium kibbe 201704
Oncoset symposium kibbe 201704
 
Big Data Training for Cancer Research, Purdue, May 2023
Big Data Training for Cancer Research, Purdue, May 2023Big Data Training for Cancer Research, Purdue, May 2023
Big Data Training for Cancer Research, Purdue, May 2023
 
ICBO 2014, October 8, 2014
ICBO 2014, October 8, 2014ICBO 2014, October 8, 2014
ICBO 2014, October 8, 2014
 
Bioinformatics tools for NGS data analysis
Bioinformatics tools for NGS data analysisBioinformatics tools for NGS data analysis
Bioinformatics tools for NGS data analysis
 
Van criekinge 2017_11_13_rodebiotech
Van criekinge 2017_11_13_rodebiotechVan criekinge 2017_11_13_rodebiotech
Van criekinge 2017_11_13_rodebiotech
 
Genomics 2015 Keynote - Utilizing cancer sequencing in the clinic
Genomics 2015 Keynote - Utilizing cancer sequencing in the clinicGenomics 2015 Keynote - Utilizing cancer sequencing in the clinic
Genomics 2015 Keynote - Utilizing cancer sequencing in the clinic
 

Kürzlich hochgeladen

The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxheathfieldcps1
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingTechSoup
 
Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.pptRamjanShidvankar
 
Food Chain and Food Web (Ecosystem) EVS, B. Pharmacy 1st Year, Sem-II
Food Chain and Food Web (Ecosystem) EVS, B. Pharmacy 1st Year, Sem-IIFood Chain and Food Web (Ecosystem) EVS, B. Pharmacy 1st Year, Sem-II
Food Chain and Food Web (Ecosystem) EVS, B. Pharmacy 1st Year, Sem-IIShubhangi Sonawane
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17Celine George
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxheathfieldcps1
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfagholdier
 
General Principles of Intellectual Property: Concepts of Intellectual Proper...
General Principles of Intellectual Property: Concepts of Intellectual  Proper...General Principles of Intellectual Property: Concepts of Intellectual  Proper...
General Principles of Intellectual Property: Concepts of Intellectual Proper...Poonam Aher Patil
 
Role Of Transgenic Animal In Target Validation-1.pptx
Role Of Transgenic Animal In Target Validation-1.pptxRole Of Transgenic Animal In Target Validation-1.pptx
Role Of Transgenic Animal In Target Validation-1.pptxNikitaBankoti2
 
Unit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxUnit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxVishalSingh1417
 
Energy Resources. ( B. Pharmacy, 1st Year, Sem-II) Natural Resources
Energy Resources. ( B. Pharmacy, 1st Year, Sem-II) Natural ResourcesEnergy Resources. ( B. Pharmacy, 1st Year, Sem-II) Natural Resources
Energy Resources. ( B. Pharmacy, 1st Year, Sem-II) Natural ResourcesShubhangi Sonawane
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsTechSoup
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...christianmathematics
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...EduSkills OECD
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfAdmir Softic
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDThiyagu K
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdfQucHHunhnh
 
psychiatric nursing HISTORY COLLECTION .docx
psychiatric  nursing HISTORY  COLLECTION  .docxpsychiatric  nursing HISTORY  COLLECTION  .docx
psychiatric nursing HISTORY COLLECTION .docxPoojaSen20
 
Class 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfClass 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfAyushMahapatra5
 
ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701bronxfugly43
 

Kürzlich hochgeladen (20)

The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.ppt
 
Food Chain and Food Web (Ecosystem) EVS, B. Pharmacy 1st Year, Sem-II
Food Chain and Food Web (Ecosystem) EVS, B. Pharmacy 1st Year, Sem-IIFood Chain and Food Web (Ecosystem) EVS, B. Pharmacy 1st Year, Sem-II
Food Chain and Food Web (Ecosystem) EVS, B. Pharmacy 1st Year, Sem-II
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdf
 
General Principles of Intellectual Property: Concepts of Intellectual Proper...
General Principles of Intellectual Property: Concepts of Intellectual  Proper...General Principles of Intellectual Property: Concepts of Intellectual  Proper...
General Principles of Intellectual Property: Concepts of Intellectual Proper...
 
Role Of Transgenic Animal In Target Validation-1.pptx
Role Of Transgenic Animal In Target Validation-1.pptxRole Of Transgenic Animal In Target Validation-1.pptx
Role Of Transgenic Animal In Target Validation-1.pptx
 
Unit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxUnit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptx
 
Energy Resources. ( B. Pharmacy, 1st Year, Sem-II) Natural Resources
Energy Resources. ( B. Pharmacy, 1st Year, Sem-II) Natural ResourcesEnergy Resources. ( B. Pharmacy, 1st Year, Sem-II) Natural Resources
Energy Resources. ( B. Pharmacy, 1st Year, Sem-II) Natural Resources
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SD
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
psychiatric nursing HISTORY COLLECTION .docx
psychiatric  nursing HISTORY  COLLECTION  .docxpsychiatric  nursing HISTORY  COLLECTION  .docx
psychiatric nursing HISTORY COLLECTION .docx
 
Class 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfClass 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdf
 
ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701
 

Madrid icgc pcawg_2016_slideshare

  • 1. Canceromatic III - Session I: Pan-Cancer analysis - Changing landscape of data and tools available for reproducible cancer genomics workflows: report from the ICGC trenches. Nov 14th 2016 B.F. Francis Ouellette francis@oicr.on.ca • Senior Scientists & Associate Director, Informatics and Biocomputing, Ontario Institute for Cancer Research, Toronto, ON • Associate Professor, Department of Cell and Systems Biology, University of Toronto, Toronto, ON.
  • 2. 2Module #: Title of Module
  • 4. ONTARIO INSTITUTE FOR CANCER RESEARC You are free to: Copy, share, adapt, or re-mix; Photograph, film, or broadcast; Blog, live-blog, or post video of; This presentation. Provided that: You attribute the work to its author and respect the rights and licenses associated with its components. Slide Concept by Cameron Neylon, who has waived all copyright and related or neighbouring rights. This slide only ccZero. Social Media Icons adapted with permission from originals by Christopher Ross. Original images are available under GPL at; http://www.thisismyurl.com/free-downloads/15-free-speech-bubble-icons-for-popular-websites
  • 5. ONTARIO INSTITUTE FOR CANCER RESEARC @bffo francis@oicr.on.caE-mail
  • 6. ONTARIO INSTITUTE FOR CANCER RESEARC 6 Cancer-om-atics Jul 6-9 2009 Cancer-om-atics II Mar 28-30 2011 Canceromatics III Nov 13 -16 2016
  • 7. ONTARIO INSTITUTE FOR CANCER RESEARC Disclaimers I do not (and will not) profit in any way, shape or form, from any of the brands, products or companies I may mention. I am a big proponent of Open Access, Open Source, Opent Data and Open Courseware I am on the SAB of many NIH funded projects (SGD, Galaxy, GenomeSpace, H3ABionet, and HMP2), as well as Elixir and Genome Canada’s SIAC, and the NRC’s KMAC. This comes with a bias on how science should be done!
  • 8. ONTARIO INSTITUTE FOR CANCER RESEARC Outline 8 Introduction ICGC PCAWG Closing remarks
  • 9. ONTARIO INSTITUTE FOR CANCER RESEARC 9 adapted from https://goo.gl/fQJAz1 ICGC PCAWG Docker Testing
  • 10. ONTARIO INSTITUTE FOR CANCER RESEARC Cancer is a Disease of the Genome Challenge in Treating Cancer:  Every tumour is different  Every cancer patient is different Adapted from Tom Hudsonhttps://www.cancer.gov/research/areas/genomics
  • 11. ONTARIO INSTITUTE FOR CANCER RESEARC  Johns Hopkins > 18,000 genes analyzed for mutations 11 breast and 11 colon tumors L.D. Wood et al, Science, Oct. 2007  Wellcome Trust Sanger Institute 518 genes analyzed for mutations 210 tumors of various types C. Greenman et al, Nature, Mar. 2007  TCGA (NIH) Multiple technologies brain (glioblastoma multiforme), lung (squamous carcinoma), and ovarian (serous cystadenocarcinoma). F.S. Collins & A.D. Barker, Sci. Am, Mar. 2007 Large-Scale Studies of Cancer Genomes
  • 12. ONTARIO INSTITUTE FOR CANCER RESEARC  Heterogeneity within and across tumor types  High rate of abnormalities (driver vs passenger)  Sample quality matters  Consent and controlled data access is complicated Lessons learned from early studies MR Stratton et al. Nature 458, 719-724 (2009) doi:10.1038/nature07943
  • 13. ONTARIO INSTITUTE FOR CANCER RESEARC Analysis Data Types Simple Somatic Mutations (SSM or SNV) Copy Number Alterations (CAN or CNV) Structural Variants (SV) Germline variants (SNPs) Gene Expression (micro-arrays and RNASeq) miRNA Expression (RNASeq) Epigenomics (Arrays and Methylation) Splicing Variation (RNASeq) Protein Expression (Arrays)
  • 14. ONTARIO INSTITUTE FOR CANCER RESEARC Rationale for the ICGC: Scope is huge Reduce duplication of effort Standardization and uniform quality measures Merging of datasets Spectrum of many cancers varies across the world Accelerate the dissemination of genomic and analytical methods
  • 15. ONTARIO INSTITUTE FOR CANCER RESEARC International Cancer Genome Consortium Collect ~500 tumour/normal pairs from each of 50 different major cancer types; 25,000 T/N pairs! Comprehensive genome analysis of each T/N pair: Genome Transcriptome Methylome Clinical data Make the data available to the research community & public. Identify genome changes …GATTATTCCAGGTAT… …GATTATTGCAGGTAT… …GATTATTGCAGGTAT… Adapted from Tom Hudson
  • 16. ONTARIO INSTITUTE FOR CANCER RESEARC 16
  • 17. ONTARIO INSTITUTE FOR CANCER RESEARC International Cancer Genome Consortium: http:/icgc.org
  • 18. ONTARIO INSTITUTE FOR CANCER RESEARC Data Submission Validation ValidationValidation (dictionary) Validation (across fields) Validation (across fields) Validation (across fields) indexing Happy Users http://goo.gl/1EcyR
  • 19. ONTARIO INSTITUTE FOR CANCER RESEARC ICGC needs to deal with different kinds of users! 19 Biologists/Clinicians: Web interface to processed data, providing: Affected gene lists with consequences Impact on pathways Power users: Application Programing Interface (API) to get to data Availability and Integration with cloud resources
  • 20. ONTARIO INSTITUTE FOR CANCER RESEARC ICGC Data Coordinating Centre: dcc.icgc.org 20
  • 21. ONTARIO INSTITUTE FOR CANCER RESEARC BRAF missense mutations in colorectal cancer 21
  • 22. ONTARIO INSTITUTE FOR CANCER RESEARC https://dcc.icgc.org/ 22
  • 23. ONTARIO INSTITUTE FOR CANCER RESEARC 23 https://dcc.icgc.org/icgc-in-the-cloud
  • 24. ONTARIO INSTITUTE FOR CANCER RESEARC 24 http://www.cancercollaboratory.org/
  • 25. ONTARIO INSTITUTE FOR CANCER RESEARC http://docs.icgc.org/ User and submitter documentation
  • 26. ONTARIO INSTITUTE FOR CANCER RESEARC Software development discussions 26 https://discuss.icgc.org/
  • 27. ONTARIO INSTITUTE FOR CANCER RESEARC Some challenges: 27 So, we have lots of data, is it generated the same way?
  • 28. ONTARIO INSTITUTE FOR CANCER RESEARC Every country/group has basically been submitting: 28 Simple Somatic Mutations (SSM or SNV) Copy Number Alterations (CAN or CNV) Structural Variants (SV) Germline variants (SNPs) Gene Expression (micro-arrays and RNASeq) miRNA Expression (RNASeq) Epigenomics (Arrays and Methylation) Splicing Variation (RNASeq) Protein Expression (Arrays)
  • 29. ONTARIO INSTITUTE FOR CANCER RESEARC Are they all using the same pipelines? 29 No
  • 30. ONTARIO INSTITUTE FOR CANCER RESEARC http://goo.gl/CekF6y Missing Clinical Data?
  • 31. ONTARIO INSTITUTE FOR CANCER RESEARC 31 http://goo.gl/CekF6y
  • 32. ONTARIO INSTITUTE FOR CANCER RESEARC Are we all using the same definition for controlled access data? 32 No
  • 33. ONTARIO INSTITUTE FOR CANCER RESEARC ICGC BAM/FASTQ TCGA BAM/FASTQ ICGC Open Data (includes TCGA Open Data)
  • 34. ONTARIO INSTITUTE FOR CANCER RESEARC • Detailed Phenotype and Outcome data Region of residence Risk factors Examination Surgery Radiation Sample Slide Specific histological features Analyte Aliquot Donor notes • Gene Expression (probe-level data) • Raw genotype calls • Gene-sample identifier links • Genome sequence files ICGC Controlled Access Datasets • Cancer Pathology Histologic type or subtype Histologic nuclear grade • Patient/Person Gender, Age range, Vital status, Survival time Relapse type, Status at follow-up • Gene Expression (normalized) • DNA methylation •Computed Copy Number and Loss of Heterozygosity • Newly discovered somatic variants ICGC OA Datasets http://goo.gl/w4mrV
  • 35. ONTARIO INSTITUTE FOR CANCER RESEARC ICGC TCGA
  • 36. ONTARIO INSTITUTE FOR CANCER RESEARC ICG C TCGA Differences between ICGC & TCGA • Different tumour types • Different geographic rules • Many countries vs one jurisdiction • Different definitions of what is controlled • Different data access rules
  • 37. ONTARIO INSTITUTE FOR CANCER RESEARC • Detailed Phenotype and Outcome data • Gene Expression (probe-level data) • Raw genotype calls • Gene-sample identifier links • Genome sequence files • Germ line variants ICGC Controlled Access Datasets • Cancer Pathology Histologic type or subtype Histologic nuclear grade • Patient/Person Gender, Age range, Vital status, Survival time Relapse type, Status at follow-up • Gene Expression (normalized) • DNA methylation •Computed Copy Number and Loss of Heterozygosity • Somatic variants from Exome or WGS ICGC Open Access Datasets http://goo.gl/w4mrV
  • 38. ONTARIO INSTITUTE FOR CANCER RESEARC • Primary sequence data (BAM and FASTQ files) • SNP6 array level 1 and level 2 data • Exon array level 1 and level 2 data • Somatic variants from whole genome sequencing • Certain information in MAFs • A full list of controlled-access data types can be found at: http://goo.gl/K1h7zu TCGA Controlled Access Datasets • De-identified clinical and demographic data • Gene expression data • Copy number alterations in regions of the genome • Epigenetic data • Summaries of data compiled across individuals • Anonymized single amplicon DNA sequence data • Somatic variants from scrubbed exome sequencing TCGA Open Access Datasets http://goo.gl/A1rMRB
  • 39. ONTARIO INSTITUTE FOR CANCER RESEARC Can we do better? 39
  • 40. ONTARIO INSTITUTE FOR CANCER RESEARC From ICGC/TCGA 40 Each groups have been free to decide on their own if they wanted to sequence Exomes or Whole Genomes. A bit more than 10% of all genomes done were done with Whole Genome Sequencing A steering comitte was formed and we decided to alnalyze these WG in a robust way with the primary question of figuring out what was hidden in the genomic sequence of cancer patients!
  • 41. ONTARIO INSTITUTE FOR CANCER RESEARC 41
  • 42. ONTARIO INSTITUTE FOR CANCER RESEARC Steering Committee of PCAWG 42 Peter Campbell, Sanger Inst. Gady Getz, Broad Jan Korbel, EMBL Lincoln Stein, OICR Josh Stuart, UCSC
  • 43. ONTARIO INSTITUTE FOR CANCER RESEARC PanCancer Analysis of Whole Genomes (PCAWG) > 2,800 T/N pairs with clinical data from 20 tumour type of whole genome analysis. Aligned with one standard pipeline. Genomic Variants determined with 3 pipelines 17 working groups Start writing papers now
  • 44. ONTARIO INSTITUTE FOR CANCER RESEARC Deliverable for PCAWG will include: 44 1st PANCANCER analysis on > 2,800 cancer tumours from a WGS perspective RNA, SSM, CNV, Methylation analysis & germline Published (executable) pipelines Docker / Dockstore Mutiple cloud access to data Multiple portal access to data
  • 45. ONTARIO INSTITUTE FOR CANCER RESEARC https://dcc.icgc.org/pcawg 45
  • 46. ONTARIO INSTITUTE FOR CANCER RESEARC Working Groups (1/2) 46 1 Novel somatic mutation calling methods 2 Analysis of mutations in regulatory regions 3 Integration of transcriptome and genome 4 Integration of epigenome and genome 5 Consequences of somatic mutations on pathway and network activity 6 Patterns of structural variations, signatures, genomic correlations, retrotransposons, mobile elements 7 Mutation signatures and processes 8 Germline cancer genome
  • 47. ONTARIO INSTITUTE FOR CANCER RESEARC Working Groups (1/2) 47 9 Inferring driver mutations and identifying cancer genes and pathways 10 Translating cancer genomes to the clinic 11 Evolution and heterogeneity 12 Exploratory: portals, visualization and software infrastructure 13 Molecular subtypes and classification 14 Analysis of mutations in non-coding RNA 15 Exploratory: mitochondrial 16 Exploratory: pathogens Tech Technical working group
  • 48. ONTARIO INSTITUTE FOR CANCER RESEARC http://dockstore.org 48
  • 49. ONTARIO INSTITUTE FOR CANCER RESEARC 49 PCAWG pipelines now on Dockstore
  • 50. ONTARIO INSTITUTE FOR CANCER RESEARC DOCKSTORE testing group 50 Andrew Duncan, OICR Christina Yung, OICR Denis Yuen, OICR Zhibin Lu, OICR Brian O’Connor, UCSC Alex Buchanan, OHSU Kyle Ellrott, OHSU Francis Ouellette, OICR Gordon Saksena, Broad Junjun Zhang, OICR Miguel Vazquez, CNIO Oliver Hofmann, Australia Solomon Shorser, OICR Adam Strucka, OHSU
  • 51. ONTARIO INSTITUTE FOR CANCER RESEARC Challenges: 51 Too many conference calls! Too many clouds Even though we learned from what not to do with ICGC, we had to learn what not to do in the clouds. TCGA and ICGC have different authorization protocols Not all data can exist everywhere Dockstore testing is taking too long!
  • 52. ONTARIO INSTITUTE FOR CANCER RESEARC Other projects in planning ICGC to finish in Spring of 2018 Planning for ICGCmed ICGC 1: 25,000 tumours (DNA, RNA, Epigenome, Clinical data) ICGCmed: 200,000 Tumours (DNA, RNA, Epigenome, Clinical trial) ICGC1 was the picture, ICGCmed will be the movie (before and after treatment). Submission system with one place for data and metadata Tools/links directory portal
  • 53. ONTARIO INSTITUTE FOR CANCER RESEARC 53 29,647
  • 54. ONTARIO INSTITUTE FOR CANCER RESEARC 54 29,647
  • 55. ONTARIO INSTITUTE FOR CANCER RESEARC 55 2,834
  • 56. ONTARIO INSTITUTE FOR CANCER RESEARC 56 2,834
  • 57. ONTARIO INSTITUTE FOR CANCER RESEARC 57 1477
  • 58. ONTARIO INSTITUTE FOR CANCER RESEARC 58 1477
  • 59. ONTARIO INSTITUTE FOR CANCER RESEARC 59 915
  • 60. ONTARIO INSTITUTE FOR CANCER RESEARC 60 915
  • 61. ONTARIO INSTITUTE FOR CANCER RESEARC 61 20
  • 62. ONTARIO INSTITUTE FOR CANCER RESEARC 62 20
  • 63. ONTARIO INSTITUTE FOR CANCER RESEARC 17
  • 64. ONTARIO INSTITUTE FOR CANCER RESEARC http://bioinformatics.ca/ 17
  • 65. ONTARIO INSTITUTE FOR CANCER RESEARC 65 12
  • 66. ONTARIO INSTITUTE FOR CANCER RESEARC 66 0-Toronto1-Bethesda2-Hinxton 4-Queensland 3-Madrid5-Kyoto 7-Hidelberg 6-Cannes8-Toronto 9-Beijing 10-Mumbai11- Boston 12
  • 67. ONTARIO INSTITUTE FOR CANCER RESEARC 67 10
  • 68. ONTARIO INSTITUTE FOR CANCER RESEARC Informatics & BioComputing @ OICR 68 10
  • 69. ONTARIO INSTITUTE FOR CANCER RESEARC 9
  • 70. ONTARIO INSTITUTE FOR CANCER RESEARC 9
  • 71. ONTARIO INSTITUTE FOR CANCER RESEARC 71 1
  • 72. ONTARIO INSTITUTE FOR CANCER RESEARC Bioinformatics.ca workshops Content 72 http://bioinformatics-ca.github.io/ https://goo.gl/CGu13q 1
  • 73. ONTARIO INSTITUTE FOR CANCER RESEARC DCC Software Developer Vincent Ferretti Dusan Andric Phuong-My Do Francois Gerthoffert Terry Lin Michael Moncada Vitalii Slobodianyk Bob Tiernay Douglas Wong Linda Xiang Junjun Zhang Acknowledgments ICGC/OICR Project leaders: Tom Hudson John McPherson Lincoln Stein Jared Simpson Paul Boutros Vincent Ferretti Francis Ouellette Jennifer Jennings Christine Yung Ouellette Lab Alysha Moncrieffe Ann Meyer Zhibin Lu Web Dev Joseph Yamada Kaman Wu Kim Cullion Koji Miyauchi Miyuki Fukuma ICGC DCC Biocuration Hardeep Nahal Marc Perry http://oicr.on.ca http://icgc.org … and all the patients and their families that that are putting their hopes into our work! Research IT/Systems David Sutton, Bob Gibson David Magda Rob Naccarato Brian Ott Gino Yearwood EGA Jordi Rambla De Argila Arcadi Navarro Audald Iloret Mauricio Moldes
  • 74. ONTARIO INSTITUTE FOR CANCER RESEARC http://icgc.org http://dcc.icgc.org http://docs.icgc.org info@icgc.org http://bioinformatics.ca
  • 75. ONTARIO INSTITUTE FOR CANCER RESEARC We are hiring: • OICR Director • Genome Technology Director • Junior Faculty in Informatics & Biocomputing • PDFs Interested? Ask Paul Boutros or I
  • 76. ONTARIO INSTITUTE FOR CANCER RESEARC 76 Muchas gracias!