SlideShare ist ein Scribd-Unternehmen logo
1 von 72
Provided to you by the
Canadian Bioinformatics
Workshop series
www.bioinformatics.ca
NCRI Cancer Conference:
Cancer data and its analysis
practical workshop
November 1, 2015
2Module #: Title of Module
bioinformatics.ca
NCRI Workshop 2015
NCRI Workshop 2015 – Module 1
You are free to:
Copy, share, adapt, or re-mix;
Photograph, film, or broadcast;
Blog, live-blog, or post video of;
This presentation. Provided that:
You attribute the work to its author and
respect the rights and licenses associated
with its components.
Slide Concept by Cameron Neylon, who has waived all copyright and related or neighbouring rights. This slide only ccZero.
Social Media Icons adapted with permission from originals by Christopher Ross. Original images are available under GPL at;
http://www.thisismyurl.com/free-downloads/15-free-speech-bubble-icons-for-popular-websites
bioinformatics.ca
NCRI Workshop 2015
NCRI Workshop 2015 – Module 1
Slides are on slideshare.net
• http://www.slideshare.net/bffo/cancer-uk-2015module1ouellettever02
Module 1
Cancer genomic databases
B.F. Francis Ouellette
bioinformatics.ca
NCRI Workshop 2015
NCRI Workshop 2015 – Module 1
@bffo
francis@oicr.on.caE-mail
bioinformatics.ca
NCRI Workshop 2015
NCRI Workshop 2015 – Module 1
Schedule for Module 1:
Cancer Genomic Databases
• Introduction to the Canadian Bioinformatics
Workshop series.
• The Databases:
– The Cancer Genome Atlas (TCGA)
– The International Cancer Genome Consortium (ICGC)
• Data Access: human genomes and security and
privacy issues:
Open Data vs. Controlled Access data
• Another Database:
– The Catalogue of Somatic Mutations in Cancer (COSMIC)
bioinformatics.ca
NCRI Workshop 2015
NCRI Workshop 2015 – Module 1
http://bioinformatics.ca/
bioinformatics.ca
NCRI Workshop 2015
NCRI Workshop 2015 – Module 1
http://bioinformatics.ca/workshops/2015/bioinformatics-cancer-genomics-2015
bioinformatics.ca
NCRI Workshop 2015
NCRI Workshop 2015 – Module 1
Workshops planned for 2016:
http://bioinformatics.ca/workshops
1. Bioinformatics for Cancer Genomics
2. High-throughput Biology: From Sequence to Networks (2017 - CSHL)
3. Introduction to R
4. Exploratory Analysis of Biological Data using R
5. Informatics for RNA-sequence Analysis
6. Informatics on High Throughput Sequencing Data
7. Pathway and Network Analysis of -omics Data
8. Informatics and Statistics for Metabolomics
9. Analysis of Metagenomic Data
10. How to Work in the Cloud: Computing on Human Genome Data
11. Epigenomic Data Analysis
12. Big Data in Precision Genomics
bioinformatics.ca
NCRI Workshop 2015
NCRI Workshop 2015 – Module 1
http://bioinformatics.ca/workshops/2015
bioinformatics.ca
NCRI Workshop 2015
NCRI Workshop 2015 – Module 1
E-mail: course_info@bioinformatics.ca
Web: http://bioinformatics.ca
Workshop announcement mailing list:
http://bioinformatics.ca/mailman/listinfo/announce
bioinformatics.ca
NCRI Workshop 2015
NCRI Workshop 2015 – Module 1
Soap-Box time!
• Open Access, Open Data and Open Source are essential for good
Science.
• Openness is a responsibility, an obligation, and something that comes
with the privilege of doing publicly funded work.
Open Access
Open Source
Open Data
Opencourseware
bioinformatics.ca
NCRI Workshop 2015
NCRI Workshop 2015 – Module 1
bioinformatics.ca
NCRI Workshop 2015
NCRI Workshop 2015 – Module 1
Cancer therapy is like
beating the dog with
a stick to get rid of
his fleas.
- Anna Deavere Smith,
Let me down easy
bioinformatics.ca
NCRI Workshop 2015
NCRI Workshop 2015 – Module 1
http://goo.gl/Yhbsj
bioinformatics.ca
NCRI Workshop 2015
NCRI Workshop 2015 – Module 1
The revolution in cancer
research can summed up
in a single sentence:
cancer is in essence,
a genetic disease.
- Bert Vogelstein
bioinformatics.ca
NCRI Workshop 2015
NCRI Workshop 2015 – Module 1
Cancer: a Disease of the Genome
Challenge in Treating Cancer:
 Every tumour is different
 Every cancer patient is different
bioinformatics.ca
NCRI Workshop 2015
NCRI Workshop 2015 – Module 1
https://en.wikipedia.org/wiki/List_of_databases_for_oncogenomic_research
bioinformatics.ca
NCRI Workshop 2015
NCRI Workshop 2015 – Module 1
bioinformatics.ca
NCRI Workshop 2015
NCRI Workshop 2015 – Module 1
Papers (PMID)
– TCGA: 24071849 21720365 23000897
22960745 22810696 24476821
– ICGC: 20393554
– COSMIC: 25355519
– Data Access: 22807659
http://www.ncbi.nlm.nih.gov/pubmed/[PMID]
NCRI Workshop 2015 – Module 1 bioinformatics.ca
TCGA
The Cancer Genome Atlas is a
comprehensive and coordinated
effort to accelerate our
understanding of the molecular
basis of cancer through the
application of genome analysis
technologies, including large-
scale genome sequencing.
bioinformatics.ca
NCRI Workshop 2015
NCRI Workshop 2015 – Module 1
About the TCGA
• National Cancer Institute (NCI)
• National Human Genome Research Institute
(NHGRI)
• Phased Structure:
– Three-year pilot in 2006 with an investment of $50 million
from each
– TCGA will collect and characterize more than 20 additional
tumour types
bioinformatics.ca
NCRI Workshop 2015
NCRI Workshop 2015 – Module 1
Where to start with the TCGA?
Wiki: https://wiki.nci.nih.gov/display/TCGA/About+TCGA
bioinformatics.ca
NCRI Workshop 2015
NCRI Workshop 2015 – Module 1
Division of Labour
• Biospecimen Core Resource (BCR)
– centre where samples are carefully catalogued, processed, qualitychecked
and stored along with participant clinical information
• Genome Sequencing Centre (GSC)
– uses high-throughput methods to identify changes to DNA sequences that are
associated with specific cancer types
• Genome Characterization Centre (GCC)
– uses high-throughput technologies to analyze genomic changes involved in cancer
• Genome Data Analysis Centre (GDAC)
– provides novel informatics tools to the research community
– provides analysis results using TCGA data.
• Data Coordinating Centre (DCC)
– Central provider of TCGA data.
– Standardizes data formats and validates submitted data.
bioinformatics.ca
NCRI Workshop 2015
NCRI Workshop 2015 – Module 1
TCGA Data
• Sequence reads from newer sequencing
technologies are available at the Cancer Genome
Hub: https://cghub.ucsc.edu/
• Higher level sequence data (variation calls and
abundance measures) are available at the TCGA
Portal: http://cancergenome.nih.gov/
• Also integrated with ICGC data (more on this later)
NCRI Workshop 2015 – Module 1 bioinformatics.ca
TCGA data flow
http://goo.gl/b5nojx
bioinformatics.ca
NCRI Workshop 2015
NCRI Workshop 2015 – Module 1
Data Coordinating Centre
• Play a central role
– Receiving data from BCR, GSC and GCC sites
– Providing access to users
– Performing analysis of data
• Responsibilities:
– Protecting participant privacy and confidentiality
– Developing data standards and controlled vocabularies
– Establishing informatics pipelines for data flow
– Developing new analytical and visualization technologies
to facilitate data analysis, for all audiences
bioinformatics.ca
NCRI Workshop 2015
NCRI Workshop 2015 – Module 1
TCGA DCC Data Portal
• Provides a platform to search, download and
analyze TCGA data sets
• Two data access tiers: Open and Controlled
• Analytic tools include: Cancer Molecular Analysis
and Cancer Genome Workbench (NCBIB),
Integrative Genomics Viewer (Broad) and
CancerGenomics Analysis (MSKCC).
bioinformatics.ca
NCRI Workshop 2015
NCRI Workshop 2015 – Module 1
TCGA Data Browser
https://tcga-data.nci.nih.gov/tcga/
Query TCGA
data online
using the
TCGA Data
Browser
bioinformatics.ca
NCRI Workshop 2015
NCRI Workshop 2015 – Module 1
The International Cancer Genome Consortium (ICGC)
• http://www.icgc.org/
• “ICGC was launched
to coordinate large-
scale cancer genome
studies in tumours
from 50 different
cancer types and/or
subtypes that are of
clinical and societal
importance across
the globe”
NCRI Workshop 2015 – Module 1 bioinformatics.ca
ICGC Map – February 2015
85 projects launched
bioinformatics.ca
NCRI Workshop 2015
NCRI Workshop 2015 – Module 1
ICGC datasets to date:
https://dcc.icgc.org/projects/history
NCRI Workshop 2015 – Module 1 bioinformatics.ca
NCRI Workshop 2015 – Module 1 bioinformatics.ca
NCRI Workshop 2015 – Module 1 bioinformatics.ca
Select “Pancreatic cancer – Canada”
NCRI Workshop 2015 – Module 1 bioinformatics.ca
… But where is the data?
NCRI Workshop 2015 – Module 1 bioinformatics.ca
NCRI Workshop 2015 – Module 1 bioinformatics.ca
http://dcc.icgc.org/
bioinformatics.ca
NCRI Workshop 2015
NCRI Workshop 2015 – Module 1
DACO
ICGC
dbGaP
EGA
TCGA
BA
M
Open
Open
ERA
BA
M
Germ
Line
+ EGA id
BA
M
BA
M
bioinformatics.ca
NCRI Workshop 2015
NCRI Workshop 2015 – Module 1
ICGC
BAM/FASTQ
TCGA
BAM/FASTQ
ICGC
Open
Data
(includes
TCGA
Open Data)
bioinformatics.ca
NCRI Workshop 2015
NCRI Workshop 2015 – Module 1
ICGC
TCGA
NCRI Workshop 2015 – Module 1 bioinformatics.ca
ICG
C
TCGA
Differences between ICGC & TCGA
• Different tumour types
• Different geographic rules
• Many countries vs one jurisdiction
• Different definitions of what is controlled
• Different data access rules
NCRI Workshop 2015 – Module 1 bioinformatics.ca
• Detailed Phenotype and Outcome data
• Gene Expression (probe-level data)
• Raw genotype calls
• Gene-sample identifier links
• Genome sequence files
• Germ line variants
ICGC Controlled
Access Datasets
• Cancer Pathology
Histologic type or subtype
Histologic nuclear grade
• Patient/Person
Gender, Age range,
Vital status, Survival time
Relapse type, Status at follow-up
• Gene Expression (normalized)
• DNA methylation
•Computed Copy Number and
Loss of Heterozygosity
• Somatic variants from Exome or WGS
ICGC Open
Access Datasets
http://goo.gl/w4mrV
NCRI Workshop 2015 – Module 1 bioinformatics.ca
• Primary sequence data
(BAM and FASTQ files)
• SNP6 array level 1 and level 2 data
• Exon array level 1 and level 2 data
• Somatic variants from whole
genome sequencing
• Certain information in MAFs
• A full list of controlled-access
data types can be found at:
http://goo.gl/K1h7zu
TCGA Controlled
Access Datasets
• De-identified clinical and
demographic data
• Gene expression data
• Copy number alterations in regions
of the genome
• Epigenetic data
• Summaries of data compiled across
individuals
• Anonymized single amplicon DNA
sequence data
• Somatic variants from scrubbed
exome sequencing
TCGA Open
Access Datasets
http://goo.gl/A1rMRB
bioinformatics.ca
NCRI Workshop 2015
NCRI Workshop 2015 – Module 1
TCGA/ICGC users agreed:
• … to keep all computer systems on which controlled
access data reside, or which provide access to such
data, up to date with respect to software and
security patches.
• … to protect Controlled Access Data against
disclosure to unauthorized individuals.
• … to monitor and control which individuals have
access to Controlled Access Data.
bioinformatics.ca
NCRI Workshop 2015
NCRI Workshop 2015 – Module 1
TCGA/ICGC users agreed:
• … to destroy all copies of controlled access data
after controlled access privileges expires.
• ... to only use secure transfer protocols:
e.g. https and sftp
• … to encrypt Controlled Access data in transfers
and storage
NCRI Workshop 2015 – Module 1 bioinformatics.ca
What does it mean for this file?
simple_somatic_mutation.aggregated.vcf.gz
https://dcc.icgc.org/repository/icgc/release_19/Summary
NCRI Workshop 2015 – Module 1 bioinformatics.ca
NCRI Workshop 2015 – Module 1 bioinformatics.ca
NCRI Workshop 2015 – Module 1 bioinformatics.ca
Identify
yourself
Fill out detail form which
includes:
• Contact and Project
Information
•Information Technology
details and procedures
for keeping data secure
•Data Access Agreement
All of these
documents are
put into a PDF
file that you
print and get your
institution to sign
off on your behalf
NCRI Workshop 2015 – Module 1 bioinformatics.ca
http://icgc.org/daco
NCRI Workshop 2015 – Module 1 bioinformatics.ca
NCRI Workshop 2015 – Module 1 bioinformatics.ca
NCRI Workshop 2015 – Module 1 bioinformatics.ca
NCRI Workshop 2015 – Module 1 bioinformatics.ca
• Name
• Institution
• Title of Project
• Collaborators
• Research Summary
• Lay Summary
• Ethics
• IT Security
• Cloud Storage
• Agreement
• Appendices
NCRI Workshop 2015 – Module 1 bioinformatics.ca
NCRI Workshop 2015 – Module 1 bioinformatics.ca
http://goo.gl/2UVLDJ
NCRI Workshop 2015 – Module 1 bioinformatics.ca
NCRI Workshop 2015 – Module 1 bioinformatics.ca
NCRI Workshop 2015 – Module 1 bioinformatics.ca
NCRI Workshop 2015 – Module 1 bioinformatics.ca
DACO approved projects
bioinformatics.ca
NCRI Workshop 2015
NCRI Workshop 2015 – Module 1
DACO/DCC User Data Access Process
• Users approved through DACO are now automatically granted access to
ICGC controlled access datasets available through the ICGC Data Portal and
the EBI’s EGA repository
DACO Web
Application
DCC User
Registry
DCC Data
Portal
EBI EGA
application
approved
by DACO
user
accounts
activated
NCRI Workshop 2015 – Module 1 bioinformatics.ca
Catalogue of Somatic Mutations in Cancer
(COSMIC) • http://cancer.sanger.ac.uk/cancerg
enome/projects/cosmic/
• COSMIC is designed
to store and display
somatic mutation
information and
related details and
contains information
relating to human
cancers.
ICGC
BAM/FASTQ
TCGA
BAM/FASTQ
ICGC
Open
Data
(includes
TCGA
Open Data)
COSMIC
Open
Data
bioinformatics.ca
NCRI Workshop 2015
NCRI Workshop 2015 – Module 1
COSMIC
• Somatic Mutations Only
• Diverse sources
– Literature (Arrays, Next-Gen, PCR...)
– TCGA
– ICGC
• Diverse ways to look at data
– Gene
– Variation
– Tumour type
– Cell line
– Experiment
bioinformatics.ca
NCRI Workshop 2015
NCRI Workshop 2015 – Module 1
FAQ
bioinformatics.ca
NCRI Workshop 2015
NCRI Workshop 2015 – Module 1
Looking up your favorite gene
1 2 3
bioinformatics.ca
NCRI Workshop 2015
NCRI Workshop 2015 – Module 1
bioinformatics.ca
NCRI Workshop 2015
NCRI Workshop 2015 – Module 1
bioinformatics.ca
NCRI Workshop 2015
NCRI Workshop 2015 – Module 1
In closing
• Remember all these sites have great amounts of
documentation
• The field is changing quickly, and so are the portals.
• New features are planned as we speak, and so you
need to use the sites, and keep coming back.
• Don’t be afraid to explore
• Interested in learning more after today? Consider
one of the bioinformatics.ca workshops!
NCRI Workshop 2015 – Module 1 bioinformatics.ca
Acknowledgements:
the CBW gang
Michelle Brazas
Michael
Stromberg
Marc
Fiume
Michael
Brudno

Weitere ähnliche Inhalte

Was ist angesagt?

GenomeTrakr: Perspectives on linking internationally - Canada and IRIDA.ca
GenomeTrakr: Perspectives on linking internationally - Canada and IRIDA.caGenomeTrakr: Perspectives on linking internationally - Canada and IRIDA.ca
GenomeTrakr: Perspectives on linking internationally - Canada and IRIDA.cafionabrinkman
 
Cross-Disciplinary Biomedical Research at Calit2
Cross-Disciplinary Biomedical Research at Calit2Cross-Disciplinary Biomedical Research at Calit2
Cross-Disciplinary Biomedical Research at Calit2Larry Smarr
 
Free webinar-introduction to bioinformatics - biologist-1
Free webinar-introduction to bioinformatics - biologist-1Free webinar-introduction to bioinformatics - biologist-1
Free webinar-introduction to bioinformatics - biologist-1Elia Brodsky
 
Sigma Xi 2021 Andrew Gao Presentation
Sigma Xi 2021 Andrew Gao PresentationSigma Xi 2021 Andrew Gao Presentation
Sigma Xi 2021 Andrew Gao PresentationAndrewGao12
 
cBioPortal Webinar Slides (2/3)
cBioPortal Webinar Slides (2/3)cBioPortal Webinar Slides (2/3)
cBioPortal Webinar Slides (2/3)Pistoia Alliance
 
The Global Micorbial Identifier (GMI) initiative - and its working groups
The Global Micorbial Identifier (GMI) initiative - and its working groupsThe Global Micorbial Identifier (GMI) initiative - and its working groups
The Global Micorbial Identifier (GMI) initiative - and its working groupsExternalEvents
 
Mastering RNA-Seq (NGS Data Analysis) - A Critical Approach To Transcriptomic...
Mastering RNA-Seq (NGS Data Analysis) - A Critical Approach To Transcriptomic...Mastering RNA-Seq (NGS Data Analysis) - A Critical Approach To Transcriptomic...
Mastering RNA-Seq (NGS Data Analysis) - A Critical Approach To Transcriptomic...Elia Brodsky
 
Personal Genomes: what can I do with my data?
Personal Genomes: what can I do with my data?Personal Genomes: what can I do with my data?
Personal Genomes: what can I do with my data?Melanie Swan
 
Bioinformatics, its application main
Bioinformatics, its application mainBioinformatics, its application main
Bioinformatics, its application mainKAUSHAL SAHU
 
Introduction to Bioinformatics
Introduction to BioinformaticsIntroduction to Bioinformatics
Introduction to BioinformaticsLeighton Pritchard
 
DNA Testing: Living Longer Via Personal Genomics
DNA Testing: Living Longer Via Personal GenomicsDNA Testing: Living Longer Via Personal Genomics
DNA Testing: Living Longer Via Personal GenomicsMelanie Swan
 
Introduction to bioinformatics
Introduction to bioinformaticsIntroduction to bioinformatics
Introduction to bioinformaticsphilmaweb
 
AI in Bioinformatics
AI in BioinformaticsAI in Bioinformatics
AI in BioinformaticsAli Kishk
 
Multi-Omics Bioinformatics across Application Domains
Multi-Omics Bioinformatics across Application DomainsMulti-Omics Bioinformatics across Application Domains
Multi-Omics Bioinformatics across Application DomainsChristoph Steinbeck
 
Application of bioinformatics in climate smart horticulture
Application of bioinformatics in climate smart horticultureApplication of bioinformatics in climate smart horticulture
Application of bioinformatics in climate smart horticultureDr.Hetalkumar Panchal
 

Was ist angesagt? (20)

Enriching Scholarship Personal Genomics presentation
Enriching Scholarship Personal Genomics presentationEnriching Scholarship Personal Genomics presentation
Enriching Scholarship Personal Genomics presentation
 
GenomeTrakr: Perspectives on linking internationally - Canada and IRIDA.ca
GenomeTrakr: Perspectives on linking internationally - Canada and IRIDA.caGenomeTrakr: Perspectives on linking internationally - Canada and IRIDA.ca
GenomeTrakr: Perspectives on linking internationally - Canada and IRIDA.ca
 
Cross-Disciplinary Biomedical Research at Calit2
Cross-Disciplinary Biomedical Research at Calit2Cross-Disciplinary Biomedical Research at Calit2
Cross-Disciplinary Biomedical Research at Calit2
 
Free webinar-introduction to bioinformatics - biologist-1
Free webinar-introduction to bioinformatics - biologist-1Free webinar-introduction to bioinformatics - biologist-1
Free webinar-introduction to bioinformatics - biologist-1
 
JALANov2000
JALANov2000JALANov2000
JALANov2000
 
Sigma Xi 2021 Andrew Gao Presentation
Sigma Xi 2021 Andrew Gao PresentationSigma Xi 2021 Andrew Gao Presentation
Sigma Xi 2021 Andrew Gao Presentation
 
cBioPortal Webinar Slides (2/3)
cBioPortal Webinar Slides (2/3)cBioPortal Webinar Slides (2/3)
cBioPortal Webinar Slides (2/3)
 
The Global Micorbial Identifier (GMI) initiative - and its working groups
The Global Micorbial Identifier (GMI) initiative - and its working groupsThe Global Micorbial Identifier (GMI) initiative - and its working groups
The Global Micorbial Identifier (GMI) initiative - and its working groups
 
Open data genomics_palermo_2017_ver03
Open data genomics_palermo_2017_ver03Open data genomics_palermo_2017_ver03
Open data genomics_palermo_2017_ver03
 
Mastering RNA-Seq (NGS Data Analysis) - A Critical Approach To Transcriptomic...
Mastering RNA-Seq (NGS Data Analysis) - A Critical Approach To Transcriptomic...Mastering RNA-Seq (NGS Data Analysis) - A Critical Approach To Transcriptomic...
Mastering RNA-Seq (NGS Data Analysis) - A Critical Approach To Transcriptomic...
 
Personal Genomes: what can I do with my data?
Personal Genomes: what can I do with my data?Personal Genomes: what can I do with my data?
Personal Genomes: what can I do with my data?
 
Brief introduction to Bioinformatics
Brief introduction to BioinformaticsBrief introduction to Bioinformatics
Brief introduction to Bioinformatics
 
Bioinformatics, its application main
Bioinformatics, its application mainBioinformatics, its application main
Bioinformatics, its application main
 
Introduction to Bioinformatics
Introduction to BioinformaticsIntroduction to Bioinformatics
Introduction to Bioinformatics
 
DNA Testing: Living Longer Via Personal Genomics
DNA Testing: Living Longer Via Personal GenomicsDNA Testing: Living Longer Via Personal Genomics
DNA Testing: Living Longer Via Personal Genomics
 
Introduction to bioinformatics
Introduction to bioinformaticsIntroduction to bioinformatics
Introduction to bioinformatics
 
AI in Bioinformatics
AI in BioinformaticsAI in Bioinformatics
AI in Bioinformatics
 
Bioinformatics: What, Why and Where?
Bioinformatics: What, Why and Where?Bioinformatics: What, Why and Where?
Bioinformatics: What, Why and Where?
 
Multi-Omics Bioinformatics across Application Domains
Multi-Omics Bioinformatics across Application DomainsMulti-Omics Bioinformatics across Application Domains
Multi-Omics Bioinformatics across Application Domains
 
Application of bioinformatics in climate smart horticulture
Application of bioinformatics in climate smart horticultureApplication of bioinformatics in climate smart horticulture
Application of bioinformatics in climate smart horticulture
 

Ähnlich wie Cancer uk 2015_module1_ouellette_ver02

CORBEL BBMRI-ERIC QM webinar slides
CORBEL BBMRI-ERIC QM webinar slidesCORBEL BBMRI-ERIC QM webinar slides
CORBEL BBMRI-ERIC QM webinar slidesCORBEL
 
CINECA webinar slides: Data Gravity in the Life Sciences: Lessons learned fro...
CINECA webinar slides: Data Gravity in the Life Sciences: Lessons learned fro...CINECA webinar slides: Data Gravity in the Life Sciences: Lessons learned fro...
CINECA webinar slides: Data Gravity in the Life Sciences: Lessons learned fro...CINECAProject
 
cBioPortal Webinar Slides (3/3)
cBioPortal Webinar Slides (3/3)cBioPortal Webinar Slides (3/3)
cBioPortal Webinar Slides (3/3)Pistoia Alliance
 
IRIDA: Canada’s federated platform for genomic epidemiology, ABPHM 2015 WHsiao
IRIDA: Canada’s federated platform for genomic epidemiology, ABPHM 2015 WHsiaoIRIDA: Canada’s federated platform for genomic epidemiology, ABPHM 2015 WHsiao
IRIDA: Canada’s federated platform for genomic epidemiology, ABPHM 2015 WHsiaoIRIDA_community
 
GCAT Update June 2013 @ The Clinical Genome Conference
GCAT Update June 2013 @ The Clinical Genome ConferenceGCAT Update June 2013 @ The Clinical Genome Conference
GCAT Update June 2013 @ The Clinical Genome ConferenceDavid Mittelman
 
ELIXIR Pilot Actions launched in 2014: Integration of BILS-ProteomeXchange us...
ELIXIR Pilot Actions launched in 2014: Integration of BILS-ProteomeXchange us...ELIXIR Pilot Actions launched in 2014: Integration of BILS-ProteomeXchange us...
ELIXIR Pilot Actions launched in 2014: Integration of BILS-ProteomeXchange us...Juan Antonio Vizcaino
 
Whole slide imaging: beyond pathology (Pittsburgh Computational Pathology Lec...
Whole slide imaging: beyond pathology (Pittsburgh Computational Pathology Lec...Whole slide imaging: beyond pathology (Pittsburgh Computational Pathology Lec...
Whole slide imaging: beyond pathology (Pittsburgh Computational Pathology Lec...Yves Sucaet
 
IRIDA: Canada’s federated platform for genomic epidemiology
IRIDA: Canada’s federated platform for genomic epidemiology IRIDA: Canada’s federated platform for genomic epidemiology
IRIDA: Canada’s federated platform for genomic epidemiology William Hsiao
 
Digital pathology and its importance as an omics data layer
Digital pathology and its importance as an omics data layerDigital pathology and its importance as an omics data layer
Digital pathology and its importance as an omics data layerYves Sucaet
 
GRIN GLOBAL implementation - CIP 2017
GRIN GLOBAL implementation - CIP 2017GRIN GLOBAL implementation - CIP 2017
GRIN GLOBAL implementation - CIP 2017Edwin Rojas
 
A global integrative ecosystem for digital pathology: how can we get there?
A global integrative ecosystem for digital pathology: how can we get there?A global integrative ecosystem for digital pathology: how can we get there?
A global integrative ecosystem for digital pathology: how can we get there?Yves Sucaet
 

Ähnlich wie Cancer uk 2015_module1_ouellette_ver02 (20)

Omprn 2018 module1_final
Omprn 2018 module1_finalOmprn 2018 module1_final
Omprn 2018 module1_final
 
CORBEL BBMRI-ERIC QM webinar slides
CORBEL BBMRI-ERIC QM webinar slidesCORBEL BBMRI-ERIC QM webinar slides
CORBEL BBMRI-ERIC QM webinar slides
 
GBIF Work Programme 2016 Update
GBIF Work Programme 2016 UpdateGBIF Work Programme 2016 Update
GBIF Work Programme 2016 Update
 
CINECA webinar slides: Data Gravity in the Life Sciences: Lessons learned fro...
CINECA webinar slides: Data Gravity in the Life Sciences: Lessons learned fro...CINECA webinar slides: Data Gravity in the Life Sciences: Lessons learned fro...
CINECA webinar slides: Data Gravity in the Life Sciences: Lessons learned fro...
 
cBioPortal Webinar Slides (3/3)
cBioPortal Webinar Slides (3/3)cBioPortal Webinar Slides (3/3)
cBioPortal Webinar Slides (3/3)
 
IRIDA: Canada’s federated platform for genomic epidemiology, ABPHM 2015 WHsiao
IRIDA: Canada’s federated platform for genomic epidemiology, ABPHM 2015 WHsiaoIRIDA: Canada’s federated platform for genomic epidemiology, ABPHM 2015 WHsiao
IRIDA: Canada’s federated platform for genomic epidemiology, ABPHM 2015 WHsiao
 
GCAT Update June 2013 @ The Clinical Genome Conference
GCAT Update June 2013 @ The Clinical Genome ConferenceGCAT Update June 2013 @ The Clinical Genome Conference
GCAT Update June 2013 @ The Clinical Genome Conference
 
ELIXIR Pilot Actions launched in 2014: Integration of BILS-ProteomeXchange us...
ELIXIR Pilot Actions launched in 2014: Integration of BILS-ProteomeXchange us...ELIXIR Pilot Actions launched in 2014: Integration of BILS-ProteomeXchange us...
ELIXIR Pilot Actions launched in 2014: Integration of BILS-ProteomeXchange us...
 
Whole slide imaging: beyond pathology (Pittsburgh Computational Pathology Lec...
Whole slide imaging: beyond pathology (Pittsburgh Computational Pathology Lec...Whole slide imaging: beyond pathology (Pittsburgh Computational Pathology Lec...
Whole slide imaging: beyond pathology (Pittsburgh Computational Pathology Lec...
 
IRIDA: Canada’s federated platform for genomic epidemiology
IRIDA: Canada’s federated platform for genomic epidemiology IRIDA: Canada’s federated platform for genomic epidemiology
IRIDA: Canada’s federated platform for genomic epidemiology
 
Digital pathology and its importance as an omics data layer
Digital pathology and its importance as an omics data layerDigital pathology and its importance as an omics data layer
Digital pathology and its importance as an omics data layer
 
Human microbiome project
Human microbiome projectHuman microbiome project
Human microbiome project
 
Three trends in cybersecurity
Three trends in cybersecurityThree trends in cybersecurity
Three trends in cybersecurity
 
GRIN GLOBAL implementation - CIP 2017
GRIN GLOBAL implementation - CIP 2017GRIN GLOBAL implementation - CIP 2017
GRIN GLOBAL implementation - CIP 2017
 
HZ Health IT Cluster Collaborative Project Update
HZ Health IT Cluster Collaborative Project UpdateHZ Health IT Cluster Collaborative Project Update
HZ Health IT Cluster Collaborative Project Update
 
Ilik - Beyond the Manuscript: Using IRs for Non Traditional Content Types
Ilik - Beyond the Manuscript: Using IRs for Non Traditional Content TypesIlik - Beyond the Manuscript: Using IRs for Non Traditional Content Types
Ilik - Beyond the Manuscript: Using IRs for Non Traditional Content Types
 
Linked data in industry
Linked data in industryLinked data in industry
Linked data in industry
 
Overview of Next Gen Sequencing Data Analysis
Overview of Next Gen Sequencing Data AnalysisOverview of Next Gen Sequencing Data Analysis
Overview of Next Gen Sequencing Data Analysis
 
2019 Triangle Machine Learning Day - Biomedical Image Understanding and EHRs ...
2019 Triangle Machine Learning Day - Biomedical Image Understanding and EHRs ...2019 Triangle Machine Learning Day - Biomedical Image Understanding and EHRs ...
2019 Triangle Machine Learning Day - Biomedical Image Understanding and EHRs ...
 
A global integrative ecosystem for digital pathology: how can we get there?
A global integrative ecosystem for digital pathology: how can we get there?A global integrative ecosystem for digital pathology: how can we get there?
A global integrative ecosystem for digital pathology: how can we get there?
 

Kürzlich hochgeladen

Porella : features, morphology, anatomy, reproduction etc.
Porella : features, morphology, anatomy, reproduction etc.Porella : features, morphology, anatomy, reproduction etc.
Porella : features, morphology, anatomy, reproduction etc.Silpa
 
Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...
Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...
Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...Silpa
 
Call Girls Ahmedabad +917728919243 call me Independent Escort Service
Call Girls Ahmedabad +917728919243 call me Independent Escort ServiceCall Girls Ahmedabad +917728919243 call me Independent Escort Service
Call Girls Ahmedabad +917728919243 call me Independent Escort Serviceshivanisharma5244
 
Velocity and Acceleration PowerPoint.ppt
Velocity and Acceleration PowerPoint.pptVelocity and Acceleration PowerPoint.ppt
Velocity and Acceleration PowerPoint.pptRakeshMohan42
 
Introduction of DNA analysis in Forensic's .pptx
Introduction of DNA analysis in Forensic's .pptxIntroduction of DNA analysis in Forensic's .pptx
Introduction of DNA analysis in Forensic's .pptxrohankumarsinghrore1
 
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and SpectrometryFAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and SpectrometryAlex Henderson
 
PSYCHOSOCIAL NEEDS. in nursing II sem pptx
PSYCHOSOCIAL NEEDS. in nursing II sem pptxPSYCHOSOCIAL NEEDS. in nursing II sem pptx
PSYCHOSOCIAL NEEDS. in nursing II sem pptxSuji236384
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsSérgio Sacani
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bSérgio Sacani
 
COMPUTING ANTI-DERIVATIVES (Integration by SUBSTITUTION)
COMPUTING ANTI-DERIVATIVES(Integration by SUBSTITUTION)COMPUTING ANTI-DERIVATIVES(Integration by SUBSTITUTION)
COMPUTING ANTI-DERIVATIVES (Integration by SUBSTITUTION)AkefAfaneh2
 
biology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGYbiology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGY1301aanya
 
Factory Acceptance Test( FAT).pptx .
Factory Acceptance Test( FAT).pptx       .Factory Acceptance Test( FAT).pptx       .
Factory Acceptance Test( FAT).pptx .Poonam Aher Patil
 
Climate Change Impacts on Terrestrial and Aquatic Ecosystems.pptx
Climate Change Impacts on Terrestrial and Aquatic Ecosystems.pptxClimate Change Impacts on Terrestrial and Aquatic Ecosystems.pptx
Climate Change Impacts on Terrestrial and Aquatic Ecosystems.pptxDiariAli
 
GBSN - Microbiology (Unit 3)
GBSN - Microbiology (Unit 3)GBSN - Microbiology (Unit 3)
GBSN - Microbiology (Unit 3)Areesha Ahmad
 
Exploring Criminology and Criminal Behaviour.pdf
Exploring Criminology and Criminal Behaviour.pdfExploring Criminology and Criminal Behaviour.pdf
Exploring Criminology and Criminal Behaviour.pdfrohankumarsinghrore1
 
Molecular markers- RFLP, RAPD, AFLP, SNP etc.
Molecular markers- RFLP, RAPD, AFLP, SNP etc.Molecular markers- RFLP, RAPD, AFLP, SNP etc.
Molecular markers- RFLP, RAPD, AFLP, SNP etc.Silpa
 
GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)Areesha Ahmad
 
Grade 7 - Lesson 1 - Microscope and Its Functions
Grade 7 - Lesson 1 - Microscope and Its FunctionsGrade 7 - Lesson 1 - Microscope and Its Functions
Grade 7 - Lesson 1 - Microscope and Its FunctionsOrtegaSyrineMay
 

Kürzlich hochgeladen (20)

Porella : features, morphology, anatomy, reproduction etc.
Porella : features, morphology, anatomy, reproduction etc.Porella : features, morphology, anatomy, reproduction etc.
Porella : features, morphology, anatomy, reproduction etc.
 
Clean In Place(CIP).pptx .
Clean In Place(CIP).pptx                 .Clean In Place(CIP).pptx                 .
Clean In Place(CIP).pptx .
 
Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...
Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...
Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...
 
Call Girls Ahmedabad +917728919243 call me Independent Escort Service
Call Girls Ahmedabad +917728919243 call me Independent Escort ServiceCall Girls Ahmedabad +917728919243 call me Independent Escort Service
Call Girls Ahmedabad +917728919243 call me Independent Escort Service
 
Velocity and Acceleration PowerPoint.ppt
Velocity and Acceleration PowerPoint.pptVelocity and Acceleration PowerPoint.ppt
Velocity and Acceleration PowerPoint.ppt
 
Introduction of DNA analysis in Forensic's .pptx
Introduction of DNA analysis in Forensic's .pptxIntroduction of DNA analysis in Forensic's .pptx
Introduction of DNA analysis in Forensic's .pptx
 
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and SpectrometryFAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
 
PSYCHOSOCIAL NEEDS. in nursing II sem pptx
PSYCHOSOCIAL NEEDS. in nursing II sem pptxPSYCHOSOCIAL NEEDS. in nursing II sem pptx
PSYCHOSOCIAL NEEDS. in nursing II sem pptx
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
 
COMPUTING ANTI-DERIVATIVES (Integration by SUBSTITUTION)
COMPUTING ANTI-DERIVATIVES(Integration by SUBSTITUTION)COMPUTING ANTI-DERIVATIVES(Integration by SUBSTITUTION)
COMPUTING ANTI-DERIVATIVES (Integration by SUBSTITUTION)
 
biology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGYbiology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGY
 
Factory Acceptance Test( FAT).pptx .
Factory Acceptance Test( FAT).pptx       .Factory Acceptance Test( FAT).pptx       .
Factory Acceptance Test( FAT).pptx .
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Climate Change Impacts on Terrestrial and Aquatic Ecosystems.pptx
Climate Change Impacts on Terrestrial and Aquatic Ecosystems.pptxClimate Change Impacts on Terrestrial and Aquatic Ecosystems.pptx
Climate Change Impacts on Terrestrial and Aquatic Ecosystems.pptx
 
GBSN - Microbiology (Unit 3)
GBSN - Microbiology (Unit 3)GBSN - Microbiology (Unit 3)
GBSN - Microbiology (Unit 3)
 
Exploring Criminology and Criminal Behaviour.pdf
Exploring Criminology and Criminal Behaviour.pdfExploring Criminology and Criminal Behaviour.pdf
Exploring Criminology and Criminal Behaviour.pdf
 
Molecular markers- RFLP, RAPD, AFLP, SNP etc.
Molecular markers- RFLP, RAPD, AFLP, SNP etc.Molecular markers- RFLP, RAPD, AFLP, SNP etc.
Molecular markers- RFLP, RAPD, AFLP, SNP etc.
 
GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)
 
Grade 7 - Lesson 1 - Microscope and Its Functions
Grade 7 - Lesson 1 - Microscope and Its FunctionsGrade 7 - Lesson 1 - Microscope and Its Functions
Grade 7 - Lesson 1 - Microscope and Its Functions
 

Cancer uk 2015_module1_ouellette_ver02

  • 1. Provided to you by the Canadian Bioinformatics Workshop series www.bioinformatics.ca NCRI Cancer Conference: Cancer data and its analysis practical workshop November 1, 2015
  • 2. 2Module #: Title of Module
  • 3. bioinformatics.ca NCRI Workshop 2015 NCRI Workshop 2015 – Module 1 You are free to: Copy, share, adapt, or re-mix; Photograph, film, or broadcast; Blog, live-blog, or post video of; This presentation. Provided that: You attribute the work to its author and respect the rights and licenses associated with its components. Slide Concept by Cameron Neylon, who has waived all copyright and related or neighbouring rights. This slide only ccZero. Social Media Icons adapted with permission from originals by Christopher Ross. Original images are available under GPL at; http://www.thisismyurl.com/free-downloads/15-free-speech-bubble-icons-for-popular-websites
  • 4. bioinformatics.ca NCRI Workshop 2015 NCRI Workshop 2015 – Module 1 Slides are on slideshare.net • http://www.slideshare.net/bffo/cancer-uk-2015module1ouellettever02
  • 5. Module 1 Cancer genomic databases B.F. Francis Ouellette
  • 6. bioinformatics.ca NCRI Workshop 2015 NCRI Workshop 2015 – Module 1 @bffo francis@oicr.on.caE-mail
  • 7. bioinformatics.ca NCRI Workshop 2015 NCRI Workshop 2015 – Module 1 Schedule for Module 1: Cancer Genomic Databases • Introduction to the Canadian Bioinformatics Workshop series. • The Databases: – The Cancer Genome Atlas (TCGA) – The International Cancer Genome Consortium (ICGC) • Data Access: human genomes and security and privacy issues: Open Data vs. Controlled Access data • Another Database: – The Catalogue of Somatic Mutations in Cancer (COSMIC)
  • 8. bioinformatics.ca NCRI Workshop 2015 NCRI Workshop 2015 – Module 1 http://bioinformatics.ca/
  • 9. bioinformatics.ca NCRI Workshop 2015 NCRI Workshop 2015 – Module 1 http://bioinformatics.ca/workshops/2015/bioinformatics-cancer-genomics-2015
  • 10. bioinformatics.ca NCRI Workshop 2015 NCRI Workshop 2015 – Module 1 Workshops planned for 2016: http://bioinformatics.ca/workshops 1. Bioinformatics for Cancer Genomics 2. High-throughput Biology: From Sequence to Networks (2017 - CSHL) 3. Introduction to R 4. Exploratory Analysis of Biological Data using R 5. Informatics for RNA-sequence Analysis 6. Informatics on High Throughput Sequencing Data 7. Pathway and Network Analysis of -omics Data 8. Informatics and Statistics for Metabolomics 9. Analysis of Metagenomic Data 10. How to Work in the Cloud: Computing on Human Genome Data 11. Epigenomic Data Analysis 12. Big Data in Precision Genomics
  • 11. bioinformatics.ca NCRI Workshop 2015 NCRI Workshop 2015 – Module 1 http://bioinformatics.ca/workshops/2015
  • 12. bioinformatics.ca NCRI Workshop 2015 NCRI Workshop 2015 – Module 1 E-mail: course_info@bioinformatics.ca Web: http://bioinformatics.ca Workshop announcement mailing list: http://bioinformatics.ca/mailman/listinfo/announce
  • 13. bioinformatics.ca NCRI Workshop 2015 NCRI Workshop 2015 – Module 1 Soap-Box time! • Open Access, Open Data and Open Source are essential for good Science. • Openness is a responsibility, an obligation, and something that comes with the privilege of doing publicly funded work. Open Access Open Source Open Data Opencourseware
  • 14. bioinformatics.ca NCRI Workshop 2015 NCRI Workshop 2015 – Module 1
  • 15. bioinformatics.ca NCRI Workshop 2015 NCRI Workshop 2015 – Module 1 Cancer therapy is like beating the dog with a stick to get rid of his fleas. - Anna Deavere Smith, Let me down easy
  • 16. bioinformatics.ca NCRI Workshop 2015 NCRI Workshop 2015 – Module 1 http://goo.gl/Yhbsj
  • 17. bioinformatics.ca NCRI Workshop 2015 NCRI Workshop 2015 – Module 1 The revolution in cancer research can summed up in a single sentence: cancer is in essence, a genetic disease. - Bert Vogelstein
  • 18. bioinformatics.ca NCRI Workshop 2015 NCRI Workshop 2015 – Module 1 Cancer: a Disease of the Genome Challenge in Treating Cancer:  Every tumour is different  Every cancer patient is different
  • 19. bioinformatics.ca NCRI Workshop 2015 NCRI Workshop 2015 – Module 1 https://en.wikipedia.org/wiki/List_of_databases_for_oncogenomic_research
  • 20. bioinformatics.ca NCRI Workshop 2015 NCRI Workshop 2015 – Module 1
  • 21. bioinformatics.ca NCRI Workshop 2015 NCRI Workshop 2015 – Module 1 Papers (PMID) – TCGA: 24071849 21720365 23000897 22960745 22810696 24476821 – ICGC: 20393554 – COSMIC: 25355519 – Data Access: 22807659 http://www.ncbi.nlm.nih.gov/pubmed/[PMID]
  • 22. NCRI Workshop 2015 – Module 1 bioinformatics.ca TCGA The Cancer Genome Atlas is a comprehensive and coordinated effort to accelerate our understanding of the molecular basis of cancer through the application of genome analysis technologies, including large- scale genome sequencing.
  • 23. bioinformatics.ca NCRI Workshop 2015 NCRI Workshop 2015 – Module 1 About the TCGA • National Cancer Institute (NCI) • National Human Genome Research Institute (NHGRI) • Phased Structure: – Three-year pilot in 2006 with an investment of $50 million from each – TCGA will collect and characterize more than 20 additional tumour types
  • 24. bioinformatics.ca NCRI Workshop 2015 NCRI Workshop 2015 – Module 1 Where to start with the TCGA? Wiki: https://wiki.nci.nih.gov/display/TCGA/About+TCGA
  • 25. bioinformatics.ca NCRI Workshop 2015 NCRI Workshop 2015 – Module 1 Division of Labour • Biospecimen Core Resource (BCR) – centre where samples are carefully catalogued, processed, qualitychecked and stored along with participant clinical information • Genome Sequencing Centre (GSC) – uses high-throughput methods to identify changes to DNA sequences that are associated with specific cancer types • Genome Characterization Centre (GCC) – uses high-throughput technologies to analyze genomic changes involved in cancer • Genome Data Analysis Centre (GDAC) – provides novel informatics tools to the research community – provides analysis results using TCGA data. • Data Coordinating Centre (DCC) – Central provider of TCGA data. – Standardizes data formats and validates submitted data.
  • 26. bioinformatics.ca NCRI Workshop 2015 NCRI Workshop 2015 – Module 1 TCGA Data • Sequence reads from newer sequencing technologies are available at the Cancer Genome Hub: https://cghub.ucsc.edu/ • Higher level sequence data (variation calls and abundance measures) are available at the TCGA Portal: http://cancergenome.nih.gov/ • Also integrated with ICGC data (more on this later)
  • 27. NCRI Workshop 2015 – Module 1 bioinformatics.ca TCGA data flow http://goo.gl/b5nojx
  • 28. bioinformatics.ca NCRI Workshop 2015 NCRI Workshop 2015 – Module 1 Data Coordinating Centre • Play a central role – Receiving data from BCR, GSC and GCC sites – Providing access to users – Performing analysis of data • Responsibilities: – Protecting participant privacy and confidentiality – Developing data standards and controlled vocabularies – Establishing informatics pipelines for data flow – Developing new analytical and visualization technologies to facilitate data analysis, for all audiences
  • 29. bioinformatics.ca NCRI Workshop 2015 NCRI Workshop 2015 – Module 1 TCGA DCC Data Portal • Provides a platform to search, download and analyze TCGA data sets • Two data access tiers: Open and Controlled • Analytic tools include: Cancer Molecular Analysis and Cancer Genome Workbench (NCBIB), Integrative Genomics Viewer (Broad) and CancerGenomics Analysis (MSKCC).
  • 30. bioinformatics.ca NCRI Workshop 2015 NCRI Workshop 2015 – Module 1 TCGA Data Browser https://tcga-data.nci.nih.gov/tcga/ Query TCGA data online using the TCGA Data Browser
  • 31. bioinformatics.ca NCRI Workshop 2015 NCRI Workshop 2015 – Module 1 The International Cancer Genome Consortium (ICGC) • http://www.icgc.org/ • “ICGC was launched to coordinate large- scale cancer genome studies in tumours from 50 different cancer types and/or subtypes that are of clinical and societal importance across the globe”
  • 32. NCRI Workshop 2015 – Module 1 bioinformatics.ca ICGC Map – February 2015 85 projects launched
  • 33. bioinformatics.ca NCRI Workshop 2015 NCRI Workshop 2015 – Module 1 ICGC datasets to date: https://dcc.icgc.org/projects/history
  • 34. NCRI Workshop 2015 – Module 1 bioinformatics.ca
  • 35. NCRI Workshop 2015 – Module 1 bioinformatics.ca
  • 36. NCRI Workshop 2015 – Module 1 bioinformatics.ca Select “Pancreatic cancer – Canada”
  • 37. NCRI Workshop 2015 – Module 1 bioinformatics.ca … But where is the data?
  • 38. NCRI Workshop 2015 – Module 1 bioinformatics.ca
  • 39. NCRI Workshop 2015 – Module 1 bioinformatics.ca http://dcc.icgc.org/
  • 40. bioinformatics.ca NCRI Workshop 2015 NCRI Workshop 2015 – Module 1 DACO ICGC dbGaP EGA TCGA BA M Open Open ERA BA M Germ Line + EGA id BA M BA M
  • 41. bioinformatics.ca NCRI Workshop 2015 NCRI Workshop 2015 – Module 1 ICGC BAM/FASTQ TCGA BAM/FASTQ ICGC Open Data (includes TCGA Open Data)
  • 42. bioinformatics.ca NCRI Workshop 2015 NCRI Workshop 2015 – Module 1 ICGC TCGA
  • 43. NCRI Workshop 2015 – Module 1 bioinformatics.ca ICG C TCGA Differences between ICGC & TCGA • Different tumour types • Different geographic rules • Many countries vs one jurisdiction • Different definitions of what is controlled • Different data access rules
  • 44. NCRI Workshop 2015 – Module 1 bioinformatics.ca • Detailed Phenotype and Outcome data • Gene Expression (probe-level data) • Raw genotype calls • Gene-sample identifier links • Genome sequence files • Germ line variants ICGC Controlled Access Datasets • Cancer Pathology Histologic type or subtype Histologic nuclear grade • Patient/Person Gender, Age range, Vital status, Survival time Relapse type, Status at follow-up • Gene Expression (normalized) • DNA methylation •Computed Copy Number and Loss of Heterozygosity • Somatic variants from Exome or WGS ICGC Open Access Datasets http://goo.gl/w4mrV
  • 45. NCRI Workshop 2015 – Module 1 bioinformatics.ca • Primary sequence data (BAM and FASTQ files) • SNP6 array level 1 and level 2 data • Exon array level 1 and level 2 data • Somatic variants from whole genome sequencing • Certain information in MAFs • A full list of controlled-access data types can be found at: http://goo.gl/K1h7zu TCGA Controlled Access Datasets • De-identified clinical and demographic data • Gene expression data • Copy number alterations in regions of the genome • Epigenetic data • Summaries of data compiled across individuals • Anonymized single amplicon DNA sequence data • Somatic variants from scrubbed exome sequencing TCGA Open Access Datasets http://goo.gl/A1rMRB
  • 46. bioinformatics.ca NCRI Workshop 2015 NCRI Workshop 2015 – Module 1 TCGA/ICGC users agreed: • … to keep all computer systems on which controlled access data reside, or which provide access to such data, up to date with respect to software and security patches. • … to protect Controlled Access Data against disclosure to unauthorized individuals. • … to monitor and control which individuals have access to Controlled Access Data.
  • 47. bioinformatics.ca NCRI Workshop 2015 NCRI Workshop 2015 – Module 1 TCGA/ICGC users agreed: • … to destroy all copies of controlled access data after controlled access privileges expires. • ... to only use secure transfer protocols: e.g. https and sftp • … to encrypt Controlled Access data in transfers and storage
  • 48. NCRI Workshop 2015 – Module 1 bioinformatics.ca What does it mean for this file? simple_somatic_mutation.aggregated.vcf.gz https://dcc.icgc.org/repository/icgc/release_19/Summary
  • 49. NCRI Workshop 2015 – Module 1 bioinformatics.ca
  • 50. NCRI Workshop 2015 – Module 1 bioinformatics.ca
  • 51. NCRI Workshop 2015 – Module 1 bioinformatics.ca Identify yourself Fill out detail form which includes: • Contact and Project Information •Information Technology details and procedures for keeping data secure •Data Access Agreement All of these documents are put into a PDF file that you print and get your institution to sign off on your behalf
  • 52. NCRI Workshop 2015 – Module 1 bioinformatics.ca http://icgc.org/daco
  • 53. NCRI Workshop 2015 – Module 1 bioinformatics.ca
  • 54. NCRI Workshop 2015 – Module 1 bioinformatics.ca
  • 55. NCRI Workshop 2015 – Module 1 bioinformatics.ca
  • 56. NCRI Workshop 2015 – Module 1 bioinformatics.ca • Name • Institution • Title of Project • Collaborators • Research Summary • Lay Summary • Ethics • IT Security • Cloud Storage • Agreement • Appendices
  • 57. NCRI Workshop 2015 – Module 1 bioinformatics.ca
  • 58. NCRI Workshop 2015 – Module 1 bioinformatics.ca http://goo.gl/2UVLDJ
  • 59. NCRI Workshop 2015 – Module 1 bioinformatics.ca
  • 60. NCRI Workshop 2015 – Module 1 bioinformatics.ca
  • 61. NCRI Workshop 2015 – Module 1 bioinformatics.ca
  • 62. NCRI Workshop 2015 – Module 1 bioinformatics.ca DACO approved projects
  • 63. bioinformatics.ca NCRI Workshop 2015 NCRI Workshop 2015 – Module 1 DACO/DCC User Data Access Process • Users approved through DACO are now automatically granted access to ICGC controlled access datasets available through the ICGC Data Portal and the EBI’s EGA repository DACO Web Application DCC User Registry DCC Data Portal EBI EGA application approved by DACO user accounts activated
  • 64. NCRI Workshop 2015 – Module 1 bioinformatics.ca Catalogue of Somatic Mutations in Cancer (COSMIC) • http://cancer.sanger.ac.uk/cancerg enome/projects/cosmic/ • COSMIC is designed to store and display somatic mutation information and related details and contains information relating to human cancers.
  • 66. bioinformatics.ca NCRI Workshop 2015 NCRI Workshop 2015 – Module 1 COSMIC • Somatic Mutations Only • Diverse sources – Literature (Arrays, Next-Gen, PCR...) – TCGA – ICGC • Diverse ways to look at data – Gene – Variation – Tumour type – Cell line – Experiment
  • 67. bioinformatics.ca NCRI Workshop 2015 NCRI Workshop 2015 – Module 1 FAQ
  • 68. bioinformatics.ca NCRI Workshop 2015 NCRI Workshop 2015 – Module 1 Looking up your favorite gene 1 2 3
  • 69. bioinformatics.ca NCRI Workshop 2015 NCRI Workshop 2015 – Module 1
  • 70. bioinformatics.ca NCRI Workshop 2015 NCRI Workshop 2015 – Module 1
  • 71. bioinformatics.ca NCRI Workshop 2015 NCRI Workshop 2015 – Module 1 In closing • Remember all these sites have great amounts of documentation • The field is changing quickly, and so are the portals. • New features are planned as we speak, and so you need to use the sites, and keep coming back. • Don’t be afraid to explore • Interested in learning more after today? Consider one of the bioinformatics.ca workshops!
  • 72. NCRI Workshop 2015 – Module 1 bioinformatics.ca Acknowledgements: the CBW gang Michelle Brazas Michael Stromberg Marc Fiume Michael Brudno