SlideShare a Scribd company logo
1 of 26
Download to read offline
Path-OS: The Curation of
Cancer Samples
Ken Doig – Bioinformatics Research Core
Peter MacCallum Cancer Centre
ken.doig@petermac.org
Agenda
•  Context
•  System overview
•  Amplicon Panels
•  Filtering
•  Futures
22 May 2014 HVP5 Path-OS 2
The Context
What we do
•  Peter MacCallum Cancer Centre
–  Molecular Pathology Department
•  Provide pathology services to the hospital and ext. labs.
•  Blood and tumour tissue samples
•  Targeted genetic sequencing using amplicon panels
•  Between 4-50 cancer specific genes
•  Looking for needles in haystacks
•  Very sensitive assays
...
...
AAAAGCAGGT TATATAGGCT AAATAGAACT AATCATTGTT TTAGACATAC TTATTGACTC TAAGAGGAAA
TCATAATGCT TGCTCTGATA GGAAAATGAG ATCTACTGTT TTCCTTTACT TACTACACCT CAGATATATT
TCTTCATGAA GACCTCACAG TAAAAATAGG TGATGTTGGT AGCTAGGAGT GAAATCTCGA TGGAGTGGGT
CCCATCAGTT TGAACAGTTG TCTGGATCCA TTTTGTGGAT GGTAAGAATT GAGGCTATTT TTCCACTGAT
TAGTTCCCAG TATTCACAAA AATCAGTGTT CTTATTTTTT ATGTAAATAG ATTTTTTAAC TTTTTTCTTT
...
...
22 May 2014 HVP5 Path-OS 4
Peter Mac Curation Scope
•  Automate the processing from sequencer
to draft report
•  Automate curation evidence collection
•  Sanitise data from external sources
•  Automated reporting
•  Best practice software engineering
22 May 2014 HVP5 Path-OS 5
The System
Patient
Sample
Genologics
wet lab LIMS
External
Variant DBs
•  COSMIC
•  Ensembl
•  Annovar
•  UCSC
•  Clinvar
etc
Loader
Pipeline data repository
FASTQ
BAM
VCF
VEP
Pipeline PipeCleaner
PathOS
Web Server
Pipeline
Validation QC
Reporting
Pipeline
configuration
Sequencers
ETL
configuration
Periodic DB download
and integration
Sequencing QC
Clinical Reporting
Read QC
Synthtetic Reads
Known samples
Filtering
configuration
Users
•  molecular scientists
•  clinicians
•  researchers
Export curated variants
to global repositories
Hospital Records
22 May 2014 HVP5 Path-OS 7
Path-OS Overview
Run QC
This run in the context of
past runs of the same panel
Per sample read yield
highlighting below average
Amplicon performance
read distribution
22 May 2014 HVP5 Path-OS 8
Classification
Page
22 May 2014 HVP5 Path-OS 9
Automatically
generated
classification
Justification
free text field
Check boxes for
variant evidence
Evidence type
tool tip
Classifying variants for the clinic
22 May 2014 HVP5 Path-OS 10
C5: Pathogenic
C4: Likely pathogenic
C3: Unknown
pathogenicity
C2: Unlikely pathogenic
C1: Not pathogenic
5 Level Classification
Stand alone
Strong
Supporting
Criteria
or or
Pathogenic
evidence
Stand alone
Strong
Supporting
Benign
evidence
=
or =
or =or
or =
All other combinations =
Software Components
Role Package Overview
Language Groovy Java on steroids, powerful JVM language
Web Framework Grails Rich Groovy based high productivity framework
Code repository GitLab Private GitHub instance
Database MySQL Widely adopted RDB, good performance
User interactivity Javascript plugins Leverage best available js e.g. Jquery, Google Charts
Object Persistence Hibernate Java standard for mapping POJOs to RDB
Searching Lucene Full-featured text search engine
IoC Layer Spring Java standard for inversion of control
IDE IntelliJ Comprehensive developers environment for Java etc
Build Management Gradle Groovy based DSL leverages Ant and CoC
DB Migration Mgmt LiquiBase DSL based data migration tool for schema versioning
Issue Management Jira Best of breed issue management tracker
LIMS GenoLogics User friendly LIMS for NGS
Aligner Primal Peter Mac in-house amplicon aligner, tuned for amplicons
Variant Caller VarScan 2 Suitable for somatic and germline (for now)
Annotation Ensembl, Annovar Rich set of annotations for multiple transcripts
22 May 2014 HVP5 Path-OS 11
The Panels
Somatic Panel
22 May 2014 HVP5 Path-OS 13
Oncogenes
Tumour suppressors
Consequence type
Other
Missense
Frame shift
Splice site
Stop gained
Gene type
6.2$%$
25.5$%$
0$
20$
40$
60$
80$
100$
Single$ Duplicate$
Variant'Allele'Frequency'(%)'
Variant'Allele'Frequency'for'Soma7c'Panel'Replicates'
Somatic
Replicates
22 May 2014 HVP5 Path-OS 14
0"
5"
10"
15"
20"
25"
30"
0'"<10" 10'<20" 20'"<30" 30'"<40" 40'"<50" 50'"<60" 60'"<70" 70'"<80" 80'"<90" 90'100"
Mean%difference%in%variant%frequency%between%replicates%(%)%
Variant%Read%Frequency%%%(Error:%S.E.M.)%
Replicate%Variant%Frequency%Differences%
72% 28% n=14,771
Amplicon artifacts
22 May 2014 HVP5 Path-OS 15
The Filtering
The Curation of Molecular Pathology Cancer Samples - Kenneth Doig
The Curation of Molecular Pathology Cancer Samples - Kenneth Doig
The Curation of Molecular Pathology Cancer Samples - Kenneth Doig
The Curation of Molecular Pathology Cancer Samples - Kenneth Doig
The Curation of Molecular Pathology Cancer Samples - Kenneth Doig
The Curation of Molecular Pathology Cancer Samples - Kenneth Doig
The Curation of Molecular Pathology Cancer Samples - Kenneth Doig
The Curation of Molecular Pathology Cancer Samples - Kenneth Doig
The Curation of Molecular Pathology Cancer Samples - Kenneth Doig
The Curation of Molecular Pathology Cancer Samples - Kenneth Doig

More Related Content

More from Human Variome Project

ClinVar: Aggregating Data to Improve Variant Interpretation - Melissa Landrum
ClinVar: Aggregating Data to Improve Variant Interpretation - Melissa LandrumClinVar: Aggregating Data to Improve Variant Interpretation - Melissa Landrum
ClinVar: Aggregating Data to Improve Variant Interpretation - Melissa LandrumHuman Variome Project
 
Establishing validity, reproducibility, and utility of highly scalable geneti...
Establishing validity, reproducibility, and utility of highly scalable geneti...Establishing validity, reproducibility, and utility of highly scalable geneti...
Establishing validity, reproducibility, and utility of highly scalable geneti...Human Variome Project
 
The PhenX Toolkit: Standard Measures for Collaborative Research - Wayne Huggins
The PhenX Toolkit: Standard Measures for  Collaborative Research - Wayne HugginsThe PhenX Toolkit: Standard Measures for  Collaborative Research - Wayne Huggins
The PhenX Toolkit: Standard Measures for Collaborative Research - Wayne HugginsHuman Variome Project
 
Report from the International Confederation of Countries Advisory Council - M...
Report from the International Confederation of Countries Advisory Council - M...Report from the International Confederation of Countries Advisory Council - M...
Report from the International Confederation of Countries Advisory Council - M...Human Variome Project
 
Human variome project quality assessment criteria for variation databases - M...
Human variome project quality assessment criteria for variation databases - M...Human variome project quality assessment criteria for variation databases - M...
Human variome project quality assessment criteria for variation databases - M...Human Variome Project
 
HVP Country Node: Venezuela - Aida Falcon de Vargas
HVP Country Node: Venezuela - Aida Falcon de VargasHVP Country Node: Venezuela - Aida Falcon de Vargas
HVP Country Node: Venezuela - Aida Falcon de VargasHuman Variome Project
 
Human Genetics of Infectious Diseases - Laurent Abel
Human Genetics of Infectious Diseases - Laurent AbelHuman Genetics of Infectious Diseases - Laurent Abel
Human Genetics of Infectious Diseases - Laurent AbelHuman Variome Project
 
HVP Country Node: Malaysia - Zilfalil bin Alwi
HVP Country Node: Malaysia - Zilfalil bin AlwiHVP Country Node: Malaysia - Zilfalil bin Alwi
HVP Country Node: Malaysia - Zilfalil bin AlwiHuman Variome Project
 
GENETIC HETEROGENEITY OF MITOCHONDRIAL DISORDERS - Agnès Rötig
GENETIC HETEROGENEITY OF MITOCHONDRIAL DISORDERS - Agnès RötigGENETIC HETEROGENEITY OF MITOCHONDRIAL DISORDERS - Agnès Rötig
GENETIC HETEROGENEITY OF MITOCHONDRIAL DISORDERS - Agnès RötigHuman Variome Project
 
The BRCA Challenge & Exchange: Progress and Plans - Gunnar Rätsch
The BRCA Challenge & Exchange: Progress and Plans - Gunnar RätschThe BRCA Challenge & Exchange: Progress and Plans - Gunnar Rätsch
The BRCA Challenge & Exchange: Progress and Plans - Gunnar RätschHuman Variome Project
 
Richard GH Cotton: He may have been a bit before his time - Michael Watson
Richard GH Cotton: He may have been a bit before his time - Michael WatsonRichard GH Cotton: He may have been a bit before his time - Michael Watson
Richard GH Cotton: He may have been a bit before his time - Michael WatsonHuman Variome Project
 
Professor Richard Cotton - Finlay Macrae
Professor Richard Cotton - Finlay MacraeProfessor Richard Cotton - Finlay Macrae
Professor Richard Cotton - Finlay MacraeHuman Variome Project
 
HVP Country Node: Canada - Matthew Lebo
HVP Country Node: Canada - Matthew LeboHVP Country Node: Canada - Matthew Lebo
HVP Country Node: Canada - Matthew LeboHuman Variome Project
 
Use of open, curated variant databases: ethics? Liability? - Bartha Knoppers
Use of open, curated variant databases: ethics? Liability? - Bartha KnoppersUse of open, curated variant databases: ethics? Liability? - Bartha Knoppers
Use of open, curated variant databases: ethics? Liability? - Bartha KnoppersHuman Variome Project
 
HVP6: Final Thoughts - John Burn & Raj Ramesar
HVP6: Final Thoughts - John Burn & Raj RamesarHVP6: Final Thoughts - John Burn & Raj Ramesar
HVP6: Final Thoughts - John Burn & Raj RamesarHuman Variome Project
 
Report from the International Scientific Advisory Committee - John Burn
Report from the International Scientific Advisory Committee - John BurnReport from the International Scientific Advisory Committee - John Burn
Report from the International Scientific Advisory Committee - John BurnHuman Variome Project
 
HVP Country Node: Italy - Domenico Coviello
HVP Country Node: Italy - Domenico CovielloHVP Country Node: Italy - Domenico Coviello
HVP Country Node: Italy - Domenico CovielloHuman Variome Project
 
Rare and common variants contribute to the complex inheritance of Hirschsprun...
Rare and common variants contribute to the complex inheritance of Hirschsprun...Rare and common variants contribute to the complex inheritance of Hirschsprun...
Rare and common variants contribute to the complex inheritance of Hirschsprun...Human Variome Project
 
Report from the Gene & Disease Specific Database Advisory Council - Peter Ta...
Report from the  Gene & Disease Specific Database Advisory Council - Peter Ta...Report from the  Gene & Disease Specific Database Advisory Council - Peter Ta...
Report from the Gene & Disease Specific Database Advisory Council - Peter Ta...Human Variome Project
 
Checking the experts: compliance with author instructions regarding HGVS nome...
Checking the experts: compliance with author instructions regarding HGVS nome...Checking the experts: compliance with author instructions regarding HGVS nome...
Checking the experts: compliance with author instructions regarding HGVS nome...Human Variome Project
 

More from Human Variome Project (20)

ClinVar: Aggregating Data to Improve Variant Interpretation - Melissa Landrum
ClinVar: Aggregating Data to Improve Variant Interpretation - Melissa LandrumClinVar: Aggregating Data to Improve Variant Interpretation - Melissa Landrum
ClinVar: Aggregating Data to Improve Variant Interpretation - Melissa Landrum
 
Establishing validity, reproducibility, and utility of highly scalable geneti...
Establishing validity, reproducibility, and utility of highly scalable geneti...Establishing validity, reproducibility, and utility of highly scalable geneti...
Establishing validity, reproducibility, and utility of highly scalable geneti...
 
The PhenX Toolkit: Standard Measures for Collaborative Research - Wayne Huggins
The PhenX Toolkit: Standard Measures for  Collaborative Research - Wayne HugginsThe PhenX Toolkit: Standard Measures for  Collaborative Research - Wayne Huggins
The PhenX Toolkit: Standard Measures for Collaborative Research - Wayne Huggins
 
Report from the International Confederation of Countries Advisory Council - M...
Report from the International Confederation of Countries Advisory Council - M...Report from the International Confederation of Countries Advisory Council - M...
Report from the International Confederation of Countries Advisory Council - M...
 
Human variome project quality assessment criteria for variation databases - M...
Human variome project quality assessment criteria for variation databases - M...Human variome project quality assessment criteria for variation databases - M...
Human variome project quality assessment criteria for variation databases - M...
 
HVP Country Node: Venezuela - Aida Falcon de Vargas
HVP Country Node: Venezuela - Aida Falcon de VargasHVP Country Node: Venezuela - Aida Falcon de Vargas
HVP Country Node: Venezuela - Aida Falcon de Vargas
 
Human Genetics of Infectious Diseases - Laurent Abel
Human Genetics of Infectious Diseases - Laurent AbelHuman Genetics of Infectious Diseases - Laurent Abel
Human Genetics of Infectious Diseases - Laurent Abel
 
HVP Country Node: Malaysia - Zilfalil bin Alwi
HVP Country Node: Malaysia - Zilfalil bin AlwiHVP Country Node: Malaysia - Zilfalil bin Alwi
HVP Country Node: Malaysia - Zilfalil bin Alwi
 
GENETIC HETEROGENEITY OF MITOCHONDRIAL DISORDERS - Agnès Rötig
GENETIC HETEROGENEITY OF MITOCHONDRIAL DISORDERS - Agnès RötigGENETIC HETEROGENEITY OF MITOCHONDRIAL DISORDERS - Agnès Rötig
GENETIC HETEROGENEITY OF MITOCHONDRIAL DISORDERS - Agnès Rötig
 
The BRCA Challenge & Exchange: Progress and Plans - Gunnar Rätsch
The BRCA Challenge & Exchange: Progress and Plans - Gunnar RätschThe BRCA Challenge & Exchange: Progress and Plans - Gunnar Rätsch
The BRCA Challenge & Exchange: Progress and Plans - Gunnar Rätsch
 
Richard GH Cotton: He may have been a bit before his time - Michael Watson
Richard GH Cotton: He may have been a bit before his time - Michael WatsonRichard GH Cotton: He may have been a bit before his time - Michael Watson
Richard GH Cotton: He may have been a bit before his time - Michael Watson
 
Professor Richard Cotton - Finlay Macrae
Professor Richard Cotton - Finlay MacraeProfessor Richard Cotton - Finlay Macrae
Professor Richard Cotton - Finlay Macrae
 
HVP Country Node: Canada - Matthew Lebo
HVP Country Node: Canada - Matthew LeboHVP Country Node: Canada - Matthew Lebo
HVP Country Node: Canada - Matthew Lebo
 
Use of open, curated variant databases: ethics? Liability? - Bartha Knoppers
Use of open, curated variant databases: ethics? Liability? - Bartha KnoppersUse of open, curated variant databases: ethics? Liability? - Bartha Knoppers
Use of open, curated variant databases: ethics? Liability? - Bartha Knoppers
 
HVP6: Final Thoughts - John Burn & Raj Ramesar
HVP6: Final Thoughts - John Burn & Raj RamesarHVP6: Final Thoughts - John Burn & Raj Ramesar
HVP6: Final Thoughts - John Burn & Raj Ramesar
 
Report from the International Scientific Advisory Committee - John Burn
Report from the International Scientific Advisory Committee - John BurnReport from the International Scientific Advisory Committee - John Burn
Report from the International Scientific Advisory Committee - John Burn
 
HVP Country Node: Italy - Domenico Coviello
HVP Country Node: Italy - Domenico CovielloHVP Country Node: Italy - Domenico Coviello
HVP Country Node: Italy - Domenico Coviello
 
Rare and common variants contribute to the complex inheritance of Hirschsprun...
Rare and common variants contribute to the complex inheritance of Hirschsprun...Rare and common variants contribute to the complex inheritance of Hirschsprun...
Rare and common variants contribute to the complex inheritance of Hirschsprun...
 
Report from the Gene & Disease Specific Database Advisory Council - Peter Ta...
Report from the  Gene & Disease Specific Database Advisory Council - Peter Ta...Report from the  Gene & Disease Specific Database Advisory Council - Peter Ta...
Report from the Gene & Disease Specific Database Advisory Council - Peter Ta...
 
Checking the experts: compliance with author instructions regarding HGVS nome...
Checking the experts: compliance with author instructions regarding HGVS nome...Checking the experts: compliance with author instructions regarding HGVS nome...
Checking the experts: compliance with author instructions regarding HGVS nome...
 

Recently uploaded

TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...ssifa0344
 
Botany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfBotany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfSumit Kumar yadav
 
Formation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksFormation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksSérgio Sacani
 
Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )aarthirajkumar25
 
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...Sérgio Sacani
 
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...Sérgio Sacani
 
GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)Areesha Ahmad
 
Natural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsNatural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsAArockiyaNisha
 
Animal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxAnimal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxUmerFayaz5
 
Presentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxPresentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxgindu3009
 
VIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C PVIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C PPRINCE C P
 
Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTSérgio Sacani
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPirithiRaju
 
GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)Areesha Ahmad
 
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service 🪡
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service  🪡CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service  🪡
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service 🪡anilsa9823
 
Biological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfBiological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfmuntazimhurra
 
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptxUnlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptxanandsmhk
 
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bNightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bSérgio Sacani
 
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceuticsPulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceuticssakshisoni2385
 
Biopesticide (2).pptx .This slides helps to know the different types of biop...
Biopesticide (2).pptx  .This slides helps to know the different types of biop...Biopesticide (2).pptx  .This slides helps to know the different types of biop...
Biopesticide (2).pptx .This slides helps to know the different types of biop...RohitNehra6
 

Recently uploaded (20)

TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
 
Botany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfBotany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdf
 
Formation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksFormation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disks
 
Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )
 
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
 
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
 
GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)
 
Natural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsNatural Polymer Based Nanomaterials
Natural Polymer Based Nanomaterials
 
Animal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxAnimal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptx
 
Presentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxPresentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptx
 
VIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C PVIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C P
 
Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOST
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
 
GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)
 
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service 🪡
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service  🪡CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service  🪡
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service 🪡
 
Biological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfBiological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdf
 
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptxUnlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
 
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bNightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
 
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceuticsPulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
 
Biopesticide (2).pptx .This slides helps to know the different types of biop...
Biopesticide (2).pptx  .This slides helps to know the different types of biop...Biopesticide (2).pptx  .This slides helps to know the different types of biop...
Biopesticide (2).pptx .This slides helps to know the different types of biop...
 

The Curation of Molecular Pathology Cancer Samples - Kenneth Doig

  • 1. Path-OS: The Curation of Cancer Samples Ken Doig – Bioinformatics Research Core Peter MacCallum Cancer Centre ken.doig@petermac.org
  • 2. Agenda •  Context •  System overview •  Amplicon Panels •  Filtering •  Futures 22 May 2014 HVP5 Path-OS 2
  • 4. What we do •  Peter MacCallum Cancer Centre –  Molecular Pathology Department •  Provide pathology services to the hospital and ext. labs. •  Blood and tumour tissue samples •  Targeted genetic sequencing using amplicon panels •  Between 4-50 cancer specific genes •  Looking for needles in haystacks •  Very sensitive assays ... ... AAAAGCAGGT TATATAGGCT AAATAGAACT AATCATTGTT TTAGACATAC TTATTGACTC TAAGAGGAAA TCATAATGCT TGCTCTGATA GGAAAATGAG ATCTACTGTT TTCCTTTACT TACTACACCT CAGATATATT TCTTCATGAA GACCTCACAG TAAAAATAGG TGATGTTGGT AGCTAGGAGT GAAATCTCGA TGGAGTGGGT CCCATCAGTT TGAACAGTTG TCTGGATCCA TTTTGTGGAT GGTAAGAATT GAGGCTATTT TTCCACTGAT TAGTTCCCAG TATTCACAAA AATCAGTGTT CTTATTTTTT ATGTAAATAG ATTTTTTAAC TTTTTTCTTT ... ... 22 May 2014 HVP5 Path-OS 4
  • 5. Peter Mac Curation Scope •  Automate the processing from sequencer to draft report •  Automate curation evidence collection •  Sanitise data from external sources •  Automated reporting •  Best practice software engineering 22 May 2014 HVP5 Path-OS 5
  • 7. Patient Sample Genologics wet lab LIMS External Variant DBs •  COSMIC •  Ensembl •  Annovar •  UCSC •  Clinvar etc Loader Pipeline data repository FASTQ BAM VCF VEP Pipeline PipeCleaner PathOS Web Server Pipeline Validation QC Reporting Pipeline configuration Sequencers ETL configuration Periodic DB download and integration Sequencing QC Clinical Reporting Read QC Synthtetic Reads Known samples Filtering configuration Users •  molecular scientists •  clinicians •  researchers Export curated variants to global repositories Hospital Records 22 May 2014 HVP5 Path-OS 7 Path-OS Overview
  • 8. Run QC This run in the context of past runs of the same panel Per sample read yield highlighting below average Amplicon performance read distribution 22 May 2014 HVP5 Path-OS 8
  • 9. Classification Page 22 May 2014 HVP5 Path-OS 9 Automatically generated classification Justification free text field Check boxes for variant evidence Evidence type tool tip
  • 10. Classifying variants for the clinic 22 May 2014 HVP5 Path-OS 10 C5: Pathogenic C4: Likely pathogenic C3: Unknown pathogenicity C2: Unlikely pathogenic C1: Not pathogenic 5 Level Classification Stand alone Strong Supporting Criteria or or Pathogenic evidence Stand alone Strong Supporting Benign evidence = or = or =or or = All other combinations =
  • 11. Software Components Role Package Overview Language Groovy Java on steroids, powerful JVM language Web Framework Grails Rich Groovy based high productivity framework Code repository GitLab Private GitHub instance Database MySQL Widely adopted RDB, good performance User interactivity Javascript plugins Leverage best available js e.g. Jquery, Google Charts Object Persistence Hibernate Java standard for mapping POJOs to RDB Searching Lucene Full-featured text search engine IoC Layer Spring Java standard for inversion of control IDE IntelliJ Comprehensive developers environment for Java etc Build Management Gradle Groovy based DSL leverages Ant and CoC DB Migration Mgmt LiquiBase DSL based data migration tool for schema versioning Issue Management Jira Best of breed issue management tracker LIMS GenoLogics User friendly LIMS for NGS Aligner Primal Peter Mac in-house amplicon aligner, tuned for amplicons Variant Caller VarScan 2 Suitable for somatic and germline (for now) Annotation Ensembl, Annovar Rich set of annotations for multiple transcripts 22 May 2014 HVP5 Path-OS 11
  • 13. Somatic Panel 22 May 2014 HVP5 Path-OS 13 Oncogenes Tumour suppressors Consequence type Other Missense Frame shift Splice site Stop gained Gene type
  • 14. 6.2$%$ 25.5$%$ 0$ 20$ 40$ 60$ 80$ 100$ Single$ Duplicate$ Variant'Allele'Frequency'(%)' Variant'Allele'Frequency'for'Soma7c'Panel'Replicates' Somatic Replicates 22 May 2014 HVP5 Path-OS 14 0" 5" 10" 15" 20" 25" 30" 0'"<10" 10'<20" 20'"<30" 30'"<40" 40'"<50" 50'"<60" 60'"<70" 70'"<80" 80'"<90" 90'100" Mean%difference%in%variant%frequency%between%replicates%(%)% Variant%Read%Frequency%%%(Error:%S.E.M.)% Replicate%Variant%Frequency%Differences% 72% 28% n=14,771
  • 15. Amplicon artifacts 22 May 2014 HVP5 Path-OS 15