SlideShare ist ein Scribd-Unternehmen logo
1 von 22
© 2019 DNAnexus, Inc. All Rights Reserved.
GETTING “PERFECT” HAPLOTIGS FOR
MHC
MHC Team, Pangenomics Analysis Hackathon
Jason Chin, Sr. Director, Machine Learning and Genomics, DNAnexus
GIAB/GRC WORKSHOP, ASHG, OCT 11, 2019
© 2019 DNAnexus, Inc. All Rights Reserved.
Major Histocompatibility Complex Region
One of the most
diversified region
HLA matching is important
for the success of organ
transplant
Antigen processing and
presentationKulski, et al., Immunological
Reviews, 2003
© 2019 DNAnexus, Inc. All Rights Reserved.
GWAS Example
3
MHC
Number of Associated Phenotypes per Variant Number of Associated Variant per 5Mbp
Unusually high number of “significant”
association in the MHC over 918 phenotype
labels
(UK Biobank, ~337,000 samples, Neale Lab Rapid GWAS results on
41202: ICD10 code, 20002: self-reported diseases,
https://www.nealelab.is/blog/2017/7/19/rapid-gwas-of-thousands-of-
phenotypes-for-337000-samples-in-the-uk-biobank)
(Instead of a typical
Manhattan plot, We
have a “Taipei” plot)
Numberofvariants
© 2019 DNAnexus, Inc. All Rights Reserved.
GWAS Example
4
MHC
Number of Associated Phenotypes per Variant Number of Associated Variant per 5Mbp
Unusually high number of “significant”
association in the MHC over 918 phenotype
labels
(UK Biobank, ~337,000 samples, Neale Lab Rapid GWAS results on
41202: ICD10 code, 20002: self-reported diseases,
https://www.nealelab.is/blog/2017/7/19/rapid-gwas-of-thousands-of-
phenotypes-for-337000-samples-in-the-uk-biobank)
(Instead of a typical
Manhattan plot, We
have a “Taipei” plot)
Numberofvariants
© 2019 DNAnexus, Inc. All Rights Reserved. 5
© 2019 DNAnexus, Inc. All Rights Reserved.
Strategy to Get “Perfect” Haplotigs
6
Trio sequencing data available
k-mer binning + long reads ->
Haplotype separated read piles: e.g., TrioCanu
Trio data not available
Long Reads, FALCON-Unzip
More Accurate Long Reads
Super Long Reads
Linked Reads
Hi-C data
© 2019 DNAnexus, Inc. All Rights Reserved.
WhatsHap + Peregrine
7
Martin, et al., 2016
BioRxiv 085050.
Chin and Khalak, 2019,
BioRxiv 705616
Platform Talk: Friday: 9:00AM, Grand Ballroom B
Session F: Fast Methods for Genome Analysis
Assembling a de novo human genome in 100 minutes
© 2019 DNAnexus, Inc. All Rights Reserved.
Workflow / Pipeline
8
© 2019 DNAnexus, Inc. All Rights Reserved.
Quick Prototype and Reproducible Environment
9
DNAnexus cloud
workspace for
genomics
development and
analysis work
Better control of
data, code and
computing
environment
© 2019 DNAnexus, Inc. All Rights Reserved.
Whole MHC Haplotig Large Scale View
10
Two haplotigs (no
gap) span through
whole MHC region
© 2019 DNAnexus, Inc. All Rights Reserved.
Phased HLA Genes Confirmed by Trio Typing
11
Assembly contigID locus
utilized
Ref
Contig start stop
Called
Genotypes
Edit
Distance
Called
Genotypes
Assembly
minEditDistance
assembly truth
whichAlleles
minEditDistance
calledGenotype_
Truth
whichAlleles Haplotype
H1 000000F HLA-A pgf 1436987 1440489 A*01:01:01G 0 A*01:01:01G A*01:01:01G_v/s_A*01:01:01G HLA-A Maternal
H1 000000F HLA-B pgf 2848251 2851577 B*35:08:01G 0 B*35:08:01G B*35:08:01G_v/s_B*35:08:01G HLA-B Maternal
H1 000000F HLA-C pgf 2763854 2767202 C*04:01:01G 0 C*04:01:01G C*04:01:01G_v/s_C*04:01:01G HLA-C Maternal
H1 000000F HLA-DQA1 pgf 4086777 4093261 DQA1*01:01:01G 0 DQA1*01:01:01G DQA1*01:01:01G_v/s_DQA1*01:01:01G HLA-DQA1 Maternal
H1 000000F HLA-DQB1 pgf 4110116 4117205 DQB1*05:01:01G 0 DQB1*05:01:01G DQB1*05:01:01G_v/s_DQB1*05:01:01G HLA-DQB1 Maternal
H1 000000F HLA-DRB1 pgf 4029789 4043078 DRB1*10:01:01G 0 DRB1*10:01:01G DRB1*10:01:01G_v/s_DRB1*10:01:01G HLA-DRB1 Maternal
H2 000000F HLA-A pgf 1437427 1440943 A*26:01:01G 0 A*26:01:01G A*26:01:01G_v/s_A*26:01:01G HLA-A Paternal
H2 000000F HLA-B pgf 2843682 2846993 B*38:01:01G 0 B*38:01:01G B*38:01:01G_v/s_B*38:01:01G HLA-B Paternal
H2 000000F HLA-C pgf 2768829 2772177 C*12:03:01G 0 C*12:03:01G C*12:03:01G_v/s_C*12:03:01G HLA-C Paternal
H2 000000F HLA-DQA1 pgf 4182456 4188892 DQA1*03:01:01G 0 DQA1*03:01:01G DQA1*03:01:01G_v/s_DQA1*03:01:01G HLA-DQA1 Paternal
H2 000000F HLA-DQB1 pgf 4201076 4208201 DQB1*03:02:01G 0 DQB1*03:02:01G DQB1*03:02:01G_v/s_DQB1*03:02:01G HLA-DQB1 Paternal
H2 000000F HLA-DRB1 cox 4122938 4138189 DRB1*04:02:01 0 DRB1*04:02:01 DRB1*04:02:01_v/s_DRB1*04:02:01 HLA-DRB1 Paternal
© 2019 DNAnexus, Inc. All Rights Reserved.
What Is Still Wrong and What Can We Do With It?
12
Assembly Graph
Due to the read
length limit, it will
still need some
manual work to
resolve CYP21A2 /
TNXB.
35 kb repeat
Assembly
Reference
Reference
Reference
© 2019 DNAnexus, Inc. All Rights Reserved.
Unroll The Loop With An ONT Read
14
Getting “perfect” assembly needs multi-scale
approaches for both phasing and contig
construction.
We can spike in this 150kb ONT read to “unroll”
the loop in the assembly graph.
Self-self dot-plot of an >150 kb ONT read
Repeat 1 Repeat 2
© 2019 DNAnexus,Inc. All RightsReserved.
Unroll The Loop With An ONT Read
15
Getting “perfect” assembly needsmulti-scale
approachesfor both phasing and contig
construction.
We can spike in this150kb ONT read to
“unroll” the loop in the assembly graph.
Self-self dot-plot of an >150 kb ONT read
Repeat 1 Repeat 2
© 2019 DNAnexus, Inc. All Rights Reserved.
2 Haplotypes and 2 Copies of CYP21A2 Repeats
15
Detecting loops is easy.
(Perhaps we should
annotate assembly
output for that).
However, when the read
length is shorted than
the repeats, we need to
resolve 2x2 haplotypes.
Variant
co-occurring
pattern
© 2019 DNAnexus, Inc. All Rights Reserved.
Long Nanopore Reads can be Phased Better
16
Thursday Afternoon
Poster 1582/T: The portrait of
fully phased assembled diploid
human genome, Arkarachai
Fungtammasan, et. al.,
© 2019 DNAnexus, Inc. All Rights Reserved.
Long Nanopore Reads can be Phased Better
17
Thursday Afternoon
Poster 1582/T: The portrait of
fully phased assembled diploid
human genome, Arkarachai
Fungtammasan, et. al.,
© 2019 DNAnexus, Inc. All Rights Reserved.
Any Other Challenges?
18
• Missing reads recruitment using single
reference
• Assembly will not be complete without an
initial de novo assembly
• One can’t describe the difference with
small variant calls
Take away 179 reads that are only mapped to the HG002 de novo contig
© 2019 DNAnexus, Inc. All Rights Reserved.
“Perfect” is Still Elusive
19
Residual Errors Analysis:
Reads <-> Assembly Contig Consistency Check
(Minimap2 + FreeBayes Variant Calling)
Not surprising, major inconsistences are from
homopolymers
Integrating
assembly- and
mapping-
based calls
gives best
MHC
benchmark
• MHC assembly-based bed
includes 23187 variants in
the MHC region, excluding:
• CYP21A2 and pseudogene
• Homopolymers >10bp
• SVs in assembly
• Very dense variants
• v4.0 mapping-based bed
includes 13964 variants in
the MHC region, excluding:
• Short read callsets
• Conflicts between callers
• SVs from all methods
• Homopolymers >10bp
• Many clusters of variants,
including some HLA genes
• Only 11 differences
between assembly and
mapping based calls in
both beds
• 2 genotyping errors in
assembly-based
• 1 inaccurate complex allele
and cluster of 8 missed
variants in mapping-based
• Merged benchmark
includes 23229 variants in
the MHC region Mbp
• Covers most HLA genes and
CYP21A2/TNXA/TNXB
Threshold True-pos-baseline True-pos-call False-pos False-neg Precision Sensitivity F-measure
----------------------------------------------------------------------------------------------------
None 13899 13549 10 4 0.9993 0.9997 0.9995
These variants are fully phased through the MHC regions too!!
9265 new variants
over MHC region.
© 2019 DNAnexus, Inc. All Rights Reserved.
More MHC in Haplotype Resolved Genome Assemblies
21
NA12878 H1
NA12878 H2
PGP1 H1
PGP1 H2
HG002 H1
HG002 H2
221/4:30 A robust and
production-level approach to
haplotype-resolved assembly of
single individuals. S. Garg, C.
Fungtammasan, A. Carroll, R. Hall,
E. Hatas, M. Mahmoud, F.
Sedlazeck, M. Chou, J. Aach, J.
Zook, J. Chin, H. Lee, G. Church.
We can already see 6 different haplotypes at this scale
© 2019 DNAnexus, Inc. All Rights Reserved.
Next Generation MHC Database?
22
Number of Associated Variant per 5Mbp
Numberofvariants
http://hla.alleles.org/inc/images/graph_hires.png
Is it worth to solve this puzzle with
long read technologies at scale?
Class I &and Class II
HLA Alleles
© 2019 DNAnexus, Inc. All Rights Reserved.
Acknowledgement
23
Thank For Your Attention!!
The MHC team for Pan-genomics in the
Cloud hackathon 2019:
A. Dilthey
A. Fungtammasan
S. Garg
E. Garrison
M. Rautiainen
M. Tobias
J. Wanger
Q. Zeng
J. Zook
Peregrine Assembler Co-developer
Asif Khalak, Foundation of Bio-Data Sciences
----
B. Busby and B. Paten for hosting the hackathon
https://github.com/NCBI-Hackathons/TheHumanPangenome/tree/master/MHC

Weitere ähnliche Inhalte

Was ist angesagt?

GIAB ASHG 2019 Small Variant poster
GIAB ASHG 2019 Small Variant posterGIAB ASHG 2019 Small Variant poster
GIAB ASHG 2019 Small Variant posterGenomeInABottle
 
GRC GIAB Workshop ASHG 2019 Small Variant Benchmark
GRC GIAB Workshop ASHG 2019 Small Variant BenchmarkGRC GIAB Workshop ASHG 2019 Small Variant Benchmark
GRC GIAB Workshop ASHG 2019 Small Variant BenchmarkGenomeInABottle
 
Genome in a Bottle- reference materials to benchmark challenging variants and...
Genome in a Bottle- reference materials to benchmark challenging variants and...Genome in a Bottle- reference materials to benchmark challenging variants and...
Genome in a Bottle- reference materials to benchmark challenging variants and...GenomeInABottle
 
How giab fits in the rest of the world seqc2 tumor normal
How giab fits in the rest of the world   seqc2 tumor normalHow giab fits in the rest of the world   seqc2 tumor normal
How giab fits in the rest of the world seqc2 tumor normalGenomeInABottle
 
Giab agbt small_var_2020
Giab agbt small_var_2020Giab agbt small_var_2020
Giab agbt small_var_2020GenomeInABottle
 
Ga4gh 2019 - Assuring data quality with benchmarking tools from GIAB and GA4GH
Ga4gh 2019 - Assuring data quality with benchmarking tools from GIAB and GA4GHGa4gh 2019 - Assuring data quality with benchmarking tools from GIAB and GA4GH
Ga4gh 2019 - Assuring data quality with benchmarking tools from GIAB and GA4GHGenomeInABottle
 
Giab for jax long read 190917
Giab for jax long read 190917Giab for jax long read 190917
Giab for jax long read 190917GenomeInABottle
 
GIAB and long reads for bio it world 190417
GIAB and long reads for bio it world 190417GIAB and long reads for bio it world 190417
GIAB and long reads for bio it world 190417GenomeInABottle
 
Genome in a bottle for ashg grc giab workshop 181016
Genome in a bottle for ashg grc giab workshop 181016Genome in a bottle for ashg grc giab workshop 181016
Genome in a bottle for ashg grc giab workshop 181016GenomeInABottle
 
New data from giab genomes promethion
New data from giab genomes   promethionNew data from giab genomes   promethion
New data from giab genomes promethionGenomeInABottle
 
Tools for Using NIST Reference Materials
Tools for Using NIST Reference MaterialsTools for Using NIST Reference Materials
Tools for Using NIST Reference MaterialsGenomeInABottle
 
Aug2013 illumina platinum genomes
Aug2013 illumina platinum genomesAug2013 illumina platinum genomes
Aug2013 illumina platinum genomesGenomeInABottle
 
New data from giab genomes pacbio ccs
New data from giab genomes   pacbio ccsNew data from giab genomes   pacbio ccs
New data from giab genomes pacbio ccsGenomeInABottle
 
Giab ashg webinar 160224
Giab ashg webinar 160224Giab ashg webinar 160224
Giab ashg webinar 160224GenomeInABottle
 
Giab v0.6 genoox sv benchmarking
Giab v0.6 genoox sv benchmarkingGiab v0.6 genoox sv benchmarking
Giab v0.6 genoox sv benchmarkingGenomeInABottle
 
160627 giab for festival sv workshop
160627 giab for festival sv workshop160627 giab for festival sv workshop
160627 giab for festival sv workshopGenomeInABottle
 
New methods deep variant evaluation of draft v4alpha
New methods   deep variant evaluation of draft v4alphaNew methods   deep variant evaluation of draft v4alpha
New methods deep variant evaluation of draft v4alphaGenomeInABottle
 
2017 amp benchmarking_poster_justin
2017 amp benchmarking_poster_justin2017 amp benchmarking_poster_justin
2017 amp benchmarking_poster_justinGenomeInABottle
 
GIAB-GRC workshop oct2015 giab introduction 151005
GIAB-GRC workshop oct2015 giab introduction 151005GIAB-GRC workshop oct2015 giab introduction 151005
GIAB-GRC workshop oct2015 giab introduction 151005GenomeInABottle
 

Was ist angesagt? (20)

GIAB ASHG 2019 Small Variant poster
GIAB ASHG 2019 Small Variant posterGIAB ASHG 2019 Small Variant poster
GIAB ASHG 2019 Small Variant poster
 
GRC GIAB Workshop ASHG 2019 Small Variant Benchmark
GRC GIAB Workshop ASHG 2019 Small Variant BenchmarkGRC GIAB Workshop ASHG 2019 Small Variant Benchmark
GRC GIAB Workshop ASHG 2019 Small Variant Benchmark
 
Genome in a Bottle- reference materials to benchmark challenging variants and...
Genome in a Bottle- reference materials to benchmark challenging variants and...Genome in a Bottle- reference materials to benchmark challenging variants and...
Genome in a Bottle- reference materials to benchmark challenging variants and...
 
How giab fits in the rest of the world seqc2 tumor normal
How giab fits in the rest of the world   seqc2 tumor normalHow giab fits in the rest of the world   seqc2 tumor normal
How giab fits in the rest of the world seqc2 tumor normal
 
Giab agbt small_var_2020
Giab agbt small_var_2020Giab agbt small_var_2020
Giab agbt small_var_2020
 
Ga4gh 2019 - Assuring data quality with benchmarking tools from GIAB and GA4GH
Ga4gh 2019 - Assuring data quality with benchmarking tools from GIAB and GA4GHGa4gh 2019 - Assuring data quality with benchmarking tools from GIAB and GA4GH
Ga4gh 2019 - Assuring data quality with benchmarking tools from GIAB and GA4GH
 
Giab for jax long read 190917
Giab for jax long read 190917Giab for jax long read 190917
Giab for jax long read 190917
 
GIAB and long reads for bio it world 190417
GIAB and long reads for bio it world 190417GIAB and long reads for bio it world 190417
GIAB and long reads for bio it world 190417
 
Genome in a bottle for ashg grc giab workshop 181016
Genome in a bottle for ashg grc giab workshop 181016Genome in a bottle for ashg grc giab workshop 181016
Genome in a bottle for ashg grc giab workshop 181016
 
New data from giab genomes promethion
New data from giab genomes   promethionNew data from giab genomes   promethion
New data from giab genomes promethion
 
Tools for Using NIST Reference Materials
Tools for Using NIST Reference MaterialsTools for Using NIST Reference Materials
Tools for Using NIST Reference Materials
 
Aug2013 illumina platinum genomes
Aug2013 illumina platinum genomesAug2013 illumina platinum genomes
Aug2013 illumina platinum genomes
 
New data from giab genomes pacbio ccs
New data from giab genomes   pacbio ccsNew data from giab genomes   pacbio ccs
New data from giab genomes pacbio ccs
 
Giab ashg webinar 160224
Giab ashg webinar 160224Giab ashg webinar 160224
Giab ashg webinar 160224
 
Giab v0.6 genoox sv benchmarking
Giab v0.6 genoox sv benchmarkingGiab v0.6 genoox sv benchmarking
Giab v0.6 genoox sv benchmarking
 
160627 giab for festival sv workshop
160627 giab for festival sv workshop160627 giab for festival sv workshop
160627 giab for festival sv workshop
 
New methods deep variant evaluation of draft v4alpha
New methods   deep variant evaluation of draft v4alphaNew methods   deep variant evaluation of draft v4alpha
New methods deep variant evaluation of draft v4alpha
 
2017 amp benchmarking_poster_justin
2017 amp benchmarking_poster_justin2017 amp benchmarking_poster_justin
2017 amp benchmarking_poster_justin
 
GIAB-GRC workshop oct2015 giab introduction 151005
GIAB-GRC workshop oct2015 giab introduction 151005GIAB-GRC workshop oct2015 giab introduction 151005
GIAB-GRC workshop oct2015 giab introduction 151005
 
Giab sv genotyping
Giab sv genotypingGiab sv genotyping
Giab sv genotyping
 

Ähnlich wie Getting Perfect Haplotigs for MHC

20160308 dtl ngs_focus_group_meeting_slideshare
20160308 dtl ngs_focus_group_meeting_slideshare20160308 dtl ngs_focus_group_meeting_slideshare
20160308 dtl ngs_focus_group_meeting_slidesharehansjansen9999
 
Genome res. 2002-kent-656-64
Genome res. 2002-kent-656-64Genome res. 2002-kent-656-64
Genome res. 2002-kent-656-64PeterMaf
 
Genome res. 2002-kent-656-64
Genome res. 2002-kent-656-64Genome res. 2002-kent-656-64
Genome res. 2002-kent-656-64PeterMaf
 
20150601 bio sb_assembly_course
20150601 bio sb_assembly_course20150601 bio sb_assembly_course
20150601 bio sb_assembly_coursehansjansen9999
 
Overview Of Array Based Copy Number
Overview Of Array Based Copy NumberOverview Of Array Based Copy Number
Overview Of Array Based Copy NumberJosephseki28
 
Genome in a bottle for amp GeT-RM 181030
Genome in a bottle for amp GeT-RM 181030Genome in a bottle for amp GeT-RM 181030
Genome in a bottle for amp GeT-RM 181030GenomeInABottle
 
SBVRLDNACOMP:AN EFFECTIVE DNA SEQUENCE COMPRESSION ALGORITHM
 SBVRLDNACOMP:AN EFFECTIVE DNA SEQUENCE COMPRESSION ALGORITHM SBVRLDNACOMP:AN EFFECTIVE DNA SEQUENCE COMPRESSION ALGORITHM
SBVRLDNACOMP:AN EFFECTIVE DNA SEQUENCE COMPRESSION ALGORITHMijcsa
 
Dgaston dec-06-2012
Dgaston dec-06-2012Dgaston dec-06-2012
Dgaston dec-06-2012Dan Gaston
 
Exploring DNA/RNA-Seq Analysis Results with Golden Helix GenomeBrowse and SVS
Exploring DNA/RNA-Seq Analysis Results with Golden Helix GenomeBrowse and SVSExploring DNA/RNA-Seq Analysis Results with Golden Helix GenomeBrowse and SVS
Exploring DNA/RNA-Seq Analysis Results with Golden Helix GenomeBrowse and SVSGolden Helix Inc
 
whole-genome-sequencing-guide-small-genomes.pdf.pdf
whole-genome-sequencing-guide-small-genomes.pdf.pdfwhole-genome-sequencing-guide-small-genomes.pdf.pdf
whole-genome-sequencing-guide-small-genomes.pdf.pdfCRISTIANALONSORODRIG1
 
Chapter8 igenetics
Chapter8 igeneticsChapter8 igenetics
Chapter8 igeneticsminhdaovan
 
CONSTRUCTION OF GENOMIC LIBRARY MCBA P7 T (1).pdf
CONSTRUCTION OF GENOMIC LIBRARY MCBA P7 T (1).pdfCONSTRUCTION OF GENOMIC LIBRARY MCBA P7 T (1).pdf
CONSTRUCTION OF GENOMIC LIBRARY MCBA P7 T (1).pdfsumitraDas14
 
2013 pag-equine-workshop
2013 pag-equine-workshop2013 pag-equine-workshop
2013 pag-equine-workshopc.titus.brown
 
Biochem recombinant dna technology(29.6.10)
Biochem   recombinant dna technology(29.6.10)Biochem   recombinant dna technology(29.6.10)
Biochem recombinant dna technology(29.6.10)MBBS IMS MSU
 
Enabling CNV Studies from Single Cells Using Whole Genome Amplification and L...
Enabling CNV Studies from Single Cells Using Whole Genome Amplification and L...Enabling CNV Studies from Single Cells Using Whole Genome Amplification and L...
Enabling CNV Studies from Single Cells Using Whole Genome Amplification and L...QIAGEN
 
RNA-seq: analysis of raw data and preprocessing - part 2
RNA-seq: analysis of raw data and preprocessing - part 2RNA-seq: analysis of raw data and preprocessing - part 2
RNA-seq: analysis of raw data and preprocessing - part 2BITS
 
artificial or synthetic transcription factor for regulation of gene expression
artificial or synthetic transcription factor for regulation of gene expressionartificial or synthetic transcription factor for regulation of gene expression
artificial or synthetic transcription factor for regulation of gene expressionBalaji Rathod
 

Ähnlich wie Getting Perfect Haplotigs for MHC (20)

20160308 dtl ngs_focus_group_meeting_slideshare
20160308 dtl ngs_focus_group_meeting_slideshare20160308 dtl ngs_focus_group_meeting_slideshare
20160308 dtl ngs_focus_group_meeting_slideshare
 
Genome res. 2002-kent-656-64
Genome res. 2002-kent-656-64Genome res. 2002-kent-656-64
Genome res. 2002-kent-656-64
 
Genome res. 2002-kent-656-64
Genome res. 2002-kent-656-64Genome res. 2002-kent-656-64
Genome res. 2002-kent-656-64
 
20150601 bio sb_assembly_course
20150601 bio sb_assembly_course20150601 bio sb_assembly_course
20150601 bio sb_assembly_course
 
Overview Of Array Based Copy Number
Overview Of Array Based Copy NumberOverview Of Array Based Copy Number
Overview Of Array Based Copy Number
 
Genome in a bottle for amp GeT-RM 181030
Genome in a bottle for amp GeT-RM 181030Genome in a bottle for amp GeT-RM 181030
Genome in a bottle for amp GeT-RM 181030
 
BioSB meeting 2015
BioSB meeting 2015BioSB meeting 2015
BioSB meeting 2015
 
SBVRLDNACOMP:AN EFFECTIVE DNA SEQUENCE COMPRESSION ALGORITHM
 SBVRLDNACOMP:AN EFFECTIVE DNA SEQUENCE COMPRESSION ALGORITHM SBVRLDNACOMP:AN EFFECTIVE DNA SEQUENCE COMPRESSION ALGORITHM
SBVRLDNACOMP:AN EFFECTIVE DNA SEQUENCE COMPRESSION ALGORITHM
 
Alignment Approaches II: Long Reads
Alignment Approaches II: Long ReadsAlignment Approaches II: Long Reads
Alignment Approaches II: Long Reads
 
Dgaston dec-06-2012
Dgaston dec-06-2012Dgaston dec-06-2012
Dgaston dec-06-2012
 
Exploring DNA/RNA-Seq Analysis Results with Golden Helix GenomeBrowse and SVS
Exploring DNA/RNA-Seq Analysis Results with Golden Helix GenomeBrowse and SVSExploring DNA/RNA-Seq Analysis Results with Golden Helix GenomeBrowse and SVS
Exploring DNA/RNA-Seq Analysis Results with Golden Helix GenomeBrowse and SVS
 
whole-genome-sequencing-guide-small-genomes.pdf.pdf
whole-genome-sequencing-guide-small-genomes.pdf.pdfwhole-genome-sequencing-guide-small-genomes.pdf.pdf
whole-genome-sequencing-guide-small-genomes.pdf.pdf
 
Chapter8 igenetics
Chapter8 igeneticsChapter8 igenetics
Chapter8 igenetics
 
CONSTRUCTION OF GENOMIC LIBRARY MCBA P7 T (1).pdf
CONSTRUCTION OF GENOMIC LIBRARY MCBA P7 T (1).pdfCONSTRUCTION OF GENOMIC LIBRARY MCBA P7 T (1).pdf
CONSTRUCTION OF GENOMIC LIBRARY MCBA P7 T (1).pdf
 
2013 pag-equine-workshop
2013 pag-equine-workshop2013 pag-equine-workshop
2013 pag-equine-workshop
 
Biochem recombinant dna technology(29.6.10)
Biochem   recombinant dna technology(29.6.10)Biochem   recombinant dna technology(29.6.10)
Biochem recombinant dna technology(29.6.10)
 
Final doc of dna
Final  doc of dnaFinal  doc of dna
Final doc of dna
 
Enabling CNV Studies from Single Cells Using Whole Genome Amplification and L...
Enabling CNV Studies from Single Cells Using Whole Genome Amplification and L...Enabling CNV Studies from Single Cells Using Whole Genome Amplification and L...
Enabling CNV Studies from Single Cells Using Whole Genome Amplification and L...
 
RNA-seq: analysis of raw data and preprocessing - part 2
RNA-seq: analysis of raw data and preprocessing - part 2RNA-seq: analysis of raw data and preprocessing - part 2
RNA-seq: analysis of raw data and preprocessing - part 2
 
artificial or synthetic transcription factor for regulation of gene expression
artificial or synthetic transcription factor for regulation of gene expressionartificial or synthetic transcription factor for regulation of gene expression
artificial or synthetic transcription factor for regulation of gene expression
 

Mehr von GenomeInABottle

GIAB Tumor Normal ASHG 2023
GIAB Tumor Normal ASHG 2023GIAB Tumor Normal ASHG 2023
GIAB Tumor Normal ASHG 2023GenomeInABottle
 
GIAB_ASHG_JZook_2023.pdf
GIAB_ASHG_JZook_2023.pdfGIAB_ASHG_JZook_2023.pdf
GIAB_ASHG_JZook_2023.pdfGenomeInABottle
 
Using accurate long reads to improve Genome in a Bottle Benchmarks 220923
Using accurate long reads to improve Genome in a Bottle Benchmarks 220923Using accurate long reads to improve Genome in a Bottle Benchmarks 220923
Using accurate long reads to improve Genome in a Bottle Benchmarks 220923GenomeInABottle
 
Benchmarking with GIAB 220907
Benchmarking with GIAB 220907Benchmarking with GIAB 220907
Benchmarking with GIAB 220907GenomeInABottle
 
New data from giab genomes strand-seq
New data from giab genomes   strand-seqNew data from giab genomes   strand-seq
New data from giab genomes strand-seqGenomeInABottle
 
New data from giab genomes intro and ultralong nanopore
New data from giab genomes   intro and ultralong nanoporeNew data from giab genomes   intro and ultralong nanopore
New data from giab genomes intro and ultralong nanoporeGenomeInABottle
 
How giab fits in the rest of the world mdic somatic reference samples
How giab fits in the rest of the world   mdic somatic reference samplesHow giab fits in the rest of the world   mdic somatic reference samples
How giab fits in the rest of the world mdic somatic reference samplesGenomeInABottle
 
How giab fits in the rest of the world telomere to telomere consortium
How giab fits in the rest of the world   telomere to telomere consortiumHow giab fits in the rest of the world   telomere to telomere consortium
How giab fits in the rest of the world telomere to telomere consortiumGenomeInABottle
 
How giab fits in the rest of the world human genome structural variation co...
How giab fits in the rest of the world   human genome structural variation co...How giab fits in the rest of the world   human genome structural variation co...
How giab fits in the rest of the world human genome structural variation co...GenomeInABottle
 
How giab fits in the rest of the world introduction
How giab fits in the rest of the world introductionHow giab fits in the rest of the world introduction
How giab fits in the rest of the world introductionGenomeInABottle
 

Mehr von GenomeInABottle (12)

2023 GIAB AMP Update
2023 GIAB AMP Update2023 GIAB AMP Update
2023 GIAB AMP Update
 
GIAB Tumor Normal ASHG 2023
GIAB Tumor Normal ASHG 2023GIAB Tumor Normal ASHG 2023
GIAB Tumor Normal ASHG 2023
 
Stratomod ASHG 2023
Stratomod ASHG 2023Stratomod ASHG 2023
Stratomod ASHG 2023
 
GIAB_ASHG_JZook_2023.pdf
GIAB_ASHG_JZook_2023.pdfGIAB_ASHG_JZook_2023.pdf
GIAB_ASHG_JZook_2023.pdf
 
Using accurate long reads to improve Genome in a Bottle Benchmarks 220923
Using accurate long reads to improve Genome in a Bottle Benchmarks 220923Using accurate long reads to improve Genome in a Bottle Benchmarks 220923
Using accurate long reads to improve Genome in a Bottle Benchmarks 220923
 
Benchmarking with GIAB 220907
Benchmarking with GIAB 220907Benchmarking with GIAB 220907
Benchmarking with GIAB 220907
 
New data from giab genomes strand-seq
New data from giab genomes   strand-seqNew data from giab genomes   strand-seq
New data from giab genomes strand-seq
 
New data from giab genomes intro and ultralong nanopore
New data from giab genomes   intro and ultralong nanoporeNew data from giab genomes   intro and ultralong nanopore
New data from giab genomes intro and ultralong nanopore
 
How giab fits in the rest of the world mdic somatic reference samples
How giab fits in the rest of the world   mdic somatic reference samplesHow giab fits in the rest of the world   mdic somatic reference samples
How giab fits in the rest of the world mdic somatic reference samples
 
How giab fits in the rest of the world telomere to telomere consortium
How giab fits in the rest of the world   telomere to telomere consortiumHow giab fits in the rest of the world   telomere to telomere consortium
How giab fits in the rest of the world telomere to telomere consortium
 
How giab fits in the rest of the world human genome structural variation co...
How giab fits in the rest of the world   human genome structural variation co...How giab fits in the rest of the world   human genome structural variation co...
How giab fits in the rest of the world human genome structural variation co...
 
How giab fits in the rest of the world introduction
How giab fits in the rest of the world introductionHow giab fits in the rest of the world introduction
How giab fits in the rest of the world introduction
 

Kürzlich hochgeladen

Call Girls Ludhiana Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Ludhiana Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Ludhiana Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Ludhiana Just Call 9907093804 Top Class Call Girl Service AvailableDipal Arora
 
Manyata Tech Park ( Call Girls ) Bangalore ✔ 6297143586 ✔ Hot Model With Sexy...
Manyata Tech Park ( Call Girls ) Bangalore ✔ 6297143586 ✔ Hot Model With Sexy...Manyata Tech Park ( Call Girls ) Bangalore ✔ 6297143586 ✔ Hot Model With Sexy...
Manyata Tech Park ( Call Girls ) Bangalore ✔ 6297143586 ✔ Hot Model With Sexy...vidya singh
 
Best Rate (Hyderabad) Call Girls Jahanuma ⟟ 8250192130 ⟟ High Class Call Girl...
Best Rate (Hyderabad) Call Girls Jahanuma ⟟ 8250192130 ⟟ High Class Call Girl...Best Rate (Hyderabad) Call Girls Jahanuma ⟟ 8250192130 ⟟ High Class Call Girl...
Best Rate (Hyderabad) Call Girls Jahanuma ⟟ 8250192130 ⟟ High Class Call Girl...astropune
 
Best Rate (Guwahati ) Call Girls Guwahati ⟟ 8617370543 ⟟ High Class Call Girl...
Best Rate (Guwahati ) Call Girls Guwahati ⟟ 8617370543 ⟟ High Class Call Girl...Best Rate (Guwahati ) Call Girls Guwahati ⟟ 8617370543 ⟟ High Class Call Girl...
Best Rate (Guwahati ) Call Girls Guwahati ⟟ 8617370543 ⟟ High Class Call Girl...Dipal Arora
 
VIP Service Call Girls Sindhi Colony 📳 7877925207 For 18+ VIP Call Girl At Th...
VIP Service Call Girls Sindhi Colony 📳 7877925207 For 18+ VIP Call Girl At Th...VIP Service Call Girls Sindhi Colony 📳 7877925207 For 18+ VIP Call Girl At Th...
VIP Service Call Girls Sindhi Colony 📳 7877925207 For 18+ VIP Call Girl At Th...jageshsingh5554
 
Russian Call Girls in Jaipur Riya WhatsApp ❤8445551418 VIP Call Girls Jaipur
Russian Call Girls in Jaipur Riya WhatsApp ❤8445551418 VIP Call Girls JaipurRussian Call Girls in Jaipur Riya WhatsApp ❤8445551418 VIP Call Girls Jaipur
Russian Call Girls in Jaipur Riya WhatsApp ❤8445551418 VIP Call Girls Jaipurparulsinha
 
Call Girls Service Surat Samaira ❤️🍑 8250192130 👄 Independent Escort Service ...
Call Girls Service Surat Samaira ❤️🍑 8250192130 👄 Independent Escort Service ...Call Girls Service Surat Samaira ❤️🍑 8250192130 👄 Independent Escort Service ...
Call Girls Service Surat Samaira ❤️🍑 8250192130 👄 Independent Escort Service ...CALL GIRLS
 
Call Girls Dehradun Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Dehradun Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Dehradun Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Dehradun Just Call 9907093804 Top Class Call Girl Service AvailableDipal Arora
 
VIP Call Girls Indore Kirti 💚😋 9256729539 🚀 Indore Escorts
VIP Call Girls Indore Kirti 💚😋  9256729539 🚀 Indore EscortsVIP Call Girls Indore Kirti 💚😋  9256729539 🚀 Indore Escorts
VIP Call Girls Indore Kirti 💚😋 9256729539 🚀 Indore Escortsaditipandeya
 
(👑VVIP ISHAAN ) Russian Call Girls Service Navi Mumbai🖕9920874524🖕Independent...
(👑VVIP ISHAAN ) Russian Call Girls Service Navi Mumbai🖕9920874524🖕Independent...(👑VVIP ISHAAN ) Russian Call Girls Service Navi Mumbai🖕9920874524🖕Independent...
(👑VVIP ISHAAN ) Russian Call Girls Service Navi Mumbai🖕9920874524🖕Independent...Taniya Sharma
 
Call Girls Bareilly Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Bareilly Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Bareilly Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Bareilly Just Call 9907093804 Top Class Call Girl Service AvailableDipal Arora
 
Night 7k to 12k Navi Mumbai Call Girl Photo 👉 BOOK NOW 9833363713 👈 ♀️ night ...
Night 7k to 12k Navi Mumbai Call Girl Photo 👉 BOOK NOW 9833363713 👈 ♀️ night ...Night 7k to 12k Navi Mumbai Call Girl Photo 👉 BOOK NOW 9833363713 👈 ♀️ night ...
Night 7k to 12k Navi Mumbai Call Girl Photo 👉 BOOK NOW 9833363713 👈 ♀️ night ...aartirawatdelhi
 
Call Girls Nagpur Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Nagpur Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Nagpur Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Nagpur Just Call 9907093804 Top Class Call Girl Service AvailableDipal Arora
 
Call Girls Tirupati Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Tirupati Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Tirupati Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Tirupati Just Call 9907093804 Top Class Call Girl Service AvailableDipal Arora
 
VIP Hyderabad Call Girls Bahadurpally 7877925207 ₹5000 To 25K With AC Room 💚😋
VIP Hyderabad Call Girls Bahadurpally 7877925207 ₹5000 To 25K With AC Room 💚😋VIP Hyderabad Call Girls Bahadurpally 7877925207 ₹5000 To 25K With AC Room 💚😋
VIP Hyderabad Call Girls Bahadurpally 7877925207 ₹5000 To 25K With AC Room 💚😋TANUJA PANDEY
 
Call Girls Varanasi Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Varanasi Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Varanasi Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Varanasi Just Call 9907093804 Top Class Call Girl Service AvailableDipal Arora
 
Call Girls Aurangabad Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Aurangabad Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Aurangabad Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Aurangabad Just Call 9907093804 Top Class Call Girl Service AvailableDipal Arora
 
Chandrapur Call girls 8617370543 Provides all area service COD available
Chandrapur Call girls 8617370543 Provides all area service COD availableChandrapur Call girls 8617370543 Provides all area service COD available
Chandrapur Call girls 8617370543 Provides all area service COD availableDipal Arora
 
Call Girls Horamavu WhatsApp Number 7001035870 Meeting With Bangalore Escorts
Call Girls Horamavu WhatsApp Number 7001035870 Meeting With Bangalore EscortsCall Girls Horamavu WhatsApp Number 7001035870 Meeting With Bangalore Escorts
Call Girls Horamavu WhatsApp Number 7001035870 Meeting With Bangalore Escortsvidya singh
 
College Call Girls in Haridwar 9667172968 Short 4000 Night 10000 Best call gi...
College Call Girls in Haridwar 9667172968 Short 4000 Night 10000 Best call gi...College Call Girls in Haridwar 9667172968 Short 4000 Night 10000 Best call gi...
College Call Girls in Haridwar 9667172968 Short 4000 Night 10000 Best call gi...perfect solution
 

Kürzlich hochgeladen (20)

Call Girls Ludhiana Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Ludhiana Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Ludhiana Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Ludhiana Just Call 9907093804 Top Class Call Girl Service Available
 
Manyata Tech Park ( Call Girls ) Bangalore ✔ 6297143586 ✔ Hot Model With Sexy...
Manyata Tech Park ( Call Girls ) Bangalore ✔ 6297143586 ✔ Hot Model With Sexy...Manyata Tech Park ( Call Girls ) Bangalore ✔ 6297143586 ✔ Hot Model With Sexy...
Manyata Tech Park ( Call Girls ) Bangalore ✔ 6297143586 ✔ Hot Model With Sexy...
 
Best Rate (Hyderabad) Call Girls Jahanuma ⟟ 8250192130 ⟟ High Class Call Girl...
Best Rate (Hyderabad) Call Girls Jahanuma ⟟ 8250192130 ⟟ High Class Call Girl...Best Rate (Hyderabad) Call Girls Jahanuma ⟟ 8250192130 ⟟ High Class Call Girl...
Best Rate (Hyderabad) Call Girls Jahanuma ⟟ 8250192130 ⟟ High Class Call Girl...
 
Best Rate (Guwahati ) Call Girls Guwahati ⟟ 8617370543 ⟟ High Class Call Girl...
Best Rate (Guwahati ) Call Girls Guwahati ⟟ 8617370543 ⟟ High Class Call Girl...Best Rate (Guwahati ) Call Girls Guwahati ⟟ 8617370543 ⟟ High Class Call Girl...
Best Rate (Guwahati ) Call Girls Guwahati ⟟ 8617370543 ⟟ High Class Call Girl...
 
VIP Service Call Girls Sindhi Colony 📳 7877925207 For 18+ VIP Call Girl At Th...
VIP Service Call Girls Sindhi Colony 📳 7877925207 For 18+ VIP Call Girl At Th...VIP Service Call Girls Sindhi Colony 📳 7877925207 For 18+ VIP Call Girl At Th...
VIP Service Call Girls Sindhi Colony 📳 7877925207 For 18+ VIP Call Girl At Th...
 
Russian Call Girls in Jaipur Riya WhatsApp ❤8445551418 VIP Call Girls Jaipur
Russian Call Girls in Jaipur Riya WhatsApp ❤8445551418 VIP Call Girls JaipurRussian Call Girls in Jaipur Riya WhatsApp ❤8445551418 VIP Call Girls Jaipur
Russian Call Girls in Jaipur Riya WhatsApp ❤8445551418 VIP Call Girls Jaipur
 
Call Girls Service Surat Samaira ❤️🍑 8250192130 👄 Independent Escort Service ...
Call Girls Service Surat Samaira ❤️🍑 8250192130 👄 Independent Escort Service ...Call Girls Service Surat Samaira ❤️🍑 8250192130 👄 Independent Escort Service ...
Call Girls Service Surat Samaira ❤️🍑 8250192130 👄 Independent Escort Service ...
 
Call Girls Dehradun Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Dehradun Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Dehradun Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Dehradun Just Call 9907093804 Top Class Call Girl Service Available
 
VIP Call Girls Indore Kirti 💚😋 9256729539 🚀 Indore Escorts
VIP Call Girls Indore Kirti 💚😋  9256729539 🚀 Indore EscortsVIP Call Girls Indore Kirti 💚😋  9256729539 🚀 Indore Escorts
VIP Call Girls Indore Kirti 💚😋 9256729539 🚀 Indore Escorts
 
(👑VVIP ISHAAN ) Russian Call Girls Service Navi Mumbai🖕9920874524🖕Independent...
(👑VVIP ISHAAN ) Russian Call Girls Service Navi Mumbai🖕9920874524🖕Independent...(👑VVIP ISHAAN ) Russian Call Girls Service Navi Mumbai🖕9920874524🖕Independent...
(👑VVIP ISHAAN ) Russian Call Girls Service Navi Mumbai🖕9920874524🖕Independent...
 
Call Girls Bareilly Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Bareilly Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Bareilly Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Bareilly Just Call 9907093804 Top Class Call Girl Service Available
 
Night 7k to 12k Navi Mumbai Call Girl Photo 👉 BOOK NOW 9833363713 👈 ♀️ night ...
Night 7k to 12k Navi Mumbai Call Girl Photo 👉 BOOK NOW 9833363713 👈 ♀️ night ...Night 7k to 12k Navi Mumbai Call Girl Photo 👉 BOOK NOW 9833363713 👈 ♀️ night ...
Night 7k to 12k Navi Mumbai Call Girl Photo 👉 BOOK NOW 9833363713 👈 ♀️ night ...
 
Call Girls Nagpur Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Nagpur Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Nagpur Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Nagpur Just Call 9907093804 Top Class Call Girl Service Available
 
Call Girls Tirupati Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Tirupati Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Tirupati Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Tirupati Just Call 9907093804 Top Class Call Girl Service Available
 
VIP Hyderabad Call Girls Bahadurpally 7877925207 ₹5000 To 25K With AC Room 💚😋
VIP Hyderabad Call Girls Bahadurpally 7877925207 ₹5000 To 25K With AC Room 💚😋VIP Hyderabad Call Girls Bahadurpally 7877925207 ₹5000 To 25K With AC Room 💚😋
VIP Hyderabad Call Girls Bahadurpally 7877925207 ₹5000 To 25K With AC Room 💚😋
 
Call Girls Varanasi Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Varanasi Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Varanasi Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Varanasi Just Call 9907093804 Top Class Call Girl Service Available
 
Call Girls Aurangabad Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Aurangabad Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Aurangabad Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Aurangabad Just Call 9907093804 Top Class Call Girl Service Available
 
Chandrapur Call girls 8617370543 Provides all area service COD available
Chandrapur Call girls 8617370543 Provides all area service COD availableChandrapur Call girls 8617370543 Provides all area service COD available
Chandrapur Call girls 8617370543 Provides all area service COD available
 
Call Girls Horamavu WhatsApp Number 7001035870 Meeting With Bangalore Escorts
Call Girls Horamavu WhatsApp Number 7001035870 Meeting With Bangalore EscortsCall Girls Horamavu WhatsApp Number 7001035870 Meeting With Bangalore Escorts
Call Girls Horamavu WhatsApp Number 7001035870 Meeting With Bangalore Escorts
 
College Call Girls in Haridwar 9667172968 Short 4000 Night 10000 Best call gi...
College Call Girls in Haridwar 9667172968 Short 4000 Night 10000 Best call gi...College Call Girls in Haridwar 9667172968 Short 4000 Night 10000 Best call gi...
College Call Girls in Haridwar 9667172968 Short 4000 Night 10000 Best call gi...
 

Getting Perfect Haplotigs for MHC

  • 1. © 2019 DNAnexus, Inc. All Rights Reserved. GETTING “PERFECT” HAPLOTIGS FOR MHC MHC Team, Pangenomics Analysis Hackathon Jason Chin, Sr. Director, Machine Learning and Genomics, DNAnexus GIAB/GRC WORKSHOP, ASHG, OCT 11, 2019
  • 2. © 2019 DNAnexus, Inc. All Rights Reserved. Major Histocompatibility Complex Region One of the most diversified region HLA matching is important for the success of organ transplant Antigen processing and presentationKulski, et al., Immunological Reviews, 2003
  • 3. © 2019 DNAnexus, Inc. All Rights Reserved. GWAS Example 3 MHC Number of Associated Phenotypes per Variant Number of Associated Variant per 5Mbp Unusually high number of “significant” association in the MHC over 918 phenotype labels (UK Biobank, ~337,000 samples, Neale Lab Rapid GWAS results on 41202: ICD10 code, 20002: self-reported diseases, https://www.nealelab.is/blog/2017/7/19/rapid-gwas-of-thousands-of- phenotypes-for-337000-samples-in-the-uk-biobank) (Instead of a typical Manhattan plot, We have a “Taipei” plot) Numberofvariants
  • 4. © 2019 DNAnexus, Inc. All Rights Reserved. GWAS Example 4 MHC Number of Associated Phenotypes per Variant Number of Associated Variant per 5Mbp Unusually high number of “significant” association in the MHC over 918 phenotype labels (UK Biobank, ~337,000 samples, Neale Lab Rapid GWAS results on 41202: ICD10 code, 20002: self-reported diseases, https://www.nealelab.is/blog/2017/7/19/rapid-gwas-of-thousands-of- phenotypes-for-337000-samples-in-the-uk-biobank) (Instead of a typical Manhattan plot, We have a “Taipei” plot) Numberofvariants
  • 5. © 2019 DNAnexus, Inc. All Rights Reserved. 5
  • 6. © 2019 DNAnexus, Inc. All Rights Reserved. Strategy to Get “Perfect” Haplotigs 6 Trio sequencing data available k-mer binning + long reads -> Haplotype separated read piles: e.g., TrioCanu Trio data not available Long Reads, FALCON-Unzip More Accurate Long Reads Super Long Reads Linked Reads Hi-C data
  • 7. © 2019 DNAnexus, Inc. All Rights Reserved. WhatsHap + Peregrine 7 Martin, et al., 2016 BioRxiv 085050. Chin and Khalak, 2019, BioRxiv 705616 Platform Talk: Friday: 9:00AM, Grand Ballroom B Session F: Fast Methods for Genome Analysis Assembling a de novo human genome in 100 minutes
  • 8. © 2019 DNAnexus, Inc. All Rights Reserved. Workflow / Pipeline 8
  • 9. © 2019 DNAnexus, Inc. All Rights Reserved. Quick Prototype and Reproducible Environment 9 DNAnexus cloud workspace for genomics development and analysis work Better control of data, code and computing environment
  • 10. © 2019 DNAnexus, Inc. All Rights Reserved. Whole MHC Haplotig Large Scale View 10 Two haplotigs (no gap) span through whole MHC region
  • 11. © 2019 DNAnexus, Inc. All Rights Reserved. Phased HLA Genes Confirmed by Trio Typing 11 Assembly contigID locus utilized Ref Contig start stop Called Genotypes Edit Distance Called Genotypes Assembly minEditDistance assembly truth whichAlleles minEditDistance calledGenotype_ Truth whichAlleles Haplotype H1 000000F HLA-A pgf 1436987 1440489 A*01:01:01G 0 A*01:01:01G A*01:01:01G_v/s_A*01:01:01G HLA-A Maternal H1 000000F HLA-B pgf 2848251 2851577 B*35:08:01G 0 B*35:08:01G B*35:08:01G_v/s_B*35:08:01G HLA-B Maternal H1 000000F HLA-C pgf 2763854 2767202 C*04:01:01G 0 C*04:01:01G C*04:01:01G_v/s_C*04:01:01G HLA-C Maternal H1 000000F HLA-DQA1 pgf 4086777 4093261 DQA1*01:01:01G 0 DQA1*01:01:01G DQA1*01:01:01G_v/s_DQA1*01:01:01G HLA-DQA1 Maternal H1 000000F HLA-DQB1 pgf 4110116 4117205 DQB1*05:01:01G 0 DQB1*05:01:01G DQB1*05:01:01G_v/s_DQB1*05:01:01G HLA-DQB1 Maternal H1 000000F HLA-DRB1 pgf 4029789 4043078 DRB1*10:01:01G 0 DRB1*10:01:01G DRB1*10:01:01G_v/s_DRB1*10:01:01G HLA-DRB1 Maternal H2 000000F HLA-A pgf 1437427 1440943 A*26:01:01G 0 A*26:01:01G A*26:01:01G_v/s_A*26:01:01G HLA-A Paternal H2 000000F HLA-B pgf 2843682 2846993 B*38:01:01G 0 B*38:01:01G B*38:01:01G_v/s_B*38:01:01G HLA-B Paternal H2 000000F HLA-C pgf 2768829 2772177 C*12:03:01G 0 C*12:03:01G C*12:03:01G_v/s_C*12:03:01G HLA-C Paternal H2 000000F HLA-DQA1 pgf 4182456 4188892 DQA1*03:01:01G 0 DQA1*03:01:01G DQA1*03:01:01G_v/s_DQA1*03:01:01G HLA-DQA1 Paternal H2 000000F HLA-DQB1 pgf 4201076 4208201 DQB1*03:02:01G 0 DQB1*03:02:01G DQB1*03:02:01G_v/s_DQB1*03:02:01G HLA-DQB1 Paternal H2 000000F HLA-DRB1 cox 4122938 4138189 DRB1*04:02:01 0 DRB1*04:02:01 DRB1*04:02:01_v/s_DRB1*04:02:01 HLA-DRB1 Paternal
  • 12. © 2019 DNAnexus, Inc. All Rights Reserved. What Is Still Wrong and What Can We Do With It? 12 Assembly Graph Due to the read length limit, it will still need some manual work to resolve CYP21A2 / TNXB. 35 kb repeat Assembly Reference Reference Reference
  • 13. © 2019 DNAnexus, Inc. All Rights Reserved. Unroll The Loop With An ONT Read 14 Getting “perfect” assembly needs multi-scale approaches for both phasing and contig construction. We can spike in this 150kb ONT read to “unroll” the loop in the assembly graph. Self-self dot-plot of an >150 kb ONT read Repeat 1 Repeat 2 © 2019 DNAnexus,Inc. All RightsReserved. Unroll The Loop With An ONT Read 15 Getting “perfect” assembly needsmulti-scale approachesfor both phasing and contig construction. We can spike in this150kb ONT read to “unroll” the loop in the assembly graph. Self-self dot-plot of an >150 kb ONT read Repeat 1 Repeat 2
  • 14. © 2019 DNAnexus, Inc. All Rights Reserved. 2 Haplotypes and 2 Copies of CYP21A2 Repeats 15 Detecting loops is easy. (Perhaps we should annotate assembly output for that). However, when the read length is shorted than the repeats, we need to resolve 2x2 haplotypes. Variant co-occurring pattern
  • 15. © 2019 DNAnexus, Inc. All Rights Reserved. Long Nanopore Reads can be Phased Better 16 Thursday Afternoon Poster 1582/T: The portrait of fully phased assembled diploid human genome, Arkarachai Fungtammasan, et. al.,
  • 16. © 2019 DNAnexus, Inc. All Rights Reserved. Long Nanopore Reads can be Phased Better 17 Thursday Afternoon Poster 1582/T: The portrait of fully phased assembled diploid human genome, Arkarachai Fungtammasan, et. al.,
  • 17. © 2019 DNAnexus, Inc. All Rights Reserved. Any Other Challenges? 18 • Missing reads recruitment using single reference • Assembly will not be complete without an initial de novo assembly • One can’t describe the difference with small variant calls Take away 179 reads that are only mapped to the HG002 de novo contig
  • 18. © 2019 DNAnexus, Inc. All Rights Reserved. “Perfect” is Still Elusive 19 Residual Errors Analysis: Reads <-> Assembly Contig Consistency Check (Minimap2 + FreeBayes Variant Calling) Not surprising, major inconsistences are from homopolymers
  • 19. Integrating assembly- and mapping- based calls gives best MHC benchmark • MHC assembly-based bed includes 23187 variants in the MHC region, excluding: • CYP21A2 and pseudogene • Homopolymers >10bp • SVs in assembly • Very dense variants • v4.0 mapping-based bed includes 13964 variants in the MHC region, excluding: • Short read callsets • Conflicts between callers • SVs from all methods • Homopolymers >10bp • Many clusters of variants, including some HLA genes • Only 11 differences between assembly and mapping based calls in both beds • 2 genotyping errors in assembly-based • 1 inaccurate complex allele and cluster of 8 missed variants in mapping-based • Merged benchmark includes 23229 variants in the MHC region Mbp • Covers most HLA genes and CYP21A2/TNXA/TNXB Threshold True-pos-baseline True-pos-call False-pos False-neg Precision Sensitivity F-measure ---------------------------------------------------------------------------------------------------- None 13899 13549 10 4 0.9993 0.9997 0.9995 These variants are fully phased through the MHC regions too!! 9265 new variants over MHC region.
  • 20. © 2019 DNAnexus, Inc. All Rights Reserved. More MHC in Haplotype Resolved Genome Assemblies 21 NA12878 H1 NA12878 H2 PGP1 H1 PGP1 H2 HG002 H1 HG002 H2 221/4:30 A robust and production-level approach to haplotype-resolved assembly of single individuals. S. Garg, C. Fungtammasan, A. Carroll, R. Hall, E. Hatas, M. Mahmoud, F. Sedlazeck, M. Chou, J. Aach, J. Zook, J. Chin, H. Lee, G. Church. We can already see 6 different haplotypes at this scale
  • 21. © 2019 DNAnexus, Inc. All Rights Reserved. Next Generation MHC Database? 22 Number of Associated Variant per 5Mbp Numberofvariants http://hla.alleles.org/inc/images/graph_hires.png Is it worth to solve this puzzle with long read technologies at scale? Class I &and Class II HLA Alleles
  • 22. © 2019 DNAnexus, Inc. All Rights Reserved. Acknowledgement 23 Thank For Your Attention!! The MHC team for Pan-genomics in the Cloud hackathon 2019: A. Dilthey A. Fungtammasan S. Garg E. Garrison M. Rautiainen M. Tobias J. Wanger Q. Zeng J. Zook Peregrine Assembler Co-developer Asif Khalak, Foundation of Bio-Data Sciences ---- B. Busby and B. Paten for hosting the hackathon https://github.com/NCBI-Hackathons/TheHumanPangenome/tree/master/MHC

Hinweis der Redaktion

  1. https://www.biorxiv.org/content/biorxiv/early/2016/11/14/085050.full.pdf Fast Assembly / Fast Iteration