This document summarizes work using PacBio long reads to improve the Atlantic cod genome assembly. Error-corrected and raw PacBio reads were used with different assembly programs. Both helped increase contig and scaffold lengths over the previous assembly, with raw reads performing best. Bridgemapper validation found misassemblies corrected by PacBio. The improved assembly met goals of <5% gaps and scaffold N50 over 1 Mbp. Lessons included developing programs to handle cod's heterozygosity and structural variation better. The new assembly version aims to have 23 pseudochromosomes and improved annotation.
Flores de Mayo-history and origin we need to understand
Improving and validating the Atlantic Cod genome assembly using PacBio
1. Improving and validating the Atlantic Cod
genome assembly using error-corrected
as well as raw PacBio reads
Lex Nederbragt, NSC and CEES
lex.nederbragt@ibv.uio.no
@lexnederbragt
OK
42. Newbler plus Celera
Celera: Long contigs, short scaffolds
Scaffold
gap
contig
Slide courtesy of Ole Kristian Tøressen
43. Newbler plus Celera
Celera: Long contigs, short scaffolds
Scaffold
gap
contig
Newbler: Short contigs, long scaffolds
Scaffold
contig
gap
Slide courtesy of Ole Kristian Tøressen
44. Newbler plus Celera
Celera: Long contigs, short scaffolds
Scaffold
gap
contig
Newbler: Short contigs, long scaffolds
Scaffold
contig
gap
Combined: Long contigs, long scaffolds
gap
Scaffold
contig
Slide courtesy of Ole Kristian Tøressen