SlideShare ist ein Scribd-Unternehmen logo
1 von 1
Downloaden Sie, um offline zu lesen
An assessment with CEGMA showed that 97% and 98% of a conserved
set of eukaryotic genes were at least partially covered in the
pseudochromosome assemblies of two Bayer rice lines, compared to 98% in
both the 93-11 and Nipponbare public genomes. Furthermore, 99% of over
66k rice transcripts could be mapped to the assemblies, indicating high
coverage of the gene space. Finally, repeat analysis revealed that ~9% of
repetitive sequences were missing from the two Bayer assemblies,
accounting for their smaller sizes in comparison with the public genomes.
BCS 1 (y axis) pseudochromosomes vs. 93-11 chromosomes (x axis)
Whole genome de novo assembly of two Bayer elite lines was performed using data from Illumina sequencing of paired-end, mate pair, and
fosmid libraries and PacBio long reads. The assemblies were further improved by the use of a genetic map and alignment to the Nipponbare
genome. The construction of reference genomes for these elite lines provide a valuable resource for marker and gene discovery in our rice
breeding program, as well as for reference-based assemblies of additional Bayer indica lines.
Whole Genome De Novo Assembly of Two Bayer Elite Rice Lines
Joan W. Wong1, Pieter B. F. Ouwerkerk1, Christian Dreischer2, Bjoern Geigle2, and Sebastian J. Schultheiss2
1Bayer CropScience NV, Innovation Center, Technologiepark 38, 9052 Ghent, Belgium
2Computomics GmbH & Co. KG, Christophstr. 32, 72072 Tuebingen, Germany
Computational Life Sciences
CONCLUSION
ABSTRACT
We performed genome sequencing and de novo assembly for two elite
indica rice lines that are parents for a Bayer commercial hybrid. Initial
assemblies were performed using ALLPATHS-LG on Illumina reads from
paired-end and mate pair libraries. Fosmid-end sequences and PacBio long
reads were then used for further scaffolding and gap filling. A genetic map
constructed from sequencing data of 2000 F2 individuals was used to order
and orient >300 scaffolds, composing around 90% sequence length of each
assembly. Finally, remaining scaffolds were placed using the public
Nipponbare genome as a reference. The final assemblies comprised 1,244
and 1,522 scaffolds with N50 scaffold sizes of 3.0 and 2.1 Mb and total sizes
of 401 and 404 Mb, respectively. The iterative assembly enabled us to track
the progress with each added dataset and demonstrated the value of the
mate pairs, long reads, and genetic map.
BACKGROUND
ALIGNMENT WITH INDICA REFERENCE GENOME
ASSEMBLY EVALUATION
ALIGNMENT WITH JAPONICA REFERENCE GENOME
ALLPATHS-LG
de novo assemble paired-end, mate-pair, and fosmid reads
PBJelly2 and SOAP GapCloser
scaffold and fill gaps with PacBio reads
Custom algorithm (Computomics)
orient and place scaffolds using genetic map
RepARK
generate repeat libraries
ABACAS
assemble scaffolds + repeats based on japonica
PBJelly2 & GapCloser
fill remaining gaps
ASSEMBLY PROCESS
0 50 100 150 200 250 300 350 400 450
Scaffold
Contig
Scaffold
Contig
BCS
2
BCS
2
BCS
1
BCS
1
Assembly Size (Mb)
De novo Reference-guided
ASSEMBLY SIZES
92 92 93 93
5 6 5 5
0
20
40
60
80
100
BCS 1 BCS 2 BGI indica IRGSP japonica v5
%ConservedGenesFound
Partial
Complete

Weitere ähnliche Inhalte

Was ist angesagt?

Ramil Mauleon: IRRI GALAXY: bioinformatics for rice scientists
Ramil Mauleon: IRRI GALAXY: bioinformatics for rice scientistsRamil Mauleon: IRRI GALAXY: bioinformatics for rice scientists
Ramil Mauleon: IRRI GALAXY: bioinformatics for rice scientistsGigaScience, BGI Hong Kong
 
Backcross method for dominant and recessive gene transfer.
Backcross method for dominant and recessive gene transfer.Backcross method for dominant and recessive gene transfer.
Backcross method for dominant and recessive gene transfer.Pawan Nagar
 
Role of Biotechnology in Improving Productivity for Rice Producers in Asia fr...
Role of Biotechnology in Improving Productivity for Rice Producers in Asia fr...Role of Biotechnology in Improving Productivity for Rice Producers in Asia fr...
Role of Biotechnology in Improving Productivity for Rice Producers in Asia fr...apaari
 
Accelerating crop genetic gains with genomic selection
Accelerating crop genetic gains with genomic selectionAccelerating crop genetic gains with genomic selection
Accelerating crop genetic gains with genomic selectionViolinaBharali
 
Next generation genomics for chickpea (Cicer arietinum L.) improvement
Next generation genomics for chickpea (Cicer arietinum L.) improvementNext generation genomics for chickpea (Cicer arietinum L.) improvement
Next generation genomics for chickpea (Cicer arietinum L.) improvementICRISAT
 
MIMG 199 P. acnes Poster Final - Lauren and Rachelle
MIMG 199 P. acnes Poster Final - Lauren and RachelleMIMG 199 P. acnes Poster Final - Lauren and Rachelle
MIMG 199 P. acnes Poster Final - Lauren and RachelleRachelle Ann Gonzales
 
Parental Lines improvement by new approaches
Parental Lines improvement by new approachesParental Lines improvement by new approaches
Parental Lines improvement by new approachesBalaji Thorat
 
GRM 2013: Improving phosphorus efficiency in sorghum by the identification an...
GRM 2013: Improving phosphorus efficiency in sorghum by the identification an...GRM 2013: Improving phosphorus efficiency in sorghum by the identification an...
GRM 2013: Improving phosphorus efficiency in sorghum by the identification an...CGIAR Generation Challenge Programme
 
The wheat genome sequence: a foundation for accelerating improvment of bread ...
The wheat genome sequence: a foundation for accelerating improvment of bread ...The wheat genome sequence: a foundation for accelerating improvment of bread ...
The wheat genome sequence: a foundation for accelerating improvment of bread ...Borlaug Global Rust Initiative
 
Sagebrush Poster Final Draft
Sagebrush Poster Final DraftSagebrush Poster Final Draft
Sagebrush Poster Final DraftMark Mendoza
 
GRM 2013: Cloning, characterization and validation of PUP1/P efficiency in ma...
GRM 2013: Cloning, characterization and validation of PUP1/P efficiency in ma...GRM 2013: Cloning, characterization and validation of PUP1/P efficiency in ma...
GRM 2013: Cloning, characterization and validation of PUP1/P efficiency in ma...CGIAR Generation Challenge Programme
 
MAGIC population and its application in crop improvement
MAGIC population and its application in crop improvementMAGIC population and its application in crop improvement
MAGIC population and its application in crop improvementSanghaviBoddu
 
Marker Assisted Breeding in Maize
Marker Assisted Breeding in MaizeMarker Assisted Breeding in Maize
Marker Assisted Breeding in MaizeNivethitha T
 
Cytoplasmic inheritance and Chloroplast engineering
Cytoplasmic inheritance and Chloroplast engineeringCytoplasmic inheritance and Chloroplast engineering
Cytoplasmic inheritance and Chloroplast engineeringSANJAY KUMAR SANADYA
 

Was ist angesagt? (20)

Ramil Mauleon: IRRI GALAXY: bioinformatics for rice scientists
Ramil Mauleon: IRRI GALAXY: bioinformatics for rice scientistsRamil Mauleon: IRRI GALAXY: bioinformatics for rice scientists
Ramil Mauleon: IRRI GALAXY: bioinformatics for rice scientists
 
Backcross method for dominant and recessive gene transfer.
Backcross method for dominant and recessive gene transfer.Backcross method for dominant and recessive gene transfer.
Backcross method for dominant and recessive gene transfer.
 
The Wheat Genome
The Wheat GenomeThe Wheat Genome
The Wheat Genome
 
Role of Biotechnology in Improving Productivity for Rice Producers in Asia fr...
Role of Biotechnology in Improving Productivity for Rice Producers in Asia fr...Role of Biotechnology in Improving Productivity for Rice Producers in Asia fr...
Role of Biotechnology in Improving Productivity for Rice Producers in Asia fr...
 
Accelerating crop genetic gains with genomic selection
Accelerating crop genetic gains with genomic selectionAccelerating crop genetic gains with genomic selection
Accelerating crop genetic gains with genomic selection
 
Back cross method Back Cross
Back cross method Back CrossBack cross method Back Cross
Back cross method Back Cross
 
Next generation genomics for chickpea (Cicer arietinum L.) improvement
Next generation genomics for chickpea (Cicer arietinum L.) improvementNext generation genomics for chickpea (Cicer arietinum L.) improvement
Next generation genomics for chickpea (Cicer arietinum L.) improvement
 
MIMG 199 P. acnes Poster Final - Lauren and Rachelle
MIMG 199 P. acnes Poster Final - Lauren and RachelleMIMG 199 P. acnes Poster Final - Lauren and Rachelle
MIMG 199 P. acnes Poster Final - Lauren and Rachelle
 
Parental Lines improvement by new approaches
Parental Lines improvement by new approachesParental Lines improvement by new approaches
Parental Lines improvement by new approaches
 
PhagePoster-HQ
PhagePoster-HQPhagePoster-HQ
PhagePoster-HQ
 
GRM 2013: Improving phosphorus efficiency in sorghum by the identification an...
GRM 2013: Improving phosphorus efficiency in sorghum by the identification an...GRM 2013: Improving phosphorus efficiency in sorghum by the identification an...
GRM 2013: Improving phosphorus efficiency in sorghum by the identification an...
 
The wheat genome sequence: a foundation for accelerating improvment of bread ...
The wheat genome sequence: a foundation for accelerating improvment of bread ...The wheat genome sequence: a foundation for accelerating improvment of bread ...
The wheat genome sequence: a foundation for accelerating improvment of bread ...
 
Genomic selection
Genomic  selectionGenomic  selection
Genomic selection
 
Sagebrush Poster Final Draft
Sagebrush Poster Final DraftSagebrush Poster Final Draft
Sagebrush Poster Final Draft
 
GRM 2013: Cloning, characterization and validation of PUP1/P efficiency in ma...
GRM 2013: Cloning, characterization and validation of PUP1/P efficiency in ma...GRM 2013: Cloning, characterization and validation of PUP1/P efficiency in ma...
GRM 2013: Cloning, characterization and validation of PUP1/P efficiency in ma...
 
MAGIC population and its application in crop improvement
MAGIC population and its application in crop improvementMAGIC population and its application in crop improvement
MAGIC population and its application in crop improvement
 
GENOMIC SIGNAL PROCESSING
GENOMIC SIGNAL PROCESSINGGENOMIC SIGNAL PROCESSING
GENOMIC SIGNAL PROCESSING
 
1632 Anirudh Kumar
1632 Anirudh Kumar1632 Anirudh Kumar
1632 Anirudh Kumar
 
Marker Assisted Breeding in Maize
Marker Assisted Breeding in MaizeMarker Assisted Breeding in Maize
Marker Assisted Breeding in Maize
 
Cytoplasmic inheritance and Chloroplast engineering
Cytoplasmic inheritance and Chloroplast engineeringCytoplasmic inheritance and Chloroplast engineering
Cytoplasmic inheritance and Chloroplast engineering
 

Ähnlich wie PAG2015_Rice_genome_poster_final_hi-res

A novel phylum-level archaea characterized by combining single-cell and metag...
A novel phylum-level archaea characterized by combining single-cell and metag...A novel phylum-level archaea characterized by combining single-cell and metag...
A novel phylum-level archaea characterized by combining single-cell and metag...Guillaume Reboul
 
Next Generation Sequencing Technologies and Their Applications in Ornamental ...
Next Generation Sequencing Technologies and Their Applications in Ornamental ...Next Generation Sequencing Technologies and Their Applications in Ornamental ...
Next Generation Sequencing Technologies and Their Applications in Ornamental ...Ravindra Kumar
 
HHMI Research poster -6-9-2014 Bipolar
HHMI Research poster -6-9-2014 BipolarHHMI Research poster -6-9-2014 Bipolar
HHMI Research poster -6-9-2014 BipolarHana (Hoang) Willner
 
Development of a high-throughput high-density SNP genotyping array for bovine
Development of a high-throughput high-density SNP genotyping array for bovineDevelopment of a high-throughput high-density SNP genotyping array for bovine
Development of a high-throughput high-density SNP genotyping array for bovineAffymetrix
 
2014 whitney-research
2014 whitney-research2014 whitney-research
2014 whitney-researchc.titus.brown
 
A phylogeny driven genomic encyclopedia of bacteria and archaea
A phylogeny driven genomic encyclopedia of bacteria and archaeaA phylogeny driven genomic encyclopedia of bacteria and archaea
A phylogeny driven genomic encyclopedia of bacteria and archaeaJonathan Eisen
 
Metagenomics as a tool for biodiversity and health
Metagenomics as a tool for biodiversity and healthMetagenomics as a tool for biodiversity and health
Metagenomics as a tool for biodiversity and healthAlberto Dávila
 
Apollo Exercises Kansas State University 2015
Apollo Exercises Kansas State University 2015Apollo Exercises Kansas State University 2015
Apollo Exercises Kansas State University 2015Monica Munoz-Torres
 
holothuriidae phylo
holothuriidae phyloholothuriidae phylo
holothuriidae phyloila Haysia
 
181214 Bioinformática vegetal
181214 Bioinformática vegetal181214 Bioinformática vegetal
181214 Bioinformática vegetalM. Gonzalo Claros
 
Debarko banerji sacnas ppresentation
Debarko banerji   sacnas ppresentationDebarko banerji   sacnas ppresentation
Debarko banerji sacnas ppresentationDebarko Banerji
 
Clase 2 - Genoma Humano proyecto conicet.pdf
Clase 2 - Genoma Humano proyecto conicet.pdfClase 2 - Genoma Humano proyecto conicet.pdf
Clase 2 - Genoma Humano proyecto conicet.pdfNoraCRuizGuevara
 
CRISPR Crops--a talk by Sophien Kamoun at Science Portal BD
CRISPR Crops--a talk by Sophien Kamoun at Science Portal BDCRISPR Crops--a talk by Sophien Kamoun at Science Portal BD
CRISPR Crops--a talk by Sophien Kamoun at Science Portal BDSophien Kamoun
 
Genome Sequencing in Finger Millet
Genome Sequencing in Finger MilletGenome Sequencing in Finger Millet
Genome Sequencing in Finger MilletVivek Suthediya
 

Ähnlich wie PAG2015_Rice_genome_poster_final_hi-res (20)

A novel phylum-level archaea characterized by combining single-cell and metag...
A novel phylum-level archaea characterized by combining single-cell and metag...A novel phylum-level archaea characterized by combining single-cell and metag...
A novel phylum-level archaea characterized by combining single-cell and metag...
 
Next Generation Sequencing Technologies and Their Applications in Ornamental ...
Next Generation Sequencing Technologies and Their Applications in Ornamental ...Next Generation Sequencing Technologies and Their Applications in Ornamental ...
Next Generation Sequencing Technologies and Their Applications in Ornamental ...
 
HHMI Research poster -6-9-2014 Bipolar
HHMI Research poster -6-9-2014 BipolarHHMI Research poster -6-9-2014 Bipolar
HHMI Research poster -6-9-2014 Bipolar
 
De Novo
De NovoDe Novo
De Novo
 
Plant genome project
Plant genome projectPlant genome project
Plant genome project
 
Development of a high-throughput high-density SNP genotyping array for bovine
Development of a high-throughput high-density SNP genotyping array for bovineDevelopment of a high-throughput high-density SNP genotyping array for bovine
Development of a high-throughput high-density SNP genotyping array for bovine
 
Pangenomics.pptx
Pangenomics.pptxPangenomics.pptx
Pangenomics.pptx
 
2014 whitney-research
2014 whitney-research2014 whitney-research
2014 whitney-research
 
A phylogeny driven genomic encyclopedia of bacteria and archaea
A phylogeny driven genomic encyclopedia of bacteria and archaeaA phylogeny driven genomic encyclopedia of bacteria and archaea
A phylogeny driven genomic encyclopedia of bacteria and archaea
 
Metagenomics as a tool for biodiversity and health
Metagenomics as a tool for biodiversity and healthMetagenomics as a tool for biodiversity and health
Metagenomics as a tool for biodiversity and health
 
Yeast Genome
Yeast Genome Yeast Genome
Yeast Genome
 
Apollo Exercises Kansas State University 2015
Apollo Exercises Kansas State University 2015Apollo Exercises Kansas State University 2015
Apollo Exercises Kansas State University 2015
 
Mouse genome
Mouse genomeMouse genome
Mouse genome
 
holothuriidae phylo
holothuriidae phyloholothuriidae phylo
holothuriidae phylo
 
181214 Bioinformática vegetal
181214 Bioinformática vegetal181214 Bioinformática vegetal
181214 Bioinformática vegetal
 
Debarko banerji sacnas ppresentation
Debarko banerji   sacnas ppresentationDebarko banerji   sacnas ppresentation
Debarko banerji sacnas ppresentation
 
Clase 2 - Genoma Humano proyecto conicet.pdf
Clase 2 - Genoma Humano proyecto conicet.pdfClase 2 - Genoma Humano proyecto conicet.pdf
Clase 2 - Genoma Humano proyecto conicet.pdf
 
MGG2003-cDNA-AFLP
MGG2003-cDNA-AFLPMGG2003-cDNA-AFLP
MGG2003-cDNA-AFLP
 
CRISPR Crops--a talk by Sophien Kamoun at Science Portal BD
CRISPR Crops--a talk by Sophien Kamoun at Science Portal BDCRISPR Crops--a talk by Sophien Kamoun at Science Portal BD
CRISPR Crops--a talk by Sophien Kamoun at Science Portal BD
 
Genome Sequencing in Finger Millet
Genome Sequencing in Finger MilletGenome Sequencing in Finger Millet
Genome Sequencing in Finger Millet
 

PAG2015_Rice_genome_poster_final_hi-res

  • 1. An assessment with CEGMA showed that 97% and 98% of a conserved set of eukaryotic genes were at least partially covered in the pseudochromosome assemblies of two Bayer rice lines, compared to 98% in both the 93-11 and Nipponbare public genomes. Furthermore, 99% of over 66k rice transcripts could be mapped to the assemblies, indicating high coverage of the gene space. Finally, repeat analysis revealed that ~9% of repetitive sequences were missing from the two Bayer assemblies, accounting for their smaller sizes in comparison with the public genomes. BCS 1 (y axis) pseudochromosomes vs. 93-11 chromosomes (x axis) Whole genome de novo assembly of two Bayer elite lines was performed using data from Illumina sequencing of paired-end, mate pair, and fosmid libraries and PacBio long reads. The assemblies were further improved by the use of a genetic map and alignment to the Nipponbare genome. The construction of reference genomes for these elite lines provide a valuable resource for marker and gene discovery in our rice breeding program, as well as for reference-based assemblies of additional Bayer indica lines. Whole Genome De Novo Assembly of Two Bayer Elite Rice Lines Joan W. Wong1, Pieter B. F. Ouwerkerk1, Christian Dreischer2, Bjoern Geigle2, and Sebastian J. Schultheiss2 1Bayer CropScience NV, Innovation Center, Technologiepark 38, 9052 Ghent, Belgium 2Computomics GmbH & Co. KG, Christophstr. 32, 72072 Tuebingen, Germany Computational Life Sciences CONCLUSION ABSTRACT We performed genome sequencing and de novo assembly for two elite indica rice lines that are parents for a Bayer commercial hybrid. Initial assemblies were performed using ALLPATHS-LG on Illumina reads from paired-end and mate pair libraries. Fosmid-end sequences and PacBio long reads were then used for further scaffolding and gap filling. A genetic map constructed from sequencing data of 2000 F2 individuals was used to order and orient >300 scaffolds, composing around 90% sequence length of each assembly. Finally, remaining scaffolds were placed using the public Nipponbare genome as a reference. The final assemblies comprised 1,244 and 1,522 scaffolds with N50 scaffold sizes of 3.0 and 2.1 Mb and total sizes of 401 and 404 Mb, respectively. The iterative assembly enabled us to track the progress with each added dataset and demonstrated the value of the mate pairs, long reads, and genetic map. BACKGROUND ALIGNMENT WITH INDICA REFERENCE GENOME ASSEMBLY EVALUATION ALIGNMENT WITH JAPONICA REFERENCE GENOME ALLPATHS-LG de novo assemble paired-end, mate-pair, and fosmid reads PBJelly2 and SOAP GapCloser scaffold and fill gaps with PacBio reads Custom algorithm (Computomics) orient and place scaffolds using genetic map RepARK generate repeat libraries ABACAS assemble scaffolds + repeats based on japonica PBJelly2 & GapCloser fill remaining gaps ASSEMBLY PROCESS 0 50 100 150 200 250 300 350 400 450 Scaffold Contig Scaffold Contig BCS 2 BCS 2 BCS 1 BCS 1 Assembly Size (Mb) De novo Reference-guided ASSEMBLY SIZES 92 92 93 93 5 6 5 5 0 20 40 60 80 100 BCS 1 BCS 2 BGI indica IRGSP japonica v5 %ConservedGenesFound Partial Complete