SlideShare ist ein Scribd-Unternehmen logo
1 von 21
Downloaden Sie, um offline zu lesen
Assembly tools and Visualisation
Matthias Haimel




                          EBI is an Outstation of the European Molecular Biology Laboratory.
Overview
    • Assemblers
               • ABySS
               • SOAPdenovo
    • Visualisation
               • Tablet
               • AbySS-Explorer
    • Read mapping
               • Sam / Bam
    • Visualisation
               • Artemis
               • IGV - Integrative Genomics Viewer



2   25.04.11             Assemblers
ABySS                                 Assembly By Short Sequences

    • Genome Sciences Centre, Vancouver
               • http://www.bcgsc.ca/platform/bioinfo/software/abyss
               • Open source, BCCA Licence
    • de Bruijn graph
               • Trimming (tip clipping), bubble popping
               • Use paired-end information: resolve ambiguities between contigs
               • parallel (use cluster)
    • Files
               • Fasta / Fastq
               • Sam/Bam
               • colour-space



3   25.04.11             Assemblers
ABySS
    • ABYSS (singe end)
               • e.g. ABYSS -k27 single.fastq -o contigs.fa
    • abyss-pe (paired end)
               • e.g. abyss-pe k=27 n=10 in='read_1.fastq read_2.fastq' name=ecli
    • Multiple libraries
               •   ... lib=’read1 read2’ read1=’read1_1.fa read1_2.fa’ read2=’read2_1.fa read1_2.fa’




4   25.04.11                 Assemblers
SOAPdenovo
    • Beijing Genomics Institute (BGI), China
               • http://soap.genomics.org.cn/soapdenovo.html
               • Panda genome
               • Source available
    • de Bruijn graph
               • pre-set Kmer frequency threshold
               • Bubble removing
    • Build scaffold
               • mapping reads to contigs
               • gap filling




5   25.04.11             Assemblers
SOAPdenovo
    • Full run
               • e.g. SOAPdenovo all -s read.config -K 27 -o contigs.fa
    • Run sub steps
               •   pregraph    = velveth
               •   contig      = velvetg
               •   map         map reads to contigs
               •   scaff      scaffolding
    • Configuration
               • Config file input instead of read files
               • Specify rank, usage (assembly/scaffolding), insert size




6   25.04.11              Assemblers
Visualisation                     http://bioinf.scri.ac.uk/tablet/

    • Tablet
               • Lightweight
               • Easy to use
    • Formats
               •   ACE
               •   AFG
               •   BAM
               •   BANK (AMOS)




7   25.04.11              Assemblers
Visualisation - Velvet
    • Tablet
               • Velvetg ... -amos_file yes
    • GraphViz
               •   Transform velvet graph into GraphViz format
               •   Contributed by Paul Harrison
               •   <velvet>/contrib/layout/
               •   Velvet -> .dot file (Python script)
               •   .dot -> png (graphviz)




8   25.04.11               Assemblers
Visualisation                   http://www.bcgsc.ca/platform/bioinfo/software/abyss-explorer


    • ABySS-Explorer
               • Visualizes ABySS assemblies
               • Interactive graph structure
               • Filter contigs




9   25.04.11            Assemblers
Assembler - Practical
     • Assemblers
                • ABySS
                • SOAPdenovo
     • Visualisation
                • Tablet
                • ABySS-Explorer




10   25.04.11            Assemblers
Read mapping                             http://samtools.sourceforge.net/SAM1.pdf

     • SAM / BAM
                •   Sequence Alignment / Map format (SAM)
                •   Binary form of SAM (BAM)
                •   generic format
                •   Flexible and simple
                •   Compact (BAM)
                •   Allow indexing
                •   Load regions
                •   Support streaming




11   25.04.11              Assemblers
SAM
     • Header
                •   File format version information
                •   Sequence dictionary (name/length/..)
                •   Read group (platform/library/...)
                •   Program info
     • Body
                • Alignment information




12   25.04.11               Assemblers
SAM Header
     • '@' followed by record type (two characters)
                @HD   VN:1.0
                @SQ   SN:chr20 LN:62435964
                @RG   ID:L1 PU:SC_1_10 LB:SC_1 SM:NA12891
                @RG   ID:L2 PU:SC_2_12 LB:SC_2 SM:NA12891




13   25.04.11         Assemblers
SAM Alignment
     • Tab delimited lines




14   25.04.11    Assemblers
SAM Alignment
     • Tab delimited lines

      Read_28833_29006_6945 99 chr20 28833 20 10M1D25M = 28993 195 
      AGCT... <<<<... NM:i:1 RG:Z:L1
      read_28701_28881_323b 147 chr20 28834 30 35M = 28701 -168 
      ACCT... <<7;:... MF:i:18 RG:Z:L2




15   25.04.11     Assemblers
Tools
     • Mapping Reads
                • BWA
                • Bowtie
                • SSAHA2
     • Manipulate SAM/BAM
                • SAM Tools package
                • Picard




16   25.04.11            Assemblers
BWA
     • Burrows-Wheeler Alignment Tool
                • Map (singe/paired-end/long) reads to a sequence
     • Index database
                • bwa index -a bwtsw database.fasta
     • Align reads
                • bwa aln database.fasta short_read.fastq > aln_sa.sai
     • Generate alignments
                •   bwa sampe database.fasta aln_sa1.sai aln_sa2.sai read1.fq read2.fq > aln.sam
     • Long reads
                • bwa bwasw database.fasta long_read.fastq > aln.sam




17   25.04.11                 Assemblers
SAM tools
     • Utilities for SAM format
                • samtools <command> ...
     • Commands:
                •   view: SAM <-> BAM
                •   sort: sort BAM file
                •   index: build BAM file index
                •   merge: merges x BAM files
                •   pileup: alignment in the pileup format
                •   tview: integrated Text alignment viewer




18   25.04.11               Assemblers
Visualisation                    Integrative Genomics Viewer
                                       http://www.broadinstitute.org/igv/
     • IGV
                • Good integration
     • Formats
                •   DAS
                •   BAM
                •   GFF
                •   ...
     • Tools
                • Run scripts
                • Export region
                • ...



19   25.04.11             Assemblers
Visualisation
                                   http://www.sanger.ac.uk/resources/software/artemis/

     • Artemis
          • Sequence Viewer
          • Annotation tool
     • Formats
          •     EMBL
          •     GENBANK
          •     GFF
          •     FASTA
          •     BAM




20   25.04.11         Assemblers
Mapping - Practical
     • Mapping reads + prepare for visalization
                • BWA
                • samtools
     • Visualisation
                • IGV




21   25.04.11            Assemblers

Weitere ähnliche Inhalte

Ähnlich wie 2011-04-26_various-assemblers-presentation

CouchDB at its Core: Global Data Storage and Rich Incremental Indexing at Clo...
CouchDB at its Core: Global Data Storage and Rich Incremental Indexing at Clo...CouchDB at its Core: Global Data Storage and Rich Incremental Indexing at Clo...
CouchDB at its Core: Global Data Storage and Rich Incremental Indexing at Clo...StampedeCon
 
Zero to the Cloud with @NetflixOSS
Zero to the Cloud with @NetflixOSSZero to the Cloud with @NetflixOSS
Zero to the Cloud with @NetflixOSSMike McGarr
 
Australian OpenStack User Group August 2012: Chef for OpenStack
Australian OpenStack User Group August 2012: Chef for OpenStackAustralian OpenStack User Group August 2012: Chef for OpenStack
Australian OpenStack User Group August 2012: Chef for OpenStackMatt Ray
 
PLAT-16 Using Enterprise Content in Grails
PLAT-16 Using Enterprise Content in GrailsPLAT-16 Using Enterprise Content in Grails
PLAT-16 Using Enterprise Content in GrailsAlfresco Software
 
Chef for OpenStack - OpenStack Fall 2012 Summit
Chef for OpenStack  - OpenStack Fall 2012 SummitChef for OpenStack  - OpenStack Fall 2012 Summit
Chef for OpenStack - OpenStack Fall 2012 SummitMatt Ray
 
Sutol How To Be A Lion Tamer
Sutol How To Be A Lion TamerSutol How To Be A Lion Tamer
Sutol How To Be A Lion TamerSharon James
 
Rails - getting started
Rails - getting startedRails - getting started
Rails - getting startedTrue North
 
JBoss AS 7 from a user perspective
JBoss AS 7 from a user perspectiveJBoss AS 7 from a user perspective
JBoss AS 7 from a user perspectiveMax Andersen
 
Java Night 2010 SteamCannon
Java Night 2010 SteamCannonJava Night 2010 SteamCannon
Java Night 2010 SteamCannonmarekgoldmann
 
Achieving Infrastructure Portability with Chef
Achieving Infrastructure Portability with ChefAchieving Infrastructure Portability with Chef
Achieving Infrastructure Portability with ChefMatt Ray
 
How to be a lion tamer
How to be a lion tamerHow to be a lion tamer
How to be a lion tamerWannes Rams
 
How to be a lion tamer
How to be a lion tamerHow to be a lion tamer
How to be a lion tamerSharon James
 
Implementing FaaS on Kubernetes using Kubeless
Implementing FaaS on Kubernetes using KubelessImplementing FaaS on Kubernetes using Kubeless
Implementing FaaS on Kubernetes using KubelessAhmed Misbah
 
OpenStack Deployments with Chef
OpenStack Deployments with ChefOpenStack Deployments with Chef
OpenStack Deployments with ChefMatt Ray
 
Play Framework and Activator
Play Framework and ActivatorPlay Framework and Activator
Play Framework and ActivatorKevin Webber
 
Optaros Surf Code Camp Introduction
Optaros Surf Code Camp IntroductionOptaros Surf Code Camp Introduction
Optaros Surf Code Camp IntroductionJeff Potts
 

Ähnlich wie 2011-04-26_various-assemblers-presentation (20)

CouchDB at its Core: Global Data Storage and Rich Incremental Indexing at Clo...
CouchDB at its Core: Global Data Storage and Rich Incremental Indexing at Clo...CouchDB at its Core: Global Data Storage and Rich Incremental Indexing at Clo...
CouchDB at its Core: Global Data Storage and Rich Incremental Indexing at Clo...
 
Zero to the Cloud with @NetflixOSS
Zero to the Cloud with @NetflixOSSZero to the Cloud with @NetflixOSS
Zero to the Cloud with @NetflixOSS
 
Australian OpenStack User Group August 2012: Chef for OpenStack
Australian OpenStack User Group August 2012: Chef for OpenStackAustralian OpenStack User Group August 2012: Chef for OpenStack
Australian OpenStack User Group August 2012: Chef for OpenStack
 
PLAT-16 Using Enterprise Content in Grails
PLAT-16 Using Enterprise Content in GrailsPLAT-16 Using Enterprise Content in Grails
PLAT-16 Using Enterprise Content in Grails
 
Chef for OpenStack - OpenStack Fall 2012 Summit
Chef for OpenStack  - OpenStack Fall 2012 SummitChef for OpenStack  - OpenStack Fall 2012 Summit
Chef for OpenStack - OpenStack Fall 2012 Summit
 
Chef for OpenStack- Fall 2012.pdf
Chef for OpenStack- Fall 2012.pdfChef for OpenStack- Fall 2012.pdf
Chef for OpenStack- Fall 2012.pdf
 
Sutol How To Be A Lion Tamer
Sutol How To Be A Lion TamerSutol How To Be A Lion Tamer
Sutol How To Be A Lion Tamer
 
Rails - getting started
Rails - getting startedRails - getting started
Rails - getting started
 
JBoss AS 7 from a user perspective
JBoss AS 7 from a user perspectiveJBoss AS 7 from a user perspective
JBoss AS 7 from a user perspective
 
spring-cloud.pptx
spring-cloud.pptxspring-cloud.pptx
spring-cloud.pptx
 
Java Night 2010 SteamCannon
Java Night 2010 SteamCannonJava Night 2010 SteamCannon
Java Night 2010 SteamCannon
 
Achieving Infrastructure Portability with Chef
Achieving Infrastructure Portability with ChefAchieving Infrastructure Portability with Chef
Achieving Infrastructure Portability with Chef
 
How to be a lion tamer
How to be a lion tamerHow to be a lion tamer
How to be a lion tamer
 
How to be a lion tamer
How to be a lion tamerHow to be a lion tamer
How to be a lion tamer
 
Implementing FaaS on Kubernetes using Kubeless
Implementing FaaS on Kubernetes using KubelessImplementing FaaS on Kubernetes using Kubeless
Implementing FaaS on Kubernetes using Kubeless
 
OpenStack Deployments with Chef
OpenStack Deployments with ChefOpenStack Deployments with Chef
OpenStack Deployments with Chef
 
Inexpensive storage
Inexpensive storageInexpensive storage
Inexpensive storage
 
Play Framework and Activator
Play Framework and ActivatorPlay Framework and Activator
Play Framework and Activator
 
Django In The Real World
Django In The Real WorldDjango In The Real World
Django In The Real World
 
Optaros Surf Code Camp Introduction
Optaros Surf Code Camp IntroductionOptaros Surf Code Camp Introduction
Optaros Surf Code Camp Introduction
 

Kürzlich hochgeladen

SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptxSKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptxAmanpreet Kaur
 
Understanding Accommodations and Modifications
Understanding  Accommodations and ModificationsUnderstanding  Accommodations and Modifications
Understanding Accommodations and ModificationsMJDuyan
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfagholdier
 
ICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxAreebaZafar22
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.christianmathematics
 
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...ZurliaSoop
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17Celine George
 
Micro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfMicro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfPoh-Sun Goh
 
SOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning PresentationSOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning Presentationcamerronhm
 
Google Gemini An AI Revolution in Education.pptx
Google Gemini An AI Revolution in Education.pptxGoogle Gemini An AI Revolution in Education.pptx
Google Gemini An AI Revolution in Education.pptxDr. Sarita Anand
 
Wellbeing inclusion and digital dystopias.pptx
Wellbeing inclusion and digital dystopias.pptxWellbeing inclusion and digital dystopias.pptx
Wellbeing inclusion and digital dystopias.pptxJisc
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsTechSoup
 
Fostering Friendships - Enhancing Social Bonds in the Classroom
Fostering Friendships - Enhancing Social Bonds  in the ClassroomFostering Friendships - Enhancing Social Bonds  in the Classroom
Fostering Friendships - Enhancing Social Bonds in the ClassroomPooky Knightsmith
 
Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Jisc
 
Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.pptRamjanShidvankar
 
ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.MaryamAhmad92
 
On National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsOn National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsMebane Rash
 
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptxMaritesTamaniVerdade
 
Sociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning ExhibitSociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning Exhibitjbellavia9
 

Kürzlich hochgeladen (20)

Spatium Project Simulation student brief
Spatium Project Simulation student briefSpatium Project Simulation student brief
Spatium Project Simulation student brief
 
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptxSKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
 
Understanding Accommodations and Modifications
Understanding  Accommodations and ModificationsUnderstanding  Accommodations and Modifications
Understanding Accommodations and Modifications
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdf
 
ICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptx
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.
 
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17
 
Micro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfMicro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdf
 
SOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning PresentationSOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning Presentation
 
Google Gemini An AI Revolution in Education.pptx
Google Gemini An AI Revolution in Education.pptxGoogle Gemini An AI Revolution in Education.pptx
Google Gemini An AI Revolution in Education.pptx
 
Wellbeing inclusion and digital dystopias.pptx
Wellbeing inclusion and digital dystopias.pptxWellbeing inclusion and digital dystopias.pptx
Wellbeing inclusion and digital dystopias.pptx
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 
Fostering Friendships - Enhancing Social Bonds in the Classroom
Fostering Friendships - Enhancing Social Bonds  in the ClassroomFostering Friendships - Enhancing Social Bonds  in the Classroom
Fostering Friendships - Enhancing Social Bonds in the Classroom
 
Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)
 
Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.ppt
 
ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.
 
On National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsOn National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan Fellows
 
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
 
Sociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning ExhibitSociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning Exhibit
 

2011-04-26_various-assemblers-presentation

  • 1. Assembly tools and Visualisation Matthias Haimel EBI is an Outstation of the European Molecular Biology Laboratory.
  • 2. Overview • Assemblers • ABySS • SOAPdenovo • Visualisation • Tablet • AbySS-Explorer • Read mapping • Sam / Bam • Visualisation • Artemis • IGV - Integrative Genomics Viewer 2 25.04.11 Assemblers
  • 3. ABySS Assembly By Short Sequences • Genome Sciences Centre, Vancouver • http://www.bcgsc.ca/platform/bioinfo/software/abyss • Open source, BCCA Licence • de Bruijn graph • Trimming (tip clipping), bubble popping • Use paired-end information: resolve ambiguities between contigs • parallel (use cluster) • Files • Fasta / Fastq • Sam/Bam • colour-space 3 25.04.11 Assemblers
  • 4. ABySS • ABYSS (singe end) • e.g. ABYSS -k27 single.fastq -o contigs.fa • abyss-pe (paired end) • e.g. abyss-pe k=27 n=10 in='read_1.fastq read_2.fastq' name=ecli • Multiple libraries • ... lib=’read1 read2’ read1=’read1_1.fa read1_2.fa’ read2=’read2_1.fa read1_2.fa’ 4 25.04.11 Assemblers
  • 5. SOAPdenovo • Beijing Genomics Institute (BGI), China • http://soap.genomics.org.cn/soapdenovo.html • Panda genome • Source available • de Bruijn graph • pre-set Kmer frequency threshold • Bubble removing • Build scaffold • mapping reads to contigs • gap filling 5 25.04.11 Assemblers
  • 6. SOAPdenovo • Full run • e.g. SOAPdenovo all -s read.config -K 27 -o contigs.fa • Run sub steps • pregraph = velveth • contig = velvetg • map map reads to contigs • scaff scaffolding • Configuration • Config file input instead of read files • Specify rank, usage (assembly/scaffolding), insert size 6 25.04.11 Assemblers
  • 7. Visualisation http://bioinf.scri.ac.uk/tablet/ • Tablet • Lightweight • Easy to use • Formats • ACE • AFG • BAM • BANK (AMOS) 7 25.04.11 Assemblers
  • 8. Visualisation - Velvet • Tablet • Velvetg ... -amos_file yes • GraphViz • Transform velvet graph into GraphViz format • Contributed by Paul Harrison • <velvet>/contrib/layout/ • Velvet -> .dot file (Python script) • .dot -> png (graphviz) 8 25.04.11 Assemblers
  • 9. Visualisation http://www.bcgsc.ca/platform/bioinfo/software/abyss-explorer • ABySS-Explorer • Visualizes ABySS assemblies • Interactive graph structure • Filter contigs 9 25.04.11 Assemblers
  • 10. Assembler - Practical • Assemblers • ABySS • SOAPdenovo • Visualisation • Tablet • ABySS-Explorer 10 25.04.11 Assemblers
  • 11. Read mapping http://samtools.sourceforge.net/SAM1.pdf • SAM / BAM • Sequence Alignment / Map format (SAM) • Binary form of SAM (BAM) • generic format • Flexible and simple • Compact (BAM) • Allow indexing • Load regions • Support streaming 11 25.04.11 Assemblers
  • 12. SAM • Header • File format version information • Sequence dictionary (name/length/..) • Read group (platform/library/...) • Program info • Body • Alignment information 12 25.04.11 Assemblers
  • 13. SAM Header • '@' followed by record type (two characters) @HD VN:1.0 @SQ SN:chr20 LN:62435964 @RG ID:L1 PU:SC_1_10 LB:SC_1 SM:NA12891 @RG ID:L2 PU:SC_2_12 LB:SC_2 SM:NA12891 13 25.04.11 Assemblers
  • 14. SAM Alignment • Tab delimited lines 14 25.04.11 Assemblers
  • 15. SAM Alignment • Tab delimited lines Read_28833_29006_6945 99 chr20 28833 20 10M1D25M = 28993 195 AGCT... <<<<... NM:i:1 RG:Z:L1 read_28701_28881_323b 147 chr20 28834 30 35M = 28701 -168 ACCT... <<7;:... MF:i:18 RG:Z:L2 15 25.04.11 Assemblers
  • 16. Tools • Mapping Reads • BWA • Bowtie • SSAHA2 • Manipulate SAM/BAM • SAM Tools package • Picard 16 25.04.11 Assemblers
  • 17. BWA • Burrows-Wheeler Alignment Tool • Map (singe/paired-end/long) reads to a sequence • Index database • bwa index -a bwtsw database.fasta • Align reads • bwa aln database.fasta short_read.fastq > aln_sa.sai • Generate alignments • bwa sampe database.fasta aln_sa1.sai aln_sa2.sai read1.fq read2.fq > aln.sam • Long reads • bwa bwasw database.fasta long_read.fastq > aln.sam 17 25.04.11 Assemblers
  • 18. SAM tools • Utilities for SAM format • samtools <command> ... • Commands: • view: SAM <-> BAM • sort: sort BAM file • index: build BAM file index • merge: merges x BAM files • pileup: alignment in the pileup format • tview: integrated Text alignment viewer 18 25.04.11 Assemblers
  • 19. Visualisation Integrative Genomics Viewer http://www.broadinstitute.org/igv/ • IGV • Good integration • Formats • DAS • BAM • GFF • ... • Tools • Run scripts • Export region • ... 19 25.04.11 Assemblers
  • 20. Visualisation http://www.sanger.ac.uk/resources/software/artemis/ • Artemis • Sequence Viewer • Annotation tool • Formats • EMBL • GENBANK • GFF • FASTA • BAM 20 25.04.11 Assemblers
  • 21. Mapping - Practical • Mapping reads + prepare for visalization • BWA • samtools • Visualisation • IGV 21 25.04.11 Assemblers