A look at Genome Assembly Visualization with ABySS-Explorer, as well as complementing genome browsing
(Using clustering and interactive data exploration)
Boost Fertility New Invention Ups Success Rates.pdf
Complementing Computation with Visualization in Genomics
1. British Columbia Cancer Agency Genome Sciences Centre Vancouver . British Columbia . Canada Complementing Computation with Visualization in Genomics March 11, 2010 EBI Interfaces Interest Forum Cydney Nielsen
25. Genome Sequencing cell population extracted DNA read pair information read sheared DNA dsDNA fragment (known size) sequencing reads (typically produce millions) AGCGGATTGCATGACAGT read GTACAGCCTGACAGAAGC GCGCTACGATCAGATCAA CATGACAGTCCGAGTACA TTCAGAATGGTACAGCAG
26. Capture read pair information After building the initial single-end (SE) contigs from k-mer sequences, ABySS uses paired-end reads to resolve ambiguities.
27. Capture read pair information Paired end read information is used the construct paired end (PE) contigs … 13+ 44- 46+ 4+ 79+ 70+ … blue gradient = paired end contig orange = selected single end contig
39. This representation is particularly powerful for revealing high-level genome assembly structure, not readily viewable in any other interactive tool
44. Genome Sequencing cell population extracted DNA sheared DNA sequencing reads (typically produce millions) AGCGGATTGCATGACAGT GTACAGCCTGACAGAAGC GCGCTACGATCAGATCAA CATGACAGTCCGAGTACA TTCAGAATGGTACAGCAG
45. Genome Sequencing cell population extracted DNA sheared DNA sequencing reads (typically produce millions) AGCGGATTGCATGACAGT GTACAGCCTGACAGAAGC GCGCTACGATCAGATCAA CATGACAGTCCGAGTACA TTCAGAATGGTACAGCAG
46. Genome Sequencing cell population Chromatin Immunoprecipitationand Sequencing (ChIP-Seq) extracted DNA selection sheared DNA sequencing reads (typically produce millions) AGCGGATTGCATGACAGT GTACAGCCTGACAGAAGC GCGCTACGATCAGATCAA GTACAGCCTGACAGAAGC CATGACAGTCCGAGTACA TTCAGAATGGTACAGCAG TTCAGAATGGTACAGCAG
47. Align sequences to the genome CCGAGTACAGCCTGACAGA GCATGACAGTCCGAGTAC TTGCATGACAGTCCGAGT AGCGGATTGCATGACAGT AGCGGATTGCATGACAGT AGCGGATTGCATGACAGT Reference Genome AGCGGATTGCATGACAGTCCGAGTACAGCCTGACAGA Read coverage Genomic coordinate
48. Genome browser can reveal local patterns H3K4me3 H3K36me3 H3K27me3 H3K9me3 H3K9Ac MRE
50. Focus on regions of interest 1. For example, transcriptional start sites (TSS +/- 3000 nt) H3K4me3 H3K9Ac H3K4me1 H3K36me3 MeDIP MRE 2. Extract data matrices Normalization for bin i, sample h: 3. Cluster matrices (k-means clustering with Euclidean distance)
51. Focus on regions of interest 1. For example, transcriptional start sites (TSS +/- 3000 nt) H3K4me3 H3K9Ac H3K4me1 H3K36me3 MeDIP MRE 2. Extract data matrices Normalization for bin i, sample h: 3. Cluster matrices (k-means clustering with Euclidean distance)
52. Focus on regions of interest 1. For example, transcriptional start sites (TSS +/- 3000 nt) H3K4me3 H3K9Ac H3K4me1 H3K36me3 MeDIP MRE 2. Extract data matrices Normalization for bin i, sample h: 3. Cluster matrices (k-means clustering with Euclidean distance)