SlideShare ist ein Scribd-Unternehmen logo
1 von 15
Downloaden Sie, um offline zu lesen
Whole Genome
Bisulfite Sequencing
     (feasibility trial)
          FISH 546
       Mackenzie Gavery
Introduction
   QUESTION:
   is whole genome bisulfite sequencing (WGBS) a viable option
   for discovering methylated cytosines in non-model species
   with limited genomic resources?
   HYPOTHESIS:
   With limited reference sequence available, it will be very
   difficult to annotate methylated regions of DNA
   WHO CARES:
   DNA methylation is an epigenetic mechanism with important
   regulatory functions. Evidence for regulatory role in oysters,
   would like to explore in diff populations / generations but
   need to know where to look.
Introduction
   QUESTION:
   is whole genome bisulfite sequencing (WGBS) a viable option
   for discovering methylated cytosines in non-model species
   with limited genomic resources?
   HYPOTHESIS:
   With limited reference sequence available, it will be very
   difficult to annotate methylated regions of DNA
   WHO CARES:
   DNA methylation is an epigenetic mechanism with important
   regulatory functions. Evidence for regulatory role in oysters,
   would like to explore in diff populations / generations but
   need to know where to look.
Background: bisulfite sequencing

                                    m
        C AT G T TA C G AT C G G C T C G
                     bisulfite
                                     m

        U AT G T TA U G AT C G G U T C G
                     PCR
         T AT G T TA T G AT C G G T T C G
         ATA C A AT A C TA G C C AT G C
Bisulfite-PCR
  previous work – use design primers to amplify
  specific regions of interest




                                                Kismeth


  challenging to design primers with specificity,
  limited to known sequences
WGBS Challenges:
    sequencing issues – sequencers can have problems
    w/ low complexity sequence

    non-model species genomic resources limited
      C.gigas
        Most resources are ESTs (coding sequences only)

    bioinformatics
      assemblies/alignments need to recognize C/T
      conversion

      bisulfite treatment results in 4 unique strands after PCR
Approach:
  generate mock bisulfite-seq reads using Atlantic
  salmon GSS sequences as surrogate to C.gigas

  use CLC to assemble mock bisulfite treated reads
  back to non-treated mock sequences
Approach:

 Atlantic salmon         after de novo        generate 1 million
  GSS: 203,387        assembly: 128,337        random, ~40bp
    sequences               contigs               fragments




                           create similar     convert all C to T,
use the non-treated
                      fragment library that   with exception of
library to assemble
                        is not converted to   ‘ACG’ sequences
  bisulfite treated
                          use as reference     (259,750 ‘C’s’
        reads
                             sequence              remain)
Assembly 1st try:

                                          assemble          BLAST non
                           de novo
non treated fragments                  bisulfite reads   treated contigs
                        assembly non
                                       to de novo non     with matches
                           treated
                                           treated            for ID




     1 million          459 contigs    42 contigs        Found hits,
    short reads          (~300bp)       (~ 46bp)          but many
     40 mil bp                                               not
                                        1940 bp          annotated
Analysis summary:

                           non-treated         non-treated        non-treated
                          reference A*         reference B   reference converted
  assembly settings: limit=8              limit=8            limit=8

  (‘global alignment’, mismatch cost =2   mismatch cost =3   mismatch cost =3
     ‘allow mismatch)
                       score limit = 8    score limit = 15   score limit = 15
   contigs generated 42                   71                 11,213
            (total bp)
                       (1940 bp)          (21,487)           (508,799)
          total SNPs 42                   42                 473
Other tools:




          Nature Reviews Genetics 11, 191-203 | doi:10.1038/nrg2732
Conclusions:
  QUESTION:
  is whole genome bisulfite sequencing (WGBS) a viable
  option for discovering methylated cytosines in non-
  model species with limited genomic resources?

  HYPOTHESIS:
  With limited reference sequence available, it will be very
  difficult to map methylated regions of DNA

  ANSWER:
  Yup
Conclusions:
  QUESTION:
  is whole genome bisulfite sequencing (WGBS) a viable
  option for discovering methylated cytosines in non-
  model species with limited genomic resources?

  HYPOTHESIS:
  With limited reference sequence available, it will be very
  difficult to map methylated regions of DNA

  ANSWER:
  Yup
Next Steps
   Find tool to do ‘customizable’ assembly
     e.g. only allow C/T (or G/A mismatches)

   new protocol using SOLiD that will only sequence
   1 strand (this will make analysis easier)

   reduced representation
     digest w/ restriction enzymes and size select DNA
     prior to making library
     DNA methylation enrichment kit – fractionate DNA by
     binding to methyl binding domain proteins (only
     sequence heavily methylated regions)
Thank you

Weitere ähnliche Inhalte

Was ist angesagt?

Was ist angesagt? (10)

gateway cloning
gateway cloning gateway cloning
gateway cloning
 
Experimental manipulation of curli
Experimental manipulation of curliExperimental manipulation of curli
Experimental manipulation of curli
 
Fernando Larcher Laguzzi-Enfermedades raras de la piel
Fernando Larcher Laguzzi-Enfermedades raras de la pielFernando Larcher Laguzzi-Enfermedades raras de la piel
Fernando Larcher Laguzzi-Enfermedades raras de la piel
 
Cancer
Cancer Cancer
Cancer
 
The NCBI Eukaryotic Genome Annotation Pipeline and Alternate Genomic Sequences
The NCBI Eukaryotic Genome Annotation Pipeline and Alternate Genomic SequencesThe NCBI Eukaryotic Genome Annotation Pipeline and Alternate Genomic Sequences
The NCBI Eukaryotic Genome Annotation Pipeline and Alternate Genomic Sequences
 
Cancer ppt 2
Cancer ppt 2Cancer ppt 2
Cancer ppt 2
 
Creating a SNP calling pipeline
Creating a SNP calling pipelineCreating a SNP calling pipeline
Creating a SNP calling pipeline
 
Chpt3 genetics
Chpt3 geneticsChpt3 genetics
Chpt3 genetics
 
RNA-seq Analysis
RNA-seq AnalysisRNA-seq Analysis
RNA-seq Analysis
 
Crispr trap
Crispr trapCrispr trap
Crispr trap
 

Ähnlich wie Gavery Fish546

Assembly and finishing
Assembly and finishingAssembly and finishing
Assembly and finishing
Nikolay Vyahhi
 
Ngs microbiome
Ngs microbiomeNgs microbiome
Ngs microbiome
jukais
 
Barcelona sabatica
Barcelona sabaticaBarcelona sabatica
Barcelona sabatica
Armando Vieira
 
20110524zurichngs 1st pub
20110524zurichngs 1st pub20110524zurichngs 1st pub
20110524zurichngs 1st pub
sesejun
 
Bioinformatics workshop Sept 2014
Bioinformatics workshop Sept 2014Bioinformatics workshop Sept 2014
Bioinformatics workshop Sept 2014
LutzFr
 

Ähnlich wie Gavery Fish546 (18)

AGBT2017 Reference Workshop: Fulton
AGBT2017 Reference Workshop: FultonAGBT2017 Reference Workshop: Fulton
AGBT2017 Reference Workshop: Fulton
 
Fast and Efficient Post-Bisulfite-Seq Library Construction with QIAseq Ultral...
Fast and Efficient Post-Bisulfite-Seq Library Construction with QIAseq Ultral...Fast and Efficient Post-Bisulfite-Seq Library Construction with QIAseq Ultral...
Fast and Efficient Post-Bisulfite-Seq Library Construction with QIAseq Ultral...
 
Improving and validating the Atlantic Cod genome assembly using PacBio
Improving and validating the Atlantic Cod genome assembly using PacBioImproving and validating the Atlantic Cod genome assembly using PacBio
Improving and validating the Atlantic Cod genome assembly using PacBio
 
How to cluster and sequence an ngs library (james hadfield160416)
How to cluster and sequence an ngs library (james hadfield160416)How to cluster and sequence an ngs library (james hadfield160416)
How to cluster and sequence an ngs library (james hadfield160416)
 
Suman (2)
Suman (2)Suman (2)
Suman (2)
 
20150601 bio sb_assembly_course
20150601 bio sb_assembly_course20150601 bio sb_assembly_course
20150601 bio sb_assembly_course
 
Using BioNano Maps to Improve an Insect Genome Assembly​
Using BioNano Maps to Improve an Insect Genome Assembly​Using BioNano Maps to Improve an Insect Genome Assembly​
Using BioNano Maps to Improve an Insect Genome Assembly​
 
Rnaseq forgenefinding
Rnaseq forgenefindingRnaseq forgenefinding
Rnaseq forgenefinding
 
RNASeq Experiment Design
RNASeq Experiment DesignRNASeq Experiment Design
RNASeq Experiment Design
 
Assembly and finishing
Assembly and finishingAssembly and finishing
Assembly and finishing
 
Ngs microbiome
Ngs microbiomeNgs microbiome
Ngs microbiome
 
Barcelona sabatica
Barcelona sabaticaBarcelona sabatica
Barcelona sabatica
 
Biosensor libraries harness large classes of binding domains for construction...
Biosensor libraries harness large classes of binding domains for construction...Biosensor libraries harness large classes of binding domains for construction...
Biosensor libraries harness large classes of binding domains for construction...
 
20110524zurichngs 1st pub
20110524zurichngs 1st pub20110524zurichngs 1st pub
20110524zurichngs 1st pub
 
choice of vectors
 choice of vectors choice of vectors
choice of vectors
 
Jan2016 bio nano han cao
Jan2016 bio nano han caoJan2016 bio nano han cao
Jan2016 bio nano han cao
 
Bioinformatics workshop Sept 2014
Bioinformatics workshop Sept 2014Bioinformatics workshop Sept 2014
Bioinformatics workshop Sept 2014
 
Snippy - Rapid bacterial variant calling - UK - tue 5 may 2015
Snippy - Rapid bacterial variant calling - UK - tue 5 may 2015Snippy - Rapid bacterial variant calling - UK - tue 5 may 2015
Snippy - Rapid bacterial variant calling - UK - tue 5 may 2015
 

Mehr von sr320

Genomics on the Half Shell: Making Science more Open
Genomics on the Half Shell: Making Science more OpenGenomics on the Half Shell: Making Science more Open
Genomics on the Half Shell: Making Science more Open
sr320
 
FISH441: Oysters, acidification and methylation
FISH441: Oysters, acidification and methylationFISH441: Oysters, acidification and methylation
FISH441: Oysters, acidification and methylation
sr320
 
FISH441 Group Project (Oysters)
FISH441 Group Project (Oysters)FISH441 Group Project (Oysters)
FISH441 Group Project (Oysters)
sr320
 
Timmins-Schiffman P2010
Timmins-Schiffman P2010Timmins-Schiffman P2010
Timmins-Schiffman P2010
sr320
 
Gavery PCSGA 2010
Gavery PCSGA 2010Gavery PCSGA 2010
Gavery PCSGA 2010
sr320
 

Mehr von sr320 (20)

Identifying Local Olympia Oyster Stocks Useful for Restoration
Identifying Local Olympia Oyster Stocks Useful for RestorationIdentifying Local Olympia Oyster Stocks Useful for Restoration
Identifying Local Olympia Oyster Stocks Useful for Restoration
 
Does DNA methylation facilitate phenotypic plasticity in marine invertebrates?
Does DNA methylation facilitate phenotypic plasticity in marine invertebrates?Does DNA methylation facilitate phenotypic plasticity in marine invertebrates?
Does DNA methylation facilitate phenotypic plasticity in marine invertebrates?
 
Science Communication and Impact: A Researcher's Perspective
Science Communication and Impact: A Researcher's PerspectiveScience Communication and Impact: A Researcher's Perspective
Science Communication and Impact: A Researcher's Perspective
 
Genomic approaches to assessing ecosystem health
Genomic approaches to assessing ecosystem healthGenomic approaches to assessing ecosystem health
Genomic approaches to assessing ecosystem health
 
Collaborative Genomic Data Analyses in the Cloud
Collaborative Genomic Data Analyses in the CloudCollaborative Genomic Data Analyses in the Cloud
Collaborative Genomic Data Analyses in the Cloud
 
Genomics on the Half Shell: Making Science more Open
Genomics on the Half Shell: Making Science more OpenGenomics on the Half Shell: Making Science more Open
Genomics on the Half Shell: Making Science more Open
 
Epigenetic and Environmental Influences on the Shellfish Immune Response
Epigenetic and Environmental Influences on the Shellfish Immune ResponseEpigenetic and Environmental Influences on the Shellfish Immune Response
Epigenetic and Environmental Influences on the Shellfish Immune Response
 
NSA2012 Short reads and Oyster Genome Resources
NSA2012 Short reads and Oyster Genome ResourcesNSA2012 Short reads and Oyster Genome Resources
NSA2012 Short reads and Oyster Genome Resources
 
Short read sequencing and shellfish
Short read sequencing and shellfishShort read sequencing and shellfish
Short read sequencing and shellfish
 
FISH441: Oysters, acidification and methylation
FISH441: Oysters, acidification and methylationFISH441: Oysters, acidification and methylation
FISH441: Oysters, acidification and methylation
 
FISH441: Oyster acidification: gene and protein expression
FISH441: Oyster acidification: gene and protein expression FISH441: Oyster acidification: gene and protein expression
FISH441: Oyster acidification: gene and protein expression
 
FISH441: Oyster Hypoxia and acclimation
FISH441: Oyster Hypoxia and acclimationFISH441: Oyster Hypoxia and acclimation
FISH441: Oyster Hypoxia and acclimation
 
Timmins Schiffman PCSGA 2011
Timmins Schiffman PCSGA 2011Timmins Schiffman PCSGA 2011
Timmins Schiffman PCSGA 2011
 
Elene Dorfmeier pcsga11
Elene Dorfmeier pcsga11 Elene Dorfmeier pcsga11
Elene Dorfmeier pcsga11
 
FISH510 Lec 1
FISH510 Lec 1FISH510 Lec 1
FISH510 Lec 1
 
FISH441 Group Project (Oysters)
FISH441 Group Project (Oysters)FISH441 Group Project (Oysters)
FISH441 Group Project (Oysters)
 
Timmins-Schiffman P2010
Timmins-Schiffman P2010Timmins-Schiffman P2010
Timmins-Schiffman P2010
 
Gavery PCSGA 2010
Gavery PCSGA 2010Gavery PCSGA 2010
Gavery PCSGA 2010
 
Salmon Senescence
Salmon SenescenceSalmon Senescence
Salmon Senescence
 
Roberts GRC
Roberts GRCRoberts GRC
Roberts GRC
 

Kürzlich hochgeladen

Kürzlich hochgeladen (20)

Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot ModelNavi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu SubbuApidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 

Gavery Fish546

  • 1. Whole Genome Bisulfite Sequencing (feasibility trial) FISH 546 Mackenzie Gavery
  • 2. Introduction   QUESTION: is whole genome bisulfite sequencing (WGBS) a viable option for discovering methylated cytosines in non-model species with limited genomic resources?   HYPOTHESIS: With limited reference sequence available, it will be very difficult to annotate methylated regions of DNA   WHO CARES: DNA methylation is an epigenetic mechanism with important regulatory functions. Evidence for regulatory role in oysters, would like to explore in diff populations / generations but need to know where to look.
  • 3. Introduction   QUESTION: is whole genome bisulfite sequencing (WGBS) a viable option for discovering methylated cytosines in non-model species with limited genomic resources?   HYPOTHESIS: With limited reference sequence available, it will be very difficult to annotate methylated regions of DNA   WHO CARES: DNA methylation is an epigenetic mechanism with important regulatory functions. Evidence for regulatory role in oysters, would like to explore in diff populations / generations but need to know where to look.
  • 4. Background: bisulfite sequencing m C AT G T TA C G AT C G G C T C G bisulfite m U AT G T TA U G AT C G G U T C G PCR T AT G T TA T G AT C G G T T C G ATA C A AT A C TA G C C AT G C
  • 5. Bisulfite-PCR   previous work – use design primers to amplify specific regions of interest Kismeth   challenging to design primers with specificity, limited to known sequences
  • 6. WGBS Challenges:   sequencing issues – sequencers can have problems w/ low complexity sequence   non-model species genomic resources limited   C.gigas   Most resources are ESTs (coding sequences only)   bioinformatics   assemblies/alignments need to recognize C/T conversion   bisulfite treatment results in 4 unique strands after PCR
  • 7. Approach:   generate mock bisulfite-seq reads using Atlantic salmon GSS sequences as surrogate to C.gigas   use CLC to assemble mock bisulfite treated reads back to non-treated mock sequences
  • 8. Approach: Atlantic salmon after de novo generate 1 million GSS: 203,387 assembly: 128,337 random, ~40bp sequences contigs fragments create similar convert all C to T, use the non-treated fragment library that with exception of library to assemble is not converted to ‘ACG’ sequences bisulfite treated use as reference (259,750 ‘C’s’ reads sequence remain)
  • 9. Assembly 1st try: assemble BLAST non de novo non treated fragments bisulfite reads treated contigs assembly non to de novo non with matches treated treated for ID 1 million 459 contigs 42 contigs Found hits, short reads (~300bp) (~ 46bp) but many 40 mil bp not 1940 bp annotated
  • 10. Analysis summary: non-treated non-treated non-treated reference A* reference B reference converted assembly settings: limit=8 limit=8 limit=8 (‘global alignment’, mismatch cost =2 mismatch cost =3 mismatch cost =3 ‘allow mismatch) score limit = 8 score limit = 15 score limit = 15 contigs generated 42 71 11,213 (total bp) (1940 bp) (21,487) (508,799) total SNPs 42 42 473
  • 11. Other tools: Nature Reviews Genetics 11, 191-203 | doi:10.1038/nrg2732
  • 12. Conclusions:   QUESTION: is whole genome bisulfite sequencing (WGBS) a viable option for discovering methylated cytosines in non- model species with limited genomic resources?   HYPOTHESIS: With limited reference sequence available, it will be very difficult to map methylated regions of DNA   ANSWER: Yup
  • 13. Conclusions:   QUESTION: is whole genome bisulfite sequencing (WGBS) a viable option for discovering methylated cytosines in non- model species with limited genomic resources?   HYPOTHESIS: With limited reference sequence available, it will be very difficult to map methylated regions of DNA   ANSWER: Yup
  • 14. Next Steps   Find tool to do ‘customizable’ assembly   e.g. only allow C/T (or G/A mismatches)   new protocol using SOLiD that will only sequence 1 strand (this will make analysis easier)   reduced representation   digest w/ restriction enzymes and size select DNA prior to making library   DNA methylation enrichment kit – fractionate DNA by binding to methyl binding domain proteins (only sequence heavily methylated regions)