SlideShare ist ein Scribd-Unternehmen logo
1 von 12
Genome Research 19:1124-1132, 2009



Speaker: Eric C.Y., LEE
Aim

• They want to developed a SNP calling
  method for Illumina platform.
• Consider the data quality, alignment and
  experimental error common to this
  platform.
Applications of NGS

• From whole genome sequence to know the
  gene variations between individuals.
 • Disease
 • Drug
 • Environment
Workflow
     Sequencing
       reads




   Map reads onto
  reference genome
                                          Prior probability
                                              of each
                                             genotype



Recalibrate sequencing
    quality score




Calculate likelihood of
   each genotype




                             Inferred
                           genotype via
                          Bayes theorem
Traditional Method

• Phred score is a universal standard.
• Compare the sample sequence with
  reference genome and filter low score
  mismatch.
• A method to detect heterozygous
  polymorphisms.
Prior Probability
• According to existing researches
 • The estimated SNP rate between two
    human haploid chromosome is about
    0.001. (Sachidanandam et al. 2001).
 • Human reference genome sequence has
    an error rate of 0.00001. (Collins et al.
    2004)

  Set the homozygous SNP at 0.0005, and the
  hetrozygous rate is 0.001.
Prior Probability
• According to a previous study on dbSNP,
  transitions are four times more frequent
  than transversions among the substitution
  mutations. (Zhao and Boerwinkle 2002)
Alignment


• Indels is the error source.
• Using SOAP for alignment.
Recalibration

• 3’ -end of reads have a much higher error
  rate than earlier cycles.
• Original quality score can’t represent the
  true error rate.
• Check the mismatch in dbSNP.
Recalibration
• Illumina uses two lasers.
 • A and C use the same laser, G and T use
    another.
 • A-C and G-T substitution were 58%-72%
    overestimated.
• Duplicate reads
 • Penalty for these reads.
Likelihood Calculation

• Observed allele type
• Quality score
• Sequencing cycle
• Observation of the same allele from
  reads with the same mapping location.
Evaluation

• Comparison of the consensus sequence
  with Illumina human 1M BeadChip
  genotyped alleles from the same DNA
  sample showed genotyped alleles on the X
  chromosome and autosomes were
  covered at 99.97% and 99.84% consistency,
  respectively.

Weitere ähnliche Inhalte

Was ist angesagt?

Was ist angesagt? (20)

Gene mapping
Gene mappingGene mapping
Gene mapping
 
dna fingerprinting powerpoint
dna fingerprinting powerpointdna fingerprinting powerpoint
dna fingerprinting powerpoint
 
Genomics
Genomics Genomics
Genomics
 
Concept of genome mapping
Concept of genome mappingConcept of genome mapping
Concept of genome mapping
 
Gene mapping methods
Gene mapping methodsGene mapping methods
Gene mapping methods
 
Association mapping
Association mappingAssociation mapping
Association mapping
 
Gene mapping / Genetic map vs Physical Map | determination of map distance a...
Gene mapping / Genetic map vs Physical Map |  determination of map distance a...Gene mapping / Genetic map vs Physical Map |  determination of map distance a...
Gene mapping / Genetic map vs Physical Map | determination of map distance a...
 
Snps and microarray
Snps and microarraySnps and microarray
Snps and microarray
 
Gene mapping tools
Gene mapping toolsGene mapping tools
Gene mapping tools
 
Mapping population ppt
Mapping population pptMapping population ppt
Mapping population ppt
 
Genome mapping
Genome mappingGenome mapping
Genome mapping
 
Genomic mapping, genetic mapping
Genomic mapping, genetic mappingGenomic mapping, genetic mapping
Genomic mapping, genetic mapping
 
Structural genomics
Structural genomicsStructural genomics
Structural genomics
 
genome mapping
genome mappinggenome mapping
genome mapping
 
Gene mapping
Gene mappingGene mapping
Gene mapping
 
Gene expression profiling i
Gene expression profiling  iGene expression profiling  i
Gene expression profiling i
 
Map based cloning
Map based cloning Map based cloning
Map based cloning
 
Genetic mapping
Genetic mappingGenetic mapping
Genetic mapping
 
Mapping the genome of bacteria
Mapping the genome of bacteriaMapping the genome of bacteria
Mapping the genome of bacteria
 
Molecular marker and gene mapping
Molecular marker and gene  mappingMolecular marker and gene  mapping
Molecular marker and gene mapping
 

Andere mochten auch

Neo4j: JDBC Connection Case Using LibreOffice
Neo4j: JDBC Connection Case Using LibreOfficeNeo4j: JDBC Connection Case Using LibreOffice
Neo4j: JDBC Connection Case Using LibreOfficeEric Lee
 
Introduction to 3rd sequencing
Introduction to 3rd sequencing Introduction to 3rd sequencing
Introduction to 3rd sequencing Eric Lee
 
Python and Neo4j
Python and Neo4jPython and Neo4j
Python and Neo4jEric Lee
 
Introduction to Graph Database
Introduction to Graph DatabaseIntroduction to Graph Database
Introduction to Graph DatabaseEric Lee
 
Google MAP API
Google MAP APIGoogle MAP API
Google MAP APIEric Lee
 
R3 Corda Simple Tutorial
R3 Corda Simple TutorialR3 Corda Simple Tutorial
R3 Corda Simple TutorialEric Lee
 
COSCUP 2016 Workshop : 快快樂樂學Neo4j
COSCUP 2016 Workshop : 快快樂樂學Neo4jCOSCUP 2016 Workshop : 快快樂樂學Neo4j
COSCUP 2016 Workshop : 快快樂樂學Neo4jEric Lee
 

Andere mochten auch (7)

Neo4j: JDBC Connection Case Using LibreOffice
Neo4j: JDBC Connection Case Using LibreOfficeNeo4j: JDBC Connection Case Using LibreOffice
Neo4j: JDBC Connection Case Using LibreOffice
 
Introduction to 3rd sequencing
Introduction to 3rd sequencing Introduction to 3rd sequencing
Introduction to 3rd sequencing
 
Python and Neo4j
Python and Neo4jPython and Neo4j
Python and Neo4j
 
Introduction to Graph Database
Introduction to Graph DatabaseIntroduction to Graph Database
Introduction to Graph Database
 
Google MAP API
Google MAP APIGoogle MAP API
Google MAP API
 
R3 Corda Simple Tutorial
R3 Corda Simple TutorialR3 Corda Simple Tutorial
R3 Corda Simple Tutorial
 
COSCUP 2016 Workshop : 快快樂樂學Neo4j
COSCUP 2016 Workshop : 快快樂樂學Neo4jCOSCUP 2016 Workshop : 快快樂樂學Neo4j
COSCUP 2016 Workshop : 快快樂樂學Neo4j
 

Ähnlich wie Genome Research 19:1124-1132, 2009

Nida ws neale_seq_data_gen
Nida ws neale_seq_data_genNida ws neale_seq_data_gen
Nida ws neale_seq_data_genFonareerat
 
2007. stephen chanock. technologic issues in gwas and follow up studies
2007. stephen chanock. technologic issues in gwas and follow up studies2007. stephen chanock. technologic issues in gwas and follow up studies
2007. stephen chanock. technologic issues in gwas and follow up studiesFOODCROPS
 
Introduction to haplotype blocks .pptx
Introduction to haplotype blocks .pptxIntroduction to haplotype blocks .pptx
Introduction to haplotype blocks .pptxFatma Sayed Ibrahim
 
paternity testing pptx.
paternity testing pptx.paternity testing pptx.
paternity testing pptx.kareem
 
Applications in forensics PCR and DNA fingerprinting.pptx
Applications in forensics PCR and DNA fingerprinting.pptxApplications in forensics PCR and DNA fingerprinting.pptx
Applications in forensics PCR and DNA fingerprinting.pptxVenkateswaraPrasad7
 
molecular marker RFLP, and application
molecular marker RFLP, and applicationmolecular marker RFLP, and application
molecular marker RFLP, and applicationKAUSHAL SAHU
 
SNPs and ESTs.pptx
SNPs and ESTs.pptxSNPs and ESTs.pptx
SNPs and ESTs.pptxRohithK65
 
GENE gene marker blood typing , abo blood typing vntr
GENE gene marker blood typing , abo blood typing vntrGENE gene marker blood typing , abo blood typing vntr
GENE gene marker blood typing , abo blood typing vntrAparnaAjayan8
 
GENE marker typing , abo blood grouping ,karl lamdsteiner
GENE marker typing , abo blood grouping ,karl lamdsteinerGENE marker typing , abo blood grouping ,karl lamdsteiner
GENE marker typing , abo blood grouping ,karl lamdsteinerAparnaAjayan8
 
A practical approach to Southern, Northern and Western Blot analyses
A practical approach to Southern, Northern and Western Blot analysesA practical approach to Southern, Northern and Western Blot analyses
A practical approach to Southern, Northern and Western Blot analysesRiddhi Datta
 
Virulence gene typing and its Applications in Genetic Engineering
Virulence gene typing and its Applications in Genetic EngineeringVirulence gene typing and its Applications in Genetic Engineering
Virulence gene typing and its Applications in Genetic EngineeringBita Rahmani
 
New Molecular Approaches to Identify 21st Century Microbes - Dr Melissa Mille...
New Molecular Approaches to Identify 21st Century Microbes - Dr Melissa Mille...New Molecular Approaches to Identify 21st Century Microbes - Dr Melissa Mille...
New Molecular Approaches to Identify 21st Century Microbes - Dr Melissa Mille...Eastern Pennsylvania Branch ASM
 
SNPs Presentation Cavalcanti Lab
SNPs Presentation Cavalcanti LabSNPs Presentation Cavalcanti Lab
SNPs Presentation Cavalcanti Labjsrep91
 
SNPs analysis methods
SNPs analysis methodsSNPs analysis methods
SNPs analysis methodshad89
 
Gene expression introduction
Gene expression introductionGene expression introduction
Gene expression introductionSetia Pramana
 
Single Nucleotide Polymorphisms (2)-1.pptx
Single Nucleotide Polymorphisms (2)-1.pptxSingle Nucleotide Polymorphisms (2)-1.pptx
Single Nucleotide Polymorphisms (2)-1.pptxkingmaxton8
 
Importance of Genetic Markers in Forensics
Importance of Genetic Markers in ForensicsImportance of Genetic Markers in Forensics
Importance of Genetic Markers in ForensicsMayank Raiborde
 
DNA fingerprinting- criminology and paternal identification
DNA fingerprinting- criminology and paternal identification DNA fingerprinting- criminology and paternal identification
DNA fingerprinting- criminology and paternal identification PrekshaJain113
 

Ähnlich wie Genome Research 19:1124-1132, 2009 (20)

Nida ws neale_seq_data_gen
Nida ws neale_seq_data_genNida ws neale_seq_data_gen
Nida ws neale_seq_data_gen
 
2007. stephen chanock. technologic issues in gwas and follow up studies
2007. stephen chanock. technologic issues in gwas and follow up studies2007. stephen chanock. technologic issues in gwas and follow up studies
2007. stephen chanock. technologic issues in gwas and follow up studies
 
Introduction to haplotype blocks .pptx
Introduction to haplotype blocks .pptxIntroduction to haplotype blocks .pptx
Introduction to haplotype blocks .pptx
 
paternity testing pptx.
paternity testing pptx.paternity testing pptx.
paternity testing pptx.
 
Applications in forensics PCR and DNA fingerprinting.pptx
Applications in forensics PCR and DNA fingerprinting.pptxApplications in forensics PCR and DNA fingerprinting.pptx
Applications in forensics PCR and DNA fingerprinting.pptx
 
molecular marker RFLP, and application
molecular marker RFLP, and applicationmolecular marker RFLP, and application
molecular marker RFLP, and application
 
SNPs and ESTs.pptx
SNPs and ESTs.pptxSNPs and ESTs.pptx
SNPs and ESTs.pptx
 
GENE gene marker blood typing , abo blood typing vntr
GENE gene marker blood typing , abo blood typing vntrGENE gene marker blood typing , abo blood typing vntr
GENE gene marker blood typing , abo blood typing vntr
 
GENE marker typing , abo blood grouping ,karl lamdsteiner
GENE marker typing , abo blood grouping ,karl lamdsteinerGENE marker typing , abo blood grouping ,karl lamdsteiner
GENE marker typing , abo blood grouping ,karl lamdsteiner
 
A practical approach to Southern, Northern and Western Blot analyses
A practical approach to Southern, Northern and Western Blot analysesA practical approach to Southern, Northern and Western Blot analyses
A practical approach to Southern, Northern and Western Blot analyses
 
Virulence gene typing and its Applications in Genetic Engineering
Virulence gene typing and its Applications in Genetic EngineeringVirulence gene typing and its Applications in Genetic Engineering
Virulence gene typing and its Applications in Genetic Engineering
 
New Molecular Approaches to Identify 21st Century Microbes - Dr Melissa Mille...
New Molecular Approaches to Identify 21st Century Microbes - Dr Melissa Mille...New Molecular Approaches to Identify 21st Century Microbes - Dr Melissa Mille...
New Molecular Approaches to Identify 21st Century Microbes - Dr Melissa Mille...
 
Ppt snp detection
Ppt snp detectionPpt snp detection
Ppt snp detection
 
SNPs Presentation Cavalcanti Lab
SNPs Presentation Cavalcanti LabSNPs Presentation Cavalcanti Lab
SNPs Presentation Cavalcanti Lab
 
SNPs analysis methods
SNPs analysis methodsSNPs analysis methods
SNPs analysis methods
 
Gene expression introduction
Gene expression introductionGene expression introduction
Gene expression introduction
 
Single Nucleotide Polymorphisms (2)-1.pptx
Single Nucleotide Polymorphisms (2)-1.pptxSingle Nucleotide Polymorphisms (2)-1.pptx
Single Nucleotide Polymorphisms (2)-1.pptx
 
Importance of Genetic Markers in Forensics
Importance of Genetic Markers in ForensicsImportance of Genetic Markers in Forensics
Importance of Genetic Markers in Forensics
 
Dna fingerprinting
Dna fingerprintingDna fingerprinting
Dna fingerprinting
 
DNA fingerprinting- criminology and paternal identification
DNA fingerprinting- criminology and paternal identification DNA fingerprinting- criminology and paternal identification
DNA fingerprinting- criminology and paternal identification
 

Genome Research 19:1124-1132, 2009

  • 1. Genome Research 19:1124-1132, 2009 Speaker: Eric C.Y., LEE
  • 2. Aim • They want to developed a SNP calling method for Illumina platform. • Consider the data quality, alignment and experimental error common to this platform.
  • 3. Applications of NGS • From whole genome sequence to know the gene variations between individuals. • Disease • Drug • Environment
  • 4. Workflow Sequencing reads Map reads onto reference genome Prior probability of each genotype Recalibrate sequencing quality score Calculate likelihood of each genotype Inferred genotype via Bayes theorem
  • 5. Traditional Method • Phred score is a universal standard. • Compare the sample sequence with reference genome and filter low score mismatch. • A method to detect heterozygous polymorphisms.
  • 6. Prior Probability • According to existing researches • The estimated SNP rate between two human haploid chromosome is about 0.001. (Sachidanandam et al. 2001). • Human reference genome sequence has an error rate of 0.00001. (Collins et al. 2004) Set the homozygous SNP at 0.0005, and the hetrozygous rate is 0.001.
  • 7. Prior Probability • According to a previous study on dbSNP, transitions are four times more frequent than transversions among the substitution mutations. (Zhao and Boerwinkle 2002)
  • 8. Alignment • Indels is the error source. • Using SOAP for alignment.
  • 9. Recalibration • 3’ -end of reads have a much higher error rate than earlier cycles. • Original quality score can’t represent the true error rate. • Check the mismatch in dbSNP.
  • 10. Recalibration • Illumina uses two lasers. • A and C use the same laser, G and T use another. • A-C and G-T substitution were 58%-72% overestimated. • Duplicate reads • Penalty for these reads.
  • 11. Likelihood Calculation • Observed allele type • Quality score • Sequencing cycle • Observation of the same allele from reads with the same mapping location.
  • 12. Evaluation • Comparison of the consensus sequence with Illumina human 1M BeadChip genotyped alleles from the same DNA sample showed genotyped alleles on the X chromosome and autosomes were covered at 99.97% and 99.84% consistency, respectively.

Hinweis der Redaktion

  1. \n
  2. \n
  3. \n
  4. \n
  5. \n
  6. \n
  7. \n
  8. \n
  9. \n
  10. \n
  11. \n
  12. \n