SlideShare ist ein Scribd-Unternehmen logo
1 von 24
THE NEXT DIMENSION IN STR SEQUENCING:
Polymorphisms in Flanking Regions
& their Allelic Associations
Katherine Butler Gettings
Rachel Aponte
Kevin Kiesler
Peter Vallone
September 2, 2015
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
Disclaimer
I will mention commercial STR kit names and information, but I am not
attempting to endorse any specific products.
NIST Disclaimer: Certain commercial equipment, instruments and materials
are identified in order to specify experimental procedures as completely as
possible. In no case does such identification imply a recommendation or it
imply that any of the materials, instruments or equipment identified are
necessarily the best available for the purpose.
Information presented does not necessarily represent the official position of
the National Institute of Standards and Technology or the U.S. Department of
Justice.
Our group receives or has received funding from the FBI Laboratory and the
National Institute of Justice.
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Population
Samples
• N=183
•Caucasian - 70
•Hispanic - 45
•African American - 68
Amplification
& Library Prep
•PowerSeq Auto
System (Beta)
•PPFusion Loci
•Illumina TruSeq
HS PCR-Free
Sequencing
•MiSeq
Bioinformatics
•STRait Razor
•ExactID (Battelle)
•CE concordance
Analysis
•Repeat Region
•Flanking Region
•Stutter
Data GenerationNIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
N=183
Length
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
Forensic STR Sequence Diversity
N=183
Simple Repeats
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
150 bp
flanks
110 bp
flanks
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
Categories of Flanking Regions
1. No polymorphisms
2. Polymorphisms Associated with a Sequence Variant
3. Rare polymorphisms
4. Population or Allele Specific polymorphisms
5. “Old” polymorphisms (not population or allele specific)
6. Multiple polymorphisms in Haplotype
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
Category Locus rs Number
African
American
Caucasian Hispanic
African
American
Caucasian Hispanic
D3S1358
FGA
D12S391
Penta E
D19S433
D21S11
D1S1656 rs4847015 0.191 0.336 0.311 5 5 4 0
rs11063971 0.044 0.086 0.078 1 1 1
rs11063970 0.044 0.086 0.078 1 1 1
rs11063969 0.044 0.086 0.078 1 1 1
rs75219269 0.044 0.086 0.078 1 1 1
rs199970098 0.029 - 0.011 2 - 1
D2S441 rs74640515 0.015 0.014 0.067 1 1 2 2
CSF1PO C-T 36bp 5' - - 0.011 - - 1 1
D8S1179 rs138862078 0.007 - - 1 - - 1
D10S1248 T-G 2bp 3' - 0.007 - - 1 - 1
D18S51 rs535833682 0.029 - - 3 - - 1
Penta D rs7279663 0.022 0.014 0.011 1 2 1 2
D22S1045 rs190864081 0.007 - - 1 - - 1
rs13422969 0.135 - 0.011 2 - 1
rs115644759 0.022 - - 2 - -
rs149212737 - 0.007 - - 1 -
TH01 rs79373318 0.148 - - 3 - - 3
rs1728369 0.404 0.171 0.278 4 4 4
G-A 94bp 5' - - 0.011 - - 1
D2S1338 rs6736691 0.221 0.264 0.300 8 6 4 3
G-T 4bp 3' 0.316 0.257 0.189 7 5 6
rs25768 0.228 0.257 0.156 5 5 5
rs146841551 0.029 - - 1 - -
rs541272009 0.007 - - 1 - -
rs9546005 0.309 0.286 0.333 5 4 7
rs73525369 0.066 - - 3 - -
rs73250432 - 0.014 0.011 - 2 1
rs146621667 0.015 - - 2 - -
4bp del 8bp 3' - - 0.022 - - 2
4bp del 21bp 3' 0.007 - 0.022 1 - 1
rs16887642 0.177 0.050 0.022 4 2 2
rs7789995 0.015 0.164 0.122 2 4 4
rs7786079 0.176 0.021 0.011 5 2 1
1bp del 21bp 3' - - 0.011 - - 1
Minor Allele Frequency Number of Associated STR Alleles Alleles Gained
with Flanking
SNPs
No SNPs
0vWA
SNPs Associated with
Repeat Region
Sequence Variant
5D16S539
Single "Old" SNP
(not population or
allele specific)
TPOXPopulation / Allele
Specific SNPs
Rare SNPs
4
16D13S317
D5S818 17
D7S820 15
Multiple SNPs in
Haplotype
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
No SNPs were identified
in the sequenced flanking regions of:
D3S1358 Penta E
FGA D19S433
D12S391 D21S11
Category Locus rs Number
African
American
Caucasian Hispanic
African
American
Caucasian Hispanic
D3S1358
FGA
D12S391
Penta E
D19S433
D21S11
Minor Allele Frequency Number of Associated STR Alleles Alleles Gained
with Flanking
SNPs
No SNPs
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
Category Locus rs Number
African
American
Caucasian Hispanic
African
American
Caucasian Hispanic
D1S1656 rs4847015 0.191 0.336 0.311 5 5 4 0
rs11063971 0.044 0.086 0.078 1 1 1
rs11063970 0.044 0.086 0.078 1 1 1
rs11063969 0.044 0.086 0.078 1 1 1
rs75219269 0.044 0.086 0.078 1 1 1
rs199970098 0.029 - 0.011 2 - 1
0vWA
SNPs Associated with
Repeat Region
Sequence Variant
Minor Allele Frequency Number of Associated STR Alleles Alleles Gained
with Flanking
SNPs
D1S1656 has one SNP, rs4847015
• Well distributed across populations…
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
Category Locus rs Number
African
American
Caucasian Hispanic
African
American
Caucasian Hispanic
D1S1656 rs4847015 0.191 0.336 0.311 5 5 4 0
rs11063971 0.044 0.086 0.078 1 1 1
rs11063970 0.044 0.086 0.078 1 1 1
rs11063969 0.044 0.086 0.078 1 1 1
rs75219269 0.044 0.086 0.078 1 1 1
rs199970098 0.029 - 0.011 2 - 1
0vWA
SNPs Associated with
Repeat Region
Sequence Variant
Minor Allele Frequency Number of Associated STR Alleles Alleles Gained
with Flanking
SNPs
D1S1656 has one SNP, rs4847015
• Well distributed across populations
• Well distributed across STR alleles
• Does NOT increase the number of alleles…
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
D1S1656 has one SNP, rs4847015
and it is always associated with x.3 alleles:
15.3
16.3
17.3 rs4847015 = T
18.3
19.3
Category Locus rs Number
African
American
Caucasian Hispanic
African
American
Caucasian Hispanic
D1S1656 rs4847015 0.191 0.336 0.311 5 5 4 0
rs11063971 0.044 0.086 0.078 1 1 1
rs11063970 0.044 0.086 0.078 1 1 1
rs11063969 0.044 0.086 0.078 1 1 1
rs75219269 0.044 0.086 0.078 1 1 1
rs199970098 0.029 - 0.011 2 - 1
0vWA
SNPs Associated with
Repeat Region
Sequence Variant
Minor Allele Frequency Number of Associated STR Alleles Alleles Gained
with Flanking
SNPs
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
Rare SNPs were identified
in the sequenced flanking regions of:
D2S441 D18S51
CSF1PO Penta D
D8S1179 D22S1045
D10S1248
Category Locus rs Number
African
American
Caucasian Hispanic
African
American
Caucasian Hispanic
D2S441 rs74640515 0.015 0.014 0.067 1 1 2 2
CSF1PO C-T 36bp 5' - - 0.011 - - 1 1
D8S1179 rs138862078 0.007 - - 1 - - 1
D10S1248 T-G 2bp 3' - 0.007 - - 1 - 1
D18S51 rs535833682 0.029 - - 3 - - 1
Penta D rs7279663 0.022 0.014 0.011 1 2 1 2
D22S1045 rs190864081 0.007 - - 1 - - 1
Minor Allele Frequency Number of Associated STR Alleles Alleles Gained
with Flanking
SNPs
Rare SNPs
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
Category Locus rs Number
African
American
Caucasian Hispanic
African
American
Caucasian Hispanic
rs13422969 0.135 - 0.011 2 - 1
rs115644759 0.022 - - 2 - -
rs149212737 - 0.007 - - 1 -
TH01 rs79373318 0.148 - - 3 - - 3
TPOX
Minor Allele Frequency Number of Associated STR Alleles Alleles Gained
with Flanking
SNPs
Population / Allele
Specific SNPs
4
TPOX: rs13422969 and two additional rare SNPs
• Well distributed across Africa, rare elsewhere
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
Category Locus rs Number
African
American
Caucasian Hispanic
African
American
Caucasian Hispanic
rs13422969 0.135 - 0.011 2 - 1
rs115644759 0.022 - - 2 - -
rs149212737 - 0.007 - - 1 -
TH01 rs79373318 0.148 - - 3 - - 3
TPOX
Minor Allele Frequency Number of Associated STR Alleles Alleles Gained
with Flanking
SNPs
Population / Allele
Specific SNPs
4
TPOX: rs13422969 and two additional rare SNPs
• Well distributed across Africa, rare elsewhere
• Associated with “9” allele
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
D16S539: rs1728369 and one additional rare SNP
• Well distributed across populations and alleles
Category Locus rs Number
African
American
Caucasian Hispanic
African
American
Caucasian Hispanic
rs1728369 0.404 0.171 0.278 4 4 4
G-A 94bp 5' - - 0.011 - - 1
D2S1338 rs6736691 0.221 0.264 0.300 8 6 4 3
5D16S539
Single "Old" SNP
(not population or
allele specific)
Minor Allele Frequency Number of Associated STR Alleles Alleles Gained
with Flanking
SNPs
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
Category Locus rs Number
African
American
Caucasian Hispanic
African
American
Caucasian Hispanic
G-T 4bp 3' 0.316 0.257 0.189 7 5 6
rs25768 0.228 0.257 0.156 5 5 5
rs146841551 0.029 - - 1 - -
rs541272009 0.007 - - 1 - -
rs9546005 0.309 0.286 0.333 5 4 7
rs73525369 0.066 - - 3 - -
rs73250432 - 0.014 0.011 - 2 1
rs146621667 0.015 - - 2 - -
4bp del 8bp 3' - - 0.022 - - 2
4bp del 21bp 3' 0.007 - 0.022 1 - 1
rs16887642 0.177 0.050 0.022 4 2 2
rs7789995 0.015 0.164 0.122 2 4 4
rs7786079 0.176 0.021 0.011 5 2 1
1bp del 21bp 3' - - 0.011 - - 1
D7S820 15
Multiple SNPs in
Haplotype
16D13S317
D5S818 17
Minor Allele Frequency Number of Associated STR Alleles Alleles Gained
with Flanking
SNPs
D5S818:
• G→T 4bp 3’
• rs25768
• two additional rare SNPs
Well distributed across
populations and alleles
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
*[AGAT]n[ACAT][AGAT]
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
Conclusions
• Ideal SNPs for increasing allelic diversity are neither population nor
allele specific
• Multiple SNPs in haplotype may substantially increase allelic diversity
• Population studies are needed to associate flanking region
polymorphisms with STR alleles
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
Flanking Region Polymorphisms in Practice
Problem is ambiguity
Full string is unambiguous,
intractable for reporting
Report differences from a reference genome
• Delineate length of amplicons
• Agree upon a reference genome
“My Wife and My Mother-in-Law”
William Ely Hill, 1915
Sample …CCAGTTACGACGCTAACTACTCTGAG…
Reference…CCAGTTACGACGCTAACTACTTTGAG…
Lab 1
Sample …CCAGTTACGACGCTAACT
Reference…CCAGTTACGACGCTAACTACTTTGAG…
Lab 3
Sample …CCAGTTACGACGCTAACTACTCTGAG…
Reference…CCAGTTACGACGCTAACTACTCTGAG…
Lab 2
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
Flanking Region Polymorphisms in Practice
Problem is ambiguity
Full string is unambiguous,
intractable for reporting
Report differences from a reference genome
• Delineate length of amplicons
• Agree upon a reference genome
• Report rs number (not always available)
• Report position and base differences,
e.g. Chr6:1004589:T
– No allowance for new genome versions
• InDel alignment parameters
“My Wife and My Mother-in-Law”
William Ely Hill, 1915
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
Looking Forward
Central repository for population sequence data at forensic STR loci
Labs generating worldwide population data will upload full strings
Database converts strings to tractable notation
• Bracketed repeat region
• Flank polymorphisms with length delineated
Database is searchable by repeat motif or string
Gatekeeper for
unusual sequences:
consistency and
back-compatible
allele calling
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
Acknowledgements
NIST
Pete Vallone
Kevin Kiesler
Nate Olson
Jo Lynne Harenza
Mike Coble
Becky Hill
Margaret Kline
Student Interns
Rachel Aponte - George Washington University
Harish Swaminathan - Rutgers University
Anna Blendermann - Mongomery College
Promega
Doug Storts
Jay Patel
Contact Information
katherine.gettings@nist.gov
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group

Weitere ähnliche Inhalte

Was ist angesagt?

Genome Editing Comes of Age; CRISPR, rAAV and the new landscape of molecular ...
Genome Editing Comes of Age; CRISPR, rAAV and the new landscape of molecular ...Genome Editing Comes of Age; CRISPR, rAAV and the new landscape of molecular ...
Genome Editing Comes of Age; CRISPR, rAAV and the new landscape of molecular ...
Candy Smellie
 
Characterizing Alzheimer’s Disease candidate genes and transcripts with targe...
Characterizing Alzheimer’s Disease candidate genes and transcripts with targe...Characterizing Alzheimer’s Disease candidate genes and transcripts with targe...
Characterizing Alzheimer’s Disease candidate genes and transcripts with targe...
Integrated DNA Technologies
 
Why Y Chromosome Markers are an Ever Expanding Essential Tool in Sexual Assau...
Why Y Chromosome Markers are an Ever Expanding Essential Tool in Sexual Assau...Why Y Chromosome Markers are an Ever Expanding Essential Tool in Sexual Assau...
Why Y Chromosome Markers are an Ever Expanding Essential Tool in Sexual Assau...
Thermo Fisher Scientific
 
140127 platinum genomes pedigree analyses
140127 platinum genomes pedigree analyses140127 platinum genomes pedigree analyses
140127 platinum genomes pedigree analyses
GenomeInABottle
 

Was ist angesagt? (20)

An Introduction to Crispr Genome Editing
An Introduction to Crispr Genome EditingAn Introduction to Crispr Genome Editing
An Introduction to Crispr Genome Editing
 
Ashg sedlazeck grc_share
Ashg sedlazeck grc_shareAshg sedlazeck grc_share
Ashg sedlazeck grc_share
 
GENASSIST™ CRISPR & rAAV Genome Editing Tools
GENASSIST™ CRISPR & rAAV Genome Editing ToolsGENASSIST™ CRISPR & rAAV Genome Editing Tools
GENASSIST™ CRISPR & rAAV Genome Editing Tools
 
Molecular QC: Interpreting your Bioinformatics Pipeline
Molecular QC: Interpreting your Bioinformatics PipelineMolecular QC: Interpreting your Bioinformatics Pipeline
Molecular QC: Interpreting your Bioinformatics Pipeline
 
Research Program Genetic Gains (RPGG) Review Meeting 2021: Groundnut genomic ...
Research Program Genetic Gains (RPGG) Review Meeting 2021: Groundnut genomic ...Research Program Genetic Gains (RPGG) Review Meeting 2021: Groundnut genomic ...
Research Program Genetic Gains (RPGG) Review Meeting 2021: Groundnut genomic ...
 
Genome Editing Comes of Age; CRISPR, rAAV and the new landscape of molecular ...
Genome Editing Comes of Age; CRISPR, rAAV and the new landscape of molecular ...Genome Editing Comes of Age; CRISPR, rAAV and the new landscape of molecular ...
Genome Editing Comes of Age; CRISPR, rAAV and the new landscape of molecular ...
 
Striving for excellence in yam breeding using genomics tools
Striving for excellence in yam breeding using genomics toolsStriving for excellence in yam breeding using genomics tools
Striving for excellence in yam breeding using genomics tools
 
171017 giab for giab grc workshop
171017 giab for giab grc workshop171017 giab for giab grc workshop
171017 giab for giab grc workshop
 
Church dm grc_workshop
Church dm grc_workshopChurch dm grc_workshop
Church dm grc_workshop
 
Gene Editing - Challenges and Future of CRISPR in Clinical Development
Gene Editing - Challenges and Future of CRISPR in Clinical DevelopmentGene Editing - Challenges and Future of CRISPR in Clinical Development
Gene Editing - Challenges and Future of CRISPR in Clinical Development
 
Characterizing Alzheimer’s Disease candidate genes and transcripts with targe...
Characterizing Alzheimer’s Disease candidate genes and transcripts with targe...Characterizing Alzheimer’s Disease candidate genes and transcripts with targe...
Characterizing Alzheimer’s Disease candidate genes and transcripts with targe...
 
Translating Genomes | Personalizing Medicine
Translating Genomes | Personalizing MedicineTranslating Genomes | Personalizing Medicine
Translating Genomes | Personalizing Medicine
 
Why Y Chromosome Markers are an Ever Expanding Essential Tool in Sexual Assau...
Why Y Chromosome Markers are an Ever Expanding Essential Tool in Sexual Assau...Why Y Chromosome Markers are an Ever Expanding Essential Tool in Sexual Assau...
Why Y Chromosome Markers are an Ever Expanding Essential Tool in Sexual Assau...
 
Rewriting the Genome Using CRISPR and Synthetic Biology
Rewriting the Genome Using CRISPR and Synthetic Biology Rewriting the Genome Using CRISPR and Synthetic Biology
Rewriting the Genome Using CRISPR and Synthetic Biology
 
Genome Editing Comes of Age
Genome Editing Comes of AgeGenome Editing Comes of Age
Genome Editing Comes of Age
 
CRISPR Screening: the What, Why and How
CRISPR Screening: the What, Why and HowCRISPR Screening: the What, Why and How
CRISPR Screening: the What, Why and How
 
140127 platinum genomes pedigree analyses
140127 platinum genomes pedigree analyses140127 platinum genomes pedigree analyses
140127 platinum genomes pedigree analyses
 
Evaluation of the impact of error correction algorithms on SNP calling.
Evaluation of the impact of error correction algorithms on SNP calling.Evaluation of the impact of error correction algorithms on SNP calling.
Evaluation of the impact of error correction algorithms on SNP calling.
 
26072016 uc davis_small
26072016 uc davis_small26072016 uc davis_small
26072016 uc davis_small
 
Next Generation Sequencing application in virology
Next Generation Sequencing application in virologyNext Generation Sequencing application in virology
Next Generation Sequencing application in virology
 

Ähnlich wie The Next Dimension in STR Sequencing: Polymorphisms in Flanking Regions and Their Allelic Associations

Y_Workshop_WI_planz (1).ppt123457800kjbvc
Y_Workshop_WI_planz (1).ppt123457800kjbvcY_Workshop_WI_planz (1).ppt123457800kjbvc
Y_Workshop_WI_planz (1).ppt123457800kjbvc
alizain9604
 
Y_Workshop_WI_planz (3).ppt12345789999987543
Y_Workshop_WI_planz (3).ppt12345789999987543Y_Workshop_WI_planz (3).ppt12345789999987543
Y_Workshop_WI_planz (3).ppt12345789999987543
alizain9604
 

Ähnlich wie The Next Dimension in STR Sequencing: Polymorphisms in Flanking Regions and Their Allelic Associations (13)

Genome Wide SNPs for Admixture Analysis and Selection Signatures
Genome Wide SNPs for Admixture Analysis and Selection SignaturesGenome Wide SNPs for Admixture Analysis and Selection Signatures
Genome Wide SNPs for Admixture Analysis and Selection Signatures
 
Dr. Igor Paploski - Making Epidemiological Sense Out of Large Datasets of PRR...
Dr. Igor Paploski - Making Epidemiological Sense Out of Large Datasets of PRR...Dr. Igor Paploski - Making Epidemiological Sense Out of Large Datasets of PRR...
Dr. Igor Paploski - Making Epidemiological Sense Out of Large Datasets of PRR...
 
150224 giab 30 min generic slides
150224 giab 30 min generic slides150224 giab 30 min generic slides
150224 giab 30 min generic slides
 
A SNP array for human population genetics studies
A SNP array for human population genetics studiesA SNP array for human population genetics studies
A SNP array for human population genetics studies
 
Eshg poster roman-naranjo
Eshg poster roman-naranjoEshg poster roman-naranjo
Eshg poster roman-naranjo
 
Forensics: Human Identity Testing in the Applied Genetics Group
Forensics: Human Identity Testing in the Applied Genetics GroupForensics: Human Identity Testing in the Applied Genetics Group
Forensics: Human Identity Testing in the Applied Genetics Group
 
Shapiro Webinar V9
Shapiro Webinar V9Shapiro Webinar V9
Shapiro Webinar V9
 
2009 CSBB LAB 新生訓練
2009 CSBB LAB 新生訓練2009 CSBB LAB 新生訓練
2009 CSBB LAB 新生訓練
 
Tair workshop stanford2017
Tair workshop stanford2017Tair workshop stanford2017
Tair workshop stanford2017
 
Pete thorpe wp1 april 2018
Pete thorpe wp1 april 2018Pete thorpe wp1 april 2018
Pete thorpe wp1 april 2018
 
Massively Parallel Sequencing - integrating the Ion PGM™ sequencer into your ...
Massively Parallel Sequencing - integrating the Ion PGM™ sequencer into your ...Massively Parallel Sequencing - integrating the Ion PGM™ sequencer into your ...
Massively Parallel Sequencing - integrating the Ion PGM™ sequencer into your ...
 
Y_Workshop_WI_planz (1).ppt123457800kjbvc
Y_Workshop_WI_planz (1).ppt123457800kjbvcY_Workshop_WI_planz (1).ppt123457800kjbvc
Y_Workshop_WI_planz (1).ppt123457800kjbvc
 
Y_Workshop_WI_planz (3).ppt12345789999987543
Y_Workshop_WI_planz (3).ppt12345789999987543Y_Workshop_WI_planz (3).ppt12345789999987543
Y_Workshop_WI_planz (3).ppt12345789999987543
 

Kürzlich hochgeladen

GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...
GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...
GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...
Lokesh Kothari
 
Formation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksFormation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disks
Sérgio Sacani
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Sérgio Sacani
 
Pests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdf
PirithiRaju
 
Presentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxPresentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptx
gindu3009
 

Kürzlich hochgeladen (20)

Zoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdfZoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdf
 
GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)
 
GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...
GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...
GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...
 
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and SpectrometryFAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
 
COST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptxCOST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptx
 
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bNightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
 
Feature-aligned N-BEATS with Sinkhorn divergence (ICLR '24)
Feature-aligned N-BEATS with Sinkhorn divergence (ICLR '24)Feature-aligned N-BEATS with Sinkhorn divergence (ICLR '24)
Feature-aligned N-BEATS with Sinkhorn divergence (ICLR '24)
 
module for grade 9 for distance learning
module for grade 9 for distance learningmodule for grade 9 for distance learning
module for grade 9 for distance learning
 
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
 
Formation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksFormation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disks
 
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 60009654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
 
American Type Culture Collection (ATCC).pptx
American Type Culture Collection (ATCC).pptxAmerican Type Culture Collection (ATCC).pptx
American Type Culture Collection (ATCC).pptx
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
 
Site Acceptance Test .
Site Acceptance Test                    .Site Acceptance Test                    .
Site Acceptance Test .
 
Botany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdfBotany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdf
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​
 
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceuticsPulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
 
Pests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdf
 
Presentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxPresentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptx
 
GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)
 

The Next Dimension in STR Sequencing: Polymorphisms in Flanking Regions and Their Allelic Associations

  • 1. THE NEXT DIMENSION IN STR SEQUENCING: Polymorphisms in Flanking Regions & their Allelic Associations Katherine Butler Gettings Rachel Aponte Kevin Kiesler Peter Vallone September 2, 2015 NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group
  • 2. Disclaimer I will mention commercial STR kit names and information, but I am not attempting to endorse any specific products. NIST Disclaimer: Certain commercial equipment, instruments and materials are identified in order to specify experimental procedures as completely as possible. In no case does such identification imply a recommendation or it imply that any of the materials, instruments or equipment identified are necessarily the best available for the purpose. Information presented does not necessarily represent the official position of the National Institute of Standards and Technology or the U.S. Department of Justice. Our group receives or has received funding from the FBI Laboratory and the National Institute of Justice. NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group
  • 3. NIST Population Samples • N=183 •Caucasian - 70 •Hispanic - 45 •African American - 68 Amplification & Library Prep •PowerSeq Auto System (Beta) •PPFusion Loci •Illumina TruSeq HS PCR-Free Sequencing •MiSeq Bioinformatics •STRait Razor •ExactID (Battelle) •CE concordance Analysis •Repeat Region •Flanking Region •Stutter Data GenerationNIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group
  • 4. N=183 Length NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group
  • 5. Forensic STR Sequence Diversity N=183 Simple Repeats NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group
  • 6. 150 bp flanks 110 bp flanks NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group
  • 7. Categories of Flanking Regions 1. No polymorphisms 2. Polymorphisms Associated with a Sequence Variant 3. Rare polymorphisms 4. Population or Allele Specific polymorphisms 5. “Old” polymorphisms (not population or allele specific) 6. Multiple polymorphisms in Haplotype NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group
  • 8. Category Locus rs Number African American Caucasian Hispanic African American Caucasian Hispanic D3S1358 FGA D12S391 Penta E D19S433 D21S11 D1S1656 rs4847015 0.191 0.336 0.311 5 5 4 0 rs11063971 0.044 0.086 0.078 1 1 1 rs11063970 0.044 0.086 0.078 1 1 1 rs11063969 0.044 0.086 0.078 1 1 1 rs75219269 0.044 0.086 0.078 1 1 1 rs199970098 0.029 - 0.011 2 - 1 D2S441 rs74640515 0.015 0.014 0.067 1 1 2 2 CSF1PO C-T 36bp 5' - - 0.011 - - 1 1 D8S1179 rs138862078 0.007 - - 1 - - 1 D10S1248 T-G 2bp 3' - 0.007 - - 1 - 1 D18S51 rs535833682 0.029 - - 3 - - 1 Penta D rs7279663 0.022 0.014 0.011 1 2 1 2 D22S1045 rs190864081 0.007 - - 1 - - 1 rs13422969 0.135 - 0.011 2 - 1 rs115644759 0.022 - - 2 - - rs149212737 - 0.007 - - 1 - TH01 rs79373318 0.148 - - 3 - - 3 rs1728369 0.404 0.171 0.278 4 4 4 G-A 94bp 5' - - 0.011 - - 1 D2S1338 rs6736691 0.221 0.264 0.300 8 6 4 3 G-T 4bp 3' 0.316 0.257 0.189 7 5 6 rs25768 0.228 0.257 0.156 5 5 5 rs146841551 0.029 - - 1 - - rs541272009 0.007 - - 1 - - rs9546005 0.309 0.286 0.333 5 4 7 rs73525369 0.066 - - 3 - - rs73250432 - 0.014 0.011 - 2 1 rs146621667 0.015 - - 2 - - 4bp del 8bp 3' - - 0.022 - - 2 4bp del 21bp 3' 0.007 - 0.022 1 - 1 rs16887642 0.177 0.050 0.022 4 2 2 rs7789995 0.015 0.164 0.122 2 4 4 rs7786079 0.176 0.021 0.011 5 2 1 1bp del 21bp 3' - - 0.011 - - 1 Minor Allele Frequency Number of Associated STR Alleles Alleles Gained with Flanking SNPs No SNPs 0vWA SNPs Associated with Repeat Region Sequence Variant 5D16S539 Single "Old" SNP (not population or allele specific) TPOXPopulation / Allele Specific SNPs Rare SNPs 4 16D13S317 D5S818 17 D7S820 15 Multiple SNPs in Haplotype NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group
  • 9. No SNPs were identified in the sequenced flanking regions of: D3S1358 Penta E FGA D19S433 D12S391 D21S11 Category Locus rs Number African American Caucasian Hispanic African American Caucasian Hispanic D3S1358 FGA D12S391 Penta E D19S433 D21S11 Minor Allele Frequency Number of Associated STR Alleles Alleles Gained with Flanking SNPs No SNPs NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group
  • 10. Category Locus rs Number African American Caucasian Hispanic African American Caucasian Hispanic D1S1656 rs4847015 0.191 0.336 0.311 5 5 4 0 rs11063971 0.044 0.086 0.078 1 1 1 rs11063970 0.044 0.086 0.078 1 1 1 rs11063969 0.044 0.086 0.078 1 1 1 rs75219269 0.044 0.086 0.078 1 1 1 rs199970098 0.029 - 0.011 2 - 1 0vWA SNPs Associated with Repeat Region Sequence Variant Minor Allele Frequency Number of Associated STR Alleles Alleles Gained with Flanking SNPs D1S1656 has one SNP, rs4847015 • Well distributed across populations… NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group
  • 11. Category Locus rs Number African American Caucasian Hispanic African American Caucasian Hispanic D1S1656 rs4847015 0.191 0.336 0.311 5 5 4 0 rs11063971 0.044 0.086 0.078 1 1 1 rs11063970 0.044 0.086 0.078 1 1 1 rs11063969 0.044 0.086 0.078 1 1 1 rs75219269 0.044 0.086 0.078 1 1 1 rs199970098 0.029 - 0.011 2 - 1 0vWA SNPs Associated with Repeat Region Sequence Variant Minor Allele Frequency Number of Associated STR Alleles Alleles Gained with Flanking SNPs D1S1656 has one SNP, rs4847015 • Well distributed across populations • Well distributed across STR alleles • Does NOT increase the number of alleles… NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group
  • 12. D1S1656 has one SNP, rs4847015 and it is always associated with x.3 alleles: 15.3 16.3 17.3 rs4847015 = T 18.3 19.3 Category Locus rs Number African American Caucasian Hispanic African American Caucasian Hispanic D1S1656 rs4847015 0.191 0.336 0.311 5 5 4 0 rs11063971 0.044 0.086 0.078 1 1 1 rs11063970 0.044 0.086 0.078 1 1 1 rs11063969 0.044 0.086 0.078 1 1 1 rs75219269 0.044 0.086 0.078 1 1 1 rs199970098 0.029 - 0.011 2 - 1 0vWA SNPs Associated with Repeat Region Sequence Variant Minor Allele Frequency Number of Associated STR Alleles Alleles Gained with Flanking SNPs NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group
  • 13. Rare SNPs were identified in the sequenced flanking regions of: D2S441 D18S51 CSF1PO Penta D D8S1179 D22S1045 D10S1248 Category Locus rs Number African American Caucasian Hispanic African American Caucasian Hispanic D2S441 rs74640515 0.015 0.014 0.067 1 1 2 2 CSF1PO C-T 36bp 5' - - 0.011 - - 1 1 D8S1179 rs138862078 0.007 - - 1 - - 1 D10S1248 T-G 2bp 3' - 0.007 - - 1 - 1 D18S51 rs535833682 0.029 - - 3 - - 1 Penta D rs7279663 0.022 0.014 0.011 1 2 1 2 D22S1045 rs190864081 0.007 - - 1 - - 1 Minor Allele Frequency Number of Associated STR Alleles Alleles Gained with Flanking SNPs Rare SNPs NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group
  • 14. Category Locus rs Number African American Caucasian Hispanic African American Caucasian Hispanic rs13422969 0.135 - 0.011 2 - 1 rs115644759 0.022 - - 2 - - rs149212737 - 0.007 - - 1 - TH01 rs79373318 0.148 - - 3 - - 3 TPOX Minor Allele Frequency Number of Associated STR Alleles Alleles Gained with Flanking SNPs Population / Allele Specific SNPs 4 TPOX: rs13422969 and two additional rare SNPs • Well distributed across Africa, rare elsewhere NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group
  • 15. Category Locus rs Number African American Caucasian Hispanic African American Caucasian Hispanic rs13422969 0.135 - 0.011 2 - 1 rs115644759 0.022 - - 2 - - rs149212737 - 0.007 - - 1 - TH01 rs79373318 0.148 - - 3 - - 3 TPOX Minor Allele Frequency Number of Associated STR Alleles Alleles Gained with Flanking SNPs Population / Allele Specific SNPs 4 TPOX: rs13422969 and two additional rare SNPs • Well distributed across Africa, rare elsewhere • Associated with “9” allele NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group
  • 16. D16S539: rs1728369 and one additional rare SNP • Well distributed across populations and alleles Category Locus rs Number African American Caucasian Hispanic African American Caucasian Hispanic rs1728369 0.404 0.171 0.278 4 4 4 G-A 94bp 5' - - 0.011 - - 1 D2S1338 rs6736691 0.221 0.264 0.300 8 6 4 3 5D16S539 Single "Old" SNP (not population or allele specific) Minor Allele Frequency Number of Associated STR Alleles Alleles Gained with Flanking SNPs NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group
  • 17. Category Locus rs Number African American Caucasian Hispanic African American Caucasian Hispanic G-T 4bp 3' 0.316 0.257 0.189 7 5 6 rs25768 0.228 0.257 0.156 5 5 5 rs146841551 0.029 - - 1 - - rs541272009 0.007 - - 1 - - rs9546005 0.309 0.286 0.333 5 4 7 rs73525369 0.066 - - 3 - - rs73250432 - 0.014 0.011 - 2 1 rs146621667 0.015 - - 2 - - 4bp del 8bp 3' - - 0.022 - - 2 4bp del 21bp 3' 0.007 - 0.022 1 - 1 rs16887642 0.177 0.050 0.022 4 2 2 rs7789995 0.015 0.164 0.122 2 4 4 rs7786079 0.176 0.021 0.011 5 2 1 1bp del 21bp 3' - - 0.011 - - 1 D7S820 15 Multiple SNPs in Haplotype 16D13S317 D5S818 17 Minor Allele Frequency Number of Associated STR Alleles Alleles Gained with Flanking SNPs D5S818: • G→T 4bp 3’ • rs25768 • two additional rare SNPs Well distributed across populations and alleles NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group
  • 18. NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group
  • 19. *[AGAT]n[ACAT][AGAT] NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group
  • 20. Conclusions • Ideal SNPs for increasing allelic diversity are neither population nor allele specific • Multiple SNPs in haplotype may substantially increase allelic diversity • Population studies are needed to associate flanking region polymorphisms with STR alleles NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group
  • 21. Flanking Region Polymorphisms in Practice Problem is ambiguity Full string is unambiguous, intractable for reporting Report differences from a reference genome • Delineate length of amplicons • Agree upon a reference genome “My Wife and My Mother-in-Law” William Ely Hill, 1915 Sample …CCAGTTACGACGCTAACTACTCTGAG… Reference…CCAGTTACGACGCTAACTACTTTGAG… Lab 1 Sample …CCAGTTACGACGCTAACT Reference…CCAGTTACGACGCTAACTACTTTGAG… Lab 3 Sample …CCAGTTACGACGCTAACTACTCTGAG… Reference…CCAGTTACGACGCTAACTACTCTGAG… Lab 2 NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group
  • 22. Flanking Region Polymorphisms in Practice Problem is ambiguity Full string is unambiguous, intractable for reporting Report differences from a reference genome • Delineate length of amplicons • Agree upon a reference genome • Report rs number (not always available) • Report position and base differences, e.g. Chr6:1004589:T – No allowance for new genome versions • InDel alignment parameters “My Wife and My Mother-in-Law” William Ely Hill, 1915 NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group
  • 23. Looking Forward Central repository for population sequence data at forensic STR loci Labs generating worldwide population data will upload full strings Database converts strings to tractable notation • Bracketed repeat region • Flank polymorphisms with length delineated Database is searchable by repeat motif or string Gatekeeper for unusual sequences: consistency and back-compatible allele calling NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group
  • 24. Acknowledgements NIST Pete Vallone Kevin Kiesler Nate Olson Jo Lynne Harenza Mike Coble Becky Hill Margaret Kline Student Interns Rachel Aponte - George Washington University Harish Swaminathan - Rutgers University Anna Blendermann - Mongomery College Promega Doug Storts Jay Patel Contact Information katherine.gettings@nist.gov NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group