SlideShare ist ein Scribd-Unternehmen logo
1 von 15
Downloaden Sie, um offline zu lesen
Transposable elements of
Agavoideae
Kate L Hertweck (@k8hert)
The University of Texas at Tyler
Alexandros Bousios
University of Sussex
Michael McKain
Donald Danforth Plant Science Center
en.wikipedia.org en.wikipedia.org
Why Agavoideae? (besides the obvious)
●
Asparagaceae subfamily Agavoideae: 23 genera, 637 species
●
agave, yucca, Joshua Tree
●
Economically important:
●
tequila, food starches
●
biofuels
●
ornamentals
●
interesting morphological, ecological, life history traits
●
Recent diversification correlated with ecological traits
(Good-Avila, 2006)
gizmodo.com
Hertweck et al., TEs in Agavoideae
commons.wikimedia.org
Agavoideae genomics
●
Emerging genomic/transcriptomic resources
●
Polyploidy, bimodality (McKain et al., 2012)
●
Variation in TEs (Bousios et al., 2007) and genome size (Zonneveld, 2003)
Darlington 1963
Hertweck et al., TEs in Agavoideae
Guadelupe et al., 2008
Transposable elements as a model system
●
TEs, mobile genetic elements, or jumping genes
●
Parasitic, self-replicating, move independently in the genome
●
Many different types; some similar to or derived from viruses
Class I: Retrotransposons
(copy and paste)
LTR (Gypsy,
Copia/Sireviruses,
Caulimoviruses)
LINE
SINE
Class II: DNA transposons
(cut and paste)
TIR (EnSpm, hAT, MuDR,
TcMar, PIF)
MITE
Helitron
Hertweck et al., TEs in Agavoideae
●
TE proliferation is associated with modifications across the genome,
including changes to gene expression and genome size
●
TE composition/abundance may interact with organismal changes, like
hybridization, polyploidy, phenotype, life history
Mine existing genomic resources across Agavoideae to characterize
repetitive elements
Estimate abundance and diversity of transposable elements (TEs)
Cross validate results from different methods
The big questions:
Is transposon composition in Agavoideae genomes related to
hypothesized patterns of genomic evolution?
Do transposon proliferation and other genomic traits correlate with life
history traits in Agavoideae?
Hertweck et al., TEs in Agavoideae
Our goals
Aphyllanthes
Lomandra
Sansevieria
Asparagus
Ledebouria
Dichelostemma
Agapanthis
Allium
Haworthia
Hosta
Scadoxus
0%
10%
20%
30%
40%
50%
60%
70%
0
5000
10000
15000
20000
25000
Agavoideae includes substantial diversity
(even by Asparagales standards)
Unknown contigs
Known repeats
Genomesize(Mb/1C)
Percentageofsequence
readsfromnucleargenome
Hertweck, 2013, Genome
●
Genomes are difficult to assemble
●
Genome size varies
Repeat characterization methods
Genome survey sequences
●
most from MonAToL
project (Illumina SE, 30-
100 bp)
●
quality control of fastq files
with PRINSEQ
●
assembled with
MaSuRCA v2.3.2 or
RepARK v1.3.0
●
organellar sequences
filtered with BLAST
●
0.02-0.38x coverage
●
12 taxa, only 8 with
sufficient contigs to analyze
Scripts available:
github.com/k8hertweck/REpipe
Hertweck et al., TEs in Agavoideae
Nuclear contigs
●
assembled contigs are
consensus of most
abundant TEs in the
genome
●
TEs must exist in high copy
to have sufficient reads for
detection (assembly)
●
the older a TE insertion,
the more likely it has
accumulated mutations
which will inhibit detection
●
data presented as
percentage of TE type in
nuclear genome (relative
abundance)
en.wikipedia.org
Repeat characterization methods
Genome survey sequences
Scripts available:
github.com/k8hertweck/REpipe
Hertweck et al., TEs in Agavoideae
Transcriptomes
●
various sources, tissues,
coverage, assembly
methods
●
downloaded assemblies
(no other filtering)
Nuclear contigs
●
contigs represent actively
transcribed TEs, which
may or may not relate to
abundance in the genome
●
even relatively rare TEs
may be detectable
●
data presented as
percentage of transcripts
(relative expressed
diversity)
en.wikipedia.org
Repeat characterization methods
Genome survey sequences
Scripts available:
github.com/k8hertweck/REpipe
Hertweck et al., TEs in Agavoideae
TranscriptomesNuclear contigs
RepeatMasker
●
Liliopsida library (mostly
references from grasses)
●
searches many types of
TEs, including parts
without genes
●
some ambiguous results
(same contig, multiple
types of TE)
Domain searching
●
rpstblastn against protein
domain models (CDD)
for TE-specific genes
●
clustering with
CD-HIT-EST
Repeat contigs
Unknown contigs
read mapping
Wikimedia
Commons
Detectable repeats vary across species
Hertweck et al., TEs in Agavoideae
Repeat abundance
●
percentage of total reads
●
repeat annotations from
RepeatMasker
●
most reads map to unannotated
contigs (or remain unmapped)
Repeat diversity
●
percentage of nuclear contigs
●
annotations from RepeatMasker
●
most contigs are LTRs
●
transcriptomes represent broader
variation in diverse TEs (because
of the overall number of contigs)
GSS transcriptome
Sampled taxa possess same diversity of DNA TE families,
but at different abundance
Hertweck et al., TEs in Agavoideae
GSS data
●
percentage of nuclear genome
●
annotations from RepeatMasker
●
most taxa have a single family
present in high abundance
●
may reflect karyotype
Transcriptome data
●
percentage of contigs
●
annotations from RepeatMasker
●
all families present (active?) in all
taxa
●
minor variation in family-level
diversity for some taxa
●
not incongruent with GSS data
Patterns of LTR abundance rely on annotation method
Hertweck et al., TEs in Agavoideae
●
Gypsy more abundant in
most genomes, although
proportions vary
●
no relationship with LTR
abundance and genome
size
●
including CDD annotations
can double LTR
abundance in some
genomes
●
Proportion of Copia:Gypsy
remains same for some
taxa (Schoenolirion), but
changes for others (Hosta)
●
LTR diversity (numbers of
contigs) shows similar
patterns
tetraploid,
largest (known) genome in dataset
Hertweck et al., TEs in Agavoideae
Conclusions
●
Mine existing genomic resources across Agavoideae to characterize
repetitive elements
●
Methods matter; bias is not evenly distributed and patterns difficult to
discern
●
Low proportion of GSS data assemble for Agavoideae
●
large numbers of ancestral (inactive) insertions, related to whole
genome duplication event?
●
low-level diversity in abundant TEs just different enough from available
libraries to remain undetectable
●
DNA transposon dominance may differ among clades
●
Gypsy more abundant in most genomes
Hertweck et al., TEs in Agavoideae
Future work
●
Future work:
●
Improve annotations (build custom repeat libraries) and analyze TE
subtaxonomy
●
improve quantification of repeats (P-clouds, RepeatExplorer)
●
validate results using multiple sequencing attempts/data types
●
Big questions:
●
Is transposon composition in Agaviodeae genomes related to
hypothesized patterns of genomic evolution?
●
Do transposon proliferation and other genomic traits correlate with life
history traits in Agavoideae?
Acknowledgements
MonAToL
Texas Advanced Computing Center (TACC)
National Evolutionary Synthesis Center (NESCent, Duke U)
Research
https://sites.google.com/site/k8hertweck
Blog:
k8hert.blogspot.com
Twitter @k8hert
Google+ k8hertweck@gmail.com

Weitere ähnliche Inhalte

Was ist angesagt?

Reverse-and forward-engineering specificity of carbohydrate-processing enzymes
Reverse-and forward-engineering specificity of carbohydrate-processing enzymesReverse-and forward-engineering specificity of carbohydrate-processing enzymes
Reverse-and forward-engineering specificity of carbohydrate-processing enzymesLeighton Pritchard
 
A statistical physics approach to system biology
A statistical physics approach to system biologyA statistical physics approach to system biology
A statistical physics approach to system biologySamir Suweis
 
BITS - Introduction to comparative genomics
BITS - Introduction to comparative genomicsBITS - Introduction to comparative genomics
BITS - Introduction to comparative genomicsBITS
 
EVE 161 Winter 2018 Class 18
EVE 161 Winter 2018 Class 18EVE 161 Winter 2018 Class 18
EVE 161 Winter 2018 Class 18Jonathan Eisen
 
Application of genomics in animals
Application of genomics in animalsApplication of genomics in animals
Application of genomics in animalsUsman Arshad
 
Bioc4700 2014 Guest Lecture
Bioc4700   2014 Guest LectureBioc4700   2014 Guest Lecture
Bioc4700 2014 Guest LectureDan Gaston
 
What is comparative genomics
What is comparative genomicsWhat is comparative genomics
What is comparative genomicsUsman Arshad
 
NAISTビッグデータシンポジウム - バイオ久保先生
NAISTビッグデータシンポジウム - バイオ久保先生NAISTビッグデータシンポジウム - バイオ久保先生
NAISTビッグデータシンポジウム - バイオ久保先生ysuzuki-naist
 
Comparative genomics
Comparative genomicsComparative genomics
Comparative genomicsprateek kumar
 
Synonymous mutations as drivers in human cancer genomes.
Synonymous mutations as drivers in human cancer genomes.Synonymous mutations as drivers in human cancer genomes.
Synonymous mutations as drivers in human cancer genomes.Fran Supek
 
BITS - Comparative genomics on the genome level
BITS - Comparative genomics on the genome levelBITS - Comparative genomics on the genome level
BITS - Comparative genomics on the genome levelBITS
 
Phylogenomics talk in 2000 at University of Maryland by J. Eisen
Phylogenomics talk in 2000 at University of Maryland by J. EisenPhylogenomics talk in 2000 at University of Maryland by J. Eisen
Phylogenomics talk in 2000 at University of Maryland by J. EisenJonathan Eisen
 
"Phylogenomics: Combining Evolutionary Reconstructions and Genome Analysis in...
"Phylogenomics: Combining Evolutionary Reconstructions and Genome Analysis in..."Phylogenomics: Combining Evolutionary Reconstructions and Genome Analysis in...
"Phylogenomics: Combining Evolutionary Reconstructions and Genome Analysis in...Jonathan Eisen
 
Haendel clingenetics.3.14.14
Haendel clingenetics.3.14.14Haendel clingenetics.3.14.14
Haendel clingenetics.3.14.14mhaendel
 
L14 human genome
L14 human genomeL14 human genome
L14 human genomeMUBOSScz
 
EVE 161 Winter 2018 Class 17
EVE 161 Winter 2018 Class 17EVE 161 Winter 2018 Class 17
EVE 161 Winter 2018 Class 17Jonathan Eisen
 

Was ist angesagt? (20)

Reverse-and forward-engineering specificity of carbohydrate-processing enzymes
Reverse-and forward-engineering specificity of carbohydrate-processing enzymesReverse-and forward-engineering specificity of carbohydrate-processing enzymes
Reverse-and forward-engineering specificity of carbohydrate-processing enzymes
 
A statistical physics approach to system biology
A statistical physics approach to system biologyA statistical physics approach to system biology
A statistical physics approach to system biology
 
Metagenomics
MetagenomicsMetagenomics
Metagenomics
 
BITS - Introduction to comparative genomics
BITS - Introduction to comparative genomicsBITS - Introduction to comparative genomics
BITS - Introduction to comparative genomics
 
EVE 161 Winter 2018 Class 18
EVE 161 Winter 2018 Class 18EVE 161 Winter 2018 Class 18
EVE 161 Winter 2018 Class 18
 
Hertweck bbl2012
Hertweck bbl2012Hertweck bbl2012
Hertweck bbl2012
 
Testing for Food Authenticity
Testing for Food AuthenticityTesting for Food Authenticity
Testing for Food Authenticity
 
Application of genomics in animals
Application of genomics in animalsApplication of genomics in animals
Application of genomics in animals
 
Bioc4700 2014 Guest Lecture
Bioc4700   2014 Guest LectureBioc4700   2014 Guest Lecture
Bioc4700 2014 Guest Lecture
 
What is comparative genomics
What is comparative genomicsWhat is comparative genomics
What is comparative genomics
 
NAISTビッグデータシンポジウム - バイオ久保先生
NAISTビッグデータシンポジウム - バイオ久保先生NAISTビッグデータシンポジウム - バイオ久保先生
NAISTビッグデータシンポジウム - バイオ久保先生
 
Comparative genomics
Comparative genomicsComparative genomics
Comparative genomics
 
Synonymous mutations as drivers in human cancer genomes.
Synonymous mutations as drivers in human cancer genomes.Synonymous mutations as drivers in human cancer genomes.
Synonymous mutations as drivers in human cancer genomes.
 
BITS - Comparative genomics on the genome level
BITS - Comparative genomics on the genome levelBITS - Comparative genomics on the genome level
BITS - Comparative genomics on the genome level
 
Phylogenomics talk in 2000 at University of Maryland by J. Eisen
Phylogenomics talk in 2000 at University of Maryland by J. EisenPhylogenomics talk in 2000 at University of Maryland by J. Eisen
Phylogenomics talk in 2000 at University of Maryland by J. Eisen
 
"Phylogenomics: Combining Evolutionary Reconstructions and Genome Analysis in...
"Phylogenomics: Combining Evolutionary Reconstructions and Genome Analysis in..."Phylogenomics: Combining Evolutionary Reconstructions and Genome Analysis in...
"Phylogenomics: Combining Evolutionary Reconstructions and Genome Analysis in...
 
Comparitive genomics
Comparitive genomicsComparitive genomics
Comparitive genomics
 
Haendel clingenetics.3.14.14
Haendel clingenetics.3.14.14Haendel clingenetics.3.14.14
Haendel clingenetics.3.14.14
 
L14 human genome
L14 human genomeL14 human genome
L14 human genome
 
EVE 161 Winter 2018 Class 17
EVE 161 Winter 2018 Class 17EVE 161 Winter 2018 Class 17
EVE 161 Winter 2018 Class 17
 

Andere mochten auch

SeqinR - biological data handling
SeqinR - biological data handlingSeqinR - biological data handling
SeqinR - biological data handlingpau_corral
 
Hertweck AB3ACBS presentation
Hertweck AB3ACBS presentationHertweck AB3ACBS presentation
Hertweck AB3ACBS presentationKate Hertweck
 
iEvoBio Hertweck presentation 2012
iEvoBio Hertweck presentation 2012iEvoBio Hertweck presentation 2012
iEvoBio Hertweck presentation 2012Kate Hertweck
 
Developing an undergraduate bioinformatics course
Developing an undergraduate bioinformatics courseDeveloping an undergraduate bioinformatics course
Developing an undergraduate bioinformatics courseKate Hertweck
 
regex-presentation_ed_goodwin
regex-presentation_ed_goodwinregex-presentation_ed_goodwin
regex-presentation_ed_goodwinschamber
 
Hertweck Monocots V Presentation
Hertweck Monocots V PresentationHertweck Monocots V Presentation
Hertweck Monocots V PresentationKate Hertweck
 
Getting More Phylotastic
Getting More PhylotasticGetting More Phylotastic
Getting More PhylotasticArlin Stoltzfus
 
Hertweck Evolution 2014
Hertweck Evolution 2014Hertweck Evolution 2014
Hertweck Evolution 2014Kate Hertweck
 
Phylogenetics in R
Phylogenetics in RPhylogenetics in R
Phylogenetics in Rschamber
 
Bayesian Divergence Time Estimation – Workshop Lecture
Bayesian Divergence Time Estimation – Workshop LectureBayesian Divergence Time Estimation – Workshop Lecture
Bayesian Divergence Time Estimation – Workshop LectureTracy Heath
 
Phylogeny in R - Bianca Santini Sheffield R Users March 2015
Phylogeny in R - Bianca Santini Sheffield R Users March 2015Phylogeny in R - Bianca Santini Sheffield R Users March 2015
Phylogeny in R - Bianca Santini Sheffield R Users March 2015Paul Richards
 
Chamberlain PhD Thesis
Chamberlain PhD ThesisChamberlain PhD Thesis
Chamberlain PhD Thesisschamber
 
R Introduction
R IntroductionR Introduction
R Introductionschamber
 
Web data from R
Web data from RWeb data from R
Web data from Rschamber
 
Digital Experimental Phylogenetics - Evolution2014
Digital Experimental Phylogenetics - Evolution2014Digital Experimental Phylogenetics - Evolution2014
Digital Experimental Phylogenetics - Evolution2014Cory Kohn
 

Andere mochten auch (20)

SeqinR - biological data handling
SeqinR - biological data handlingSeqinR - biological data handling
SeqinR - biological data handling
 
Phylolecture
PhylolecturePhylolecture
Phylolecture
 
Hertweck AB3ACBS presentation
Hertweck AB3ACBS presentationHertweck AB3ACBS presentation
Hertweck AB3ACBS presentation
 
iEvoBio Hertweck presentation 2012
iEvoBio Hertweck presentation 2012iEvoBio Hertweck presentation 2012
iEvoBio Hertweck presentation 2012
 
Developing an undergraduate bioinformatics course
Developing an undergraduate bioinformatics courseDeveloping an undergraduate bioinformatics course
Developing an undergraduate bioinformatics course
 
Poster
PosterPoster
Poster
 
Poster
PosterPoster
Poster
 
regex-presentation_ed_goodwin
regex-presentation_ed_goodwinregex-presentation_ed_goodwin
regex-presentation_ed_goodwin
 
Hertweck Monocots V Presentation
Hertweck Monocots V PresentationHertweck Monocots V Presentation
Hertweck Monocots V Presentation
 
Getting More Phylotastic
Getting More PhylotasticGetting More Phylotastic
Getting More Phylotastic
 
Hertweck Evolution 2014
Hertweck Evolution 2014Hertweck Evolution 2014
Hertweck Evolution 2014
 
Phylogenetics in R
Phylogenetics in RPhylogenetics in R
Phylogenetics in R
 
Careers in Botany
Careers in BotanyCareers in Botany
Careers in Botany
 
Evolution 2012
Evolution 2012Evolution 2012
Evolution 2012
 
Bayesian Divergence Time Estimation – Workshop Lecture
Bayesian Divergence Time Estimation – Workshop LectureBayesian Divergence Time Estimation – Workshop Lecture
Bayesian Divergence Time Estimation – Workshop Lecture
 
Phylogeny in R - Bianca Santini Sheffield R Users March 2015
Phylogeny in R - Bianca Santini Sheffield R Users March 2015Phylogeny in R - Bianca Santini Sheffield R Users March 2015
Phylogeny in R - Bianca Santini Sheffield R Users March 2015
 
Chamberlain PhD Thesis
Chamberlain PhD ThesisChamberlain PhD Thesis
Chamberlain PhD Thesis
 
R Introduction
R IntroductionR Introduction
R Introduction
 
Web data from R
Web data from RWeb data from R
Web data from R
 
Digital Experimental Phylogenetics - Evolution2014
Digital Experimental Phylogenetics - Evolution2014Digital Experimental Phylogenetics - Evolution2014
Digital Experimental Phylogenetics - Evolution2014
 

Ähnlich wie Transposable elements of Agavoideae

Burns_et_al-2016-Molecular_Ecology_Resources
Burns_et_al-2016-Molecular_Ecology_ResourcesBurns_et_al-2016-Molecular_Ecology_Resources
Burns_et_al-2016-Molecular_Ecology_ResourcesMercedes Burns
 
Utility of transcriptome sequencing for phylogenetic
Utility of transcriptome sequencing for phylogeneticUtility of transcriptome sequencing for phylogenetic
Utility of transcriptome sequencing for phylogeneticEdizonJambormias2
 
Improving pan-genome annotation using whole genome multiple alignment
Improving pan-genome annotation using whole genome multiple alignmentImproving pan-genome annotation using whole genome multiple alignment
Improving pan-genome annotation using whole genome multiple alignmentRaunak Shrestha
 
Comparative genomics to the rescue: How complete is your plant genome sequence?
Comparative genomics to the rescue: How complete is your plant genome sequence?Comparative genomics to the rescue: How complete is your plant genome sequence?
Comparative genomics to the rescue: How complete is your plant genome sequence?Klaas Vandepoele
 
Biochemical and molecular markers for characterization
Biochemical and molecular markers for characterizationBiochemical and molecular markers for characterization
Biochemical and molecular markers for characterizationmithraa thirumalai
 
Plant Chromosomes: European Cytogeneticists outline: Trude Schwarzacher and P...
Plant Chromosomes: European Cytogeneticists outline: Trude Schwarzacher and P...Plant Chromosomes: European Cytogeneticists outline: Trude Schwarzacher and P...
Plant Chromosomes: European Cytogeneticists outline: Trude Schwarzacher and P...Pat (JS) Heslop-Harrison
 
Genome to pangenome : A doorway into crops genome exploration
Genome to pangenome : A doorway into crops genome explorationGenome to pangenome : A doorway into crops genome exploration
Genome to pangenome : A doorway into crops genome explorationKiranKm11
 
The Human Genome Project - Part I
The Human Genome Project - Part IThe Human Genome Project - Part I
The Human Genome Project - Part Ihhalhaddad
 
Phylogeny of Bacterial and Archaeal Genomes Using Conserved Genes: Supertrees...
Phylogeny of Bacterial and Archaeal Genomes Using Conserved Genes: Supertrees...Phylogeny of Bacterial and Archaeal Genomes Using Conserved Genes: Supertrees...
Phylogeny of Bacterial and Archaeal Genomes Using Conserved Genes: Supertrees...Jonathan Eisen
 
Catalyzing Plant Science Research with RNA-seq
Catalyzing Plant Science Research with RNA-seqCatalyzing Plant Science Research with RNA-seq
Catalyzing Plant Science Research with RNA-seqManjappa Ganiger
 
Variant (SNP) calling - an introduction (with a worked example, using FreeBay...
Variant (SNP) calling - an introduction (with a worked example, using FreeBay...Variant (SNP) calling - an introduction (with a worked example, using FreeBay...
Variant (SNP) calling - an introduction (with a worked example, using FreeBay...Manikhandan Mudaliar
 
2015 beacon-metagenome-tutorial
2015 beacon-metagenome-tutorial2015 beacon-metagenome-tutorial
2015 beacon-metagenome-tutorialc.titus.brown
 
OKC Grand Rounds 2009
OKC Grand Rounds 2009OKC Grand Rounds 2009
OKC Grand Rounds 2009Sean Davis
 
Genetics of gene expression primer
Genetics of gene expression primerGenetics of gene expression primer
Genetics of gene expression primerChris Cotsapas
 
Domains of unknown function are essential in yeast
Domains of unknown function are essential in yeastDomains of unknown function are essential in yeast
Domains of unknown function are essential in yeastLaura Berry
 
Introduction to epigenetics and study design
Introduction to epigenetics and study designIntroduction to epigenetics and study design
Introduction to epigenetics and study designamlbinder
 
Transcriptomics: A time efficient tool for crop improvement
Transcriptomics: A time efficient tool for crop improvementTranscriptomics: A time efficient tool for crop improvement
Transcriptomics: A time efficient tool for crop improvementSajid Sheikh
 
21 kebere bezaweletaw 207-217
21 kebere bezaweletaw 207-21721 kebere bezaweletaw 207-217
21 kebere bezaweletaw 207-217Alexander Decker
 
Epigenetic Analysis Sequencing
Epigenetic Analysis SequencingEpigenetic Analysis Sequencing
Epigenetic Analysis SequencingLisa Martinez
 
Identification, annotation and visualisation of extreme changes in splicing w...
Identification, annotation and visualisation of extreme changes in splicing w...Identification, annotation and visualisation of extreme changes in splicing w...
Identification, annotation and visualisation of extreme changes in splicing w...Mar Gonzàlez-Porta
 

Ähnlich wie Transposable elements of Agavoideae (20)

Burns_et_al-2016-Molecular_Ecology_Resources
Burns_et_al-2016-Molecular_Ecology_ResourcesBurns_et_al-2016-Molecular_Ecology_Resources
Burns_et_al-2016-Molecular_Ecology_Resources
 
Utility of transcriptome sequencing for phylogenetic
Utility of transcriptome sequencing for phylogeneticUtility of transcriptome sequencing for phylogenetic
Utility of transcriptome sequencing for phylogenetic
 
Improving pan-genome annotation using whole genome multiple alignment
Improving pan-genome annotation using whole genome multiple alignmentImproving pan-genome annotation using whole genome multiple alignment
Improving pan-genome annotation using whole genome multiple alignment
 
Comparative genomics to the rescue: How complete is your plant genome sequence?
Comparative genomics to the rescue: How complete is your plant genome sequence?Comparative genomics to the rescue: How complete is your plant genome sequence?
Comparative genomics to the rescue: How complete is your plant genome sequence?
 
Biochemical and molecular markers for characterization
Biochemical and molecular markers for characterizationBiochemical and molecular markers for characterization
Biochemical and molecular markers for characterization
 
Plant Chromosomes: European Cytogeneticists outline: Trude Schwarzacher and P...
Plant Chromosomes: European Cytogeneticists outline: Trude Schwarzacher and P...Plant Chromosomes: European Cytogeneticists outline: Trude Schwarzacher and P...
Plant Chromosomes: European Cytogeneticists outline: Trude Schwarzacher and P...
 
Genome to pangenome : A doorway into crops genome exploration
Genome to pangenome : A doorway into crops genome explorationGenome to pangenome : A doorway into crops genome exploration
Genome to pangenome : A doorway into crops genome exploration
 
The Human Genome Project - Part I
The Human Genome Project - Part IThe Human Genome Project - Part I
The Human Genome Project - Part I
 
Phylogeny of Bacterial and Archaeal Genomes Using Conserved Genes: Supertrees...
Phylogeny of Bacterial and Archaeal Genomes Using Conserved Genes: Supertrees...Phylogeny of Bacterial and Archaeal Genomes Using Conserved Genes: Supertrees...
Phylogeny of Bacterial and Archaeal Genomes Using Conserved Genes: Supertrees...
 
Catalyzing Plant Science Research with RNA-seq
Catalyzing Plant Science Research with RNA-seqCatalyzing Plant Science Research with RNA-seq
Catalyzing Plant Science Research with RNA-seq
 
Variant (SNP) calling - an introduction (with a worked example, using FreeBay...
Variant (SNP) calling - an introduction (with a worked example, using FreeBay...Variant (SNP) calling - an introduction (with a worked example, using FreeBay...
Variant (SNP) calling - an introduction (with a worked example, using FreeBay...
 
2015 beacon-metagenome-tutorial
2015 beacon-metagenome-tutorial2015 beacon-metagenome-tutorial
2015 beacon-metagenome-tutorial
 
OKC Grand Rounds 2009
OKC Grand Rounds 2009OKC Grand Rounds 2009
OKC Grand Rounds 2009
 
Genetics of gene expression primer
Genetics of gene expression primerGenetics of gene expression primer
Genetics of gene expression primer
 
Domains of unknown function are essential in yeast
Domains of unknown function are essential in yeastDomains of unknown function are essential in yeast
Domains of unknown function are essential in yeast
 
Introduction to epigenetics and study design
Introduction to epigenetics and study designIntroduction to epigenetics and study design
Introduction to epigenetics and study design
 
Transcriptomics: A time efficient tool for crop improvement
Transcriptomics: A time efficient tool for crop improvementTranscriptomics: A time efficient tool for crop improvement
Transcriptomics: A time efficient tool for crop improvement
 
21 kebere bezaweletaw 207-217
21 kebere bezaweletaw 207-21721 kebere bezaweletaw 207-217
21 kebere bezaweletaw 207-217
 
Epigenetic Analysis Sequencing
Epigenetic Analysis SequencingEpigenetic Analysis Sequencing
Epigenetic Analysis Sequencing
 
Identification, annotation and visualisation of extreme changes in splicing w...
Identification, annotation and visualisation of extreme changes in splicing w...Identification, annotation and visualisation of extreme changes in splicing w...
Identification, annotation and visualisation of extreme changes in splicing w...
 

Kürzlich hochgeladen

Animal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxAnimal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxUmerFayaz5
 
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bNightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bSérgio Sacani
 
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptxUnlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptxanandsmhk
 
Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTSérgio Sacani
 
Cultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptxCultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptxpradhanghanshyam7136
 
Boyles law module in the grade 10 science
Boyles law module in the grade 10 scienceBoyles law module in the grade 10 science
Boyles law module in the grade 10 sciencefloriejanemacaya1
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​kaibalyasahoo82800
 
Biopesticide (2).pptx .This slides helps to know the different types of biop...
Biopesticide (2).pptx  .This slides helps to know the different types of biop...Biopesticide (2).pptx  .This slides helps to know the different types of biop...
Biopesticide (2).pptx .This slides helps to know the different types of biop...RohitNehra6
 
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCESTERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCEPRINCE C P
 
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsHubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsSérgio Sacani
 
Natural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsNatural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsAArockiyaNisha
 
G9 Science Q4- Week 1-2 Projectile Motion.ppt
G9 Science Q4- Week 1-2 Projectile Motion.pptG9 Science Q4- Week 1-2 Projectile Motion.ppt
G9 Science Q4- Week 1-2 Projectile Motion.pptMAESTRELLAMesa2
 
Botany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdfBotany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdfSumit Kumar yadav
 
Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )aarthirajkumar25
 
Orientation, design and principles of polyhouse
Orientation, design and principles of polyhouseOrientation, design and principles of polyhouse
Orientation, design and principles of polyhousejana861314
 
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptxSOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptxkessiyaTpeter
 
Chemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdfChemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdfSumit Kumar yadav
 
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSpermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSarthak Sekhar Mondal
 

Kürzlich hochgeladen (20)

Engler and Prantl system of classification in plant taxonomy
Engler and Prantl system of classification in plant taxonomyEngler and Prantl system of classification in plant taxonomy
Engler and Prantl system of classification in plant taxonomy
 
Animal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxAnimal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptx
 
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bNightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
 
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptxUnlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
 
Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOST
 
Cultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptxCultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptx
 
Boyles law module in the grade 10 science
Boyles law module in the grade 10 scienceBoyles law module in the grade 10 science
Boyles law module in the grade 10 science
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​
 
Biopesticide (2).pptx .This slides helps to know the different types of biop...
Biopesticide (2).pptx  .This slides helps to know the different types of biop...Biopesticide (2).pptx  .This slides helps to know the different types of biop...
Biopesticide (2).pptx .This slides helps to know the different types of biop...
 
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCESTERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
 
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsHubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
 
Natural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsNatural Polymer Based Nanomaterials
Natural Polymer Based Nanomaterials
 
G9 Science Q4- Week 1-2 Projectile Motion.ppt
G9 Science Q4- Week 1-2 Projectile Motion.pptG9 Science Q4- Week 1-2 Projectile Motion.ppt
G9 Science Q4- Week 1-2 Projectile Motion.ppt
 
Botany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdfBotany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdf
 
Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )
 
Orientation, design and principles of polyhouse
Orientation, design and principles of polyhouseOrientation, design and principles of polyhouse
Orientation, design and principles of polyhouse
 
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptxSOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
 
Chemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdfChemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdf
 
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSpermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
 
9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service
9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service
9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service
 

Transposable elements of Agavoideae

  • 1. Transposable elements of Agavoideae Kate L Hertweck (@k8hert) The University of Texas at Tyler Alexandros Bousios University of Sussex Michael McKain Donald Danforth Plant Science Center en.wikipedia.org en.wikipedia.org
  • 2. Why Agavoideae? (besides the obvious) ● Asparagaceae subfamily Agavoideae: 23 genera, 637 species ● agave, yucca, Joshua Tree ● Economically important: ● tequila, food starches ● biofuels ● ornamentals ● interesting morphological, ecological, life history traits ● Recent diversification correlated with ecological traits (Good-Avila, 2006) gizmodo.com Hertweck et al., TEs in Agavoideae commons.wikimedia.org
  • 3. Agavoideae genomics ● Emerging genomic/transcriptomic resources ● Polyploidy, bimodality (McKain et al., 2012) ● Variation in TEs (Bousios et al., 2007) and genome size (Zonneveld, 2003) Darlington 1963 Hertweck et al., TEs in Agavoideae Guadelupe et al., 2008
  • 4. Transposable elements as a model system ● TEs, mobile genetic elements, or jumping genes ● Parasitic, self-replicating, move independently in the genome ● Many different types; some similar to or derived from viruses Class I: Retrotransposons (copy and paste) LTR (Gypsy, Copia/Sireviruses, Caulimoviruses) LINE SINE Class II: DNA transposons (cut and paste) TIR (EnSpm, hAT, MuDR, TcMar, PIF) MITE Helitron Hertweck et al., TEs in Agavoideae ● TE proliferation is associated with modifications across the genome, including changes to gene expression and genome size ● TE composition/abundance may interact with organismal changes, like hybridization, polyploidy, phenotype, life history
  • 5. Mine existing genomic resources across Agavoideae to characterize repetitive elements Estimate abundance and diversity of transposable elements (TEs) Cross validate results from different methods The big questions: Is transposon composition in Agavoideae genomes related to hypothesized patterns of genomic evolution? Do transposon proliferation and other genomic traits correlate with life history traits in Agavoideae? Hertweck et al., TEs in Agavoideae Our goals
  • 6. Aphyllanthes Lomandra Sansevieria Asparagus Ledebouria Dichelostemma Agapanthis Allium Haworthia Hosta Scadoxus 0% 10% 20% 30% 40% 50% 60% 70% 0 5000 10000 15000 20000 25000 Agavoideae includes substantial diversity (even by Asparagales standards) Unknown contigs Known repeats Genomesize(Mb/1C) Percentageofsequence readsfromnucleargenome Hertweck, 2013, Genome ● Genomes are difficult to assemble ● Genome size varies
  • 7. Repeat characterization methods Genome survey sequences ● most from MonAToL project (Illumina SE, 30- 100 bp) ● quality control of fastq files with PRINSEQ ● assembled with MaSuRCA v2.3.2 or RepARK v1.3.0 ● organellar sequences filtered with BLAST ● 0.02-0.38x coverage ● 12 taxa, only 8 with sufficient contigs to analyze Scripts available: github.com/k8hertweck/REpipe Hertweck et al., TEs in Agavoideae Nuclear contigs ● assembled contigs are consensus of most abundant TEs in the genome ● TEs must exist in high copy to have sufficient reads for detection (assembly) ● the older a TE insertion, the more likely it has accumulated mutations which will inhibit detection ● data presented as percentage of TE type in nuclear genome (relative abundance) en.wikipedia.org
  • 8. Repeat characterization methods Genome survey sequences Scripts available: github.com/k8hertweck/REpipe Hertweck et al., TEs in Agavoideae Transcriptomes ● various sources, tissues, coverage, assembly methods ● downloaded assemblies (no other filtering) Nuclear contigs ● contigs represent actively transcribed TEs, which may or may not relate to abundance in the genome ● even relatively rare TEs may be detectable ● data presented as percentage of transcripts (relative expressed diversity) en.wikipedia.org
  • 9. Repeat characterization methods Genome survey sequences Scripts available: github.com/k8hertweck/REpipe Hertweck et al., TEs in Agavoideae TranscriptomesNuclear contigs RepeatMasker ● Liliopsida library (mostly references from grasses) ● searches many types of TEs, including parts without genes ● some ambiguous results (same contig, multiple types of TE) Domain searching ● rpstblastn against protein domain models (CDD) for TE-specific genes ● clustering with CD-HIT-EST Repeat contigs Unknown contigs read mapping Wikimedia Commons
  • 10. Detectable repeats vary across species Hertweck et al., TEs in Agavoideae Repeat abundance ● percentage of total reads ● repeat annotations from RepeatMasker ● most reads map to unannotated contigs (or remain unmapped) Repeat diversity ● percentage of nuclear contigs ● annotations from RepeatMasker ● most contigs are LTRs ● transcriptomes represent broader variation in diverse TEs (because of the overall number of contigs) GSS transcriptome
  • 11. Sampled taxa possess same diversity of DNA TE families, but at different abundance Hertweck et al., TEs in Agavoideae GSS data ● percentage of nuclear genome ● annotations from RepeatMasker ● most taxa have a single family present in high abundance ● may reflect karyotype Transcriptome data ● percentage of contigs ● annotations from RepeatMasker ● all families present (active?) in all taxa ● minor variation in family-level diversity for some taxa ● not incongruent with GSS data
  • 12. Patterns of LTR abundance rely on annotation method Hertweck et al., TEs in Agavoideae ● Gypsy more abundant in most genomes, although proportions vary ● no relationship with LTR abundance and genome size ● including CDD annotations can double LTR abundance in some genomes ● Proportion of Copia:Gypsy remains same for some taxa (Schoenolirion), but changes for others (Hosta) ● LTR diversity (numbers of contigs) shows similar patterns tetraploid, largest (known) genome in dataset
  • 13. Hertweck et al., TEs in Agavoideae Conclusions ● Mine existing genomic resources across Agavoideae to characterize repetitive elements ● Methods matter; bias is not evenly distributed and patterns difficult to discern ● Low proportion of GSS data assemble for Agavoideae ● large numbers of ancestral (inactive) insertions, related to whole genome duplication event? ● low-level diversity in abundant TEs just different enough from available libraries to remain undetectable ● DNA transposon dominance may differ among clades ● Gypsy more abundant in most genomes
  • 14. Hertweck et al., TEs in Agavoideae Future work ● Future work: ● Improve annotations (build custom repeat libraries) and analyze TE subtaxonomy ● improve quantification of repeats (P-clouds, RepeatExplorer) ● validate results using multiple sequencing attempts/data types ● Big questions: ● Is transposon composition in Agaviodeae genomes related to hypothesized patterns of genomic evolution? ● Do transposon proliferation and other genomic traits correlate with life history traits in Agavoideae?
  • 15. Acknowledgements MonAToL Texas Advanced Computing Center (TACC) National Evolutionary Synthesis Center (NESCent, Duke U) Research https://sites.google.com/site/k8hertweck Blog: k8hert.blogspot.com Twitter @k8hert Google+ k8hertweck@gmail.com