Justdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts Service
Talk by Jonathan Eisen for GSAC2000 on "Phylogenomics"
1. TIGRTIGR
Phylogenomics:
Combining Evolutionary
Reconstructions and Genome
Analysis into a Single
Composite Approach
0
250000
500000
750000
1000000
1250000
SubjectOrfPosition
0 250000 500000 750000 1000000 1250000
Query Orf Position
Mycobacterium tuberculosis
Bacillus subtilis
Synechocystis sp.
Caenorhabditis elegans
Drosophila melanogaster
Saccharomyces cerevisiae
Methanobacterium thermoautotrophicum
Archaeoglobus fulgidus
Pyrococcus horikoshii
Methanococcus jannaschii
Aeropyrum pernix
Aquifex aeolicus
Thermotoga maritima
Deinococcus radiodurans
Treponema pallidum
Borrelia burgdorferi
Helicobacter pylori
Campylobacter jejuni
Neisseria meningitidis
Escherichia coli
Vibrio cholerae
Haemophilus influenzae
Rickettsia prowazekii
Mycoplasma pneumoniae
Mycoplasma genitalium
Chlamydia trachomatis
Chlamydia pneumoniae
0.05 changes
Archaea
Bacteria
Eukarya
Tmf-penden
R-rubrum3
Azs-brasi2
Rm-vanniel
Rhb-legum8
Bdr-japoni
Spg-capsul
Ric-prowaz
Ste-maltop
Spr-voluta
Rub-gelat2
Rcy-purpur
Nis-gonor1
Hrh-halc
h2
Alm
-vin
osm
Ps-aerugi3
E-coliMyx-xanthu
Bde-stolpiDsv-desulfDsb-postgaC-leptum
C-butyric4
C-pasteuri
Eub-barker
C-quercico
Hel-chlor2
Acp-laidla
M-capricol
C-ramosum
B-stearoth
Eco-faecal
Lis-monoc3
B-cereus4
B-subtilis
Stc-therm3
L-delbruck
L-casei
Fus-nuclea
Glb-violac
Olst-lut_CZeamaysC
Nost-muscr
Syn-6301
Tnm
-lapsum
Flx-litora
Cy-lytica
Emb-brevi2
Bac-fragil
Prv-rumcol
Prb-difflu
Cy-hutchin
Flx-canada
Sap-grandi
Chl-limico
Wln-succi2
Hlb-pylor6
Cam-jejun5Stm-ambofa
Arb-globif
Cor-xerosi
Bif-bifidu
Cfx-aurant
Tmc-roseum
Aqu-pyroph
env-SBAR12
env-SBAR16
Msr-barker
Tpl-acidop
Msp-hungat
Hf-volcani
Mb-formici
Mt-fervid1
Tc-celer
Arg-fulgid
Mpy-kandl1
Mc-vanniel
Mc-jannasc
env-pJP27
Sul-acalda
Thp-tenax
env-pJP89
Tt-maritim
Fer-island
Mei-ruber4
D-radiodur
Chd-psitta
Acbt-capsl
env-MC18
Pir-staley
Lpn-illini
Lps-interKSpi-stenos
Trp-pallidBor-burgdo
Spi-haloph
Brs-hyodys
Fib-sucS85
Tmf-penden
R-rubrum3
Azs-brasi2
Rm-vanniel
Rhb-legum8
Bdr-japoni
Spg-capsul
Ric-prowaz
Ste-maltop
Spr-voluta
Rub-gelat2
Rcy-purpur
Nis-gonor1
Hrh-halch2
Alm
-vinosm
Ps-aerugi3
E-coliMyx-xanthu
Bde-stolpiDsv-desulfDsb-postgaC-leptum
C-butyric4
C-pasteuri
Eub-barker
C-quercico
Hel-chlor2
Acp-laidla
M-capricol
C-ramosum
B-stearoth
Eco-faecal
Lis-monoc3
B-cereus4
B-subtilis
Stc-therm3
L-delbruck
L-casei
Fus-nuclea
Glb-vio
lac
Olst-lut_CZeamaysC
Nost-muscr
Syn-6301
Tnm
-lapsum
Flx-litora
Cy-lytica
Emb-brevi2
Bac-fragil
Prv-rumcol
Prb-difflu
Cy-hutchin
Flx-canada
Sap-grandi
Chl-limico
Wln-succi2
Hlb-pylor6
Cam-jejun5Stm-ambofa
Arb-globif
Cor-xerosi
Bif-bifidu
Cfx-aurant
Tmc-roseum
Aqu-pyroph
env-SBAR12
env-SBAR16
Msr-barker
Tpl-acidop
Msp-hungat
Hf-volcani
Mb-formici
Mt-fervid1
Tc-celer
Arg-fulgid
Mpy-kandl1
M
c-vanniel
Mc-jannasc
env-pJP27
Sul-acalda
Thp-tenax
env-pJP89
Tt-maritim
Fer-island
Mei-ruber4
D-radiodur
Chd-psitta
Acbt-capsl
env-MC18
Pir-staley
Lpn-illini
Lps-interKSpi-stenos
Trp-pallidBor-burgdo
Spi-haloph
Brs-hyodys
Fib-sucS85
Bacteria Archaea Bacteria Archaea
A.rRNAtreeofBacterialandArchaealMajorGroups B.GroupswithCompletedGenomesHighlighted
A
B
CD
E
F
A
B
CD
E
F
A
B
C
D
E
F
A
B
C
D
E
F
A ’
B’
C’
D’
E ’
F’
A
B
C
D
E
F
A’
B’
C’
D’
E ’
F ’
A
C
D
F
A’
B’
E’
E. coli
E. coli
B
C
D
F
A’
B’
D’
E ’
V. cholerae
A
B
C
D
E
F
A ’
B’
C’
D’
E ’
F’
B1
A1
B2
A2
B3
A3
A2
A1 A2
A3
B2
B1
B3
B2
24
23
22
21
20
19
18171615
14
13
12
11
10
9
6
7
258
26
27
28
29
30
1 2
3
4
5
31 32
B1
3132
6
7
8
9
10
11
12
13
14
15161718
19
20
21
22
23
24
25
26
27
28
29
30
1 2
3
4
5
3132
B3 24
23
22
21
20
19
18171615
14
13
12
11
10
9
6
7
258
26
27
28
29
3
32 31
30
4
5
2 1
A1
3132
6
7
8
9
10
11
12
13
14
15161718
19
20
21
22
23
24
25
26
27
28
29
30
1 2
3
4
5
3132
A2
31 32
6
7
8
9
10
11
12
13
19
18171615
14
20
21
22
23
24
25
26
27
28
29
30
1 2
3
4
5
31 32
A 3
2
6
7
8
9
10
11
12
13
19
18171615
14
20
21
22
23
24
25
26
27
5
4
3
31
30
29
28
1 32
B2
Inversion
Around
Terminus (*)
Inversion
Around
Terminus (*)
Inversion
Around
Origin (*)
Inversion
Around
Origin (*)
* *
* *
* *
* *
Figure 4
C ommon
Ancestor of
A and B
31 32
6
7
8
9
10
11
12
13
14
15161718
19
20
21
22
23
24
25
26
27
28
29
30
1 2
3
4
5
31 32
Three V. cholerae
Photolyases
Phr.S thyp
PHR E. coli
O R FA0 0965*********
phr.neucr
Phr.Tricho
Phr.Yeast
Phr.B firm
phr.strpy
phr.haloba
PHR STRGR
p C RY1.huma
p hr.mouse
phr2.human
phr2.mouse
phr.drosop
p hr3.Sy nsp
O RF02295.Vib ch********
phr.neigo
O R F01792.Vib ch*******
Phr.Adiant
Phr2.Adian
Phr3.Adian
phr.tomato
C RY1 ARATH
phr.p hy com
C RY2 ARATH
PHH1.arath
PHR1 SINAL
phr.chlamy
PHR ANANI
phr.Sy nsp
PHR SYNY3
phr.Theth
Rh.caps
M TH F type
C la ss I CP D
Photolya se s
6-4
P hotolya se s
Blue
Light
R e ce ptors
8-HD F type
CPD
P hotolya se s
Three P hotoly ase H om ologs inV . chole ra e
UvrA2
UvrA2 S. coelicolor
DrrC S. peuce teus
UvrA2 D. radiodurans
Duplication
inUvrA
family
UvrA1
UvrA H. influenzae
UvrA E. coli
UvrA N. gonorrhoaea
UvrA R. prowazekii
UvrA S. mutans
UvrA S. pyogene s
UvrA S. pneumoniae
UvrA B. subtilis
UvrA M. luteus
UvrA M. tuberculosis
UvrA M. hermoautotrophicum
UvrA H. pylori
UvrA C. jejuni
UvrA P. gingivalis
UvrA C. tepidum
uvra1 D. radiodurans
UvrA T. thermophilus
UvrA T. pallidum
UvrA B. burgdorefi
UvrA T. maritima
UvrA A. aeolicus
UvrA Synechocystis sp.
UvrA1
UvrA2
OppDF
UUP
NodI
LivF
XylG
NrtDC
PstB
MDR
HlyB
TAP1
CFTR, SUR
A. ABC Transporters B. UvrA Subfamily
0
10
20
30
40
50
60
0 5 10 15 20
0
50
100
150
0 5 10 15 20
NumberofSpeciesWithHighHits
0
50
100
150
200
250
Frequency
0 5 10 15 20
PapaBear MamaBear BabyBear
0
100
200
300
400
500
0 5 10 15 20
E.coli
2. TIGRTIGR
Wolbachia Genome Project
• Information is available through TIGRs
unfinished genomes service (go to
www.tigr.org to find out more)
• Also, additional information will soon be
available on www.wolbachia.org
7. TIGRTIGR
Uses of Evolutionary Analysis in
Molecular Biology
• Identification of mutation patterns (e.g., ts/tv ratio)
• Amino-acid/nucleotide substitution patterns useful in
structural studies (e.g., rRNA)
• Sequence searching matrices (e.g., PAM, Blosum)
• Motif analysis (e.g., Blocks)
• Functional predictions
• Classifying multigene families
• Evolutionary history puts other information into
perspective (e.g., duplications, gene loss)
• HIV mutation patterns and classification
TIGRTIGR
8. TIGRTIGR
Evolutionary Studies Improve
Most Aspects of Genome Analysis
• Phylogeny of species places comparative data in perspective
• Evolution of genes and gene families
– Functional predictions
– Identification of orthologs and paralogs
– Species specific mutation patterns
• Evolution of pathways
– Convergence
– Prediction of function
• Evolution of gene order/genome rearrangements
• Phylogenetic distribution patterns
• Identification of novel features
9. TIGRTIGR
Genome Information and Analysis
Improves Studies of Evolution
• Complete genome information particularly useful
• Unbiased sampling
• More sequences of genes
• Presence/absence information needed to infer
certain events (e.g., gene loss, duplication)
• Genome wide mutation and substitution patterns
(e.g., strand bias)
• Diversification and duplication
10. TIGRTIGR
Phylogenomic Analysis
• There are feedback loop between evolutionary and
genome analysis such that for many studies,
genome and evolutionary analyses are
interdependent.
• Therefore, I have proposed that they actually be
combined into a single composite approach I refer
to as phylogenomics
• Phylogenomics involves combining evolutionary
reconstructions of genes, proteins, pathways, and
species with analysis of complete genome
sequences.
11. TIGRTIGR
Outline of Phylogenomics
Gene Evolution Events
Phenotype Predictions
Database
Species tree Presence/AbsenceGene trees
Congruence Evol. Distribution
F(x) Predictions
Pathway Evolution
TIGRTIGR
19. TIGRTIGR
Why Duplications Are Useful to Identify
• Allows division into orthologs and paralogs
• Aids functional predictions
• Recent duplications may be indicative of species’
specific adaptations
• Helps identify mechanisms of duplication
• Can be used to study mutation processes in
different parts of genome
20. TIGRTIGR
Expansion of MCP Family in V. cholerae
E.coligi1787690
B.subtilisgi2633766
Synechocystissp. gi1001299
Synechocystissp. gi1001300
Synechocystissp. gi1652276
Synechocystissp.gi1652103
H.pylori gi2313716
H.pylori99 gi4155097
C.jejuniCj1190c
C.jejuniCj1110c
A.fulgidusgi2649560
A.fulgidusgi2649548
B.subtilisgi2634254
B.subtilisgi2632630
B.subtilisgi2635607
B.subtilisgi2635608
B.subtilisgi2635609
B.subtilisgi2635610
B.subtilisgi2635882
E.coligi1788195
E.coligi2367378
E.coligi1788194
E.coligi1789453
C.jejuniCj0144
C.jejuniCj0262c
H.pylori gi2313186
H.pylori99 gi4154603
C.jejuniCj1564
C.jejuniCj1506c
H.pylori gi2313163
H.pylori99 gi4154575
H.pylori gi2313179
H.pylori99 gi4154599
C.jejuniCj0019c
C.jejuniCj0951c
C.jejuniCj0246c
B.subtilisgi2633374
T.maritima TM0014
T.pallidumgi3322777
T.pallidumgi3322939
T.pallidumgi3322938
B.burgdorferi gi2688522
T.pallidumgi3322296
B.burgdorferi gi2688521
T.maritima TM0429
T.maritima TM0918
T.maritima TM0023
T.maritima TM1428
T.maritima TM1143
T.maritima TM1146
P.abyssiPAB1308
P.horikoshiigi3256846
P.abyssiPAB1336
P.horikoshiigi3256896
P.abyssiPAB2066
P.horikoshiigi3258290
P.abyssiPAB1026
P.horikoshiigi3256884
D.radiodurans DR A00354
D.radiodurans DRA0353
D.radiodurans DRA0352
P.abyssiPAB1189
P.horikoshiigi3258414
B.burgdorferi gi2688621
M.tuberculosisgi1666149
V .c hole ra eV C0 5 1 2
V . c hol e ra eV CA1 0 3 4
V .c hole ra eV CA 0 9 7 4
V .c hole raeV CA 0 06 8
V . chol e ra eV C0 8 2 5
V . c hol e ra eV C0 28 2
V .c hol e raeV CA 0 9 0 6
V . chol e ra eV CA0 9 7 9
V .c hol e raeV CA 1 0 5 6
V . c hol e ra eV C1 64 3
V . c hol e ra eV C2 1 6 1
V .c hole ra eV CA 09 2 3
V .c hole raeV C0 5 1 4
V . c hol e ra eV C1 8 6 8
V . c hol era eV CA0 7 7 3
V .c hole raeV C1 3 1 3
V . c hol era eV C1 8 5 9
V . c hole ra eV C14 1 3
V .c hol e raeV CA 0 2 6 8
V .c hol e raeV CA0 6 5 8
V . c hole ra eV C14 0 5
V . c hol e ra eV C1 2 9 8
V . c hol e ra eV C1 2 4 8
V . c hol era eV CA0 8 6 4
V . c hole ra eV CA0 1 7 6
V. c hol e ra eV CA0 2 2 0
V .c hole ra eV C1 2 8 9
V .c hole ra eV CA 10 6 9
V . c hol e ra eV C2 43 9
V . chol e ra eV C1 9 6 7
V . chol e ra eV CA0 0 3 1
V . c hole ra eV C18 9 8
V . chol e ra eV CA0 6 6 3
V .c hole ra eV CA 0 9 8 8
V . c hol era eV C0 2 1 6
V . c hol era eV C0 4 4 9
V .c hole ra eV CA 0 0 0 8
V . c hole ra eV C14 0 6
V . chol e ra eV C1 5 3 5
V .c hole ra eV C0 8 4 0
V . c hol e raeV C0 0 98
V .c hole ra eV CA 1 0 9 2
V .c hole ra eV C1 4 0 3
V .c hole ra eV CA1 0 8 8
V . c hol e ra eV C1 3 9 4
V .c hole ra eV C0 6 2 2
NJ
* *
* *
* *
*
* *
* *
* *
* *
* *
*
* *
* *
* *
* *
*
* *
* *
* *
* *
* *
* *
* *
* *
* *
* *
*
* *
* *
* *
* ** *
* *
*
*
*
*
* *
*
* *
* *
* *
*
* *
* *
*
29. TIGRTIGR
Why Gene Loss is Useful to Identify
• Indicates that gene is not absolutely required for
survival
• Correlated loss of same gene in different species
may indicate selective advantage of loss of that
gene
• Correlated loss of genes in a pathway suggests a
conserved association among those genes
30. TIGRTIGR
EuksArch Bacteria
Loss
Evolutionary O rigin of Gene
MT MJ SC HS AA DR TA BS MG MP BB TP HP HI EC SS MT
Presence ( ) or Absence of Gene
Species Abbreviation
Kingdom
Example of Tracing Gene Loss
TIGRTIGR
32. TIGRTIGR
Need for Phylogenomics Example:
Gene Duplication and Loss
• Genome analysis required to determine number of
homologs in different species
• Evolutionary analysis required to divide into
orthology groups and identify gene duplications
• Genome analysis is then required to determine
presence and absence of orthologs
• Then loss of orthologs can be traced onto
evolutionary tree of species
34. TIGRTIGR
V. cholerae vs. E. coli All Hits
0
1000000
2000000
3000000
4000000
5000000E.coliCoordinates
0 1000000 2000000 3000000
V. cholerae CoordinatesTIGRTIGR
35. TIGRTIGR
V. cholerae vs. E. coli Top Hits
0
1000000
2000000
3000000
4000000
5000000
E.coliCoordinates
0 1000000 2000000 3000000
V. cholerae CoordinatesTIGRTIGR
36. TIGRTIGR
V. cholerae vs. E. coli
Only if EC-Orf is Closest in All Genomes
0
1000000
2000000
3000000
4000000
5000000
E.coliCoordinates
0 1000000 2000000 3000000
V. cholerae Coordinates
TIGRTIGR
37. TIGRTIGR
V. cholerae vs. E. coli F+R
0
1000000
2000000
3000000
4000000
5000000
Bert
Ecoli R
Ecoli
44. TIGRTIGR
Why Gene Transfers Are Useful to Identify
• Laterally transferred genes frequently involved in
environmental adaptations and/or pathogenicity
• Helps identify transposons, integrons, and other
vectors of gene transfer
• Helps identify species associations in the
environment
• Prediction of organellar targeting of nuclear
encoded genes
47. TIGRTIGR
Best Matches Per ORF
0
0.05
0.1
0.15
0.2
0.25
0.3
BM/Orfs
CHLTE
PORGI
BACSU
MCYTU
BBUR
TREPA
CHLPN
ECOLI
NEIME
RICPR
CAUCR
HELPY
SYNSP
AQUAE
DEIRA
THEMA
AERPE
ARCFU
METJA
METTH
PYRAB
CELEG
YEAST
DROME
B A E
48. TIGRTIGR
Possible Plastid ORFs
• So far, over 900 ORFs are candidates for being
derived from the plastid genome some time in the
past
– 50 have best match to plastid genomes from other plants
– more than 800 have best match to Syn. sp. complete
genome
– 100 have best matches to proteins from incomplete
cyanobacterial genomes but no match to the proteins from
Syn. sp.
– incredible diversity of putative functions as well as many
conserv hypothetical
52. TIGRTIGR
Acknowledgements
• Genome duplications: S. Salzberg, J. Heidelberg, O.
White, A. Stoltzfus, J. Peterson
• Genome sequences and analysis: J. Heidelberg, T.
Read, H. Tettelin, K. Nelson, J. Peterson, R.
Fleischmann, D. Bryant
• Horizontal transfers: K. Nelson, W. F. Doolittle
• TIGR: C. Fraser, J. Venter, M-I. Benito, S. Kaul,
Seqcore
• $$$: DOE, NSF, NIH, ONR
53. TIGRTIGR
True Phylogenetic Methods
Work Best
MutS2.Syns
MutS2.Bacs
MutS2.Help
MutS2.Deir
Mutsl.Mett
MSH4.Celeg
MSH4.Yeast
MSH4.human
mMutS.Saco
MSH3.yeast
C23C11.Spo
MSH1.Yeast
MSH3.Human
REP1.Mouse
GTBP.Mouse
GTBP.Human
MSH6.Yeast
MSH5.Human
MSH5.Celeg
MSH5.Yeast
MSH2.Human
MSH2.Mouse
MSH2.Yeast
MutS.Ecoli
MutS.Synsp
MutS.Deira
MutS.Bacsu
M utS.Ecoli
M utS.Synsp
M utS.B acsu
M utS.Deira
M SH 2.H uman
M SH 2.M ouse
M SH 2.Yeast
M SH 3.H uman
R EP1.M ouse
G TB P.M ouse
G TB P.H uman
M SH 6.Yeast
C 23C 11.Sp o
M SH 1.Yeast
M SH 3.yeast
M SH 4.C eleg
M SH 4.human
M SH 5.C eleg
M SH 5.Yeast
mM utS.Saco
M SH 5.H uman
M SH 4.Yeast
M utS2.Syns
M utS2.B acs
M utS2.Deir
M utS2.H elp
M utsl.M ett
UPGMANeighbor-Joining
58. TIGRTIGR
TIGTIG
RR
OtherOther
peoplepeople
Mom and DadMom and Dad
S. KarlinS. Karlin
M. FeldmanM. Feldman
A. M. CampbellA. M. Campbell
R. FernaldR. Fernald
R. ShaferR. Shafer
D. AckerlyD. Ackerly
D. GoldsteinD. Goldstein
M. EisenM. Eisen
J. CourcelleJ. Courcelle
R. MyersR. Myers
C. M. CavanaughC. M. Cavanaugh
P. HanawaltP. Hanawalt
NSFNSF
J. HeidelberJ. Heidelber
T.ReadT.Read
S. KaulS. Kaul
M-I BenitoM-I Benito
J. C. VenterJ. C. VenterC. FraseC. Fraser
S. SalzbergS. Salzberg
O. WhiteO. White
K. NelsonK. Nelson
$$$$$$
ONRONR
DOEDOE
NIHNIH
H. TettelinH. Tettelin
59. TIGRTIGR
Figure 4. Best matches by role
categoryHits By Role Category
0.00
0.10
0.20
0.30
0.40
0.50
0.60
0.70
0.80
Role Category
Arc/Bac
64. TIGRTIGR
Wolbachia sp.
• Endosymbionts of many invertebrates
• We are sequencing the symbiont of the fruit
fly and a parasite Brugia malayi.
• Some of required for host survival, some
are pathogens (e.g., in wasps, Wolbachia
kill all male offspring)
• Member of the α-Proteobacteria
65. TIGRTIGR
Methylococcus capsulatus
• Member of the γ-Proteobacteria
• Grows on methane
• Collaboration with University of Bergen in
Norway who are working on using this
species as Salmon feed
66. TIGRTIGR
Figure 6. Citrate lyase domains
Human Citrate lyase
CelegCitrate lyase
Cyanophora
A. thaliana F5E6.2 chrIII
Chlorobium 1164
Spombe
Citrate synthase I
Succinyl Co-A Synthase alpha
Succinyl Co-A Synthase beta
Chlorobium 2435
A. thaliana - many
Bacteria
Eukaryotic mitochondria
Chlorobium 3733
Bacteria
Chlorobium 1167
A. thaliana - many
Bacteria
67. TIGRTIGR
Chlorobium tepidum Strain TLS
C. tepidum mat in highly sulfidic
“Travelodge Stream”,
Rotorua, New Zealand
(from Castenholz and Pierson, 1995)
Phase contrast photomicrograph
of the 48-hours culture and electron
micrograph of thin cell section
(from Wahlund et al, 1991)
68. TIGRTIGR
N
N N
N
O
H 3 C
H 3 C
C HOH
C H 3
H 3C
O
O
M g
C H 3
C H 2 C H 2C H 3
C H 2 C H 3
H
H
C H 3
or
C H 2 C H(CH3 )2
C H 2 C (CH3 )3
ethyl
propyl
isobutyl
neopentyl
farnesyl
C H 2 C H 3
N
N N
N
O
H 3
C
C H 3H 3C
C H 2CH 3
O
O
M g
CH 3
COO C H3
CH
CH 2
N
N N
N
O
H 3
C
C H 3H 3C
C H 2CH 3
C
C H 3
O
O
M g
CH 3
COO C H3
O
Chlorophylls Found in
Chlorobium tepidum
Chlorophyll a-670Bacteriochlorophyll a
Bacteriochlorophyll c
∆-2, 6 phytadienol
phytol
69. TIGRTIGR
Protein Duplications
• Of 14,881 ORFs with matches to a
complete genome, 13,092 have a best match
to another A. thaliana ORF
• Two major classes
– tandem duplications
– large chromosomal duplications
70. TIGRTIGR
Best Matches to Complete Genomes
0
1000
2000
3000
4000
BestMatches
CHLTE
PORGI
BACSU
MCYTU
BBUR
TREPA
CHLPN
ECOLI
NEIME
RICPR
CAUCR
HELPY
SYNSP
AQUAE
DEIRA
THEMA
AERPE
ARCFU
METJA
METTH
PYRAB
CELEG
YEAST
DROME
B A E
75. TIGRTIGR
Evolution of Uracil Glycosylase
• Ung activity has evolve many times (many non-
homologous proteins have uracil-DNA glycosylase
activity)
• Therefore, absence of homologs of these genes
should not be used to infer likely absence of
activity
• However, presence of homologs of Ung and MUG
genes can be used to indicate presence of activity
because all homologs of these genes have this
activity
76. TIGRTIGR
Evolution of Photoreactivation
• All known enzymes that perform photoreactivation are part of
a single large photolyase gene family
• Some members of the family do not function as photolyases,
but instead work as blue-light receptors
• If a species does not encode a member of the photolyase gene
family, it likely does not have photoreactivation capability
• If a species encodes a photolyase, one cannot conclude it has
photolyase activity
• Position of photolyase homologs within photolyase tree helps
predict what activities they have
77. TIGRTIGR
Evolution of Alkyltransferases
• All known alkyltransferases share a conserved,
homologous alkyltransferase domain
• Therefore, if a species does not encode any
protein with this domain, it likely does not have
alkyltransferase activity
• If a species does encode an member of this gene
family, it likely has alkyltransferase activity
78. TIGRTIGR
Examples of Horizontal Transfers
• Antibiotic resistance genes on plasmids
• Insertion sequences
• Pathogenicity islands
• Toxin resistance genes on plasmids
• Agrobacterium Ti plasmid
• Viruses and viroids
• Organelle to nucleus transfers
79. TIGRTIGR
Archaeal genes in bacterial genomesArchaeal genes in bacterial genomes**
Bacterial speciesBacterial species Best hits to ArchaealBest hits to Archaeal
Thermotoga maritimaThermotoga maritima 451 (24%)451 (24%)
Aquifex aeolicusAquifex aeolicus 246 (16%)246 (16%)
SynechocystisSynechocystis sp.sp. 126 (4%)126 (4%)
Borrelia burgdorferiBorrelia burgdorferi 45 (3.6%)45 (3.6%)
Escherichia coliEscherichia coli 99 (2.3%)99 (2.3%)
** 1010-5-5
over 60% of sequenceover 60% of sequence
80. TIGRTIGR
• Possibilitiy of gene transfer criticized
because of possibility of shared descent
• C. tepidum – green sulfur bacteria – 15-20%
• C. hydrogenoformans – low GC gram + -
~25%