This document summarizes a presentation on the evolution of symmetry in protein structures. It discusses how CE-Symm, a software tool, can accurately detect internal symmetry in proteins. The presentation finds that 18% of SCOP superfamilies exhibit symmetry, with β proteins having the highest percentage at 24.6%. It also discusses how identifying symmetric subdomains can help find "protodomains" that are building blocks of protein domains and have spread across evolution. The use of symmetry to study protein evolution is demonstrated through several examples.
OECD bibliometric indicators: Selected highlights, April 2024
CE-Symm, protein symmetry, and the evolution of protein folds
1. The evolution of symmetry in protein
fold space
Presenter: Douglas Myers-Turnbull
undergraduate student, bioinformatics
Systematic detection of internal symmetry in proteins
using CE-Symm
Douglas Myers-Turnbulla, Spencer E. Blivenb, Peter W. Rosec, Zaid K.
Azidd, Philippe Youkharibachee, Philip E. Bournef,*, Andreas Prlicc,*
Journal of Molecular Biology, under second-pass review.
4. Why is symmetry so common?
Enzymatic function
Lowest energy state
5. Why is symmetry so common?
Enzymatic function
Lowest energy state
Fewest kinetic barriers
6. Why is symmetry so common?
Enzymatic function
Lowest energy state
Fewest kinetic barriers in folding
Easier to evolve complex structures from simple
building blocks
8. Building blocks of domains
Modular Evolution and the Origins of Symmetry:
“…symmetric protein structures can be constructed
from a set of basic ‘building blocks’ or subdomain
modules.”
9. What is a protodomain?
A building block for domains
10. What is a protodomain?
A building block for domains
A subdomain that occurs across distant folds
11. What is a protodomain?
A building block for domains
A subdomain that occurs across distant folds
Unlikely to have arisen by chance
19. Structural classification of proteins
SCOP: class→fold→superfamily→family→domain
Different superfamilies of the same fold often have
substantial differences in structure
21. Normalization by superfamilies
Normalize by number of domains per superfamily
A superfamily is symmetric if more than half of its
domains are symmetric.
22. A census of symmetry
SCOP class number of SFs % symmetric (SFs)
α 503 18.5%
β 354 24.6%
α/β 244 16.8%
α+β 549 14.3%
membrane 108 23.8%
overall 1824 18.0%
Complete results available at: http://source.rcsb.org
23. A census of symmetry
SCOP class number of SFs % symmetric (SFs)
α 503 18.5%
β 354 24.6%
α/β 244 16.8%
α+β 549 14.3%
membrane 108 23.8%
overall 1824 18.0%
Complete results available at: http://source.rcsb.org
24. A census of symmetry
SCOP class number of SFs % symmetric (SFs)
α 503 18.5%
β 354 24.6%
α/β 244 16.8%
α+β 549 14.3%
membrane 108 23.8%
overall 1824 18.0%
Complete results available at: http://source.rcsb.org
35. Identifying protodomains systematically
1. Identify subdomains of symmetric structures with
CE-Symm
2. Identify hits against other domains
3. Derive a non-redundant set of protodomains
37. Acknowledgments
Dr. Andreas Prlic, San Diego Supercomputer Center
Spencer Bliven, Bioinformatics and Systems Biology
Dr. Peter Rose, Skaggs School of Pharmacy
Zaid Aziz, Chemistry and Biochemistry
Dr. Philippe Youkharibache, Life Sciences R&D
Dr. Phil Bourne, Skaggs School of Pharmacy
43. Two methods for order-detection
Method 1: apply alignment repeatedly until the
composition becomes approximately the identity
Method 2: identify the lowest difference ε(θ):