SlideShare ist ein Scribd-Unternehmen logo
1 von 41
1
Protein Structure, Structure
Classification and Prediction
Bioinformatics X3
January 2005
P. Johansson, D. Madsen
Dept.of Cell & Molecular Biology,
Uppsala University
2
Overview
• Introduction to proteins, structure & classification
• Protein Folding
• Experimental techniques for structure determination
• Structure prediction
3
4
Proteins
• Proteins play a crucial role in virtually all biological processes
with a broad range of functions.
• The activity of an enzyme or the function of a protein is
governed by the three-dimensional structure
5
20 amino acids - the building blocks
6
The Amino Acids
7
Hydrophilic or hydrophobic..?
• Virtually all soluble proteins feature a hydrophobic core
surrounded by a hydrophilic surface
• But, peptide backbone is inherently polar ?
• Solution ; neutralize potential H-donors & acceptors using
ordered secondary structure
8
Secondary Structure: a-helix
9
• 3.6 residues / turn
• Axial dipole moment
• Not Proline & Glycine
• Protein surfaces
Secondary Structure: a-helix
10
Secondary Structure: b-sheets
11
Secondary Structure: b-sheets
• Parallel or antiparallel
• Alternating side-chains
• No mixing
• Loops often have polar amino acids
12
Structural classification
• Databases
– SCOP, ’Structural Classification of Proteins’,
manual classification
– CATH, ’Class Architecture Topology Homology’, based
on the SSAP algorithm
– FSSP, ’Family of Structurally Similar Proteins’, based
on the DALI algorithm
– PClass, ’Protein Classification’
based on the LOCK and 3Dsearch algorithms
13
Structural classification, CATH
• Class, four types :
– Mainly a
– a / b structures
– Mainly b
– No secondary structure
• Arhitecture (fold)
• Topology (superfamily)
• Homology (family)
14
Structural classification..
15
Structural classification..
• Two types of algorithms
– Inter-Molecular, 3D, Rigid Body ; structural alignment in a
common coordinate system (hard) e.g. VAST, LOCK.. alg.
– Intra-Molecular, 2D, Internal Geometry ; structural
alignment using internal distances and angles e.g. DALI,
STRUCTURAL, SSAP.. alg.
16
Structural classification, SSAP
• SSAP, ‘Sequential Structure Alignment Program’
Basic idea ; The similarity between residue i in molecule A
and residue k in molecule B is characterised in terms of their
structural surroundings
This similarity can be quantified into a score, Sik
Based on this similarity score and some specified gap penalty,
dynamic programming is used to find the optimal structural
alignment
17
Structural classification, SSAP
The structural neighborhood of residue i in A compared
to residue k in B
i k
18
Structural classification, SSAP..
Distance between residue i & j in molecule A ; dA
i,j
Similarity for two pairs of residues, i j in A & k l in B ;
,
,
b
d
d
a
s B
kl
A
ij
kl
ij


 a,b constants
Similarity between residue i in A and residue k in B ;


 
 


n
n
m
B
m
k
k
A
m
i
i
k
i
b
d
d
a
S
,
,
,
Idea ; Si,k is big if the distances from residue i in A to the 2n
nearest neighbours are similar to the corresponding distances
around k in B
19
Structural classification, SSAP..
This works well for small structures and local structural
alignments - however, insertions and deletions cause problems
 unrelated distances
HSERAHVFIM..
GQ-VMAC-NW..
i=5
k=4
A :
B :
- The real algorithm uses Dynamic programming on two levels,
first to find which distances to compare  Sik, then to align the
structures using these scores
20
Experimental techniques for structure
determination
• X-ray Crystallography
• Nuclear Magnetic Resonance
spectroscopy (NMR)
• Electron Microscopy/Diffraction
• Free electron lasers ?
21
X-ray Crystallography
22
X-ray Crystallography..
• From small molecules to viruses
• Information about the positions of
individual atoms
• Limited information about
dynamics
• Requires crystals
23
24
NMR
• Limited to molecules up to ~50kDa
(good quality up to 30 kDa)
• Distances between pairs of
hydrogen atoms
• Lots of information about dynamics
• Requires soluble, non-aggregating
material
• Assignment problem
25
Electron Microscopy/ Diffraction
• Low to medium resolution
• Limited information about
dynamics
• Can use very small crystals
(nm range)
• Can be used for very large
molecules and complexes
26
27
Structure Prediction
GPSRYIV…
?
28
Protein Folding
• Different sequence  Different
structure
• Free energy difference small due
to large entropy decrease,
DG = DH - TDS
29
Structure Prediction
Why is structure prediction and especially ab
initio calculations hard..?
• Many degrees of freedom / residue
• Remote noncovalent interactions
• Nature does not go through all conformations
• Folding assisted by enzymes & chaperones
30
Ab initio calculations used
for smaller problems ;
• Calculation of affinity
• Enzymatic pathways
Molecular dynamics
31
Sequence Classification rev.
• Class : Secondary structure content
• Fold : Major structural similarity.
• Superfamily : Probable common
evolutionary origin.
• Family : Clear evolutionary relationship.
32
• Search sequence data banks for homologs
• Search methods e.g. BLAST, PSIBLAST,
FASTA…
• Homologue in PDB..?
Structure Prediction
IVTY…PGGG HYW…QHG
33
Multiple sequence / structure alignment
• Contains more information than a single sequence
for applications like homology modeling and
secondary structure prediction
• Gives location of conserved parts
and residues likely to be buried in
the protein core or exposed to solvent
Structure Prediction
34
HFD fingerprint
Multiple alignment example
35
• Statistical Analysis (old fashioned):
– For each amino acid type assign it’s ‘propensity’
to be in a helix, sheet, or coil.
• Limited accuracy ~55-60%.
• Random prediction ~38%.
MTLLALGINHKTAP...
CCEEEEEECCCCCC...
Secondary Structure Prediction
36
• Each residue is classified as:
– Ha/Hb, strong helix / strand former.
– ha/hb, weak helix / strand former.
– I, indifferent.
– ba/bb, weak helix/strand breaker.
– Ba/Bb, strong helix / strand breaker.
The Chou & Fasman Method
37
The Chou & Fasman Method..
• Score each residue:
– Ha/ha=1, Ia=0 or ½, Ba/ba=-1.
– Hb/hb=1, Ib=0 or ½, Bb/bb=-1.
• Helix nucleation:
– Score > 4 in a “window” of 6 residues.
• Strand nucleation:
– Score > 3 in a “window” of 5 residues.
• Propagate until score < 1 in a 4 residue “window”.
38
GPSRYIVTLANGK
Helix:
Strand
-1 -1 0 0 -1 1 1 0 1 1 -1 -1 1
-1 -1 -1 .5 1 1 1 1 1 0 0 -1 -1
-2 0 1 2 3 3 1
No nucl.
-1.5 .5 2.5 4.5 5 4 3 1 -1
-2.5 -.5 1.5 … 3 1 -1
Nucleation
Propagate
GPSRYIVTLANGK
Result
The Chou & Fasman Method..
39
• Neural networks (e.g. the PHD server):
– Input: a number of protein sequences +
secondary structure.
– Output: a trained network that predicts
secondary structure elements with ~70%
accuracy.
• Use many different methods and compare
(e.g. the JPred server)!
Modern methods
40
Summary
• The function of a protein is governed by its structure
• Different sequence  Different structure
• PDB, protein data bank
• Secondary structure prediction is hard, tertiary
structure prediction is even harder
• Use homologs whenever possible or different methods
to assess quality
41

Weitere ähnliche Inhalte

Ähnlich wie 2005_lecture_01.ppt

Computer Aided Molecular Modeling
Computer Aided Molecular ModelingComputer Aided Molecular Modeling
Computer Aided Molecular Modelingpkchoudhury
 
Protein Structural predection
Protein Structural predectionProtein Structural predection
Protein Structural predectionSantu Chall
 
P. Joshi SBDD and docking.ppt
P. Joshi SBDD and docking.pptP. Joshi SBDD and docking.ppt
P. Joshi SBDD and docking.pptpranalpatilPranal
 
Bioinformatics t7-proteinstructure v2014
Bioinformatics t7-proteinstructure v2014Bioinformatics t7-proteinstructure v2014
Bioinformatics t7-proteinstructure v2014Prof. Wim Van Criekinge
 
Dicroismo circular
Dicroismo circularDicroismo circular
Dicroismo circularInês Silva
 
Molecular docking
Molecular dockingMolecular docking
Molecular dockingpalliyath91
 
Proteins ccc 21_08_2012 modified
Proteins ccc 21_08_2012 modifiedProteins ccc 21_08_2012 modified
Proteins ccc 21_08_2012 modifiedlifsnu
 
molecular docking its types and de novo drug design and application and softw...
molecular docking its types and de novo drug design and application and softw...molecular docking its types and de novo drug design and application and softw...
molecular docking its types and de novo drug design and application and softw...GAUTAM KHUNE
 
Structure based computer aided drug design
Structure based computer aided drug designStructure based computer aided drug design
Structure based computer aided drug designThanh Truong
 
Protein Crystallography of Lysozyme
Protein Crystallography of LysozymeProtein Crystallography of Lysozyme
Protein Crystallography of LysozymeGuillermo Llamas
 
QSAR : Activity Relationships Quantitative Structure
QSAR : Activity Relationships Quantitative StructureQSAR : Activity Relationships Quantitative Structure
QSAR : Activity Relationships Quantitative StructureSaramita De Chakravarti
 
NMR Random Coil Index & Protein Dynamics
NMR Random Coil Index & Protein Dynamics NMR Random Coil Index & Protein Dynamics
NMR Random Coil Index & Protein Dynamics Mark Berjanskii
 
Homology Modeling.pptx
Homology Modeling.pptxHomology Modeling.pptx
Homology Modeling.pptxAmnaAkram29
 
DHC Microbiome Presentation 4-23-19.pptx
DHC Microbiome Presentation 4-23-19.pptxDHC Microbiome Presentation 4-23-19.pptx
DHC Microbiome Presentation 4-23-19.pptxDivyanshGupta922023
 
Isoelectric Focusing.pptx
Isoelectric Focusing.pptxIsoelectric Focusing.pptx
Isoelectric Focusing.pptxrimshaijaz6
 
3 d qsar approaches structure
3 d qsar approaches structure3 d qsar approaches structure
3 d qsar approaches structureROHIT PAL
 

Ähnlich wie 2005_lecture_01.ppt (20)

Computer Aided Molecular Modeling
Computer Aided Molecular ModelingComputer Aided Molecular Modeling
Computer Aided Molecular Modeling
 
Cs273 structure prediction
Cs273 structure predictionCs273 structure prediction
Cs273 structure prediction
 
Protein Structural predection
Protein Structural predectionProtein Structural predection
Protein Structural predection
 
P. Joshi SBDD and docking.ppt
P. Joshi SBDD and docking.pptP. Joshi SBDD and docking.ppt
P. Joshi SBDD and docking.ppt
 
docking
docking docking
docking
 
Bioinformatics t7-proteinstructure v2014
Bioinformatics t7-proteinstructure v2014Bioinformatics t7-proteinstructure v2014
Bioinformatics t7-proteinstructure v2014
 
Protein structure prediction with a focus on Rosetta
Protein structure prediction with a focus on RosettaProtein structure prediction with a focus on Rosetta
Protein structure prediction with a focus on Rosetta
 
Dicroismo circular
Dicroismo circularDicroismo circular
Dicroismo circular
 
Molecular docking
Molecular dockingMolecular docking
Molecular docking
 
Proteins ccc 21_08_2012 modified
Proteins ccc 21_08_2012 modifiedProteins ccc 21_08_2012 modified
Proteins ccc 21_08_2012 modified
 
molecular docking its types and de novo drug design and application and softw...
molecular docking its types and de novo drug design and application and softw...molecular docking its types and de novo drug design and application and softw...
molecular docking its types and de novo drug design and application and softw...
 
Structure based computer aided drug design
Structure based computer aided drug designStructure based computer aided drug design
Structure based computer aided drug design
 
Protein Crystallography of Lysozyme
Protein Crystallography of LysozymeProtein Crystallography of Lysozyme
Protein Crystallography of Lysozyme
 
QSAR : Activity Relationships Quantitative Structure
QSAR : Activity Relationships Quantitative StructureQSAR : Activity Relationships Quantitative Structure
QSAR : Activity Relationships Quantitative Structure
 
NMR Random Coil Index & Protein Dynamics
NMR Random Coil Index & Protein Dynamics NMR Random Coil Index & Protein Dynamics
NMR Random Coil Index & Protein Dynamics
 
Homology Modeling.pptx
Homology Modeling.pptxHomology Modeling.pptx
Homology Modeling.pptx
 
Atindra-protein.pptx
Atindra-protein.pptxAtindra-protein.pptx
Atindra-protein.pptx
 
DHC Microbiome Presentation 4-23-19.pptx
DHC Microbiome Presentation 4-23-19.pptxDHC Microbiome Presentation 4-23-19.pptx
DHC Microbiome Presentation 4-23-19.pptx
 
Isoelectric Focusing.pptx
Isoelectric Focusing.pptxIsoelectric Focusing.pptx
Isoelectric Focusing.pptx
 
3 d qsar approaches structure
3 d qsar approaches structure3 d qsar approaches structure
3 d qsar approaches structure
 

Mehr von KhalidBassiouny1 (20)

05lipids2009.ppt
05lipids2009.ppt05lipids2009.ppt
05lipids2009.ppt
 
2-lipid.ppt
2-lipid.ppt2-lipid.ppt
2-lipid.ppt
 
2-12-2012-2.ppt
2-12-2012-2.ppt2-12-2012-2.ppt
2-12-2012-2.ppt
 
2-12-2012.ppt
2-12-2012.ppt2-12-2012.ppt
2-12-2012.ppt
 
PeptidesandProteins.ppt
PeptidesandProteins.pptPeptidesandProteins.ppt
PeptidesandProteins.ppt
 
PBIO4500 Protein Therapeutics.ppt
PBIO4500 Protein Therapeutics.pptPBIO4500 Protein Therapeutics.ppt
PBIO4500 Protein Therapeutics.ppt
 
Nut_Chapter_6_lecture.ppt
Nut_Chapter_6_lecture.pptNut_Chapter_6_lecture.ppt
Nut_Chapter_6_lecture.ppt
 
Lh6Ch03aAAs.ppt
Lh6Ch03aAAs.pptLh6Ch03aAAs.ppt
Lh6Ch03aAAs.ppt
 
Lecture 14 2013.ppt
Lecture 14 2013.pptLecture 14 2013.ppt
Lecture 14 2013.ppt
 
KSD_L8_CMB06.ppt
KSD_L8_CMB06.pptKSD_L8_CMB06.ppt
KSD_L8_CMB06.ppt
 
HSPs.ppt
HSPs.pptHSPs.ppt
HSPs.ppt
 
enzymes.ppt
enzymes.pptenzymes.ppt
enzymes.ppt
 
Classification_of_protein_structure.ppt
Classification_of_protein_structure.pptClassification_of_protein_structure.ppt
Classification_of_protein_structure.ppt
 
chapter6.ppt
chapter6.pptchapter6.ppt
chapter6.ppt
 
class_L02.ppt
class_L02.pptclass_L02.ppt
class_L02.ppt
 
centoni (2).ppt
centoni (2).pptcentoni (2).ppt
centoni (2).ppt
 
Background.ppt
Background.pptBackground.ppt
Background.ppt
 
Amino_Acids_and_Proteins.ppt
Amino_Acids_and_Proteins.pptAmino_Acids_and_Proteins.ppt
Amino_Acids_and_Proteins.ppt
 
142 proteinsoutline.ppt
142 proteinsoutline.ppt142 proteinsoutline.ppt
142 proteinsoutline.ppt
 
27-protease.ppt
27-protease.ppt27-protease.ppt
27-protease.ppt
 

Kürzlich hochgeladen

HiFi Call Girl Service Hyderabad | Whatsapp No 📞 9352988975 📞 VIP Escorts Ser...
HiFi Call Girl Service Hyderabad | Whatsapp No 📞 9352988975 📞 VIP Escorts Ser...HiFi Call Girl Service Hyderabad | Whatsapp No 📞 9352988975 📞 VIP Escorts Ser...
HiFi Call Girl Service Hyderabad | Whatsapp No 📞 9352988975 📞 VIP Escorts Ser...kajalverma014
 
Call girls Service Budvel ( 8250092165 ) Cheap rates call girls | Get low budget
Call girls Service Budvel ( 8250092165 ) Cheap rates call girls | Get low budgetCall girls Service Budvel ( 8250092165 ) Cheap rates call girls | Get low budget
Call girls Service Budvel ( 8250092165 ) Cheap rates call girls | Get low budgetSareena Khatun
 
ADSORPTIVE REMOVAL OF LEAD & ARSENIC FROM AQUEOUS.pptx
ADSORPTIVE REMOVAL OF LEAD & ARSENIC FROM AQUEOUS.pptxADSORPTIVE REMOVAL OF LEAD & ARSENIC FROM AQUEOUS.pptx
ADSORPTIVE REMOVAL OF LEAD & ARSENIC FROM AQUEOUS.pptxniranjanrauniyar
 
一比一原版卡尔顿大学毕业证(carleton毕业证)学历认证靠谱办理
一比一原版卡尔顿大学毕业证(carleton毕业证)学历认证靠谱办理一比一原版卡尔顿大学毕业证(carleton毕业证)学历认证靠谱办理
一比一原版卡尔顿大学毕业证(carleton毕业证)学历认证靠谱办理hwoudye
 
Call Girls Vanasthalipuram - 8250092165 Our call girls are sure to provide yo...
Call Girls Vanasthalipuram - 8250092165 Our call girls are sure to provide yo...Call Girls Vanasthalipuram - 8250092165 Our call girls are sure to provide yo...
Call Girls Vanasthalipuram - 8250092165 Our call girls are sure to provide yo...kumargunjan9515
 
Dubai Call Girls Clim@X O525547819 Call Girls Dubai
Dubai Call Girls Clim@X O525547819 Call Girls DubaiDubai Call Girls Clim@X O525547819 Call Girls Dubai
Dubai Call Girls Clim@X O525547819 Call Girls Dubaikajalvid75
 
+97470301568>>buy weed in qatar>>buy thc oil in doha qatar>>
+97470301568>>buy weed in qatar>>buy thc oil in doha qatar>>+97470301568>>buy weed in qatar>>buy thc oil in doha qatar>>
+97470301568>>buy weed in qatar>>buy thc oil in doha qatar>>Health
 
thesis of copper nanoparticles and their relevance
thesis of copper nanoparticles and their relevancethesis of copper nanoparticles and their relevance
thesis of copper nanoparticles and their relevanceDiptiPriya6
 
FOOD PACKAGING AND LABELLING (Food Safety and Standards).pptx
FOOD PACKAGING AND LABELLING (Food Safety and Standards).pptxFOOD PACKAGING AND LABELLING (Food Safety and Standards).pptx
FOOD PACKAGING AND LABELLING (Food Safety and Standards).pptxDr. Trisha Manna
 
Call girls Service Nacharam - 8250092165 Our call girls are sure to provide y...
Call girls Service Nacharam - 8250092165 Our call girls are sure to provide y...Call girls Service Nacharam - 8250092165 Our call girls are sure to provide y...
Call girls Service Nacharam - 8250092165 Our call girls are sure to provide y...kumargunjan9515
 
The Clean Living Project Episode 18 - Seed Oils
The Clean Living Project Episode 18 - Seed OilsThe Clean Living Project Episode 18 - Seed Oils
The Clean Living Project Episode 18 - Seed OilsThe Clean Living Project
 
Essential oils for beginners slide .pptx
Essential oils for beginners slide .pptxEssential oils for beginners slide .pptx
Essential oils for beginners slide .pptxniranjanrauniyar
 
一比一原版(uOttawa毕业证书)加拿大渥太华大学毕业证如何办理
一比一原版(uOttawa毕业证书)加拿大渥太华大学毕业证如何办理一比一原版(uOttawa毕业证书)加拿大渥太华大学毕业证如何办理
一比一原版(uOttawa毕业证书)加拿大渥太华大学毕业证如何办理hwoudye
 
Call Girls in Kondapur - 8250092165 Our call girls are sure to provide you wi...
Call Girls in Kondapur - 8250092165 Our call girls are sure to provide you wi...Call Girls in Kondapur - 8250092165 Our call girls are sure to provide you wi...
Call Girls in Kondapur - 8250092165 Our call girls are sure to provide you wi...kumargunjan9515
 
High Class Call Girls Hyderabad 10k @ I'm VIP Independent Escorts Girls 📞 935...
High Class Call Girls Hyderabad 10k @ I'm VIP Independent Escorts Girls 📞 935...High Class Call Girls Hyderabad 10k @ I'm VIP Independent Escorts Girls 📞 935...
High Class Call Girls Hyderabad 10k @ I'm VIP Independent Escorts Girls 📞 935...kajalverma014
 
Call Girls Surat ( 8250092165 ) Cheap rates call girls | Get low budget
Call Girls Surat ( 8250092165 ) Cheap rates call girls | Get low budgetCall Girls Surat ( 8250092165 ) Cheap rates call girls | Get low budget
Call Girls Surat ( 8250092165 ) Cheap rates call girls | Get low budgetSareena Khatun
 
Occupational Cancers and its prevention in an Industrial setting
Occupational Cancers and its prevention in an Industrial settingOccupational Cancers and its prevention in an Industrial setting
Occupational Cancers and its prevention in an Industrial settingRajeshKulkarni67
 
Top profile Call Girls In Chhapra [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In Chhapra [ 7014168258 ] Call Me For Genuine Models W...Top profile Call Girls In Chhapra [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In Chhapra [ 7014168258 ] Call Me For Genuine Models W...gajnagarg
 
Top profile Call Girls In Deoghar [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In Deoghar [ 7014168258 ] Call Me For Genuine Models W...Top profile Call Girls In Deoghar [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In Deoghar [ 7014168258 ] Call Me For Genuine Models W...gajnagarg
 

Kürzlich hochgeladen (20)

HiFi Call Girl Service Hyderabad | Whatsapp No 📞 9352988975 📞 VIP Escorts Ser...
HiFi Call Girl Service Hyderabad | Whatsapp No 📞 9352988975 📞 VIP Escorts Ser...HiFi Call Girl Service Hyderabad | Whatsapp No 📞 9352988975 📞 VIP Escorts Ser...
HiFi Call Girl Service Hyderabad | Whatsapp No 📞 9352988975 📞 VIP Escorts Ser...
 
Call girls Service Budvel ( 8250092165 ) Cheap rates call girls | Get low budget
Call girls Service Budvel ( 8250092165 ) Cheap rates call girls | Get low budgetCall girls Service Budvel ( 8250092165 ) Cheap rates call girls | Get low budget
Call girls Service Budvel ( 8250092165 ) Cheap rates call girls | Get low budget
 
ADSORPTIVE REMOVAL OF LEAD & ARSENIC FROM AQUEOUS.pptx
ADSORPTIVE REMOVAL OF LEAD & ARSENIC FROM AQUEOUS.pptxADSORPTIVE REMOVAL OF LEAD & ARSENIC FROM AQUEOUS.pptx
ADSORPTIVE REMOVAL OF LEAD & ARSENIC FROM AQUEOUS.pptx
 
17 Foods to avoid while breastfeeding.pdf
17 Foods to avoid while breastfeeding.pdf17 Foods to avoid while breastfeeding.pdf
17 Foods to avoid while breastfeeding.pdf
 
一比一原版卡尔顿大学毕业证(carleton毕业证)学历认证靠谱办理
一比一原版卡尔顿大学毕业证(carleton毕业证)学历认证靠谱办理一比一原版卡尔顿大学毕业证(carleton毕业证)学历认证靠谱办理
一比一原版卡尔顿大学毕业证(carleton毕业证)学历认证靠谱办理
 
Call Girls Vanasthalipuram - 8250092165 Our call girls are sure to provide yo...
Call Girls Vanasthalipuram - 8250092165 Our call girls are sure to provide yo...Call Girls Vanasthalipuram - 8250092165 Our call girls are sure to provide yo...
Call Girls Vanasthalipuram - 8250092165 Our call girls are sure to provide yo...
 
Dubai Call Girls Clim@X O525547819 Call Girls Dubai
Dubai Call Girls Clim@X O525547819 Call Girls DubaiDubai Call Girls Clim@X O525547819 Call Girls Dubai
Dubai Call Girls Clim@X O525547819 Call Girls Dubai
 
+97470301568>>buy weed in qatar>>buy thc oil in doha qatar>>
+97470301568>>buy weed in qatar>>buy thc oil in doha qatar>>+97470301568>>buy weed in qatar>>buy thc oil in doha qatar>>
+97470301568>>buy weed in qatar>>buy thc oil in doha qatar>>
 
thesis of copper nanoparticles and their relevance
thesis of copper nanoparticles and their relevancethesis of copper nanoparticles and their relevance
thesis of copper nanoparticles and their relevance
 
FOOD PACKAGING AND LABELLING (Food Safety and Standards).pptx
FOOD PACKAGING AND LABELLING (Food Safety and Standards).pptxFOOD PACKAGING AND LABELLING (Food Safety and Standards).pptx
FOOD PACKAGING AND LABELLING (Food Safety and Standards).pptx
 
Call girls Service Nacharam - 8250092165 Our call girls are sure to provide y...
Call girls Service Nacharam - 8250092165 Our call girls are sure to provide y...Call girls Service Nacharam - 8250092165 Our call girls are sure to provide y...
Call girls Service Nacharam - 8250092165 Our call girls are sure to provide y...
 
The Clean Living Project Episode 18 - Seed Oils
The Clean Living Project Episode 18 - Seed OilsThe Clean Living Project Episode 18 - Seed Oils
The Clean Living Project Episode 18 - Seed Oils
 
Essential oils for beginners slide .pptx
Essential oils for beginners slide .pptxEssential oils for beginners slide .pptx
Essential oils for beginners slide .pptx
 
一比一原版(uOttawa毕业证书)加拿大渥太华大学毕业证如何办理
一比一原版(uOttawa毕业证书)加拿大渥太华大学毕业证如何办理一比一原版(uOttawa毕业证书)加拿大渥太华大学毕业证如何办理
一比一原版(uOttawa毕业证书)加拿大渥太华大学毕业证如何办理
 
Call Girls in Kondapur - 8250092165 Our call girls are sure to provide you wi...
Call Girls in Kondapur - 8250092165 Our call girls are sure to provide you wi...Call Girls in Kondapur - 8250092165 Our call girls are sure to provide you wi...
Call Girls in Kondapur - 8250092165 Our call girls are sure to provide you wi...
 
High Class Call Girls Hyderabad 10k @ I'm VIP Independent Escorts Girls 📞 935...
High Class Call Girls Hyderabad 10k @ I'm VIP Independent Escorts Girls 📞 935...High Class Call Girls Hyderabad 10k @ I'm VIP Independent Escorts Girls 📞 935...
High Class Call Girls Hyderabad 10k @ I'm VIP Independent Escorts Girls 📞 935...
 
Call Girls Surat ( 8250092165 ) Cheap rates call girls | Get low budget
Call Girls Surat ( 8250092165 ) Cheap rates call girls | Get low budgetCall Girls Surat ( 8250092165 ) Cheap rates call girls | Get low budget
Call Girls Surat ( 8250092165 ) Cheap rates call girls | Get low budget
 
Occupational Cancers and its prevention in an Industrial setting
Occupational Cancers and its prevention in an Industrial settingOccupational Cancers and its prevention in an Industrial setting
Occupational Cancers and its prevention in an Industrial setting
 
Top profile Call Girls In Chhapra [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In Chhapra [ 7014168258 ] Call Me For Genuine Models W...Top profile Call Girls In Chhapra [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In Chhapra [ 7014168258 ] Call Me For Genuine Models W...
 
Top profile Call Girls In Deoghar [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In Deoghar [ 7014168258 ] Call Me For Genuine Models W...Top profile Call Girls In Deoghar [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In Deoghar [ 7014168258 ] Call Me For Genuine Models W...
 

2005_lecture_01.ppt

  • 1. 1 Protein Structure, Structure Classification and Prediction Bioinformatics X3 January 2005 P. Johansson, D. Madsen Dept.of Cell & Molecular Biology, Uppsala University
  • 2. 2 Overview • Introduction to proteins, structure & classification • Protein Folding • Experimental techniques for structure determination • Structure prediction
  • 3. 3
  • 4. 4 Proteins • Proteins play a crucial role in virtually all biological processes with a broad range of functions. • The activity of an enzyme or the function of a protein is governed by the three-dimensional structure
  • 5. 5 20 amino acids - the building blocks
  • 7. 7 Hydrophilic or hydrophobic..? • Virtually all soluble proteins feature a hydrophobic core surrounded by a hydrophilic surface • But, peptide backbone is inherently polar ? • Solution ; neutralize potential H-donors & acceptors using ordered secondary structure
  • 9. 9 • 3.6 residues / turn • Axial dipole moment • Not Proline & Glycine • Protein surfaces Secondary Structure: a-helix
  • 11. 11 Secondary Structure: b-sheets • Parallel or antiparallel • Alternating side-chains • No mixing • Loops often have polar amino acids
  • 12. 12 Structural classification • Databases – SCOP, ’Structural Classification of Proteins’, manual classification – CATH, ’Class Architecture Topology Homology’, based on the SSAP algorithm – FSSP, ’Family of Structurally Similar Proteins’, based on the DALI algorithm – PClass, ’Protein Classification’ based on the LOCK and 3Dsearch algorithms
  • 13. 13 Structural classification, CATH • Class, four types : – Mainly a – a / b structures – Mainly b – No secondary structure • Arhitecture (fold) • Topology (superfamily) • Homology (family)
  • 15. 15 Structural classification.. • Two types of algorithms – Inter-Molecular, 3D, Rigid Body ; structural alignment in a common coordinate system (hard) e.g. VAST, LOCK.. alg. – Intra-Molecular, 2D, Internal Geometry ; structural alignment using internal distances and angles e.g. DALI, STRUCTURAL, SSAP.. alg.
  • 16. 16 Structural classification, SSAP • SSAP, ‘Sequential Structure Alignment Program’ Basic idea ; The similarity between residue i in molecule A and residue k in molecule B is characterised in terms of their structural surroundings This similarity can be quantified into a score, Sik Based on this similarity score and some specified gap penalty, dynamic programming is used to find the optimal structural alignment
  • 17. 17 Structural classification, SSAP The structural neighborhood of residue i in A compared to residue k in B i k
  • 18. 18 Structural classification, SSAP.. Distance between residue i & j in molecule A ; dA i,j Similarity for two pairs of residues, i j in A & k l in B ; , , b d d a s B kl A ij kl ij    a,b constants Similarity between residue i in A and residue k in B ;         n n m B m k k A m i i k i b d d a S , , , Idea ; Si,k is big if the distances from residue i in A to the 2n nearest neighbours are similar to the corresponding distances around k in B
  • 19. 19 Structural classification, SSAP.. This works well for small structures and local structural alignments - however, insertions and deletions cause problems  unrelated distances HSERAHVFIM.. GQ-VMAC-NW.. i=5 k=4 A : B : - The real algorithm uses Dynamic programming on two levels, first to find which distances to compare  Sik, then to align the structures using these scores
  • 20. 20 Experimental techniques for structure determination • X-ray Crystallography • Nuclear Magnetic Resonance spectroscopy (NMR) • Electron Microscopy/Diffraction • Free electron lasers ?
  • 22. 22 X-ray Crystallography.. • From small molecules to viruses • Information about the positions of individual atoms • Limited information about dynamics • Requires crystals
  • 23. 23
  • 24. 24 NMR • Limited to molecules up to ~50kDa (good quality up to 30 kDa) • Distances between pairs of hydrogen atoms • Lots of information about dynamics • Requires soluble, non-aggregating material • Assignment problem
  • 25. 25 Electron Microscopy/ Diffraction • Low to medium resolution • Limited information about dynamics • Can use very small crystals (nm range) • Can be used for very large molecules and complexes
  • 26. 26
  • 28. 28 Protein Folding • Different sequence  Different structure • Free energy difference small due to large entropy decrease, DG = DH - TDS
  • 29. 29 Structure Prediction Why is structure prediction and especially ab initio calculations hard..? • Many degrees of freedom / residue • Remote noncovalent interactions • Nature does not go through all conformations • Folding assisted by enzymes & chaperones
  • 30. 30 Ab initio calculations used for smaller problems ; • Calculation of affinity • Enzymatic pathways Molecular dynamics
  • 31. 31 Sequence Classification rev. • Class : Secondary structure content • Fold : Major structural similarity. • Superfamily : Probable common evolutionary origin. • Family : Clear evolutionary relationship.
  • 32. 32 • Search sequence data banks for homologs • Search methods e.g. BLAST, PSIBLAST, FASTA… • Homologue in PDB..? Structure Prediction IVTY…PGGG HYW…QHG
  • 33. 33 Multiple sequence / structure alignment • Contains more information than a single sequence for applications like homology modeling and secondary structure prediction • Gives location of conserved parts and residues likely to be buried in the protein core or exposed to solvent Structure Prediction
  • 35. 35 • Statistical Analysis (old fashioned): – For each amino acid type assign it’s ‘propensity’ to be in a helix, sheet, or coil. • Limited accuracy ~55-60%. • Random prediction ~38%. MTLLALGINHKTAP... CCEEEEEECCCCCC... Secondary Structure Prediction
  • 36. 36 • Each residue is classified as: – Ha/Hb, strong helix / strand former. – ha/hb, weak helix / strand former. – I, indifferent. – ba/bb, weak helix/strand breaker. – Ba/Bb, strong helix / strand breaker. The Chou & Fasman Method
  • 37. 37 The Chou & Fasman Method.. • Score each residue: – Ha/ha=1, Ia=0 or ½, Ba/ba=-1. – Hb/hb=1, Ib=0 or ½, Bb/bb=-1. • Helix nucleation: – Score > 4 in a “window” of 6 residues. • Strand nucleation: – Score > 3 in a “window” of 5 residues. • Propagate until score < 1 in a 4 residue “window”.
  • 38. 38 GPSRYIVTLANGK Helix: Strand -1 -1 0 0 -1 1 1 0 1 1 -1 -1 1 -1 -1 -1 .5 1 1 1 1 1 0 0 -1 -1 -2 0 1 2 3 3 1 No nucl. -1.5 .5 2.5 4.5 5 4 3 1 -1 -2.5 -.5 1.5 … 3 1 -1 Nucleation Propagate GPSRYIVTLANGK Result The Chou & Fasman Method..
  • 39. 39 • Neural networks (e.g. the PHD server): – Input: a number of protein sequences + secondary structure. – Output: a trained network that predicts secondary structure elements with ~70% accuracy. • Use many different methods and compare (e.g. the JPred server)! Modern methods
  • 40. 40 Summary • The function of a protein is governed by its structure • Different sequence  Different structure • PDB, protein data bank • Secondary structure prediction is hard, tertiary structure prediction is even harder • Use homologs whenever possible or different methods to assess quality
  • 41. 41