SlideShare ist ein Scribd-Unternehmen logo
1 von 39
Protein Structural Prediction
Protein Structure is Hierarchical
Structure Determines Function
What determines structure?
• Energy
• Kinematics
How can we determine structure?
• Experimental methods
• Computational predictions
The Protein Folding Problem
Primary Structure: Sequence
• The primary structure of a protein is the amino acid sequence
Primary Structure: Sequence
• Twenty different amino
acids have distinct shapes
and properties
Primary Structure: Sequence
A useful mnemonic for the hydrophobic amino acids is "FAMILY VW"
Secondary Structure: , , & loops
•  helices and  sheets are stabilized by hydrogen bonds between
backbone oxygen and hydrogen atoms
Secondary Structure:  helix
Secondary Structure:  sheet
 sheet
 buldge
Second-and-a-half-ary Structure: Motifs
beta helix
beta barrel
beta trefoil
Tertiary Structure: Domains
Mosaic Proteins
Tertiary Structure: A Protein Fold
Protein Folds Composed of , , other
Quaternary Structure: Multimeric Proteins or
Functional Assemblies
• Multimeric Proteins
• Macromolecular Assemblies
Ribosome:
Protein Synthesis
Replisome:
DNA copying
Hemoglobin:
A tetramer
Protein Folding
• The amino-acid sequence of a protein determines the 3D fold
[Anfinsen et al., 1950s]
Some exceptions:
 All proteins can be denatured
 Some proteins have multiple conformations
 Some proteins get folding help from chaperones
• The function of a protein is determined by its 3D fold
• Can we predict 3D fold of a protein given its amino-acid sequence?
The Leventhal Paradox
• Given a small protein (100aa) assume 3 possible
conformations/peptide bond
• 3100 = 5 × 1047 conformations
• Fastest motions 10- 15 sec so sampling all conformations would take
5 × 1032 sec
• 60 × 60 × 24 × 365 = 31536000 seconds in a year
• Sampling all conformations will take 1.6 × 1025 years
• Each protein folds quickly into a single stable native conformation the
Leventhal paradox
Quick Overview of Energy
Strength
(kcal/mole)
Bond
3-7H-bonds
10Ionic bonds
1-2
Hydrophobic
interactions
1
Van der vaals
interactions
51Disulfide bridge
The Hydrophobic Effect
• Important for folding, because every amino acid participates!
Trp2.25
Ile1.80
Phe1.79
Leu1.70
Cys1.54
Met1.23
Val1.22
Tyr0.96
Pro0.72
Ala0.31
Thr0.26
His0.13
Gly0.00
Ser-0.04
Gln-0.22
Asn-0.60
Glu-0.64
Asp-0.77
Lys-0.99
Arg-1.01
Experimentally Determined Hydrophobicity Levels
Fauchere and Pilska (1983).
Eur. J. Med. Chem. 18, 369-75.
Protein Structure Determination
• Experimental
 X-ray crystallography
 NMR spectrometry
• Computational – Structure Prediction
(The Holy Grail)
Sequence implies structure, therefore in principle we can
predict the structure from the sequence alone
Protein Structure Prediction
• ab initio
 Use just first principles: energy, geometry, and kinematics
• Homology
 Find the best match to a database of sequences with known 3D-
structure
• Threading
• Meta-servers and other methods
Ab initio Prediction
• Sampling the global conformation space
 Lattice models / Discrete-state models
 Molecular Dynamics
 Pre-set libraries of fragment 3D motifs
• Picking native conformations with an energy function
 Solvation model: how protein interacts with water
 Pair interactions between amino acids
• Predicting secondary structure
 Local homology
 Fragment libraries
Lattice String Folding
• HP model: main modeled force is hydrophobic attraction
 NP-hard in both 2-D square and 3-D cubic
 Constant approximation algorithms
 Not so relevant biologically
Lattice String Folding
ROSETTA
http://www.bioinfo.rpi.edu/~bystrc/hmmstr/server.php
http://depts.washington.edu/bakerpg/papers/Bonneau-ARBBS-v30-p173.pdf
• Monte Carlo based method
• Limit conformational search space by using sequence—structure
motif I-Sites library (http://isites.bio.rpi.edu/Isites/)
 261 patterns in library
 Certain positions in motif favor certain residues
• Remove all sequences with <25% identity
• Find structures of the 25 nearest sequence neighbors of
each 9-mer
Rationale
 Local structures often fold independently of full protein
 Can predict large areas of protein by matching sequence to I-
Sites
?? ?
I-Sites Examples
• Non polar helix
 Abundance of alanine at all positions
 Non-polar side chains favored at positions 3, 6, 10
(methionine, leucine, isoleucine)
• Amphipathic helix
 Non-polar side chains favored at positions 6, 9, 13, 16
(methionine, leucine, isoleucine)
 Polar side chains favored at positions 1, 8, 11, 18
(glutamic acid, lysine)
ROSETTA Method
• New structures generated by swapping
compatible fragments
• Accepted structures are clustered based
on energy and structural size
• Best cluster is one with the greatest
number of conformations within 4-Å rms
deviation structure of the center
• Representative structures taken from each
of the best five clusters and returned to
the user as predictions
?? ?
Robetta & Rosetta
Rosetta results in CASP
Rosetta Results
• In CASP4, Rosetta’s best models ranged from 6–10 Å rmsd C
• For comparison, good comparative models give 2-5 Å rmsd C
• Most effective with small proteins (<100 residues) and structures with
helices
Only a few folds are found in nature
The SCOP Database
Structural Classification Of Proteins
FAMILY: proteins that are >30% similar, or >15% similar and have
similar known structure/function
SUPERFAMILY: proteins whose families have some sequence and
function/structure similarity suggesting a common evolutionary origin
COMMON FOLD: superfamilies that have same secondary structures in
same arrangement, probably resulting by physics and chemistry
CLASS: alpha, beta, alpha–beta, alpha+beta, multidomain
Status of Protein Databases
SCOP: Structural Classification of Proteins. 1.67 release
24037 PDB Entries (15 May 2004). 65122 Domains.
Class
Number of
folds
Number of
superfamilies
Number of
families
All alpha proteins 202 342 550
All beta proteins 141 280 529
Alpha and beta proteins (a/b) 130 213 593
Alpha and beta proteins (a+b) 260 386 650
Multi-domain proteins 40 40 55
Membrane and cell surface
proteins
42 82 91
Small proteins 71 104 162
Total 887 1447 2630
EMBL
PDB
Evolution of Proteins – Domains
#members in different families obey power law
429 families common in all 14 eukaryotes;
80% of animal domains, 90% of fungi domains
80% of proteins are multidomain in eukaryotes;
domains usually combine pairwise in same order
--why?
Evolution of proteins happens
mainly through duplication,
recombination, and divergence
Chothia, Gough, Vogel, Teichmann, Science 300:1701-17-3, 2003
Homology-based Prediction
• Align query sequence with sequences of known structure,
usually >30% similar
• Superimpose the aligned sequence onto the structure
template, according to the computed sequence alignment
• Perform local refinement of the resulting structure in 3D
90% of new structures submitted to PDB in the
past three years have similar folds in PDB
The number of unique structural folds
is small (possibly a few thousand)
Examples of Fold Classes
Homology-based Prediction
Raw model
Loop modeling
Side chain placement
Refinement
Homology-based Prediction

Weitere ähnliche Inhalte

Was ist angesagt?

Protein structure
Protein structure  Protein structure
Protein structure Sailee Gurav
 
BT631-6-structural_motifs
BT631-6-structural_motifsBT631-6-structural_motifs
BT631-6-structural_motifsRajesh G
 
BT631-5-primary_secondary_structures_proteins
BT631-5-primary_secondary_structures_proteinsBT631-5-primary_secondary_structures_proteins
BT631-5-primary_secondary_structures_proteinsRajesh G
 
Alpha
AlphaAlpha
Alphaavinaavi
 
Alpha domain structurs
Alpha domain structursAlpha domain structurs
Alpha domain structursNithin Chandran
 
Biochemistry - Ch4 protein structure , and function
Biochemistry - Ch4 protein structure , and function Biochemistry - Ch4 protein structure , and function
Biochemistry - Ch4 protein structure , and function Areej Abu Hanieh
 
Secondary Structure Of Protein (Repeating structure of protein)
Secondary Structure Of Protein (Repeating structure of protein)Secondary Structure Of Protein (Repeating structure of protein)
Secondary Structure Of Protein (Repeating structure of protein)Amrutha Hari
 
PROTEIN STRUCTURE PRESENTATION
PROTEIN STRUCTURE PRESENTATIONPROTEIN STRUCTURE PRESENTATION
PROTEIN STRUCTURE PRESENTATIONdevadevi666
 
Peptide+structure
Peptide+structurePeptide+structure
Peptide+structureAya Chavez
 
Protein 3 dimensional structure and function
Protein 3 dimensional structure and functionProtein 3 dimensional structure and function
Protein 3 dimensional structure and functionDr. Armaan Singh
 
Structure of protein By KK Sahu Sir
Structure of protein By KK Sahu SirStructure of protein By KK Sahu Sir
Structure of protein By KK Sahu SirKAUSHAL SAHU
 
Structure of protein
Structure of proteinStructure of protein
Structure of proteinHarishmaravi
 
Protein structure
Protein structureProtein structure
Protein structureVedpal Yadav
 
Different Levels of protein
Different Levels of proteinDifferent Levels of protein
Different Levels of proteinRajpal Choudhary
 
structure, properties and biological functions of proteins.
 structure, properties and biological functions of proteins. structure, properties and biological functions of proteins.
structure, properties and biological functions of proteins.Rakhi Adarsh
 
Secondary Structure Prediction of proteins
Secondary Structure Prediction of proteins Secondary Structure Prediction of proteins
Secondary Structure Prediction of proteins Vijay Hemmadi
 
Protein structure
Protein structure Protein structure
Protein structure jawaria sultan
 

Was ist angesagt? (20)

Protein structure
Protein structure  Protein structure
Protein structure
 
Protein
ProteinProtein
Protein
 
Protein structure
Protein structureProtein structure
Protein structure
 
BT631-6-structural_motifs
BT631-6-structural_motifsBT631-6-structural_motifs
BT631-6-structural_motifs
 
BT631-5-primary_secondary_structures_proteins
BT631-5-primary_secondary_structures_proteinsBT631-5-primary_secondary_structures_proteins
BT631-5-primary_secondary_structures_proteins
 
Alpha
AlphaAlpha
Alpha
 
Alpha domain structurs
Alpha domain structursAlpha domain structurs
Alpha domain structurs
 
Biochemistry - Ch4 protein structure , and function
Biochemistry - Ch4 protein structure , and function Biochemistry - Ch4 protein structure , and function
Biochemistry - Ch4 protein structure , and function
 
Beta
BetaBeta
Beta
 
Secondary Structure Of Protein (Repeating structure of protein)
Secondary Structure Of Protein (Repeating structure of protein)Secondary Structure Of Protein (Repeating structure of protein)
Secondary Structure Of Protein (Repeating structure of protein)
 
PROTEIN STRUCTURE PRESENTATION
PROTEIN STRUCTURE PRESENTATIONPROTEIN STRUCTURE PRESENTATION
PROTEIN STRUCTURE PRESENTATION
 
Peptide+structure
Peptide+structurePeptide+structure
Peptide+structure
 
Protein 3 dimensional structure and function
Protein 3 dimensional structure and functionProtein 3 dimensional structure and function
Protein 3 dimensional structure and function
 
Structure of protein By KK Sahu Sir
Structure of protein By KK Sahu SirStructure of protein By KK Sahu Sir
Structure of protein By KK Sahu Sir
 
Structure of protein
Structure of proteinStructure of protein
Structure of protein
 
Protein structure
Protein structureProtein structure
Protein structure
 
Different Levels of protein
Different Levels of proteinDifferent Levels of protein
Different Levels of protein
 
structure, properties and biological functions of proteins.
 structure, properties and biological functions of proteins. structure, properties and biological functions of proteins.
structure, properties and biological functions of proteins.
 
Secondary Structure Prediction of proteins
Secondary Structure Prediction of proteins Secondary Structure Prediction of proteins
Secondary Structure Prediction of proteins
 
Protein structure
Protein structure Protein structure
Protein structure
 

Ähnlich wie Cs273 structure prediction

Bioinformatics t7-protein structure-v2013_wim_vancriekinge
Bioinformatics t7-protein structure-v2013_wim_vancriekingeBioinformatics t7-protein structure-v2013_wim_vancriekinge
Bioinformatics t7-protein structure-v2013_wim_vancriekingeProf. Wim Van Criekinge
 
Bioinformatics t7-proteinstructure v2014
Bioinformatics t7-proteinstructure v2014Bioinformatics t7-proteinstructure v2014
Bioinformatics t7-proteinstructure v2014Prof. Wim Van Criekinge
 
Ap bio ch 3 Functional Groups & Macromolecules
Ap bio ch 3 Functional Groups & MacromoleculesAp bio ch 3 Functional Groups & Macromolecules
Ap bio ch 3 Functional Groups & Macromoleculeszernwoman
 
Chapters 3,4,5
Chapters 3,4,5Chapters 3,4,5
Chapters 3,4,5obanbrahma
 
Protein structure & function
Protein structure & functionProtein structure & function
Protein structure & functionMerlyn Denesia
 
Crash course of biochemistry
Crash  course of biochemistryCrash  course of biochemistry
Crash course of biochemistryGaurav Kr
 
Proteins chp-4-bioc-361-version-oct-2012b
Proteins chp-4-bioc-361-version-oct-2012bProteins chp-4-bioc-361-version-oct-2012b
Proteins chp-4-bioc-361-version-oct-2012bJody Haddow
 
Lecture3 intro to_proteins (1)
Lecture3 intro to_proteins (1)Lecture3 intro to_proteins (1)
Lecture3 intro to_proteins (1)SourabhKumar240
 
Lecture 5 sols-2019-protein struc 2
Lecture 5 sols-2019-protein struc 2Lecture 5 sols-2019-protein struc 2
Lecture 5 sols-2019-protein struc 2SourabhKumar240
 
2016 bioinformatics i_proteins_wim_vancriekinge
2016 bioinformatics i_proteins_wim_vancriekinge2016 bioinformatics i_proteins_wim_vancriekinge
2016 bioinformatics i_proteins_wim_vancriekingeProf. Wim Van Criekinge
 
structure of proteins
structure of proteinsstructure of proteins
structure of proteinsAtheer Ahmed
 
structure of protins
structure of protins structure of protins
structure of protins Atheer Ahmed
 
Bioinformatica t7-protein structure
Bioinformatica t7-protein structureBioinformatica t7-protein structure
Bioinformatica t7-protein structureProf. Wim Van Criekinge
 

Ähnlich wie Cs273 structure prediction (20)

Proteins
ProteinsProteins
Proteins
 
Proteins biochem
Proteins biochemProteins biochem
Proteins biochem
 
Bioinformatics t7-protein structure-v2013_wim_vancriekinge
Bioinformatics t7-protein structure-v2013_wim_vancriekingeBioinformatics t7-protein structure-v2013_wim_vancriekinge
Bioinformatics t7-protein structure-v2013_wim_vancriekinge
 
Bioinformatics t7-proteinstructure v2014
Bioinformatics t7-proteinstructure v2014Bioinformatics t7-proteinstructure v2014
Bioinformatics t7-proteinstructure v2014
 
Ap bio ch 3 Functional Groups & Macromolecules
Ap bio ch 3 Functional Groups & MacromoleculesAp bio ch 3 Functional Groups & Macromolecules
Ap bio ch 3 Functional Groups & Macromolecules
 
Chapters 3,4,5
Chapters 3,4,5Chapters 3,4,5
Chapters 3,4,5
 
Protein structure & function
Protein structure & functionProtein structure & function
Protein structure & function
 
Crash course of biochemistry
Crash  course of biochemistryCrash  course of biochemistry
Crash course of biochemistry
 
Proteins chp-4-bioc-361-version-oct-2012b
Proteins chp-4-bioc-361-version-oct-2012bProteins chp-4-bioc-361-version-oct-2012b
Proteins chp-4-bioc-361-version-oct-2012b
 
Lecture 4
Lecture 4Lecture 4
Lecture 4
 
Lecture3 intro to_proteins (1)
Lecture3 intro to_proteins (1)Lecture3 intro to_proteins (1)
Lecture3 intro to_proteins (1)
 
Lecture 14 2013.ppt
Lecture 14 2013.pptLecture 14 2013.ppt
Lecture 14 2013.ppt
 
Enzymology
Enzymology Enzymology
Enzymology
 
Protein
ProteinProtein
Protein
 
Atindra-protein.pptx
Atindra-protein.pptxAtindra-protein.pptx
Atindra-protein.pptx
 
Lecture 5 sols-2019-protein struc 2
Lecture 5 sols-2019-protein struc 2Lecture 5 sols-2019-protein struc 2
Lecture 5 sols-2019-protein struc 2
 
2016 bioinformatics i_proteins_wim_vancriekinge
2016 bioinformatics i_proteins_wim_vancriekinge2016 bioinformatics i_proteins_wim_vancriekinge
2016 bioinformatics i_proteins_wim_vancriekinge
 
structure of proteins
structure of proteinsstructure of proteins
structure of proteins
 
structure of protins
structure of protins structure of protins
structure of protins
 
Bioinformatica t7-protein structure
Bioinformatica t7-protein structureBioinformatica t7-protein structure
Bioinformatica t7-protein structure
 

Mehr von University of Allahabad

Mehr von University of Allahabad (12)

Intro to illumina sequencing
Intro to illumina sequencingIntro to illumina sequencing
Intro to illumina sequencing
 
Presentation ppt cancer
Presentation ppt cancerPresentation ppt cancer
Presentation ppt cancer
 
Applied Bioinformatics Assignment 5docx
Applied Bioinformatics Assignment  5docxApplied Bioinformatics Assignment  5docx
Applied Bioinformatics Assignment 5docx
 
Cadd assignment 4 (sarita)
Cadd assignment 4 (sarita)Cadd assignment 4 (sarita)
Cadd assignment 4 (sarita)
 
CADD assignment unit 3
CADD assignment unit 3CADD assignment unit 3
CADD assignment unit 3
 
Unit 2 cadd assignment
Unit 2 cadd assignmentUnit 2 cadd assignment
Unit 2 cadd assignment
 
Illumina sequencing introduction
Illumina sequencing introductionIllumina sequencing introduction
Illumina sequencing introduction
 
Covid 19
Covid 19Covid 19
Covid 19
 
Application of nanotechnology in biomedicine
Application of nanotechnology in biomedicineApplication of nanotechnology in biomedicine
Application of nanotechnology in biomedicine
 
Pharmacogenomics: The right drug to the right person.
Pharmacogenomics: The right drug to the right person.Pharmacogenomics: The right drug to the right person.
Pharmacogenomics: The right drug to the right person.
 
Computer aided drug designing (cadd)
Computer aided drug designing (cadd)Computer aided drug designing (cadd)
Computer aided drug designing (cadd)
 
Noesy [autosaved]
Noesy [autosaved]Noesy [autosaved]
Noesy [autosaved]
 

KĂźrzlich hochgeladen

STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCESTERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCEPRINCE C P
 
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Lokesh Kothari
 
GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)Areesha Ahmad
 
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSpermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSarthak Sekhar Mondal
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPirithiRaju
 
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptxSOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptxkessiyaTpeter
 
Botany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questionsBotany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questionsSumit Kumar yadav
 
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...anilsa9823
 
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptxUnlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptxanandsmhk
 
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...ssifa0344
 
Chemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdfChemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdfSumit Kumar yadav
 
GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)Areesha Ahmad
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)PraveenaKalaiselvan1
 
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRStunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRDelhi Call girls
 
VIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C PVIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C PPRINCE C P
 
Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )aarthirajkumar25
 
Animal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxAnimal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxUmerFayaz5
 
Botany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdfBotany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdfSumit Kumar yadav
 
GFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptxGFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptxAleenaTreesaSaji
 

KĂźrzlich hochgeladen (20)

STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCESTERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
 
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
 
GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)
 
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSpermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
 
9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service
9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service
9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
 
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptxSOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
 
Botany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questionsBotany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questions
 
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
 
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptxUnlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
 
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
 
Chemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdfChemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdf
 
GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)
 
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRStunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
 
VIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C PVIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C P
 
Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )
 
Animal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxAnimal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptx
 
Botany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdfBotany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdf
 
GFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptxGFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptx
 

Cs273 structure prediction

  • 2. Protein Structure is Hierarchical
  • 3. Structure Determines Function What determines structure? • Energy • Kinematics How can we determine structure? • Experimental methods • Computational predictions The Protein Folding Problem
  • 4. Primary Structure: Sequence • The primary structure of a protein is the amino acid sequence
  • 5. Primary Structure: Sequence • Twenty different amino acids have distinct shapes and properties
  • 6. Primary Structure: Sequence A useful mnemonic for the hydrophobic amino acids is "FAMILY VW"
  • 7. Secondary Structure: , , & loops •  helices and  sheets are stabilized by hydrogen bonds between backbone oxygen and hydrogen atoms
  • 9. Secondary Structure:  sheet  sheet  buldge
  • 10. Second-and-a-half-ary Structure: Motifs beta helix beta barrel beta trefoil
  • 13. Tertiary Structure: A Protein Fold
  • 14. Protein Folds Composed of , , other
  • 15. Quaternary Structure: Multimeric Proteins or Functional Assemblies • Multimeric Proteins • Macromolecular Assemblies Ribosome: Protein Synthesis Replisome: DNA copying Hemoglobin: A tetramer
  • 16. Protein Folding • The amino-acid sequence of a protein determines the 3D fold [Anfinsen et al., 1950s] Some exceptions:  All proteins can be denatured  Some proteins have multiple conformations  Some proteins get folding help from chaperones • The function of a protein is determined by its 3D fold • Can we predict 3D fold of a protein given its amino-acid sequence?
  • 17. The Leventhal Paradox • Given a small protein (100aa) assume 3 possible conformations/peptide bond • 3100 = 5 × 1047 conformations • Fastest motions 10- 15 sec so sampling all conformations would take 5 × 1032 sec • 60 × 60 × 24 × 365 = 31536000 seconds in a year • Sampling all conformations will take 1.6 × 1025 years • Each protein folds quickly into a single stable native conformation the Leventhal paradox
  • 18. Quick Overview of Energy Strength (kcal/mole) Bond 3-7H-bonds 10Ionic bonds 1-2 Hydrophobic interactions 1 Van der vaals interactions 51Disulfide bridge
  • 19. The Hydrophobic Effect • Important for folding, because every amino acid participates! Trp2.25 Ile1.80 Phe1.79 Leu1.70 Cys1.54 Met1.23 Val1.22 Tyr0.96 Pro0.72 Ala0.31 Thr0.26 His0.13 Gly0.00 Ser-0.04 Gln-0.22 Asn-0.60 Glu-0.64 Asp-0.77 Lys-0.99 Arg-1.01 Experimentally Determined Hydrophobicity Levels Fauchere and Pilska (1983). Eur. J. Med. Chem. 18, 369-75.
  • 20. Protein Structure Determination • Experimental  X-ray crystallography  NMR spectrometry • Computational – Structure Prediction (The Holy Grail) Sequence implies structure, therefore in principle we can predict the structure from the sequence alone
  • 21. Protein Structure Prediction • ab initio  Use just first principles: energy, geometry, and kinematics • Homology  Find the best match to a database of sequences with known 3D- structure • Threading • Meta-servers and other methods
  • 22. Ab initio Prediction • Sampling the global conformation space  Lattice models / Discrete-state models  Molecular Dynamics  Pre-set libraries of fragment 3D motifs • Picking native conformations with an energy function  Solvation model: how protein interacts with water  Pair interactions between amino acids • Predicting secondary structure  Local homology  Fragment libraries
  • 23. Lattice String Folding • HP model: main modeled force is hydrophobic attraction  NP-hard in both 2-D square and 3-D cubic  Constant approximation algorithms  Not so relevant biologically
  • 25. ROSETTA http://www.bioinfo.rpi.edu/~bystrc/hmmstr/server.php http://depts.washington.edu/bakerpg/papers/Bonneau-ARBBS-v30-p173.pdf • Monte Carlo based method • Limit conformational search space by using sequence—structure motif I-Sites library (http://isites.bio.rpi.edu/Isites/)  261 patterns in library  Certain positions in motif favor certain residues • Remove all sequences with <25% identity • Find structures of the 25 nearest sequence neighbors of each 9-mer Rationale  Local structures often fold independently of full protein  Can predict large areas of protein by matching sequence to I- Sites ?? ?
  • 26. I-Sites Examples • Non polar helix  Abundance of alanine at all positions  Non-polar side chains favored at positions 3, 6, 10 (methionine, leucine, isoleucine) • Amphipathic helix  Non-polar side chains favored at positions 6, 9, 13, 16 (methionine, leucine, isoleucine)  Polar side chains favored at positions 1, 8, 11, 18 (glutamic acid, lysine)
  • 27. ROSETTA Method • New structures generated by swapping compatible fragments • Accepted structures are clustered based on energy and structural size • Best cluster is one with the greatest number of conformations within 4-Å rms deviation structure of the center • Representative structures taken from each of the best five clusters and returned to the user as predictions ?? ?
  • 29.
  • 31. Rosetta Results • In CASP4, Rosetta’s best models ranged from 6–10 Å rmsd C • For comparison, good comparative models give 2-5 Å rmsd C • Most effective with small proteins (<100 residues) and structures with helices
  • 32. Only a few folds are found in nature
  • 33. The SCOP Database Structural Classification Of Proteins FAMILY: proteins that are >30% similar, or >15% similar and have similar known structure/function SUPERFAMILY: proteins whose families have some sequence and function/structure similarity suggesting a common evolutionary origin COMMON FOLD: superfamilies that have same secondary structures in same arrangement, probably resulting by physics and chemistry CLASS: alpha, beta, alpha–beta, alpha+beta, multidomain
  • 34. Status of Protein Databases SCOP: Structural Classification of Proteins. 1.67 release 24037 PDB Entries (15 May 2004). 65122 Domains. Class Number of folds Number of superfamilies Number of families All alpha proteins 202 342 550 All beta proteins 141 280 529 Alpha and beta proteins (a/b) 130 213 593 Alpha and beta proteins (a+b) 260 386 650 Multi-domain proteins 40 40 55 Membrane and cell surface proteins 42 82 91 Small proteins 71 104 162 Total 887 1447 2630 EMBL PDB
  • 35. Evolution of Proteins – Domains #members in different families obey power law 429 families common in all 14 eukaryotes; 80% of animal domains, 90% of fungi domains 80% of proteins are multidomain in eukaryotes; domains usually combine pairwise in same order --why? Evolution of proteins happens mainly through duplication, recombination, and divergence Chothia, Gough, Vogel, Teichmann, Science 300:1701-17-3, 2003
  • 36. Homology-based Prediction • Align query sequence with sequences of known structure, usually >30% similar • Superimpose the aligned sequence onto the structure template, according to the computed sequence alignment • Perform local refinement of the resulting structure in 3D 90% of new structures submitted to PDB in the past three years have similar folds in PDB The number of unique structural folds is small (possibly a few thousand)
  • 37. Examples of Fold Classes
  • 38. Homology-based Prediction Raw model Loop modeling Side chain placement Refinement