SlideShare ist ein Scribd-Unternehmen logo
1 von 14
Downloaden Sie, um offline zu lesen
1
Central University of Bihar
BIS 553: protein modelling and simulation
Denovo structure prediction
Submitted to:- Submitted by:-
Dr. Durg Vijay Singh Shweta Kumari
Roll no- 21
2 nd semester
Central University of Bihar
Patna
2
CONTENT
Sl. No Topic Page No.
1 Introduction 3
2 Need of Ab inito prediction 4
3 Challenges 4
4 Principle of ab inito method 4
5 Denovo Structure Prediction V/S Template
Based Structure Prediction
4-5
6 Successful De Novo Modeling Requirements 6
7 Results from ab­initio 7
8 Domain prediction 7-8
9 Advantages of This Method 8-9
10 Complexity of abinitio methods 9
11 Ab initio methods have recently received 
increased attention in the prediction of loops
10
12 Protein folding and de novo protein design for 
biotechnological applications
10
13 Limitations of De novo Prediction Methods 11
14 CASPs 11-12
15 Application of Denovo structure prediction 12
16 List of de novo protein structure prediction 
software
12-13
17 References 14
3
Introduction:
Predicting the 3D structure without any “prior
knowledge”
•Predicting protein 3D structures from the amino acid sequence still remains
as an unsolved problem after five decades of efforts. If the target protein has a
homologue already solved, the task is relatively easy and high-resolution
models can be built by copying the framework of the solved structure.
•However, such a modelling procedure does not help answer the question of
how and why a protein adopts its specific structure. If structure homologues
(occasionally analogues) do not exist, or exist but cannot be identified, models
have to be constructed from scratch. This procedure, called ab initio
modelling.
•Ab initio modelling is essential for a complete solution to the protein
structure prediction problem; it can also help us understand the
physicochemical principle of how proteins fold in nature.
Thus,” In computational biology, de novo protein structure prediction refers to an
algorithmic process by which protein tertiary structure is predicted from its
amino acid primary sequence.”
4
Need of Ab inito prediction:
• First, in some cases, even a remotely related structural homologue may not be 
available.
• Second, new structure continue to be discovered which could not have been 
identified by methods which rely on comparison to known structure.
• Third, knowledge­based methods have been criticized for predicting protein 
protein structures without having to obtain a fundamental understanding of the 
mechanisms and driving forces of structure formation. Ab initio methods, in 
contrast, base their predictions on physical models for these mechanisms
Challenges:
• Energy functions that can  reliable   discriminat e   native and non­native  
structures.
• Enormous amount of computations.
Principle of ab inito method:
•It is based on the ‘thermodynamic hypothesis’, which states that the native 
structure of a protein is the one for which the free energy achieves the global 
minimum.
•ANFINSEN (1973) showed that all the information necessary for a protein to fold 
to the native state residue in the protein sequence.
•In the absence of large kinetic barriers in the force energy landscape, Anfinsen's 
result and those of large numbers of researchers in the intervening year suggest that 
the native confoermations of most proteins are the lowest free energy conformation 
for their sequences.
Denovo Structure Prediction V/S Template Based
Structure Prediction:
•De novo protein structure modeling is distinguished from Template-based
modeling (TBM) by the fact that no solved homolog to the protein of interest is
known, making efforts to predict protein structure from amino acid sequence
exceedingly difficult.
5
Sl.
No.
Homology
modelling
Fold Reconition De novo
prediction
1 templete based
modelling
templete based remote
homology modelling
templete free
modelling
2 applicable to sequence
having >= 30% homogy
on PDB database
applicable to sequence
having <20% homogy on
PDB database
applicable to any
sequence does not
having homologue
on PDB
3 length of sequence is not
limited
greater than 150AA
not applicable
4 limited search space search space greater than
homology modelling
very large search
space
5 more accurate structure generate less accurate
structure than homology
generate least
accurate structure
6 model quality at atomic
level
model quality at fold level model quality at
atomic level
7 computationally less
expensive
computationally more
expensive than homology
modelling
computationally
most expensive
8 applicable in Drug
designing, virtual
screening, designing site
directed mutagenisis,
characterization of active
site
applicable in Prediction of
protein family, functional
characterization by folding
assignment
applicable in
genome annotation,
domain prediction
and structural
genomics initiatives
6
Template­based methods begin with a sequence, predict the secondary structure, and 
attempt to find a template structure and/or fragments from existing structures in the 
PDB that will fold similar to the target sequence.
source: http://isw3.naist.jp/IS/Bio-Info-Unit/gogroup/study/study-en.html
“Denovo structure prediction method mean it
doesnot relay on homology between the query
sequence and a sequence in the Protein Data
Bank(PDB) to create a template for structure
prediction”
Successful De Novo Modeling Requirements:
•De novo conformation predictors usually function by producing candidate 
conformations (decoys) and then choosing amongst them based on their 
thermodynamic stability and energy state. Most successful predictors will have the 
following three factors in common:
1.  An accurate energy function that corresponds the most thermodynamically 
stable state to the native structure of a protein
2.  An efficient search method capable of quickly identify low­energy states 
through conformational search
3. The ability to select native­like models from a collection of decoy structures.
7
Results from ab­initio:
•Average error 5 Å -Average error 5 Å - 10 Å10 Å
•Function cannot beFunction cannot be predictedpredicted
••Long simulationsLong simulations
fig:fig: Some protein from ESome protein from E.coli.coli predicted at 7.6 Åpredicted at 7.6 Å
(CASP3, H.Scheraga)(CASP3, H.Scheraga)
Domain prediction:
•Domain prediction is a critical pre resquisite to the structur prediction “As the size of
the protein increases, its conformational space also increases.”
•Current denovo methods are limited to protein domain of 150 amino acid domain
residue for alpha-beta protein.
•80 residue for beta folds and 150 for alpha fold only.
•To overcome this two approaches can apply-
1. Increase the size range of denovo structure prediction.
2. Dividing protein into domains prior to attempting two protein structure
prediction.
•"A domain is generally define as a portion of protein that folds independently of the
rest of the protein."
•So dividing a query sequence into their smallest component domain prior to folding
is straight forward way to increase the size of the predictio.
•For many proteins domains division can be easily found while several domain
remains beyond our ability to correctly detached them.
•The determination of domain, family membership and its boundries for multidomain
protein is a vital step in structure annotation/ prediction.
•In brief, most domain protein partial methods relay on hierarchy searching for
domains in query sequence with collection of primary sequence methods, domains
library search and matches to structural domains in the PDB.
8
Advantages of This Method:
•The method is fully automated, and the methodology is the same regardless of the
existing homology between the query protein and the proteins in the structural
database. Thus, it can be easily applied to the structural annotation on a genomic
scale.
•A large success rate, which is competitive with other methods (a large fraction of
correct and accurate predictions), could be expected for the following types of
proteins.
The most advanced ab­initio method is fragment assembly
•Consists by breaking up the sequence in small subsegments of 3 to 9 residues  and 
generating structure for these segments based on a large library of known fragments.
•Decoys are generated from all possible combinations of fragments.
•An energy minimization process is applied to all decoys.
9
•Decoys are clustered and the final models are selected from the center of the largest
clusters.
Complexity of abinitio methods Can be easily explained by the understanding 
conformational complexity
Complexity of    abinitio    methods:
•3100 = 5 × 1047 conformations
•Fastest motions 10- 15 sec so sampling all
•conformations would take 5 × 1032 sec
•60 × 60 × 24 × 365 = 31536000 = 3.1536 x 107
•seconds in a year
•Sampling all conformations will take 1.6 × 1025 years
•years, much longer than the age of the universe
10
Ab initio methods have recently received increased 
attention in the prediction of loops:
•Loops exhibit greater structural variability than Beta-sheets and Alpha helices.
•Loop structure therefore is considerably more difficult to predict than the structure
of the geometrically highly regular Beta-sheets and Alpha helices.
•Loops are often exposed to the surface of proteins and contribute to active and binding sites. 
Consequently, loops are crucial for protein function.
Protein folding and    de novo    protein design for 
biotechnological applications:
Advances and challenges in the fields of protein structure prediction and de novo
protein design focusing on the interplay necessary for success. schematically shows
the roadmap and key challenges in protein structure prediction and de novo protein
design. The past few years have shown impressive applications of computational
structure prediction and design to biotechnology, spanning peptide or antibody
therapeutics, novel biocatalysts, and self-assembling nanomaterials.
Fig: Roadmap of key challenges in understanding how to predict protein sequence to structure to
function and design. Structure prediction begins with a primary amino acid sequence
11
Table. Summary of recent successful computational de novo designed and
redesigned systems and their biotechnological applications
source: http://www.sciencedirect.com/science/article/pii/S0167779913002266#
Limitations of De novo Prediction Methods:
•Pure ab­initio modelling is still very costly  and ineffective  but hybrid 
homology/ab­ initio methods  such as fragment  assembly have better performance
•A major limitation of de novo protein prediction methods is the extraordinary 
amount of computer time required to successfully solve for the native confirmation of
a protein.
•Distributed methods, such as Rosetta@home, have attempted to ameliorate this by 
recruiting individuals who then volunteer idle home computer time in order to 
process data.
•Even these methods face challenges, however. For example, a distributed method 
was utilized by a team of researchers at the University of Washington and the 
Howard Hughes Medical Institute to predict the tertiary structure of the protein 
T0283 from its amino acid sequence. In a blind test comparing the accuracy of this 
distributed technique with the experimentally confirmed structure deposited within 
the Protein Databank (PDB), the predictor produced excellent agreement with the 
deposited structure.
•However, the time and number of computers required for this feat was enormous – 
almost two years and approximately 70,000 home computers, respectively.
“One method proposed to overcome such limitations involves the use of Markov 
models (see Markov chain Monte Carlo). One possibility is that such models could
be constructed in order to assist with free energy computation and protein 
structure prediction, perhaps by refining computational simulations”
CASPs:
•“Progress for all variants of computational protein structure prediction methods is 
assessed in the biannual, community wide Critical Assessment of Protein Structure 
Prediction (CASP) experiments.
12
•To assess the current status of protein structure prediction, John Moult proposed the 
CASP (Critical Assessment of Techniques for Protein Structure Prediction) 
community­wide protein structure prediction experiment.
•The idea is that experimentalists who are about to determine protein structures make 
the sequences of the proteins available and then the protein structure prediction 
community makes predictions that are then assessed by independent reviewers.
•Attendees tested recently developed ab initio protein structure predictions methods
during the CASP3 exercises, conducted in December 1998 in Asilomar, California.
•Among the best performing ab initio methods was the Rosetta method developed by 
David Baker and coworkers.
Application of Denovo structure prediction:
Genome functional annotation and structure genomics initiate two areas of
research where ab initio protein structured prediction could take important
contributions.
1. Genome annotation:
a. The annotation of open reading frames lacking detectable sequence homology to protein of
known function represents a promising applicable for ab initio model.
Low resolution ab initio predicted structure and functional relationships between proteins not
apparent from sequence similarity alone.
Note:- This concept is well illustrated by some example of prediction from CASP4.
b. Ab initio structure could be probed for the presense of residue adopting conserved geometric
motifs (eg. Serin protease catalysis traids).
2) structural genomics initiatives:
a) an initio structure prediction can help guide target selection by focussing experimental structure
determination on those proteins likely to adopt novel folds or to be of particular biological
importance
b) an initio technique do not face the limitations which comes in homology modelling applied on
genomic scale ( need for at least one homologue of known structure with good coverage).
Thus, may be a valuable adjunct to homology methods, filling in structural gaps and
producing much more complete set of model.
13
List of de novo protein structure prediction
software:
Name Method Description Link
EVfold
Evolutionary couplings calculated from correlated
mutations in a protein family, used to predict 3D
structure from sequences alone and to predict
functional residues from coupling strengths. Predicts
both globular and transmembrane proteins.
Webserver
http://evfold
.org/evfold-
web/evfold.
do
QUARK Monte Carlo fragment assembly
On-line server for
protein modeling (best
for ab initio folding in
CASP9)
http://zhang
lab.ccmb.m
ed.umich.ed
u/QUARK/
NovaFold Combination of threading and ab initio folding
Commercial protein
structure prediction
application
http://www.
dnastar.com
/t-products-
NovaFold.a
spx
I-TASSER Threading fragment structure reassembly
On-line server for
protein modeling
http://zhang
lab.ccmb.m
ed.umich.ed
u/I-
TASSER/
Selvita Protein
Modeling Platform
Package of tools for protein modeling
Interactive webserver
and standalone program
including: CABS ab
initio modeling
http://www.
selvita.com/
selvita-
protein-
modeling-
platform.ht
ml
ROBETTA
Rosetta homology modeling and ab initio fragment
assembly with Ginzu domain prediction
Webserver
http://www.
robetta.org/
Rosetta@home
Distributed-computing implementation of Rosetta
algorithm
Downloadable program
http://boinc.
bakerlab.org
/rosetta/
CABS Reduced modeling tool Downloadable program
CABS-FOLD
Server for de novo modeling, can also use alternative
templates (consensus modeling).
Webserver
http://bioco
mp.chem.u
w.edu.pl/C
ABSfold/
Bhageerath
A computational protocol for modeling and
predicting protein structures at the atomic level.
Webserver
http://www.
scfbio-
iitd.res.in/b
hageerath/in
dex.jsp
Abalone Molecular Dynamics folding Program
PEP-FOLD
De novo approach, based on a HMM structural
alphabet
On-line server for
peptide structure
prediction
http://bioser
v.rpbs.univ-
paris-
diderot.fr/se
rvices/PEP-
FOLD/
14
References:
1.Structural Bioinformatics, 2nd
edition (Wiely-Blackwell)
2.Introduction to Bioinfoematics by Zhumur Ghosh
3. http://spin.niddk.nih.gov/bax/software/CSROSETTA/
4 http://zhanglab.ccmb.med.umich.edu/papers/2009_8.pdf
5.http://www.ercim.eu/publication/Ercim_News/enw43/ber
nasconi.html
6.http://zhanglab.ccmb.med.umich.edu/papers/2009_4.pdf
7. http://www2.denizyuret.com/ref/simons/baker-rosetta.pdf
8.http://en.wikipedia.org/wiki/De_novo_protein_structure_
prediction

Weitere ähnliche Inhalte

Was ist angesagt?

Protein Structure Determination
Protein Structure DeterminationProtein Structure Determination
Protein Structure Determination
Amjad Ibrahim
 
Protein-Ligand Docking
Protein-Ligand DockingProtein-Ligand Docking
Protein-Ligand Docking
baoilleach
 

Was ist angesagt? (20)

Protein modeling
Protein modelingProtein modeling
Protein modeling
 
Secondary protein structure prediction
Secondary protein structure predictionSecondary protein structure prediction
Secondary protein structure prediction
 
RNA secondary structure prediction
RNA secondary structure predictionRNA secondary structure prediction
RNA secondary structure prediction
 
Secondary Structure Prediction of proteins
Secondary Structure Prediction of proteins Secondary Structure Prediction of proteins
Secondary Structure Prediction of proteins
 
protein data bank
protein data bankprotein data bank
protein data bank
 
Homology modeling
Homology modelingHomology modeling
Homology modeling
 
Protein Structure Determination
Protein Structure DeterminationProtein Structure Determination
Protein Structure Determination
 
In silico structure prediction
In silico structure predictionIn silico structure prediction
In silico structure prediction
 
Protein Threading
Protein ThreadingProtein Threading
Protein Threading
 
Protein Structure Prediction
Protein Structure PredictionProtein Structure Prediction
Protein Structure Prediction
 
Role of bioinformatics of drug designing
Role of bioinformatics of drug designingRole of bioinformatics of drug designing
Role of bioinformatics of drug designing
 
Swiss pdb viewer
Swiss pdb viewerSwiss pdb viewer
Swiss pdb viewer
 
Protein Structure, Databases and Structural Alignment
Protein Structure, Databases and Structural AlignmentProtein Structure, Databases and Structural Alignment
Protein Structure, Databases and Structural Alignment
 
Protein-Ligand Docking
Protein-Ligand DockingProtein-Ligand Docking
Protein-Ligand Docking
 
Uni prot presentation
Uni prot presentationUni prot presentation
Uni prot presentation
 
Protein structure prediction with a focus on Rosetta
Protein structure prediction with a focus on RosettaProtein structure prediction with a focus on Rosetta
Protein structure prediction with a focus on Rosetta
 
Errat
ErratErrat
Errat
 
Sequence alignment
Sequence alignmentSequence alignment
Sequence alignment
 
Gene prediction method
Gene prediction method Gene prediction method
Gene prediction method
 
Scop database
Scop databaseScop database
Scop database
 

Andere mochten auch

Corrected draft programme dec 2014
Corrected draft programme dec 2014Corrected draft programme dec 2014
Corrected draft programme dec 2014
rahulsne
 
Puget Puget sound restoration 424 final
Puget Puget sound restoration 424 finalPuget Puget sound restoration 424 final
Puget Puget sound restoration 424 final
jketchu
 

Andere mochten auch (20)

Shweta ppt I1FFL
Shweta ppt I1FFLShweta ppt I1FFL
Shweta ppt I1FFL
 
1
11
1
 
Speaking out mena.def
Speaking out mena.defSpeaking out mena.def
Speaking out mena.def
 
Paper dhh
Paper dhhPaper dhh
Paper dhh
 
Be475 mid term j-b
Be475 mid term j-bBe475 mid term j-b
Be475 mid term j-b
 
พื้นฐาน Adobe Flash CS3 ตอนที่2
พื้นฐาน Adobe Flash CS3 ตอนที่2พื้นฐาน Adobe Flash CS3 ตอนที่2
พื้นฐาน Adobe Flash CS3 ตอนที่2
 
พื้นฐาน Adobe Flash CS3 ตอนที่5
พื้นฐาน Adobe Flash CS3 ตอนที่5พื้นฐาน Adobe Flash CS3 ตอนที่5
พื้นฐาน Adobe Flash CS3 ตอนที่5
 
L1 handout
L1 handoutL1 handout
L1 handout
 
Kurikulum 2013
Kurikulum 2013Kurikulum 2013
Kurikulum 2013
 
Ikeday
IkedayIkeday
Ikeday
 
Genre research.
Genre research.Genre research.
Genre research.
 
Sm11415 20140807
Sm11415 20140807Sm11415 20140807
Sm11415 20140807
 
Corrected draft programme dec 2014
Corrected draft programme dec 2014Corrected draft programme dec 2014
Corrected draft programme dec 2014
 
Démarrer une session de formation
Démarrer une session de formationDémarrer une session de formation
Démarrer une session de formation
 
Puget
PugetPuget
Puget
 
X ray
X ray X ray
X ray
 
Wings power point1
Wings power point1Wings power point1
Wings power point1
 
Be475 final j b version 3
Be475 final j b version 3Be475 final j b version 3
Be475 final j b version 3
 
Burak gundogdu
Burak gundogduBurak gundogdu
Burak gundogdu
 
Puget Puget sound restoration 424 final
Puget Puget sound restoration 424 finalPuget Puget sound restoration 424 final
Puget Puget sound restoration 424 final
 

Ähnlich wie De novo str_prediction

Computational Prediction Of Protein-1.pptx
Computational Prediction Of Protein-1.pptxComputational Prediction Of Protein-1.pptx
Computational Prediction Of Protein-1.pptx
ashharnomani
 
58.Comparative modelling of cellulase from Aspergillus terreus
58.Comparative modelling of cellulase from Aspergillus terreus58.Comparative modelling of cellulase from Aspergillus terreus
58.Comparative modelling of cellulase from Aspergillus terreus
Annadurai B
 
Comparative Protein Structure Modeling and itsApplications
Comparative Protein Structure Modeling and itsApplicationsComparative Protein Structure Modeling and itsApplications
Comparative Protein Structure Modeling and itsApplications
LynellBull52
 
protein structure prediction presentation .pptx
protein structure prediction presentation .pptxprotein structure prediction presentation .pptx
protein structure prediction presentation .pptx
abdulahad563527
 

Ähnlich wie De novo str_prediction (20)

protein design, principles and examples.pptx
protein design, principles and examples.pptxprotein design, principles and examples.pptx
protein design, principles and examples.pptx
 
protein Modeling Abi.pptx
protein Modeling Abi.pptxprotein Modeling Abi.pptx
protein Modeling Abi.pptx
 
Protien Structure Prediction
Protien Structure PredictionProtien Structure Prediction
Protien Structure Prediction
 
threading and homology modelling methods
threading and homology modelling methodsthreading and homology modelling methods
threading and homology modelling methods
 
Computational Prediction Of Protein-1.pptx
Computational Prediction Of Protein-1.pptxComputational Prediction Of Protein-1.pptx
Computational Prediction Of Protein-1.pptx
 
L1Protein_Structure_Analysis.pptx
L1Protein_Structure_Analysis.pptxL1Protein_Structure_Analysis.pptx
L1Protein_Structure_Analysis.pptx
 
Computational Prediction of Protein Structure.pptx
Computational Prediction of Protein Structure.pptxComputational Prediction of Protein Structure.pptx
Computational Prediction of Protein Structure.pptx
 
Computer Aided Molecular Modeling
Computer Aided Molecular ModelingComputer Aided Molecular Modeling
Computer Aided Molecular Modeling
 
Modelling Proteins By Computational Structural Biology
Modelling Proteins By Computational Structural BiologyModelling Proteins By Computational Structural Biology
Modelling Proteins By Computational Structural Biology
 
cloudapplications.pptx
cloudapplications.pptxcloudapplications.pptx
cloudapplications.pptx
 
Cloud applications - Protein Structure Predication and gene expression data...
Cloud applications - Protein Structure Predication  and  gene expression data...Cloud applications - Protein Structure Predication  and  gene expression data...
Cloud applications - Protein Structure Predication and gene expression data...
 
58.Comparative modelling of cellulase from Aspergillus terreus
58.Comparative modelling of cellulase from Aspergillus terreus58.Comparative modelling of cellulase from Aspergillus terreus
58.Comparative modelling of cellulase from Aspergillus terreus
 
Protein structure 2
Protein structure 2Protein structure 2
Protein structure 2
 
Protein struc pred-Ab initio and other methods as a short introduction.ppt
Protein struc pred-Ab initio and other methods as a short introduction.pptProtein struc pred-Ab initio and other methods as a short introduction.ppt
Protein struc pred-Ab initio and other methods as a short introduction.ppt
 
Homology modeling: Modeller
Homology modeling: ModellerHomology modeling: Modeller
Homology modeling: Modeller
 
Comparative Protein Structure Modeling and itsApplications
Comparative Protein Structure Modeling and itsApplicationsComparative Protein Structure Modeling and itsApplications
Comparative Protein Structure Modeling and itsApplications
 
Class powerpoint.ppt
Class powerpoint.pptClass powerpoint.ppt
Class powerpoint.ppt
 
Drug discovery presentation
Drug discovery presentationDrug discovery presentation
Drug discovery presentation
 
protein structure prediction presentation .pptx
protein structure prediction presentation .pptxprotein structure prediction presentation .pptx
protein structure prediction presentation .pptx
 
demonstration lecture on Homology modeling
demonstration lecture on Homology modelingdemonstration lecture on Homology modeling
demonstration lecture on Homology modeling
 

Mehr von ShwetA Kumari

Mehr von ShwetA Kumari (12)

Combanitorial approach for drug discovery
Combanitorial approach for drug discoveryCombanitorial approach for drug discovery
Combanitorial approach for drug discovery
 
Final shweta protein kinase cascade
Final shweta protein kinase cascadeFinal shweta protein kinase cascade
Final shweta protein kinase cascade
 
Data mining
Data miningData mining
Data mining
 
Distance measure between two biological sequences
Distance  measure between  two biological  sequences Distance  measure between  two biological  sequences
Distance measure between two biological sequences
 
modelling assignment
modelling assignmentmodelling assignment
modelling assignment
 
paper ppt 2ndry str in globular protein
paper ppt 2ndry str in globular proteinpaper ppt 2ndry str in globular protein
paper ppt 2ndry str in globular protein
 
Aminiacid Selenocysteine and Pyrrolysine
Aminiacid Selenocysteine and PyrrolysineAminiacid Selenocysteine and Pyrrolysine
Aminiacid Selenocysteine and Pyrrolysine
 
dot plot analysis
dot plot analysisdot plot analysis
dot plot analysis
 
protein kinase cascade
protein kinase cascadeprotein kinase cascade
protein kinase cascade
 
Evolution
EvolutionEvolution
Evolution
 
Natural selection
Natural selection Natural selection
Natural selection
 
Genetic drift
Genetic drift Genetic drift
Genetic drift
 

Kürzlich hochgeladen

Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
ssuser79fe74
 
Seismic Method Estimate velocity from seismic data.pptx
Seismic Method Estimate velocity from seismic  data.pptxSeismic Method Estimate velocity from seismic  data.pptx
Seismic Method Estimate velocity from seismic data.pptx
AlMamun560346
 
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Lokesh Kothari
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
PirithiRaju
 
Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOST
Sérgio Sacani
 
Presentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxPresentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptx
gindu3009
 

Kürzlich hochgeladen (20)

Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
 
Isotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoIsotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on Io
 
Seismic Method Estimate velocity from seismic data.pptx
Seismic Method Estimate velocity from seismic  data.pptxSeismic Method Estimate velocity from seismic  data.pptx
Seismic Method Estimate velocity from seismic data.pptx
 
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICESAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICE
 
Botany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfBotany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdf
 
GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)
 
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
 
COST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptxCOST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptx
 
Green chemistry and Sustainable development.pptx
Green chemistry  and Sustainable development.pptxGreen chemistry  and Sustainable development.pptx
Green chemistry and Sustainable development.pptx
 
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bNightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)
 
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
 
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRLKochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
 
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls AgencyHire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
 
Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOST
 
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
 
Botany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdfBotany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdf
 
Presentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxPresentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptx
 
Biological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfBiological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdf
 

De novo str_prediction

  • 1. 1 Central University of Bihar BIS 553: protein modelling and simulation Denovo structure prediction Submitted to:- Submitted by:- Dr. Durg Vijay Singh Shweta Kumari Roll no- 21 2 nd semester Central University of Bihar Patna
  • 2. 2 CONTENT Sl. No Topic Page No. 1 Introduction 3 2 Need of Ab inito prediction 4 3 Challenges 4 4 Principle of ab inito method 4 5 Denovo Structure Prediction V/S Template Based Structure Prediction 4-5 6 Successful De Novo Modeling Requirements 6 7 Results from ab­initio 7 8 Domain prediction 7-8 9 Advantages of This Method 8-9 10 Complexity of abinitio methods 9 11 Ab initio methods have recently received  increased attention in the prediction of loops 10 12 Protein folding and de novo protein design for  biotechnological applications 10 13 Limitations of De novo Prediction Methods 11 14 CASPs 11-12 15 Application of Denovo structure prediction 12 16 List of de novo protein structure prediction  software 12-13 17 References 14
  • 3. 3 Introduction: Predicting the 3D structure without any “prior knowledge” •Predicting protein 3D structures from the amino acid sequence still remains as an unsolved problem after five decades of efforts. If the target protein has a homologue already solved, the task is relatively easy and high-resolution models can be built by copying the framework of the solved structure. •However, such a modelling procedure does not help answer the question of how and why a protein adopts its specific structure. If structure homologues (occasionally analogues) do not exist, or exist but cannot be identified, models have to be constructed from scratch. This procedure, called ab initio modelling. •Ab initio modelling is essential for a complete solution to the protein structure prediction problem; it can also help us understand the physicochemical principle of how proteins fold in nature. Thus,” In computational biology, de novo protein structure prediction refers to an algorithmic process by which protein tertiary structure is predicted from its amino acid primary sequence.”
  • 4. 4 Need of Ab inito prediction: • First, in some cases, even a remotely related structural homologue may not be  available. • Second, new structure continue to be discovered which could not have been  identified by methods which rely on comparison to known structure. • Third, knowledge­based methods have been criticized for predicting protein  protein structures without having to obtain a fundamental understanding of the  mechanisms and driving forces of structure formation. Ab initio methods, in  contrast, base their predictions on physical models for these mechanisms Challenges: • Energy functions that can  reliable   discriminat e   native and non­native   structures. • Enormous amount of computations. Principle of ab inito method: •It is based on the ‘thermodynamic hypothesis’, which states that the native  structure of a protein is the one for which the free energy achieves the global  minimum. •ANFINSEN (1973) showed that all the information necessary for a protein to fold  to the native state residue in the protein sequence. •In the absence of large kinetic barriers in the force energy landscape, Anfinsen's  result and those of large numbers of researchers in the intervening year suggest that  the native confoermations of most proteins are the lowest free energy conformation  for their sequences. Denovo Structure Prediction V/S Template Based Structure Prediction: •De novo protein structure modeling is distinguished from Template-based modeling (TBM) by the fact that no solved homolog to the protein of interest is known, making efforts to predict protein structure from amino acid sequence exceedingly difficult.
  • 5. 5 Sl. No. Homology modelling Fold Reconition De novo prediction 1 templete based modelling templete based remote homology modelling templete free modelling 2 applicable to sequence having >= 30% homogy on PDB database applicable to sequence having <20% homogy on PDB database applicable to any sequence does not having homologue on PDB 3 length of sequence is not limited greater than 150AA not applicable 4 limited search space search space greater than homology modelling very large search space 5 more accurate structure generate less accurate structure than homology generate least accurate structure 6 model quality at atomic level model quality at fold level model quality at atomic level 7 computationally less expensive computationally more expensive than homology modelling computationally most expensive 8 applicable in Drug designing, virtual screening, designing site directed mutagenisis, characterization of active site applicable in Prediction of protein family, functional characterization by folding assignment applicable in genome annotation, domain prediction and structural genomics initiatives
  • 6. 6 Template­based methods begin with a sequence, predict the secondary structure, and  attempt to find a template structure and/or fragments from existing structures in the  PDB that will fold similar to the target sequence. source: http://isw3.naist.jp/IS/Bio-Info-Unit/gogroup/study/study-en.html “Denovo structure prediction method mean it doesnot relay on homology between the query sequence and a sequence in the Protein Data Bank(PDB) to create a template for structure prediction” Successful De Novo Modeling Requirements: •De novo conformation predictors usually function by producing candidate  conformations (decoys) and then choosing amongst them based on their  thermodynamic stability and energy state. Most successful predictors will have the  following three factors in common: 1.  An accurate energy function that corresponds the most thermodynamically  stable state to the native structure of a protein 2.  An efficient search method capable of quickly identify low­energy states  through conformational search 3. The ability to select native­like models from a collection of decoy structures.
  • 7. 7 Results from ab­initio: •Average error 5 Å -Average error 5 Å - 10 Å10 Å •Function cannot beFunction cannot be predictedpredicted ••Long simulationsLong simulations fig:fig: Some protein from ESome protein from E.coli.coli predicted at 7.6 Åpredicted at 7.6 Å (CASP3, H.Scheraga)(CASP3, H.Scheraga) Domain prediction: •Domain prediction is a critical pre resquisite to the structur prediction “As the size of the protein increases, its conformational space also increases.” •Current denovo methods are limited to protein domain of 150 amino acid domain residue for alpha-beta protein. •80 residue for beta folds and 150 for alpha fold only. •To overcome this two approaches can apply- 1. Increase the size range of denovo structure prediction. 2. Dividing protein into domains prior to attempting two protein structure prediction. •"A domain is generally define as a portion of protein that folds independently of the rest of the protein." •So dividing a query sequence into their smallest component domain prior to folding is straight forward way to increase the size of the predictio. •For many proteins domains division can be easily found while several domain remains beyond our ability to correctly detached them. •The determination of domain, family membership and its boundries for multidomain protein is a vital step in structure annotation/ prediction. •In brief, most domain protein partial methods relay on hierarchy searching for domains in query sequence with collection of primary sequence methods, domains library search and matches to structural domains in the PDB.
  • 8. 8 Advantages of This Method: •The method is fully automated, and the methodology is the same regardless of the existing homology between the query protein and the proteins in the structural database. Thus, it can be easily applied to the structural annotation on a genomic scale. •A large success rate, which is competitive with other methods (a large fraction of correct and accurate predictions), could be expected for the following types of proteins. The most advanced ab­initio method is fragment assembly •Consists by breaking up the sequence in small subsegments of 3 to 9 residues  and  generating structure for these segments based on a large library of known fragments. •Decoys are generated from all possible combinations of fragments. •An energy minimization process is applied to all decoys.
  • 9. 9 •Decoys are clustered and the final models are selected from the center of the largest clusters. Complexity of abinitio methods Can be easily explained by the understanding  conformational complexity Complexity of    abinitio    methods: •3100 = 5 × 1047 conformations •Fastest motions 10- 15 sec so sampling all •conformations would take 5 × 1032 sec •60 × 60 × 24 × 365 = 31536000 = 3.1536 x 107 •seconds in a year •Sampling all conformations will take 1.6 × 1025 years •years, much longer than the age of the universe
  • 10. 10 Ab initio methods have recently received increased  attention in the prediction of loops: •Loops exhibit greater structural variability than Beta-sheets and Alpha helices. •Loop structure therefore is considerably more difficult to predict than the structure of the geometrically highly regular Beta-sheets and Alpha helices. •Loops are often exposed to the surface of proteins and contribute to active and binding sites.  Consequently, loops are crucial for protein function. Protein folding and    de novo    protein design for  biotechnological applications: Advances and challenges in the fields of protein structure prediction and de novo protein design focusing on the interplay necessary for success. schematically shows the roadmap and key challenges in protein structure prediction and de novo protein design. The past few years have shown impressive applications of computational structure prediction and design to biotechnology, spanning peptide or antibody therapeutics, novel biocatalysts, and self-assembling nanomaterials. Fig: Roadmap of key challenges in understanding how to predict protein sequence to structure to function and design. Structure prediction begins with a primary amino acid sequence
  • 11. 11 Table. Summary of recent successful computational de novo designed and redesigned systems and their biotechnological applications source: http://www.sciencedirect.com/science/article/pii/S0167779913002266# Limitations of De novo Prediction Methods: •Pure ab­initio modelling is still very costly  and ineffective  but hybrid  homology/ab­ initio methods  such as fragment  assembly have better performance •A major limitation of de novo protein prediction methods is the extraordinary  amount of computer time required to successfully solve for the native confirmation of a protein. •Distributed methods, such as Rosetta@home, have attempted to ameliorate this by  recruiting individuals who then volunteer idle home computer time in order to  process data. •Even these methods face challenges, however. For example, a distributed method  was utilized by a team of researchers at the University of Washington and the  Howard Hughes Medical Institute to predict the tertiary structure of the protein  T0283 from its amino acid sequence. In a blind test comparing the accuracy of this  distributed technique with the experimentally confirmed structure deposited within  the Protein Databank (PDB), the predictor produced excellent agreement with the  deposited structure. •However, the time and number of computers required for this feat was enormous –  almost two years and approximately 70,000 home computers, respectively. “One method proposed to overcome such limitations involves the use of Markov  models (see Markov chain Monte Carlo). One possibility is that such models could be constructed in order to assist with free energy computation and protein  structure prediction, perhaps by refining computational simulations” CASPs: •“Progress for all variants of computational protein structure prediction methods is  assessed in the biannual, community wide Critical Assessment of Protein Structure  Prediction (CASP) experiments.
  • 12. 12 •To assess the current status of protein structure prediction, John Moult proposed the  CASP (Critical Assessment of Techniques for Protein Structure Prediction)  community­wide protein structure prediction experiment. •The idea is that experimentalists who are about to determine protein structures make  the sequences of the proteins available and then the protein structure prediction  community makes predictions that are then assessed by independent reviewers. •Attendees tested recently developed ab initio protein structure predictions methods during the CASP3 exercises, conducted in December 1998 in Asilomar, California. •Among the best performing ab initio methods was the Rosetta method developed by  David Baker and coworkers. Application of Denovo structure prediction: Genome functional annotation and structure genomics initiate two areas of research where ab initio protein structured prediction could take important contributions. 1. Genome annotation: a. The annotation of open reading frames lacking detectable sequence homology to protein of known function represents a promising applicable for ab initio model. Low resolution ab initio predicted structure and functional relationships between proteins not apparent from sequence similarity alone. Note:- This concept is well illustrated by some example of prediction from CASP4. b. Ab initio structure could be probed for the presense of residue adopting conserved geometric motifs (eg. Serin protease catalysis traids). 2) structural genomics initiatives: a) an initio structure prediction can help guide target selection by focussing experimental structure determination on those proteins likely to adopt novel folds or to be of particular biological importance b) an initio technique do not face the limitations which comes in homology modelling applied on genomic scale ( need for at least one homologue of known structure with good coverage). Thus, may be a valuable adjunct to homology methods, filling in structural gaps and producing much more complete set of model.
  • 13. 13 List of de novo protein structure prediction software: Name Method Description Link EVfold Evolutionary couplings calculated from correlated mutations in a protein family, used to predict 3D structure from sequences alone and to predict functional residues from coupling strengths. Predicts both globular and transmembrane proteins. Webserver http://evfold .org/evfold- web/evfold. do QUARK Monte Carlo fragment assembly On-line server for protein modeling (best for ab initio folding in CASP9) http://zhang lab.ccmb.m ed.umich.ed u/QUARK/ NovaFold Combination of threading and ab initio folding Commercial protein structure prediction application http://www. dnastar.com /t-products- NovaFold.a spx I-TASSER Threading fragment structure reassembly On-line server for protein modeling http://zhang lab.ccmb.m ed.umich.ed u/I- TASSER/ Selvita Protein Modeling Platform Package of tools for protein modeling Interactive webserver and standalone program including: CABS ab initio modeling http://www. selvita.com/ selvita- protein- modeling- platform.ht ml ROBETTA Rosetta homology modeling and ab initio fragment assembly with Ginzu domain prediction Webserver http://www. robetta.org/ Rosetta@home Distributed-computing implementation of Rosetta algorithm Downloadable program http://boinc. bakerlab.org /rosetta/ CABS Reduced modeling tool Downloadable program CABS-FOLD Server for de novo modeling, can also use alternative templates (consensus modeling). Webserver http://bioco mp.chem.u w.edu.pl/C ABSfold/ Bhageerath A computational protocol for modeling and predicting protein structures at the atomic level. Webserver http://www. scfbio- iitd.res.in/b hageerath/in dex.jsp Abalone Molecular Dynamics folding Program PEP-FOLD De novo approach, based on a HMM structural alphabet On-line server for peptide structure prediction http://bioser v.rpbs.univ- paris- diderot.fr/se rvices/PEP- FOLD/
  • 14. 14 References: 1.Structural Bioinformatics, 2nd edition (Wiely-Blackwell) 2.Introduction to Bioinfoematics by Zhumur Ghosh 3. http://spin.niddk.nih.gov/bax/software/CSROSETTA/ 4 http://zhanglab.ccmb.med.umich.edu/papers/2009_8.pdf 5.http://www.ercim.eu/publication/Ercim_News/enw43/ber nasconi.html 6.http://zhanglab.ccmb.med.umich.edu/papers/2009_4.pdf 7. http://www2.denizyuret.com/ref/simons/baker-rosetta.pdf 8.http://en.wikipedia.org/wiki/De_novo_protein_structure_ prediction