SlideShare ist ein Scribd-Unternehmen logo
1 von 63
PROTEIN DATA BANK(PDB)
CONTENTS

 Introduction
 Supported and funded by

 History

 PDB Holdings list

 Member organizations

 Task forces

 PDB ID

 PDB File format

 Browse to WWW.RCSB.ORG/PDB/
PROTEIN DATA BANK
                                 PDB
   Single worldwide database and hundreds of secondary databases categorize
    the data differently.

   Key resource in the area of structural biology, stores 3D structural data of
    large biological molecules such as Proteins and Nucleic acids.

   Data is submitted by Biologists and Biochemists from all around the world
    to be freely accessible on internet via its member organizations’ websites
    and is updated weekly.

   The mission is to maintain a single Protein Data Bank Archive of
    Macromolecular Structural data.
SUPPORTED AND FUNDED
The Protein Data Bank (PDB) is operated by:

 Rutgers, The State University of New Jersey.
 The San Diego Supercomputer Center at the University of California, San
  Diego.

   The Center for Advanced Research in Biotechnology of the National
    Institute of Standards and Technology -- the Research Collaboratory for
    Structural Bioinformatics (RCSB)

   The PDB is supported by funds from the National Science Foundation, the
    Department of Energy, and the National Institutes of Health.
PDB HISTORY

Two forces to initiate PDB:
  Growing collection of sets of protein structural data by X-Ray diffraction.
   Brookhaven Raster Display (BRAD), a molecular graphics display to
  visualize protein structures in 3D, emerged in 1968.

   In 1969, Dr Edger Meyer began to write software to store atomic
    coordinates files in a common format to make them available for geometric
    and graphical evaluation (with sponsorship of Dr Walton Hamilton at
    Bookhaven National Laboratory.

   In 1971, one of Dr Meyer’s programs- SEARCH- enabled networking i.e
    enabled the researchers to access information from database to study protein
    structures offline.
   In 1973, upon Hamilton’s death, Dr Tom Koetzle took over direction of PDB
    for 20 years.
   mmCIF project completed and Structural genomics began in 1970s.

   In 1980s, IUCr guidelines established, number of structures deposited increases
    and independent biological databases established – e.g., the NDB.

   In Oct, 1998; PDB was transferred to Research Collaboratory for Structural
    Bioinformatics (RCSB), complete transfer since 1999. Dr Helen M Berman of
    Rutgers University was the new director.

   In 2003, with the formation of wwPDB, the PDB became an international
    organization having three member organizations.

   !n 2006, the BMRB joined PDB.
PDB HOLDINGS LIST ( AS OF 25 OCT, 2011)


        Experimental                                            Protein/Nucleic Acid
                                Proteins    Nucleic Acids                                   Other   Total
            Method                                                      complexes

X-ray diffraction                  62750              1323                        3050          2   67125

NMR                                 7962               960                         179          7    9108

Electron microscopy                  262                22                             96       0     380

Hybrid                                 41                   3                           1       1       46

Other                                133                    4                           5      13     155

                       Total:      71148              2312                        3331         23   76814
MEMBER ORGANIZATIONS
   Act as Data deposition, Data processing and Distribution centers for PDB data.

   Three are founding member organizations:

   PDBe…Protein Data Bank in Europe.
   PDBj…Protein Data Bank in Japan.
   RCSB…Research Collaboratory for Structural Bioinformatics.

   The Biological Magnetic Resonance Data Bank (BMRB) joined later in 2006.

   Another organization Worldwide Protein Data Bank (wwPDB) oversees PDB.
    wwPDB reviews and annotates each submitted entry and then it is automatically
    checked for plausibility( the source code for validation software is available.
TASK FORCES
   X-Ray diffraction (most of the structures)…approximations of the
    coordinates of atoms of proteins are obtained. E.g lyzozyme.

   NMR (about 15% e.g, haemoglobin)…estimations of distances between
    pairs of atoms of proteins. Final conformation is obtained after solving
    distance geometry problem.

   Cryo Electron Microscopy (very few protein e.g, crsysalin).
PDB IDENTIFIER (PDB ID)
   Each structure published in PDB receives a four character alphanumeric
    identifier or accession number. Like, 1ANG or 4hhb.

   However, this cant be used as an identifier for biomolecules. Because
    several structures for the same molecule in different environments or
    conformations-are contained in PDB with different PDB IDs.
PDB FILE FORMAT

   Standard data representation…encoded in data dictionary. The metadata
    model supporting this representation is used by all PDB data processing and
    database software tools.

   PDB file format was restricted to 80 characters per line initially.
   In 1996, macromolecular Crystallographic Information File (mmCIF) format
    started.
   In 2005, XML version called as PDBML, was described.

   The structure files can be downloaded in any of these three formats.
   The files are easily downloaded into graphics packages as well, using web
    services.
PDB 3D data file format

   ASCII
   column based: 80 columns per line
   KEYWORD for record type at col.#1
   Header records
   Structure records
   Atom records (containing coordinates)
   ATOM, HETATM, ..., TER
PDB format
                        Coordinate Section


ATOM record                          Biopolymer residue atom
HETATM record                        nonBiopolymer atom
TER record                           chain terminator




123456789012345678901234567890123456789012345678901234567
89012345678901234567890
ATOM 1 N ALA 1 11.104 6.134 -6.504 1.00 0.00 N
ATOM 2 CA ALA 1 11.639 6.071 -5.147 1.00 0.00 C
...
ATOM 293 1HG GLU 18 -14.861 -4.847 0.361 1.00 0.00 H
ATOM 294 2HG GLU 18 -13.518 -3.769 0.084 1.00 0.00 H
TER 295 GLU 18
HETATM 5555 CA 0.000 0.000 0.000
mmCIF

    mmCIF is the acronym for the macromolecular Crystallographic
    Information File.
    mmCIF is based on a subset of the syntax rules for the Self
    Defining Text Archive (STAR) file.
    A Dictionary Description Language (DDL) defines the structure of
    mmCIF dictionaries. Dictionaries provide the metadata which define
    the content of mmCIF data files.
    mmCIF data files, dictionaries and DDLs are all expressed in a
    common syntax.
POINT YOUR BROWSER TO:
WWW.RCSB.ORG/PDB/

put either a search term (for example, a protein name) or a
PDB number
2DN2
UMIN
       HAEMOGLOBIN
PROTEIN ANNOTATION
   If the contents of the PDB are thought of as primary
    data, then there are hundreds of derived (i.e.,
    secondary) databases that categorize the data
    differently. For example,
    both SCOPand CATH categorize structures
    according to type of structure and assumed
    evolutionary relations; GO categorize structures
    based on genes.
The Structural Classification
of Proteins (SCOP) database is
a largely manual classification of
protein structural domains based
on similarities of
their structures and amino
acid sequences
Class:the overall secondary-structure content of the domain
Architecture:high structural similarity but no evidence
of homology.
Topology:a large-scale grouping of topologies which share
particular structural features
Homologous superfamily:indicative of a demonstrable
evolutionary relationship.
Pfam is a
database
of protein
families that
includes their
annotations
and multiple
sequence
alignment
generated
using hidden
Markov models
CLICK
Select your
desire method
CLICK
CLICK
CLICK
CLICK
& FINALLY
Show the gradual updating
Structural View of Biology   released entries
You can also
select different
display view
can also download in
different view
VIEWING THE DATA
   56,523 structures In PDB have structure factor files.
   6410 structures In PDB have NMR restraint files.
   198 structures In PDB have chemical shifts files.
   Text file can be viewed or modified in editor.
   Structure files may be viewed using various free and commercial
    visualizations programs and Web browsers plug-ins like
   OPEN SOURCE PDB softweres
   Jmol
   Molekel
   MeshLab(able to import PDB data set and buildup surfaces from them)
   QuteMol
   Avogadro
   And others open but not free , like
   PYMOL , RASMOL, VIST PROT 3DS & STAR BIOCHEM
HTTP://WWW.RCSB.ORG/PDB/STATIC.DO
          ?P=SOFTWARE/SOFTWARE_LINKS/MOL
          ECULAR_GRAPHICS.HTML




The RCSB PDB website contains
an extensive list of both free and
commercial molecule visualization
programs and web browser plug-in.
LIMITATION
   The Protein Data Bank (PDB) is the central archive of
    experimentally solved biomolecular structures. However, the PDB
    only allows data retrieval and does not provide functionality for
    collaboration or user feedback.
    In contrast, PDBWiki allows for sharing expert knowledge about
    structures deposited in the PDB. It provides tools for discussing and
    annotating proteins in a collaborative way. The goal is to create a
    central and freely-accessible repository of user-contributed
    information that will be useful for anyone working with PDB
    structures. As such PDBWiki can be considered a part of a wider
    effort in community-based biological databases curation.
THANKS
protein data bank

Weitere ähnliche Inhalte

Was ist angesagt?

Was ist angesagt? (20)

Protein database
Protein databaseProtein database
Protein database
 
Gen bank databases
Gen bank databasesGen bank databases
Gen bank databases
 
Protein Databases
Protein DatabasesProtein Databases
Protein Databases
 
Structural databases
Structural databases Structural databases
Structural databases
 
Cath
CathCath
Cath
 
Uni prot presentation
Uni prot presentationUni prot presentation
Uni prot presentation
 
Database in bioinformatics
Database in bioinformaticsDatabase in bioinformatics
Database in bioinformatics
 
Primary and secondary databases ppt by puneet kulyana
Primary and secondary databases ppt by puneet kulyanaPrimary and secondary databases ppt by puneet kulyana
Primary and secondary databases ppt by puneet kulyana
 
Prosite
PrositeProsite
Prosite
 
Tools and database of NCBI
Tools and database of NCBITools and database of NCBI
Tools and database of NCBI
 
BLAST (Basic local alignment search Tool)
BLAST (Basic local alignment search Tool)BLAST (Basic local alignment search Tool)
BLAST (Basic local alignment search Tool)
 
Nucleic Acid Sequence databases
Nucleic Acid Sequence databasesNucleic Acid Sequence databases
Nucleic Acid Sequence databases
 
Gene prediction methods vijay
Gene prediction methods  vijayGene prediction methods  vijay
Gene prediction methods vijay
 
blast bioinformatics
blast bioinformaticsblast bioinformatics
blast bioinformatics
 
Protein Database
Protein DatabaseProtein Database
Protein Database
 
Blast and fasta
Blast and fastaBlast and fasta
Blast and fasta
 
sequence of file formats in bioinformatics
sequence of file formats in bioinformaticssequence of file formats in bioinformatics
sequence of file formats in bioinformatics
 
NCBI
NCBINCBI
NCBI
 
Swiss pdb viewer
Swiss pdb viewerSwiss pdb viewer
Swiss pdb viewer
 
Fasta
FastaFasta
Fasta
 

Andere mochten auch

Kegg database resources
Kegg database resources Kegg database resources
Kegg database resources
innocent87
 
Protein databases
Protein databasesProtein databases
Protein databases
sarumalay
 
Intellectual property rights(I.P.R.) and traditional knowledge protection of ...
Intellectual property rights(I.P.R.) and traditional knowledge protection of ...Intellectual property rights(I.P.R.) and traditional knowledge protection of ...
Intellectual property rights(I.P.R.) and traditional knowledge protection of ...
Abhishek Singh
 
Protection of Traditional Knowledge
Protection of Traditional KnowledgeProtection of Traditional Knowledge
Protection of Traditional Knowledge
patent_unitedipr
 

Andere mochten auch (20)

Cytoscape plugins - GeneMania and CentiScape
Cytoscape plugins - GeneMania and CentiScapeCytoscape plugins - GeneMania and CentiScape
Cytoscape plugins - GeneMania and CentiScape
 
Kegg database resources
Kegg database resources Kegg database resources
Kegg database resources
 
Protein database
Protein databaseProtein database
Protein database
 
Protein databases
Protein databasesProtein databases
Protein databases
 
Introduction to bioinformatics
Introduction to bioinformaticsIntroduction to bioinformatics
Introduction to bioinformatics
 
Ab Initio Protein Structure Prediction
Ab Initio Protein Structure PredictionAb Initio Protein Structure Prediction
Ab Initio Protein Structure Prediction
 
Intellectual property rights(I.P.R.) and traditional knowledge protection of ...
Intellectual property rights(I.P.R.) and traditional knowledge protection of ...Intellectual property rights(I.P.R.) and traditional knowledge protection of ...
Intellectual property rights(I.P.R.) and traditional knowledge protection of ...
 
Distance based method
Distance based method Distance based method
Distance based method
 
Protecting Traditional Knowledge In India
Protecting Traditional Knowledge In IndiaProtecting Traditional Knowledge In India
Protecting Traditional Knowledge In India
 
Intellectual Property Rights : Indian Perspective
Intellectual Property Rights : Indian PerspectiveIntellectual Property Rights : Indian Perspective
Intellectual Property Rights : Indian Perspective
 
Traditional knowledge
Traditional knowledgeTraditional knowledge
Traditional knowledge
 
Wipo
WipoWipo
Wipo
 
Homology modeling
Homology modelingHomology modeling
Homology modeling
 
Protection of Traditional Knowledge
Protection of Traditional KnowledgeProtection of Traditional Knowledge
Protection of Traditional Knowledge
 
10 traditional knowledge
10 traditional knowledge10 traditional knowledge
10 traditional knowledge
 
Clustering and Visualisation using R programming
Clustering and Visualisation using R programmingClustering and Visualisation using R programming
Clustering and Visualisation using R programming
 
Structural Bioinformatics - Homology modeling & its Scope
Structural Bioinformatics - Homology modeling & its ScopeStructural Bioinformatics - Homology modeling & its Scope
Structural Bioinformatics - Homology modeling & its Scope
 
PowerMV
PowerMV PowerMV
PowerMV
 
PROTEIN DATABASE
PROTEIN DATABASEPROTEIN DATABASE
PROTEIN DATABASE
 
PERL- Bioperl modules
PERL- Bioperl modulesPERL- Bioperl modules
PERL- Bioperl modules
 

Ähnlich wie protein data bank

Molecular Structures 2009
Molecular Structures 2009Molecular Structures 2009
Molecular Structures 2009
lyonja
 
Protein structure
Protein structureProtein structure
Protein structure
Pooja Pawar
 
Biological databases.pptx
Biological databases.pptxBiological databases.pptx
Biological databases.pptx
PagudalaSangeetha
 

Ähnlich wie protein data bank (20)

Protein Data Bank
Protein Data BankProtein Data Bank
Protein Data Bank
 
Molecular Structures 2009
Molecular Structures 2009Molecular Structures 2009
Molecular Structures 2009
 
Bioinformatics lecture xxiii
Bioinformatics lecture xxiiiBioinformatics lecture xxiii
Bioinformatics lecture xxiii
 
Protein Data Bank ( PDB ) - Bioinformatics
Protein Data Bank ( PDB ) - BioinformaticsProtein Data Bank ( PDB ) - Bioinformatics
Protein Data Bank ( PDB ) - Bioinformatics
 
BITS: Overview of important biological databases beyond sequences
BITS: Overview of important biological databases beyond sequencesBITS: Overview of important biological databases beyond sequences
BITS: Overview of important biological databases beyond sequences
 
Bioinformatics introduction
Bioinformatics introductionBioinformatics introduction
Bioinformatics introduction
 
2009 CSBB LAB 新生訓練
2009 CSBB LAB 新生訓練 2009 CSBB LAB 新生訓練
2009 CSBB LAB 新生訓練
 
PROTEIN STRUCTURE PREDICTION USING SUPPORT VECTOR MACHINE
PROTEIN STRUCTURE PREDICTION USING SUPPORT VECTOR MACHINEPROTEIN STRUCTURE PREDICTION USING SUPPORT VECTOR MACHINE
PROTEIN STRUCTURE PREDICTION USING SUPPORT VECTOR MACHINE
 
Protein Structure Prediction Using Support Vector Machine
Protein Structure Prediction Using Support Vector Machine  Protein Structure Prediction Using Support Vector Machine
Protein Structure Prediction Using Support Vector Machine
 
R.P Maurya ppt on C C D C & DSSP(Bioinformatics)
R.P Maurya ppt  on C C D C & DSSP(Bioinformatics)R.P Maurya ppt  on C C D C & DSSP(Bioinformatics)
R.P Maurya ppt on C C D C & DSSP(Bioinformatics)
 
Databases
DatabasesDatabases
Databases
 
Protein data bank
Protein data bankProtein data bank
Protein data bank
 
Types of biological databases-protein database
Types of biological databases-protein databaseTypes of biological databases-protein database
Types of biological databases-protein database
 
Biological database
Biological databaseBiological database
Biological database
 
A Cell-Cycle Knowledge Integration Framework
A Cell-Cycle Knowledge Integration FrameworkA Cell-Cycle Knowledge Integration Framework
A Cell-Cycle Knowledge Integration Framework
 
Protein structure
Protein structureProtein structure
Protein structure
 
Biological databases
Biological databasesBiological databases
Biological databases
 
Protein Databases
Protein DatabasesProtein Databases
Protein Databases
 
Biological databases.pptx
Biological databases.pptxBiological databases.pptx
Biological databases.pptx
 
Databases_CSS2.pptx
Databases_CSS2.pptxDatabases_CSS2.pptx
Databases_CSS2.pptx
 

Kürzlich hochgeladen

The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
heathfieldcps1
 
Spellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please PractiseSpellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please Practise
AnaAcapella
 
Salient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functionsSalient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functions
KarakKing
 

Kürzlich hochgeladen (20)

Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptx
 
SOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning PresentationSOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning Presentation
 
General Principles of Intellectual Property: Concepts of Intellectual Proper...
General Principles of Intellectual Property: Concepts of Intellectual  Proper...General Principles of Intellectual Property: Concepts of Intellectual  Proper...
General Principles of Intellectual Property: Concepts of Intellectual Proper...
 
On National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsOn National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan Fellows
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17
 
Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.ppt
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
 
How to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POSHow to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POS
 
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
 
Understanding Accommodations and Modifications
Understanding  Accommodations and ModificationsUnderstanding  Accommodations and Modifications
Understanding Accommodations and Modifications
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.
 
Single or Multiple melodic lines structure
Single or Multiple melodic lines structureSingle or Multiple melodic lines structure
Single or Multiple melodic lines structure
 
Sociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning ExhibitSociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning Exhibit
 
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptxHMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
 
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
 
Dyslexia AI Workshop for Slideshare.pptx
Dyslexia AI Workshop for Slideshare.pptxDyslexia AI Workshop for Slideshare.pptx
Dyslexia AI Workshop for Slideshare.pptx
 
Spellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please PractiseSpellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please Practise
 
Salient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functionsSalient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functions
 
Python Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxPython Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docx
 
How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17
 

protein data bank

  • 2. CONTENTS  Introduction  Supported and funded by  History  PDB Holdings list  Member organizations  Task forces  PDB ID  PDB File format  Browse to WWW.RCSB.ORG/PDB/
  • 3. PROTEIN DATA BANK PDB  Single worldwide database and hundreds of secondary databases categorize the data differently.  Key resource in the area of structural biology, stores 3D structural data of large biological molecules such as Proteins and Nucleic acids.  Data is submitted by Biologists and Biochemists from all around the world to be freely accessible on internet via its member organizations’ websites and is updated weekly.  The mission is to maintain a single Protein Data Bank Archive of Macromolecular Structural data.
  • 4. SUPPORTED AND FUNDED The Protein Data Bank (PDB) is operated by:  Rutgers, The State University of New Jersey.  The San Diego Supercomputer Center at the University of California, San Diego.  The Center for Advanced Research in Biotechnology of the National Institute of Standards and Technology -- the Research Collaboratory for Structural Bioinformatics (RCSB)  The PDB is supported by funds from the National Science Foundation, the Department of Energy, and the National Institutes of Health.
  • 5. PDB HISTORY Two forces to initiate PDB: Growing collection of sets of protein structural data by X-Ray diffraction. Brookhaven Raster Display (BRAD), a molecular graphics display to visualize protein structures in 3D, emerged in 1968.  In 1969, Dr Edger Meyer began to write software to store atomic coordinates files in a common format to make them available for geometric and graphical evaluation (with sponsorship of Dr Walton Hamilton at Bookhaven National Laboratory.  In 1971, one of Dr Meyer’s programs- SEARCH- enabled networking i.e enabled the researchers to access information from database to study protein structures offline.
  • 6. In 1973, upon Hamilton’s death, Dr Tom Koetzle took over direction of PDB for 20 years.  mmCIF project completed and Structural genomics began in 1970s.  In 1980s, IUCr guidelines established, number of structures deposited increases and independent biological databases established – e.g., the NDB.  In Oct, 1998; PDB was transferred to Research Collaboratory for Structural Bioinformatics (RCSB), complete transfer since 1999. Dr Helen M Berman of Rutgers University was the new director.  In 2003, with the formation of wwPDB, the PDB became an international organization having three member organizations.  !n 2006, the BMRB joined PDB.
  • 7. PDB HOLDINGS LIST ( AS OF 25 OCT, 2011) Experimental Protein/Nucleic Acid Proteins Nucleic Acids Other Total Method complexes X-ray diffraction 62750 1323 3050 2 67125 NMR 7962 960 179 7 9108 Electron microscopy 262 22 96 0 380 Hybrid 41 3 1 1 46 Other 133 4 5 13 155 Total: 71148 2312 3331 23 76814
  • 8. MEMBER ORGANIZATIONS  Act as Data deposition, Data processing and Distribution centers for PDB data.  Three are founding member organizations:  PDBe…Protein Data Bank in Europe.  PDBj…Protein Data Bank in Japan.  RCSB…Research Collaboratory for Structural Bioinformatics.  The Biological Magnetic Resonance Data Bank (BMRB) joined later in 2006.  Another organization Worldwide Protein Data Bank (wwPDB) oversees PDB. wwPDB reviews and annotates each submitted entry and then it is automatically checked for plausibility( the source code for validation software is available.
  • 9.
  • 10.
  • 11.
  • 12.
  • 13.
  • 14. TASK FORCES  X-Ray diffraction (most of the structures)…approximations of the coordinates of atoms of proteins are obtained. E.g lyzozyme.  NMR (about 15% e.g, haemoglobin)…estimations of distances between pairs of atoms of proteins. Final conformation is obtained after solving distance geometry problem.  Cryo Electron Microscopy (very few protein e.g, crsysalin).
  • 15.
  • 16.
  • 17.
  • 18. PDB IDENTIFIER (PDB ID)  Each structure published in PDB receives a four character alphanumeric identifier or accession number. Like, 1ANG or 4hhb.  However, this cant be used as an identifier for biomolecules. Because several structures for the same molecule in different environments or conformations-are contained in PDB with different PDB IDs.
  • 19. PDB FILE FORMAT  Standard data representation…encoded in data dictionary. The metadata model supporting this representation is used by all PDB data processing and database software tools.  PDB file format was restricted to 80 characters per line initially.  In 1996, macromolecular Crystallographic Information File (mmCIF) format started.  In 2005, XML version called as PDBML, was described.  The structure files can be downloaded in any of these three formats.  The files are easily downloaded into graphics packages as well, using web services.
  • 20. PDB 3D data file format  ASCII  column based: 80 columns per line  KEYWORD for record type at col.#1  Header records  Structure records  Atom records (containing coordinates)  ATOM, HETATM, ..., TER
  • 21. PDB format Coordinate Section ATOM record Biopolymer residue atom HETATM record nonBiopolymer atom TER record chain terminator 123456789012345678901234567890123456789012345678901234567 89012345678901234567890 ATOM 1 N ALA 1 11.104 6.134 -6.504 1.00 0.00 N ATOM 2 CA ALA 1 11.639 6.071 -5.147 1.00 0.00 C ... ATOM 293 1HG GLU 18 -14.861 -4.847 0.361 1.00 0.00 H ATOM 294 2HG GLU 18 -13.518 -3.769 0.084 1.00 0.00 H TER 295 GLU 18 HETATM 5555 CA 0.000 0.000 0.000
  • 22. mmCIF  mmCIF is the acronym for the macromolecular Crystallographic Information File.  mmCIF is based on a subset of the syntax rules for the Self Defining Text Archive (STAR) file.  A Dictionary Description Language (DDL) defines the structure of mmCIF dictionaries. Dictionaries provide the metadata which define the content of mmCIF data files.  mmCIF data files, dictionaries and DDLs are all expressed in a common syntax.
  • 23. POINT YOUR BROWSER TO: WWW.RCSB.ORG/PDB/ put either a search term (for example, a protein name) or a PDB number
  • 24.
  • 25. 2DN2
  • 26.
  • 27.
  • 28. UMIN HAEMOGLOBIN
  • 29.
  • 30.
  • 31.
  • 32.
  • 33.
  • 34.
  • 35.
  • 36. PROTEIN ANNOTATION  If the contents of the PDB are thought of as primary data, then there are hundreds of derived (i.e., secondary) databases that categorize the data differently. For example, both SCOPand CATH categorize structures according to type of structure and assumed evolutionary relations; GO categorize structures based on genes.
  • 37.
  • 38. The Structural Classification of Proteins (SCOP) database is a largely manual classification of protein structural domains based on similarities of their structures and amino acid sequences
  • 39. Class:the overall secondary-structure content of the domain Architecture:high structural similarity but no evidence of homology. Topology:a large-scale grouping of topologies which share particular structural features Homologous superfamily:indicative of a demonstrable evolutionary relationship.
  • 40.
  • 41. Pfam is a database of protein families that includes their annotations and multiple sequence alignment generated using hidden Markov models
  • 42. CLICK
  • 44. CLICK
  • 45. CLICK
  • 46. CLICK
  • 47. CLICK
  • 49. Show the gradual updating Structural View of Biology released entries
  • 50.
  • 51.
  • 52.
  • 53.
  • 54.
  • 55. You can also select different display view
  • 56. can also download in different view
  • 57. VIEWING THE DATA  56,523 structures In PDB have structure factor files.  6410 structures In PDB have NMR restraint files.  198 structures In PDB have chemical shifts files.  Text file can be viewed or modified in editor.  Structure files may be viewed using various free and commercial visualizations programs and Web browsers plug-ins like  OPEN SOURCE PDB softweres  Jmol  Molekel  MeshLab(able to import PDB data set and buildup surfaces from them)  QuteMol  Avogadro  And others open but not free , like  PYMOL , RASMOL, VIST PROT 3DS & STAR BIOCHEM
  • 58. HTTP://WWW.RCSB.ORG/PDB/STATIC.DO ?P=SOFTWARE/SOFTWARE_LINKS/MOL ECULAR_GRAPHICS.HTML The RCSB PDB website contains an extensive list of both free and commercial molecule visualization programs and web browser plug-in.
  • 59.
  • 60.
  • 61. LIMITATION  The Protein Data Bank (PDB) is the central archive of experimentally solved biomolecular structures. However, the PDB only allows data retrieval and does not provide functionality for collaboration or user feedback.  In contrast, PDBWiki allows for sharing expert knowledge about structures deposited in the PDB. It provides tools for discussing and annotating proteins in a collaborative way. The goal is to create a central and freely-accessible repository of user-contributed information that will be useful for anyone working with PDB structures. As such PDBWiki can be considered a part of a wider effort in community-based biological databases curation.