SlideShare ist ein Scribd-Unternehmen logo
1 von 23
Protein Database
Presented by:-
Rajpal Choudhary
M.Sc. Biotechnology 2nd year
161103004
Protein database
• SWISS-PROT: Annotated Sequence Database
• TrEMBL: Database of EMBL nucleotide translated sequences
• InterPro: Integrated resource for protein families, domains and functional sites.
• CluSTr: Offers an automatic classification of SWISS-PROT and TrEMBL.
• IPI: A non-redundant human proteome set constructed from SWISS-PROT, TrEMBL, Ensembl and
RefSeq.
• GOA: Provides assignments of gene products to the Gene Ontology (GO) resource.
• Proteome Analysis: Statistical and comparative analysis of the predicted proteomes of fully
sequenced organisms
• Protein Profiles: Tables of SWISS-PROT and TrEMBL entries and alignments for the protein
families of the Protein Profile.
Swiss-Prot
• Annotated protein sequence database established in 1986 and
maintained collaboratively since 1987, by the Department of Medical
Biochemistry of the University of Geneva and EBI
• Complete, Curated, Non-redundant and cross-referenced with 34 other
databases
• Highly cross-referenced
• Available from a variety of servers and through sequence analysis
software tools
• More than 8,000 different species
• First 20 species represent about 42% of all sequences in the database
• More than 1,29,000 entries with 4.7 X 1010 amino acids
• More than 6,22,000 entries in TrEMBL
TrEMBL (Translation of EMBL)
• Computer-annotated supplement to SWISS-PROT, as it is impossible to
cope with the flow of data.
• Well-structure SWISS-PROT-like resource
• Derived from automated EMBL CDS translation maintained at the EBI,
UK.
• TrEMBL is automatically generated and annotated using software tools
(incompatible with the SWISS-PROT in terms of quality)
• TrEMBL contains all what is not yet in SWISS-PROT
SWISS-PROT file format
SWISS-PROT file format
SWISS-PROT file format
SWISS-PROT file format
GenBank
(http://www.ncbi.nlm.nih.gov/genbank/)
• The GenBank is a sequence database that stores nucleotide
sequences and the proteins obtained from them by translations. This
database is maintained by National Center for Biotechnology
Information (NCBI). As of April 2011, the GenBank contains
approximately 126,551,501,141 numbers of bases in 135,440,924
numbers of sequences. Each sequence submitted to GenBank is
assigned a unique GenBank identifier or GenBank accession number.
Structure Databases
•MSD: Macromolecular Structure Database - A relational database
representation of clean Protein Data Bank (PDB).
•3DSeq: 3D sequence alignment server- Annotation of the alignments
between sequence database and the PDB.
•FSSP: Based on exhaustive all-against-all 3D structure comparison of
protein structures currently in the Protein Data Bank (PDB)
•DALI: Fold Classification based on Structure-Structure Assignments.
•3Dee: Database of protein domain definitions wherein the domains
have been clustered on sequence and structural similarity.
•NDB: Nucleic Acid Structure Database
htttp://www.rcsb.org/pdb/
Protein Data Bank (PDB)
• Important in solving real problems in molecular biology
• Protein Databank
• PDB Established in 1972 at Brookhaven National Laboratory
(BNL)
• Sole international repository of macromolecular structure data
• Moved to Research Collaboratory for Structural Bioinformatics
http://www.rcsb.org/
Effective use of PDB
• Queries are of three types
• PDBid - As quoted in paper
• Search Lite - one or more keywords
• Search Fields - A detailed query form
• Query results
• Structure Explorer - details of the structure
• Query Result Browser - for multiple structures
• PDB Viewer
SCOP (Structural Classification of Protein)
(http://scop.mrc-lmb.cam.ac.uk/scop/)
• The Structural Classification of Proteins (SCOP) database is basically
a database with manual classification of protein structural
domains. The whole concept is based on similarities of the amino
acid sequences and three- dimensional structures of the proteins.
The database was originally published in 1995 and it is usually
updated at least once yearly by Alexei G. Murzin and his
colleagues.
• SCOP database uses the following protein structural hierarchy:
• Class—It is the general structural architecture of the protein domains.
• Fold—It represents similar arrangement of regular secondary
structures but without evidence of evolutionary relatedness.
• Superfamily—It represents whether the protein structures have
sufficient structural and functional similarities to each other to infer a
divergent evolutionary relation- ship but not necessarily a detectable
sequence homology.
• Family—Proteins belonging to the same family share some sequence
similarity.
• SCOP has the following classes:
• 1) Proteins with mostly α-helical domains;
• 2) Proteins with mostly β-sheet domains;
• 3) Proteins with α/β domains which contain beta- alpha-beta
structural units or motifs that form mainly parallel β-sheets;
• 4) Proteins with mostly α + β domains consisting of independent α-
helices and mainly antiparallel β-sheets;
CATH (Class Architecture Topology Homology)
(http://www.cathdb.info/)
• The CATH Protein Structure Classification method is a semi-automatic,
hierarchical classification of protein domains. The database was first
published in 1997 by Christine Orengo, Janet Thornton and their
colleagues. CATH carries many broad features with its principal rival,
SCOP. However there are also many areas in which the detailed
classifications in the two databases differ greatly.
• CATH clusters proteins at four major levels:
• 1) Class (C): Class is derived from secondary structure contents of proteins.
It is assigned for more than 90% of protein structures automatically.
• 2) Architecture (A): Architecture describes the gross orientation of
secondary structures, independent of connectivity in proteins.
• 3) Topology (T): Topology level of CATH clusters protein structures
according to their topological connections & numbers of secondary
structures.
• 4) Homologous Superfamily (H): Homologous super- families of CATH
cluster the proteins with highly similar structures & functions.
Conclusion
The different databases discussed here provide different information.
PDB gives both structural and sequence information of
macromolecules whereas SCOP & CATH have structures based on their
evolutionary relationships and folding classes. The structural
classifications of proteins are generally obtained from SCOP and CATH.
On the other hand, the UniProt/Swissprot database provides the
sequence annotations of proteins along with links to the external
databases like PDB. All these databases are increasing day by day. There
are other databases and some new databases are coming. But the
databases discussed here are considered to be the foundation stones
of bioinformatics.
REFERENCES
• G. Murzin, S. E. Brenner, T. Hubbard and C. Chothia, “SCOP: A Structural Classification of Proteins Database
for the Investigation of Sequences and Structures,” Jour- nal of Molecular Biology, Vol. 247, No. 4, 1995, pp.
536- 540. doi:10.1016/S0022-2836(05)80134-2
• L. Lo Conte, S. E. Brenner, T. J. Hubbard, C. Chothia and A. G. Murzin, “SCOP Database in 2002: Refinements
Accommodate Structural Genomics,” Nucleic Acids Re- search, Vol. 30, No. 1, 2002, pp. 264-267.
doi:10.1093/nar/30.1.264
• Andreeva, D. Howorth, S. E. Brenner, T. J. Hubbard, C. Chothia and A. G. Murzin, “SCOP Database in 2004: Re-
finements Integrate Structure and Sequence Family Data,” Nucleic Acids Research, Vol. 32, Suppl. 1, 2004,
pp. D226-D229. doi:10.1093/nar/gkh039
• R. Day, D. A. Beck, R. S. Armen and V. Daggett, “A Consensus View of Fold Space: Combining SCOP, CATH, and
the Dali Domain Dictionary,” Protein Scien- ce, Vol. 12, No. 10, 2003, pp. 2150-2160. doi:10.1110/ps.0306803
Thank you

Weitere ähnliche Inhalte

Was ist angesagt? (20)

Sequence Submission Tools
Sequence Submission ToolsSequence Submission Tools
Sequence Submission Tools
 
sequence of file formats in bioinformatics
sequence of file formats in bioinformaticssequence of file formats in bioinformatics
sequence of file formats in bioinformatics
 
Kegg
KeggKegg
Kegg
 
Cath
CathCath
Cath
 
DNA data bank of japan (DDBJ)
DNA data bank of japan (DDBJ)DNA data bank of japan (DDBJ)
DNA data bank of japan (DDBJ)
 
Gene prediction methods vijay
Gene prediction methods  vijayGene prediction methods  vijay
Gene prediction methods vijay
 
Ddbj
DdbjDdbj
Ddbj
 
EMBL- European Molecular Biology Laboratory
EMBL- European Molecular Biology LaboratoryEMBL- European Molecular Biology Laboratory
EMBL- European Molecular Biology Laboratory
 
Structural databases
Structural databases Structural databases
Structural databases
 
Gene prediction and expression
Gene prediction and expressionGene prediction and expression
Gene prediction and expression
 
Protein database
Protein databaseProtein database
Protein database
 
Prosite
PrositeProsite
Prosite
 
Protein Databases
Protein DatabasesProtein Databases
Protein Databases
 
Blast and fasta
Blast and fastaBlast and fasta
Blast and fasta
 
Gen bank databases
Gen bank databasesGen bank databases
Gen bank databases
 
Primary and secondary databases ppt by puneet kulyana
Primary and secondary databases ppt by puneet kulyanaPrimary and secondary databases ppt by puneet kulyana
Primary and secondary databases ppt by puneet kulyana
 
Tools and database of NCBI
Tools and database of NCBITools and database of NCBI
Tools and database of NCBI
 
Major databases in bioinformatics
Major databases in bioinformaticsMajor databases in bioinformatics
Major databases in bioinformatics
 
NCBI National Center for Biotechnology Information
NCBI National Center for Biotechnology InformationNCBI National Center for Biotechnology Information
NCBI National Center for Biotechnology Information
 
Uni prot presentation
Uni prot presentationUni prot presentation
Uni prot presentation
 

Ähnlich wie Protein database

Types of biological databases-protein database
Types of biological databases-protein databaseTypes of biological databases-protein database
Types of biological databases-protein databasechinmayeec
 
Introduction OF BIOLOGICAL DATABASE
Introduction OF BIOLOGICAL DATABASEIntroduction OF BIOLOGICAL DATABASE
Introduction OF BIOLOGICAL DATABASEPrashantSharma807
 
Protein databases
Protein databasesProtein databases
Protein databasessarumalay
 
Protein Sequence Databases
Protein Sequence Databases Protein Sequence Databases
Protein Sequence Databases Hemant Bothe
 
Primary Bioinformatics Database.pptx
Primary Bioinformatics Database.pptxPrimary Bioinformatics Database.pptx
Primary Bioinformatics Database.pptxVandana Yadav03
 
Sequence and Structural Databases of DNA and Protein, and its significance in...
Sequence and Structural Databases of DNA and Protein, and its significance in...Sequence and Structural Databases of DNA and Protein, and its significance in...
Sequence and Structural Databases of DNA and Protein, and its significance in...SBituila
 
Sequence and Structural Databases of DNA and Protein, and its significance in...
Sequence and Structural Databases of DNA and Protein, and its significance in...Sequence and Structural Databases of DNA and Protein, and its significance in...
Sequence and Structural Databases of DNA and Protein, and its significance in...BibiQuinah
 
biological databases.pptx
biological databases.pptxbiological databases.pptx
biological databases.pptxscience lover
 
database retrival.pdf
database retrival.pdfdatabase retrival.pdf
database retrival.pdfSrimathideviJ
 
Presentation on Biological database By Elufer Akram @ University Of Science ...
Presentation on Biological database  By Elufer Akram @ University Of Science ...Presentation on Biological database  By Elufer Akram @ University Of Science ...
Presentation on Biological database By Elufer Akram @ University Of Science ...Elufer Akram
 
Bioinformatics introduction
Bioinformatics introductionBioinformatics introduction
Bioinformatics introductionDrGopaSarma
 
protein databases.ppt
protein databases.pptprotein databases.ppt
protein databases.pptSanthiyaAK
 

Ähnlich wie Protein database (20)

Types of biological databases-protein database
Types of biological databases-protein databaseTypes of biological databases-protein database
Types of biological databases-protein database
 
Structural database and their classification by abdul qahar
Structural database and their classification by abdul qaharStructural database and their classification by abdul qahar
Structural database and their classification by abdul qahar
 
Protein database
Protein  databaseProtein  database
Protein database
 
Introduction OF BIOLOGICAL DATABASE
Introduction OF BIOLOGICAL DATABASEIntroduction OF BIOLOGICAL DATABASE
Introduction OF BIOLOGICAL DATABASE
 
Important protein databases and proteomics softwares
Important protein databases and proteomics softwaresImportant protein databases and proteomics softwares
Important protein databases and proteomics softwares
 
Biological databases
Biological databasesBiological databases
Biological databases
 
Scop database
Scop databaseScop database
Scop database
 
Protein databases
Protein databasesProtein databases
Protein databases
 
PROTEIN DATABASE
PROTEIN DATABASEPROTEIN DATABASE
PROTEIN DATABASE
 
Protein Sequence Databases
Protein Sequence Databases Protein Sequence Databases
Protein Sequence Databases
 
Biological databases
Biological databases Biological databases
Biological databases
 
Primary Bioinformatics Database.pptx
Primary Bioinformatics Database.pptxPrimary Bioinformatics Database.pptx
Primary Bioinformatics Database.pptx
 
Sequence and Structural Databases of DNA and Protein, and its significance in...
Sequence and Structural Databases of DNA and Protein, and its significance in...Sequence and Structural Databases of DNA and Protein, and its significance in...
Sequence and Structural Databases of DNA and Protein, and its significance in...
 
Sequence and Structural Databases of DNA and Protein, and its significance in...
Sequence and Structural Databases of DNA and Protein, and its significance in...Sequence and Structural Databases of DNA and Protein, and its significance in...
Sequence and Structural Databases of DNA and Protein, and its significance in...
 
biological databases.pptx
biological databases.pptxbiological databases.pptx
biological databases.pptx
 
database retrival.pdf
database retrival.pdfdatabase retrival.pdf
database retrival.pdf
 
Data Retrieval Systems
Data Retrieval SystemsData Retrieval Systems
Data Retrieval Systems
 
Presentation on Biological database By Elufer Akram @ University Of Science ...
Presentation on Biological database  By Elufer Akram @ University Of Science ...Presentation on Biological database  By Elufer Akram @ University Of Science ...
Presentation on Biological database By Elufer Akram @ University Of Science ...
 
Bioinformatics introduction
Bioinformatics introductionBioinformatics introduction
Bioinformatics introduction
 
protein databases.ppt
protein databases.pptprotein databases.ppt
protein databases.ppt
 

Mehr von Rajpal Choudhary

Chromosome theory of inheritance
Chromosome theory of inheritanceChromosome theory of inheritance
Chromosome theory of inheritanceRajpal Choudhary
 
Different Levels of protein
Different Levels of proteinDifferent Levels of protein
Different Levels of proteinRajpal Choudhary
 
Hybridoma technology and production of monoclonal antibody
Hybridoma technology and production of monoclonal antibodyHybridoma technology and production of monoclonal antibody
Hybridoma technology and production of monoclonal antibodyRajpal Choudhary
 
Linkage analysis and genome mapping
Linkage analysis and genome mappingLinkage analysis and genome mapping
Linkage analysis and genome mappingRajpal Choudhary
 
Cancer genetics and diagnosis
Cancer genetics and diagnosisCancer genetics and diagnosis
Cancer genetics and diagnosisRajpal Choudhary
 
Microbial and chemical analysis of potable water
Microbial and chemical analysis of potable waterMicrobial and chemical analysis of potable water
Microbial and chemical analysis of potable waterRajpal Choudhary
 
Animal cell culture and its techniques
Animal cell culture and its techniquesAnimal cell culture and its techniques
Animal cell culture and its techniquesRajpal Choudhary
 
Epistasis and its different types
Epistasis and its different typesEpistasis and its different types
Epistasis and its different typesRajpal Choudhary
 
Escherichia coli as water indicator
Escherichia coli as water indicatorEscherichia coli as water indicator
Escherichia coli as water indicatorRajpal Choudhary
 
Advanced techniques in animal cell culture
Advanced techniques in animal cell cultureAdvanced techniques in animal cell culture
Advanced techniques in animal cell cultureRajpal Choudhary
 
Cell wall structure and function
Cell wall structure and functionCell wall structure and function
Cell wall structure and functionRajpal Choudhary
 
Enzyme inhibition AND ITS TYPES
Enzyme inhibition AND ITS TYPES Enzyme inhibition AND ITS TYPES
Enzyme inhibition AND ITS TYPES Rajpal Choudhary
 
Biofertilizer and biopesticides
Biofertilizer and biopesticidesBiofertilizer and biopesticides
Biofertilizer and biopesticidesRajpal Choudhary
 
Adenoviral cloning vectors
Adenoviral cloning vectorsAdenoviral cloning vectors
Adenoviral cloning vectorsRajpal Choudhary
 
Antigen processing and presentation
Antigen processing and presentationAntigen processing and presentation
Antigen processing and presentationRajpal Choudhary
 
Antigen antibody interaction
Antigen antibody interactionAntigen antibody interaction
Antigen antibody interactionRajpal Choudhary
 

Mehr von Rajpal Choudhary (20)

Chromosome theory of inheritance
Chromosome theory of inheritanceChromosome theory of inheritance
Chromosome theory of inheritance
 
BLAST Search tool
BLAST Search toolBLAST Search tool
BLAST Search tool
 
Different Levels of protein
Different Levels of proteinDifferent Levels of protein
Different Levels of protein
 
Hybridoma technology and production of monoclonal antibody
Hybridoma technology and production of monoclonal antibodyHybridoma technology and production of monoclonal antibody
Hybridoma technology and production of monoclonal antibody
 
Linkage analysis and genome mapping
Linkage analysis and genome mappingLinkage analysis and genome mapping
Linkage analysis and genome mapping
 
Cancer genetics and diagnosis
Cancer genetics and diagnosisCancer genetics and diagnosis
Cancer genetics and diagnosis
 
Microbial and chemical analysis of potable water
Microbial and chemical analysis of potable waterMicrobial and chemical analysis of potable water
Microbial and chemical analysis of potable water
 
Animal cell culture and its techniques
Animal cell culture and its techniquesAnimal cell culture and its techniques
Animal cell culture and its techniques
 
Epistasis and its different types
Epistasis and its different typesEpistasis and its different types
Epistasis and its different types
 
Escherichia coli as water indicator
Escherichia coli as water indicatorEscherichia coli as water indicator
Escherichia coli as water indicator
 
Advanced techniques in animal cell culture
Advanced techniques in animal cell cultureAdvanced techniques in animal cell culture
Advanced techniques in animal cell culture
 
Cell wall structure and function
Cell wall structure and functionCell wall structure and function
Cell wall structure and function
 
Vaccines AND THEIR ROLE
Vaccines AND THEIR ROLEVaccines AND THEIR ROLE
Vaccines AND THEIR ROLE
 
Enzyme inhibition AND ITS TYPES
Enzyme inhibition AND ITS TYPES Enzyme inhibition AND ITS TYPES
Enzyme inhibition AND ITS TYPES
 
Elisa AND ITS APPLICATION
Elisa AND ITS APPLICATIONElisa AND ITS APPLICATION
Elisa AND ITS APPLICATION
 
Biofertilizer and biopesticides
Biofertilizer and biopesticidesBiofertilizer and biopesticides
Biofertilizer and biopesticides
 
Adenoviral cloning vectors
Adenoviral cloning vectorsAdenoviral cloning vectors
Adenoviral cloning vectors
 
Antigen processing and presentation
Antigen processing and presentationAntigen processing and presentation
Antigen processing and presentation
 
Agrobacterium tumefaciens
Agrobacterium tumefaciensAgrobacterium tumefaciens
Agrobacterium tumefaciens
 
Antigen antibody interaction
Antigen antibody interactionAntigen antibody interaction
Antigen antibody interaction
 

Kürzlich hochgeladen

Grade 7 - Lesson 1 - Microscope and Its Functions
Grade 7 - Lesson 1 - Microscope and Its FunctionsGrade 7 - Lesson 1 - Microscope and Its Functions
Grade 7 - Lesson 1 - Microscope and Its FunctionsOrtegaSyrineMay
 
Pests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPirithiRaju
 
pumpkin fruit fly, water melon fruit fly, cucumber fruit fly
pumpkin fruit fly, water melon fruit fly, cucumber fruit flypumpkin fruit fly, water melon fruit fly, cucumber fruit fly
pumpkin fruit fly, water melon fruit fly, cucumber fruit flyPRADYUMMAURYA1
 
Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...
Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...
Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...Silpa
 
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPirithiRaju
 
Molecular markers- RFLP, RAPD, AFLP, SNP etc.
Molecular markers- RFLP, RAPD, AFLP, SNP etc.Molecular markers- RFLP, RAPD, AFLP, SNP etc.
Molecular markers- RFLP, RAPD, AFLP, SNP etc.Silpa
 
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and SpectrometryFAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and SpectrometryAlex Henderson
 
Digital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptxDigital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptxMohamedFarag457087
 
GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)Areesha Ahmad
 
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 60009654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000Sapana Sha
 
Dubai Call Girls Beauty Face Teen O525547819 Call Girls Dubai Young
Dubai Call Girls Beauty Face Teen O525547819 Call Girls Dubai YoungDubai Call Girls Beauty Face Teen O525547819 Call Girls Dubai Young
Dubai Call Girls Beauty Face Teen O525547819 Call Girls Dubai Youngkajalvid75
 
GBSN - Microbiology (Unit 3)
GBSN - Microbiology (Unit 3)GBSN - Microbiology (Unit 3)
GBSN - Microbiology (Unit 3)Areesha Ahmad
 
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICESAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICEayushi9330
 
300003-World Science Day For Peace And Development.pptx
300003-World Science Day For Peace And Development.pptx300003-World Science Day For Peace And Development.pptx
300003-World Science Day For Peace And Development.pptxryanrooker
 
PSYCHOSOCIAL NEEDS. in nursing II sem pptx
PSYCHOSOCIAL NEEDS. in nursing II sem pptxPSYCHOSOCIAL NEEDS. in nursing II sem pptx
PSYCHOSOCIAL NEEDS. in nursing II sem pptxSuji236384
 
Module for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learningModule for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learninglevieagacer
 
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...chandars293
 
development of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virusdevelopment of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virusNazaninKarimi6
 
FAIRSpectra - Enabling the FAIRification of Analytical Science
FAIRSpectra - Enabling the FAIRification of Analytical ScienceFAIRSpectra - Enabling the FAIRification of Analytical Science
FAIRSpectra - Enabling the FAIRification of Analytical ScienceAlex Henderson
 

Kürzlich hochgeladen (20)

Grade 7 - Lesson 1 - Microscope and Its Functions
Grade 7 - Lesson 1 - Microscope and Its FunctionsGrade 7 - Lesson 1 - Microscope and Its Functions
Grade 7 - Lesson 1 - Microscope and Its Functions
 
Pests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdf
 
pumpkin fruit fly, water melon fruit fly, cucumber fruit fly
pumpkin fruit fly, water melon fruit fly, cucumber fruit flypumpkin fruit fly, water melon fruit fly, cucumber fruit fly
pumpkin fruit fly, water melon fruit fly, cucumber fruit fly
 
Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...
Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...
Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...
 
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
 
Site Acceptance Test .
Site Acceptance Test                    .Site Acceptance Test                    .
Site Acceptance Test .
 
Molecular markers- RFLP, RAPD, AFLP, SNP etc.
Molecular markers- RFLP, RAPD, AFLP, SNP etc.Molecular markers- RFLP, RAPD, AFLP, SNP etc.
Molecular markers- RFLP, RAPD, AFLP, SNP etc.
 
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and SpectrometryFAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
 
Digital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptxDigital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptx
 
GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)
 
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 60009654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
 
Dubai Call Girls Beauty Face Teen O525547819 Call Girls Dubai Young
Dubai Call Girls Beauty Face Teen O525547819 Call Girls Dubai YoungDubai Call Girls Beauty Face Teen O525547819 Call Girls Dubai Young
Dubai Call Girls Beauty Face Teen O525547819 Call Girls Dubai Young
 
GBSN - Microbiology (Unit 3)
GBSN - Microbiology (Unit 3)GBSN - Microbiology (Unit 3)
GBSN - Microbiology (Unit 3)
 
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICESAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICE
 
300003-World Science Day For Peace And Development.pptx
300003-World Science Day For Peace And Development.pptx300003-World Science Day For Peace And Development.pptx
300003-World Science Day For Peace And Development.pptx
 
PSYCHOSOCIAL NEEDS. in nursing II sem pptx
PSYCHOSOCIAL NEEDS. in nursing II sem pptxPSYCHOSOCIAL NEEDS. in nursing II sem pptx
PSYCHOSOCIAL NEEDS. in nursing II sem pptx
 
Module for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learningModule for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learning
 
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
 
development of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virusdevelopment of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virus
 
FAIRSpectra - Enabling the FAIRification of Analytical Science
FAIRSpectra - Enabling the FAIRification of Analytical ScienceFAIRSpectra - Enabling the FAIRification of Analytical Science
FAIRSpectra - Enabling the FAIRification of Analytical Science
 

Protein database

  • 1. Protein Database Presented by:- Rajpal Choudhary M.Sc. Biotechnology 2nd year 161103004
  • 2. Protein database • SWISS-PROT: Annotated Sequence Database • TrEMBL: Database of EMBL nucleotide translated sequences • InterPro: Integrated resource for protein families, domains and functional sites. • CluSTr: Offers an automatic classification of SWISS-PROT and TrEMBL. • IPI: A non-redundant human proteome set constructed from SWISS-PROT, TrEMBL, Ensembl and RefSeq. • GOA: Provides assignments of gene products to the Gene Ontology (GO) resource. • Proteome Analysis: Statistical and comparative analysis of the predicted proteomes of fully sequenced organisms • Protein Profiles: Tables of SWISS-PROT and TrEMBL entries and alignments for the protein families of the Protein Profile.
  • 3. Swiss-Prot • Annotated protein sequence database established in 1986 and maintained collaboratively since 1987, by the Department of Medical Biochemistry of the University of Geneva and EBI • Complete, Curated, Non-redundant and cross-referenced with 34 other databases • Highly cross-referenced • Available from a variety of servers and through sequence analysis software tools • More than 8,000 different species • First 20 species represent about 42% of all sequences in the database • More than 1,29,000 entries with 4.7 X 1010 amino acids • More than 6,22,000 entries in TrEMBL
  • 4. TrEMBL (Translation of EMBL) • Computer-annotated supplement to SWISS-PROT, as it is impossible to cope with the flow of data. • Well-structure SWISS-PROT-like resource • Derived from automated EMBL CDS translation maintained at the EBI, UK. • TrEMBL is automatically generated and annotated using software tools (incompatible with the SWISS-PROT in terms of quality) • TrEMBL contains all what is not yet in SWISS-PROT
  • 9. GenBank (http://www.ncbi.nlm.nih.gov/genbank/) • The GenBank is a sequence database that stores nucleotide sequences and the proteins obtained from them by translations. This database is maintained by National Center for Biotechnology Information (NCBI). As of April 2011, the GenBank contains approximately 126,551,501,141 numbers of bases in 135,440,924 numbers of sequences. Each sequence submitted to GenBank is assigned a unique GenBank identifier or GenBank accession number.
  • 10. Structure Databases •MSD: Macromolecular Structure Database - A relational database representation of clean Protein Data Bank (PDB). •3DSeq: 3D sequence alignment server- Annotation of the alignments between sequence database and the PDB. •FSSP: Based on exhaustive all-against-all 3D structure comparison of protein structures currently in the Protein Data Bank (PDB) •DALI: Fold Classification based on Structure-Structure Assignments. •3Dee: Database of protein domain definitions wherein the domains have been clustered on sequence and structural similarity. •NDB: Nucleic Acid Structure Database
  • 12. Protein Data Bank (PDB) • Important in solving real problems in molecular biology • Protein Databank • PDB Established in 1972 at Brookhaven National Laboratory (BNL) • Sole international repository of macromolecular structure data • Moved to Research Collaboratory for Structural Bioinformatics http://www.rcsb.org/
  • 13. Effective use of PDB • Queries are of three types • PDBid - As quoted in paper • Search Lite - one or more keywords • Search Fields - A detailed query form • Query results • Structure Explorer - details of the structure • Query Result Browser - for multiple structures • PDB Viewer
  • 14.
  • 15.
  • 16. SCOP (Structural Classification of Protein) (http://scop.mrc-lmb.cam.ac.uk/scop/) • The Structural Classification of Proteins (SCOP) database is basically a database with manual classification of protein structural domains. The whole concept is based on similarities of the amino acid sequences and three- dimensional structures of the proteins. The database was originally published in 1995 and it is usually updated at least once yearly by Alexei G. Murzin and his colleagues.
  • 17. • SCOP database uses the following protein structural hierarchy: • Class—It is the general structural architecture of the protein domains. • Fold—It represents similar arrangement of regular secondary structures but without evidence of evolutionary relatedness. • Superfamily—It represents whether the protein structures have sufficient structural and functional similarities to each other to infer a divergent evolutionary relation- ship but not necessarily a detectable sequence homology. • Family—Proteins belonging to the same family share some sequence similarity.
  • 18. • SCOP has the following classes: • 1) Proteins with mostly α-helical domains; • 2) Proteins with mostly β-sheet domains; • 3) Proteins with α/β domains which contain beta- alpha-beta structural units or motifs that form mainly parallel β-sheets; • 4) Proteins with mostly α + β domains consisting of independent α- helices and mainly antiparallel β-sheets;
  • 19. CATH (Class Architecture Topology Homology) (http://www.cathdb.info/) • The CATH Protein Structure Classification method is a semi-automatic, hierarchical classification of protein domains. The database was first published in 1997 by Christine Orengo, Janet Thornton and their colleagues. CATH carries many broad features with its principal rival, SCOP. However there are also many areas in which the detailed classifications in the two databases differ greatly.
  • 20. • CATH clusters proteins at four major levels: • 1) Class (C): Class is derived from secondary structure contents of proteins. It is assigned for more than 90% of protein structures automatically. • 2) Architecture (A): Architecture describes the gross orientation of secondary structures, independent of connectivity in proteins. • 3) Topology (T): Topology level of CATH clusters protein structures according to their topological connections & numbers of secondary structures. • 4) Homologous Superfamily (H): Homologous super- families of CATH cluster the proteins with highly similar structures & functions.
  • 21. Conclusion The different databases discussed here provide different information. PDB gives both structural and sequence information of macromolecules whereas SCOP & CATH have structures based on their evolutionary relationships and folding classes. The structural classifications of proteins are generally obtained from SCOP and CATH. On the other hand, the UniProt/Swissprot database provides the sequence annotations of proteins along with links to the external databases like PDB. All these databases are increasing day by day. There are other databases and some new databases are coming. But the databases discussed here are considered to be the foundation stones of bioinformatics.
  • 22. REFERENCES • G. Murzin, S. E. Brenner, T. Hubbard and C. Chothia, “SCOP: A Structural Classification of Proteins Database for the Investigation of Sequences and Structures,” Jour- nal of Molecular Biology, Vol. 247, No. 4, 1995, pp. 536- 540. doi:10.1016/S0022-2836(05)80134-2 • L. Lo Conte, S. E. Brenner, T. J. Hubbard, C. Chothia and A. G. Murzin, “SCOP Database in 2002: Refinements Accommodate Structural Genomics,” Nucleic Acids Re- search, Vol. 30, No. 1, 2002, pp. 264-267. doi:10.1093/nar/30.1.264 • Andreeva, D. Howorth, S. E. Brenner, T. J. Hubbard, C. Chothia and A. G. Murzin, “SCOP Database in 2004: Re- finements Integrate Structure and Sequence Family Data,” Nucleic Acids Research, Vol. 32, Suppl. 1, 2004, pp. D226-D229. doi:10.1093/nar/gkh039 • R. Day, D. A. Beck, R. S. Armen and V. Daggett, “A Consensus View of Fold Space: Combining SCOP, CATH, and the Dali Domain Dictionary,” Protein Scien- ce, Vol. 12, No. 10, 2003, pp. 2150-2160. doi:10.1110/ps.0306803