SlideShare ist ein Scribd-Unternehmen logo
1 von 19
Presented by – SWARUP MALAKAR
A database is a repository of sequence ( DNA or amino acids ) stored in a
computer which provide a centralized and homogenous view of its content.
or, it is a vast collection of data pertaining to a specific topic, e.g.,
nucleotide sequence, protein sequence etc.
Basically, it is an electronic environment.
Databases are at the heart of bioinformatics.
1. Sequence databases: - that involves the sequences of both proteins and nucleic
acids.
2. Structural databases:- that involves only protein databases.
In additionally, it is also classified into three categories:
A. Primary database B. Secondary databases C. Composite databases.
It contain information of the sequence or structure alone either protein or
nucleic acid .
Example: PIR, SWISS-PROT for protein sequences , NCBI, EMBL and DDBJ for
genome sequences.
PIR: It is functionally annotated
protein sequences and structure.
PIR has collaborated with EBI and
SIB to establish the UniProt (
United Protein Databases).
The central resource of
protein sequence and function.
TREMBL
NCBI ( National Centre of Biotechnology Information ):
- Nov 4, 1988 , the NCBI was established as division of the National Library of medicine for the
development of information systems in molecular biology.
- The NCBI is located in Bethesta, Maryland (U.S.A).
- NCBI built the GenBank, which is an annotated collection of publically available nucleotide and
protein sequences.
- In 1988, the three partners (DDBJ, EMBL and GenBank) of the international Nucelotide
Sequences Database collaboration had a meeting and agreed to use a common format.
i. Maintains collaboration with several NIH institutes, academia, industry and other governmental
agencies.
ii. Develops, distributes, supports and coordinates access to a variety of databases and software for
the scientific and medical communities.
iii. Develops and promotes standards for databases, data deposition and exchange, and biological
nomenclature.
iv. Engages the members of the international scientific community in informatics research and training
through the scientific visitors programs.
Link: https://www.ncbi.nlm.nih.gov/
 In 1992, NCBI has the responsibility for making available the
DNA sequence database to the GenBank.
 Coordinates with individual laboratories and other sequence
data base such those of EMBL and DDBJ.
 Moreover, NCBI has grown to provide other databases in
addition to GenBank.
 GenBank is a comprehensive sequence database that contains
publicly available DNA sequences for more than 1,19,000
different organisms obtained through the submission of
sequence data from individual lab and batch submissions from
large-scale of seq. projects.
 Daily data exchange with the EMBL data library in the UK and
the DNA Data Bank of Japan helps world wide coverage.
 Developed and maintained by European Molecular Biology Laboratory – European
Bioinformatics Institute (EMBL-EBI).
 Comprehensive data nucleotide sequence information.
 The European Molecular Biology Laboratory (EMBL) Nucleotide Sequence Database is a
comprehensive collection of primary nucleotide sequences maintained at the European
Bioinformatics Institute (EBI).
 Link: http:www.ebi.ac.uk/embl/
EMBL is supported by 22 member states, four prospect, and two associated states.
 The laboratory operatory operates from five sites: the main laboratory in Heidelberg, and
outstations Hinxton (EBI, in England), Grenoble (France), Hambury (Germany) and
Manterotando ( near Rome).
 EMBL groups and laboratories perform basic research in molecular biology and molecular
medicine as well as training for science student and visitors.
 Since 1982 this work has been done in collaboration with GenBank (NCBI, Bethesda, USA)
and the DNA Database of Japan (Mishima).
 For sequencing similar searching, a variety of tools (FASTA and BLAST
are available that allow external users to compare their own seq. against the data in
EMBL nucleotide sequence database and other database.
 The DNA Data Bank of Japan (DDBJ) is a biological database that collects DNA
sequences. It was established in 1986.
 Link: https://www.ddbj.nig.ac.jp
 It is located at the National Institute of Genetics (NIG) in the Shizuoka prefecture of
Japan.
 DDBJ is a member of the International Nucleotide Sequence Database
Collaboration or INSDC.
 It exchanges its data with European Molecular Biology Laboratory at the European
Bioinformatics Institute and with GenBank at the National Center for Biotechnology
Information on a daily basis.
 DDBJ Center collects nucleotide sequence data as a member of INSDC(International
Nucleotide Sequence Database Collaboration) and provides freely available nucleotide
sequence data and supercomputer system, to support research activities in life science.
 FEATURES
 group 1: biological source of the sequence (source) The feature, “source” (group 1) is
mandatory for all entries in the international nucleotide database. ...
 group 2: biological function features of the region. ...
 group 3: difference and/or change of the sequence data.
Data type Organism Accession numbers for annotated
sequences (number of entries)
Accession numbers for raw reads
Genome Radish (Raphanus sativus cv. Aokubi S-
h)
WGS: BAOO01000001-
BAOO01072909 (72 909 entries)
scaffold CON: DF196826-
DF236948 (40,123 entries)
DRR012610-DRR012624
Soybean (Glycine max cv. Enrei) BBNX02000001-BBNX02108601 (108
601 entries)
DRR021740-DRR021744
Common marmoset (Callithrix jacchus) WGS: BBXK01000001-
BBXK01109198 (109 198 entries)
scaffold CON: DG000097-
DG000120 (24 entries)
GSS: LB274659-LB427105 (152 447
entries)
DRR036754-DRR036764
List of notable data sets released from the DNA Data Bank of Japan (DDBJ) sequence databases from June 2015 to May 2016
 Hosted at National Institute of Genetics .
 Mainly from scientists in Japan and also from resources all over the world and shave this
nucleotide data with EMBL and GenBank.
 This officially , certified to collect nucleotide sequence from researchers sand to tissue the
internationally recognized number of data submitters.
 About 99% of the nucleotide data in INSDC are submitted by DDMJ
 This database plays a major role to improve the quality of INSDC.
 Each database entry include details of sequences, submitters details bibiliographic
references, biological significance and the scientific name and taxonomy of the organism.
 Features that identify coding regions transcription units, mutation sites etc. are displayed
in a feature table. Major activities of the database.
 Providing internationally recognized accession numbers to sequences.
 Bioinformatics database management developing tools for the analysis and visualization of
biological data.
 Conducting courses for beginners to reduce the complexity in the biological data analysis.
Primary Databases.pptx
Primary Databases.pptx

Weitere ähnliche Inhalte

Was ist angesagt?

Nucleic acid database
Nucleic acid databaseNucleic acid database
Nucleic acid databaseEsakkiammal S
 
Protein information resource (PIR)
Protein information resource (PIR)Protein information resource (PIR)
Protein information resource (PIR)ShivaniShewale2
 
Primary and secondary databases ppt by puneet kulyana
Primary and secondary databases ppt by puneet kulyanaPrimary and secondary databases ppt by puneet kulyana
Primary and secondary databases ppt by puneet kulyanaPuneet Kulyana
 
An Introduction to "Bioinformatics & Internet"
An Introduction to "Bioinformatics & Internet"An Introduction to "Bioinformatics & Internet"
An Introduction to "Bioinformatics & Internet"Asar Khan
 
Nucleic Acid Sequence databases
Nucleic Acid Sequence databasesNucleic Acid Sequence databases
Nucleic Acid Sequence databasesPranavathiyani G
 

Was ist angesagt? (20)

Protein database
Protein databaseProtein database
Protein database
 
protein data bank
protein data bankprotein data bank
protein data bank
 
Nucleic acid database
Nucleic acid databaseNucleic acid database
Nucleic acid database
 
Protein information resource (PIR)
Protein information resource (PIR)Protein information resource (PIR)
Protein information resource (PIR)
 
NCBI National Center for Biotechnology Information
NCBI National Center for Biotechnology InformationNCBI National Center for Biotechnology Information
NCBI National Center for Biotechnology Information
 
Structural databases
Structural databases Structural databases
Structural databases
 
European molecular biology laboratory (EMBL)
European molecular biology laboratory (EMBL)European molecular biology laboratory (EMBL)
European molecular biology laboratory (EMBL)
 
Gen bank
Gen bankGen bank
Gen bank
 
EMBL- European Molecular Biology Laboratory
EMBL- European Molecular Biology LaboratoryEMBL- European Molecular Biology Laboratory
EMBL- European Molecular Biology Laboratory
 
Primary and secondary databases ppt by puneet kulyana
Primary and secondary databases ppt by puneet kulyanaPrimary and secondary databases ppt by puneet kulyana
Primary and secondary databases ppt by puneet kulyana
 
Gene prediction method
Gene prediction method Gene prediction method
Gene prediction method
 
Swiss PROT
Swiss PROT Swiss PROT
Swiss PROT
 
Biological databases
Biological databasesBiological databases
Biological databases
 
An Introduction to "Bioinformatics & Internet"
An Introduction to "Bioinformatics & Internet"An Introduction to "Bioinformatics & Internet"
An Introduction to "Bioinformatics & Internet"
 
blast bioinformatics
blast bioinformaticsblast bioinformatics
blast bioinformatics
 
Nucleic Acid Sequence databases
Nucleic Acid Sequence databasesNucleic Acid Sequence databases
Nucleic Acid Sequence databases
 
Tools and database of NCBI
Tools and database of NCBITools and database of NCBI
Tools and database of NCBI
 
SWISS-PROT
SWISS-PROTSWISS-PROT
SWISS-PROT
 
TrEMBL
TrEMBLTrEMBL
TrEMBL
 
Protein sequence databases
Protein sequence databasesProtein sequence databases
Protein sequence databases
 

Ähnlich wie Primary Databases.pptx

Bioinformatics in biotechnology by kk sahu
Bioinformatics in biotechnology by kk sahu Bioinformatics in biotechnology by kk sahu
Bioinformatics in biotechnology by kk sahu KAUSHAL SAHU
 
Nucleic Acid Databases (NDB ) of bioinformatics pptx
Nucleic Acid Databases (NDB ) of bioinformatics pptxNucleic Acid Databases (NDB ) of bioinformatics pptx
Nucleic Acid Databases (NDB ) of bioinformatics pptxkarmandeepkaur7
 
Nucleic acid and protein databanks
Nucleic acid and protein databanksNucleic acid and protein databanks
Nucleic acid and protein databanksNithyaNandapal
 
Database in bioinformatics
Database in bioinformaticsDatabase in bioinformatics
Database in bioinformaticsVinaKhan1
 
databases.pptx
databases.pptxdatabases.pptx
databases.pptxifra27
 
Presentation on Biological database By Elufer Akram @ University Of Science ...
Presentation on Biological database  By Elufer Akram @ University Of Science ...Presentation on Biological database  By Elufer Akram @ University Of Science ...
Presentation on Biological database By Elufer Akram @ University Of Science ...Elufer Akram
 
Bioinformatics biological databases
Bioinformatics biological databasesBioinformatics biological databases
Bioinformatics biological databasesSangeeta Das
 
Bioinformatics
BioinformaticsBioinformatics
BioinformaticsRaj Varun
 
Sequence and Structural Databases of DNA and Protein, and its significance in...
Sequence and Structural Databases of DNA and Protein, and its significance in...Sequence and Structural Databases of DNA and Protein, and its significance in...
Sequence and Structural Databases of DNA and Protein, and its significance in...SBituila
 
Sequence and Structural Databases of DNA and Protein, and its significance in...
Sequence and Structural Databases of DNA and Protein, and its significance in...Sequence and Structural Databases of DNA and Protein, and its significance in...
Sequence and Structural Databases of DNA and Protein, and its significance in...BibiQuinah
 
Biological Databases | Access to sequence data and related information
Biological Databases | Access to sequence data and related information Biological Databases | Access to sequence data and related information
Biological Databases | Access to sequence data and related information NahalMalik1
 
Primary Bioinformatics Database.pptx
Primary Bioinformatics Database.pptxPrimary Bioinformatics Database.pptx
Primary Bioinformatics Database.pptxVandana Yadav03
 
Primary sequencing of nucleic acids
Primary sequencing of nucleic acidsPrimary sequencing of nucleic acids
Primary sequencing of nucleic acidsvibhakumari12
 

Ähnlich wie Primary Databases.pptx (20)

Bioinformatics in biotechnology by kk sahu
Bioinformatics in biotechnology by kk sahu Bioinformatics in biotechnology by kk sahu
Bioinformatics in biotechnology by kk sahu
 
Biological databases.pptx
Biological databases.pptxBiological databases.pptx
Biological databases.pptx
 
Biological database
Biological databaseBiological database
Biological database
 
Nucleic Acid Databases (NDB ) of bioinformatics pptx
Nucleic Acid Databases (NDB ) of bioinformatics pptxNucleic Acid Databases (NDB ) of bioinformatics pptx
Nucleic Acid Databases (NDB ) of bioinformatics pptx
 
Nucleic acid and protein databanks
Nucleic acid and protein databanksNucleic acid and protein databanks
Nucleic acid and protein databanks
 
Database in bioinformatics
Database in bioinformaticsDatabase in bioinformatics
Database in bioinformatics
 
databases.pptx
databases.pptxdatabases.pptx
databases.pptx
 
Presentation on Biological database By Elufer Akram @ University Of Science ...
Presentation on Biological database  By Elufer Akram @ University Of Science ...Presentation on Biological database  By Elufer Akram @ University Of Science ...
Presentation on Biological database By Elufer Akram @ University Of Science ...
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
Introduction to databases.pptx
Introduction to databases.pptxIntroduction to databases.pptx
Introduction to databases.pptx
 
Bioinformatics biological databases
Bioinformatics biological databasesBioinformatics biological databases
Bioinformatics biological databases
 
Data base in detail
Data base in detailData base in detail
Data base in detail
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
Sequence and Structural Databases of DNA and Protein, and its significance in...
Sequence and Structural Databases of DNA and Protein, and its significance in...Sequence and Structural Databases of DNA and Protein, and its significance in...
Sequence and Structural Databases of DNA and Protein, and its significance in...
 
Sequence and Structural Databases of DNA and Protein, and its significance in...
Sequence and Structural Databases of DNA and Protein, and its significance in...Sequence and Structural Databases of DNA and Protein, and its significance in...
Sequence and Structural Databases of DNA and Protein, and its significance in...
 
Biological Databases | Access to sequence data and related information
Biological Databases | Access to sequence data and related information Biological Databases | Access to sequence data and related information
Biological Databases | Access to sequence data and related information
 
Primary Bioinformatics Database.pptx
Primary Bioinformatics Database.pptxPrimary Bioinformatics Database.pptx
Primary Bioinformatics Database.pptx
 
Primary sequencing of nucleic acids
Primary sequencing of nucleic acidsPrimary sequencing of nucleic acids
Primary sequencing of nucleic acids
 
Biological databases
Biological databasesBiological databases
Biological databases
 
Major biological nucleotide databases
Major biological nucleotide databasesMajor biological nucleotide databases
Major biological nucleotide databases
 

Kürzlich hochgeladen

Pests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPirithiRaju
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)PraveenaKalaiselvan1
 
GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)Areesha Ahmad
 
Bacterial Identification and Classifications
Bacterial Identification and ClassificationsBacterial Identification and Classifications
Bacterial Identification and ClassificationsAreesha Ahmad
 
GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)Areesha Ahmad
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPirithiRaju
 
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bNightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bSérgio Sacani
 
Botany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfBotany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfSumit Kumar yadav
 
Chemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdfChemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdfSumit Kumar yadav
 
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptxSCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptxRizalinePalanog2
 
Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTSérgio Sacani
 
Green chemistry and Sustainable development.pptx
Green chemistry  and Sustainable development.pptxGreen chemistry  and Sustainable development.pptx
Green chemistry and Sustainable development.pptxRajatChauhan518211
 
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Lokesh Kothari
 
Creating and Analyzing Definitive Screening Designs
Creating and Analyzing Definitive Screening DesignsCreating and Analyzing Definitive Screening Designs
Creating and Analyzing Definitive Screening DesignsNurulAfiqah307317
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bSérgio Sacani
 
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICESAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICEayushi9330
 
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Sérgio Sacani
 
VIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C PVIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C PPRINCE C P
 
Botany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questionsBotany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questionsSumit Kumar yadav
 
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsHubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsSérgio Sacani
 

Kürzlich hochgeladen (20)

Pests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdf
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)
 
GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)
 
Bacterial Identification and Classifications
Bacterial Identification and ClassificationsBacterial Identification and Classifications
Bacterial Identification and Classifications
 
GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
 
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bNightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
 
Botany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfBotany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdf
 
Chemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdfChemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdf
 
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptxSCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
 
Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOST
 
Green chemistry and Sustainable development.pptx
Green chemistry  and Sustainable development.pptxGreen chemistry  and Sustainable development.pptx
Green chemistry and Sustainable development.pptx
 
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
 
Creating and Analyzing Definitive Screening Designs
Creating and Analyzing Definitive Screening DesignsCreating and Analyzing Definitive Screening Designs
Creating and Analyzing Definitive Screening Designs
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
 
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICESAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICE
 
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
 
VIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C PVIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C P
 
Botany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questionsBotany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questions
 
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsHubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
 

Primary Databases.pptx

  • 1. Presented by – SWARUP MALAKAR
  • 2. A database is a repository of sequence ( DNA or amino acids ) stored in a computer which provide a centralized and homogenous view of its content. or, it is a vast collection of data pertaining to a specific topic, e.g., nucleotide sequence, protein sequence etc. Basically, it is an electronic environment. Databases are at the heart of bioinformatics.
  • 3. 1. Sequence databases: - that involves the sequences of both proteins and nucleic acids. 2. Structural databases:- that involves only protein databases. In additionally, it is also classified into three categories: A. Primary database B. Secondary databases C. Composite databases.
  • 4. It contain information of the sequence or structure alone either protein or nucleic acid . Example: PIR, SWISS-PROT for protein sequences , NCBI, EMBL and DDBJ for genome sequences.
  • 5. PIR: It is functionally annotated protein sequences and structure. PIR has collaborated with EBI and SIB to establish the UniProt ( United Protein Databases). The central resource of protein sequence and function.
  • 7. NCBI ( National Centre of Biotechnology Information ): - Nov 4, 1988 , the NCBI was established as division of the National Library of medicine for the development of information systems in molecular biology. - The NCBI is located in Bethesta, Maryland (U.S.A). - NCBI built the GenBank, which is an annotated collection of publically available nucleotide and protein sequences. - In 1988, the three partners (DDBJ, EMBL and GenBank) of the international Nucelotide Sequences Database collaboration had a meeting and agreed to use a common format.
  • 8. i. Maintains collaboration with several NIH institutes, academia, industry and other governmental agencies. ii. Develops, distributes, supports and coordinates access to a variety of databases and software for the scientific and medical communities. iii. Develops and promotes standards for databases, data deposition and exchange, and biological nomenclature. iv. Engages the members of the international scientific community in informatics research and training through the scientific visitors programs. Link: https://www.ncbi.nlm.nih.gov/
  • 9.  In 1992, NCBI has the responsibility for making available the DNA sequence database to the GenBank.  Coordinates with individual laboratories and other sequence data base such those of EMBL and DDBJ.  Moreover, NCBI has grown to provide other databases in addition to GenBank.  GenBank is a comprehensive sequence database that contains publicly available DNA sequences for more than 1,19,000 different organisms obtained through the submission of sequence data from individual lab and batch submissions from large-scale of seq. projects.  Daily data exchange with the EMBL data library in the UK and the DNA Data Bank of Japan helps world wide coverage.
  • 10.  Developed and maintained by European Molecular Biology Laboratory – European Bioinformatics Institute (EMBL-EBI).  Comprehensive data nucleotide sequence information.
  • 11.  The European Molecular Biology Laboratory (EMBL) Nucleotide Sequence Database is a comprehensive collection of primary nucleotide sequences maintained at the European Bioinformatics Institute (EBI).  Link: http:www.ebi.ac.uk/embl/ EMBL is supported by 22 member states, four prospect, and two associated states.  The laboratory operatory operates from five sites: the main laboratory in Heidelberg, and outstations Hinxton (EBI, in England), Grenoble (France), Hambury (Germany) and Manterotando ( near Rome).
  • 12.  EMBL groups and laboratories perform basic research in molecular biology and molecular medicine as well as training for science student and visitors.  Since 1982 this work has been done in collaboration with GenBank (NCBI, Bethesda, USA) and the DNA Database of Japan (Mishima).  For sequencing similar searching, a variety of tools (FASTA and BLAST are available that allow external users to compare their own seq. against the data in EMBL nucleotide sequence database and other database.
  • 13.  The DNA Data Bank of Japan (DDBJ) is a biological database that collects DNA sequences. It was established in 1986.  Link: https://www.ddbj.nig.ac.jp  It is located at the National Institute of Genetics (NIG) in the Shizuoka prefecture of Japan.  DDBJ is a member of the International Nucleotide Sequence Database Collaboration or INSDC.  It exchanges its data with European Molecular Biology Laboratory at the European Bioinformatics Institute and with GenBank at the National Center for Biotechnology Information on a daily basis.
  • 14.  DDBJ Center collects nucleotide sequence data as a member of INSDC(International Nucleotide Sequence Database Collaboration) and provides freely available nucleotide sequence data and supercomputer system, to support research activities in life science.  FEATURES  group 1: biological source of the sequence (source) The feature, “source” (group 1) is mandatory for all entries in the international nucleotide database. ...  group 2: biological function features of the region. ...  group 3: difference and/or change of the sequence data.
  • 15. Data type Organism Accession numbers for annotated sequences (number of entries) Accession numbers for raw reads Genome Radish (Raphanus sativus cv. Aokubi S- h) WGS: BAOO01000001- BAOO01072909 (72 909 entries) scaffold CON: DF196826- DF236948 (40,123 entries) DRR012610-DRR012624 Soybean (Glycine max cv. Enrei) BBNX02000001-BBNX02108601 (108 601 entries) DRR021740-DRR021744 Common marmoset (Callithrix jacchus) WGS: BBXK01000001- BBXK01109198 (109 198 entries) scaffold CON: DG000097- DG000120 (24 entries) GSS: LB274659-LB427105 (152 447 entries) DRR036754-DRR036764 List of notable data sets released from the DNA Data Bank of Japan (DDBJ) sequence databases from June 2015 to May 2016
  • 16.  Hosted at National Institute of Genetics .  Mainly from scientists in Japan and also from resources all over the world and shave this nucleotide data with EMBL and GenBank.  This officially , certified to collect nucleotide sequence from researchers sand to tissue the internationally recognized number of data submitters.  About 99% of the nucleotide data in INSDC are submitted by DDMJ  This database plays a major role to improve the quality of INSDC.  Each database entry include details of sequences, submitters details bibiliographic references, biological significance and the scientific name and taxonomy of the organism.
  • 17.  Features that identify coding regions transcription units, mutation sites etc. are displayed in a feature table. Major activities of the database.  Providing internationally recognized accession numbers to sequences.  Bioinformatics database management developing tools for the analysis and visualization of biological data.  Conducting courses for beginners to reduce the complexity in the biological data analysis.