SlideShare ist ein Scribd-Unternehmen logo
1 von 9
Bioinformatics
Introduction
When the Human Genome Project was begun in 1990 it was understood that to meet the
project's goals, the speed of DNA sequencing would have to increase and the cost would have to
come down. Over the life of the project virtually every aspect of DNA sequencing was improved.
It took the project approximately four years to sequence its first one billion bases but just four
months to sequence the second billion bases.
During the month of January, 2003, 1.5 billion bases were sequenced. As the speed of DNA
sequencing increased, the cost decreased from 10 dollars per base in 1990 to 10 cents per base at
the conclusion of the project in April 2003. Although the Human Genome Project is officially
over, improvements in DNA sequencing continue to be made. Researchers are experimenting with
new methods for sequencing DNA that have the potential to sequence a human genome in just a
matter of weeks for a few thousand dollars.
DNA sequencing performed on an industrial scale has produced
a vast amount of data to analyze. In August 2005 it was
announced that the three largest public collections of DNA and
RNA sequences together store one hundred billion bases,
representing over 165,000 different organisms. As sequence
data began to pile up, the need for new and better methods of
sequence analysis was critical.
Bioinformatics is the branch of biology that is concerned with the gaining, storage, and analysis
of the information found in nucleic acid and protein sequence data. Computers and bioinformatics
software are the tools of the trade.
Genetic data represent a treasure trove for researchers and
companies interested in how genes contribute to our
health and well being. Almost half of the genes identified
by the Human Genome Project have no known function.
Researchers are using bioinformatics to identify genes,
establish their functions, and develop gene-based
strategies for preventing, diagnosing, and treating
disease.
A DNA sequencing reaction produces a sequence that is
several hundred bases long. Gene sequences typically run for
thousands of bases. The largest known gene is that associated
with Duchenne muscular dystrophy. It is approximately 2.4
million bases in length. In order to study genes, scientists first
assemble long DNA sequences from series of shorter
overlapping sequences.
Scientists enter their assembled sequences into genetic databases so that other scientists may use
the data. Since the sequences of the two DNA strands are complementary, it is only necessary to
enter the sequence of one DNA strand into a database. By selecting an appropriate computer
program, scientists can use sequence data to look for genes, get clues to gene functions,
examine genetic variation, and explore evolutionary relationships. Bioinformatics is a young
and dynamic science. New bioinformatic software is being developed while existing software is
continually updated.
Definition:
Bioinformatics is an interdisciplinary field that develops methods and software tools for
understanding biological data. As an interdisciplinary field of science, bioinformatics combines
computer science, statistics, mathematics, and engineering to analyze and interpret biological data.
The National Center for Biotechnology Information (NCBI 2001) defines bioinformatics as:
“Bioinformatics is the field of science in which biology, computer science, and information
technology merge into a single discipline”.
Bioinformatics tools aid in the comparison of genetic and genomic data and more generally in the
understanding of evolutionary aspects of molecular biology. At a more integrative level, it helps
analyze and catalogue the biological pathways and networks that are an important part of systems
biology. In structural biology, it aids in the simulation and modeling of DNA, RNA, and protein
structures as well as molecular interactions.
History:
Historically, the term bioinformatics did not mean what it means today. Paulien Hogeweg and
Ben Hesper coined it in 1970 to refer to the study of information processes in biotic systems.
A Chronological History of Bioinformatics
• 1953 - Watson & Crick proposed the double helix model for DNA based x-ray data obtained
by Franklin & Wilkins.
• 1954 - Perutz's group develop heavy atom methods to solve the phase problem in protein
crystallography.
• 1955 - The sequence of the first protein to be analysed, bovine insulin, is announed by
F.Sanger.
• 1970 - The details of the Needleman-Wunsch algorithm for sequence comparison are
published.
• 1972 - The first recombinant DNA molecule is created by Paul Berg and his group.
• 1973 - The Brookhaven Protein DataBank is announeced.
• 1974 - Vint Cerf and Robert Khan develop the concept of connecting networks of computers
into an "internet" and develop the Transmission Control Protocol (TCP).
• 1975 - Microsoft Corporation is founded by Bill Gates and Paul Allen. Two-dimensional
electrophoresis, where separation of proteins on SDS polyacrylamide gel is combined with
separation according to isoelectric points, is announced by P.H.O'Farrel.
• 1988 - The National Centre for Biotechnology Information (NCBI) is established at the
National Cancer Institute. The Human Genome Intiative is started in 1988.
The FASTA algorith for sequence comparison is published by Pearson and Lupman.
• 1990 - The BLAST program (Altschul,et.al.) is implemented.
• 1991 -Myriad Genetics, Inc. is founded in Utah. The company's goal is to lead in the
discovery of major common human disease genes and their related pathways. The company
has discovered and sequenced, with its academic collaborators, the
Following major genes: BRCA1, BRACA1, CHD1, MMAC1, MMSC1, MMSC2, CtIP, p16,
p19 and MTS2.
• 1995 - The Haemophilus influenzea genome (1.8) is sequenced. The Mycoplasma genitalium
genome is sequenced.
• 1996 - The genome for Saccharomyces cerevisiae (baker's yeadt, 12.1 Mb) is sequenced.
• 1997 - The genome for E.coli (4.7 Mbp) is published.Oxford Molecualr Group acquires the
Genetics Computer Group.
• 1998 - The genomes for Caenorhabitis elegans and baker's yeast are published.The Swiss
Institute of Bioinformatics is established as a non-profit foundation.Craig Venter forms
Celera in Rockville, Maryland.
• 2000 - The genome for Pseudomonas aeruginosa (6.3 Mbp) is published. The Athaliana
genome (100 Mb) is secquenced.The D.melanogaster genome (180 Mb) is sequenced.
• 2001 - The human genome (3,000 Mbp) is published.
Goals:
 To study how normal cellular activities are altered in different disease states, the
biological data must be combined to form a comprehensive picture of these activities.
This includes nucleotide and amino acid sequences, protein domains, and protein
structures.
 The actual process of analyzing and interpreting data is referred to as computational
biology. Important sub-disciplines within bioinformatics and computational biology
include:
 Development and implementation of computer programs that enable efficient access
to, use and management of, various types of information
 Development of new algorithms (mathematical formulas) and statistical measures that
assess relationships among members of large data sets. For example, there are methods
to locate a gene within a sequence, to predict protein structure and/or function, and to
cluster protein sequences into families of related sequences.
 The primary goal of bioinformatics is to increase the understanding of biological
processes. Examples include: pattern recognition, data mining, machine learning
algorithms, and visualization.
 Major research efforts in the field include sequence alignment, gene finding, genome
assembly, drug design, drug discovery, protein structure alignment, protein structure
prediction, prediction of gene expression and protein–protein interactions, genome-
wide association studies, the modeling of evolution and cell division/mitosis.
 Common activities in bioinformatics include mapping and analyzing DNA and protein
sequences, aligning DNA and protein sequences to compare them, and creating and
viewing 3-D models of protein structures.
Applications of Bioinformatics:
1. Automatic Genome Sequencing:
Bioinformatics are used in genome sequencing techniques such as
a. Bioinformatics are used in the development of automated sequencing techniques that use
PCR or BAC based gene amplifications, two dimensional electrophoresis and also automated
reading of nucleotides.
b. Bioinformatics are used in joining the sequence of smaller fragments of DNA to form a
complete genome sequence.
c. Bioinformatics are also used in the prediction of promoter region of the genome.
d. Bioinformatics are also used in the prediction of promoter coding region of the genome.
Small fragments of genome can be amplified using polymerase chain reaction or bacterial
artificial chromosome. Amplified fragments may suffer from nucleotide reading errors,
repeats; these in turn generate multiple copies of the same genome fragments. These repeats
are removed just before the final assembly of all genome fragments via mathematical models.
2. Identification of Genes:
Bioinformatics programs such as GLIMMER and GenBank are used to identify the coding
region or open reading frames in the genome.
3. Identifying Gene Function:
After identifying the open reading frames present in the genome, nest step is to annotate the
structure and function of the genes. Bioinformatics tool such as sequence search and pair-wise
gene alignment technique are used to identify the gene function. The major four algorithms
such as BLAST, BLOSUM, ClustalX and SMART are used for the functional annotation of
genes.
4. Three-dimensional (3D) Structure Modelling:
Single protein may present in different conformational states depending upon its interaction
with other proteins present in the vicinity. Three-dimensional structure of the protein has got
some regions exposed for protein-protein or protein-DNA interactions. Also the function of
these proteins are also depends upon these interactions.
Bioinformatics techniques are also used to predict the possible conformation of the protein
coded by a gene; this in turn helps in identifying the function of the same protein.
5. Pair-wise Genome Comparison:
After the identification of functioning of a particular gene, bioinformatics is also used for pair-
wise genome comparisons. Pair-wise comparison of a genome with itself is done to get the
details of paralogous genes or duplicated genes that have the same base sequence with some
functional variations. Also pair-wise genome comparison is also done against other genome to
get information such as orthologous genes. These genes are equivalent genes present in two
different genomes due to speciation. This technique is also used to identify different types of
gene-groups and adjacent genes that occur in the close proximity as they are involved in
common higher level function, and also lateral-gene transfer that is transfer of genes from a
microorganism that is evolutionary distant. Bioinformatics technique is also used to analyse
gene-fusion or gene-fission, gene-group duplication and much more.
6. Vaccine Design:
Bioinformatics techniques are used in the effective and efficient designing of vaccine and also
in the designing of antimicrobial agents.
7. Microbial Genome Sequencing:
Bioinformatics is also used in automating microbial genome sequencing.
8. Genome Comparison:
Bioinformatics techniques is also used to genome as follows,
a. Identifying conserved function within a genome family
b. Identifying a specific genes in a group of genomes
c. Modelling 3D structure of proteins
d. Docking of biochemical compounds as well as receptors.
This helps in developing integrated data bases over the internet. This also helps in
understanding function of gene and also genome.
Bioinformatics has got major impacts on biotechnology and its applications. The vast amount
of data generated by human genome project or by other genome sequencing project would be
unmanageable without the bioinformatics technique. Without bioinformatics handling,
interpretation of these data would be unthinkable. Due to the bioinformatics application cost
of all these major projects comes down drastically.
9. Medicine
In the field of Medicine applications of bioinformatics is used for following areas:
a. Drug Discovery:
The Idea of using X ray Crystallography in drug discovery emerged more than 30 years ago,
when the first 3 dimensional structure of protein was determined. Within a decade, a radical
change in drug design had begun, incarporating the knowledge of 3 dimensional structures of
target protein into design process. Protein structure can influence drug discovery at every stage
in design process. Classically, it is used in lead optimization, a process that uses structure to
guide the chemical modification of a lead molecule to give an optimized fit in terms of shape,
hydrogen bonds and other non -covalent interactions with the target.
b. Personal Medicine:
Personalized medicine is a medical model that proposes the customization of healthcare, with
all decisions and practices being tailored to the individual patient by use of genetic or other
information. Practical application outside of long established considerations like a patient's
family history, social circumstances, environment and behaviors are very limited so far and
practically no progress has been made in the last decade. Personalized medicine research
attempts to identify individual solutions based on the susceptibility profile of each individual.
It is hoped that these fields will enable new approaches to diagnosis, drug development, and
individualized therapy.
c. Preventive Medicine:
Preventive medicine or preventive care consists of measures taken to prevent diseases, (or
injuries) rather than curing them or treating their symptoms. This contrasts in method with
curative and palliative medicine, and in scope with public health methods (which work at the
level of population health rather than individual health)[4]. Simple examples of preventive
medicine include hand washing, breastfeeding, and immunizations.
d. Gene Therapy:
Gene therapy is a novel form of drug delivery that enlists the synthetic machinery of the
patient's cell to produce a therapeutic agent. It involves the efficient introduction of functional
gene into the appropriate cells of the patient in order to produce sufficient amount of protein
encoded by transferred gene (transgene) so as to precisely and permanently correct the
disorder. Strategies of Gene Therapy are following:
- Gene addition
- Removal of harmful gene by antisense nucleotide or ribozymes
- Control of gene expression
10. Microbial Genome Applications
In the field of Microbial Genome Applications, applications of bioinformatics are used for
following areas:
a. Waste Cleanup:
In bioinformatics bacteria and microbes are identified which are helpful in cleaning waste.
Deinococcus radiodurans Bacterium is listed in the Guinness Book of World Records as "the
world's toughest bacterium." This bacterium has the ability to repair damaged DNA and small
fragments from chromosomes by isolating damage segments in a concentrated area. This is
because it has additional copies of its genome. Genes from other bacteria have been inserted
into D. radiodurans for environmental cleanup. It was used to break down organic chemicals,
solvents and heavy metals in radioactive waste sites.
b. Climate Change:
Climate change is caused by factors that include oceanic processes (such as oceanic
circulation), variations in solar radiation received by Earth, plate tectonics and volcanic
eruptions, and human-induced alterations of the natural world. By studying microorganisms
genome scientists can begin to understand these microbes at a very fundamental level and
isolated the genes that give them their unique abilities to survive under extreme conditions.
Rhodopseudomonas palustris is a purple non-sulfur phototrophic bacterium commonly found
in soils and water. It converts sunlight to cellular energy by absorbing atmospheric carbon
dioxide and converting it to biomass. This microbe can also degrade and recycle a variety of
aromatic compounds that comprise lignin, the main constituent of wood and the second most
abundant polymer on earth. R. palustris is acknowledged by microbiologists to be one of the
most metabolically versatile bacteria ever described. Not only can it convert carbon dioxide
gas into cell material but nitrogen gas into ammonia, and it can produce hydrogen gas. It grows
both in the absence and presence of oxygen. In the absence of oxygen, it prefers to generate all
its energy from light by photosynthesis.
c. Biotechnology:
The wide concept of "biotech" or "biotechnology" encompasses a wide range of procedures
for modifying living organisms according to human purposes, going back to domestication of
animals, cultivation of plants, and "improvements" to these through breeding programs that
employ artificial selection and hybridization. Modern usage also includes genetic engineering
as well as cell and tissue culture technologies. In the field of bioinformatics, biotechnology has
identified organisms and microorganisms which can be very useful in dairy industry and food
manufacturers. Lactococcus Lactis is one of the most important micro-organisms involved in
the dairy industry, it is a non-pathogenic rod-shaped bacterium that is critical for
manufacturing dairy products like buttermilk, yogurt and cheese. This bacterium, is also used
to prepare pickled vegetables, beer, wine, some breads and sausages and other fermented foods.
Researchers anticipate that understanding the physiology and genetic make-up of this
bacterium will prove invaluable for food manufacturers as well as the pharmaceutical industry,
which is exploring the capacity of L. lactis to serve as a vehicle for delivering drugs.
d. Alternative Energy:
Scientists are studying the genome of the microbe Chlorobium tepidum which has an unusual
capacity for generating energy from light. Chlorobium tepidumis a thermophilic Gram-
negative green sulfur bacterium isolated from a hot spring in New Zealand in which it forms a
dense mat. The bacterium carries out photosynthesis in ways that are different from plants and
other bacteria. Unlike plants, the green bacteria do not produce oxygen from photosynthesis.
According to some researchers, photosynthesis may have its evolutionary origins in organisms
like C. tepidum. Such species would have been able to harvest energy from light at a time when
the Earth's atmosphere had little oxygen. In addition, the organisms' ability to grow in low-
light environments may have helped them limit their exposure to UV irradiation, which was
likely at higher levels in the early days of Earth.
11. Agriculture
In the field of Agriculture, applications of bioinformatics are in following areas:
a. Crop Improvement:
Comparative genetics of the plant genomes has shown that the organisation of their genes has
remained more conserved over evolutionary time than was previously believed. These findings
suggest that information obtained from the model crop systems can be used to suggest
improvements to other food crops. Arabidopsis thaliana (water cress) and Oryza sativa (rice)
are examples of available complete plant genomes. Arabidopsis thaliana was the first plant to
be sequenced and is considered the model species for investigating plant genetics and biology.
There are many genes which are similar in all plants and the study of genes in a model organism
like A. thaliana facillitates our understanding of gene expression and function in all plants.
Furthermore, since animals and plants are both eukaryotes, many of the genes found in A.
thaliana have homologs in animals. Arabidopsis has the smallest genome of any flowering
plant, which is the main reason it was selected as a model organism for genome sequencing
The DNA of Arabidopsis is made up of about 140 million bases, which are parcelled into five
chromosomes. Oryza sativa (rice) is the most important crop for human consumption,
providing staple food for more than half of the world population. Oryza sativa was the cereal
selected to be sequenced as a priority and has gained the status "model organism". It has the
smallest genome of all the cereals: 430 million nucleotides and it can serve as a model genome
for one of the two main groups of flowering plants, the monocotyledons. Because it has been
the subject of studies on yield, hybrid vigor, genetic resistance to disease and adaptive
responses, scientists have taken advantage of the existence of a multitude of varieties that have
adapted to a very wide range of environmental conditions, from dry soil in temperate regions
to flooded cultures in tropical regions.
b. Insect Resistance:
Genes from Bacillus thuringiensis that can control a number of serious pests have been
successfully transferred to cotton, maize and potatoes. This new ability of the plants to resist
insect attack means that the amount of insecticides being used can be reduced and hence the
nutritional quality of the crops is increased. Bacillus thuringiensis is a pathogenic bacteria used
for insect control. It is Gram-positive spore-forming, rod-shaped aerobic bacteria in the genus
Bacillus. B. thuringiensisis an insecticidal bacterium, marketed worldwide for control of many
important plant pests - mainly caterpillars of the Lepidoptera (butterflies and moths) but also
mosquito larvae, and simuliid blackflies that vector river blindness in Africa. B. thuringiensis
products represent about 1% of the total 'agrochemical' market (fungicides, herbicides and
insecticides) across the world. The commercial B. thuringiensis products are powders
containing a mixture of dried spores and toxin crystals. They are applied to leaves or other
environments where the insect larvae feed. The toxin genes have also been genetically
engineered into several crop plants.
c. Improve Nutritional Quality:
Scientists have recently succeeded in transferring genes into rice to increase levels of Vitamin
A, iron and other micronutrients. This work could have a profound impact in reducing
occurrences of blindness and anaemia caused by deficiencies in Vitamin A and iron
respectively. Scientists have inserted a gene from yeast into the tomato, and the result is a plant
whose fruit stays longer on the vine and has an extended shelf life. One little gene may be all
that stands between a fresh, juicy, homegrown tomato and its bland, store-bought counterpart.
Biologists announced that they've identified the gene that controls the ripening process in the
humble fruit. If this "rin" gene can be manipulated effectively, scientists will be able to create
breeds of tomatoes that will be more flavorful even after the long journey from the vine to the
produce department. Today, tomatoes are plucked from the vine early, when still green and
firm, to ensure that they survive shipping without bruising and rotting. Picking tomatoes early
means they have less chance to develop flavor, color, and nutrients naturally. By manipulating
the "rin" gene, scientists will be able to slow the ripening process, letting the tomato develop
on the vine for longer - but still keeping it firm enough to ship safely. The scientists responsible
for the "rin" gene findings are from the U.S. Department of Agriculture and the Boyce
Thompson Institute for Plant Research, on the campus of Cornell University. They hope that
their technique may also be applied to other fruits - such as strawberries, bananas, bell peppers,
and melons - which suffer from the same shipping and storage complications.

Weitere ähnliche Inhalte

Was ist angesagt? (20)

Biological databases
Biological databasesBiological databases
Biological databases
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
Bioinformatics introduction
Bioinformatics introductionBioinformatics introduction
Bioinformatics introduction
 
LECTURE NOTES ON BIOINFORMATICS
LECTURE NOTES ON BIOINFORMATICSLECTURE NOTES ON BIOINFORMATICS
LECTURE NOTES ON BIOINFORMATICS
 
SWISS-PROT
SWISS-PROTSWISS-PROT
SWISS-PROT
 
Tools and database of NCBI
Tools and database of NCBITools and database of NCBI
Tools and database of NCBI
 
Application of bioinformatics in agriculture sector
Application of bioinformatics in agriculture sectorApplication of bioinformatics in agriculture sector
Application of bioinformatics in agriculture sector
 
Introduction to Bioinformatics
Introduction to BioinformaticsIntroduction to Bioinformatics
Introduction to Bioinformatics
 
Introduction to Bioinformatics
Introduction to BioinformaticsIntroduction to Bioinformatics
Introduction to Bioinformatics
 
Biological databases
Biological databasesBiological databases
Biological databases
 
Biological databases
Biological databasesBiological databases
Biological databases
 
Structural databases
Structural databases Structural databases
Structural databases
 
Bioinformatics & its scope in biotech.
Bioinformatics & its scope in biotech.Bioinformatics & its scope in biotech.
Bioinformatics & its scope in biotech.
 
Cath
CathCath
Cath
 
Protein databases
Protein databasesProtein databases
Protein databases
 
Entrez databases
Entrez databasesEntrez databases
Entrez databases
 
Computational Biology and Bioinformatics
Computational Biology and BioinformaticsComputational Biology and Bioinformatics
Computational Biology and Bioinformatics
 
Kegg databse
Kegg databseKegg databse
Kegg databse
 
Bioinformatics Applications in Biotechnology
Bioinformatics Applications in BiotechnologyBioinformatics Applications in Biotechnology
Bioinformatics Applications in Biotechnology
 
Protein Data Bank (PDB)
Protein Data Bank (PDB)Protein Data Bank (PDB)
Protein Data Bank (PDB)
 

Andere mochten auch

Plants as bioreactor
Plants as bioreactorPlants as bioreactor
Plants as bioreactorhina amir
 
Bioreactors for animal cell suspension culture
Bioreactors for animal cell suspension cultureBioreactors for animal cell suspension culture
Bioreactors for animal cell suspension cultureGrace Felciya
 
Bioreactors for plant cell cultures
Bioreactors for plant cell culturesBioreactors for plant cell cultures
Bioreactors for plant cell culturesDelince Samuel
 
Bioreactors for plant cell suspension culture
Bioreactors for plant cell suspension cultureBioreactors for plant cell suspension culture
Bioreactors for plant cell suspension cultureGrace Felciya
 
GENE TRANSFER TECHNIQUES
GENE TRANSFER TECHNIQUESGENE TRANSFER TECHNIQUES
GENE TRANSFER TECHNIQUESsavvysahana
 
Agrobacterium mediated gene transfer in plants.
Agrobacterium mediated gene transfer in plants.Agrobacterium mediated gene transfer in plants.
Agrobacterium mediated gene transfer in plants.ICHHA PURAK
 
Pulses: India and World
Pulses: India and WorldPulses: India and World
Pulses: India and WorldAnand Thokal
 
Agrobacterium mediated gene transfer
Agrobacterium mediated gene transferAgrobacterium mediated gene transfer
Agrobacterium mediated gene transfersushantasarma
 
Agrobacterium mediated gene transfer
Agrobacterium mediated gene transferAgrobacterium mediated gene transfer
Agrobacterium mediated gene transferPrabhu Thirusangu
 
Types of Bioreactors / Fermenters
Types of Bioreactors / FermentersTypes of Bioreactors / Fermenters
Types of Bioreactors / Fermentersajithnandanam
 
Biotransformation (metabolism) of drugs
Biotransformation (metabolism) of drugsBiotransformation (metabolism) of drugs
Biotransformation (metabolism) of drugsZaigham Hammad
 

Andere mochten auch (17)

Plants as bioreactor
Plants as bioreactorPlants as bioreactor
Plants as bioreactor
 
plant as bioreactor
plant as bioreactorplant as bioreactor
plant as bioreactor
 
Bioreactors for animal cell suspension culture
Bioreactors for animal cell suspension cultureBioreactors for animal cell suspension culture
Bioreactors for animal cell suspension culture
 
Bioreactors for plant cell cultures
Bioreactors for plant cell culturesBioreactors for plant cell cultures
Bioreactors for plant cell cultures
 
Bioreactors for plant cell suspension culture
Bioreactors for plant cell suspension cultureBioreactors for plant cell suspension culture
Bioreactors for plant cell suspension culture
 
GENE TRANSFER TECHNIQUES
GENE TRANSFER TECHNIQUESGENE TRANSFER TECHNIQUES
GENE TRANSFER TECHNIQUES
 
Ti plasmid
Ti plasmidTi plasmid
Ti plasmid
 
Agrobacterium mediated gene transfer in plants.
Agrobacterium mediated gene transfer in plants.Agrobacterium mediated gene transfer in plants.
Agrobacterium mediated gene transfer in plants.
 
Pulses: India and World
Pulses: India and WorldPulses: India and World
Pulses: India and World
 
Agrobacterium mediated gene transfer
Agrobacterium mediated gene transferAgrobacterium mediated gene transfer
Agrobacterium mediated gene transfer
 
Agrobacterium mediated gene transfer
Agrobacterium mediated gene transferAgrobacterium mediated gene transfer
Agrobacterium mediated gene transfer
 
Biotransformation
BiotransformationBiotransformation
Biotransformation
 
Types of Bioreactors / Fermenters
Types of Bioreactors / FermentersTypes of Bioreactors / Fermenters
Types of Bioreactors / Fermenters
 
Cereals & pulses
Cereals & pulsesCereals & pulses
Cereals & pulses
 
Biotransformation (metabolism) of drugs
Biotransformation (metabolism) of drugsBiotransformation (metabolism) of drugs
Biotransformation (metabolism) of drugs
 
Gene transfer (2)
Gene transfer (2)Gene transfer (2)
Gene transfer (2)
 
Gene transfer technologies
Gene transfer technologiesGene transfer technologies
Gene transfer technologies
 

Ähnlich wie Bioinformatics Introduction

Bioinformatics
BioinformaticsBioinformatics
BioinformaticsAmna Jalil
 
Human genome project by kk sahu
Human genome project by kk sahuHuman genome project by kk sahu
Human genome project by kk sahuKAUSHAL SAHU
 
GENOMICS AND BIOINFORMATICS
GENOMICS AND BIOINFORMATICSGENOMICS AND BIOINFORMATICS
GENOMICS AND BIOINFORMATICSsandeshGM
 
Pcmd bioinformatics-lecture i
Pcmd bioinformatics-lecture iPcmd bioinformatics-lecture i
Pcmd bioinformatics-lecture iMuhammad Younis
 
Genomics and Bioinformatics
Genomics and BioinformaticsGenomics and Bioinformatics
Genomics and BioinformaticsAmit Garg
 
Bioinformatics Introduction and Use of BLAST Tool
Bioinformatics Introduction and Use of BLAST ToolBioinformatics Introduction and Use of BLAST Tool
Bioinformatics Introduction and Use of BLAST ToolJesminBinti
 
Human genome project - Decoding the codes of life
Human genome project - Decoding the codes of lifeHuman genome project - Decoding the codes of life
Human genome project - Decoding the codes of lifearjunaa7
 
Genome data management
Genome data managementGenome data management
Genome data managementShareb Ismaeel
 
History and devolopment of bioinfomatics.ppt (1)
History and devolopment of bioinfomatics.ppt (1)History and devolopment of bioinfomatics.ppt (1)
History and devolopment of bioinfomatics.ppt (1)Madan Kumar Ca
 
Introducción a la bioinformatica
Introducción a la bioinformaticaIntroducción a la bioinformatica
Introducción a la bioinformaticaMartín Arrieta
 
The human genome project was started in 1990 with the goal of sequencing and ...
The human genome project was started in 1990 with the goal of sequencing and ...The human genome project was started in 1990 with the goal of sequencing and ...
The human genome project was started in 1990 with the goal of sequencing and ...Rania Malik
 
Describe in your own words the benefits, but also the problems of ha.pdf
Describe in your own words the benefits, but also the problems of ha.pdfDescribe in your own words the benefits, but also the problems of ha.pdf
Describe in your own words the benefits, but also the problems of ha.pdfarenamobiles123
 
PAPER 3.1 ~ HUMAN GENOME PROJECT
PAPER 3.1 ~  HUMAN GENOME PROJECTPAPER 3.1 ~  HUMAN GENOME PROJECT
PAPER 3.1 ~ HUMAN GENOME PROJECTNusrat Gulbarga
 

Ähnlich wie Bioinformatics Introduction (20)

Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
Human genome project by kk sahu
Human genome project by kk sahuHuman genome project by kk sahu
Human genome project by kk sahu
 
GENOMICS AND BIOINFORMATICS
GENOMICS AND BIOINFORMATICSGENOMICS AND BIOINFORMATICS
GENOMICS AND BIOINFORMATICS
 
Pcmd bioinformatics-lecture i
Pcmd bioinformatics-lecture iPcmd bioinformatics-lecture i
Pcmd bioinformatics-lecture i
 
rheumatoid arthritis
rheumatoid arthritisrheumatoid arthritis
rheumatoid arthritis
 
Basic of bioinformatics
Basic of bioinformaticsBasic of bioinformatics
Basic of bioinformatics
 
Genomics and Bioinformatics
Genomics and BioinformaticsGenomics and Bioinformatics
Genomics and Bioinformatics
 
Bioinformatics Introduction and Use of BLAST Tool
Bioinformatics Introduction and Use of BLAST ToolBioinformatics Introduction and Use of BLAST Tool
Bioinformatics Introduction and Use of BLAST Tool
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
Human genome project - Decoding the codes of life
Human genome project - Decoding the codes of lifeHuman genome project - Decoding the codes of life
Human genome project - Decoding the codes of life
 
Genome data management
Genome data managementGenome data management
Genome data management
 
Bioinformatics .pptx
Bioinformatics .pptxBioinformatics .pptx
Bioinformatics .pptx
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
History and devolopment of bioinfomatics.ppt (1)
History and devolopment of bioinfomatics.ppt (1)History and devolopment of bioinfomatics.ppt (1)
History and devolopment of bioinfomatics.ppt (1)
 
Introducción a la bioinformatica
Introducción a la bioinformaticaIntroducción a la bioinformatica
Introducción a la bioinformatica
 
Bio informatics
Bio informaticsBio informatics
Bio informatics
 
Bio informatics
Bio informaticsBio informatics
Bio informatics
 
The human genome project was started in 1990 with the goal of sequencing and ...
The human genome project was started in 1990 with the goal of sequencing and ...The human genome project was started in 1990 with the goal of sequencing and ...
The human genome project was started in 1990 with the goal of sequencing and ...
 
Describe in your own words the benefits, but also the problems of ha.pdf
Describe in your own words the benefits, but also the problems of ha.pdfDescribe in your own words the benefits, but also the problems of ha.pdf
Describe in your own words the benefits, but also the problems of ha.pdf
 
PAPER 3.1 ~ HUMAN GENOME PROJECT
PAPER 3.1 ~  HUMAN GENOME PROJECTPAPER 3.1 ~  HUMAN GENOME PROJECT
PAPER 3.1 ~ HUMAN GENOME PROJECT
 

Mehr von Vidya Kalaivani Rajkumar

Transgenic plants- Abiotic stress tolerance
Transgenic plants- Abiotic stress toleranceTransgenic plants- Abiotic stress tolerance
Transgenic plants- Abiotic stress toleranceVidya Kalaivani Rajkumar
 
Protein structure visualization tools-RASMOL
Protein structure visualization tools-RASMOLProtein structure visualization tools-RASMOL
Protein structure visualization tools-RASMOLVidya Kalaivani Rajkumar
 

Mehr von Vidya Kalaivani Rajkumar (20)

Recombinant vaccines-Peptide Vaccines
Recombinant vaccines-Peptide Vaccines Recombinant vaccines-Peptide Vaccines
Recombinant vaccines-Peptide Vaccines
 
Transgenic plants- Abiotic stress tolerance
Transgenic plants- Abiotic stress toleranceTransgenic plants- Abiotic stress tolerance
Transgenic plants- Abiotic stress tolerance
 
Bioreactors in tissue engineering
Bioreactors in tissue engineeringBioreactors in tissue engineering
Bioreactors in tissue engineering
 
Tissue assembly in microgravity
Tissue assembly in microgravityTissue assembly in microgravity
Tissue assembly in microgravity
 
In vivo synthesis of tissues and organs
In vivo synthesis of tissues and organsIn vivo synthesis of tissues and organs
In vivo synthesis of tissues and organs
 
Bioartificial pancreas
Bioartificial pancreasBioartificial pancreas
Bioartificial pancreas
 
Biomaterials for tissue engineering
Biomaterials for tissue engineeringBiomaterials for tissue engineering
Biomaterials for tissue engineering
 
Haematopoietic system
Haematopoietic systemHaematopoietic system
Haematopoietic system
 
Fasta
FastaFasta
Fasta
 
Water vascular system of star fish
Water vascular system of star fishWater vascular system of star fish
Water vascular system of star fish
 
Cephalopodes are advance molluscs
Cephalopodes are advance molluscsCephalopodes are advance molluscs
Cephalopodes are advance molluscs
 
Beat air pollution
Beat air pollution Beat air pollution
Beat air pollution
 
Birth control methods
Birth control methodsBirth control methods
Birth control methods
 
Future of human evolution
Future of human evolutionFuture of human evolution
Future of human evolution
 
Sequence alignment
Sequence alignmentSequence alignment
Sequence alignment
 
Assignment on developmental zoology
Assignment on developmental zoologyAssignment on developmental zoology
Assignment on developmental zoology
 
Development of chick
Development of chickDevelopment of chick
Development of chick
 
Major biological nucleotide databases
Major biological nucleotide databasesMajor biological nucleotide databases
Major biological nucleotide databases
 
Protein structure visualization tools-RASMOL
Protein structure visualization tools-RASMOLProtein structure visualization tools-RASMOL
Protein structure visualization tools-RASMOL
 
Swiss pdb viewer
Swiss pdb viewerSwiss pdb viewer
Swiss pdb viewer
 

Kürzlich hochgeladen

Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?Antenna Manufacturer Coco
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 

Kürzlich hochgeladen (20)

Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 

Bioinformatics Introduction

  • 1. Bioinformatics Introduction When the Human Genome Project was begun in 1990 it was understood that to meet the project's goals, the speed of DNA sequencing would have to increase and the cost would have to come down. Over the life of the project virtually every aspect of DNA sequencing was improved. It took the project approximately four years to sequence its first one billion bases but just four months to sequence the second billion bases. During the month of January, 2003, 1.5 billion bases were sequenced. As the speed of DNA sequencing increased, the cost decreased from 10 dollars per base in 1990 to 10 cents per base at the conclusion of the project in April 2003. Although the Human Genome Project is officially over, improvements in DNA sequencing continue to be made. Researchers are experimenting with new methods for sequencing DNA that have the potential to sequence a human genome in just a matter of weeks for a few thousand dollars. DNA sequencing performed on an industrial scale has produced a vast amount of data to analyze. In August 2005 it was announced that the three largest public collections of DNA and RNA sequences together store one hundred billion bases, representing over 165,000 different organisms. As sequence data began to pile up, the need for new and better methods of sequence analysis was critical. Bioinformatics is the branch of biology that is concerned with the gaining, storage, and analysis of the information found in nucleic acid and protein sequence data. Computers and bioinformatics software are the tools of the trade. Genetic data represent a treasure trove for researchers and companies interested in how genes contribute to our health and well being. Almost half of the genes identified by the Human Genome Project have no known function. Researchers are using bioinformatics to identify genes, establish their functions, and develop gene-based strategies for preventing, diagnosing, and treating disease.
  • 2. A DNA sequencing reaction produces a sequence that is several hundred bases long. Gene sequences typically run for thousands of bases. The largest known gene is that associated with Duchenne muscular dystrophy. It is approximately 2.4 million bases in length. In order to study genes, scientists first assemble long DNA sequences from series of shorter overlapping sequences. Scientists enter their assembled sequences into genetic databases so that other scientists may use the data. Since the sequences of the two DNA strands are complementary, it is only necessary to enter the sequence of one DNA strand into a database. By selecting an appropriate computer program, scientists can use sequence data to look for genes, get clues to gene functions, examine genetic variation, and explore evolutionary relationships. Bioinformatics is a young and dynamic science. New bioinformatic software is being developed while existing software is continually updated. Definition: Bioinformatics is an interdisciplinary field that develops methods and software tools for understanding biological data. As an interdisciplinary field of science, bioinformatics combines computer science, statistics, mathematics, and engineering to analyze and interpret biological data. The National Center for Biotechnology Information (NCBI 2001) defines bioinformatics as: “Bioinformatics is the field of science in which biology, computer science, and information technology merge into a single discipline”. Bioinformatics tools aid in the comparison of genetic and genomic data and more generally in the understanding of evolutionary aspects of molecular biology. At a more integrative level, it helps analyze and catalogue the biological pathways and networks that are an important part of systems biology. In structural biology, it aids in the simulation and modeling of DNA, RNA, and protein structures as well as molecular interactions. History: Historically, the term bioinformatics did not mean what it means today. Paulien Hogeweg and Ben Hesper coined it in 1970 to refer to the study of information processes in biotic systems. A Chronological History of Bioinformatics • 1953 - Watson & Crick proposed the double helix model for DNA based x-ray data obtained by Franklin & Wilkins. • 1954 - Perutz's group develop heavy atom methods to solve the phase problem in protein crystallography. • 1955 - The sequence of the first protein to be analysed, bovine insulin, is announed by F.Sanger. • 1970 - The details of the Needleman-Wunsch algorithm for sequence comparison are published. • 1972 - The first recombinant DNA molecule is created by Paul Berg and his group.
  • 3. • 1973 - The Brookhaven Protein DataBank is announeced. • 1974 - Vint Cerf and Robert Khan develop the concept of connecting networks of computers into an "internet" and develop the Transmission Control Protocol (TCP). • 1975 - Microsoft Corporation is founded by Bill Gates and Paul Allen. Two-dimensional electrophoresis, where separation of proteins on SDS polyacrylamide gel is combined with separation according to isoelectric points, is announced by P.H.O'Farrel. • 1988 - The National Centre for Biotechnology Information (NCBI) is established at the National Cancer Institute. The Human Genome Intiative is started in 1988. The FASTA algorith for sequence comparison is published by Pearson and Lupman. • 1990 - The BLAST program (Altschul,et.al.) is implemented. • 1991 -Myriad Genetics, Inc. is founded in Utah. The company's goal is to lead in the discovery of major common human disease genes and their related pathways. The company has discovered and sequenced, with its academic collaborators, the
  • 4. Following major genes: BRCA1, BRACA1, CHD1, MMAC1, MMSC1, MMSC2, CtIP, p16, p19 and MTS2. • 1995 - The Haemophilus influenzea genome (1.8) is sequenced. The Mycoplasma genitalium genome is sequenced. • 1996 - The genome for Saccharomyces cerevisiae (baker's yeadt, 12.1 Mb) is sequenced. • 1997 - The genome for E.coli (4.7 Mbp) is published.Oxford Molecualr Group acquires the Genetics Computer Group. • 1998 - The genomes for Caenorhabitis elegans and baker's yeast are published.The Swiss Institute of Bioinformatics is established as a non-profit foundation.Craig Venter forms Celera in Rockville, Maryland. • 2000 - The genome for Pseudomonas aeruginosa (6.3 Mbp) is published. The Athaliana genome (100 Mb) is secquenced.The D.melanogaster genome (180 Mb) is sequenced. • 2001 - The human genome (3,000 Mbp) is published. Goals:  To study how normal cellular activities are altered in different disease states, the biological data must be combined to form a comprehensive picture of these activities. This includes nucleotide and amino acid sequences, protein domains, and protein structures.  The actual process of analyzing and interpreting data is referred to as computational biology. Important sub-disciplines within bioinformatics and computational biology include:  Development and implementation of computer programs that enable efficient access to, use and management of, various types of information  Development of new algorithms (mathematical formulas) and statistical measures that assess relationships among members of large data sets. For example, there are methods to locate a gene within a sequence, to predict protein structure and/or function, and to cluster protein sequences into families of related sequences.  The primary goal of bioinformatics is to increase the understanding of biological processes. Examples include: pattern recognition, data mining, machine learning algorithms, and visualization.  Major research efforts in the field include sequence alignment, gene finding, genome assembly, drug design, drug discovery, protein structure alignment, protein structure prediction, prediction of gene expression and protein–protein interactions, genome- wide association studies, the modeling of evolution and cell division/mitosis.  Common activities in bioinformatics include mapping and analyzing DNA and protein sequences, aligning DNA and protein sequences to compare them, and creating and viewing 3-D models of protein structures. Applications of Bioinformatics: 1. Automatic Genome Sequencing:
  • 5. Bioinformatics are used in genome sequencing techniques such as a. Bioinformatics are used in the development of automated sequencing techniques that use PCR or BAC based gene amplifications, two dimensional electrophoresis and also automated reading of nucleotides. b. Bioinformatics are used in joining the sequence of smaller fragments of DNA to form a complete genome sequence. c. Bioinformatics are also used in the prediction of promoter region of the genome. d. Bioinformatics are also used in the prediction of promoter coding region of the genome. Small fragments of genome can be amplified using polymerase chain reaction or bacterial artificial chromosome. Amplified fragments may suffer from nucleotide reading errors, repeats; these in turn generate multiple copies of the same genome fragments. These repeats are removed just before the final assembly of all genome fragments via mathematical models. 2. Identification of Genes: Bioinformatics programs such as GLIMMER and GenBank are used to identify the coding region or open reading frames in the genome. 3. Identifying Gene Function: After identifying the open reading frames present in the genome, nest step is to annotate the structure and function of the genes. Bioinformatics tool such as sequence search and pair-wise gene alignment technique are used to identify the gene function. The major four algorithms such as BLAST, BLOSUM, ClustalX and SMART are used for the functional annotation of genes. 4. Three-dimensional (3D) Structure Modelling: Single protein may present in different conformational states depending upon its interaction with other proteins present in the vicinity. Three-dimensional structure of the protein has got some regions exposed for protein-protein or protein-DNA interactions. Also the function of these proteins are also depends upon these interactions. Bioinformatics techniques are also used to predict the possible conformation of the protein coded by a gene; this in turn helps in identifying the function of the same protein. 5. Pair-wise Genome Comparison: After the identification of functioning of a particular gene, bioinformatics is also used for pair- wise genome comparisons. Pair-wise comparison of a genome with itself is done to get the details of paralogous genes or duplicated genes that have the same base sequence with some functional variations. Also pair-wise genome comparison is also done against other genome to get information such as orthologous genes. These genes are equivalent genes present in two different genomes due to speciation. This technique is also used to identify different types of gene-groups and adjacent genes that occur in the close proximity as they are involved in common higher level function, and also lateral-gene transfer that is transfer of genes from a microorganism that is evolutionary distant. Bioinformatics technique is also used to analyse gene-fusion or gene-fission, gene-group duplication and much more. 6. Vaccine Design:
  • 6. Bioinformatics techniques are used in the effective and efficient designing of vaccine and also in the designing of antimicrobial agents. 7. Microbial Genome Sequencing: Bioinformatics is also used in automating microbial genome sequencing. 8. Genome Comparison: Bioinformatics techniques is also used to genome as follows, a. Identifying conserved function within a genome family b. Identifying a specific genes in a group of genomes c. Modelling 3D structure of proteins d. Docking of biochemical compounds as well as receptors. This helps in developing integrated data bases over the internet. This also helps in understanding function of gene and also genome. Bioinformatics has got major impacts on biotechnology and its applications. The vast amount of data generated by human genome project or by other genome sequencing project would be unmanageable without the bioinformatics technique. Without bioinformatics handling, interpretation of these data would be unthinkable. Due to the bioinformatics application cost of all these major projects comes down drastically. 9. Medicine In the field of Medicine applications of bioinformatics is used for following areas: a. Drug Discovery: The Idea of using X ray Crystallography in drug discovery emerged more than 30 years ago, when the first 3 dimensional structure of protein was determined. Within a decade, a radical change in drug design had begun, incarporating the knowledge of 3 dimensional structures of target protein into design process. Protein structure can influence drug discovery at every stage in design process. Classically, it is used in lead optimization, a process that uses structure to guide the chemical modification of a lead molecule to give an optimized fit in terms of shape, hydrogen bonds and other non -covalent interactions with the target. b. Personal Medicine: Personalized medicine is a medical model that proposes the customization of healthcare, with all decisions and practices being tailored to the individual patient by use of genetic or other information. Practical application outside of long established considerations like a patient's family history, social circumstances, environment and behaviors are very limited so far and practically no progress has been made in the last decade. Personalized medicine research attempts to identify individual solutions based on the susceptibility profile of each individual. It is hoped that these fields will enable new approaches to diagnosis, drug development, and individualized therapy. c. Preventive Medicine: Preventive medicine or preventive care consists of measures taken to prevent diseases, (or injuries) rather than curing them or treating their symptoms. This contrasts in method with curative and palliative medicine, and in scope with public health methods (which work at the
  • 7. level of population health rather than individual health)[4]. Simple examples of preventive medicine include hand washing, breastfeeding, and immunizations. d. Gene Therapy: Gene therapy is a novel form of drug delivery that enlists the synthetic machinery of the patient's cell to produce a therapeutic agent. It involves the efficient introduction of functional gene into the appropriate cells of the patient in order to produce sufficient amount of protein encoded by transferred gene (transgene) so as to precisely and permanently correct the disorder. Strategies of Gene Therapy are following: - Gene addition - Removal of harmful gene by antisense nucleotide or ribozymes - Control of gene expression 10. Microbial Genome Applications In the field of Microbial Genome Applications, applications of bioinformatics are used for following areas: a. Waste Cleanup: In bioinformatics bacteria and microbes are identified which are helpful in cleaning waste. Deinococcus radiodurans Bacterium is listed in the Guinness Book of World Records as "the world's toughest bacterium." This bacterium has the ability to repair damaged DNA and small fragments from chromosomes by isolating damage segments in a concentrated area. This is because it has additional copies of its genome. Genes from other bacteria have been inserted into D. radiodurans for environmental cleanup. It was used to break down organic chemicals, solvents and heavy metals in radioactive waste sites. b. Climate Change: Climate change is caused by factors that include oceanic processes (such as oceanic circulation), variations in solar radiation received by Earth, plate tectonics and volcanic eruptions, and human-induced alterations of the natural world. By studying microorganisms genome scientists can begin to understand these microbes at a very fundamental level and isolated the genes that give them their unique abilities to survive under extreme conditions. Rhodopseudomonas palustris is a purple non-sulfur phototrophic bacterium commonly found in soils and water. It converts sunlight to cellular energy by absorbing atmospheric carbon dioxide and converting it to biomass. This microbe can also degrade and recycle a variety of aromatic compounds that comprise lignin, the main constituent of wood and the second most abundant polymer on earth. R. palustris is acknowledged by microbiologists to be one of the most metabolically versatile bacteria ever described. Not only can it convert carbon dioxide gas into cell material but nitrogen gas into ammonia, and it can produce hydrogen gas. It grows both in the absence and presence of oxygen. In the absence of oxygen, it prefers to generate all its energy from light by photosynthesis. c. Biotechnology: The wide concept of "biotech" or "biotechnology" encompasses a wide range of procedures for modifying living organisms according to human purposes, going back to domestication of
  • 8. animals, cultivation of plants, and "improvements" to these through breeding programs that employ artificial selection and hybridization. Modern usage also includes genetic engineering as well as cell and tissue culture technologies. In the field of bioinformatics, biotechnology has identified organisms and microorganisms which can be very useful in dairy industry and food manufacturers. Lactococcus Lactis is one of the most important micro-organisms involved in the dairy industry, it is a non-pathogenic rod-shaped bacterium that is critical for manufacturing dairy products like buttermilk, yogurt and cheese. This bacterium, is also used to prepare pickled vegetables, beer, wine, some breads and sausages and other fermented foods. Researchers anticipate that understanding the physiology and genetic make-up of this bacterium will prove invaluable for food manufacturers as well as the pharmaceutical industry, which is exploring the capacity of L. lactis to serve as a vehicle for delivering drugs. d. Alternative Energy: Scientists are studying the genome of the microbe Chlorobium tepidum which has an unusual capacity for generating energy from light. Chlorobium tepidumis a thermophilic Gram- negative green sulfur bacterium isolated from a hot spring in New Zealand in which it forms a dense mat. The bacterium carries out photosynthesis in ways that are different from plants and other bacteria. Unlike plants, the green bacteria do not produce oxygen from photosynthesis. According to some researchers, photosynthesis may have its evolutionary origins in organisms like C. tepidum. Such species would have been able to harvest energy from light at a time when the Earth's atmosphere had little oxygen. In addition, the organisms' ability to grow in low- light environments may have helped them limit their exposure to UV irradiation, which was likely at higher levels in the early days of Earth. 11. Agriculture In the field of Agriculture, applications of bioinformatics are in following areas: a. Crop Improvement: Comparative genetics of the plant genomes has shown that the organisation of their genes has remained more conserved over evolutionary time than was previously believed. These findings suggest that information obtained from the model crop systems can be used to suggest improvements to other food crops. Arabidopsis thaliana (water cress) and Oryza sativa (rice) are examples of available complete plant genomes. Arabidopsis thaliana was the first plant to be sequenced and is considered the model species for investigating plant genetics and biology. There are many genes which are similar in all plants and the study of genes in a model organism like A. thaliana facillitates our understanding of gene expression and function in all plants. Furthermore, since animals and plants are both eukaryotes, many of the genes found in A. thaliana have homologs in animals. Arabidopsis has the smallest genome of any flowering plant, which is the main reason it was selected as a model organism for genome sequencing The DNA of Arabidopsis is made up of about 140 million bases, which are parcelled into five chromosomes. Oryza sativa (rice) is the most important crop for human consumption, providing staple food for more than half of the world population. Oryza sativa was the cereal selected to be sequenced as a priority and has gained the status "model organism". It has the smallest genome of all the cereals: 430 million nucleotides and it can serve as a model genome for one of the two main groups of flowering plants, the monocotyledons. Because it has been the subject of studies on yield, hybrid vigor, genetic resistance to disease and adaptive responses, scientists have taken advantage of the existence of a multitude of varieties that have
  • 9. adapted to a very wide range of environmental conditions, from dry soil in temperate regions to flooded cultures in tropical regions. b. Insect Resistance: Genes from Bacillus thuringiensis that can control a number of serious pests have been successfully transferred to cotton, maize and potatoes. This new ability of the plants to resist insect attack means that the amount of insecticides being used can be reduced and hence the nutritional quality of the crops is increased. Bacillus thuringiensis is a pathogenic bacteria used for insect control. It is Gram-positive spore-forming, rod-shaped aerobic bacteria in the genus Bacillus. B. thuringiensisis an insecticidal bacterium, marketed worldwide for control of many important plant pests - mainly caterpillars of the Lepidoptera (butterflies and moths) but also mosquito larvae, and simuliid blackflies that vector river blindness in Africa. B. thuringiensis products represent about 1% of the total 'agrochemical' market (fungicides, herbicides and insecticides) across the world. The commercial B. thuringiensis products are powders containing a mixture of dried spores and toxin crystals. They are applied to leaves or other environments where the insect larvae feed. The toxin genes have also been genetically engineered into several crop plants. c. Improve Nutritional Quality: Scientists have recently succeeded in transferring genes into rice to increase levels of Vitamin A, iron and other micronutrients. This work could have a profound impact in reducing occurrences of blindness and anaemia caused by deficiencies in Vitamin A and iron respectively. Scientists have inserted a gene from yeast into the tomato, and the result is a plant whose fruit stays longer on the vine and has an extended shelf life. One little gene may be all that stands between a fresh, juicy, homegrown tomato and its bland, store-bought counterpart. Biologists announced that they've identified the gene that controls the ripening process in the humble fruit. If this "rin" gene can be manipulated effectively, scientists will be able to create breeds of tomatoes that will be more flavorful even after the long journey from the vine to the produce department. Today, tomatoes are plucked from the vine early, when still green and firm, to ensure that they survive shipping without bruising and rotting. Picking tomatoes early means they have less chance to develop flavor, color, and nutrients naturally. By manipulating the "rin" gene, scientists will be able to slow the ripening process, letting the tomato develop on the vine for longer - but still keeping it firm enough to ship safely. The scientists responsible for the "rin" gene findings are from the U.S. Department of Agriculture and the Boyce Thompson Institute for Plant Research, on the campus of Cornell University. They hope that their technique may also be applied to other fruits - such as strawberries, bananas, bell peppers, and melons - which suffer from the same shipping and storage complications.