Comparative and functional genomics

Jalormi Parekh
M.Sc. Final Medical Biotechnology

• Genomics - It is the study of genomes.
• The field of genomics comprises of two
main areas:
1.Comparative genomics
2. Functional genomics

• Comparative genomics is a large-scale, holistic approach that
compares two or more genomes to discover the similarities
and differences between the genomes and to study the
biology of the individual genomes
• The subject of comparative genomics impinges on
– Evolutionary biology and phylogenetic reconstructions of the tree of
life,
– Drug discovery programs,
– Function predictions of hypothetical proteins
– Identification of genes, regulatory motifs and other non-coding
DNA motifs
– Genome flux and dynamics

• Comparative analysis of genome structure
• Comparative analysis of coding regions
• Comparative analysis of non-coding regions
Methods for comparative genomics

• The structure of different genomes can be compared at
three levels:
– Overall nucleotide statistics,
– Genome structure at DNA level,and
– Genome structure at gene level

• Overall nucleotide statistics, such as
– Genome size,
– Overall (G+C) content,
– Regions of different (G+C) content,
– Genome signature such as codon usage biases,
– Amino acid usage biases, and the ratio of observed di-
nucleotide frequency and
– The expected frequency given random nucleotide
distribution

• Chromosomal breakage and exchange of chromosomal
fragments are common mode of gene evolution. They can be
studied by comparing genome structures at DNAlevel.
– Identification of conserved synteny and genome
 rearrangement events
– Analysis of breakpoints
– Analysis of content and distribution of DNArepeats

• Chromosomal breakage and exchange
of chromosomal fragments cause
disruption of gene order
• Therefore gene order correlates with
evolutionary distance between genomes

Identification of gene-coding regions
comparison of gene content
comparison of protein content
Comparative genome based function prediction

• Noncoding regions of the genome gained a lot
of attention in recent years because of its
predicted role in regulation of transcription,
DNA replication, and other biological
functions
• This approach is based on the presumption that
selective pressure causes regulatory elements
to evolve at a slower rate than that of non
regulatory sequences in the non coding
regions

• Comparative genomic studies throw important light
on the pathogenesis of organisms, throwing up
opportunities for therapeutic intervention as well as
help in understanding and identifying disease genes
• One of the most important fallouts of comparative
analyses at a genome-wide scale is in the ability to
identify and develop novel drug targets
• If one is looking for antibacterial, antifungal, or
antiprotozoal proteins to be used as targets,
comparative genome analysis can reveal virulence
genes, uncharacterized essential genes, species-
specific genes, organism-specific genes, while ensuring
that the chosen genes have no homologues in humans

• It is largely experiment based with a focus on
gene functions at the whole genome level
using high throughput approaches.
• The high throughput analysis of all expressed
genes is also termed transcriptome
analysis
• Transcriptome analysis can be conducted by
two approaches:
1) sequence based approaches
2) microarray based approaches

• Expressed sequence tags :- ESTs are short
sequences of cDNA typically 200-400 nucleotides in
length.
• Obtained from either 5’ end or 3’ end of cDNA inserts of cDNA
library.
 ADVANTAGESOF E.S.T :-
• Provide a rough estimate of genes that are actively
expressed in a genome under a particular physiological
condition.
• Help in discovering new genes, due to random
sequencing of cDNA clones.
• EST libraries can be easily generated

 DRAWBACKSOF USING E.S.Ts:-
• Automatically generated without verification thus
contain high error rates.
• There is often contamination by vector
sequence , introns, ribosomal RNA,
mitochondrial RNA.
• Weakly expressed genes are hardly found in EST
sequencing survey.
• ESTs represent only partial sequences of genes.

 Major EST index sequence databases
are:
• Unigene – is an NCBI EST cluster
database.
• TIGR Gene indices

• Serial analysis of gene expression
• Another high throughput, Sequence-based
approach for gene expression profile analysis.
• More quantitative in determining mRNA
expression in cell.
• Short fragments (taqs) of DNA, excised from
cDNA sequences , act as unique markers of gene
transcript
• SAGE invented at Johns Hopkins University in
USA (Oncology Center) by Dr. Victor in 1995.

• A short sequence tag (10-14bp) contains
sufficient information to uniquely identify a
transcript.
• Sequence tag can be linked together to
form long serial molecules that can be
cloned and sequenced.
• the number of times a particular tag is
observed provides the expression level of
the corresponding transcript

Trapping of RNA with beads
cDNA synthesis
Enzymatic cleavage of cDNA
Ligation of Linkers to bound cDNA
Isolation and concatamerization of ditags
PCR amplification of Ditags
Formation of Ditags
Cleaving with tagging enzyme

• Software tools for SAGEanalysis:
• SAGE map
• SAGE xprofiler
• SAGE Genie
• Advantages over EST analysis:
a. Detect weakly expressed genes
b. It uses a short nucleotide tag and allows
sequencing of multiple tags in a single clone

 A microarray is a pattern of ssDNA probes which are
immobilized on a surface called a chip or a slide.
• Microarrays use hybridization to detect a specific DNA
or RNA in a sample.
• DNA microarray uses a million different probes, fixed
on a solid surface.
• Microarray technology evolved from Southern
blotting.

• To analyze the expression of thousands of
genes in single reaction, very quickly and
in an efficient manner.
• To understand the genetic causes for
the abnormal functioning of the
human body.
• To understand which genes are active
and which genes are inactive in
different cell types.

• Length of oligonucleotides is in range of 25-70 bases
long
• Oligonucleotides are called probes
• Oligonucleotide should not form stable internal
secondary structure.
• All probes should have approx. equal Tm.
• OligoWiz & OligoArray are 2 programs used in
designing probe for microarray spotting.

Sample
preparation
and
labeling
Hybridisation Washing
Image processing
and
Data analysis

• Software programs to perform microarray
image analysis:
• ArrayDB
• TIGR Spotfinder

Comparative and functional genomics

Empfohlen

Empfohlen

Weitere ähnliche Inhalte

Was ist angesagt?

Was ist angesagt? (20)

Ähnlich wie Comparative and functional genomics

Ähnlich wie Comparative and functional genomics (20)

Kürzlich hochgeladen

Kürzlich hochgeladen (20)

Comparative and functional genomics

Hinweis der Redaktion