SlideShare ist ein Scribd-Unternehmen logo
1 von 25
1
Introduction
• Comparative genomics is a large-scale, holistic approach that
compares two or more genomes to discover the similarities and
differences between the genomes and to study the biology of the
individual genomes
• The subject of comparative genomics impinges on
– Evolutionary biology and phylogenetic reconstructions of the tree of life,
– Drug discovery programs,
– Function predictions of hypothetical proteins
– Identification of genes, regulatory motifs and other non-coding DNA
motifs
– Genome flux and dynamics
2
Computational tool for genome-scale
sequence alignment
• The first step in comparative genomics analysis is often the
alignment of two genome sequences
• It is a technically challenging problem
Algorithms/tools URL
BLASTN and
MEGABLAST
http://www.ncbi.nlm.nih.gov/BLAST/
GLASS http://crossspecies.lcs.mit.edu/
MUMmer http://www.tigr.org/software/mummer/
PatternHunter http://www.bioinformaticssolutions.com/
products/ph.php
PipMaker http://bio.cse.psu.edu/pipmaker/
VISTA http://www-gsd.lbl.gov/vista
WABA http://www.cse.ucsc.edu/kent/xenoAli/
3
L. Wei et. al. 2002
Related terms..
• Homology
• Homologous
• Orthologous
• Paralogous
• Xenologous
• Analogoues
• Horizontal gene transfer
4
• Homology is the relationship of any two characters (such as two proteins that
have similar sequences) that have descended, usually through divergence, from
a common ancestral character
• Homologues are thus components or characters (such as genes/proteins with
similar sequences) that can be attributed to a common ancestor of the two
organisms during evolution. Homologues can either be orthologues,
paralogues, or xenologues
• Orthologues are homologues that have evolved from a common ancestral gene
by speciation. They usually have similar functions
• Paralogues are homologues that are related or produced by duplication within a
genome. They often have evolved to perform different functions
• Xenologues are homologues that are related by an interspecies (horizontal
transfer) of the genetic material for one of the homologues
• Horizontal (Lateral) Gene Transfer is the movement of genetic material
between species (or genus) other than by vertical descent.
5
6
7
Methods for comparative genomics
• Comparative analysis of genome structure
• Comparative analysis of coding regions
• Comparative analysis of non-coding regions
8
Comparative analysis of genome structure
• Analysis of the global structure of genomes, such as nucleotide
composition, syntenic relationships, and gene ordering offer insight
into the similarities and differences between genomes.
• This provide information on the organization and evolution of the
genomes, and highlight the unique features of individual genomes
• The structure of different genomes can be compared at three levels:
– Overall nucleotide statistics,
– Genome structure at DNA level, and
– Genome structure at gene level.
9
Comparison of overall nucleotide statistics
• Overall nucleotide statistics, such as
– Genome size,
– Overall (G+C) content,
– Regions of different (G+C) content,
– Genome signature such as codon usage biases,
– Amino acid usage biases, and the ratio of observed di-
nucleotide frequency and
– The expected frequency given random nucleotide
distribution
• These all present a global view of the similarities and
differences of the genomes.
10
Comparison of genome structure at DNA level
• Chromosomal breakage and exchange of chromosomal
fragments are common mode of gene evolution. They can be
studied by comparing genome structures at DNA level.
– Identification of conserved synteny and genome
rearrangement events
– Analysis of breakpoints
– Analysis of content and distribution of DNA repeats
11
Comparison of genome structure at gene level
• Chromosomal breakage and exchange of chromosomal
fragments cause disruption of gene order
• Therefore gene order correlates with evolutionary distance
between genomes
12
Comparative analysis of coding regions
13
Number of algorithms that have been use in comparative genomics to aid function
prediction of genes.
Identification of gene-coding regions
comparison of gene content
comparison of protein content
Comparative genome based function prediction
Identification of gene-coding regions
• The analysis and comparison of the coding regions starts with the gene
identification algorithm that is used to infer what portions of the genomic
sequence actively code for genes.
• There are four basic approaches for gene identification
L. Wei et. al. 2002
14
Comparison of gene content
• After the predicted gene set is generated, it is very interesting
and important to compare the content of genes across genomes
• The first statistics to compare is the estimated total number of
genes in a genome, elucidate the similarities and differences
between the genomes include percentage of the genome that
code for genes, distribution of coding regions across the
genome (a.k.a. gene density), average gene length, codon
usage
• This is often done using a pairwise sequence comparison tool
such as BLASTN or TBLASTX
15
Comparison of protein content
• A second level of analysis that can be performed is to compare
the set of gene products (protein) between the genomes, which
has been termed ‘‘comparative proteomics”
• It is important to compare the protein contents in critical
pathways and important functional categories across genomes
• Two widely used resources for pathways and functional
categories are the KEGG pathway database and the Gene
Ontology (GO) hierarchy
L. Wei et. al. 2002
16
cntd…
• Interesting statistics to compare include
– Level of sequence identity between orthologous pairs
across genome
– Paralogous pairs within genome,
– Number of replicated copies in corresponding paralog
families
– Functions of the paralogs
– Locations of members of paralog families across the
genome
17
Comparative genomics-based function
prediction
• functional assignment of genes in a non similarity-based
manner
• This rely on the basic premise that genes; that are functionally
related, are genes that are closely associated across genomes in
some form
• This include three methods:
– Co-conservation across genomes
– Conservation of gene clusters and genomic context across species
– Physical fusion of functionally linked genes across species (Domain
fusion analysis)
18
Comparative analysis of noncoding regions
• Noncoding regions of the genome gained a lot of attention in
recent years because of its predicted role in regulation of
transcription, DNA replication, and other biological functions
• This approach is based on the presumption that selective
pressure causes regulatory elements to evolve at a slower rate
than that of non regulatory sequences in the non coding
regions
19
Insights into Genome Fluxes and the Processes of Evolution
• From an evolutionary biology perspective, whole genome
comparisons provide molecular insights into the processes of
evolution that include the molecular events responsible for the
variations and fluxes that occur through a genome. These
include processes like, inversions, translocations, deletions,
duplications and insertions.
20
The Impact of Comparative Genomics in Phylogenetic Analysis
Schematic depiction of Microsporidia's phylogenetic position based on Small Subunit RNA
(SSU rRNA) as an early branching eukaryote that evolved prior to the acquisiton of
mitochondria, and it's subsequent placement based on a composite gene phylogeny where it
was placed closer to fungi. The latter placement has been confirmed by the complete
sequenceof the micro-sporidia, Encephalitozoon cuniculi, where despite the absence of
mitochondria, the presence of several mitochondrial genes could be observed.
21
Comparative Genomics in Drug Discovery
• Comparative genomic studies throw important light on the
pathogenesis of organisms, throwing up opportunities for
therapeutic intervention as well as help in understanding and
identifying disease genes
• One of the most important fallouts of comparative analyses at a
genome-wide scale is in the ability to identify and develop novel
drug targets
• If one is looking for antibacterial, antifungal, or antiprotozoal
proteins to be used as targets, comparative genome analysis can
reveal virulence genes, uncharacterized essential genes, species-
specific genes, organism-specific genes, while ensuring that the
chosen genes have no homologues in humans
22
Comparative genomics in drug discovery programs. A flow chart
diagram explaining how comparative genomics can facilitate drug
discovery programs for the discovery of new antimicrobials
23
Looking beyond…
• As comparative genomics moves from between kingdoms to
between genus to between species analysis, the next step is to
carry out comparisions between individuals or strains that are
members of a particular species
• This would allow us to investigate variations at the individual
level and to enable one to determine the propensity of an
individual to respond to a drug or to come down with a disease
or infection
24
25

Weitere ähnliche Inhalte

Was ist angesagt?

Functional genomics, and tools
Functional genomics, and toolsFunctional genomics, and tools
Functional genomics, and toolsKAUSHAL SAHU
 
MULTIPLE SEQUENCE ALIGNMENT
MULTIPLE  SEQUENCE  ALIGNMENTMULTIPLE  SEQUENCE  ALIGNMENT
MULTIPLE SEQUENCE ALIGNMENTMariya Raju
 
Whole genome shotgun sequencing
Whole genome shotgun sequencingWhole genome shotgun sequencing
Whole genome shotgun sequencingGoutham Sarovar
 
Gene prediction methods vijay
Gene prediction methods  vijayGene prediction methods  vijay
Gene prediction methods vijayVijay Hemmadi
 
shotgun sequncing
 shotgun sequncing shotgun sequncing
shotgun sequncingSAIFALI444
 
DNA SEQUENCING METHODS AND STRATEGIES FOR GENOME SEQUENCING
DNA SEQUENCING METHODS AND STRATEGIES FOR GENOME SEQUENCINGDNA SEQUENCING METHODS AND STRATEGIES FOR GENOME SEQUENCING
DNA SEQUENCING METHODS AND STRATEGIES FOR GENOME SEQUENCINGPuneet Kulyana
 
Complementary DNA (cDNA) Libraries
Complementary DNA 	(cDNA) LibrariesComplementary DNA 	(cDNA) Libraries
Complementary DNA (cDNA) LibrariesRamesh Pothuraju
 
Binary Vector, By KK Sahu sir
Binary Vector, By KK Sahu sirBinary Vector, By KK Sahu sir
Binary Vector, By KK Sahu sirKAUSHAL SAHU
 
cDNA Library Construction
cDNA Library ConstructioncDNA Library Construction
cDNA Library ConstructionStella Evelyn
 
Expressed sequence tag (EST), molecular marker
Expressed sequence tag (EST), molecular markerExpressed sequence tag (EST), molecular marker
Expressed sequence tag (EST), molecular markerKAUSHAL SAHU
 
History and scope in bioinformatics
History and scope in bioinformaticsHistory and scope in bioinformatics
History and scope in bioinformaticsKAUSHAL SAHU
 
PHYSICAL MAPPING STRATEGIES IN GENOMICS
PHYSICAL MAPPING STRATEGIES IN GENOMICSPHYSICAL MAPPING STRATEGIES IN GENOMICS
PHYSICAL MAPPING STRATEGIES IN GENOMICSUsman Arshad
 
bacterial artificial chromosome & yeast artificial chromosome
bacterial artificial chromosome & yeast artificial chromosomebacterial artificial chromosome & yeast artificial chromosome
bacterial artificial chromosome & yeast artificial chromosomeashapatel676
 

Was ist angesagt? (20)

Express sequence tags
Express sequence tagsExpress sequence tags
Express sequence tags
 
Functional genomics, and tools
Functional genomics, and toolsFunctional genomics, and tools
Functional genomics, and tools
 
Genome annotation 2013
Genome annotation 2013Genome annotation 2013
Genome annotation 2013
 
MULTIPLE SEQUENCE ALIGNMENT
MULTIPLE  SEQUENCE  ALIGNMENTMULTIPLE  SEQUENCE  ALIGNMENT
MULTIPLE SEQUENCE ALIGNMENT
 
Whole genome shotgun sequencing
Whole genome shotgun sequencingWhole genome shotgun sequencing
Whole genome shotgun sequencing
 
Gene prediction methods vijay
Gene prediction methods  vijayGene prediction methods  vijay
Gene prediction methods vijay
 
Exprssion vector
Exprssion vectorExprssion vector
Exprssion vector
 
shotgun sequncing
 shotgun sequncing shotgun sequncing
shotgun sequncing
 
DNA SEQUENCING METHODS AND STRATEGIES FOR GENOME SEQUENCING
DNA SEQUENCING METHODS AND STRATEGIES FOR GENOME SEQUENCINGDNA SEQUENCING METHODS AND STRATEGIES FOR GENOME SEQUENCING
DNA SEQUENCING METHODS AND STRATEGIES FOR GENOME SEQUENCING
 
Complementary DNA (cDNA) Libraries
Complementary DNA 	(cDNA) LibrariesComplementary DNA 	(cDNA) Libraries
Complementary DNA (cDNA) Libraries
 
Artificial Vectors
Artificial VectorsArtificial Vectors
Artificial Vectors
 
Binary Vector, By KK Sahu sir
Binary Vector, By KK Sahu sirBinary Vector, By KK Sahu sir
Binary Vector, By KK Sahu sir
 
Site directed mutagenesis by pcr
Site directed mutagenesis by pcrSite directed mutagenesis by pcr
Site directed mutagenesis by pcr
 
cDNA Library Construction
cDNA Library ConstructioncDNA Library Construction
cDNA Library Construction
 
Expressed sequence tag (EST), molecular marker
Expressed sequence tag (EST), molecular markerExpressed sequence tag (EST), molecular marker
Expressed sequence tag (EST), molecular marker
 
History and scope in bioinformatics
History and scope in bioinformaticsHistory and scope in bioinformatics
History and scope in bioinformatics
 
Gene prediction method
Gene prediction method Gene prediction method
Gene prediction method
 
Physical mapping
Physical mappingPhysical mapping
Physical mapping
 
PHYSICAL MAPPING STRATEGIES IN GENOMICS
PHYSICAL MAPPING STRATEGIES IN GENOMICSPHYSICAL MAPPING STRATEGIES IN GENOMICS
PHYSICAL MAPPING STRATEGIES IN GENOMICS
 
bacterial artificial chromosome & yeast artificial chromosome
bacterial artificial chromosome & yeast artificial chromosomebacterial artificial chromosome & yeast artificial chromosome
bacterial artificial chromosome & yeast artificial chromosome
 

Andere mochten auch

Andere mochten auch (20)

Comparative genomics presentation
Comparative genomics presentationComparative genomics presentation
Comparative genomics presentation
 
What is comparative genomics
What is comparative genomicsWhat is comparative genomics
What is comparative genomics
 
Comparative genomics
Comparative genomicsComparative genomics
Comparative genomics
 
Comparative genomics
Comparative genomicsComparative genomics
Comparative genomics
 
Comparative genomics and proteomics
Comparative genomics and proteomicsComparative genomics and proteomics
Comparative genomics and proteomics
 
genomic comparison
genomic comparison genomic comparison
genomic comparison
 
Functional genomics
Functional genomicsFunctional genomics
Functional genomics
 
Genomics
GenomicsGenomics
Genomics
 
Genome evolution
Genome evolutionGenome evolution
Genome evolution
 
Comparative genomics @ sid 2003 format
Comparative genomics @ sid 2003 formatComparative genomics @ sid 2003 format
Comparative genomics @ sid 2003 format
 
Genome annotation
Genome annotationGenome annotation
Genome annotation
 
Comparative Genomics with GMOD and BioPerl
Comparative Genomics with GMOD and BioPerlComparative Genomics with GMOD and BioPerl
Comparative Genomics with GMOD and BioPerl
 
Genome evolution - tales of scales DNA to crops,months to billions of years, ...
Genome evolution - tales of scales DNA to crops,months to billions of years, ...Genome evolution - tales of scales DNA to crops,months to billions of years, ...
Genome evolution - tales of scales DNA to crops,months to billions of years, ...
 
Comparative genomics
Comparative genomicsComparative genomics
Comparative genomics
 
Gene order
Gene orderGene order
Gene order
 
Genome origin
Genome originGenome origin
Genome origin
 
Association mapping
Association mapping Association mapping
Association mapping
 
SNp mining in crops
SNp mining in cropsSNp mining in crops
SNp mining in crops
 
Bioinformatics and functional genomics
Bioinformatics and functional genomicsBioinformatics and functional genomics
Bioinformatics and functional genomics
 
Genotyping by Sequencing
Genotyping by SequencingGenotyping by Sequencing
Genotyping by Sequencing
 

Ähnlich wie Comparative genomics

Comparative and functional genomics
Comparative and functional genomicsComparative and functional genomics
Comparative and functional genomicsJalormi Parekh
 
Comparative genomics 2
Comparative genomics 2Comparative genomics 2
Comparative genomics 2GCUF
 
Bioinformatics, comparative genemics and proteomics
Bioinformatics, comparative genemics and proteomicsBioinformatics, comparative genemics and proteomics
Bioinformatics, comparative genemics and proteomicsjuancarlosrise
 
Role of genomics and proteomics
Role of genomics and proteomicsRole of genomics and proteomics
Role of genomics and proteomicsPavana K A
 
Introduction to bioinformatics
Introduction to bioinformaticsIntroduction to bioinformatics
Introduction to bioinformaticsmaulikchaudhary8
 
Functional genomics,Pharmaco genomics, and Meta genomics.
Functional genomics,Pharmaco genomics, and Meta genomics.Functional genomics,Pharmaco genomics, and Meta genomics.
Functional genomics,Pharmaco genomics, and Meta genomics.sangeeta jadav
 
Comparative genomics.pdf
Comparative genomics.pdfComparative genomics.pdf
Comparative genomics.pdfshinycthomas
 
Genomics(functional genomics)
Genomics(functional genomics)Genomics(functional genomics)
Genomics(functional genomics)IndrajaDoradla
 
Molecular basis of evolution and softwares used in phylogenetic tree contruction
Molecular basis of evolution and softwares used in phylogenetic tree contructionMolecular basis of evolution and softwares used in phylogenetic tree contruction
Molecular basis of evolution and softwares used in phylogenetic tree contructionUdayBhanushali111
 
Functional Genomic l Genomes l proteomic l DNA l #genomics #proteomics #scien...
Functional Genomic l Genomes l proteomic l DNA l #genomics #proteomics #scien...Functional Genomic l Genomes l proteomic l DNA l #genomics #proteomics #scien...
Functional Genomic l Genomes l proteomic l DNA l #genomics #proteomics #scien...DevikaPatel12
 
Pathway analysis for genomics data
Pathway analysis for genomics dataPathway analysis for genomics data
Pathway analysis for genomics dataSakshiJha40
 
Genomics and proteomics by shreeman
Genomics and proteomics by shreemanGenomics and proteomics by shreeman
Genomics and proteomics by shreemanshreeman cs
 

Ähnlich wie Comparative genomics (20)

Comparative and functional genomics
Comparative and functional genomicsComparative and functional genomics
Comparative and functional genomics
 
Comparative genomics 2
Comparative genomics 2Comparative genomics 2
Comparative genomics 2
 
Bioinformatics, comparative genemics and proteomics
Bioinformatics, comparative genemics and proteomicsBioinformatics, comparative genemics and proteomics
Bioinformatics, comparative genemics and proteomics
 
Data mining ppt
Data mining pptData mining ppt
Data mining ppt
 
Role of genomics and proteomics
Role of genomics and proteomicsRole of genomics and proteomics
Role of genomics and proteomics
 
Introduction to bioinformatics
Introduction to bioinformaticsIntroduction to bioinformatics
Introduction to bioinformatics
 
Data Mining
Data Mining Data Mining
Data Mining
 
Functional genomics,Pharmaco genomics, and Meta genomics.
Functional genomics,Pharmaco genomics, and Meta genomics.Functional genomics,Pharmaco genomics, and Meta genomics.
Functional genomics,Pharmaco genomics, and Meta genomics.
 
Comparative genomics.pdf
Comparative genomics.pdfComparative genomics.pdf
Comparative genomics.pdf
 
Genomics(functional genomics)
Genomics(functional genomics)Genomics(functional genomics)
Genomics(functional genomics)
 
Molecular basis of evolution and softwares used in phylogenetic tree contruction
Molecular basis of evolution and softwares used in phylogenetic tree contructionMolecular basis of evolution and softwares used in phylogenetic tree contruction
Molecular basis of evolution and softwares used in phylogenetic tree contruction
 
Functional Genomic l Genomes l proteomic l DNA l #genomics #proteomics #scien...
Functional Genomic l Genomes l proteomic l DNA l #genomics #proteomics #scien...Functional Genomic l Genomes l proteomic l DNA l #genomics #proteomics #scien...
Functional Genomic l Genomes l proteomic l DNA l #genomics #proteomics #scien...
 
Pharmacogenomics
PharmacogenomicsPharmacogenomics
Pharmacogenomics
 
Genomics
GenomicsGenomics
Genomics
 
Genomics
GenomicsGenomics
Genomics
 
Genomics types
Genomics typesGenomics types
Genomics types
 
Pathway analysis for genomics data
Pathway analysis for genomics dataPathway analysis for genomics data
Pathway analysis for genomics data
 
Functional genomics
Functional genomicsFunctional genomics
Functional genomics
 
genomeannotation-160822182432.pdf
genomeannotation-160822182432.pdfgenomeannotation-160822182432.pdf
genomeannotation-160822182432.pdf
 
Genomics and proteomics by shreeman
Genomics and proteomics by shreemanGenomics and proteomics by shreeman
Genomics and proteomics by shreeman
 

Kürzlich hochgeladen

General AI for Medical Educators April 2024
General AI for Medical Educators April 2024General AI for Medical Educators April 2024
General AI for Medical Educators April 2024Janet Corral
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhikauryashika82
 
Class 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfClass 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfAyushMahapatra5
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Sapana Sha
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfchloefrazer622
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeThiyagu K
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)eniolaolutunde
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdfQucHHunhnh
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfJayanti Pande
 
social pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajansocial pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajanpragatimahajan3
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphThiyagu K
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactdawncurless
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxheathfieldcps1
 
Student login on Anyboli platform.helpin
Student login on Anyboli platform.helpinStudent login on Anyboli platform.helpin
Student login on Anyboli platform.helpinRaunakKeshri1
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3JemimahLaneBuaron
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformChameera Dedduwage
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDThiyagu K
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationnomboosow
 

Kürzlich hochgeladen (20)

INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptxINDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
 
General AI for Medical Educators April 2024
General AI for Medical Educators April 2024General AI for Medical Educators April 2024
General AI for Medical Educators April 2024
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
 
Advance Mobile Application Development class 07
Advance Mobile Application Development class 07Advance Mobile Application Development class 07
Advance Mobile Application Development class 07
 
Class 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfClass 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdf
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdf
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and Mode
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdf
 
social pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajansocial pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajan
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot Graph
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impact
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
 
Student login on Anyboli platform.helpin
Student login on Anyboli platform.helpinStudent login on Anyboli platform.helpin
Student login on Anyboli platform.helpin
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy Reform
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SD
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communication
 

Comparative genomics

  • 1. 1
  • 2. Introduction • Comparative genomics is a large-scale, holistic approach that compares two or more genomes to discover the similarities and differences between the genomes and to study the biology of the individual genomes • The subject of comparative genomics impinges on – Evolutionary biology and phylogenetic reconstructions of the tree of life, – Drug discovery programs, – Function predictions of hypothetical proteins – Identification of genes, regulatory motifs and other non-coding DNA motifs – Genome flux and dynamics 2
  • 3. Computational tool for genome-scale sequence alignment • The first step in comparative genomics analysis is often the alignment of two genome sequences • It is a technically challenging problem Algorithms/tools URL BLASTN and MEGABLAST http://www.ncbi.nlm.nih.gov/BLAST/ GLASS http://crossspecies.lcs.mit.edu/ MUMmer http://www.tigr.org/software/mummer/ PatternHunter http://www.bioinformaticssolutions.com/ products/ph.php PipMaker http://bio.cse.psu.edu/pipmaker/ VISTA http://www-gsd.lbl.gov/vista WABA http://www.cse.ucsc.edu/kent/xenoAli/ 3 L. Wei et. al. 2002
  • 4. Related terms.. • Homology • Homologous • Orthologous • Paralogous • Xenologous • Analogoues • Horizontal gene transfer 4
  • 5. • Homology is the relationship of any two characters (such as two proteins that have similar sequences) that have descended, usually through divergence, from a common ancestral character • Homologues are thus components or characters (such as genes/proteins with similar sequences) that can be attributed to a common ancestor of the two organisms during evolution. Homologues can either be orthologues, paralogues, or xenologues • Orthologues are homologues that have evolved from a common ancestral gene by speciation. They usually have similar functions • Paralogues are homologues that are related or produced by duplication within a genome. They often have evolved to perform different functions • Xenologues are homologues that are related by an interspecies (horizontal transfer) of the genetic material for one of the homologues • Horizontal (Lateral) Gene Transfer is the movement of genetic material between species (or genus) other than by vertical descent. 5
  • 6. 6
  • 7. 7
  • 8. Methods for comparative genomics • Comparative analysis of genome structure • Comparative analysis of coding regions • Comparative analysis of non-coding regions 8
  • 9. Comparative analysis of genome structure • Analysis of the global structure of genomes, such as nucleotide composition, syntenic relationships, and gene ordering offer insight into the similarities and differences between genomes. • This provide information on the organization and evolution of the genomes, and highlight the unique features of individual genomes • The structure of different genomes can be compared at three levels: – Overall nucleotide statistics, – Genome structure at DNA level, and – Genome structure at gene level. 9
  • 10. Comparison of overall nucleotide statistics • Overall nucleotide statistics, such as – Genome size, – Overall (G+C) content, – Regions of different (G+C) content, – Genome signature such as codon usage biases, – Amino acid usage biases, and the ratio of observed di- nucleotide frequency and – The expected frequency given random nucleotide distribution • These all present a global view of the similarities and differences of the genomes. 10
  • 11. Comparison of genome structure at DNA level • Chromosomal breakage and exchange of chromosomal fragments are common mode of gene evolution. They can be studied by comparing genome structures at DNA level. – Identification of conserved synteny and genome rearrangement events – Analysis of breakpoints – Analysis of content and distribution of DNA repeats 11
  • 12. Comparison of genome structure at gene level • Chromosomal breakage and exchange of chromosomal fragments cause disruption of gene order • Therefore gene order correlates with evolutionary distance between genomes 12
  • 13. Comparative analysis of coding regions 13 Number of algorithms that have been use in comparative genomics to aid function prediction of genes. Identification of gene-coding regions comparison of gene content comparison of protein content Comparative genome based function prediction
  • 14. Identification of gene-coding regions • The analysis and comparison of the coding regions starts with the gene identification algorithm that is used to infer what portions of the genomic sequence actively code for genes. • There are four basic approaches for gene identification L. Wei et. al. 2002 14
  • 15. Comparison of gene content • After the predicted gene set is generated, it is very interesting and important to compare the content of genes across genomes • The first statistics to compare is the estimated total number of genes in a genome, elucidate the similarities and differences between the genomes include percentage of the genome that code for genes, distribution of coding regions across the genome (a.k.a. gene density), average gene length, codon usage • This is often done using a pairwise sequence comparison tool such as BLASTN or TBLASTX 15
  • 16. Comparison of protein content • A second level of analysis that can be performed is to compare the set of gene products (protein) between the genomes, which has been termed ‘‘comparative proteomics” • It is important to compare the protein contents in critical pathways and important functional categories across genomes • Two widely used resources for pathways and functional categories are the KEGG pathway database and the Gene Ontology (GO) hierarchy L. Wei et. al. 2002 16
  • 17. cntd… • Interesting statistics to compare include – Level of sequence identity between orthologous pairs across genome – Paralogous pairs within genome, – Number of replicated copies in corresponding paralog families – Functions of the paralogs – Locations of members of paralog families across the genome 17
  • 18. Comparative genomics-based function prediction • functional assignment of genes in a non similarity-based manner • This rely on the basic premise that genes; that are functionally related, are genes that are closely associated across genomes in some form • This include three methods: – Co-conservation across genomes – Conservation of gene clusters and genomic context across species – Physical fusion of functionally linked genes across species (Domain fusion analysis) 18
  • 19. Comparative analysis of noncoding regions • Noncoding regions of the genome gained a lot of attention in recent years because of its predicted role in regulation of transcription, DNA replication, and other biological functions • This approach is based on the presumption that selective pressure causes regulatory elements to evolve at a slower rate than that of non regulatory sequences in the non coding regions 19
  • 20. Insights into Genome Fluxes and the Processes of Evolution • From an evolutionary biology perspective, whole genome comparisons provide molecular insights into the processes of evolution that include the molecular events responsible for the variations and fluxes that occur through a genome. These include processes like, inversions, translocations, deletions, duplications and insertions. 20
  • 21. The Impact of Comparative Genomics in Phylogenetic Analysis Schematic depiction of Microsporidia's phylogenetic position based on Small Subunit RNA (SSU rRNA) as an early branching eukaryote that evolved prior to the acquisiton of mitochondria, and it's subsequent placement based on a composite gene phylogeny where it was placed closer to fungi. The latter placement has been confirmed by the complete sequenceof the micro-sporidia, Encephalitozoon cuniculi, where despite the absence of mitochondria, the presence of several mitochondrial genes could be observed. 21
  • 22. Comparative Genomics in Drug Discovery • Comparative genomic studies throw important light on the pathogenesis of organisms, throwing up opportunities for therapeutic intervention as well as help in understanding and identifying disease genes • One of the most important fallouts of comparative analyses at a genome-wide scale is in the ability to identify and develop novel drug targets • If one is looking for antibacterial, antifungal, or antiprotozoal proteins to be used as targets, comparative genome analysis can reveal virulence genes, uncharacterized essential genes, species- specific genes, organism-specific genes, while ensuring that the chosen genes have no homologues in humans 22
  • 23. Comparative genomics in drug discovery programs. A flow chart diagram explaining how comparative genomics can facilitate drug discovery programs for the discovery of new antimicrobials 23
  • 24. Looking beyond… • As comparative genomics moves from between kingdoms to between genus to between species analysis, the next step is to carry out comparisions between individuals or strains that are members of a particular species • This would allow us to investigate variations at the individual level and to enable one to determine the propensity of an individual to respond to a drug or to come down with a disease or infection 24
  • 25. 25