SlideShare a Scribd company logo
www.   .uni-rostock.de




        Bioinformatics
    Introduction to genomics and proteomics II


             ulf.schmitz@informatik.uni-rostock.de

Bioinformatics and Systems Biology Group
               www.sbi.informatik.uni-rostock.de




    Ulf Schmitz, Introduction to genomics and proteomics II                  1
www.   .uni-rostock.de
Outline

1. Proteomics
   •   Motivation
   •   Post -Translational Modifications
   •   Key technologies
   •   Data explosion
2. Maps of hereditary information
3. Single nucleotide polymorphisms




              Ulf Schmitz, Introduction to genomics and proteomics II                     2
www.   .uni-rostock.de
Protomics

   Proteomics:
                 • is the large-scale study of proteins, particularly their structures
                   and functions
                 • This term was coined to make an analogy with genomics, and
                   is often viewed as the "next step",
                 • but proteomics is much more complicated than genomics.
                 • Most importantly, while the genome is a rather constant entity,
                   the proteome is constantly changing through its biochemical
                   interactions with the genome.
                 • One organism will have radically different protein expression in
                   different parts of its body and in different stages of its life cycle.

   Proteome:
                 The entirety of proteins in existence in an organism are
                 referred to as the proteome.




                   Ulf Schmitz, Introduction to genomics and proteomics II                     3
www.   .uni-rostock.de
Proteomics
If the genome is a list of the instruments in an orchestra, the
proteome is the orchestra playing a symphony.
R.Simpson




                Ulf Schmitz, Introduction to genomics and proteomics II                     4
www.   .uni-rostock.de
Proteomics
•   Describing all 3D structures of proteins in the cell is called Structural
    Genomics
•   Finding out what these proteins do is called Functional Genomics


    DNA Microarray                         GENOME                                 Genetic Screens




                                          PROTEOME


     Protein – Ligand                                                       Protein – Protein
        Interactions                                                           Interactions



                                            Structure


                        Ulf Schmitz, Introduction to genomics and proteomics II                     5
www.   .uni-rostock.de
Proteomics


Motivation:
 • What kind of data would we like to measure?

 • What mature experimental techniques exist to
   determine them?

 • The basic goal is a spatio-temporal description of
   the deployment of proteins in the organism.



              Ulf Schmitz, Introduction to genomics and proteomics II                     6
www.   .uni-rostock.de
Proteomics
Things to consider:
• the rates of synthesis of different proteins vary among
  different tissues and different cell types and states of activity

• methods are available for efficient analysis of transcription
  patterns of multiple genes

• because proteins ‘turn over’ at different rates, it is also
  necessary to measure proteins directly

• the distribution of expressed protein levels is a kinetic
  balance between rates of protein synthesis and degradation




                  Ulf Schmitz, Introduction to genomics and proteomics II                     7
www.   .uni-rostock.de




Ulf Schmitz, Introduction to genomics and proteomics II                     8
www.   .uni-rostock.de
Why do Proteomics?
•   are there differences between amino acid sequences determined
    directly from proteins and those determined by translation from
    DNA?
     – pattern recognition programs addressing this questions have following
       errors:
         •   a genuine protein sequence may be missed entirely
         •   an incomplete protein may be reported
         •   a gene may be incorrectly spliced
         •   genes for different proteins may overlap
         •   genes may be assembled from exons in different ways in different tissues
     – often, molecules must be modified to make a mature protein that differs
       significantly from the one suggested by translation
         • in many cases the missing post-translational- modifications are quite
           important and have functional significance
         • post-transitional modifications include addition of ligands, glycosylation,
           methylation, excision of peptides, etc.
     – in some cases mRNA is edited before translation, creating changes in
       the amino acid sequence that are not inferrable from the genes
•   a protein inferred from a genome sequence is a hypothetical object
    until an experiment verifies its existence


                       Ulf Schmitz, Introduction to genomics and proteomics II                     9
www.   .uni-rostock.de
Post-translational modification
•   a protein is a polypeptide chain composed of 20 possible amino acids

•   there are far fewer genes that code for proteins in the human genome than there
    are proteins in the human proteome (~33,000 genes vs ~200,000 proteins).

•   each gene encodes as many as six to eight different proteins
     – due to post-translational modifications such as phosphorylation, glycosylation or cleavage
       (Spaltung)

•   posttranslational modification extends the range of possible functions a protein can
    have
     – changes may alter the hydrophobicity of a protein and thus determine if the modified
       protein is cytosolic or membrane-bound
     – modifications like phosphorylation are part of common mechanisms for controlling the
       behavior of a protein, for instance, activating or inactivating an enzyme.




                        Ulf Schmitz, Introduction to genomics and proteomics II                    10
www.   .uni-rostock.de
Post-translational modification
 Phosphorylation
 •   phosphorylation is the addition of a phosphate (PO4) group to a protein
     or a small molecule (usual to serine, tyrosine, threonine or histidine)
 •   In eukaryotes, protein phosphorylation is probably the most important
     regulatory event
 •   Many enzymes and receptors are switched "on" or "off" by
     phosphorylation and dephosphorylation
 •   Phosphorylation is catalyzed by various specific protein kinases,
     whereas phosphatases dephosphorylate.
 Acetylation
 •   Is the addition of an acetyl group, usually at the N-terminus of the protein
 Farnesylation
 •   farnesylation, the addition of a farnesyl group
 Glycosylation
 •   the addition of a glycosyl group to either asparagine, hydroxylysine,
     serine, or threonine, resulting in a glycoprotein
                        Ulf Schmitz, Introduction to genomics and proteomics II                    11
www.   .uni-rostock.de
Proteomics




             Ulf Schmitz, Introduction to genomics and proteomics II                    12
www.   .uni-rostock.de
Key technologies for proteomics

 1. 1-D electrophoresis and 2-D electrophoresis
    •   are for the separation and visualization of proteins.
 2. mass spectrometry, x-ray crystallography, and NMR
    (Nuclear magnetic resonance )
    •   are used to identify and characterize proteins
 3. chromatography techniques especially affinity
    chromatography
    •   are used to characterize protein-protein interactions.
 4. Protein expression systems like the yeast two-
    hybrid and FRET (fluorescence resonance energy
    transfer)
    •   can also be used to characterize protein-protein interactions.



                  Ulf Schmitz, Introduction to genomics and proteomics II                    13
www.   .uni-rostock.de
  Key technologies for proteomics

        High-resolution two-dimensional polyacrylamide gel
        electrophoresis (2D PAGE) shows the pattern of
        protein content in a sample.




Reference map of lympphoblastoid
cell linePRI, soluble proteins.
• 110 µg of proteins loaded
• Strip 17cm pH gradient 4-7, SDS
  PAGE gels 20 x 25 cm, 8-18.5% T.
• Staining by silver nitrate method
  (Rabilloud et al.,)
• Identification by mass spectrometry.
  The pinks labels on the spots indicate
  the ID in Swiss-prot database


                                            browse the SWISS-2DPAGE database for more 2d PAGE images

                               Ulf Schmitz, Introduction to genomics and proteomics II                    14
www.   .uni-rostock.de
Proteomics


X-ray crystallography is a means to
determine the detailed molecular
structure of a protein, nucleic acid or
small molecule.


With a crystal structure we can explain the
mechanism of an enzyme, the binding of an
inhibitor, the packing of protein domains, the
tertiary structure of a nucleic acid molecule
etc..



Typically, a sample is purified to
homogeneity, crystallized, subjected to an X-
ray beam and diffraction data are collected.



                          Ulf Schmitz, Introduction to genomics and proteomics II                    15
www.   .uni-rostock.de
High-throughput Biological Data

 • Enormous amounts of biological data are being
   generated by high-throughput capabilities; even
   more are coming
    –   genomic sequences
    –   gene expression data (microarrays)
    –   mass spec. data
    –   protein-protein interaction (chromatography)
    –   protein structures (x-ray christallography)
    –   ......




                 Ulf Schmitz, Introduction to genomics and proteomics II                    16
www.   .uni-rostock.de
Protein structural data explosion
Protein Data Bank (PDB): 33.367 Structures (1 November 2005)
28.522 x-ray crystallography, 4.845 NMR




                    Ulf Schmitz, Introduction to genomics and proteomics II                    17
www.   .uni-rostock.de
Maps of hereditary information

 Following maps are used to find out how hereditary information is
 stored, passed on, and implemented.


 1.    Linkage maps of
          genes
          mini- / microsatellites
 2.    Banding patterns of chromosomes
          physical objects with visible landmarks called banding patterns
 3.    DNA sequences
          Contig maps (contigous clone maps)
          Sequence tagged site (STS)
          SNPs (Single nucloetide polymorphisms)




                    Ulf Schmitz, Introduction to genomics and proteomics II                    18
www.   .uni-rostock.de




Linkage map
  Ulf Schmitz, Introduction to genomics and proteomics II                    19
www.   .uni-rostock.de
Maps of hereditary information
Variable number tandem repeats (VNTRs, also minisatellites)
• regions, 8-80bp long, repeated a variable number of times
• the distribution and the size of repeats is the marker
• inheritance of VNTRs can be followed in a family and
  mapped to a pathological phenotype
• first genetic data used for personal identification
    – Genetic fingerprints; in paternity and in criminal cases


Short tandem repeat polymorphism (STRPs, also microsatellites)
  • Regions of 2-7bp, repeated many times
     – Usually 10-30 consecutive copies



                   Ulf Schmitz, Introduction to genomics and proteomics II                    20
www.   .uni-rostock.de



               centromere




3bp

CGTCGTCGTCGTCGTCGTCGTCGT...
GCAGCAGCAGCAGCAGCAGCAGCA...


                Ulf Schmitz, Introduction to genomics and proteomics II                    21
www.   .uni-rostock.de
Maps of hereditary information



Banding patterns of
chromosomes




              Ulf Schmitz, Introduction to genomics and proteomics II                    22
www.   .uni-rostock.de
Maps of hereditary information

   Banding patterns of chromosomes

          petite – arm
           centromere
          queue - arm




             Ulf Schmitz, Introduction to genomics and proteomics II                    23
www.   .uni-rostock.de
Maps of hereditary information

    Contig map (also contiguous clone map)
•    Series of overlapping DNA clones of known
     order along a chromosome from an organism
     of interest, stored in yeast or bacterial cells as
     YACs (Yeast Artificial Chromosomes) or
     BACs (Bacterial Artificial Chromosomes)
•    A contig map produces a fine mapping (high
     resolution) of a genome
•    YAC can contain up to 106bp, a BAC about
     250.000bp



     Sequence tagged site (STS)
•     Short, sequenced region of DNA, 200-600bp
      long, that appears in a unique location in the
      genome
•     One type arises from an EST (expressed
      sequence tag), a piece of cDNA




                            Ulf Schmitz, Introduction to genomics and proteomics II                    24
www.   .uni-rostock.de
Maps of hereditary information
Imagine we know that a disease results from a specific
defective protein:
  1. if we know the protein involved, we can pursue
     rational approaches to therapy
  2. if we know the gene involved, we can devise
     tests to identify sufferers or carriers
  3. wereas the knowledge of the chromosomal
     location of the gene is unnecessary in many
     cases for either therapy or detection;
    • it is required only for identifying the gene, providing a
      bridge between the patterns of inheritance and the
      DNA sequence

               Ulf Schmitz, Introduction to genomics and proteomics II                    25
www.   .uni-rostock.de
Single nucleotide polymorphisms (SNPs)


•   SNP (pronounced ‘snip’) is a genetic
    variation between individuals
•   single base pairs that can be substituted,
    deleted or inserted
•   SNPs are distributed throughout the
    genome
     – average every 2000bp
•   provide markers for mapping genes
•   not all SNPs are linked to diseases




                    Ulf Schmitz, Introduction to genomics and proteomics II                    26
www.   .uni-rostock.de
Single nucleotide polymorphisms (SNPs)


 • nonsense mutations:
    – codes for a stop, which can truncate the
      protein
 • missense mutations:
    – codes for a different amino acid
 • silent mutations:
    – codes for the same amino acid, so has no
      effect



             Ulf Schmitz, Introduction to genomics and proteomics II                    27
www.   .uni-rostock.de
Outlook – coming lecture


• Bioinformatics Information Resources And Networks
   – EMBnet – European Molecular Biology Network
        • DBs and Tools
   – NCBI – National Center For Biotechnology Information
        • DBs and Tools

   –   Nucleic Acid Sequence Databases
   –   Protein Information Resources
   –   Metabolic Databases
   –   Mapping Databases
   –   Databases concerning Mutations
   –   Literature Databases




                  Ulf Schmitz, Introduction to genomics and proteomics II                    28
www.   .uni-rostock.de




Thanks for your
  attention!


 Ulf Schmitz, Introduction to genomics and proteomics II                    29

More Related Content

What's hot

Personalized Medicine and the Omics Revolution by Professor Mike Snyder
Personalized Medicine and the Omics Revolution by Professor Mike SnyderPersonalized Medicine and the Omics Revolution by Professor Mike Snyder
Personalized Medicine and the Omics Revolution by Professor Mike Snyder
The Hive
 
Genomics and bioinformatics
Genomics and bioinformatics Genomics and bioinformatics
Genomics and bioinformatics
Senthil Natesan
 

What's hot (20)

Plant system biology
Plant system biologyPlant system biology
Plant system biology
 
Vijay protiomics current
Vijay protiomics currentVijay protiomics current
Vijay protiomics current
 
Personalized Medicine and the Omics Revolution by Professor Mike Snyder
Personalized Medicine and the Omics Revolution by Professor Mike SnyderPersonalized Medicine and the Omics Revolution by Professor Mike Snyder
Personalized Medicine and the Omics Revolution by Professor Mike Snyder
 
Transcriptomics and metabolomics
Transcriptomics and metabolomicsTranscriptomics and metabolomics
Transcriptomics and metabolomics
 
Techniques in proteomics
Techniques in proteomicsTechniques in proteomics
Techniques in proteomics
 
Proteomics ppt
Proteomics pptProteomics ppt
Proteomics ppt
 
Genomics, Transcriptomics, Proteomics, Metabolomics - Basic concepts for clin...
Genomics, Transcriptomics, Proteomics, Metabolomics - Basic concepts for clin...Genomics, Transcriptomics, Proteomics, Metabolomics - Basic concepts for clin...
Genomics, Transcriptomics, Proteomics, Metabolomics - Basic concepts for clin...
 
genomic comparison
genomic comparison genomic comparison
genomic comparison
 
Comparative genomics
Comparative genomicsComparative genomics
Comparative genomics
 
Genomics and proteomics ppt
Genomics and proteomics pptGenomics and proteomics ppt
Genomics and proteomics ppt
 
Functional proteomics, methods and tools
Functional proteomics, methods and toolsFunctional proteomics, methods and tools
Functional proteomics, methods and tools
 
Regulation of Gene Expression
Regulation of Gene Expression Regulation of Gene Expression
Regulation of Gene Expression
 
Genomics,proteomics and comparative genomics
Genomics,proteomics and comparative genomicsGenomics,proteomics and comparative genomics
Genomics,proteomics and comparative genomics
 
Functional genomics
Functional genomicsFunctional genomics
Functional genomics
 
Genomics
GenomicsGenomics
Genomics
 
Types of genomics ppt
Types of genomics pptTypes of genomics ppt
Types of genomics ppt
 
PROTEIN MICROARRAYS
PROTEIN MICROARRAYSPROTEIN MICROARRAYS
PROTEIN MICROARRAYS
 
Genome analysis
Genome analysisGenome analysis
Genome analysis
 
Proteomics
ProteomicsProteomics
Proteomics
 
Genomics and bioinformatics
Genomics and bioinformatics Genomics and bioinformatics
Genomics and bioinformatics
 

Viewers also liked

Proteomics analysis: Basics and Applications
Proteomics analysis: Basics and ApplicationsProteomics analysis: Basics and Applications
Proteomics analysis: Basics and Applications
COST action BM1006
 
Archakov A. from human genome project to human proteome project
Archakov A. from human genome project to human proteome projectArchakov A. from human genome project to human proteome project
Archakov A. from human genome project to human proteome project
igorod
 
Data Integration, Mass Spectrometry Proteomics Software Development
Data Integration, Mass Spectrometry Proteomics Software DevelopmentData Integration, Mass Spectrometry Proteomics Software Development
Data Integration, Mass Spectrometry Proteomics Software Development
Neil Swainston
 
Protein Microarrays: Approaches to Printing
Protein Microarrays: Approaches to PrintingProtein Microarrays: Approaches to Printing
Protein Microarrays: Approaches to Printing
SCHOTT
 
Proteomics course 1
Proteomics course 1Proteomics course 1
Proteomics course 1
utpaltatu
 

Viewers also liked (20)

Proteomics
Proteomics Proteomics
Proteomics
 
Proteomics analysis: Basics and Applications
Proteomics analysis: Basics and ApplicationsProteomics analysis: Basics and Applications
Proteomics analysis: Basics and Applications
 
Archakov A. from human genome project to human proteome project
Archakov A. from human genome project to human proteome projectArchakov A. from human genome project to human proteome project
Archakov A. from human genome project to human proteome project
 
Golgi apparatus
Golgi apparatusGolgi apparatus
Golgi apparatus
 
Proteomics repositories
Proteomics repositoriesProteomics repositories
Proteomics repositories
 
Clinical proteomics in diseases lecture, 2014
Clinical proteomics in diseases lecture, 2014Clinical proteomics in diseases lecture, 2014
Clinical proteomics in diseases lecture, 2014
 
Genomics and Proteomics - Impact on Drug Discovery
Genomics and Proteomics - Impact on Drug DiscoveryGenomics and Proteomics - Impact on Drug Discovery
Genomics and Proteomics - Impact on Drug Discovery
 
Data Integration, Mass Spectrometry Proteomics Software Development
Data Integration, Mass Spectrometry Proteomics Software DevelopmentData Integration, Mass Spectrometry Proteomics Software Development
Data Integration, Mass Spectrometry Proteomics Software Development
 
Proteomics
ProteomicsProteomics
Proteomics
 
Advanced bioinformatics of proteomics datasets
Advanced bioinformaticsof proteomics datasetsAdvanced bioinformaticsof proteomics datasets
Advanced bioinformatics of proteomics datasets
 
OpenMS: Quantitative proteomics at large scale
OpenMS: Quantitative proteomics at large scaleOpenMS: Quantitative proteomics at large scale
OpenMS: Quantitative proteomics at large scale
 
Techniques used for separation in proteomics
Techniques used for separation in proteomicsTechniques used for separation in proteomics
Techniques used for separation in proteomics
 
Genomics
GenomicsGenomics
Genomics
 
Golgi apparatus ppt
Golgi apparatus pptGolgi apparatus ppt
Golgi apparatus ppt
 
Protein Microarrays: Approaches to Printing
Protein Microarrays: Approaches to PrintingProtein Microarrays: Approaches to Printing
Protein Microarrays: Approaches to Printing
 
Proteomics course 1
Proteomics course 1Proteomics course 1
Proteomics course 1
 
Proteins – Basics you need to know for Proteomics
Proteins – Basics you need to know for ProteomicsProteins – Basics you need to know for Proteomics
Proteins – Basics you need to know for Proteomics
 
Golgi apparatus
Golgi apparatusGolgi apparatus
Golgi apparatus
 
proteomics, mass spectrometry, science, bioinformatics, electrophoresis, liqu...
proteomics, mass spectrometry, science, bioinformatics, electrophoresis, liqu...proteomics, mass spectrometry, science, bioinformatics, electrophoresis, liqu...
proteomics, mass spectrometry, science, bioinformatics, electrophoresis, liqu...
 
Golgi complex structure and functions
Golgi complex structure and functionsGolgi complex structure and functions
Golgi complex structure and functions
 

Similar to Genomics and proteomics II

Genomicsandproteomicsii
GenomicsandproteomicsiiGenomicsandproteomicsii
Genomicsandproteomicsii
Shyam Kodi
 
OMICS.pptx
OMICS.pptxOMICS.pptx
OMICS.pptx
PagudalaSangeetha
 
Protein protein interactions
Protein protein interactionsProtein protein interactions
Protein protein interactions
Prianca12
 
Why Proteins Are Essential For Cellular Function
Why Proteins Are Essential For Cellular FunctionWhy Proteins Are Essential For Cellular Function
Why Proteins Are Essential For Cellular Function
Beth Salazar
 
Prion Protein
Prion ProteinPrion Protein
Prion Protein
mazraara
 
Presentation(malak eshtawi)
Presentation(malak eshtawi)Presentation(malak eshtawi)
Presentation(malak eshtawi)
Malak Ishtewi
 
PROTEOMICS IN BLOOD BANKING. DIFFERENT TYPES OF PROTEOMICS
PROTEOMICS IN BLOOD BANKING. DIFFERENT TYPES OF PROTEOMICSPROTEOMICS IN BLOOD BANKING. DIFFERENT TYPES OF PROTEOMICS
PROTEOMICS IN BLOOD BANKING. DIFFERENT TYPES OF PROTEOMICS
PoobalanRajan2
 

Similar to Genomics and proteomics II (20)

Genomicsandproteomicsii
GenomicsandproteomicsiiGenomicsandproteomicsii
Genomicsandproteomicsii
 
proteomics scope and its importance by aniqa atta
proteomics scope and its importance by aniqa attaproteomics scope and its importance by aniqa atta
proteomics scope and its importance by aniqa atta
 
Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.
 
Biotech 2011-01-intro
Biotech 2011-01-introBiotech 2011-01-intro
Biotech 2011-01-intro
 
GENOMICS
GENOMICSGENOMICS
GENOMICS
 
4.0 CENTRAL DOGMA OF MOLLECULAR BIOLOGY V2.pptx
4.0 CENTRAL DOGMA OF MOLLECULAR BIOLOGY V2.pptx4.0 CENTRAL DOGMA OF MOLLECULAR BIOLOGY V2.pptx
4.0 CENTRAL DOGMA OF MOLLECULAR BIOLOGY V2.pptx
 
protein-protein interaction
protein-protein  interactionprotein-protein  interaction
protein-protein interaction
 
Proteomics, techniques, applications.pdf
Proteomics, techniques, applications.pdfProteomics, techniques, applications.pdf
Proteomics, techniques, applications.pdf
 
OMICS.pptx
OMICS.pptxOMICS.pptx
OMICS.pptx
 
Protein protein interactions
Protein protein interactionsProtein protein interactions
Protein protein interactions
 
Proteomics in VSC for crop improvement programme
Proteomics in VSC for crop improvement programmeProteomics in VSC for crop improvement programme
Proteomics in VSC for crop improvement programme
 
genomics proteomics metbolomics.pptx
genomics proteomics metbolomics.pptxgenomics proteomics metbolomics.pptx
genomics proteomics metbolomics.pptx
 
Biotech 2011-01-intro
Biotech 2011-01-introBiotech 2011-01-intro
Biotech 2011-01-intro
 
Why Proteins Are Essential For Cellular Function
Why Proteins Are Essential For Cellular FunctionWhy Proteins Are Essential For Cellular Function
Why Proteins Are Essential For Cellular Function
 
Prion Protein
Prion ProteinPrion Protein
Prion Protein
 
Protein protein interaction
Protein protein interactionProtein protein interaction
Protein protein interaction
 
New proteomics
New proteomicsNew proteomics
New proteomics
 
Proteomics
ProteomicsProteomics
Proteomics
 
Presentation(malak eshtawi)
Presentation(malak eshtawi)Presentation(malak eshtawi)
Presentation(malak eshtawi)
 
PROTEOMICS IN BLOOD BANKING. DIFFERENT TYPES OF PROTEOMICS
PROTEOMICS IN BLOOD BANKING. DIFFERENT TYPES OF PROTEOMICSPROTEOMICS IN BLOOD BANKING. DIFFERENT TYPES OF PROTEOMICS
PROTEOMICS IN BLOOD BANKING. DIFFERENT TYPES OF PROTEOMICS
 

More from Nikolay Vyahhi

Assembly and finishing
Assembly and finishingAssembly and finishing
Assembly and finishing
Nikolay Vyahhi
 
Molbiol 2011-13-organelles
Molbiol 2011-13-organellesMolbiol 2011-13-organelles
Molbiol 2011-13-organelles
Nikolay Vyahhi
 
Molbiol 2011-12-eukaryotic gene-expression
Molbiol 2011-12-eukaryotic gene-expressionMolbiol 2011-12-eukaryotic gene-expression
Molbiol 2011-12-eukaryotic gene-expression
Nikolay Vyahhi
 
Molbiol 2011-10-proteins
Molbiol 2011-10-proteinsMolbiol 2011-10-proteins
Molbiol 2011-10-proteins
Nikolay Vyahhi
 
Molbiol 2011-09-reparation-recombination
Molbiol 2011-09-reparation-recombinationMolbiol 2011-09-reparation-recombination
Molbiol 2011-09-reparation-recombination
Nikolay Vyahhi
 
Molbiol 2011-08-epigenetics
Molbiol 2011-08-epigeneticsMolbiol 2011-08-epigenetics
Molbiol 2011-08-epigenetics
Nikolay Vyahhi
 
Molbiol 2011-07-chromosomes-cell-cycle
Molbiol 2011-07-chromosomes-cell-cycleMolbiol 2011-07-chromosomes-cell-cycle
Molbiol 2011-07-chromosomes-cell-cycle
Nikolay Vyahhi
 
Molbiol 2011-06-transcription-translation
Molbiol 2011-06-transcription-translationMolbiol 2011-06-transcription-translation
Molbiol 2011-06-transcription-translation
Nikolay Vyahhi
 
Molbiol 2011-05-dna-rna-protein
Molbiol 2011-05-dna-rna-proteinMolbiol 2011-05-dna-rna-protein
Molbiol 2011-05-dna-rna-protein
Nikolay Vyahhi
 
Molbiol 2011-04-metabolism
Molbiol 2011-04-metabolismMolbiol 2011-04-metabolism
Molbiol 2011-04-metabolism
Nikolay Vyahhi
 
Molbiol 2011-03-biochem
Molbiol 2011-03-biochemMolbiol 2011-03-biochem
Molbiol 2011-03-biochem
Nikolay Vyahhi
 
Molbiol 2011-02-biology
Molbiol 2011-02-biologyMolbiol 2011-02-biology
Molbiol 2011-02-biology
Nikolay Vyahhi
 
Molbiol 2011-01-chemistry
Molbiol 2011-01-chemistryMolbiol 2011-01-chemistry
Molbiol 2011-01-chemistry
Nikolay Vyahhi
 
Molbiol 2011-11-role of-proteins
Molbiol 2011-11-role of-proteinsMolbiol 2011-11-role of-proteins
Molbiol 2011-11-role of-proteins
Nikolay Vyahhi
 
Biotech 2011-08-recombinant-dna
Biotech 2011-08-recombinant-dnaBiotech 2011-08-recombinant-dna
Biotech 2011-08-recombinant-dna
Nikolay Vyahhi
 
Biotech 2011-02-genetics
Biotech 2011-02-geneticsBiotech 2011-02-genetics
Biotech 2011-02-genetics
Nikolay Vyahhi
 
Biotech 2011-10-methods
Biotech 2011-10-methodsBiotech 2011-10-methods
Biotech 2011-10-methods
Nikolay Vyahhi
 
Biotech 2011-09-pcr and-in_situ_methods
Biotech 2011-09-pcr and-in_situ_methodsBiotech 2011-09-pcr and-in_situ_methods
Biotech 2011-09-pcr and-in_situ_methods
Nikolay Vyahhi
 
Biotech 2011-07-finding-orf-etc
Biotech 2011-07-finding-orf-etcBiotech 2011-07-finding-orf-etc
Biotech 2011-07-finding-orf-etc
Nikolay Vyahhi
 

More from Nikolay Vyahhi (20)

Assembly and finishing
Assembly and finishingAssembly and finishing
Assembly and finishing
 
Molbiol 2011-wetlab
Molbiol 2011-wetlabMolbiol 2011-wetlab
Molbiol 2011-wetlab
 
Molbiol 2011-13-organelles
Molbiol 2011-13-organellesMolbiol 2011-13-organelles
Molbiol 2011-13-organelles
 
Molbiol 2011-12-eukaryotic gene-expression
Molbiol 2011-12-eukaryotic gene-expressionMolbiol 2011-12-eukaryotic gene-expression
Molbiol 2011-12-eukaryotic gene-expression
 
Molbiol 2011-10-proteins
Molbiol 2011-10-proteinsMolbiol 2011-10-proteins
Molbiol 2011-10-proteins
 
Molbiol 2011-09-reparation-recombination
Molbiol 2011-09-reparation-recombinationMolbiol 2011-09-reparation-recombination
Molbiol 2011-09-reparation-recombination
 
Molbiol 2011-08-epigenetics
Molbiol 2011-08-epigeneticsMolbiol 2011-08-epigenetics
Molbiol 2011-08-epigenetics
 
Molbiol 2011-07-chromosomes-cell-cycle
Molbiol 2011-07-chromosomes-cell-cycleMolbiol 2011-07-chromosomes-cell-cycle
Molbiol 2011-07-chromosomes-cell-cycle
 
Molbiol 2011-06-transcription-translation
Molbiol 2011-06-transcription-translationMolbiol 2011-06-transcription-translation
Molbiol 2011-06-transcription-translation
 
Molbiol 2011-05-dna-rna-protein
Molbiol 2011-05-dna-rna-proteinMolbiol 2011-05-dna-rna-protein
Molbiol 2011-05-dna-rna-protein
 
Molbiol 2011-04-metabolism
Molbiol 2011-04-metabolismMolbiol 2011-04-metabolism
Molbiol 2011-04-metabolism
 
Molbiol 2011-03-biochem
Molbiol 2011-03-biochemMolbiol 2011-03-biochem
Molbiol 2011-03-biochem
 
Molbiol 2011-02-biology
Molbiol 2011-02-biologyMolbiol 2011-02-biology
Molbiol 2011-02-biology
 
Molbiol 2011-01-chemistry
Molbiol 2011-01-chemistryMolbiol 2011-01-chemistry
Molbiol 2011-01-chemistry
 
Molbiol 2011-11-role of-proteins
Molbiol 2011-11-role of-proteinsMolbiol 2011-11-role of-proteins
Molbiol 2011-11-role of-proteins
 
Biotech 2011-08-recombinant-dna
Biotech 2011-08-recombinant-dnaBiotech 2011-08-recombinant-dna
Biotech 2011-08-recombinant-dna
 
Biotech 2011-02-genetics
Biotech 2011-02-geneticsBiotech 2011-02-genetics
Biotech 2011-02-genetics
 
Biotech 2011-10-methods
Biotech 2011-10-methodsBiotech 2011-10-methods
Biotech 2011-10-methods
 
Biotech 2011-09-pcr and-in_situ_methods
Biotech 2011-09-pcr and-in_situ_methodsBiotech 2011-09-pcr and-in_situ_methods
Biotech 2011-09-pcr and-in_situ_methods
 
Biotech 2011-07-finding-orf-etc
Biotech 2011-07-finding-orf-etcBiotech 2011-07-finding-orf-etc
Biotech 2011-07-finding-orf-etc
 

Recently uploaded

Essentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with ParametersEssentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with Parameters
Safe Software
 

Recently uploaded (20)

How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...
 
Connector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a buttonConnector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a button
 
PLAI - Acceleration Program for Generative A.I. Startups
PLAI - Acceleration Program for Generative A.I. StartupsPLAI - Acceleration Program for Generative A.I. Startups
PLAI - Acceleration Program for Generative A.I. Startups
 
Salesforce Adoption – Metrics, Methods, and Motivation, Antone Kom
Salesforce Adoption – Metrics, Methods, and Motivation, Antone KomSalesforce Adoption – Metrics, Methods, and Motivation, Antone Kom
Salesforce Adoption – Metrics, Methods, and Motivation, Antone Kom
 
Demystifying gRPC in .Net by John Staveley
Demystifying gRPC in .Net by John StaveleyDemystifying gRPC in .Net by John Staveley
Demystifying gRPC in .Net by John Staveley
 
Behind the Scenes From the Manager's Chair: Decoding the Secrets of Successfu...
Behind the Scenes From the Manager's Chair: Decoding the Secrets of Successfu...Behind the Scenes From the Manager's Chair: Decoding the Secrets of Successfu...
Behind the Scenes From the Manager's Chair: Decoding the Secrets of Successfu...
 
Introduction to Open Source RAG and RAG Evaluation
Introduction to Open Source RAG and RAG EvaluationIntroduction to Open Source RAG and RAG Evaluation
Introduction to Open Source RAG and RAG Evaluation
 
Essentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with ParametersEssentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with Parameters
 
What's New in Teams Calling, Meetings and Devices April 2024
What's New in Teams Calling, Meetings and Devices April 2024What's New in Teams Calling, Meetings and Devices April 2024
What's New in Teams Calling, Meetings and Devices April 2024
 
Free and Effective: Making Flows Publicly Accessible, Yumi Ibrahimzade
Free and Effective: Making Flows Publicly Accessible, Yumi IbrahimzadeFree and Effective: Making Flows Publicly Accessible, Yumi Ibrahimzade
Free and Effective: Making Flows Publicly Accessible, Yumi Ibrahimzade
 
Optimizing NoSQL Performance Through Observability
Optimizing NoSQL Performance Through ObservabilityOptimizing NoSQL Performance Through Observability
Optimizing NoSQL Performance Through Observability
 
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...Integrating Telephony Systems with Salesforce: Insights and Considerations, B...
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...
 
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMsTo Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
 
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...
 
Powerful Start- the Key to Project Success, Barbara Laskowska
Powerful Start- the Key to Project Success, Barbara LaskowskaPowerful Start- the Key to Project Success, Barbara Laskowska
Powerful Start- the Key to Project Success, Barbara Laskowska
 
SOQL 201 for Admins & Developers: Slice & Dice Your Org’s Data With Aggregate...
SOQL 201 for Admins & Developers: Slice & Dice Your Org’s Data With Aggregate...SOQL 201 for Admins & Developers: Slice & Dice Your Org’s Data With Aggregate...
SOQL 201 for Admins & Developers: Slice & Dice Your Org’s Data With Aggregate...
 
Key Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdfKey Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdf
 
WSO2CONMay2024OpenSourceConferenceDebrief.pptx
WSO2CONMay2024OpenSourceConferenceDebrief.pptxWSO2CONMay2024OpenSourceConferenceDebrief.pptx
WSO2CONMay2024OpenSourceConferenceDebrief.pptx
 
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
 
ODC, Data Fabric and Architecture User Group
ODC, Data Fabric and Architecture User GroupODC, Data Fabric and Architecture User Group
ODC, Data Fabric and Architecture User Group
 

Genomics and proteomics II

  • 1. www. .uni-rostock.de Bioinformatics Introduction to genomics and proteomics II ulf.schmitz@informatik.uni-rostock.de Bioinformatics and Systems Biology Group www.sbi.informatik.uni-rostock.de Ulf Schmitz, Introduction to genomics and proteomics II 1
  • 2. www. .uni-rostock.de Outline 1. Proteomics • Motivation • Post -Translational Modifications • Key technologies • Data explosion 2. Maps of hereditary information 3. Single nucleotide polymorphisms Ulf Schmitz, Introduction to genomics and proteomics II 2
  • 3. www. .uni-rostock.de Protomics Proteomics: • is the large-scale study of proteins, particularly their structures and functions • This term was coined to make an analogy with genomics, and is often viewed as the "next step", • but proteomics is much more complicated than genomics. • Most importantly, while the genome is a rather constant entity, the proteome is constantly changing through its biochemical interactions with the genome. • One organism will have radically different protein expression in different parts of its body and in different stages of its life cycle. Proteome: The entirety of proteins in existence in an organism are referred to as the proteome. Ulf Schmitz, Introduction to genomics and proteomics II 3
  • 4. www. .uni-rostock.de Proteomics If the genome is a list of the instruments in an orchestra, the proteome is the orchestra playing a symphony. R.Simpson Ulf Schmitz, Introduction to genomics and proteomics II 4
  • 5. www. .uni-rostock.de Proteomics • Describing all 3D structures of proteins in the cell is called Structural Genomics • Finding out what these proteins do is called Functional Genomics DNA Microarray GENOME Genetic Screens PROTEOME Protein – Ligand Protein – Protein Interactions Interactions Structure Ulf Schmitz, Introduction to genomics and proteomics II 5
  • 6. www. .uni-rostock.de Proteomics Motivation: • What kind of data would we like to measure? • What mature experimental techniques exist to determine them? • The basic goal is a spatio-temporal description of the deployment of proteins in the organism. Ulf Schmitz, Introduction to genomics and proteomics II 6
  • 7. www. .uni-rostock.de Proteomics Things to consider: • the rates of synthesis of different proteins vary among different tissues and different cell types and states of activity • methods are available for efficient analysis of transcription patterns of multiple genes • because proteins ‘turn over’ at different rates, it is also necessary to measure proteins directly • the distribution of expressed protein levels is a kinetic balance between rates of protein synthesis and degradation Ulf Schmitz, Introduction to genomics and proteomics II 7
  • 8. www. .uni-rostock.de Ulf Schmitz, Introduction to genomics and proteomics II 8
  • 9. www. .uni-rostock.de Why do Proteomics? • are there differences between amino acid sequences determined directly from proteins and those determined by translation from DNA? – pattern recognition programs addressing this questions have following errors: • a genuine protein sequence may be missed entirely • an incomplete protein may be reported • a gene may be incorrectly spliced • genes for different proteins may overlap • genes may be assembled from exons in different ways in different tissues – often, molecules must be modified to make a mature protein that differs significantly from the one suggested by translation • in many cases the missing post-translational- modifications are quite important and have functional significance • post-transitional modifications include addition of ligands, glycosylation, methylation, excision of peptides, etc. – in some cases mRNA is edited before translation, creating changes in the amino acid sequence that are not inferrable from the genes • a protein inferred from a genome sequence is a hypothetical object until an experiment verifies its existence Ulf Schmitz, Introduction to genomics and proteomics II 9
  • 10. www. .uni-rostock.de Post-translational modification • a protein is a polypeptide chain composed of 20 possible amino acids • there are far fewer genes that code for proteins in the human genome than there are proteins in the human proteome (~33,000 genes vs ~200,000 proteins). • each gene encodes as many as six to eight different proteins – due to post-translational modifications such as phosphorylation, glycosylation or cleavage (Spaltung) • posttranslational modification extends the range of possible functions a protein can have – changes may alter the hydrophobicity of a protein and thus determine if the modified protein is cytosolic or membrane-bound – modifications like phosphorylation are part of common mechanisms for controlling the behavior of a protein, for instance, activating or inactivating an enzyme. Ulf Schmitz, Introduction to genomics and proteomics II 10
  • 11. www. .uni-rostock.de Post-translational modification Phosphorylation • phosphorylation is the addition of a phosphate (PO4) group to a protein or a small molecule (usual to serine, tyrosine, threonine or histidine) • In eukaryotes, protein phosphorylation is probably the most important regulatory event • Many enzymes and receptors are switched "on" or "off" by phosphorylation and dephosphorylation • Phosphorylation is catalyzed by various specific protein kinases, whereas phosphatases dephosphorylate. Acetylation • Is the addition of an acetyl group, usually at the N-terminus of the protein Farnesylation • farnesylation, the addition of a farnesyl group Glycosylation • the addition of a glycosyl group to either asparagine, hydroxylysine, serine, or threonine, resulting in a glycoprotein Ulf Schmitz, Introduction to genomics and proteomics II 11
  • 12. www. .uni-rostock.de Proteomics Ulf Schmitz, Introduction to genomics and proteomics II 12
  • 13. www. .uni-rostock.de Key technologies for proteomics 1. 1-D electrophoresis and 2-D electrophoresis • are for the separation and visualization of proteins. 2. mass spectrometry, x-ray crystallography, and NMR (Nuclear magnetic resonance ) • are used to identify and characterize proteins 3. chromatography techniques especially affinity chromatography • are used to characterize protein-protein interactions. 4. Protein expression systems like the yeast two- hybrid and FRET (fluorescence resonance energy transfer) • can also be used to characterize protein-protein interactions. Ulf Schmitz, Introduction to genomics and proteomics II 13
  • 14. www. .uni-rostock.de Key technologies for proteomics High-resolution two-dimensional polyacrylamide gel electrophoresis (2D PAGE) shows the pattern of protein content in a sample. Reference map of lympphoblastoid cell linePRI, soluble proteins. • 110 µg of proteins loaded • Strip 17cm pH gradient 4-7, SDS PAGE gels 20 x 25 cm, 8-18.5% T. • Staining by silver nitrate method (Rabilloud et al.,) • Identification by mass spectrometry. The pinks labels on the spots indicate the ID in Swiss-prot database browse the SWISS-2DPAGE database for more 2d PAGE images Ulf Schmitz, Introduction to genomics and proteomics II 14
  • 15. www. .uni-rostock.de Proteomics X-ray crystallography is a means to determine the detailed molecular structure of a protein, nucleic acid or small molecule. With a crystal structure we can explain the mechanism of an enzyme, the binding of an inhibitor, the packing of protein domains, the tertiary structure of a nucleic acid molecule etc.. Typically, a sample is purified to homogeneity, crystallized, subjected to an X- ray beam and diffraction data are collected. Ulf Schmitz, Introduction to genomics and proteomics II 15
  • 16. www. .uni-rostock.de High-throughput Biological Data • Enormous amounts of biological data are being generated by high-throughput capabilities; even more are coming – genomic sequences – gene expression data (microarrays) – mass spec. data – protein-protein interaction (chromatography) – protein structures (x-ray christallography) – ...... Ulf Schmitz, Introduction to genomics and proteomics II 16
  • 17. www. .uni-rostock.de Protein structural data explosion Protein Data Bank (PDB): 33.367 Structures (1 November 2005) 28.522 x-ray crystallography, 4.845 NMR Ulf Schmitz, Introduction to genomics and proteomics II 17
  • 18. www. .uni-rostock.de Maps of hereditary information Following maps are used to find out how hereditary information is stored, passed on, and implemented. 1. Linkage maps of genes mini- / microsatellites 2. Banding patterns of chromosomes physical objects with visible landmarks called banding patterns 3. DNA sequences Contig maps (contigous clone maps) Sequence tagged site (STS) SNPs (Single nucloetide polymorphisms) Ulf Schmitz, Introduction to genomics and proteomics II 18
  • 19. www. .uni-rostock.de Linkage map Ulf Schmitz, Introduction to genomics and proteomics II 19
  • 20. www. .uni-rostock.de Maps of hereditary information Variable number tandem repeats (VNTRs, also minisatellites) • regions, 8-80bp long, repeated a variable number of times • the distribution and the size of repeats is the marker • inheritance of VNTRs can be followed in a family and mapped to a pathological phenotype • first genetic data used for personal identification – Genetic fingerprints; in paternity and in criminal cases Short tandem repeat polymorphism (STRPs, also microsatellites) • Regions of 2-7bp, repeated many times – Usually 10-30 consecutive copies Ulf Schmitz, Introduction to genomics and proteomics II 20
  • 21. www. .uni-rostock.de centromere 3bp CGTCGTCGTCGTCGTCGTCGTCGT... GCAGCAGCAGCAGCAGCAGCAGCA... Ulf Schmitz, Introduction to genomics and proteomics II 21
  • 22. www. .uni-rostock.de Maps of hereditary information Banding patterns of chromosomes Ulf Schmitz, Introduction to genomics and proteomics II 22
  • 23. www. .uni-rostock.de Maps of hereditary information Banding patterns of chromosomes petite – arm centromere queue - arm Ulf Schmitz, Introduction to genomics and proteomics II 23
  • 24. www. .uni-rostock.de Maps of hereditary information Contig map (also contiguous clone map) • Series of overlapping DNA clones of known order along a chromosome from an organism of interest, stored in yeast or bacterial cells as YACs (Yeast Artificial Chromosomes) or BACs (Bacterial Artificial Chromosomes) • A contig map produces a fine mapping (high resolution) of a genome • YAC can contain up to 106bp, a BAC about 250.000bp Sequence tagged site (STS) • Short, sequenced region of DNA, 200-600bp long, that appears in a unique location in the genome • One type arises from an EST (expressed sequence tag), a piece of cDNA Ulf Schmitz, Introduction to genomics and proteomics II 24
  • 25. www. .uni-rostock.de Maps of hereditary information Imagine we know that a disease results from a specific defective protein: 1. if we know the protein involved, we can pursue rational approaches to therapy 2. if we know the gene involved, we can devise tests to identify sufferers or carriers 3. wereas the knowledge of the chromosomal location of the gene is unnecessary in many cases for either therapy or detection; • it is required only for identifying the gene, providing a bridge between the patterns of inheritance and the DNA sequence Ulf Schmitz, Introduction to genomics and proteomics II 25
  • 26. www. .uni-rostock.de Single nucleotide polymorphisms (SNPs) • SNP (pronounced ‘snip’) is a genetic variation between individuals • single base pairs that can be substituted, deleted or inserted • SNPs are distributed throughout the genome – average every 2000bp • provide markers for mapping genes • not all SNPs are linked to diseases Ulf Schmitz, Introduction to genomics and proteomics II 26
  • 27. www. .uni-rostock.de Single nucleotide polymorphisms (SNPs) • nonsense mutations: – codes for a stop, which can truncate the protein • missense mutations: – codes for a different amino acid • silent mutations: – codes for the same amino acid, so has no effect Ulf Schmitz, Introduction to genomics and proteomics II 27
  • 28. www. .uni-rostock.de Outlook – coming lecture • Bioinformatics Information Resources And Networks – EMBnet – European Molecular Biology Network • DBs and Tools – NCBI – National Center For Biotechnology Information • DBs and Tools – Nucleic Acid Sequence Databases – Protein Information Resources – Metabolic Databases – Mapping Databases – Databases concerning Mutations – Literature Databases Ulf Schmitz, Introduction to genomics and proteomics II 28
  • 29. www. .uni-rostock.de Thanks for your attention! Ulf Schmitz, Introduction to genomics and proteomics II 29