SlideShare ist ein Scribd-Unternehmen logo
1 von 38
Sifting the human genome for functional polymorphisms Pauline C. Ng, PhD
From genotype to phenotype ,[object Object],[object Object]
Variation around genes are most likely to contribute to phenotype Coding Nonsynonymous SNPs, variation that causes an amino acid substitution 3’UTR Change in  protein  function? 5’UTR upstream 5’UTR
Amino acid substitutions can cause disease ,[object Object],Hemoglobin    E6V     sickle-cell anemia
[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],nsSNPs in humans are selected against ? some of the observed nsSNPs may be involved in disease
Predicting the effect of an amino acid substitution ,[object Object],[object Object],[object Object],[object Object]
Computational Tools for Predicting AA Substitution Effects ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Sequence conservation correlated with intolerance to substitutions Conservation    log 2 20 +   f aa log f aa
SIFT Choosing sequences a) Database search b) Choose closely  related sequences Obtain  alignment with related proteins. For each position, calculate scaled probabilities for each amino acid substitution. Query protein < cutoff > cutoff tolerated affects function
SIFT: Choosing sequences # of sequences:  1 2 3 4 5 6 7 8 9 10 11 12 13 14
SIFT: Calculating probabilities 1 0 2 0 1 0 4 0 1 0 1 0 1 0 1 0 1 0 2 0 3 0 1 0 p x /p max  < 0.05 => x  affects function  20 12 4 1 20 16 13 9 4 2 12 0 9 7 18 13 19 12 16 11 c 20 14 c 13 9 c 16 10 16 9 c 5 2 13 7 12 8 12 8
SIFT output Substitution Probability  Prediction Confidence M24S    0.04 Affect Function   Low S82T   0.36 Tolerated   High V247A    0.03 Affect Function    High !!!
Confidence is determined by the diversity of sequences in the alignment many highly  identical sequences Ideal case:  Diverse set of orthologous proteins few sequences available Low confidence examples
Case Study: LacI lac operon repressed LacI expressed lactose present normal state 4000 single amino acid substitutions assayed: throughout entire protein both neutral and affected phenotypes TIBS  22:334-339 c c
Prediction on LacI substitutions 63% 28% Substitutions that affect protein function Substitutions that give no phenotype Total prediction accuracy 68% (2726/4004) Pr(observe affected phenotype  | predicted to be damaging) 63% false - false + 37% 72% predicted to affect function predicted to be tolerated 37%
False negative error:  Positions not conserved among paralogues dimer & sugar interface not conserved
False positive error in LacI: surface with unknown function?
SIFTing human variant databases 69% 25% Substitutions involved in disease 7397 subst., 606 proteins  from SWISS-PROT Predicted on 76% proteins 71% subst nsSNPs in  normal individuals 19% Putative polymorphisms 5780 nsSNPs, 3005 proteins from dbSNP Predicted on 60% prot.,  53% subst. 185 nsSNPs, 69 proteins from Whitehead Institute Predicted on 77% prot. 62% subst 31% 81% 75%
[object Object],dbSNP nsSNPs in normal individuals Whitehead Institute Putative polymorphisms suggests that most nsSNPs are  functionally neutral What accounts for the 5% difference? 25% 19%
Account for 5% difference in dbSNP 16 genes with a high fraction of dbSNP variants predicted to affect function 1)  Substitutions found in patients  2)  Substitutions mapped to nonfunctional genes/regions 3)  Substitutions detected in error Supports SIFT as a prediction tool
Account for 5% difference in dbSNP 16 genes with a high fraction of dbSNP variants predicted to affect function 1)  Substitutions found in patients   2)  Substitutions mapped to nonfunctional genes/regions 3)  Substitutions detected in error Supports SIFT as a prediction tool
Mutations in  MSHR  increase skin cancer Mutations associated with cutaneous malignant melanoma 1 Mutations  not  associated with CMM 1-3 1   Am. J. Hum. Genet . 66: 176-186,  2   J. Invest. Dermatol . 116 :224-229,  3   J. Invest. Dermatol . 112: 512-513 R151C L60V    R151C   D294H   R160W  Tolerated Affect  function Prediction Substitution  L60V  R163Q  D84E Tolerated Affect  function Prediction Substitution
Mutations in  PPAR  ,  a candidate gene for diabetes ,[object Object],[object Object],[object Object],[object Object],1 Am. J. Hum. Genet . 63:abs997  2 Diabetologia  43:673-680  3 Diabetes Metab . 26:393-401  4 J.Lipid Res.  41: 945-952  5 J. Hum. Genet . 46: 285-288 Mutations in diabetics 1 Mutations in nondiabetics 1-5    R127Q   R409T   D304N  Tolerated Affect  function Prediction Substitution  V227A  A268V  *** L162V Tolerated Affect  function  Prediction Substitution
Mutations in  MTHFR Mutations with diminished enzyme activity 1-5 Unknown effect ,[object Object],[object Object],[object Object],[object Object],Found by contig comparison 1 Nat. Genet . 10:111-113  2 PNAS  96:12810-12815  3 PNAS  98:4004-4009  4 Cancer Res . 57:1098-1102 5 Mol. Genet. Metab . 64: 169-172    A222V  E429A Tolerated Affect function Prediction Substitution  R68Q
dbSNP variants from patients ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
16 genes with a high fraction of dbSNP variants predicted to affect function 1)  Substitutions found in patients  2)  Substitutions mapped to nonfunctional genes or    regions 3)  Substitutions detected in error
16 genes with a high fraction of dbSNP variants predicted to affect function 1)  Substitutions found in patients  2)  Substitutions mapped to nonfunctional genes or    regions 3)  Substitutions detected in error
16 genes with a high fraction of dbSNP variants predicted to affect function 1)  Substitutions found in patients 2)  Substitutions mapped to nonfunctional genes/regions 3)   Substitutions detected in error   Changes found in patients Confirms SIFT prediction and its sensitivity Unlikely to affect human health Irrelevant to human health
Comparison of Prediction Tools 69% 69% 63% 75% 28% 9% 25% 32% 15% 19% Variagenics SIFT SIFT EMBL disease  subst. LacI Variagenics SIFT LacI EMBL* 15% Variagenics SIFT SIFT EMBL SNP  databases normal individuals Substitutions that affect function Substitutions that do not affect function Polymorphisms 31% 72% 69% 91% 75% 68% 81% 85% SIFT has similar prediction accuracy to tools that use structure
[object Object],[object Object],[object Object],[object Object]
Association studies  for finding disease loci   ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],AATACGAT AATACGAT AATACGAT GATACAAC GATACAAC GATACAAC
Feasibility of direct approach ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
[object Object],[object Object],[object Object],[object Object],Possible effect? In LD with causative variant? SNPs in and near genes protein  function splicing regulation
[object Object],Double-hit and known-frequency SNPs in genes
Non-genic regions could potentially harbor disease variants ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Adding SNPs in conserved regions improves SNP density
Focusing on variation in functional regions ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Acknowledgments ,[object Object],[object Object],[object Object],[object Object]

Weitere ähnliche Inhalte

Was ist angesagt?

Monitoring of Minimal Residual Disease Principles and Applications
Monitoring of Minimal Residual Disease Principles and ApplicationsMonitoring of Minimal Residual Disease Principles and Applications
Monitoring of Minimal Residual Disease Principles and Applicationsspa718
 
Development and verification of an Ion AmpliSeq TP53 Panel
Development and verification of an Ion AmpliSeq TP53 PanelDevelopment and verification of an Ion AmpliSeq TP53 Panel
Development and verification of an Ion AmpliSeq TP53 PanelThermo Fisher Scientific
 
NSCLC: diagnóstico molecular, pronóstico y seguimiento; CTC
NSCLC: diagnóstico molecular, pronóstico y seguimiento; CTCNSCLC: diagnóstico molecular, pronóstico y seguimiento; CTC
NSCLC: diagnóstico molecular, pronóstico y seguimiento; CTCMauricio Lema
 
Innovación en terapia sistémica de cáncer de pulmón
Innovación en terapia sistémica de cáncer de pulmónInnovación en terapia sistémica de cáncer de pulmón
Innovación en terapia sistémica de cáncer de pulmónMauricio Lema
 
FLT3 INHIBITORS
FLT3 INHIBITORSFLT3 INHIBITORS
FLT3 INHIBITORSspa718
 
1-s2.0-S2211124714001612-main
1-s2.0-S2211124714001612-main1-s2.0-S2211124714001612-main
1-s2.0-S2211124714001612-mainAnirudh Prahallad
 
Identification and characterization of micro rn as expressed in hiv
Identification and characterization of micro rn as expressed in hivIdentification and characterization of micro rn as expressed in hiv
Identification and characterization of micro rn as expressed in hivTaahira Goolam Hoosen (Moola)
 
Regulatory T Cells and GVHD
Regulatory T Cells and GVHDRegulatory T Cells and GVHD
Regulatory T Cells and GVHDspa718
 
RNA splicing mutations and human disease: Pompe disease.
RNA splicing mutations and human disease: Pompe disease.RNA splicing mutations and human disease: Pompe disease.
RNA splicing mutations and human disease: Pompe disease.CentroMalattieRareFVG
 
Artesunate improves drug resistance of lung carcinomas via regulation of mi r...
Artesunate improves drug resistance of lung carcinomas via regulation of mi r...Artesunate improves drug resistance of lung carcinomas via regulation of mi r...
Artesunate improves drug resistance of lung carcinomas via regulation of mi r...Clinical Surgery Research Communications
 
E research feb2016 sifting the needles in the haystack
E research feb2016 sifting the needles in the haystackE research feb2016 sifting the needles in the haystack
E research feb2016 sifting the needles in the haystackTom Kelly
 
Introducing Genomic Testing Cooperative (GTC)
Introducing Genomic Testing Cooperative (GTC)Introducing Genomic Testing Cooperative (GTC)
Introducing Genomic Testing Cooperative (GTC)George Arndt
 
Ph D Swati Dhar
Ph D Swati DharPh D Swati Dhar
Ph D Swati DharSwati Dhar
 
Ultrasound reverses adriamycin resistance in non-small cell lung cancer via p...
Ultrasound reverses adriamycin resistance in non-small cell lung cancer via p...Ultrasound reverses adriamycin resistance in non-small cell lung cancer via p...
Ultrasound reverses adriamycin resistance in non-small cell lung cancer via p...Clinical Surgery Research Communications
 

Was ist angesagt? (20)

Monitoring of Minimal Residual Disease Principles and Applications
Monitoring of Minimal Residual Disease Principles and ApplicationsMonitoring of Minimal Residual Disease Principles and Applications
Monitoring of Minimal Residual Disease Principles and Applications
 
Development and verification of an Ion AmpliSeq TP53 Panel
Development and verification of an Ion AmpliSeq TP53 PanelDevelopment and verification of an Ion AmpliSeq TP53 Panel
Development and verification of an Ion AmpliSeq TP53 Panel
 
NSCLC: diagnóstico molecular, pronóstico y seguimiento; CTC
NSCLC: diagnóstico molecular, pronóstico y seguimiento; CTCNSCLC: diagnóstico molecular, pronóstico y seguimiento; CTC
NSCLC: diagnóstico molecular, pronóstico y seguimiento; CTC
 
EASL2015cfDNA2
EASL2015cfDNA2EASL2015cfDNA2
EASL2015cfDNA2
 
PNAS 1992 FBH.PDF
PNAS 1992 FBH.PDFPNAS 1992 FBH.PDF
PNAS 1992 FBH.PDF
 
Innovación en terapia sistémica de cáncer de pulmón
Innovación en terapia sistémica de cáncer de pulmónInnovación en terapia sistémica de cáncer de pulmón
Innovación en terapia sistémica de cáncer de pulmón
 
Crizo
CrizoCrizo
Crizo
 
FLT3 INHIBITORS
FLT3 INHIBITORSFLT3 INHIBITORS
FLT3 INHIBITORS
 
Cell death 2013
Cell death 2013Cell death 2013
Cell death 2013
 
1-s2.0-S2211124714001612-main
1-s2.0-S2211124714001612-main1-s2.0-S2211124714001612-main
1-s2.0-S2211124714001612-main
 
Identification and characterization of micro rn as expressed in hiv
Identification and characterization of micro rn as expressed in hivIdentification and characterization of micro rn as expressed in hiv
Identification and characterization of micro rn as expressed in hiv
 
nature13121
nature13121nature13121
nature13121
 
Regulatory T Cells and GVHD
Regulatory T Cells and GVHDRegulatory T Cells and GVHD
Regulatory T Cells and GVHD
 
CH_BBM Poster 2014
CH_BBM Poster 2014CH_BBM Poster 2014
CH_BBM Poster 2014
 
RNA splicing mutations and human disease: Pompe disease.
RNA splicing mutations and human disease: Pompe disease.RNA splicing mutations and human disease: Pompe disease.
RNA splicing mutations and human disease: Pompe disease.
 
Artesunate improves drug resistance of lung carcinomas via regulation of mi r...
Artesunate improves drug resistance of lung carcinomas via regulation of mi r...Artesunate improves drug resistance of lung carcinomas via regulation of mi r...
Artesunate improves drug resistance of lung carcinomas via regulation of mi r...
 
E research feb2016 sifting the needles in the haystack
E research feb2016 sifting the needles in the haystackE research feb2016 sifting the needles in the haystack
E research feb2016 sifting the needles in the haystack
 
Introducing Genomic Testing Cooperative (GTC)
Introducing Genomic Testing Cooperative (GTC)Introducing Genomic Testing Cooperative (GTC)
Introducing Genomic Testing Cooperative (GTC)
 
Ph D Swati Dhar
Ph D Swati DharPh D Swati Dhar
Ph D Swati Dhar
 
Ultrasound reverses adriamycin resistance in non-small cell lung cancer via p...
Ultrasound reverses adriamycin resistance in non-small cell lung cancer via p...Ultrasound reverses adriamycin resistance in non-small cell lung cancer via p...
Ultrasound reverses adriamycin resistance in non-small cell lung cancer via p...
 

Andere mochten auch

Thyroid and the Heart
Thyroid and the HeartThyroid and the Heart
Thyroid and the Heartcallroom
 
Test Presentation
Test PresentationTest Presentation
Test Presentationcallroom
 
Nondisjunction and its implications ppt
Nondisjunction and its implications pptNondisjunction and its implications ppt
Nondisjunction and its implications pptpoojakamble1609
 
37233502 amniocentesis-and-cvs
37233502 amniocentesis-and-cvs37233502 amniocentesis-and-cvs
37233502 amniocentesis-and-cvsscopulovic
 
281 lec14 eukaryptic_transcription
281 lec14 eukaryptic_transcription281 lec14 eukaryptic_transcription
281 lec14 eukaryptic_transcriptionhhalhaddad
 

Andere mochten auch (9)

PPT1
PPT1PPT1
PPT1
 
Thyroid and the Heart
Thyroid and the HeartThyroid and the Heart
Thyroid and the Heart
 
ppt6
ppt6ppt6
ppt6
 
PPT5
PPT5PPT5
PPT5
 
PPT2
PPT2PPT2
PPT2
 
Test Presentation
Test PresentationTest Presentation
Test Presentation
 
Nondisjunction and its implications ppt
Nondisjunction and its implications pptNondisjunction and its implications ppt
Nondisjunction and its implications ppt
 
37233502 amniocentesis-and-cvs
37233502 amniocentesis-and-cvs37233502 amniocentesis-and-cvs
37233502 amniocentesis-and-cvs
 
281 lec14 eukaryptic_transcription
281 lec14 eukaryptic_transcription281 lec14 eukaryptic_transcription
281 lec14 eukaryptic_transcription
 

Ähnlich wie testing123

Towards Precision Medicine: Tute Genomics, a cloud-based application for anal...
Towards Precision Medicine: Tute Genomics, a cloud-based application for anal...Towards Precision Medicine: Tute Genomics, a cloud-based application for anal...
Towards Precision Medicine: Tute Genomics, a cloud-based application for anal...Reid Robison
 
Assessing the clinical utility of cancer genomic and proteomic data across tu...
Assessing the clinical utility of cancer genomic and proteomic data across tu...Assessing the clinical utility of cancer genomic and proteomic data across tu...
Assessing the clinical utility of cancer genomic and proteomic data across tu...Gul Muneer
 
Rules and impact of nonsense-mediated mRNA decay in human cancers
Rules and impact of nonsense-mediated mRNA decay in human cancersRules and impact of nonsense-mediated mRNA decay in human cancers
Rules and impact of nonsense-mediated mRNA decay in human cancersFran Supek
 
Single nucleotide polymorphism
Single nucleotide polymorphismSingle nucleotide polymorphism
Single nucleotide polymorphismBipul Das
 
OncoRep: A n-of-1 reporting tool to support genome-guided treatment for breas...
OncoRep: A n-of-1 reporting tool to support genome-guided treatment for breas...OncoRep: A n-of-1 reporting tool to support genome-guided treatment for breas...
OncoRep: A n-of-1 reporting tool to support genome-guided treatment for breas...Tobias Meißner
 
Navigating through disease maps
Navigating through disease mapsNavigating through disease maps
Navigating through disease mapsJoaquin Dopazo
 
A New Generation Of Mechanism-Based Biomarkers For The Clinic
A New Generation Of Mechanism-Based Biomarkers For The ClinicA New Generation Of Mechanism-Based Biomarkers For The Clinic
A New Generation Of Mechanism-Based Biomarkers For The ClinicJoaquin Dopazo
 
High Sensitivity Sanger Sequencing for Minor Indel Detection and Characteriza...
High Sensitivity Sanger Sequencing for Minor Indel Detection and Characteriza...High Sensitivity Sanger Sequencing for Minor Indel Detection and Characteriza...
High Sensitivity Sanger Sequencing for Minor Indel Detection and Characteriza...Thermo Fisher Scientific
 
Aug2013 Heidi Rehm integrating large scale sequencing into clinical practice
Aug2013 Heidi Rehm integrating large scale sequencing into clinical practiceAug2013 Heidi Rehm integrating large scale sequencing into clinical practice
Aug2013 Heidi Rehm integrating large scale sequencing into clinical practiceGenomeInABottle
 
Oncomine Cancer Research Panel (OCP) | ESHG 2015 Poster PS12.131
Oncomine Cancer Research Panel (OCP) | ESHG 2015 Poster PS12.131Oncomine Cancer Research Panel (OCP) | ESHG 2015 Poster PS12.131
Oncomine Cancer Research Panel (OCP) | ESHG 2015 Poster PS12.131Thermo Fisher Scientific
 
A next Generation Sequencing Approach to Detect Large Rearrangements in BRCA1...
A next Generation Sequencing Approach to Detect Large Rearrangements in BRCA1...A next Generation Sequencing Approach to Detect Large Rearrangements in BRCA1...
A next Generation Sequencing Approach to Detect Large Rearrangements in BRCA1...Thermo Fisher Scientific
 
IGNTU-eContent-328712472244-M.Sc-EnvironmentalScience-2-ManojkumarRai-Environ...
IGNTU-eContent-328712472244-M.Sc-EnvironmentalScience-2-ManojkumarRai-Environ...IGNTU-eContent-328712472244-M.Sc-EnvironmentalScience-2-ManojkumarRai-Environ...
IGNTU-eContent-328712472244-M.Sc-EnvironmentalScience-2-ManojkumarRai-Environ...sumitraDas14
 
Molecular mechanisms in crpc
Molecular mechanisms in crpcMolecular mechanisms in crpc
Molecular mechanisms in crpcAlok Gupta
 
miRNA-Target Site SNPs as Predictors for Cancer Risk and Treatment Response
miRNA-Target Site SNPs as Predictors for Cancer Risk and Treatment ResponsemiRNA-Target Site SNPs as Predictors for Cancer Risk and Treatment Response
miRNA-Target Site SNPs as Predictors for Cancer Risk and Treatment ResponseDavid W. Salzman
 
Presentation july 31_2015
Presentation july 31_2015Presentation july 31_2015
Presentation july 31_2015gkoytiger
 
Adaptive Molecular Evolution (dN/dS). 2011
Adaptive Molecular Evolution (dN/dS). 2011Adaptive Molecular Evolution (dN/dS). 2011
Adaptive Molecular Evolution (dN/dS). 2011Hernán Dopazo
 
Next Generation Sequencing
Next Generation SequencingNext Generation Sequencing
Next Generation SequencingShelomi Karoon
 
Verifying the role of AID in Chronic Lymphocytic Leukemia
Verifying the role of AID in Chronic Lymphocytic LeukemiaVerifying the role of AID in Chronic Lymphocytic Leukemia
Verifying the role of AID in Chronic Lymphocytic LeukemiaCharlotte Broadbent
 

Ähnlich wie testing123 (20)

Towards Precision Medicine: Tute Genomics, a cloud-based application for anal...
Towards Precision Medicine: Tute Genomics, a cloud-based application for anal...Towards Precision Medicine: Tute Genomics, a cloud-based application for anal...
Towards Precision Medicine: Tute Genomics, a cloud-based application for anal...
 
Assessing the clinical utility of cancer genomic and proteomic data across tu...
Assessing the clinical utility of cancer genomic and proteomic data across tu...Assessing the clinical utility of cancer genomic and proteomic data across tu...
Assessing the clinical utility of cancer genomic and proteomic data across tu...
 
Rules and impact of nonsense-mediated mRNA decay in human cancers
Rules and impact of nonsense-mediated mRNA decay in human cancersRules and impact of nonsense-mediated mRNA decay in human cancers
Rules and impact of nonsense-mediated mRNA decay in human cancers
 
Pharmacology Powered by Computational Analysis: Predicting Cardiotoxicity of ...
Pharmacology Powered by Computational Analysis: Predicting Cardiotoxicity of ...Pharmacology Powered by Computational Analysis: Predicting Cardiotoxicity of ...
Pharmacology Powered by Computational Analysis: Predicting Cardiotoxicity of ...
 
Single nucleotide polymorphism
Single nucleotide polymorphismSingle nucleotide polymorphism
Single nucleotide polymorphism
 
OncoRep: A n-of-1 reporting tool to support genome-guided treatment for breas...
OncoRep: A n-of-1 reporting tool to support genome-guided treatment for breas...OncoRep: A n-of-1 reporting tool to support genome-guided treatment for breas...
OncoRep: A n-of-1 reporting tool to support genome-guided treatment for breas...
 
Navigating through disease maps
Navigating through disease mapsNavigating through disease maps
Navigating through disease maps
 
A New Generation Of Mechanism-Based Biomarkers For The Clinic
A New Generation Of Mechanism-Based Biomarkers For The ClinicA New Generation Of Mechanism-Based Biomarkers For The Clinic
A New Generation Of Mechanism-Based Biomarkers For The Clinic
 
High Sensitivity Sanger Sequencing for Minor Indel Detection and Characteriza...
High Sensitivity Sanger Sequencing for Minor Indel Detection and Characteriza...High Sensitivity Sanger Sequencing for Minor Indel Detection and Characteriza...
High Sensitivity Sanger Sequencing for Minor Indel Detection and Characteriza...
 
Aug2013 Heidi Rehm integrating large scale sequencing into clinical practice
Aug2013 Heidi Rehm integrating large scale sequencing into clinical practiceAug2013 Heidi Rehm integrating large scale sequencing into clinical practice
Aug2013 Heidi Rehm integrating large scale sequencing into clinical practice
 
Oncomine Cancer Research Panel (OCP) | ESHG 2015 Poster PS12.131
Oncomine Cancer Research Panel (OCP) | ESHG 2015 Poster PS12.131Oncomine Cancer Research Panel (OCP) | ESHG 2015 Poster PS12.131
Oncomine Cancer Research Panel (OCP) | ESHG 2015 Poster PS12.131
 
Snp
SnpSnp
Snp
 
A next Generation Sequencing Approach to Detect Large Rearrangements in BRCA1...
A next Generation Sequencing Approach to Detect Large Rearrangements in BRCA1...A next Generation Sequencing Approach to Detect Large Rearrangements in BRCA1...
A next Generation Sequencing Approach to Detect Large Rearrangements in BRCA1...
 
IGNTU-eContent-328712472244-M.Sc-EnvironmentalScience-2-ManojkumarRai-Environ...
IGNTU-eContent-328712472244-M.Sc-EnvironmentalScience-2-ManojkumarRai-Environ...IGNTU-eContent-328712472244-M.Sc-EnvironmentalScience-2-ManojkumarRai-Environ...
IGNTU-eContent-328712472244-M.Sc-EnvironmentalScience-2-ManojkumarRai-Environ...
 
Molecular mechanisms in crpc
Molecular mechanisms in crpcMolecular mechanisms in crpc
Molecular mechanisms in crpc
 
miRNA-Target Site SNPs as Predictors for Cancer Risk and Treatment Response
miRNA-Target Site SNPs as Predictors for Cancer Risk and Treatment ResponsemiRNA-Target Site SNPs as Predictors for Cancer Risk and Treatment Response
miRNA-Target Site SNPs as Predictors for Cancer Risk and Treatment Response
 
Presentation july 31_2015
Presentation july 31_2015Presentation july 31_2015
Presentation july 31_2015
 
Adaptive Molecular Evolution (dN/dS). 2011
Adaptive Molecular Evolution (dN/dS). 2011Adaptive Molecular Evolution (dN/dS). 2011
Adaptive Molecular Evolution (dN/dS). 2011
 
Next Generation Sequencing
Next Generation SequencingNext Generation Sequencing
Next Generation Sequencing
 
Verifying the role of AID in Chronic Lymphocytic Leukemia
Verifying the role of AID in Chronic Lymphocytic LeukemiaVerifying the role of AID in Chronic Lymphocytic Leukemia
Verifying the role of AID in Chronic Lymphocytic Leukemia
 

Mehr von callroom

Myocardial Viability - the STICH Trial NEJM May 2011
Myocardial Viability - the STICH Trial NEJM May 2011Myocardial Viability - the STICH Trial NEJM May 2011
Myocardial Viability - the STICH Trial NEJM May 2011callroom
 
Flail Leaflet
Flail LeafletFlail Leaflet
Flail Leafletcallroom
 
Fat versus Fit
Fat versus FitFat versus Fit
Fat versus Fitcallroom
 
Cardiac MR and viability
Cardiac MR and viabilityCardiac MR and viability
Cardiac MR and viabilitycallroom
 
Cardiac MR and viability
Cardiac MR and viabilityCardiac MR and viability
Cardiac MR and viabilitycallroom
 
LFT Review
LFT ReviewLFT Review
LFT Reviewcallroom
 
Hypertrophic Cardiomyopathy
Hypertrophic CardiomyopathyHypertrophic Cardiomyopathy
Hypertrophic Cardiomyopathycallroom
 
C. diff presentation
C. diff presentationC. diff presentation
C. diff presentationcallroom
 
Hemostasis and Thrombosis
Hemostasis and ThrombosisHemostasis and Thrombosis
Hemostasis and Thrombosiscallroom
 
safari test
safari test safari test
safari test callroom
 
safari test
safari test safari test
safari test callroom
 
Safari Test
Safari TestSafari Test
Safari Testcallroom
 

Mehr von callroom (20)

Myocardial Viability - the STICH Trial NEJM May 2011
Myocardial Viability - the STICH Trial NEJM May 2011Myocardial Viability - the STICH Trial NEJM May 2011
Myocardial Viability - the STICH Trial NEJM May 2011
 
Flail Leaflet
Flail LeafletFlail Leaflet
Flail Leaflet
 
Fat versus Fit
Fat versus FitFat versus Fit
Fat versus Fit
 
Cardiac MR and viability
Cardiac MR and viabilityCardiac MR and viability
Cardiac MR and viability
 
Cardiac MR and viability
Cardiac MR and viabilityCardiac MR and viability
Cardiac MR and viability
 
LFT Review
LFT ReviewLFT Review
LFT Review
 
Hypertrophic Cardiomyopathy
Hypertrophic CardiomyopathyHypertrophic Cardiomyopathy
Hypertrophic Cardiomyopathy
 
C. diff presentation
C. diff presentationC. diff presentation
C. diff presentation
 
test
testtest
test
 
Hemostasis and Thrombosis
Hemostasis and ThrombosisHemostasis and Thrombosis
Hemostasis and Thrombosis
 
 
 
qwqsqw
qwqsqwqwqsqw
qwqsqw
 
asaswwww
asaswwwwasaswwww
asaswwww
 
asas
asasasas
asas
 
wwww
wwwwwwww
wwww
 
safari test
safari test safari test
safari test
 
safari test
safari test safari test
safari test
 
Safari Test
Safari TestSafari Test
Safari Test
 
new
newnew
new
 

testing123

  • 1. Sifting the human genome for functional polymorphisms Pauline C. Ng, PhD
  • 2.
  • 3. Variation around genes are most likely to contribute to phenotype Coding Nonsynonymous SNPs, variation that causes an amino acid substitution 3’UTR Change in protein function? 5’UTR upstream 5’UTR
  • 4.
  • 5.
  • 6.
  • 7.
  • 8. Sequence conservation correlated with intolerance to substitutions Conservation  log 2 20 +  f aa log f aa
  • 9. SIFT Choosing sequences a) Database search b) Choose closely related sequences Obtain alignment with related proteins. For each position, calculate scaled probabilities for each amino acid substitution. Query protein < cutoff > cutoff tolerated affects function
  • 10. SIFT: Choosing sequences # of sequences: 1 2 3 4 5 6 7 8 9 10 11 12 13 14
  • 11. SIFT: Calculating probabilities 1 0 2 0 1 0 4 0 1 0 1 0 1 0 1 0 1 0 2 0 3 0 1 0 p x /p max < 0.05 => x affects function 20 12 4 1 20 16 13 9 4 2 12 0 9 7 18 13 19 12 16 11 c 20 14 c 13 9 c 16 10 16 9 c 5 2 13 7 12 8 12 8
  • 12. SIFT output Substitution Probability Prediction Confidence M24S 0.04 Affect Function Low S82T 0.36 Tolerated High V247A 0.03 Affect Function High !!!
  • 13. Confidence is determined by the diversity of sequences in the alignment many highly identical sequences Ideal case: Diverse set of orthologous proteins few sequences available Low confidence examples
  • 14. Case Study: LacI lac operon repressed LacI expressed lactose present normal state 4000 single amino acid substitutions assayed: throughout entire protein both neutral and affected phenotypes TIBS 22:334-339 c c
  • 15. Prediction on LacI substitutions 63% 28% Substitutions that affect protein function Substitutions that give no phenotype Total prediction accuracy 68% (2726/4004) Pr(observe affected phenotype | predicted to be damaging) 63% false - false + 37% 72% predicted to affect function predicted to be tolerated 37%
  • 16. False negative error: Positions not conserved among paralogues dimer & sugar interface not conserved
  • 17. False positive error in LacI: surface with unknown function?
  • 18. SIFTing human variant databases 69% 25% Substitutions involved in disease 7397 subst., 606 proteins from SWISS-PROT Predicted on 76% proteins 71% subst nsSNPs in normal individuals 19% Putative polymorphisms 5780 nsSNPs, 3005 proteins from dbSNP Predicted on 60% prot., 53% subst. 185 nsSNPs, 69 proteins from Whitehead Institute Predicted on 77% prot. 62% subst 31% 81% 75%
  • 19.
  • 20. Account for 5% difference in dbSNP 16 genes with a high fraction of dbSNP variants predicted to affect function 1) Substitutions found in patients 2) Substitutions mapped to nonfunctional genes/regions 3) Substitutions detected in error Supports SIFT as a prediction tool
  • 21. Account for 5% difference in dbSNP 16 genes with a high fraction of dbSNP variants predicted to affect function 1) Substitutions found in patients 2) Substitutions mapped to nonfunctional genes/regions 3) Substitutions detected in error Supports SIFT as a prediction tool
  • 22. Mutations in MSHR increase skin cancer Mutations associated with cutaneous malignant melanoma 1 Mutations not associated with CMM 1-3 1 Am. J. Hum. Genet . 66: 176-186, 2 J. Invest. Dermatol . 116 :224-229, 3 J. Invest. Dermatol . 112: 512-513 R151C L60V  R151C  D294H  R160W Tolerated Affect function Prediction Substitution  L60V  R163Q  D84E Tolerated Affect function Prediction Substitution
  • 23.
  • 24.
  • 25.
  • 26. 16 genes with a high fraction of dbSNP variants predicted to affect function 1) Substitutions found in patients 2) Substitutions mapped to nonfunctional genes or regions 3) Substitutions detected in error
  • 27. 16 genes with a high fraction of dbSNP variants predicted to affect function 1) Substitutions found in patients 2) Substitutions mapped to nonfunctional genes or regions 3) Substitutions detected in error
  • 28. 16 genes with a high fraction of dbSNP variants predicted to affect function 1) Substitutions found in patients 2) Substitutions mapped to nonfunctional genes/regions 3) Substitutions detected in error Changes found in patients Confirms SIFT prediction and its sensitivity Unlikely to affect human health Irrelevant to human health
  • 29. Comparison of Prediction Tools 69% 69% 63% 75% 28% 9% 25% 32% 15% 19% Variagenics SIFT SIFT EMBL disease subst. LacI Variagenics SIFT LacI EMBL* 15% Variagenics SIFT SIFT EMBL SNP databases normal individuals Substitutions that affect function Substitutions that do not affect function Polymorphisms 31% 72% 69% 91% 75% 68% 81% 85% SIFT has similar prediction accuracy to tools that use structure
  • 30.
  • 31.
  • 32.
  • 33.
  • 34.
  • 35.
  • 36. Adding SNPs in conserved regions improves SNP density
  • 37.
  • 38.

Hinweis der Redaktion

  1. hemoglobin with is a tetramer of 2 alpha and 2 beta subunits. structure on left from J.Mol. Biol he High Resolution Crystal Structure of Deoxyhemoglobin S Daniel J. Harrington, Kazuhiko Adachi, William E. Royer, Jr The Journal of Molecular Biology V272 No. 3 pp. 398-407 September 1997 http://web.wi.mit.edu/proteins/pub/BOA-2000/left.htm
  2. Start off by with slide by defining SNPs 529 SNP/Mb in exon 921 SNP/Mb intron Nucleotide diversity
  3. 40% of proteins belong to a family 70% has at least one other match
  4. (information content without correction) “In general,” “there are some exceptions”
  5. pseudocounts are based on prior knowledge of the most common amino acid distributions observed in a database of many protein alignments probabilities are calcualted for ever amino acid at every position position aa allowed 5Y all 20
  6. Ideal case: a variety of amino acids have had the time to evolve at positions not important for function. many highly identical sequences e.g. viral proteins, Ig’s (can be fixed by going to smaller database)
  7. 1764 substitutions that affect function 2240 substitutions that give no phenotype Intermediate grouped with null Mention intermediate grouped with null 15% better total prediction accuracy 10% increase in experimental prediction accuracy
  8. white: tolerate &gt;= 6 substitutions in assay red : positions high false positive error
  9. what genes in whitehead SNPs. candidate genes for coronary artery disease, type II diabetes, schizophrenia
  10. Substitutions were first identified in patients and then deposited into dbSNP. Thus it makes sense that the substitutions should be preicted as damaging.
  11. when purine repressor , a LacI paralogue, used for prediction on LacI, Variagenics only predicted 19% of the substitutions that have an effect were correctly predicted as damaging.
  12. There are two genetic approaches that make use of the variation around genes to find disease loci. Haplotypes may be stronger predictors of phenotype (mirvana, chakravarti) haplotype a set of alleles grouped together haplotype is a group of SNPs that are linked together tagSNPs are most informative Neil Risch – reduced positive with direct appraoch
  13. Is the direct approach possible? Hoogendoorn, Bastiaan used reporter gene assays in cell lines. We have used denaturing high performance liquid chromatography to screen the first 500 bp of the 5&apos; flanking region of 170 opportunistically selected genes identified from the Eukaryotic Promoter Database (EPD) for common polymorphisms. Using a screening set of 16 chromosomes, single-nucleotide polymorphisms were found in approximately 35% of genes. It was attempted to clone each of these promoters into a T-vector constructed from the reporter gene vector pGL3. The relative ability of each promoter haplotype to promote transcription of the luciferase gene was tested in each of three human cell lines (HEK293, JEG and TE671) using a co-transfected SEAP-CMV plasmid as a control. The findings suggest that around a third of promoter variants may alter gene expression to a functionally relevant extent .
  14. causal variant may not have been identified. 80% common identified in European. 50% in Africans (Nickerson) rare variant some genes have no coverage – there may be no nsSNP or it has not yet been identified and deposited in dbSNP
  15. dbSNP 120, 3.5 million double hit and snps with frequencies