SlideShare ist ein Scribd-Unternehmen logo
1 von 37
Downloaden Sie, um offline zu lesen
Protein-Protein Interactions Prediction
Sergey Knyazev
November 21, 2012
Outline
1) Introduction
2) Protein-Protein interaction
3) Protein-Protein interaction databases
4) Protein-Protein interaction prediction
Introduction
1) Introduction
2) Protein-Protein interaction
3) Protein-Protein interaction databases
4) Protein-Protein interaction prediction
There is Huge Ammount of
Interactions in a Cell
Example: Possible molecular interactions
in a spreading cell.
There is Many Ways Biomolecules
Interacts in a Cell.
Protein-Protein Interaction
1) Introduction
2) Protein-Protein interaction
3) Protein-Protein interaction databases
4) Protein-Protein interaction prediction
Protein-Protein Interaction
●
Physical contacts with molecular docking between
proteins that occur in a cell or in a living organism.
●
Not just a ‘‘functional contact’’: The existence of
many other types of functional links between
biomolecular entities (genes, proteins, metabolites,
etc.) in living organisms should not be confused
with protein physical interactions.
●
‘‘Specific contact’’, not just all proteins that bump
into each other by chance.
●
Should be excluded interactions that a protein
experiences when it is being made, folded, quality
checked, or degraded.
Protein-Protein Interaction (PPI)
detection
PPIs Detection Methods
Protein-Protein Interaction
Databases
1) Introduction
2) Protein-Protein interaction
3) Protein-Protein interaction
databases
4) Protein-Protein interaction prediction
Protein-Protein Interaction
Databases
●
BIND - Biomolecular Interaction Network Database;
●
BioGRID - Biological General Repository for
Interaction Datasets;
●
DIP - Database of Interacting Proteins;
●
IntAct - IntAct Molecular Interaction Database;
●
HPRD - Human Protein Reference Database
●
MINT - Molecular INTeraction database;
●
PIPs - Human PPI Prediction database;
●
STRING - Known and Predicted Protein-Protein
Interactions.
PPI Network Derived from
Databases
PIPs human PPIs database
●
Contains predictions of 37 000 high probability
interactions of which 34 000 are not reported in the
interaction databases HPRD, BIND, DIP or OPHID.
●
Interactions predicted by a naive Bayesian model.
The method combines information from gene co-
expression, orthology, co-occurrence of domains,
post-translational modifications, co-localization of
the proteins within the cell and analysis of the local
topology of the predicted PPI network.
●
Based on a prediction algorithm described bellow...
Protein-Protein interaction
prediction
1) Introduction
2) Protein-Protein interaction
3) Protein-Protein interaction databases
4) Protein-Protein interaction
prediction
Protein-Protein Interaction
Prediction
●
The prediction of human protein-protein
interactions was investigated in a Bayesian
framework by considering combinations of
individual protein features known to be indicative
of interaction.
●
The seven individual features are used.
●
The features are grouped into five distinct
modules: Expression (E), Ortology(O),
Combined(C), Disorder(D), Transitive(T).
Expression Module
●
Data Source:
–
GDS596 from the Gene Expression Omnibus
●
Description:
–
Gene co-expression profiles from 79 physiologically normal
tissues obtained from various sources
●
Scoring function:
–
Pearson correlation of coexpression over all conditions
●
Bins:
–
20 of equal size covering the correlation value
range (-1 to +1)
Orthology Module
●
Data Source:
–
InParanoid, BIND, DIP and GRID databases
●
Description:
–
Interactions of homologous protein pairs from yeast, fly, worm and human
●
Scoring function:
–
Organism-based using InParanoid score
●
13 Bins:
–
High, medium and low confidence bins were defined for human protein pairs that have
interacting orthologs in either yeast, fly or worm (for a total of 9 bins)
–
two bin for human pairs that have interacting paralogs in human (a medium and a low
confidence)
–
one bin for human pairs that have interacting homologs in more than one organism
–
one bin for human pairs that have only noninteracting orthologs
Combined Module
●
This module incorporates three distinct features in a nonnaïve
Bayesian framework: subcellular localization, domain co-
occurrence and post-translational modification co-occurrence.
●
Localization:
–
Data source:
●
PSLT predictions
–
Description:
●
PSLT is a human subcellular localization predictor that considers nine different
compartments (ER, Golgi, cytosol, nucleus, peroxisome, plasma membrane,
lysosome, mitochondria and extracellular)
–
Scoring function:
●
Qualitative score: proximity of compartments
–
4 bins:
●
same, neighboring, different compartments, or not localized
Combined module
●
Domain co-occurrence
–
Data source:
●
InterPro and Pfam
–
Description:
●
Protein domains and motifs
–
Scoring function:
●
The chi-square test was used as a measure of the likelihood of co-
occurrence of specific InterPro domains and motifs in protein pairs
●
Chi-square scores were calculated for all pairs of domains/motifs
that occurred in the training data
–
Bins:
●
5 covering range of Chi-square scores
Combined module
●
PTM co-occurrence
–
Data source:
●
HPRD and UniProt
–
Description:
●
Post-translational modifications
–
Scoring function:
–
Bins:
●
4 covering range of PTM scores
Disorder Module
●
Data source:
–
VLS2 predictions
●
Description:
–
Prediction of protein intrinsic disorder
●
Scoring function:
–
Sum of the percent disorder for each protein in a pair
●
Bins:
–
6 covering range of scoring function (0 to 200%)
Transitive Module
●
Description:
–
Module that considers local
topology of underlying network
predicted using combinations of
above features
●
Scoring function:
●
Bins:
–
5 covering range of scoring
function
Independence of the Modules
●
The final likelihood ratio output by the predictor is only
representative of the true likelihood of interaction of a protein pair if
the modules considered are independent. If the modules were not
independent, some likelihood ratios would likely be overestimated.
●
Previous studies have demonstrated that some of the features
considered here are indeed independent.
●
Independence of all modules used in our predictor was verified by
calculating Pearson correlation coefficients for all pairs of modules.
Architecture of the Predictor and
Likelihoods of the Modules
Posterior Odds Ratio Estimation
●
f1, … , fn — features
●
I — interaction
●
~I — non-interaction
Accuracy of the Predictors
●
In order to analyze the predictions, five-fold cross validation
experiments were performed and the area under partial ROC
(receiver operator characteristic) curves (partial AUCs) measured.
●
T is the total number of positives in the test set
●
Ti is the number of positives that score higher than the ith highest
scoring negative
Prediction Accuracy of Different Combinations of Modules
PPI Prediction by Single Module
PPI Prediction by Combination of
Modules
Receiver Operator Characteristic
(ROC)
Comparison with Other Interaction
Datasets
●
Estimated datasets:
–
Rhodes probabilistic dataset
–
LR400 (derived from our predictors)
–
Lehner orthology-derived dataset
●
The false positive rates:
●
Reference datasets:
–
Literature-mined Ramani dataset
–
Human Protein Reference Database
(HPRD)
Comparison with Other Interaction
Datasets
Independent Validation
Conclusion
●
Predicted over 37000 human protein
interactions
●
Explored a subspace of the human
interactome that has not been
investigated by previous large
interaction datasets.
References
●
Protein–Protein Interactions Essentials: Key Concepts to
Building and Analyzing Interactome Networks 2010
Javier De Las Rivas, Celia Fontanillo
●
PIPs: human protein–protein interaction prediction
database 2008
Mark D. McDowall, Michelle S. Scott and Geoffrey J. Barton
●
Probabilistic prediction and ranking of human protein-
protein interactions 2007
Michelle S Scott and Geoffrey J Barton
Thank you!

Weitere ähnliche Inhalte

Was ist angesagt?

co immunoprecipitation
co immunoprecipitationco immunoprecipitation
co immunoprecipitationssuser60e34a
 
Cytoscape: Gene coexppression and PPI networks
Cytoscape: Gene coexppression and PPI networksCytoscape: Gene coexppression and PPI networks
Cytoscape: Gene coexppression and PPI networksBITS
 
Proteomics and protein-protein interaction
Proteomics  and protein-protein interactionProteomics  and protein-protein interaction
Proteomics and protein-protein interactionSenthilkumarV25
 
Protein-protein interaction (PPI)
Protein-protein interaction (PPI)Protein-protein interaction (PPI)
Protein-protein interaction (PPI)N Poorin
 
Protein protein interaction basic
Protein protein interaction basicProtein protein interaction basic
Protein protein interaction basicAyesha Aftab
 
Protein interaction Creative Biomart
Protein interaction Creative BiomartProtein interaction Creative Biomart
Protein interaction Creative BiomartCreative BioMart
 
Protein-Protein Interactions (PPIs)
Protein-Protein Interactions (PPIs)Protein-Protein Interactions (PPIs)
Protein-Protein Interactions (PPIs)Sai Ram
 
Protein protein interaction
Protein protein interactionProtein protein interaction
Protein protein interactionAashish Patel
 
The yeast two hybrid system and ChIP
The yeast two hybrid system and ChIPThe yeast two hybrid system and ChIP
The yeast two hybrid system and ChIPAbhishek M
 
yeast two hybrid system
yeast two hybrid systemyeast two hybrid system
yeast two hybrid systemSheetal Mehla
 
Yeast two hybrid system for Protein Protein Interaction Studies
Yeast two hybrid system for Protein Protein Interaction StudiesYeast two hybrid system for Protein Protein Interaction Studies
Yeast two hybrid system for Protein Protein Interaction Studiesajithnandanam
 
Protein protein interactions
Protein protein interactionsProtein protein interactions
Protein protein interactionsTasuduq Yaqoob
 
Biotech 2012 spring-6_protein_interactions_0
Biotech 2012 spring-6_protein_interactions_0Biotech 2012 spring-6_protein_interactions_0
Biotech 2012 spring-6_protein_interactions_0BioinformaticsInstitute
 
Protein-protein interaction
Protein-protein interactionProtein-protein interaction
Protein-protein interactionsigma-tau
 
Gene regulatory networks
Gene regulatory networksGene regulatory networks
Gene regulatory networksMadiheh
 
Yeast two hybrid
Yeast two hybridYeast two hybrid
Yeast two hybridhina ojha
 

Was ist angesagt? (20)

co immunoprecipitation
co immunoprecipitationco immunoprecipitation
co immunoprecipitation
 
Cytoscape: Gene coexppression and PPI networks
Cytoscape: Gene coexppression and PPI networksCytoscape: Gene coexppression and PPI networks
Cytoscape: Gene coexppression and PPI networks
 
Ppi
PpiPpi
Ppi
 
Proteomics and protein-protein interaction
Proteomics  and protein-protein interactionProteomics  and protein-protein interaction
Proteomics and protein-protein interaction
 
Protein-protein interaction (PPI)
Protein-protein interaction (PPI)Protein-protein interaction (PPI)
Protein-protein interaction (PPI)
 
Protein protein interaction basic
Protein protein interaction basicProtein protein interaction basic
Protein protein interaction basic
 
Protein protein interactions
Protein protein interactionsProtein protein interactions
Protein protein interactions
 
Protein interaction Creative Biomart
Protein interaction Creative BiomartProtein interaction Creative Biomart
Protein interaction Creative Biomart
 
Protein-Protein Interactions (PPIs)
Protein-Protein Interactions (PPIs)Protein-Protein Interactions (PPIs)
Protein-Protein Interactions (PPIs)
 
Protein protein interaction
Protein protein interactionProtein protein interaction
Protein protein interaction
 
The yeast two hybrid system and ChIP
The yeast two hybrid system and ChIPThe yeast two hybrid system and ChIP
The yeast two hybrid system and ChIP
 
yeast two hybrid system
yeast two hybrid systemyeast two hybrid system
yeast two hybrid system
 
Yeast two hybrid system for Protein Protein Interaction Studies
Yeast two hybrid system for Protein Protein Interaction StudiesYeast two hybrid system for Protein Protein Interaction Studies
Yeast two hybrid system for Protein Protein Interaction Studies
 
Protein protein interactions
Protein protein interactionsProtein protein interactions
Protein protein interactions
 
Biotech 2012 spring-6_protein_interactions_0
Biotech 2012 spring-6_protein_interactions_0Biotech 2012 spring-6_protein_interactions_0
Biotech 2012 spring-6_protein_interactions_0
 
Protein-protein interaction
Protein-protein interactionProtein-protein interaction
Protein-protein interaction
 
Proteomics
ProteomicsProteomics
Proteomics
 
Gene regulatory networks
Gene regulatory networksGene regulatory networks
Gene regulatory networks
 
Yeast two hybrid
Yeast two hybridYeast two hybrid
Yeast two hybrid
 
Yeast two hybrid
Yeast two hybrid Yeast two hybrid
Yeast two hybrid
 

Andere mochten auch

Bioinformatics and functional genomics
Bioinformatics and functional genomicsBioinformatics and functional genomics
Bioinformatics and functional genomicsAisha Kalsoom
 
Functional genomics
Functional genomicsFunctional genomics
Functional genomicsPawan Kumar
 
Functional genomics
Functional genomicsFunctional genomics
Functional genomicsajay301
 

Andere mochten auch (7)

Bioinformatics and functional genomics
Bioinformatics and functional genomicsBioinformatics and functional genomics
Bioinformatics and functional genomics
 
Structural genomics
Structural genomicsStructural genomics
Structural genomics
 
Functional genomics
Functional genomicsFunctional genomics
Functional genomics
 
Functional genomics
Functional genomicsFunctional genomics
Functional genomics
 
Genomics
GenomicsGenomics
Genomics
 
Types of genomics ppt
Types of genomics pptTypes of genomics ppt
Types of genomics ppt
 
Genomics
GenomicsGenomics
Genomics
 

Ähnlich wie Slides 0

Protein protein interaction, functional proteomics
Protein protein interaction, functional proteomicsProtein protein interaction, functional proteomics
Protein protein interaction, functional proteomicsKAUSHAL SAHU
 
A Systems Biology Approach to Natural Products Research
A Systems Biology Approach to Natural Products ResearchA Systems Biology Approach to Natural Products Research
A Systems Biology Approach to Natural Products ResearchHuda Nazeer
 
Yeast Two Hybrid System
Yeast Two Hybrid SystemYeast Two Hybrid System
Yeast Two Hybrid SystemSuby Mon Benny
 
Role of genomics and proteomics
Role of genomics and proteomicsRole of genomics and proteomics
Role of genomics and proteomicsPavana K A
 
University of Texas at Austin
University of Texas at AustinUniversity of Texas at Austin
University of Texas at Austinbutest
 
Comparative genomics
Comparative genomicsComparative genomics
Comparative genomicshemantbreeder
 
STRUCTURAL GENOMICS, FUNCTIONAL GENOMICS, COMPARATIVE GENOMICS
STRUCTURAL GENOMICS, FUNCTIONAL GENOMICS, COMPARATIVE GENOMICSSTRUCTURAL GENOMICS, FUNCTIONAL GENOMICS, COMPARATIVE GENOMICS
STRUCTURAL GENOMICS, FUNCTIONAL GENOMICS, COMPARATIVE GENOMICSSHEETHUMOLKS
 
Proteomics resources at the EBI & ExPASy
Proteomics resources at the EBI & ExPASyProteomics resources at the EBI & ExPASy
Proteomics resources at the EBI & ExPASyChrist College, Rajkot
 
Systems Biology Approaches to Cancer
Systems Biology Approaches to CancerSystems Biology Approaches to Cancer
Systems Biology Approaches to CancerRaunak Shrestha
 
Systems biology in polypharmacology: explaining and predicting drug secondary...
Systems biology in polypharmacology: explaining and predicting drug secondary...Systems biology in polypharmacology: explaining and predicting drug secondary...
Systems biology in polypharmacology: explaining and predicting drug secondary...Andrei KUCHARAVY
 
Proteomics - Analysis and integration of large-scale data sets
Proteomics - Analysis and integration of large-scale data setsProteomics - Analysis and integration of large-scale data sets
Proteomics - Analysis and integration of large-scale data setsLars Juhl Jensen
 
STRING - Modeling of pathways through cross-species integration of large-scal...
STRING - Modeling of pathways through cross-species integration of large-scal...STRING - Modeling of pathways through cross-species integration of large-scal...
STRING - Modeling of pathways through cross-species integration of large-scal...Lars Juhl Jensen
 
Genetic disease identification and medical diagnosis using MF, CC, BF, MicroR...
Genetic disease identification and medical diagnosis using MF, CC, BF, MicroR...Genetic disease identification and medical diagnosis using MF, CC, BF, MicroR...
Genetic disease identification and medical diagnosis using MF, CC, BF, MicroR...KarthigaRavichandran3
 
Proteomics in VSC for crop improvement programme
Proteomics in VSC for crop improvement programmeProteomics in VSC for crop improvement programme
Proteomics in VSC for crop improvement programmeSumanthBT1
 
Functional proteomics, and tools
Functional proteomics, and toolsFunctional proteomics, and tools
Functional proteomics, and toolsKAUSHAL SAHU
 

Ähnlich wie Slides 0 (20)

Protein protein interaction, functional proteomics
Protein protein interaction, functional proteomicsProtein protein interaction, functional proteomics
Protein protein interaction, functional proteomics
 
Systems biology
Systems biologySystems biology
Systems biology
 
A Systems Biology Approach to Natural Products Research
A Systems Biology Approach to Natural Products ResearchA Systems Biology Approach to Natural Products Research
A Systems Biology Approach to Natural Products Research
 
Applied Bioinformatics Assignment 5docx
Applied Bioinformatics Assignment  5docxApplied Bioinformatics Assignment  5docx
Applied Bioinformatics Assignment 5docx
 
Yeast Two Hybrid System
Yeast Two Hybrid SystemYeast Two Hybrid System
Yeast Two Hybrid System
 
Role of genomics and proteomics
Role of genomics and proteomicsRole of genomics and proteomics
Role of genomics and proteomics
 
University of Texas at Austin
University of Texas at AustinUniversity of Texas at Austin
University of Texas at Austin
 
Comparative genomics
Comparative genomicsComparative genomics
Comparative genomics
 
STRUCTURAL GENOMICS, FUNCTIONAL GENOMICS, COMPARATIVE GENOMICS
STRUCTURAL GENOMICS, FUNCTIONAL GENOMICS, COMPARATIVE GENOMICSSTRUCTURAL GENOMICS, FUNCTIONAL GENOMICS, COMPARATIVE GENOMICS
STRUCTURAL GENOMICS, FUNCTIONAL GENOMICS, COMPARATIVE GENOMICS
 
presentation
presentationpresentation
presentation
 
Proteomics resources at the EBI & ExPASy
Proteomics resources at the EBI & ExPASyProteomics resources at the EBI & ExPASy
Proteomics resources at the EBI & ExPASy
 
Systems Biology Approaches to Cancer
Systems Biology Approaches to CancerSystems Biology Approaches to Cancer
Systems Biology Approaches to Cancer
 
Systems biology in polypharmacology: explaining and predicting drug secondary...
Systems biology in polypharmacology: explaining and predicting drug secondary...Systems biology in polypharmacology: explaining and predicting drug secondary...
Systems biology in polypharmacology: explaining and predicting drug secondary...
 
Proteomics
ProteomicsProteomics
Proteomics
 
Proteomics - Analysis and integration of large-scale data sets
Proteomics - Analysis and integration of large-scale data setsProteomics - Analysis and integration of large-scale data sets
Proteomics - Analysis and integration of large-scale data sets
 
STRING - Modeling of pathways through cross-species integration of large-scal...
STRING - Modeling of pathways through cross-species integration of large-scal...STRING - Modeling of pathways through cross-species integration of large-scal...
STRING - Modeling of pathways through cross-species integration of large-scal...
 
proteomics
 proteomics proteomics
proteomics
 
Genetic disease identification and medical diagnosis using MF, CC, BF, MicroR...
Genetic disease identification and medical diagnosis using MF, CC, BF, MicroR...Genetic disease identification and medical diagnosis using MF, CC, BF, MicroR...
Genetic disease identification and medical diagnosis using MF, CC, BF, MicroR...
 
Proteomics in VSC for crop improvement programme
Proteomics in VSC for crop improvement programmeProteomics in VSC for crop improvement programme
Proteomics in VSC for crop improvement programme
 
Functional proteomics, and tools
Functional proteomics, and toolsFunctional proteomics, and tools
Functional proteomics, and tools
 

Mehr von BioinformaticsInstitute

Comparative Genomics and de Bruijn graphs
Comparative Genomics and de Bruijn graphsComparative Genomics and de Bruijn graphs
Comparative Genomics and de Bruijn graphsBioinformaticsInstitute
 
Биоинформатический анализ данных полноэкзомного секвенирования: анализ качес...
 Биоинформатический анализ данных полноэкзомного секвенирования: анализ качес... Биоинформатический анализ данных полноэкзомного секвенирования: анализ качес...
Биоинформатический анализ данных полноэкзомного секвенирования: анализ качес...BioinformaticsInstitute
 
Вперед в прошлое. Методы генетической диагностики древней днк
Вперед в прошлое. Методы генетической диагностики древней днкВперед в прошлое. Методы генетической диагностики древней днк
Вперед в прошлое. Методы генетической диагностики древней днкBioinformaticsInstitute
 
"Зачем биологам суперкомпьютеры", Александр Предеус
"Зачем биологам суперкомпьютеры", Александр Предеус"Зачем биологам суперкомпьютеры", Александр Предеус
"Зачем биологам суперкомпьютеры", Александр ПредеусBioinformaticsInstitute
 
Иммунотерапия раковых опухолей: взгляд со стороны системной биологии. Максим ...
Иммунотерапия раковых опухолей: взгляд со стороны системной биологии. Максим ...Иммунотерапия раковых опухолей: взгляд со стороны системной биологии. Максим ...
Иммунотерапия раковых опухолей: взгляд со стороны системной биологии. Максим ...BioinformaticsInstitute
 
Рак 101 (Мария Шутова, ИоГЕН РАН)
Рак 101 (Мария Шутова, ИоГЕН РАН)Рак 101 (Мария Шутова, ИоГЕН РАН)
Рак 101 (Мария Шутова, ИоГЕН РАН)BioinformaticsInstitute
 
Секвенирование как инструмент исследования сложных фенотипов человека: от ген...
Секвенирование как инструмент исследования сложных фенотипов человека: от ген...Секвенирование как инструмент исследования сложных фенотипов человека: от ген...
Секвенирование как инструмент исследования сложных фенотипов человека: от ген...BioinformaticsInstitute
 
Инвестиции в биоинформатику и биотех (Андрей Афанасьев)
Инвестиции в биоинформатику и биотех (Андрей Афанасьев)Инвестиции в биоинформатику и биотех (Андрей Афанасьев)
Инвестиции в биоинформатику и биотех (Андрей Афанасьев)BioinformaticsInstitute
 

Mehr von BioinformaticsInstitute (20)

Graph genome
Graph genome Graph genome
Graph genome
 
Nanopores sequencing
Nanopores sequencingNanopores sequencing
Nanopores sequencing
 
A superglue for string comparison
A superglue for string comparisonA superglue for string comparison
A superglue for string comparison
 
Comparative Genomics and de Bruijn graphs
Comparative Genomics and de Bruijn graphsComparative Genomics and de Bruijn graphs
Comparative Genomics and de Bruijn graphs
 
Биоинформатический анализ данных полноэкзомного секвенирования: анализ качес...
 Биоинформатический анализ данных полноэкзомного секвенирования: анализ качес... Биоинформатический анализ данных полноэкзомного секвенирования: анализ качес...
Биоинформатический анализ данных полноэкзомного секвенирования: анализ качес...
 
Вперед в прошлое. Методы генетической диагностики древней днк
Вперед в прошлое. Методы генетической диагностики древней днкВперед в прошлое. Методы генетической диагностики древней днк
Вперед в прошлое. Методы генетической диагностики древней днк
 
Knime & bioinformatics
Knime & bioinformaticsKnime & bioinformatics
Knime & bioinformatics
 
"Зачем биологам суперкомпьютеры", Александр Предеус
"Зачем биологам суперкомпьютеры", Александр Предеус"Зачем биологам суперкомпьютеры", Александр Предеус
"Зачем биологам суперкомпьютеры", Александр Предеус
 
Иммунотерапия раковых опухолей: взгляд со стороны системной биологии. Максим ...
Иммунотерапия раковых опухолей: взгляд со стороны системной биологии. Максим ...Иммунотерапия раковых опухолей: взгляд со стороны системной биологии. Максим ...
Иммунотерапия раковых опухолей: взгляд со стороны системной биологии. Максим ...
 
Рак 101 (Мария Шутова, ИоГЕН РАН)
Рак 101 (Мария Шутова, ИоГЕН РАН)Рак 101 (Мария Шутова, ИоГЕН РАН)
Рак 101 (Мария Шутова, ИоГЕН РАН)
 
Плюрипотентность 101
Плюрипотентность 101Плюрипотентность 101
Плюрипотентность 101
 
Секвенирование как инструмент исследования сложных фенотипов человека: от ген...
Секвенирование как инструмент исследования сложных фенотипов человека: от ген...Секвенирование как инструмент исследования сложных фенотипов человека: от ген...
Секвенирование как инструмент исследования сложных фенотипов человека: от ген...
 
Инвестиции в биоинформатику и биотех (Андрей Афанасьев)
Инвестиции в биоинформатику и биотех (Андрей Афанасьев)Инвестиции в биоинформатику и биотех (Андрей Афанасьев)
Инвестиции в биоинформатику и биотех (Андрей Афанасьев)
 
Biodb 2011-everything
Biodb 2011-everythingBiodb 2011-everything
Biodb 2011-everything
 
Biodb 2011-05
Biodb 2011-05Biodb 2011-05
Biodb 2011-05
 
Biodb 2011-04
Biodb 2011-04Biodb 2011-04
Biodb 2011-04
 
Biodb 2011-03
Biodb 2011-03Biodb 2011-03
Biodb 2011-03
 
Biodb 2011-01
Biodb 2011-01Biodb 2011-01
Biodb 2011-01
 
Biodb 2011-02
Biodb 2011-02Biodb 2011-02
Biodb 2011-02
 
Ngs 3 1
Ngs 3 1Ngs 3 1
Ngs 3 1
 

Kürzlich hochgeladen

GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdflior mazor
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 

Kürzlich hochgeladen (20)

GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 

Slides 0

  • 2. Outline 1) Introduction 2) Protein-Protein interaction 3) Protein-Protein interaction databases 4) Protein-Protein interaction prediction
  • 3. Introduction 1) Introduction 2) Protein-Protein interaction 3) Protein-Protein interaction databases 4) Protein-Protein interaction prediction
  • 4. There is Huge Ammount of Interactions in a Cell Example: Possible molecular interactions in a spreading cell.
  • 5. There is Many Ways Biomolecules Interacts in a Cell.
  • 6. Protein-Protein Interaction 1) Introduction 2) Protein-Protein interaction 3) Protein-Protein interaction databases 4) Protein-Protein interaction prediction
  • 7. Protein-Protein Interaction ● Physical contacts with molecular docking between proteins that occur in a cell or in a living organism. ● Not just a ‘‘functional contact’’: The existence of many other types of functional links between biomolecular entities (genes, proteins, metabolites, etc.) in living organisms should not be confused with protein physical interactions. ● ‘‘Specific contact’’, not just all proteins that bump into each other by chance. ● Should be excluded interactions that a protein experiences when it is being made, folded, quality checked, or degraded.
  • 10. Protein-Protein Interaction Databases 1) Introduction 2) Protein-Protein interaction 3) Protein-Protein interaction databases 4) Protein-Protein interaction prediction
  • 11. Protein-Protein Interaction Databases ● BIND - Biomolecular Interaction Network Database; ● BioGRID - Biological General Repository for Interaction Datasets; ● DIP - Database of Interacting Proteins; ● IntAct - IntAct Molecular Interaction Database; ● HPRD - Human Protein Reference Database ● MINT - Molecular INTeraction database; ● PIPs - Human PPI Prediction database; ● STRING - Known and Predicted Protein-Protein Interactions.
  • 12.
  • 13. PPI Network Derived from Databases
  • 14. PIPs human PPIs database ● Contains predictions of 37 000 high probability interactions of which 34 000 are not reported in the interaction databases HPRD, BIND, DIP or OPHID. ● Interactions predicted by a naive Bayesian model. The method combines information from gene co- expression, orthology, co-occurrence of domains, post-translational modifications, co-localization of the proteins within the cell and analysis of the local topology of the predicted PPI network. ● Based on a prediction algorithm described bellow...
  • 15. Protein-Protein interaction prediction 1) Introduction 2) Protein-Protein interaction 3) Protein-Protein interaction databases 4) Protein-Protein interaction prediction
  • 16. Protein-Protein Interaction Prediction ● The prediction of human protein-protein interactions was investigated in a Bayesian framework by considering combinations of individual protein features known to be indicative of interaction. ● The seven individual features are used. ● The features are grouped into five distinct modules: Expression (E), Ortology(O), Combined(C), Disorder(D), Transitive(T).
  • 17. Expression Module ● Data Source: – GDS596 from the Gene Expression Omnibus ● Description: – Gene co-expression profiles from 79 physiologically normal tissues obtained from various sources ● Scoring function: – Pearson correlation of coexpression over all conditions ● Bins: – 20 of equal size covering the correlation value range (-1 to +1)
  • 18. Orthology Module ● Data Source: – InParanoid, BIND, DIP and GRID databases ● Description: – Interactions of homologous protein pairs from yeast, fly, worm and human ● Scoring function: – Organism-based using InParanoid score ● 13 Bins: – High, medium and low confidence bins were defined for human protein pairs that have interacting orthologs in either yeast, fly or worm (for a total of 9 bins) – two bin for human pairs that have interacting paralogs in human (a medium and a low confidence) – one bin for human pairs that have interacting homologs in more than one organism – one bin for human pairs that have only noninteracting orthologs
  • 19. Combined Module ● This module incorporates three distinct features in a nonnaïve Bayesian framework: subcellular localization, domain co- occurrence and post-translational modification co-occurrence. ● Localization: – Data source: ● PSLT predictions – Description: ● PSLT is a human subcellular localization predictor that considers nine different compartments (ER, Golgi, cytosol, nucleus, peroxisome, plasma membrane, lysosome, mitochondria and extracellular) – Scoring function: ● Qualitative score: proximity of compartments – 4 bins: ● same, neighboring, different compartments, or not localized
  • 20. Combined module ● Domain co-occurrence – Data source: ● InterPro and Pfam – Description: ● Protein domains and motifs – Scoring function: ● The chi-square test was used as a measure of the likelihood of co- occurrence of specific InterPro domains and motifs in protein pairs ● Chi-square scores were calculated for all pairs of domains/motifs that occurred in the training data – Bins: ● 5 covering range of Chi-square scores
  • 21. Combined module ● PTM co-occurrence – Data source: ● HPRD and UniProt – Description: ● Post-translational modifications – Scoring function: – Bins: ● 4 covering range of PTM scores
  • 22. Disorder Module ● Data source: – VLS2 predictions ● Description: – Prediction of protein intrinsic disorder ● Scoring function: – Sum of the percent disorder for each protein in a pair ● Bins: – 6 covering range of scoring function (0 to 200%)
  • 23. Transitive Module ● Description: – Module that considers local topology of underlying network predicted using combinations of above features ● Scoring function: ● Bins: – 5 covering range of scoring function
  • 24. Independence of the Modules ● The final likelihood ratio output by the predictor is only representative of the true likelihood of interaction of a protein pair if the modules considered are independent. If the modules were not independent, some likelihood ratios would likely be overestimated. ● Previous studies have demonstrated that some of the features considered here are indeed independent. ● Independence of all modules used in our predictor was verified by calculating Pearson correlation coefficients for all pairs of modules.
  • 25. Architecture of the Predictor and Likelihoods of the Modules
  • 26. Posterior Odds Ratio Estimation ● f1, … , fn — features ● I — interaction ● ~I — non-interaction
  • 27. Accuracy of the Predictors ● In order to analyze the predictions, five-fold cross validation experiments were performed and the area under partial ROC (receiver operator characteristic) curves (partial AUCs) measured. ● T is the total number of positives in the test set ● Ti is the number of positives that score higher than the ith highest scoring negative
  • 28. Prediction Accuracy of Different Combinations of Modules
  • 29. PPI Prediction by Single Module
  • 30. PPI Prediction by Combination of Modules
  • 32. Comparison with Other Interaction Datasets ● Estimated datasets: – Rhodes probabilistic dataset – LR400 (derived from our predictors) – Lehner orthology-derived dataset ● The false positive rates: ● Reference datasets: – Literature-mined Ramani dataset – Human Protein Reference Database (HPRD)
  • 33. Comparison with Other Interaction Datasets
  • 35. Conclusion ● Predicted over 37000 human protein interactions ● Explored a subspace of the human interactome that has not been investigated by previous large interaction datasets.
  • 36. References ● Protein–Protein Interactions Essentials: Key Concepts to Building and Analyzing Interactome Networks 2010 Javier De Las Rivas, Celia Fontanillo ● PIPs: human protein–protein interaction prediction database 2008 Mark D. McDowall, Michelle S. Scott and Geoffrey J. Barton ● Probabilistic prediction and ranking of human protein- protein interactions 2007 Michelle S Scott and Geoffrey J Barton