SlideShare ist ein Scribd-Unternehmen logo
1 von 1
Downloaden Sie, um offline zu lesen
Predicting Performance in Recommender Systems
                                                                                                  Alejandro Bellogín
                                                                     Supervised by Pablo Castells and Iván Cantador
                                                           Information Retrieval Group, Universidad Autónoma de Madrid, Spain
                                                                                alejandro.bellogin@uam.es
                                                                                                                                                                                                                                                            IRG
                                                                                                                                                                                                                                                            IR Group @ UAM


          Motivation                                                                                                    Research questions
 Is it possible to predict the accuracy of a recommendation?                                             1. Is it possible to define a performance prediction theory for recommender
 E.g., we can decide whether to deliver a recommendation or not,                                            systems in a sound, formal way?
 depending on such prediction. Or, even, to combine different
                                                                                                         2. Is it possible to adapt query performance techniques (from IR) to the
 recommenders according to the expected performance of each one.
                                                                                                            recommendation task?
                                                                                                         3. What kind of evaluation should be performed? Is IR evaluation still valid in our
          Hypothesis                                                                                        problem?
 Data that are commonly available to a Recommender System could
 contain signals that enable an a priori estimation of the success of
                                                                                                         4. What kind of recommendation problems can these models be applied to?
 the recommendation


          Research question 1                                                                 Research question 2
 Is it possible to define a performance prediction theory                                                                                                                                                                     User clarity
                                                                            Is it possible to adapt query performance
 for recommender systems in a sound, formal way?                            techniques (from IR) to the recommendation task? It captures the uncertainty in user’s data
                                                                            • In Information Retrieval: “Estimation of the system’s
                                                                                                                                                                       p  x | u   user’s model
 a) Define a predictor of performance                                          performance in response to a specific query”         clarity  u    p  x | u  log 
                                                                                                                                                                       p  x     
                                                                                                                                                    x X                                                                                                                   c                    system’s model
                         = (u, i, r, …)                                   •       Several predictors proposed                                                                                                               Distance between the user’s and the
 b) Agree on a performance metric                                           •       We focus on query clarity  user clarity                                                                                                  system’s probability model
                         = (u, i, r, …)
                                                                                        0.7                                                           RatItem                                       p_c(x)
                                                                                        0.6                                                                                                         p(x|u1)
                                                                                                                                                                                                    p(x|u2)
                                                                                                                                                                                                                              We propose 3 formulations (for space X):
 c) Check predictive power by measuring correlation
                                                                                        0.5



                                                                                                                                                                                                                              • Based on ratings
                                                                                        0.4




         corr([(x1), …, (xn)], [(x1), …, (xn)])
                                                                                        0.3

                                                                                        0.2
                                                                                                                                                                                                                              • Based on items
                                                                                                                                                                                                                              • Based on ratings and items
                                                                                        0.1

 d) Evaluate final performance: dynamic vs static                                       0.0
                                                                                              4           3         5           2         1        Ratings 4         3       5       2          1




Correlation coefficients
                                                        r ~ 0.57
                                                                                              Research question 4
                                                                            What kind of recommendation problems can these models be applied to?
Pearson: linear correlation                                                 •      Whenever a combination of strategies is available
Spearman: rank correlation
                                                                            • Example 1: dynamic neighbor weighting                                                                                             • Example 2: dynamic ensemble recommendation
                                                                            • The user’s neighbors are                                                                                weighted                  • Weight is the same for every item and user
                                                                              according to their similarity                                                                                                       (learnt from training)
                                                                            • Can we take into account the uncertainty in                                                                                       • What about boosting those users predicted to
          Research question 3                                                 neighbor’s data?                                                                                                                    perform better for some recommender?
 What kind of evaluation should be performed? Is IR                         • User neighbor weighting:                                                                                                          • Hybrid recommendation:
 evaluation still valid in our problem?                                            • Static:                                g  u, i   C            sim u, v   rat  v, i                                  • Static: g u, i     gR1 u, i   1     gR2 u, i 
                                                                                                                                                   vN [ u ]
 •    In IR: Mean Average Precision + correlation                             • Dynamic: g  u, i   C  γ  v   sim u, v   rat  v, i  • Dynamic: g u, i   γ u   gR1 u, i   1  γ u   gR2 u, i 
                                                                                                       vN [ u ]
           50 points (queries) vs 1000+ points (users)                      • Correlation analysis:                                           • Correlation analysis:
 •    Performance metric is not clear:
             error-based?
                                                                            • Performance:
             precision-based?                                                   0,98                               Standard CF                            0,88 b) Neighbourhood size: 500
                                                                                                                                                                                  Standard CF                            b) 80% training
                                                                                 0,96                               Clarity-enhanced CF                    0,87                    Clarity-enhanced CF


 • What is performance?                                                          0,94                                                                      0,86
                                                                                                                                                                                                                                                                 nDCG@50
                                                                                 0,92
                                                                                 0,90
                                                                                                                                                           0,85
                                                                                                                                                                                                                • Performance:
                                                                                                                                                     MAE




                                                                                                                                                                                                                                                      0.2
                                                                           MAE




          It may depend on the final application
                                                                                                                                                           0,84
                                                                                 0,88
                                                                                                                                                           0,83                                                                                      0.15
                                                                                 0,86
                                                                                 0,84                                                                      0,82

 • Possible bias                                                                 0,82
                                                                                 0,80
                                                                                                                                                           0,81
                                                                                                                                                           0,80
                                                                                                                                                                                                                                                      0.1


                                                                                                                                                                                                                                                     0.05

          E.g., towards users or items with larger profiles
                                                                                         10       20     30    40    50    60       70   80   90       10      20 100 150 200 250 300 350 400 450 500
                                                                                                                                                                   30 40 50 60 70 80 90                        100 150 200 250 300 350 400 450 500
                                                                                                       % of ratings for training                                         Neighbourhood size                                                            0
                                                                                                                                                                                                                                                            H1   H2                      H3           H4

                                                                                                                                                                                                                                                                 Adaptive       Static




          Future Work                                                                                                                                                                                                 Publications
                  Explore other input sources                              We need a theoretical background                                                                                                   • A Performance Prediction Aproach to Enhance
                                                                           Why do some predictors work better?                                                                                                  Collaborative Filtering Performance. A. Bellogín
      Implicit data (with time)       Social links
                                                                                                                                                                                                                and P. Castells. In ECIR 2010.
      Item predictors                 o Use graph-based measures
                                                                                                                                                                                                              • Predicting the Performance of Recommender
      o Different recommender          as indicators of user strength
                                                                                                                                                                                                                Systems: An Information Theoretic Approach. A.
      behavior depending on item       o   First results are positive                                                                                                                                           Bellogín, P. Castells, and I. Cantador. In ICTIR
      attributes                                                                                                                                                                                                2011.
      o They would allow to capture                                                                                                                                                                           • Self-adjusting Hybrid Recommenders based on
      popularity, diversity, etc.                                                                                                                                                                               Social Network Analysis. A. Bellogín, P. Castells,
                                                                           Larger datasets
                                                                                                                                                                                                                and I. Cantador. In SIGIR 2011.
                                                 5th ACM Conference on Recommender Systems (RecSys2011) – Doctoral Symposium Acknowledgments to the National Science Foundation
                                                                        Chicago, USA, 23-27 October 2011                            for the funding to attend the conference

Weitere ähnliche Inhalte

Was ist angesagt?

97cogsci.doc
97cogsci.doc97cogsci.doc
97cogsci.docbutest
 
Calling Dr Watson To Radiology - RSNA Presentation
Calling Dr Watson To Radiology - RSNA PresentationCalling Dr Watson To Radiology - RSNA Presentation
Calling Dr Watson To Radiology - RSNA PresentationNick van Terheyden
 
Meta analysis-sloan
Meta analysis-sloanMeta analysis-sloan
Meta analysis-sloanJohn Turner
 
IRJET - Ensembling Reinforcement Learning for Portfolio Management
IRJET -  	  Ensembling Reinforcement Learning for Portfolio ManagementIRJET -  	  Ensembling Reinforcement Learning for Portfolio Management
IRJET - Ensembling Reinforcement Learning for Portfolio ManagementIRJET Journal
 
Performance analysis of machine learning algorithms on self localization system1
Performance analysis of machine learning algorithms on self localization system1Performance analysis of machine learning algorithms on self localization system1
Performance analysis of machine learning algorithms on self localization system1Venkat Projects
 
Paradigm shifts in wildlife and biodiversity management through machine learning
Paradigm shifts in wildlife and biodiversity management through machine learningParadigm shifts in wildlife and biodiversity management through machine learning
Paradigm shifts in wildlife and biodiversity management through machine learningSalford Systems
 
New challenges monolixday2011
New challenges monolixday2011New challenges monolixday2011
New challenges monolixday2011blaudez
 
5004 implementing aggregate_awareness_in_sap_business_objects
5004 implementing aggregate_awareness_in_sap_business_objects5004 implementing aggregate_awareness_in_sap_business_objects
5004 implementing aggregate_awareness_in_sap_business_objectsYogeeswar Reddy
 
Statistically adaptive learning for a general class of..
Statistically adaptive learning for a general class of..Statistically adaptive learning for a general class of..
Statistically adaptive learning for a general class of..Accenture
 
Secondary mathematics wednesday august 22 2012
Secondary mathematics   wednesday august 22 2012Secondary mathematics   wednesday august 22 2012
Secondary mathematics wednesday august 22 2012brearatliff
 
Chapter6.doc
Chapter6.docChapter6.doc
Chapter6.docbutest
 

Was ist angesagt? (12)

97cogsci.doc
97cogsci.doc97cogsci.doc
97cogsci.doc
 
G
GG
G
 
Calling Dr Watson To Radiology - RSNA Presentation
Calling Dr Watson To Radiology - RSNA PresentationCalling Dr Watson To Radiology - RSNA Presentation
Calling Dr Watson To Radiology - RSNA Presentation
 
Meta analysis-sloan
Meta analysis-sloanMeta analysis-sloan
Meta analysis-sloan
 
IRJET - Ensembling Reinforcement Learning for Portfolio Management
IRJET -  	  Ensembling Reinforcement Learning for Portfolio ManagementIRJET -  	  Ensembling Reinforcement Learning for Portfolio Management
IRJET - Ensembling Reinforcement Learning for Portfolio Management
 
Performance analysis of machine learning algorithms on self localization system1
Performance analysis of machine learning algorithms on self localization system1Performance analysis of machine learning algorithms on self localization system1
Performance analysis of machine learning algorithms on self localization system1
 
Paradigm shifts in wildlife and biodiversity management through machine learning
Paradigm shifts in wildlife and biodiversity management through machine learningParadigm shifts in wildlife and biodiversity management through machine learning
Paradigm shifts in wildlife and biodiversity management through machine learning
 
New challenges monolixday2011
New challenges monolixday2011New challenges monolixday2011
New challenges monolixday2011
 
5004 implementing aggregate_awareness_in_sap_business_objects
5004 implementing aggregate_awareness_in_sap_business_objects5004 implementing aggregate_awareness_in_sap_business_objects
5004 implementing aggregate_awareness_in_sap_business_objects
 
Statistically adaptive learning for a general class of..
Statistically adaptive learning for a general class of..Statistically adaptive learning for a general class of..
Statistically adaptive learning for a general class of..
 
Secondary mathematics wednesday august 22 2012
Secondary mathematics   wednesday august 22 2012Secondary mathematics   wednesday august 22 2012
Secondary mathematics wednesday august 22 2012
 
Chapter6.doc
Chapter6.docChapter6.doc
Chapter6.doc
 

Andere mochten auch

2012 Digital Opportunities
2012 Digital Opportunities2012 Digital Opportunities
2012 Digital OpportunitiesCLOSEUP Agency
 
Leergang Opleidingsmanagement
Leergang OpleidingsmanagementLeergang Opleidingsmanagement
Leergang OpleidingsmanagementMarvinMoentadj
 
OOcamp, presentatie Jongerenwelzijn, project OO voor ouders met tieners in ar...
OOcamp, presentatie Jongerenwelzijn, project OO voor ouders met tieners in ar...OOcamp, presentatie Jongerenwelzijn, project OO voor ouders met tieners in ar...
OOcamp, presentatie Jongerenwelzijn, project OO voor ouders met tieners in ar...EXPOO
 
PEShare.co.uk Shared Resource
PEShare.co.uk Shared ResourcePEShare.co.uk Shared Resource
PEShare.co.uk Shared Resourcepeshare.co.uk
 

Andere mochten auch (6)

2012 Digital Opportunities
2012 Digital Opportunities2012 Digital Opportunities
2012 Digital Opportunities
 
Leergang Opleidingsmanagement
Leergang OpleidingsmanagementLeergang Opleidingsmanagement
Leergang Opleidingsmanagement
 
OOcamp, presentatie Jongerenwelzijn, project OO voor ouders met tieners in ar...
OOcamp, presentatie Jongerenwelzijn, project OO voor ouders met tieners in ar...OOcamp, presentatie Jongerenwelzijn, project OO voor ouders met tieners in ar...
OOcamp, presentatie Jongerenwelzijn, project OO voor ouders met tieners in ar...
 
Segundo examen
Segundo examenSegundo examen
Segundo examen
 
PEShare.co.uk Shared Resource
PEShare.co.uk Shared ResourcePEShare.co.uk Shared Resource
PEShare.co.uk Shared Resource
 
Pratica i
Pratica iPratica i
Pratica i
 

Ähnlich wie Predicting performance in Recommender Systems - Poster

Predicting performance in Recommender Systems - Slides
Predicting performance in Recommender Systems - SlidesPredicting performance in Recommender Systems - Slides
Predicting performance in Recommender Systems - SlidesAlejandro Bellogin
 
Predicting Employee Attrition
Predicting Employee AttritionPredicting Employee Attrition
Predicting Employee AttritionShruti Mohan
 
2013-1 Machine Learning Lecture 01 - Pattern Recognition
2013-1 Machine Learning Lecture 01 - Pattern Recognition2013-1 Machine Learning Lecture 01 - Pattern Recognition
2013-1 Machine Learning Lecture 01 - Pattern RecognitionDongseo University
 
LAK13 linkedup tutorial_evaluation_framework
LAK13 linkedup tutorial_evaluation_frameworkLAK13 linkedup tutorial_evaluation_framework
LAK13 linkedup tutorial_evaluation_frameworkHendrik Drachsler
 
Komogortsev Qualitative And Quantitative Scoring And Evaluation Of The Eye Mo...
Komogortsev Qualitative And Quantitative Scoring And Evaluation Of The Eye Mo...Komogortsev Qualitative And Quantitative Scoring And Evaluation Of The Eye Mo...
Komogortsev Qualitative And Quantitative Scoring And Evaluation Of The Eye Mo...Kalle
 
Neural networks, naïve bayes and decision tree machine learning
Neural networks, naïve bayes and decision tree machine learningNeural networks, naïve bayes and decision tree machine learning
Neural networks, naïve bayes and decision tree machine learningFrancisco E. Figueroa-Nigaglioni
 
IRJET - A Novel Approach for Software Defect Prediction based on Dimensio...
IRJET -  	  A Novel Approach for Software Defect Prediction based on Dimensio...IRJET -  	  A Novel Approach for Software Defect Prediction based on Dimensio...
IRJET - A Novel Approach for Software Defect Prediction based on Dimensio...IRJET Journal
 
Adaptive Neural Fuzzy Inference System for Employability Assessment
Adaptive Neural Fuzzy Inference System for Employability AssessmentAdaptive Neural Fuzzy Inference System for Employability Assessment
Adaptive Neural Fuzzy Inference System for Employability AssessmentEditor IJCATR
 
A Review on Reasoning System, Types, and Tools and Need for Hybrid Reasoning
A Review on Reasoning System, Types, and Tools and Need for Hybrid ReasoningA Review on Reasoning System, Types, and Tools and Need for Hybrid Reasoning
A Review on Reasoning System, Types, and Tools and Need for Hybrid ReasoningBRNSSPublicationHubI
 
Screening of Mental Health in Adolescents using ML.pptx
Screening of Mental Health in Adolescents using ML.pptxScreening of Mental Health in Adolescents using ML.pptx
Screening of Mental Health in Adolescents using ML.pptxNitishChoudhary23
 
Chi1417 Richardson
Chi1417 RichardsonChi1417 Richardson
Chi1417 Richardsonmerichar
 
IRJET- Improved Model for Big Data Analytics using Dynamic Multi-Swarm Op...
IRJET-  	  Improved Model for Big Data Analytics using Dynamic Multi-Swarm Op...IRJET-  	  Improved Model for Big Data Analytics using Dynamic Multi-Swarm Op...
IRJET- Improved Model for Big Data Analytics using Dynamic Multi-Swarm Op...IRJET Journal
 
IRJET - House Price Predictor using ML through Artificial Neural Network
IRJET - House Price Predictor using ML through Artificial Neural NetworkIRJET - House Price Predictor using ML through Artificial Neural Network
IRJET - House Price Predictor using ML through Artificial Neural NetworkIRJET Journal
 
An Aspect Based Resource Recommendation System for Smart Hotels
An Aspect Based Resource Recommendation System for Smart HotelsAn Aspect Based Resource Recommendation System for Smart Hotels
An Aspect Based Resource Recommendation System for Smart HotelsEduardo Castillejo Gil
 
IRJET- Study and Evaluation of Classification Algorithms in Data Mining
IRJET- Study and Evaluation of Classification Algorithms in Data MiningIRJET- Study and Evaluation of Classification Algorithms in Data Mining
IRJET- Study and Evaluation of Classification Algorithms in Data MiningIRJET Journal
 
Spark + AI Summit - The Importance of Model Fairness and Interpretability in ...
Spark + AI Summit - The Importance of Model Fairness and Interpretability in ...Spark + AI Summit - The Importance of Model Fairness and Interpretability in ...
Spark + AI Summit - The Importance of Model Fairness and Interpretability in ...Francesca Lazzeri, PhD
 
IRJET- A Survey on Image Retrieval using Machine Learning
IRJET- A Survey on Image Retrieval using Machine LearningIRJET- A Survey on Image Retrieval using Machine Learning
IRJET- A Survey on Image Retrieval using Machine LearningIRJET Journal
 
CFAR-m Presentation English
CFAR-m Presentation EnglishCFAR-m Presentation English
CFAR-m Presentation Englishbusinessangeleu
 

Ähnlich wie Predicting performance in Recommender Systems - Poster (20)

Predicting performance in Recommender Systems - Slides
Predicting performance in Recommender Systems - SlidesPredicting performance in Recommender Systems - Slides
Predicting performance in Recommender Systems - Slides
 
Predicting Employee Attrition
Predicting Employee AttritionPredicting Employee Attrition
Predicting Employee Attrition
 
2013-1 Machine Learning Lecture 01 - Pattern Recognition
2013-1 Machine Learning Lecture 01 - Pattern Recognition2013-1 Machine Learning Lecture 01 - Pattern Recognition
2013-1 Machine Learning Lecture 01 - Pattern Recognition
 
LAK13 linkedup tutorial_evaluation_framework
LAK13 linkedup tutorial_evaluation_frameworkLAK13 linkedup tutorial_evaluation_framework
LAK13 linkedup tutorial_evaluation_framework
 
Komogortsev Qualitative And Quantitative Scoring And Evaluation Of The Eye Mo...
Komogortsev Qualitative And Quantitative Scoring And Evaluation Of The Eye Mo...Komogortsev Qualitative And Quantitative Scoring And Evaluation Of The Eye Mo...
Komogortsev Qualitative And Quantitative Scoring And Evaluation Of The Eye Mo...
 
14
1414
14
 
14
1414
14
 
Neural networks, naïve bayes and decision tree machine learning
Neural networks, naïve bayes and decision tree machine learningNeural networks, naïve bayes and decision tree machine learning
Neural networks, naïve bayes and decision tree machine learning
 
IRJET - A Novel Approach for Software Defect Prediction based on Dimensio...
IRJET -  	  A Novel Approach for Software Defect Prediction based on Dimensio...IRJET -  	  A Novel Approach for Software Defect Prediction based on Dimensio...
IRJET - A Novel Approach for Software Defect Prediction based on Dimensio...
 
Adaptive Neural Fuzzy Inference System for Employability Assessment
Adaptive Neural Fuzzy Inference System for Employability AssessmentAdaptive Neural Fuzzy Inference System for Employability Assessment
Adaptive Neural Fuzzy Inference System for Employability Assessment
 
A Review on Reasoning System, Types, and Tools and Need for Hybrid Reasoning
A Review on Reasoning System, Types, and Tools and Need for Hybrid ReasoningA Review on Reasoning System, Types, and Tools and Need for Hybrid Reasoning
A Review on Reasoning System, Types, and Tools and Need for Hybrid Reasoning
 
Screening of Mental Health in Adolescents using ML.pptx
Screening of Mental Health in Adolescents using ML.pptxScreening of Mental Health in Adolescents using ML.pptx
Screening of Mental Health in Adolescents using ML.pptx
 
Chi1417 Richardson
Chi1417 RichardsonChi1417 Richardson
Chi1417 Richardson
 
IRJET- Improved Model for Big Data Analytics using Dynamic Multi-Swarm Op...
IRJET-  	  Improved Model for Big Data Analytics using Dynamic Multi-Swarm Op...IRJET-  	  Improved Model for Big Data Analytics using Dynamic Multi-Swarm Op...
IRJET- Improved Model for Big Data Analytics using Dynamic Multi-Swarm Op...
 
IRJET - House Price Predictor using ML through Artificial Neural Network
IRJET - House Price Predictor using ML through Artificial Neural NetworkIRJET - House Price Predictor using ML through Artificial Neural Network
IRJET - House Price Predictor using ML through Artificial Neural Network
 
An Aspect Based Resource Recommendation System for Smart Hotels
An Aspect Based Resource Recommendation System for Smart HotelsAn Aspect Based Resource Recommendation System for Smart Hotels
An Aspect Based Resource Recommendation System for Smart Hotels
 
IRJET- Study and Evaluation of Classification Algorithms in Data Mining
IRJET- Study and Evaluation of Classification Algorithms in Data MiningIRJET- Study and Evaluation of Classification Algorithms in Data Mining
IRJET- Study and Evaluation of Classification Algorithms in Data Mining
 
Spark + AI Summit - The Importance of Model Fairness and Interpretability in ...
Spark + AI Summit - The Importance of Model Fairness and Interpretability in ...Spark + AI Summit - The Importance of Model Fairness and Interpretability in ...
Spark + AI Summit - The Importance of Model Fairness and Interpretability in ...
 
IRJET- A Survey on Image Retrieval using Machine Learning
IRJET- A Survey on Image Retrieval using Machine LearningIRJET- A Survey on Image Retrieval using Machine Learning
IRJET- A Survey on Image Retrieval using Machine Learning
 
CFAR-m Presentation English
CFAR-m Presentation EnglishCFAR-m Presentation English
CFAR-m Presentation English
 

Mehr von Alejandro Bellogin

Recommender Systems and Misinformation: The Problem or the Solution?
Recommender Systems and Misinformation: The Problem or the Solution?Recommender Systems and Misinformation: The Problem or the Solution?
Recommender Systems and Misinformation: The Problem or the Solution?Alejandro Bellogin
 
Revisiting neighborhood-based recommenders for temporal scenarios
Revisiting neighborhood-based recommenders for temporal scenariosRevisiting neighborhood-based recommenders for temporal scenarios
Revisiting neighborhood-based recommenders for temporal scenariosAlejandro Bellogin
 
Evaluating decision-aware recommender systems
Evaluating decision-aware recommender systemsEvaluating decision-aware recommender systems
Evaluating decision-aware recommender systemsAlejandro Bellogin
 
Replicable Evaluation of Recommender Systems
Replicable Evaluation of Recommender SystemsReplicable Evaluation of Recommender Systems
Replicable Evaluation of Recommender SystemsAlejandro Bellogin
 
Implicit vs Explicit trust in Social Matrix Factorization
Implicit vs Explicit trust in Social Matrix FactorizationImplicit vs Explicit trust in Social Matrix Factorization
Implicit vs Explicit trust in Social Matrix FactorizationAlejandro Bellogin
 
RiVal - A toolkit to foster reproducibility in Recommender System evaluation
RiVal - A toolkit to foster reproducibility in Recommender System evaluationRiVal - A toolkit to foster reproducibility in Recommender System evaluation
RiVal - A toolkit to foster reproducibility in Recommender System evaluationAlejandro Bellogin
 
HT2014 Tutorial: Evaluating Recommender Systems - Ensuring Replicability of E...
HT2014 Tutorial: Evaluating Recommender Systems - Ensuring Replicability of E...HT2014 Tutorial: Evaluating Recommender Systems - Ensuring Replicability of E...
HT2014 Tutorial: Evaluating Recommender Systems - Ensuring Replicability of E...Alejandro Bellogin
 
CWI @ Contextual Suggestion track - TREC 2013
CWI @ Contextual Suggestion track - TREC 2013CWI @ Contextual Suggestion track - TREC 2013
CWI @ Contextual Suggestion track - TREC 2013Alejandro Bellogin
 
CWI @ Federated Web Track - TREC 2013
CWI @ Federated Web Track - TREC 2013CWI @ Federated Web Track - TREC 2013
CWI @ Federated Web Track - TREC 2013Alejandro Bellogin
 
Probabilistic Collaborative Filtering with Negative Cross Entropy
Probabilistic Collaborative Filtering with Negative Cross EntropyProbabilistic Collaborative Filtering with Negative Cross Entropy
Probabilistic Collaborative Filtering with Negative Cross EntropyAlejandro Bellogin
 
Understanding Similarity Metrics in Neighbour-based Recommender Systems
Understanding Similarity Metrics in Neighbour-based Recommender SystemsUnderstanding Similarity Metrics in Neighbour-based Recommender Systems
Understanding Similarity Metrics in Neighbour-based Recommender SystemsAlejandro Bellogin
 
Artist popularity: do web and social music services agree?
Artist popularity: do web and social music services agree?Artist popularity: do web and social music services agree?
Artist popularity: do web and social music services agree?Alejandro Bellogin
 
Improving Memory-Based Collaborative Filtering by Neighbour Selection based o...
Improving Memory-Based Collaborative Filtering by Neighbour Selection based o...Improving Memory-Based Collaborative Filtering by Neighbour Selection based o...
Improving Memory-Based Collaborative Filtering by Neighbour Selection based o...Alejandro Bellogin
 
Performance prediction and evaluation in Recommender Systems: an Information ...
Performance prediction and evaluation in Recommender Systems: an Information ...Performance prediction and evaluation in Recommender Systems: an Information ...
Performance prediction and evaluation in Recommender Systems: an Information ...Alejandro Bellogin
 
Using Graph Partitioning Techniques for Neighbour Selection in User-Based Col...
Using Graph Partitioning Techniques for Neighbour Selection in User-Based Col...Using Graph Partitioning Techniques for Neighbour Selection in User-Based Col...
Using Graph Partitioning Techniques for Neighbour Selection in User-Based Col...Alejandro Bellogin
 
Using Graph Partitioning Techniques for Neighbour Selection in User-Based Col...
Using Graph Partitioning Techniques for Neighbour Selection in User-Based Col...Using Graph Partitioning Techniques for Neighbour Selection in User-Based Col...
Using Graph Partitioning Techniques for Neighbour Selection in User-Based Col...Alejandro Bellogin
 
Precision-oriented Evaluation of Recommender Systems: An Algorithmic Comparis...
Precision-oriented Evaluation of Recommender Systems: An Algorithmic Comparis...Precision-oriented Evaluation of Recommender Systems: An Algorithmic Comparis...
Precision-oriented Evaluation of Recommender Systems: An Algorithmic Comparis...Alejandro Bellogin
 
Predicting performance in Recommender Systems - Poster slam
Predicting performance in Recommender Systems - Poster slamPredicting performance in Recommender Systems - Poster slam
Predicting performance in Recommender Systems - Poster slamAlejandro Bellogin
 
Precision-oriented Evaluation of Recommender Systems: An Algorithmic Comparis...
Precision-oriented Evaluation of Recommender Systems: An Algorithmic Comparis...Precision-oriented Evaluation of Recommender Systems: An Algorithmic Comparis...
Precision-oriented Evaluation of Recommender Systems: An Algorithmic Comparis...Alejandro Bellogin
 

Mehr von Alejandro Bellogin (19)

Recommender Systems and Misinformation: The Problem or the Solution?
Recommender Systems and Misinformation: The Problem or the Solution?Recommender Systems and Misinformation: The Problem or the Solution?
Recommender Systems and Misinformation: The Problem or the Solution?
 
Revisiting neighborhood-based recommenders for temporal scenarios
Revisiting neighborhood-based recommenders for temporal scenariosRevisiting neighborhood-based recommenders for temporal scenarios
Revisiting neighborhood-based recommenders for temporal scenarios
 
Evaluating decision-aware recommender systems
Evaluating decision-aware recommender systemsEvaluating decision-aware recommender systems
Evaluating decision-aware recommender systems
 
Replicable Evaluation of Recommender Systems
Replicable Evaluation of Recommender SystemsReplicable Evaluation of Recommender Systems
Replicable Evaluation of Recommender Systems
 
Implicit vs Explicit trust in Social Matrix Factorization
Implicit vs Explicit trust in Social Matrix FactorizationImplicit vs Explicit trust in Social Matrix Factorization
Implicit vs Explicit trust in Social Matrix Factorization
 
RiVal - A toolkit to foster reproducibility in Recommender System evaluation
RiVal - A toolkit to foster reproducibility in Recommender System evaluationRiVal - A toolkit to foster reproducibility in Recommender System evaluation
RiVal - A toolkit to foster reproducibility in Recommender System evaluation
 
HT2014 Tutorial: Evaluating Recommender Systems - Ensuring Replicability of E...
HT2014 Tutorial: Evaluating Recommender Systems - Ensuring Replicability of E...HT2014 Tutorial: Evaluating Recommender Systems - Ensuring Replicability of E...
HT2014 Tutorial: Evaluating Recommender Systems - Ensuring Replicability of E...
 
CWI @ Contextual Suggestion track - TREC 2013
CWI @ Contextual Suggestion track - TREC 2013CWI @ Contextual Suggestion track - TREC 2013
CWI @ Contextual Suggestion track - TREC 2013
 
CWI @ Federated Web Track - TREC 2013
CWI @ Federated Web Track - TREC 2013CWI @ Federated Web Track - TREC 2013
CWI @ Federated Web Track - TREC 2013
 
Probabilistic Collaborative Filtering with Negative Cross Entropy
Probabilistic Collaborative Filtering with Negative Cross EntropyProbabilistic Collaborative Filtering with Negative Cross Entropy
Probabilistic Collaborative Filtering with Negative Cross Entropy
 
Understanding Similarity Metrics in Neighbour-based Recommender Systems
Understanding Similarity Metrics in Neighbour-based Recommender SystemsUnderstanding Similarity Metrics in Neighbour-based Recommender Systems
Understanding Similarity Metrics in Neighbour-based Recommender Systems
 
Artist popularity: do web and social music services agree?
Artist popularity: do web and social music services agree?Artist popularity: do web and social music services agree?
Artist popularity: do web and social music services agree?
 
Improving Memory-Based Collaborative Filtering by Neighbour Selection based o...
Improving Memory-Based Collaborative Filtering by Neighbour Selection based o...Improving Memory-Based Collaborative Filtering by Neighbour Selection based o...
Improving Memory-Based Collaborative Filtering by Neighbour Selection based o...
 
Performance prediction and evaluation in Recommender Systems: an Information ...
Performance prediction and evaluation in Recommender Systems: an Information ...Performance prediction and evaluation in Recommender Systems: an Information ...
Performance prediction and evaluation in Recommender Systems: an Information ...
 
Using Graph Partitioning Techniques for Neighbour Selection in User-Based Col...
Using Graph Partitioning Techniques for Neighbour Selection in User-Based Col...Using Graph Partitioning Techniques for Neighbour Selection in User-Based Col...
Using Graph Partitioning Techniques for Neighbour Selection in User-Based Col...
 
Using Graph Partitioning Techniques for Neighbour Selection in User-Based Col...
Using Graph Partitioning Techniques for Neighbour Selection in User-Based Col...Using Graph Partitioning Techniques for Neighbour Selection in User-Based Col...
Using Graph Partitioning Techniques for Neighbour Selection in User-Based Col...
 
Precision-oriented Evaluation of Recommender Systems: An Algorithmic Comparis...
Precision-oriented Evaluation of Recommender Systems: An Algorithmic Comparis...Precision-oriented Evaluation of Recommender Systems: An Algorithmic Comparis...
Precision-oriented Evaluation of Recommender Systems: An Algorithmic Comparis...
 
Predicting performance in Recommender Systems - Poster slam
Predicting performance in Recommender Systems - Poster slamPredicting performance in Recommender Systems - Poster slam
Predicting performance in Recommender Systems - Poster slam
 
Precision-oriented Evaluation of Recommender Systems: An Algorithmic Comparis...
Precision-oriented Evaluation of Recommender Systems: An Algorithmic Comparis...Precision-oriented Evaluation of Recommender Systems: An Algorithmic Comparis...
Precision-oriented Evaluation of Recommender Systems: An Algorithmic Comparis...
 

Kürzlich hochgeladen

EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilV3cube
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024The Digital Insurer
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 

Kürzlich hochgeladen (20)

EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of Brazil
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 

Predicting performance in Recommender Systems - Poster

  • 1. Predicting Performance in Recommender Systems Alejandro Bellogín Supervised by Pablo Castells and Iván Cantador Information Retrieval Group, Universidad Autónoma de Madrid, Spain alejandro.bellogin@uam.es IRG IR Group @ UAM Motivation Research questions Is it possible to predict the accuracy of a recommendation? 1. Is it possible to define a performance prediction theory for recommender E.g., we can decide whether to deliver a recommendation or not, systems in a sound, formal way? depending on such prediction. Or, even, to combine different 2. Is it possible to adapt query performance techniques (from IR) to the recommenders according to the expected performance of each one. recommendation task? 3. What kind of evaluation should be performed? Is IR evaluation still valid in our Hypothesis problem? Data that are commonly available to a Recommender System could contain signals that enable an a priori estimation of the success of 4. What kind of recommendation problems can these models be applied to? the recommendation Research question 1 Research question 2 Is it possible to define a performance prediction theory User clarity Is it possible to adapt query performance for recommender systems in a sound, formal way? techniques (from IR) to the recommendation task? It captures the uncertainty in user’s data • In Information Retrieval: “Estimation of the system’s  p  x | u   user’s model a) Define a predictor of performance performance in response to a specific query” clarity  u    p  x | u  log   p  x   x X  c  system’s model  = (u, i, r, …) • Several predictors proposed Distance between the user’s and the b) Agree on a performance metric • We focus on query clarity  user clarity system’s probability model  = (u, i, r, …) 0.7 RatItem p_c(x) 0.6 p(x|u1) p(x|u2) We propose 3 formulations (for space X): c) Check predictive power by measuring correlation 0.5 • Based on ratings 0.4 corr([(x1), …, (xn)], [(x1), …, (xn)]) 0.3 0.2 • Based on items • Based on ratings and items 0.1 d) Evaluate final performance: dynamic vs static 0.0 4 3 5 2 1 Ratings 4 3 5 2 1 Correlation coefficients r ~ 0.57 Research question 4 What kind of recommendation problems can these models be applied to? Pearson: linear correlation • Whenever a combination of strategies is available Spearman: rank correlation • Example 1: dynamic neighbor weighting • Example 2: dynamic ensemble recommendation • The user’s neighbors are weighted • Weight is the same for every item and user according to their similarity (learnt from training) • Can we take into account the uncertainty in • What about boosting those users predicted to Research question 3 neighbor’s data? perform better for some recommender? What kind of evaluation should be performed? Is IR • User neighbor weighting: • Hybrid recommendation: evaluation still valid in our problem? • Static: g  u, i   C  sim u, v   rat  v, i  • Static: g u, i     gR1 u, i   1     gR2 u, i  vN [ u ] • In IR: Mean Average Precision + correlation • Dynamic: g  u, i   C  γ  v   sim u, v   rat  v, i  • Dynamic: g u, i   γ u   gR1 u, i   1  γ u   gR2 u, i  vN [ u ] 50 points (queries) vs 1000+ points (users) • Correlation analysis: • Correlation analysis: • Performance metric is not clear:  error-based? • Performance:  precision-based? 0,98 Standard CF 0,88 b) Neighbourhood size: 500 Standard CF b) 80% training 0,96 Clarity-enhanced CF 0,87 Clarity-enhanced CF • What is performance? 0,94 0,86 nDCG@50 0,92 0,90 0,85 • Performance: MAE 0.2 MAE  It may depend on the final application 0,84 0,88 0,83 0.15 0,86 0,84 0,82 • Possible bias 0,82 0,80 0,81 0,80 0.1 0.05  E.g., towards users or items with larger profiles 10 20 30 40 50 60 70 80 90 10 20 100 150 200 250 300 350 400 450 500 30 40 50 60 70 80 90 100 150 200 250 300 350 400 450 500 % of ratings for training Neighbourhood size 0 H1 H2 H3 H4 Adaptive Static Future Work Publications Explore other input sources We need a theoretical background • A Performance Prediction Aproach to Enhance Why do some predictors work better? Collaborative Filtering Performance. A. Bellogín  Implicit data (with time)  Social links and P. Castells. In ECIR 2010.  Item predictors o Use graph-based measures • Predicting the Performance of Recommender o Different recommender as indicators of user strength Systems: An Information Theoretic Approach. A. behavior depending on item o First results are positive Bellogín, P. Castells, and I. Cantador. In ICTIR attributes 2011. o They would allow to capture • Self-adjusting Hybrid Recommenders based on popularity, diversity, etc. Social Network Analysis. A. Bellogín, P. Castells, Larger datasets and I. Cantador. In SIGIR 2011. 5th ACM Conference on Recommender Systems (RecSys2011) – Doctoral Symposium Acknowledgments to the National Science Foundation Chicago, USA, 23-27 October 2011 for the funding to attend the conference