SlideShare a Scribd company logo
1 of 24
5th ACM International Conference on
                     Recommender Systems – RecSys 2011




                        Rank and Relevance
                 in Novelty and Diversity Metrics
                    for Recommender Systems

                          Saúl Vargas and Pablo Castells
                        Universidad Autónoma de Madrid
                                http://ir.ii.uam.es



IRG
                      Rank and Relevance in Novelty and Diversity Metrics for Recommender Systems
                        5th ACM International Conference on Recommender Systems (RecSys 2011)
IR Group @ UAM                               Chicago, IL, 23-27 October 2011
Beyond accuracy: novelty and diversity
         You bought                                     So you are recommended…
        (or browsed)
                 Revolver

                                                    Rubber Soul            With The              Beatles             Let it be         Help!
                                                                           Beatles               for Sale




                                                     A Hard Day’s       Sgt. Pp’s Lonely         Yellow           Magical Mystery   The White
                                                        Night          Hearts Club Band        Submarine               Tour           Album

             Abbey Road


                            The recommendedPlease are…
                                                 items 1967-1970                               1962-1966           Past Masters     Past Masters
                              Very similar to each other (Blue)
                                              Please me                                          (Red)                                  Vol 2

                              Very similar to what the
                               user has already seen                                                              …          More Beatles’
                                                                                                                               albums
                              Very widely known
                                              Dark Side Some Girls                              Bob Dylan
                                                     of the Moon


IRG
                                    Rank and Relevance in Novelty and Diversity Metrics for Recommender Systems
                                      5th ACM International Conference on Recommender Systems (RecSys 2011)
IR Group @ UAM                                             Chicago, IL, 23-27 October 2011
Novelty and diversity in Recommender Systems

     Algorithms to enhance novelty and diversity
          Greedy optimization of objective functions (accuracy + diversity), promotion of long-tail items, etc.
                 (Ziegler 2005, Zhang 2008, Celma 2008)

     Metrics and methodologies to measure and evaluate novelty and diversity
          Inverse popularity –mean self-information (Zhou 2010)  recommend in the long tail
                                 1
                       MSI  
                                 R
                                      log
                                     iR
                                             2   p i                              Novelty

          Intra-list diversity –average pairwise distance (Ziegler 2005, Zhang 2008)

                                     2
                       ILD                   R d ik , il 
                                 R  R  1 ik ,il                                 Diversity
                                                    k l


          Other: temporal diversity (Lathia 2010), diversity relative to other users & to other systems
                 (Bellogín 2010), aggregate diversity (Adomavicius 2011), unexpectedness (Adamopoulos 2011), etc.



IRG
                                      Rank and Relevance in Novelty and Diversity Metrics for Recommender Systems
                                        5th ACM International Conference on Recommender Systems (RecSys 2011)
IR Group @ UAM                                               Chicago, IL, 23-27 October 2011
Some limitations

                               R1                                    R2

                                                                                         Metrics are insensitive to the
                 Diverse




                                                       Not diverse
                                                                                         order of recommender items




                                                                                               Same item sets  same
                                                                                               measured diversity/novelty
                 Not diverse




                                                       Diverse
                               …




                                                                      …




IRG
                                    Rank and Relevance in Novelty and Diversity Metrics for Recommender Systems
                                      5th ACM International Conference on Recommender Systems (RecSys 2011)
IR Group @ UAM                                             Chicago, IL, 23-27 October 2011
Some limitations



                            Accuracy and diversity/novelty measured independently



                            Method A is better than B                                           Which one is better?

                                                 Method A
                                                 Method B
                 Accuracy




                                     Diversity




IRG
                                       Rank and Relevance in Novelty and Diversity Metrics for Recommender Systems
                                         5th ACM International Conference on Recommender Systems (RecSys 2011)
IR Group @ UAM                                                Chicago, IL, 23-27 October 2011
Our research goals


           1. Further formalize recommendation novelty and diversity
                 metrics based on a few basic fundamental principles

           2. Build a unified metric framework where:
                  –   As many state of the art novelty and diversity metrics as possible
                      are related and generalized

                  –   New metrics can be defined

           3. Enhance the novelty and diversity metrics with rank
                 sensitivity and relevance awareness


IRG
                               Rank and Relevance in Novelty and Diversity Metrics for Recommender Systems
                                 5th ACM International Conference on Recommender Systems (RecSys 2011)
IR Group @ UAM                                        Chicago, IL, 23-27 October 2011
Basic fundamental principles to build metrics upon

        Our approach: define and formalize novelty and diversity metrics
                 based on models of how users interact with items

        Three basic fundamental principles in user-item interaction
                  – Discovery – an item is seen by a user
                  – Relevance – an item would be liked by (or useful for, etc.) a user
                  – Choice – an item is actually accepted (bought, consumed, etc.) by a user

        Formalized as binary random variables
                  – seen, rel, choose taking values in {true, false}
                                                                                                   seen choose        rel
        Simplifying assumptions:
                  – seen and rel are mutually independent
                  – If a user sees an item that is relevant for her,                         p choose            p seen  p rel 
                     she chooses it


IRG
                                      Rank and Relevance in Novelty and Diversity Metrics for Recommender Systems
                                        5th ACM International Conference on Recommender Systems (RecSys 2011)
IR Group @ UAM                                               Chicago, IL, 23-27 October 2011
Proposed metric framework

                     Expected effective novelty of items when a user interacts
         R
                     with a ranked list of recommended items in a context 


                     m  R    C  p choose i, u, R  nov i  
                                                 iR

                     Novelty is relative: item novelty context 
                 i
                     To (what we know about) what someone has seen sometime somewhere
                      Someone  the target user, a set of users, all users…
                      Sometime  a specific past time period, an ongoing session, “ever”…
                      Somewhere  past recommendations, the current recommendation R,
                       recommendations by other systems, “anywhere”…
                      “What we know about that”  context of observation: available observations
         …




IRG
                                Rank and Relevance in Novelty and Diversity Metrics for Recommender Systems
                                  5th ACM International Conference on Recommender Systems (RecSys 2011)
IR Group @ UAM                                         Chicago, IL, 23-27 October 2011
Metric framework components




                 m  R    C  p choose i, u, R  nov i  
                                         iR



                    Item novelty model                            nov i  

                    Choice model                                  p choose i, u, R 




IRG
                        Rank and Relevance in Novelty and Diversity Metrics for Recommender Systems
                          5th ACM International Conference on Recommender Systems (RecSys 2011)
IR Group @ UAM                                 Chicago, IL, 23-27 October 2011
Item novelty models

                        Item novelty model nov(i|)
    Discovery-based                     (negative popularity)

                 – Popularity complement                nov i    1  p seen i,                            Forced discovery

                 – Self-information (surprisal) nov i     log2 p i seen,                                 Free discovery


    Distance-based                   ( here represents a set of items)

                 – Expected item distance                nov i     p  j choose, i,   d i, j 
                                                                                j 


                 – Minimum item distance                 nov i    min d i, j 
                                                                                 j 




IRG
                                   Rank and Relevance in Novelty and Diversity Metrics for Recommender Systems
                                     5th ACM International Conference on Recommender Systems (RecSys 2011)
IR Group @ UAM                                            Chicago, IL, 23-27 October 2011
Metric framework components




                 m  R    C  p choose i, u, R  nov i  
                                         iR



                    Item novelty model                            nov i  

                    Choice model                                  p choose i, u, R 




IRG
                        Rank and Relevance in Novelty and Diversity Metrics for Recommender Systems
                          5th ACM International Conference on Recommender Systems (RecSys 2011)
IR Group @ UAM                                 Chicago, IL, 23-27 October 2011
Choice model


                     Choice model p(choose|i,u,R)

                 p choose     p seen  p rel 


                 p choose i, u, R           p seen i, u, R  p rel i, u 

                                                     Browsing                     Relevance                  Independent
                                                      model                        model                     from R




IRG
                               Rank and Relevance in Novelty and Diversity Metrics for Recommender Systems
                                 5th ACM International Conference on Recommender Systems (RecSys 2011)
IR Group @ UAM                                        Chicago, IL, 23-27 October 2011
Browsing model

                 R
                         Browsing model where p(seen|ik,u,R) should decrease with k
         1
                          Can be formalized as different probabilistic discount functions
         2
                           (see e.g. Carterette 2011)
         3
                          In general, p(seen|ik,u,R) = disc(k)
         4
                             disc k 
         5
                               p k 1                       exponential, as in RBP (Moffat 2008)
k=6                  ?
                               1 log k  1               as in nDCG
         7
                               1k                           Zipfian, as in MRR, MAP, etc.
         8
                               1                            no discount
         9
                               ...                          many others...
                 …




IRG
                             Rank and Relevance in Novelty and Diversity Metrics for Recommender Systems
                               5th ACM International Conference on Recommender Systems (RecSys 2011)
IR Group @ UAM                                      Chicago, IL, 23-27 October 2011
Wrapping up: resulting metric scheme




                  m  R    C  disc k  p rel ik , u  nov ik  
                                         ik R
                                                       Rank                        Item                         Item
                                                     discount                   relevance                      novelty


                 Normalization – to get the novelty ratio by expected number of browsed items

                                 1
                         C 
                              disc k 
                                 ik R
                                                                              Expected browsing depth




IRG
                                 Rank and Relevance in Novelty and Diversity Metrics for Recommender Systems
                                   5th ACM International Conference on Recommender Systems (RecSys 2011)
IR Group @ UAM                                          Chicago, IL, 23-27 October 2011
Implementation

       Ground model estimates

         observed interaction between all users and items in the system

        Discovery distributions can be estimated from rating data or access records

                  – Forced discovery p(seen|i,)  IUF (ratio of users who have interacted with i)

                  – Free discovery: p(i|seen,)  ICF (ratio of interactions involving i)

        Relevance distribution p(rel|i,u) is estimated by a mapping from ratings

                 to relevance (see definition of ERR in Chapelle 2009)




IRG
                                   Rank and Relevance in Novelty and Diversity Metrics for Recommender Systems
                                     5th ACM International Conference on Recommender Systems (RecSys 2011)
IR Group @ UAM                                            Chicago, IL, 23-27 October 2011
Novelty and diversity metrics




                                 Putting all together
                 Some metric framework instantiations




IRG
                     Rank and Relevance in Novelty and Diversity Metrics for Recommender Systems
                       5th ACM International Conference on Recommender Systems (RecSys 2011)
IR Group @ UAM                              Chicago, IL, 23-27 October 2011
Putting all together: metric framework instantiations


       Discovery-based metrics

         observed interaction between all users and items in the system

        Expected popularity complement

                   EPC R   C  disc k  p rel ik , u  1  p seen ik 
                                  ik R
                                                                                                        
                                                                                                                          Novelty
        Expected free discovery

                   EFD R   C  disc k  p rel ik , u  log p ik seen 
                                    ik R
                                                                                                                1
                 Without rank and relevance reduces to  MSI  R   
                                                                                                                R
                                                                                                                     log p i seen 
                                                                                                                    iR




IRG
                                  Rank and Relevance in Novelty and Diversity Metrics for Recommender Systems
                                    5th ACM International Conference on Recommender Systems (RecSys 2011)
IR Group @ UAM                                           Chicago, IL, 23-27 October 2011
Putting all together: metric framework instantiations

   Distance-based metrics

     the observed interaction of the target user only
    Expected profile distance
                                                                                                                      Unexpectedness
                 EPD  R   C u  disc k  p rel ik , u  p rel j, u  d ik , j                                 (user-specific)
                                   ik R
                                    j u


     the recommended items the target user can see in R
    Expected intra-list diversity
                                                                                                                                    Diversity
                 EILD  R       C disc k  disc l k  p rel i , u  p rel i , u  d i , i 
                                ik R
                                           k                                             k                    l          k   l

                                il R
                                 k l                                2
             Without rank and relevance reduces to 
                                                     ILD  R                 d ik , il 
                                                                 R  R  1 ik ,il R
                                                                                                                                 k l



IRG
                                        Rank and Relevance in Novelty and Diversity Metrics for Recommender Systems
                                          5th ACM International Conference on Recommender Systems (RecSys 2011)
IR Group @ UAM                                                 Chicago, IL, 23-27 October 2011
Novelty and diversity metrics




                              Some experiments




IRG
                  Rank and Relevance in Novelty and Diversity Metrics for Recommender Systems
                    5th ACM International Conference on Recommender Systems (RecSys 2011)
IR Group @ UAM                           Chicago, IL, 23-27 October 2011
Experiments

      Datasets                                                                     Recommender algorithms
                 – MovieLens 1M                                                            – CB                    Content-based (ML only)
                 – Last.fm data by Òscar Celma                                             – UB                    User-based kNN
      Experiment design                                                                   – MF                    Matrix factorization
                 – Run baseline recommenders                                               – AVG                   Average rating
                 – Rerank top 500 recommended items                                        – RND                   Random
                   by diversification algorithms
                 – Measure metrics on top 50 items                                  Diversification algorithms
      Metrics                                                                             – MMR                   Greedy optimization
                                                                                                                   of relevance + diversity
                 – EPC@50          Novelty                                                                         (Zhang 2008)
                                   (popularity complement)
                                                                                           – IA-Select             Adaptation of IR
                 – EPD@50          Unexpectednes                                                                   diversity algorithm
                                   (profile distance)                                                              (Agrawal 2008)
                 – EILD@50         Intra-list diversity                                    – NGD                   Greedy optimization
                 Distance function: complement of Jaccard                                                          of relevance + novelty
                 (MovieLens genres) and Pearson (Last.fm)                                  – Random

IRG
                                     Rank and Relevance in Novelty and Diversity Metrics for Recommender Systems
                                       5th ACM International Conference on Recommender Systems (RecSys 2011)
IR Group @ UAM                                              Chicago, IL, 23-27 October 2011
Experimental results on baseline recommenders (no rank discount)

                                  MovieLens 1M                                        Last.fm
                                                                                                                               Without relevance
                   1.0                                         1.00                                               CB
                   0.9                                                                                            MF
                                                                                                                                CB is good for long-
                                                               0.97
No relevance




                   0.8                                                                                            UB              tail, not so good at
                                                               0.94
                   0.7                                                                                            AVG             unexpectedness
                                                               0.91                                               RND
                   0.6                                                                                                            and diversity
                   0.5                                         0.88
                                                                                                                                AVG rating and RND
                   0.4                                         0.85
                         EPC@50      EPD@50      EILD@50                EPC@50       EPD@50        EILD@50                        stand out, especial-
                                                                                                                                  ly on Last.fm


                                  MovieLens 1M                                        Last.fm
                                                                                                                               With relevance
                  0.07                                          0.5                                               CB
                                                                                                                                MF stands out on
Relevance-aware




                  0.06                                                                                            MF
                                                                0.4
                  0.05                                                                                            UB              MovieLens
                  0.04                                          0.3                                               AVG
                  0.03                                                                                            RND
                                                                                                                                UB stands out on
                                                                0.2
                  0.02                                                                                                            Last.fm
                  0.01                                          0.1
                  0.00                                          0.0
                                                                                                                                AVG rating and RND
                         EPC@50     EPD@50    EILD@50                   EPC@50       EPD@50       EILD@50                         drop drastically




IRG
                                                 Rank and Relevance in Novelty and Diversity Metrics for Recommender Systems
                                                   5th ACM International Conference on Recommender Systems (RecSys 2011)
IR Group @ UAM                                                          Chicago, IL, 23-27 October 2011
Experimental results with diversification algorithms
   Wilcoxon                                  MovieLens 1M                                                              Last.fm
   p < 0.001                  EPC@50            EPD@50                EILD@50                   EPC@50                 EPD@50        EILD@50
                disc (k)      1    0.85k–1     1       0.85k–1        1       0.85k–1          1       0.85k–1         1   0.85k–1   1      0.85k–1
                MF          0.9124 0.8876 0.7632 0.7466 0.7164 0.6191                      0.8754 0.8481 0.8949 0.8895 0.8862 0.7954
 No relevance




                IA-Select   0.9045 0.8886 0.8080 0.7577 0.8289 0.7483                      0.8840 0.9089 0.8912 0.8909 0.8878 0.8274
                MMR         0.9063 0.8769 0.7605 0.7428 0.7191 0.6247                      0.9068 0.8903 0.9133 0.9107 0.9166 0.8398
                NGD         0.9851 0.9795 0.7725 0.7551 0.6563 0.5430                      0.9722 0.9571 0.9423 0.9398 0.9485 0.8784
                Random      0.9525 0.9527 0.7699 0.7699 0.7283 0.6719                      0.9359 0.9357 0.9278 0.9279 0.9318 0.8619

                MF          0.0671 0.1043 0.0580 0.0944 0.0471 0.0551                      0.2501 0.2115 0.2671 0.2587 0.2518 0.1900
                IA-Select
 Relevance




                            0.0705 0.1161 0.0639 0.1032 0.0537 0.0648                      0.3343 0.4752 0.3462 0.3994 0.3343 0.4154
                MMR         0.0719 0.1131 0.0620 0.1020 0.0510 0.0610                      0.2351 0.1936 0.2439 0.2340 0.2360 0.1759
                NGD         0.0155 0.0223 0.0128 0.0200 0.0067 0.0017                      0.2286 0.3077 0.2212 0.2593 0.2165 0.2656
                Random      0.0222 0.0218 0.0182 0.0179 0.0117 0.0058                      0.1362 0.1368 0.1407 0.1405 0.1342 0.1113

  Improvement w.r.t. random reranking is clearer with relevance                                                                         best
  Rank sensitivity uncovers further improvements by diversification algorithms                                                          > random
  Different metrics appreciate different diversification algorithms consistently                                                        < baseline


IRG
                                         Rank and Relevance in Novelty and Diversity Metrics for Recommender Systems
                                           5th ACM International Conference on Recommender Systems (RecSys 2011)
IR Group @ UAM                                                  Chicago, IL, 23-27 October 2011
Experimental results

          The metrics behave consistently
                  – E.g. content-based recommender scores high on novelty (long-tail) but low on
                     unexpectedness and diversity

                  – Diversified recommendations score higher than baselines

                  – Different diversification strategies met their specific target

          Relevance makes a large difference
                  – Probe recommenders such as random and average rating score high without
                     relevance and rank discount –and they drop with relevance

                  – Same effect for random diversification

          Rank sensitiveness uncovers further improvements by diversification
                 algorithms which otherwise go unnoticed



IRG
                                     Rank and Relevance in Novelty and Diversity Metrics for Recommender Systems
                                       5th ACM International Conference on Recommender Systems (RecSys 2011)
IR Group @ UAM                                              Chicago, IL, 23-27 October 2011
Conclusion

    General metric framework for recommendation novelty and diversity evaluation

    Flexible and configurable, supports a fair range of variants and configurations
                 – Key configuration components: item novelty models, context , rank and relevance

    Unifies and generalizes state of the art metrics
                 – Further metrics can be unified taking alternative  : temporal novelty/diversity,
                   inter-system diversity, inter-user diversity

    Provides for rank sensitivity and relevance awareness (as an option)

    Provides for single metric assessing accuracy and diversity/novelty

    Further ongoing empirical testing, wide space for further exploration!




IRG
                                    Rank and Relevance in Novelty and Diversity Metrics for Recommender Systems
                                      5th ACM International Conference on Recommender Systems (RecSys 2011)
IR Group @ UAM                                             Chicago, IL, 23-27 October 2011

More Related Content

What's hot

Past, Present & Future of Recommender Systems: An Industry Perspective
Past, Present & Future of Recommender Systems: An Industry PerspectivePast, Present & Future of Recommender Systems: An Industry Perspective
Past, Present & Future of Recommender Systems: An Industry PerspectiveJustin Basilico
 
Hands on Explainable Recommender Systems with Knowledge Graphs @ RecSys22
Hands on Explainable Recommender Systems with Knowledge Graphs @ RecSys22Hands on Explainable Recommender Systems with Knowledge Graphs @ RecSys22
Hands on Explainable Recommender Systems with Knowledge Graphs @ RecSys22GiacomoBalloccu
 
Deep Learning in Recommender Systems - RecSys Summer School 2017
Deep Learning in Recommender Systems - RecSys Summer School 2017Deep Learning in Recommender Systems - RecSys Summer School 2017
Deep Learning in Recommender Systems - RecSys Summer School 2017Balázs Hidasi
 
Tutorial on People Recommendations in Social Networks - ACM RecSys 2013,Hong...
Tutorial on People Recommendations in Social Networks -  ACM RecSys 2013,Hong...Tutorial on People Recommendations in Social Networks -  ACM RecSys 2013,Hong...
Tutorial on People Recommendations in Social Networks - ACM RecSys 2013,Hong...Anmol Bhasin
 
[DL輪読会]Toward Multimodal Image-to-Image Translation (NIPS'17)
[DL輪読会]Toward Multimodal Image-to-Image Translation (NIPS'17)[DL輪読会]Toward Multimodal Image-to-Image Translation (NIPS'17)
[DL輪読会]Toward Multimodal Image-to-Image Translation (NIPS'17)Deep Learning JP
 
Bpr bayesian personalized ranking from implicit feedback
Bpr bayesian personalized ranking from implicit feedbackBpr bayesian personalized ranking from implicit feedback
Bpr bayesian personalized ranking from implicit feedbackPark JunPyo
 
Deep Learning for Recommender Systems
Deep Learning for Recommender SystemsDeep Learning for Recommender Systems
Deep Learning for Recommender SystemsJustin Basilico
 
GTC 2021: Counterfactual Learning to Rank in E-commerce
GTC 2021: Counterfactual Learning to Rank in E-commerceGTC 2021: Counterfactual Learning to Rank in E-commerce
GTC 2021: Counterfactual Learning to Rank in E-commerceGrubhubTech
 
Personalized Page Generation for Browsing Recommendations
Personalized Page Generation for Browsing RecommendationsPersonalized Page Generation for Browsing Recommendations
Personalized Page Generation for Browsing RecommendationsJustin Basilico
 
Personalizing the listening experience
Personalizing the listening experiencePersonalizing the listening experience
Personalizing the listening experienceMounia Lalmas-Roelleke
 
Homepage Personalization at Spotify
Homepage Personalization at SpotifyHomepage Personalization at Spotify
Homepage Personalization at SpotifyOguz Semerci
 
Kdd 2014 Tutorial - the recommender problem revisited
Kdd 2014 Tutorial -  the recommender problem revisitedKdd 2014 Tutorial -  the recommender problem revisited
Kdd 2014 Tutorial - the recommender problem revisitedXavier Amatriain
 
Stephanie deWet, Software Engineer, Pinterest at MLconf SF 2016
Stephanie deWet, Software Engineer, Pinterest at MLconf SF 2016Stephanie deWet, Software Engineer, Pinterest at MLconf SF 2016
Stephanie deWet, Software Engineer, Pinterest at MLconf SF 2016MLconf
 
Deep Learning for Recommender Systems - Budapest RecSys Meetup
Deep Learning for Recommender Systems  - Budapest RecSys MeetupDeep Learning for Recommender Systems  - Budapest RecSys Meetup
Deep Learning for Recommender Systems - Budapest RecSys MeetupAlexandros Karatzoglou
 
Recsys 2016 tutorial: Lessons learned from building real-life recommender sys...
Recsys 2016 tutorial: Lessons learned from building real-life recommender sys...Recsys 2016 tutorial: Lessons learned from building real-life recommender sys...
Recsys 2016 tutorial: Lessons learned from building real-life recommender sys...Xavier Amatriain
 
Music Personalization At Spotify
Music Personalization At SpotifyMusic Personalization At Spotify
Music Personalization At SpotifyVidhya Murali
 
15. Transformerを用いた言語処理技術の発展.pdf
15. Transformerを用いた言語処理技術の発展.pdf15. Transformerを用いた言語処理技術の発展.pdf
15. Transformerを用いた言語処理技術の発展.pdf幸太朗 岩澤
 
Past present and future of Recommender Systems: an Industry Perspective
Past present and future of Recommender Systems: an Industry PerspectivePast present and future of Recommender Systems: an Industry Perspective
Past present and future of Recommender Systems: an Industry PerspectiveXavier Amatriain
 
レコメンド研究のあれこれ
レコメンド研究のあれこれレコメンド研究のあれこれ
レコメンド研究のあれこれMasahiro Sato
 

What's hot (20)

Past, Present & Future of Recommender Systems: An Industry Perspective
Past, Present & Future of Recommender Systems: An Industry PerspectivePast, Present & Future of Recommender Systems: An Industry Perspective
Past, Present & Future of Recommender Systems: An Industry Perspective
 
Hands on Explainable Recommender Systems with Knowledge Graphs @ RecSys22
Hands on Explainable Recommender Systems with Knowledge Graphs @ RecSys22Hands on Explainable Recommender Systems with Knowledge Graphs @ RecSys22
Hands on Explainable Recommender Systems with Knowledge Graphs @ RecSys22
 
Deep Learning in Recommender Systems - RecSys Summer School 2017
Deep Learning in Recommender Systems - RecSys Summer School 2017Deep Learning in Recommender Systems - RecSys Summer School 2017
Deep Learning in Recommender Systems - RecSys Summer School 2017
 
Tutorial on People Recommendations in Social Networks - ACM RecSys 2013,Hong...
Tutorial on People Recommendations in Social Networks -  ACM RecSys 2013,Hong...Tutorial on People Recommendations in Social Networks -  ACM RecSys 2013,Hong...
Tutorial on People Recommendations in Social Networks - ACM RecSys 2013,Hong...
 
[DL輪読会]Toward Multimodal Image-to-Image Translation (NIPS'17)
[DL輪読会]Toward Multimodal Image-to-Image Translation (NIPS'17)[DL輪読会]Toward Multimodal Image-to-Image Translation (NIPS'17)
[DL輪読会]Toward Multimodal Image-to-Image Translation (NIPS'17)
 
Bpr bayesian personalized ranking from implicit feedback
Bpr bayesian personalized ranking from implicit feedbackBpr bayesian personalized ranking from implicit feedback
Bpr bayesian personalized ranking from implicit feedback
 
Deep Learning for Recommender Systems
Deep Learning for Recommender SystemsDeep Learning for Recommender Systems
Deep Learning for Recommender Systems
 
GTC 2021: Counterfactual Learning to Rank in E-commerce
GTC 2021: Counterfactual Learning to Rank in E-commerceGTC 2021: Counterfactual Learning to Rank in E-commerce
GTC 2021: Counterfactual Learning to Rank in E-commerce
 
Personalized Page Generation for Browsing Recommendations
Personalized Page Generation for Browsing RecommendationsPersonalized Page Generation for Browsing Recommendations
Personalized Page Generation for Browsing Recommendations
 
Personalizing the listening experience
Personalizing the listening experiencePersonalizing the listening experience
Personalizing the listening experience
 
Homepage Personalization at Spotify
Homepage Personalization at SpotifyHomepage Personalization at Spotify
Homepage Personalization at Spotify
 
Kdd 2014 Tutorial - the recommender problem revisited
Kdd 2014 Tutorial -  the recommender problem revisitedKdd 2014 Tutorial -  the recommender problem revisited
Kdd 2014 Tutorial - the recommender problem revisited
 
Stephanie deWet, Software Engineer, Pinterest at MLconf SF 2016
Stephanie deWet, Software Engineer, Pinterest at MLconf SF 2016Stephanie deWet, Software Engineer, Pinterest at MLconf SF 2016
Stephanie deWet, Software Engineer, Pinterest at MLconf SF 2016
 
Deep Learning for Recommender Systems - Budapest RecSys Meetup
Deep Learning for Recommender Systems  - Budapest RecSys MeetupDeep Learning for Recommender Systems  - Budapest RecSys Meetup
Deep Learning for Recommender Systems - Budapest RecSys Meetup
 
Recsys 2016 tutorial: Lessons learned from building real-life recommender sys...
Recsys 2016 tutorial: Lessons learned from building real-life recommender sys...Recsys 2016 tutorial: Lessons learned from building real-life recommender sys...
Recsys 2016 tutorial: Lessons learned from building real-life recommender sys...
 
Music Personalization At Spotify
Music Personalization At SpotifyMusic Personalization At Spotify
Music Personalization At Spotify
 
15. Transformerを用いた言語処理技術の発展.pdf
15. Transformerを用いた言語処理技術の発展.pdf15. Transformerを用いた言語処理技術の発展.pdf
15. Transformerを用いた言語処理技術の発展.pdf
 
Temporal based Recommendation System
Temporal based Recommendation SystemTemporal based Recommendation System
Temporal based Recommendation System
 
Past present and future of Recommender Systems: an Industry Perspective
Past present and future of Recommender Systems: an Industry PerspectivePast present and future of Recommender Systems: an Industry Perspective
Past present and future of Recommender Systems: an Industry Perspective
 
レコメンド研究のあれこれ
レコメンド研究のあれこれレコメンド研究のあれこれ
レコメンド研究のあれこれ
 

Viewers also liked

Setting Goals and Choosing Metrics for Recommender System Evaluations
Setting Goals and Choosing Metrics for Recommender System EvaluationsSetting Goals and Choosing Metrics for Recommender System Evaluations
Setting Goals and Choosing Metrics for Recommender System EvaluationsGunnar Fabritius (Schroeder)
 
Diversity and novelty for recommendation system
Diversity and novelty for recommendation systemDiversity and novelty for recommendation system
Diversity and novelty for recommendation systemZhenv5
 
ACM RecSys 2012: Recommender Systems, Today
ACM RecSys 2012: Recommender Systems, TodayACM RecSys 2012: Recommender Systems, Today
ACM RecSys 2012: Recommender Systems, TodayData Science London
 
The continuous cold-start problem in e-commerce recommender systems
The continuous cold-start problem in e-commerce recommender systemsThe continuous cold-start problem in e-commerce recommender systems
The continuous cold-start problem in e-commerce recommender systemsMelanie JI Mueller
 
RecSys 2015 Tutorial - Scalable Recommender Systems: Where Machine Learning m...
RecSys 2015 Tutorial - Scalable Recommender Systems: Where Machine Learning m...RecSys 2015 Tutorial - Scalable Recommender Systems: Where Machine Learning m...
RecSys 2015 Tutorial - Scalable Recommender Systems: Where Machine Learning m...Joaquin Delgado PhD.
 
Recommender Systems
Recommender SystemsRecommender Systems
Recommender SystemsT212
 
Interactive Recommender Systems with Netflix and Spotify
Interactive Recommender Systems with Netflix and SpotifyInteractive Recommender Systems with Netflix and Spotify
Interactive Recommender Systems with Netflix and SpotifyChris Johnson
 

Viewers also liked (7)

Setting Goals and Choosing Metrics for Recommender System Evaluations
Setting Goals and Choosing Metrics for Recommender System EvaluationsSetting Goals and Choosing Metrics for Recommender System Evaluations
Setting Goals and Choosing Metrics for Recommender System Evaluations
 
Diversity and novelty for recommendation system
Diversity and novelty for recommendation systemDiversity and novelty for recommendation system
Diversity and novelty for recommendation system
 
ACM RecSys 2012: Recommender Systems, Today
ACM RecSys 2012: Recommender Systems, TodayACM RecSys 2012: Recommender Systems, Today
ACM RecSys 2012: Recommender Systems, Today
 
The continuous cold-start problem in e-commerce recommender systems
The continuous cold-start problem in e-commerce recommender systemsThe continuous cold-start problem in e-commerce recommender systems
The continuous cold-start problem in e-commerce recommender systems
 
RecSys 2015 Tutorial - Scalable Recommender Systems: Where Machine Learning m...
RecSys 2015 Tutorial - Scalable Recommender Systems: Where Machine Learning m...RecSys 2015 Tutorial - Scalable Recommender Systems: Where Machine Learning m...
RecSys 2015 Tutorial - Scalable Recommender Systems: Where Machine Learning m...
 
Recommender Systems
Recommender SystemsRecommender Systems
Recommender Systems
 
Interactive Recommender Systems with Netflix and Spotify
Interactive Recommender Systems with Netflix and SpotifyInteractive Recommender Systems with Netflix and Spotify
Interactive Recommender Systems with Netflix and Spotify
 

More from Pablo Castells

Rational and irrational bias in recommendation
Rational and irrational bias in recommendationRational and irrational bias in recommendation
Rational and irrational bias in recommendationPablo Castells
 
Bias in recommendation: avoid it or embrace it?
Bias in recommendation: avoid it or embrace it?Bias in recommendation: avoid it or embrace it?
Bias in recommendation: avoid it or embrace it?Pablo Castells
 
RecSys 2020 - On Target Item Sampling in Offline Recommender System Evaluation
RecSys 2020 - On Target Item Sampling in Offline Recommender System EvaluationRecSys 2020 - On Target Item Sampling in Offline Recommender System Evaluation
RecSys 2020 - On Target Item Sampling in Offline Recommender System EvaluationPablo Castells
 
REVEAL @ RecSys 2018 - Characterization of Fair Experiments for Recommender S...
REVEAL @ RecSys 2018 - Characterization of Fair Experiments for Recommender S...REVEAL @ RecSys 2018 - Characterization of Fair Experiments for Recommender S...
REVEAL @ RecSys 2018 - Characterization of Fair Experiments for Recommender S...Pablo Castells
 
SIGIR 2018 - Should I Follow the Crowd? A Probabilistic Analysis of the Effec...
SIGIR 2018 - Should I Follow the Crowd? A Probabilistic Analysis of the Effec...SIGIR 2018 - Should I Follow the Crowd? A Probabilistic Analysis of the Effec...
SIGIR 2018 - Should I Follow the Crowd? A Probabilistic Analysis of the Effec...Pablo Castells
 
SIGIR 2017 - A Probabilistic Reformulation of Memory-Based Collaborative Filt...
SIGIR 2017 - A Probabilistic Reformulation of Memory-Based Collaborative Filt...SIGIR 2017 - A Probabilistic Reformulation of Memory-Based Collaborative Filt...
SIGIR 2017 - A Probabilistic Reformulation of Memory-Based Collaborative Filt...Pablo Castells
 
RSWeb @ ACM RecSys 2014 - Exploring social network effects on popularity bias...
RSWeb @ ACM RecSys 2014 - Exploring social network effects on popularity bias...RSWeb @ ACM RecSys 2014 - Exploring social network effects on popularity bias...
RSWeb @ ACM RecSys 2014 - Exploring social network effects on popularity bias...Pablo Castells
 
SIGIR 2011 Poster - Intent-Oriented Diversity in Recommender Systems
SIGIR 2011 Poster - Intent-Oriented Diversity in Recommender SystemsSIGIR 2011 Poster - Intent-Oriented Diversity in Recommender Systems
SIGIR 2011 Poster - Intent-Oriented Diversity in Recommender SystemsPablo Castells
 
SIGIR 2012 - Explicit Relevance Models in Intent-Oriented Information Retrie...
SIGIR 2012 - Explicit Relevance Models in Intent-Oriented  Information Retrie...SIGIR 2012 - Explicit Relevance Models in Intent-Oriented  Information Retrie...
SIGIR 2012 - Explicit Relevance Models in Intent-Oriented Information Retrie...Pablo Castells
 

More from Pablo Castells (9)

Rational and irrational bias in recommendation
Rational and irrational bias in recommendationRational and irrational bias in recommendation
Rational and irrational bias in recommendation
 
Bias in recommendation: avoid it or embrace it?
Bias in recommendation: avoid it or embrace it?Bias in recommendation: avoid it or embrace it?
Bias in recommendation: avoid it or embrace it?
 
RecSys 2020 - On Target Item Sampling in Offline Recommender System Evaluation
RecSys 2020 - On Target Item Sampling in Offline Recommender System EvaluationRecSys 2020 - On Target Item Sampling in Offline Recommender System Evaluation
RecSys 2020 - On Target Item Sampling in Offline Recommender System Evaluation
 
REVEAL @ RecSys 2018 - Characterization of Fair Experiments for Recommender S...
REVEAL @ RecSys 2018 - Characterization of Fair Experiments for Recommender S...REVEAL @ RecSys 2018 - Characterization of Fair Experiments for Recommender S...
REVEAL @ RecSys 2018 - Characterization of Fair Experiments for Recommender S...
 
SIGIR 2018 - Should I Follow the Crowd? A Probabilistic Analysis of the Effec...
SIGIR 2018 - Should I Follow the Crowd? A Probabilistic Analysis of the Effec...SIGIR 2018 - Should I Follow the Crowd? A Probabilistic Analysis of the Effec...
SIGIR 2018 - Should I Follow the Crowd? A Probabilistic Analysis of the Effec...
 
SIGIR 2017 - A Probabilistic Reformulation of Memory-Based Collaborative Filt...
SIGIR 2017 - A Probabilistic Reformulation of Memory-Based Collaborative Filt...SIGIR 2017 - A Probabilistic Reformulation of Memory-Based Collaborative Filt...
SIGIR 2017 - A Probabilistic Reformulation of Memory-Based Collaborative Filt...
 
RSWeb @ ACM RecSys 2014 - Exploring social network effects on popularity bias...
RSWeb @ ACM RecSys 2014 - Exploring social network effects on popularity bias...RSWeb @ ACM RecSys 2014 - Exploring social network effects on popularity bias...
RSWeb @ ACM RecSys 2014 - Exploring social network effects on popularity bias...
 
SIGIR 2011 Poster - Intent-Oriented Diversity in Recommender Systems
SIGIR 2011 Poster - Intent-Oriented Diversity in Recommender SystemsSIGIR 2011 Poster - Intent-Oriented Diversity in Recommender Systems
SIGIR 2011 Poster - Intent-Oriented Diversity in Recommender Systems
 
SIGIR 2012 - Explicit Relevance Models in Intent-Oriented Information Retrie...
SIGIR 2012 - Explicit Relevance Models in Intent-Oriented  Information Retrie...SIGIR 2012 - Explicit Relevance Models in Intent-Oriented  Information Retrie...
SIGIR 2012 - Explicit Relevance Models in Intent-Oriented Information Retrie...
 

Recently uploaded

Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfhans926745
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesBoston Institute of Analytics
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 

Recently uploaded (20)

Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 

ACM RecSys 2011 - Rank and Relevance in Novelty and Diversity Metrics for Recommender Systems

  • 1. 5th ACM International Conference on Recommender Systems – RecSys 2011 Rank and Relevance in Novelty and Diversity Metrics for Recommender Systems Saúl Vargas and Pablo Castells Universidad Autónoma de Madrid http://ir.ii.uam.es IRG Rank and Relevance in Novelty and Diversity Metrics for Recommender Systems 5th ACM International Conference on Recommender Systems (RecSys 2011) IR Group @ UAM Chicago, IL, 23-27 October 2011
  • 2. Beyond accuracy: novelty and diversity You bought So you are recommended… (or browsed) Revolver Rubber Soul With The Beatles Let it be Help! Beatles for Sale A Hard Day’s Sgt. Pp’s Lonely Yellow Magical Mystery The White Night Hearts Club Band Submarine Tour Album Abbey Road The recommendedPlease are… items 1967-1970 1962-1966 Past Masters Past Masters  Very similar to each other (Blue) Please me (Red) Vol 2  Very similar to what the user has already seen … More Beatles’ albums  Very widely known Dark Side Some Girls Bob Dylan of the Moon IRG Rank and Relevance in Novelty and Diversity Metrics for Recommender Systems 5th ACM International Conference on Recommender Systems (RecSys 2011) IR Group @ UAM Chicago, IL, 23-27 October 2011
  • 3. Novelty and diversity in Recommender Systems Algorithms to enhance novelty and diversity  Greedy optimization of objective functions (accuracy + diversity), promotion of long-tail items, etc. (Ziegler 2005, Zhang 2008, Celma 2008) Metrics and methodologies to measure and evaluate novelty and diversity  Inverse popularity –mean self-information (Zhou 2010)  recommend in the long tail 1 MSI   R  log iR 2 p i  Novelty  Intra-list diversity –average pairwise distance (Ziegler 2005, Zhang 2008) 2 ILD  R d ik , il  R  R  1 ik ,il  Diversity k l  Other: temporal diversity (Lathia 2010), diversity relative to other users & to other systems (Bellogín 2010), aggregate diversity (Adomavicius 2011), unexpectedness (Adamopoulos 2011), etc. IRG Rank and Relevance in Novelty and Diversity Metrics for Recommender Systems 5th ACM International Conference on Recommender Systems (RecSys 2011) IR Group @ UAM Chicago, IL, 23-27 October 2011
  • 4. Some limitations R1 R2 Metrics are insensitive to the Diverse Not diverse order of recommender items Same item sets  same measured diversity/novelty Not diverse Diverse … … IRG Rank and Relevance in Novelty and Diversity Metrics for Recommender Systems 5th ACM International Conference on Recommender Systems (RecSys 2011) IR Group @ UAM Chicago, IL, 23-27 October 2011
  • 5. Some limitations Accuracy and diversity/novelty measured independently Method A is better than B Which one is better? Method A Method B Accuracy Diversity IRG Rank and Relevance in Novelty and Diversity Metrics for Recommender Systems 5th ACM International Conference on Recommender Systems (RecSys 2011) IR Group @ UAM Chicago, IL, 23-27 October 2011
  • 6. Our research goals 1. Further formalize recommendation novelty and diversity metrics based on a few basic fundamental principles 2. Build a unified metric framework where: – As many state of the art novelty and diversity metrics as possible are related and generalized – New metrics can be defined 3. Enhance the novelty and diversity metrics with rank sensitivity and relevance awareness IRG Rank and Relevance in Novelty and Diversity Metrics for Recommender Systems 5th ACM International Conference on Recommender Systems (RecSys 2011) IR Group @ UAM Chicago, IL, 23-27 October 2011
  • 7. Basic fundamental principles to build metrics upon  Our approach: define and formalize novelty and diversity metrics based on models of how users interact with items  Three basic fundamental principles in user-item interaction – Discovery – an item is seen by a user – Relevance – an item would be liked by (or useful for, etc.) a user – Choice – an item is actually accepted (bought, consumed, etc.) by a user  Formalized as binary random variables – seen, rel, choose taking values in {true, false} seen choose rel  Simplifying assumptions: – seen and rel are mutually independent – If a user sees an item that is relevant for her,  p choose  p seen  p rel  she chooses it IRG Rank and Relevance in Novelty and Diversity Metrics for Recommender Systems 5th ACM International Conference on Recommender Systems (RecSys 2011) IR Group @ UAM Chicago, IL, 23-27 October 2011
  • 8. Proposed metric framework Expected effective novelty of items when a user interacts R with a ranked list of recommended items in a context  m  R    C  p choose i, u, R  nov i   iR Novelty is relative: item novelty context  i To (what we know about) what someone has seen sometime somewhere  Someone  the target user, a set of users, all users…  Sometime  a specific past time period, an ongoing session, “ever”…  Somewhere  past recommendations, the current recommendation R, recommendations by other systems, “anywhere”…  “What we know about that”  context of observation: available observations … IRG Rank and Relevance in Novelty and Diversity Metrics for Recommender Systems 5th ACM International Conference on Recommender Systems (RecSys 2011) IR Group @ UAM Chicago, IL, 23-27 October 2011
  • 9. Metric framework components m  R    C  p choose i, u, R  nov i   iR  Item novelty model nov i    Choice model p choose i, u, R  IRG Rank and Relevance in Novelty and Diversity Metrics for Recommender Systems 5th ACM International Conference on Recommender Systems (RecSys 2011) IR Group @ UAM Chicago, IL, 23-27 October 2011
  • 10. Item novelty models Item novelty model nov(i|)  Discovery-based (negative popularity) – Popularity complement nov i    1  p seen i,   Forced discovery – Self-information (surprisal) nov i     log2 p i seen,   Free discovery  Distance-based ( here represents a set of items) – Expected item distance nov i     p  j choose, i,   d i, j  j  – Minimum item distance nov i    min d i, j  j  IRG Rank and Relevance in Novelty and Diversity Metrics for Recommender Systems 5th ACM International Conference on Recommender Systems (RecSys 2011) IR Group @ UAM Chicago, IL, 23-27 October 2011
  • 11. Metric framework components m  R    C  p choose i, u, R  nov i   iR  Item novelty model nov i    Choice model p choose i, u, R  IRG Rank and Relevance in Novelty and Diversity Metrics for Recommender Systems 5th ACM International Conference on Recommender Systems (RecSys 2011) IR Group @ UAM Chicago, IL, 23-27 October 2011
  • 12. Choice model Choice model p(choose|i,u,R) p choose  p seen  p rel  p choose i, u, R  p seen i, u, R  p rel i, u  Browsing Relevance Independent model model from R IRG Rank and Relevance in Novelty and Diversity Metrics for Recommender Systems 5th ACM International Conference on Recommender Systems (RecSys 2011) IR Group @ UAM Chicago, IL, 23-27 October 2011
  • 13. Browsing model R Browsing model where p(seen|ik,u,R) should decrease with k 1  Can be formalized as different probabilistic discount functions 2 (see e.g. Carterette 2011) 3  In general, p(seen|ik,u,R) = disc(k) 4 disc k  5 p k 1 exponential, as in RBP (Moffat 2008) k=6 ? 1 log k  1 as in nDCG 7 1k Zipfian, as in MRR, MAP, etc. 8 1 no discount 9 ... many others... … IRG Rank and Relevance in Novelty and Diversity Metrics for Recommender Systems 5th ACM International Conference on Recommender Systems (RecSys 2011) IR Group @ UAM Chicago, IL, 23-27 October 2011
  • 14. Wrapping up: resulting metric scheme m  R    C  disc k  p rel ik , u  nov ik   ik R Rank Item Item discount relevance novelty Normalization – to get the novelty ratio by expected number of browsed items 1 C   disc k  ik R Expected browsing depth IRG Rank and Relevance in Novelty and Diversity Metrics for Recommender Systems 5th ACM International Conference on Recommender Systems (RecSys 2011) IR Group @ UAM Chicago, IL, 23-27 October 2011
  • 15. Implementation Ground model estimates   observed interaction between all users and items in the system  Discovery distributions can be estimated from rating data or access records – Forced discovery p(seen|i,)  IUF (ratio of users who have interacted with i) – Free discovery: p(i|seen,)  ICF (ratio of interactions involving i)  Relevance distribution p(rel|i,u) is estimated by a mapping from ratings to relevance (see definition of ERR in Chapelle 2009) IRG Rank and Relevance in Novelty and Diversity Metrics for Recommender Systems 5th ACM International Conference on Recommender Systems (RecSys 2011) IR Group @ UAM Chicago, IL, 23-27 October 2011
  • 16. Novelty and diversity metrics Putting all together Some metric framework instantiations IRG Rank and Relevance in Novelty and Diversity Metrics for Recommender Systems 5th ACM International Conference on Recommender Systems (RecSys 2011) IR Group @ UAM Chicago, IL, 23-27 October 2011
  • 17. Putting all together: metric framework instantiations Discovery-based metrics   observed interaction between all users and items in the system  Expected popularity complement EPC R   C  disc k  p rel ik , u  1  p seen ik  ik R   Novelty  Expected free discovery EFD R   C  disc k  p rel ik , u  log p ik seen  ik R 1 Without rank and relevance reduces to  MSI  R    R  log p i seen  iR IRG Rank and Relevance in Novelty and Diversity Metrics for Recommender Systems 5th ACM International Conference on Recommender Systems (RecSys 2011) IR Group @ UAM Chicago, IL, 23-27 October 2011
  • 18. Putting all together: metric framework instantiations Distance-based metrics   the observed interaction of the target user only  Expected profile distance Unexpectedness EPD  R   C u  disc k  p rel ik , u  p rel j, u  d ik , j  (user-specific) ik R j u   the recommended items the target user can see in R  Expected intra-list diversity Diversity EILD  R    C disc k  disc l k  p rel i , u  p rel i , u  d i , i  ik R k k l k l il R k l 2 Without rank and relevance reduces to  ILD  R    d ik , il  R  R  1 ik ,il R k l IRG Rank and Relevance in Novelty and Diversity Metrics for Recommender Systems 5th ACM International Conference on Recommender Systems (RecSys 2011) IR Group @ UAM Chicago, IL, 23-27 October 2011
  • 19. Novelty and diversity metrics Some experiments IRG Rank and Relevance in Novelty and Diversity Metrics for Recommender Systems 5th ACM International Conference on Recommender Systems (RecSys 2011) IR Group @ UAM Chicago, IL, 23-27 October 2011
  • 20. Experiments  Datasets  Recommender algorithms – MovieLens 1M – CB Content-based (ML only) – Last.fm data by Òscar Celma – UB User-based kNN  Experiment design – MF Matrix factorization – Run baseline recommenders – AVG Average rating – Rerank top 500 recommended items – RND Random by diversification algorithms – Measure metrics on top 50 items  Diversification algorithms  Metrics – MMR Greedy optimization of relevance + diversity – EPC@50 Novelty (Zhang 2008) (popularity complement) – IA-Select Adaptation of IR – EPD@50 Unexpectednes diversity algorithm (profile distance) (Agrawal 2008) – EILD@50 Intra-list diversity – NGD Greedy optimization Distance function: complement of Jaccard of relevance + novelty (MovieLens genres) and Pearson (Last.fm) – Random IRG Rank and Relevance in Novelty and Diversity Metrics for Recommender Systems 5th ACM International Conference on Recommender Systems (RecSys 2011) IR Group @ UAM Chicago, IL, 23-27 October 2011
  • 21. Experimental results on baseline recommenders (no rank discount) MovieLens 1M Last.fm Without relevance 1.0 1.00 CB 0.9 MF  CB is good for long- 0.97 No relevance 0.8 UB tail, not so good at 0.94 0.7 AVG unexpectedness 0.91 RND 0.6 and diversity 0.5 0.88  AVG rating and RND 0.4 0.85 EPC@50 EPD@50 EILD@50 EPC@50 EPD@50 EILD@50 stand out, especial- ly on Last.fm MovieLens 1M Last.fm With relevance 0.07 0.5 CB  MF stands out on Relevance-aware 0.06 MF 0.4 0.05 UB MovieLens 0.04 0.3 AVG 0.03 RND  UB stands out on 0.2 0.02 Last.fm 0.01 0.1 0.00 0.0  AVG rating and RND EPC@50 EPD@50 EILD@50 EPC@50 EPD@50 EILD@50 drop drastically IRG Rank and Relevance in Novelty and Diversity Metrics for Recommender Systems 5th ACM International Conference on Recommender Systems (RecSys 2011) IR Group @ UAM Chicago, IL, 23-27 October 2011
  • 22. Experimental results with diversification algorithms Wilcoxon MovieLens 1M Last.fm p < 0.001 EPC@50 EPD@50 EILD@50 EPC@50 EPD@50 EILD@50 disc (k) 1 0.85k–1 1 0.85k–1 1 0.85k–1 1 0.85k–1 1 0.85k–1 1 0.85k–1 MF 0.9124 0.8876 0.7632 0.7466 0.7164 0.6191 0.8754 0.8481 0.8949 0.8895 0.8862 0.7954 No relevance IA-Select 0.9045 0.8886 0.8080 0.7577 0.8289 0.7483 0.8840 0.9089 0.8912 0.8909 0.8878 0.8274 MMR 0.9063 0.8769 0.7605 0.7428 0.7191 0.6247 0.9068 0.8903 0.9133 0.9107 0.9166 0.8398 NGD 0.9851 0.9795 0.7725 0.7551 0.6563 0.5430 0.9722 0.9571 0.9423 0.9398 0.9485 0.8784 Random 0.9525 0.9527 0.7699 0.7699 0.7283 0.6719 0.9359 0.9357 0.9278 0.9279 0.9318 0.8619 MF 0.0671 0.1043 0.0580 0.0944 0.0471 0.0551 0.2501 0.2115 0.2671 0.2587 0.2518 0.1900 IA-Select Relevance 0.0705 0.1161 0.0639 0.1032 0.0537 0.0648 0.3343 0.4752 0.3462 0.3994 0.3343 0.4154 MMR 0.0719 0.1131 0.0620 0.1020 0.0510 0.0610 0.2351 0.1936 0.2439 0.2340 0.2360 0.1759 NGD 0.0155 0.0223 0.0128 0.0200 0.0067 0.0017 0.2286 0.3077 0.2212 0.2593 0.2165 0.2656 Random 0.0222 0.0218 0.0182 0.0179 0.0117 0.0058 0.1362 0.1368 0.1407 0.1405 0.1342 0.1113  Improvement w.r.t. random reranking is clearer with relevance best  Rank sensitivity uncovers further improvements by diversification algorithms > random  Different metrics appreciate different diversification algorithms consistently < baseline IRG Rank and Relevance in Novelty and Diversity Metrics for Recommender Systems 5th ACM International Conference on Recommender Systems (RecSys 2011) IR Group @ UAM Chicago, IL, 23-27 October 2011
  • 23. Experimental results  The metrics behave consistently – E.g. content-based recommender scores high on novelty (long-tail) but low on unexpectedness and diversity – Diversified recommendations score higher than baselines – Different diversification strategies met their specific target  Relevance makes a large difference – Probe recommenders such as random and average rating score high without relevance and rank discount –and they drop with relevance – Same effect for random diversification  Rank sensitiveness uncovers further improvements by diversification algorithms which otherwise go unnoticed IRG Rank and Relevance in Novelty and Diversity Metrics for Recommender Systems 5th ACM International Conference on Recommender Systems (RecSys 2011) IR Group @ UAM Chicago, IL, 23-27 October 2011
  • 24. Conclusion  General metric framework for recommendation novelty and diversity evaluation  Flexible and configurable, supports a fair range of variants and configurations – Key configuration components: item novelty models, context , rank and relevance  Unifies and generalizes state of the art metrics – Further metrics can be unified taking alternative  : temporal novelty/diversity, inter-system diversity, inter-user diversity  Provides for rank sensitivity and relevance awareness (as an option)  Provides for single metric assessing accuracy and diversity/novelty  Further ongoing empirical testing, wide space for further exploration! IRG Rank and Relevance in Novelty and Diversity Metrics for Recommender Systems 5th ACM International Conference on Recommender Systems (RecSys 2011) IR Group @ UAM Chicago, IL, 23-27 October 2011