SlideShare ist ein Scribd-Unternehmen logo
1 von 39
Recommender System
    Introduction
  xiangliang@hulu.com
What is good recommender
         system?
Outline
• What is recommender system?
  – Mission
  – History
  – Problems
• What is good recommender system?
  – Experiment Methods
  – Evaluation Metric
Information Overload
How to solve information overload
• Catalog
  – Yahoo, DMOZ


• Search Engine
  – Google, Bing
Mission
• Help user find item of their interest.
• Help item provider deliver their item to
  right user.
• Help website improve user engagement.
Recommender
   System
Search Engine vs. Recommender
               System
• User will try search engine if
  – they have specific needs
  – they can use keywords to describe needs
• User will try recommender system if
  – they do not know what they want now
  – they can not use keywords to describe needs
History: Before 1992
• Content Filtering
  – An architecture for large scale information
    systems [1985] (Gifford, D.K)
  – MAFIA: An active mail-filter agent for an
    intelligent document processing support [1990]
    (Lutz, E.)
  – A rule-based message filtering system [1988]
    (Pollock, S. )
History: 1992-1998
• Tapestry by Xerox Palo Alto [1992]
  – First system designed by collaborative filtering
• Grouplens [1994]
  – First recommender system using rating data
• Movielens [1997]
  – First movie recommender system
  – Provide well-known dataset for researchers
History: 1992-1998
• Fab : content-based collaborative
  recommendation
  – First unified recommender system
• Empirical Analysis of Predictive Algorithms
  for Collaborative Filtering [1998] (John S.
  Breese)
  – Systematically evaluate user-based
    collaborative filtering
History: 1999-2005
• Amazon proposed item-based collaborative
  filtering (Patent is filed in 1998 and issued
  in 2001) [link]
• Thomas Hofmann proposed pLSA [1999]
  and apply similar method on collaborative
  filtering [2004]
• Pandora began music genome project
  [2000]
History: 1999-2005
• Lastfm using Audioscrobbler to generate
  user taste profile on musics.
• Evaluating collaborative filtering
  recommender systems [2004] (Jonathan L.
  Herlocker)
History: 2005-2009
• Toward the Next Generation of
  Recommender Systems: A Survey of the
  State-of-the-Art and Possible Extensions.
  [2005] (Alexander Tuzhilin)
• Netflix Prize [link]
  – Latent Factor Model (SVD, RSVD, NSVD, SVD++)
  – Temporal Dynamic Collaborative Filtering
  – Yehuda Koren [link]’s team get prize
History: 2005-2009
• ACM Conference on Recommender System
  [2007] (Minneapolis, Minnesota, USA)
• Digg, Youtube try recommender system.
History: 2010-now
•   Context-Aware Recommender Systems
•   Music Recommendation and Discovery
•   Recommender Systems and the Social Web
•   Information Heterogeneity and Fusion in
    Recommender Systems
•   Human Decision Making in Recommender Systems
•   Personalization in Mobile Applications
•   Novelty and Diversity in Recommender Systems
•   User-Centric Evaluation
History: 2010-now
• Facebook launches instant personalization
  [2010]
  – Clicker
  – Bing
  – Trip Advisor
  – Rotten Tomatoes
  – Pandora
  – ……
Problems
• Main Problems
  – Top-N Recommendation
  – Rating Prediction
Problems
• Top-N Recommendation
  – Input
        user      item
            A      a
            B      a
            B      b
            …      …

  – Output
Problems
• Top-N Recommendation
  – Input
        user      item     rating
            A      a
            B      a
            B      b
            …      …         …

  – Output
What is good recommender
         system?
Experiment Methods
• Offline Experiment
• User Survey
• Online Experiment
  – AB Testing
Experiment Methods
• Offline Experiment

                    DataSet




                Train                    Test


  • Advantage:
       • Only rely on dataset
       •
  • Disadvantage:
       • Offline metric can not reflect business goal
Experiment Methods
• User Survey
  – Advantage:
    • Can get subjective metrics
    • Lower risk than online testing
  – Disadvantage:
    • Higher cost than offline experiments
    • Some results may not have statistical significance
    • Users may have different behaviors under testing
      environment or real environment
    • It’s difficult to design double blink experiments.
Experiment Methods
• On line experiments (AB Testing)
  – Advantage:
     • Can get metrics related to business goal
  – Disadvantage:
     • High risk/cost
     • Need large user set to get statistical significant result
Experiment Metrics
•   User Satisfaction
•   Prediction Accuracy
•   Coverage
•   Diversity
•   Novelty
•   Serendipity
•   Trust
•   Robust
•   Real-time
Experiment Metrics
• User Satisfaction
  – Subjective metric
  – Measured by user survey or online experiments
Experiment Metrics
• Prediction Accuracy
  – Measured by offline experiments
  – Top-N Recommendation
     • Precision / Recall


  – Rating Prediction
     • MAE, RMSE
Experiment Metrics
• Coverage
  – Measure the ability of recommender system to
    recommend long-tail items.

                |         R (u , N ) |
                     u U
  Coverage
                           |I|

  – Entropy, Gini Index
Experiment Metrics
• Diversity
  – Measure the ability of recommender system to
    cover users’ different interests.
  – Different similarity metric generate different
    diversity metric.
Experiment Metrics
• Diversity (Example)




 Watch History           Related Items
Experiment Metrics
• Novelty
  – Measure the ability of recommender system to
    introduce long tail items to users.
  – International Workshop on Novelty and
    Diversity in Recommender Systems [link]
  – Music Recommendation and Discovery in the
    Long Tail [Oscar Celma]
Experiment Metrics
• Serendipity
  – A recommendation result is serendipity if:
     • it’s not related with user’s historical interest
     • it’s novelty to user
     • user will find it’s interesting after user view it
Experiment Metrics
• Trust
  – If user trust recommender system, they will
    interact with it.
  – Ways to improve trust:
     • Transparency
     • Social
     • Trust System (Epinion)
Experiment Metrics
• Robust
  – The ability of recommender system to prevent
    attack.
  – Neil Hurley. Tutorial on Robustness of
    Recommender System. ACM RecSys 2011.
Experiment Metrics
• Real-time
  – Generate new recommendations when user
    have new behaviors immediately.
Too many metric!
Which is most important?
How to do trade-off
• Business goal
• Our belief
• Making new algorithms by 3 steps
  experiments:
  – Offline testing
  – User survey
  – Online testing
Thanks!

Weitere ähnliche Inhalte

Was ist angesagt?

Matrix Factorization Techniques For Recommender Systems
Matrix Factorization Techniques For Recommender SystemsMatrix Factorization Techniques For Recommender Systems
Matrix Factorization Techniques For Recommender Systems
Lei Guo
 
Recommender Systems! @ASAI 2011
Recommender Systems! @ASAI 2011Recommender Systems! @ASAI 2011
Recommender Systems! @ASAI 2011
Ernesto Mislej
 
Recommendation engines
Recommendation enginesRecommendation engines
Recommendation engines
Georgian Micsa
 

Was ist angesagt? (20)

Boston ML - Architecting Recommender Systems
Boston ML - Architecting Recommender SystemsBoston ML - Architecting Recommender Systems
Boston ML - Architecting Recommender Systems
 
Recommendation system
Recommendation systemRecommendation system
Recommendation system
 
Recommender Systems
Recommender SystemsRecommender Systems
Recommender Systems
 
Recommendation System Explained
Recommendation System ExplainedRecommendation System Explained
Recommendation System Explained
 
Recommendation Systems
Recommendation SystemsRecommendation Systems
Recommendation Systems
 
Introduction to Recommendation Systems
Introduction to Recommendation SystemsIntroduction to Recommendation Systems
Introduction to Recommendation Systems
 
Past, present, and future of Recommender Systems: an industry perspective
Past, present, and future of Recommender Systems: an industry perspectivePast, present, and future of Recommender Systems: an industry perspective
Past, present, and future of Recommender Systems: an industry perspective
 
Recommender Systems
Recommender SystemsRecommender Systems
Recommender Systems
 
Matrix Factorization Techniques For Recommender Systems
Matrix Factorization Techniques For Recommender SystemsMatrix Factorization Techniques For Recommender Systems
Matrix Factorization Techniques For Recommender Systems
 
Recommender Systems! @ASAI 2011
Recommender Systems! @ASAI 2011Recommender Systems! @ASAI 2011
Recommender Systems! @ASAI 2011
 
A Hybrid Recommendation system
A Hybrid Recommendation systemA Hybrid Recommendation system
A Hybrid Recommendation system
 
Recommendation engines
Recommendation enginesRecommendation engines
Recommendation engines
 
[Final]collaborative filtering and recommender systems
[Final]collaborative filtering and recommender systems[Final]collaborative filtering and recommender systems
[Final]collaborative filtering and recommender systems
 
Recommendation Systems Basics
Recommendation Systems BasicsRecommendation Systems Basics
Recommendation Systems Basics
 
Recommender Systems (Machine Learning Summer School 2014 @ CMU)
Recommender Systems (Machine Learning Summer School 2014 @ CMU)Recommender Systems (Machine Learning Summer School 2014 @ CMU)
Recommender Systems (Machine Learning Summer School 2014 @ CMU)
 
Recommender systems
Recommender systemsRecommender systems
Recommender systems
 
Recent advances in deep recommender systems
Recent advances in deep recommender systemsRecent advances in deep recommender systems
Recent advances in deep recommender systems
 
Past present and future of Recommender Systems: an Industry Perspective
Past present and future of Recommender Systems: an Industry PerspectivePast present and future of Recommender Systems: an Industry Perspective
Past present and future of Recommender Systems: an Industry Perspective
 
Recommendation system by_arpit_sharma
Recommendation system by_arpit_sharmaRecommendation system by_arpit_sharma
Recommendation system by_arpit_sharma
 
Collaborative Filtering 1: User-based CF
Collaborative Filtering 1: User-based CFCollaborative Filtering 1: User-based CF
Collaborative Filtering 1: User-based CF
 

Andere mochten auch

Lecture 6 lu factorization & determinants - section 2-5 2-7 3-1 and 3-2
Lecture 6   lu factorization & determinants - section 2-5 2-7 3-1 and 3-2Lecture 6   lu factorization & determinants - section 2-5 2-7 3-1 and 3-2
Lecture 6 lu factorization & determinants - section 2-5 2-7 3-1 and 3-2
njit-ronbrown
 
Factorization Machines with libFM
Factorization Machines with libFMFactorization Machines with libFM
Factorization Machines with libFM
Liangjie Hong
 
Matrix factorization
Matrix factorizationMatrix factorization
Matrix factorization
rubyyc
 
Building a Recommendation Engine - An example of a product recommendation engine
Building a Recommendation Engine - An example of a product recommendation engineBuilding a Recommendation Engine - An example of a product recommendation engine
Building a Recommendation Engine - An example of a product recommendation engine
NYC Predictive Analytics
 

Andere mochten auch (14)

Lecture 6 lu factorization & determinants - section 2-5 2-7 3-1 and 3-2
Lecture 6   lu factorization & determinants - section 2-5 2-7 3-1 and 3-2Lecture 6   lu factorization & determinants - section 2-5 2-7 3-1 and 3-2
Lecture 6 lu factorization & determinants - section 2-5 2-7 3-1 and 3-2
 
Factorization Machines with libFM
Factorization Machines with libFMFactorization Machines with libFM
Factorization Machines with libFM
 
Matrix factorization
Matrix factorizationMatrix factorization
Matrix factorization
 
Neighbor methods vs matrix factorization - case studies of real-life recommen...
Neighbor methods vs matrix factorization - case studies of real-life recommen...Neighbor methods vs matrix factorization - case studies of real-life recommen...
Neighbor methods vs matrix factorization - case studies of real-life recommen...
 
Collaborative Filtering with Spark
Collaborative Filtering with SparkCollaborative Filtering with Spark
Collaborative Filtering with Spark
 
Intro to Factorization Machines
Intro to Factorization MachinesIntro to Factorization Machines
Intro to Factorization Machines
 
آموزش محاسبات عددی - بخش دوم
آموزش محاسبات عددی - بخش دومآموزش محاسبات عددی - بخش دوم
آموزش محاسبات عددی - بخش دوم
 
Nonnegative Matrix Factorization
Nonnegative Matrix FactorizationNonnegative Matrix Factorization
Nonnegative Matrix Factorization
 
Matrix Factorization Technique for Recommender Systems
Matrix Factorization Technique for Recommender SystemsMatrix Factorization Technique for Recommender Systems
Matrix Factorization Technique for Recommender Systems
 
Introduction to Matrix Factorization Methods Collaborative Filtering
Introduction to Matrix Factorization Methods Collaborative FilteringIntroduction to Matrix Factorization Methods Collaborative Filtering
Introduction to Matrix Factorization Methods Collaborative Filtering
 
Beginners Guide to Non-Negative Matrix Factorization
Beginners Guide to Non-Negative Matrix FactorizationBeginners Guide to Non-Negative Matrix Factorization
Beginners Guide to Non-Negative Matrix Factorization
 
Recommendation system
Recommendation system Recommendation system
Recommendation system
 
Collaborative Filtering Recommendation System
Collaborative Filtering Recommendation SystemCollaborative Filtering Recommendation System
Collaborative Filtering Recommendation System
 
Building a Recommendation Engine - An example of a product recommendation engine
Building a Recommendation Engine - An example of a product recommendation engineBuilding a Recommendation Engine - An example of a product recommendation engine
Building a Recommendation Engine - An example of a product recommendation engine
 

Ähnlich wie Recommender system introduction

Preference Elicitation Interface
Preference Elicitation InterfacePreference Elicitation Interface
Preference Elicitation Interface
晓愚 孟
 
Data Mining and Recommendation Systems
Data Mining and Recommendation SystemsData Mining and Recommendation Systems
Data Mining and Recommendation Systems
Salil Navgire
 
Recommender System _Module 1_Introduction to Recommender System.pptx
Recommender System _Module 1_Introduction to Recommender System.pptxRecommender System _Module 1_Introduction to Recommender System.pptx
Recommender System _Module 1_Introduction to Recommender System.pptx
Satyam Sharma
 

Ähnlich wie Recommender system introduction (20)

Modern Perspectives on Recommender Systems and their Applications in Mendeley
Modern Perspectives on Recommender Systems and their Applications in MendeleyModern Perspectives on Recommender Systems and their Applications in Mendeley
Modern Perspectives on Recommender Systems and their Applications in Mendeley
 
Use of data science in recommendation system
Use of data science in  recommendation systemUse of data science in  recommendation system
Use of data science in recommendation system
 
productionising-recommenders
productionising-recommendersproductionising-recommenders
productionising-recommenders
 
Preference Elicitation Interface
Preference Elicitation InterfacePreference Elicitation Interface
Preference Elicitation Interface
 
Recommender Systems in a nutshell
Recommender Systems in a nutshellRecommender Systems in a nutshell
Recommender Systems in a nutshell
 
Evaluation in Audio Music Similarity
Evaluation in Audio Music SimilarityEvaluation in Audio Music Similarity
Evaluation in Audio Music Similarity
 
Design Recommender systems from scratch
Design Recommender systems from scratchDesign Recommender systems from scratch
Design Recommender systems from scratch
 
NISO Altmetrics Initiative: A Project Update - Martin Fenner, Technical Lead ...
NISO Altmetrics Initiative: A Project Update - Martin Fenner, Technical Lead ...NISO Altmetrics Initiative: A Project Update - Martin Fenner, Technical Lead ...
NISO Altmetrics Initiative: A Project Update - Martin Fenner, Technical Lead ...
 
recommendation system techunique and issue
recommendation system techunique and issuerecommendation system techunique and issue
recommendation system techunique and issue
 
Олександр Обєдніков “Рекомендательные системы”
Олександр Обєдніков “Рекомендательные системы”Олександр Обєдніков “Рекомендательные системы”
Олександр Обєдніков “Рекомендательные системы”
 
The Hive Think Tank: Machine Learning at Pinterest by Jure Leskovec
The Hive Think Tank: Machine Learning at Pinterest by Jure LeskovecThe Hive Think Tank: Machine Learning at Pinterest by Jure Leskovec
The Hive Think Tank: Machine Learning at Pinterest by Jure Leskovec
 
Recommandation systems -
Recommandation systems - Recommandation systems -
Recommandation systems -
 
Modern Perspectives on Recommender Systems and their Applications in Mendeley
Modern Perspectives on Recommender Systems and their Applications in MendeleyModern Perspectives on Recommender Systems and their Applications in Mendeley
Modern Perspectives on Recommender Systems and their Applications in Mendeley
 
Data Mining and Recommendation Systems
Data Mining and Recommendation SystemsData Mining and Recommendation Systems
Data Mining and Recommendation Systems
 
Demystifying Recommendation Systems
Demystifying Recommendation SystemsDemystifying Recommendation Systems
Demystifying Recommendation Systems
 
Recommender System _Module 1_Introduction to Recommender System.pptx
Recommender System _Module 1_Introduction to Recommender System.pptxRecommender System _Module 1_Introduction to Recommender System.pptx
Recommender System _Module 1_Introduction to Recommender System.pptx
 
Lecture Notes on Recommender System Introduction
Lecture Notes on Recommender System IntroductionLecture Notes on Recommender System Introduction
Lecture Notes on Recommender System Introduction
 
Agent technology for e commerce-recommendation systems
Agent technology for e commerce-recommendation systemsAgent technology for e commerce-recommendation systems
Agent technology for e commerce-recommendation systems
 
Mini-training: Personalization & Recommendation Demystified
Mini-training: Personalization & Recommendation DemystifiedMini-training: Personalization & Recommendation Demystified
Mini-training: Personalization & Recommendation Demystified
 
Paper prototype evaluation
Paper prototype evaluationPaper prototype evaluation
Paper prototype evaluation
 

Mehr von Liang Xiang

Phd. Thesis : Temporal Recommendation
Phd. Thesis : Temporal RecommendationPhd. Thesis : Temporal Recommendation
Phd. Thesis : Temporal Recommendation
Liang Xiang
 
How to do model ensemble
How to do model ensembleHow to do model ensemble
How to do model ensemble
Liang Xiang
 
Temporal recommendation on graphs via long and short-term
Temporal recommendation on graphs via long  and short-termTemporal recommendation on graphs via long  and short-term
Temporal recommendation on graphs via long and short-term
Liang Xiang
 

Mehr von Liang Xiang (6)

Kddcup2011
Kddcup2011Kddcup2011
Kddcup2011
 
Phd. Thesis : Temporal Recommendation
Phd. Thesis : Temporal RecommendationPhd. Thesis : Temporal Recommendation
Phd. Thesis : Temporal Recommendation
 
动态推荐系统关键技术研究
动态推荐系统关键技术研究动态推荐系统关键技术研究
动态推荐系统关键技术研究
 
动态推荐系统关键技术研究
动态推荐系统关键技术研究动态推荐系统关键技术研究
动态推荐系统关键技术研究
 
How to do model ensemble
How to do model ensembleHow to do model ensemble
How to do model ensemble
 
Temporal recommendation on graphs via long and short-term
Temporal recommendation on graphs via long  and short-termTemporal recommendation on graphs via long  and short-term
Temporal recommendation on graphs via long and short-term
 

Kürzlich hochgeladen

CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
giselly40
 

Kürzlich hochgeladen (20)

🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 

Recommender system introduction

  • 1. Recommender System Introduction xiangliang@hulu.com
  • 2. What is good recommender system?
  • 3. Outline • What is recommender system? – Mission – History – Problems • What is good recommender system? – Experiment Methods – Evaluation Metric
  • 5. How to solve information overload • Catalog – Yahoo, DMOZ • Search Engine – Google, Bing
  • 6. Mission • Help user find item of their interest. • Help item provider deliver their item to right user. • Help website improve user engagement.
  • 7. Recommender System
  • 8. Search Engine vs. Recommender System • User will try search engine if – they have specific needs – they can use keywords to describe needs • User will try recommender system if – they do not know what they want now – they can not use keywords to describe needs
  • 9. History: Before 1992 • Content Filtering – An architecture for large scale information systems [1985] (Gifford, D.K) – MAFIA: An active mail-filter agent for an intelligent document processing support [1990] (Lutz, E.) – A rule-based message filtering system [1988] (Pollock, S. )
  • 10. History: 1992-1998 • Tapestry by Xerox Palo Alto [1992] – First system designed by collaborative filtering • Grouplens [1994] – First recommender system using rating data • Movielens [1997] – First movie recommender system – Provide well-known dataset for researchers
  • 11. History: 1992-1998 • Fab : content-based collaborative recommendation – First unified recommender system • Empirical Analysis of Predictive Algorithms for Collaborative Filtering [1998] (John S. Breese) – Systematically evaluate user-based collaborative filtering
  • 12. History: 1999-2005 • Amazon proposed item-based collaborative filtering (Patent is filed in 1998 and issued in 2001) [link] • Thomas Hofmann proposed pLSA [1999] and apply similar method on collaborative filtering [2004] • Pandora began music genome project [2000]
  • 13. History: 1999-2005 • Lastfm using Audioscrobbler to generate user taste profile on musics. • Evaluating collaborative filtering recommender systems [2004] (Jonathan L. Herlocker)
  • 14. History: 2005-2009 • Toward the Next Generation of Recommender Systems: A Survey of the State-of-the-Art and Possible Extensions. [2005] (Alexander Tuzhilin) • Netflix Prize [link] – Latent Factor Model (SVD, RSVD, NSVD, SVD++) – Temporal Dynamic Collaborative Filtering – Yehuda Koren [link]’s team get prize
  • 15. History: 2005-2009 • ACM Conference on Recommender System [2007] (Minneapolis, Minnesota, USA) • Digg, Youtube try recommender system.
  • 16. History: 2010-now • Context-Aware Recommender Systems • Music Recommendation and Discovery • Recommender Systems and the Social Web • Information Heterogeneity and Fusion in Recommender Systems • Human Decision Making in Recommender Systems • Personalization in Mobile Applications • Novelty and Diversity in Recommender Systems • User-Centric Evaluation
  • 17. History: 2010-now • Facebook launches instant personalization [2010] – Clicker – Bing – Trip Advisor – Rotten Tomatoes – Pandora – ……
  • 18. Problems • Main Problems – Top-N Recommendation – Rating Prediction
  • 19. Problems • Top-N Recommendation – Input user item A a B a B b … … – Output
  • 20. Problems • Top-N Recommendation – Input user item rating A a B a B b … … … – Output
  • 21. What is good recommender system?
  • 22. Experiment Methods • Offline Experiment • User Survey • Online Experiment – AB Testing
  • 23. Experiment Methods • Offline Experiment DataSet Train Test • Advantage: • Only rely on dataset • • Disadvantage: • Offline metric can not reflect business goal
  • 24. Experiment Methods • User Survey – Advantage: • Can get subjective metrics • Lower risk than online testing – Disadvantage: • Higher cost than offline experiments • Some results may not have statistical significance • Users may have different behaviors under testing environment or real environment • It’s difficult to design double blink experiments.
  • 25. Experiment Methods • On line experiments (AB Testing) – Advantage: • Can get metrics related to business goal – Disadvantage: • High risk/cost • Need large user set to get statistical significant result
  • 26. Experiment Metrics • User Satisfaction • Prediction Accuracy • Coverage • Diversity • Novelty • Serendipity • Trust • Robust • Real-time
  • 27. Experiment Metrics • User Satisfaction – Subjective metric – Measured by user survey or online experiments
  • 28. Experiment Metrics • Prediction Accuracy – Measured by offline experiments – Top-N Recommendation • Precision / Recall – Rating Prediction • MAE, RMSE
  • 29. Experiment Metrics • Coverage – Measure the ability of recommender system to recommend long-tail items. | R (u , N ) | u U Coverage |I| – Entropy, Gini Index
  • 30. Experiment Metrics • Diversity – Measure the ability of recommender system to cover users’ different interests. – Different similarity metric generate different diversity metric.
  • 31. Experiment Metrics • Diversity (Example) Watch History Related Items
  • 32. Experiment Metrics • Novelty – Measure the ability of recommender system to introduce long tail items to users. – International Workshop on Novelty and Diversity in Recommender Systems [link] – Music Recommendation and Discovery in the Long Tail [Oscar Celma]
  • 33. Experiment Metrics • Serendipity – A recommendation result is serendipity if: • it’s not related with user’s historical interest • it’s novelty to user • user will find it’s interesting after user view it
  • 34. Experiment Metrics • Trust – If user trust recommender system, they will interact with it. – Ways to improve trust: • Transparency • Social • Trust System (Epinion)
  • 35. Experiment Metrics • Robust – The ability of recommender system to prevent attack. – Neil Hurley. Tutorial on Robustness of Recommender System. ACM RecSys 2011.
  • 36. Experiment Metrics • Real-time – Generate new recommendations when user have new behaviors immediately.
  • 37. Too many metric! Which is most important?
  • 38. How to do trade-off • Business goal • Our belief • Making new algorithms by 3 steps experiments: – Offline testing – User survey – Online testing