SlideShare ist ein Scribd-Unternehmen logo
1 von 26
Downloaden Sie, um offline zu lesen
Homepage Personalization
at Spotify
Oğuz Semerci, Aloïs Gruson, Clay Gibson, Ben Lacker, Catherine Edwards, Vladan Radosavljevic
Spotify is a global audio
subscription service
By the
numbers
232M
108M
79
50M+ 450k+
What’s at stake on the Homepage?
The Homepage is the first thing you see when you open the app. It
is many things: a discovery tool, a personal music assistant, a
marketplace for artists and their fans.
Spotify’s mission is to unlock the potential of human creativity —
by giving a million creative artists the opportunity to live off their art
and billions of fans the opportunity to enjoy and be inspired by it.
Personalization is powerful in this challenging content space with
vast volume and variety.
01 More on Spotify Homepage
02 Overview of the Ranking algorithm and the bandit policy
03 Sanity checks used in practice for policy debiasing and model behavior
Talk outline
Homepage
organization
The Homepage is made up of cards:
podcast shows or episodes, albums,
playlists, radio stations, artist pages,
etc.
Cards are organized into shelves.
Shelf A
Shelf B
Each user is eligible for hundreds of
candidate shelves, which can be
editorially or programmatically
curated. Shelves pull from a pool of
millions of cards.
All shelf candidates and their
respective cards are ranked in
real-time when you load Home.
Made for X
Your Favorite Albums
Similar to Y
Recommended for Today
Iconic 80s Soundtracks
Discovered in Greenwich Village
Programmatic Curation
Editorial Curation
Embedding
Network
Ranking
Recommendation
Funnel
Ranking Algorithm
and Bandit Policy
Log user feedback:
interactions such as clicks,
likes, streams
Learn to rank Homepage based on logged feedback data.
Homepage ranking as end-to-end ML problem
Ranking algorithm serves
recommendations
Train ranking
algorithm
using logged
feedback
Consequences of Feedback Loops
Without randomization in the feedback loop, you risk:
● Homogenized user behavior (Chaney et al. 2018)
● Diminishing diversity over time (Nguyen et al. 2014)
● Poor representation of the long tail (Mehrotra et al. 2018)
Continuous exploration and content pool expansion
are helpful (Jiang et al, 2019)
Log user feedback:
interactions such as clicks,
likes, streams
Ranking algorithm serves
recommendations
Train ranking
algorithm
using logged
feedback
Introduce exploration
Exploration policy
introduces
randomness
Log user feedback:
interactions such as clicks,
likes, streams
Ranking algorithm serves
recommendations
Train ranking
algorithm
using logged
feedback
+ policy
propensities
Introduce exploration
Random data collection
Randomize the Homepage
for a small fraction of
requests
Ways to introduce exploration
Bandit Policy
Explore/exploit as
Homepage is assembled
(McInerney et al., 2018)
Bandit approaches are becoming popular:
● Artwork personalization at Netflix (Amat et al. 2018)
● News article recommendation in Yahoo (Chu et al. 2012)
● Personalization at Amazon Music (ICML 2019)
● REVEAL ’19 workshop here
Fully randomized
experiment
Randomize the Homepage
for a small fraction of users
Explore/Exploit
on the Homepage
An example of an epsilon-greedy policy for
ranking the Spotify Homepage.
0.7 0.20.8
Card Candidates
Predicted stream rate
Explore/Exploit
on the Homepage
An example of an epsilon-greedy policy for
ranking the Spotify Homepage.
0.7 0.20.8
Card Candidates
0.8
𝜋 = (1- 𝝐) + 𝝐/ 3
Explore/Exploit
on the Homepage
An example of an epsilon-greedy policy for
ranking the Spotify Homepage.
0.7 0.20.8
Card Candidates
0.8 0.2
𝜋 = 𝝐/ 2
Explore/Exploit
on the Homepage
An example of an epsilon-greedy policy for
ranking the Spotify Homepage.
0.7 0.20.8
Card Candidates
0.8 0.2 0.7
𝜋 = 1
Training the reward model*
Counterfactual inference for model parameters
* Explore, Exploit, Explain: Personalizing Explainable Recommendations with Bandits. J McInerney, B Lacker, S Hansen,
K Higley, H.Bouchard, A Gruson & R Mehrotra. RecSys 2018.
Research Directions & Practical Challenges
Many research directions we work on:
● Designing better reward models (REVEAL, talk by Mounia Lalmas)
● Optimizing for the marketplace (Marketplaces tutorial, Rishabh and Ben)
● Careful feature engineering to mitigate feedback loop side effects and better
rank new content
● Creating a more representative Homepage (Henriette Cramer in Responsible
Recommendation Panel)
But we need to have integration tests (kind of) so that we are confident that we’ve
got the basics right.
Sanity Checks
used in Practice
Three examples
Need a way to validate that policy debiasing yields roughly unbiased training data.
Sanity Checks
for policy debiasing
Method:
● Remove position bias by using training data from top
position..
● Train a linear model with a single feature (shelf_name) to
predict a metric that’s observable online (CTR).
● Compare prediction from debiased model to observed
outcome during exploration in that position.
Need a way to validate that policy debiasing yields roughly unbiased training data.
Sanity Checks
for policy debiasing
With
importance
sampling
Without
importance
sampling
Product strategy
Sanity Checks
for problem specific model behavior
Aggregate ranking metrics (e.g. NDCG) have low resolution and offer little visibility into
model behavior. But stakeholders have expectations about what the model should do in
specific situations. We build trust in the model internally and externally by creating metrics
around these expectations and using them as sanity checks.
Artists
Curators
Users
Music has repetitive consumption patterns.
Users have habitual behavior on Home. If a
user has a clear preference for a specific shelf,
models should rank that shelf high on the
page, regardless of what it is.
A user has a “favorite” shelf if a significant
amount of their consumption can be attributed
to that shelf.
Measure the average row where that shelf is
placed for those users.
Favorite Shelf Position Sanity Check
modelA modelB
shelfX
shelfY
shelfZ
Daily & Hourly Patterns Sanity Check
“Why don’t I see “Peaceful Piano” on top of my
homepage every night?”
● Zoom into repetitive consumption patterns and
habitual behavior.
● Measure if the row position is higher at the right
time when applicable.
streamrate
01 Motivation for exploration when collecting training data
02 Methods for collection policies and an epsilon greedy example
03 Three examples of simple sanity checks we use in production while
navigating the complex ecosystem of the homepage personalization
Conclusions
Thank you!
References:
[1] Lihong Li, Wei Chu, John Langford, Robert E. Schapire, A Contextual-Bandit Approach to Personalized News Article Recommendation
arXiv preprint arXiv:1003.0146
[2] Rishabh Mehrotra, James McInerney, Hugues Bouchard, Mounia Lalmas, and Fernando Diaz. 2018. Towards a Fair Marketplace:
Counterfactual Evaluation of the trade-off between Relevance, Fairness & Satisfaction in Recommendation Systems. CIKM '18. ACM, New
York, NY, USA, 2243-2251
[3] Allison J. B. Chaney, Brandon Stewart, and Barbara Engelhardt. 2017. How algorithmic confounding in recommendation systems
increases homogeneity and decreases utility. arXiv preprint arXiv:1710.11214
[4] J. McInerney, B. Lacker, S. Hansen, K. Higley, H. Bouchard, A. Gruson, R. Mehrotra. Explore, Exploit, Explain: Personalizing Explainable
Recommendations with Bandits. In ACM Conference on Recommender Systems (RecSys), October 2018
[5] Ray Jiang, Silvia Chiappa, Tor Lattimore, Andras Agyorgy, and Pushmeet Kohli. 2019. Degenerate Feedback Loops in Recommender
Systems. arXiv:arXiv:1902.10730
[6] Thorsten Joachims, Adith Swaminathan, Tobias Schnabel Unbiased learning from biased user feedback arXiv:arXiv:1608.04468
[7] Fernando Amat, Ashok Chandrashekar, Tony Jebara, and Justin Basilico. 2018. Artwork personalization at netflix. In Proceedings of the
12th ACM Conference on Recommender Systems (RecSys '18).
https://www.spotifyjobs.com

Weitere ähnliche Inhalte

Was ist angesagt?

Interactive Recommender Systems with Netflix and Spotify
Interactive Recommender Systems with Netflix and SpotifyInteractive Recommender Systems with Netflix and Spotify
Interactive Recommender Systems with Netflix and Spotify
Chris Johnson
 
Interactive Recommender Systems
Interactive Recommender SystemsInteractive Recommender Systems
Interactive Recommender Systems
Roelof van Zwol
 
RecSys 2020 A Human Perspective on Algorithmic Similarity Schendel 9-2020
RecSys 2020 A Human Perspective on Algorithmic Similarity Schendel 9-2020RecSys 2020 A Human Perspective on Algorithmic Similarity Schendel 9-2020
RecSys 2020 A Human Perspective on Algorithmic Similarity Schendel 9-2020
Zachary Schendel
 

Was ist angesagt? (20)

Déjà Vu: The Importance of Time and Causality in Recommender Systems
Déjà Vu: The Importance of Time and Causality in Recommender SystemsDéjà Vu: The Importance of Time and Causality in Recommender Systems
Déjà Vu: The Importance of Time and Causality in Recommender Systems
 
Recent Trends in Personalization at Netflix
Recent Trends in Personalization at NetflixRecent Trends in Personalization at Netflix
Recent Trends in Personalization at Netflix
 
Past, Present & Future of Recommender Systems: An Industry Perspective
Past, Present & Future of Recommender Systems: An Industry PerspectivePast, Present & Future of Recommender Systems: An Industry Perspective
Past, Present & Future of Recommender Systems: An Industry Perspective
 
Crafting Recommenders: the Shallow and the Deep of it!
Crafting Recommenders: the Shallow and the Deep of it! Crafting Recommenders: the Shallow and the Deep of it!
Crafting Recommenders: the Shallow and the Deep of it!
 
Making Netflix Machine Learning Algorithms Reliable
Making Netflix Machine Learning Algorithms ReliableMaking Netflix Machine Learning Algorithms Reliable
Making Netflix Machine Learning Algorithms Reliable
 
Sequential Decision Making in Recommendations
Sequential Decision Making in RecommendationsSequential Decision Making in Recommendations
Sequential Decision Making in Recommendations
 
Recommending and searching @ Spotify
Recommending and searching @ SpotifyRecommending and searching @ Spotify
Recommending and searching @ Spotify
 
Past, present, and future of Recommender Systems: an industry perspective
Past, present, and future of Recommender Systems: an industry perspectivePast, present, and future of Recommender Systems: an industry perspective
Past, present, and future of Recommender Systems: an industry perspective
 
Music Personalization At Spotify
Music Personalization At SpotifyMusic Personalization At Spotify
Music Personalization At Spotify
 
Interactive Recommender Systems with Netflix and Spotify
Interactive Recommender Systems with Netflix and SpotifyInteractive Recommender Systems with Netflix and Spotify
Interactive Recommender Systems with Netflix and Spotify
 
Context Aware Recommendations at Netflix
Context Aware Recommendations at NetflixContext Aware Recommendations at Netflix
Context Aware Recommendations at Netflix
 
Learning a Personalized Homepage
Learning a Personalized HomepageLearning a Personalized Homepage
Learning a Personalized Homepage
 
Deep Learning for Recommender Systems
Deep Learning for Recommender SystemsDeep Learning for Recommender Systems
Deep Learning for Recommender Systems
 
Interactive Recommender Systems
Interactive Recommender SystemsInteractive Recommender Systems
Interactive Recommender Systems
 
Engagement, metrics and "recommenders"
Engagement, metrics and "recommenders"Engagement, metrics and "recommenders"
Engagement, metrics and "recommenders"
 
Algorithmic Music Recommendations at Spotify
Algorithmic Music Recommendations at SpotifyAlgorithmic Music Recommendations at Spotify
Algorithmic Music Recommendations at Spotify
 
RecSys 2020 A Human Perspective on Algorithmic Similarity Schendel 9-2020
RecSys 2020 A Human Perspective on Algorithmic Similarity Schendel 9-2020RecSys 2020 A Human Perspective on Algorithmic Similarity Schendel 9-2020
RecSys 2020 A Human Perspective on Algorithmic Similarity Schendel 9-2020
 
Artwork Personalization at Netflix Fernando Amat RecSys2018
Artwork Personalization at Netflix Fernando Amat RecSys2018 Artwork Personalization at Netflix Fernando Amat RecSys2018
Artwork Personalization at Netflix Fernando Amat RecSys2018
 
Time, Context and Causality in Recommender Systems
Time, Context and Causality in Recommender SystemsTime, Context and Causality in Recommender Systems
Time, Context and Causality in Recommender Systems
 
Recent Trends in Personalization at Netflix
Recent Trends in Personalization at NetflixRecent Trends in Personalization at Netflix
Recent Trends in Personalization at Netflix
 

Ähnlich wie Homepage Personalization at Spotify

SampleLiteratureReviewTemplate_IVBTechIISEM_MajorProject.pptx
SampleLiteratureReviewTemplate_IVBTechIISEM_MajorProject.pptxSampleLiteratureReviewTemplate_IVBTechIISEM_MajorProject.pptx
SampleLiteratureReviewTemplate_IVBTechIISEM_MajorProject.pptx
20211a05p7
 
Designing for Evaluation
Designing for EvaluationDesigning for Evaluation
Designing for Evaluation
Kelly Page
 
Measuring the Quality of Online Service - Jinyoung kim
Measuring the Quality of Online Service - Jinyoung kimMeasuring the Quality of Online Service - Jinyoung kim
Measuring the Quality of Online Service - Jinyoung kim
Jin Young Kim
 

Ähnlich wie Homepage Personalization at Spotify (20)

Social Media and AI: Don’t forget the users
Social Media and AI: Don’t forget the usersSocial Media and AI: Don’t forget the users
Social Media and AI: Don’t forget the users
 
20120140506003
2012014050600320120140506003
20120140506003
 
UX STRAT Online 2021 Presentation by Rina Tambo Jensen
UX STRAT Online 2021 Presentation by Rina Tambo JensenUX STRAT Online 2021 Presentation by Rina Tambo Jensen
UX STRAT Online 2021 Presentation by Rina Tambo Jensen
 
MBA
MBAMBA
MBA
 
Sweeny group think-ias2015
Sweeny group think-ias2015Sweeny group think-ias2015
Sweeny group think-ias2015
 
Approaching Big Data: Lesson Plan
Approaching Big Data: Lesson Plan Approaching Big Data: Lesson Plan
Approaching Big Data: Lesson Plan
 
Fuzzy Logic Based Recommender System
Fuzzy Logic Based Recommender SystemFuzzy Logic Based Recommender System
Fuzzy Logic Based Recommender System
 
Using Contextual Information to Understand Searching and Browsing Behavior
Using Contextual Information to Understand Searching and Browsing BehaviorUsing Contextual Information to Understand Searching and Browsing Behavior
Using Contextual Information to Understand Searching and Browsing Behavior
 
Data science and ethics in fundraising
Data science and ethics in fundraisingData science and ethics in fundraising
Data science and ethics in fundraising
 
SampleLiteratureReviewTemplate_IVBTechIISEM_MajorProject.pptx
SampleLiteratureReviewTemplate_IVBTechIISEM_MajorProject.pptxSampleLiteratureReviewTemplate_IVBTechIISEM_MajorProject.pptx
SampleLiteratureReviewTemplate_IVBTechIISEM_MajorProject.pptx
 
Martin bazley evaluating digital learning resources leicester reduced for upl...
Martin bazley evaluating digital learning resources leicester reduced for upl...Martin bazley evaluating digital learning resources leicester reduced for upl...
Martin bazley evaluating digital learning resources leicester reduced for upl...
 
NISO Altmetrics Initiative: A Project Update - Martin Fenner, Technical Lead ...
NISO Altmetrics Initiative: A Project Update - Martin Fenner, Technical Lead ...NISO Altmetrics Initiative: A Project Update - Martin Fenner, Technical Lead ...
NISO Altmetrics Initiative: A Project Update - Martin Fenner, Technical Lead ...
 
IRJET- Review on Different Recommendation Techniques for GRS in Online Social...
IRJET- Review on Different Recommendation Techniques for GRS in Online Social...IRJET- Review on Different Recommendation Techniques for GRS in Online Social...
IRJET- Review on Different Recommendation Techniques for GRS in Online Social...
 
Impersonal Recommendation system on top of Hadoop
Impersonal Recommendation system on top of HadoopImpersonal Recommendation system on top of Hadoop
Impersonal Recommendation system on top of Hadoop
 
Mini-training: Personalization & Recommendation Demystified
Mini-training: Personalization & Recommendation DemystifiedMini-training: Personalization & Recommendation Demystified
Mini-training: Personalization & Recommendation Demystified
 
Designing for Evaluation
Designing for EvaluationDesigning for Evaluation
Designing for Evaluation
 
Meaningful Metrics - Aligning Operational Metrics with Marketing & Customer E...
Meaningful Metrics - Aligning Operational Metrics with Marketing & Customer E...Meaningful Metrics - Aligning Operational Metrics with Marketing & Customer E...
Meaningful Metrics - Aligning Operational Metrics with Marketing & Customer E...
 
Survey Research In Empirical Software Engineering
Survey Research In Empirical Software EngineeringSurvey Research In Empirical Software Engineering
Survey Research In Empirical Software Engineering
 
Measuring the Quality of Online Service - Jinyoung kim
Measuring the Quality of Online Service - Jinyoung kimMeasuring the Quality of Online Service - Jinyoung kim
Measuring the Quality of Online Service - Jinyoung kim
 
Tutorial on metrics of user engagement -- Applications to Search & E- commerce
Tutorial on metrics of user engagement -- Applications to Search & E- commerceTutorial on metrics of user engagement -- Applications to Search & E- commerce
Tutorial on metrics of user engagement -- Applications to Search & E- commerce
 

Kürzlich hochgeladen

Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
nirzagarg
 
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
nirzagarg
 
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
nirzagarg
 
怎样办理圣路易斯大学毕业证(SLU毕业证书)成绩单学校原版复制
怎样办理圣路易斯大学毕业证(SLU毕业证书)成绩单学校原版复制怎样办理圣路易斯大学毕业证(SLU毕业证书)成绩单学校原版复制
怎样办理圣路易斯大学毕业证(SLU毕业证书)成绩单学校原版复制
vexqp
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
ZurliaSoop
 
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
gajnagarg
 
一比一原版(UCD毕业证书)加州大学戴维斯分校毕业证成绩单原件一模一样
一比一原版(UCD毕业证书)加州大学戴维斯分校毕业证成绩单原件一模一样一比一原版(UCD毕业证书)加州大学戴维斯分校毕业证成绩单原件一模一样
一比一原版(UCD毕业证书)加州大学戴维斯分校毕业证成绩单原件一模一样
wsppdmt
 
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Klinik kandungan
 
Jual Cytotec Asli Obat Aborsi No. 1 Paling Manjur
Jual Cytotec Asli Obat Aborsi No. 1 Paling ManjurJual Cytotec Asli Obat Aborsi No. 1 Paling Manjur
Jual Cytotec Asli Obat Aborsi No. 1 Paling Manjur
ptikerjasaptiker
 
怎样办理旧金山城市学院毕业证(CCSF毕业证书)成绩单学校原版复制
怎样办理旧金山城市学院毕业证(CCSF毕业证书)成绩单学校原版复制怎样办理旧金山城市学院毕业证(CCSF毕业证书)成绩单学校原版复制
怎样办理旧金山城市学院毕业证(CCSF毕业证书)成绩单学校原版复制
vexqp
 
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样
wsppdmt
 
Cytotec in Jeddah+966572737505) get unwanted pregnancy kit Riyadh
Cytotec in Jeddah+966572737505) get unwanted pregnancy kit RiyadhCytotec in Jeddah+966572737505) get unwanted pregnancy kit Riyadh
Cytotec in Jeddah+966572737505) get unwanted pregnancy kit Riyadh
Abortion pills in Riyadh +966572737505 get cytotec
 
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
gajnagarg
 

Kürzlich hochgeladen (20)

Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
 
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Research
 
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
 
Data Analyst Tasks to do the internship.pdf
Data Analyst Tasks to do the internship.pdfData Analyst Tasks to do the internship.pdf
Data Analyst Tasks to do the internship.pdf
 
Switzerland Constitution 2002.pdf.........
Switzerland Constitution 2002.pdf.........Switzerland Constitution 2002.pdf.........
Switzerland Constitution 2002.pdf.........
 
怎样办理圣路易斯大学毕业证(SLU毕业证书)成绩单学校原版复制
怎样办理圣路易斯大学毕业证(SLU毕业证书)成绩单学校原版复制怎样办理圣路易斯大学毕业证(SLU毕业证书)成绩单学校原版复制
怎样办理圣路易斯大学毕业证(SLU毕业证书)成绩单学校原版复制
 
Aspirational Block Program Block Syaldey District - Almora
Aspirational Block Program Block Syaldey District - AlmoraAspirational Block Program Block Syaldey District - Almora
Aspirational Block Program Block Syaldey District - Almora
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
 
一比一原版(UCD毕业证书)加州大学戴维斯分校毕业证成绩单原件一模一样
一比一原版(UCD毕业证书)加州大学戴维斯分校毕业证成绩单原件一模一样一比一原版(UCD毕业证书)加州大学戴维斯分校毕业证成绩单原件一模一样
一比一原版(UCD毕业证书)加州大学戴维斯分校毕业证成绩单原件一模一样
 
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
 
Jual Cytotec Asli Obat Aborsi No. 1 Paling Manjur
Jual Cytotec Asli Obat Aborsi No. 1 Paling ManjurJual Cytotec Asli Obat Aborsi No. 1 Paling Manjur
Jual Cytotec Asli Obat Aborsi No. 1 Paling Manjur
 
Dubai Call Girls Peeing O525547819 Call Girls Dubai
Dubai Call Girls Peeing O525547819 Call Girls DubaiDubai Call Girls Peeing O525547819 Call Girls Dubai
Dubai Call Girls Peeing O525547819 Call Girls Dubai
 
怎样办理旧金山城市学院毕业证(CCSF毕业证书)成绩单学校原版复制
怎样办理旧金山城市学院毕业证(CCSF毕业证书)成绩单学校原版复制怎样办理旧金山城市学院毕业证(CCSF毕业证书)成绩单学校原版复制
怎样办理旧金山城市学院毕业证(CCSF毕业证书)成绩单学校原版复制
 
Digital Transformation Playbook by Graham Ware
Digital Transformation Playbook by Graham WareDigital Transformation Playbook by Graham Ware
Digital Transformation Playbook by Graham Ware
 
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样
 
Cytotec in Jeddah+966572737505) get unwanted pregnancy kit Riyadh
Cytotec in Jeddah+966572737505) get unwanted pregnancy kit RiyadhCytotec in Jeddah+966572737505) get unwanted pregnancy kit Riyadh
Cytotec in Jeddah+966572737505) get unwanted pregnancy kit Riyadh
 
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
 
DATA SUMMIT 24 Building Real-Time Pipelines With FLaNK
DATA SUMMIT 24  Building Real-Time Pipelines With FLaNKDATA SUMMIT 24  Building Real-Time Pipelines With FLaNK
DATA SUMMIT 24 Building Real-Time Pipelines With FLaNK
 

Homepage Personalization at Spotify

  • 1. Homepage Personalization at Spotify Oğuz Semerci, Aloïs Gruson, Clay Gibson, Ben Lacker, Catherine Edwards, Vladan Radosavljevic
  • 2. Spotify is a global audio subscription service By the numbers 232M 108M 79 50M+ 450k+
  • 3. What’s at stake on the Homepage? The Homepage is the first thing you see when you open the app. It is many things: a discovery tool, a personal music assistant, a marketplace for artists and their fans. Spotify’s mission is to unlock the potential of human creativity — by giving a million creative artists the opportunity to live off their art and billions of fans the opportunity to enjoy and be inspired by it. Personalization is powerful in this challenging content space with vast volume and variety.
  • 4. 01 More on Spotify Homepage 02 Overview of the Ranking algorithm and the bandit policy 03 Sanity checks used in practice for policy debiasing and model behavior Talk outline
  • 5. Homepage organization The Homepage is made up of cards: podcast shows or episodes, albums, playlists, radio stations, artist pages, etc. Cards are organized into shelves. Shelf A Shelf B
  • 6. Each user is eligible for hundreds of candidate shelves, which can be editorially or programmatically curated. Shelves pull from a pool of millions of cards. All shelf candidates and their respective cards are ranked in real-time when you load Home. Made for X Your Favorite Albums Similar to Y Recommended for Today Iconic 80s Soundtracks Discovered in Greenwich Village Programmatic Curation Editorial Curation Embedding Network Ranking Recommendation Funnel
  • 8. Log user feedback: interactions such as clicks, likes, streams Learn to rank Homepage based on logged feedback data. Homepage ranking as end-to-end ML problem Ranking algorithm serves recommendations Train ranking algorithm using logged feedback
  • 9. Consequences of Feedback Loops Without randomization in the feedback loop, you risk: ● Homogenized user behavior (Chaney et al. 2018) ● Diminishing diversity over time (Nguyen et al. 2014) ● Poor representation of the long tail (Mehrotra et al. 2018) Continuous exploration and content pool expansion are helpful (Jiang et al, 2019)
  • 10. Log user feedback: interactions such as clicks, likes, streams Ranking algorithm serves recommendations Train ranking algorithm using logged feedback Introduce exploration
  • 11. Exploration policy introduces randomness Log user feedback: interactions such as clicks, likes, streams Ranking algorithm serves recommendations Train ranking algorithm using logged feedback + policy propensities Introduce exploration
  • 12. Random data collection Randomize the Homepage for a small fraction of requests Ways to introduce exploration Bandit Policy Explore/exploit as Homepage is assembled (McInerney et al., 2018) Bandit approaches are becoming popular: ● Artwork personalization at Netflix (Amat et al. 2018) ● News article recommendation in Yahoo (Chu et al. 2012) ● Personalization at Amazon Music (ICML 2019) ● REVEAL ’19 workshop here Fully randomized experiment Randomize the Homepage for a small fraction of users
  • 13. Explore/Exploit on the Homepage An example of an epsilon-greedy policy for ranking the Spotify Homepage. 0.7 0.20.8 Card Candidates Predicted stream rate
  • 14. Explore/Exploit on the Homepage An example of an epsilon-greedy policy for ranking the Spotify Homepage. 0.7 0.20.8 Card Candidates 0.8 𝜋 = (1- 𝝐) + 𝝐/ 3
  • 15. Explore/Exploit on the Homepage An example of an epsilon-greedy policy for ranking the Spotify Homepage. 0.7 0.20.8 Card Candidates 0.8 0.2 𝜋 = 𝝐/ 2
  • 16. Explore/Exploit on the Homepage An example of an epsilon-greedy policy for ranking the Spotify Homepage. 0.7 0.20.8 Card Candidates 0.8 0.2 0.7 𝜋 = 1
  • 17. Training the reward model* Counterfactual inference for model parameters * Explore, Exploit, Explain: Personalizing Explainable Recommendations with Bandits. J McInerney, B Lacker, S Hansen, K Higley, H.Bouchard, A Gruson & R Mehrotra. RecSys 2018.
  • 18. Research Directions & Practical Challenges Many research directions we work on: ● Designing better reward models (REVEAL, talk by Mounia Lalmas) ● Optimizing for the marketplace (Marketplaces tutorial, Rishabh and Ben) ● Careful feature engineering to mitigate feedback loop side effects and better rank new content ● Creating a more representative Homepage (Henriette Cramer in Responsible Recommendation Panel) But we need to have integration tests (kind of) so that we are confident that we’ve got the basics right.
  • 19. Sanity Checks used in Practice Three examples
  • 20. Need a way to validate that policy debiasing yields roughly unbiased training data. Sanity Checks for policy debiasing Method: ● Remove position bias by using training data from top position.. ● Train a linear model with a single feature (shelf_name) to predict a metric that’s observable online (CTR). ● Compare prediction from debiased model to observed outcome during exploration in that position.
  • 21. Need a way to validate that policy debiasing yields roughly unbiased training data. Sanity Checks for policy debiasing With importance sampling Without importance sampling
  • 22. Product strategy Sanity Checks for problem specific model behavior Aggregate ranking metrics (e.g. NDCG) have low resolution and offer little visibility into model behavior. But stakeholders have expectations about what the model should do in specific situations. We build trust in the model internally and externally by creating metrics around these expectations and using them as sanity checks. Artists Curators Users
  • 23. Music has repetitive consumption patterns. Users have habitual behavior on Home. If a user has a clear preference for a specific shelf, models should rank that shelf high on the page, regardless of what it is. A user has a “favorite” shelf if a significant amount of their consumption can be attributed to that shelf. Measure the average row where that shelf is placed for those users. Favorite Shelf Position Sanity Check modelA modelB shelfX shelfY shelfZ
  • 24. Daily & Hourly Patterns Sanity Check “Why don’t I see “Peaceful Piano” on top of my homepage every night?” ● Zoom into repetitive consumption patterns and habitual behavior. ● Measure if the row position is higher at the right time when applicable. streamrate
  • 25. 01 Motivation for exploration when collecting training data 02 Methods for collection policies and an epsilon greedy example 03 Three examples of simple sanity checks we use in production while navigating the complex ecosystem of the homepage personalization Conclusions
  • 26. Thank you! References: [1] Lihong Li, Wei Chu, John Langford, Robert E. Schapire, A Contextual-Bandit Approach to Personalized News Article Recommendation arXiv preprint arXiv:1003.0146 [2] Rishabh Mehrotra, James McInerney, Hugues Bouchard, Mounia Lalmas, and Fernando Diaz. 2018. Towards a Fair Marketplace: Counterfactual Evaluation of the trade-off between Relevance, Fairness & Satisfaction in Recommendation Systems. CIKM '18. ACM, New York, NY, USA, 2243-2251 [3] Allison J. B. Chaney, Brandon Stewart, and Barbara Engelhardt. 2017. How algorithmic confounding in recommendation systems increases homogeneity and decreases utility. arXiv preprint arXiv:1710.11214 [4] J. McInerney, B. Lacker, S. Hansen, K. Higley, H. Bouchard, A. Gruson, R. Mehrotra. Explore, Exploit, Explain: Personalizing Explainable Recommendations with Bandits. In ACM Conference on Recommender Systems (RecSys), October 2018 [5] Ray Jiang, Silvia Chiappa, Tor Lattimore, Andras Agyorgy, and Pushmeet Kohli. 2019. Degenerate Feedback Loops in Recommender Systems. arXiv:arXiv:1902.10730 [6] Thorsten Joachims, Adith Swaminathan, Tobias Schnabel Unbiased learning from biased user feedback arXiv:arXiv:1608.04468 [7] Fernando Amat, Ashok Chandrashekar, Tony Jebara, and Justin Basilico. 2018. Artwork personalization at netflix. In Proceedings of the 12th ACM Conference on Recommender Systems (RecSys '18). https://www.spotifyjobs.com