SlideShare ist ein Scribd-Unternehmen logo
1 von 46
Downloaden Sie, um offline zu lesen
Lessons Learned from
Building real-life Recsys
Xavier Amatriain (Quora)
Deepak Agarwal (LinkedIn)
4. Recommendations
@Quora
Our Mission
“To share and grow the world’s
knowledge”
‱ Millions of questions & answers
‱ Millions of users
‱ Thousands of topics
‱ ...
Demand
What we care about
Quality
Relevance
Data @ Quora
Lots of high-quality textual information
Lots of data relations
Recommendations
@Quora
Recommendations at Quora
● Homepage feed ranking
● Email digest
● Answer ranking
● Topic recommendation
● User recommendation
● Trending Topics
● Automated Topic Labelling
● Related Question
● ...
click
upvote
downvote
expand
share
Models
● Deep Neural Networks
● Logistic Regression
● Elastic Nets
● Gradient Boosted Decision Trees
● Random Forests
● LambdaMART
● Matrix Factorization
● LDA
● ...
●
5. Lessons learned
(Part II)
1. Implicitsignalsbeat
explicitones
(almostalways)
Implicit vs. Explicit
● Many have acknowledged
that implicit feedback is more
useful
● Is implicit feedback really always
more useful?
● If so, why?
● Implicit data is (usually):
○ More dense, and available for all users
○ Better representative of user behavior vs.
user reflection
○ More related to final objective function
○ Better correlated with AB test results
● E.g. Rating vs watching
Implicit vs. Explicit
● However
○ It is not always the case that
direct implicit feedback correlates
well with long-term retention
○ E.g. clickbait
● Solution:
○ Combine different forms of
implicit + explicit to better represent
long-term goal
Implicit vs. Explicit
2.bethoughtfulaboutyour
TrainingData
Defining training/testing data
● Training a simple binary classifier for
good/bad answer
○ Defining positive and negative labels ->
Non-trivial task
○ Is this a positive or a negative?
■ funny uninformative answer with many
upvotes
■ short uninformative answer by a well-known
expert in the field
■ very long informative answer that nobody
reads/upvotes
■ informative answer with grammar/spelling
mistakes
■ ...
3.YourModelwilllearn
whatyouteachittolearn
Training a model
● Model will learn according to:
○ Training data (e.g. implicit and explicit)
○ Target function (e.g. probability of user reading an answer)
○ Metric (e.g. precision vs. recall)
● Example 1 (made up):
○ Optimize probability of a user going to the cinema to
watch a movie and rate it “highly” by using purchase history
and previous ratings. Use NDCG of the ranking as final
metric using only movies rated 4 or higher as positives.
Example 2 - Quora’s feed
● Training data = implicit + explicit
● Target function: Value of showing a
story to a
user ~ weighted sum of actions:
v = ∑a
va
1{ya
= 1}
○ predict probabilities for each action, then compute expected
value: v_pred = E[ V | x ] = ∑a
va
p(a | x)
● Metric: any ranking metric
4.Explanationsmightmatter
morethantheprediction
Explanation/Support for Recommendations
Social Support
5.IfYouHavetoPickonesingleapproach,
Matrixfactorizationisyourbestbet
Matrix Factorization
● MF can be interpreted as
○ Unsupervised:
■ Dimensionality Reduction a la PCA
■ Clustering (e.g. NMF)
○ Supervised:
■ Labeled targets ~ regression
● Very useful variations of MF
○ BPR, ALS, SVD++
○ Tensor Factorization, Factorization Machines
● However...
6.Everythingisanensemble
Ensembles
● Netflix Prize was won by an ensemble
○ Initially Bellkor was using GDBTs
○ BigChaos introduced ANN-based ensemble
● Most practical applications of ML run an ensemble
○ Why wouldn’t you?
○ At least as good as the best of your methods
○ Can add completely different approaches
(e.g. CF and content-based)
○ You can use many different models at the
ensemble layer: LR, GDBTs, RFs, ANNs...
Ensembles & Feature Engineering
● Ensembles are the way to turn any model into a feature!
● E.g. Don’t know if the way to go is to use Factorization
Machines, Tensor Factorization, or RNNs?
○ Treat each model as a “feature”
○ Feed them into an ensemble
The Master Algorithm?
It definitely is an ensemble!
7.BuildingRecommenderSystemsisalso
aboutFeatureEngineering
Need for feature engineering
In many cases an understanding of the domain will lead to
optimal results.
Feature Engineering
Feature Engineering Example - Quora Answer Ranking
What is a good Quora answer?
‱ truthful
‱ reusable
‱ provides explanation
‱ well formatted
‱ ...
Feature Engineering Example - Quora Answer Ranking
How are those dimensions translated
into features?
‱ Features that relate to the answer
quality itself
‱ Interaction features
(upvotes/downvotes, clicks,
comments
)
‱ User features (e.g. expertise in topic)
Feature Engineering
● Properties of a well-behaved
ML feature:
○ Reusable
○ Transformable
○ Interpretable
○ Reliable
8.Whyyoushouldcareabout
answeringquestions
(aboutyourrecsys)
Model debuggability
● Value of a model = value it brings to the product
● Product owners/stakeholders have expectations on
the product
● It is important to answer questions to why did
something fail
● Bridge gap between product design and ML algos
● Model debuggability is so important it can
determine:
○ Particular model to use
○ Features to rely on
○ Implementation of tools
Model debuggability
● E.g. Why am I seeing or not seeing
this on my homepage feed?
9.DataandModelsaregreat.Youknow
what’sevenbetter?
Therightevaluationapproach!
Offline/Online testing process
Executing A/B tests
● Measure differences in metrics across statistically identical
populations that each experience a different algorithm.
● Decisions on the product always data-driven
● Overall Evaluation Criteria (OEC) = member retention
○ Use long-term metrics whenever possible
○ Short-term metrics can be informative and allow faster decisions
■ But, not always aligned with OEC
Offline testing
● Measure model performance,
using (IR) metrics
● Offline performance = indication
to make decisions on follow-up
A/B tests
● A critical (and mostly unsolved)
issue is how offline metrics
correlate with A/B test results.
10.Youdon’tneedtodistributeyour
Recsys
Distributing Recommender Systems
● Most of what people do in practice can fit
into a multi-core machine
○ As long as you use:
■ Smart data sampling
■ Offline schemes
■ Efficient parallel code
● (
 but not Deep ANNs)
● Do you care about costs? How about latencies or
system complexity/debuggability?
Matrix Factorization Example
Conclusions
● Recommender Systems are about much more than
just predicting a rating
● Designing a “real-life” recsys means paying
attention to issues such as:
○ Feature engineering
○ Training dataset
○ Metrics
○ Experimentation and AB Testing
○ System scalability
○ ...
● Lots of room for improvement & research
Questions?
Xavier Amatriain (Quora)
xavier@amatriain.net
Deepak Agarwal (LinkedIn)
dagarwal@linkedin.com

Weitere Àhnliche Inhalte

Was ist angesagt?

Deep Learning for Recommender Systems
Deep Learning for Recommender SystemsDeep Learning for Recommender Systems
Deep Learning for Recommender SystemsJustin Basilico
 
Recommender system introduction
Recommender system   introductionRecommender system   introduction
Recommender system introductionLiang Xiang
 
Kaggleぼテクニック
KaggleぼテクニックKaggleぼテクニック
KaggleぼテクニックYasunori Ozaki
 
Deep Learning for Recommender Systems
Deep Learning for Recommender SystemsDeep Learning for Recommender Systems
Deep Learning for Recommender SystemsYves Raimond
 
Gradient Tree Boosting はいいぞ
Gradient Tree Boosting はいいぞGradient Tree Boosting はいいぞ
Gradient Tree Boosting はいいぞ7X RUSK
 
Interactive Recommender Systems with Netflix and Spotify
Interactive Recommender Systems with Netflix and SpotifyInteractive Recommender Systems with Netflix and Spotify
Interactive Recommender Systems with Netflix and SpotifyChris Johnson
 
ICML 2021 Workshop æ·±ć±€ć­Šçż’ăźäžçąșćźŸæ€§ă«ă€ă„ăŠ
ICML 2021 Workshop æ·±ć±€ć­Šçż’ăźäžçąșćźŸæ€§ă«ă€ă„ăŠICML 2021 Workshop æ·±ć±€ć­Šçż’ăźäžçąșćźŸæ€§ă«ă€ă„ăŠ
ICML 2021 Workshop æ·±ć±€ć­Šçż’ăźäžçąșćźŸæ€§ă«ă€ă„ăŠtmtm otm
 
æ©Ÿæą°ć­Šçż’ă‚’äœżăŁăŸæ™‚çł»ćˆ—ćŁČ侊äșˆæžŹ
æ©Ÿæą°ć­Šçż’ă‚’äœżăŁăŸæ™‚çł»ćˆ—ćŁČ侊äșˆæžŹæ©Ÿæą°ć­Šçż’ă‚’äœżăŁăŸæ™‚çł»ćˆ—ćŁČ侊äșˆæžŹ
æ©Ÿæą°ć­Šçż’ă‚’äœżăŁăŸæ™‚çł»ćˆ—ćŁČ侊äșˆæžŹDataRobotJP
 
ăƒąăƒ‡ăƒ«ăƒ™ăƒŒă‚č捔èȘżăƒ•ă‚Łăƒ«ă‚żăƒȘăƒłă‚°ă«ăŠă‘ă‚‹æŽšè–Šăźé€æ˜Žæ€§ă«é–ąă™ă‚‹æ€œèšŽ
ăƒąăƒ‡ăƒ«ăƒ™ăƒŒă‚č捔èȘżăƒ•ă‚Łăƒ«ă‚żăƒȘăƒłă‚°ă«ăŠă‘ă‚‹æŽšè–Šăźé€æ˜Žæ€§ă«é–ąă™ă‚‹æ€œèšŽăƒąăƒ‡ăƒ«ăƒ™ăƒŒă‚č捔èȘżăƒ•ă‚Łăƒ«ă‚żăƒȘăƒłă‚°ă«ăŠă‘ă‚‹æŽšè–Šăźé€æ˜Žæ€§ă«é–ąă™ă‚‹æ€œèšŽ
ăƒąăƒ‡ăƒ«ăƒ™ăƒŒă‚č捔èȘżăƒ•ă‚Łăƒ«ă‚żăƒȘăƒłă‚°ă«ăŠă‘ă‚‹æŽšè–Šăźé€æ˜Žæ€§ă«é–ąă™ă‚‹æ€œèšŽOkamoto Laboratory, The University of Electro-Communications
 
Recent advances in deep recommender systems
Recent advances in deep recommender systemsRecent advances in deep recommender systems
Recent advances in deep recommender systemsNAVER Engineering
 
Optuna DashboardたçŽčä»‹ăšèš­èšˆè§ŁèȘŹ - 2022/12/10 Optuna Meetup #2
Optuna DashboardたçŽčä»‹ăšèš­èšˆè§ŁèȘŹ - 2022/12/10 Optuna Meetup #2Optuna DashboardたçŽčä»‹ăšèš­èšˆè§ŁèȘŹ - 2022/12/10 Optuna Meetup #2
Optuna DashboardたçŽčä»‹ăšèš­èšˆè§ŁèȘŹ - 2022/12/10 Optuna Meetup #2Preferred Networks
 
Active Learning ć…„é–€
Active Learning ć…„é–€Active Learning ć…„é–€
Active Learning ć…„é–€Shuyo Nakatani
 
ă‚»ă‚™ăƒ­ă‹ă‚‰ć§‹ă‚ă‚‹ăƒŹă‚łăƒĄăƒłă‚żă‚™ă‚·ă‚čテム
ă‚»ă‚™ăƒ­ă‹ă‚‰ć§‹ă‚ă‚‹ăƒŹă‚łăƒĄăƒłă‚żă‚™ă‚·ă‚čăƒ†ăƒ ă‚»ă‚™ăƒ­ă‹ă‚‰ć§‹ă‚ă‚‹ăƒŹă‚łăƒĄăƒłă‚żă‚™ă‚·ă‚čテム
ă‚»ă‚™ăƒ­ă‹ă‚‰ć§‹ă‚ă‚‹ăƒŹă‚łăƒĄăƒłă‚żă‚™ă‚·ă‚čテムKazuaki Tanida
 
æ©Ÿæą°ć­Šçż’ăƒ—ăƒ­ăƒ•ă‚§ăƒƒă‚·ăƒ§ăƒŠăƒ«ă‚·ăƒȘăƒŒă‚șèŒȘèȘ­äŒš #5 ç•°ćžžæ€œçŸ„ăšć€‰ćŒ–æ€œçŸ„ Chapter 1 & 2 èł‡æ–™
æ©Ÿæą°ć­Šçż’ăƒ—ăƒ­ăƒ•ă‚§ăƒƒă‚·ăƒ§ăƒŠăƒ«ă‚·ăƒȘăƒŒă‚șèŒȘèȘ­äŒš #5 ç•°ćžžæ€œçŸ„ăšć€‰ćŒ–æ€œçŸ„ Chapter 1 & 2 èł‡æ–™æ©Ÿæą°ć­Šçż’ăƒ—ăƒ­ăƒ•ă‚§ăƒƒă‚·ăƒ§ăƒŠăƒ«ă‚·ăƒȘăƒŒă‚șèŒȘèȘ­äŒš #5 ç•°ćžžæ€œçŸ„ăšć€‰ćŒ–æ€œçŸ„ Chapter 1 & 2 èł‡æ–™
æ©Ÿæą°ć­Šçż’ăƒ—ăƒ­ăƒ•ă‚§ăƒƒă‚·ăƒ§ăƒŠăƒ«ă‚·ăƒȘăƒŒă‚șèŒȘèȘ­äŒš #5 ç•°ćžžæ€œçŸ„ăšć€‰ćŒ–æ€œçŸ„ Chapter 1 & 2 èł‡æ–™at grandpa
 
An introduction to Recommender Systems
An introduction to Recommender SystemsAn introduction to Recommender Systems
An introduction to Recommender SystemsDavid Zibriczky
 
æ©Ÿæą°ć­Šçż’ăƒąăƒ‡ăƒ«ăźćˆ€æ–­æ čæ‹ ăźèȘŹæ˜ŽïŒˆVer.2
æ©Ÿæą°ć­Šçż’ăƒąăƒ‡ăƒ«ăźćˆ€æ–­æ čæ‹ ăźèȘŹæ˜ŽïŒˆVer.2ïŒ‰æ©Ÿæą°ć­Šçż’ăƒąăƒ‡ăƒ«ăźćˆ€æ–­æ čæ‹ ăźèȘŹæ˜ŽïŒˆVer.2
æ©Ÿæą°ć­Šçż’ăƒąăƒ‡ăƒ«ăźćˆ€æ–­æ čæ‹ ăźèȘŹæ˜ŽïŒˆVer.2Satoshi Hara
 
Recommender systems: Content-based and collaborative filtering
Recommender systems: Content-based and collaborative filteringRecommender systems: Content-based and collaborative filtering
Recommender systems: Content-based and collaborative filteringViet-Trung TRAN
 
Artwork Personalization at Netflix
Artwork Personalization at NetflixArtwork Personalization at Netflix
Artwork Personalization at NetflixJustin Basilico
 
Personalizing "The Netflix Experience" with Deep Learning
Personalizing "The Netflix Experience" with Deep LearningPersonalizing "The Netflix Experience" with Deep Learning
Personalizing "The Netflix Experience" with Deep LearningAnoop Deoras
 

Was ist angesagt? (20)

Deep Learning for Recommender Systems
Deep Learning for Recommender SystemsDeep Learning for Recommender Systems
Deep Learning for Recommender Systems
 
Recommender Systems
Recommender SystemsRecommender Systems
Recommender Systems
 
Recommender system introduction
Recommender system   introductionRecommender system   introduction
Recommender system introduction
 
Kaggleぼテクニック
KaggleぼテクニックKaggleぼテクニック
Kaggleぼテクニック
 
Deep Learning for Recommender Systems
Deep Learning for Recommender SystemsDeep Learning for Recommender Systems
Deep Learning for Recommender Systems
 
Gradient Tree Boosting はいいぞ
Gradient Tree Boosting はいいぞGradient Tree Boosting はいいぞ
Gradient Tree Boosting はいいぞ
 
Interactive Recommender Systems with Netflix and Spotify
Interactive Recommender Systems with Netflix and SpotifyInteractive Recommender Systems with Netflix and Spotify
Interactive Recommender Systems with Netflix and Spotify
 
ICML 2021 Workshop æ·±ć±€ć­Šçż’ăźäžçąșćźŸæ€§ă«ă€ă„ăŠ
ICML 2021 Workshop æ·±ć±€ć­Šçż’ăźäžçąșćźŸæ€§ă«ă€ă„ăŠICML 2021 Workshop æ·±ć±€ć­Šçż’ăźäžçąșćźŸæ€§ă«ă€ă„ăŠ
ICML 2021 Workshop æ·±ć±€ć­Šçż’ăźäžçąșćźŸæ€§ă«ă€ă„ăŠ
 
æ©Ÿæą°ć­Šçż’ă‚’äœżăŁăŸæ™‚çł»ćˆ—ćŁČ侊äșˆæžŹ
æ©Ÿæą°ć­Šçż’ă‚’äœżăŁăŸæ™‚çł»ćˆ—ćŁČ侊äșˆæžŹæ©Ÿæą°ć­Šçż’ă‚’äœżăŁăŸæ™‚çł»ćˆ—ćŁČ侊äșˆæžŹ
æ©Ÿæą°ć­Šçż’ă‚’äœżăŁăŸæ™‚çł»ćˆ—ćŁČ侊äșˆæžŹ
 
ăƒąăƒ‡ăƒ«ăƒ™ăƒŒă‚č捔èȘżăƒ•ă‚Łăƒ«ă‚żăƒȘăƒłă‚°ă«ăŠă‘ă‚‹æŽšè–Šăźé€æ˜Žæ€§ă«é–ąă™ă‚‹æ€œèšŽ
ăƒąăƒ‡ăƒ«ăƒ™ăƒŒă‚č捔èȘżăƒ•ă‚Łăƒ«ă‚żăƒȘăƒłă‚°ă«ăŠă‘ă‚‹æŽšè–Šăźé€æ˜Žæ€§ă«é–ąă™ă‚‹æ€œèšŽăƒąăƒ‡ăƒ«ăƒ™ăƒŒă‚č捔èȘżăƒ•ă‚Łăƒ«ă‚żăƒȘăƒłă‚°ă«ăŠă‘ă‚‹æŽšè–Šăźé€æ˜Žæ€§ă«é–ąă™ă‚‹æ€œèšŽ
ăƒąăƒ‡ăƒ«ăƒ™ăƒŒă‚č捔èȘżăƒ•ă‚Łăƒ«ă‚żăƒȘăƒłă‚°ă«ăŠă‘ă‚‹æŽšè–Šăźé€æ˜Žæ€§ă«é–ąă™ă‚‹æ€œèšŽ
 
Recent advances in deep recommender systems
Recent advances in deep recommender systemsRecent advances in deep recommender systems
Recent advances in deep recommender systems
 
Optuna DashboardたçŽčä»‹ăšèš­èšˆè§ŁèȘŹ - 2022/12/10 Optuna Meetup #2
Optuna DashboardたçŽčä»‹ăšèš­èšˆè§ŁèȘŹ - 2022/12/10 Optuna Meetup #2Optuna DashboardたçŽčä»‹ăšèš­èšˆè§ŁèȘŹ - 2022/12/10 Optuna Meetup #2
Optuna DashboardたçŽčä»‹ăšèš­èšˆè§ŁèȘŹ - 2022/12/10 Optuna Meetup #2
 
Active Learning ć…„é–€
Active Learning ć…„é–€Active Learning ć…„é–€
Active Learning ć…„é–€
 
ă‚»ă‚™ăƒ­ă‹ă‚‰ć§‹ă‚ă‚‹ăƒŹă‚łăƒĄăƒłă‚żă‚™ă‚·ă‚čテム
ă‚»ă‚™ăƒ­ă‹ă‚‰ć§‹ă‚ă‚‹ăƒŹă‚łăƒĄăƒłă‚żă‚™ă‚·ă‚čăƒ†ăƒ ă‚»ă‚™ăƒ­ă‹ă‚‰ć§‹ă‚ă‚‹ăƒŹă‚łăƒĄăƒłă‚żă‚™ă‚·ă‚čテム
ă‚»ă‚™ăƒ­ă‹ă‚‰ć§‹ă‚ă‚‹ăƒŹă‚łăƒĄăƒłă‚żă‚™ă‚·ă‚čテム
 
æ©Ÿæą°ć­Šçż’ăƒ—ăƒ­ăƒ•ă‚§ăƒƒă‚·ăƒ§ăƒŠăƒ«ă‚·ăƒȘăƒŒă‚șèŒȘèȘ­äŒš #5 ç•°ćžžæ€œçŸ„ăšć€‰ćŒ–æ€œçŸ„ Chapter 1 & 2 èł‡æ–™
æ©Ÿæą°ć­Šçż’ăƒ—ăƒ­ăƒ•ă‚§ăƒƒă‚·ăƒ§ăƒŠăƒ«ă‚·ăƒȘăƒŒă‚șèŒȘèȘ­äŒš #5 ç•°ćžžæ€œçŸ„ăšć€‰ćŒ–æ€œçŸ„ Chapter 1 & 2 èł‡æ–™æ©Ÿæą°ć­Šçż’ăƒ—ăƒ­ăƒ•ă‚§ăƒƒă‚·ăƒ§ăƒŠăƒ«ă‚·ăƒȘăƒŒă‚șèŒȘèȘ­äŒš #5 ç•°ćžžæ€œçŸ„ăšć€‰ćŒ–æ€œçŸ„ Chapter 1 & 2 èł‡æ–™
æ©Ÿæą°ć­Šçż’ăƒ—ăƒ­ăƒ•ă‚§ăƒƒă‚·ăƒ§ăƒŠăƒ«ă‚·ăƒȘăƒŒă‚șèŒȘèȘ­äŒš #5 ç•°ćžžæ€œçŸ„ăšć€‰ćŒ–æ€œçŸ„ Chapter 1 & 2 èł‡æ–™
 
An introduction to Recommender Systems
An introduction to Recommender SystemsAn introduction to Recommender Systems
An introduction to Recommender Systems
 
æ©Ÿæą°ć­Šçż’ăƒąăƒ‡ăƒ«ăźćˆ€æ–­æ čæ‹ ăźèȘŹæ˜ŽïŒˆVer.2
æ©Ÿæą°ć­Šçż’ăƒąăƒ‡ăƒ«ăźćˆ€æ–­æ čæ‹ ăźèȘŹæ˜ŽïŒˆVer.2ïŒ‰æ©Ÿæą°ć­Šçż’ăƒąăƒ‡ăƒ«ăźćˆ€æ–­æ čæ‹ ăźèȘŹæ˜ŽïŒˆVer.2
æ©Ÿæą°ć­Šçż’ăƒąăƒ‡ăƒ«ăźćˆ€æ–­æ čæ‹ ăźèȘŹæ˜ŽïŒˆVer.2
 
Recommender systems: Content-based and collaborative filtering
Recommender systems: Content-based and collaborative filteringRecommender systems: Content-based and collaborative filtering
Recommender systems: Content-based and collaborative filtering
 
Artwork Personalization at Netflix
Artwork Personalization at NetflixArtwork Personalization at Netflix
Artwork Personalization at Netflix
 
Personalizing "The Netflix Experience" with Deep Learning
Personalizing "The Netflix Experience" with Deep LearningPersonalizing "The Netflix Experience" with Deep Learning
Personalizing "The Netflix Experience" with Deep Learning
 

Andere mochten auch

ML to cure the world
ML to cure the worldML to cure the world
ML to cure the worldXavier Amatriain
 
Past present and future of Recommender Systems: an Industry Perspective
Past present and future of Recommender Systems: an Industry PerspectivePast present and future of Recommender Systems: an Industry Perspective
Past present and future of Recommender Systems: an Industry PerspectiveXavier Amatriain
 
Recommender Systems
Recommender SystemsRecommender Systems
Recommender SystemsT212
 
Recommendation system
Recommendation system Recommendation system
Recommendation system Vikrant Arya
 
Building a Recommendation Engine - An example of a product recommendation engine
Building a Recommendation Engine - An example of a product recommendation engineBuilding a Recommendation Engine - An example of a product recommendation engine
Building a Recommendation Engine - An example of a product recommendation engineNYC Predictive Analytics
 
Recommender system algorithm and architecture
Recommender system algorithm and architectureRecommender system algorithm and architecture
Recommender system algorithm and architectureLiang Xiang
 

Andere mochten auch (6)

ML to cure the world
ML to cure the worldML to cure the world
ML to cure the world
 
Past present and future of Recommender Systems: an Industry Perspective
Past present and future of Recommender Systems: an Industry PerspectivePast present and future of Recommender Systems: an Industry Perspective
Past present and future of Recommender Systems: an Industry Perspective
 
Recommender Systems
Recommender SystemsRecommender Systems
Recommender Systems
 
Recommendation system
Recommendation system Recommendation system
Recommendation system
 
Building a Recommendation Engine - An example of a product recommendation engine
Building a Recommendation Engine - An example of a product recommendation engineBuilding a Recommendation Engine - An example of a product recommendation engine
Building a Recommendation Engine - An example of a product recommendation engine
 
Recommender system algorithm and architecture
Recommender system algorithm and architectureRecommender system algorithm and architecture
Recommender system algorithm and architecture
 

Ähnlich wie Recsys 2016 tutorial: Lessons learned from building real-life recommender systems

Strata 2016 - Lessons Learned from building real-life Machine Learning Systems
Strata 2016 -  Lessons Learned from building real-life Machine Learning SystemsStrata 2016 -  Lessons Learned from building real-life Machine Learning Systems
Strata 2016 - Lessons Learned from building real-life Machine Learning SystemsXavier Amatriain
 
BIG2016- Lessons Learned from building real-life user-focused Big Data systems
BIG2016- Lessons Learned from building real-life user-focused Big Data systemsBIG2016- Lessons Learned from building real-life user-focused Big Data systems
BIG2016- Lessons Learned from building real-life user-focused Big Data systemsXavier Amatriain
 
Staying Shallow & Lean in a Deep Learning World
Staying Shallow & Lean in a Deep Learning WorldStaying Shallow & Lean in a Deep Learning World
Staying Shallow & Lean in a Deep Learning WorldXavier Amatriain
 
Past, present, and future of Recommender Systems: an industry perspective
Past, present, and future of Recommender Systems: an industry perspectivePast, present, and future of Recommender Systems: an industry perspective
Past, present, and future of Recommender Systems: an industry perspectiveXavier Amatriain
 
Xavier Amatriain, VP of Engineering, Quora at MLconf SF - 11/13/15
Xavier Amatriain, VP of Engineering, Quora at MLconf SF - 11/13/15Xavier Amatriain, VP of Engineering, Quora at MLconf SF - 11/13/15
Xavier Amatriain, VP of Engineering, Quora at MLconf SF - 11/13/15MLconf
 
10 more lessons learned from building Machine Learning systems - MLConf
10 more lessons learned from building Machine Learning systems - MLConf10 more lessons learned from building Machine Learning systems - MLConf
10 more lessons learned from building Machine Learning systems - MLConfXavier Amatriain
 
10 more lessons learned from building Machine Learning systems
10 more lessons learned from building Machine Learning systems10 more lessons learned from building Machine Learning systems
10 more lessons learned from building Machine Learning systemsXavier Amatriain
 
Scaling Quality on Quora Using Machine Learning
Scaling Quality on Quora Using Machine LearningScaling Quality on Quora Using Machine Learning
Scaling Quality on Quora Using Machine LearningVo Viet Anh
 
AI hype or reality
AI  hype or realityAI  hype or reality
AI hype or realityAwantik Das
 
Pragmatic software testing education - SIGCSE 2019
Pragmatic software testing education - SIGCSE 2019Pragmatic software testing education - SIGCSE 2019
Pragmatic software testing education - SIGCSE 2019MaurĂ­cio Aniche
 
10 Lessons Learned from Building Machine Learning Systems
10 Lessons Learned from Building Machine Learning Systems10 Lessons Learned from Building Machine Learning Systems
10 Lessons Learned from Building Machine Learning SystemsXavier Amatriain
 
Machine learning: A Walk Through School Exams
Machine learning: A Walk Through School ExamsMachine learning: A Walk Through School Exams
Machine learning: A Walk Through School ExamsRamsha Ijaz
 
tensorflow.pptx
tensorflow.pptxtensorflow.pptx
tensorflow.pptxJoanJeremiah
 
Deep learning with tensorflow
Deep learning with tensorflowDeep learning with tensorflow
Deep learning with tensorflowCharmi Chokshi
 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine LearningSSSSSS354882
 
Model evaluation in the land of deep learning
Model evaluation in the land of deep learningModel evaluation in the land of deep learning
Model evaluation in the land of deep learningPramit Choudhary
 
Dmitriy Babichenko, Jonathan Velez - To Scope or Not To Scope: Challenges of ...
Dmitriy Babichenko, Jonathan Velez - To Scope or Not To Scope: Challenges of ...Dmitriy Babichenko, Jonathan Velez - To Scope or Not To Scope: Challenges of ...
Dmitriy Babichenko, Jonathan Velez - To Scope or Not To Scope: Challenges of ...SeriousGamesAssoc
 
Building High Available and Scalable Machine Learning Applications
Building High Available and Scalable Machine Learning ApplicationsBuilding High Available and Scalable Machine Learning Applications
Building High Available and Scalable Machine Learning ApplicationsYalçın YenigĂŒn
 
لموŰčŰŻ Ű§Ù„Ű„Ű«Ù†ÙŠÙ† 03 ÙŠÙ†Ű§ÙŠŰ± 2022 143 Ù…ŰšŰ§ŰŻŰ±Ű© #ŰȘÙˆŰ§Ű”Ù„_ŰȘŰ·ÙˆÙŠŰ± Ű§Ù„Ù…Ű­Ű§Ű¶Ű±Ű© Ű§Ù„ 143 من Ű§Ù„Ù…ŰšŰ§ŰŻ...
لموŰčŰŻ Ű§Ù„Ű„Ű«Ù†ÙŠÙ† 03 ÙŠÙ†Ű§ÙŠŰ± 2022 143 Ù…ŰšŰ§ŰŻŰ±Ű© #ŰȘÙˆŰ§Ű”Ù„_ŰȘŰ·ÙˆÙŠŰ± Ű§Ù„Ù…Ű­Ű§Ű¶Ű±Ű© Ű§Ù„ 143 من Ű§Ù„Ù…ŰšŰ§ŰŻ...لموŰčŰŻ Ű§Ù„Ű„Ű«Ù†ÙŠÙ† 03 ÙŠÙ†Ű§ÙŠŰ± 2022 143 Ù…ŰšŰ§ŰŻŰ±Ű© #ŰȘÙˆŰ§Ű”Ù„_ŰȘŰ·ÙˆÙŠŰ± Ű§Ù„Ù…Ű­Ű§Ű¶Ű±Ű© Ű§Ù„ 143 من Ű§Ù„Ù…ŰšŰ§ŰŻ...
لموŰčŰŻ Ű§Ù„Ű„Ű«Ù†ÙŠÙ† 03 ÙŠÙ†Ű§ÙŠŰ± 2022 143 Ù…ŰšŰ§ŰŻŰ±Ű© #ŰȘÙˆŰ§Ű”Ù„_ŰȘŰ·ÙˆÙŠŰ± Ű§Ù„Ù…Ű­Ű§Ű¶Ű±Ű© Ű§Ù„ 143 من Ű§Ù„Ù…ŰšŰ§ŰŻ...Egyptian Engineers Association
 

Ähnlich wie Recsys 2016 tutorial: Lessons learned from building real-life recommender systems (20)

Strata 2016 - Lessons Learned from building real-life Machine Learning Systems
Strata 2016 -  Lessons Learned from building real-life Machine Learning SystemsStrata 2016 -  Lessons Learned from building real-life Machine Learning Systems
Strata 2016 - Lessons Learned from building real-life Machine Learning Systems
 
BIG2016- Lessons Learned from building real-life user-focused Big Data systems
BIG2016- Lessons Learned from building real-life user-focused Big Data systemsBIG2016- Lessons Learned from building real-life user-focused Big Data systems
BIG2016- Lessons Learned from building real-life user-focused Big Data systems
 
Staying Shallow & Lean in a Deep Learning World
Staying Shallow & Lean in a Deep Learning WorldStaying Shallow & Lean in a Deep Learning World
Staying Shallow & Lean in a Deep Learning World
 
Past, present, and future of Recommender Systems: an industry perspective
Past, present, and future of Recommender Systems: an industry perspectivePast, present, and future of Recommender Systems: an industry perspective
Past, present, and future of Recommender Systems: an industry perspective
 
Xavier Amatriain, VP of Engineering, Quora at MLconf SF - 11/13/15
Xavier Amatriain, VP of Engineering, Quora at MLconf SF - 11/13/15Xavier Amatriain, VP of Engineering, Quora at MLconf SF - 11/13/15
Xavier Amatriain, VP of Engineering, Quora at MLconf SF - 11/13/15
 
10 more lessons learned from building Machine Learning systems - MLConf
10 more lessons learned from building Machine Learning systems - MLConf10 more lessons learned from building Machine Learning systems - MLConf
10 more lessons learned from building Machine Learning systems - MLConf
 
10 more lessons learned from building Machine Learning systems
10 more lessons learned from building Machine Learning systems10 more lessons learned from building Machine Learning systems
10 more lessons learned from building Machine Learning systems
 
Scaling Quality on Quora Using Machine Learning
Scaling Quality on Quora Using Machine LearningScaling Quality on Quora Using Machine Learning
Scaling Quality on Quora Using Machine Learning
 
AI hype or reality
AI  hype or realityAI  hype or reality
AI hype or reality
 
Pragmatic software testing education - SIGCSE 2019
Pragmatic software testing education - SIGCSE 2019Pragmatic software testing education - SIGCSE 2019
Pragmatic software testing education - SIGCSE 2019
 
10 Lessons Learned from Building Machine Learning Systems
10 Lessons Learned from Building Machine Learning Systems10 Lessons Learned from Building Machine Learning Systems
10 Lessons Learned from Building Machine Learning Systems
 
Machine learning: A Walk Through School Exams
Machine learning: A Walk Through School ExamsMachine learning: A Walk Through School Exams
Machine learning: A Walk Through School Exams
 
Role of Data Science in eCommerce
Role of Data Science in eCommerceRole of Data Science in eCommerce
Role of Data Science in eCommerce
 
tensorflow.pptx
tensorflow.pptxtensorflow.pptx
tensorflow.pptx
 
Deep learning with tensorflow
Deep learning with tensorflowDeep learning with tensorflow
Deep learning with tensorflow
 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine Learning
 
Model evaluation in the land of deep learning
Model evaluation in the land of deep learningModel evaluation in the land of deep learning
Model evaluation in the land of deep learning
 
Dmitriy Babichenko, Jonathan Velez - To Scope or Not To Scope: Challenges of ...
Dmitriy Babichenko, Jonathan Velez - To Scope or Not To Scope: Challenges of ...Dmitriy Babichenko, Jonathan Velez - To Scope or Not To Scope: Challenges of ...
Dmitriy Babichenko, Jonathan Velez - To Scope or Not To Scope: Challenges of ...
 
Building High Available and Scalable Machine Learning Applications
Building High Available and Scalable Machine Learning ApplicationsBuilding High Available and Scalable Machine Learning Applications
Building High Available and Scalable Machine Learning Applications
 
لموŰčŰŻ Ű§Ù„Ű„Ű«Ù†ÙŠÙ† 03 ÙŠÙ†Ű§ÙŠŰ± 2022 143 Ù…ŰšŰ§ŰŻŰ±Ű© #ŰȘÙˆŰ§Ű”Ù„_ŰȘŰ·ÙˆÙŠŰ± Ű§Ù„Ù…Ű­Ű§Ű¶Ű±Ű© Ű§Ù„ 143 من Ű§Ù„Ù…ŰšŰ§ŰŻ...
لموŰčŰŻ Ű§Ù„Ű„Ű«Ù†ÙŠÙ† 03 ÙŠÙ†Ű§ÙŠŰ± 2022 143 Ù…ŰšŰ§ŰŻŰ±Ű© #ŰȘÙˆŰ§Ű”Ù„_ŰȘŰ·ÙˆÙŠŰ± Ű§Ù„Ù…Ű­Ű§Ű¶Ű±Ű© Ű§Ù„ 143 من Ű§Ù„Ù…ŰšŰ§ŰŻ...لموŰčŰŻ Ű§Ù„Ű„Ű«Ù†ÙŠÙ† 03 ÙŠÙ†Ű§ÙŠŰ± 2022 143 Ù…ŰšŰ§ŰŻŰ±Ű© #ŰȘÙˆŰ§Ű”Ù„_ŰȘŰ·ÙˆÙŠŰ± Ű§Ù„Ù…Ű­Ű§Ű¶Ű±Ű© Ű§Ù„ 143 من Ű§Ù„Ù…ŰšŰ§ŰŻ...
لموŰčŰŻ Ű§Ù„Ű„Ű«Ù†ÙŠÙ† 03 ÙŠÙ†Ű§ÙŠŰ± 2022 143 Ù…ŰšŰ§ŰŻŰ±Ű© #ŰȘÙˆŰ§Ű”Ù„_ŰȘŰ·ÙˆÙŠŰ± Ű§Ù„Ù…Ű­Ű§Ű¶Ű±Ű© Ű§Ù„ 143 من Ű§Ù„Ù…ŰšŰ§ŰŻ...
 

Mehr von Xavier Amatriain

Data/AI driven product development: from video streaming to telehealth
Data/AI driven product development: from video streaming to telehealthData/AI driven product development: from video streaming to telehealth
Data/AI driven product development: from video streaming to telehealthXavier Amatriain
 
AI-driven product innovation: from Recommender Systems to COVID-19
AI-driven product innovation: from Recommender Systems to COVID-19AI-driven product innovation: from Recommender Systems to COVID-19
AI-driven product innovation: from Recommender Systems to COVID-19Xavier Amatriain
 
AI for COVID-19 - Q42020 update
AI for COVID-19 - Q42020 updateAI for COVID-19 - Q42020 update
AI for COVID-19 - Q42020 updateXavier Amatriain
 
AI for COVID-19: An online virtual care approach
AI for COVID-19: An online virtual care approachAI for COVID-19: An online virtual care approach
AI for COVID-19: An online virtual care approachXavier Amatriain
 
Lessons learned from building practical deep learning systems
Lessons learned from building practical deep learning systemsLessons learned from building practical deep learning systems
Lessons learned from building practical deep learning systemsXavier Amatriain
 
AI for healthcare: Scaling Access and Quality of Care for Everyone
AI for healthcare: Scaling Access and Quality of Care for EveryoneAI for healthcare: Scaling Access and Quality of Care for Everyone
AI for healthcare: Scaling Access and Quality of Care for EveryoneXavier Amatriain
 
Towards online universal quality healthcare through AI
Towards online universal quality healthcare through AITowards online universal quality healthcare through AI
Towards online universal quality healthcare through AIXavier Amatriain
 
From one to zero: Going smaller as a growth strategy
From one to zero: Going smaller as a growth strategyFrom one to zero: Going smaller as a growth strategy
From one to zero: Going smaller as a growth strategyXavier Amatriain
 
Learning to speak medicine
Learning to speak medicineLearning to speak medicine
Learning to speak medicineXavier Amatriain
 
Medical advice as a Recommender System
Medical advice as a Recommender SystemMedical advice as a Recommender System
Medical advice as a Recommender SystemXavier Amatriain
 
Machine Learning for Q&A Sites: The Quora Example
Machine Learning for Q&A Sites: The Quora ExampleMachine Learning for Q&A Sites: The Quora Example
Machine Learning for Q&A Sites: The Quora ExampleXavier Amatriain
 
Barcelona ML Meetup - Lessons Learned
Barcelona ML Meetup - Lessons LearnedBarcelona ML Meetup - Lessons Learned
Barcelona ML Meetup - Lessons LearnedXavier Amatriain
 
Machine Learning to Grow the World's Knowledge
Machine Learning to Grow  the World's KnowledgeMachine Learning to Grow  the World's Knowledge
Machine Learning to Grow the World's KnowledgeXavier Amatriain
 
MLConf Seattle 2015 - ML@Quora
MLConf Seattle 2015 - ML@QuoraMLConf Seattle 2015 - ML@Quora
MLConf Seattle 2015 - ML@QuoraXavier Amatriain
 
Lean DevOps - Lessons Learned from Innovation-driven Companies
Lean DevOps - Lessons Learned from Innovation-driven CompaniesLean DevOps - Lessons Learned from Innovation-driven Companies
Lean DevOps - Lessons Learned from Innovation-driven CompaniesXavier Amatriain
 
Recsys 2014 Tutorial - The Recommender Problem Revisited
Recsys 2014 Tutorial - The Recommender Problem RevisitedRecsys 2014 Tutorial - The Recommender Problem Revisited
Recsys 2014 Tutorial - The Recommender Problem RevisitedXavier Amatriain
 
Kdd 2014 Tutorial - the recommender problem revisited
Kdd 2014 Tutorial -  the recommender problem revisitedKdd 2014 Tutorial -  the recommender problem revisited
Kdd 2014 Tutorial - the recommender problem revisitedXavier Amatriain
 
MMDS 2014 Talk - Distributing ML Algorithms: from GPUs to the Cloud
MMDS 2014 Talk - Distributing ML Algorithms: from GPUs to the CloudMMDS 2014 Talk - Distributing ML Algorithms: from GPUs to the Cloud
MMDS 2014 Talk - Distributing ML Algorithms: from GPUs to the CloudXavier Amatriain
 
Qcon SF 2013 - Machine Learning & Recommender Systems @ Netflix Scale
Qcon SF 2013 - Machine Learning & Recommender Systems @ Netflix ScaleQcon SF 2013 - Machine Learning & Recommender Systems @ Netflix Scale
Qcon SF 2013 - Machine Learning & Recommender Systems @ Netflix ScaleXavier Amatriain
 

Mehr von Xavier Amatriain (19)

Data/AI driven product development: from video streaming to telehealth
Data/AI driven product development: from video streaming to telehealthData/AI driven product development: from video streaming to telehealth
Data/AI driven product development: from video streaming to telehealth
 
AI-driven product innovation: from Recommender Systems to COVID-19
AI-driven product innovation: from Recommender Systems to COVID-19AI-driven product innovation: from Recommender Systems to COVID-19
AI-driven product innovation: from Recommender Systems to COVID-19
 
AI for COVID-19 - Q42020 update
AI for COVID-19 - Q42020 updateAI for COVID-19 - Q42020 update
AI for COVID-19 - Q42020 update
 
AI for COVID-19: An online virtual care approach
AI for COVID-19: An online virtual care approachAI for COVID-19: An online virtual care approach
AI for COVID-19: An online virtual care approach
 
Lessons learned from building practical deep learning systems
Lessons learned from building practical deep learning systemsLessons learned from building practical deep learning systems
Lessons learned from building practical deep learning systems
 
AI for healthcare: Scaling Access and Quality of Care for Everyone
AI for healthcare: Scaling Access and Quality of Care for EveryoneAI for healthcare: Scaling Access and Quality of Care for Everyone
AI for healthcare: Scaling Access and Quality of Care for Everyone
 
Towards online universal quality healthcare through AI
Towards online universal quality healthcare through AITowards online universal quality healthcare through AI
Towards online universal quality healthcare through AI
 
From one to zero: Going smaller as a growth strategy
From one to zero: Going smaller as a growth strategyFrom one to zero: Going smaller as a growth strategy
From one to zero: Going smaller as a growth strategy
 
Learning to speak medicine
Learning to speak medicineLearning to speak medicine
Learning to speak medicine
 
Medical advice as a Recommender System
Medical advice as a Recommender SystemMedical advice as a Recommender System
Medical advice as a Recommender System
 
Machine Learning for Q&A Sites: The Quora Example
Machine Learning for Q&A Sites: The Quora ExampleMachine Learning for Q&A Sites: The Quora Example
Machine Learning for Q&A Sites: The Quora Example
 
Barcelona ML Meetup - Lessons Learned
Barcelona ML Meetup - Lessons LearnedBarcelona ML Meetup - Lessons Learned
Barcelona ML Meetup - Lessons Learned
 
Machine Learning to Grow the World's Knowledge
Machine Learning to Grow  the World's KnowledgeMachine Learning to Grow  the World's Knowledge
Machine Learning to Grow the World's Knowledge
 
MLConf Seattle 2015 - ML@Quora
MLConf Seattle 2015 - ML@QuoraMLConf Seattle 2015 - ML@Quora
MLConf Seattle 2015 - ML@Quora
 
Lean DevOps - Lessons Learned from Innovation-driven Companies
Lean DevOps - Lessons Learned from Innovation-driven CompaniesLean DevOps - Lessons Learned from Innovation-driven Companies
Lean DevOps - Lessons Learned from Innovation-driven Companies
 
Recsys 2014 Tutorial - The Recommender Problem Revisited
Recsys 2014 Tutorial - The Recommender Problem RevisitedRecsys 2014 Tutorial - The Recommender Problem Revisited
Recsys 2014 Tutorial - The Recommender Problem Revisited
 
Kdd 2014 Tutorial - the recommender problem revisited
Kdd 2014 Tutorial -  the recommender problem revisitedKdd 2014 Tutorial -  the recommender problem revisited
Kdd 2014 Tutorial - the recommender problem revisited
 
MMDS 2014 Talk - Distributing ML Algorithms: from GPUs to the Cloud
MMDS 2014 Talk - Distributing ML Algorithms: from GPUs to the CloudMMDS 2014 Talk - Distributing ML Algorithms: from GPUs to the Cloud
MMDS 2014 Talk - Distributing ML Algorithms: from GPUs to the Cloud
 
Qcon SF 2013 - Machine Learning & Recommender Systems @ Netflix Scale
Qcon SF 2013 - Machine Learning & Recommender Systems @ Netflix ScaleQcon SF 2013 - Machine Learning & Recommender Systems @ Netflix Scale
Qcon SF 2013 - Machine Learning & Recommender Systems @ Netflix Scale
 

KĂŒrzlich hochgeladen

Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...apidays
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingEdi Saputra
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024The Digital Insurer
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...apidays
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesBoston Institute of Analytics
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodJuan lago vĂĄzquez
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businesspanagenda
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...Principled Technologies
 

KĂŒrzlich hochgeladen (20)

Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
 

Recsys 2016 tutorial: Lessons learned from building real-life recommender systems

  • 1. Lessons Learned from Building real-life Recsys Xavier Amatriain (Quora) Deepak Agarwal (LinkedIn)
  • 3. Our Mission “To share and grow the world’s knowledge” ‱ Millions of questions & answers ‱ Millions of users ‱ Thousands of topics ‱ ...
  • 4. Demand What we care about Quality Relevance
  • 6. Lots of high-quality textual information
  • 7. Lots of data relations
  • 9. Recommendations at Quora ● Homepage feed ranking ● Email digest ● Answer ranking ● Topic recommendation ● User recommendation ● Trending Topics ● Automated Topic Labelling ● Related Question ● ... click upvote downvote expand share
  • 10. Models ● Deep Neural Networks ● Logistic Regression ● Elastic Nets ● Gradient Boosted Decision Trees ● Random Forests ● LambdaMART ● Matrix Factorization ● LDA ● ... ●
  • 13. Implicit vs. Explicit ● Many have acknowledged that implicit feedback is more useful ● Is implicit feedback really always more useful? ● If so, why?
  • 14. ● Implicit data is (usually): ○ More dense, and available for all users ○ Better representative of user behavior vs. user reflection ○ More related to final objective function ○ Better correlated with AB test results ● E.g. Rating vs watching Implicit vs. Explicit
  • 15. ● However ○ It is not always the case that direct implicit feedback correlates well with long-term retention ○ E.g. clickbait ● Solution: ○ Combine different forms of implicit + explicit to better represent long-term goal Implicit vs. Explicit
  • 17. Defining training/testing data ● Training a simple binary classifier for good/bad answer ○ Defining positive and negative labels -> Non-trivial task ○ Is this a positive or a negative? ■ funny uninformative answer with many upvotes ■ short uninformative answer by a well-known expert in the field ■ very long informative answer that nobody reads/upvotes ■ informative answer with grammar/spelling mistakes ■ ...
  • 19. Training a model ● Model will learn according to: ○ Training data (e.g. implicit and explicit) ○ Target function (e.g. probability of user reading an answer) ○ Metric (e.g. precision vs. recall) ● Example 1 (made up): ○ Optimize probability of a user going to the cinema to watch a movie and rate it “highly” by using purchase history and previous ratings. Use NDCG of the ranking as final metric using only movies rated 4 or higher as positives.
  • 20. Example 2 - Quora’s feed ● Training data = implicit + explicit ● Target function: Value of showing a story to a user ~ weighted sum of actions: v = ∑a va 1{ya = 1} ○ predict probabilities for each action, then compute expected value: v_pred = E[ V | x ] = ∑a va p(a | x) ● Metric: any ranking metric
  • 24. Matrix Factorization ● MF can be interpreted as ○ Unsupervised: ■ Dimensionality Reduction a la PCA ■ Clustering (e.g. NMF) ○ Supervised: ■ Labeled targets ~ regression ● Very useful variations of MF ○ BPR, ALS, SVD++ ○ Tensor Factorization, Factorization Machines ● However...
  • 26. Ensembles ● Netflix Prize was won by an ensemble ○ Initially Bellkor was using GDBTs ○ BigChaos introduced ANN-based ensemble ● Most practical applications of ML run an ensemble ○ Why wouldn’t you? ○ At least as good as the best of your methods ○ Can add completely different approaches (e.g. CF and content-based) ○ You can use many different models at the ensemble layer: LR, GDBTs, RFs, ANNs...
  • 27. Ensembles & Feature Engineering ● Ensembles are the way to turn any model into a feature! ● E.g. Don’t know if the way to go is to use Factorization Machines, Tensor Factorization, or RNNs? ○ Treat each model as a “feature” ○ Feed them into an ensemble
  • 28. The Master Algorithm? It definitely is an ensemble!
  • 30. Need for feature engineering In many cases an understanding of the domain will lead to optimal results. Feature Engineering
  • 31. Feature Engineering Example - Quora Answer Ranking What is a good Quora answer? ‱ truthful ‱ reusable ‱ provides explanation ‱ well formatted ‱ ...
  • 32. Feature Engineering Example - Quora Answer Ranking How are those dimensions translated into features? ‱ Features that relate to the answer quality itself ‱ Interaction features (upvotes/downvotes, clicks, comments
) ‱ User features (e.g. expertise in topic)
  • 33. Feature Engineering ● Properties of a well-behaved ML feature: ○ Reusable ○ Transformable ○ Interpretable ○ Reliable
  • 35. Model debuggability ● Value of a model = value it brings to the product ● Product owners/stakeholders have expectations on the product ● It is important to answer questions to why did something fail ● Bridge gap between product design and ML algos ● Model debuggability is so important it can determine: ○ Particular model to use ○ Features to rely on ○ Implementation of tools
  • 36. Model debuggability ● E.g. Why am I seeing or not seeing this on my homepage feed?
  • 39. Executing A/B tests ● Measure differences in metrics across statistically identical populations that each experience a different algorithm. ● Decisions on the product always data-driven ● Overall Evaluation Criteria (OEC) = member retention ○ Use long-term metrics whenever possible ○ Short-term metrics can be informative and allow faster decisions ■ But, not always aligned with OEC
  • 40. Offline testing ● Measure model performance, using (IR) metrics ● Offline performance = indication to make decisions on follow-up A/B tests ● A critical (and mostly unsolved) issue is how offline metrics correlate with A/B test results.
  • 42. Distributing Recommender Systems ● Most of what people do in practice can fit into a multi-core machine ○ As long as you use: ■ Smart data sampling ■ Offline schemes ■ Efficient parallel code ● (
 but not Deep ANNs) ● Do you care about costs? How about latencies or system complexity/debuggability?
  • 45. ● Recommender Systems are about much more than just predicting a rating ● Designing a “real-life” recsys means paying attention to issues such as: ○ Feature engineering ○ Training dataset ○ Metrics ○ Experimentation and AB Testing ○ System scalability ○ ... ● Lots of room for improvement & research
  • 46. Questions? Xavier Amatriain (Quora) xavier@amatriain.net Deepak Agarwal (LinkedIn) dagarwal@linkedin.com