SlideShare ist ein Scribd-Unternehmen logo
1 von 41
PRACTICAL BANDITS
FOR BUSINESS
Yan Xu
Houston Machine Learning Meetup
June 22, 2019
OUTLINE
- Recap on Bandit Problem
- A Contextual-Bandit Approach to Personalized News
Article Recommendation
http://rob.schapire.net/papers/www10.pdf
- An efficient bandit algorithm for realtime multivariate
optimization
https://www.kdd.org/kdd2017/papers/view/an-
efficient-bandit-algorithm-for-realtime-multivariate-
optimization
MULTI-ARMED BANDITS
DILEMMA: EXPLORATION VS.
EXPLOITATION
The exploration/exploitation trade-off is a dilemma we
frequently face in choosing between options.
Stay the same route to drive home, or try a new route?
Choose your favorite restaurant, or the new one?
Listen to your favorite music channel, or try a new artist?
Attend a new meetup?
HOW TO RESOLVE THE
DILEMMA
https://pavlov.tech/2019/03/02/animated-multi-
armed-bandit-policies/
Epsilon Greedy
UCB (Upper Confidence Bound)
Thompson Sampling
REWARD AND REGRET
REWARD AND REGRET
REWARD AND REGRET
MULTI-ARMED BANDITS
FORMULATION
PRACTICAL BANDITS
APPLICATION
BANDITS FOR PERSONALIZED
RECOMMENDATION
BANDITS FOR NEWS
RECOMMENDATION
CONTEXTUAL BANDITS
CONTEXTUAL BANDITS
CONTEXTUAL BANDITS
[0.1, 0.6]
[0.6, 0.4]
[0.7, 0.1]
[0.4, 0.2]
LINUCB ALGORITHM
LINUCB ALGORITHM
LINEAR DISJOINT MODEL
LINEAR DISJOINT MODEL
UPPER BOUND ILLUSTRATION
FEATURE FREE VS LINEAR
CONTEXTUAL BANDIT
BANDITS EVALUATION
BANDITS EVALUATION
BANDITS EVALUATION
BANDITS EVALUATION
DEALING WITH HIGH
DIMENSIONALITY
1000 binary features per user; ~100 binary feature per article
DEALING WITH HIGH
DIMENSIONALITY
DEALING WITH HIGH
DIMENSIONALITY
RESULT: PERSONALIZED
NEWS
Omniscient: always chooses the article with highest empirical
CONCLUSION
AMAZON: BANDITS FOR
MULTIVARIATE OPTIMIZATION
Published at KDD 2017, KDD 2019 is in Alaska!
AMAZON: BANDITS FOR
MULTIVARIATE OPTIMIZATION
OPTIMIZING WEB LAYOUT
PROBLEM FORMULATION
STEP 1: PROBIT REGRESSION
STEP 2: THOMPSON
SAMPLING
STEP 3: HILLING-CLIMBING TO
DECIDE
SIMULATION RESULT
SIMULATION RESULT
Control widget interaction in simulation
through alpha_2.
EXPERIMENT ON REAL
TRAFFIC
‱ After only a single week of online
optimization, we saw a 21%
conversion increase compared to the
median layout
SUMMARY
Contextual bandits
 Linear payoff
 Add interaction components
 UCB: Variance estimation of expected rewards
 Thompson sampling: Sample weights from posterior distribution
Applications
 Recommendation
 Multi-variate optimization
For more details
 - A Contextual-Bandit Approach to Personalized News Article
Recommendation
http://rob.schapire.net/papers/www10.pdf
- An efficient bandit algorithm for realtime multivariate
optimization
https://www.kdd.org/kdd2017/papers/view/an-efficient-
bandit-algorithm-for-realtime-multivariate-optimization

Weitere Àhnliche Inhalte

Was ist angesagt?

Was ist angesagt? (20)

ćŠčæžœăźă‚ă‚‹ă‚ŻăƒȘă‚šă‚€ăƒ†ă‚Łăƒ–ćșƒć‘ŠăźèŠ‹ă€ă‘æ–č(Contextual Bandit + TS or UCB)
ćŠčæžœăźă‚ă‚‹ă‚ŻăƒȘă‚šă‚€ăƒ†ă‚Łăƒ–ćșƒć‘ŠăźèŠ‹ă€ă‘æ–č(Contextual Bandit + TS or UCB)ćŠčæžœăźă‚ă‚‹ă‚ŻăƒȘă‚šă‚€ăƒ†ă‚Łăƒ–ćșƒć‘ŠăźèŠ‹ă€ă‘æ–č(Contextual Bandit + TS or UCB)
ćŠčæžœăźă‚ă‚‹ă‚ŻăƒȘă‚šă‚€ăƒ†ă‚Łăƒ–ćșƒć‘ŠăźèŠ‹ă€ă‘æ–č(Contextual Bandit + TS or UCB)
 
TensorFlow XLAは、 äž­ă§äœ•ă‚’ă‚„ăŁăŠă„ă‚‹ăźă‹ïŒŸ
TensorFlow XLAは、 äž­ă§äœ•ă‚’ă‚„ăŁăŠă„ă‚‹ăźă‹ïŒŸTensorFlow XLAは、 äž­ă§äœ•ă‚’ă‚„ăŁăŠă„ă‚‹ăźă‹ïŒŸ
TensorFlow XLAは、 äž­ă§äœ•ă‚’ă‚„ăŁăŠă„ă‚‹ăźă‹ïŒŸ
 
捔èȘżăƒ•ă‚Łăƒ«ă‚żăƒȘăƒłă‚°ć…„é–€
捔èȘżăƒ•ă‚Łăƒ«ă‚żăƒȘăƒłă‚°ć…„é–€ć”èȘżăƒ•ă‚Łăƒ«ă‚żăƒȘăƒłă‚°ć…„é–€
捔èȘżăƒ•ă‚Łăƒ«ă‚żăƒȘăƒłă‚°ć…„é–€
 
【DLèŒȘèȘ­äŒšă€‘Llama 2: Open Foundation and Fine-Tuned Chat Models
【DLèŒȘèȘ­äŒšă€‘Llama 2: Open Foundation and Fine-Tuned Chat Models【DLèŒȘèȘ­äŒšă€‘Llama 2: Open Foundation and Fine-Tuned Chat Models
【DLèŒȘèȘ­äŒšă€‘Llama 2: Open Foundation and Fine-Tuned Chat Models
 
ćźŸç€ŸäŒšăƒ»ćźŸç’°ćąƒă«ăŠă‘ă‚‹ăƒ­ăƒœăƒƒăƒˆăźæ©Ÿæą°ć­Šçż’ ver. 2
ćźŸç€ŸäŒšăƒ»ćźŸç’°ćąƒă«ăŠă‘ă‚‹ăƒ­ăƒœăƒƒăƒˆăźæ©Ÿæą°ć­Šçż’ ver. 2ćźŸç€ŸäŒšăƒ»ćźŸç’°ćąƒă«ăŠă‘ă‚‹ăƒ­ăƒœăƒƒăƒˆăźæ©Ÿæą°ć­Šçż’ ver. 2
ćźŸç€ŸäŒšăƒ»ćźŸç’°ćąƒă«ăŠă‘ă‚‹ăƒ­ăƒœăƒƒăƒˆăźæ©Ÿæą°ć­Šçż’ ver. 2
 
Approximate nearest neighbor methods and vector models – NYC ML meetup
Approximate nearest neighbor methods and vector models – NYC ML meetupApproximate nearest neighbor methods and vector models – NYC ML meetup
Approximate nearest neighbor methods and vector models – NYC ML meetup
 
LightGBM: a highly efficient gradient boosting decision tree
LightGBM: a highly efficient gradient boosting decision treeLightGBM: a highly efficient gradient boosting decision tree
LightGBM: a highly efficient gradient boosting decision tree
 
Bandit algorithm overview_and_practice
Bandit algorithm overview_and_practiceBandit algorithm overview_and_practice
Bandit algorithm overview_and_practice
 
ć€šæ§˜ăȘćŒ·ćŒ–ć­Šçż’ăźæŠ‚ćż”ăšèȘČ題èȘè­˜
ć€šæ§˜ăȘćŒ·ćŒ–ć­Šçż’ăźæŠ‚ćż”ăšèȘČ題èȘè­˜ć€šæ§˜ăȘćŒ·ćŒ–ć­Šçż’ăźæŠ‚ćż”ăšèȘČ題èȘè­˜
ć€šæ§˜ăȘćŒ·ćŒ–ć­Šçż’ăźæŠ‚ćż”ăšèȘČ題èȘè­˜
 
BERT+XLNet+RoBERTa
BERT+XLNet+RoBERTaBERT+XLNet+RoBERTa
BERT+XLNet+RoBERTa
 
æ€œçŽąè©•äŸĄăƒ„ăƒŒăƒ«ă‚­ăƒƒăƒˆNTCIREVALă‚’ç”šă„ăŸæ§˜ă€…ăȘæƒ…ć ±ă‚ąă‚Żă‚»ă‚čæŠ€èĄ“ăźè©•äŸĄæ–čæł•
æ€œçŽąè©•äŸĄăƒ„ăƒŒăƒ«ă‚­ăƒƒăƒˆNTCIREVALă‚’ç”šă„ăŸæ§˜ă€…ăȘæƒ…ć ±ă‚ąă‚Żă‚»ă‚čæŠ€èĄ“ăźè©•äŸĄæ–čæł•æ€œçŽąè©•äŸĄăƒ„ăƒŒăƒ«ă‚­ăƒƒăƒˆNTCIREVALă‚’ç”šă„ăŸæ§˜ă€…ăȘæƒ…ć ±ă‚ąă‚Żă‚»ă‚čæŠ€èĄ“ăźè©•äŸĄæ–čæł•
æ€œçŽąè©•äŸĄăƒ„ăƒŒăƒ«ă‚­ăƒƒăƒˆNTCIREVALă‚’ç”šă„ăŸæ§˜ă€…ăȘæƒ…ć ±ă‚ąă‚Żă‚»ă‚čæŠ€èĄ“ăźè©•äŸĄæ–čæł•
 
CatBoost on GPU たăČみ぀
CatBoost on GPU たăČみ぀CatBoost on GPU たăČみ぀
CatBoost on GPU たăČみ぀
 
Optunaă‚’äœżăŁăŸHuman-in-the-loopæœ€é©ćŒ–ăźçŽč介 - 2023/04/27 W&B 東äșŹăƒŸăƒŒăƒˆă‚ąăƒƒăƒ— #3
Optunaă‚’äœżăŁăŸHuman-in-the-loopæœ€é©ćŒ–ăźçŽč介 - 2023/04/27 W&B 東äșŹăƒŸăƒŒăƒˆă‚ąăƒƒăƒ— #3Optunaă‚’äœżăŁăŸHuman-in-the-loopæœ€é©ćŒ–ăźçŽč介 - 2023/04/27 W&B 東äșŹăƒŸăƒŒăƒˆă‚ąăƒƒăƒ— #3
Optunaă‚’äœżăŁăŸHuman-in-the-loopæœ€é©ćŒ–ăźçŽč介 - 2023/04/27 W&B 東äșŹăƒŸăƒŒăƒˆă‚ąăƒƒăƒ— #3
 
[DLèŒȘèȘ­äŒš]Reward Augmented Maximum Likelihood for Neural Structured Prediction
[DLèŒȘèȘ­äŒš]Reward Augmented Maximum Likelihood for Neural Structured Prediction[DLèŒȘèȘ­äŒš]Reward Augmented Maximum Likelihood for Neural Structured Prediction
[DLèŒȘèȘ­äŒš]Reward Augmented Maximum Likelihood for Neural Structured Prediction
 
[DLèŒȘèȘ­äŒš]Unsupervised Learning by Predicting Noise
[DLèŒȘèȘ­äŒš]Unsupervised Learning by Predicting Noise[DLèŒȘèȘ­äŒš]Unsupervised Learning by Predicting Noise
[DLèŒȘèȘ­äŒš]Unsupervised Learning by Predicting Noise
 
ćźŸè·”ć€šă‚Żăƒ©ă‚č戆類 Kaggle Ottoから歩んだこべ
ćźŸè·”ć€šă‚Żăƒ©ă‚č戆類 Kaggle Ottoă‹ă‚‰ć­Šă‚“ă ă“ăšćźŸè·”ć€šă‚Żăƒ©ă‚č戆類 Kaggle Ottoから歩んだこべ
ćźŸè·”ć€šă‚Żăƒ©ă‚č戆類 Kaggle Ottoから歩んだこべ
 
ăƒ™ă‚€ă‚șæœ€é©ćŒ–
ăƒ™ă‚€ă‚șæœ€é©ćŒ–ăƒ™ă‚€ă‚șæœ€é©ćŒ–
ăƒ™ă‚€ă‚șæœ€é©ćŒ–
 
Kaggleぼテクニック
KaggleぼテクニックKaggleぼテクニック
Kaggleぼテクニック
 
æœ€èż‘ăźKaggleă«ć­Šă¶ăƒ†ăƒŒăƒ–ăƒ«ăƒ‡ăƒŒă‚żăźç‰čćŸŽé‡ă‚šăƒłă‚žăƒ‹ă‚ąăƒȘング
æœ€èż‘ăźKaggleă«ć­Šă¶ăƒ†ăƒŒăƒ–ăƒ«ăƒ‡ăƒŒă‚żăźç‰čćŸŽé‡ă‚šăƒłă‚žăƒ‹ă‚ąăƒȘăƒłă‚°æœ€èż‘ăźKaggleă«ć­Šă¶ăƒ†ăƒŒăƒ–ăƒ«ăƒ‡ăƒŒă‚żăźç‰čćŸŽé‡ă‚šăƒłă‚žăƒ‹ă‚ąăƒȘング
æœ€èż‘ăźKaggleă«ć­Šă¶ăƒ†ăƒŒăƒ–ăƒ«ăƒ‡ăƒŒă‚żăźç‰čćŸŽé‡ă‚šăƒłă‚žăƒ‹ă‚ąăƒȘング
 
Calibrated Recommendations
Calibrated RecommendationsCalibrated Recommendations
Calibrated Recommendations
 

Mehr von Yan Xu

Mehr von Yan Xu (20)

Kaggle winning solutions: Retail Sales Forecasting
Kaggle winning solutions: Retail Sales ForecastingKaggle winning solutions: Retail Sales Forecasting
Kaggle winning solutions: Retail Sales Forecasting
 
Basics of Dynamic programming
Basics of Dynamic programming Basics of Dynamic programming
Basics of Dynamic programming
 
Walking through Tensorflow 2.0
Walking through Tensorflow 2.0Walking through Tensorflow 2.0
Walking through Tensorflow 2.0
 
Introduction to Multi-armed Bandits
Introduction to Multi-armed BanditsIntroduction to Multi-armed Bandits
Introduction to Multi-armed Bandits
 
A Data-Driven Question Generation Model for Educational Content - by Jack Wang
A Data-Driven Question Generation Model for Educational Content - by Jack WangA Data-Driven Question Generation Model for Educational Content - by Jack Wang
A Data-Driven Question Generation Model for Educational Content - by Jack Wang
 
Deep Learning Approach in Characterizing Salt Body on Seismic Images - by Zhe...
Deep Learning Approach in Characterizing Salt Body on Seismic Images - by Zhe...Deep Learning Approach in Characterizing Salt Body on Seismic Images - by Zhe...
Deep Learning Approach in Characterizing Salt Body on Seismic Images - by Zhe...
 
Deep Hierarchical Profiling & Pattern Discovery: Application to Whole Brain R...
Deep Hierarchical Profiling & Pattern Discovery: Application to Whole Brain R...Deep Hierarchical Profiling & Pattern Discovery: Application to Whole Brain R...
Deep Hierarchical Profiling & Pattern Discovery: Application to Whole Brain R...
 
Detecting anomalies on rotating equipment using Deep Stacked Autoencoders - b...
Detecting anomalies on rotating equipment using Deep Stacked Autoencoders - b...Detecting anomalies on rotating equipment using Deep Stacked Autoencoders - b...
Detecting anomalies on rotating equipment using Deep Stacked Autoencoders - b...
 
Introduction to Autoencoders
Introduction to AutoencodersIntroduction to Autoencoders
Introduction to Autoencoders
 
State of enterprise data science
State of enterprise data scienceState of enterprise data science
State of enterprise data science
 
Long Short Term Memory
Long Short Term MemoryLong Short Term Memory
Long Short Term Memory
 
Deep Feed Forward Neural Networks and Regularization
Deep Feed Forward Neural Networks and RegularizationDeep Feed Forward Neural Networks and Regularization
Deep Feed Forward Neural Networks and Regularization
 
Linear algebra and probability (Deep Learning chapter 2&3)
Linear algebra and probability (Deep Learning chapter 2&3)Linear algebra and probability (Deep Learning chapter 2&3)
Linear algebra and probability (Deep Learning chapter 2&3)
 
HML: Historical View and Trends of Deep Learning
HML: Historical View and Trends of Deep LearningHML: Historical View and Trends of Deep Learning
HML: Historical View and Trends of Deep Learning
 
Secrets behind AlphaGo
Secrets behind AlphaGoSecrets behind AlphaGo
Secrets behind AlphaGo
 
Optimization in Deep Learning
Optimization in Deep LearningOptimization in Deep Learning
Optimization in Deep Learning
 
Introduction to Recurrent Neural Network
Introduction to Recurrent Neural NetworkIntroduction to Recurrent Neural Network
Introduction to Recurrent Neural Network
 
Convolutional neural network
Convolutional neural network Convolutional neural network
Convolutional neural network
 
Introduction to Neural Network
Introduction to Neural NetworkIntroduction to Neural Network
Introduction to Neural Network
 
Nonlinear dimension reduction
Nonlinear dimension reductionNonlinear dimension reduction
Nonlinear dimension reduction
 

KĂŒrzlich hochgeladen

Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
 

KĂŒrzlich hochgeladen (20)

🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 

Practical contextual bandits for business