SlideShare ist ein Scribd-Unternehmen logo
1 von 25
Downloaden Sie, um offline zu lesen
Volodymyrk
Bayesian Model Averaging
Bayesian Mixer, 27.09.2016
London, UK
Volodymyrk
Bayesian Model Averaging (BMA) - 1 minute version
New Project - how much does it worth?
CFO VP of Growth
Net Present Value: $50m $100m
Model M1
Model M2
30%CEO belief:
after evaluating both
models and market
data
70%
$15m + $70m = $85m
K = 2
Volodymyrk
Bayesian Model Averaging (BMA) - 3 minute version
VP of Growth
CLV assumptions
$10 $12 $15
CAC
$4 72 129 149
$6 62 112 133
$8 51 92 101
Average= $100.11m
Sensitivity Analysis for M2
DATA
Volodymyrk
Bayesian Model Averaging (BMA) - 5 minute version
Bayesian Model Averaging: A Tutorial
Jennifer A. Hoeting, David Madigan, Adrian E. Raftery and Chris T. Volinsky
How much do you trust your
VP and CFO, before you look
at models?
Scary normalising term
that you can ignore
Prior probability for
model parameter
Volodymyrk
Bayesian answer to overfitting
Frequentist:
- model selection
- regularisation
Bayesian:
- BMA
- marginalisation
Volodymyrk
Case Study
You just get the best job in the galaxy
Volodymyrk
Your new Boss Business domain Modelling case
Always test your models on synthetic data that you understand and control
Volodymyrk
Use Cases:
- Fraud Detection
- Inventory Sourcing
Data
Volodymyrk
Modelling goals
- Prediction range is needed, so that you can identify fraudulent transactions
(sand people under-reporting real transaction size and pocketing profit)
- Sale price should be easily explainable, as a function of various Droid Features
so that Jabba can invest in appropriate scavenging/sourcing projects
- You want lowest prediction error possible
so that you are not feeded to Sarlacc
Volodymyrk
Data Generation
Class-1
Class-2
Class-3
Class-4
durability
circuitry
height
weight
price
...
age
Volodymyrk
Data Collection
Volodymyrk
Model Selection - classical method
credits ~ height + weight + power + dents + rad + wheels + legs + red + blue + black + temperature + lat + long + ir_emit + dents_log + height_log + weight_log + power_log + rad_log
Adj. R2: 0.884974385182
Volodymyrk
Model Selection - backward elimination
Volodymyrk
Final Model
credits ~ weight + power + dents + rad + wheels + blue + black + temperature + lat + dents_log + height_log + weight_log + power_log
Adj. R2: 0.903544333611
Volodymyrk
Model Evaluation (out-of-sample)
Volodymyrk
Ridge regression (L2 regularisation)
Volodymyrk
Bayesian Model Averaging for Linear Models - a special case
Inclusion probability for (regression coefficients) are weighted across all possible models
Number of models = combinations of all K features (include/exclude) = 2K
Volodymyrk
How to actually do BMA? (in R)
cran.r-project.org/web/packages/BMA cran.r-project.org/web/packages/BAScran.r-project.org/web/packages/BMS
Mature. A.k.a. “the original”
Developed by PhD duringresearch. Not maintained
Newest. Maintained by Chair
of the Department ofStatistical Science at Duke
Volodymyrk
BMA using BMS (R) package
Model Selection L2 Regularisation BMA
MSE 9736.49 7782.21 7329.44
It worked!
But you can find inputs into data generator script that will not work as well!
Volodymyrk
Nice things you get from BMA
Posterior Inclusion Probability!
How cool is that!
Volodymyrk
Model ranking!
MCMC can beused, if number of
features is large
Best model, according toBMA
Volodymyrk
Can we use it for more complex models?
normalising term
that you can ignore
http://www.ssc.wisc.edu/~bhansen/718/NonParametrics15.pdf
http://www.ejwagenmakers.com/2004/aic.pdf
Warning:Very questionable math.
Does not work
Volodymyrk
Can we use BMA to combine complex (incl. hierarchical) models?
1
3
2
Model order is somewhat similar. Relative probabilities are not.
We need working Reverse-Jump MCMC or something more sophisticated.
Not available in common bayesian MCMC packages yet.
Volodymyrk
In Summary
- BMA is a Bayesian version of ML Model Ensembles
- Math behind is quite beautiful
- Model Averaging is useful for interpretation, not only prediction
- Invest in synthetic data generation,
- before applying new modelling techniques to real-world data
- Even if you are not using BMA, fit different models
- And combine them, if your goal is prediction
- BMA works very well for common GLMs, but does not work yet for arbitrary
models
- Do try it next time you need to fit OLS, though!
Volodymyrk
Of course we are hiring!
● (Snr, Mid) Data Scientists
● Solutions Architect
● Ruby Developer
● Data Engineer
● Senior Artist
● Technical Artist
● Unity Developers
● Senior Product Manager
● Product Director
http://jobs.productmadness.com/

Weitere ähnliche Inhalte

Was ist angesagt?

Feature engineering mean encodings
Feature engineering   mean encodingsFeature engineering   mean encodings
Feature engineering mean encodingsChode Amarnath
 
Substructure Search Face-off
Substructure Search Face-offSubstructure Search Face-off
Substructure Search Face-offNextMove Software
 
Explainability for Natural Language Processing
Explainability for Natural Language ProcessingExplainability for Natural Language Processing
Explainability for Natural Language ProcessingYunyao Li
 
Exploratory data analysis
Exploratory data analysisExploratory data analysis
Exploratory data analysisVishwas N
 
Causal inference in practice
Causal inference in practiceCausal inference in practice
Causal inference in practiceAmit Sharma
 
Opinion Dynamics on Generalized Networks
Opinion Dynamics on Generalized NetworksOpinion Dynamics on Generalized Networks
Opinion Dynamics on Generalized NetworksMason Porter
 
Data Analysis: Evaluation Metrics for Supervised Learning Models of Machine L...
Data Analysis: Evaluation Metrics for Supervised Learning Models of Machine L...Data Analysis: Evaluation Metrics for Supervised Learning Models of Machine L...
Data Analysis: Evaluation Metrics for Supervised Learning Models of Machine L...Md. Main Uddin Rony
 
Machine Learning Unit 3 Semester 3 MSc IT Part 2 Mumbai University
Machine Learning Unit 3 Semester 3  MSc IT Part 2 Mumbai UniversityMachine Learning Unit 3 Semester 3  MSc IT Part 2 Mumbai University
Machine Learning Unit 3 Semester 3 MSc IT Part 2 Mumbai UniversityMadhav Mishra
 
Machine Learning Interpretability
Machine Learning InterpretabilityMachine Learning Interpretability
Machine Learning Interpretabilityinovex GmbH
 
07 dimensionality reduction
07 dimensionality reduction07 dimensionality reduction
07 dimensionality reductionMarco Quartulli
 
Reactive Programming with Cassandra
Reactive Programming with CassandraReactive Programming with Cassandra
Reactive Programming with CassandraCédrick Lunven
 
Image Segmentation: Approaches and Challenges
Image Segmentation: Approaches and ChallengesImage Segmentation: Approaches and Challenges
Image Segmentation: Approaches and ChallengesApache MXNet
 
Neural Collaborative Filtering Explanation & Implementation
Neural Collaborative Filtering Explanation & ImplementationNeural Collaborative Filtering Explanation & Implementation
Neural Collaborative Filtering Explanation & ImplementationKung-hsiang (Steeve) Huang
 
Model selection and cross validation techniques
Model selection and cross validation techniquesModel selection and cross validation techniques
Model selection and cross validation techniquesVenkata Reddy Konasani
 
Neural collaborative filtering-발표
Neural collaborative filtering-발표Neural collaborative filtering-발표
Neural collaborative filtering-발표hyunsung lee
 
Bpr bayesian personalized ranking from implicit feedback
Bpr bayesian personalized ranking from implicit feedbackBpr bayesian personalized ranking from implicit feedback
Bpr bayesian personalized ranking from implicit feedbackPark JunPyo
 
Machine Learning and Data Mining: 16 Classifiers Ensembles
Machine Learning and Data Mining: 16 Classifiers EnsemblesMachine Learning and Data Mining: 16 Classifiers Ensembles
Machine Learning and Data Mining: 16 Classifiers EnsemblesPier Luca Lanzi
 

Was ist angesagt? (20)

Feature engineering mean encodings
Feature engineering   mean encodingsFeature engineering   mean encodings
Feature engineering mean encodings
 
Substructure Search Face-off
Substructure Search Face-offSubstructure Search Face-off
Substructure Search Face-off
 
Explainability for Natural Language Processing
Explainability for Natural Language ProcessingExplainability for Natural Language Processing
Explainability for Natural Language Processing
 
Exploratory data analysis
Exploratory data analysisExploratory data analysis
Exploratory data analysis
 
Causal inference in practice
Causal inference in practiceCausal inference in practice
Causal inference in practice
 
Opinion Dynamics on Generalized Networks
Opinion Dynamics on Generalized NetworksOpinion Dynamics on Generalized Networks
Opinion Dynamics on Generalized Networks
 
Data Analysis: Evaluation Metrics for Supervised Learning Models of Machine L...
Data Analysis: Evaluation Metrics for Supervised Learning Models of Machine L...Data Analysis: Evaluation Metrics for Supervised Learning Models of Machine L...
Data Analysis: Evaluation Metrics for Supervised Learning Models of Machine L...
 
Vertica
VerticaVertica
Vertica
 
Machine Learning Unit 3 Semester 3 MSc IT Part 2 Mumbai University
Machine Learning Unit 3 Semester 3  MSc IT Part 2 Mumbai UniversityMachine Learning Unit 3 Semester 3  MSc IT Part 2 Mumbai University
Machine Learning Unit 3 Semester 3 MSc IT Part 2 Mumbai University
 
Machine Learning Interpretability
Machine Learning InterpretabilityMachine Learning Interpretability
Machine Learning Interpretability
 
07 dimensionality reduction
07 dimensionality reduction07 dimensionality reduction
07 dimensionality reduction
 
Reactive Programming with Cassandra
Reactive Programming with CassandraReactive Programming with Cassandra
Reactive Programming with Cassandra
 
NLP_KASHK:N-Grams
NLP_KASHK:N-GramsNLP_KASHK:N-Grams
NLP_KASHK:N-Grams
 
Topic Modeling
Topic ModelingTopic Modeling
Topic Modeling
 
Image Segmentation: Approaches and Challenges
Image Segmentation: Approaches and ChallengesImage Segmentation: Approaches and Challenges
Image Segmentation: Approaches and Challenges
 
Neural Collaborative Filtering Explanation & Implementation
Neural Collaborative Filtering Explanation & ImplementationNeural Collaborative Filtering Explanation & Implementation
Neural Collaborative Filtering Explanation & Implementation
 
Model selection and cross validation techniques
Model selection and cross validation techniquesModel selection and cross validation techniques
Model selection and cross validation techniques
 
Neural collaborative filtering-발표
Neural collaborative filtering-발표Neural collaborative filtering-발표
Neural collaborative filtering-발표
 
Bpr bayesian personalized ranking from implicit feedback
Bpr bayesian personalized ranking from implicit feedbackBpr bayesian personalized ranking from implicit feedback
Bpr bayesian personalized ranking from implicit feedback
 
Machine Learning and Data Mining: 16 Classifiers Ensembles
Machine Learning and Data Mining: 16 Classifiers EnsemblesMachine Learning and Data Mining: 16 Classifiers Ensembles
Machine Learning and Data Mining: 16 Classifiers Ensembles
 

Andere mochten auch

Customer segmentation - Games Analytics and Business Intelligence, Sep 2015
Customer segmentation - Games Analytics and Business Intelligence, Sep 2015Customer segmentation - Games Analytics and Business Intelligence, Sep 2015
Customer segmentation - Games Analytics and Business Intelligence, Sep 2015Volodymyr Kazantsev
 
Games Analytics and players segmentation
Games Analytics and players segmentationGames Analytics and players segmentation
Games Analytics and players segmentationmichele de meo
 
How to conclude online experiments in python
How to conclude online experiments in pythonHow to conclude online experiments in python
How to conclude online experiments in pythonVolodymyr Kazantsev
 
Agile data science
Agile data scienceAgile data science
Agile data scienceJoel Horwitz
 
Big Data Agile Analytics by Ken Collier - Director Agile Analytics, Thoughtwo...
Big Data Agile Analytics by Ken Collier - Director Agile Analytics, Thoughtwo...Big Data Agile Analytics by Ken Collier - Director Agile Analytics, Thoughtwo...
Big Data Agile Analytics by Ken Collier - Director Agile Analytics, Thoughtwo...Thoughtworks
 
Clean code in Jupyter notebooks
Clean code in Jupyter notebooksClean code in Jupyter notebooks
Clean code in Jupyter notebooksKaterina Nerush
 
Churn prediction in mobile social games towards a complete assessment using ...
Churn prediction in mobile social games  towards a complete assessment using ...Churn prediction in mobile social games  towards a complete assessment using ...
Churn prediction in mobile social games towards a complete assessment using ...Alain Saas
 
Soft Launch Strategies for Mobile App Companies
Soft Launch Strategies for MobileApp CompaniesSoft Launch Strategies for MobileApp Companies
Soft Launch Strategies for Mobile App CompaniesFiksu
 
Scientific Computing with Python Webinar 9/18/2009:Curve Fitting
Scientific Computing with Python Webinar 9/18/2009:Curve FittingScientific Computing with Python Webinar 9/18/2009:Curve Fitting
Scientific Computing with Python Webinar 9/18/2009:Curve FittingEnthought, Inc.
 
Introduction to Bayesian Statistics
Introduction to Bayesian StatisticsIntroduction to Bayesian Statistics
Introduction to Bayesian StatisticsPhilipp Singer
 
KPIs for Mobile Game Soft Launch
KPIs for Mobile Game Soft LaunchKPIs for Mobile Game Soft Launch
KPIs for Mobile Game Soft LaunchZENNA
 
Agile Analytics: The Secret to Test, Improve, Fail & Succeed Quickly.
Agile Analytics: The Secret to Test, Improve, Fail & Succeed Quickly.Agile Analytics: The Secret to Test, Improve, Fail & Succeed Quickly.
Agile Analytics: The Secret to Test, Improve, Fail & Succeed Quickly.Venveo
 
Bayes theorem explained
Bayes theorem explainedBayes theorem explained
Bayes theorem explainedDaniel Ross
 
Agile Data Science by Russell Jurney_ The Hive_Janruary 29 2014
Agile Data Science by Russell Jurney_ The Hive_Janruary 29 2014Agile Data Science by Russell Jurney_ The Hive_Janruary 29 2014
Agile Data Science by Russell Jurney_ The Hive_Janruary 29 2014The Hive
 

Andere mochten auch (20)

Clean Code in Jupyter notebook
Clean Code in Jupyter notebookClean Code in Jupyter notebook
Clean Code in Jupyter notebook
 
Customer segmentation - Games Analytics and Business Intelligence, Sep 2015
Customer segmentation - Games Analytics and Business Intelligence, Sep 2015Customer segmentation - Games Analytics and Business Intelligence, Sep 2015
Customer segmentation - Games Analytics and Business Intelligence, Sep 2015
 
Agile Data Science
Agile Data ScienceAgile Data Science
Agile Data Science
 
Games Analytics and players segmentation
Games Analytics and players segmentationGames Analytics and players segmentation
Games Analytics and players segmentation
 
How to conclude online experiments in python
How to conclude online experiments in pythonHow to conclude online experiments in python
How to conclude online experiments in python
 
Agile data science
Agile data scienceAgile data science
Agile data science
 
Big Data Agile Analytics by Ken Collier - Director Agile Analytics, Thoughtwo...
Big Data Agile Analytics by Ken Collier - Director Agile Analytics, Thoughtwo...Big Data Agile Analytics by Ken Collier - Director Agile Analytics, Thoughtwo...
Big Data Agile Analytics by Ken Collier - Director Agile Analytics, Thoughtwo...
 
Agile data visualisation
Agile data visualisationAgile data visualisation
Agile data visualisation
 
RFM Segmentation
RFM SegmentationRFM Segmentation
RFM Segmentation
 
Clean code in Jupyter notebooks
Clean code in Jupyter notebooksClean code in Jupyter notebooks
Clean code in Jupyter notebooks
 
Churn prediction in mobile social games towards a complete assessment using ...
Churn prediction in mobile social games  towards a complete assessment using ...Churn prediction in mobile social games  towards a complete assessment using ...
Churn prediction in mobile social games towards a complete assessment using ...
 
NumPy/SciPy Statistics
NumPy/SciPy StatisticsNumPy/SciPy Statistics
NumPy/SciPy Statistics
 
Soft Launch Strategies for Mobile App Companies
Soft Launch Strategies for MobileApp CompaniesSoft Launch Strategies for MobileApp Companies
Soft Launch Strategies for Mobile App Companies
 
4Front Game Data Science
4Front Game Data Science4Front Game Data Science
4Front Game Data Science
 
Scientific Computing with Python Webinar 9/18/2009:Curve Fitting
Scientific Computing with Python Webinar 9/18/2009:Curve FittingScientific Computing with Python Webinar 9/18/2009:Curve Fitting
Scientific Computing with Python Webinar 9/18/2009:Curve Fitting
 
Introduction to Bayesian Statistics
Introduction to Bayesian StatisticsIntroduction to Bayesian Statistics
Introduction to Bayesian Statistics
 
KPIs for Mobile Game Soft Launch
KPIs for Mobile Game Soft LaunchKPIs for Mobile Game Soft Launch
KPIs for Mobile Game Soft Launch
 
Agile Analytics: The Secret to Test, Improve, Fail & Succeed Quickly.
Agile Analytics: The Secret to Test, Improve, Fail & Succeed Quickly.Agile Analytics: The Secret to Test, Improve, Fail & Succeed Quickly.
Agile Analytics: The Secret to Test, Improve, Fail & Succeed Quickly.
 
Bayes theorem explained
Bayes theorem explainedBayes theorem explained
Bayes theorem explained
 
Agile Data Science by Russell Jurney_ The Hive_Janruary 29 2014
Agile Data Science by Russell Jurney_ The Hive_Janruary 29 2014Agile Data Science by Russell Jurney_ The Hive_Janruary 29 2014
Agile Data Science by Russell Jurney_ The Hive_Janruary 29 2014
 

Ähnlich wie Bayesian model averaging

Can Machine Learning Models be Trusted? Explaining Decisions of ML Models
Can Machine Learning Models be Trusted? Explaining Decisions of ML ModelsCan Machine Learning Models be Trusted? Explaining Decisions of ML Models
Can Machine Learning Models be Trusted? Explaining Decisions of ML ModelsDarek Smyk
 
From Prototyping to Deployment at Scale with R and sparklyr with Kevin Kuo
From Prototyping to Deployment at Scale with R and sparklyr with Kevin KuoFrom Prototyping to Deployment at Scale with R and sparklyr with Kevin Kuo
From Prototyping to Deployment at Scale with R and sparklyr with Kevin KuoDatabricks
 
Machine Learning on IBM Watson Studio
Machine Learning on IBM Watson StudioMachine Learning on IBM Watson Studio
Machine Learning on IBM Watson StudioUpkar Lidder
 
Customer choice probabilities
Customer choice probabilitiesCustomer choice probabilities
Customer choice probabilitiesAllan D. Butler
 
Converter Simulation - Beyond the Evaluation Board
Converter Simulation - Beyond the Evaluation BoardConverter Simulation - Beyond the Evaluation Board
Converter Simulation - Beyond the Evaluation BoardAnalog Devices, Inc.
 
[OFW 14] Prediction of Flow Characteristics by Applying Machine Learning of S...
[OFW 14] Prediction of Flow Characteristics by Applying Machine Learning of S...[OFW 14] Prediction of Flow Characteristics by Applying Machine Learning of S...
[OFW 14] Prediction of Flow Characteristics by Applying Machine Learning of S...Geon-Hong Kim
 
Tensors Are All You Need: Faster Inference with Hummingbird
Tensors Are All You Need: Faster Inference with HummingbirdTensors Are All You Need: Faster Inference with Hummingbird
Tensors Are All You Need: Faster Inference with HummingbirdDatabricks
 
“Introduction to Simultaneous Localization and Mapping (SLAM),” a Presentatio...
“Introduction to Simultaneous Localization and Mapping (SLAM),” a Presentatio...“Introduction to Simultaneous Localization and Mapping (SLAM),” a Presentatio...
“Introduction to Simultaneous Localization and Mapping (SLAM),” a Presentatio...Edge AI and Vision Alliance
 
Managing cost models, baseline budgets and actual costs to deliver successful...
Managing cost models, baseline budgets and actual costs to deliver successful...Managing cost models, baseline budgets and actual costs to deliver successful...
Managing cost models, baseline budgets and actual costs to deliver successful...Association for Project Management
 
joyglobalpresentationsiemenstrifectamar2016-160429150056
joyglobalpresentationsiemenstrifectamar2016-160429150056joyglobalpresentationsiemenstrifectamar2016-160429150056
joyglobalpresentationsiemenstrifectamar2016-160429150056Darren Simoni
 
Data Con LA 2022 - AutoDC + AutoML = your AI development superpower
Data Con LA 2022 - AutoDC + AutoML = your AI development superpowerData Con LA 2022 - AutoDC + AutoML = your AI development superpower
Data Con LA 2022 - AutoDC + AutoML = your AI development superpowerData Con LA
 
11.1. PPT on How to crack ML Competitions all steps explained.pptx
11.1. PPT on How to crack ML Competitions all steps explained.pptx11.1. PPT on How to crack ML Competitions all steps explained.pptx
11.1. PPT on How to crack ML Competitions all steps explained.pptxhu153574
 
Realtime pothole detection system using improved CNN Models
Realtime pothole detection system using improved CNN ModelsRealtime pothole detection system using improved CNN Models
Realtime pothole detection system using improved CNN Modelsnithinsai2992
 
Wireless Project Using CAPsMAN
Wireless Project Using CAPsMANWireless Project Using CAPsMAN
Wireless Project Using CAPsMANAidan Venn MSc
 
A Systematic Approach to Creating Behavioral Models (CDNLive Slides)
A Systematic Approach to Creating Behavioral Models (CDNLive Slides)A Systematic Approach to Creating Behavioral Models (CDNLive Slides)
A Systematic Approach to Creating Behavioral Models (CDNLive Slides)Robert O. Peruzzi, PhD, PE, DFE
 
Visualizing, Analyzing and Optimizing Automotive Architecture Models using Si...
Visualizing, Analyzing and Optimizing Automotive Architecture Models using Si...Visualizing, Analyzing and Optimizing Automotive Architecture Models using Si...
Visualizing, Analyzing and Optimizing Automotive Architecture Models using Si...Obeo
 
Human-Centered Interpretable Machine Learning
Human-Centered Interpretable  Machine LearningHuman-Centered Interpretable  Machine Learning
Human-Centered Interpretable Machine LearningPrzemek Biecek
 

Ähnlich wie Bayesian model averaging (20)

Can Machine Learning Models be Trusted? Explaining Decisions of ML Models
Can Machine Learning Models be Trusted? Explaining Decisions of ML ModelsCan Machine Learning Models be Trusted? Explaining Decisions of ML Models
Can Machine Learning Models be Trusted? Explaining Decisions of ML Models
 
From Prototyping to Deployment at Scale with R and sparklyr with Kevin Kuo
From Prototyping to Deployment at Scale with R and sparklyr with Kevin KuoFrom Prototyping to Deployment at Scale with R and sparklyr with Kevin Kuo
From Prototyping to Deployment at Scale with R and sparklyr with Kevin Kuo
 
Machine Learning on IBM Watson Studio
Machine Learning on IBM Watson StudioMachine Learning on IBM Watson Studio
Machine Learning on IBM Watson Studio
 
Customer choice probabilities
Customer choice probabilitiesCustomer choice probabilities
Customer choice probabilities
 
Simulations Partners.pdf
Simulations Partners.pdfSimulations Partners.pdf
Simulations Partners.pdf
 
Converter Simulation - Beyond the Evaluation Board
Converter Simulation - Beyond the Evaluation BoardConverter Simulation - Beyond the Evaluation Board
Converter Simulation - Beyond the Evaluation Board
 
[OFW 14] Prediction of Flow Characteristics by Applying Machine Learning of S...
[OFW 14] Prediction of Flow Characteristics by Applying Machine Learning of S...[OFW 14] Prediction of Flow Characteristics by Applying Machine Learning of S...
[OFW 14] Prediction of Flow Characteristics by Applying Machine Learning of S...
 
Tensors Are All You Need: Faster Inference with Hummingbird
Tensors Are All You Need: Faster Inference with HummingbirdTensors Are All You Need: Faster Inference with Hummingbird
Tensors Are All You Need: Faster Inference with Hummingbird
 
“Introduction to Simultaneous Localization and Mapping (SLAM),” a Presentatio...
“Introduction to Simultaneous Localization and Mapping (SLAM),” a Presentatio...“Introduction to Simultaneous Localization and Mapping (SLAM),” a Presentatio...
“Introduction to Simultaneous Localization and Mapping (SLAM),” a Presentatio...
 
Report
ReportReport
Report
 
Managing cost models, baseline budgets and actual costs to deliver successful...
Managing cost models, baseline budgets and actual costs to deliver successful...Managing cost models, baseline budgets and actual costs to deliver successful...
Managing cost models, baseline budgets and actual costs to deliver successful...
 
joyglobalpresentationsiemenstrifectamar2016-160429150056
joyglobalpresentationsiemenstrifectamar2016-160429150056joyglobalpresentationsiemenstrifectamar2016-160429150056
joyglobalpresentationsiemenstrifectamar2016-160429150056
 
Data Con LA 2022 - AutoDC + AutoML = your AI development superpower
Data Con LA 2022 - AutoDC + AutoML = your AI development superpowerData Con LA 2022 - AutoDC + AutoML = your AI development superpower
Data Con LA 2022 - AutoDC + AutoML = your AI development superpower
 
11.1. PPT on How to crack ML Competitions all steps explained.pptx
11.1. PPT on How to crack ML Competitions all steps explained.pptx11.1. PPT on How to crack ML Competitions all steps explained.pptx
11.1. PPT on How to crack ML Competitions all steps explained.pptx
 
Realtime pothole detection system using improved CNN Models
Realtime pothole detection system using improved CNN ModelsRealtime pothole detection system using improved CNN Models
Realtime pothole detection system using improved CNN Models
 
Wireless Project Using CAPsMAN
Wireless Project Using CAPsMANWireless Project Using CAPsMAN
Wireless Project Using CAPsMAN
 
A Systematic Approach to Creating Behavioral Models (CDNLive Slides)
A Systematic Approach to Creating Behavioral Models (CDNLive Slides)A Systematic Approach to Creating Behavioral Models (CDNLive Slides)
A Systematic Approach to Creating Behavioral Models (CDNLive Slides)
 
Visualizing, Analyzing and Optimizing Automotive Architecture Models using Si...
Visualizing, Analyzing and Optimizing Automotive Architecture Models using Si...Visualizing, Analyzing and Optimizing Automotive Architecture Models using Si...
Visualizing, Analyzing and Optimizing Automotive Architecture Models using Si...
 
Human-Centered Interpretable Machine Learning
Human-Centered Interpretable  Machine LearningHuman-Centered Interpretable  Machine Learning
Human-Centered Interpretable Machine Learning
 
Software-Development.pptx
Software-Development.pptxSoftware-Development.pptx
Software-Development.pptx
 

Kürzlich hochgeladen

❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.Nitya salvi
 
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and SpectrometryFAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and SpectrometryAlex Henderson
 
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptxSCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptxRizalinePalanog2
 
IDENTIFICATION OF THE LIVING- forensic medicine
IDENTIFICATION OF THE LIVING- forensic medicineIDENTIFICATION OF THE LIVING- forensic medicine
IDENTIFICATION OF THE LIVING- forensic medicinesherlingomez2
 
dkNET Webinar "Texera: A Scalable Cloud Computing Platform for Sharing Data a...
dkNET Webinar "Texera: A Scalable Cloud Computing Platform for Sharing Data a...dkNET Webinar "Texera: A Scalable Cloud Computing Platform for Sharing Data a...
dkNET Webinar "Texera: A Scalable Cloud Computing Platform for Sharing Data a...dkNET
 
STS-UNIT 4 CLIMATE CHANGE POWERPOINT PRESENTATION
STS-UNIT 4 CLIMATE CHANGE POWERPOINT PRESENTATIONSTS-UNIT 4 CLIMATE CHANGE POWERPOINT PRESENTATION
STS-UNIT 4 CLIMATE CHANGE POWERPOINT PRESENTATIONrouseeyyy
 
biology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGYbiology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGY1301aanya
 
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPirithiRaju
 
Dopamine neurotransmitter determination using graphite sheet- graphene nano-s...
Dopamine neurotransmitter determination using graphite sheet- graphene nano-s...Dopamine neurotransmitter determination using graphite sheet- graphene nano-s...
Dopamine neurotransmitter determination using graphite sheet- graphene nano-s...Mohammad Khajehpour
 
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICESAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICEayushi9330
 
Module for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learningModule for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learninglevieagacer
 
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...ssuser79fe74
 
GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)Areesha Ahmad
 
Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.Silpa
 
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...chandars293
 
Introduction,importance and scope of horticulture.pptx
Introduction,importance and scope of horticulture.pptxIntroduction,importance and scope of horticulture.pptx
Introduction,importance and scope of horticulture.pptxBhagirath Gogikar
 
GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)Areesha Ahmad
 

Kürzlich hochgeladen (20)

❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
 
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and SpectrometryFAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
 
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptxSCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
 
IDENTIFICATION OF THE LIVING- forensic medicine
IDENTIFICATION OF THE LIVING- forensic medicineIDENTIFICATION OF THE LIVING- forensic medicine
IDENTIFICATION OF THE LIVING- forensic medicine
 
dkNET Webinar "Texera: A Scalable Cloud Computing Platform for Sharing Data a...
dkNET Webinar "Texera: A Scalable Cloud Computing Platform for Sharing Data a...dkNET Webinar "Texera: A Scalable Cloud Computing Platform for Sharing Data a...
dkNET Webinar "Texera: A Scalable Cloud Computing Platform for Sharing Data a...
 
STS-UNIT 4 CLIMATE CHANGE POWERPOINT PRESENTATION
STS-UNIT 4 CLIMATE CHANGE POWERPOINT PRESENTATIONSTS-UNIT 4 CLIMATE CHANGE POWERPOINT PRESENTATION
STS-UNIT 4 CLIMATE CHANGE POWERPOINT PRESENTATION
 
biology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGYbiology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGY
 
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
 
Dopamine neurotransmitter determination using graphite sheet- graphene nano-s...
Dopamine neurotransmitter determination using graphite sheet- graphene nano-s...Dopamine neurotransmitter determination using graphite sheet- graphene nano-s...
Dopamine neurotransmitter determination using graphite sheet- graphene nano-s...
 
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICESAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICE
 
Module for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learningModule for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learning
 
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
 
GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)
 
Site Acceptance Test .
Site Acceptance Test                    .Site Acceptance Test                    .
Site Acceptance Test .
 
CELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdfCELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdf
 
Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.
 
Clean In Place(CIP).pptx .
Clean In Place(CIP).pptx                 .Clean In Place(CIP).pptx                 .
Clean In Place(CIP).pptx .
 
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
 
Introduction,importance and scope of horticulture.pptx
Introduction,importance and scope of horticulture.pptxIntroduction,importance and scope of horticulture.pptx
Introduction,importance and scope of horticulture.pptx
 
GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)
 

Bayesian model averaging

  • 1. Volodymyrk Bayesian Model Averaging Bayesian Mixer, 27.09.2016 London, UK
  • 2. Volodymyrk Bayesian Model Averaging (BMA) - 1 minute version New Project - how much does it worth? CFO VP of Growth Net Present Value: $50m $100m Model M1 Model M2 30%CEO belief: after evaluating both models and market data 70% $15m + $70m = $85m K = 2
  • 3. Volodymyrk Bayesian Model Averaging (BMA) - 3 minute version VP of Growth CLV assumptions $10 $12 $15 CAC $4 72 129 149 $6 62 112 133 $8 51 92 101 Average= $100.11m Sensitivity Analysis for M2 DATA
  • 4. Volodymyrk Bayesian Model Averaging (BMA) - 5 minute version Bayesian Model Averaging: A Tutorial Jennifer A. Hoeting, David Madigan, Adrian E. Raftery and Chris T. Volinsky How much do you trust your VP and CFO, before you look at models? Scary normalising term that you can ignore Prior probability for model parameter
  • 5. Volodymyrk Bayesian answer to overfitting Frequentist: - model selection - regularisation Bayesian: - BMA - marginalisation
  • 6. Volodymyrk Case Study You just get the best job in the galaxy
  • 7. Volodymyrk Your new Boss Business domain Modelling case Always test your models on synthetic data that you understand and control
  • 8. Volodymyrk Use Cases: - Fraud Detection - Inventory Sourcing Data
  • 9. Volodymyrk Modelling goals - Prediction range is needed, so that you can identify fraudulent transactions (sand people under-reporting real transaction size and pocketing profit) - Sale price should be easily explainable, as a function of various Droid Features so that Jabba can invest in appropriate scavenging/sourcing projects - You want lowest prediction error possible so that you are not feeded to Sarlacc
  • 12. Volodymyrk Model Selection - classical method credits ~ height + weight + power + dents + rad + wheels + legs + red + blue + black + temperature + lat + long + ir_emit + dents_log + height_log + weight_log + power_log + rad_log Adj. R2: 0.884974385182
  • 13. Volodymyrk Model Selection - backward elimination
  • 14. Volodymyrk Final Model credits ~ weight + power + dents + rad + wheels + blue + black + temperature + lat + dents_log + height_log + weight_log + power_log Adj. R2: 0.903544333611
  • 17. Volodymyrk Bayesian Model Averaging for Linear Models - a special case Inclusion probability for (regression coefficients) are weighted across all possible models Number of models = combinations of all K features (include/exclude) = 2K
  • 18. Volodymyrk How to actually do BMA? (in R) cran.r-project.org/web/packages/BMA cran.r-project.org/web/packages/BAScran.r-project.org/web/packages/BMS Mature. A.k.a. “the original” Developed by PhD duringresearch. Not maintained Newest. Maintained by Chair of the Department ofStatistical Science at Duke
  • 19. Volodymyrk BMA using BMS (R) package Model Selection L2 Regularisation BMA MSE 9736.49 7782.21 7329.44 It worked! But you can find inputs into data generator script that will not work as well!
  • 20. Volodymyrk Nice things you get from BMA Posterior Inclusion Probability! How cool is that!
  • 21. Volodymyrk Model ranking! MCMC can beused, if number of features is large Best model, according toBMA
  • 22. Volodymyrk Can we use it for more complex models? normalising term that you can ignore http://www.ssc.wisc.edu/~bhansen/718/NonParametrics15.pdf http://www.ejwagenmakers.com/2004/aic.pdf Warning:Very questionable math. Does not work
  • 23. Volodymyrk Can we use BMA to combine complex (incl. hierarchical) models? 1 3 2 Model order is somewhat similar. Relative probabilities are not. We need working Reverse-Jump MCMC or something more sophisticated. Not available in common bayesian MCMC packages yet.
  • 24. Volodymyrk In Summary - BMA is a Bayesian version of ML Model Ensembles - Math behind is quite beautiful - Model Averaging is useful for interpretation, not only prediction - Invest in synthetic data generation, - before applying new modelling techniques to real-world data - Even if you are not using BMA, fit different models - And combine them, if your goal is prediction - BMA works very well for common GLMs, but does not work yet for arbitrary models - Do try it next time you need to fit OLS, though!
  • 25. Volodymyrk Of course we are hiring! ● (Snr, Mid) Data Scientists ● Solutions Architect ● Ruby Developer ● Data Engineer ● Senior Artist ● Technical Artist ● Unity Developers ● Senior Product Manager ● Product Director http://jobs.productmadness.com/