Predictive Model Selection in PLS-PM (SCECR 2015)

•

2 gefällt mir•1,185 views

Galit Shmueli

Slides for talk by Galit Shmueli at 2015 Statistical Challenges in eCommerce Research (SCECR) symposium

Daten & Analysen

Predictive Model Selection
in PLS Path Modeling
Galit Shmueli, National Tsing Hua University, Taiwan
With:
Pratyush Sharma, U. Delaware
Marko Sarstedt, Otto-von-Guericke University Magdeburg
Kevin H. Kim†
SCECR 2015, Addis Ababa

PLS Path Model
Path coefficients
loadingsweights

Why Model Selection?
Researcher using structural model is often confident about the model
structure, but not the paths (arrows)
Model selection common practice in many fields

How to Compare PLS Models?
Suppose...
● theory cannot help and / or
● all models yield satisfactory results in terms of significant
paths
Predictive power!
Choose the model with best ability to predict out of sample.

Measuring Predictive Power
Classic predictive approach:
out-of-sample
1. Partition data randomly into
training and holdout
samples
2. Fit model to training data;
evaluate predictive power
by predicting holdout
records (RMSE, MAPE...)
For parametric models:
“information theoretic”
(IT) criteria
● In-sample metrics
● Measure out-of-sample
predictive power by
penalizing in-sample fit
● (Similar to adj-R2)

Information Theoretic (IT) Model Selection
Criteria: General Form
IT criterion = -2 log likelihood + penalty
penalty = f(sample size, #parameters)
Small values = better
Balance data fit (likelihood) with parsimony (penalty)

Two Classes of IT Model Selection Criteria
AIC-type criteria:
● AIC = n [log(SSE/n) + 2p/n]
● AICc = n [log(SSE/n) + (n+p)/(n-p-2)]
● AICu = n [log(SSE/(n-p)) + 2p/n]
● Further variants: Final Prediction Error (FPE) and Mallow’s Cp
BIC-type criteria:
● BIC = n [log(SSE/n) + p*log(n)/n]
● HQ = n [log(SSE/n) + 2p*log(log(n))/n]
● HQc = n [log(SSE/n) + 2p*log(log(n))/(n-p-2)]
● Further variant: Geweke-Meese Criterion (GM)

Advantages of IT Criteria
● Commonly used for model selection in predictive modeling
(with parametric models)
● Asymptotic equivalence to cross-validation
● Useful for small samples: do not require data partitioning
● Use well-established in econometrics & statistics

Which IT criterion is best for
selecting the best
predictive PLS model?

Simulation Study
Establish “best model”
● Use each model to
predict holdout
● Compute holdout
RMSE for each model
● Lowest RMSE -> Best
predictive model
Find “best” criterion
● Compute all IT criteria for each
model (from training)
● Which model does each
criterion choose?
● Best criterion = RMSE choice
● Benchmark criterion: Q2
1. Simulate data from a specific PLS model
2. Partition data into (small) training and (big) holdout
3. Estimate all possible PLS models from training sample

Experimental Conditions
● Sample Size :
o Training: 50, 100, 150, 200, 250, 500
o Holdout: 1000
● Effect Size (ξ1 → η2): 0.1, 0.2, 0.3, 0.4, 0.5
● Data Distributions: Normal, Chi-Squared (df=3), t-dist (df=5), Uniform
● Measurement Model Factor Loadings:
o Higher AVE & Homogenous (0.9, 0.9, 0.9)
o Lower AVE & Homogenous (0.7, 0.7, 0.7)
o Higher AVE & Heterogenous (0.9, 0.8, 0.7)
o Lower AVE & Heterogenous (0.5, 0.6, 0.7)

Results
Initial simulation results showed unexpected
results...
Model 5 is not necessarily
the best predictive model!

What’s Next
● Get meaningful results!
● More complex models
(e.g., interaction effects, hierarchical component models,
nonlinear relationships)
● Broader set of data constellations (e.g., collinearity)
● Design of IT criteria that take the specificities of the PLS
method into account

Weitere ähnliche Inhalte

Was ist angesagt?

Predictive analytics in Information Systems Research (TSWIM 2015 keynote)Galit Shmueli

Statistical and Predictive ModellingJMP software from SAS

Statistical Modeling in 3D: Explaining, Predicting, DescribingGalit Shmueli

Tech meetup Data Driven - Codemotion antimo musone

Machine Learning and Causal InferenceNBER

Learning to learn Model Behavior: How to use "human-in-the-loop" to explain d...IDEAS - Int'l Data Engineering and Science Association

Data Analytics, Machine Learning, and HPC in Today’s Changing Application Env...Intel® Software

Business Development Analysis Manpreet Chandhok

Lecture 7butest

Predictive data analytics models and their applicationsBharathi Raja Asoka Chakravarthi

Causal Inference in Data Science and Machine LearningBill Liu

Nbe rtopicsandrecomvlecture1NBER

Introduction to regressionDr. C.V. Suresh Babu

Keynote by Agus Sudjianto, Wells Fargo - Interpretable Machine Learning - H2O...Sri Ambati

Alleviating Privacy Attacks Using Causal ModelsAmit Sharma

Missing Data and data imputation techniquesOmar F. Althuwaynee

ClassificationDr. C.V. Suresh Babu

Predire il futuro con Machine Learning & Big DataData Driven Innovation

Collaboration with Statistician? 矩陣視覺化於探索式資料分析台灣資料科學年會

Eda sriMudumby Srinath

Was ist angesagt? (20)

Predictive analytics in Information Systems Research (TSWIM 2015 keynote)

Statistical and Predictive Modelling

Statistical Modeling in 3D: Explaining, Predicting, Describing

Tech meetup Data Driven - Codemotion

Machine Learning and Causal Inference

Learning to learn Model Behavior: How to use "human-in-the-loop" to explain d...

Data Analytics, Machine Learning, and HPC in Today’s Changing Application Env...

Business Development Analysis

Lecture 7

Predictive data analytics models and their applications

Causal Inference in Data Science and Machine Learning

Nbe rtopicsandrecomvlecture1

Introduction to regression

Keynote by Agus Sudjianto, Wells Fargo - Interpretable Machine Learning - H2O...

Alleviating Privacy Attacks Using Causal Models

Missing Data and data imputation techniques

Classification

Predire il futuro con Machine Learning & Big Data

Collaboration with Statistician? 矩陣視覺化於探索式資料分析

Eda sri

Ähnlich wie Predictive Model Selection in PLS-PM (SCECR 2015)

Machine Learning Unit 1 Semester 3 MSc IT Part 2 Mumbai UniversityMadhav Mishra

Post Graduate Admission Prediction SystemIRJET Journal

Informs presentation new pptSalford Systems

Statistical Learning and Model Selection module 2.pptxnagarajan740445

Ensemble methods in Machine learning technologysikethatsarightemail

Model Selection TechniquesSwati .

Diabetes Prediction Using Machine Learningjagan477830

Statistical Learning and Model Selection (1).pptxrajalakshmi5921

Machine learning how are things going onRajasekhar364622

AIAA Future of Fluids 2018 MoserQiqi Wang

Experimental Design for Distributed Machine Learning with Myles BakerDatabricks

ENACTMENT RANKING OF SUPERVISED ALGORITHMS DEPENDENCE OF DATA SPLITTING ALGOR...ijnlc

Enactment Ranking of Supervised Algorithms Dependence of Data Splitting Algor...AIRCC Publishing Corporation

Lecture 9: Machine Learning in Practice (2)Marina Santini

Machine learning - session 3Luis Borbon

Maximum likelihood estimation from uncertainIEEEFINALYEARPROJECTS

Ds for finance day 3QuantUniversity

Machine Learning in the Financial IndustrySubrat Panda, PhD

Pharmacokinetic pharmacodynamic modelingMeghana Gowda

GLM & GBM in H2OSri Ambati

Ähnlich wie Predictive Model Selection in PLS-PM (SCECR 2015) (20)

Machine Learning Unit 1 Semester 3 MSc IT Part 2 Mumbai University

Post Graduate Admission Prediction System

Informs presentation new ppt

Statistical Learning and Model Selection module 2.pptx

Ensemble methods in Machine learning technology

Model Selection Techniques

Diabetes Prediction Using Machine Learning

Statistical Learning and Model Selection (1).pptx

Machine learning how are things going on

AIAA Future of Fluids 2018 Moser

Experimental Design for Distributed Machine Learning with Myles Baker

ENACTMENT RANKING OF SUPERVISED ALGORITHMS DEPENDENCE OF DATA SPLITTING ALGOR...

Enactment Ranking of Supervised Algorithms Dependence of Data Splitting Algor...

Lecture 9: Machine Learning in Practice (2)

Machine learning - session 3

Maximum likelihood estimation from uncertain

Ds for finance day 3

Machine Learning in the Financial Industry

Pharmacokinetic pharmacodynamic modeling

GLM & GBM in H2O

Mehr von Galit Shmueli

“Improving” prediction of human behavior using behavior modificationGalit Shmueli

Behavioral Big Data & Healthcare ResearchGalit Shmueli

Reinventing the Data Analytics ClassroomGalit Shmueli

Behavioral Big Data & Healthcare Research: Talk at WiDS TaipeiGalit Shmueli

Workshop on Information QualityGalit Shmueli

Behavioral Big Data: Why Quality Engineers Should CareGalit Shmueli

Researcher Dilemmas using Behavioral Big Data in Healthcare (INFORMS DMDA Wo...Galit Shmueli

Prediction-based Model Selection in PLS-PMGalit Shmueli

When Prediction Met PLS: What We learned in 3 Years of MarriageGalit Shmueli

A Tree-Based Approach for Addressing Self-selection in Impact Studies with B...Galit Shmueli

Research Using Behavioral Big Data: A Tour and Why Mechanical Engineers Shoul...Galit Shmueli

A Tree-Based Approach for Addressing Self-Selection in Impact Studies with Bi...Galit Shmueli

Research Using Behavioral Big Data (BBD)Galit Shmueli

Analyzing Behavioral Big Data: Methodological, Practical, Ethical & Moral IssuesGalit Shmueli

Information Quality: A Framework for Evaluating Empirical Studies Galit Shmueli

E.SUN Academic Award presentation (Jan 2016)Galit Shmueli

Big Data & Analytics in the Digital Creative IndustriesGalit Shmueli

On Information Quality: Can Your Data Do The Job? (SCECR 2015 Keynote)Galit Shmueli

Introducing the NTHU-EZTABLE Kaggle Contest (Predicting Repeat Restaurant Boo...Galit Shmueli

Opening Data With KaggleGalit Shmueli

Mehr von Galit Shmueli (20)

“Improving” prediction of human behavior using behavior modification

Behavioral Big Data & Healthcare Research

Reinventing the Data Analytics Classroom

Behavioral Big Data & Healthcare Research: Talk at WiDS Taipei

Workshop on Information Quality

Behavioral Big Data: Why Quality Engineers Should Care

Researcher Dilemmas using Behavioral Big Data in Healthcare (INFORMS DMDA Wo...

Prediction-based Model Selection in PLS-PM

When Prediction Met PLS: What We learned in 3 Years of Marriage

A Tree-Based Approach for Addressing Self-selection in Impact Studies with B...

Research Using Behavioral Big Data: A Tour and Why Mechanical Engineers Shoul...

A Tree-Based Approach for Addressing Self-Selection in Impact Studies with Bi...

Research Using Behavioral Big Data (BBD)

Analyzing Behavioral Big Data: Methodological, Practical, Ethical & Moral Issues

Information Quality: A Framework for Evaluating Empirical Studies

E.SUN Academic Award presentation (Jan 2016)

Big Data & Analytics in the Digital Creative Industries

On Information Quality: Can Your Data Do The Job? (SCECR 2015 Keynote)

Introducing the NTHU-EZTABLE Kaggle Contest (Predicting Repeat Restaurant Boo...

Opening Data With Kaggle

Kürzlich hochgeladen

Predictive Analysis for Loan Default Presentation : Data Analysis Project PPTBoston Institute of Analytics

Bank Loan Approval Analysis: A Comprehensive Data Analysis ProjectBoston Institute of Analytics

Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...Boston Institute of Analytics

Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...Boston Institute of Analytics

Decoding Patterns: Customer Churn Prediction Data Analysis ProjectBoston Institute of Analytics

Defining Constituents, Data Vizzes and Telling a Data StoryJeremy Anderson

Cyber awareness ppt on the recorded dataTecnoIncentive

Insurance Churn Prediction Data Analysis ProjectBoston Institute of Analytics

The Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptxTasha Penwell

Student Profile Sample report on improving academic performance by uniting gr...Seán Kennedy

English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdfblazblazml

convolutional neural network and its applications.pdfSubhamKumar3239

Data Factory in Microsoft Fabric (MsBIP #82)Cathrine Wilhelmsen

Unveiling the Role of Social Media Suspect Investigators in Preventing Online...Milind Agarwal

Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...Boston Institute of Analytics

Student profile product demonstration on grades, ability, well-being and mind...Seán Kennedy

6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...Dr Arash Najmaei ( Phd., MBA, BSc)

Digital Marketing Plan, how digital marketing worksdeepakthakur548787

Principles and Practices of Data VisualizationKianJazayeri1

What To Do For World Nature Conservation Day by Slidesgo.pptxSimranPal17

Kürzlich hochgeladen (20)

Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT

Bank Loan Approval Analysis: A Comprehensive Data Analysis Project

Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...

Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...

Decoding Patterns: Customer Churn Prediction Data Analysis Project

Defining Constituents, Data Vizzes and Telling a Data Story

Cyber awareness ppt on the recorded data

Insurance Churn Prediction Data Analysis Project

The Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptx

Student Profile Sample report on improving academic performance by uniting gr...

English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf

convolutional neural network and its applications.pdf

Data Factory in Microsoft Fabric (MsBIP #82)

Unveiling the Role of Social Media Suspect Investigators in Preventing Online...

Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...

Student profile product demonstration on grades, ability, well-being and mind...

6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...

Digital Marketing Plan, how digital marketing works

Principles and Practices of Data Visualization

What To Do For World Nature Conservation Day by Slidesgo.pptx

Predictive Model Selection in PLS-PM (SCECR 2015)

1. Predictive Model Selection in PLS Path Modeling Galit Shmueli, National Tsing Hua University, Taiwan With: Pratyush Sharma, U. Delaware Marko Sarstedt, Otto-von-Guericke University Magdeburg Kevin H. Kim† SCECR 2015, Addis Ababa

2. SCECR 2010

3. PLS Path Model Path coefficients loadingsweights

4. Why Model Selection? Researcher using structural model is often confident about the model structure, but not the paths (arrows) Model selection common practice in many fields

5. How to Compare PLS Models? Suppose... ● theory cannot help and / or ● all models yield satisfactory results in terms of significant paths Predictive power! Choose the model with best ability to predict out of sample.

6. Measuring Predictive Power Classic predictive approach: out-of-sample 1. Partition data randomly into training and holdout samples 2. Fit model to training data; evaluate predictive power by predicting holdout records (RMSE, MAPE...) For parametric models: “information theoretic” (IT) criteria ● In-sample metrics ● Measure out-of-sample predictive power by penalizing in-sample fit ● (Similar to adj-R2)

7. Information Theoretic (IT) Model Selection Criteria: General Form IT criterion = -2 log likelihood + penalty penalty = f(sample size, #parameters) Small values = better Balance data fit (likelihood) with parsimony (penalty)

8. Two Classes of IT Model Selection Criteria AIC-type criteria: ● AIC = n [log(SSE/n) + 2p/n] ● AICc = n [log(SSE/n) + (n+p)/(n-p-2)] ● AICu = n [log(SSE/(n-p)) + 2p/n] ● Further variants: Final Prediction Error (FPE) and Mallow’s Cp BIC-type criteria: ● BIC = n [log(SSE/n) + p*log(n)/n] ● HQ = n [log(SSE/n) + 2p*log(log(n))/n] ● HQc = n [log(SSE/n) + 2p*log(log(n))/(n-p-2)] ● Further variant: Geweke-Meese Criterion (GM)

9. Advantages of IT Criteria ● Commonly used for model selection in predictive modeling (with parametric models) ● Asymptotic equivalence to cross-validation ● Useful for small samples: do not require data partitioning ● Use well-established in econometrics & statistics

10. Which IT criterion is best for selecting the best predictive PLS model?

11. Simulation Study Establish “best model” ● Use each model to predict holdout ● Compute holdout RMSE for each model ● Lowest RMSE -> Best predictive model Find “best” criterion ● Compute all IT criteria for each model (from training) ● Which model does each criterion choose? ● Best criterion = RMSE choice ● Benchmark criterion: Q2 1. Simulate data from a specific PLS model 2. Partition data into (small) training and (big) holdout 3. Estimate all possible PLS models from training sample

12. Competing Models

13. Experimental Conditions ● Sample Size : o Training: 50, 100, 150, 200, 250, 500 o Holdout: 1000 ● Effect Size (ξ1 → η2): 0.1, 0.2, 0.3, 0.4, 0.5 ● Data Distributions: Normal, Chi-Squared (df=3), t-dist (df=5), Uniform ● Measurement Model Factor Loadings: o Higher AVE & Homogenous (0.9, 0.9, 0.9) o Lower AVE & Homogenous (0.7, 0.7, 0.7) o Higher AVE & Heterogenous (0.9, 0.8, 0.7) o Lower AVE & Heterogenous (0.5, 0.6, 0.7)

14. Results Initial simulation results showed unexpected results... Model 5 is not necessarily the best predictive model!

15. What’s Next ● Get meaningful results! ● More complex models (e.g., interaction effects, hierarchical component models, nonlinear relationships) ● Broader set of data constellations (e.g., collinearity) ● Design of IT criteria that take the specificities of the PLS method into account

Predictive Model Selection in PLS-PM (SCECR 2015)

Empfohlen

Empfohlen

Weitere ähnliche Inhalte

Was ist angesagt?

Was ist angesagt? (20)

Ähnlich wie Predictive Model Selection in PLS-PM (SCECR 2015)

Ähnlich wie Predictive Model Selection in PLS-PM (SCECR 2015) (20)

Mehr von Galit Shmueli

Mehr von Galit Shmueli (20)

Kürzlich hochgeladen

Kürzlich hochgeladen (20)

Predictive Model Selection in PLS-PM (SCECR 2015)