SlideShare ist ein Scribd-Unternehmen logo
1 von 15
Multinomial Logistic Regression Models




January 1, 2013     ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India
 Logistic regression CAN handle dependant variables
   with more than two categories
  It is important to note whether the response variable
   is ordinal (consisting of ordered categories like young,
   middle-aged, old) or nominal (dependant is unordered
   like red, blue, black)
  Some multinomial logistic models are appropriate only
   for ordered response
  It is not mathematically necessary to consider the
   natural ordering when modeling ordinal response but,
  Considering the natural ordering
       Leads to a more parsimonious model
       Increase power to detect relationships with other variables

January 1, 2013    ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India
 Applying logistic regression considering the natural
   order is done using a modeling technique called the
   “Proportional Odds Model”
  Say the dependant variable Y has 4 states measuring
   the impact of radiation on the human body; fine,
   sick, serious,dead
  Let p1=prob of fine, p2=prob of sick, p3=prob of
   serious, p4=prob of dead
  Let us define a baseline category: fine, since this is
   the normal stage (we shall see why we need this
   later)
January 1, 2013   ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India
    What if we break up the modeling of the 4 level
      ordered dependant into 3 binary logistic
      situations: 1 – (fine,sick), 2 – (fine,serious),3 –
      (fine,dead)?
     Then we would have 3 logit equations:
       Log(p2/p1)=B11+B12X1+B13X2
       Log(p3/p1)=B21+B22X1+B23X2
       Log(p4/p1)=B31+B32X1+B33X2
      X is the degree of radiation dummy with 3 levels so
        broken into 2 binary dummies
     So, 9 parameters to be estmated

January 1, 2013   ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India
    Now consider an alternative model for the same
      situation
     Cumulative logit model:
       L1=log(p1/p2+p3+p4)
       L2=log(p1+p2/p3+p4)
       L3=log(p1+p2+p3/p4)
     The obvious way to introduce covariates is
       L1=B11+B12X1+B13X2
       L2=B21+B22X1+B23X2
       L3=B31+B32X1+B33X2

January 1, 2013   ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India
    Let us simplyfy the model by specifying that
      the slope parameters are identical over the
      logit equations. Then,
       L1=A1+B1X1+B2X2
       L2=A2+B1X1+B2X2
       L3=A3+B1X1+B2X2
     This is the proportional odds cumulative logit
      model

January 1, 2013   ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India
 Suppose that the categorical outcome is actually a
   categorized version of an unobservable (latent)
   continuous variable which has a logistic distribution
  The continuous scale is divided into five regions by
   four cut-points c1, c2, c3, c4 which are determined by
   nature
  If Z ≤ c1 we observe Y = 1; if c1 < Z ≤ c2 we observe Y =
   2; and so on
  Suppose that the Z is related to the X’s through a linear
   regression
  Then, the coarsened categorical variable would be
   related Y will be related to the X’s by a proportional-
   odds cumulative logit model
January 1, 2013   ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India
    Let us go back to the model
       L1=A1+B1X1+B2X2
       L2=A2+B1X1+B2X2
       L3=A3+B1X1+B2X2
     Note that Lj is the log-odds of falling into or below category j
      versus falling above it
     Aj is the log-odds of falling into or below category j when X1 =
      X2 = 0
     B1 is the increase in log-odds of falling into or below any
      category associated with a one-unit increase in Xk, holding all
      the other X-variables constant.
     Therefore, a positive slope indicates a tendency for the
      response level to decrease as the variable decreases
January 1, 2013     ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India
    Our example of 4 levels of impact of radiation
      corresponding to 3 levels of radiation

 proc logistic data=radiation_impact;
 freq count;
 class radiation / order=data param=ref ref=first;
 model sickness (order=data descending) = radiation /
   link=logit
 aggregate=(radiation) scale=none;
 run;
January 1, 2013   ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India
    Freq=count
       This is important for specifying grouped data
       Count is the variable that contains the frequency of
        occurrance of each observation
       In its absence, each row would be considered a
        separate row of data
     Class=radiation
       Specifies that radiation is a classification variable to
        be used in the analysis
       SAS would automatically generate n-1 binary
        dummies for n categories of radiation with param=ref
        option
January 1, 2013    ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India
    Order=data
       Simply tells SAS to arrange the response categories in
          the order they occur in the input data 1,2,3,4
     Param=ref
       This implies that there is going to be dummy coding
          for the classification variable ‘radiation’listed in class
     Ref=first
       Designates the first ordered level, in this case ‘fine’ as
          the reference level

January 1, 2013      ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India
    Order=data descending
         This tells SAS to reverse the order of the logits
         So, instead of the cumulative logit model being
         L1=log(p1/p2+p3+p4)
         L2=log(p1+p2/p3+p4)
         L3=log(p1+p2+p3/p4), it becomes
         L1=log(p4/p1+p2+p3)
         L2=log(p4+p3/p1+p2)
         L3=log(p4+p3+p2/p1)
     Now, a positive B1 indicates that a higher value of X1
      leads to greater chance of radiation sickness
January 1, 2013      ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India
    Link=logit
       fits the cumulative logit model when there are more
          than two response categories
     Aggregate=radiation
       Indicates that the goodness of fit statistics are to be
          calculated on the subpopulations of the variable:
          radiation
     Scale=none
       No correction is need for the dispersion parameter
       To understand this, read up. This happens when the
          goodness of fit statistic exceeds its degrees of freedom
          and need to be corrected for
January 1, 2013      ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India
    When we fit this model, the first output we
      see:
 Score Test for the Proportional Odds Assumption
 Chi-Square DF         Pr > ChiSq
 17.2866        21     0.6936

  Null hypothesis is that the current proportional-odds
   cumulative logit model is true
  Seems like we fail to reject the null and so can proceed to the
   rest of the output under the current assumption


January 1, 2013   ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India
    Ultimately we are interested in the predicted
      probabilities
          OUTPUT <OUT=SAS-data-set><options>
     Predicted=
       For a cumulative model, it is the predicted cumulative
          probability (that is, the probability that the response
          variable is less than or equal to the value of _LEVEL_);
     PREDPROBS=I or C
       Individual|I requests the predicted probability of each
        response level.
       CUMULATIVE | C requests the cumulative predicted
        probability of each response level
January 1, 2013      ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India

Weitere ähnliche Inhalte

Was ist angesagt?

A review of automatic differentiationand its efficient implementation
A review of automatic differentiationand its efficient implementationA review of automatic differentiationand its efficient implementation
A review of automatic differentiationand its efficient implementation
ssuserfa7e73
 
Linear Programming 1
Linear Programming 1Linear Programming 1
Linear Programming 1
irsa javed
 
A SYSTEMATIC APPROACH FOR SOLVING MIXED INTUITIONISTIC FUZZY TRANSPORTATION P...
A SYSTEMATIC APPROACH FOR SOLVING MIXED INTUITIONISTIC FUZZY TRANSPORTATION P...A SYSTEMATIC APPROACH FOR SOLVING MIXED INTUITIONISTIC FUZZY TRANSPORTATION P...
A SYSTEMATIC APPROACH FOR SOLVING MIXED INTUITIONISTIC FUZZY TRANSPORTATION P...
Navodaya Institute of Technology
 

Was ist angesagt? (19)

ANALYTICAL FORMULATIONS FOR THE LEVEL BASED WEIGHTED AVERAGE VALUE OF DISCRET...
ANALYTICAL FORMULATIONS FOR THE LEVEL BASED WEIGHTED AVERAGE VALUE OF DISCRET...ANALYTICAL FORMULATIONS FOR THE LEVEL BASED WEIGHTED AVERAGE VALUE OF DISCRET...
ANALYTICAL FORMULATIONS FOR THE LEVEL BASED WEIGHTED AVERAGE VALUE OF DISCRET...
 
Mm2521542158
Mm2521542158Mm2521542158
Mm2521542158
 
A Fuzzy Mean-Variance-Skewness Portfolioselection Problem.
A Fuzzy Mean-Variance-Skewness Portfolioselection Problem.A Fuzzy Mean-Variance-Skewness Portfolioselection Problem.
A Fuzzy Mean-Variance-Skewness Portfolioselection Problem.
 
Application of Adomian Decomposition Method in Solving Second Order Nonlinear...
Application of Adomian Decomposition Method in Solving Second Order Nonlinear...Application of Adomian Decomposition Method in Solving Second Order Nonlinear...
Application of Adomian Decomposition Method in Solving Second Order Nonlinear...
 
A review of automatic differentiationand its efficient implementation
A review of automatic differentiationand its efficient implementationA review of automatic differentiationand its efficient implementation
A review of automatic differentiationand its efficient implementation
 
A NEW ALGORITHM FOR SOLVING FULLY FUZZY BI-LEVEL QUADRATIC PROGRAMMING PROBLEMS
A NEW ALGORITHM FOR SOLVING FULLY FUZZY BI-LEVEL QUADRATIC PROGRAMMING PROBLEMSA NEW ALGORITHM FOR SOLVING FULLY FUZZY BI-LEVEL QUADRATIC PROGRAMMING PROBLEMS
A NEW ALGORITHM FOR SOLVING FULLY FUZZY BI-LEVEL QUADRATIC PROGRAMMING PROBLEMS
 
Linear programming
Linear programmingLinear programming
Linear programming
 
Cg32519523
Cg32519523Cg32519523
Cg32519523
 
Chapter two
Chapter twoChapter two
Chapter two
 
Fractional calculus and applications
Fractional calculus and applicationsFractional calculus and applications
Fractional calculus and applications
 
An Approach to Mathematically Establish the Practical Use of Assignment Probl...
An Approach to Mathematically Establish the Practical Use of Assignment Probl...An Approach to Mathematically Establish the Practical Use of Assignment Probl...
An Approach to Mathematically Establish the Practical Use of Assignment Probl...
 
Linear Programming 1
Linear Programming 1Linear Programming 1
Linear Programming 1
 
Heptagonal Fuzzy Numbers by Max Min Method
Heptagonal Fuzzy Numbers by Max Min MethodHeptagonal Fuzzy Numbers by Max Min Method
Heptagonal Fuzzy Numbers by Max Min Method
 
Linear programming class 12 investigatory project
Linear programming class 12 investigatory projectLinear programming class 12 investigatory project
Linear programming class 12 investigatory project
 
A new approach for ranking of intuitionistic fuzzy numbers
A new approach for ranking of intuitionistic fuzzy numbersA new approach for ranking of intuitionistic fuzzy numbers
A new approach for ranking of intuitionistic fuzzy numbers
 
linear programming
linear programming linear programming
linear programming
 
Certified global minima
Certified global minimaCertified global minima
Certified global minima
 
Dr azimifar pattern recognition lect2
Dr azimifar pattern recognition lect2Dr azimifar pattern recognition lect2
Dr azimifar pattern recognition lect2
 
A SYSTEMATIC APPROACH FOR SOLVING MIXED INTUITIONISTIC FUZZY TRANSPORTATION P...
A SYSTEMATIC APPROACH FOR SOLVING MIXED INTUITIONISTIC FUZZY TRANSPORTATION P...A SYSTEMATIC APPROACH FOR SOLVING MIXED INTUITIONISTIC FUZZY TRANSPORTATION P...
A SYSTEMATIC APPROACH FOR SOLVING MIXED INTUITIONISTIC FUZZY TRANSPORTATION P...
 

Andere mochten auch

Financialmodeling
FinancialmodelingFinancialmodeling
Financialmodeling
Talal Tahir
 
1.5.1 measures basic concepts
1.5.1 measures basic concepts1.5.1 measures basic concepts
1.5.1 measures basic concepts
A M
 
(마더세이프 라운드) Logistic regression
(마더세이프 라운드) Logistic regression(마더세이프 라운드) Logistic regression
(마더세이프 라운드) Logistic regression
mothersafe
 
[Xin yan, xiao_gang_su]_linear_regression_analysis(book_fi.org)
[Xin yan, xiao_gang_su]_linear_regression_analysis(book_fi.org)[Xin yan, xiao_gang_su]_linear_regression_analysis(book_fi.org)
[Xin yan, xiao_gang_su]_linear_regression_analysis(book_fi.org)
mohamedchaouche
 

Andere mochten auch (20)

Boosted Tree-based Multinomial Logit Model for Aggregated Market Data
Boosted Tree-based Multinomial Logit Model for Aggregated Market DataBoosted Tree-based Multinomial Logit Model for Aggregated Market Data
Boosted Tree-based Multinomial Logit Model for Aggregated Market Data
 
The power of RapidMiner, showing the direct marketing demo
The power of RapidMiner, showing the direct marketing demoThe power of RapidMiner, showing the direct marketing demo
The power of RapidMiner, showing the direct marketing demo
 
Hadoop Summit EU 2013: Parallel Linear Regression, IterativeReduce, and YARN
Hadoop Summit EU 2013: Parallel Linear Regression, IterativeReduce, and YARNHadoop Summit EU 2013: Parallel Linear Regression, IterativeReduce, and YARN
Hadoop Summit EU 2013: Parallel Linear Regression, IterativeReduce, and YARN
 
Statisticsfor businessproject solution
Statisticsfor businessproject solutionStatisticsfor businessproject solution
Statisticsfor businessproject solution
 
Financialmodeling
FinancialmodelingFinancialmodeling
Financialmodeling
 
Chapter 16
Chapter 16Chapter 16
Chapter 16
 
Midterm
MidtermMidterm
Midterm
 
Ordinal Logistic Regression
Ordinal Logistic RegressionOrdinal Logistic Regression
Ordinal Logistic Regression
 
Logistic Regression/Markov Chain presentation
Logistic Regression/Markov Chain presentationLogistic Regression/Markov Chain presentation
Logistic Regression/Markov Chain presentation
 
Qam formulas
Qam formulasQam formulas
Qam formulas
 
Transparency7
Transparency7Transparency7
Transparency7
 
1.5.1 measures basic concepts
1.5.1 measures basic concepts1.5.1 measures basic concepts
1.5.1 measures basic concepts
 
Regression
Regression Regression
Regression
 
(마더세이프 라운드) Logistic regression
(마더세이프 라운드) Logistic regression(마더세이프 라운드) Logistic regression
(마더세이프 라운드) Logistic regression
 
Regression: A skin-deep dive
Regression: A skin-deep diveRegression: A skin-deep dive
Regression: A skin-deep dive
 
ForecastIT 2. Linear Regression & Model Statistics
ForecastIT 2. Linear Regression & Model StatisticsForecastIT 2. Linear Regression & Model Statistics
ForecastIT 2. Linear Regression & Model Statistics
 
Generalized Logistic Regression - by example (Anthony Kilili)
Generalized Logistic Regression - by example (Anthony Kilili)Generalized Logistic Regression - by example (Anthony Kilili)
Generalized Logistic Regression - by example (Anthony Kilili)
 
[Xin yan, xiao_gang_su]_linear_regression_analysis(book_fi.org)
[Xin yan, xiao_gang_su]_linear_regression_analysis(book_fi.org)[Xin yan, xiao_gang_su]_linear_regression_analysis(book_fi.org)
[Xin yan, xiao_gang_su]_linear_regression_analysis(book_fi.org)
 
C2.1 intro
C2.1 introC2.1 intro
C2.1 intro
 
Mode Choice analysis for work trips using Multinomial Logit model for Windsor...
Mode Choice analysis for work trips using Multinomial Logit model for Windsor...Mode Choice analysis for work trips using Multinomial Logit model for Windsor...
Mode Choice analysis for work trips using Multinomial Logit model for Windsor...
 

Ähnlich wie Logistic regression for ordered dependant variable with more than 2 levels

IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
ijceronline
 

Ähnlich wie Logistic regression for ordered dependant variable with more than 2 levels (20)

How principal components analysis is different from factor
How principal components analysis is different from factorHow principal components analysis is different from factor
How principal components analysis is different from factor
 
Supervised Learning.pdf
Supervised Learning.pdfSupervised Learning.pdf
Supervised Learning.pdf
 
ML MODULE 4.pdf
ML MODULE 4.pdfML MODULE 4.pdf
ML MODULE 4.pdf
 
A Theory of the Learnable; PAC Learning
A Theory of the Learnable; PAC LearningA Theory of the Learnable; PAC Learning
A Theory of the Learnable; PAC Learning
 
GRADIENT OMISSIVE DESCENT IS A MINIMIZATION ALGORITHM
GRADIENT OMISSIVE DESCENT IS A MINIMIZATION ALGORITHMGRADIENT OMISSIVE DESCENT IS A MINIMIZATION ALGORITHM
GRADIENT OMISSIVE DESCENT IS A MINIMIZATION ALGORITHM
 
IRJET- Supervised Learning Classification Algorithms Comparison
IRJET- Supervised Learning Classification Algorithms ComparisonIRJET- Supervised Learning Classification Algorithms Comparison
IRJET- Supervised Learning Classification Algorithms Comparison
 
IRJET- Supervised Learning Classification Algorithms Comparison
IRJET- Supervised Learning Classification Algorithms ComparisonIRJET- Supervised Learning Classification Algorithms Comparison
IRJET- Supervised Learning Classification Algorithms Comparison
 
autocorrelation.pptx
autocorrelation.pptxautocorrelation.pptx
autocorrelation.pptx
 
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
 
Study the different
Study the differentStudy the different
Study the different
 
Yahya Almalki SOED 2016
Yahya Almalki SOED 2016Yahya Almalki SOED 2016
Yahya Almalki SOED 2016
 
Design and analysis of ra sort
Design and analysis of ra sortDesign and analysis of ra sort
Design and analysis of ra sort
 
working with python
working with pythonworking with python
working with python
 
MUMS: Bayesian, Fiducial, and Frequentist Conference - Model Selection in the...
MUMS: Bayesian, Fiducial, and Frequentist Conference - Model Selection in the...MUMS: Bayesian, Fiducial, and Frequentist Conference - Model Selection in the...
MUMS: Bayesian, Fiducial, and Frequentist Conference - Model Selection in the...
 
2018 p 2019-ee-a2
2018 p 2019-ee-a22018 p 2019-ee-a2
2018 p 2019-ee-a2
 
Ica group 3[1]
Ica group 3[1]Ica group 3[1]
Ica group 3[1]
 
Big Data Analysis
Big Data AnalysisBig Data Analysis
Big Data Analysis
 
ppt0320defenseday
ppt0320defensedayppt0320defenseday
ppt0320defenseday
 
Prediction model of algal blooms using logistic regression and confusion matrix
Prediction model of algal blooms using logistic regression and confusion matrix Prediction model of algal blooms using logistic regression and confusion matrix
Prediction model of algal blooms using logistic regression and confusion matrix
 
A new generalized lindley distribution
A new generalized lindley distributionA new generalized lindley distribution
A new generalized lindley distribution
 

Mehr von Arup Guha (6)

Structural breaks, unit root tests and long time series
Structural breaks, unit root tests and long time seriesStructural breaks, unit root tests and long time series
Structural breaks, unit root tests and long time series
 
Partnership with a Premier Business School
Partnership with a Premier Business SchoolPartnership with a Premier Business School
Partnership with a Premier Business School
 
Beer Market Analytics Solutions
Beer Market Analytics SolutionsBeer Market Analytics Solutions
Beer Market Analytics Solutions
 
Database marketing
Database marketingDatabase marketing
Database marketing
 
Using survival analysis results
Using survival analysis results Using survival analysis results
Using survival analysis results
 
Measuring Actual Effect Of Tv Ads On Sales Lk
Measuring Actual Effect Of Tv Ads On Sales   LkMeasuring Actual Effect Of Tv Ads On Sales   Lk
Measuring Actual Effect Of Tv Ads On Sales Lk
 

Kürzlich hochgeladen

Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
giselly40
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
 

Kürzlich hochgeladen (20)

How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 

Logistic regression for ordered dependant variable with more than 2 levels

  • 1. Multinomial Logistic Regression Models January 1, 2013 ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India
  • 2.  Logistic regression CAN handle dependant variables with more than two categories  It is important to note whether the response variable is ordinal (consisting of ordered categories like young, middle-aged, old) or nominal (dependant is unordered like red, blue, black)  Some multinomial logistic models are appropriate only for ordered response  It is not mathematically necessary to consider the natural ordering when modeling ordinal response but,  Considering the natural ordering  Leads to a more parsimonious model  Increase power to detect relationships with other variables January 1, 2013 ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India
  • 3.  Applying logistic regression considering the natural order is done using a modeling technique called the “Proportional Odds Model”  Say the dependant variable Y has 4 states measuring the impact of radiation on the human body; fine, sick, serious,dead  Let p1=prob of fine, p2=prob of sick, p3=prob of serious, p4=prob of dead  Let us define a baseline category: fine, since this is the normal stage (we shall see why we need this later) January 1, 2013 ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India
  • 4. What if we break up the modeling of the 4 level ordered dependant into 3 binary logistic situations: 1 – (fine,sick), 2 – (fine,serious),3 – (fine,dead)?  Then we would have 3 logit equations:  Log(p2/p1)=B11+B12X1+B13X2  Log(p3/p1)=B21+B22X1+B23X2  Log(p4/p1)=B31+B32X1+B33X2 X is the degree of radiation dummy with 3 levels so broken into 2 binary dummies  So, 9 parameters to be estmated January 1, 2013 ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India
  • 5. Now consider an alternative model for the same situation  Cumulative logit model:  L1=log(p1/p2+p3+p4)  L2=log(p1+p2/p3+p4)  L3=log(p1+p2+p3/p4)  The obvious way to introduce covariates is  L1=B11+B12X1+B13X2  L2=B21+B22X1+B23X2  L3=B31+B32X1+B33X2 January 1, 2013 ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India
  • 6. Let us simplyfy the model by specifying that the slope parameters are identical over the logit equations. Then,  L1=A1+B1X1+B2X2  L2=A2+B1X1+B2X2  L3=A3+B1X1+B2X2  This is the proportional odds cumulative logit model January 1, 2013 ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India
  • 7.  Suppose that the categorical outcome is actually a categorized version of an unobservable (latent) continuous variable which has a logistic distribution  The continuous scale is divided into five regions by four cut-points c1, c2, c3, c4 which are determined by nature  If Z ≤ c1 we observe Y = 1; if c1 < Z ≤ c2 we observe Y = 2; and so on  Suppose that the Z is related to the X’s through a linear regression  Then, the coarsened categorical variable would be related Y will be related to the X’s by a proportional- odds cumulative logit model January 1, 2013 ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India
  • 8. Let us go back to the model  L1=A1+B1X1+B2X2  L2=A2+B1X1+B2X2  L3=A3+B1X1+B2X2  Note that Lj is the log-odds of falling into or below category j versus falling above it  Aj is the log-odds of falling into or below category j when X1 = X2 = 0  B1 is the increase in log-odds of falling into or below any category associated with a one-unit increase in Xk, holding all the other X-variables constant.  Therefore, a positive slope indicates a tendency for the response level to decrease as the variable decreases January 1, 2013 ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India
  • 9. Our example of 4 levels of impact of radiation corresponding to 3 levels of radiation proc logistic data=radiation_impact; freq count; class radiation / order=data param=ref ref=first; model sickness (order=data descending) = radiation / link=logit aggregate=(radiation) scale=none; run; January 1, 2013 ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India
  • 10. Freq=count  This is important for specifying grouped data  Count is the variable that contains the frequency of occurrance of each observation  In its absence, each row would be considered a separate row of data  Class=radiation  Specifies that radiation is a classification variable to be used in the analysis  SAS would automatically generate n-1 binary dummies for n categories of radiation with param=ref option January 1, 2013 ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India
  • 11. Order=data  Simply tells SAS to arrange the response categories in the order they occur in the input data 1,2,3,4  Param=ref  This implies that there is going to be dummy coding for the classification variable ‘radiation’listed in class  Ref=first  Designates the first ordered level, in this case ‘fine’ as the reference level January 1, 2013 ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India
  • 12. Order=data descending  This tells SAS to reverse the order of the logits  So, instead of the cumulative logit model being  L1=log(p1/p2+p3+p4)  L2=log(p1+p2/p3+p4)  L3=log(p1+p2+p3/p4), it becomes  L1=log(p4/p1+p2+p3)  L2=log(p4+p3/p1+p2)  L3=log(p4+p3+p2/p1)  Now, a positive B1 indicates that a higher value of X1 leads to greater chance of radiation sickness January 1, 2013 ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India
  • 13. Link=logit  fits the cumulative logit model when there are more than two response categories  Aggregate=radiation  Indicates that the goodness of fit statistics are to be calculated on the subpopulations of the variable: radiation  Scale=none  No correction is need for the dispersion parameter  To understand this, read up. This happens when the goodness of fit statistic exceeds its degrees of freedom and need to be corrected for January 1, 2013 ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India
  • 14. When we fit this model, the first output we see: Score Test for the Proportional Odds Assumption Chi-Square DF Pr > ChiSq 17.2866 21 0.6936  Null hypothesis is that the current proportional-odds cumulative logit model is true  Seems like we fail to reject the null and so can proceed to the rest of the output under the current assumption January 1, 2013 ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India
  • 15. Ultimately we are interested in the predicted probabilities OUTPUT <OUT=SAS-data-set><options>  Predicted=  For a cumulative model, it is the predicted cumulative probability (that is, the probability that the response variable is less than or equal to the value of _LEVEL_);  PREDPROBS=I or C  Individual|I requests the predicted probability of each response level.  CUMULATIVE | C requests the cumulative predicted probability of each response level January 1, 2013 ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India