SlideShare a Scribd company logo
1 of 27
Download to read offline
Substructural Surrogates for Learning
                   g                g
Decomposable Classification Problems:
  Implementation and First Results


       Albert Orriols Puig1,2              Kumara Sastry2
              Orriols-Puig
       David E. Goldberg2          Ester Bernadó-Mansilla1

               1Research  Group in Intelligent Systems
      Enginyeria i Arquitectura La Salle, Ramon Llull University
               2Illinois
                       Genetic Algorithms Laboratory
                                   g                 y
     Department of Industrial and Enterprise Systems Engineering
             University of Illinois at Urbana Champaign
Motivation


       Nearly decomposable functions in engineering systems
           • (Simon 69; Gibson 79; Goldberg 02)
             (Simon,69; Gibson,79; Goldberg,02)


       Design decomposition principle in:

       – Genetic Algorithms (GAs)
           • (Goldberg,02; Pelikan,05; Pelikan, Sastry & Cantú-Paz, 06)
       – Genetic Programming (GPs)
           • (Sastry & Goldberg, 03)
       – Learning Classifier Systems (LCS)
           • (Butz et al., 06; Llorà et al., 06)


                                    Enginyeria i Arquitectura la Salle    Slide 2
GRSI
Aim


       Our approach: build surrogates for classification.
       – Probabilistic models     Induce the form of a surrogate
       – Regression     Estimate the coefficients of the surrogate
       – Use the surrogate to classify new examples


       Surrogates used for efficiency enhancement of:
       – GAs:
           • (Sastry, Pelikan & Goldberg, 2004)
           • (Pelikan & Sastry, 2004)
           • (Sastry, Lima & Goldberg, 2006)
       – LCSs:
           • (Llorà, Sastry, Yu & Goldberg, 2007)


                                 Enginyeria i Arquitectura la Salle   Slide 3
GRSI
Outline


        1. Methodology

        2. Implementation

        3. Test P bl
        3 T t Problems

        4.
        4 Results

        5. Discussion

        6. Conclusions




                            Enginyeria i Arquitectura la Salle   Slide 4
GRSI
1. Methodology
                                                                                   2. Implementation
                                                                                   3. Test Problems
       Overview of the Methodology
                                gy                                                 3. Results
                                                                                   4. Discussion
                                                                                   4 Di      i
                                                                                   5. Conclusions




          Structural                  Surrogate                           Classification
         Model Layer                 Model Layer                          Model Layer




                                 •Infer structural form                 •Use the surrogate
        • Extract dependencies
                                  and the coefficients                   function to predict the
         between attributes
         bet een attrib tes
                                  based on the                           class of new
        • Expressed as:
                                  structural model                       examples
           • Linkage Groups
           • Matrices            (Sastry, Lima & Goldberg, 2006)
           • Bayesian Networks
        • Example: x-or



                                   Enginyeria i Arquitectura la Salle                                  Slide 5
GRSI
1. Methodology
                                                                         2. Implementation
                                                                         3. Test Problems
       Surrogate Model Layer
            g            y                                               3. Results
                                                                         4. Discussion
                                                                         4 Di      i
                                                                         5. Conclusions




        Input: Database of labeled examples with nominal attributes


        Substructure identified by the structural model layer
         – Example: [0 2] [1]


        Surrogate form induced from the structural model:

                f(x
                f( 0, x1, x2) = cI + c0x0 + c1x1 + c2x2 + c0,2x0x2


        Coefficients f
        C ffi i t of surrogates: li
                            t    linear regression methods
                                               i     th d



                                    Enginyeria i Arquitectura la Salle                       Slide 6
GRSI
1. Methodology
                                                                                               2. Implementation
                                                                                               3. Test Problems
       Surrogate Model Layer
            g            y                                                                     3. Results
                                                                                               4 Di      i
                                                                                               4. Discussion
                                                                                               5. Conclusions




        Map input examples to schema-index matrix
         – Consider all the schemas:



                           {0*0, 0*1, 1*0, 1*1, *0*, *1*}
                                [0 2]           [1]
         – I general, th number of schemas t be considered i
           In      l the    b    fh        to b     id d is:


                                               Number of linkage
                                                   groups


                                                                       Cardinality of the variable jth
                                                                         of the ith linkage group




                                  Enginyeria i Arquitectura la Salle                                               Slide 7
GRSI
1. Methodology
                                                                                                 2. Implementation
                                                                                                 3. Test Problems
       Surrogate Model Layer
            g            y                                                                       3. Results
                                                                                                 4. Discussion
                                                                                                 4 Di      i
                                                                                                 5. Conclusions




        Map each example to a binary vector of size msch
         – For each building block, the entry corresponding to the schemata contained
           in the example is one. All the other are zero.

                                                  [0 2]                              [1]

                                     0*0         0*1         1*0           1*1   *0*       *1*

                         010          1            0           0           0     0         1

                         100          0            0           1           0     1         0

                         111          0            0           0           1     0         1




                                      Enginyeria i Arquitectura la Salle                                             Slide 8
GRSI
1. Methodology
                                                                                2. Implementation
                                                                                3. Test Problems
       Surrogate Model Layer
            g            y                                                      3. Results
                                                                                4. Discussion
                                                                                4 Di      i
                                                                                5. Conclusions




        This results in the following matrix



                                                                      n input
                                                                     examples




        Each class is mapped to an integer




                                Enginyeria i Arquitectura la Salle                                  Slide 9
GRSI
1. Methodology
                                                                                             2. Implementation
                                                                                             3. Test Problems
       Surrogate Model Layer
            g            y                                                                   3. Results
                                                                                             4. Discussion
                                                                                             4 Di      i
                                                                                             5. Conclusions




        Solve the following linear system to estimate the
        coefficients of the surrogate model

             Input instances mapped                                 Class or label of each
              to the schemas matrix                                    input instance
                                                                         p

                                      Coefficients of the
                                       surrogate model



        To solve it, we use multi-dimensional least squares fitting
        approach:


         – Minimize:




                                          Enginyeria i Arquitectura la Salle                                     Slide 10
GRSI
1. Methodology
                                                                                   2. Implementation
                                                                                   3. Test Problems
       Overview of the Methodology
                                gy                                                 3. Results
                                                                                   4. Discussion
                                                                                   4 Di      i
                                                                                   5. Conclusions




          Structural                  Surrogate                           Classification
         Model Layer                 Model Layer                          Model Layer




                                 •Infer structural form                 •Use the surrogate
        • Extract dependencies
                                  and the coefficients                   function to predict the
         between attributes
         bet een attrib tes
                                  based on the                           class of new
        • Expressed as:
                                  structural model                       examples
           • Linkage Groups
           • Matrices            (Sastry, Lima & Goldberg, 2006)
           • Bayesian Networks
        • Example: x-or



                                   Enginyeria i Arquitectura la Salle                                  Slide 11
GRSI
1. Methodology
                                                                                             2. Implementation
                                                                                             3. Test Problems
       Classification Model Layer
                              y                                                              3. Results
                                                                                             4. Discussion
                                                                                             4 Di      i
                                                                                             5. Conclusions




        Use the surrogate to predict the class of a new input instance



                                     Map the example
           New input
                 p              to a vector e of length msch                       Output = round (x e)
                                                                                                  (x·e)
            example              according to the schemas
                                                                                                Coefficients of the
                                                                                                 surrogate model




                           0*0     0*1       1*0      1*1        *0*         *1*
                                                                                      Output = x·(001010)’
             001            0       0         1         0         1          0




                                        Enginyeria i Arquitectura la Salle                                       Slide 12
GRSI
1. Methodology
                                                                                   2. Implementation
                                                                                   3. Test Problems
       Overview of the Methodology
                                gy                                                 3. Results
                                                                                   4. Discussion
                                                                                   4 Di      i
                                                                                   5. Conclusions




          Structural                  Surrogate                           Classification
         Model Layer                 Model Layer                          Model Layer




                                 •Infer structural form                 •Use the surrogate
        • Extract dependencies
                                  and the coefficients                   function to predict the
         between attributes
         bet een attrib tes
                                  based on the                           class of new
        • Expressed as:
                                  structural model                       examples
           • Linkage Groups
           • Matrices            (Sastry, Lima & Goldberg, 2006)
           • Bayesian Networks
        • Example: x-or



                                   Enginyeria i Arquitectura la Salle                                  Slide 13
GRSI
1. Methodology

       How do we obtain the structural                                           2. Implementation
                                                                                 3. Test Problems
                                                                                 3. Results

       model?                                                                    4 Di      i
                                                                                 4. Discussion
                                                                                 5. Conclusions




         Greedy extraction of the structural model for classification
         (gESMC): Greedy search à la eCGA (Harik, 98)
         1.
         1 Start considering all variables independent sModel = [1] [2] … [n]
                                           independent.
         2. Divide the dataset in ten-fold cross-validation sets: train and test
         3. Build the surrogate with the train set and evaluate with the test set
         4. While there is improvement
             1. Create all combinations of two subset merges
                  Eg. { [1,2] .. [n]; [1,n] .. [2]; [1]..[2,n]}
             2. Build the surrogate for all candidates. Evaluate the quality with the test set
             3. Select the candidate with minimum error
             4 If the quality of the candidate is better than the parent, accept the model
                  th     lit f th       did t i b tt th th             t       t th    dl
             4.
                (update smodel) and go to 4.
             5. Otherwise, stop. sModel is optimal.
                         ,    p             p


                                          Enginyeria i Arquitectura la Salle                         Slide 14
GRSI
1. Methodology
                                                                         2. Implementation
                                                                         3. Test Problems
       Test Problems                                                     3. Results
                                                                         4 Di      i
                                                                         4. Discussion
                                                                         5. Conclusions




        Two-level hierarchical problems (Butz, Pelikan, Llorà & Goldberg, 2006)




                                    Enginyeria i Arquitectura la Salle                       Slide 15
GRSI
1. Methodology
                                                                                     2. Implementation
                                                                                     3. Test Problems
       Test Problems                                                                 3. Results
                                                                                     4. Discussion
                                                                                     4 Di      i
                                                                                     5. Conclusions




        Problems in the lower level

                                Condition
                                 length (l)
                                                                    Position of the left-most
         – Position Problem:   000110 :3                                 one-valued bit



                                Condition
                                 length (l)

         – Parity Problem:                                            Number of 1 mod 2
                               01001010 :1




                               Enginyeria i Arquitectura la Salle                                        Slide 16
GRSI
1. Methodology
                                                                                         2. Implementation
                                                                                         3. Test Problems
       Test Problems                                                                     3. Results
                                                                                         4. Discussion
                                                                                         4 Di      i
                                                                                         5. Conclusions




        Problems in the higher level

                                     Condition
                                      length (l)
         – Decoder Problem:                                            Integer value of the input
                                    000110 :5



                                        Condition
                                         length (l)
         – Count-ones Problem:                                         Number of ones of the input
                                       000110 :2

        Combinations of both levels:
         – Parity + decoder (
                            (hParDec)                  - Parity + count-ones (hParCount)
                y                   )                         y              (         )
         – Position + decoder (hPosDec)                - Position + count-ones (hPosCount)



                                  Enginyeria i Arquitectura la Salle                                         Slide 17
GRSI
1. Methodology
                                                                                              2. Implementation
                                                                                              3. Test Problems
       Results with 2-bit lower level blocks                                                  3. Results
                                                                                              4. Discussion
                                                                                              4 Di      i
                                                                                              5. Conclusions




         First experiment:
         –   hParDec, hPosDec, hParCount,
             hParDec hPosDec hParCount hPosCount
         –   Binary inputs of length 17.
         –   The first 6 bits are relevant The others are irrelevant
                                  relevant.
         –   3 blocks of two-bits low order problems
         –   Eg: hPosDec


                                                                                          Interacting variables:
                             10 01 00 00101010101                                         [x0 x1] [x2 x3] [x4 x5]

                                                                        irrelevant bits
                              210
                                                    Output = 21


                                   Enginyeria i Arquitectura la Salle                                             Slide 18
GRSI
1. Methodology
                                                                2. Implementation
                                                                3. Test Problems
       Results with 2-bit lower level blocks                    3. Results
                                                                4. Discussion
                                                                4 Di      i
                                                                5. Conclusions




         Performance of gESMC compared to SMO and C4.5:




                           Enginyeria i Arquitectura la Salle                       Slide 19
GRSI
1. Methodology
                                                                       2. Implementation
                                                                       3. Test Problems
       Results with 2-bit lower level blocks                           3. Results
                                                                       4 Di      i
                                                                       4. Discussion
                                                                       5. Conclusions




         Form and coefficients of the surrogate models
         F      d    ffi i t f th            t    dl
         – Interacting variables [x0 x1] [x2 x3] [x4 x5]




         – Linkages between variables are detected
         – Easy visualization of the salient variable interactions
         – All models only consist of the six relevant variables

                                  Enginyeria i Arquitectura la Salle                       Slide 20
GRSI
1. Methodology
                                                                         2. Implementation
                                                                         3. Test Problems
       Results with 2-bit lower level blocks                             3. Results
                                                                         4 Di      i
                                                                         4. Discussion
                                                                         5. Conclusions




         C4.5
         C4 5 created t
                  t d trees:
         – HPosDec and HPosCount
             • Trees of 53 nodes and 27 leaves on average
             • Only the six relevant variables in the decision nodes
         – HParDec
             • Trees of 270 nodes and 136 leaves on average
             • Irrelevant attributes in the decision nodes
         – HP C
           HParCount
                   t
             • Trees of 283 nodes and 142 leaves on average
             • Irrelevant attributes in the decision nodes


         SMO built support vector machines:
         – 351, 6, 6, and 28 machines with 16 weights each one were
           created for HPosDec, HPosCount, HParDec, and HParCount


                                    Enginyeria i Arquitectura la Salle                       Slide 21
GRSI
1. Methodology
                                                                                              2. Implementation
                                                                                              3. Test Problems
       Results with 3-bit lower level blocks                                                  3. Results
                                                                                              4. Discussion
                                                                                              4 Di      i
                                                                                              5. Conclusions




         Second experiment:
         –   hParDec, hPosDec, hParCount,
             hParDec hPosDec hParCount hPosCount
         –   Binary inputs of length 17.
         –   The first 6 bits are relevant The others are irrelevant
                                  relevant.
         –   2 blocks of three-bits low order problems
         –   Eg: hPosDec


                                                                                          Interacting variables:
                             000 100 00101010101                                           [x0 x1 x2] [x3 x4 x5]

                                                                        irrelevant bits
                               0      3
                                                    Output = 3


                                   Enginyeria i Arquitectura la Salle                                             Slide 22
GRSI
1. Methodology
                                                                               2. Implementation
                                                                               3. Test Problems
       Results with 3-bit lower level blocks                                   3. Results
                                                                               4 Di      i
                                                                               4. Discussion
                                                                               5. Conclusions




         Second experiment:
         – 2 blocks of three-bits low order problems




                                       Interesting observations:
                                       •As expected, the method stops the search when no
  M1: [0][1][2][3][4][5]               improvement is found
  M2: [0 1][2][3][4][5]                •In HParDec, half of the times the system gets a
                                       model that represents the salient interacting
  M3: [0 1 2][3][4][5]                 variables due to the stochasticity of the 10-fold CV
                                       estimation. Eg. [0 1 2 8] [3 4 9 5 6]

                                 Enginyeria i Arquitectura la Salle                                Slide 23
GRSI
1. Methodology
                                                                      2. Implementation
                                                                      3. Test Problems
       Discussion                                                     3. Results
                                                                      4. Discussion
                                                                      4 Di      i
                                                                      5. Conclusions




        Lack of guidance from low-order substructures
         – Increase the order of substructural merges




         – Select randomly one of the new structural models
            • Follow schemes such as simulated annealing (Korst & Aarts, 1997)




                                 Enginyeria i Arquitectura la Salle                       Slide 24
GRSI
1. Methodology
                                                                        2. Implementation
                                                                        3. Test Problems
       Discussion                                                       3. Results
                                                                        4. Discussion
                                                                        4 Di      i
                                                                        5. Conclusions




        Creating substructural models with overlapping
        structures
         – Problems with overlapping linkages
         – Eg: multiplexer problems
         – gESMC on the 6-bit mux results in: [0 1 2 3 4 5 ]
         – What we would like to obtain is: [0 1 2] [0 1 3] [0 1 4] [0 1 5]
         – Enhance the methodology permitting to represent overlapping
           linkages, using methods such as the design structure matrix
           genetic algorithm (DMSGA) (Yu, 2006)




                                   Enginyeria i Arquitectura la Salle                       Slide 25
GRSI
1. Methodology
                                                                        2. Implementation
                                                                        3. Test Problems
       Conclusions                                                      3. Results
                                                                        4 Di      i
                                                                        4. Discussion
                                                                        5. Conclusions




        Methodology to learn from decomposable problems
         – Structural model
         – Surrogate model
         – Classification model
        First implementation approach
         – greedy search of the best structural model
        Main advantage of gESMC over other learners
         – It provides the structural model j
              p                             jointly with the classification
                                                  y
           model.
        Some limitations of gESMC:
         – It depends on the guidance on the low-order structure
         – Several approaches outlined as further work.


                                   Enginyeria i Arquitectura la Salle                       Slide 26
GRSI
Substructural Surrogates for Learning
                   g                g
Decomposable Classification Problems:
  Implementation and First Results


       Albert Orriols Puig1,2              Kumara Sastry2
              Orriols-Puig
       David E. Goldberg2          Ester Bernadó-Mansilla1

               1Research  Group in Intelligent Systems
      Enginyeria i Arquitectura La Salle, Ramon Llull University
               2Illinois
                       Genetic Algorithms Laboratory
                                   g                 y
     Department of Industrial and Enterprise Systems Engineering
             University of Illinois at Urbana Champaign

More Related Content

Viewers also liked

JAEM'2007: Aprendizaje Supervisado de Reglas Difusas mediante un Sistema Clas...
JAEM'2007: Aprendizaje Supervisado de Reglas Difusas mediante un Sistema Clas...JAEM'2007: Aprendizaje Supervisado de Reglas Difusas mediante un Sistema Clas...
JAEM'2007: Aprendizaje Supervisado de Reglas Difusas mediante un Sistema Clas...Albert Orriols-Puig
 
HAIS'2008: Approximate versus Linguistic Representation in Fuzzy-UCS
HAIS'2008: Approximate versus Linguistic Representation in Fuzzy-UCSHAIS'2008: Approximate versus Linguistic Representation in Fuzzy-UCS
HAIS'2008: Approximate versus Linguistic Representation in Fuzzy-UCSAlbert Orriols-Puig
 
HIS'2008: Artificial Data Sets based on Knowledge Generators: Analysis of Lea...
HIS'2008: Artificial Data Sets based on Knowledge Generators: Analysis of Lea...HIS'2008: Artificial Data Sets based on Knowledge Generators: Analysis of Lea...
HIS'2008: Artificial Data Sets based on Knowledge Generators: Analysis of Lea...Albert Orriols-Puig
 
IWLCS'2008: First Approach toward Online Evolution of Association Rules wit...
IWLCS'2008: First Approach toward Online Evolution of Association Rules wit...IWLCS'2008: First Approach toward Online Evolution of Association Rules wit...
IWLCS'2008: First Approach toward Online Evolution of Association Rules wit...Albert Orriols-Puig
 
IWLCS'2007: Fuzzy-UCS: Preliminary Results
IWLCS'2007: Fuzzy-UCS: Preliminary ResultsIWLCS'2007: Fuzzy-UCS: Preliminary Results
IWLCS'2007: Fuzzy-UCS: Preliminary ResultsAlbert Orriols-Puig
 
HIS'2008: Genetic-based Synthetic Data Sets for the Analysis of Classifiers B...
HIS'2008: Genetic-based Synthetic Data Sets for the Analysis of Classifiers B...HIS'2008: Genetic-based Synthetic Data Sets for the Analysis of Classifiers B...
HIS'2008: Genetic-based Synthetic Data Sets for the Analysis of Classifiers B...Albert Orriols-Puig
 
New Challenges in Learning Classifier Systems: Mining Rarities and Evolving F...
New Challenges in Learning Classifier Systems: Mining Rarities and Evolving F...New Challenges in Learning Classifier Systems: Mining Rarities and Evolving F...
New Challenges in Learning Classifier Systems: Mining Rarities and Evolving F...Albert Orriols-Puig
 
ESTYLF'2008: Modelado Causal en Marketing mediante Aprendizaje no Supervisado...
ESTYLF'2008: Modelado Causal en Marketing mediante Aprendizaje no Supervisado...ESTYLF'2008: Modelado Causal en Marketing mediante Aprendizaje no Supervisado...
ESTYLF'2008: Modelado Causal en Marketing mediante Aprendizaje no Supervisado...Albert Orriols-Puig
 
HIS'2008: New Crossover Operator for Evolutionary Rule Discovery in XCS
HIS'2008: New Crossover Operator for Evolutionary Rule Discovery in XCSHIS'2008: New Crossover Operator for Evolutionary Rule Discovery in XCS
HIS'2008: New Crossover Operator for Evolutionary Rule Discovery in XCSAlbert Orriols-Puig
 
CCIA'2008: On the dimensions of data complexity through synthetic data sets
CCIA'2008: On the dimensions of data complexity through synthetic data setsCCIA'2008: On the dimensions of data complexity through synthetic data sets
CCIA'2008: On the dimensions of data complexity through synthetic data setsAlbert Orriols-Puig
 
GECCO'2007: Modeling Selection Pressure in XCS for Proportionate and Tourname...
GECCO'2007: Modeling Selection Pressure in XCS for Proportionate and Tourname...GECCO'2007: Modeling Selection Pressure in XCS for Proportionate and Tourname...
GECCO'2007: Modeling Selection Pressure in XCS for Proportionate and Tourname...Albert Orriols-Puig
 
HAIS09-BeyondHomemadeArtificialDatasets
HAIS09-BeyondHomemadeArtificialDatasetsHAIS09-BeyondHomemadeArtificialDatasets
HAIS09-BeyondHomemadeArtificialDatasetsAlbert Orriols-Puig
 
Lecture1 AI1 Introduction to artificial intelligence
Lecture1 AI1 Introduction to artificial intelligenceLecture1 AI1 Introduction to artificial intelligence
Lecture1 AI1 Introduction to artificial intelligenceAlbert Orriols-Puig
 
Artificial Intelligence Presentation
Artificial Intelligence PresentationArtificial Intelligence Presentation
Artificial Intelligence Presentationlpaviglianiti
 

Viewers also liked (18)

JAEM'2007: Aprendizaje Supervisado de Reglas Difusas mediante un Sistema Clas...
JAEM'2007: Aprendizaje Supervisado de Reglas Difusas mediante un Sistema Clas...JAEM'2007: Aprendizaje Supervisado de Reglas Difusas mediante un Sistema Clas...
JAEM'2007: Aprendizaje Supervisado de Reglas Difusas mediante un Sistema Clas...
 
HAIS'2008: Approximate versus Linguistic Representation in Fuzzy-UCS
HAIS'2008: Approximate versus Linguistic Representation in Fuzzy-UCSHAIS'2008: Approximate versus Linguistic Representation in Fuzzy-UCS
HAIS'2008: Approximate versus Linguistic Representation in Fuzzy-UCS
 
HIS'2008: Artificial Data Sets based on Knowledge Generators: Analysis of Lea...
HIS'2008: Artificial Data Sets based on Knowledge Generators: Analysis of Lea...HIS'2008: Artificial Data Sets based on Knowledge Generators: Analysis of Lea...
HIS'2008: Artificial Data Sets based on Knowledge Generators: Analysis of Lea...
 
IWLCS'2008: First Approach toward Online Evolution of Association Rules wit...
IWLCS'2008: First Approach toward Online Evolution of Association Rules wit...IWLCS'2008: First Approach toward Online Evolution of Association Rules wit...
IWLCS'2008: First Approach toward Online Evolution of Association Rules wit...
 
IWLCS'2007: Fuzzy-UCS: Preliminary Results
IWLCS'2007: Fuzzy-UCS: Preliminary ResultsIWLCS'2007: Fuzzy-UCS: Preliminary Results
IWLCS'2007: Fuzzy-UCS: Preliminary Results
 
HIS'2008: Genetic-based Synthetic Data Sets for the Analysis of Classifiers B...
HIS'2008: Genetic-based Synthetic Data Sets for the Analysis of Classifiers B...HIS'2008: Genetic-based Synthetic Data Sets for the Analysis of Classifiers B...
HIS'2008: Genetic-based Synthetic Data Sets for the Analysis of Classifiers B...
 
New Challenges in Learning Classifier Systems: Mining Rarities and Evolving F...
New Challenges in Learning Classifier Systems: Mining Rarities and Evolving F...New Challenges in Learning Classifier Systems: Mining Rarities and Evolving F...
New Challenges in Learning Classifier Systems: Mining Rarities and Evolving F...
 
ESTYLF'2008: Modelado Causal en Marketing mediante Aprendizaje no Supervisado...
ESTYLF'2008: Modelado Causal en Marketing mediante Aprendizaje no Supervisado...ESTYLF'2008: Modelado Causal en Marketing mediante Aprendizaje no Supervisado...
ESTYLF'2008: Modelado Causal en Marketing mediante Aprendizaje no Supervisado...
 
HIS'2008: New Crossover Operator for Evolutionary Rule Discovery in XCS
HIS'2008: New Crossover Operator for Evolutionary Rule Discovery in XCSHIS'2008: New Crossover Operator for Evolutionary Rule Discovery in XCS
HIS'2008: New Crossover Operator for Evolutionary Rule Discovery in XCS
 
CCIA'2008: On the dimensions of data complexity through synthetic data sets
CCIA'2008: On the dimensions of data complexity through synthetic data setsCCIA'2008: On the dimensions of data complexity through synthetic data sets
CCIA'2008: On the dimensions of data complexity through synthetic data sets
 
GECCO'2007: Modeling Selection Pressure in XCS for Proportionate and Tourname...
GECCO'2007: Modeling Selection Pressure in XCS for Proportionate and Tourname...GECCO'2007: Modeling Selection Pressure in XCS for Proportionate and Tourname...
GECCO'2007: Modeling Selection Pressure in XCS for Proportionate and Tourname...
 
Lecture7 - IBk
Lecture7 - IBkLecture7 - IBk
Lecture7 - IBk
 
Lecture24
Lecture24Lecture24
Lecture24
 
Lecture23
Lecture23Lecture23
Lecture23
 
HAIS09-BeyondHomemadeArtificialDatasets
HAIS09-BeyondHomemadeArtificialDatasetsHAIS09-BeyondHomemadeArtificialDatasets
HAIS09-BeyondHomemadeArtificialDatasets
 
Lecture1 AI1 Introduction to artificial intelligence
Lecture1 AI1 Introduction to artificial intelligenceLecture1 AI1 Introduction to artificial intelligence
Lecture1 AI1 Introduction to artificial intelligence
 
Artificial Intelligence Presentation
Artificial Intelligence PresentationArtificial Intelligence Presentation
Artificial Intelligence Presentation
 
Artificial inteligence
Artificial inteligenceArtificial inteligence
Artificial inteligence
 

Similar to IWLCS'2007: Substructural Surrogates for Learning Decomposable Classification Problems: Implementation and First Results

Transcription Factor DNA Binding Prediction
Transcription Factor DNA Binding PredictionTranscription Factor DNA Binding Prediction
Transcription Factor DNA Binding PredictionUT, San Antonio
 
Testing survey by_directions
Testing survey by_directionsTesting survey by_directions
Testing survey by_directionsTao He
 
Linking CSCL script design patterns: connections between assessment
Linking CSCL script design patterns: connections between assessmentLinking CSCL script design patterns: connections between assessment
Linking CSCL script design patterns: connections between assessmentevilfer
 
GECCO'2007: Modeling XCS in Class Imbalances: Population Size and Parameter S...
GECCO'2007: Modeling XCS in Class Imbalances: Population Size and Parameter S...GECCO'2007: Modeling XCS in Class Imbalances: Population Size and Parameter S...
GECCO'2007: Modeling XCS in Class Imbalances: Population Size and Parameter S...Albert Orriols-Puig
 
COM623M1.doc.doc
COM623M1.doc.docCOM623M1.doc.doc
COM623M1.doc.docbutest
 
COM623M1.doc.doc
COM623M1.doc.docCOM623M1.doc.doc
COM623M1.doc.docbutest
 
Agile analytics : An exploratory study of technical complexity management
Agile analytics : An exploratory study of technical complexity managementAgile analytics : An exploratory study of technical complexity management
Agile analytics : An exploratory study of technical complexity managementAgnirudra Sikdar
 
Pal gov.tutorial1.session7 2.finalcheckandschemaengineeringissues
Pal gov.tutorial1.session7 2.finalcheckandschemaengineeringissuesPal gov.tutorial1.session7 2.finalcheckandschemaengineeringissues
Pal gov.tutorial1.session7 2.finalcheckandschemaengineeringissuesMustafa Jarrar
 
Applying soft computing techniques to corporate mobile security systems
Applying soft computing techniques to corporate mobile security systemsApplying soft computing techniques to corporate mobile security systems
Applying soft computing techniques to corporate mobile security systemsPaloma De Las Cuevas
 
Puzzle-Based Automatic Testing: Bringing Humans Into the Loop by Solving Puzz...
Puzzle-Based Automatic Testing: Bringing Humans Into the Loop by Solving Puzz...Puzzle-Based Automatic Testing: Bringing Humans Into the Loop by Solving Puzz...
Puzzle-Based Automatic Testing: Bringing Humans Into the Loop by Solving Puzz...Sung Kim
 
Large Language Models for Test Case Evolution and Repair
Large Language Models for Test Case Evolution and RepairLarge Language Models for Test Case Evolution and Repair
Large Language Models for Test Case Evolution and RepairLionel Briand
 
Self-supervised Learning Lecture Note
Self-supervised Learning Lecture NoteSelf-supervised Learning Lecture Note
Self-supervised Learning Lecture NoteSangwoo Mo
 
New challenges monolixday2011
New challenges monolixday2011New challenges monolixday2011
New challenges monolixday2011blaudez
 
Programacion multiobjetivo
Programacion multiobjetivoProgramacion multiobjetivo
Programacion multiobjetivoDiego Bass
 
Network Metrics and Measurements in the Era of the Digital Economies
Network Metrics and Measurements in the Era of the Digital EconomiesNetwork Metrics and Measurements in the Era of the Digital Economies
Network Metrics and Measurements in the Era of the Digital EconomiesPavel Loskot
 
Tackling Open Images Challenge (2019)
Tackling Open Images Challenge (2019)Tackling Open Images Challenge (2019)
Tackling Open Images Challenge (2019)Hiroto Honda
 

Similar to IWLCS'2007: Substructural Surrogates for Learning Decomposable Classification Problems: Implementation and First Results (20)

Transcription Factor DNA Binding Prediction
Transcription Factor DNA Binding PredictionTranscription Factor DNA Binding Prediction
Transcription Factor DNA Binding Prediction
 
ICSE10b.ppt
ICSE10b.pptICSE10b.ppt
ICSE10b.ppt
 
Testing survey by_directions
Testing survey by_directionsTesting survey by_directions
Testing survey by_directions
 
Ppt19
Ppt19Ppt19
Ppt19
 
Linking CSCL script design patterns: connections between assessment
Linking CSCL script design patterns: connections between assessmentLinking CSCL script design patterns: connections between assessment
Linking CSCL script design patterns: connections between assessment
 
GECCO'2007: Modeling XCS in Class Imbalances: Population Size and Parameter S...
GECCO'2007: Modeling XCS in Class Imbalances: Population Size and Parameter S...GECCO'2007: Modeling XCS in Class Imbalances: Population Size and Parameter S...
GECCO'2007: Modeling XCS in Class Imbalances: Population Size and Parameter S...
 
COM623M1.doc.doc
COM623M1.doc.docCOM623M1.doc.doc
COM623M1.doc.doc
 
COM623M1.doc.doc
COM623M1.doc.docCOM623M1.doc.doc
COM623M1.doc.doc
 
Agile analytics : An exploratory study of technical complexity management
Agile analytics : An exploratory study of technical complexity managementAgile analytics : An exploratory study of technical complexity management
Agile analytics : An exploratory study of technical complexity management
 
Pal gov.tutorial1.session7 2.finalcheckandschemaengineeringissues
Pal gov.tutorial1.session7 2.finalcheckandschemaengineeringissuesPal gov.tutorial1.session7 2.finalcheckandschemaengineeringissues
Pal gov.tutorial1.session7 2.finalcheckandschemaengineeringissues
 
Applying soft computing techniques to corporate mobile security systems
Applying soft computing techniques to corporate mobile security systemsApplying soft computing techniques to corporate mobile security systems
Applying soft computing techniques to corporate mobile security systems
 
Puzzle-Based Automatic Testing: Bringing Humans Into the Loop by Solving Puzz...
Puzzle-Based Automatic Testing: Bringing Humans Into the Loop by Solving Puzz...Puzzle-Based Automatic Testing: Bringing Humans Into the Loop by Solving Puzz...
Puzzle-Based Automatic Testing: Bringing Humans Into the Loop by Solving Puzz...
 
Large Language Models for Test Case Evolution and Repair
Large Language Models for Test Case Evolution and RepairLarge Language Models for Test Case Evolution and Repair
Large Language Models for Test Case Evolution and Repair
 
Self-supervised Learning Lecture Note
Self-supervised Learning Lecture NoteSelf-supervised Learning Lecture Note
Self-supervised Learning Lecture Note
 
RCIM 2008 - Modello Generale
RCIM 2008 - Modello GeneraleRCIM 2008 - Modello Generale
RCIM 2008 - Modello Generale
 
Geog2 question 2
Geog2 question 2Geog2 question 2
Geog2 question 2
 
New challenges monolixday2011
New challenges monolixday2011New challenges monolixday2011
New challenges monolixday2011
 
Programacion multiobjetivo
Programacion multiobjetivoProgramacion multiobjetivo
Programacion multiobjetivo
 
Network Metrics and Measurements in the Era of the Digital Economies
Network Metrics and Measurements in the Era of the Digital EconomiesNetwork Metrics and Measurements in the Era of the Digital Economies
Network Metrics and Measurements in the Era of the Digital Economies
 
Tackling Open Images Challenge (2019)
Tackling Open Images Challenge (2019)Tackling Open Images Challenge (2019)
Tackling Open Images Challenge (2019)
 

More from Albert Orriols-Puig (20)

Lecture22
Lecture22Lecture22
Lecture22
 
Lecture21
Lecture21Lecture21
Lecture21
 
Lecture20
Lecture20Lecture20
Lecture20
 
Lecture19
Lecture19Lecture19
Lecture19
 
Lecture18
Lecture18Lecture18
Lecture18
 
Lecture17
Lecture17Lecture17
Lecture17
 
Lecture16 - Advances topics on association rules PART III
Lecture16 - Advances topics on association rules PART IIILecture16 - Advances topics on association rules PART III
Lecture16 - Advances topics on association rules PART III
 
Lecture15 - Advances topics on association rules PART II
Lecture15 - Advances topics on association rules PART IILecture15 - Advances topics on association rules PART II
Lecture15 - Advances topics on association rules PART II
 
Lecture14 - Advanced topics in association rules
Lecture14 - Advanced topics in association rulesLecture14 - Advanced topics in association rules
Lecture14 - Advanced topics in association rules
 
Lecture13 - Association Rules
Lecture13 - Association RulesLecture13 - Association Rules
Lecture13 - Association Rules
 
Lecture12 - SVM
Lecture12 - SVMLecture12 - SVM
Lecture12 - SVM
 
Lecture11 - neural networks
Lecture11 - neural networksLecture11 - neural networks
Lecture11 - neural networks
 
Lecture10 - Naïve Bayes
Lecture10 - Naïve BayesLecture10 - Naïve Bayes
Lecture10 - Naïve Bayes
 
Lecture9 - Bayesian-Decision-Theory
Lecture9 - Bayesian-Decision-TheoryLecture9 - Bayesian-Decision-Theory
Lecture9 - Bayesian-Decision-Theory
 
Lecture8 - From CBR to IBk
Lecture8 - From CBR to IBkLecture8 - From CBR to IBk
Lecture8 - From CBR to IBk
 
Lecture6 - C4.5
Lecture6 - C4.5Lecture6 - C4.5
Lecture6 - C4.5
 
Lecture5 - C4.5
Lecture5 - C4.5Lecture5 - C4.5
Lecture5 - C4.5
 
Lecture4 - Machine Learning
Lecture4 - Machine LearningLecture4 - Machine Learning
Lecture4 - Machine Learning
 
Lecture3 - Machine Learning
Lecture3 - Machine LearningLecture3 - Machine Learning
Lecture3 - Machine Learning
 
Lecture2 - Machine Learning
Lecture2 - Machine LearningLecture2 - Machine Learning
Lecture2 - Machine Learning
 

Recently uploaded

Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphThiyagu K
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13Steve Thomason
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfJayanti Pande
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introductionMaksud Ahmed
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsTechSoup
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformChameera Dedduwage
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxSayali Powar
 
Russian Call Girls in Andheri Airport Mumbai WhatsApp 9167673311 💞 Full Nigh...
Russian Call Girls in Andheri Airport Mumbai WhatsApp  9167673311 💞 Full Nigh...Russian Call Girls in Andheri Airport Mumbai WhatsApp  9167673311 💞 Full Nigh...
Russian Call Girls in Andheri Airport Mumbai WhatsApp 9167673311 💞 Full Nigh...Pooja Nehwal
 
Separation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and ActinidesSeparation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and ActinidesFatimaKhan178732
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactdawncurless
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfsanyamsingh5019
 
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...Sapna Thakur
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeThiyagu K
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...EduSkills OECD
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104misteraugie
 
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...fonyou31
 
mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docxPoojaSen20
 
social pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajansocial pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajanpragatimahajan3
 
Disha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdfDisha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdfchloefrazer622
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingTechSoup
 

Recently uploaded (20)

Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot Graph
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdf
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy Reform
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
 
Russian Call Girls in Andheri Airport Mumbai WhatsApp 9167673311 💞 Full Nigh...
Russian Call Girls in Andheri Airport Mumbai WhatsApp  9167673311 💞 Full Nigh...Russian Call Girls in Andheri Airport Mumbai WhatsApp  9167673311 💞 Full Nigh...
Russian Call Girls in Andheri Airport Mumbai WhatsApp 9167673311 💞 Full Nigh...
 
Separation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and ActinidesSeparation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and Actinides
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impact
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdf
 
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and Mode
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104
 
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
 
mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docx
 
social pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajansocial pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajan
 
Disha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdfDisha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdf
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 

IWLCS'2007: Substructural Surrogates for Learning Decomposable Classification Problems: Implementation and First Results

  • 1. Substructural Surrogates for Learning g g Decomposable Classification Problems: Implementation and First Results Albert Orriols Puig1,2 Kumara Sastry2 Orriols-Puig David E. Goldberg2 Ester Bernadó-Mansilla1 1Research Group in Intelligent Systems Enginyeria i Arquitectura La Salle, Ramon Llull University 2Illinois Genetic Algorithms Laboratory g y Department of Industrial and Enterprise Systems Engineering University of Illinois at Urbana Champaign
  • 2. Motivation Nearly decomposable functions in engineering systems • (Simon 69; Gibson 79; Goldberg 02) (Simon,69; Gibson,79; Goldberg,02) Design decomposition principle in: – Genetic Algorithms (GAs) • (Goldberg,02; Pelikan,05; Pelikan, Sastry & Cantú-Paz, 06) – Genetic Programming (GPs) • (Sastry & Goldberg, 03) – Learning Classifier Systems (LCS) • (Butz et al., 06; Llorà et al., 06) Enginyeria i Arquitectura la Salle Slide 2 GRSI
  • 3. Aim Our approach: build surrogates for classification. – Probabilistic models Induce the form of a surrogate – Regression Estimate the coefficients of the surrogate – Use the surrogate to classify new examples Surrogates used for efficiency enhancement of: – GAs: • (Sastry, Pelikan & Goldberg, 2004) • (Pelikan & Sastry, 2004) • (Sastry, Lima & Goldberg, 2006) – LCSs: • (Llorà, Sastry, Yu & Goldberg, 2007) Enginyeria i Arquitectura la Salle Slide 3 GRSI
  • 4. Outline 1. Methodology 2. Implementation 3. Test P bl 3 T t Problems 4. 4 Results 5. Discussion 6. Conclusions Enginyeria i Arquitectura la Salle Slide 4 GRSI
  • 5. 1. Methodology 2. Implementation 3. Test Problems Overview of the Methodology gy 3. Results 4. Discussion 4 Di i 5. Conclusions Structural Surrogate Classification Model Layer Model Layer Model Layer •Infer structural form •Use the surrogate • Extract dependencies and the coefficients function to predict the between attributes bet een attrib tes based on the class of new • Expressed as: structural model examples • Linkage Groups • Matrices (Sastry, Lima & Goldberg, 2006) • Bayesian Networks • Example: x-or Enginyeria i Arquitectura la Salle Slide 5 GRSI
  • 6. 1. Methodology 2. Implementation 3. Test Problems Surrogate Model Layer g y 3. Results 4. Discussion 4 Di i 5. Conclusions Input: Database of labeled examples with nominal attributes Substructure identified by the structural model layer – Example: [0 2] [1] Surrogate form induced from the structural model: f(x f( 0, x1, x2) = cI + c0x0 + c1x1 + c2x2 + c0,2x0x2 Coefficients f C ffi i t of surrogates: li t linear regression methods i th d Enginyeria i Arquitectura la Salle Slide 6 GRSI
  • 7. 1. Methodology 2. Implementation 3. Test Problems Surrogate Model Layer g y 3. Results 4 Di i 4. Discussion 5. Conclusions Map input examples to schema-index matrix – Consider all the schemas: {0*0, 0*1, 1*0, 1*1, *0*, *1*} [0 2] [1] – I general, th number of schemas t be considered i In l the b fh to b id d is: Number of linkage groups Cardinality of the variable jth of the ith linkage group Enginyeria i Arquitectura la Salle Slide 7 GRSI
  • 8. 1. Methodology 2. Implementation 3. Test Problems Surrogate Model Layer g y 3. Results 4. Discussion 4 Di i 5. Conclusions Map each example to a binary vector of size msch – For each building block, the entry corresponding to the schemata contained in the example is one. All the other are zero. [0 2] [1] 0*0 0*1 1*0 1*1 *0* *1* 010 1 0 0 0 0 1 100 0 0 1 0 1 0 111 0 0 0 1 0 1 Enginyeria i Arquitectura la Salle Slide 8 GRSI
  • 9. 1. Methodology 2. Implementation 3. Test Problems Surrogate Model Layer g y 3. Results 4. Discussion 4 Di i 5. Conclusions This results in the following matrix n input examples Each class is mapped to an integer Enginyeria i Arquitectura la Salle Slide 9 GRSI
  • 10. 1. Methodology 2. Implementation 3. Test Problems Surrogate Model Layer g y 3. Results 4. Discussion 4 Di i 5. Conclusions Solve the following linear system to estimate the coefficients of the surrogate model Input instances mapped Class or label of each to the schemas matrix input instance p Coefficients of the surrogate model To solve it, we use multi-dimensional least squares fitting approach: – Minimize: Enginyeria i Arquitectura la Salle Slide 10 GRSI
  • 11. 1. Methodology 2. Implementation 3. Test Problems Overview of the Methodology gy 3. Results 4. Discussion 4 Di i 5. Conclusions Structural Surrogate Classification Model Layer Model Layer Model Layer •Infer structural form •Use the surrogate • Extract dependencies and the coefficients function to predict the between attributes bet een attrib tes based on the class of new • Expressed as: structural model examples • Linkage Groups • Matrices (Sastry, Lima & Goldberg, 2006) • Bayesian Networks • Example: x-or Enginyeria i Arquitectura la Salle Slide 11 GRSI
  • 12. 1. Methodology 2. Implementation 3. Test Problems Classification Model Layer y 3. Results 4. Discussion 4 Di i 5. Conclusions Use the surrogate to predict the class of a new input instance Map the example New input p to a vector e of length msch Output = round (x e) (x·e) example according to the schemas Coefficients of the surrogate model 0*0 0*1 1*0 1*1 *0* *1* Output = x·(001010)’ 001 0 0 1 0 1 0 Enginyeria i Arquitectura la Salle Slide 12 GRSI
  • 13. 1. Methodology 2. Implementation 3. Test Problems Overview of the Methodology gy 3. Results 4. Discussion 4 Di i 5. Conclusions Structural Surrogate Classification Model Layer Model Layer Model Layer •Infer structural form •Use the surrogate • Extract dependencies and the coefficients function to predict the between attributes bet een attrib tes based on the class of new • Expressed as: structural model examples • Linkage Groups • Matrices (Sastry, Lima & Goldberg, 2006) • Bayesian Networks • Example: x-or Enginyeria i Arquitectura la Salle Slide 13 GRSI
  • 14. 1. Methodology How do we obtain the structural 2. Implementation 3. Test Problems 3. Results model? 4 Di i 4. Discussion 5. Conclusions Greedy extraction of the structural model for classification (gESMC): Greedy search à la eCGA (Harik, 98) 1. 1 Start considering all variables independent sModel = [1] [2] … [n] independent. 2. Divide the dataset in ten-fold cross-validation sets: train and test 3. Build the surrogate with the train set and evaluate with the test set 4. While there is improvement 1. Create all combinations of two subset merges Eg. { [1,2] .. [n]; [1,n] .. [2]; [1]..[2,n]} 2. Build the surrogate for all candidates. Evaluate the quality with the test set 3. Select the candidate with minimum error 4 If the quality of the candidate is better than the parent, accept the model th lit f th did t i b tt th th t t th dl 4. (update smodel) and go to 4. 5. Otherwise, stop. sModel is optimal. , p p Enginyeria i Arquitectura la Salle Slide 14 GRSI
  • 15. 1. Methodology 2. Implementation 3. Test Problems Test Problems 3. Results 4 Di i 4. Discussion 5. Conclusions Two-level hierarchical problems (Butz, Pelikan, Llorà & Goldberg, 2006) Enginyeria i Arquitectura la Salle Slide 15 GRSI
  • 16. 1. Methodology 2. Implementation 3. Test Problems Test Problems 3. Results 4. Discussion 4 Di i 5. Conclusions Problems in the lower level Condition length (l) Position of the left-most – Position Problem: 000110 :3 one-valued bit Condition length (l) – Parity Problem: Number of 1 mod 2 01001010 :1 Enginyeria i Arquitectura la Salle Slide 16 GRSI
  • 17. 1. Methodology 2. Implementation 3. Test Problems Test Problems 3. Results 4. Discussion 4 Di i 5. Conclusions Problems in the higher level Condition length (l) – Decoder Problem: Integer value of the input 000110 :5 Condition length (l) – Count-ones Problem: Number of ones of the input 000110 :2 Combinations of both levels: – Parity + decoder ( (hParDec) - Parity + count-ones (hParCount) y ) y ( ) – Position + decoder (hPosDec) - Position + count-ones (hPosCount) Enginyeria i Arquitectura la Salle Slide 17 GRSI
  • 18. 1. Methodology 2. Implementation 3. Test Problems Results with 2-bit lower level blocks 3. Results 4. Discussion 4 Di i 5. Conclusions First experiment: – hParDec, hPosDec, hParCount, hParDec hPosDec hParCount hPosCount – Binary inputs of length 17. – The first 6 bits are relevant The others are irrelevant relevant. – 3 blocks of two-bits low order problems – Eg: hPosDec Interacting variables: 10 01 00 00101010101 [x0 x1] [x2 x3] [x4 x5] irrelevant bits 210 Output = 21 Enginyeria i Arquitectura la Salle Slide 18 GRSI
  • 19. 1. Methodology 2. Implementation 3. Test Problems Results with 2-bit lower level blocks 3. Results 4. Discussion 4 Di i 5. Conclusions Performance of gESMC compared to SMO and C4.5: Enginyeria i Arquitectura la Salle Slide 19 GRSI
  • 20. 1. Methodology 2. Implementation 3. Test Problems Results with 2-bit lower level blocks 3. Results 4 Di i 4. Discussion 5. Conclusions Form and coefficients of the surrogate models F d ffi i t f th t dl – Interacting variables [x0 x1] [x2 x3] [x4 x5] – Linkages between variables are detected – Easy visualization of the salient variable interactions – All models only consist of the six relevant variables Enginyeria i Arquitectura la Salle Slide 20 GRSI
  • 21. 1. Methodology 2. Implementation 3. Test Problems Results with 2-bit lower level blocks 3. Results 4 Di i 4. Discussion 5. Conclusions C4.5 C4 5 created t t d trees: – HPosDec and HPosCount • Trees of 53 nodes and 27 leaves on average • Only the six relevant variables in the decision nodes – HParDec • Trees of 270 nodes and 136 leaves on average • Irrelevant attributes in the decision nodes – HP C HParCount t • Trees of 283 nodes and 142 leaves on average • Irrelevant attributes in the decision nodes SMO built support vector machines: – 351, 6, 6, and 28 machines with 16 weights each one were created for HPosDec, HPosCount, HParDec, and HParCount Enginyeria i Arquitectura la Salle Slide 21 GRSI
  • 22. 1. Methodology 2. Implementation 3. Test Problems Results with 3-bit lower level blocks 3. Results 4. Discussion 4 Di i 5. Conclusions Second experiment: – hParDec, hPosDec, hParCount, hParDec hPosDec hParCount hPosCount – Binary inputs of length 17. – The first 6 bits are relevant The others are irrelevant relevant. – 2 blocks of three-bits low order problems – Eg: hPosDec Interacting variables: 000 100 00101010101 [x0 x1 x2] [x3 x4 x5] irrelevant bits 0 3 Output = 3 Enginyeria i Arquitectura la Salle Slide 22 GRSI
  • 23. 1. Methodology 2. Implementation 3. Test Problems Results with 3-bit lower level blocks 3. Results 4 Di i 4. Discussion 5. Conclusions Second experiment: – 2 blocks of three-bits low order problems Interesting observations: •As expected, the method stops the search when no M1: [0][1][2][3][4][5] improvement is found M2: [0 1][2][3][4][5] •In HParDec, half of the times the system gets a model that represents the salient interacting M3: [0 1 2][3][4][5] variables due to the stochasticity of the 10-fold CV estimation. Eg. [0 1 2 8] [3 4 9 5 6] Enginyeria i Arquitectura la Salle Slide 23 GRSI
  • 24. 1. Methodology 2. Implementation 3. Test Problems Discussion 3. Results 4. Discussion 4 Di i 5. Conclusions Lack of guidance from low-order substructures – Increase the order of substructural merges – Select randomly one of the new structural models • Follow schemes such as simulated annealing (Korst & Aarts, 1997) Enginyeria i Arquitectura la Salle Slide 24 GRSI
  • 25. 1. Methodology 2. Implementation 3. Test Problems Discussion 3. Results 4. Discussion 4 Di i 5. Conclusions Creating substructural models with overlapping structures – Problems with overlapping linkages – Eg: multiplexer problems – gESMC on the 6-bit mux results in: [0 1 2 3 4 5 ] – What we would like to obtain is: [0 1 2] [0 1 3] [0 1 4] [0 1 5] – Enhance the methodology permitting to represent overlapping linkages, using methods such as the design structure matrix genetic algorithm (DMSGA) (Yu, 2006) Enginyeria i Arquitectura la Salle Slide 25 GRSI
  • 26. 1. Methodology 2. Implementation 3. Test Problems Conclusions 3. Results 4 Di i 4. Discussion 5. Conclusions Methodology to learn from decomposable problems – Structural model – Surrogate model – Classification model First implementation approach – greedy search of the best structural model Main advantage of gESMC over other learners – It provides the structural model j p jointly with the classification y model. Some limitations of gESMC: – It depends on the guidance on the low-order structure – Several approaches outlined as further work. Enginyeria i Arquitectura la Salle Slide 26 GRSI
  • 27. Substructural Surrogates for Learning g g Decomposable Classification Problems: Implementation and First Results Albert Orriols Puig1,2 Kumara Sastry2 Orriols-Puig David E. Goldberg2 Ester Bernadó-Mansilla1 1Research Group in Intelligent Systems Enginyeria i Arquitectura La Salle, Ramon Llull University 2Illinois Genetic Algorithms Laboratory g y Department of Industrial and Enterprise Systems Engineering University of Illinois at Urbana Champaign