SlideShare ist ein Scribd-Unternehmen logo
1 von 22
The Compact Classifier System:
Scalability Analysis and First Results

      Xavier Llorà, Kumara Sastry, & David E. Goldberg

                  Illinois Genetic Algorithms Lab
             University of Illinois at Urbana-Champaign


            {xllora,kumara,deg}@illigal.ge.uiuc.edu
Motivation
    • Pittsburgh classifier systems
    • Can we apply Wilson’s ideas for evolving rule sets
      formed only by maximally accurate and general rules?
    • Bottom up approach for evolving such rules
             – The compact classifier system
    • Previous Multiobjective (Llorà, Goldberg, Traus, Bernadó,
      2003) approaches were top down
             – Explicitly address accuracy and generality
             – Use it to push and product compact rule sets
    • Side product:
             – Scalability challenge of De Jong & Spears (1991) representation

IWLCS 2005                          Llorà, Sastry, & Goldberg, 2005              2
Binary Rule Encoding
    • De Jong & Spears (1991)
    • Widely used in Pittsburgh classifiers
    • GALE, MOLS, GAssist have used it
                     color              shape                                 size
        red green blue white     round square                      huge large medium small
             1   1     1     1      0              1                0     1          1   0


    • A rule is expressed as (1111|01|0110)
    • Equivalent to Holland's (1975) representation (#11,#12)
    • A rule set is a disjunction of such rules


IWLCS 2005                       Llorà, Sastry, & Goldberg, 2005                             3
Previous Efforts based using
              Multiobjective Optimization




                                                 (Llorà, Goldberg, Traus, Bernadó, 2003)
IWLCS 2005     Llorà, Sastry, & Goldberg, 2005                                             4
Maximally Accurate and General Rules
    • Accuracy and generality can be computed using data set

                        n t + (r) + n t− (r)                         n t + (r)
                 α(r) =                                       ε(r) =
                                 nt                                    nm
     • Fitness should combine accuracy and generality
                              f (r) = α(r) ⋅ ε(r)γ

     • Such measure can be either applied to rules or a rule sets
     • The compact classifier systems uses this fitness and a
       compact genetic algorithm (cGA) to evolve such rules
     • Each cGA run use a different initial perturbed probability
       vector
IWLCS 2005                      Llorà, Sastry, & Goldberg, 2005                  5
The Compact Genetic Algorithm Can Make It

    • Rules may be obtained optimizing

                                    f (r) = α(r) ⋅ ε(r)γ

             The basic cGA scheme
    •
                                 px i = 0.5
                                  0
             1. Initialization
             2. Model sampling (two individuals are generated)
             3. Evaluation (f(r))
             4. Selection (tournament selection)
             5. Probabilistic model updation
             6. Repeat steps 2-5 until termination criteria are met



IWLCS 2005                            Llorà, Sastry, & Goldberg, 2005   6
cGAModel Perturbation
    • Facilitate the evolution of different rules
    • Explore the frequency of appearance of each optimal
      rule
    • Initial model perturbation

                        px i = 0.5 + U(−0.4,0.4)
                         0



     • Experiments using the 3-input multiplexer
     • 1,000 independent runs
     • Visualize the pair-wise relations of the genes



IWLCS 2005                  Llorà, Sastry, & Goldberg, 2005   7
Initial Perturbed Vectors Leading to rule
                                         100111(01#)
       Problem
       structure




IWLCS 2005                Llorà, Sastry, & Goldberg, 2005   8
Initial Perturbed Vectors Leading to rule
                                         011101(1#1)
       Problem
       structure




IWLCS 2005                Llorà, Sastry, & Goldberg, 2005   9
Perturbation Summary
    • 97% of the runs lead to a maximally general and
      accurate rule
    • The provability of evolving each of the optimal rules was
      roughly 1/3
    • The initial perturbed probability vectors that lead to an
      optimal rule show pair-wise relations among genes
    • The pair-wise relations reflect the problem structure




IWLCS 2005                 Llorà, Sastry, & Goldberg, 2005        10
But One Rule Is Not Enough
    • Model perturbation in cGA evolve different rules
    • The goal: evolve population of rules that solve the
      problem together
    • The fitness measure (f(r)) can be also be applied to rule
      sets
             Two mechanisms:
    •
             – Spawn a population until the solution is meet
             – Fusing populations when they represent the same rule




IWLCS 2005                          Llorà, Sastry, & Goldberg, 2005   11
Spawning and Fusing Populations of Rules




IWLCS 2005      Llorà, Sastry, & Goldberg, 2005   12
Experiments & Scalability
    • Analysis using multiplexer problems (3-, 6-, and 11-input)
    • The number of rules in [O] grow exponentially
             – 2i, where i is the number of inputs
    • The CGA success as a function of the problem size
             – 3-input: 97%
             – 6-input: 73.93%
             – 11-input:43.03%
    • Scalability over 10,000 independent runs




IWLCS 2005                           Llorà, Sastry, & Goldberg, 2005   13
Scalability of CCS




IWLCS 2005   Llorà, Sastry, & Goldberg, 2005              14
Unmatchable Rules: A Byproduct
    • A rule is unmatchable if:
             – At least one attribute in the contain have all its possible values
               set to 0
                     color                  shape                                 size
        red green blue white         round square                      huge large medium small
             1   1     1     1          0              0                0     1          1   0


    • The rule (1111|00|0110) force the shape to be neither
      round or square
    • Hence no data instance will ever match it
    • Direct impact on the scalability of LCS/GBML system using
      it (as simple experiments with the multiplexer show)
IWLCS 2005                           Llorà, Sastry, & Goldberg, 2005                             15
3-Input Multiplexer




                                               37 unmatchable rules (57.8%)




IWLCS 2005   Llorà, Sastry, & Goldberg, 2005                                  16
6-Input Multiplexer




                                               3,367 unmatchable rules (82.2%)




IWLCS 2005   Llorà, Sastry, & Goldberg, 2005                                     17
11-Input Multiplexer



                                               4,017,157 unmatchable rules (95.7%)




IWLCS 2005   Llorà, Sastry, & Goldberg, 2005                                         18
Growth Ratio of Unmatchable Rules (I/III)
    • An unmatchable rule has of all attribute values set to 0
    • Analysis for problems with binary attributes (worst case)
    • The total number of rules
                                 Σ(l) = 2 l
     • Number of rules matchable rules (all attributes set to
       either 01, 11, & 11)         l
                                 Ψ(l) = 3 2
    • Size of the unmatchable rule set plateau
                                                              l
                     Φ(l) = Σ(l) − Ψ(l) = 2 l − 3             2



IWLCS 2005                  Llorà, Sastry, & Goldberg, 2005       19
Growth Ratio of Unmatchable Rules (II/III)
    • Growth ratio of unmatchable rules
                               Φ(l) 2 l
                        ρ(l) =      = l −1
                               Ψ(l)
                                     32
    • It can be approximated by
                               ρ(l) ≈ e cl
                               ⎛2⎞
                         c = ln⎜ ⎟ = 0.143
                               ⎝ 3⎠

     • The growth ratio (ρ) for this representation grows
       exponentially

IWLCS 2005                 Llorà, Sastry, & Goldberg, 2005   20
Growth Ratio of Unmatchable Rules (III/III)




IWLCS 2005         Llorà, Sastry, & Goldberg, 2005   21
Growth Ratio of Unmatchable Rules (III/III)




IWLCS 2005         Llorà, Sastry, & Goldberg, 2005   21

Weitere ähnliche Inhalte

Andere mochten auch

The role organizational rewards on employees motivation
The role organizational rewards on employees motivationThe role organizational rewards on employees motivation
The role organizational rewards on employees motivationAwais Sargana
 
Questionnaire on training
Questionnaire on trainingQuestionnaire on training
Questionnaire on trainingswati_tiwari
 
Time and space complexity
Time and space complexityTime and space complexity
Time and space complexityAnkit Katiyar
 
The Secret To Employee Motivation
The Secret To Employee MotivationThe Secret To Employee Motivation
The Secret To Employee MotivationD B
 
Questionnaire on training and development
Questionnaire on training and developmentQuestionnaire on training and development
Questionnaire on training and developmentSusmitha Chowdary
 

Andere mochten auch (6)

Complexity of Algorithm
Complexity of AlgorithmComplexity of Algorithm
Complexity of Algorithm
 
The role organizational rewards on employees motivation
The role organizational rewards on employees motivationThe role organizational rewards on employees motivation
The role organizational rewards on employees motivation
 
Questionnaire on training
Questionnaire on trainingQuestionnaire on training
Questionnaire on training
 
Time and space complexity
Time and space complexityTime and space complexity
Time and space complexity
 
The Secret To Employee Motivation
The Secret To Employee MotivationThe Secret To Employee Motivation
The Secret To Employee Motivation
 
Questionnaire on training and development
Questionnaire on training and developmentQuestionnaire on training and development
Questionnaire on training and development
 

Ähnlich wie The compact classifier system: Motivation, analysis and first results

Linkage Learning for Pittsburgh LCS: Making Problems Tractable
Linkage Learning for Pittsburgh LCS: Making Problems TractableLinkage Learning for Pittsburgh LCS: Making Problems Tractable
Linkage Learning for Pittsburgh LCS: Making Problems TractableXavier Llorà
 
Do not Match, Inherit: Fitness Surrogates for Genetics-Based Machine Learning...
Do not Match, Inherit: Fitness Surrogates for Genetics-Based Machine Learning...Do not Match, Inherit: Fitness Surrogates for Genetics-Based Machine Learning...
Do not Match, Inherit: Fitness Surrogates for Genetics-Based Machine Learning...Xavier Llorà
 
IWLCS'2008: First Approach toward Online Evolution of Association Rules wit...
IWLCS'2008: First Approach toward Online Evolution of Association Rules wit...IWLCS'2008: First Approach toward Online Evolution of Association Rules wit...
IWLCS'2008: First Approach toward Online Evolution of Association Rules wit...Albert Orriols-Puig
 
Evolution of regression ols to gps to mars
Evolution of regression   ols to gps to marsEvolution of regression   ols to gps to mars
Evolution of regression ols to gps to marsSalford Systems
 
Extending R-squared beyond ordinary least-squares linear regression
Extending R-squared beyond ordinary least-squares linear regressionExtending R-squared beyond ordinary least-squares linear regression
Extending R-squared beyond ordinary least-squares linear regressionpcdjohnson
 
Reporting.ppt
Reporting.pptReporting.ppt
Reporting.pptasodiatel
 
Revisiting the Calibration of Modern Neural Networks
Revisiting the Calibration of Modern Neural NetworksRevisiting the Calibration of Modern Neural Networks
Revisiting the Calibration of Modern Neural NetworksSungchul Kim
 
Pittsburgh Learning Classifier Systems for Protein Structure Prediction: Sca...
Pittsburgh Learning Classifier Systems for Protein  Structure Prediction: Sca...Pittsburgh Learning Classifier Systems for Protein  Structure Prediction: Sca...
Pittsburgh Learning Classifier Systems for Protein Structure Prediction: Sca...Xavier Llorà
 
Challenging Common Assumptions in the Unsupervised Learning of Disentangled R...
Challenging Common Assumptions in the Unsupervised Learning of Disentangled R...Challenging Common Assumptions in the Unsupervised Learning of Disentangled R...
Challenging Common Assumptions in the Unsupervised Learning of Disentangled R...Sangwoo Mo
 

Ähnlich wie The compact classifier system: Motivation, analysis and first results (9)

Linkage Learning for Pittsburgh LCS: Making Problems Tractable
Linkage Learning for Pittsburgh LCS: Making Problems TractableLinkage Learning for Pittsburgh LCS: Making Problems Tractable
Linkage Learning for Pittsburgh LCS: Making Problems Tractable
 
Do not Match, Inherit: Fitness Surrogates for Genetics-Based Machine Learning...
Do not Match, Inherit: Fitness Surrogates for Genetics-Based Machine Learning...Do not Match, Inherit: Fitness Surrogates for Genetics-Based Machine Learning...
Do not Match, Inherit: Fitness Surrogates for Genetics-Based Machine Learning...
 
IWLCS'2008: First Approach toward Online Evolution of Association Rules wit...
IWLCS'2008: First Approach toward Online Evolution of Association Rules wit...IWLCS'2008: First Approach toward Online Evolution of Association Rules wit...
IWLCS'2008: First Approach toward Online Evolution of Association Rules wit...
 
Evolution of regression ols to gps to mars
Evolution of regression   ols to gps to marsEvolution of regression   ols to gps to mars
Evolution of regression ols to gps to mars
 
Extending R-squared beyond ordinary least-squares linear regression
Extending R-squared beyond ordinary least-squares linear regressionExtending R-squared beyond ordinary least-squares linear regression
Extending R-squared beyond ordinary least-squares linear regression
 
Reporting.ppt
Reporting.pptReporting.ppt
Reporting.ppt
 
Revisiting the Calibration of Modern Neural Networks
Revisiting the Calibration of Modern Neural NetworksRevisiting the Calibration of Modern Neural Networks
Revisiting the Calibration of Modern Neural Networks
 
Pittsburgh Learning Classifier Systems for Protein Structure Prediction: Sca...
Pittsburgh Learning Classifier Systems for Protein  Structure Prediction: Sca...Pittsburgh Learning Classifier Systems for Protein  Structure Prediction: Sca...
Pittsburgh Learning Classifier Systems for Protein Structure Prediction: Sca...
 
Challenging Common Assumptions in the Unsupervised Learning of Disentangled R...
Challenging Common Assumptions in the Unsupervised Learning of Disentangled R...Challenging Common Assumptions in the Unsupervised Learning of Disentangled R...
Challenging Common Assumptions in the Unsupervised Learning of Disentangled R...
 

Mehr von Xavier Llorà

Meandre 2.0 Alpha Preview
Meandre 2.0 Alpha PreviewMeandre 2.0 Alpha Preview
Meandre 2.0 Alpha PreviewXavier Llorà
 
Soaring the Clouds with Meandre
Soaring the Clouds with MeandreSoaring the Clouds with Meandre
Soaring the Clouds with MeandreXavier Llorà
 
From Galapagos to Twitter: Darwin, Natural Selection, and Web 2.0
From Galapagos to Twitter: Darwin, Natural Selection, and Web 2.0From Galapagos to Twitter: Darwin, Natural Selection, and Web 2.0
From Galapagos to Twitter: Darwin, Natural Selection, and Web 2.0Xavier Llorà
 
Large Scale Data Mining using Genetics-Based Machine Learning
Large Scale Data Mining using   Genetics-Based Machine LearningLarge Scale Data Mining using   Genetics-Based Machine Learning
Large Scale Data Mining using Genetics-Based Machine LearningXavier Llorà
 
Data-Intensive Computing for Competent Genetic Algorithms: A Pilot Study us...
Data-Intensive Computing for  Competent Genetic Algorithms:  A Pilot Study us...Data-Intensive Computing for  Competent Genetic Algorithms:  A Pilot Study us...
Data-Intensive Computing for Competent Genetic Algorithms: A Pilot Study us...Xavier Llorà
 
Scalabiltity in GBML, Accuracy-based Michigan Fuzzy LCS, and new Trends
Scalabiltity in GBML, Accuracy-based Michigan Fuzzy LCS, and new TrendsScalabiltity in GBML, Accuracy-based Michigan Fuzzy LCS, and new Trends
Scalabiltity in GBML, Accuracy-based Michigan Fuzzy LCS, and new TrendsXavier Llorà
 
Towards a Theoretical Towards a Theoretical Framework for LCS Framework fo...
Towards a Theoretical  Towards a Theoretical  Framework for LCS  Framework fo...Towards a Theoretical  Towards a Theoretical  Framework for LCS  Framework fo...
Towards a Theoretical Towards a Theoretical Framework for LCS Framework fo...Xavier Llorà
 
Learning Classifier Systems for Class Imbalance Problems
Learning Classifier Systems  for Class Imbalance  ProblemsLearning Classifier Systems  for Class Imbalance  Problems
Learning Classifier Systems for Class Imbalance ProblemsXavier Llorà
 
A Retrospective Look at A Retrospective Look at Classifier System ResearchCl...
A Retrospective Look at  A Retrospective Look at  Classifier System ResearchCl...A Retrospective Look at  A Retrospective Look at  Classifier System ResearchCl...
A Retrospective Look at A Retrospective Look at Classifier System ResearchCl...Xavier Llorà
 
XCS: Current capabilities and future challenges
XCS: Current capabilities and future  challengesXCS: Current capabilities and future  challenges
XCS: Current capabilities and future challengesXavier Llorà
 
Negative Selection for Algorithm for Anomaly Detection
Negative Selection for Algorithm for Anomaly DetectionNegative Selection for Algorithm for Anomaly Detection
Negative Selection for Algorithm for Anomaly DetectionXavier Llorà
 
Searle, Intentionality, and the Future of Classifier Systems
Searle, Intentionality, and the  Future of Classifier SystemsSearle, Intentionality, and the  Future of Classifier Systems
Searle, Intentionality, and the Future of Classifier SystemsXavier Llorà
 
Computed Prediction: So far, so good. What now?
Computed Prediction:  So far, so good. What now?Computed Prediction:  So far, so good. What now?
Computed Prediction: So far, so good. What now?Xavier Llorà
 
Meandre: Semantic-Driven Data-Intensive Flows in the Clouds
Meandre: Semantic-Driven Data-Intensive Flows in the CloudsMeandre: Semantic-Driven Data-Intensive Flows in the Clouds
Meandre: Semantic-Driven Data-Intensive Flows in the CloudsXavier Llorà
 
ZigZag: The Meandring Language
ZigZag: The Meandring LanguageZigZag: The Meandring Language
ZigZag: The Meandring LanguageXavier Llorà
 
HUMIES 2007 Bronze Winner: Towards Better than Human Capability in Diagnosing...
HUMIES 2007 Bronze Winner: Towards Better than Human Capability in Diagnosing...HUMIES 2007 Bronze Winner: Towards Better than Human Capability in Diagnosing...
HUMIES 2007 Bronze Winner: Towards Better than Human Capability in Diagnosing...Xavier Llorà
 
Towards Better than Human Capability in Diagnosing Prostate Cancer Using Infr...
Towards Better than Human Capability in Diagnosing Prostate Cancer Using Infr...Towards Better than Human Capability in Diagnosing Prostate Cancer Using Infr...
Towards Better than Human Capability in Diagnosing Prostate Cancer Using Infr...Xavier Llorà
 
Visualizing content in metadata stores
Visualizing content in metadata storesVisualizing content in metadata stores
Visualizing content in metadata storesXavier Llorà
 

Mehr von Xavier Llorà (20)

Meandre 2.0 Alpha Preview
Meandre 2.0 Alpha PreviewMeandre 2.0 Alpha Preview
Meandre 2.0 Alpha Preview
 
Soaring the Clouds with Meandre
Soaring the Clouds with MeandreSoaring the Clouds with Meandre
Soaring the Clouds with Meandre
 
From Galapagos to Twitter: Darwin, Natural Selection, and Web 2.0
From Galapagos to Twitter: Darwin, Natural Selection, and Web 2.0From Galapagos to Twitter: Darwin, Natural Selection, and Web 2.0
From Galapagos to Twitter: Darwin, Natural Selection, and Web 2.0
 
Large Scale Data Mining using Genetics-Based Machine Learning
Large Scale Data Mining using   Genetics-Based Machine LearningLarge Scale Data Mining using   Genetics-Based Machine Learning
Large Scale Data Mining using Genetics-Based Machine Learning
 
Data-Intensive Computing for Competent Genetic Algorithms: A Pilot Study us...
Data-Intensive Computing for  Competent Genetic Algorithms:  A Pilot Study us...Data-Intensive Computing for  Competent Genetic Algorithms:  A Pilot Study us...
Data-Intensive Computing for Competent Genetic Algorithms: A Pilot Study us...
 
Scalabiltity in GBML, Accuracy-based Michigan Fuzzy LCS, and new Trends
Scalabiltity in GBML, Accuracy-based Michigan Fuzzy LCS, and new TrendsScalabiltity in GBML, Accuracy-based Michigan Fuzzy LCS, and new Trends
Scalabiltity in GBML, Accuracy-based Michigan Fuzzy LCS, and new Trends
 
Towards a Theoretical Towards a Theoretical Framework for LCS Framework fo...
Towards a Theoretical  Towards a Theoretical  Framework for LCS  Framework fo...Towards a Theoretical  Towards a Theoretical  Framework for LCS  Framework fo...
Towards a Theoretical Towards a Theoretical Framework for LCS Framework fo...
 
Learning Classifier Systems for Class Imbalance Problems
Learning Classifier Systems  for Class Imbalance  ProblemsLearning Classifier Systems  for Class Imbalance  Problems
Learning Classifier Systems for Class Imbalance Problems
 
A Retrospective Look at A Retrospective Look at Classifier System ResearchCl...
A Retrospective Look at  A Retrospective Look at  Classifier System ResearchCl...A Retrospective Look at  A Retrospective Look at  Classifier System ResearchCl...
A Retrospective Look at A Retrospective Look at Classifier System ResearchCl...
 
XCS: Current capabilities and future challenges
XCS: Current capabilities and future  challengesXCS: Current capabilities and future  challenges
XCS: Current capabilities and future challenges
 
Negative Selection for Algorithm for Anomaly Detection
Negative Selection for Algorithm for Anomaly DetectionNegative Selection for Algorithm for Anomaly Detection
Negative Selection for Algorithm for Anomaly Detection
 
Searle, Intentionality, and the Future of Classifier Systems
Searle, Intentionality, and the  Future of Classifier SystemsSearle, Intentionality, and the  Future of Classifier Systems
Searle, Intentionality, and the Future of Classifier Systems
 
Computed Prediction: So far, so good. What now?
Computed Prediction:  So far, so good. What now?Computed Prediction:  So far, so good. What now?
Computed Prediction: So far, so good. What now?
 
NIGEL 2006 welcome
NIGEL 2006 welcomeNIGEL 2006 welcome
NIGEL 2006 welcome
 
Meandre: Semantic-Driven Data-Intensive Flows in the Clouds
Meandre: Semantic-Driven Data-Intensive Flows in the CloudsMeandre: Semantic-Driven Data-Intensive Flows in the Clouds
Meandre: Semantic-Driven Data-Intensive Flows in the Clouds
 
ZigZag: The Meandring Language
ZigZag: The Meandring LanguageZigZag: The Meandring Language
ZigZag: The Meandring Language
 
HUMIES 2007 Bronze Winner: Towards Better than Human Capability in Diagnosing...
HUMIES 2007 Bronze Winner: Towards Better than Human Capability in Diagnosing...HUMIES 2007 Bronze Winner: Towards Better than Human Capability in Diagnosing...
HUMIES 2007 Bronze Winner: Towards Better than Human Capability in Diagnosing...
 
Towards Better than Human Capability in Diagnosing Prostate Cancer Using Infr...
Towards Better than Human Capability in Diagnosing Prostate Cancer Using Infr...Towards Better than Human Capability in Diagnosing Prostate Cancer Using Infr...
Towards Better than Human Capability in Diagnosing Prostate Cancer Using Infr...
 
The DISCUS project
The DISCUS projectThe DISCUS project
The DISCUS project
 
Visualizing content in metadata stores
Visualizing content in metadata storesVisualizing content in metadata stores
Visualizing content in metadata stores
 

Kürzlich hochgeladen

Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MIND CTI
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...apidays
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdflior mazor
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024The Digital Insurer
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businesspanagenda
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 

Kürzlich hochgeladen (20)

Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 

The compact classifier system: Motivation, analysis and first results

  • 1. The Compact Classifier System: Scalability Analysis and First Results Xavier Llorà, Kumara Sastry, & David E. Goldberg Illinois Genetic Algorithms Lab University of Illinois at Urbana-Champaign {xllora,kumara,deg}@illigal.ge.uiuc.edu
  • 2. Motivation • Pittsburgh classifier systems • Can we apply Wilson’s ideas for evolving rule sets formed only by maximally accurate and general rules? • Bottom up approach for evolving such rules – The compact classifier system • Previous Multiobjective (Llorà, Goldberg, Traus, Bernadó, 2003) approaches were top down – Explicitly address accuracy and generality – Use it to push and product compact rule sets • Side product: – Scalability challenge of De Jong & Spears (1991) representation IWLCS 2005 Llorà, Sastry, & Goldberg, 2005 2
  • 3. Binary Rule Encoding • De Jong & Spears (1991) • Widely used in Pittsburgh classifiers • GALE, MOLS, GAssist have used it color shape size red green blue white round square huge large medium small 1 1 1 1 0 1 0 1 1 0 • A rule is expressed as (1111|01|0110) • Equivalent to Holland's (1975) representation (#11,#12) • A rule set is a disjunction of such rules IWLCS 2005 Llorà, Sastry, & Goldberg, 2005 3
  • 4. Previous Efforts based using Multiobjective Optimization (Llorà, Goldberg, Traus, Bernadó, 2003) IWLCS 2005 Llorà, Sastry, & Goldberg, 2005 4
  • 5. Maximally Accurate and General Rules • Accuracy and generality can be computed using data set n t + (r) + n t− (r) n t + (r) α(r) = ε(r) = nt nm • Fitness should combine accuracy and generality f (r) = α(r) ⋅ ε(r)γ • Such measure can be either applied to rules or a rule sets • The compact classifier systems uses this fitness and a compact genetic algorithm (cGA) to evolve such rules • Each cGA run use a different initial perturbed probability vector IWLCS 2005 Llorà, Sastry, & Goldberg, 2005 5
  • 6. The Compact Genetic Algorithm Can Make It • Rules may be obtained optimizing f (r) = α(r) ⋅ ε(r)γ The basic cGA scheme • px i = 0.5 0 1. Initialization 2. Model sampling (two individuals are generated) 3. Evaluation (f(r)) 4. Selection (tournament selection) 5. Probabilistic model updation 6. Repeat steps 2-5 until termination criteria are met IWLCS 2005 Llorà, Sastry, & Goldberg, 2005 6
  • 7. cGAModel Perturbation • Facilitate the evolution of different rules • Explore the frequency of appearance of each optimal rule • Initial model perturbation px i = 0.5 + U(−0.4,0.4) 0 • Experiments using the 3-input multiplexer • 1,000 independent runs • Visualize the pair-wise relations of the genes IWLCS 2005 Llorà, Sastry, & Goldberg, 2005 7
  • 8. Initial Perturbed Vectors Leading to rule 100111(01#) Problem structure IWLCS 2005 Llorà, Sastry, & Goldberg, 2005 8
  • 9. Initial Perturbed Vectors Leading to rule 011101(1#1) Problem structure IWLCS 2005 Llorà, Sastry, & Goldberg, 2005 9
  • 10. Perturbation Summary • 97% of the runs lead to a maximally general and accurate rule • The provability of evolving each of the optimal rules was roughly 1/3 • The initial perturbed probability vectors that lead to an optimal rule show pair-wise relations among genes • The pair-wise relations reflect the problem structure IWLCS 2005 Llorà, Sastry, & Goldberg, 2005 10
  • 11. But One Rule Is Not Enough • Model perturbation in cGA evolve different rules • The goal: evolve population of rules that solve the problem together • The fitness measure (f(r)) can be also be applied to rule sets Two mechanisms: • – Spawn a population until the solution is meet – Fusing populations when they represent the same rule IWLCS 2005 Llorà, Sastry, & Goldberg, 2005 11
  • 12. Spawning and Fusing Populations of Rules IWLCS 2005 Llorà, Sastry, & Goldberg, 2005 12
  • 13. Experiments & Scalability • Analysis using multiplexer problems (3-, 6-, and 11-input) • The number of rules in [O] grow exponentially – 2i, where i is the number of inputs • The CGA success as a function of the problem size – 3-input: 97% – 6-input: 73.93% – 11-input:43.03% • Scalability over 10,000 independent runs IWLCS 2005 Llorà, Sastry, & Goldberg, 2005 13
  • 14. Scalability of CCS IWLCS 2005 Llorà, Sastry, & Goldberg, 2005 14
  • 15. Unmatchable Rules: A Byproduct • A rule is unmatchable if: – At least one attribute in the contain have all its possible values set to 0 color shape size red green blue white round square huge large medium small 1 1 1 1 0 0 0 1 1 0 • The rule (1111|00|0110) force the shape to be neither round or square • Hence no data instance will ever match it • Direct impact on the scalability of LCS/GBML system using it (as simple experiments with the multiplexer show) IWLCS 2005 Llorà, Sastry, & Goldberg, 2005 15
  • 16. 3-Input Multiplexer 37 unmatchable rules (57.8%) IWLCS 2005 Llorà, Sastry, & Goldberg, 2005 16
  • 17. 6-Input Multiplexer 3,367 unmatchable rules (82.2%) IWLCS 2005 Llorà, Sastry, & Goldberg, 2005 17
  • 18. 11-Input Multiplexer 4,017,157 unmatchable rules (95.7%) IWLCS 2005 Llorà, Sastry, & Goldberg, 2005 18
  • 19. Growth Ratio of Unmatchable Rules (I/III) • An unmatchable rule has of all attribute values set to 0 • Analysis for problems with binary attributes (worst case) • The total number of rules Σ(l) = 2 l • Number of rules matchable rules (all attributes set to either 01, 11, & 11) l Ψ(l) = 3 2 • Size of the unmatchable rule set plateau l Φ(l) = Σ(l) − Ψ(l) = 2 l − 3 2 IWLCS 2005 Llorà, Sastry, & Goldberg, 2005 19
  • 20. Growth Ratio of Unmatchable Rules (II/III) • Growth ratio of unmatchable rules Φ(l) 2 l ρ(l) = = l −1 Ψ(l) 32 • It can be approximated by ρ(l) ≈ e cl ⎛2⎞ c = ln⎜ ⎟ = 0.143 ⎝ 3⎠ • The growth ratio (ρ) for this representation grows exponentially IWLCS 2005 Llorà, Sastry, & Goldberg, 2005 20
  • 21. Growth Ratio of Unmatchable Rules (III/III) IWLCS 2005 Llorà, Sastry, & Goldberg, 2005 21
  • 22. Growth Ratio of Unmatchable Rules (III/III) IWLCS 2005 Llorà, Sastry, & Goldberg, 2005 21
  • 23. Conclusions • Initial steps to evolve rule sets formed formed only by maximally accurate and general rules using Pittsburgh systems • Using a cGA and the appropriate fitness function (CCS) we can evolve such rules • Rule representation has a direct connection to the scalability of any GBML system – A wrong choice makes the problem extremely hard • Further analysis for different representations is needed (Stone, 2004) IWLCS 2005 Llorà, Sastry, & Goldberg, 2005 22
  • 24. Conclusions • Initial steps to evolve rule sets formed formed only by maximally accurate and general rules using Pittsburgh systems • Using a cGA and the appropriate fitness function (CCS) we can evolve such rules • Rule representation has a direct connection to the scalability of any GBML system – A wrong choice makes the problem extremely hard • Further analysis for different representations is needed (Stone, 2004) IWLCS 2005 Llorà, Sastry, & Goldberg, 2005 22