SlideShare ist ein Scribd-Unternehmen logo
1 von 26
Downloaden Sie, um offline zu lesen
Structural Accuracy of
Probabilistic Models in BOA

                                 Claudio F. Lima
                 University of Algarve, Portugal

  Work in collaboration with Martin Pelikan, David E. Goldberg,
      Fernando G. Lobo, Kumara Sastry, and Mark Hauschild
Outline

    BOA in one slide


    Motivation


    Measuring structural accuracy of PMs in BOA


    Influence of BOA’s parameters


        Population Size
    

        Selection Strategy
    

        Replacement Strategy
    

    Conclusions

BOA: Bayesian optimization
algorithm (Pelikan et al., 1999)

    EDA that builds and samples from a Bayesian network to

    guide search

    Similar to EBNA (Etxeberria et al., 1999) and LFDA

    (Muhlenbein, 2000)

    BNs can model complex multivariate dependencies

    between variables

    Has been sucessfuly applied in real-world problems

    (Pelikan; 2005, 2006)
Overfitting in EDAs
    EDAs can solve efficiently challeging optimization

    problems
        Where exploiting problem structure is a plus
    



    But oftentimes their probabilistic models do not exactly

    reflect the problem structure

    Why?

        PMs are learned from a sample of limited size (population)
    

        Particular features of that specific sample are also encoded
    

        Well-known problem in ML → overfitting
    

        See Wu & Shapiro (2006)
    
A Closer Look at BOA (w/ DTs)

    Looking at the PMs in BOA, one can see:

        Important dependencies are discovered
    

        But with additional spurious linkages
    



    While BN structure captures such excessive

    complexity...

    ...the corresponding conditional probabilities nearly

    express independency between spurious and correct
    variables
When is Structural Accuracy Crucial?


    Model-based efficiency enhancement techniques


        Evaluation relaxation (Sastry et al., 2004, Pelikan et al., 2004,
    
        Sastry et al., 2006)
        Time continuation (Lima et al., 2005; Lima et al., 2006)
    




    Off-line usage of linkage information


        Learn about the problem while trying to solve it
    

        Build problem-specific operators with gained knowledge
    
What can affect model accuracy in
BOA?
    Parameters

        Population Size (Pelikan, 2005)
    
        Selection Strategy (Johnson et al., 2001; Santana et al., 2005)
    
        Replacement Strategy
    



    Problem Structure

        Subfunction Size
    
        Subfunction Overlapping Degree
    
        Subfunction Fitness Distribution
    
        Hierarchy
    



    Bayesian Network Learning

        Search Procedure (Wu et al., 2006; Echegoyen et al., 2007)
    
        Score Metric (Pelikan, 2005; Correa et al., 2006)
    
Experimental Setup for Measuring
Structural Accuracy of PMs


    Test problem(s) structure should be known


    Learning problem structure should be crucial to solve it

    efficiently
    Easy control of problem size and difficulty in order to test

    scalability
    m-k deceptive trap functions:


        Concatenated m trap functions of size k each
    

        Total problem size is m.k (in case of overlapping, m(k-o)+o )
    

        k-order statistics are required to (efficient) success
    

        Lower-order statictics lead the search away from the optimum
    
Analyzing Probabilistic Models in
BOA
    The big question: When does the BN structure matches

    problem structure?

    Looking for answers:

        Edges: The good, the bad, and the missing ones!
    

            Mark Hauschild’s talk on Monday, 15:15h
        

        Problem substructure “represented” in the BN?
    
            This talk
        




    Measure the proportion of substructures represented

    and their accuracy
A simple example for Trap3
    Ideal BN for Trap3:        Ideally, relations should
                          
                               respect a certain chain

                               But the dependency that
                           
                               relates all k variables is
                               crucial!
    Dependency relations

        0←{}
    
                               And most difficult to learn
                           
        1←{0}
    

        2 ← { 0,1 }
    
                               In this way, k-order
                           
        3←{}
    
                               statistics are maintained
        4←{3}
    

        5 ← { 2,3 }
    
What are we going to measure?
    Proportion of subfunctions with correct linkage group

        Example: 2 ← { 0,1 }
    



    Proportion of subfunctions with spurious linkage group

        Example: 2 ← { 0,1,3,4,5 }
    



    Proportion of subfunctions with any of the above

        Sum of the two previous proportions
    



    Average size of spurious linkage

        Number or spurious variables
    
Influence of BOA’s parameters

    Population Size




    Selection Strategy

        Selection method
    

        Selection intensity
    




    Replacement Strategy


        Replacement method
    

        Elitist intensity, diversity preservation
    
Influence of Population Size

                           • m = 10 and k = 5
                           • n0 is the minimal
                           population size required to
                           solve problem
                           • Exponentialy increasing
                           population sizes: n0, 2n0,
                           4n0, and 8n0
                           • Increasing population
                           size slightly improves
                           model accuracy
Influence of Selection Strategy


    Tournament Selection


        Pick best solution from a tournament of s individuals
    

        Repeat n times (w/ or wo/ replacement)
    



    Truncation Selection

        Choose the best δ % proportion of the population
    



    Different selection dynamics


        In terms of selection variance and lost of diversity (Blickle &
    
        Thiele, 1997)
Influence of Selection Intensity on n,
nfe, and resulting Speedup




    Good news: Increasing selection pressure reduces the

    required population size and num. of evals
    Not so good news: the resulting speedup seems to

    decrease with problem size
Tournament Selection

                       • m = 24 and k = 5 (l = 120)
                       • Tournament sizes s = 2,3,4
                       • Proportion of accurate
                       structures is not significative
                       (in particular for larger s)
                       • However, w/ or wo/ spurious
                       linkage, all substructures are
                       represented
                       • Higher selection intensity
                       increases spurious linkage
                       size
Truncation Selection

                       • m = 24 and k = 5 (l = 120)
                       • Truncation thresholds δ =
                       66%, 47%, 36%
                       • Same selection intensity as
                       s = 2,3,4
                       • Proportion of accurate
                       structures is now close to
                       100%
                       • Number of spurious
                       variables is low
                       • Higher selection intensity
                       has small impact on model
                       structural accuracy
Influence of Selection Strategy




    n and nfe required for truncation is higher than for

    tournament by a significant however constant factor (2-3
    times more)
Tournament vs. Truncation:
Is it a matter of population size?

                           • m = 24 and k = 5 (l = 120)


                           • Comparing tournament (s =
                           2) and truncation (δ = 66%)
                           with the same:
                               • population size
                               • selection intensity


                           • Tournament selection
                           improves accuracy but is still
                           not close to truncation
What is it then?

    For the same selection intensity:

        Truncation has a higher loss of diversity...
    
        ...and a lower selection variance (Blickle & Thiele, 1997)
    


    But... distribution of the number of copies in the selected

    population affects BN learning
        In tournament the number of copies is somewhat proportional to
    
        its rank (best guy gets exactly s copies)
        In truncation no particular relevance is given to top solutions
    



    Model overfitting of top individuals takes place (in

    particular for increasing s)
Influence of Replacement Strategy


    Full replacement (FR)


        Offspring population fully replaces the parents
    




    Elitist replacement (ER)

        Worst portion of the parents is replaced by offspring
    



    Restricted tournament replacement (RTR)


        Offspring directly competes with similar parent for a place in the
    
        next population (diversity preservation)
Influence of Replacement
(Tournament Selection)

                           • m = 24 and k = 5 (l = 120)
                           • Comparing ER50%, FR, and
                           RTR with tournament
                           selection (s = 2)
                           • Structural accuracy is higher
                           with FR
                           • Replacement strategy does
                           not have the same impact as
                           selection on the spurious
                           linkage size
                           • Spurious linkage is more
                           frequent with RTR because of
                           the smaller pop. size
Influence of Replacement
(Tournament Selection)




    Additional elitist proportions are shown (1-50%)


    RTR clearly requires less n and nfe


    However, linkage information is not so accurate

Influence of Replacement
(Truncation selection)

                           • m = 24 and k = 5 (l = 120)
                           • Same comparison with
                           truncation selection (δ = 66%)
                           • Structural accuracy is close
                           to 100% for all replacement
                           strategies
                           • FR is still slightly better
                           because it needs larger n
                           • Size of spurious linkage is
                           quite low
Conclusions

    Trade-off between model accuracy and overall

    performance in BOA

    Truncation is better than tournament selection for

    accurate modelling

    For the same purpose, the replacement method is more

    relevant if tournament selection is used (in which case
    full replacement is better)

    If overall performance (num. of evals) is our main

    concern, tournament and RTR are the best options
Structural Accuracy of
Probabilistic Models in BOA

                                 Claudio F. Lima
                 University of Algarve, Portugal

  Work in collaboration with Martin Pelikan, David E. Goldberg,
      Fernando G. Lobo, Kumara Sastry, and Mark Hauschild

Weitere ähnliche Inhalte

Ähnlich wie Structural Accuracy of Probabilistic Models in BOA

Scatterplots and Cautions of Correlation
Scatterplots and Cautions of CorrelationScatterplots and Cautions of Correlation
Scatterplots and Cautions of Correlation
Oleg Janke
 
STAT 350 (Spring 2017) Homework 11 (20 points + 1 point BONUS).docx
STAT 350 (Spring 2017) Homework 11 (20 points + 1 point BONUS).docxSTAT 350 (Spring 2017) Homework 11 (20 points + 1 point BONUS).docx
STAT 350 (Spring 2017) Homework 11 (20 points + 1 point BONUS).docx
whitneyleman54422
 
2010 smg training_cardiff_day2_session2_dias
2010 smg training_cardiff_day2_session2_dias2010 smg training_cardiff_day2_session2_dias
2010 smg training_cardiff_day2_session2_dias
rgveroniki
 
Probability and Statistics part 3.pdf
Probability and Statistics part 3.pdfProbability and Statistics part 3.pdf
Probability and Statistics part 3.pdf
Almolla Raed
 
MDPSO_SDM_2012_Souma
MDPSO_SDM_2012_SoumaMDPSO_SDM_2012_Souma
MDPSO_SDM_2012_Souma
MDO_Lab
 

Ähnlich wie Structural Accuracy of Probabilistic Models in BOA (20)

Sct2013 boston,randomizationmetricsposter,d6.2
Sct2013 boston,randomizationmetricsposter,d6.2Sct2013 boston,randomizationmetricsposter,d6.2
Sct2013 boston,randomizationmetricsposter,d6.2
 
Population sizing for entropy-based model buliding In genetic algorithms
Population sizing for entropy-based model buliding In genetic algorithms Population sizing for entropy-based model buliding In genetic algorithms
Population sizing for entropy-based model buliding In genetic algorithms
 
Scatterplots and Cautions of Correlation
Scatterplots and Cautions of CorrelationScatterplots and Cautions of Correlation
Scatterplots and Cautions of Correlation
 
ECONOMETRICS I ASA
ECONOMETRICS I ASAECONOMETRICS I ASA
ECONOMETRICS I ASA
 
Statistics.pdf
Statistics.pdfStatistics.pdf
Statistics.pdf
 
STAT 350 (Spring 2017) Homework 11 (20 points + 1 point BONUS).docx
STAT 350 (Spring 2017) Homework 11 (20 points + 1 point BONUS).docxSTAT 350 (Spring 2017) Homework 11 (20 points + 1 point BONUS).docx
STAT 350 (Spring 2017) Homework 11 (20 points + 1 point BONUS).docx
 
2010 smg training_cardiff_day2_session2_dias
2010 smg training_cardiff_day2_session2_dias2010 smg training_cardiff_day2_session2_dias
2010 smg training_cardiff_day2_session2_dias
 
"intelligent" intelligence testing: Evaluating wihtin CHC domain test score ...
"intelligent" intelligence testing:  Evaluating wihtin CHC domain test score ..."intelligent" intelligence testing:  Evaluating wihtin CHC domain test score ...
"intelligent" intelligence testing: Evaluating wihtin CHC domain test score ...
 
Ga
GaGa
Ga
 
Structured equation model
Structured equation modelStructured equation model
Structured equation model
 
Machine Learning Foundations
Machine Learning FoundationsMachine Learning Foundations
Machine Learning Foundations
 
Probability and Statistics part 3.pdf
Probability and Statistics part 3.pdfProbability and Statistics part 3.pdf
Probability and Statistics part 3.pdf
 
SEM
SEMSEM
SEM
 
Genetic Algorithm
Genetic AlgorithmGenetic Algorithm
Genetic Algorithm
 
MDPSO_SDM_2012_Souma
MDPSO_SDM_2012_SoumaMDPSO_SDM_2012_Souma
MDPSO_SDM_2012_Souma
 
Cz24655657
Cz24655657Cz24655657
Cz24655657
 
Causally regularized machine learning
Causally regularized machine learningCausally regularized machine learning
Causally regularized machine learning
 
Structural Optimization using Genetic Algorithms - Artificial Intelligence Fu...
Structural Optimization using Genetic Algorithms - Artificial Intelligence Fu...Structural Optimization using Genetic Algorithms - Artificial Intelligence Fu...
Structural Optimization using Genetic Algorithms - Artificial Intelligence Fu...
 
Probability distribution Function & Decision Trees in machine learning
Probability distribution Function  & Decision Trees in machine learningProbability distribution Function  & Decision Trees in machine learning
Probability distribution Function & Decision Trees in machine learning
 
GAN(と強化学習との関係)
GAN(と強化学習との関係)GAN(と強化学習との関係)
GAN(と強化学習との関係)
 

Kürzlich hochgeladen

IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
Enterprise Knowledge
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 

Kürzlich hochgeladen (20)

08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 

Structural Accuracy of Probabilistic Models in BOA

  • 1. Structural Accuracy of Probabilistic Models in BOA Claudio F. Lima University of Algarve, Portugal Work in collaboration with Martin Pelikan, David E. Goldberg, Fernando G. Lobo, Kumara Sastry, and Mark Hauschild
  • 2. Outline BOA in one slide  Motivation  Measuring structural accuracy of PMs in BOA  Influence of BOA’s parameters  Population Size  Selection Strategy  Replacement Strategy  Conclusions 
  • 3. BOA: Bayesian optimization algorithm (Pelikan et al., 1999) EDA that builds and samples from a Bayesian network to  guide search Similar to EBNA (Etxeberria et al., 1999) and LFDA  (Muhlenbein, 2000) BNs can model complex multivariate dependencies  between variables Has been sucessfuly applied in real-world problems  (Pelikan; 2005, 2006)
  • 4. Overfitting in EDAs EDAs can solve efficiently challeging optimization  problems Where exploiting problem structure is a plus  But oftentimes their probabilistic models do not exactly  reflect the problem structure Why?  PMs are learned from a sample of limited size (population)  Particular features of that specific sample are also encoded  Well-known problem in ML → overfitting  See Wu & Shapiro (2006) 
  • 5. A Closer Look at BOA (w/ DTs) Looking at the PMs in BOA, one can see:  Important dependencies are discovered  But with additional spurious linkages  While BN structure captures such excessive  complexity... ...the corresponding conditional probabilities nearly  express independency between spurious and correct variables
  • 6. When is Structural Accuracy Crucial? Model-based efficiency enhancement techniques  Evaluation relaxation (Sastry et al., 2004, Pelikan et al., 2004,  Sastry et al., 2006) Time continuation (Lima et al., 2005; Lima et al., 2006)  Off-line usage of linkage information  Learn about the problem while trying to solve it  Build problem-specific operators with gained knowledge 
  • 7. What can affect model accuracy in BOA? Parameters  Population Size (Pelikan, 2005)  Selection Strategy (Johnson et al., 2001; Santana et al., 2005)  Replacement Strategy  Problem Structure  Subfunction Size  Subfunction Overlapping Degree  Subfunction Fitness Distribution  Hierarchy  Bayesian Network Learning  Search Procedure (Wu et al., 2006; Echegoyen et al., 2007)  Score Metric (Pelikan, 2005; Correa et al., 2006) 
  • 8. Experimental Setup for Measuring Structural Accuracy of PMs Test problem(s) structure should be known  Learning problem structure should be crucial to solve it  efficiently Easy control of problem size and difficulty in order to test  scalability m-k deceptive trap functions:  Concatenated m trap functions of size k each  Total problem size is m.k (in case of overlapping, m(k-o)+o )  k-order statistics are required to (efficient) success  Lower-order statictics lead the search away from the optimum 
  • 9. Analyzing Probabilistic Models in BOA The big question: When does the BN structure matches  problem structure? Looking for answers:  Edges: The good, the bad, and the missing ones!  Mark Hauschild’s talk on Monday, 15:15h  Problem substructure “represented” in the BN?  This talk  Measure the proportion of substructures represented  and their accuracy
  • 10. A simple example for Trap3 Ideal BN for Trap3: Ideally, relations should   respect a certain chain But the dependency that  relates all k variables is crucial! Dependency relations  0←{}  And most difficult to learn  1←{0}  2 ← { 0,1 }  In this way, k-order  3←{}  statistics are maintained 4←{3}  5 ← { 2,3 } 
  • 11. What are we going to measure? Proportion of subfunctions with correct linkage group  Example: 2 ← { 0,1 }  Proportion of subfunctions with spurious linkage group  Example: 2 ← { 0,1,3,4,5 }  Proportion of subfunctions with any of the above  Sum of the two previous proportions  Average size of spurious linkage  Number or spurious variables 
  • 12. Influence of BOA’s parameters Population Size  Selection Strategy  Selection method  Selection intensity  Replacement Strategy  Replacement method  Elitist intensity, diversity preservation 
  • 13. Influence of Population Size • m = 10 and k = 5 • n0 is the minimal population size required to solve problem • Exponentialy increasing population sizes: n0, 2n0, 4n0, and 8n0 • Increasing population size slightly improves model accuracy
  • 14. Influence of Selection Strategy Tournament Selection  Pick best solution from a tournament of s individuals  Repeat n times (w/ or wo/ replacement)  Truncation Selection  Choose the best δ % proportion of the population  Different selection dynamics  In terms of selection variance and lost of diversity (Blickle &  Thiele, 1997)
  • 15. Influence of Selection Intensity on n, nfe, and resulting Speedup Good news: Increasing selection pressure reduces the  required population size and num. of evals Not so good news: the resulting speedup seems to  decrease with problem size
  • 16. Tournament Selection • m = 24 and k = 5 (l = 120) • Tournament sizes s = 2,3,4 • Proportion of accurate structures is not significative (in particular for larger s) • However, w/ or wo/ spurious linkage, all substructures are represented • Higher selection intensity increases spurious linkage size
  • 17. Truncation Selection • m = 24 and k = 5 (l = 120) • Truncation thresholds δ = 66%, 47%, 36% • Same selection intensity as s = 2,3,4 • Proportion of accurate structures is now close to 100% • Number of spurious variables is low • Higher selection intensity has small impact on model structural accuracy
  • 18. Influence of Selection Strategy n and nfe required for truncation is higher than for  tournament by a significant however constant factor (2-3 times more)
  • 19. Tournament vs. Truncation: Is it a matter of population size? • m = 24 and k = 5 (l = 120) • Comparing tournament (s = 2) and truncation (δ = 66%) with the same: • population size • selection intensity • Tournament selection improves accuracy but is still not close to truncation
  • 20. What is it then? For the same selection intensity:  Truncation has a higher loss of diversity...  ...and a lower selection variance (Blickle & Thiele, 1997)  But... distribution of the number of copies in the selected  population affects BN learning In tournament the number of copies is somewhat proportional to  its rank (best guy gets exactly s copies) In truncation no particular relevance is given to top solutions  Model overfitting of top individuals takes place (in  particular for increasing s)
  • 21. Influence of Replacement Strategy Full replacement (FR)  Offspring population fully replaces the parents  Elitist replacement (ER)  Worst portion of the parents is replaced by offspring  Restricted tournament replacement (RTR)  Offspring directly competes with similar parent for a place in the  next population (diversity preservation)
  • 22. Influence of Replacement (Tournament Selection) • m = 24 and k = 5 (l = 120) • Comparing ER50%, FR, and RTR with tournament selection (s = 2) • Structural accuracy is higher with FR • Replacement strategy does not have the same impact as selection on the spurious linkage size • Spurious linkage is more frequent with RTR because of the smaller pop. size
  • 23. Influence of Replacement (Tournament Selection) Additional elitist proportions are shown (1-50%)  RTR clearly requires less n and nfe  However, linkage information is not so accurate 
  • 24. Influence of Replacement (Truncation selection) • m = 24 and k = 5 (l = 120) • Same comparison with truncation selection (δ = 66%) • Structural accuracy is close to 100% for all replacement strategies • FR is still slightly better because it needs larger n • Size of spurious linkage is quite low
  • 25. Conclusions Trade-off between model accuracy and overall  performance in BOA Truncation is better than tournament selection for  accurate modelling For the same purpose, the replacement method is more  relevant if tournament selection is used (in which case full replacement is better) If overall performance (num. of evals) is our main  concern, tournament and RTR are the best options
  • 26. Structural Accuracy of Probabilistic Models in BOA Claudio F. Lima University of Algarve, Portugal Work in collaboration with Martin Pelikan, David E. Goldberg, Fernando G. Lobo, Kumara Sastry, and Mark Hauschild