SlideShare ist ein Scribd-Unternehmen logo
1 von 22
Portfolios of Artificial Intelligences 
+ playing with random seeds 
1. What is a portfolio 
2. Offline portfolio 
3. Online portfolio 
4. Mathematics (sorry) 
5. Experiments 
J.-B. Hoock, D. L. St-Pierre, O. Teytaud
Portfolio 
● I have K algorithms for solving a given task : 
– Mcts 
– Alpha-Beta 
– Parametric script 
– Nested MC 
– … 
● I want to choose the best one
Two frameworks 
● Offline 
– I do some work before the competition 
– I combine all my algorithms into 1 
– Simple version : 
● Compute some probability vector p 
● For each game, use Algo(i) with probability p(i) 
● Online 
– For each game, 
● Use Algo(i) with probability p(i) 
● Update p when the game is over
1. What is a portfolio 
2. Offline portfolio 
3. Online portfolio 
4. Mathematics (sorry) 
5. Experiments
Offline Nash portfolio 
● K algorithms for black BAI(1),..., BAI(K) 
● K' algorithms for white WAI(1),...,WAI(K') 
● Def : Mij=proba( BAI(i) beats WAI(j) ) 
● Define (p,q) = Nash equilibrium of M 
– p = best stochastic portfolio for Black (Nash sense) 
– q = best stochastic portfolio for White (Nash sense) 
● Portfolio : 
– Black : Play BAI(i) with probability p(i) 
– White : Play WAI(j) with probability q(j)
Other offline portfolios 
● K algorithms for black BAI(1),..., BAI(K) 
● K' algorithms for white WAI(1),...,WAI(K) 
● Definitions : 
– Uniform portfolio : p(i) = 1/K q(j)=1/K' 
– Fixed seed : p(i)=1, q(j)=1 for some i,j 
– Best arm : fixed seed with i best row / j best column 
● Portfolio : 
– Black : Play BAI(i) with probability p(i) 
– White : Play WAI(j) with probability q(j)
1. What is a portfolio 
2. Offline portfolio 
3. Online portfolio 
4. Mathematics (sorry) 
5. Experiments
Online portfolio (for Black) 
● Just apply UCBT (or your favorite bandit) 
● Before playing a game : 
– p(i) = frequency of win for BAI(i) 
– n(i) =number of times BAI(i) was used 
– N= sum of the n(i) 
– sc(i)= p(i) + Clog(N)/n(i) 
+C' sqrt( p(i)(1-p(i)) log(N) /n(i) ) 
– choose i* maximizing sc(i*) 
● Play with BAI(i*)
1. What is a portfolio 
2. Offline portfolio 
3. Online portfolio 
4. Mathematics (sorry) 
5. Experiments
Nash 
Computed 
● exactly in polynomial time. 
● with precision e in expected time 
O( (K+K') log (K+K') 2 / e 2 ) 
The best portfolio in terms of 
● Worst case winrate against the WAI(i) 
● Worst case winrate against WAI(i) for i ~ some 
probability distribution
UCBT for Black 
● Nearly zero computational overhead 
● Asymptotically optimal winning rate against a 
stationary opponent, among the BAI(i) 
● We did not try discounted Ucb
1. What is a portfolio 
2. Offline portfolio 
3. Online portfolio 
4. Mathematics (sorry) 
5. Experiments 
on 9x9 Go
First portfolio : random seeds 
● Pick up a stochastic algorithm 
● Choose K random seeds 
● You get K algorithms 
Hint : the random seed has a significant impact. 
Yes, it's by rote learning (kind of opening book).
Performance of Nash portfolio 
(learnt offline), in generalization 
● Against 
« new » seeds 
● Vs uniform 
==> this means we 
outperform the 
default version 
(which is randomized seeds). 
Portfolios are here 
a distribution on random seeds. 
We get an improved algorithm 
(winning rate 66%) just 
with that.
Performance of Nash portfolio 
(learnt offline), in generalization 
● Against 
« new » seeds 
● Vs uniform : 
==> this means we 
outperform the 
default version 
(which is randomized seeds) 
Portfolios are here 
a distribution on random seeds. 
We get an improved algorithm 
(winning rate 66%) just 
with that. 
X-axis = K = K'
Remarks 
● Nash portfolio good 
● « Best Arm » seed very good 
● But we will see that « best arm » has 
weaknesses ==> it can be « overfitted » i.e. 
easily beaten by a « learning » opponent.
UCBT cruches fixedSeed and wins 
against uniform 
Dots decreasing 
to 0. 
Fixed seeds 
(deterministic 
algorithms) 
are overfitted 
after 64 games 
X-axis = 
log2 (nb of games) 
(max. 512 games)
UCBT cruches fixedSeed and wins 
against uniform 
Dots decreasing 
to 0. 
Fixed seeds 
(deterministic 
algorithms) 
are overfitted 
after 64 games 
X-axis = 
log2 (nb of games) 
(max. 512 games)
Other experiments : variants of 
some algorithm 
● Gnugo with options (32 variants) 
● Nash-portfolio or UCBT portfolio : only a few 
percents of improvements over a single ad hoc 
variant. 
==> less impressive than with random seeds
Conclusions 
● Nice application for Nash-portfolio: 
– Choose a stochastic algorithm 
– Build a matrix M of games randomSeed vs 
randomSeed 
– Compute the Nash equilibrium 
– You get a new probability distribution on random seeds 
– It should be strong than the original algorithm. 
● Nice application for UCBT-portfolio 
– Play against it 
– As long as you lose, it will keep the same line of play
Conclusions 
● Further work 
– Better Nash approximation 
– Increase fun (should Ucbt explore more or less ? 
discount ?) 
– Bigger experiments (bigger games ? 19x19 ?) 
● Comments ? 
We forgot to cite your paper ? 
We did not try on your favorite game ? 
Our results are bullshit ? Please tell us:-)
AI Portfolios Improve Random Seed Algorithms

Weitere ähnliche Inhalte

Andere mochten auch

Artificial intelligence for power systems
Artificial intelligence for power systemsArtificial intelligence for power systems
Artificial intelligence for power systemsOlivier Teytaud
 
Bias correction, and other uncertainty management techniques
Bias correction, and other uncertainty management techniquesBias correction, and other uncertainty management techniques
Bias correction, and other uncertainty management techniquesOlivier Teytaud
 
Planning for power systems
Planning for power systemsPlanning for power systems
Planning for power systemsOlivier Teytaud
 
Réseaux neuronaux profonds & intelligence artificielle
Réseaux neuronaux profonds & intelligence artificielleRéseaux neuronaux profonds & intelligence artificielle
Réseaux neuronaux profonds & intelligence artificielleOlivier Teytaud
 
Simple regret bandit algorithms for unstructured noisy optimization
Simple regret bandit algorithms for unstructured noisy optimizationSimple regret bandit algorithms for unstructured noisy optimization
Simple regret bandit algorithms for unstructured noisy optimizationOlivier Teytaud
 
Simulation-based optimization: Upper Confidence Tree and Direct Policy Search
Simulation-based optimization: Upper Confidence Tree and Direct Policy SearchSimulation-based optimization: Upper Confidence Tree and Direct Policy Search
Simulation-based optimization: Upper Confidence Tree and Direct Policy SearchOlivier Teytaud
 
Bias and Variance in Continuous EDA: massively parallel continuous optimization
Bias and Variance in Continuous EDA: massively parallel continuous optimizationBias and Variance in Continuous EDA: massively parallel continuous optimization
Bias and Variance in Continuous EDA: massively parallel continuous optimizationOlivier Teytaud
 
Keywords and examples of machine learning
Keywords and examples of machine learningKeywords and examples of machine learning
Keywords and examples of machine learningOlivier Teytaud
 

Andere mochten auch (11)

Artificial intelligence for power systems
Artificial intelligence for power systemsArtificial intelligence for power systems
Artificial intelligence for power systems
 
Functional programming
Functional programmingFunctional programming
Functional programming
 
Bias correction, and other uncertainty management techniques
Bias correction, and other uncertainty management techniquesBias correction, and other uncertainty management techniques
Bias correction, and other uncertainty management techniques
 
Planning for power systems
Planning for power systemsPlanning for power systems
Planning for power systems
 
Réseaux neuronaux profonds & intelligence artificielle
Réseaux neuronaux profonds & intelligence artificielleRéseaux neuronaux profonds & intelligence artificielle
Réseaux neuronaux profonds & intelligence artificielle
 
Debugging
DebuggingDebugging
Debugging
 
Simple regret bandit algorithms for unstructured noisy optimization
Simple regret bandit algorithms for unstructured noisy optimizationSimple regret bandit algorithms for unstructured noisy optimization
Simple regret bandit algorithms for unstructured noisy optimization
 
Simulation-based optimization: Upper Confidence Tree and Direct Policy Search
Simulation-based optimization: Upper Confidence Tree and Direct Policy SearchSimulation-based optimization: Upper Confidence Tree and Direct Policy Search
Simulation-based optimization: Upper Confidence Tree and Direct Policy Search
 
Power systemsilablri
Power systemsilablriPower systemsilablri
Power systemsilablri
 
Bias and Variance in Continuous EDA: massively parallel continuous optimization
Bias and Variance in Continuous EDA: massively parallel continuous optimizationBias and Variance in Continuous EDA: massively parallel continuous optimization
Bias and Variance in Continuous EDA: massively parallel continuous optimization
 
Keywords and examples of machine learning
Keywords and examples of machine learningKeywords and examples of machine learning
Keywords and examples of machine learning
 

Ähnlich wie AI Portfolios Improve Random Seed Algorithms

Theories of continuous optimization
Theories of continuous optimizationTheories of continuous optimization
Theories of continuous optimizationOlivier Teytaud
 
13_Unsupervised Learning.pdf
13_Unsupervised Learning.pdf13_Unsupervised Learning.pdf
13_Unsupervised Learning.pdfEmanAsem4
 
Pyoneers - IITGN CSE Seminar Presentation
Pyoneers - IITGN CSE Seminar PresentationPyoneers - IITGN CSE Seminar Presentation
Pyoneers - IITGN CSE Seminar PresentationVaidyanathan P. R.
 
Dynamic Programming
Dynamic ProgrammingDynamic Programming
Dynamic ProgrammingSahil Kumar
 
Meta Monte-Carlo Tree Search
Meta Monte-Carlo Tree SearchMeta Monte-Carlo Tree Search
Meta Monte-Carlo Tree SearchOlivier Teytaud
 
Undecidability in partially observable deterministic games
Undecidability in partially observable deterministic gamesUndecidability in partially observable deterministic games
Undecidability in partially observable deterministic gamesOlivier Teytaud
 
An Analytical Study of Puzzle Selection Strategies for the ESP Game
An Analytical Study of Puzzle Selection Strategies for the ESP GameAn Analytical Study of Puzzle Selection Strategies for the ESP Game
An Analytical Study of Puzzle Selection Strategies for the ESP GameAcademia Sinica
 
Dynamic programming
Dynamic programmingDynamic programming
Dynamic programmingGopi Saiteja
 
Machine learning 2016: deep networks and Monte Carlo Tree Search
Machine learning 2016: deep networks and Monte Carlo Tree SearchMachine learning 2016: deep networks and Monte Carlo Tree Search
Machine learning 2016: deep networks and Monte Carlo Tree SearchOlivier Teytaud
 
Machine learning 2016: deep networks and Monte Carlo Tree Search
Machine learning 2016: deep networks and Monte Carlo Tree SearchMachine learning 2016: deep networks and Monte Carlo Tree Search
Machine learning 2016: deep networks and Monte Carlo Tree SearchOlivier Teytaud
 
Haskell in the Real World
Haskell in the Real WorldHaskell in the Real World
Haskell in the Real Worldosfameron
 
AlphaZero and beyond: Polygames
AlphaZero and beyond: PolygamesAlphaZero and beyond: Polygames
AlphaZero and beyond: PolygamesOlivier Teytaud
 
Lecture9-bayes.pptx
Lecture9-bayes.pptxLecture9-bayes.pptx
Lecture9-bayes.pptxTienChung4
 
Games, Queries, and Argumentation Frameworks: Time for a Family Reunion!
Games, Queries, and Argumentation Frameworks: Time for a Family Reunion!Games, Queries, and Argumentation Frameworks: Time for a Family Reunion!
Games, Queries, and Argumentation Frameworks: Time for a Family Reunion!Bertram Ludäscher
 
Choosing between several options in uncertain environments
Choosing between several options in uncertain environmentsChoosing between several options in uncertain environments
Choosing between several options in uncertain environmentsOlivier Teytaud
 
dynamic programming Rod cutting class
dynamic programming Rod cutting classdynamic programming Rod cutting class
dynamic programming Rod cutting classgiridaroori
 
clegoues-pwlconf-sept16-asPDF.pdf
clegoues-pwlconf-sept16-asPDF.pdfclegoues-pwlconf-sept16-asPDF.pdf
clegoues-pwlconf-sept16-asPDF.pdfaoecmtin
 

Ähnlich wie AI Portfolios Improve Random Seed Algorithms (20)

Theories of continuous optimization
Theories of continuous optimizationTheories of continuous optimization
Theories of continuous optimization
 
13_Unsupervised Learning.pdf
13_Unsupervised Learning.pdf13_Unsupervised Learning.pdf
13_Unsupervised Learning.pdf
 
Provenance Games
Provenance GamesProvenance Games
Provenance Games
 
Pyoneers - IITGN CSE Seminar Presentation
Pyoneers - IITGN CSE Seminar PresentationPyoneers - IITGN CSE Seminar Presentation
Pyoneers - IITGN CSE Seminar Presentation
 
groovy & grails - lecture 8
groovy & grails - lecture 8groovy & grails - lecture 8
groovy & grails - lecture 8
 
Dynamic Programming
Dynamic ProgrammingDynamic Programming
Dynamic Programming
 
Meta Monte-Carlo Tree Search
Meta Monte-Carlo Tree SearchMeta Monte-Carlo Tree Search
Meta Monte-Carlo Tree Search
 
Undecidability in partially observable deterministic games
Undecidability in partially observable deterministic gamesUndecidability in partially observable deterministic games
Undecidability in partially observable deterministic games
 
An Analytical Study of Puzzle Selection Strategies for the ESP Game
An Analytical Study of Puzzle Selection Strategies for the ESP GameAn Analytical Study of Puzzle Selection Strategies for the ESP Game
An Analytical Study of Puzzle Selection Strategies for the ESP Game
 
Dynamic programming
Dynamic programmingDynamic programming
Dynamic programming
 
present_merged
present_mergedpresent_merged
present_merged
 
Machine learning 2016: deep networks and Monte Carlo Tree Search
Machine learning 2016: deep networks and Monte Carlo Tree SearchMachine learning 2016: deep networks and Monte Carlo Tree Search
Machine learning 2016: deep networks and Monte Carlo Tree Search
 
Machine learning 2016: deep networks and Monte Carlo Tree Search
Machine learning 2016: deep networks and Monte Carlo Tree SearchMachine learning 2016: deep networks and Monte Carlo Tree Search
Machine learning 2016: deep networks and Monte Carlo Tree Search
 
Haskell in the Real World
Haskell in the Real WorldHaskell in the Real World
Haskell in the Real World
 
AlphaZero and beyond: Polygames
AlphaZero and beyond: PolygamesAlphaZero and beyond: Polygames
AlphaZero and beyond: Polygames
 
Lecture9-bayes.pptx
Lecture9-bayes.pptxLecture9-bayes.pptx
Lecture9-bayes.pptx
 
Games, Queries, and Argumentation Frameworks: Time for a Family Reunion!
Games, Queries, and Argumentation Frameworks: Time for a Family Reunion!Games, Queries, and Argumentation Frameworks: Time for a Family Reunion!
Games, Queries, and Argumentation Frameworks: Time for a Family Reunion!
 
Choosing between several options in uncertain environments
Choosing between several options in uncertain environmentsChoosing between several options in uncertain environments
Choosing between several options in uncertain environments
 
dynamic programming Rod cutting class
dynamic programming Rod cutting classdynamic programming Rod cutting class
dynamic programming Rod cutting class
 
clegoues-pwlconf-sept16-asPDF.pdf
clegoues-pwlconf-sept16-asPDF.pdfclegoues-pwlconf-sept16-asPDF.pdf
clegoues-pwlconf-sept16-asPDF.pdf
 

Kürzlich hochgeladen

High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Dr.Costas Sachpazis
 
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130Suhani Kapoor
 
Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...Call Girls in Nagpur High Profile
 
UNIT-III FMM. DIMENSIONAL ANALYSIS
UNIT-III FMM.        DIMENSIONAL ANALYSISUNIT-III FMM.        DIMENSIONAL ANALYSIS
UNIT-III FMM. DIMENSIONAL ANALYSISrknatarajan
 
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur EscortsCall Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escortsranjana rawat
 
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLSMANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLSSIVASHANKAR N
 
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Christo Ananth
 
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...ranjana rawat
 
Java Programming :Event Handling(Types of Events)
Java Programming :Event Handling(Types of Events)Java Programming :Event Handling(Types of Events)
Java Programming :Event Handling(Types of Events)simmis5
 
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...Soham Mondal
 
result management system report for college project
result management system report for college projectresult management system report for college project
result management system report for college projectTonystark477637
 
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).pptssuser5c9d4b1
 
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...ranjana rawat
 
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSAPPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSKurinjimalarL3
 
Processing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptxProcessing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptxpranjaldaimarysona
 
Introduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptxIntroduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptxupamatechverse
 
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 

Kürzlich hochgeladen (20)

High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
 
DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE
DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINEDJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE
DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE
 
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
 
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
 
Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...
 
UNIT-III FMM. DIMENSIONAL ANALYSIS
UNIT-III FMM.        DIMENSIONAL ANALYSISUNIT-III FMM.        DIMENSIONAL ANALYSIS
UNIT-III FMM. DIMENSIONAL ANALYSIS
 
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur EscortsCall Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
 
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
 
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLSMANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
 
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
 
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...
 
Java Programming :Event Handling(Types of Events)
Java Programming :Event Handling(Types of Events)Java Programming :Event Handling(Types of Events)
Java Programming :Event Handling(Types of Events)
 
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
 
result management system report for college project
result management system report for college projectresult management system report for college project
result management system report for college project
 
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
 
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
 
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSAPPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
 
Processing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptxProcessing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptx
 
Introduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptxIntroduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptx
 
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
 

AI Portfolios Improve Random Seed Algorithms

  • 1. Portfolios of Artificial Intelligences + playing with random seeds 1. What is a portfolio 2. Offline portfolio 3. Online portfolio 4. Mathematics (sorry) 5. Experiments J.-B. Hoock, D. L. St-Pierre, O. Teytaud
  • 2. Portfolio ● I have K algorithms for solving a given task : – Mcts – Alpha-Beta – Parametric script – Nested MC – … ● I want to choose the best one
  • 3. Two frameworks ● Offline – I do some work before the competition – I combine all my algorithms into 1 – Simple version : ● Compute some probability vector p ● For each game, use Algo(i) with probability p(i) ● Online – For each game, ● Use Algo(i) with probability p(i) ● Update p when the game is over
  • 4. 1. What is a portfolio 2. Offline portfolio 3. Online portfolio 4. Mathematics (sorry) 5. Experiments
  • 5. Offline Nash portfolio ● K algorithms for black BAI(1),..., BAI(K) ● K' algorithms for white WAI(1),...,WAI(K') ● Def : Mij=proba( BAI(i) beats WAI(j) ) ● Define (p,q) = Nash equilibrium of M – p = best stochastic portfolio for Black (Nash sense) – q = best stochastic portfolio for White (Nash sense) ● Portfolio : – Black : Play BAI(i) with probability p(i) – White : Play WAI(j) with probability q(j)
  • 6. Other offline portfolios ● K algorithms for black BAI(1),..., BAI(K) ● K' algorithms for white WAI(1),...,WAI(K) ● Definitions : – Uniform portfolio : p(i) = 1/K q(j)=1/K' – Fixed seed : p(i)=1, q(j)=1 for some i,j – Best arm : fixed seed with i best row / j best column ● Portfolio : – Black : Play BAI(i) with probability p(i) – White : Play WAI(j) with probability q(j)
  • 7. 1. What is a portfolio 2. Offline portfolio 3. Online portfolio 4. Mathematics (sorry) 5. Experiments
  • 8. Online portfolio (for Black) ● Just apply UCBT (or your favorite bandit) ● Before playing a game : – p(i) = frequency of win for BAI(i) – n(i) =number of times BAI(i) was used – N= sum of the n(i) – sc(i)= p(i) + Clog(N)/n(i) +C' sqrt( p(i)(1-p(i)) log(N) /n(i) ) – choose i* maximizing sc(i*) ● Play with BAI(i*)
  • 9. 1. What is a portfolio 2. Offline portfolio 3. Online portfolio 4. Mathematics (sorry) 5. Experiments
  • 10. Nash Computed ● exactly in polynomial time. ● with precision e in expected time O( (K+K') log (K+K') 2 / e 2 ) The best portfolio in terms of ● Worst case winrate against the WAI(i) ● Worst case winrate against WAI(i) for i ~ some probability distribution
  • 11. UCBT for Black ● Nearly zero computational overhead ● Asymptotically optimal winning rate against a stationary opponent, among the BAI(i) ● We did not try discounted Ucb
  • 12. 1. What is a portfolio 2. Offline portfolio 3. Online portfolio 4. Mathematics (sorry) 5. Experiments on 9x9 Go
  • 13. First portfolio : random seeds ● Pick up a stochastic algorithm ● Choose K random seeds ● You get K algorithms Hint : the random seed has a significant impact. Yes, it's by rote learning (kind of opening book).
  • 14. Performance of Nash portfolio (learnt offline), in generalization ● Against « new » seeds ● Vs uniform ==> this means we outperform the default version (which is randomized seeds). Portfolios are here a distribution on random seeds. We get an improved algorithm (winning rate 66%) just with that.
  • 15. Performance of Nash portfolio (learnt offline), in generalization ● Against « new » seeds ● Vs uniform : ==> this means we outperform the default version (which is randomized seeds) Portfolios are here a distribution on random seeds. We get an improved algorithm (winning rate 66%) just with that. X-axis = K = K'
  • 16. Remarks ● Nash portfolio good ● « Best Arm » seed very good ● But we will see that « best arm » has weaknesses ==> it can be « overfitted » i.e. easily beaten by a « learning » opponent.
  • 17. UCBT cruches fixedSeed and wins against uniform Dots decreasing to 0. Fixed seeds (deterministic algorithms) are overfitted after 64 games X-axis = log2 (nb of games) (max. 512 games)
  • 18. UCBT cruches fixedSeed and wins against uniform Dots decreasing to 0. Fixed seeds (deterministic algorithms) are overfitted after 64 games X-axis = log2 (nb of games) (max. 512 games)
  • 19. Other experiments : variants of some algorithm ● Gnugo with options (32 variants) ● Nash-portfolio or UCBT portfolio : only a few percents of improvements over a single ad hoc variant. ==> less impressive than with random seeds
  • 20. Conclusions ● Nice application for Nash-portfolio: – Choose a stochastic algorithm – Build a matrix M of games randomSeed vs randomSeed – Compute the Nash equilibrium – You get a new probability distribution on random seeds – It should be strong than the original algorithm. ● Nice application for UCBT-portfolio – Play against it – As long as you lose, it will keep the same line of play
  • 21. Conclusions ● Further work – Better Nash approximation – Increase fun (should Ucbt explore more or less ? discount ?) – Bigger experiments (bigger games ? 19x19 ?) ● Comments ? We forgot to cite your paper ? We did not try on your favorite game ? Our results are bullshit ? Please tell us:-)