Estimating the Evolution Direction of Populations to Improve Genetic Algorithms

Estimating the Evolution Direction of
Populations To Improve GAs
Andrea De Lucia*, Massimiliano Di Penta+
Rocco Oliveto×, Annibale Panichella*
The autors are listed in Alphabetic Order
* SoftwareEngineering Lab , University of Salerno, Italy
+ RCOST, University of Sannio, Italy
× STAT Department, University of Molise, Italy

Motivations
• Why estimate evolution direction?
• Othogonal evolution directions.
How?
• Singular Value Decomposition (SVD)
• SVD base Genetic Algorithm
Empirical Evaluation
• GA vs. SVD-GA

What is the evolution direction?
P(t) = Popolation at
generation t

What is the evolution direction?
P(t+k) = Popolation after
k generations
P(t) = Popolation at
generation t

Why estimate evolution direction?
( ) ]2;0[,1)1(sin)(min 8
∈+−= xxxf
Motivating Example 1

( ) ]2;0[,1)1(sin)(min 8
∈+−= xxxf
0.5
1.5
0.5
1.5
1. Optimum is x = 1

( ) ]2;0[,1)1(sin)(min 8
∈+−= xxxf
0.5
1.5
0.5
1.5
2. f(x) is convex
1. Optimum is x = 1

0.5
1.5
0.5
1.5
It causes poor or slow
convergence of GA
( ) ]2;0[,1)1(sin)(min 8
∈+−= xxxf
2. f(x) is convex
1. Optimum is x = 1

( ) ]2;0[,1)1(sin)(min 8
∈+−= xxxf
0.5
1.5
0.5
1.5
Current
direction
xΔ

( ) ]2;0[,1)1(sin)(min 8
∈+−= xxxf
0.5
1.5
0.5
1.5
The estimated current evolution
direction can be used to
improve the convergence speed
of a GA
xΔ
Probable future
direction
Current
direction

( ) ]3.2;0[,1)1(sin)(min 8
∈+−= xxxf
xΔ
0.5 1.5
Current
direction

( ) ]3.2;0[,1)1(sin)(min 8
∈+−= xxxf
xΔ
0.5 1.5
We can use the evolution
direction for driving the next
generations along orthogonal
directions
Then, we improve the
probability of converging to
a global optimum
Current
direction

How?
⎥
⎥
⎥
⎥
⎦
⎤
⎢
⎢
⎢
⎢
⎣
⎡
=
nmmm
n
n
aaa
aaa
aaa
A
,2,1,
,22,21,2
,12,11,1
!
"#""
!
! Obs. 1
Obs. 2
Obs. m

How?
⎥
⎥
⎥
⎥
⎦
⎤
⎢
⎢
⎢
⎢
⎣
⎡
=
nmmm
n
n
aaa
aaa
aaa
A
,2,1,
,22,21,2
,12,11,1
!
"#""
!
!
Column Vectors
⎥
⎥
⎥
⎥
⎦
⎤
⎢
⎢
⎢
⎢
⎣
⎡
=
nmmm
n
n
aaa
aaa
aaa
A
,2,1,
,22,21,2
,12,11,1
!
"#""
!
!
A column vector Aj represents the
effect of the factor j for all
observations

How?
⎥
⎥
⎥
⎥
⎦
⎤
⎢
⎢
⎢
⎢
⎣
⎡
=
nmmm
n
n
aaa
aaa
aaa
A
,2,1,
,22,21,2
,12,11,1
!
"#""
!
!
⎥
⎥
⎥
⎥
⎦
⎤
⎢
⎢
⎢
⎢
⎣
⎡
=
nmmm
n
n
aaa
aaa
aaa
A
,2,1,
,22,21,2
,12,11,1
!
"#""
!
!
Column Vectors
Row Vectors
⎥
⎥
⎥
⎥
⎦
⎤
⎢
⎢
⎢
⎢
⎣
⎡
=
nmmm
n
n
aaa
aaa
aaa
A
,2,1,
,22,21,2
,12,11,1
!
"#""
!
!
A row vector Aj represents the
distribution of the observation j
within the factors space

Linear Algebra
It measures the
relationship between
observations within the
factors space
⎥
⎥
⎥
⎥
⎦
⎤
⎢
⎢
⎢
⎢
⎣
⎡
⋅
⎥
⎥
⎥
⎥
⎦
⎤
⎢
⎢
⎢
⎢
⎣
⎡
=⋅
nmnn
m
m
nmmm
n
n
T
aaa
aaa
aaa
aaa
aaa
aaa
AA
,,2,1
2,2,22,1
1,1,21,1
,2,1,
,22,21,2
,12,11,1
!
"#""
!
!
!
"#""
!
!
⎥
⎥
⎥
⎥
⎦
⎤
⎢
⎢
⎢
⎢
⎣
⎡
⋅
⎥
⎥
⎥
⎥
⎦
⎤
⎢
⎢
⎢
⎢
⎣
⎡
=⋅
nmmm
n
n
nmnn
m
m
T
aaa
aaa
aaa
aaa
aaa
aaa
AA
,2,1,
,22,21,2
,12,11,1
,,2,1
2,2,22,1
1,1,21,1
!
"#""
!
!
!
"#""
!
!
It measures the
factors within the
observations space

Eigenvalues and Eigenvectors
It measures the
factors space
⎥
⎥
⎥
⎥
⎦
⎤
⎢
⎢
⎢
⎢
⎣
⎡
⋅
⎥
⎥
⎥
⎥
⎦
⎤
⎢
⎢
⎢
⎢
⎣
⎡
=⋅
nmnn
m
m
nmmm
n
n
T
aaa
aaa
aaa
aaa
aaa
aaa
AA
,,2,1
2,2,22,1
1,1,21,1
,2,1,
,22,21,2
,12,11,1
!
"#""
!
!
!
"#""
!
!
⎥
⎥
⎥
⎥
⎦
⎤
⎢
⎢
⎢
⎢
⎣
⎡
⋅
⎥
⎥
⎥
⎥
⎦
⎤
⎢
⎢
⎢
⎢
⎣
⎡
=⋅
nmmm
n
n
nmnn
m
m
T
aaa
aaa
aaa
aaa
aaa
aaa
AA
,2,1,
,22,21,2
,12,11,1
,,2,1
2,2,22,1
1,1,21,1
!
"#""
!
!
!
"#""
!
!
It measures the
factors within the
observations space
The eigenvectors of (A AT) form an orthogonal basis of the space Rm
Each column vector of A can be expressed as linear combination of
U={u1, u2, …, um}
{ } 0and,,,)( 21 =×=⋅ T
jim
T
uuuuuAArsEigenVecto !

Eigenvalues and Eigenvectors
It measures the
factors space
⎥
⎥
⎥
⎥
⎦
⎤
⎢
⎢
⎢
⎢
⎣
⎡
⋅
⎥
⎥
⎥
⎥
⎦
⎤
⎢
⎢
⎢
⎢
⎣
⎡
=⋅
nmnn
m
m
nmmm
n
n
T
aaa
aaa
aaa
aaa
aaa
aaa
AA
,,2,1
2,2,22,1
1,1,21,1
,2,1,
,22,21,2
,12,11,1
!
"#""
!
!
!
"#""
!
!
⎥
⎥
⎥
⎥
⎦
⎤
⎢
⎢
⎢
⎢
⎣
⎡
⋅
⎥
⎥
⎥
⎥
⎦
⎤
⎢
⎢
⎢
⎢
⎣
⎡
=⋅
nmmm
n
n
nmnn
m
m
T
aaa
aaa
aaa
aaa
aaa
aaa
AA
,2,1,
,22,21,2
,12,11,1
,,2,1
2,2,22,1
1,1,21,1
!
"#""
!
!
!
"#""
!
!
It measures the
factors within the
observations space
The eigenvectors of (AT A) form an orthogonal basis of the space Rn
Each row vector of A can be expressed as linear combination of
V={v1, v2, …, vm}
{ } 0and,,,)( 21 =×=⋅ T
jin
T
vvvvvAArsEigenVecto !

How? Singual Value Decomposition
THEOREM:
Let A be a mxn matrix with rank k. There are three matrix U Σ V such that
VUA nkkkkmnm ××××
⋅⋅= Σ
Where:
• U contains the left singular vectors of A.They are eigenvectors of (AT A).
• V contains the right singular vectors of A. They are eigenvectors of (AAT ) . .
• Σ is a diagonal matrix containing the non-zero singular values of A (found on the diagonal
entries of Σ) are the square roots of the non-zero eigenvalues of both (AT A) (AAT ) .

THEOREM:
⋅⋅= Σ
u1
u2
u3
Column
Vectors of
A

THEOREM:
⋅⋅= Σ
u1
u2
u3
Column
Vectors of
A Row Vectors of
A
v1
v2
v3

THEOREM:
⋅⋅= Σ
u1
u2
u3
Column
Vectors of
A Row Vectors of
A
v1
v2
v3
⎥
⎥
⎥
⎦
⎤
⎢
⎢
⎢
⎣
⎡
=Σ
kσ
σ
!
"#"
!
0
01
They are in
descendig order

The basic idea is that a population of solutions P provided by GA at generation t can be
viewed as a m x n matrix
Using SVD for Evolution Direction
Individual 1
Individual 2
Individual m

From algebraic theorem
VUP ⋅⋅= Σ
v1
v2

Suppose we have two different populations Pt and P t+k at two different generations
t and t + k
We can compute two SVD
decompositions

Suppose we have two different populations Pt and P t+k at two different generations
t and t + k
We can compute two SVD
decomposition
The currect evolution direction
is related to
T
VUd ⋅≈

By definition Σ is a
scaling operator
Then, we construct a new orthogonal population as follows
By definition V is a
Rotating operator

SVD based GA
Selection
Initialize
population
Terminate
?
Crossover
Mutation
No
Yes

SVD based GA
Selection
Initialize
population
Terminate
?
Crossover
Mutation
No
Yes
Select best 50%
of individuals
Generate an orthogonal
sub-population
Replace the worst 50%
of individuals with new
sub-populations

Case Study
f1- f6 are multimodal functions
f7 – f11 are convex unimodal functions

Case Study
We compared three GAs over 50 independent runs:
- simple GA
- simple GA with Distance Crowding
- SVD-GA
Population size = 100
Initial population = uniformly and randomly generated within the solution spaces
Fitness scaling function = fitness scaling rank scheme
Crossover function = arithmetic crossover with probability Pc = 0.60
Mutation function = uniform mutation function with probability Pm = 0.02 for f1
and f8, while Pm = 0.05 for all the other functions.
Elitism = n equals to 2
SVD-Frequency: orthogonal sub-populations are generated by SVD every two
generations

Statistical Analysis
We performed the Wilcoxon Rank Sum test with α = 0.05
H0

Statistical Analysis
H0
We performed the Wilcoxon Rank Sum test with α = 0.05

Estimating the Evolution Direction of Populations to Improve Genetic Algorithms

Estimating the Evolution Direction of Populations to Improve Genetic Algorithms

Empfohlen

Empfohlen

Weitere ähnliche Inhalte

Ähnlich wie Estimating the Evolution Direction of Populations to Improve Genetic Algorithms

Ähnlich wie Estimating the Evolution Direction of Populations to Improve Genetic Algorithms (20)

Mehr von Annibale Panichella

Mehr von Annibale Panichella (20)

Kürzlich hochgeladen

Kürzlich hochgeladen (20)

Estimating the Evolution Direction of Populations to Improve Genetic Algorithms