2007 : Solving Linear Problems with MOSEK (Seattle 2007)

Solving Linear Optimization
Problems with MOSEK.
Bo Jensen ∗
MOSEK ApS,
Fruebjergvej 3, Box 16, 2100 Copenhagen,
Denmark.
Email: bo.jensen@mosek.com

INFORMS Annual Meeting Seattle Nov. 7, 2007
∗ http://www.mosek.com
Erling D. Andersen

Introduction

2 / 26

Topics

Introduction
Topics
s The problem:
The linear optimizer (P ) min cT x
The simplex st Ax = b,
optimizers

Computational
x ≥ 0.
results

Conclusions s The linear optimizers.
x Interior-point optimizer (Not main focus in this talk).
x Simplex optimizer.
s What is the recent improvements?
s What is the (relative) performance?

3 / 26

The linear optimizer

Introduction
Topics
The general flow :
The linear optimizer
s Presolve.
The simplex
optimizers s Form the reduced primal or dual.
Computational s Scale (optimizer specific).
results
s Optimize (interior-point or simplex).
Conclusions
s Basis identification (interior-point only).
s Undo scaling and dualizing.
s Postsolve.

4 / 26

The simplex optimizers

5 / 26

What makes a good simplex optimizer ?

Introduction
s Exploit sparsity (i.e. LU and FTRAN and BTRAN
The simplex
optimizers routines).
What makes a good
simplex optimizer ?
s Exploit problem dependent structure.
MOSEK
simplex-overview
s Choose right path (i.e. good pricing strategy).
Exploiting sparsity s Long steps (i.e. avoid degeneracy).
aggressively
Primal (dual) s Numerical stability (i.e. reliable and consistent results).
Degeneracy
Dual bound ﬂipping
s Fast hotstarts (i.e. MIP and other hotstart applications).
idea used more
aggressively
s Other tricks.
Numerical stability
Network optimizer

Computational
results

Conclusions

6 / 26

MOSEK simplex-overview

Introduction
s Primal and dual simplex optimizer.
The simplex
optimizers
What makes a good
x Efﬁcient cold start and warm start.
simplex optimizer ? x Crashes an initial basis.
MOSEK
simplex-overview x Multiple pricing options:
Exploiting sparsity
aggressively s Full (Dantzig).
Primal (dual)
Degeneracy s Partial.
idea used more s Approximate/exact steepest edge.
aggressively
Numerical stability
s Hybrid.
Network optimizer
x Degeneration handling.
Computational
results
s Revised simplex algorithm + many enhancements.
Conclusions
s Many enhancements still possible!.

7 / 26

Exploiting sparsity aggressively

Introduction
s Simplex algs. require solution of the linear equation
The simplex
optimizers systems
What makes a good
simplex optimizer ?
Bf = A:j and B T g = ei .
MOSEK
simplex-overview in each iteration.
Exploiting sparsity
aggressively
Primal (dual)
Degeneracy
idea used more
aggressively
Numerical stability
Network optimizer

Computational
results

Conclusions

8 / 26


Introduction
The simplex
optimizers systems
What makes a good
simplex optimizer ?
MOSEK
Exploiting sparsity
aggressively s Assume a sparse LU factorization of the basis
Primal (dual)
Degeneracy
Dual bound ﬂipping B = LU.
idea used more
aggressively
Numerical stability
Network optimizer

Computational
results

Conclusions

8 / 26


Introduction
The simplex
optimizers systems
What makes a good
simplex optimizer ?
MOSEK
Exploiting sparsity
Primal (dual)
Degeneracy
idea used more
aggressively
Numerical stability s f can be computed as follow. Solve
Network optimizer

Computational
results
¯
Lf = A:j
Conclusions
and then
¯
Uf = f.

8 / 26


Introduction
The simplex
optimizers systems
What makes a good
simplex optimizer ?
MOSEK
Exploiting sparsity
Primal (dual)
Degeneracy
idea used more
aggressively
Numerical stability s f can be computed as follow. Solve
Network optimizer

Computational
results
¯
Lf = A:j
Conclusions
and then
¯
Uf = f.
s Simple implementation requires O(nz(L) + nz(U )) ﬂops.

8 / 26

Exploiting sparsity aggressively (continued)

Introduction
s Consider the simple example:
The simplex
optimizers
¯
    
What makes a good 1 f1 0
simplex optimizer ?
MOSEK  0 1 ¯
  f2  =  x 
simplex-overview
x 0 1 ¯
f3 0
Exploiting sparsity
aggressively
Primal (dual)
Degeneracy
idea used more
aggressively
Numerical stability
Network optimizer

Computational
results

Conclusions

9 / 26


Introduction
The simplex
optimizers
¯
    
simplex optimizer ?
MOSEK  0 1 ¯
  f2  =  x 
simplex-overview
x 0 1 ¯
f3 0
Exploiting sparsity
aggressively
Primal (dual)
Degeneracy
s Clearly sparsity in the RHS can be exploited! (done
idea used more
aggressively
extensively in MOSEK).
Numerical stability
Network optimizer

Computational
results

Conclusions

9 / 26


Introduction
The simplex
optimizers
¯
    
simplex optimizer ?
MOSEK  0 1 ¯
  f2  =  x 
simplex-overview
x 0 1 ¯
f3 0
Exploiting sparsity
aggressively
Primal (dual)
Degeneracy
idea used more
aggressively
Numerical stability s Gilbert and Peierls [GIL:88] demonstrate how to solve the
Network optimizer
triangular system in O(minimal number of ﬂops).
Computational
results

Conclusions

9 / 26


Introduction
The simplex
optimizers
¯
    
simplex optimizer ?
MOSEK  0 1 ¯
  f2  =  x 
simplex-overview
x 0 1 ¯
f3 0
Exploiting sparsity
aggressively
Primal (dual)
Degeneracy
idea used more
aggressively
Network optimizer
Computational
results s Aim: Solves with L and U and updates to the LU should
Conclusions run in O(minimal number of ﬂops) and not in O(m) for
instance.

9 / 26


Introduction
The simplex
optimizers
¯
    
simplex optimizer ?
MOSEK  0 1 ¯
  f2  =  x 
simplex-overview
x 0 1 ¯
f3 0
Exploiting sparsity
aggressively
Primal (dual)
Degeneracy
idea used more
aggressively
Network optimizer
Computational
results s Aim: Solves with L and U and updates to the LU should
Conclusions run in O(minimal number of ﬂops) and not in O(m) for
instance.
s Drawback: Both L and U must be stored row and column
wise because solves with LT and U T are required too.

9 / 26

Primal (dual) Degeneracy

Introduction
The simplex optimizer may take very small or zero step sizes,
The simplex
optimizers why ?
What makes a good
simplex optimizer ?
MOSEK
simplex-overview
Exploiting sparsity
aggressively
Primal (dual)
Degeneracy
idea used more
aggressively
Numerical stability
Network optimizer

Computational
results

Conclusions

10 / 26


Introduction
The simplex
optimizers why ?
What makes a good
simplex optimizer ? s Primal step size δp :
MOSEK
simplex-overview lB ≤ xB − δp B −1 aq ≤ uB
Exploiting sparsity
aggressively
Primal (dual)
Degeneracy
idea used more
aggressively
Numerical stability
Network optimizer

Computational
results

Conclusions

10 / 26


Introduction
The simplex
optimizers why ?
What makes a good
MOSEK
Exploiting sparsity
aggressively s Basic variables on a bound may imply a zero primal step.
Primal (dual)
Degeneracy
idea used more
aggressively
Numerical stability
Network optimizer

Computational
results

Conclusions

10 / 26


Introduction
The simplex
optimizers why ?
What makes a good
MOSEK
Exploiting sparsity
Primal (dual)
Degeneracy
s Dual step size δd :
Dual bound ﬂipping cj − y T Aj − (+)δd (ei B −1 N )j ≥ 0 ∀j ∈ NL
idea used more
aggressively
cj − y T Aj − (+)δd (ei B −1 N )j ≤ 0 ∀j ∈ NU
Numerical stability
Network optimizer

Computational
results

Conclusions

10 / 26


Introduction
The simplex
optimizers why ?
What makes a good
MOSEK
Exploiting sparsity
Primal (dual)
Degeneracy
idea used more
aggressively
Numerical stability
Network optimizer s Non basic variables with zero reduced cost may imply a
Computational
results
zero dual step.
Conclusions Degeneration posses both a theoretical and a practical
problem for the simplex optimizer !

10 / 26


Introduction
The simplex
optimizers why ?
What makes a good
MOSEK
Exploiting sparsity
Primal (dual)
Degeneracy
idea used more
aggressively
Numerical stability
Computational
results
zero dual step.

What is our options ?

10 / 26


Introduction
The simplex
optimizers why ?
What makes a good
MOSEK
Exploiting sparsity
Primal (dual)
Degeneracy
idea used more
aggressively
Numerical stability
Computational
results
zero dual step.

What is our options ?

One approach is to perturb lj and uj (cj ).
10 / 26

Primal (dual) Degeneracy (continued)

Introduction
MOSEK 5 has been improved on degenerated problems:
The simplex
optimizers
What makes a good
s Better and more aggressive perturbation scheme.
simplex optimizer ?
MOSEK
simplex-overview
Exploiting sparsity
aggressively
Primal (dual)
Degeneracy
idea used more
aggressively
Numerical stability
Network optimizer

Computational
results

Conclusions

11 / 26


Introduction
The simplex
optimizers
What makes a good
simplex optimizer ? s Sparsity issues important (very tricky).
MOSEK
simplex-overview
Exploiting sparsity
aggressively
Primal (dual)
Degeneracy
idea used more
aggressively
Numerical stability
Network optimizer

Computational
results

Conclusions

11 / 26


Introduction
The simplex
optimizers
What makes a good
MOSEK
simplex-overview s Clean up perturbations with dual (primal) simplex.
Exploiting sparsity
aggressively
Primal (dual)
Degeneracy
idea used more
aggressively
Numerical stability
Network optimizer

Computational
results

Conclusions

11 / 26


Introduction
The simplex
optimizers
What makes a good
MOSEK
Exploiting sparsity
aggressively s Many examples where ”tailed” solves are substantial
Primal (dual)
Degeneracy
reduced.
idea used more
aggressively
Numerical stability
Network optimizer

Computational
results

Conclusions

11 / 26


Introduction
The simplex
optimizers
What makes a good
MOSEK
Exploiting sparsity
aggressively s Many examples where ”tailed” solves are substantial
Primal (dual)
Degeneracy
reduced.
Dual bound ﬂipping s Still room for improvement.
idea used more
aggressively
Numerical stability
Network optimizer

Computational
results

Conclusions

11 / 26

Dual bound ﬂipping idea used more aggressively

Introduction
The simplex
Dual step size δd :
optimizers cj − y T Aj − (+)δd (ei B −1 N )j ≥ 0 ∀j ∈ NL
What makes a good
simplex optimizer ?
MOSEK
simplex-overview
Exploiting sparsity
A ranged variable i.e. −∞ < lj < xj < uj < ∞ may not be
aggressively binding in dual min-ratio if proﬁtable.
Primal (dual)
Degeneracy
idea used more
aggressively
Numerical stability
Network optimizer

Computational
results

Conclusions

12 / 26


Introduction
The simplex
What makes a good
simplex optimizer ?
MOSEK
simplex-overview
Exploiting sparsity
Primal (dual)
Degeneracy
Dual bound ﬂipping s This involves ﬂipping nonbasic variables to opposite
idea used more
aggressively bound to remain dual feasible and cost one extra solve.
Numerical stability
Network optimizer

Computational
results

Conclusions

12 / 26


Introduction
The simplex
What makes a good
simplex optimizer ?
MOSEK
simplex-overview
Exploiting sparsity
Primal (dual)
Degeneracy
idea used more
Numerical stability
Network optimizer
s Longer dual steplengths.
Computational
results

Conclusions

12 / 26


Introduction
The simplex
What makes a good
simplex optimizer ?
MOSEK
simplex-overview
Exploiting sparsity
Primal (dual)
Degeneracy
idea used more
Numerical stability
Network optimizer
Computational s Reduces degeneracy.
results

Conclusions

12 / 26


Introduction
The simplex
What makes a good
simplex optimizer ?
MOSEK
simplex-overview
Exploiting sparsity
Primal (dual)
Degeneracy
idea used more
Numerical stability
Network optimizer
results
s Less iterations.
Conclusions

12 / 26


Introduction
The simplex
What makes a good
simplex optimizer ?
MOSEK
simplex-overview
Exploiting sparsity
Primal (dual)
Degeneracy
idea used more
Numerical stability
Network optimizer
results
s Less iterations.
Conclusions
s More ﬂexibility in pivot choice (i.e. potentially more
stable).

12 / 26


Introduction
The simplex
What makes a good
simplex optimizer ?
MOSEK
simplex-overview
Exploiting sparsity
Primal (dual)
Degeneracy
idea used more
Numerical stability
Network optimizer
results
s Less iterations.
Conclusions
s More ﬂexibility in pivot choice (i.e. potentially more
stable).
s Improves sparsity of the basis when degenerated! (i.e. if
xB i becomes feasible no basis exchange is needed).

12 / 26

(continued)

Introduction
Bound flipping examples:
The simplex
optimizers
What makes a good Iter Time
simplex optimizer ?
MOSEK Problem Rows Cols NB WB NB WB
simplex-overview
Exploiting sparsity osa-60 10280 232966 6938 5111 58.12 8.84
aggressively
Primal (dual)
world 34506 32734 54566 32606 218.81 50.03
Degeneracy
pds-40 66844 212859 34274 26599 96.51 18.48
idea used more ken-18 105127 154699 151203 51452 258.18 13.92
aggressively
Numerical stability client 27216 20567 80555 63660 208.40 84.09
Network optimizer

Computational WB = MOSEK 5 Dual simplex with bound flips
results

Conclusions
NB = MOSEK 5 Dual simplex with no bound flips

13 / 26

Numerical stability

Introduction
s Improving numerical stability.
The simplex
optimizers
What makes a good
x Moved LU update before updating solution.
simplex optimizer ?
MOSEK
simplex-overview
Exploiting sparsity
aggressively
Primal (dual)
Degeneracy
idea used more
aggressively
Numerical stability
Network optimizer

Computational
results

Conclusions

14 / 26

Numerical stability

Introduction
The simplex
optimizers
What makes a good
simplex optimizer ?
MOSEK s Saves one solve with L in ei T B −1 [GOL:77].
simplex-overview
Exploiting sparsity
s More stable approach.
aggressively
Primal (dual)
Degeneracy
idea used more
aggressively
Numerical stability
Network optimizer

Computational
results

Conclusions

14 / 26

Numerical stability

Introduction
The simplex
optimizers
What makes a good
simplex optimizer ?
simplex-overview
Exploiting sparsity
aggressively
Primal (dual) x Better handling of singularities (sing. variables are
Degeneracy
temporary ﬁxed).
idea used more
aggressively
Numerical stability
Network optimizer

Computational
results

Conclusions

14 / 26

Numerical stability

Introduction
The simplex
optimizers
What makes a good
simplex optimizer ?
simplex-overview
Exploiting sparsity
aggressively
Primal (dual) x Better handling of singularities (sing. variables are
Degeneracy
temporary ﬁxed).
idea used more
aggressively
x Switch to safe mode if deemed unstable.
Numerical stability
Network optimizer

Computational
results

Conclusions

14 / 26

Network optimizer

Introduction
MOSEK 5 features a network simplex optimizer.
The simplex
optimizers
What makes a good
s Solves pure network ﬂow problems (i.e. LP’s with two
simplex optimizer ? non-zeros in each column either 1 or -1).
MOSEK
simplex-overview s Can extract embedded network structure in a model (i.e.
Exploiting sparsity
aggressively network with side constraints).
Primal (dual)
Degeneracy
s Using standard interface, only one parameter has to be
Dual bound ﬂipping set.
idea used more
aggressively s Huge problems can be solved in limited time, for instance
Numerical stability
Network optimizer
a problem with 8 million variables can be solved in less
Computational than 200 seconds.
results

Conclusions

15 / 26

Computational results

16 / 26

Test setup

Introduction
s 577 problems (mixed size).
The simplex
optimizers s A Dual Core server with 4GB RAM running Windows
Computational 2003 (Intel CPU).
results
Test setup s A Quad Core server with 8GB RAM running Windows
Network Vs.
Standard simplex
2003 (Intel CPU).
Primal Simplex s See [HM:07] for a benchmark comparing Mosek with
Dual Simplex
Numerical difﬁcult
other solvers.
problems-primal
simplex
All results presented in one table is obtained using one of the
problems-dual
simplex
two computers only.
Conclusions

17 / 26

Network Vs. Standard simplex

Introduction
The simplex
optimizers
small medium
Computational
netw psim dsim netw psim dsim
results Num. 30 30 30 43 43 43
Test setup
Network Vs.
Firsts 30 0 1 43 0 0
Standard simplex Total time 13.7 114.8 27.8 589.9 10676.6 3015.2
Primal Simplex
Dual Simplex
G. avg. 0.39 2.42 0.70 6.30 91.74 19.70
problems-primal
simplex
Numerical difﬁcult large
problems-dual
simplex
netw psim dsim
Conclusions
Num. 2 2 2
Firsts 2 0 0
Total time 366.3 2905.8 968.9
G. avg. 182.98 1115.71 468.76

Table 1: Performance of the network ﬂow, primal simplex and
dual simplex optimizer on pure network problems.
18 / 26

Primal Simplex

Introduction
The simplex
optimizers
small medium large
Computational
5 4 5 4 5 4
results Num. 399 399 148 148 30 30
Test setup
Network Vs.
Firsts 329 245 91 62 22 11
Primal Simplex
Dual Simplex
G. avg. 0.06 0.07 7.49 9.24 591.39 746.01
problems-primal
simplex
Table 2: Performance of the version 4 and version 5 primal
problems-dual
simplex
simplex optimizer
Conclusions

19 / 26

Dual Simplex

Introduction
The simplex
optimizers
small medium large
Computational
5 4 5 4 5 4
results Num. 412 412 150 150 21 21
Test setup
Network Vs.
Firsts 198 286 133 22 18 5
Primal Simplex
Dual Simplex
G. avg. 0.10 0.08 4.65 8.70 544.44 1065.24
problems-primal
simplex
Table 3: Performance of the version 4 and version 5 dual sim-
problems-dual
simplex
plex optimizer
Conclusions

20 / 26

Numerical difficult problems-primal simplex

Introduction
The simplex
optimizers
small medium large
Computational
5 4 5 4 5 4
results Num. 9 9 19 19 2 2
Test setup
Network Vs.
Firsts 5 5 13 6 2 0
Primal Simplex
Dual Simplex
G. avg. 0.19 0.18 7.19 9.54 413.26 464.04
Numerical difficult Fails 0 0 0 3 0 3
problems-primal
simplex
problems-dual Table 4: Performance of the version 4 and 5 of the primal sim-
simplex

Conclusions
plex optimizer on numerical difficult problems.

21 / 26

Numerical difficult problems-dual simplex

Introduction
The simplex
optimizers
small medium large
Computational
5 4 5 4 5 4
results Num. 11 11 19 19 4 4
Test setup
Network Vs.
Firsts 7 6 13 6 4 0
Primal Simplex
Dual Simplex
G. avg. 0.24 0.31 8.44 9.67 802.24 2525.35
Numerical difficult Fails 0 0 0 1 0 1
problems-primal
simplex
problems-dual Table 5: Performance of the version 4 and 5 dual simplex opti-
simplex

Conclusions
mizer on numerical difficult problems.

22 / 26

Conclusions

23 / 26

Conclusions

Introduction
s Simplex:
The simplex
optimizers
x MOSEK 5 substantial faster than MOSEK 4.
Computational
results x MOSEK 5 more stable than MOSEK 4.
Conclusions x Dual simplex faster than primal.
Conclusions
A number open
issues exists
References

24 / 26

A number open issues exists

Introduction
s Simplex:
The simplex
optimizers
x Degeneracy (non-perturbation method might be
Computational
results needed in extreme cases).
Conclusions x Improve primal pricing.
Conclusions
A number open
x Better crashing on special problems.
issues exists
x Choose more sparse path.
References

25 / 26

References

Introduction
The simplex
[HM:07] H.Mittelmann http://plato.la.asu.edu/bench.html
optimizers

Computational [GIL:88] J. R. Gilbert and T. Peierls, ”Sparse partial pivoting in time
results proportional to arithmetic operations”, SIAM J. Sci. Statist.
Conclusions Comput., 9, 1988, pp. 862–874.
Conclusions
A number open
issues exists [GOL:77] D. Goldfarb, ”On the Bartels-Golub decomposition for
References
linear programming bases,” Mathematical. Programming, 13,
1977, pp 272-279

[KOS:02] E. Kostina, ”The Long Step Rule in the Bounded-Variable
Dual Simplex Method: Numerical Experiments”,
Mathematical Methods of Operations Research, 55 2002, I. 3.

[MAR:03] Maros I, ”A Generalized Dual Phase-2 Simplex
Algorithm”, European Journal of Operational Research, 149,
2003, pp. 1–16

26 / 26

2007 : Solving Linear Problems with MOSEK (Seattle 2007)

Empfohlen

Empfohlen

Weitere ähnliche Inhalte

Was ist angesagt?

Was ist angesagt? (9)

Ähnlich wie 2007 : Solving Linear Problems with MOSEK (Seattle 2007)

Ähnlich wie 2007 : Solving Linear Problems with MOSEK (Seattle 2007) (20)

2007 : Solving Linear Problems with MOSEK (Seattle 2007)