Linear Programming (Optimization)

Download Report

Transcript Linear Programming (Optimization)

Chap 2. The Geometry of LP
ļ± In the text, polyhedron is defined as š‘ƒ = {š‘„ āˆˆ š‘…š‘› : š“š‘„ ā‰„ š‘}. So some of our
earlier results should be taken with modifications.
ļ± Thm 2.1:
(a) The intersections of convex sets is convex.
(b) Every polyhedron is a convex set.
(c) Convex combination of a finite number of elements of a convex set also
belongs to that set.
(recall that S closed for convex combination of 2 points.
ļƒ› S closed for convex combination of a finite number of points)
(d) Convex hull of a finite number of vectors (polytope) is convex.
Linear Programming 2012
1
(a) Let š‘„, š‘¦ āˆˆ š‘–āˆˆš¼ š‘†š‘– , š‘†š‘– convex ļƒž š‘„, š‘¦ āˆˆ š‘†š‘– , āˆ€ š‘–
šœ†š‘„ + 1 āˆ’ šœ† š‘¦ āˆˆ š‘†š‘– , āˆ€ š‘– since š‘†š‘– convex
šœ†š‘„ + 1 āˆ’ šœ† š‘¦ āˆˆ š‘–āˆˆš¼ š‘†š‘– ļƒž
š‘–āˆˆš¼ š‘†š‘– is convex.
Halfspace {š‘„: š‘Žā€² š‘„ ā‰„ š‘} is convex.
Polyhedron š‘ƒ is intersection of halfspaces ļƒž From (a), P is convex.
( or we may directly show š“(šœ†š‘„ + 1 āˆ’ šœ† š‘¦) ā‰„ š‘.)
(c) Use induction. True for š‘˜ = 2 by definition.
Suppose statement holds for š‘˜ elements. Suppose šœ†š‘˜+1 ā‰  1.
Pf)
ļƒž
ļƒž
(b)
Then
š‘˜+1
š‘–
š‘–=1 šœ†š‘– š‘„
= šœ†š‘˜+1 š‘„ š‘˜+1 + (1 āˆ’ šœ†š‘˜+1 )(
šœ†š‘–
š‘˜
š‘–=1 1āˆ’šœ†
š‘˜+1
š‘„š‘–)
šœ†š‘– /(1 āˆ’ šœ†š‘˜+1 ) ā‰„ 0 and sum up to 1, hence
š‘˜
š‘–=1(šœ†š‘– /(1 āˆ’
(d) Let š‘† be the convex hull of
ļƒž š‘¦=
š‘˜+1
š‘–
š‘–=1 šœ†š‘– š‘„ āˆˆ š‘†
vectors š‘„ 1 , ā€¦ , š‘„ š‘˜
šœ†š‘˜+1 ))š‘„ š‘– āˆˆ š‘† ļƒž
š‘˜
š‘–
š‘–=1 šœ‰š‘– š‘„
šœ†š‘¦ + 1 āˆ’ šœ† š‘§ = šœ†
, š‘§=
š‘˜
š‘–=1 šœ‰š‘–
š‘˜
š‘–
š‘–=1 šœƒš‘– š‘„
š‘„š‘– + 1 āˆ’
and š‘¦, š‘§ āˆˆ š‘†
for some šœ‰š‘– , šœƒš‘– .
šœ†
š‘˜
š‘–
š‘–=1 šœƒš‘– š‘„
=
š‘˜
š‘–=1(šœ†šœ‰š‘–
+ 1 āˆ’ šœ† šœƒš‘– )š‘„ š‘–
šœ†šœ‰š‘– + (1 āˆ’ šœ†)šœƒš‘– ā‰„ 0 and sum up to 1 ļƒž convex combination of š‘„ š‘– ā€²š‘ 
ļƒž šœ†š‘¦ + 1 āˆ’ šœ† š‘§ āˆˆ š‘†.
Linear Programming 2012
ļ‚œ
2
Extreme points, vertices, and b.f.sā€™s
ļ± Def: (a) Extreme point ( as we defined earlier)
(b) š‘„ āˆˆ š‘ƒ is a vertex if āˆƒ š‘ āˆˆ š‘…š‘› such that š‘ ā€² š‘„ < š‘ ā€² š‘¦, āˆ€ š‘¦ āˆˆ š‘ƒ and š‘¦ ā‰  š‘„.
( š‘„ is a unique optimal solution of min š‘ ā€² š‘„, š‘„ āˆˆ š‘ƒ )
(c) Consider polyhedron š‘ƒ and š‘„ āˆ— āˆˆ š‘…š‘› . Then š‘„ āˆ— is a basic solution if all
equality constraints are active at š‘„ āˆ— and āˆƒ š‘› linearly independent active
constraints among the constraints active at š‘„ āˆ— .
( basic feasible solution if š‘„ āˆ— is basic solution and š‘„ āˆ— āˆˆ š‘ƒ )
ļ± Note: Earlier, we defined the extreme point same as in the text.
Vertex as 0-dimensional face ( dim(š‘ƒ) + rank š“= , š‘ = = š‘› ) which is the same
as the basic feasible solution defined in the text.
We defined basic solution (and b.f.s) only for the standard LP. (š‘„šµ = šµāˆ’1 š‘,
š‘„š‘ = 0)
Definition (b) is new. It gives an equivalent characterization of extreme point.
(b) can be extended to characterize a face š¹ of š‘ƒ.
Linear Programming 2012
3
š‘„3
š‘„2
A
P
C
E
š‘„1
D
B
ļ± Fig. 2.6: š‘ƒ = { š‘„1 , š‘„2 , š‘„3 : š‘„1 + š‘„2 + š‘„3 = 1, š‘„1 , š‘„2 , š‘„3 ā‰„ 0}
Three constraints active at A, B, C, D. Only two constraints active at E.
Note that D is not a basic solution since it does not satisfy the equality
constraint. However, if š‘ƒ is given as š‘ƒ = { š‘„1 , š‘„2 , š‘„3 : š‘„1 + š‘„2 + š‘„3 ā‰„
1, š‘„1 + š‘„2 + š‘„3 ā‰¤ 1, š‘„1 , š‘„2 , š‘„3 ā‰„ 0}, D is a basic solution by the definition in
the text, i.e. whether a solution is basic depends on the representation of š‘ƒ.
Linear Programming 2012
4
A
E
D
P
F
B
C
ļ± Fig. 2.7: A, B, C, D, E, F are all basic solutions. C, D, E, F are basic
feasible solutions.
Linear Programming 2012
5
ļ± Comparison of definitions in the notes and the text
Notes
Text
Extreme
point
Geometric definition
Geometric definition
Vertex
0-dimensional face
Existence of š‘ vector which makes š‘„ āˆ—
as the unique optimal solution for the
LP
Basic
solution,
b.f.s.
Defined for standard
form. Set š‘› āˆ’ š‘š
variables at 0 and solve
the remaining system.
b.f.s. if nonnegative.
Defined for general polyhedron.
Satisfy equality constraints and š‘›
linearly independent constraints are
active. ( 0-dimensional face if feasible)
Linear Programming 2012
6
ļ± Thm 2.3: š‘„ āˆ— āˆˆ š‘ƒ, then š‘„ āˆ— vertex, extreme point, and b.f.s. are equivalent
statements.
Pf) We follow the definitions given in the text. We already showed in the
notes that extreme point and 0-dimensional face ( š“š¼ š‘„ āˆ— = š‘š¼ , š“š¼ : rank š‘›,
b.f.s. in the text) are equivalent.
To show all are equivalent, take the following steps:
š‘„ āˆ— vertex (1) ļƒž š‘„ āˆ— extreme point (2) ļƒž š‘„ āˆ— b.f.s. (3) ļƒž š‘„ āˆ— vertex
(1) š‘„ āˆ— vertex ļƒž š‘„ āˆ— extreme point
Suppose š‘„ āˆ— is vertex, i.e. āˆƒ š‘ āˆˆ š‘…š‘› such that š‘„ āˆ— is unique minimum of
min š‘ ā€² š‘„, š‘„ āˆˆ š‘ƒ.
If š‘¦, š‘§ āˆˆ š‘ƒ, š‘¦, š‘§ ā‰  š‘„ āˆ— , then š‘ ā€² š‘„ āˆ— < š‘ ā€² š‘¦ and š‘ ā€² š‘„ āˆ— < š‘ ā€² š‘§.
Hence š‘ ā€² š‘„ āˆ— < š‘ ā€² šœ†š‘¦ + 1 āˆ’ šœ† š‘§ , 0 ā‰¤ šœ† ā‰¤ 1 ļƒžšœ†š‘¦ + 1 āˆ’ šœ† š‘§ ā‰  š‘„ āˆ— .
Hence š‘„ āˆ— cannot be expressed as convex combination of two other points in
š‘ƒ.
ļƒž š‘„ āˆ— extreme point.
Linear Programming 2012
7
(continued)
(2) š‘„ āˆ— extreme point ļƒž š‘„ āˆ— b.f.s.
Suppose š‘„ āˆ— is not a b.f.s.. Let š¼ = {š‘–: š‘Žš‘–ā€² š‘„ āˆ— = š‘š‘– }.
Since š‘„ āˆ— is not a b.f.s., the number of linearly independent vectors š‘Žš‘– in š¼ < š‘›.
Hence āˆƒ nonzero š‘‘ āˆˆ š‘…š‘› such that š‘Žš‘–ā€² š‘‘ = 0, āˆ€ š‘– āˆˆ š¼.
Consider š‘¦ = š‘„ āˆ— + šœ€š‘‘, š‘§ = š‘„ āˆ— āˆ’ šœ€š‘‘. But, š‘¦, š‘§ āˆˆ š‘ƒ for sufficiently small
positive šœ€, and š‘„ āˆ— = (š‘¦ + š‘§)/2, which implies š‘„ āˆ— is not an extreme point.
(3) š‘„ āˆ— b.f.s. ļƒž š‘„ āˆ— vertex
Let š‘„ āˆ— be a b.f.s. and let š¼ = {š‘–: š‘Žš‘–ā€² š‘„ āˆ— = š‘š‘– }.
Let š‘ = š‘–āˆˆš¼ š‘Žš‘– . Then š‘ ā€² š‘„ āˆ— = š‘–āˆˆš¼ š‘Žš‘– ā€²š‘„ āˆ— = š‘–āˆˆš¼ š‘š‘– .
āˆ€ š‘„ āˆˆ š‘ƒ, we have š‘ ā€² š‘„ = š‘–āˆˆš¼ š‘Žš‘– ā€²š‘„ ā‰„ š‘–āˆˆš¼ š‘š‘– = š‘ ā€² š‘„ āˆ— , hence š‘„ āˆ— optimal.
For uniqueness, equality holds ļƒ› š‘Žš‘– ā€²š‘„ = š‘š‘– , š‘– āˆˆ š¼.
Since š‘„ āˆ— is b.f.s., it is the unique solution of š‘Žš‘– ā€²š‘„ = š‘š‘– , š‘– āˆˆ š¼.
Hence š‘„ āˆ— is a vertex.
ļ‚œ
Linear Programming 2012
8
ļ± Note: Whether š‘„ āˆ— is a basic solution depends on the representation of š‘ƒ.
However, š‘„ āˆ— is b.f.s. if and only if š‘„ āˆ— extreme point and š‘„ āˆ— being extreme point
is independent of the representation of š‘ƒ. Hence the property of being a b.f.s.
is also independent of the representation of š‘ƒ.
ļ± Cor 2.1: For polyhedron š‘ƒ ā‰  āˆ…, there can be finite number of basic or basic
feasible solutions.
ļ± Def: Two distinct basic solutions are said to be adjacent if we can find š‘› āˆ’ 1
linearly independent constraints that are active at both of them. ( In Fig 2.7, D
and E are adjacent to B; A and C are adjacent to D.)
If two adjacent basic solutions are also feasible, then the line segment that joins
them is called an edge of the feasible set (one dimensional face).
Linear Programming 2012
9
2.3 Polyhedra in standard form
ļ± Thm 2.4: š‘ƒ = {š‘„: š“š‘„ = š‘, š‘„ ā‰„ 0}, š“: š‘š × š‘›, full row rank.
Then š‘„ is a basic solution ļƒ› š‘„ satisfies š“š‘„ = š‘ and āˆƒ indices
šµ 1 , ā€¦ , šµ(š‘š) such that š“šµ(1) , ā€¦ , š“šµ(š‘š) are linearly independent and š‘„š‘– =
0, š‘– ā‰  šµ 1 , ā€¦ , šµ(š‘š).
Pf) see text.
( To find a basic solution, choose š‘š linearly independent columns
š“šµ(1) , ā€¦ , š“šµ(š‘š) . Set š‘„š‘– = 0 for all š‘– ā‰  šµ 1 , ā€¦ , šµ(š‘š), then solve š“š‘„ = š‘
for š‘„šµ(1) , ā€¦ , š‘„šµ(š‘š) . )
ļ± Def: basic variable, nonbasic variable, basis, basic columns, basis matrix šµ.
(see text)
( šµš‘„šµ = š‘ ļ‚® š‘„šµ = šµāˆ’1 š‘ )
Linear Programming 2012
10
ļ± Def: For standard form problems, we say that two bases are adjacent if they
share all but one basic column.
ļ± Note: A basis uniquely determines a basic solution.
Hence if have two different basic solutions ļƒž have different basis.
But two different bases may correspond to the same basic solution. (e.g. when
š‘=0)
Similarly, two adjacent basic solutions ļƒž two adjacent bases
Two adjacent bases with different basic solutions ļƒž two adjacent basic
solutions.
However, two adjacent bases only not necessarily imply two adjacent basic
solutions. The two solutions may be the same solution.
Linear Programming 2012
11
ļ± Check that full row rank assumption on š“ results in no loss of generality.
ļ± Thm 2.5: š‘ƒ = {š‘„: š“š‘„ = š‘, š‘„ ā‰„ 0} ā‰  āˆ…, š“: š‘š × š‘›, rank is š‘˜ < š‘š.
š‘„ = {š‘„: š“š¼ š‘„ = š‘š¼ , š‘„ ā‰„ 0}, š¼ = {š‘–1 , ā€¦ , š‘–š‘˜ } with linearly independent rows.
Then š‘ƒ = š‘„.
Pf) Suppose first š‘˜ rows of š“ are linearly independent.
š‘ƒ āŠ† š‘„ is clear. Show š‘„ āŠ† š‘ƒ.
Every row š‘Žš‘– ā€² of š“ can be expressed as š‘Žš‘– ā€² =
Hence, for š‘„ āˆˆ š‘ƒ, š‘š‘– = š‘Žš‘– ā€²š‘„ =
š‘˜
š‘—=1 šœ†š‘–š‘— š‘Žš‘— ā€²š‘„
=
š‘˜
š‘—=1 šœ†š‘–š‘— š‘Žš‘— ā€².
š‘˜
š‘—=1 šœ†š‘–š‘— š‘š‘— ,
š‘– = 1, ā€¦ , š‘š
i.e. š‘š‘– is also linear combination of š‘š‘— , š‘— āˆˆ š¼.
Suppose š‘¦ āˆˆ š‘„, then āˆ€ š‘– = 1, ā€¦ , š‘š,
š‘Žš‘– ā€²š‘¦ =
š‘˜
š‘—=1 šœ†š‘–š‘— š‘Žš‘— ā€²š‘¦
=
š‘˜
š‘—=1 šœ†š‘–š‘— š‘š‘—
Hence, š‘¦ āˆˆ š‘ƒ ļƒž š‘„ āŠ† š‘ƒ
Linear Programming 2012
= š‘š‘–
ļ‚œ
12
2.4 Degeneracy
ļ± Def 2.10: A basic solution š‘„ āˆˆ š‘…š‘› is said to be degenerate if more than š‘› of
the constraints are active at š‘„.
ļ± Def 2.11: š‘ƒ = {š‘„ āˆˆ š‘…š‘› : š“š‘„ = š‘, š‘„ ā‰„ 0}, š“: š‘š × š‘›, full row rank.
Then š‘„ is a degenerate basic solution if more than š‘› āˆ’ š‘š of the components
of š‘„ are 0 ( i.e. some basic variables have 0 value)
ļ± For standard LP, if we have more than š‘› āˆ’ š‘š variables at 0 for a basic
feasible solution š‘„ āˆ— , it means that more than š‘› āˆ’ š‘š of the nonnegativity
constraints are active at š‘„ āˆ— in addition to the š‘š constraints in š“š‘„ = š‘.
The solution can be identified by defining š‘› āˆ’ š‘š nonbasic variables (value
= 0). Hence, depending on the choice of nonbasic variables, we have
different bases, but the solution is the same.
Linear Programming 2012
13
A
D
C
P
B
E
ļ± Fig 2.9: A and C are degenerate basic feasible solutions. B and E are
nondegenerate. D is a degenerate basic solution.
Linear Programming 2012
14
š‘„3 = 0
š‘„4 = 0
A
B
š‘„5 = 0
š‘ƒ
š‘„2 = 0
š‘„1 = 0
š‘„6 = 0
ļ± Fig 2.11: (š‘› āˆ’ š‘š)-dimensional illustration of degeneracy. Here, š‘› = 6,
š‘š = 4. A is nondegenerate and basic variables are š‘„1 , š‘„2 , š‘„3 , š‘„6 . B is
degenerate. We can choose š‘„1 , š‘„6 as the nonbasic variables. Other
possibilities are to choose š‘„1 , š‘„5 , or to choose š‘„5 , š‘„6 .
Linear Programming 2012
15
ļ± Degeneracy is not purely geometric property, it may depend on
representation of the polyhedrom
ex) š‘ƒ = {š‘„: š“š‘„ = š‘, š‘„ ā‰„ 0}, š“: š‘š × š‘›
š‘ƒā€² = {š‘„: š“š‘„ ā‰„ š‘, āˆ’š“š‘„ ā‰„ āˆ’š‘, š‘„ ā‰„ 0}
We know that š‘ƒ = š‘ƒā€², but representation is different.
Suppose š‘„ āˆ— is a nondegenerate basic feasible solution of š‘ƒ.
Then exactly š‘› āˆ’ š‘š of the variables š‘„š‘–āˆ— are equal to 0.
For š‘ƒā€², at the basic feasible solution š‘„ āˆ— , we have š‘› āˆ’ š‘š variables set to 0
and additional 2š‘š constraints are satisfied with equality. Hence, we have
š‘› + š‘š active constraints and š‘„ āˆ— is degenerate.
Linear Programming 2012
16
2.5 Existence of extreme points
ļ± Def 2.12: Polyhedron š‘ƒ āŠ† š‘…š‘› contains a line if āˆƒ a vector š‘„ āˆˆ š‘ƒ and a
nonzero š‘‘ āˆˆ š‘…š‘› such that š‘„ + šœ†š‘‘ āˆˆ š‘ƒ for all šœ† āˆˆ š‘….
Note that if š‘‘ is a line in š‘ƒ, then š“(š‘„ + šœ†š‘‘) ā‰¤ š‘ for all šœ† āˆˆ š‘…
ļƒ› š“š‘‘ = 0
Hence š‘‘ is a vector in the lineality space š‘†. (in š‘ƒ = š‘† + š¾ + š‘„)
ļ± Thm 2.6: š‘ƒ = {š‘„ āˆˆ š‘…š‘› : š‘Žš‘– ā€²š‘„ ā‰„ š‘š‘– , š‘– = 1, ā€¦ , š‘š} ā‰  āˆ…, then the following are
equivalent.
(a) š‘ƒ has at least one extreme point.
(b) š‘ƒ does not contain a line.
(c) āˆƒ š‘› vectors out of š‘Ž1 , ā€¦ , š‘Žš‘š , which are linearly independent.
Pf) see proof in the text.
Linear Programming 2012
17
ļ± Note that the conditions given in Thm 2.6 means that the lineality space š‘† =
{0}.
ļ± Cor 2.2: Every nonempty bounded polyhedron (polytope) and every
nonempty polyhedron in standard form has at least one basic feasible
solution (extreme point).
Linear Programming 2012
18
2.6 Optimality of extreme points
ļ± Thm 2.7: Consider the LP of minimizing š‘ ā€² š‘„ over a polyhedron š‘ƒ. Suppose
š‘ƒ has at least one extreme point and there exists an optimal solution.
Then there exists an optimal solution which is an extreme point of š‘ƒ.
Pf) see text.
ļ± Thm 2.8: Consider the LP of minimizing š‘ ā€² š‘„ over a polyhedron š‘ƒ. Suppose
š‘ƒ has at least one extreme point.
Then, either the optimal cost is āˆ’āˆž, or there exists an extreme point which is
optimal.
Linear Programming 2012
19
(continued)
Idea of proof in the text)
Consider any š‘„ āˆˆ š‘ƒ. Let š¼ = {š‘–: š‘Žš‘– ā€²š‘„ = š‘š‘– }
Then we move to š‘¦ = š‘„ + šœ†š‘‘, where š‘Žš‘– ā€²š‘‘ = 0, š‘– āˆˆ š¼ and š‘ ā€² š‘‘ ā‰¤ 0.
Then either the optimal cost is āˆ’āˆž ( if the half line š‘‘ is in š‘ƒ and š‘ ā€² š‘‘ < 0 )
or we meet a new inequality which becomes active ( cost does not increase).
By repeating the process, we eventually arrive at an extreme point which
has value not inferior to š‘„.
Therefore, for any š‘„ in š‘ƒ, there exists an extreme point š‘¦ such that š‘ ā€² š‘¦ ā‰¤
š‘ ā€² š‘„. Then we choose the extreme point which gives the smallest objective
value with respect to š‘.
Linear Programming 2012
20
ļ± ( alternative proof of Thm 2.8)
š‘ƒ = š‘† + š¾ + š‘„. Pointedness of š‘ƒ implies š‘† = {0}.
Hence š‘„ āˆˆ š‘ƒ ļƒ› š‘„ = š‘– š‘žš‘– š‘‘ š‘– + š‘— š‘Ÿš‘— š‘¢ š‘— , where š‘‘ š‘– ā€²š‘  are extreme rays of š¾
and š‘¢ š‘— ā€²š‘  are extreme points of š‘ƒ and š‘žš‘– ā‰„ 0, āˆ€ š‘–, š‘Ÿš‘— ā‰„ 0, āˆ€ š‘—, š‘— š‘Ÿš‘— = 1.
Suppose āˆƒ š‘– such that š‘ ā€² š‘‘ š‘– < 0, then LP is unbounded.
( For š‘„ āˆˆ š‘ƒ, š‘„ + šœ†š‘‘ š‘– āˆˆ š‘ƒ for šœ† ā‰„ 0. Then š‘ā€²(š‘„ + šœ†š‘‘ š‘– ) ļ‚® āˆ’āˆž as šœ† ā†’ āˆž)
Otherwise, š‘ā€²š‘‘ š‘– ā‰„ 0 for all š‘–, take š‘¢āˆ— such that š‘ ā€² š‘¢āˆ— = š‘šš‘–š‘›š‘— š‘ā€²š‘¢ š‘— .
Then āˆ€ š‘„ āˆˆ š‘ƒ,
š‘ā€²š‘„ =
š‘–
š‘– š‘žš‘– (š‘ā€²š‘‘ ) +
š‘—
š‘— š‘Ÿš‘— (š‘ā€²š‘¢ ) ā‰„
š‘—
ā€²š‘¢
š‘— š‘Ÿš‘— (š‘ā€²š‘¢ ) ā‰„ š‘
āˆ—
š‘— š‘Ÿš‘—
= š‘ā€²š‘¢āˆ— .
Hence LP is either unbounded or āˆƒ an extreme point of š‘ƒ which is an optimal
solution.
Proof here shows that the existence of an extreme ray š‘‘ š‘– of the pointed
recession cone š“š‘„ ā‰„ 0 ( if have min problem and polyhedron is š“š‘„ ā‰„ š‘)
such that š‘ ā€² š‘‘ š‘– < 0 is the necessary and sufficient condition for unboundedness
of the LP.
( If š‘ƒ has at least one extreme point, then LP is unbounded
ļƒ› āˆƒ an extreme ray š‘‘ š‘– in recession cone š¾ such that š‘ ā€² š‘‘ š‘– < 0)
Linear Programming 2012
ļ‚œ
21