IB Optimisation - The method of Lagrange multipliers

2The method of Lagrange multipliers

IB Optimisation

2.4 Supporting hyperplanes and convexity

We use the fancy term “hyperplane” to denote planes in higher dimensions (in

an n-dimensional space, a hyperplane has n − 1 dimensions).

Definition

(Supporting hype rplane)

A hyperplane

→ R

is supporting

to φ at b if α intersects φ at b and φ(c) ≥ α(c) for all c.

φ(b)

Theorem.

(

) satisfies strong duality iff

(

) =

inf

x∈X(c)

(

) has a supporting

hyperplane at b.

Note that here we fix a b, and let φ be a function of c.

Proof.

(

⇐

) Suppose there is a supporting hyperplane. Then since the plane

passes through φ(b), it must be of the form

α(c) = φ(b) + λ

(c − b).

Since this is supporting, for all c ∈ R

φ(b) + λ

(c − b) ≤ φ(c),

φ(b) ≤ φ(c) − λ

(c − b),

This implies that

φ(b) ≤ inf

c∈R

(φ(c) − λ

(c − b))

= inf

c∈R

inf

x∈X(c)

(f(x) − λ

(h(x) − b))

(since φ(c) = inf

x∈X(c)

f(x) and h(x) = c for x ∈ X(c))

= inf

x∈X

L(x, λ).

(since

c∈R

(

) =

, which is true since for any

x ∈ X

, we have

x ∈ X

(

)))

= g(λ)

By weak duality, g(λ) ≤ φ(b). So φ(b) = g(λ). So strong duality holds.

(

⇒

). Assume now that we have strong duality. The there exists

such that

for all c ∈ R

φ(b) = g(λ)

= inf

x∈X

L(x, λ)

≤ inf

x∈X(c)

L(x, λ)

= inf

x∈X(c)

(f(x) − λ

(h(x) − b))

= φ(c) − λ

(c − b)

So φ(b) + λ

(c − b) ≤ φ(c). So this defines a supporting hyperplane.

We are having some progress now. To show that Lagrange multipliers work,

we need to show that (

) satisfies strong duality. To show that (

) satisfies

strong duality, we need to show that it has a supporting hyperplane at

. How

can we show that there is a supporting hyperplane? A sufficient condition is

convexity.

Theorem

(Supporting hyperplane theorem)

Suppose that

→ R

convex and

b ∈ R

lies in the interior of the set of points where

is finite. Then

there exists a supporting hyperplane to φ at b.

Proof follows rather straightforwardly from the definition of convexity, and

is omitted.

This is some even better progress. However, the definition of

is rather

convoluted. How can we show that it is convex? We have the following helpful

theorem:

Theorem. Let

φ(b) = inf

x∈X

{f(x) : h(x) ≤ b}

If X, f, h are convex, then so is φ (assuming feasibility and boundedness).

Proof.

Consider

, b

∈ R

such that

(

) and

(

) are defined. Let

δ ∈

and define

δb

+(1

−δ

)

. We want to show that

(

)

≤ δφ

(

)+(1

−δ

)

(

Consider

∈ X

(

∈ X

(

), and let

δx

+ (1

− δ

)

. By convexity

of X, x ∈ X.

By convexity of h,

h(x) = h(δx

+ (1 − δ)x

)

≤ δh(x

) + (1 − δ)h(x

)

≤ δb

+ (1 − δ)b

= b

So x ∈ X(b). Since φ(x) is an optimal solution, by convexity of f,

φ(b) ≤ f(x)

= f(δx

+ (1 − δ)x

)

≤ δf(x

) + (1 − δ)f(x

)

This holds for any

∈ X

(

) and

∈ X

(

). So by taking infimum of the

right hand side,

φ(b) ≤ δφ(b

) + (1 − δ)φ(b

So φ is convex.

(

) =

is equivalent to

(

)

≤ b

and

−h

(

)

≤ −b

. So the result holds for

problems with equality constraints if both

and

−h

are convex, i.e. if

(

) is

linear.

Theorem.

If a linear program is feasible and bounded, then it satisfies strong

duality.