IB Variational Principles - The second variation

5The second variation

IB Variational Principles

5.1 The second variation

So far, we have only looked at the “first derivatives” of functionals. We can

identify stationary points, but we don’t know if it is a maximum, minimum or a

saddle. To distinguish between these, we have to look at the “second derivatives”,

or the second variation.

Suppose x(t) = x

(t) is a solution of

δF [x]

δy(x)

= 0,

i.e. F [x] is stationary at y = y

To determine what type of stationary point it is, we need to expand

[

δx

]

to second order in

δx

. For convenience, let

δx

(

) =

εξ

(

) with constant

ε 

We will also only consider functionals of the form

F [x] =

f(x, ˙x, t) dt

with fixed-end boundary conditions, i.e.

(

) =

(

) = 0. We will use both dots

( ˙x) and dashes (x

) to denote derivatives.

We consider a variation x 7→ x + δx and expand the integrand to obtain

f(x + εξ, ˙x + ε

ξ, t) − f(x, ˙x, t)

= ε



∂f

∂x

∂f

∂ ˙x





∂

∂x

+ 2ξ

∂

∂x∂ ˙x

∂

∂ ˙x



+ O(ε

)

Noting that 2ξ

ξ = (ξ

)

and integrating by parts, we obtain

= εξ



∂f

∂x

−



∂f

∂ ˙x







∂

∂x

−



∂

∂x∂ ˙x



∂f

∂ ˙x



plus some boundary terms which vanish. So

F [x + εξ] − F [x] =

εξ



∂f

∂x

−



∂f

∂ ˙x



dt +

F [x, ξ] + O(ε

where

F [x, ξ] =





∂

∂x

−



∂

∂x∂ ˙x



∂

∂ ˙x



is a functional of both x(t) and ξ(t). This is analogous to the term

δx

H(x)δx

appearing in the expansion of a regular function

(

). In the case of normal

functions, if

(

) is positive,

(

) is convex for all

, and the stationary point

is hence a global minimum. A similar result holds for functionals.

In this case, if

[

x, ξ

]

0 for all non-zero

and all allowed

, then a

solution x

(t) of

δF

δx

= 0 is an absolute minimum.

Example

(Geodesics in the plane)

We previously shown that a straight line is

a stationary point for the curve-length functional, but we didn’t show it is in

fact the shortest distance! Maybe it is a maximum, and we can get the shortest

distance by routing to the moon and back.

Recall that f =

1 + (y

)

. Then

∂f

∂y

= 0,

∂f

∂y

1 + (y

)

∂

∂y

1 + (y

)

with the other second derivatives zero. So we have

F [y, ξ] =

(1 + (y

)

3/2

dx > 0

So if we have a stationary function satisfying the boundary conditions, it is an

absolute minimum. Since the straight line is a stationary function, it is indeed

the minimum.

However, not all functions are convex

[citation needed]

. We can still ask whether

a solution

(

) of the Euler-Lagrange equation is a local minimum. For these,

we need to consider

F [x

, ξ] =

(ρ(t)

+ σ(t)ξ

) dt,

where

ρ(t) =

∂

∂ ˙x



x=x

, σ(t) =



∂

∂x

−



∂

∂x∂ ˙x



x=x

This is of the same form as the Sturm-Liouville problem. For

to minimize

F [x] locally, we need δ

F [x

, ξ] > 0. A necessary condition for this is

ρ(t) ≥ 0,

which is the Legendre condition.

The intuition behind this necessary condition is as follows: suppose that

(

) is negative in some interval

I ⊆

[

α, β

]. Then we can find a

(

) that makes

[

, ξ

] negative. We simply have to make

zero outside

, and small but

crazily oscillating inside

. Then inside

˙x

wiill be very large while

is kept

tiny. So we can make δ

F [y, ξ] arbitrarily negative.

Turning the intuition into a formal proof is not difficult but is tedious and

will be omitted.

However, this is not a sufficient condition. Even if we had a strict inequality

ρ(t) > 0 for all α < t < β, it is still not sufficient.

Of course, a sufficient (but not necessary) condition is

(

)

, σ

(

)

≥

0, but

this is not too interesting.

Example. In the Branchistochrone problem, we have

T [x] ∝

1 + ˙x

dt.

Then

ρ(t) =

∂

∂ ˙x



> 0

σ(t) =

x(1 + ˙x

)

> 0.

So the cycloid does minimize the time T .