IB Variational Principles

0Introduction

0 Introduction

Consider a light ray travelling towards a mirror and being reflected.

We see that the light ray travels towards the mirror, gets reflected at

, and hits

the (invisible) eye. What determines the path taken? The usual answer would

be that the reflected angle shall be the same as the incident angle. However,

ancient Greek mathematician Hero of Alexandria provided a different answer:

the path of the light minimizes the total distance travelled.

We can assume that light travels in a straight line except when reflected.

Then we can characterize the path by a single variable

, the point where the

light ray hits the mirror. Then we let

(

) to be the length of the path, and we

can solve for z by setting L

(z) = 0.

This principle sounds reasonable - in the absence of mirrors, light travels in

a straight line - which is the shortest path between two points. But is it always

true that the shortest path is taken? No! We only considered a plane mirror, and

this doesn’t hold if we have, say, a spherical mirror. However, it turns out that

in all cases, the path is a stationary point of the length function, i.e. L

(z) = 0.

Fermat put this principle further. Assuming that light travels at a finite

speed, the shortest path is the path that takes the minimum time. Fermat’s

principle thus states that

Light travels on the path that takes the shortest time.

This alternative formulation has the advantage that it applies to refraction as

well. Light travels at different speeds in different mediums. Hence when they

travel between mediums, they will change direction such that the total time

taken is minimized.

We usually define the refractive index

of a medium to be

= 1

, where

is the velocity of light in the medium. Then we can write the variational

principle as

minimize

path

n ds,

where d

is the path length element. This is easy to solve if we have two distinct

mediums. Since light travels in a straight line in each medium, we can simply

characterize the path by the point where the light crosses the boundary. However,

in the general case, we should be considering any possible path between two

points. In this case, we could no longer use ordinary calculus, and need new

tools - calculus of variations.

In calculus of variations, the main objective is to find a function

(

) that

minimizes an integral

(

) d

for some function

. For example, we might

want to minimize

(

) d

. This differs greatly from ordinary minimization

problems. In ordinary calculus, we minimize a function

(

) for all possible

values of

x ∈ R

. However, in calculus of variations, we will be minimizing the

integral

f(x) dt over all possible functions x(t).