IB Methods - Sturm-Liouville Theory

3Sturm-Liouville Theory

IB Methods

3.2 Least squares approximation

So far, we have expanded functions in terms of infinite series. However, in real

life, when we ask a computer to do this for us, it is incapable of storing and

calculating an infinite series. So we need to truncate it at some point.

Suppose we approximate some function

: Ω

→ C

by a finite set of eigen-

functions y

(x). Suppose we write the approximation g as

g(x) =

k=1

(x).

The objective here is to figure out what values of the coefficients

are “the

best”, i.e. can make g represent f as closely as possible.

Firstly, we have to make it explicit what we mean by “as closely as possible”.

Here we take this to mean that we want to minimize the

-norm (

f −g, f −g

)

By linearity of the norm, we know that

(f − g, f − g)

= (f, f)

+ (g, g)

− (f, g)

− (g, f)

To minimize this norm, we want

0 =

∂

∂c

(f − g, f − g)

∂

∂c

[(f, f)

+ (g, g)

− (f, g)

− (g, f)

We know that the (

f, f

)

term vanishes since it does not depend on

. Expanding

our definitions of g, we can get

0 =

∂

∂c

∞

i=1

−

k=1

∗

−

k=1

∗

= c

∗

−

∗

Note that here we are treating

∗

and

as distinct quantities. So when we vary

∗

is unchanged. To formally justify this treatment, we can vary the real and

imaginary parts separately.

Hence, the extremum is achieved at

∗

. Similarly, varying with respect

to c

∗

, we get that c

To check that this is indeed an minimum, we can look at the second-derivatives

∂

∂c

(f − g, f − g)

∂

∂c

∗

(f − g, f − g)

= 0,

while

∂

∂c

∗

∂c

(f − g, f − g)

= δ

≥ 0.

Hence this is indeed a minimum.

Thus we know that (f − g, f − g)

is minimized over all g(x) when

= (y

, f)

These are exactly the coefficients in our infinite expansion. Hence if we truncate

our series at an arbitrary point, it is still the best approximation we can get.