II Linear Analysis - Hilbert spaces

4Hilbert spaces

II Linear Analysis

4.1 Inner product spaces

We have just looked at continuous functions on some compact space

. Another

important space is the space of square-integrable functions. Consider the space

(R) =



f : f is Lebesgue integrable :

|f|

< ∞



/∼,

where f ∼ g if f = g is Lebesgue almost everywhere.

One thing we like to think about is the Fourier series. Recall that for

f ∈ C(S

), we have defined, for each k ∈ Z,

f(k) =

2π

−π

−ikx

f(x) dx,

and we have defined the partial sum

(f)(x) =

n=−N

inx

f(k).

We have previously seen that even if

is continuous, it is possible that the partial

sums

do not converge, even pointwise. However, we can ask for something

weaker:

Proposition. Let f ∈ C(S

). Then

lim

N→∞

2π

−π

|f(x) − S

(f)(x)|

dx = 0.

We will prove this later. However, the key points of the proof is the “orthog-

onality” of {e

inx

}. More precisely, we have

2π

−π

inx

−imx

dx = 0 if m 6= n.

The introduction of Hilbert spaces is in particular a way to put this in a

general framework. We want to introduce an extra structure that gives rise to

“orthogonality”.

Definition

(Inner product)

Let

be a vector space over

. We say

p : V × V → R or C is an inner product on V it satisfies

(i) p(v, w) = p(w, v) for all v, w ∈ V . (antisymmetry)

(ii) p

(

, u

) =

(

, w

) +

(

, w

). (linearity in first argument)

(iii) p(v, v) ≥ 0 for all v ∈ V and equality holds iff v = 0. (non-negativity)

We will often denote an inner product by

(

v, w

) =

hv, wi

. We call (

V, h·, ·i

)

an inner product space.

Definition

(Orthogonality)

In an inner product space,

and

are orthogonal

if hv, wi = 0.

Orthogonality and the inner product is important when dealing with vector

spaces. For example, recall that when working with finite-dimensional spaces, we

had things like Hermitian matrices, orthogonal matrices and normal matrices. All

these are in some sense defined in terms of the inner product and orthogonality.

More fundamentally, when we have a finite-dimensional vector spaces, we often

write the vectors as a set of

coordinates. To define this coordinate system, we

start by picking

orthogonal vectors (which are in fact orthonormal), and then

the coordinates are just the projections onto these orthogonal vectors.

Hopefully, you are convinced that inner products are important. So let’s see

what we can get if we put in inner products to arbitrary vector spaces.

We will look at some easy properties of the inner product.

Proposition

(Cauchy-Schwarz inequality)

Let (

V, h·, ·i

) be an inner product

space. Then for all v, w ∈ V ,

|hv, wi| ≤

hv, vihw, wi,

with equality iff there is some λ ∈ R or C such that v = λw or w = λv.

Proof.

wlog, we can assume

w 6

= 0. Otherwise, this is trivial. Moreover, assume

hv, wi ∈ R. Otherwise, we can just multiply w by some e

iα

By non-negativity, we know that for all t, we have

0 ≤ hv + tw, v + twi

= hv, vi + 2thv, wi+ t

hw, wi.

Therefore, the discriminant of this quadratic polynomial in

is non-positive, i.e.

4(hv, wi)

− 4hv, vihw, wi ≤ 0,

from which the result follows.

Finally, note that if equality holds, then the discriminant is 0. So the

quadratic has exactly one root. So there exists

such that

= 0, which of

course implies v = −tw.

Proposition. Let (V, h·, ·i) be an inner product space. Then

kvk =

hv, vi

defines a norm.

Proof.

The first two axioms of the norm are easy to check, since it follows directly

from definition of the inner product that

kvk ≥

0 with equality iff

, and

kλvk = |λ|kvk.

The only non-trivial thing to check is the triangle inequality. We have

kv + wk

= hv + w, v + wi

= kvk

+ kwk

+ |hv, wi| + |hw, vi|

≤ kvk

+ kwk

+ 2kvkkwk

= (kvk + kwk)

Hence we know that kv + wk ≤ kvk + kwk.

This motivates the following definition:

Definition

(Euclidean space)

A normed vector space (

V, k · k

) is a Euclidean

space if there exists an inner product h·, ·i such that

kvk =

hv, vi.

Proposition.

Let (

E, k · k

) be a Euclidean space. Then there is a unique inner

product h·, ·i such that kvk =

hv, vi.

Proof. The real and complex cases are slightly different.

First suppose

is a vector space over

, and suppose also that we have an

inner product h·, ·i such that kv k =

hv, vi. Then

hv + w, v + wi = kvk

+ 2hv, wi + kwk

So we get

hv, wi =

(kv + wk

− kvk

− kwk

). (∗)

In particular, the inner product is completely determined by the norm. So this

must be unique.

Now suppose E is a vector space over C. We have

hv + w, v + wi = kvk

+ kwk

+ hv, wi + hw, vi (1)

hv − w, v − wi = kvk

+ kwk

− hv, wi − hw, vi (2)

hv + iw, v + iwi = kvk

+ kwk

− ihv, wi + ihw, vi (3)

hv − iw, v − iwi = kvk

+ kwk

+ ihv, wi − ihw, vi (4)

Now consider (1) − (2) + i(3) − i(4). Then we obtain

kv + wk

− kv − wk

+ ikv + iwk

− ikv − iwk

= 4hv, wi. (†)

So again hv, wi is again determined by the norm.

The identities (

∗

) and (

†

) are sometimes known as the polarization identities.

Definition

(Hilbert space)

A Euclidean space (

E, k · k

) is a Hilbert space if it

is complete.

We will prove certain properties of the inner product.

Proposition

(Parallelogram law)

Let (

E, k · k

) be a Euclidean space. Then

for v, w ∈ E, we have

kv − wk

+ kv + wk

= 2kvk

+ 2kwk

This is called the parallelogram law because it says that for any parallelogram,

the sum of the square of the lengths of the diagonals is the sum of square of the

lengths of the two sides.

v + w

v − w

Proof. This is just simple algebraic manipulation. We have

kv − wk

+ kv + wk

= hv − w, v − wi+ hv + w, v + wi

= hv, vi − hv, wi−hw, vi + hw, wi

+ hv, vi + hv, wi+ hw, vi + hw, wi

= 2hv, vi + 2hw, wi.

Proposition

(Pythagoras theorem)

Let (

E, k · k

) be a Euclidean space, and

let v, w ∈ E be orthogonal. Then

kv + wk

= kvk

+ kwk

Proof.

kv + wk

= hv + w, v + wi

= hv, vi + hv, wi+ hw, vi + hw, wi

= hv, vi + 0 + 0 + hw, wi

= kvk

+ kwk

By induction, if

∈ E

for

= 1

, ··· , n

such that

, v

= 0 for

i 6

, i.e.

they are mutually orthogonal, then



i=1



i=1

Proposition.

Let (

E, k · k

) be a Euclidean space. Then

h·, ·i

E × E → C

continuous.

Proof. Let (v, w) ∈ E × E, and (

w) ∈ E × E. We have

khv, wi − h

wik = khv, wi − hv,

wi + hv,

wi − h

wik

≤ khv, wi − hv,

wik + khv,

wi − h

wik

= khv, w −

wik + khv −

wik

≤ kvkkw −

wk + kv −

vkk

Hence for

v, w

sufficiently closed to

, we can get

khv, wi−h

wik

arbitrarily

small. So it is continuous.

When we have an incomplete Euclidean space, we can of course take the

completion of it to form a complete extension of the original normed vector

space. However, it is not immediately obvious that the inner product can also

be extended to the completion to give a Hilbert space. The following proposition

tells us we can do so.

Proposition.

Let (

E, k · k

) denote a Euclidean space, and

its completion.

Then the inner product extends to an inner product on

, turning

into a

Hilbert space.

Proof.

Recall we constructed the completion of a space as the equivalence classes

of Cauchy sequences (where two Cauchy sequences (

) and (

) are equivalent

−x

| →

0). Let (

)

(

) be two Cauchy sequences in

, and let

˜x, ˜y ∈

denote their equivalence classes. We define the inner product as

yi = lim

n→∞

, y

i. (∗)

We want to show this is well-defined. Firstly, we need to make sure the limit

exists. We can show this by showing that this is a Cauchy sequence. We have

khx

, y

i − hx

, y

ik = khx

, y

i − hx

, y

i + hx

, y

i − hx

, y

≤ khx

, y

i − hx

, y

ik + khx

, y

i − hx

, y

≤ khx

, x

, y

ik + khx

, y

− y

≤ kx

− x

kky

k + kxkky

− y

So hx

, y

i is a Cauchy sequence since (x

) and (y

) are.

We also need to show that (

∗

) does not depend on the representatives for

and

y. This is left as an exercise for the reader

We also need to show that

h·, ·i

define the norm of

k · k

, which is yet

another exercise.

Example. Consider the space

(

, x

, ···) : x

∈ C,

∞

i=1

< ∞

)

We already know that this is a complete Banach space. We can also define an

inner product on this space by

ha, bi

∞

i=1

We need to check that this actually converges. We prove this by showing absolute

convergence. For each n, we can use Cauchy-Schwarz to obtain

i=1

| ≤

i=1

|a|

i=1

|b|

≤ kak

kbk

So it converges. Now notice that the

norm is indeed induced by this inner

product.

This is a significant example since we will later show that every separable (i.e.

has countable basis) infinite dimensional Hilbert space is isometric isomorphic

to `

Definition

(Orthogonal space)

Let

be a Euclidean space and

S ⊆ E

arbitrary subset. Then the orthogonal space of S, denoted by S

⊥

is given by

⊥

= {v ∈ E : ∀w ∈ S, hv, wi = 0}.

Proposition.

Let

be a Euclidean space and

S ⊆ E

. Then

⊥

is a closed

subspace of E, and moreover

⊥

= (span S)

⊥

Proof.

We first show it is a subspace. Let

u, v ∈ S

⊥

and

λ, µ ∈ C

. We want to

show λu + µv ∈ S

⊥

. Let w ∈ S. Then

hλu + µv, wi = λhu, wi + µhv, wi = 0.

To show it is closed, let

∈ S

⊥

be a sequence such that

→ u ∈ E

. Let

w ∈ S. Then we know that

, wi = 0.

Hence, by the continuity of the inner product, we have

0 = lim

n→∞

, wi = hlim u

, wi = hu, wi.

The remaining part is left as an exercise.

Note that if

is a linear subspace, then

V ∩V

⊥

{

}

, since any

v ∈ V ∩V

⊥

has to satisfy hv, vi = 0. So V + V

⊥

is a direct sum.

Theorem.

Let (

E, k · k

) be a Euclidean space, and

F ⊆ E

a complete subspace.

Then F ⊕ F

⊥

= E.

Hence, by definition of the direct sum, for

x ∈ E

, we can write

where x

∈ F and x

∈ F

⊥

. Moreover, x

is uniquely characterized by

− xk = inf

y∈F

ky − xk.

Note that this is not necessarily true if F is not complete.

Proof.

We already know that

F ⊕ F

⊥

is a direct sum. It thus suffices to show

that the sum is the whole of E.

Let y

∈ F be a sequence with

lim

i→∞

− xk = inf

y∈F

ky − xk = d.

We want to show that

is a Cauchy sequence. Let

ε >

0 be given. Let

∈ N

such that for all i ≥ n

, we have

− xk

≤ d

+ ε.

We now use the parallelogram law for

x − y

with

i, j ≥ n

Then the parallelogram law says:

kv + wk

+ kv − wk

= 2kvk

+ 2kwk

− y

+ k2x − y

− y

= 2ky

− xk

+ 2ky

− xk

Hence we know that

− y

≤ 2ky

− xk

+ 2ky

− xk

− 4



x −

+ y



≤ 2(d

+ ε) + 2(d

+ ε) − 4d

≤ 4ε.

is a Cauchy sequence. Since

is complete,

→ y ∈ F

for some

Moreover, by continuity, of k · k, we know that

d = lim

i→∞

− xk = ky − xk.

Now let

and

x − y

. The only thing left over is to show

∈ F

⊥

Suppose not. Then there is some

y ∈ F such that

y, x

i 6= 0.

The idea is that we can perturbe

by a little bit to get a point even closer to

By multiplying

y with a scalar, we can assume

y, x

i > 0.

Then for t > 0, we have

k(y + t

y) − xk

= hy + t

y − x, y + t

y − xi

= hy − x, y − xi + ht

y, y − xi + hy − x, t

yi + t

= d

− 2th

y, x

i + t

Hence for sufficiently small

, the

term is negligible, and we can make this

less that d

. This is a contradiction since y + t

y ∈ F .

As a corollary, we can define the projection map as follows:

Corollary.

Let

be a Euclidean space and

F ⊆ E

a complete subspace. Then

there exists a projection map

E → E

defined by

(

) =

, where

∈ F

as defined in the theorem above. Moreover,

satisfies the following properties:

(i) P (E) = F and P (F

⊥

) = {0}, and P

= P . In other words, F

⊥

≤ ker P .

(ii) (I − P )(E) = F

⊥

, (I − P )(F ) = {0}, (I − P )

= (I − P ).

(iii) kP k

B(E,E)

≤

1 and

kI −P k

B(E,E)

≤

1, with equality if and only if

F 6

{

}

and F

⊥

6= {0} respectively.

Here P projects our space to F , where I − P projects our space to F

⊥