IB Linear Algebra - Inner product spaces

8Inner product spaces

IB Linear Algebra

8.4 Spectral theory

We are going to classify matrices in inner product spaces. Recall that for general

vector spaces, what we effectively did was to find the orbits of the conjugation

action of

(

) on

Mat

(

). If we have inner product spaces, we will want to

look at the action of

(

) or

(

) on

Mat

(

). In a more human language,

instead of allowing arbitrary basis transformations, we only allow transforming

between orthonormal basis.

We are not going to classify all endomorphisms, but just self-adjoint and

orthogonal/unitary ones.

Lemma.

Let

be a finite-dimensional inner product space, and

α ∈ End

(

)

self-adjoint. Then

(i) α has a real eigenvalue, and all eigenvalues of α are real.

(ii) Eigenvectors of α with distinct eigenvalues are orthogonal.

Proof. We are going to do real and complex cases separately.

(i)

Suppose first

is a complex inner product space. Then by the fundamental

theorem of algebra,

has an eigenvalue, say

. We pick

v ∈ V \ {

}

such

that αv = λv. Then

λ(v, v) = (λv, v) = (αv, v) = (v, αv) = (v, λv) = λ(v, v).

Since v 6= 0, we know (v, v) 6= 0. So λ =

λ.

For the real case, we pretend we are in the complex case. Let

, ··· , e

an orthonormal basis for

. Then

is represented by a symmetric matrix

(with respect to this basis). Since real symmetric matrices are Hermitian

viewed as complex matrices, this gives a self-adjoint endomorphism of

By the complex case,

has real eigenvalues only. But the eigenvalues of

A are the eigenvalues of α and M

(t) = M

(t). So done.

Alternatively, we can prove this without reducing to the complex case. We

know every irreducible factor of

(

) in

[

] must have degree 1 or 2,

since the roots are either real or come in complex conjugate pairs. Suppose

f(t) were an irreducible factor of degree 2. Then





(α) 6= 0

since it has degree less than the minimal polynomial. So there is some

v ∈ V such that





(α)(v) 6= 0.

So it must be that

(

)(

) =

. Let

hv, α

(

)

. Then this is an

α-invariant subspace of V since f has degree 2.

Now

α|

∈ End

(

) is self-adjoint. So if (

, e

) is an orthonormal basis of

U, then α is represented by a real symmetric matrix, say



a b

b a



But then

α|

(

) = (

t −a

)

−b

, which has real roots, namely

a ±b

. This

is a contradiction, since M

α|

= f, but f is irreducible.

(ii)

Now suppose

αv

λv

αw

µw

and

λ 6

. We need to show (

v, w

) = 0.

We know

(αv, w) = (v, αw)

by definition. This then gives

λ(v, w) = µ(v, w)

Since λ 6= µ, we must have (v, w) = 0.

Theorem.

Let

be a finite-dimensional inner product space, and

α ∈ End

(

)

self-adjoint. Then V has an orthonormal basis of eigenvectors of α.

Proof.

By the previous lemma,

has a real eigenvalue, say

. Then we can find

an eigenvector v ∈ V \ {0} such that αv = λv.

Let U = hvi

⊥

. Then we can write

V = hvi ⊥ U.

We now want to prove α sends U into U . Suppose u ∈ U . Then

(v, α(u)) = (αv, u) = λ(v, u) = 0.

So α(u) ∈ hvi

⊥

= U. So α|

∈ End(U) and is self-adjoint.

By induction on

dim V

has an orthonormal basis (

, ··· , v

) of

eigen-

vectors. Now let

kvk

Then (v

, v

, ··· , v

) is an orthonormal basis of eigenvectors for α.

Corollary.

Let

be a finite-dimensional vector space and

self-adjoint. Then

V is the orthogonal (internal) direct sum of its α-eigenspaces.

Corollary.

Let

A ∈ Mat

(

) be symmetric. Then there exists an orthogonal

matrix P such that P

AP = P

−1

AP is diagonal.

Proof.

Let (

·, ·

) be the standard inner product on

. Then

is self-adjoint

as an endomorphism of

. So

has an orthonormal basis of eigenvectors for

A, say (v

, ··· , v

). Taking P = (v

··· v

) gives the result.

Corollary.

Let

be a finite-dimensional real inner product space and

V ×V → R

a symmetric bilinear form. Then there exists an orthonormal basis

(

, ··· , v

) for

with respect to which

is represented by a diagonal matrix.

Proof.

Let (

, ··· , u

) be any orthonormal basis for

. Then

is represented

by a symmetric matrix

. Then there exists an orthogonal matrix

such that

is diagonal. Now let

. Then (

, ··· , v

) is an orthonormal

basis since

, v

) =





, u

= [P

P ]

= δ

Also, ψ is represented by P

AP with respect to (v

, ··· , v

Note that the diagonal values of

are just the eigenvalues of

. So the

signature of

is just the number of positive eigenvalues of

minus the number

of negative eigenvalues of A.

Corollary.

Let

be a finite-dimensional real vector space and

φ, ψ

symmetric

bilinear forms on

such that

is positive-definite. Then we can find a basis

(

, ··· , v

) for

such that both

and

are represented by diagonal matrices

with respect to this basis.

Proof.

We use

to define an inner product. Choose an orthonormal basis for

(equipped with

) (

, ··· , v

) with respect to which

is diagonal. Then

represented by I with respect to this basis, since ψ(v

, v

) = δ

. So done.

Corollary.

A, B ∈ Mat

(

) are symmetric and

is positive definitive (i.e.

Av >

0 for all

v ∈ R

\ {

}

). Then there exists an invertible matrix

such

that Q

AQ and Q

BQ are both diagonal.

We can deduce similar results for complex finite-dimensional vector spaces,

with the same proofs. In particular,

Proposition.

(i)

A ∈ Mat

(

) is Hermitian, then there exists a unitary matrix

U ∈

Mat

−1

AU = U

†

is diagonal.

(ii)

is a Hermitian form on a finite-dimensional complex inner product

space V , then there is an orthonormal basis for V diagonalizing ψ.

(iii)

φ, ψ

are Hermitian forms on a finite-dimensional complex vector space

and

is positive definite, then there exists a basis for which

and

are

diagonalized.

(iv)

Let

A, B ∈ Mat

(

) be Hermitian, and

positive definitive (i.e.

†

Av >

for

v ∈ V \{

}

). Then there exists some invertible

such that

†

and

†

BQ are diagonal.

That’s all for self-adjoint matrices. How about unitary matrices?

Theorem.

Let

be a finite-dimensional complex vector space and

α ∈ U

(

)

be unitary. Then V has an orthonormal basis of α eigenvectors.

Proof.

By the fundamental theorem of algebra, there exists

v ∈ V \ {

}

and

λ ∈ C such that αv = λv. Now consider W = hvi

⊥

. Then

V = W ⊥ hvi.

We want to show

restricts to a (unitary) endomorphism of

. Let

w ∈ W

We need to show α(w) is orthogonal to v. We have

(αw, v) = (w, α

−1

v) = (w, λ

−1

v) = 0.

(

)

∈ W

and

α|

∈ End

(

). Also,

α|

is unitary since

is. So by

induction on

dim V

has an orthonormal basis of

eigenvectors. If we add

v/kvk

to this basis, we get an orthonormal basis of

itself comprised of

eigenvectors.

This theorem and the analogous one for self-adjoint endomorphisms have a

common generalization, at least for complex inner product spaces. The key fact

that leads to the existence of an orthonormal basis of eigenvectors is that

and

∗

commute. This is clearly a necessary condition, since if

is diagonalizable, then

∗

is diagonal in the same basis (since it is just the transpose (and conjugate)),

and hence they commute. It turns out this is also a sufficient condition, as you

will show in example sheet 4.

However, we cannot generalize this in the real orthogonal case. For example,



cos θ sin θ

−sin θ cos θ



∈ O(R

)

cannot be diagonalized (if

θ 6∈ πZ

). However, in example sheet 4, you will find a

classification of

(

), and you will see that the above counterexample is the

worst that can happen in some sense.