IA Vectors and Matrices - Eigenvalues and eigenvectors

5Eigenvalues and eigenvectors

IA Vectors and Matrices

5.8 Eigenvalues and eigenvectors of a Hermitian matrix

5.8.1 Eigenvalues and eigenvectors

Theorem. The eigenvalues of a Hermitian matrix H are real.

Proof. Suppose that H has eigenvalue λ with eigenvector v 6= 0. Then

Hv = λv.

We pre-multiply by v

†

, a 1 × n row vector, to obtain

†

Hv = λv

†

v (∗)

We take the Hermitian conjugate of both sides. The left hand side is

†

Hv)

†

= v

†

v = v

†

since H is Hermitian. The right hand side is

(λv

†

= λ

∗

†

So we have

†

Hv = λ

∗

†

From (

∗

), we know that

λv

†

∗

†

. Since

v 6

= 0, we know that

†

v · v 6= 0. So λ = λ

∗

and λ is real.

Theorem.

The eigenvectors of a Hermitian matrix

corresponding to distinct

eigenvalues are orthogonal.

Proof. Let

= λ

(i)

= λ

. (ii)

Pre-multiply (i) by v

†

to obtain

†

= λ

†

. (iii)

Pre-multiply (ii) by v

†

and take the Hermitian conjugate to obtain

†

= λ

†

. (iv)

Equating (iii) and (iv) yields

†

= λ

†

Since

, we must have

†

= 0. So their inner product is zero and are

orthogonal.

So we know that if a Hermitian matrix has

distinct eigenvalues, then

the eigenvectors form an orthonormal basis. However, if there are degenerate

eigenvalues, it is more difficult, and requires the Gram-Schmidt process.

5.8.2 Gram-Schmidt orthogonalization (non-examinable)

Suppose we have a set

, w

, ··· , w

}

of linearly independent vectors.

We want to find an orthogonal set

B = {v

, v

, ··· , v

Define the projection of

onto

(

) =

hv|wi

hv|vi

. Now construct

iteratively:

(i) v

= w

(ii) v

= w

− P

(w)

Then we get that hv

| v

i = hv

| w

i −





| v

i = 0

(iii) v

= w

− P

) − P

)

(iv)

(v) v

= w

−

r−1

j=1

)

At each step, we subtract out the components of

that belong to the space

, ··· , v

k−1

}

. This ensures that all the vectors are orthogonal. Finally, we

normalize each basis vector individually to obtain an orthonormal basis.

5.8.3 Unitary transformation

Suppose

is the transformation between one orthonormal basis and a new

orthonormal basis {u

, u

, ··· , u

}, i.e. hu

| u

i = δ

. Then

U =







)

··· (u

)

··· (u

)

··· (u

)







Then

†

= (U

†

)

= U

∗

= (u

)

∗

)

= hu

| u

= δ

So U is a unitary matrix.

5.8.4 Diagonalization of n × n Hermitian matrices

Theorem.

n × n

Hermitian matrix has precisely

orthogonal eigenvectors.

Proof.

(Non-examinable) Let

, λ

, ··· , λ

be the distinct eigenvalues of

(

r ≤

), with a set of corresponding orthonormal eigenvectors

, v

, ··· , v

}

Extend to a basis of the whole of C

= {v

, v

, ··· , v

, w

, ··· , w

n−r

}

Now use Gram-Schmidt to create an orthonormal basis

B = {v

, v

, ··· , v

, u

, ··· , u

n−r

Now write

P =





↑ ↑ ↑ ↑ ↑

··· v

··· u

n−r

↓ ↓ ↓ ↓ ↓





We have shown above that this is a unitary matrix, i.e.

−1

†

. So if we

change basis, we have

−1

HP = P

†







0 ··· 0 0 0 ··· 0

0 λ

··· 0 0 0 ··· 0

0 0 ··· λ

0 0 ··· 0

0 0 ··· 0 c

··· c

1,n−r

0 0 ··· 0 c

··· c

2,n−r

0 0 ··· 0 c

n−r,1

n−r,2

··· c

n−r,n−r







Here

is an (

n − r

)

(

n − r

) Hermitian matrix. The eigenvalues of

are also

eigenvalues of

because

det

(

H − λI

) =

det

(

†

HP − λI

) = (

− λ

)

···

(

−

λ) det(C − λI). So the eigenvalues of C are the eigenvalues of H.

We can keep repeating the process on

until we finish all rows. For example,

if the eigenvalues of

are all distinct, there are

n − r

orthonormal eigenvectors

(for j = r + 1, ··· , n) of C. Let

Q =







↑ ↑ ↑

r+1

r+2

··· w

↓ ↓ ↓







with other entries 0. (where we have a

r × r

identity matrix block on the top

left corner and a (n − r) ×(n −r) with columns formed by w

)

Since the columns of

are orthonormal,

is unitary. So

†

HP Q

diag

(

, λ

, ··· , λ

, λ

r+1

, ··· , λ

), where the first

r λ

s are distinct and the re-

maining ones are copies of previous ones.

The n linearly-independent eigenvectors are the columns of PQ.

So it now follows that

is diagonalizable via transformation

P Q

is a unitary matrix because P and Q are. We have

D = U

†

H = UDU

†

Note that a real symmetric matrix

is a special case of Hermitian matrices. So

we have

D = Q

S = QDQ

Example.

Find the orthogonal matrix which diagonalizes the following real

symmetric matrix: S =



1 β

β 1



with β 6= 0 ∈ R.

We find the eigenvalues by solving the characteristic equation:

det

(

S−λI

) = 0,

and obtain λ = 1 ± β.

The corresponding eigenvectors satisfy (

S − λI

)

= 0, which gives

√



±1



We change the basis from the standard basis to

√





√



−1



(which

is just a rotation by π/4).

The transformation matrix is



√

2 1/

√

2 −1/

√



. Then we know that

S = QDQ

with D = diag(1, −1)

5.8.5 Normal matrices

We have seen that the eigenvalues and eigenvectors of Hermitian matrices satisfy

some nice properties. More generally, we can define the following:

Definition

(Normal matrix)

A normal matrix as a matrix that commutes with

its own Hermitian conjugate, i.e.

†

= N

†

Hermitian, real symmetric, skew-Hermitian, real anti-symmetric, orthogonal,

unitary matrices are all special cases of normal matrices.

It can be shown that:

Proposition.

(i) If λ is an eigenvalue of N, then λ

∗

is an eigenvalue of N

†

(ii) The eigenvectors of distinct eigenvalues are orthogonal.

(iii)

A normal matrix can always be diagonalized with an orthonormal basis of

eigenvectors.