IA Vectors and Matrices

3Linear maps

3.4 Matrices

In the examples above, we have represented our linear maps by some object

such that

. We call

the matrix for the linear map. In general, let

α : R

→ R

be a linear map, and x

= α(x).

Let {e

} be a basis of R

. Then x = x

for some x

. Then we get

= α(x

) = x

α(e

So we get that

= [α(e

)]

We now define A

= [α(e

)]

. Then x

= A

. We write

A = {A

} =







··· A

. A

··· A







Here

is the entry in the

th row of the

th column. We say that

is an

m × n matrix, and write x

= Ax.

We see that the columns of the matrix are the images of the standard basis

vectors under the mapping α.

Example.

3.4.1 Examples

(i)

, consider a reflection in a line with an angle

to the

axis. We

know that

i 7→ cos

sin

, with

j 7→ −cos

sin

. Then the

matrix is



cos 2θ sin 2θ

sin 2θ −cos 2θ



(ii)

, as we’ve previously seen, a rotation by

about the

axis is given

R =





cos θ −sin θ 0

sin θ cos θ 0

0 0 1





(iii)

, a reflection in plane with normal

is given by

−

ˆn

Written as a matrix, we have





1 − 2ˆn

−2ˆn

ˆn

−2ˆn

ˆn

−2ˆn

ˆn

1 − 2ˆn

−2ˆn

ˆn

−2ˆn

ˆn

−2ˆn

ˆn

1 − 2ˆn





(iv)

Dilation (“stretching”)

→ R

is given by a map (

x, y, z

)

7→

(λx, µy, νz) for some λ, µ, ν. The matrix is





λ 0 0

0 µ 0

0 0 ν





(v) Shear: Consider S : R

→ R

that sheers in the x direction:

x x

sheer in x direction

We have (x, y, z) 7→ (x + λy, y, z). Then

S =





1 λ 0

0 1 0

0 0 1





3.4.2 Matrix Algebra

This part is mostly on a whole lot of definitions, saying what we can do with

matrices and classifying them into different types.

Definition

(Addition of matrices)

Consider two linear maps

α, β

→ R

The sum of α and β is defined by

(α + β)(x) = α(x) + β(x)

In terms of the matrix, we have

(A + B)

= A

+ B

(A + B)

= A

+ B

Definition

(Scalar multiplication of matrices)

Define (

λα

)

[

(

)]. So

(λA)

= λA

Definition

(Matrix multiplication)

Consider maps

→ R

and

→ R

. The composition is

βα

→ R

. Take

x ∈ R

7→ x

∈ R

Then

= (

)

, where

. Using suffix notation, we have

= (Bx

)

= b

= B

. But x

= (BA)

. So

(BA)

= B

Generally, an

m ×n

matrix multiplied by an

n ×`

matrix gives an

m ×`

matrix.

(BA)

is given by the ith row of B dotted with the jth column of A.

Note that the number of columns of

has to be equal to the number of rows

for multiplication to be defined. If

as well, then both

and

make sense, but

AB 6

in general. In fact, they don’t even have to have the

same dimensions.

Also, since function composition is associative, we get A(BC) = (AB)C.

Definition

(Transpose of matrix)

is an

m × n

matrix, the transpose

is an n × m matrix defined by (A

)

= A

Proposition.

(i) (A

)

= A.

(ii) If x is a column vector













, x

is a row vector (x

···x

(iii) (AB)

= B

since (AB)

= (AB)

= A

= B

= (B

)

= (B

)

Definition

(Hermitian conjugate)

Define

†

= (

)

∗

. Similarly, (

)

†

Definition (Symmetric matrix). A matrix is symmetric if A

= A.

Definition

(Hermitian matrix)

A matrix is Hermitian if

†

. (The diagonal

of a Hermitian matrix must be real).

Definition

(Anti/skew symmetric matrix)

A matrix is anti-symmetric or skew

symmetric if A

= −A. The diagonals are all zero.

Definition

(Skew-Hermitian matrix)

A matrix is skew-Hermitian if

†

−A

The diagonals are pure imaginary.

Definition

(Trace of matrix)

The trace of an

n ×n

matrix

is the sum of the

diagonal. tr(A) = A

Example.

Consider the reflection matrix

−

ˆn

. We have

(

) =

= 3 − 2ˆn · ˆn = 3 − 2 = 1.

Proposition. tr(BC) = tr(CB)

Proof. tr(BC) = B

= C

= (CB)

= tr(CB)

Definition (Identity matrix). I = δ

3.4.3 Decomposition of an n × n matrix

Any

n ×n

matrix

can be split as a sum of symmetric and antisymmetric parts.

Write

+ B

)

| {z }

− B

)

| {z }

We have

, so

is symmetric, while

−A

, and

is antisymmetric.

So B = S + A.

Furthermore , we can decompose

into an isotropic part (a scalar multiple

of the identity) plus a trace-less part (i.e. sum of diagonal = 0). Write

tr(S)δ

| {z }

isotropic part

+ (S

−

tr(S)δ

)

| {z }

We have tr(T ) = T

= S

−

tr(S)δ

= tr(S) −

tr(S)(n) = 0.

Putting all these together,

B =

tr(B)I +



(B + B

) −

tr(B)I



(B −B

In three dimensions, we can write the antisymmetric part

in terms of a single

vector: we have

A =





0 a −b

−a 0 c

b −c 0





and we can consider

ijk





0 ω

−ω

0 ω

−ω





So if we have ω = (c, b, a), then A

= ε

ijk

This decomposition can be useful in certain physical applications. For

example, if the matrix represents the stress of a system, different parts of the

decomposition will correspond to different types of stresses.

3.4.4 Matrix inverse

Definition

(Inverse of matrix)

Consider an

m×n

matrix

and

n×m

matrices

and

. If

, then we say

is the left inverse of

. If

, then

we say

is the right inverse of

. If

is square (

n × n

), then

(

) =

(

)

, i.e. the left and right inverses coincide. Both are denoted by

−1

the inverse of A. Therefore we have

−1

= A

−1

A = I.

Note that not all square matrices have inverses. For example, the zero matrix

clearly has no inverse.

Definition (Invertible matrix). If A has an inverse, then A is invertible.

Proposition. (AB)

−1

= B

−1

Proof. (B

−1

)(AB) = B

−1

A)B = B

−1

B = I.

Definition

(Orthogonal and unitary matrices)

A real

n×n

matrix is orthogonal

, i.e.

−1

. A complex

n × n

matrix is unitary if

†

U = U U

†

= I, i.e. U

†

= U

−1

Note that an orthogonal matrix

satisfies

(

) =

, i.e.

We can see this as saying “the scalar product of two distinct rows is 0, and the

scalar product of a row with itself is 1”. Alternatively, the rows (and columns —

by considering A

) of an orthogonal matrix form an orthonormal set.

Similarly, for a unitary matrix,

†

, i.e.

∗

. i.e.

the rows are orthonormal, using the definition of complex scalar product.

Example.

(i)

The reflection in a plane is an orthogonal matrix. Since

−

We have

= (δ

− 2n

)(δ

− 2n

)

= δ

− 2δ

+ 2n

= δ

− 2n

+ 4n

)

= δ

(ii)

The rotation is an orthogonal matrix. We could multiply out using suffix

notation, but it would be cumbersome to do so. Alternatively, denote

rotation matrix by

about

ˆn

(

θ, ˆn

). Clearly,

(

θ, ˆn

)

−1

(

−θ, ˆn

We have

(−θ, ˆn) = (cos θ)δ

+ n

(1 − cos θ) + ε

ijk

sin θ

= (cos θ)δ

+ n

(1 − cos θ) − ε

jik

sin θ

= R

(θ, ˆn)

In other words, R(−θ, ˆn) = R(θ, ˆn)

. So R(θ, ˆn)

−1

= R(θ, ˆn)