IB Linear Algebra - Determinants of matrices

5Determinants of matrices

IB Linear Algebra

5 Determinants of matrices

We probably all know what the determinant is. Here we are going to give a

slightly more abstract definition, and spend quite a lot of time trying motivate

this definition.

Recall that

is the group of permutations of

{

, ··· , n}

, and there is a

unique group homomorphism

→ {±

}

such that

(

) = 1 if

can be

written as a product of an even number of transpositions;

(

) =

−

1 if

can be

written as an odd number of transpositions. It is proved in IA Groups that this

is well-defined.

Definition (Determinant). Let A ∈ Mat

n,n

(F). Its determinant is

det A =

σ∈S

ε(σ)

i=1

iσ(i)

This is a big scary definition. Hence, we will spend the first half of the

chapter trying to understand what this really means, and how it behaves. We

will eventually prove a formula that is useful for computing the determinant,

which is probably how you were first exposed to the determinant.

Example. If n = 2, then S

= {id, (1 2)}. So

det A = A

− A

When n = 3, then S

has 6 elements, and

det A = A

+ A

− A

We will first prove a few easy and useful lemmas about the determinant.

Lemma. det A = det A

Proof.

det A

σ∈S

ε(σ)

i=1

σ(i)i

σ∈S

ε(σ)

j=1

jσ

−1

(j)

τ ∈S

ε(τ

−1

)

j=1

jτ (j)

Since ε(τ ) = ε(τ

−1

), we get

τ ∈S

ε(τ)

j=1

jτ (j)

= det A.

Lemma. If A is an upper triangular matrix, i.e.

A =







··· a

0 a

··· a

0 0 ··· a







Then

det A =

i=1

Proof. We have

det A =

σ∈S

ε(σ)

i=1

iσ( i)

But A

iσ( i)

= 0 whenever i > σ(i). So

i=1

iσ( i)

= 0

if there is some i ∈ {1, ··· , n} such that i > σ(i).

However, the only permutation in which

i ≤ σ

(

) for all

is the identity. So

the only thing that contributes in the sum is σ = id. So

det A =

i=1

To motivate this definition, we need a notion of volume. How can we define

volume on a vector space? It should be clear that the “volume” cannot be

uniquely determined, since it depends on what units we are using. For example,

saying the volume is “1” is meaningless unless we provide the units, e.g.

So we have an axiomatic definition for what it means for something to denote a

“volume”.

Definition

(Volume form)

A volume form on

is a function

×···×F

→

F that is

(i) Multilinear, i.e. for all i and all v

, ··· , v

i−1

, v

i+1

, ··· , v

∈ F

, we have

d(v

, ··· , v

i−1

, ·, v

i+1

, ··· , v

) ∈ (F

)

∗

(ii) Alternating, i.e. if v

= v

for some i 6= j, then

d(v

, ··· , v

) = 0.

We should think of

(

, ··· , v

) as the

-dimensional volume of the paral-

lelopiped spanned by v

, ··· , v

We can view

A ∈ Mat

(

) as

-many vectors in

by considering its columns

A = (A

(1)

(2)

··· A

(n)

), with A

(i)

∈ F

. Then we have

Lemma. det A is a volume form.

Proof. To see that det is multilinear, it is sufficient to show that each

i=1

iσ(i)

is multilinear for all

σ ∈ S

, since linear combinations of multilinear forms are

multilinear. But each such product is contains precisely one entry from each

column, and so is multilinear.

To show it is alternating, suppose now there are some

k, `

distinct such that

(k)

(`)

. We let

be the transposition (

k `

). By Lagrange’s theorem, we

can write

= A

q τA

where A

= ker ε and q is the disjoint union. We also know that

σ∈A

i=1

iσ( i)

σ∈A

i=1

i,τ σ( i)

since if

(

) is not

, then

does nothing; if

(

) is

, then

just swaps

them around, but A

(k)

= A

(l)

. So we get

σ∈A

i=1

iσ(i)

∈τ A

i=1

iσ

(i)

But we know that

det A = LHS − RHS = 0.

So done.

We have shown that determinants are volume forms, but is this the only

volume form? Well obviously not, since 2

det A

is also a valid volume form.

However, in some sense, all volume forms are “derived” from the determinant.

Before we show that, we need the following

Lemma.

Let

be a volume form on

. Then swapping two entries changes

the sign, i.e.

d(v

, ··· , v

) = −d(v

, ··· , v

Proof. By linearity, we have

0 = d(v

, ··· , v

+ v

, ··· , v

+ v

, ··· , v

)

= d(v

, ··· , v

)

+ d(v

, ··· , v

)

+ d(v

, ··· , v

)

+ d(v

, ··· , v

)

= d(v

, ··· , v

)

+ d(v

, ··· , v

So done.

Corollary. If σ ∈ S

, then

d(v

σ(1)

, ··· , v

σ(n)

) = ε(σ)d(v

, ··· , v

)

for any v

∈ F

Theorem.

Let

be any volume form on

, and let

= (

(1)

··· A

(n)

)

∈

Mat

(F). Then

d(A

(1)

, ··· , A

(n)

) = (det A)d(e

, ··· , e

where {e

, ··· , e

} is the standard basis.

Proof. We can compute

d(A

(1)

, ··· , A

(n)

) = d

i=1

, A

(2)

, ··· , A

(n)

i=1

d(e

, A

(2)

, ··· , A

(n)

)

i,j=1

d(e

, e

, A

(3)

, ··· , A

(n)

)

,··· ,i

d(e

, ··· , e

)

j=1

We know that lots of these are zero, since if

for some

k, j

, then the term

is zero. So we are just summing over distinct tuples, i.e. when there is some

such that i

= σ(j). So we get

d(A

(1)

, ··· , A

(n)

) =

σ∈S

d(e

σ(1)

, ··· , e

σ(n)

)

j=1

σ(j)j

However, by our corollary up there, this is just

d(A

(1)

, ··· , A

(n)

) =

σ∈S

ε(σ)d(e

, ··· , e

)

j=1

σ(j)j

= (det A)d(e

, ··· , e

So done.

We can rewrite the formula as

d(Ae

, ··· , Ae

) = (det A)d(e

, ··· , e

It is not hard to see that the same proof gives for any v

, ··· , v

, we have

d(Av

, ··· , Av

) = (det A)d(v

, ··· , v

So we know that

det A

is the volume rescaling factor of an arbitrary parallelopiped,

and this is true for any volume form d.

Theorem. Let A, B ∈ Mat

(F). Then det(AB) = det(A) det(B).

Proof.

Let

be a non-zero volume form on

(e.g. the “determinant”). Then

we can compute

d(ABe

, ··· , ABe

) = (det AB)d(e

, ··· , e

but we also have

d(ABe

, ··· , ABe

) = (det A)d(Be

, ··· , Be

) = (det A)(det B)d(e

, ··· , e

Since d is non-zero, we must have det AB = det A det B.

Corollary.

A ∈ Mat

(

) is invertible, then

det A 6

= 0. In fact, when

invertible, then det(A

−1

) = (det A)

−1

Proof. We have

1 = det I = det(AA

−1

) = det A det A

−1

So done.

Definition

(Singular matrices)

A matrix

is singular if

det A

= 0. Otherwise,

it is non-singular.

We have just shown that if

det A

= 0, then

is not invertible. Is the converse

true? If

det A 6

= 0, then can we conclude that

is invertible? The answer is

yes. We are now going to prove it in an abstract and clean way. We will later

prove this fact again by constructing an explicit formula for the inverse, which

involves dividing by the determinant. So if the determinant is non-zero, then we

know an inverse exists.

Theorem. Let A ∈ Mat

(F). Then the following are equivalent:

(i) A is invertible.

(ii) det A 6= 0.

(iii) r(A) = n.

Proof.

We have proved that (i)

⇒

(ii) above, and the rank-nullity theorem implies

(iii)

⇒

(i). We will prove (ii)

⇒

(iii). In fact we will show the contrapositive.

Suppose

(

)

< n

. By rank-nullity theorem,

(

)

0. So there is some

x =













such that Ax = 0. Suppose λ

6= 0. We define B as follows:

B =







1 λ

k−1

k+1







has the

th column identically zero. So

det

(

) = 0. So it is sufficient

to prove that det(B) 6= 0. But det B = λ

6= 0. So done.

We are now going to come up with an alternative formula for the determinant

(which is probably the one you are familiar with). To do so, we introduce the

following notation:

Notation.

Write

for the matrix obtained from

by deleting the

th row

and jth column.

Lemma. Let A ∈ Mat

(F). Then

(i) We can expand det A along the jth column by

det A =

i=1

(−1)

i+j

det

(ii) We can expand det A along the ith row by

det A =

j=1

(−1)

i+j

det

We could prove this directly from the definition, but that is messy and scary,

so let’s use volume forms instead.

Proof.

Since

det A

, (i) and (ii) are equivalent. So it suffices to prove

just one of them. We have

det A = d(A

(1)

, ··· , A

(n)

where d is the volume form induced by the determinant. Then we can write as

det A = d

(1)

, ··· ,

i=1

, ··· , A

(n)

i=1

d(A

(1)

, ··· , e

, ··· , A

(n)

)

The volume form on the right is the determinant of a matrix with the

th column

replaced with

. We can move our columns around so that our matrix becomes

B =



stuff 1



We get that

det B

det

, since the only permutations that give a non-zero

sum are those that send

. In the row and column swapping, we have made

n − j column transpositions and n − i row transpositions. So we have

det A =

i=1

(−1)

n−j

(−1)

n−i

det B

i=1

(−1)

i+j

det

This is not only useful for computing determinants, but also computing

inverses.

Definition

(Adjugate matrix)

Let

A ∈ Mat

(

). The adjugate matrix of

written adj A, is the n × n matrix such that (adj A)

= (−1)

i+j

det

The relevance is the following result:

Theorem.

A ∈ Mat

(

), then

(

adj A

) = (

det A

)

= (

adj A

)

. In particu-

lar, if det A 6= 0, then

−1

det A

adj A.

Note that this is not an efficient way to compute the inverse.

Proof. We compute

[(adj A)A]

i=1

(adj A)

i=1

(−1)

i+j

det

. (∗)

So if j = k, then [(adj A)A]

= det A by the lemma.

Otherwise, if

j 6

, consider the matrix

obtained from

by replacing the

th column by the

th column. Then the right hand side of (

∗

) is just

det B

the lemma. But we know that if two columns are the same, the determinant is

zero. So the right hand side of (∗) is zero. So

[(adj A)A]

= det Aδ

The calculation for [

A adj A

] = (

det A

)

can be done in a similar manner, or by

considering (A adj A)

= (adj A)

= (adj(A

))A

= (det A)I

Note that the coefficients of (

adj A

) are just given by polynomials in the

entries of

, and so is the determinant. So if

is invertible, then its inverse is

given by a rational function (i.e. ratio of two polynomials) in the entries of A.

This is very useful theoretically, but not computationally, since the polyno-

mials are very large. There are better ways computationally, such as Gaussian

elimination.

We’ll end with a useful tricks to compute the determinant.

Lemma. Let A, B be square matrices. Then for any C, we have

det



A C

0 B



= (det A)(det B).

Proof. Suppose A ∈ Mat

(F), and B ∈ Mat

(F), so C ∈ Mat

k,`

(F). Let

X =



A C

0 B



Then by definition, we have

det X =

σ∈S

k+`

ε(σ)

k+`

i=1

iσ(i)

j ≤ k

and

i > k

, then

= 0. We only want to sum over permutations

such

that

(

)

> k

i > k

. So we are permuting the last

things among themselves,

and hence the first

things among themselves. So we can decompose this into

, where

is a permutation of

{

, ··· , k}

and fixes the remaining things,

while σ

fixes {1, ··· , k}, and permutes the remaining. Then

det X =

σ=σ

ε(σ

)

i=1

iσ

(i)

j=1

k+j σ

(k+j)

∈S

ε(σ

)

i=1

iσ

(i)





∈S

ε(σ

)

j=1

jσ

(j)





= (det A)(det B)

Corollary.

det







stuff

0 A







i=1

det A