IB Numerical Analysis - Numerical linear algebra

8Numerical linear algebra

IB Numerical Analysis

8.2 LU factorization

In general, we don’t always have triangular matrices. The idea is, for every

matrix A, find some lower and triangular matrices L, U such that A = LU.

If we can do this, then we can solve

in two steps — we first find a

such that Ly = b. Then we find an x such that Ux = y. Then

Ax = LUx = Ly = b.

So what we want is such a factorization. To guarantee uniqueness of the

factorization, we require that

is unit, i.e. the diagonals are all 1. Otherwise,

given any such factorization, we can divide

by some (non-zero) constant and

multiply L by the same constant to get another factorization.

Definition

(LU factorization)

. A

is an LU factorization if

is upper

triangular and L is unit lower triangular (i.e. the diagonals of L are all 1).

Note that since

has to be unit, it must be non-singular. However, we still

allow A and U to be singular. Note that

det(A) = det(L) det(U ) = det(U ).

So A is singular if and only if U is.

Unfortunately, even if

is non-singular, it may not have an LU factorization.

We take a simple example of



0 1

1 1



This is clearly non-singular, with determinant

−

1. However, we can manually

check that there is no LU factorization of this.

On the other hand, while we don’t really like singular matrices, singular

matrices can still have LU factorizations. For example,



0 0

0 1





1 0

0 1



0 0

0 1



is trivially an LU factorization of a singular matrix.

Fortunately, if

is non-singular and has an LU factorization, then this

factorization is unique (this is not necessarily true if

is singular. For example,

we know



0 0

0 1





1 0

a 1



0 0

0 1



for any real number a).

To understand when LU factorizations exist, we try to construct one, and

see when it could fail.

We write L in terms of its columns, and U in terms of its rows:

L =



··· l



, U =













Clearly, these rows and columns cannot be arbitrary, since

and

are triangular.

In particular, l

, u

must be zero in the first i −1 entries.

Suppose this is an LU factorization of A. Then we can write

A = L · U = l

+ l

+ ··· + l

What do these matrices look like? For each

, we know

and

have the first

i −

1 entries zero. So the first

i −

1 rows and columns of

are zero. In

particular, the first row and columns only have contributions from

, the

second row/column only has contributions from l

and l

etc.

The plan is as follows:

(i)

Obtain

and

from the first row and column of

. Since the first entry

is 1,

is exactly the first row of

. We can then obtain

by taking

the first column of A and dividing by U

= A

(ii) Obtain l

and u

form the second row and column of A − l

similarly.

(iii) ···

(iv) Obtain l

and u

from the nth row and column of A −

n−1

i=1

We can turn this into an algorithm. We define the intermediate matrices, starting

with

(0)

= A.

For k = 1, ··· , n, we let

= A

(k−1)

j = k, ··· , n

(k−1)

i = k, ··· , n

(k)

= A

(k−1)

− L

i, j ≥ k

When

, we end up with a zero matrix, and then

and

are completely

filled.

We can now see when this will break down. A sufficient condition for

to exist is that

(k−1)

= 0 for all

. Since

(k−1)

, this sufficient condition

ensures

, and hence

is non-singular. Conversely, if

is non-singular and

an LU factorization exists, then this would always work, since we must have

(k−1)

= 0. Moreover, the LU factorization must be given by this

algorithm. So we get uniqueness.

The problem with this sufficient condition is that most of these coefficients

do not appear in the matrix

. They are constructed during the algorithm. We

don’t know easily what they are in terms of the coefficients of

. We will later

come up with an equivalent condition on our original A that is easier to check.

Note that as long as this method does not break down, we need

(

)

operations to perform this factorization. Recall we only needed

(

) operations

to solve the equation after factorization. So the bulk of the work in solving

Ax = b is in doing the LU factorization.

As before, this allows us to find the inverse of

if it is non-singular. In

particular, solving

gives the

th column of

−1

. Note that we are

solving the system for the same

for each

. So we only have to perform the

LU factorization once, and then solve

different equations. So in total we need

O(n

) operations.

However, we still have the problem that factorization is not always possible.

Requiring that we must factorize

is too restrictive. The idea is to factor

something closely related, but not exactly A. Instead, we want a factorization

P A = LU,

where

is a permutation matrix. Recall a permutation matrix acting on a

column vector just permutes the elements of the vector, and

acting on

would just permute the rows of

. So we want to factor

up to a permutation

of rows.

Our goal now is to extend the previous algorithm to allow permutations of

rows, and then we shall show that we will be able to perform this factorization

all the time.

Suppose our breakdown occurs at

= 1, i.e.

(0)

= 0. We find a

permutation matrix

and let it act via

(0)

. The idea is to look down the

first column of

, and find a row starting with a non-zero element, say

. Then

we use

to interchange rows 1 and

such that

(0)

has a non-zero top-most

entry. For simplicity, we assume we always need a

, and if

(0)

is non-zero in

the first place, we just take P

to be the identity.

After that, we can carry on. We construct

and

from

(0)

as before,

and set A

(1)

= P

(0)

− l

But what happens if the first column of

is completely zero? Then no

interchange will make the (1

1) entry non-zero. However, in this case, we don’t

actually have to do anything. We can immediately find our

and

, namely

set

(or anything) and let

be the first row of

(0)

. Then this already

works. Note however that this corresponds to

(and hence

) being singular,

and we are not too interested with these.

The later steps are exactly analogous. Suppose we have

(k−1)

= 0. Again

we find a

such that

(k−1)

has a non-zero (

k, k

) entry. We then construct

and u

from P

(k−1)

and set

(k)

= P

(n−1)

− l

Again, if the

th column of

(k−1)

is completely zero, we set

and

be the kth row of A

(k−1)

. But again this implies A and U will be singular.

However, as we do this, the permutation matrices appear all over the place

inside the algorithm. It is not immediately clear that we do get a factorization

of the form

P A

. Fortunately, keeping track of the interchanges, we do

have an LU factorization

P A =

LU,

where U is what we got from the algorithm,

P = P

n−1

···P

while

L is given by

L =



···



= P

n−1

···P

k−1

Note that in particular, we have

n−1

= l

n−1

= l

One problem we have not considered is the problem of inexact arithmetic. While

these formula are correct mathematically, when we actually implement things,

we do them on computers with finite precision. As we go through the algorithm,

errors will accumulate, and the error might be amplified to a significant amount

when we get the reach the end. We want an algorithm that is insensitive to

errors. In order to work safely in inexact arithmetic, we will put the element of

largest modulus in the (

k, k

)th position, not just an arbitrary non-zero one, as

this minimizes the error when dividing.