IB Groups, Rings and Modules

2Rings

2.5 Factorization in polynomial rings

Since polynomial rings are a bit more special than general integral domains, we

can say a bit more about them.

Recall that for

a field, we know

[

] is a Euclidean domain, hence a

principal ideal domain, hence a unique factorization domain. Therefore we know

(i) If I C F [X], then I = (f) for some f ∈ F [X].

(ii) If f ∈ F [X], then f is irreducible if and only if f is prime.

(iii)

Let

be irreducible, and suppose (

)

⊆ J ⊆ F

[

]. Then

= (

) for some

. Since (

)

⊆

(

), we must have

for some

. But

is irreducible.

So either

is a unit. If

is a unit, then (

) =

[

]. If

is a unit,

then (

) = (

). So (

) is a maximal ideal. Note that this argument is valid

for any PID, not just polynomial rings.

(iv)

Let (

) be a prime ideal. Then

is prime. So

is irreducible. So (

) is

maximal. But we also know in complete generality that maximal ideals are

prime. So in

[

], prime ideals are the same as maximal ideals. Again,

this is true for all PIDs in general.

(v) Thus f is irreducible if and only if F [X]/(f) is a field.

To use the last item, we can first show that

[

]

(

) is a field, and then use this

to deduce that

is irreducible. But we can also do something more interesting

— find an irreducible f, and then generate an interesting field F [X]/(f).

So we want to understand reducibility, i.e. we want to know whether we can

factorize a polynomial

. Firstly, we want to get rid of the trivial case where we

just factor out a scalar, e.g. 2

+ 2 = 2(

+ 1)

∈ Z

[

] is a boring factorization.

Definition (Content). Let

be a UFD and

···

∈ R

[

The content c(f) of f is

c(f) = gcd(a

, a

, ··· , a

) ∈ R.

Again, since the gcd is only defined up to a unit, so is the content.

Definition (Primitive polynomial). A polynomial is primitive if

(

) is a unit,

i.e. the a

are coprime.

Note that this is the best we can do. We cannot ask for

(

) to be exactly 1,

since the gcd is only well-defined up to a unit.

We now want to prove the following important lemma:

Lemma (Gauss’ lemma). Let

be a UFD, and

f ∈ R

[

] be a primitive

polynomial. Then

is reducible in

[

] if and only if

is reducible

[

], where

F is the field of fractions of R.

We can’t do this right away. We first need some preparation. Before that,

we do some examples.

Example. Consider

+ 1

∈ Z

[

]. This has content 1 so is primitive. We

show it is not reducible in Z[X], and hence not reducible in Q[X].

Suppose

is reducible in

[

]. Then by Gauss’ lemma, this is reducible in

Z[X]. So we can write

+ X + 1 = gh,

for some polynomials

g, h ∈ Z

[

], with

g, h

not units. But if

and

are not

units, then they cannot be constant, since the coefficients of

+ 1 are all

1 or 0. So they have degree at least 1. Since the degrees add up to 3, we wlog

suppose g has degree 1 and h has degree 2. So suppose

g = b

+ b

X, h = c

+ c

X + c

Multiplying out and equating coefficients, we get

= 1

and

must be

1. So

is either 1 +

− X, −

1 +

−

− X

, and

hence has

1 as a root. But this is a contradiction, since

1 is not a root of

+ X + 1. So f is not reducible in Q. In particular, f has no root in Q.

We see the advantage of using Gauss’ lemma — if we worked in

instead,

we could have gotten to the step

= 1, and then we can do nothing, since

and c

can be many things if we live in Q.

Now we start working towards proving this.

Lemma. Let R be a UFD. If f, g ∈ R[X] are primitive, then so is f g.

Proof. We let

f = a

+ a

X + ··· + a

g = b

+ b

X + ··· + b

where

, b

= 0, and

f, g

are primitive. We want to show that the content of

fg is a unit.

Now suppose

is not primitive. Then

(

) is not a unit. Since

is a

UFD, we can find an irreducible p which divides c(fg).

By assumption,

(

) and

(

) are units. So

p - c

(

) and

p - c

(

). So suppose

p | a

, . . . ,

p | a

k−1

but

p - a

. Note it is possible that

= 0. Similarly,

suppose p | b

, p | b

, ··· , p | b

`−1

, p - b

We look at the coefficient of X

k+`

in f g. It is given by

i+j=k+`

= a

k+`

+ ··· + a

k+1

`−1

+ a

k−1

`+1

+ ··· + a

`+k

By assumption, this is divisible by p. So

p |

i+j=k+`

However, the terms

k+`

···

k+1

`−1

, is divisible by

, as

p | b

for

j < `

Similarly,

k−1

`+1

···

`+k

is divisible by

. So we must have

p | a

is irreducible, and hence prime, we must have

p | a

p | b

. This is a

contradiction. So c(f g) must be a unit.

Corollary. Let

be a UFD. Then for

f, g ∈ R

[

], we have that

(

) is an

associate of c(f)c(g).

Again, we cannot say they are equal, since content is only well-defined up to

a unit.

Proof.

We can write

(

)

and

(

)

, with

and

primitive. Then

fg = c(f)c(g)f

Since

is primitive, so

(

)

(

) is a gcd of the coefficients of

, and so is

c(fg), by definition. So they are associates.

Finally, we can prove Gauss’ lemma.

Lemma (Gauss’ lemma). Let

be a UFD, and

f ∈ R

[

] be a primitive

polynomial. Then

is reducible in

[

] if and only if

is reducible

[

], where

F is the field of fractions of R.

Proof.

We will show that a primitive

f ∈ R

[

] is reducible in

[

] if and only

if f is reducible in F [X].

One direction is almost immediately obvious. Let

be a product in

[

] with

g, h

not units. As

is primitive, so are

and

. So both have degree

> 0. So g, h are not units in F [X]. So f is reducible in F [X].

The other direction is less obvious. We let

[

], with

g, h

not units.

and

have degree

0, since

is a field. So we can clear denominators

by finding

a, b ∈ R

such that (

)

(

)

∈ R

[

] (e.g. let

be the product of

denominators of coefficients of g). Then we get

abf = (ag)(bh),

and this is a factorization in

[

]. Here we have to be careful — (

) is one

thing that lives in

[

], and is not necessarily a product in

[

], since

might

not be in R[X]. So we should just treat it as a single symbol.

We now write

(ag) = c(ag)g

(bh) = c(bh)h

where g

, h

are primitive. So we have

ab = c(abf) = c((ag)(bh)) = u ·c(ag)c(bh),

where u ∈ R is a unit, by the previous corollary. But also we have

abf = c(ag)c(gh)g

= u

−1

abg

So cancelling ab gives

f = u

−1

∈ R[X].

So f is reducible in R[X].

If this looks fancy and magical, you can try to do this explicitly in the case

where R = Z and F = Q. Then you will probably get enlightened.

We will do another proof performed in a similar manner.

Proposition. Let

be a UFD, and

be its field of fractions. Let

g ∈ R

[

] be

primitive. We let

J = (g) C R[X], I = (g) C F [X].

Then

J = I ∩ R[X].

In other words, if

f ∈ R

[

] and we can write it as

, with

h ∈ F

[

], then

in fact h ∈ R[X].

Proof.

The strategy is the same — we clear denominators in the equation

and then use contents to get that down in R[X].

We certainly have J ⊆ I ∩ R[X]. Now let f ∈ I ∩ R[X]. So we can write

f = gh,

with h ∈ F [X]. So we can choose b ∈ R such that bh ∈ R[X]. Then we know

bf = g(bh) ∈ R[X].

We let

(bh) = c(bh)h

for h

∈ R[X] primitive. Thus

bf = c(bh)gh

Since

is primitive, so is

. So

(

) =

(

) for

a unit. But

is really a

product in R[X]. So we have

c(bf) = c(b)c(f) = bc(f).

So we have

bf = ubc(f)gh

Cancelling b gives

f = g(uc(f)h

So g | f in R[X]. So f ∈ J.

From this we can get ourselves a large class of UFDs.

Theorem. If R is a UFD, then R[X] is a UFD.

In particular, if R is a UFD, then R[X

, ··· , X

] is also a UFD.

Proof.

We know

[

] has a notion of degree. So we will combine this with the

fact that R is a UFD.

Let

f ∈ R

[

]. We can write

(

)

, with

primitive. Firstly, as

is a

UFD, we may factor

c(f) = p

···p

for

∈ R

irreducible (and also irreducible in

[

]). Now we want to deal with

If f

is not irreducible, then we can write

= f

with

, f

both not units. Since

is primitive,

, f

also cannot be constants.

So we must have

deg f

, deg f

0. Also, since

deg f

, we must

have

deg f

, deg f

< deg f

. If

, f

are irreducible, then done. Otherwise, keep

on going. We will eventually stop since the degrees have to keep on decreasing.

So we can write it as

= q

···q

with q

irreducible. So we can write

f = p

···p

···q

a product of irreducibles.

For uniqueness, we first deal with the p’s. We note that

c(f) = p

···p

is a unique factorization of the content, up to reordering and associates, as

a UFD. So cancelling the content, we only have to show that primitives can be

factored uniquely.

Suppose we have two factorizations

= q

···q

= r

···r

Note that each

and each

is a factor of the primitive polynomial

, so are

also primitive. Now we do (maybe) the unexpected thing. We let

be the

field of fractions of

, and consider

, r

∈ F

[

]. Since

is a field,

[

] is

a Euclidean domain, hence principal ideal domain, hence unique factorization

domain.

By Gauss’ lemma, since the

and

are irreducible in

[

], they are also

irreducible in

[

]. As

[

] is a UFD, we find that

, and after reordering,

and q

are associates, say

= u

with

∈ F

[

] a unit. What we want to say is that

is a unit times

[

Firstly, note that u

∈ F as it is a unit. Clearing denominators, we can write

= b

∈ R[X].

Taking contents, since

, q

are primitives, we know

and

are associates, say

= v

with

∈ R

a unit. Cancelling

on both sides, we know

as required.

The key idea is to use Gauss’ lemma to say the reducibility in

[

] is the

same as reducibility in

[

], as long as we are primitive. The first part about

contents is just to turn everything into primitives.

Note that the last part of the proof is just our previous proposition. We

could have applied it, but we decide to spell it out in full for clarity.

Example. We know

[

] is a UFD, and if

is a UFD, then

[

, ··· , X

] is

also a UFD.

This is a useful thing to know. In particular, it gives us examples of UFDs

that are not PIDs. However, in such rings, we would also like to have an easy to

determine whether something is reducible. Fortunately, we have the following

criterion:

Proposition (Eisenstein’s criterion). Let R be a UFD, and let

f = a

+ a

X + ··· + a

∈ R[X]

be primitive with a

6= 0. Let p ∈ R be irreducible (hence prime) be such that

(i) p - a

;

(ii) p | a

for all 0 ≤ i < n;

(iii) p

- a

Then

is irreducible in

[

], and hence in

[

] (where

is the field of fractions

of R).

It is important that we work in

[

] all the time, until the end where we

apply Gauss’ lemma. Otherwise, we cannot possibly apply Eisenstein’s criterion

since there are no primes in F .

Proof. Suppose we have a factorization f = gh with

g = r

+ r

X + ··· + r

h = s

+ s

X + ··· + s

for r

, s

6= 0.

We know

. Since

p - a

, so

p - r

and

p - s

. We can also look at

bottom coefficients. We know

. We know

p | a

and

- a

. So

divides exactly one of r

and s

. wlog, p | r

and p - s

Now let j be such that

p | r

, p | r

, ··· , p | r

j−1

, p - r

We now look at a

. This is, by definition,

= r

+ r

j−1

+ ··· + r

j−1

+ r

We know r

, ··· , r

j−1

are all divisible by p. So

p | r

+ r

j−1

+ ··· + r

j−1

Also, since

p - r

and

p - s

, we know

p - r

, using the fact that

is prime. So

p - a

. So we must have j = n.

We also know that

j ≤ k ≤ n

. So we must have

. So

deg g

Hence

n − h

= 0. So

is a constant. But we also know

is primitive. So

must be a unit. So this is not a proper factorization.

Example. Consider the polynomial

− p ∈ Z

[

] for

a prime. Apply

Eisenstein’s criterion with

, and observe all the conditions hold. This is

certainly primitive, since this is monic. So

− p

is irreducible in

[

], hence

[

]. In particular,

− p

has no rational roots, i.e.

√

is irrational (for

n > 1).

Example. Consider a polynomial

f = X

p−1

+ X

p−2

+ ··· + X

+ X + 1 ∈ Z[X],

where

is a prime number. If we look at this, we notice Eisenstein’s criteria

does not apply. What should we do? We observe that

f =

− 1

X −1

So it might be a good idea to let Y = X −1. Then we get a new polynomial

f =

f(Y ) =

(Y + 1)

− 1

= Y

p−1





p−2





p−3

+ ··· +



p − 1



When we look at it hard enough, we notice Eisenstein’s criteria can be applied —

we know

p |





for 1

≤ i ≤ p −

1, but



p−1



. So

is irreducible in

[

Now if we had a factorization

f(X) = g(X)h(X) ∈ Z[X],

then we get

f(Y ) = g(Y + 1)h(Y + 1)

in Z[Y ]. So f is irreducible.

Hence none of the roots of

are rational (but we already know that — they

are not even real!).