IB Groups, Rings and Modules

2Rings

2.2 Homomorphisms, ideals, quotients and isomorphisms

Just like groups, we will come up with analogues of homomorphisms, normal

subgroups (which are now known as ideals), and quotients.

Definition (Homomorphism of rings). Let

R, S

be rings. A function

R → S

is a ring homomorphism if it preserves everything we can think of, i.e.

(i) φ(r

+ r

) = φ(r

) + φ(r

(ii) φ(0

) = 0

(iii) φ(r

· r

) = φ(r

) · φ(r

(iv) φ(1

) = 1

Definition (Isomorphism of rings). If a homomorphism

R → S

is a bijection,

we call it an isomorphism.

Definition (Kernel). The kernel of a homomorphism φ : R → S is

ker(φ) = {r ∈ R : φ(r) = 0

Definition (Image). The image of φ : R → S is

im(φ) = {s ∈ S : s = φ(r) for some r ∈ R}.

Lemma. A homomorphism φ : R → S is injective if and only if ker φ = {0

Proof.

A ring homomorphism is in particular a group homomorphism

(

)

→

(

) of abelian groups. So this follows from the case of

groups.

In the group scenario, we had groups, subgroups and normal subgroups,

which are special subgroups. Here, we have a special kind of subsets of a ring

that act like normal subgroups, known as ideals.

Definition (Ideal). A subset I ⊆ R is an ideal, written I C R, if

(i)

It is an additive subgroup of (

), i.e. it is closed under addition and

additive inverses. (additive closure)

(ii) If a ∈ I and b ∈ R, then a · b ∈ I. (strong closure)

We say I is a proper ideal if I 6= R.

Note that the multiplicative closure is stronger than what we require for

subrings — for subrings, it has to be closed under multiplication by its own

elements; for ideals, it has to be closed under multiplication by everything in

the world. This is similar to how normal subgroups not only have to be closed

under internal multiplication, but also conjugation by external elements.

Lemma. If φ : R → S is a homomorphism, then ker(φ) C R.

Proof.

Since

: (

)

→

(

) is a group homomorphism, the kernel is

a subgroup of (R, +, 0

For the second part, let

a ∈ ker

(

b ∈ R

. We need to show that their

product is in the kernel. We have

φ(a · b) = φ(a) · φ(b) = 0 · φ(b) = 0.

So a · b ∈ ker(φ).

Example. Suppose

I C R

is an ideal, and 1

∈ I

. Then for any

r ∈ R

, the

axioms entail 1

· r ∈ I. But 1

· r = r. So if 1

∈ I, then I = R.

In other words, every proper ideal does not contain 1. In particular, every

proper ideal is not a subring, since a subring must contain 1.

We are starting to diverge from groups. In groups, a normal subgroup is a

subgroup, but here an ideal is not a subring.

Example. We can generalize the above a bit. Suppose

I C R

and

u ∈ I

is a

unit, i.e. there is some

v ∈ R

such that

u · v

= 1

. Then by strong closure,

= u · v ∈ I. So I = R.

Hence proper ideals are not allowed to contain any unit at all, not just 1

Example. Consider the ring

of integers. Then every ideal of

is of the form

nZ = {··· , −2n, −n, 0, n, 2n, ···} ⊆ Z.

It is easy to see this is indeed an ideal.

To show these are all the ideals, let

I C Z

. If

{

}

, then

= 0

. Otherwise,

let

n ∈ N

be the smallest positive element of

. We want to show in fact

Certainly nZ ⊆ I by strong closure.

Now let m ∈ I. By the Euclidean algorithm, we can write

m = q · n + r

with 0

≤ r < n

. Now

n, m ∈ I

. So by strong closure,

m, q · n ∈ I

. So

m −q ·n ∈ I

. As

is the smallest positive element of

, and

r < n

, we must

have r = 0. So m = q · n ∈ nZ. So I ⊆ nZ. So I = nZ.

The key to proving this was that we can perform the Euclidean algorithm on

. Thus, for any ring

in which we can “do Euclidean algorithm”, every ideal

is of the form

{a · r

r ∈ R}

for some

a ∈ R

. We will make this notion

precise later.

Definition (Generator of ideal). For an element a ∈ R, we write

(a) = aR = {a · r : r ∈ R} C R.

This is the ideal generated by a.

In general, let a

, a

, ··· , a

∈ R, we write

, a

, ··· , a

) = {a

+ ··· + a

: r

, ··· , r

∈ R}.

This is the ideal generated by a

, ··· , a

We can also have ideals generated by infinitely many objects, but we have to

be careful, since we cannot have infinite sums.

Definition (Generator of ideal). For

A ⊆ R

a subset, the ideal generated by

(A) =

(

a∈A

· a : r

∈ R, only finitely-many non-zero

)

These ideals are rather nice ideals, since they are easy to describe, and often

have some nice properties.

Definition (Principal ideal). An ideal

is a principal ideal if

= (

) for some

a ∈ R.

So what we have just shown for

is that all ideals are principal. Not all

rings are like this. These are special types of rings, which we will study more in

depth later.

Example. Consider the following subset:

{f ∈ R[X] : the constant coefficient of f is 0}.

This is an ideal, as we can check manually (alternatively, it is the kernel of the

“evaluate at 0” homomorphism). It turns out this is a principal ideal. In fact, it

is (X).

We have said ideals are like normal subgroups. The key idea is that we can

divide by ideals.

Definition (Quotient ring). Let

I C R

. The quotient ring

R/I

consists of the

(additive) cosets

with the zero and one as 0

and 1

, and operations

+ I) + (r

+ I) = (r

+ r

) + I

+ I) ·(r

+ I) = r

+ I.

Proposition. The quotient ring is a ring, and the function

R → R/I

r 7→ r + I

is a ring homomorphism.

This is true, because we defined ideals to be those things that can be

quotiented by. So we just have to check we made the right definition.

Just as we could have come up with the definition of a normal subgroup by

requiring operations on the cosets to be well-defined, we could have come up

with the definition of an ideal by requiring the multiplication of cosets to be

well-defined, and we would end up with the strong closure property.

Proof.

We know the group (

R/I,

R/I

) is well-defined, since

is a (normal)

subgroup of R. So we only have to check multiplication is well-defined.

Suppose

and

. Then

− r

∈ I

and

− r

= a

∈ I. So

= (r

+ a

)(r

+ a

) = r

+ r

+ a

By the strong closure property, the last three objects are in

. So

+ I.

It is easy to check that 0

and 1

are indeed the zero and one, and

the function given is clearly a homomorphism.

Example. We have the ideals

nZ C Z

. So we have the quotient rings

Z/nZ

The elements are of the form m + nZ, so they are just

0 + nZ, 1 + nZ, 2 + nZ, ··· , (n − 1) + nZ.

Addition and multiplication are just what we are used to — addition and

multiplication modulo n.

Note that it is easier to come up with ideals than normal subgroups — we

can just pick up random elements, and then take the ideal generated by them.

Example. Consider (

)

C C

[

]. What is

[

]

(

)? Elements are represented

+ a

X + a

+ ··· + a

+ (X).

But everything but the first term is in (

). So every such thing is equivalent to

+ (

). It is not hard to convince yourself that this representation is unique.

So in fact C[X]/(X)

∼

C, with the bijection a

+ (X) ↔ a

If we want to prove things like this, we have to convince ourselves this

representation is unique. We can do that by hand here, but in general, we want

to be able to do this properly.

Proposition (Euclidean algorithm for polynomials). Let

be a field and

f, g ∈ F[X]. Then there is some r, q ∈ F[X] such that

f = gq + r,

with deg r < deg g.

This is like the usual Euclidean algorithm, except that instead of the absolute

value, we use the degree to measure how “big” the polynomial is.

Proof. Let deg(f) = n. So

f =

i=0

and a

6= 0. Similarly, if deg g = m, then

g =

i=0

with b

6= 0. If n < m, we let q = 0 and r = f, and done.

Otherwise, suppose n ≥ m, and proceed by induction on n.

We let

= f − a

−1

n−m

This is possible since

= 0, and

is a field. Then by construction, the

coefficients of X

cancel out. So deg(f

) < n.

If n = m, then deg(f

) < n = m. So we can write

f = (a

−1

n−m

)g + f

and

deg

(

)

< deg

(

). So done. Otherwise, if

n > m

, then as

deg

(

)

< n

, by

induction, we can find r

, q

such that

= gq

+ r

and deg(r

) < deg g = m. Then

f = a

−1

n−m

g + q

g + r

= (a

−1

n−m

+ q

)g + r

So done.

Now that we have a Euclidean algorithm for polynomials, we should be able

to show that every ideal of

[

] is generated by one polynomial. We will not

prove it specifically here, but later show that in general, in every ring where the

Euclidean algorithm is possible, all ideals are principal.

We now look at some applications of the Euclidean algorithm.

Example. Consider

[

], and consider the principal ideal (

+ 1)

C R

[

We let R = R[X]/(X

+ 1).

Elements of R are polynomials

+ a

X + a

+ ··· + a

| {z }

+(X

+ 1).

By the Euclidean algorithm, we have

f = q(X

+ 1) + r,

with

deg

(

)

2, i.e.

. Thus

+ (

+ 1) =

+ (

+ 1). So every

element of R[X]/(X

+ 1) is representable as a + bX for some a, b ∈ R.

Is this representation unique? If

+ (

+ 1) =

+ (

+ 1),

then the difference (

a −a

) + (

b −b

)

X ∈

(

+ 1). So it is (

+ 1)

for some

This is possible only if

= 0, since for non-zero

, we know (

+ 1)

has degree

at least 2. So we must have (

a − a

) + (

b − b

)

= 0. So

. So

the representation is unique.

What we’ve got is that every element in

is of the form

, and

+ 1 = 0, i.e.

−

1. This sounds like the complex numbers, just that we

are calling it X instead of i.

To show this formally, we define the function

φ : R[X]/(X

+ 1) → C

a + bX + (X

+ 1) 7→ a + bi.

This is well-defined and a bijection. It is also clearly additive. So to prove this

is an isomorphism, we have to show it is multiplicative. We check this manually.

We have

φ((a + bX + (X

+ 1))(c + dX + (X

+ 1)))

= φ(ac + (ad + bc)X + bdX

+ (X

+ 1))

= φ((ac − bd) + (ad + bc)X + (X

+ 1))

= (ac − bd) + (ad + bc)i

= (a + bi)(c + di)

= φ(a + bX + (X

+ 1))φ(c + dX + (X

+ 1)).

So this is indeed an isomorphism.

This is pretty tedious. Fortunately, we have some helpful results we can use,

namely the isomorphism theorems. These are exactly analogous to those for

groups.

Theorem (First isomorphism theorem). Let

R → S

be a ring homomorphism.

Then ker(φ) C R, and

ker(φ)

∼

im(φ) ≤ S.

Proof. We have already seen ker(φ) C R. Now define

Φ : R/ ker(φ) → im(φ)

r + ker(φ) 7→ φ(r).

This well-defined, since if

ker

(

) =

ker

(

), then

r − r

∈ ker

(

). So

φ(r −r

) = 0. So φ(r) = φ(r

We don’t have to check this is bijective and additive, since that comes for

free from the (proof of the) isomorphism theorem of groups. So we just have to

check it is multiplicative. To show Φ is multiplicative, we have

Φ((r + ker(φ))(t + ker(φ))) = Φ(rt + ker(φ))

= φ(rt)

= φ(r)φ(t)

= Φ(r + ker(φ))Φ(t + ker(φ)).

This is more-or-less the same proof as the one for groups, just that we had a

few more things to check.

Since there is the first isomorphism theorem, we, obviously, have more

coming.

Theorem (Second isomorphism theorem). Let

R ≤ S

and

J C S

. Then

J ∩RC R

and

R + J

= {r + J : r ∈ R} ≤

is a subring, and

R ∩ J

∼

R + J

Proof. Define the function

φ : R → S/J

r 7→ r + J.

Since this is the quotient map, it is a ring homomorphism. The kernel is

ker(φ) = {r ∈ R : r + J = 0, i.e. r ∈ J} = R ∩J.

Then the image is

im(φ) = {r + J : r ∈ R} =

R + J

Then by the first isomorphism theorem, we know

R ∩J C R

, and

R+J

≤ S

, and

R ∩ J

∼

R + J

Before we get to the third isomorphism theorem, recall we had the subgroup

correspondence for groups. Analogously, for I C R,

{subrings of R/I} ←→ {subrings of R which contain I}

L ≤

−→ {x ∈ R : x + I ∈ L}

≤

←− I C S ≤ R.

This is exactly the same formula as for groups.

For groups, we had a correspondence for normal subgroups. Here, we have a

correspondence between ideals

{ideals of R/I} ←→ {ideals of R which contain I}

It is important to note here that quotienting in groups and rings have different

purposes. In groups, we take quotients so that we have simpler groups to work

with. In rings, we often take quotients to get more interesting rings. For example,

[

] is quite boring, but

[

]

(

+ 1)

∼

is more interesting. Thus this ideal

correspondence allows us to occasionally get interesting ideals from boring ones.

Theorem (Third isomorphism theorem). Let

I C R

and

J C R

, and

I ⊆ J

Then J/I C R/I and











∼

Proof. We define the map

φ : R/I → R/J

r + I 7→ r + J.

This is well-defined and surjective by the groups case. Also it is a ring homo-

morphism since multiplication in

R/I

and

R/J

are “the same”. The kernel

ker(φ) = {r + I : r + J = 0, i.e. r ∈ J} =

So the result follows from the first isomorphism theorem.

Note that for any ring

, there is a unique ring homomorphism

Z → R

, given

ι : Z → R

n ≥ 0 7→ 1

+ 1

+ ··· + 1

| {z }

n times

n ≤ 0 7→ −(1

+ 1

+ ··· + 1

| {z }

−n times

)

Any homomorphism

Z → R

must be given by this formula, since it must send the

unit to the unit, and we can show this is indeed a homomorphism by distributivity.

So the ring homomorphism is unique. In fancy language, we say

is the initial

object in (the category of) rings.

We then know ker(ι) C Z. Thus ker(ι) = nZ for some n.

Definition (Characteristic of ring). Let

be a ring, and

Z → R

be the

unique such map. The characteristic of

is the unique non-negative

such

that ker(ι) = nZ.

Example. The rings

Z, Q, R, C

all have characteristic 0. The ring

Z/nZ

has

characteristic n. In particular, all natural numbers can be characteristics.

The notion of the characteristic will not be too useful in this course. How-

ever, fields of non-zero characteristic often provide interesting examples and

counterexamples to some later theory.