II Galois Theory - Field extensions

2Field extensions

II Galois Theory

2.6 Separable extensions

Here we will define what it means for an extension to be separable. This is

done via defining separable polynomials, and then an extension is separable if

all minimal polynomials are separable.

At first, the definition of separability might seem absurd — surely every

polynomial should be separable. Indeed, polynomials that are not separable

tend to be weird, and our theories often break without separability. Hence it is

important to figure out when polynomials are separable, and when they are not.

Fortunately, we will end up with a result that tells us exactly when a polynomial

is not separable, and this is just a very small, specific class. In particular, in

fields of characteristic zero, all polynomials are separable.

Definition (Separable polynomial). Let

be a field,

f ∈ K

[

] non-zero, and

a splitting field of

. For an irreducible

, we say it is separable if

has no

repeated roots, i.e.

|Root

(

)

deg f

. For a general polynomial

, we say it is

separable if all its irreducible factors in K[t] are separable.

It should be obvious from definition that if

is separable and

Q | P

, then

is also separable.

Note that some people instead define a separable polynomial to be one with

no repeated roots, so (

x−

over

would not be separable under this definition.

Example. Any linear polynomial t − a (with a ∈ K) is separable.

This is, however, not a very interesting example. To get to more interesting

examples, we need even more preparation.

Definition (Formal derivative). Let

be a field,

f ∈ K

[

]. (Formal) differenti-

ation the K-linear map K[t] → K[t] defined by t

7→ nt

n−1

The image of a polynomial f is the derivative of f , written f

′

This is similar to how we differentiate real or complex polynomials (in case

that isn’t obvious).

The following lemma summarizes the properties of the derivative we need.

Lemma. Let K be a field, f, g ∈ K[t]. Then

(i) (f + g)

′

= f

′

+ g

′

, (fg)

′

= fg

′

+ f

′

(ii)

Assume

f 

= 0 and

is a splitting field of

. Then

has a repeated root in

if and only if

and

′

have a common (non-constant) irreducible factor

in K[t] (if and only if f and f

′

have a common root in L).

This will allow us to show when irreducible polynomials are separable.

Proof.

(i) (f + g)

′

= f

′

+ g

′

is true by linearity.

To show that (

)

′

, we use linearity to reduce to the case

where

, g

. Then both sides are (

)

n+m−1

. So this holds.

(ii)

First assume that

has a repeated root. So let

= (

t −α

)

h ∈ L

[

] where

α ∈ L

. Then

′

= 2(

t − α

)

+ (

t − α

)

′

= (

t − α

)(2

+ (

t − α

)

′

). So

(

) =

′

(

) = 0. So

and

′

have common roots. However, we want a

common irreducible factor in

[

], not

[

]. So we let

be the minimal

polynomial of α over K. Then P

| f and P

| f

′

. So done.

Conversely, suppose

is a common irreducible factor of

and

′

[

with deg e > 0. Pick α ∈ Root

(L). Then α ∈ Root

(L) ∩Root

′

(L).

Since α is a root of f, we can write f = (t −α)q ∈ L[t] for some q. Then

′

= (t −α)q

′

+ q.

Since (t −α) | f

′

, we must have (t − α) | q. So (t −α)

| f.

Recall that the characteristic of a field

char K

is the minimum

such that

p ·

= 0. If no such

exists, we say

char K

= 0. For example,

has

characteristic 0 while Z

has characteristic p.

Corollary. Let K be a field, f ∈ K[t] non-zero irreducible. Then

(i) If char K = 0, then f is separable.

(ii)

char K

p >

0, then

is not separable iff

deg f >

0 and

f ∈ K

[

]. For

example, t

+ 3t

+ 1 is not separable.

Proof.

By definition, for irreducible

is not separable iff

has a repeated

root. So by our previous lemma,

is not separable if and only if

and

′

have a common irreducible factor of positive degree in

[

]. However, since

irreducible, its only factors are 1 and itself. So this can happen if and only if

′

= 0.

To make it more explicit, we can write

f = a

+ ··· + a

t + a

Then we can write

′

= na

n−1

+ ··· + a

Now f

′

= 0 if and only if all coefficients ia

= 0 for all i.

(i)

Suppose

char K

= 0, then if

deg f

= 0, then

is trivially separable. If

deg f >

0, then

is not separable iff

′

= 0 iff

= 0 for all

iff

= 0

for

i ≥

1. But we cannot have a polynomial of positive degree with all its

coefficients zero (apart from the constant term). So f must be separable.

(ii) If deg f = 0, then f is trivially separable. So assume deg f > 0.

Then

is not separable

⇔ f

′

= 0

⇔ ia

= 0 for

i ≥

⇔ a

= 0 for all

i ≥ 1 not multiples of p ⇔ f ∈ K[t

Using this, it should be easy to find lots of examples of separable polynomials.

Definition (Separable elements and extensions). Let

K ⊆ L

be an algebraic

field extension. We say

α ∈ L

is separable over

is separable, where

is the minimal polynomial of α over K.

We say

is separable over

(or

K ⊆ L

is separable) if all

α ∈ L

are

separable.

Example.

–

The extensions

Q ⊆ Q

(

√

) and

R ⊆ C

are separable because

char Q

char R = 0. So we can apply our previous corollary.

–

Let

(

) be the field of rational functions in

over

(which is the

fraction field of

[

]), and

(

). We have

K ⊆ L

, and

(

Since

∈ K

is a root of

− s

∈ K

[

]. So

is algebraic over

and

hence

is algebraic over

. In fact

−s

is the minimal polynomial

of s over K.

Now

−s

= (

t−s

)

since the field has characteristic

. So

Root

−s

(

) =

{s}. So P

is not separable.

As mentioned in the beginning, separable extensions are nice, or at least

non-weird. One particular nice result about separable extensions is that all finite

separable extensions are simple, i.e. if

K ⊆ L

is finite separable, then

(

)

for some

α ∈ L

. This is what we will be working towards for the remaining of

the section.

Example. Consider

Q ⊆ Q

(

√

). This is a separable finite extension. So

we should be able to generate

(

√

) by just one element, not just two. In

fact, we can use α =

√

2 +

√

3, since we have

= 11

√

2 + 9

√

3 = 2

√

2 + 9α.

So since α

∈ Q(α), we know that

√

2 ∈ Q(α). So we also have

√

3 ∈ Q(α).

In general, it is not easy to find an

that works, but we our later result will

show that such an α exists.

Before that, we will prove some results about the K-homomorphisms.

Lemma. Let

L/F/K

be finite extensions, and

E/K

be a field extension. Then

for all α ∈ L, we have

|Hom

(F (α), E)| ≤ [F (α) : F ]|Hom

(F, E)|.

Note that if

is the minimal polynomial of

over

, then [

(

) :

] =

deg P

. So we can interpret this intuitively as follows: for each

ψ ∈ Hom

(

F, E

we can obtain a

-homomorphism in

Hom

(

)

, E

) by sending things in

according to

, and then send

to any root of

. Then there are at

most [

(

) :

]

-homomorphisms generated this way. Moreover, each

homomorphism in

Hom

(

)

, E

) can be created this way. So we get this

result.

Proof.

We show that for each

ψ ∈ Hom

(

F, E

), there are at most [

(

) :

]

-isomorphisms in

Hom

(

)

, E

) that restrict to

. Since each

isomorphism in

Hom

(

)

, E

) has to restrict to something, it follows that

there are at most [

(

) :

]

|Hom

(

F, E

)

| K

-homomorphisms from

(

) to

Now let

be the minimal polynomial for

, and let

ψ ∈ Hom

(

F, E

To extend ψ to a morphism F (α) → E, we need to decide where to send α. So

there should be some sort of correspondence

Root

(E) ←→ {ϕ ∈ Hom

(F (α), E) : ϕ|

= ψ}.

Except that the previous sentence makes no sense, since

∈ F

[

] but we are

not told that F is a subfield of E. So we use our ψ to “move” our things to E.

We let

(

)

⊆ E

, and

q ∈ M

[

] be the image of

under the

homomorphism

[

]

→ M

[

] induced by

. As we have previously shown, there

is a one-to-one correspondence

Root

(E) ←→ Hom

(M[t]/⟨q⟩, E).

What we really want to show is the correspondence between

Root

(

) and the

-homomorphisms

[

]

/⟨P

⟩ → E

that restrict to

. Let’s ignore the

quotient for the moment and think: what does it mean for

ϕ ∈ Hom

(

[

]

, E

) to

restrict to

? We know that any

ϕ ∈ Hom

(

[

]

, E

) is uniquely determined

by the values it takes on

and

. Hence if

ϕ|

, then our

must send

(

) =

, and can send

to anything in

. This corresponds exactly to

the

-homomorphisms

[

]

→ E

that does nothing to

and sends

to that

“anything” in E.

The situation does not change when we put back the quotient. Changing

from

[

]

→ E

[

]

/⟨q⟩ → E

just requires that the image of

must be

a root of

. On the other hand, using

[

]

/⟨P

⟩

instead of

[

] requires that

(

)) = 0. But we know that

(

) =

(

) =

. So this just requires

q(t) = 0 as well. So we get the one-to-one correspondence

Hom

(M[t]/⟨q⟩, E) ←→ {ϕ ∈ Hom

(F [t]/⟨P

⟩, E) : ϕ|

= ψ}.

Since F [t]/⟨P

⟩ = F (α), there is a one-to-one correspondence

Root

(E) ←→ {ϕ ∈ Hom

(F (α), E) : ϕ|

= ψ}.

So done.

Theorem. Let L/K and E/K be field extensions. Then

(i) |Hom

(L, E)| ≤ [L : K]. In particular, |Aut

(L)| ≤ [L : K].

(ii) If equality holds in (i), then for any intermediate field K ⊆ F ⊆ L:

(a) We also have |Hom

(F, E)| = [F : K].

(b) The map Hom

(L, E) → Hom

(F, E) by restriction is surjective.

Proof.

(i) We have previously shown we can find a sequence of field extensions

K = F

⊆ F

⊆ ··· ⊆ F

= L

such that for each

, there is some

such that

i−1

(

). Then by

our previous lemma, we have

|Hom

(L, E)| ≤ [F

: F

n−1

]|Hom

n−1

, E)|

≤ [F

: F

n−1

][F

n−1

: F

n−2

]|Hom

n−2

, E)|

≤ [F

: F

n−1

][F

n−1

: F

n−2

] ···[F

: F

]|Hom

, E)|

= [F

: F

]

= [L : K]

(ii) (a)

If equality holds in (i), then every inequality in the proof above has

to an equality. Instead of directly decomposing

K ⊆ L

as a chain

above, we can first decompose

K ⊆ F

, then

F ⊆ L

, then join them

together. Then we can assume that F = F

for some i. Then we get

|Hom

(L, E)| = [L : F ]|Hom

(F, E)| = [L : K].

Then the tower law says

|Hom

(F, E)| = [F : K].

(b)

By the proof of the lemma, for each

ψ ∈ Hom

(

F, E

), we know that

{ϕ : Hom

(L, E) : ϕ|

= ψ} ≤ [L : F ]. (∗)

As we know that

|Hom

(F, E)| = [F : K], |Hom

(L, E)| = [L : K]

we must have had equality in (

∗

), or else we won’t have enough

elements. So in particular

{ϕ

Hom

(

L, E

) :

ϕ|

ψ} ≥

1. So the

map is surjective.

With this result, we can prove prove the following result characterizing

separable extensions.

Theorem. Let

L/K

be a finite field extension. Then the following are equivalent:

(i) There is some extension E of K such that |Hom

(L, E)| = [L : K].

(ii) L/K is separable.

(iii) L

(

, ··· , α

) such that

, the minimal polynomial of

over

is separable for all i.

(iv) L

(

, ··· , α

) such that

, the minimal polynomial of

over

K(α

, ··· , α

i−1

) is separable for all i.

Proof.

–

(i)

⇒

(ii): For all

α ∈ L

, if

is the minimal polynomial of

over

then since K(α) is a subfield of L, by our previous theorem, we have

|Hom

(K(α), E)| = [K(α) : K].

We also know that

|Root

(

)

|Hom

(

)

, E

)

, and that [

(

) :

] =

deg P

. So we know that

has no repeated roots in any splitting

field. So P

is a separable. So L/K is a separable extension.

– (ii) ⇒ (iii): Obvious from definition

–

(iii)

⇒

(iv): Since

is a minimal polynomial in

(

, ··· , α

i−1

), we

know that R

| P

. So R

is separable as P

is separable.

–

(iv)

⇒

(i): Let

be the splitting field of

, ··· , P

. We do induction

to show that this satisfies the properties we want. If

= 1, then

L = K(α

). Then we have

|Hom

(L, E)| = |Root

(E)| = deg P

= [K(α

) : K] = [L : K].

We now induct on

. So we can assume that (iv)

⇒

(i) holds for smaller

number of generators. For convenience, we write

(

, ··· , α

Then we have

|Hom

n−1

, E)| = [K

n−1

: K].

We also know that

|Hom

, E)| ≤ [K

: K

n−1

]|Hom

n−1

, E)|.

What we actually want is equality. We now re-do (parts of) the proof of

this result, and see that separability guarantees that equality holds. If

we pick

ψ ∈ Hom

(

n−1

, E

), then there is a one-to-one correspondence

between

{ϕ ∈ Hom

(

, E

) :

ϕ|

n−1

ψ}

and

Root

(

), where

q ∈ M

[

]

is defined as the image of

under

n−1

[

]

→ M

[

], and

is the image

of ψ.

Since

∈ K

[

] and

| P

, then

q | P

. So

splits over

. By

separability assumption , we get that

|Root

(E)| = deg q = deg R

= [K

: K

n−1

Hence we know that

|Hom

(L, E)| = [K

: K

n−1

]|Hom

n−1

, E)|

= [K

: K

n−1

][K

n−1

: K]

= [K

: K].

So done.

Before we finally get to the primitive element theorem, we prove the following

lemma. This will enable us to prove the trivial case of the primitive element

theorem, and will also be very useful later on.

Lemma. Let

be a field,

∗

L \{

}

be the multiplicative group of

. If

is a finite subgroup of L

∗

, then G is cyclic.

Proof.

Since

∗

is abelian,

is also abelian. Then by the structure theorem on

finite abelian groups,

∼

⟨n

⟩

× ··· ×

⟨n

⟩

for some

∈ N

. Let

be the least common multiple of

, ··· , n

, and let

f = t

− 1.

If α ∈ G, then α

= 1. So f(α) = 0 for all α ∈ G. Therefore

|G| = n

···n

≤ |Root

(L)| ≤ deg f = m.

Since

is the least common multiple of

, ··· , n

, we must have

···n

and thus (

, n

) = 1 for all

i 

. Then by the Chinese remainder theorem, we

have

∼

⟨n

⟩

× ··· ×

⟨n

⟩

⟨n

···n

⟩

So G is cyclic.

We now come to the main theorem of the lecture:

Theorem (Primitive element theorem). Assume

L/K

is a finite and separable

extension. Then L/K is simple, i.e. there is some α ∈ L such that L = K(α).

Proof.

At some point in our proof, we will require that

is infinite. So we

first do the finite case first. If

is finite, then

is also finite, which in turns

implies

∗

is finite too. So by the lemma,

∗

is a cyclic group (since it is a finite

subgroup of itself). So there is some

α ∈ L

∗

such that every element in

∗

is a

power of α. So L = K(α).

So focus on the case where

is infinite. Also, assume

K 

. Then since

L/K

is a finite extension, there is some intermediate field

K ⊆ F ⊊ L

such that

(

) for some

. Now

L/K

is separable. So

F/K

is also separable, and

[

]

[

]. Then by induction on degree of extension, we can assume

F/K

is simple. In other words, there is some

λ ∈ F

such that

(

). Now

L = K(λ, β). In the rest of the proof, we will try to replace the two generators

λ, β with just a single generator.

Unsurprisingly, the generator of

will be chosen to be a linear combination

of β and λ. We set

α = β + aλ

for some

a ∈ K

to be chosen later. We will show that

(

) =

. Actually,

almost any choice of

will do, but at the end of the proof, we will see which

ones are the bad ones.

Let

and

be the minimal polynomial of

and

over

respectively.

Consider the polynomial f = P

(α −at) ∈ K(α)[t]. Then we have

f(λ) = P

(α −aλ) = P

(β) = 0.

On the other hand, P

(λ) = 0. So λ is a common root of P

and f.

We now want to pick an

such that

is the only common root of

and

(in

). If so, then the gcd of

and

(

) must only have

as a root.

But since

is separable, it has no double roots. So the gcd must be

t −λ

. In

particular, we must have

λ ∈ K

(

). Since

aλ

, it follows that

β ∈ K

(

)

as well, and so K(α) = L.

Thus, it remains to choose an

such that there are no other common roots.

We work in a splitting field of P

, and write

= (t −β

) ···(t −β

)

= (t −λ

) ···(t −λ

We wlog β

= β and λ

= λ.

Now suppose θ is a common root of f and P

. Then

(

f(θ) = 0

(θ) = 0

⇒

(

(α −aθ) = 0

(θ) = 0

⇒

(

α −aθ = β

θ = λ

for some i, j. Then we know that

α = β

+ aλ

However, by definition, we also know that

α = β + aλ

Now we see how we need to choose

. We need to choose

such that the elements

β + aλ = β

+ aλ

for all i, j. But if they were equal, then we have

a =

λ −λ

− β

and there are only finitely many elements of this form. So we just have to pick

an a not in this list.

Corollary. Any finite extension

L/K

of field of characteristic 0 is simple, i.e.

L = K(α) for some α ∈ L.

Proof.

This follows from the fact that all extensions of fields of characteristic

zero are separable.

We have previously seen that

(

√

)

is a simple extension, but that

is of course true from this theorem. A more interesting example would be one in

which this fails. We will need a field with non-zero characteristic.

Example. Let

(

s, u

), the fraction field of

[

s, u

]. Let

(

, u

We have L/K. We want to show this is not simple.

α ∈ L

, then

∈ K

. So

is a root of

− α

∈ K

[

]. Thus the minimal

polynomial

has degree at most

. So [

(

) :

] =

deg P

≤ p

. On the other

hand, we have [

] =

, since

: 0

≤ i, j < p}

is a basis. So for any

we have

(

)



. So

L/K

is not a simple extension. This then implies

L/K

is not separable.

At this point, one might suspect that all fields with positive characteristic

are not separable. This is not true by considering a rather silly example.

Example. Consider

and

[

]

/⟨s

+ 1

⟩

. We can check manually

that

+ 1 has no roots and hence irreducible. So

is a field. So

L/F

is a

finite extension. Note that L only has 4 elements.

Now if

α ∈ L \ F

, and

is the minimal polynomial of

over

, then

| t

+ t + 1. So P

is separable as a polynomial. So L/F

is separable.

In fact, we have

Proposition. Let

L/K

be an extension of finite fields. Then the extension is

separable.

Proof.

Let the characteristic of the fields be

. Suppose the extension were not

separable. Then there is some non-separable element

α ∈ L

. Then its minimal

polynomial must be of the form P

Now note that the map

K → K

given by

x 7→ x

is injective, hence surjective.

So we can write a

= b

for all i. Then we have





and so P

is not irreducible, which is a contradiction.