II Number Fields - L-functions, Dirichlet series*

8L-functions, Dirichlet series*

II Number Fields

8 L-functions, Dirichlet series*

This section is non-examinable.

We start by proving the exciting fact that there are infinitely many primes.

Theorem (Euclid). There are infinitely many primes.

Proof. Consider the function

p primes



1 −



−1

p prime



1 +

+ ···



n>0

This is since every

···p

factors uniquely as a product of primes, and

each such product appears exactly once in this. If there were finitely many

primes, as

converges to



1 −



−1

, the sum

n≥1

p prime



1 −



must be finite. But the harmonic series diverges. This is a contradiction.

We all knew that. What we now want to prove is something more interesting.

Theorem

(Dirichlet’s theorem)

Let

a, q ∈ Z

be coprime. Then there exists

infinitely many primes in the sequence

a, a + q, a + 2q, ··· ,

i.e. there are infinitely many primes in any such arithmetic progression.

We want to imitate the Euler proof, but then that would amount to showing

that

p≡a mod q

p prime



1 −



−1

is divergent, and there is no nice expression for this. So it will be a lot more

work.

To begin with, we define the Riemann zeta function.

Definition

(Riemann zeta function)

The Riemann zeta function is defined as

ζ(s) =

n≥1

−s

for s ∈ C.

There are some properties we will show (or assert):

Proposition.

(i) The Riemann zeta function ζ(s) converges for Re(s) > 1.

(ii) The function

ζ(s) −

s − 1

extends to a holomorphic function when Re(s) > 0.

In other words,

(

) extends to a meromorphic function on

(

)

0 with

a simple pole at 1 with residue 1.

(iii) We have the expression

ζ(s) =

p prime



1 −



−1

for

(

)

1, and the product is absolutely convergent. This is the Euler

product.

The first part follows from the following general fact about Dirichlet series.

Definition

(Dirichlet series)

A Dirichlet series is a series of the form

−s

where a

, a

, ··· ∈ C.

Lemma. If there is a real number r ∈ R such that

+ ··· + a

= O(N

then

−s

converges for Re(s) > r, and is a holomorphic function there.

Then (i) is immediate by picking

= 1, since in the Riemann zeta function,

= a

= ··· = 1.

Recall that x

= e

s log x

has

| = |x

Re(s)

if x ∈ R, x > 0.

Proof. This is just IA Analysis. Suppose Re(s) > r. Then we can write

n=1

−s

= a

−s

− 2

−s

) + (a

+ a

)(2

−s

− 3

−s

) + ···

+ (a

+ ··· + a

N−1

)((N − 1)

−s

− N

−s

) + R

where

+ ··· + a

This is getting annoying, so let’s write

T (N) = a

+ ··· + a

We know



T (N)



T (N)



Re(s)−r

→ 0

as N → ∞, by assumption. Thus we have

n≥1

−s

n≥1

T (n)(n

−s

− (n + 1)

−s

)

(

)

> r

. But again by assumption,

(

)

≤ B · n

for some constant

and

all n. So it is enough to show that

−s

− (n + 1)

−s

)

converges. But

−s

− (n + 1)

−s

n+1

s+1

dx,

and if x ∈ [n, n + 1], then n

≤ x

. So we have

−s

− (n + 1)

−s

) ≤

n+1

s+1

dx = s

n+1

s+1−r

It thus suffices to show that

s+1−r

converges, which it does (to

s−r

We omit the proof of (ii). The idea is to write

s − 1

∞

n=1

n+1

and show that

is uniformly convergent when Re(s) > 0, where

= n

−s

−

n+1

For (iii), consider the first r primes p

, ··· , p

, and

i=1

(1 − p

−s

)

−1

−s

where the sum is over the positive integers

whose prime divisors are among

, ··· , p

. Notice that 1, ··· , r are certainly in the set.



ζ(s) −

i=1

(1 − p

−s

)

−1



≤

n≥r

−s

| =

n≥r

−Re(s)

But

n≥r

−Re(s)

→

0 as

r → ∞

, proving the result, if we also show that it

converges absolutely. We omit this proof, but it follows from the fact that

p prime

−s

≤

−s

and the latter converges absolutely, plus the fact that

− a

) converges if

and only if

converges, by IA Analysis I.

This is good, but not what we want. Let’s mimic this definition for an

arbitrary number field!

Definition

(Zeta function)

Let

L ⊇ Q

be a number field, and [

] =

. We

define the zeta function of L by

(s) =

aCO

N(a)

−s

It is clear that if

and

, then this is just the Riemann zeta

function.

Theorem.

(i) ζ

(s) converges to a holomorphic function if Re(s) > 1.

(ii)

Analytic class number formula:

(

) is a meromorphic function if

(

)

1 −

and has a simple pole at s = 1 with residue

|cl

(2π)

1/2

|µ

where

is the class group,

and

are the number of real and complex

embeddings, you know what

is,

is the regulator,

is the discriminant

and µ

is the roots of unity in L.

(iii)

(s) =

pCO

prime ideal

(1 − N(p)

−s

)

−1

This is again known as the Euler product.

We will not prove this, but the proof does not actually require any new ideas.

Note that

aCO

N(a)

−s

pCO

,p prime

(1 − N(p)

−s

)

−1

holds “formally”, as in the terms match up when you expand, as an immediate

consequence of the unique factorization of ideals into a product of prime ideals.

The issue is to study convergence of

(

)

−s

, and this comes down to estimating

the number of ideals of fixed norm geometrically, and that is where all the factors

in the pole come it.

Example.

We try to compute

(

), where

(

√

). This has discriminant

D, which may be d or 4d. We first look at the prime ideals.

is a prime ideal in

, then

p | hpi

for a unique

. So let’s enumerate

the factors of η

controlled by p ∈ Z.

Now if

p | |D

, then

hpi

ramifies, and

(

) =

. So this contributes a

factor of (1 − p

−s

)

−1

Now if p remains prime, then we have N(hpi) = p

. So we get a factor of

(1 − p

−2s

)

−1

= (1 − p

−s

)

−1

(1 + p

−s

)

−1

If p splits completely, then

hpi = p

N(p

) = p,

and so we get a factor of

(1 − p

−s

)

−1

(1 − p

−s

)

−1

So we find that

(s) = ζ(s)L(χ

, s),

where we define

Definition (L-function). We define the L-function by

L(χ, s) =

p prime

(1 − χ(p)p

−s

)

−1

In our case, χ is given by

(p) =











0 p | D

−1 p remains prime

1 p splits

(





p is odd

depends on d mod 8 p = 2

Example. If L = Q(

√

−1), then we know



−4





−1



= (−1)

p−1

if p 6= 2,

and χ

(2) = 0 as 2 ramifies. We then have

L(χ

, s) =

p>2 prime

(1 − (−1)

p−1

−s

)

−1

= 1 −

−

+ ··· .

Note that

was defined for primes only, but we can extend it to a function

: Z → C by imposing

(nm) = χ

(n)χ

(m),

i.e. we define

···p

) = χ

)

···χ

)

Example. Let L = Q(

√

−1). Then

−4

(m) =

(

(−1)

m−1

m odd

0 m even.

It is an exercise to show that this is really the extension, i.e.

−4

(mn) = χ

−4

(m)χ

−4

(n).

Notice that this has the property that

−4

(m − 4) = χ

−4

(m).

We give these some special names

Definition

(Dirichlet character)

A function

Z → C

is a Dirichlet character

of modulus D if there exists a group homomorphism

w :





→ C

such that

χ(m) =

(

w(m mod D) gcd(m, D) = 1

0 otherwise

We say χ is non-trivial if ω is non-trivial.

Example. χ

−4

is a Dirichlet character of modulus 4.

Note that

χ(mn) = χ(m)χ(n)

for such Dirichlet characters, and so

L(χ, s) =

p prime

(1 − χ(p)p

−s

)

−1

n≥1

χ(n)

for such χ.

Proposition. χ

, as defined for

(

√

) is a Dirichlet character of modulus

Note that this is a very special Dirichlet character, as it only takes values

0, ±1. We call this a quadratic Dirichlet character.

Proof. We must show that

(p + Da) = χ

(p)

for all p, a.

(i) If d ≡ 3 (mod 4), then D = 4d. Then

(2) = 0,

as (2) ramifies. So χ

(even) = 0. For p > 2, we have

(p) =













(−1)

p−1

d−1

≡ 1 (mod 2), by quadratic reciprocity. So

(p + Da) =



p + Da



(−1)

p−1

(−1)

4da/2

= χ

(p).

(ii) If d ≡ 1, 2 (mod 4), see example sheet.

Lemma.

Let

be any non-trivial Dirichlet character. Then

(

χ, s

) is holomor-

phic for Re(s) > 0.

Proof. By our lemma on convergence of Dirichlet series, we have to show that

i=1

χ(i) = O(1),

i.e. it is bounded. Recall from Representation Theory that distinct irreducible

characters of a finite group G are orthogonal, i.e.

|G|

g∈G

(g)χ

(g) =

(

1 χ

= χ

0 otherwise

We apply this to

= (

Z/DZ

)

, where

is trivial and

. So orthogonality

gives

aD<i≤(a+1)D

χ(i) =

i∈(Z/DZ)

χ(i) = 0,

using that χ(i) = 0 if i is not coprime to D. So we are done.

Corollary. For quadratic characters χ

, we have

L(χ

, 1) 6= 0.

For example, if D < 0, then

L(χ

, 1) =

2π|cl

√

|D|

1/2

|µ

√

Proof. We have shown that

√

(s) = ζ

(s)L(χ

, s).

Note that

√

(

) and

(

) have simple poles at

= 1, while

(

, s

) is

holomorphic at s = 1.

Since the residue of

(

) at

= 1 is 1, while the residue of

√

= 1

is non-zero by the analytic class number formula. So

(

1) is non-zero, and

given by the analytic class number formula.

Example. If L = Q(

√

−1), then

1 −

−

+ ··· =

2π ·1

2 · 4

In general, for any field whose class number we know, we can get a series

expansion for π. And it converges incredibly slow.

Note that this corollary required two things — the analytic input for the

analytic class number formula, and quadratic reciprocity (to show that

is a

Dirichlet character).

More ambitiously, we now compute the zeta function of a cyclotomic field,

(

), where

is the primitive

th root of unity and

q ∈ N

. We need to

know the following facts about cyclotomic extensions:

Proposition.

(i) We have [L : Q] = ϕ(q), where

ϕ(q) = |(Z/qZ)

(ii) L ⊇ Q is a Galois extension, with

Gal(L/Q) = (Z/qZ)

where if

r ∈

(

Z/qZ

)

, then

acts on

(

) by sending

7→ ω

. This is

what plays the role of quadratic reciprocity for cyclotomic fields.

(iii) The ring of integers is

= Z[ω

] = Z[x]/Φ

(x),

where

(x) =

− 1

d|q,d6=q

(x)

is the qth cyclotomic polynomial.

(iv)

Let

be a prime. Then

ramifies in

if and only if

p | D

, if and only

p | q

. So while

might be messy, the prime factors of

are the prime

factors of q.

(v)

Let

be a prime and

p - q

. Then

hpi

factors as a product of

(

)

distinct

prime ideals, each of norm p

, where f is the order of p in (Z/qZ)

Proof.

(i) In the Galois theory course.

(ii) In the Galois theory course.

(iii) In the example sheet.

(iv) In the example sheet.

(v) Requires proof, but is easy Galois theory, and is omitted.

Example. Let q = 8. Then

− 1

(x + 1)(x − 1)(x

+ 1)

− 1

= x

+ 1.

So given a prime p (that is not 2), we need to understand

/p =

[x]

i.e. how Φ

factors factors mod p (Dedekind’s criterion). We have

(Z/8)

= {1, 3, 5, 7} = {1, 3, −3, −1} = Z/2 × Z/2.

Then (v) says if p = 17, then x

factors into 4 linear factors, which it does.

If p = 3, then (v) says x

factors into 2 quadratic factors. Indeed, we have

− x − 1)(x

+ x − 1) = (x

− 1)

− x

= x

+ 1.

Given all of these, let’s compute the zeta function! Recall that

Q(ω

)

(s) =

(1 − N(p)

−s

)

−1

We consider the prime ideals

dividing

hpi

, where

is a fixed integer prime

number. If p - q, then (v) says this contributes a factor of

(1 − p

−fs

)

−ϕ(q)/f

to the zeta function, where

is the order of

in (

Z/qZ

)

. We observe that this

thing factors, since

1 − t

γ∈µ

(1 − γt),

with

= {γ ∈ C : γ

= 1},

and we can put t = p

−s

We let

, ··· , ω

ϕ(q)

: (Z/qZ)

→ C

be the distinct irreducible (one-dimensional) representations of (

Z/qZ

)

, with

being the trivial representation, i.e. ω

(a) = 1 for all a ∈ (Z/qZ)

The claim is that

(

)

, ··· , ω

ϕ(q)

(

) are

th roots of 1, each repeated

(

)

times. We either say this is obvious, or we can use some representation theory.

We know

generates a cyclic subgroup

hpi

of (

Z/qZ

)

of order

, by definition

. So this is equivalent to saying the restrictions of

, ··· , ω

ϕ(q)

are the

f distinct irreducible characters of hpi

∼

Z/f, each repeated ϕ(q)/f times.

Equivalently, note that

Res

(Z/qZ)

hpi

(ω

⊕ ··· ⊕ ω

ϕ(q)

) = Res

(Z/qZ)

hpi

(regular representation of (Z/qZ)

So this claims that

Res

(Z/qZ)

hpi

(regular rep. of (Z/qZ)

) =

ϕ(q)

(regular rep. of Z/f ).

But this is true for any group, since

Res

CG = |G/H|CH,

as the character of both sides is |G|δ

So we have

(1 − p

−fs

)

−ϕ(q)/f

ϕ(q)

i=1

(1 − ω

(p)p

−s

)

−1

So we let

(n) =

(

(n mod q) gcd(n, q) = 1

0 otherwise

be the corresponding Dirichlet characters. So we have just shown that

Proposition. We have

Q(ω

)

(s) =

ϕ(q)

i=1

L(χ

, s) · (corr. factor) = ζ

(s)

ϕ(q)

i=2

L(χ

, s) · (corr. factor)

where the correction factor is a finite product coming from the primes that divide

By defining the

functions in a slightly more clever way, we can hide the

correction factors into the

(

χ, s

), and then the

function is just the product of

these L-functions.

Proof.

Our analysis covered all primes

p - q

, and the correction factor is just to

include the terms with p | q. The second part is just saying that

(s) = L(χ

, s)

p|q

(1 − p

−s

)

−1

This allows us to improve our result on the non-vanishing of

(

χ,

1) to all

Dirichlet characters, and not just quadratic Dirichlet characters.

Corollary. If χ is any non-trivial Dirichlet character, then L(χ, 1) 6= 0.

Proof.

By definition, Dirichlet characters come from representations of some

(

Z/qZ

)

, so they appear in the formula of the

function of some cyclotomic

extension.

Consider the formula

Q(ω

)

(s) = ζ

(s)

ϕ(q)

i=2

L(χ

, s) · (corr. factor)

= 1. We know that the

(

, s

) are all holomorphic at

= 1. Moreover,

both

Q(ω

)

and

have a simple pole at 0. Since the correction terms are finite,

it must be the case that all L(χ

, s) are non-zero.

Theorem

(Dirichlet, 1839)

Let

a, q ∈ N

be coprime, i.e.

gcd

(

a, q

) = 1. Then

there are infinitely many primes in the arithmetic progression

a, a + q, a + 2q, a + 3q, ··· .

Proof. As before, let

, ··· , ω

ϕ(q)

: (Z/qZ)

→ C

be the irreducible characters, and let

, ··· , χ

ϕ(q)

: Z → C

be the corresponding Dirichlet character, with ω

the trivial one.

Recall the orthogonality of columns of the character table, which says that if

gcd(p, q) = 1, then

ϕ(q)

(a)ω

(p) =

(

1 a ≡ p (mod q)

0 otherwise

Hence we know

ϕ(q)

(a)χ

(p) =

(

1 a ≡ p (mod q)

0 otherwise

even if gcd(p, q) 6= 1, as then χ

(p) = 0. So

p≡a mod q

p prime

−s

ϕ(q)

(a)

all primes p

(p)p

−s

. (‡)

We want to show this has a pole at s = 1, as in Euclid’s proof.

To do so, we show that

(

)

−s

is “essentially”

log L

(

, s

), up to some

bounded terms. We Taylor expand

log L(χ, s) = −

log(1 − χ(p)p

−s

) =

n≥1

p prime

χ(p)

n≥1

p prime

χ(p

)

What we care about is the n = 1 term. So we claim that

n≥2,p prime

χ(p

)

converges at s = 1. This follows from the geometric sum



n≥2

χ(p

)



≤

n≥2

−ns

p prime

− 1)

≤

n≥2

− 1)

< ∞.

Hence we know

log L(χ, s) =

(p)p

−s

+ bounded stuff

near s = 1.

So at s = 1, we have

(‡) ∼

ϕ(q)

(a) log L(χ

, s).

and we have to show that the right hand side has a pole at s = 1.

We know that for

i 6

= 1, i.e.

non-trivial,

(

, s

) is holomorphic and

non-zero at

= 1. So we just have to show that

log L

(

, s

) has a pole. Note

that L(χ

, s) is essentially ζ

(s). Precisely, we have

L(χ

, s) = ζ

(s)

p|q

(1 − p

−s

Moreover, we already know that ζ

(s) blows up at s = 1. We have

(s) =

s − 1

+ holomorphic function

s − 1

(1 + (s − 1)(holomorphic function)).

So we know

log L(χ

, s) ∼ log ζ

(s) ∼ log



s − 1



and this does blow up at s = 1.

So far, we have been working with abelian extensions over

, i.e. extensions

L/Q

whose Galois group is abelian. By the Kronecker–Weber theorem, every

abelian extension of

is contained within some cyclotomic extension. So in

some sense, we have considered the “most general” abelian extension.

Can we move on to consider more complicated number fields? In general,

suppose

L/Q

is Galois, and

Gal

(

L/Q

). We can still make sense of the

functions, and it turns out it always factors as

(s) =

L(ρ, s)

dim ρ

where

ranges over all the irreducible representations of

, and

(

ρ, s

) is the

Artin

-function. It takes some effort to define the Artin

-function, and we

shall not do so here. However, it is worth noting that

, s

) is just

(

), and

for ρ 6= 1, we still have a factorization of the form

L(ρ, s) =

p prime

(ρ, s).

This L

(ρ, s) is known as the Euler factor.

One can show that

(

ρ, s

) is always a meromorphic function of

, and is

conjectured to be holomorphic for all s (if ρ 6= 1, of course).

is one-dimensional, then

(

ρ, s

) is a Dirichlet series

(

χ, s

) for some

Recall that to establish this fact for quadratic fields, we had to use quadratic

reciprocity. In general given a

, finding

is a higher version of “quadratic

reciprocity”. This area is known as class field theory. If dim ρ > 1, then this is

“non-abelian class field theory”, known as Langlands programme.