III Ramsey Theory - Partition Regularity

3Partition Regularity

III Ramsey Theory

3 Partition Regularity

In the previous chapter, the problems we studied were mostly “linear” in nature.

We had some linear system, namely that encoding the fact that a sequence is an

AP, and we wanted to know if it had a monochromatic solution. More generally,

we can define the following:

Definition

(Partition regularity)

We say an

m ×n

matrix

over

is partition

regular if whenever

is finitely coloured, there exists an

x ∈ N

such that

Ax = 0 and x is monochromatic, i.e. all coordinates of x have the same colour.

Recall that N does not include 0.

There is no loss in generality by assuming

in fact has entries in

, by

scaling

, but sometimes it is (notationally) convenient to consider cases where

the entries are rational.

The question of the chapter is the following — when is a matrix partition

regular? We begin by looking at some examples.

Example

(Schur’s theorem)

Schur’s theorem says whenever

is finitely

coloured, there exists a monochromatic set of the form

{x, y, x

. In other

words the matrix



1 1 −1



is partition regular, since



1 1 −1











= 0 ⇐⇒ z = x + y.

Example.

How about



2 3 −5



. This is partition regular, because we can

pick any x, and we have



2 3 −5











= 0.

This is a trivial example.

How about van der Waerden’s theorem?

Example.

Van der Waerden’s theorem says there is a monochromatic 3-AP

, x

}

whenever

is finitely-coloured. We know

, x

forms a 3-AP iff

− x

= x

− x

or equivalently

+ x

= 2x

This implies that



1 −2 1



is partition regular. But this is actually not a

very interesting thing to say, because

is always a solution to this

equation. So this falls into the previous “trivial” case.

On the second example sheet we saw a stronger version of van der Waerden.

It says we can always find a monochromatic set of the form

{d, a, a + d, a + 2d, ··· , a + md}.

By including this variable, we can write down the property of being a progression

in a non-trivial format by



1 1 0 −1

2 1 −1 0















= 0

This obviously generalizes to an arbitrary m-AP, with matrix







1 1 −1 0 0 ··· 0

1 2 0 −1 0 ··· 0

1 3 0 0 −1 ··· 0

1 m 0 0 0 ··· −1







We’ve seen three examples of partition regular matrices. Of course, not every

matrix is partition regular. The matrix



1 1



is not partition regular, for the

silly reason that two positive things cannot add up to zero.

Let’s now look at some non-trivial first matrix that is not partition regular.

Example.

The matrix



2 −1



is not partition regular, since we can put

(

) = (

−

, where

is the maximum integer such that 2

| x

. Then

{x,

never monochromatic.

A similar argument shows that if

λ ∈ Q

is such that



λ, −1



is partition

regular, then λ = 1.

But if we write down a random matrix, say



2 3 −6



? The goal of this

chapter is to find a complete characterization of matrices that are partition

regular.

Definition (Columns property). Let

A =





↑ ↑ ↑

(1)

(2)

··· c

(n)

↓ ↓ ↓





We say

has the columns property if there is a partition [

] =

∪B

∪···∪B

such that

i∈B

(i)

∈ span{c

(i)

: c

(i)

∈ B

∪ ··· ∪ B

s−1

}

for s = 1, ··· , d. In particular,

i∈B

(i)

= 0

What does this mean? Let’s look at the matrices we’ve seen so far.

Example.



1 1 −1



has the columns property by picking

{

}

and

= {2}.

Example.



2 3 −5



has the columns property by picking B

= {1, 2, 3}.

Example. The matrix







1 1 −1 0 0 ··· 0

1 2 0 −1 0 ··· 0

1 3 0 0 −1 ··· 0

1 m 0 0 0 ··· −1







has the column property. Indeed, we take

{

, ··· , m

+ 2

}

, and since

(3)

, ··· , c

(m+2)

} spans all of R

, we know picking B

= {2} works.

Example.



` −1



has the columns property iff

= 1. In particular,



1 1



does not have a columns property.

Given these examples, it is natural to conjecture the following:

Theorem

(Rado’s theorem)

A matrix

is partition regular iff it has the

column property.

This is a remarkable theorem! The property of being partition regular

involves a lot of quantifiers, over infinitely many things, but the column property

is entirely finite, and we can get a computer to check it for us easily.

Another remarkable property of this theorem is that neither direction is obvi-

ous! It turns out partition regularity implies the column property is (marginally)

easier, because if we know something is partition regular, then we can try to

cook up some interesting colourings and see what happens. The other direction

is harder.

To get a feel of the result, we will first prove it in the case of a single equation.

The columns property in particular becomes much simpler. It just means that

there exists a non-empty subset of the non-zero a

’s that sums up to zero.

Theorem.

, ··· , a

∈ Q \ {

}

, then



··· a



is partition regular iff

there exists a non-empty I ⊆ [n] such that

i∈I

= 0.

For a fixed prime

, we let

(

) denote the last non-zero digit of

in base

i.e. if

x = d

+ d

n−1

+ ··· + d

then

L(x) = min{i : d

6= 0}

and d(x) = d

L(x)

. We now prove the easy direction of the theorem.

Proposition.

, ··· , a

∈ Q\{

}

and



··· a



is partition regular,

then

i∈I

= 0

for some non-empty I.

Proof. We wlog a

∈ Z, by scaling. Fix a suitably large prime

p >

i=1

and consider the (

p −

1)-colouring of

where

is coloured

(

). We find

, ··· , x

such that

= 0.

and

(

) =

for some

d ∈ {

, ··· , p −

}

. We write out everything in base

and let

L = min{L(x

) : 1 ≤ i ≤ n},

and set

I = {i : L(x

) = L}.

Then for all i ∈ I, we have

≡ d (mod p

L+1

On the other hand, we are given that

= 0.

Taking mod p

L+1

gives us

i∈I

d = 0 (mod p

L+1

Since d is invertible, we know

i∈I

= 0 (mod p

L+1

But by our choice of p, this implies that

i∈I

= 0.

Here we knew what prime

to pick ahead. If we didn’t, then we could

consider all primes p, and for each p we find I

⊆ [n] such that

i∈I

= 0 (mod p).

Then some

has to occur infinitely often, and then we are done. Note that this

is merely about the fact that if a fixed number is 0 mod

for arbitrarily large

then it must be zero. This is not some deep local-global correspondence number

theoretic fact.

It turns out this is essentially the only way we know for proving this theorem.

One possible variation is to use the “first non-zero digit” to do the colouring,

but this is harder.

Let’s now try and do the other direction. Before we do that, we start by

doing a warm up. Last time, we proved that if we had



1 λ



, then this is

partition regular iff λ = −1. We now prove a three element version.

Proposition. The equation



1 λ −1



is partition regular for all λ ∈ Q.

Proof.

We may wlog

λ >

0. If

= 0, then this is trivial, and if

λ <

0, then we

can multiply the whole equation by −1.

Say

λ =

The idea is to try to find solutions of this in long arithmetic progressions.

Suppose N is k-coloured. We let

{a, a + d, ··· , a + nd}

be a monochromatic AP, for n sufficiently large.

If sd were the same colour as this AP, then we’d be done, as we can take

x = a, y = sd, z = a +

sd.

In fact, if any of

sd,

sd, ··· ,





have the same colour as the AP, then we’d

be done by taking

x = a, y = isd, z = a +

isd = a + ird ≤ a + nd.

If this were not the case, then

{sd,

sd, ··· ,





sd}

is (

k −

1)-coloured, and this

is just a scaled up copy of N. So we are done by induction on k.

Note that when

= 1, we have



1 1 −1



is partition regular, and this

may be proved by Ramsey’s theorem. Can we prove this more general result by

Ramsey’s theorem as well? The answer is, we don’t know.

It turns out this is not just a warm up, but the main ingredient of what we

are going to do.

Theorem.

, ··· , a

∈ Q

, then



··· a



is partition regular iff there

exists a non-empty I ⊆ [n] such that

i∈I

= 0.

Proof.

One direction was done. To do the other direction, we recall that we had

a really easy case of, say,



2 3 −5



because we can just make all the variables the same?

In the general case, we can’t quite do this, but we may try to solve this

equation with the least number of variables possible. In fact, we shall find some

monochromatic x, y, z, and then assign each of x

, ··· , x

to be one of x, y, z.

We know

i∈I

= 0.

We now partition I into two pieces. We fix i

∈ I, and set











x i = i

y i ∈ I \ {i

}

z i 6∈ I

We would be done if whenever we finitely colour

, we can find monochromatic

x, y, z such that

x +





i∈I\{i

}





z +





i6∈I





y = 0.

But, since

i∈I

= 0,

this is equivalent to

x − a

z + (something)y = 0.

Since all these coefficients were non-zero, we can divide out by

, and we are

done by the previous case.

Note that our proof of the theorem shows that if an equation is partition

regular for all last-digit base

colourings, then it is partition regular for all

colourings. This might sound like an easier thing to prove that the full-blown

Rado’s theorem, but it turns out the only proof we have for this implication is

Rado’s theorem.

We now prove the general case. We first do the easy direction, because it is

largely the same as the single-equation case.

Proposition.

is an

m × n

matrix with rational entries which is partition

regular, then A has the columns property.

Proof.

We again wlog all entries of

are integers. Let the columns of

(1)

, ··· , c

(n)

. Given a prime

, we consider the (

p −

1)-colouring of

, where

is coloured

(

), the last non-zero digit in the base

expansion. Since

partition regular, we obtain a monochromatic solution.

We then get a monochromatic x

, ··· , x

such that Ax = 0, i.e.

(i)

= 0.

Any such solution with colour

induces a partition of [

] =

∪ B

∪ ··· ∪ B

where

– For all i, j ∈ B

, we have L(x

) = L(x

); and

– For all s < t and i ∈ B

, j ∈ B

, the L(x

) < L(x

Last time, with the benefit of hindsight, we were able to choose some large prime

that made the argument work. So we use the trick we mentioned after the

proof last time.

Since there are finitely many possible partitions of [

], we may assume that

this particular partition is generated by infinitely many primes

. Call these

primes

. We introduce some notation. We say two vectors

u, v ∈ Z

satisfy

u ≡ v (mod p) if u

≡ v

(mod p) for all i = 1, ··· , m.

Now we know that

(i)

= 0.

Looking at the first non-zero digit in the base p expansion, we have

i∈B

(i)

≡ 0 (mod p).

From this, we conclude that, by multiplying by d

−1

i∈B

(i)

≡ 0 (mod p),

for all p ∈ P. So we deduce that

i∈B

(i)

= 0.

Similarly, for higher s, we find that for each base p colouring, we have

i∈B

(i)

i∈B

∪...∪B

(i)

≡ 0 (mod p

t+1

)

for all s ≥ 2, and some t dependent on s and p. Multiplying by d

−1

, we find

i∈B

(i)

i∈B

∪...∪B

s−1

−1

(i)

≡ 0 (mod p

t+1

). (∗)

We claim that this implies

i∈B

(i)

∈ hc

(i)

: i ∈ B

∪ ··· ∪ B

s−1

This is not exactly immediate, because the values of

in (

∗

) may change as we

change our p. But it is still some easy linear algebra.

Suppose this were not true. Since we are living in a Euclidean space, we have

an inner product, and we can find some v ∈ Z

such that

hv, c

(i)

i = 0 for all i ∈ B

∪ ··· ∪ B

s−1

and

i∈B

(i)

6= 0.

But then, taking inner products with gives

i∈B

(i)

≡ 0 (mod p

t+1

Equivalently, we have

i∈B

(i)

≡ 0 (mod p),

but this is a contradiction. So we showed that

has the columns property with

respect to the partition [n] = B

∪ ··· ∪ B

We now prove the hard direction. We want an analogous gadget to our



1 λ −1



we had for the single-equation case. The definition will seem rather

mysterious, but it turns out to be what we want, and its purpose becomes more

clear as we look at some examples.

Definition

((

m, p, c

)-set)

For

m, p, c ∈ N

, a set

S ⊆ N

is an (

m, p, c

)-set with

generators x

, ··· , x

if S has the form

S =







i=0

= 0 for all i < j

= c

∈ [−p, p]







In other words, we have

S =

[

j=1

{cx

+ λ

j+1

+ ··· + λ

: λ

∈ [−p, p]}.

For each

, the set

{cx

j+1

···

∈

[

−p, p

]

}

is called a row

of S.

Example. What does a (2, p, 1) set look like? It has the form

− px

, x

− (p − 1)x

, ··· , x

+ px

} ∪ {x

In other words, this is just an arithmetic progression with its common difference.

Example. A (2, p, 3)-set has the form

{3x

− px

, ··· , 3x

= px

} ∪ {3x

The idea of an (

m, p, c

) set is that we “iterate” this process many times, and

so an (

m, p, c

)-set is an “iterated AP’s and various patterns of their common

differences”.

Our proof strategy is to show that that whenever we finitely-colour

, we can

always find an (

m, p, c

)-set, and given any matrix

with the columns property

and any (

m, p, c

)-set (for suitable

), there will always be a solution in

there.

Proposition.

Let

m, p, c ∈ N

. Then whenever

is finitely coloured, there

exists a monochromatic (m, p, c)-set.

Proof.

It suffices to find an (

m, p, c

)-set all of whose rows are monochromatic,

since when

-coloured, and (

, p, c

)-set with

+ 1 has

monochro-

matic rows of the same colour by pigeonhole, and these rows contain a monochro-

matic (m, p, c)-set, by restricting to the elements where a lot of the λ

are zero.

In this proof, whenever we say (

m, p, c

)-set, we mean one all of whose rows are

monochromatic.

We will prove this by induction. We have a

-colouring of [

], where

very very very large. This contains a k-colouring of

B =

c, 2c, ··· ,

Since

is fixed, we can pick this so that

is large. By van der Waerden, we

find some set monochromatic

A = {cx

− Nd, cx

− (N − 1)d, ··· , cx

+ Nd} ⊆ B,

with

very very large. Since each element is a multiple of

by assumption,

we know that

c | d

. By induction, we may find an (

m −

, p, c

)-set in the set

{d,

d, ··· , Md}

, where

is large. We are now done by the (

m, p, c

) set on

generators x

, ··· , x

, provided

i=2

∈ A

for all

∈

[

−p, p

], which is easily seen to be the case, provided

N ≥

(

m −

1)pM.

Note that the argument itself is quite similar to that in the



1 λ −1



case.

Recall that Schur’s theorem said that whenever we finitely-colour

, we can

find a monochromatic

{x, y, x

. More generally, for

, x

, ··· , x

∈ N

, we

let

F S(x

, ··· , x

) =

(

i∈I

: I ⊆ [n], I 6= ∅

)

The existence of a monochromatic (m, 1, 1)-sets gives us

Corollary

(Finite sum theorem)

For every fixed

, whenever we finitely-colour

N, there exists x

, ··· , x

such that F S(x

, ··· , x

) is monochromatic.

This is since an (

1)-set contains more things than

F S

(

, ··· , x

). This

was discovered independently by a bunch of different people, including Folkman,

Rado and Sanders.

Similarly, if we let

F P (x

, ··· , x

) =

(

i∈I

: I ⊆ [n], I 6= ∅

)

then we can find these as well. For example, we can restrict attention to

{

n ∈ N}

, and use the finite sum theorem. This is the same idea as we had

when we used van der Waerden to find geometric progressions.

But what if we want both? Can we have

F S

(

, ··· , x

)

∪ F P

(

, ··· , x

)

in the same colour class? The answer is actually not known! Even the case

when

= 2, i.e. finding a monochromatic set of the form

{x, y, x

y, xy}

open. Until mid 2016, we did not know if we can find

y, xy}

monochromatic

(x, y > 2).

To finish he proof of Rado’s theorem, we need the following proposition:

Proposition.

is a rational matrix with the columns property, then there

is some

m, p, c ∈ N

such that

= 0 has a solution inside any (

m, p, c

) set, i.e.

all entries of the solution lie in the (m, p, c) set.

In the case of a single equation, we reduced the general problem to the case

of 3 variables only. Here we are going to do something similar — we will use the

columns property to reduce the solution to something much smaller.

Proof. We again write

A =





↑ ↑ ↑

(1)

(2)

··· c

(n)

↓ ↓ ↓





Re-ordering the columns of A if necessary, we assume that we have

[n] = B

∪ ··· ∪ B

such that max(B

) < min(B

s+1

) for all s, and we have

i∈B

(i)

i∈B

∪...∪B

s−1

(i)

for some

∈ Q

. These

only depend on the matrix. In other words, we have

(i)

= 0,

where











−q

i ∈ B

∪ ···B

s−1

1 i ∈ B

0 otherwise

For a fixed

, if we scan these coefficients starting from

and then keep

decreasing

, then the first non-zero coefficient we see is 1, which is good, because

it looks like what we see in an (m, p, c) set.

Now we try to write down a general solution with

many free variables.

Given x

, ··· , x

∈ N

, we look at

s=1

It is easy to check that Ay = 0 since

(i)

= 0.

Now take

, and pick

large enough such that

∈ Z

for all

i, s

, and

finally, p = max{cd

: i, s ∈ Q}.

Thus, if we consider the (

m, p, c

)-set on generators (

, ··· , x

) and

defined above, then we have

= 0 and hence

(

) = 0. Since

is integral,

and lies in the (m, p, c) set, we are done!

We have thus proved Rado’s theorem.

Theorem

(Rado’s theorem)

A matrix

is partition regular iff it has the

column property.

So we have a complete characterization of all partition regular matrices.

Note that Rado’s theorem reduces Schur’s theorem, van der Waerden’s

theorem, finite sums theorem etc. to just checking if certain matrices have the

columns property, which are rather straightforward computations.

More interestingly, we can prove some less obvious “theoretical” results.

Corollary

(Consistency theorem)

are partition regular in independent

variables, then



A 0

0 B



is partition regular. In other words, we can solve

= 0 and

= 0 simultane-

ously in the same colour class.

Proof. The matrix



A 0

0 B



has the columns property if A and B do.

In fact, much much more is true.

Corollary.

Whenever

is finitely-coloured, one colour class contains solutions

to all partition regular systems!

Proof.

Suppose not. Then we have

∪ ··· ∪ D

such that for each

there is some partition regular matrix

such that we cannot solve

= 0

inside

. But this contradicts the fact that

diag

(

, A

, ··· , A

) is partition

regular (by applying consistency theorem many times).

Where did this whole idea of the (

m, p, c

)-sets come from? The original proof

by Rado didn’t use (

m, p, c

)-sets, and this idea only appeared only a while later,

when we tried to prove a more general result.

In general, we call a set

D ⊆ N

partition regular if we can solve any partition

regular system in

. Then we know that

are partition regular sets, but

+ 1 is not (because we can’t solve

, say). Then what Rado’s theorem

says is that whenever we finitely partition

, then one piece of

is partition

regular.

In the 1930’s, Rado conjectured that there is nothing special about

begin with — whenever we break up a partition regular set, then one of the

pieces is partition regular. This conjecture was proved by Deuber in the 1970s

who introduced the idea of the (m, p, c)-set.

It is not hard to check that

is partition regular iff

contains an (

m, p, c

)

set of each size. Then Deuber’s proof involves showing that for all

m, p, c, k ∈ N

there exists

n, q, d ∈ N

such that any

-colouring of an (

n, q, d

)-set contains

a monochromatic (

m, p, c

) set. The proof is quite similar to how we found

(

m, p, c

)-sets in the naturals, but instead of using van der Waerden theorem, we

need the Hales–Jewett theorem.

We end by mentioning an open problem in this area. Suppose

is an

m ×n

matrix

that is not partition regular. That is, there is some

-colouring of

with no solution to Ax = 0 in a colour class. Can we find some bound f (m, n),

such that every such

has a “bad” colouring with

k < f

(

m, n

)? This is an open

problem, first conjectured by Rado, and we think the answer is yes.

What do we actually know about this problem? The answer is trivially yes

for

2), as there aren’t many matrices of size 1

2, up to rescaling. It is a

non-trivial theorem that

3) exists, and in fact

≤

24. We don’t know

anything more than that.