2Ramsey theory on the integers
III Ramsey Theory
2 Ramsey theory on the integers
So far, we’ve been talking about what happens when we finitely colour graphs.
What if we k-colour the integers N? What can we say about it?
It is a trivial statement that this colouring has a monochromatic subset, by the
pigeonhole principle. Interesting questions arise when we try to take the additive
structure of
N
into account. So we could ask, can we find a monochromatic
“copy” of N.
One way to make this question concrete is to ask if there is an infinite
monochromatic arithmetic progression.
The answer is easily a “no”! There are only countably many progressions, so
for each arithmetic progression, we pick two things in the progression and colour
them differently.
We can also construct this more concretely. We can colour the first number
red, the next two blue, the next three red etc. then it is easy to see that it
doesn’t have an infinite arithmetic progression.
···
But this is somewhat silly, because there is clearly a significant amount of
structure in the sequence there. It turns out the following is true:
Theorem
(van der Waerden theorem)
.
Let
m, k ∈ N
. Then there is some
N
=
W
(
m, k
) such that whenever [
N
] is
k
-coloured, then there is a monochromatic
arithmetic progression of length n.
The idea is to do induction on
m
. We will be using colourings with much
greater than k colours to deduce the existence of W (m, k).
We can try a toy example first. Let’s try to show that
W
(3
,
2) exists. Suppose
we have three natural numbers:
By the pigeonhole principle, there must be two things that are the same colour,
say
If this is the case, then if we don’t want to have an arithmetic progression of
length 3, then the fifth number must be blue
We now cut the universe into blocks into 5 things. Again by the pigeonhole
principle, there must be two blocks that look the same. Say it’s this block again.
Now we have two sequences, and the point at the end belongs to both of the
two sequences. And no matter what colour it is, we are done.
For
k
= 3, we can still find such a block, but now that third point could be a
third colour, say, green. This will not stop us. We can now look at these big
blocks, and we know that eventually, these big blocks must repeat themselves.
Here we did the case
m
= 2, and we used the pigeonhole principle. When we do
m > 2, we will use van der Waerden theorem for smaller m inductively.
We now come up with names to describe the scenario we had above.
Definition
(Focused progression)
.
We say a collection of arithmetic progressions
A
1
, A
2
, ··· , A
r
of length m with
A
i
= {a
i
, a
i
+ d
i
, ··· , a
i
+ (m − 1)d
i
}
are focused at f if a
i
+ md
i
= f for all 1 ≤ i ≤ r.
Example. {1, 4} and {5, 6} are focused at 7.
Definition
(Colour focused progression)
.
If
A
1
, ··· , A
r
are focused at
f
, and
each
A
i
is monochromatic and no two are the same colour, then we say they are
colour focused at f.
We can now write the proof.
Proof.
We induct on
m
. The result is clearly trivial when
m
= 1, and follows
easily from the pigeonhole principle when m = 2.
Suppose
m >
1, and assume inductively that
W
(
m −
1
, k
0
) exists for any
k
0
∈ N.
Here is the claim we are trying to established:
Claim.
For each
r ≤ k
, there is a natural number
n
such that whenever we
k-colour [n], then either
(i) There exists a monochromatic AP of length m; or
(ii) There are r colour-focused AP’s of length m −1.
It is clear that this claim implies the theorem, as we can pick
r
=
k
. Then if
there isn’t a monochromatic AP of length
m
, then we look at the colour of the
common focus, and it must be one of the colours of those AP’s.
To prove the claim, we induct on
r
. When
n
= 1, we may take
W
(
m −
1
, k
).
Now suppose
r >
1, and some
n
0
works for
r −
1. With the benefit of hindsight,
we shall show that
n = W (m −1, k
2n
0
)2n
0
works for r.
We consider any
k
-colouring of [
n
], and suppose it has no monochromatic
AP of length m. We need to find r colour-focused progressions of length n − 1.
We view this
k
-colouring of [
n
] as a
k
2n
0
colouring of blocks of length 2
n
0
, of
which there are W (m − 1, k
2n
0
).
Then by definition of W (m −1, k
2n
0
), we can find blocks
B
s
, B
s+t
, ··· , B
s+(m−2)t
which are coloured identically. By the inductive hypothesis, we know each
B
s
contains
r −
1 colour-focused AP’s of length
m −
1, say
A
1
, .., A
r−1
with first
terms
a
1
, ··· , a
r
and common difference
d
1
, ··· , d
r−1
, and also their focus
f
,
because the length of B
s
is 2n
0
, not just n
0
.
Since we assumed there is no monochromatic progression of length
n
, we can
assume f has a different colour than the A
i
.
Now consider
A
0
1
, A
0
2
, ··· , A
0
r−1
, where
A
0
i
has first term
a
i
, common difference
d
i
+ 2
n
0
t
, and length
m −
1. This difference sends us to the next block, and then
the next term in the AP. We also pick
A
0
r
to consist of the all the focus of the
blocks B
i
, namely
A
0
r
= {f, f + 2n
0
t, ··· , f + 2n
0
t(m − 2)}
These progressions are monochromatic with distinct colours, and focused at
f + (2n
0
t)(m − 1). So we are done.
This argument, where one looks at the induced colourings of blocks, is called
a product argument.
The bounds we obtain from this argument are, as one would expect, terrible.
We have
W (3, k) ≤ k
.
.
.
k
4k
,
where the tower of k’s has length k −1.
Now we can generalize in a different way. What can we say about monochro-
matic structures a
k
-colouring of
N
d
? What is the right generalization of van
der Waerden theorem?
To figure out the answer, we need to find the right generalization of arithmetic
progressions.
Definition
(Homothetic copy)
.
Given a finite
S ⊆ N
d
, a homothetic copy of
S
is a set of the form
`S + M,
where `, M ∈ N
d
and ` 6= 0.
Example.
An arithmetic progression of length
m
is just a homothetic copy of
[m] = {1, 2, ··· , m}.
Thus, the theorem we want to say is the following:
Theorem
(Gallai)
.
Whenever
N
d
is
k
-coloured, there exists a monochromatic
(homothetic) copy of S for each finite S ⊆ N
d
.
In order to prove this, we prove the beautiful Hales–Jewett theorem, which
is in some sense the abstract core of the argument we used for van der Waerden
theorem.
We need a bit of set up. As usual, we write
[m]
n
= {(x
1
, ··· , x
n
) : x
i
∈ [m]}.
Here is the important definition:
Definition
(Combinatorial line)
.
A combinatorial line
L ⊆
[
m
]
n
is a set of the
form
{(x
1
, ··· , x
n
n) : x
i
= x
j
for all i, j ∈ I; x
i
= a
i
for all i 6∈ I}
for some fixed non-empty set of coordinates I ⊆ [n] and a
i
∈ [m].
I is called the set of active coordinates.
Example. Here is a line in [3]
2
This line is given by I = {1}, a
2
= 1.
The following shows all the lines we have:
It is easy to see that any line has exactly [
m
] elements. We write
L
−
and
L
+
for the first and last point of the line, i.e. the points where the active coordinates
are all 1 and
m
respectively. It is clear that any line
L
is uniquely determined
by L
−
and its active coordinates.
Example. In [3]
3
, we have the line
L = {(1, 2, 1), (2, 2, 2), (3, 2, 3)}.
This is a line with
I
=
{
1
,
3
}
and
a
2
= 2. The first and last points are (1
,
2
,
1)
and (3, 2, 3).
Then we have the following theorem:
Theorem
(Hales–Jewett theorem)
.
For all
m, k ∈ N
, there exists
N
=
HJ
(
m, k
)
such that whenever [m]
N
is k-coloured, there exists a monochromatic line.
Note that this theorem implies van der Waerden’s theorem easily. The idea
is to embed [
m
]
N
into
N
linearly, so that a monochromatic line in [
m
]
N
gives an
arithmetic progression of length
m
. Explicitly, given a
k
-colouring
c
:
N →
[
k
],
we define c
0
: [m]
N
→ [k] by
c
0
(x
1
, x
2
, ··· , x
N
) = c(x
1
+ x
2
+ ··· + x
N
).
Now a monochromatic line gives us an arithmetic progression of length m. For
example, if the line is
L = {(1, 2, 1), (2, 2, 2), (3, 2, 3)},
then we get the monochromatic progression 4
,
6
,
8 of length 3. In general, the
monochromatic AP defined has d = |I|.
The proof is essentially what we did for van der Waerden theorem. We are
going to build a lot of almost-lines that point at the same vertex, and then no
matter what colour the last vertex is, we are happy.
Definition
(Focused lines)
.
We say lines
L
1
, ··· , L
r
are focused at
f ∈
[
m
]
N
if
L
+
i
= f for all i = 1, ··· , r.
Definition
(Colour focused lines)
.
If
L
1
, ··· , L
r
are focused lines, and
L
i
\{L
+
i
}
is monochromatic for each
i
= 1
, ··· , r
and all these colours are distinct, then
we say L
1
, ··· , L
r
are colour focused at f.
Proof.
We proceed by induction on
m
. This is clearly trivial for
m
= 1, as a line
only has a single point.
Now suppose
m >
1, and that
HJ
(
m −
1
, k
) exists for all
k ∈ N
. As before,
we will prove the following claim:
Claim.
For each 1
≤ r ≤ k
, there is an
n ∈ N
such that in any
k
-colouring of
[m]
n
, either
(i) there exists a monochromatic line; or
(ii) there exists r colour-focused lines.
Again, the result is immediate from the claim, as we just use it for
r
=
k
and
look at the colour of the focus.
The prove this claim, we induct on
r
. If
r
= 1, then picking
n
=
HJ
(
m−
1
, k
)
works, as a single colour-focused line of length
n
is just a monochromatic line of
length n −1, and [m −1]
n
⊆ [m]
n
naturally.
Now suppose
r >
1 and
n
works for
r −
1. Then we will show that
n
+
n
0
works, where
n
0
= HJ(m − 1, k
m
n
).
Consider a colouring
c
: [
m
]
n+n
0
→
[
k
], and we assume this has no monochromatic
lines.
We think of [
m
]
n+n
0
as [
m
]
n
×
[
m
]
n
0
. So for each point in [
m
]
n
0
, we have
a whole cube [
m
]
n
. Consider a
k
m
n
colouring of [
m
]
n
0
as follows — given any
x ∈
[
m
]
n
0
, we look at the subset of [
m
]
n+n
0
containing the points with last
n
0
coordinates
x
. Then we define the new colouring of [
m
]
n
0
to be the restriction
of c to this [m]
n
, and there are m
n
possibilities.
Now there exists a line
L
such that
L \ L
+
is monochromatic for the new
colouring. This means for all a ∈ [m]
n
and for all b, b
0
∈ L \ L
+
, we have
c(a, b) = c(a, b
0
).
Let
c
00
(
a
) denote this common colour for all
a ∈
[
m
]
n
. this is a
k
-colouring of
[
m
]
n
with no monochromatic line (because
c
doesn’t have any). So by definition
of
n
, there exists lines
L
1
, L
2
, ··· , L
r−1
in [
m
]
n
which are colour-focused at some
f ∈ [m]
n
for c
00
.
In the proof of van der Waerden, we had a progression within each block,
and also how we just between blocks. Here we have the same thing. We have
the lines in [m]
n
, and also the “external” line L.
L
Consider the line L
0
1
, L
0
2
, ··· , L
0
r−1
in [m]
n+n
0
, where
(L
0
i
)
−
= (L
−
i
, L
−
),
and the active coordinate set is
I
i
∪ I
, where
I
is the active coordinate set of
L
.
Also consider the line F with F
−
= (f, L
−
) and active coordinate set I.
Then we see that
L
0
1
, ··· , L
0
r−1
, F
are
r
colour-focused lines with focus
(f, L
+
).
We can now prove our generalized van der Waerden.
Theorem
(Gallai)
.
Whenever
N
d
is
k
-coloured, there exists a monochromatic
(homothetic) copy of S for each finite S ⊆ N
d
.
Proof.
Let
S
=
{S
(1)
, S
(2)
, ··· , S
(
m
)
} ⊆ N
d
. Given a
k
-colouring
N
d
→
[
k
], we
induce a k-colouring c : [m]
N
→ [k] by
c
0
(x
1
, ··· , x
N
) = c(S(x
1
) + S(x
2
) + ··· + S(x
N
)).
By Hales-Jewett, for sufficiently large
N
, there exists a monochromatic line
L
for
c
0
, which gives us a monochromatic homothetic copy of
S
in
N
d
. For example, if
the line is (1 2 1), (2 2 2) and (3 2 3), then we know
c(S(1) + S(2) + S(1)) = c(S(2) + S(2) + S(2)) = c(S(3) + S(2) + S(3)).
So we have the monochromatic homothetic copy
λS
+
µ
, where
λ
= 2 (the
number of active coordinates), and µ = S(2).
Largeness in Ramsey theory*
In van der Waerden, we proved that for each
k, m
, there is some
n
such that
whenever we
k
-colour [
n
], then there is a monochromatic AP of length
m
. How
much of this is relies on the underlying set being [
n
]? Or is it just that if we
finitely colour [
n
], then one of the colours must contain a lot of numbers, and if
we have a lot of numbers, then we are guaranteed to have a monochromatic AP?
Of course, by “contains a lot of numbers”, we do not mean the actual number
of numbers we have. It is certainly not true that for some
n
, whenever we
k
-colour any set of
n
integers, we must have a monochromatic
m
-AP, because
an arbitrary set of
n
integers need not even contain an
m
-AP at all, let alone a
monochromatic one. Thus, what we really mean is density.
Definition (Density). For A ⊆ N, we let the density of A as
¯
d(A) = lim sup
(b−a)→∞
A ∩ [a, b]
|b − a|
.
Clearly, in any finite
k
-colouring of
N
, there exists a colour class with positive
density. Thus, we want to know if merely a positive density implies the existence
of progressions. Remarkably, the answer is yes!
Theorem
(Szemer´edi theorem)
.
Let
δ >
0 and
m ∈ N
. Then there exists some
N
=
S
(
m, δ
)
∈ N
such that any subset
A ⊆
[
N
] with
|A| ≥ δN
contains an
m-term arithmetic progression.
The proof of this theorem is usually the subject of an entire lecture course,
so we are not going to attempt to prove this. Even the particular case
m
= 3 is
very hard.
This theorem has a lot of very nice Ramsey consequences. In the case of
graph colourings, we asked ourselves what happens if we colour a graph with
infinitely many colours. Suppose we now have a colouring
c
:
N → N
. Can we
still find a monochromatic progression of length
m
? Of course not, because
c
can be injective.
Theorem. For any c : N → N, there exists a m-AP on which either
(i) c is constant; or
(ii) c is injective.
It is also possible to prove this directly, but it is easy with Szemer´edi.
Proof. We set
δ =
1
m
2
(m + 1)
2
.
We let N = S(m, δ). We write
[N] = A
1
∪ A
2
∪ ··· ∪ A
k
,
where the
A
i
’s are the colour-classes of
c|
[N]
. By choice of
N
, we are done if
|A
i
| ≥ δN for some 1 ≤ i ≤ k. So suppose not.
Let’s try to count the number of arithmetic progressions in [
N
]. There are
more than
N
2
/
(
m
+ 1)
2
of these, as we can take any
a, d ∈
[
N/m
+ 1]. We want
to show that there is an AP that hits each A
i
at most once.
So, fixing an
i
, how many AP’s are there that hit
A
i
at least twice? We need
to pick two terms in
A
i
, and also decide which two terms in the progression they
are in, e.g. they can be the first and second term, or the 5th and 17th term. So
there are at most m
2
|A
i
|
2
terms.
So the number of AP’s on which c is injective is greater than
N
2
(m + 1)
2
− k|A
i
|
2
m
2
≥
N
2
(m + 1)
2
−
1
δ
(δN)
2
m
2
=
N
2
(m + 1)
2
− δN
2
m
2
≥ 0.
So we are done. Here the first inequality follows from the fact that
P
|A
i
|
= [
N
]
and each |A
i
| < δN.
Our next theorem will mix arithmetic and graph-theoretic properties. Con-
sider a colouring
c
:
N
(2)
→
[2]. As before, we say a set
X
is monochromatic if
c|
X
(2)
is constant. Now we want to try to find a monochromatic set with some
arithmetic properties.
The first obvious question to ask is — can we find a monochromatic 3-term
arithmetic progression? The answer is no. For example, we can define
c
(
ij
) to be
the parity of largest power of 2 dividing
j −i
, and then there is no monochromatic
3-term arithmetic progression.
What if we make some concessions — can we find a blue 10-AP, or if not,
find an infinite red set? The answer is again no. This construction is slightly
more involved. To construct a counterexample, we can make progressively larger
red cliques, and take all other edges blue. If we double the size of the red cliques
every time, it is not hard to check that there is no blue 4-AP, and no infinite
red set.
···
What if we further relax our condition, and only require an arbitrarily large red
set?
Theorem. For any c : N
(2)
→ {red, blue}, either
(i) There exists a blue m-AP for each m ∈ N; or
(ii) There exists arbitrarily large red sets.
Proof.
Suppose we can’t find a blue
m
-AP for some fixed
m
. We induct on
r
,
and try to find a red set of size r.
Say
A ⊆ N
is a progression of length
N
. Since
A
has no blue
m
-term
progression, so it must contain many red edges. Indeed, each
m
-AP in
A
must
contain a red edge. Also each edge specifies two points, and this can be extended
to an
m
-term progression in at most
m
2
ways. Since there are
N
2
/
(
m
+ 1)
2
. So
there are at least
N
2
m
2
(m + 1)
2
red edges. With the benefit of hindsight, we set
δ =
1
2m
2
(m + 1)
2
.
The idea is that since we have lots of red edges, we can try to find a point with
a lot of red edges connected to it, and we hope to find a progression in it.
We say X, Y ⊆ N form an (r, k)-structure if
(i) They are disjoint
(ii) X is red;
(iii) Y is an arithmetic progression;
(iv) All X-Y edges are red;
(v) |X| = r and |Y | = k.
···
We show by induction that there is an (r, k)-structure for each r and k.
A (1
, k
) structure is just a vertex connected by red edges to a
k
-point structure.
If we take
N
=
S
(
δ, k
), we know among the first
N
natural numbers, there are
at least
N
2
/
(
m
2
(
m
+ 1)
2
) red edges inside [
N
]. So in particular, some
v ∈
[
N
]
has at least
δN
red neighbours in [
N
], and so we know
v
is connected to some
k-AP by red edges. That’s the base case done.
Now suppose we can find an (r − 1, k
0
)-structure for all k
0
∈ N. We set
k
0
= S
1
2m
2
(m + 1)
2
, k
.
We let (
X, Y
) be an (
r −
1
, k
0
)-structure. As before, we can find
v ∈ Y
such
that
v
has
δ|Y |
red neighbours in
Y
. Then we can find a progression
Y
0
of
length
k
in the red neighbourhood of
v
, and we are done, as (
X ∪ {v}, Y
0
) is an
(
r, k
)-structure, and an “arithmetic progression” within an arithmetic progression
is still an arithmetic progression.
Before we end this chapter, we make a quick remark. Everything we looked
for in this chapter involved the additive structure of the naturals. What about
the multiplicative structure? For example, given a finite colouring of
N
, can we
find a monochromatic geometric progression? The answer is trivially yes. We
can look at
{
2
x
:
x ∈ N}
, and then multiplication inside this set just looks like
addition in the naturals.
But what if we want to mix additive and multiplicative structures? For
example, can we always find a monochromatic set of the form
{x
+
y, xy}
? Of
course, there is the trivial answer
x
=
y
= 2, but is there any other? This
question was answered positively in 2016! We will return to this at the end of
the course.