III Extremal Graph Theory - Szemeredi's regularity lemma

4Szemeredi's regularity lemma

III Extremal Graph Theory

4 Szemer´edi’s regularity lemma

Szmer´edi’s regularity lemma tells us given a very large graph, we can always

equipartition it into pieces that are “uniform” in some sense. The lemma is

arguably “trivial”, but it also has many interesting consequences. To state the

lemma, we need to know what we mean by “uniform”.

Definition

(Density)

Let

U, W

be disjoint subsets of the vertex set of some

graph. The number of edges between

and

is denoted by

(

U, W

), and the

density is

d(U, W ) =

e(U, W )

|U||W |

Definition

(

-uniform pair)

Let 0

< ε <

1. We say a pair (

U, W

) is

-uniform

|d(U

, W

) − d(U, W )| < ε

whenever U

⊆ U, W

⊆ W , and |U

| ≥ ε|U|, |W

| ≥ ε|W |.

Note that it is necessary to impose some conditions on how small

and

can be. For example, if

= 1, then

(

, W

) is either 0 or 1. So we

cannot have a sensible definition if we want to require the inequality to hold for

arbitrary U

, W

But we might be worried that it is unnatural to use the same

for two

different purposes. This is not something one should worry about. The Szemer´edi

regularity lemma is a fairly robust result, and everything goes through if we use

different

’s for the two different purposes. However, it is annoying to have to

have many different ε’s floating around.

Before we state and prove Szemer´edi’s regularity lemma, let’s first try to

understand why uniformity is good. The following is an elementary observation.

Lemma. Let (U, W ) be an ε-uniform pair with d(U, W ) = d. Then

|{u ∈ U : |Γ(u) ∩ W | > (d − ε)|W |}| ≥ (1 − ε)|U |

|{u ∈ U : |Γ(u) ∩ W | < (d + ε)|W |}| ≥ (1 − ε)|U |,

where Γ(u) is the set of neighbours of u.

Proof. Let

X = {u ∈ U : |Γ(u) ∩ W | ≤ (d − ε)|W |}.

Then e(X, W ) ≤ (d − ε)|X||W |. So

d(X, W ) ≤ d − ε = d(U, W ) − ε.

So it fails the uniformity condition. Since

is definitely not small, we must

have |X| < ε|U|.

The other case is similar, or observe that the complementary bipartite graph

between U and W has density 1 −d and is ε-uniform.

What is good about

-uniform pairs is that if we have enough of them, then

we can construct essentially any subgraph we like. Later, Szemer´edi’s regularity

lemma says any graph large enough has

-uniform equipartitions, and together,

they can give us some pretty neat results.

Lemma

(Graph building lemma)

Let

be a graph containing distinct vertex

subsets

, . . . , V

with

, such that (

, V

) is

-uniform and

(

, V

)

≥ λ

for all 1 ≤ i ≤ j ≤ r.

Let

be a graph with maximum degree ∆(

)

≤

∆. Suppose

has an

-colouring in which no colour is used more than

times, i.e.

H ⊆ K

(

), and

suppose (∆ + 1)ε ≤ λ

∆

and s ≤ bεuc. Then H ⊆ G.

To prove this, we just do what we did in the previous lemma, and find lots

of vertices connected to lots of other vertices, and then we are done.

Proof.

We wlog assume

(

) =

{

, . . . , k}

, and let

(

)

→ {

, . . . , r}

be a

colouring of

(

) using no colour more than

times. We want to pick vertices

, . . . , x

in G so that x

∈ E(G) if ij ∈ E(H).

We claim that, for 0

≤ ` ≤ k

, we can choose distinct vertices

, . . . , x

that x

∈ C

c(j)

, and for ` < j ≤ k, a set X

of candidates for x

such that

(i) X

⊆ V

c(j)

;

(ii) x

∈ E(G) for all y

∈ X

and i ≤ ` such that ij ∈ E(H).

(iii) |X

| ≥ (λ − ε)

|N(j,`)|

c(j)

|, where

N(j, `) = {x

: 1 ≤ i ≤ ` and ij ∈ E(H)}.

The claim holds for ` = 0 by taking X

= V

c(j)

By induction, suppose it holds for

. To pick

`+1

, of course we should pick it

from our candidate set

`+1

. Then the first condition is automatically satisfied.

Define the set

T = {j > ` + 1 : (` + 1)j ∈ E(H)}.

Then each

t ∈ T

presents an obstruction to (ii) and (iii) being satisfied. To

satisfy (ii), for each t ∈ T , we should set

`+1

= X

∩ Γ(x

`+1

Thus, to satisfy (iii), we want to exclude those

`+1

that make this set too small.

We define

y ∈ X

`+1

: |Γ(y) ∩ X

| ≤ (λ − ε)|X

So we want to find something in

`+1

t∈T

. We also cannot choose one of

the x

already used. So our goal is to show that



`+1

−

[

t∈T



> s − 1.

This will follow simply from counting the sizes of

`+1

and

. We already

have a bound on the size of

`+1

, and we shall show that if

is too large,

then it violates ε-uniformity.

Indeed, by definition of Y

, we have

d(Y

, X

`+1

) ≤ λ − ε ≤ d(V

c(t)

, V

c(`+1)

) − ε.

So either |X

`+1

| < ε|V

c(t)

| or |Y

| < ε|V

c(`+1)

|. But the first cannot occur.

Indeed, write

(

+ 1

, `

)

. Then

|T | ≤

∆. In particular, since the

|T | = 0 case is trivial, we may assume m ≤ ∆ − 1. So we can easily bound

`+1

| ≥ (λ − ε)

∆−1

c(t)

| ≥ (λ

∆−1

− (∆ − 1)ε)|V

c(t)

| > ε|V

c(t)

Thus, by ε-uniformity, it must be the case that

| ≤ ε|V

c(`+1)

Therefore, we can bound



`+1

−

[

t∈T



≥ (λ − ε)

c(`+1)

| − (∆ − m)ε|V

c(`+1)

≥ (λ

− mε − (∆ − m)ε)u ≥ εu > s − 1.

So we are done.

Corollary.

Let

be a graph with vertex set

, . . . , v

}

. Let 0

< λ, ε <

satisfy rε ≤ λ

r−1

Let

be a graph with disjoint vertex subsets

, . . . , V

, each of size

u ≥

Suppose each pair (

, V

) is

uniform, and

(

, V

)

≥ λ

∈ E

(

), and

(

, V

)

≤

− λ

6∈ E

(

). Then there exists

∈ V

so that the map

→ x

is an isomorphism H → G[{x

, . . . , x

}].

Proof.

By replacing the

edges by the complementary set whenever

6∈

E(H), we may assume d(V

, V

) ≥ λ for all i, j, and H is a complete graph.

We then apply the previous lemma with ∆ = r − 1 and s = 1.

Szemer´edi showed that every graph that is sufficiently large can be partitioned

into finitely many classes, with most pairs being

-uniform. The idea is simple

— whenever we see something that is not uniform, we partition it further into

subsets that are more uniform. The “hard part” of the proof is to come up with

a measure of how far we are from being uniform.

Definition

(Equipartition)

An equipartition of

(

) into

parts is a partition

into sets V

, . . . , V

, where b

c ≤ V

≤ d

e, where n = |G|.

We say that the partition is

-uniform if (

, V

) is

-uniform for all but





pairs.

Theorem

(Szemer´edi’s regularity lemma)

Let 0

< ε <

1 and let

be some

natural number. Then there exists some

(

`, ε

) such that every graph has

-uniform equipartition into

parts for some

` ≤ m ≤ L

, depending on the

graph.

This lemma was proved by Szemer´edi in order to prove his theorem on

arithmetic progressions in dense subsets of integers.

When we want to apply this, we usually want at least

many parts. For

example, having 1 part is usually not very helpful. The upper bound on

is helpful for us to ensure the parts are large enough, by picking graphs with

sufficiently many vertices.

We first need a couple of trivial lemmas.

Lemma.

Let

⊆ U

and

⊆ W

, where

| ≥

− δ

)

|U|

and

| ≥

(1 − δ)|W |. Then

|d(U

, W

) − d(U, W )| ≤ 2δ.

Proof. Let d = d(U, W ) and d

= d(U

, W

). Then

d =

e(U, W )

|U||W |

≥

e(U

, W

)

|U||W |

= d

||W

|U||W |

≥ d

(1 − δ)

Thus,

− d ≤ d

(1 − (1 − δ)

) ≤ 2δd

≤ 2δ.

The other inequality follows from considering the complementary graph, which

tells us

(1 − d

) − (1 − d) ≤ 2δ.

Lemma. Let x

, . . . , x

be real numbers with

X =

i=1

and let

x =

i=1

Then

i=1

≥ X

n − m

(x − X)

≥ X

(x − X)

If we ignore the second term on the right, then this is just Cauchy–Schwarz.

Proof. We have

i=1

i=m+1

≥

n − m



nX − mx

n − m



≥ X

n − m

(x − X)

by two applications of Cauchy–Schwarz.

We can now prove Szemer´edi’s regularity lemma.

Proof. Define the index ind(P) of an equipartition P into k parts V

to be

ind(P ) =

i<j

, V

We show that if

is not

-uniform, then there is a refinement equipartition

into k4

parts, with ind(Q) ≥ ind(P) +

This is enough to prove the theorem. For choose

t ≥ `

with 4

≥

100.

Define recursively a function f by

f(0) = t, f(j + 1) = f(j)4

f(j)

Let

N = f(d4ε

−5

e),

and pick L = N 16

Then, if

n ≤ L

, then just take an equipartition into single vertices. Otherwise,

begin with a partition into

parts. As long as the current partition into

parts

is not

uniform, replace it a refinement into 4

parts. The point is that

ind

(

)

≤

for any partition. So we can’t do this more than 4

−5

times, at

which point we have partitioned into N ≤ L parts.

Note that the reason we had to set

is that in our proof, we want

to assume we have many vertices lying around.

The proof really is just one line, but students tend to complain about such

short proofs, so let’s try to explain it in a bit more detail. If the partition is

not

-uniform, this means we can further partition each part into uneven pieces.

Then our previous lemma tells us this discrepancy allows us to push up

So given an equipartition

that is not

-uniform, for each non-uniform pair

, V

) of P , we pick witness sets

⊆ V

, X

⊆ V

with |X

| ≥ ε|V

|, |X

| ≥ |V

| and |d(X

, X

) − D(V

, V

)| ≥ ε.

Fix

. Then the sets

partition

into at most 2

k−1

atoms. Let

and let n = k4

m + ak + b, where 0 ≤ a ≤ 4

and b ≤ k. Then we see that

bn/kc = 4

m + a

and the parts of P have size 4

m + a or 4

m + a + 1, with b of the larger size.

Partition each part of

into 4

sets, of size

+ 1. The smaller

having a parts of size m + 1, and thelargrer having a + 1 such pairs.

We see that any such partition is an equipartition into

parts of size

m + 1, with ak + b parts of larger size m + 1.

Let’s choose such an equipartition Q with parts as nearly as possible inside

atoms, so each atom is a union of parts of Q with at most m extra vertices.

All that remains is to check that ind(Q) ≥ ind(P) +

Let the sets of Q within V

be V

(s), where 1 ≤ s ≤ 4

≡ q. So

[

s=1

(s).

Now

1≤s,t≤q

e(V

(s), V

(t)) = e(V

, V

We’d like to divide by some numbers and convert these to densities, but this is

where we have to watch out. But this is still quite easy to handle. We have

m + 1

≤ q|V

(s)| ≤ |V

| ≤

m + 1

q|V

(s)|

for all s. So we want m to be large for this to not hurt us too much.



m + 1



d(V

, V

) ≤

s,t

d(V

(s), V

(t)) ≤



m + 1



d(V

, V

Using n ≥ k16

, and hence



m + 1



≥ 1 −

we have



s,t

d(V

(s), V

(t)) − d(V

, V

)



≤

+ d(V

, V

In particular,

(s), V

(t)) ≥ d

, V

) −

The lower bound can be improved if (V

, V

) is not ε-uniform.

Let

∗

be the largest subset of

that is the union of parts of

. We may

assume

∗

[

1≤s≤r

(s).

By an argument similar to the above, we have

1≤s≤r

1≤t≤r

d(V

(s), d

(t)) ≤ d(X

∗

, X

∗

) +

By the choice of parts of

within atoms, and because

| ≥ qm

= 4

, we

have

∗

| ≥ |X

| − 2

k−1

≥ |X



1 −

ε|V



≥ |X



1 −



≥ |X



1 −



So by the lemma, we know

|d(X

∗

, X

∗

) − d(X

, X

)| <

Recalling that

|d(X

, X

) − d(V

, V

)| > ε,

we have



1≤s≤r

1≤t≤r

d(V

(s), V

(t)) − d(V

, V

)



ε.

We can now allow our Cauchy–Schwarz inequality with

and

gives

1≤s,t≤q

(s), V

(t)) ≥ d

, V

) −

9ε

≥ d

, V

) −

using the fact that

≥



1 −



m + 1

≥



1 −



≥

∗

≥



1 −





1 −



4ε

Therefore

ind(Q) =

1≤i<j<k

1≤s,t≤q

(s), V

(t))

≥

1≤i<j≤k

, V

) −

+ ε





≥ ind(P ) +

The proof gives something like

L ∼ 2

where the tower is ε

−5

tall. Can we do better than that?

In 1997, Gowers showed that the tower of height at least

−1/16

. More

generally, we can define

, . . . , V

to be (

ε, δ, η

)-uniform if all but





pairs

(

, V

) satisfy

(

, V

)

− d

(

, V

)

| ≤ ε

whenever

| ≥ δ|V

. Then there is a

graph for which every (1

− δ

1/16

, δ,

−

1/16

)-uniform partition needs a tower

height δ

−1/16

parts.

More recently, Moskowitz and Shapira (2012) improved these bounds. Most

recently, a reformulation of the lemma due to Lov´asz and Szegedy (2007) for

which the upper bound is tower(

−2

) was shown to have lower bound tower(

−2

)

by Fox and Lov´asz (2014) (note that these are different Lov´asz’s!).

Let’s now turn to some applications of the Szemer´edi’s regularity lemma.

Recall that Ramsey’s theorem says there exists

(

) so every red-blue colouring

of the edges of

yeidls a monochromatic

provided

n ≥ R

(

). There are

known bounds

k/2

≤ R(k) ≤ 4

The existence of

(

) implies that for every graph

, there exists a number

r(G) minimal so if n ≥ r(G) and we colour K

, we obtain a monochromatic G.

Clearly, we have

r(G) ≤ R(|G|).

How much smaller can r(G) be compared to R(|G|)?

Theorem. Given an integer d, there exists c(d) such that

r(G) ≤ c|G|

for every graph G with ∆(G) ≤ d.

Proof.

Let

(

+ 1). Pick

ε ≤ min

(d+1)

. Let

` ≥ t

, and let

L = L(`, ε). We show that c =

works.

Indeed, let

be a graph. Colour the edges of

by red and blue, where

n ≥ c|G|

. Apply Szemir´edi to the red graph with

`, ε

as above. Let

the graph whose vertices are

, . . . , V

}

, where

, . . . , V

is the partition

of the red graph. Let

, V

∈ E

(

) if (

, V

) is

-uniform. Notice that

|H| ≥ ` ≥ t

, and

(

)

≤ ε





. So

H ⊇ K

, or else by Tur´an’s theroem,

there are integers d

, . . . , d

t−1

and

m and

H) ≥





≥ (t − 1)



m/(t − 1)



≥ ε





by our choice of ε and m.

We may as well assume all pairs

, V

for 1

≤ i < j ≤ t

are

-uniform. We

colour the edge of

green if

(

, V

)

≥

(in the red graph), or white if

(i.e. density >

in the blue graph).

By Ramsey’s theorem, we may assume all pairs

for 1

≤ i < j ≤ d

+ 1

are the same colour. We may wlog assume the colour is green, and we shall find

a red G (a similar argument gives a blue G if the colour is white).

Indeed, take a vertex colouring of

with at most

+ 1 colours (using

∆(

)

≤ d

), with no colour used more than

|G|

times. By the building lemma

with

(in the lemma) being

(in this proof), and

(in the lemma) equal to

the subgrpah of the red graph spanned by V

, . . . , V

d+1

(here),

u = |V

| ≥

≥

c|G|

≥

|G|

r = d + 1, λ =

, and we are done.

This proof is due to Chv´atal, R¨odl, Szem´eredi, Trotter (1983). It was extended

to more general graphs by Chen and Schelp (1993) including planar graphs. It

was conjectured by Burr and Erd¨os (1978) to be true for

-degenerate graphs

(e(H) ≤ d|H| for all H ⊆ G).

Kostochka–R¨odl (2004) introduced “dependent random choice”, used by

Fox–Sudakov (2009) and finally Lee (2015) proved the full conjecture.

An important application of the Szem´eredi regularity lemma is the triangle

removal lemma.

Theorem

(Triangle removal lemma)

Given

ε >

0, there exists

δ >

0 such that

|G|

and

contains at most

δn

triangles, then there exists a set of at

most εn

edges whose removal leaves no triangles.

Proof. Exercise. (See example sheet)

Appropriate modifications hold for general graphs, not just triangles.

Corollary

(Roth, 1950’s)

Let

ε >

0. Then if

is large enough, and

A ⊆

[

] =

{1, 2, . . . , n} with |A| ≥ εn, then A contains a 3-term arithmetic progression.

Roth originally proved this by some sort of Fourier analysis arguments, while

Szmer´edi came and prove this for all lengths later.

Proof. Define

B = {(x, y) ∈ [2n]

: x − y ∈ A}.

Then certainly

|B| ≥ εn

. We form a 3-partite graph with disjoint vertex classes

= [2

] and

= [2

= [4

]. If we have

x ∈ X, y ∈ Y

and

z ∈ Z

, we join

to y if (x, y) ∈ B; join x to z if (x, z − x) ∈ B and join y to z if (z − y, y) ∈ B.

A triangle in

is a triple (

x, y, z

) where (

x, y

)

(

x, y

)

(

w, y

)

∈ B

where

z − x − y

. Note that

w <

0 is okay. A 0-triangle is one with

= 0.

There are at least

εn

of these, one for each (

x, y

)

∈ B

, and these are edge

disjoint, because z = x + y.

Hence the triangles cannot be killed by removing

≤ εn

2 edges. By the

triangle removal lemma, we must have

≥ δn

triangles for some

. In particular,

for n large enough, there is some triangle that is not a 0-triangle.

But then we are done, since

x − y − w, x − y, x − y + w ∈ A

where w 6= 0, and this is a 3-term arithemtic progression.

There is a simple generalization of this argument which yields

-term arith-

metic progressions provided we have a suitable removal lemma. This needs a

Szemer´edi regularity lemma for (

k −

1)-uniform hypergraphs, instead of just

graphs, and a corresponding version of the building lemma.

The natural generalizations of Szemer´edi’s lemma to hypergraphs is easily

shown to be true (exercise). The catch is that what the Szemer´edi’s lemma does

not give us a strong enough result to apply the building lemma.

What we need is a stronger version of the regularity that allows us to build

graphs, but not too strong that we can’t prove the Szemer´edi regularity lemma.

A workable hypergraph regularity lemma along these lines was proved by Nagle,

R¨odl, Skokan, and by Gowers (those weren’t quite the same lemma, though).