III Percolation and Random Walks on Graphs

1Percolation

1.1 The critical probability

There are two models of percolation — bond percolation and site percolation. In

this course, we will focus on bond percolation, but we will look at site percolation

in the example sheets.

The very basic set up of percolation theory involves picking a graph

(

V, E

), where

is the set of vertices and

is the set of edges. We also pick a

percolation probability

p ∈

1]. For each edge

e ∈ E

, we keep it with probability

and throw it with probability 1

− p

. In the first case, we say the edge is open,

and in the latter, we say it is closed.

More precisely, we define the probability space to be Ω =

{

}

, where 0

denotes a closed edge and 1 denotes an open one (in the case of site percolation,

we have Ω =

{

}

). We endow Ω with the

-algebra generated by cylinder

sets

{ω ∈ Ω : ω(e) = x

for all e ∈ A},

where

is a finite set and

∈ {

}

for all

. In other words, this is the

product

-algebra. As probability measure, we take the product measure

, i.e.

every edge is 1 with probability

and 0 with probability 1

− p

. We will write

∈ {0, 1}

for the state of the system.

Now what can we say about the graph resulting from this process? One

question we may ask is whether we can connect two points in the graphs via the

edges that remain. To further the discussion, we introduce some notation.

Notation. We write x ↔ y if there is an open path of edges from x to y.

Notation. We write C(x) = {y ∈ V : y ↔ x}, the cluster of x.

Notation. We write x ↔ ∞ if |C(x)| = ∞.

From now on, we shall take

= (

, E

(

)), the

-dimensional integer

lattice. Then by translation invariance,

(

)

has the same distribution as

(0)

for all x. We now introduce a key piece of notation:

Definition (θ(p)). We define θ(p) = P

(|C(0)| = ∞).

Most of the questions we ask surround this

(

). We first make the most

elementary observations:

Example. θ(0) = 0 and θ(1) = 1.

A natural question to ask is then if we can find

p ∈

1) such that

(

)

But even before answering that question, we can ask a more elementary one —

is θ an increasing function of p?

Intuitively, it must be. And we can prove it. The proof strategy is known as

coupling. We have already seen coupling in IB Markov Chains, where we used it

to prove the convergence to the invariant distribution under suitable conditions.

Here we are going to couple all percolation processes for different values of P .

Lemma. θ is an increasing function of p.

Proof.

We let (

(

))

e∈E(Z

)

be iid

1] random variables. For each

p ∈

1],

we define

(e) =

(

1 U(e) ≤ p

0 otherwise

Then

(

) = 1) =

(

)

< p

) =

. Since the

(

) are independent, so are

. Thus η

has the law of bond percolation with probability p.

Moreover, if p ≤ q, then η

(e) ≤ η

(e). So the result follows.

Note that this is not only useful as a theoretical tool. If we want to simulate

percolation with different probabilities

, we can simply generate a set of

variables, and use it to produce a percolation for all p.

If we wish, we can provide an abstract definition of what coupling is, but the

detailed definition is not of much practical use:

Definition

(Coupling)

Let

and

be two probability measures on (potentially)

different probability spaces. A coupling is a pair of random variables (

X, Y

)

defined on the same probability space such that the marginal distribution of

is µ and the marginal distribution of Y is ν.

With the lemma, we can make the definition

Definition (Critical probability). We define p

(d) = sup{p ∈ [0, 1] : θ(p) = 0}.

Recall we initially asked whether

(

) can be non-zero for

p ∈

1). We

can now rephrase and strengthen this question by asking for the value of p

(d).

There are a lot more questions we can ask about p

and θ(p).

For example, we know that

(

) is a

∞

function on (

1]. However, we

do not know if

is continuous at

= 3. We will see soon that

d = 2, but the exact value of p

is not known in higher dimensions.

Let’s start actually proving things about p

. We previously noted that

Proposition. p

(1) = 1.

The first actually interesting theorem is the following:

Theorem. For all d ≥ 2, we have p

(d) ∈ (0, 1).

We shall break this up into two natural parts:

Lemma. For d ≥ 2, p

(d) > 0.

Proof.

Write Σ

for the number of open self-avoiding paths of length

starting

at 0. We then note that

(|C(0)| = ∞) = P

(∀n ≥ 1 : Σ

≥ 1) = lim

n→∞

(Σ

≥ 1) ≤ lim

n→∞

[Σ

We can now compute

[Σ

]. The point is that expectation is linear, which

makes this much easier to compute. We let

be the number of self-avoiding

paths of length n from 0. Then we simply have

[Σ

] = σ

We can bound

by 2

d ·

d −

n−1

, since we have 2

choices of the first step,

and at most 2d − 1 choices in each subsequent step. So we have

[Σ

] ≤ 2d(2d − 1)

n−1

2d − 1

(p(2d − 1))

So if p(2d − 1) < 1, then θ(p) = 0. So we know that

(d) ≥

2d − 1

Before we move on to the other half of the theorem, we talk a bit more about

self-avoiding paths.

Definition

(

)

We write

for the number of self-avoiding paths of length

starting from 0.

In the proof, we used the rather crude bound

≤ 2d · (2d − 1)

n−1

More generally, we can make the following bound:

Lemma. We have σ

n+m

≤ σ

Proof.

A self-avoiding path of length

can be written as a concatenation of

self-avoiding paths of length

starting from 0 and another one of length

Taking the logarithm, we know that

log σ

is a subadditive sequence. It turns

out this property alone is already quite useful. It is an exercise in analysis to

prove the following lemma:

Lemma

(Fekete’s lemma)

If (

) is a subadditive sequence of real numbers,

then

lim

n→∞

= inf

: k ≥ 1

∈ [−∞, ∞).

In particular, the limit exists.

This allows us to define

Definition (λ and κ). We define

λ = lim

n→∞

log σ

, κ = e

κ is known as the connective constant.

Then by definition, we have

= e

nλ(1+o(1))

= κ

n+o(n)

as n → ∞.

Thus, asymptotically, the growth rate of

is determined by

. It is natural

to then seek for the value of

, but unfortunately we don’t know the value of

for the Euclidean lattice. A bit more on the positive side, the value for the

hexagonal lattice has been found recently:

Theorem (Duminil-Copin, Smirnov, 2010). The hexagonal lattice has

hex

2 +

√

We might want to be a bit more precise about how

grows. For

d ≥

5, we

have the following theorem:

Theorem

(Hara and Slade, 1991)

For

d ≥

5, there exists a constant

such

that

= Aκ

(1 + O(n

−ε

))

for any ε <

We don’t really know what happens when

d <

5, but we have the following

conjecture:

Conjecture.

≈











11/32

d = 2

d = 3

(log n)

1/4

d = 4

One can also instead try to bound

from above. We have the following

classic theorem:

Theorem (Hammersley and Welsh, 1962). For all d ≥ 2, we have

≤ Cκ

exp(c

√

for some constants C and c

In fact, a better bound was recently found:

Theorem (Hutchcroft, 2017). For d ≥ 2, we have

≤ Cκ

exp(o(

√

n)).

This will be proved in the example sheet.

What would be a good way to understand self-avoiding walks? Fixing an

there are only finitely many self-avoiding walks of length

. So we can sample

such a self-avoiding walk uniformly at random. In general, we would expect

the total displacement of the walk to be

∼

√

. Thus, what we can try to do

is to take

n → ∞

while simultaneously shrinking space by a factor of

√

. We

would then hope that the result converges toward some Brownian motion-like

trajectory. If we can characterize the precise behaviour of this scaling limit,

then we might be able to say something concrete about

and the asymptotic

behaviour of σ

But we don’t really know what the scaling limit is. In the case

= 2, it is

conjectured to be

SLE

(

). Curiously, it was proven by Gwynne and Miller in

2016 that if we instead looked at self-avoiding walks on a random surface, then

the scaling limit is SLE(

That’s enough of a digression. Let’s finish our proof and show that

(

)

A first observation is that it suffices to show this for

= 2. Indeed, since

embeds into

d+1

for all

, if we can find an infinite cluster in

, then the same

is true for

d+1

. Thus, it is natural to restrict to the case of

= 2, where duality

will be prove to be an extremely useful tool.

Definition

(Planar graph)

A graph

is called planar if it can be embedded

on the plane in such a way that no two edges cross.

Definition

(Dual graph)

Let

be a planar graph (which we call the primal

graph). We define the dual graph by placing a vertex in each face of

, and

connecting 2 vertices if their faces share a boundary edge.

Example. The dual of Z

is isomorphic to Z

The dual lattice will help us prove a lot of properties for percolation in Z

Lemma. p

(d) < 1 for all d ≥ 2.

Proof.

It suffices to show this for

= 2. Suppose we perform percolation on

Then this induces a percolation on the dual lattice by declaring an edge of the

dual is open if it crosses an open edge of Z

, and closed otherwise.

Suppose

(0)

| < ∞

in the primal lattice. Then there is a closed circuit in

the dual lattice, given by the “boundary” of

(0). Let

be the number of

closed dual circuits of length

that surround 0. Then the union bound plus

Markov’s inequality tells us

(|C(0)| < ∞) = P

(∃n ≥ D

≥ 1) ≤

∞

n=4

using the union bound and Markov’s inequality.

It is a simple exercise to show that

Exercise.

Show that the number of dual circuits of length

that contain 0 is

at most n · 4

From this, it follows that

(|C(0)| < ∞) ≤

∞

n=4

n · 4

(1 − p)

Thus, if

is sufficiently near 1, then

(

(0)

| < ∞

) is bounded away from 1.

By definition, if

p < p

(

), then 0 is almost surely not contained in an infinite

cluster. If

p > p

(

), then there is a positive probability that 0 is contained

in an infinite cluster. However, it is of course not necessarily the case that

0 is connected to

∞

with probability 1. In fact, there is at least probability

− p

)

that 0 is not connected to

∞

, since 0 cannot be connected to

∞

if all

its neighbouring edges are closed. However, it is still possible that there is some

infinite cluster somewhere. It’s just that it does not contain 0.

Proposition. Let A

∞

be the event that there is an infinite cluster.

(i) If θ(p) = 0, then P

∞

) = 0.

(ii) If θ(p) > 0, then P

∞

) = 1.

Proof.

(i) We have

∞

) = P

(∃x : |C(x)| = ∞) ≤

x∈Z

(|C(x)| = ∞) =

θ(p) = 0.

(ii)

We need to apply the Kolmogorov 0-1 law . Recall that if

, X

, . . .

are

independent random variables, and

(

k ≥ n

∞

n≥0

Then F

∞

is trivial, i.e. for all A ∈ F

∞

, P(A) ∈ {0, 1}.

So we order the edges of Z

as e

, e

, . . . and denote their states

w(e

), w(e

), . . . .

These are iid random variables. We certainly have

(

∞

)

≥ θ

(

)

So if we can show that

∞

∈ F

∞

, then we are done. But this is clear,

since changing the states of a finite number of edges does not affect the

occurrence of A

∞

The next follow up questions is how many infinite clusters do we expect to

get?

Theorem

(Burton and Keane)

p > p

, then there exists a unique infinite

cluster with probability 1.

This proof is considerably harder than the ones we have previously done. We

might think we can use the Kolmogorov 0-1 law, but we can’t, since changing

a finite number of edges can break up or join together infinite clusters, so the

event that there are

infinite clusters for

k >

0 is not in

∞

. However, we can

exploit the fact that N is translation invariant.

Exercise.

Let

be an event that is translation invariant. Then

(

) = 0 or 1

almost surely.

Proof.

Let

be the number of infinite clusters. Then by the lemma, we know

is constant almost surely. So there is some

k ∈ N ∪{∞}

such that

(

) = 1.

First of all, we know that

k 6

= 0, since

(

)

0. We shall first exclude 2

≤ k < ∞

and then exclude k = ∞.

Assume that

k < ∞

. We will show that

(

= 1)

0, and hence it must

be the case that P

(n = 1) = 1.

To bound this probability, we let

(

) = [

−n, n

]

∩ Z

(which we will

sometimes write as B

), and let ∂B(n) be its boundary. We know that

(all infinite clusters intersect ∂B(n)) → 1

n → ∞

. This is since with probability 1, there are only finitely many clusters

by assumption, and for each of these configurations, all infinite clusters intersect

∂B(n) for sufficiently large n.

In particular, we can take n large enough such that

(all infinite clusters intersect ∂B(n)) ≥

We can then bound

(N = 1) ≥ P

(all infinite clusters intersect ∂B(n)

and all edges in B(n) are open).

Finally, note that the two events in there are independent, since they involve

different edges. But the probability that all edges in

(

) are open is just

E(B(n))

. So

(N = 1) ≥

E(B(n))

> 0.

So we are done.

We now have to show that

k 6

∞

. This involves the notion of a trifurcation.

The idea is that we will show that if k = ∞, then the probability that a vertex

is a trifurcation is positive. This implies the expected number of trifurcations

∼ n

. We will then show deterministically that the number of trifurcations

inside

(

) must be

≤ |∂B

(

)

, and so there are

(

d−1

) trifurcations, which is

a contradiction.

We say a vertex x is a trifurcation if the following three conditions hold:

(i) x is in an infinite open cluster C

∞

;

(ii) There exist exactly three open edges adjacent to x;

(iii) C

∞

\ {x} contains exactly three infinite clusters and no finite ones.

This is clearly a translation invariant notion. So

(0 is a trifurcation) = P

(x is a trifurcation)

for all x ∈ Z

Claim. P

(0 is a trifurcation) > 0.

We need to use something slightly different from

(

). We define

(

) =

{x ∈ Z

: kxk

≤ n}.

The crucial property of this is that for any

, x

∈ ∂S

(

), there exist three

disjoint self-avoiding paths joining

to 0 (exercise!). For each triple

, x

we arbitrarily pick a set of three such paths, and define the event

J(x

, x

) = {all edges on these 3 paths are open

and everything else inside S(n) is closed}.

Next, for every possible infinite cluster in

\ S

(

) that intersects

∂S

(

) at at

least one point, we pick a designated point of intersection arbitrarily.

Then we can bound

(0 is a trifurcation) ≥ P

(∃C

∞

, C

∞

, C

∞

⊆ Z

\ S(n)

infinite clusters which intersect ∂S(n) at x

, x

, and J(x

, x

)).

Rewrite the right-hand probability as

(J(x

, x

) | ∃C

∞

, C

∞

, C

∞

⊆ Z intersecting ∂S(n))

× P

(∃C

∞

, C

∞

, C

∞

⊆ Z

\ ∂S(n))

We can bound the first term by

min(p, 1 − p)

E(S(n))

To bound the second probability, we have already assumed that

(

∞

) = 1.

(

∃C

∞

, C

∞

, C

∞

⊆ Z

\ S

(

)

intersecting ∂S

(

))

→

1 as

n → ∞

. We can

then take

large enough such that the probability is

≥

. So we have shown

that c ≡ P

(0 is a trifurcation) > 0.

Using the linearity of expectation, it follows that

[number of trifurcations inside B(n))] ≥ c|B(n)| ∼ n

On the other hand, we can bound the number of trifurcations in

(

) by

|∂B

(

)

To see this, suppose

is a trifurcation in

(

). By definition, there exists

3 open paths to

∂B

(

). Fix three such paths. Let

be another trifurcation.

It also has 3 open paths to the

∂B

(

), and its paths to the boundary could

intersect those of

. However, they cannot create a cycle, by definition of a

trifurcation. For simplicity, we add the rule that when we produce the paths for

, once we intersect the path of x

, we continue following the path of x

Exploring all trifurcations this way, we obtain a forest inside B(n), and the

boundary points will be the leaves of the forest. Now the trifurcations have

degree 3 in this forest. The rest is just combinatorics.

Exercise.

For any tree, the number of degree 3 vertices is always less than the

number of leaves.