IB Markov Chains - Long-run behaviour

3Long-run behaviour

IB Markov Chains

3.2 Convergence to equilibrium

So far, we have discussed that if a chain converged, then it must converge to an

invariant distribution. We then proved that the chain has a (unique) invariant

distribution if and only if it is positive recurrent.

Now, we want to understand when convergence actually occurs.

Theorem. Consider a Markov chain that is irreducible, positive recurrent and

aperiodic. Then

i,k

(n) → π

as n → ∞, where π is the unique invariant distribution.

We will prove this by “coupling”. The idea of coupling is that here we have

two sets of probabilities, and we want to prove some relations between them. The

first step is to move our attention to random variables, by considering random

variables that give rise to these probability distribution. In other words, we look

at the Markov chains themselves instead of the probabilities. In general, random

variables are nicer to work with, since they are functions, not discrete, unrelated

numbers.

However, we have a problem since we get two random variables, but they are

completely unrelated. This is bad. So we will need to do some “coupling” to

correlate the two random variables together.

Proof.

(non-examinable) The idea of the proof is to show that for any

i, j, k ∈ S

we have

i,k

(

)

→ p

j,k

(

) as

n → ∞

. Then we can argue that no matter where

we start, we will tend to the same distribution, and hence any distribution tends

to the same distribution as π, since π doesn’t change.

As mentioned, instead of working with probability distributions, we will work

with the chains themselves. In particular, we have two Markov chains, and we

imagine one starts at

and the other starts at

. To do so, we define the pair

= (

X, Y

) of two independent chains, with

= (

) and

= (

) each

having the state space S and transition matrix P .

We can let

= (

), where

= (

, Y

) is a Markov chain on state space

. This has transition probabilities

ij,kℓ

= p

i,k

j,ℓ

by independence of the chains. We would like to apply theorems to

, so we

need to make sure it has nice properties. First, we want to check that

irreducible. We have

ij,kℓ

(n) = p

i,k

(n)p

j,ℓ

(n).

We want this to be strictly positive for some

. We know that there is

such

that

i,k

(

)

0, and some

such that

j,ℓ

(

)

0. However, what we need is

that makes them simultaneously positive. We can indeed find such an

based on the assumption that we have aperiodic chains and waffling something

about number theory.

Now we want to show positive recurrence. We know that

, and hence

is positive recurrent. By our previous theorem, there is a unique invariant

distribution π for P . It is then easy to check that Z has invariant distribution

ν = (ν

: ij ∈ S

)

given by

i,j

= π

This works because X and Y are independent. So Z is also positive recurrent.

So Z is nice.

The next step is to couple the two chains together. The idea is to fix some

state

s ∈ S

, and let

be the earliest time at which

. Because of

recurrence, we can always find such at

. After this time

and

behave

under the exact same distribution.

We define

T = inf{n : Z

= (X

, Y

) = (s, s)}.

We have

i,k

(n) = P

= k)

= P

= k)

= P

= k, T ≤ n) + P

= k, T > n)

Note that if

T ≤ n

, then at time

. Thus the evolution of

and

after time T is equal. So this is equal to

= P

= k, T ≤ n) + P

= k, T > n)

≤ P

= k) + P

(T > n)

= p

j,k

(n) + P

(T > n).

Hence we know that

i,k

(n) − p

j,k

(n)| ≤ P

(T > n).

As n → ∞, we know that P

(T > n) → 0 since Z is recurrent. So

i,k

(n) − p

j,k

(n)| → 0

With this result, we can prove what we want. First, by the invariance of

, we

have

π = πP

for all n. So we can write

j,k

(n).

Hence we have

|π

− p

i,k

(n)| =



j,k

(n) − p

i,k

(n))



≤

j,k

(n) − p

i,k

We know that each individual

j,k

(

)

− p

i,k

(

)

tends to zero. So by bounded

convergence, we know

− p

i,k

(n) → 0.

So done.

What happens when we have a null recurrent case? We would still be able to

prove the result about

i,k

(

)

→ p

j,k

(

), since

is finite by recurrence. However,

we do not have a π to make the last step.

Recall that we motivated our definition of

as the proportion of time we

spend in state i. Can we prove that this is indeed the case?

More concretely, we let

(n) = |{m ≤ n : X

= i}|.

We thus want to know what happens to

(

)

n → ∞

. We think this

should tend to π

Note that technically, this is not a well-formed question, since we don’t exactly

know how convergence of random variables should be defined. Nevertheless, we

can give an informal proof of this result.

The idea is to look at the average time between successive visits. We assume

. We let

be the time of

th return to

. In particular,

= 0. We

define

− T

m−1

. All of these are iid by the strong Markov property,

and has mean µ

by definition of µ

Hence, by the law of large numbers,

r=1

∼ E[U

] = µ

. (∗)

We now want to look at

. If we stare at them hard enough, we see that

(

)

≤ k

if and only if

≤ n

. We can write an equivalent statement by letting

be a real number. We denote

⌈x⌉

as the least integer greater than

. Then we

have

(n) ≤ x ⇔ T

⌈x⌉

≤ n.

Putting a funny value of x in, we get

(n)

≤

⇔

⌈An/µ

⌉

≤ 1.

However, using (∗), we know that

An/µ

→ µ

Multiply both sides by A/µ

to get

An/µ

→

Aµ

= A.

So if

A <

1, the event

An/µ

≤

1 occurs with almost probability 1. Otherwise,

it happens with probability 0. So in some sense,

(n)

→

= π