IB Markov Chains - Classification of chains and states

2Classification of chains and states

IB Markov Chains

2.3 Hitting probabilities

Recurrence and transience tells us if we are going to return to the original

state with (almost) certainty. Often, we would like to know something more

qualitative. What is the actual probability of returning to the state

? If we

return, what is the expected duration of returning?

We can formulate this in a more general setting. Let

be our state space,

and let

A ⊆ S

. We want to know how likely and how long it takes for us to

reach

. For example, if we are in a casino, we want to win, say, a million, and

don’t want to go bankrupt. So we would like to know the probability of reaching

A = {1 million} and A = {0}.

Definition (Hitting time). The hitting time of

A ⊆ S

is the random variable

min{n ≥

0 :

∈ A}

. In particular, if we start in

, then

= 0. We

also have

= P

< ∞) = P

(ever reach A).

To determine hitting times, we mostly rely on the following result:

Theorem. The vector (h

: i ∈ S) satisfies

(

1 i ∈ A

j∈S

i,j

i ∈ A

and is minimal in that for any non-negative solution (

i ∈ S

) to these

equations, we have h

≤ x

for all i.

It is easy to show that

satisfies the formula given, but it takes some more

work to show that

is the minimal. Recall, however, that we have proved a

similar result for random walks in IA probability, and the proof is more-or-less

the same.

Proof. By definition, h

= 1 if i ∈ A. Otherwise, we have

= P

< ∞) =

j∈S

< ∞ | X

= j)p

i,j

j∈S

i,j

So h

is indeed a solution to the equations.

To show that

is the minimal solution, suppose

= (

i ∈ S

) is a

non-negative solution, i.e.

(

1 i ∈ A

j∈S

i,j

A i ∈ A

If i ∈ A, we have h

= x

= 1. Otherwise, we can write

i,j

j∈A

i,j

j∈A

i,j

j∈A

i,j

j∈A

i,j

≥

j∈A

i,j

= P

= 1).

By iterating this process, we can write

j∈A

i,j

j∈A

i,j

i,k

j∈A

i,j

j∈A

i,j





k∈A

i,k

k∈A

i,k





≥ P

= 1) +

j∈A,k∈A

i,j

j,k

= P

= 1) + P

= 2)

= P

≤ 2).

By induction, we obtain

≥ P

≤ n)

for all n. Taking the limit as n → ∞, we get

≥ P

≤ ∞) = h

So h

is minimal.

The next question we want to ask is how long it will take for us to hit

We want to find

(

) =

. Note that we have to be careful — if there is a

chance that we never hit

, then

could be infinite, and

(

) =

∞

. This

occurs if

1. So often we are only interested in the case where

= 1 (note

that h

= 1 does not imply that k

< ∞. It is merely a necessary condition).

We get a similar result characterizing the expected hitting time.

Theorem. (k

: i ∈ S) is the minimal non-negative solution to

(

0 i ∈ A

1 +

i,j

i ∈ A.

Note that we have this “1+” since when we move from

, one step has

already passed.

The proof is almost the same as the proof we had above.

Proof. The proof that (k

) satisfies the equations is the same as before.

Now let (y

: i ∈ S) be a non-negative solution. We show that y

≥ k

If i ∈ A, we get y

= k

= 0. Otherwise, suppose i ∈ A. Then we have

= 1 +

i,j

= 1 +

j∈A

i,j

j∈A

i,j

= 1 +

j∈A

i,j

= 1 +

j∈A

i,j





1 +

k∈A

j,k





≥ 1 +

j∈A

i,j

= P

≥ 1) + P

≥ 2).

By induction, we know that

≥ P

≥ 1) + ··· + P

≥ n)

for all n. Let n → ∞. Then we get

≥

m≥1

≥ m) =

m≥1

= m) = k

Example (Gambler’s ruin). This time, we will consider a random walk on

In each step, we either move to the right with probability

, or to the left with

probability

= 1

− p

. What is the probability of ever hitting 0 from a given

initial point? In other words, we want to find h

= h

{0}

We know h

is the minimal solution to

(

1 i = 0

i−1

+ ph

i+1

i = 0.

What are the solutions to these equations? We can view this as a difference

equation

i+1

− h

+ qh

i−1

= 0, i ≥ 1.

with the boundary condition that

= 1. We all know how to solve difference

equations, so let’s just jump to the solution.

If p = q, i.e. p =

, then the solution has the form

= A + B





for

i ≥

0. If

p < q

, then for large





is very large and blows up. However,

since

is a probability, it can never blow up. So we must have

= 0. So

constant. Since h

= 1, we have h

= 1 for all i. So we always get to 0.

If p > q, since h

= 1, we have A + B = 1. So





+ A

1 −





This is in fact a solution for all A. So we want to find the smallest solution.

i → ∞

, we get

→ A

. Since

≥

0, we know that

A ≥

0. Subject to this

constraint, the minimum is attained when

= 0 (since (

q/p

)

and (1

−

(

q/p

)

are both positive). So we have





There is another way to solve this. We can give ourselves a ceiling, i.e. we also

stop when we hit

k >

0, i.e.

= 1. We now have two boundary conditions

and can find a unique solution. Then we take the limit as

k → ∞

. This is the

approach taken in IA Probability.

Here if p = q, then by the same arguments, we get h

= 1 for all i.

Example (Birth-death chain). Let (

i ≥

1) be an arbitrary sequence such

that

∈

1). We let

= 1

− p

. We let

be our state space and define the

transition probabilities to be

i,i+1

= p

, p

i,i−1

= q

This is a more general case of the random walk — in the random walk we have

a constant p

sequence.

This is a general model for population growth, where the change in population

depends on what the current population is. Here each “step” does not correspond

to some unit time, since births and deaths occur rather randomly. Instead, we

just make a “step” whenever some birth or death occurs, regardless of what time

they occur.

Here, if we have no people left, then it is impossible for us to reproduce and

get more population. So we have

0,0

= 1.

We say 0 is absorbing in that {0} is closed. We let h

= h

{0}

. We know that

= 1, p

i+1

− h

+ q

i−1

= 0, i ≥ 1.

This is no longer a difference equation, since the coefficients depends on the

index i. To solve this, we need magic. We rewrite this as

i+1

− h

+ q

i−1

= p

i+1

− (p

+ q

i−1

= p

i+1

− h

) − q

− h

i−1

We let

i−1

− h

(picking

− h

i−1

might seem more natural, but this

definition makes u

positive). Then our equation becomes

i+1

We can iterate this to become

i+1





i−1



···





We let

···q

···p

Then we get

i+1

. For convenience, we let

= 1. Now we want to

retrieve our

. We can do this by summing the equation

i−1

− h

. So we

get

− h

= u

+ u

+ ··· + u

Using the fact that h

= 1, we get

= 1 − u

(γ

+ γ

+ ··· + γ

i−1

Here we have a parameter

, and we need to find out what this is. Our theorem

tells us the value of u

minimizes h

. This all depends on the value of

S =

∞

i=0

By the law of excluded middle,

either diverges or converges. If

∞

, then

we must have

= 0. Otherwise,

blows up for large

, but we know that

≤ h

≤

1. If

is finite, then

can be non-zero. We know that the

are

all positive. So to minimize

, we need to maximize

. We cannot make

arbitrarily large, since this will make

negative. To find the maximum possible

value of

, we take the limit as

i → ∞

. Then we know that the maximum value

of u

satisfies

0 = 1 − u

In other words, u

= 1/S. So we have

∞

k=i

∞

k=0