IA Probability - Central limit theorem

7Central limit theorem

IA Probability

7 Central limit theorem

Suppose

, ··· , X

are iid random variables with mean

and variance

. Let

= X

+ ··· + X

. Then we have previously shown that

var(S

√

n) = var



− nµ

√



= σ

Theorem (Central limit theorem). Let

, X

, ···

be iid random variables with

E[X

] = µ, var(X

) = σ

< ∞. Define

= X

+ ··· + X

Then for all finite intervals (a, b),

lim

n→∞



a ≤

− nµ

√

≤ b



√

2π

−

dt.

Note that the final term is the pdf of a standard normal. We say

− nµ

√

→

N(0, 1).

To show this, we will use the continuity theorem without proof:

Theorem (Continuity theorem). If the random variables

, X

, ···

have mgf’s

(

)

, m

(

)

, ···

and

(

)

→ m

(

) as

n → ∞

for all

, then

→

the

random variable with mgf m(θ).

We now provide a sketch-proof of the central limit theorem:

Proof. wlog, assume µ = 0, σ

= 1 (otherwise replace X

with

−µ

Then

(θ) = E[e

θX

] = 1 + θE[X

] +

E[X

] + ···

= 1 +

E[X

] + ···

Now consider S

√

n. Then

E[e

θS

√

] = E[e

θ(X

+...+X

√

]

= E[e

θX

√

] ···E[e

θX

√

]



E[e

θX

√

]





1 +

E[X

]

3/2

+ ···



→ e

n → ∞

since (1 +

a/n

)

→ e

. And this is the mgf of the standard normal.

So the result follows from the continuity theorem.

Note that this is not a very formal proof, since we have to require E[X

] to

be finite. Also, sometimes the moment generating function is not defined. But

this will work for many “nice” distributions we will ever meet.

The proper proof uses the characteristic function

(θ) = E[e

iθX

An important application is to use the normal distribution to approximate a

large binomial.

Let

∼ B

, p

). Then

∼ B

(

n, p

). So

[

] =

and

var

(

) =

−p

− np

np(1 − p)

→

N(0, 1).

Example. Suppose two planes fly a route. Each of

passengers chooses a plane

at random. The number of people choosing plane 1 is

S ∼ B

(

). Suppose

each has s seats. What is

F (s) = P(S > s),

i.e. the probability that plane 1 is over-booked? We have

F (s) = P(S > s) = P





S − n/2

n ·

s − n/2

√

n/2





Since

S − np

√

n/2

∼ N(0, 1),

we have

F (s) ≈ 1 − Φ



s − n/2

√

n/2



For example, if

= 1000 and

= 537, then

−n/2

√

n/2

≈

34, Φ(2

34)

≈

99,

and

(

)

≈

01. So with only 74 seats as buffer between the two planes, the

probability of overbooking is just 1/100.

Example. An unknown proportion

of the electorate will vote Labour. It is

desired to find

without an error not exceeding 0

005. How large should the

sample be?

We estimate by

′

where X

∼ B(1, p). Then

P(|p

′

− p| ≤ 0.005) = P(|S

− np| ≤ 0.005n)

= P







− np|

np(1 − p)

| {z }

≈N(0,1)

≤

0.005n

np(1 − p)







We want |p

′

− p| ≤ 0.005 with probability ≥ 0.95. Then we want

0.005n

np(1 − p)

≥ Φ

−1

(0.975) = 1.96.

(we use 0.975 instead of 0.95 since we are doing a two-tailed test) Since the

maximum possible value of p(1 − p) is 1/4, we have

n ≥ 38416.

In practice, we don’t have that many samples. Instead, we go by

P(|p

′

< p| ≤ 0.03) ≥ 0.95.

This just requires n ≥ 1068.

Example (Estimating

with Buffon’s needle). Recall that if we randomly toss

a needle of length

ℓ

to a floor marked with parallel lines a distance

apart, the

probability that the needle hits the line is p =

2ℓ

πL

ℓ

Suppose we toss the pin n times, and it hits the line N times. Then

N ≈ N(np, np(1 − p))

by the Central limit theorem. Write

′

for the actual proportion observed. Then

ˆπ =

2ℓ

(N/n)L

π2ℓ/(πL)

′

πp

p + (p

′

− p)

= π



1 −

′

− p

+ ···



Hence

ˆπ −π ≈

p − p

′

We know

′

∼ N



p(1 − p)



So we can find

ˆπ −π ∼ N



p(1 − p)



= N



(1 − p)



We want a small variance, and that occurs when

is the largest. Since

= 2

ℓ/πL

this is maximized with ℓ = L. In this case,

p =

and

ˆπ − π ≈ N



(π − 2)π



If we want to estimate π to 3 decimal places, then we need

P(|ˆπ − π| ≤ 0.001) ≥ 0.95.

This is true if and only if

0.001

(π − 2)(π

)

≥ Φ

−1

(0.975) = 1.96

n ≥

. So we can obtain

to 3 decimal places just by throwing a

stick 20 million times! Isn’t that exciting?