IA Probability - Continuous random variables

5Continuous random variables

IA Probability

5.5 The normal distribution

Definition (Normal distribution). The normal distribution with parameters

µ, σ

, written N(µ, σ

) has pdf

f(x) =

√

2πσ

exp



−

(x − µ)

2σ



for −∞ < x < ∞.

It looks approximately like this:

The standard normal is when µ = 0, σ

= 1, i.e. X ∼ N(0, 1).

We usually write

(

) for the pdf and Φ(

) for the cdf of the standard normal.

This is a rather important probability distribution. This is partly due to the

central limit theorem, which says that if we have a large number of iid random

variables, then the distribution of their averages are approximately normal. Many

distributions in physics and other sciences are also approximately or exactly

normal.

We first have to show that this makes sense, i.e.

Proposition.

∞

−∞

√

2πσ

−

2σ

(x−µ)

dx = 1.

Proof. Substitute z =

(x−µ)

. Then

I =

∞

−∞

√

2π

−

dz.

Then

∞

−∞

√

2π

−x

∞

√

2π

−y

∞

2π

−r

r dr dθ

= 1.

We also have

Proposition. E[X] = µ.

Proof.

E[X] =

√

2πσ

∞

−∞

−(x−µ)

/2σ

√

2πσ

∞

−∞

(x − µ)e

−(x−µ)

/2σ

dx +

√

2πσ

∞

−∞

µe

−(x−µ)

/2σ

dx.

The first term is antisymmetric about

and gives 0. The second is just

times

the integral we did above. So we get µ.

Also, by symmetry, the mode and median of a normal distribution are also

both µ.

Proposition. var(X) = σ

Proof.

We have

var

(

) =

[

]

−

(

[

])

. Substitute

X−µ

. Then

[

] = 0,

E[Z

] =

E[X

Then

var(Z) =

√

2π

∞

−∞

−z



−

√

2π

−z



∞

−∞

√

2π

∞

−∞

−z

= 0 + 1

= 1

So var X = σ

Example. UK adult male heights are normally distributed with mean 70” and

standard deviation 3”. In the Netherlands, these figures are 71” and 3”.

What is

(

Y > X

), where

and

are the heights of randomly chosen UK

and Netherlands males, respectively?

We have

X ∼ N

(70

) and

Y ∼ N

(71

). Then (as we will show in later

lectures) Y − X ∼ N(1, 18).

P(Y > X) = P(Y − X > 0) = P



Y − X − 1

√

−1

√



= 1 − Φ(−1/

√

18),

since

(Y −X)−1

√

∼ N (0, 1), and the answer is approximately 0.5931.

Now suppose that in both countries, the Olympic male basketball teams are

selected from that portion of male whose hight is at least above 4” above the

mean (which corresponds to the 9

1% tallest males of the country). What is the

probability that a randomly chosen Netherlands player is taller than a randomly

chosen UK player?

For the second part, we have

P(Y > X | X ≥ 74, Y ≥ 75) =

x=74

(x) dx +

∞

x=75

∞

y=x

(y)ϕ

(x) dy dx

∞

x=74

(x) dx

∞

y=75

(y) dy

which is approximately 0.7558. So even though the Netherlands people are only

slightly taller, if we consider the tallest bunch, the Netherlands people will be

much taller on average.