IB Statistics - Hypothesis testing

2Hypothesis testing

IB Statistics

2.6 Student’s t-distribution

Definition (

-distribution). Suppose that

and

are independent,

Z ∼ N

and Y ∼ χ

. Then

T =

Y/k

is said to have a t-distribution on k degrees of freedom, and we write T ∼ t

The density of t

turns out to be

(t) =

Γ((k + 1)/2)

Γ(k/2)

√

πk



1 +



−(k+1)/2

This density is symmetric, bell-shaped, and has a maximum at

= 0, which

is rather like the standard normal density. However, it can be shown that

(

T > t

)

> P

(

Z > t

), i.e. the

distribution has a “fatter” tail. Also, as

k → ∞

approaches a normal distribution.

Proposition. If k > 1, then E

(T ) = 0.

If k > 2, then var

(T ) =

k−2

If k = 2, then var

(T ) = ∞.

In all other cases, the values are undefined. In particular, the

= 1 case has

undefined mean and variance. This is known as the Cauchy distribution.

Notation. We write

(

) be the upper 100

% point of the

distribution, so

that P(T > t

(α)) = α.

Why would we define such a weird distribution? The typical application is

to study random samples with unknown mean and unknown variance.

Let

, ··· , X

be iid

(

µ, σ

). Then

X ∼ N

(

µ, σ

). So

√

X−µ)

∼

N(0, 1).

Also, S

/σ

∼ χ

n−1

and is independent of

X, and hence Z. So

√

X − µ)/σ

/((n − 1)σ

)

∼ t

n−1

√

X − µ)

/(n − 1)

∼ t

n−1

We write

˜σ

n−1

(note that this is the unbiased estimator). Then a 100(1

−α

confidence interval for µ is found from

1 − α = P



−t

n−1





≤

√

X − µ)

˜σ

≤ t

n−1







This has endpoints

X ±

˜σ

√

n−1



