IB Statistics - Hypothesis testing

2Hypothesis testing

IB Statistics

2.2 Composite hypotheses

For composite hypotheses like

θ ≥

0, the error probabilities do not have a

single value. We define

Definition (Power function). The power function is

W (θ) = P(X ∈ C | θ) = P(reject H

| θ).

We want W (θ) to be small on H

and large on H

Definition (Size). The size of the test is

α = sup

θ∈Θ

W (θ).

This is the worst possible size we can get.

For θ ∈ Θ

, 1 − W (θ) = P(Type II error | θ).

Sometimes the Neyman-Pearson theory can be extended to one-sided alter-

natives.

For example, in the previous example, we have shown that the most powerful

size α test of H

: µ = µ

versus H

: µ = µ

(where µ

> µ

) is given by

C =



x :

√

n(¯x −µ

)

> z



The critical region depends on

, n, σ

, α

, and the fact that

> µ

. It does

not depend on the particular value of

. This test is then uniformly the most

powerful size α for testing H

: µ = µ

against H

: µ > µ

Definition (Uniformly most powerful test). A test specified by a critical region

is uniformly most powerful (UMP) size

test for test

θ ∈

against

: θ ∈ Θ

(i) sup

θ∈Θ

W (θ) = α.

(ii)

For any other test

∗

with size

≤ α

and with power function

∗

, we have

W (θ) ≥ W

∗

(θ) for all θ ∈ Θ

Note that these may not exist. However, the likelihood ratio test often works.

Example. Suppose

, ··· , X

are iid

(

µ, σ

) where

is known, and we

wish to test H

: µ ≤ µ

against H

: µ > µ

First consider testing

′

against

′

, where

> µ

. The

Neyman-Pearson test of size α of H

′

against H

′

has

C =



x :

√

n(¯x −µ

)

> z



We show that

is in fact UMP for the composite hypotheses

against

For µ ∈ R, the power function is

W (µ) = P

(reject H

)

= P



√

X − µ

)

> z



= P



√

X − µ)

> z

√

n(µ

− µ)



= 1 −Φ



√

n(µ

− µ)



To show this is UMP, we know that

(

) =

(by plugging in).

(

) is an

increasing function of µ. So

sup

µ≤µ

W (µ) = α.

So the first condition is satisfied.

For the second condition, observe that for any

µ > µ

, the Neyman-Pearson

size

test of

′

has critical region

. Let

∗

and

∗

belong to any

other test of

of size

≤ α

. Then

∗

can be regarded as a test of

′

of size

≤ α

, and the Neyman-Pearson lemma says that

∗

(

)

≤ W

(

This holds for all µ

> µ

. So the condition is satisfied and it is UMP.

We now consider likelihood ratio tests for more general situations.

Definition (Likelihood of a composite hypothesis). The likelihood of a composite

hypothesis H : θ ∈ Θ given data x to be

(H) = sup

θ∈Θ

f(x | θ).

So far we have considered disjoint hypotheses Θ

, but we are not interested

in any specific alternative. So it is easier to take Θ

= Θ rather than Θ

Then

; H

) =

)

sup

θ∈Θ

f(x | θ)

sup

θ∈Θ

f(x | θ)

≥ 1,

with large values of Λ indicating departure from H

Example. Suppose that

, ··· , X

are iid

(

µ, σ

), with

known, and we

wish to test

against

µ 

(for given constant

). Here

= {µ

} and Θ = R.

For the numerator, we have

sup

| µ

) =

| ˆµ

), where

ˆµ

is the mle.

We know that ˆµ = ¯x. Hence

; H

) =

(2πσ

)

−n/2

exp



−

2σ

− ¯x)



(2πσ

)

−n/2

exp



−

2σ

− µ

)



Then H

is rejected if Λ

is large.

To make our lives easier, we can use the logarithm instead:

2 log Λ(H

; H

) =

− µ

)

−

− ¯x)

(¯x −µ

)

So we can reject H

if we have



√

n(¯x −µ

)



> c

for some c.

We know that under

√

X − µ

)

∼ N

1). So the size

generalised likelihood test rejects H



√

n(¯x −µ

)



> z

α/2

Alternatively, since

X − µ

)

∼ χ

, we reject H

n(¯x −µ

)

> χ

(α),

(check that z

α/2

= χ

(α)).

Note that this is a two-tailed test — i.e. we reject

both for high and low

values of ¯x.

The next theorem allows us to use likelihood ratio tests even when we cannot

find the exact relevant null distribution.

First consider the “size” or “dimension” of our hypotheses: suppose that

imposes

independent restrictions on Θ. So for example, if Θ =

{θ

(θ

, ··· , θ

)}, and we have

– H

: θ

= a

, θ

= a

, ··· , θ

= a

; or

– H

: Aθ = b (with A p × k, b p × 1 given); or

– H

: θ

= f

(φ), i = 1, ··· , k for some φ = (φ

, ··· , φ

k−p

We say Θ has

free parameters and Θ

has

k − p

free parameters. We write

|Θ

| = k − p and |Θ| = k.

Theorem (Generalized likelihood ratio theorem). Suppose Θ

⊆

and

|−

|Θ

| = p. Let X = (X

, ··· , X

) with all X

iid. If H

is true, then as n → ∞,

2 log Λ

; H

) ∼ χ

is not true, then 2

log

Λ tends to be larger. We reject

if 2

log

> c

where c = χ

(α) for a test of approximately size α.

We will not prove this result here. In our example above,

| − |

= 1,

and in this case, we saw that under

, 2

log

∼ χ

exactly for all

in that

particular case, rather than just approximately.