IB Statistics - Hypothesis testing

2Hypothesis testing

IB Statistics

2.4

Tests of homogeneity, and connections to confidence

intervals

2.4.1 Tests of homogeneity

Example. 150 patients were randomly allocated to three groups of 50 patients

each. Two groups were given a new drug at different dosage levels, and the third

group received a placebo. The responses were as shown in the table below.

Improved No difference Worse Total

Placebo 18 17 15 50

Half dose 20 10 20 50

Full dose 25 13 12 50

Total 63 40 47 150

Here the row totals are fixed in advance, in contrast to our last section, where

the row totals are random variables.

For the above, we may be interested in testing

: the probability of

“improved” is the same for each of the three treatment groups, and so are

the probabilities of “no difference” and “worse”, i.e.

says that we have

homogeneity down the rows.

In general, we have independent observations from

multinomial distributions,

each of which has

categories, i.e. we observe an

r ×c

table (

), for

= 1

, ··· , r

and j = 1, ··· , c, where

, ··· , N

) ∼ multinomial(n

, p

, ··· , p

)

independently for each i = 1, ··· , r. We want to test

: p

= p

= ··· = p

= p

for j = 1, ··· , c, and

: p

are unrestricted.

Using H

, for any matrix of probabilities (p

like((p

)) =

i=1

! ···n

···p

and

log like = constant +

i=1

j=1

log p

Using Lagrangian methods, we find that ˆp

Under H

log like = constant +

j=1

log p

By Lagrangian methods, we have ˆp

Hence

2 log Λ =

i=1

j=1

log



ˆp



= 2

i=1

j=1

log





which is the same as what we had last time, when the row totals are unrestricted!

We have

(

c −

1) and

c −

1. So the degrees of freedom is

(

c −

−

(

c −

1) = (

r −

1)(

c −

1), and under

, 2

log

Λ is approximately

(r−1)(c−1)

. Again, it is exactly the same as what we had last time!

We reject H

if 2 log Λ > χ

(r−1)(c−1)

(α) for an approximate size α test.

If we let

, e

, and

− e

, using the same approxi-

mating steps as for Pearson’s chi-squared, we obtain

2 log Λ ≈

− e

)

Example. Continuing our previous example, our data is

Improved No difference Worse Total

Placebo 18 17 15 50

Half dose 20 10 20 50

Full dose 25 13 12 50

Total 6 3 40 47 150

The expected under H

Improved No difference Worse Total

Placebo 21 13.3 15.7 50

Half dose 21 13.3 15.7 50

Full dose 21 13.3 15.7 50

Total 63 40 47 150

We find 2

log

Λ = 5

129, and we refer this to

. Clearly this is not significant,

as the mean of

is 4, and is something we would expect to happen solely by

chance.

We can calculate the

-value: from tables,

05) = 9

488, so our observed

value is not significant at 5%, and the data are consistent with H

We conclude that there is no evidence for a difference between the drug at

the given doses and the placebo.

For interest,

− e

)

= 5.173,

giving the same conclusion.

2.4.2 Confidence intervals and hypothesis tests

Confidence intervals or sets can be obtained by inverting hypothesis tests, and

vice versa

Definition (Acceptance region). The acceptance region

of a test is the

complement of the critical region C.

Note that when we say “acceptance”, we really mean “non-rejection”! The

name is purely for historical reasons.

Theorem (Duality of hypothesis tests and confidence intervals). Suppose

, ··· , X

have joint pdf f

(x | θ) for θ ∈ Θ.

(i)

Suppose that for every

∈

Θ there is a size

test of

. Denote

the acceptance region by

(

). Then the set

(X) =

{θ

: X

∈ A

(

)

}

is a

100(1 −α)% confidence set for θ.

(ii)

Suppose

(X) is a 100(1

− α

)% confidence set for

. Then

(

) =

{

X :

∈ I(X)} is an acceptance region for a size α test of H

: θ = θ

Intuitively, this says that “confidence intervals” and “hypothesis accep-

tance/rejection” are the same thing. After gathering some data X, we can

produce a, say, 95% confidence interval (

a, b

). Then if we want to test the

hypothesis H

: θ = θ

, we simply have to check whether θ

∈ (a, b).

On the other hand, if we have a test for

, then the confidence

interval is all θ

in which we would accept H

: θ = θ

Proof. First note that θ

∈ I(X) iff X ∈ A(θ

For (i), since the test is size α, we have

P(accept H

| H

is true) = P(X ∈ A(θ

) | θ = θ

) = 1 −α.

And so

P(θ

∈ I(X) | θ = θ

) = P(X ∈ A(θ

) | θ = θ

) = 1 −α.

For (ii), since I(X) is a 100(1 − α)% confidence set, we have

P (θ

∈ I(X) | θ = θ

) = 1 −α.

P(X ∈ A(θ

) | θ = θ

) = P(θ ∈ I(X) | θ = θ

) = 1 −α.

Example. Suppose

, ··· , X

are iid

(

µ,

1) random variables and we want

a 95% confidence set for µ.

One way is to use the theorem and find the confidence set that belongs to the

hypothesis test that we found in the previous example. We find a test of size 0.05

against

µ 

that rejects

when

√

(

¯x − µ

)

| >

(where 1.96 is the upper 2.5% point of N(0, 1)).

Then

(X) =

{µ

: X

∈ A

(

)

}

{µ

√

(

X − µ

)

| <

}

. So a 95%

confidence set for µ is (

X − 1.96/

√

X + 1.96/

√

n).