IB Statistics - Estimation

1Estimation

IB Statistics

1.1 Estimators

The goal of estimation is as follows: we are given iid

, ··· , X

, and we know

that their probability density/mass function is

(

;

) for some unknown

We know

but not

. For example, we might know that they follow a Poisson

distribution, but we do not know what the mean is. The objective is to estimate

the value of θ.

Definition (Statistic). A statistic is an estimate of

. It is a function

of the

data. If we write the data as x = (

, ··· , x

), then our estimate is written as

θ = T (x). T (X) is an estimator of θ.

The distribution of T = T (X) is the sampling distribution of the statistic.

Note that we adopt the convention where capital X denotes a random variable

and x is an observed value. So

(X) is a random variable and

(x) is a particular

value we obtain after experiments.

Example. Let X

, ··· , X

be iid N(µ, 1). A possible estimator for µ is

T (X) =

Then for any particular observed sample x, our estimate is

T (x) =

What is the sampling distribution of

? Recall from IA Probability that in

general, if

∼ N

(

, σ

), then

∼ N

(

), which is something we

can prove by considering moment-generating functions.

So we have

(X)

∼ N

(

µ,

). Note that by the Central Limit Theorem,

even if

were not normal, we still have approximately

(X)

∼ N

(

µ,

) for

large values of

, but here we get exactly the normal distribution even for small

values of n.

The estimator

we had above is a rather sensible estimator. Of course,

we can also have silly estimators such as

(X) =

, or even

(X) = 0

always.

One way to decide if an estimator is silly is to look at its bias.

Definition (Bias). Let

(X) be an estimator of

. The bias of

is the

difference between its expected value and true value.

bias(

θ) = E

(

θ) − θ.

Note that the subscript

does not represent the random variable, but the thing

we want to estimate. This is inconsistent with the use for, say, the probability

mass function.

An estimator is unbiased if it has no bias, i.e. E

(

θ) = θ.

To find out

(

), we can either find the distribution of

and find its

expected value, or evaluate

as a function of X directly, and find its expected

value.

Example. In the above example, E

(T ) = µ. So T is unbiased for µ.