IB Analysis II - Differentiation from ℝ<sup>m</sup> to ℝ<sup>n</sup>

6Differentiation from ℝ^m to ℝⁿ

IB Analysis II

6.3 Mean value inequalities

So far, we have just looked at cases where we assume the function is differentiable

at a point. We are now going to assume the function is differentiable in a region,

and see what happens to the derivative.

Recall the mean value theorem from single-variable calculus: if

: [

a, b

]

→ R

is continuous on [a, b] and differentiable on (a, b), then

f(b) − f(a) = f

′

(c)(b − a)

for some

c ∈

(

a, b

). This is our favorite theorem, and we have used it many

times in IA Analysis. Here we have an exact equality. However, in general, for

vector-valued functions, i.e. if we are mapping to

, this is no longer true.

Instead, we only have an inequality.

We first prove it for the case when the domain is a subset of

, and then

reduce the general case to this special case.

Theorem. Let f : [

a, b

]

→ R

be continuous on [

a, b

] and differentiable on (

a, b

Suppose we can find some

such that for all

t ∈

(

a, b

), we have

∥D

)

∥ ≤ M

Then

∥f(b) − f(a)∥ ≤ M(b − a).

Proof. Let v = f(b) −f(a). We define

g(t) = v · f (t) =

i=1

(t).

Since each

is differentiable,

is continuous on [

a, b

] and differentiable on (

a, b

)

with

′

(t) =

′

(t).

Hence, we know

′

(t)| ≤



i=1

′

(t)



≤ ∥v∥

i=1

′2

(t)

1/2

= ∥v∥∥Df(t)∥ ≤ M∥v∥.

We now apply the mean value theorem to g to get

g(b) −g(a) = g

′

(t)(b − a)

for some t ∈ (a, b). By definition of g, we get

v · (f (b) − f (a)) = g

′

(t)(b − a).

By definition of v, we have

∥f(b) − f(a)∥

= |g

′

(t)(b − a)| ≤ (b − a)M∥f(b) − f(a)∥.

If f (

) = f (

), then there is nothing to prove. Otherwise, divide by

∥

)

−

)

∥

and done.

We now apply this to prove the general version.

Theorem (Mean value inequality). Let a

∈ R

and f :

(a)

→ R

differentiable on B

(a) with ∥Df (x)∥ ≤ M for all x ∈ B

(a). Then

∥f(b

) − f(b

)∥ ≤ M∥b

− b

∥

for any b

, b

∈ B

(a).

Proof. We will reduce this to the previous theorem.

Fix b

, b

∈ B

(a). Note that

+ (1 − t)b

∈ B

(a)

for all t ∈ [0, 1]. Now consider g : [0, 1] → R

g(t) = f (tb

+ (1 − t)b

By the chain rule, g is differentiable and

′

(t) = Dg(t) = (Df(tb

+ (1 − t)b

))(b

− b

)

Therefore

∥Dg(t)∥ ≤ ∥Df(tb

+ (1 − t)b

)∥∥b

− b

∥ ≤ M∥b

− b

∥.

Now we can apply the previous theorem, and get

∥f(b

) − f(b

)∥ = ∥g(1) − g(0)∥ ≤ M∥b

− b

∥.

Note that here we worked in a ball. In general, we could have worked in a

convex set, since all we need is for tb

+ (1 − t)b

to be inside the domain.

But with this, we have the following easy corollary.

Corollary. Let f :

(a)

⊆ R

→ R

have

f(x) = 0 for all x

∈ B

(a). Then

f is constant.

Proof. Apply the mean value inequality with M = 0.

We would like to extend this corollary. Does this corollary extend to differ-

entiable maps f with Df = 0 defined on any open set U ⊆ R

The answer is clearly no. Even for functions

R → R

, this is not true, since

we can have two disjoint intervals [1, 2] ∪ [3, 4], and define f(t) to be 1 on [1, 2]

and 2 on [3

4]. Then

= 0 but

is not constant.

is just locally constant on

each interval.

The problem with this is that the sets are disconnected. We cannot connect

points in [1

2] and points in [3

4] with a line. If we can do so, then we would be

able to show that f is constant.

Definition (Path-connected subset). A subset

E ⊆ R

is path-connected if for

any a, b ∈ E, there is a continuous map γ : [0, 1] → E such that

γ(0) = a, γ(1) = b.

Theorem. Let

U ⊆ R

be open and path-connected. Then for any differentiable

f : U → R

, if Df (x) = 0 for all x ∈ U, then f is constant on U.

A naive attempt would be to replace

−

−t

in the proof of the mean

value theorem with a path γ(t). However, this is not a correct proof, since this

has to assume γ is differentiable. So this doesn’t work. We have to think some

more.

Proof.

We are going to use the fact that f is locally constant. wlog, assume

= 1. Given any a

∈ U

, we show that

(a) =

(b). Let

: [0

→ U

a (continuous) path from a to b. For any

s ∈

1), there exists some

such

that

(

))

⊆ U

since

is open. By continuity of

, there is a

such that

(s − δ, s + δ) ⊆ [0, 1] with γ((s − δ, s + δ)) ⊆ B

(γ(s)) ⊆ U.

Since

is constant on

(

)) by the previous corollary, we know that

(

) =

f ◦ γ

(

) is constant on (

s − δ, s

). In particular,

is differentiable at

with derivative 0. This is true for all

. So the map

: [0

→ R

has zero

derivative on (0

1) and is continuous on (0

1). So

is constant. So

(0) =

(1),

i.e. f(a) = f(b).

were differentiable, then this is much easier, since we can show

′

= 0 by

the chain rule:

′

(t) = Df(γ(t))γ

′

(t).