III Differential Geometry

1Manifolds

1.2 Smooth functions and derivatives

From now on,

and

will be manifolds. As usual, we would like to talk about

maps between manifolds. What does it mean for such a map to be smooth? In

the case of a function

M → R

, we had to check it on each chart of

. Now that

we have functions M → N , we need to check it on charts of both N and M .

Definition

(Smooth function)

A function

M → N

is smooth at a point

p ∈ M

if there are charts (

U, ϕ

) for

and (

V, ξ

) for

with

p ∈ U

and

(

)

∈ V

such that ξ ◦ f ◦ ϕ

−1

: ϕ(U) → ξ(V ) is smooth at ϕ(p).

A function is smooth if it is smooth at all points p ∈ M.

A diffeomorphism is a smooth f with a smooth inverse.

We write

∞

(

M, N

) for the space of smooth maps

M → N

. We write

∞

(

) for

∞

(

M, R

), and this has the additional structure of an algebra, i.e.

a vector space with multiplication.

ξ ◦ f ◦ ϕ

−1

Equivalently,

is smooth at

ξ ◦ f ◦ ϕ

−1

is smooth at

(

) for any such charts

(U, ϕ) and (V, ξ).

Example.

Let

U → R

be a chart. Then

U → ϕ

(

) is a diffeomorphism.

Definition

(Curve)

A curve is a smooth map

I → M

, where

is a non-empty

open interval.

To discuss derivatives, we first look at the case where

U ⊆ R

is open.

Suppose

U → R

is smooth. If

p ∈ U

and

v ∈ R

, recall that the directional

derivative is defined by

Df|

(v) = lim

t→0

f(p + tv) − f(p)

If v = e

= (0, · · · , 0, 1, 0, · · · , 0), then we write

Df|

) =

∂f

∂x



Also, we know Df|

: R

→ R is a linear map (by definition of smooth).

Note that here

and

are both vectors, but they play different roles —

is an element in the domain

, while

is an arbitrary vector in

. Even if

is enormous, by taking a small enough

, we find that

will eventually be

inside U.

If we have a general manifold, we can still talk about the

. However, we

don’t have anything that plays the role of a vector. Our first goal is to define

the tangent space to a manifold that captures where the “directions” live.

An obvious way to do so would be to use a curve. Suppose

I → M

is a

curve, with

(0) =

p ∈ U ⊆ M

, and

U → R

is smooth. We can then take the

derivative of f along γ as before. We let

X(f) =



t=0

f(γ(t)).

It is an exercise to see that

∞

(

)

→ R

is a linear map, and it satisfies the

Leibniz rule

X(fg) = f (p)X(g) + g(p)X(f).

We denote

˙γ

(0). We might think of defining the tangent space as curves

up to some equivalence relation, but if we do this, there is no obvious vector

space on it. The trick is to instead define a vector by the derivative

induces.

This then has an obvious vector space structure.

Definition

(Derivation)

A derivation on an open subset

U ⊆ M

p ∈ U

is a

linear map X : C

∞

(U) → R satisfying the Leibniz rule

X(fg) = f (p)X(g) + g(p)X(f).

Definition

(Tangent space)

Let

p ∈ U ⊆ M

, where

is open. The tangent

space of M at p is the vector space

M = { derivations on U at p } ≡ Der

∞

(U)).

The subscript p tells us the point at which we are taking the tangent space.

Why is this the “right” definition? There are two things we would want to

be true:

(i) The definition doesn’t actually depend on U .

(ii) This definition agrees with the usual definition of tangent vectors in R

We will do the first part at the end by bump functions, and will do the second

part now. Note that it follows from the second part that every tangent vector

comes from the derivative of a path, because this is certainly true for the usual

definition of tangent vectors in

(take a straight line), and this is a completely

local problem.

Example. Let U ⊆ R

be open, and let p ∈ U. Then we have tangent vectors

∂

∂x



∈ T

, i = 1, . . . , n.

These correspond to the canonical basis vectors in R

Lemma.

∂

∂x



, · · · ,

∂

∂x



is a basis of T

. So these are all the derivations.

The idea of the proof is to show that a derivation can only depend on the

first order derivatives of a function, and all possibilities will be covered by the

∂

∂x

Proof. Independence is clear as

∂x

= δ

We need to show spanning. For notational convenience, we wlog take

= 0. Let

X ∈ T

We first show that if

g ∈ C

∞

(

) is the constant function

= 1, then

X(g) = 0. Indeed, we have

X(g) = X(g

) = g(0)X(g) + X(g)g(0) = 2X(g).

Thus, if

is any constant function, say,

, then

(

) =

(

) =

(

). So the

derivative of any constant function vanishes.

In general, let f ∈ C

∞

(U). By Taylor’s theorem, we have

f(x

, · · · , x

) = f(0) +

i=1

∂f

∂x



+ ε,

where ε is a sum of terms of the form x

h with h ∈ C

∞

(U).

We set λ

= X(x

) ∈ R. We first claim that X(ε) = 0. Indeed, we have

X(x

h) = x

(0)X(x

h) + (x

h)(0)X(x

) = 0.

So we have

X(f) =

i=1

∂f

∂x



So we have

X =

i=1

∂

∂x



Given this definition of a tangent vector, we have a rather silly and tautological

definition of the derivative of a smooth function.

Definition

(Derivative)

Suppose

F ∈ C

∞

(

M, N

), say

(

) =

. We define

DF |

: T

M → T

N by

DF |

(X)(g) = X(g ◦ F )

for X ∈ T

M and g ∈ C

∞

(V ) with q ∈ V ⊆ N .

This is a linear map called the derivative of F at p.

M N

g◦F

With a silly definition of a derivative comes a silly definition of the chain

rule.

Proposition

(Chain rule)

Let

M, N, P

be manifolds, and

F ∈ C

∞

(

M, N

G ∈ C

∞

(N, P ), and p ∈ M, q = F (p). Then we have

D(G ◦ F )|

= DG|

◦ DF |

Proof. Let h ∈ C

∞

(P ) and X ∈ T

M. We have

DG|

(DF |

(X))(h) = DF |

(X)(h ◦ G) = X(h ◦ G ◦ F ) = D(G ◦ F )|

(X)(h).

Note that this does not provide a new, easy proof of the chain rule. Indeed,

to come this far into the course, we have used the actual chain rule something

like ten thousand times.

Corollary.

is a diffeomorphism, then D

F |

is a linear isomorphism, and

(DF |

)

−1

= D(F

−1

F (p)

In the special case where the domain is

, there is a canonical choice of

tangent vector at each point, namely 1.

Definition

(Derivative)

Let

R → M

be a smooth function. Then we write

dγ

(t) = ˙γ(t) = Dγ|

(1).

We now go back to understanding what

is if

p ∈ M

. We let

p ∈ U

where (

U, ϕ

) is a chart. Then if

(

), the map D

ϕ|

M → T

is a

linear isomorphism.

Definition (

∂

∂x

). Given a chart ϕ : U → R

with ϕ = (x

, · · · , x

), we define

∂

∂x



= (Dϕ|

)

−1

∂

∂x



ϕ(p)

∈ T

∂

∂x



, · · · ,

∂

∂x



is a basis for T

Recall that if

U → R

is smooth, then we can write

(

, · · · , x

). Then

we have

∂

∂x



(f) =

∂f

∂x



ϕ(p)

So we have a consistent notation.

Now, how does this basis change when we change coordinates? Suppose we

also have coordinates

, · · · , y

near

given by some other chart. We then have

∂

∂y



∈ T

M. So we have

∂

∂y



j=1

∂

∂x



for some

. To figure out what they are, we apply them to the function

. So

we have

∂

∂y



) =

∂x

∂y

(p) = α

So we obtain

∂

∂y



j=1

∂x

∂y

(p)

∂

∂x



This is the usual change-of-coordinate formula!

Now let

F ∈ C

∞

(

M, N

), (

U, ϕ

) be a chart on

containing

with coordinates

, · · · , x

, and (

V, ξ

) a chart on

containing

(

) with coordinates

, · · · , y

. By abuse of notation, we confuse

and

ξ ◦ F ◦ ϕ

−1

. So we write

F = (F

, · · · , F

) with F

= F

, · · · , x

) : U → R.

As before, we have a basis

∂

∂x



, · · · ,

∂

∂x



for T

∂

∂y



, · · · ,

∂

∂y



for T

Lemma. We have

DF |

∂

∂x



j=1

∂F

∂x

(p)

∂

∂y



In other words, DF |

has matrix representation



∂F

∂x

(p)



Proof. We let

DF |

∂

∂x



j=1

∂

∂y



for some λ

. We apply this to the local function y

to obtain





j=1

∂

∂y







)

= DF

∂

∂x



)

∂

∂x



◦ F )

∂

∂x



)

∂F

∂x

(p).

Example.

Let

∞

(

) where

U ⊆ M

is an open set containing

. Then

M → T

f(p)

∼

is a linear map. So D

is an element in the dual

space (

)

∗

, called the differential of

, and is denoted d

. Then we

have

df|

(X) = X(f).

(this can, e.g. be checked in local coordinates)