In the previous section, our solutions f(x,t) had the property that f(−,t)→f0 in L2. One might hope that, for example, if f0 is continuous, then this converges in C0. Another shortcoming of our previous approach is that we have no control over what Ht looks like as a function as the whole construction is manifestly global.
In this section, we will construct the heat kernel via an alternative method. This method involves starting with an initial “guess” of what the heat kernel should be. We then show that if our initial guess is “good enough”, then there is an iterative procedure that gives us the actual heat kernel. Moreover, this iterative procedure is sufficiently explicit that we can relate the asymptotic behaviour of Ht as t→0 back to that of our original guess.
We will now restrict to the case of the Laplace–Beltrami operator, since we can use the explicit solution for Rd as an inspiration for our initial guess. We fix a p and consider the Laplace–Beltrami operator on Ωp.
Lemma2.1
For any N, there exists a smooth section Kt(x,y)=Kt(y,x)T∈Γ(Ωp⊠(Ωp)∗) such that
Kt(x,y)→δ(x,y) as t→0.
More precisely,
t→0lim∫MKt(x,y)α(y)dy=α(x) in C0
for all continuous α∈Ωp(M).
There is a constant C and an exponent N such that
∣∣(∂t∂−Δx)Kt(x,y)∣∣≤CtN
for all x,y∈M.
For any other continuous section Ft(x,y)∈C0(Ωp⊠(Ωp)∗×[0,∞)), the convolution
(K∗F)t(x,y)=∫0tds∫MdzKt−s(x,z)Gs(z,y)
is a C2 section of Ωp⊠(Ωp)∗×(0,∞).
To motivate why one should care about the convolution, notice that Duhamel's principle essentially says that the heat kernel H satisfies
(dtd+Δx)H∗F=F
for all F. The condition that the convolution is C2 is a technical point that will be useful later on.
Proof
□
Write r(x,y) for the geodesic distance between x,y∈M. The very first naive guess for Kt(x,y) might just be
(4πt)d/21e−r2/4t.
This definitely satisfies (i). To see how good an attempt we made, we compute ∂t∂+Δx of this.
To do so, we fix a y∈M and express x in geodesic polar coordinates about y. Then there is a formula
If we were in flat space, then drdg=0, and so this is in fact a legitimate fundamental solution, as one would expect. However, when drdg=0, then we can't quite bound this as t→0.
To solve this, we introduce some correction terms, which are small enough to preserve the property (i). We shall find ui(x,y)∈Γ(Ωp⊠(Ωp)∗) with f0(x,x)=I such that
The precise shape of the right-hand side does not matter. It is just what happens to be convenient. The point is that the exponent of t is N−2d, which is eventually positive. Then we simply have to take K=KN for large enough N (note that this N is different from the exponent N in (ii) by 2d). Actually, we will only succeed in finding uN in a neighbourhood of the diagonal, since we work in geodesic polars. But we can simply take K=KNφ(x,y), where φ is a bump function supported near the diagonal. ((iii) is left as an exercise)
To find the ui, we assume KN is of the given form, and compute
The diligent reader will observe that the left-hand side is exact, so we can write this as
∇r∂r∂(rig1/4ui(x,y))=rig1/4Δxui−1(x,y).
The existence and uniqueness of a local solution then follows easily (one has to be slightly careful about r=0, where the original equation gives an initial condition iui=Δxui−1).
Proof
□
For future reference, we note that ∇r∂r∂=0 when r=0, so
Kt(x,x)=const⋅i=0∑Nti−d/2ui(x,x),
where ui(x,x) is a rational function in gij and its derivatives.
We will now fix an N large enough and take K to be the function given by the previous lemma.
To find the actual fundamental solution, we have to understand the convolution a bit better. Recall that we expect a genuine fundamental solution H to satisfy (dtd+Δx)(H∗F)=F. Using the fact that Kt(x,y)→δ(x,y) as t→0, we can calculate
Lemma2.2
(∂t∂+Δx)(K∗F)=F+[(∂t∂+Δx)K]∗F.
So at least formally, the heat kernel should be given by
H=m=0∑∞(−1)mK∗[(∂t∂+Δx)K]∗m.
Barring convergence issues, the formula above also shows that (dtd+Δx)H=0.
The terms in the denominator are the interesting bits that come from doing the bound more carefully. They ensure the sum above converges by the ratio test. This is a standard inductive calculation
Adding these up and using the ratio test, we find that
H=K+K∗m=1∑∞(−1)G∗m
converges, and we have the following theorem:
Theorem2.3
There is a fundamental solution Ht(x,y), and it satisfies
Ht(x,y)=Kt(x,y)+O(tN+1).
In particular, we have
Ht(x,y)→δ(x,y) as t→0.
From this, we can read off the leading term of Ht(x,x) as t→0, which is, unsurprisingly, (4πt)d/2e−r2/4t. As a recap, to produce this estimate, we used this as an initial approximation to the heat kernel, and then perform some procedures to turn this into an actual heat kernel. We then observe that all the correction terms we add are of higher order, so we are happy.
Note that all we used about Kt was the properties stated in Lemma 2.1, and the procedure is pretty general. One can interpret the success of this procedure as testifying to the locality of the heat kernel. The properties of Kt we required were always local conditions, namely that Kt looks locally like a heat kernel for small t. We then get to produce an actual heat kernel Ht, whose difference from Kt is something we can control very well, and in particular vanishes in the limit t→0.
In general, if our manifold is written as a union of two open submanifolds, and we have produced heat kernels on each submanifold (suitably interpreted), then we can attempt to glue them together using a partition of unity. This will not give us an actual heat kernel, but the failure to be a heat kernel decreases exponentially as t→0 (since Ht(x,y) falls exponentially as t→0 whenever x=y). The above argument then lets us produce a heat kernel whose difference from the “fake” one vanishes to all orders as t→0.