7The Riemann Integral
IA Analysis I
7.1 Riemann Integral
Definition (Dissections). Let [
a, b
] be a closed interval. A dissection of [
a, b
] is
a sequence a = x
0
< x
1
< x
2
< ··· < x
n
= b.
Definition (Upper and lower sums). Given a dissection
D
, the upper sum and
lower sum are defined by the formulae
U
D
(f) =
n
X
i=1
(x
i
− x
i−1
) sup
x∈[x
i−1
,x
i
]
f(x)
L
D
(f) =
n
X
i=1
(x
i
− x
i−1
) inf
x∈[x
i−1
,x
i
]
f(x)
Sometimes we use the shorthand
M
i
= sup
x∈[x
i−1
,x
i
]
f(x), m
i
= inf
x∈[x
i−1
−x
i
]
f(x).
The upper sum is the total area of the red rectangles, while the lower sum is
the total area of the black rectangles:
x
y
a x
1
x
2
x
3
···
x
i
x
i+1
···
···
b
Definition (Refining dissections). If
D
1
and
D
2
are dissections of [
a, b
], we say
that D
2
refines D
1
if every point of D
1
is a point of D
2
.
Lemma. If D
2
refines D
1
, then
U
D
2
f ≤ U
D
1
f and L
D
2
f
≥ L
D
1
f.
Using the picture above, this is because if we cut up the dissections into
smaller pieces, the red rectangles can only get chopped into shorter pieces and
the black rectangles can only get chopped into taller pieces.
x
y
x
0
x
1
x
y
x
0
x
1
x
2
x
3
Proof.
Let
D
be
x
0
< x
1
< ··· < x
n
. Let
D
2
be obtained from
D
1
by the
addition of one point z. If z ∈ (x
i−1
, x
i
), then
U
D
2
f −U
D
1
f =
"
(z − x
i−1
) sup
x∈[x
i−1
,z]
f(x)
#
+
"
(x
i
− z) sup
x∈[z,x
i
]
f(x)
#
− (x
i
− x
i−1
)M
i
.
But
sup
x∈[x
i−1
,z]
f
(
x
) and
sup
x∈[z,x
i
]
f
(
x
) are both at most
M
i
. So this is at
most M
i
(z − x
i−1
+ x
i
− z − (x
i
− x
i−1
)) = 0. So
U
D
2
f ≤ U
D
1
f.
By induction, the result is true whenever D
2
refines D
1
.
A very similar argument shows that L
D
2
f
≥ L
D
1
f.
Definition (Least common refinement). If
D
1
and
D
2
be dissections of [
a, b
].
Then the least common refinement of
D
1
and
D
2
is the dissection made out of
the points of D
1
and D
2
.
Corollary. Let D
1
and D
2
be two dissections of [a, b]. Then
U
D
1
f ≥ L
D
2
f.
Proof.
Let
D
be the least common refinement (or indeed any common refinement).
Then by lemma above (and by definition),
U
D
1
f ≥ U
D
f ≥ L
D
f ≥ L
D
2
f.
Finally, we can define the integral.
Definition (Upper, lower, and Riemann integral). The upper integral is
Z
b
a
f(x) dx = inf
D
U
D
f.
The lower integral is
Z
b
a
f(x) dx = sup
D
L
D
f.
If these are equal, then we call their common value the Riemann integral of
f
,
and is denoted
R
b
a
f(x) dx.
If this exists, we say f is Riemann integrable.
We will later prove the fundamental theorem of calculus, which says that
integration is the reverse of differentiation. But why don’t we simply define
integration as anti-differentiation, and prove that it is the area of the curve?
There are things that we cannot find (a closed form of) the anti-derivative of,
like
e
−x
2
. In these cases, we wouldn’t want to say the integral doesn’t exist — it
surely does according to this definition!
There is an immediate necessary condition for Riemann integrability — bound-
edness. If
f
is unbounded above in [
a, b
], then for any dissection
D
, there must
be some
i
such that
f
is unbounded on [
x
i−1
, x
i
]. So
M
i
=
∞
. So
U
D
f
=
∞
.
Similarly, if
f
is unbounded below, then
L
D
f
=
−∞
. So unbounded functions
are not Riemann integrable.
Example. Let
f
(
x
) =
x
on [
a, b
]. Intuitively, we know that the integral is
(
b
2
− a
2
)
/
2, and we will show this using the definition above. Let
D
=
x
0
<
x
1
< ··· < x
n
be a dissection. Then
U
D
f =
n
X
i=1
(x
i
− x
i−1
)x
i
We know that the integral is
b
2
−a
2
2
. So we put each term of the sum into the
form
x
2
i
−x
2
i−1
2
plus some error terms.
=
n
X
i=1
x
2
i
2
−
x
2
i−1
2
+
x
2
i
2
− x
i−1
x
i
+
x
2
i−1
2
=
1
2
n
X
i=1
(x
2
i
− x
2
i−1
+ (x
i
− x
i−1
)
2
)
=
1
2
(b
2
− a
2
) +
1
2
n
X
i=1
(x
i
− x
i−1
)
2
.
Definition (Mesh). The mesh of a dissection D is max
i
(x
i+1
− x
i
).
Then if the mesh is < δ, then
1
2
n
X
i=1
(x
i
− x
i−1
)
2
≤
δ
2
n
X
i=1
(x
i
− x
i−1
) =
δ
2
(b − a).
So by making δ small enough, we can show that for any ε > 0,
Z
b
a
x dx <
1
2
(b
2
− a
2
) + ε.
Similarly,
Z
b
a
x dx >
1
2
(b
2
− a
2
) − ε.
So
Z
b
a
x dx =
1
2
(b
2
− a
2
).
Example. Define f : [0, 1] → R by
f(x) =
(
1 x ∈ Q
0 x 6∈ Q
.
Let
x
0
< x
1
< ··· < x
n
be a dissection. Then for every
i
, we have
m
i
= 0 (since
there is an irrational in every interval), and
M
i
= 1 (since there is a rational in
every interval). So
U
D
f =
n
X
i=1
M
i
(x
i
− x
i−1
) =
n
X
i=1
(x
i
− x
i−1
) = 1.
Similarly, L
D
f = 0. Since D was arbitrary, we have
Z
1
0
f(x) dx = 1,
Z
1
0
f(x) dx = 0.
So f is not Riemann integrable.
Of course, this function is not interesting at all. The whole point of its
existence is to show undergraduates that there are some functions that are not
integrable!
Note that it is important to say that the function is not Riemann integrable.
There are other notions for integration in which this function is integrable. For
example, this function is Lebesgue-integrable.
Using the definition to show integrability is often cumbersome. Most of
the time, we use the Riemann’s integrability criterion, which follows rather
immediately from the definition, but is much nicer to work with.
Proposition (Riemann’s integrability criterion). This is sometimes known as
Cauchy’s integrability criterion.
Let
f
: [
a, b
]
→ R
. Then
f
is Riemann integrable if and only if for every
ε > 0, there exists a dissection D such that
U
D
− L
D
< ε.
Proof.
(
⇒
) Suppose that
f
is integrable. Then (by definition of Riemann
integrability), there exist D
1
and D
2
such that
U
D
1
<
Z
b
a
f(x) dx +
ε
2
,
and
L
D
2
>
Z
b
a
f(x) dx −
ε
2
.
Let D be a common refinement of D
1
and D
2
. Then
U
D
f −L
D
f ≤ U
D
1
f −L
D
2
f < ε.
(⇐) Conversely, if there exists D such that
U
D
f −L
D
f < ε,
then
inf U
D
f −sup L
D
f < ε,
which is, by definition, that
Z
b
a
f(x) dx −
Z
b
a
f(x) dx < ε.
Since ε > 0 is arbitrary, this gives us that
Z
b
a
f(x) dx =
Z
b
a
f(x) dx.
So f is integrable.
The next big result we want to prove is that integration is linear, ie
Z
b
a
(λf(x) + µg(x)) dx = λ
Z
b
a
f(x) dx + µ
Z
b
a
g(x) dx.
We do this step by step:
Proposition. Let
f
: [
a, b
]
→ R
be integrable, and
λ ≥
0. Then
λf
is integrable,
and
Z
b
a
λf(x) dx = λ
Z
b
a
f(x) dx.
Proof. Let D be a dissection of [a, b]. Since
sup
x∈[x
i−1
,x
i
]
λf(x) = λ sup
x∈[x
i−1
,x
i
]
f(x),
and similarly for inf, we have
U
D
(λf) = λU
D
f
L
D
(λf) = λL
D
f.
So if we choose
D
such that
U
D
f − L
D
f < ε/λ
, then
U
D
(
λf
)
−L
D
(
λf
)
< ε
. So
the result follows from Riemann’s integrability criterion.
Proposition. Let f : [a, b] → R be integrable. Then −f is integrable, and
Z
b
a
−f(x) dx = −
Z
b
a
f(x) dx.
Proof. Let D be a dissection. Then
sup
x∈[x
i−1
,x
i
]
−f(x) = − inf
x∈[x
i−1
,x
i
]
f(x)
inf
x∈[x
i−1
,x
i
]
−f(x) = − sup
x∈[x
i−1
,x
i
]
f(x).
Therefore
U
D
(−f) =
n
X
i=1
(x
i
− x
i−1
)(−m
i
) = −L
D
(f).
Similarly,
L
D
(−f) = −U
D
f.
So
U
D
(−f) − L
D
(−f) = U
D
f −L
D
f.
Hence if
f
is integrable, then
−f
is integrable by the Riemann integrability
criterion.
Proposition. Let
f, g
: [
a, b
]
→ R
be integrable. Then
f
+
g
is integrable, and
Z
b
a
(f(x) + g(x)) dx =
Z
b
a
f(x) dx +
Z
b
a
g(x) dx.
Proof. Let D be a dissection. Then
U
D
(f + g) =
n
X
i=1
(x
i
− x
i−1
) sup
x∈[x
i−1
,x
i
]
(f(x) + g(x))
≤
n
X
i=1
(x
i
− x
i−1
)
sup
u∈[x
i−1
,x
i
]
f(u) + sup
v∈[x
i−1
,x
i
]
g(v)
!
= U
D
f + U
D
g
Therefore,
Z
b
a
(f(x) + g(x)) dx ≤
Z
b
a
f(x) dx +
Z
b
a
g(x) dx =
Z
b
a
f(x) dx +
Z
b
a
g(x) dx.
Similarly,
Z
b
a
(f(x) + g(x)) dx ≥
Z
b
a
f(x) dx +
Z
b
a
g(x) dx.
So the upper and lower integrals are equal, and the result follows.
So we now have that
Z
b
a
(λf(x) + µg(x)) dx = λ
Z
b
a
f(x) dx + µ
Z
b
a
g(x) dx.
We will prove more “obvious” results.
Proposition. Let
f, g
: [
a, b
]
→ R
be integrable, and suppose that
f
(
x
)
≤ g
(
x
)
for every x. Then
Z
b
a
f(x) dx ≤
Z
b
a
g(x) dx.
Proof. Follows immediately from the definition.
Proposition. Let f : [a, b] → R be integrable. Then |f| is integrable.
Proof. Note that we can write
sup
x∈[x
i−1
,x
i
]
f(x) − inf
x∈[x
i−1
,x
i
]
f(x) = sup
u,v∈[x
i−1
,x
i
]
|f(u) − f(v)|.
Similarly,
sup
x∈[x
i−1
,x
i
]
|f(x)| − inf
x∈[x
i−1
,x
i
]
|f(x)| = sup
u,v∈[x
i−1
,x
i
]
||f(u)| − |f(v)||.
For any pair of real numbers,
x, y
, we have that
||x| − |y|| ≤ |x − y|
by the
triangle inequality. Then for any interval u, v ∈ [x
i−1
, x
i
], we have
||f(u)| − |f(v)|| ≤ |f(u) − f(v)|.
Hence we have
sup
x∈[x
i−1
,x
i
]
|f(x)| − inf
x∈[x
i−1
,x
i
]
|f(x)| ≤ sup
x∈[x
i−1
,x
i
]
f(x) − inf
x∈[x
i−1
,x
i
]
f(x).
So for any dissection D, we have
U
D
(|f|) − L
D
(|f|) ≤ U
D
(f) − L
D
(f).
So the result follows from Riemann’s integrability criterion.
Combining these two propositions, we get that if
|f(x) − g(x)| ≤ C,
for every x ∈ [a, b], then
Z
b
a
f(x) dx −
Z
b
a
g(x) dx
≤ C(b − a).
Proposition (Additivity property). Let
f
: [
a, c
]
→ R
be integrable, and let
b ∈
(
a, c
). Then the restrictions of
f
to [
a, b
] and [
b, c
] are Riemann integrable,
and
Z
b
a
f(x) dx +
Z
c
b
f(x) dx =
Z
c
a
f(x) dx
Similarly, if
f
is integrable on [
a, b
] and [
b, c
], then it is integrable on [
a, c
] and
the above equation also holds.
Proof.
Let
ε >
0, and let
a
=
x
0
< x
1
< ··· < x
n
=
c
be a dissection of
D
of
[a, c] such that
U
D
(f) ≤
Z
c
a
f(x) dx + ε,
and
L
D
(f) ≥
Z
c
a
f(x) dx − ε.
Let
D
0
be the dissection made of
D
plus the point
b
. Let
D
1
be the dissection of
[
a, b
] made of points of
D
0
from
a
to
b
, and
D
2
be the dissection of [
b, c
] made of
points of D
0
from b to c. Then
U
D
1
(f) + U
D
2
(f) = U
D
0
(f) ≤ U
D
(f),
and
L
D
1
(f) + L
D
2
(f) = L
D
0
(f) ≥ L
D
(f).
Since
U
D
(
f
)
− L
D
(
f
)
<
2
ε
, and both
U
D
2
(
f
)
− L
D
2
(
f
) and
U
D
1
(
f
)
− L
D
1
(
f
)
are non-negative, we have
U
D
1
(
f
)
− L
D
1
(
f
) and
U
D
2
(
f
)
− L
D
2
(
f
) are less than
2
ε
. Since
ε
is arbitrary, it follows that the restrictions of
f
to [
a, b
] and [
b, c
] are
both Riemann integrable. Furthermore,
Z
b
a
f(x) dx +
Z
c
b
f(x) dx ≤ U
D
1
(f) + U
D
2
(f) = U
D
0
(f) ≤ U
D
(f)
≤
Z
c
a
f(x) dx + ε.
Similarly,
Z
b
a
f(x) dx +
Z
c
b
f(x) dx ≥ L
D
1
(f) + L
D
2
(f) = L
D
0
(f) ≥ L
D
(f)
≥
Z
c
a
f(x) dx − ε.
Since ε is arbitrary, it follows that
Z
b
a
f(x) dx +
Z
c
b
f(x) dx =
Z
c
a
f(x) dx.
The other direction is left as an (easy) exercise.
Proposition. Let f, g : [a, b] → R be integrable. Then fg is integrable.
Proof.
Let
C
be such that
|f
(
x
)
|, |g
(
x
)
| ≤ C
for every
x ∈
[
a, b
]. Write
L
i
and
`
i
for the
sup
and
inf
of
g
in [
x
i−1
, x
i
]. Now let
D
be a dissection, and for each
i, let u
i
and v
i
be two points in [x
i−1
, x
i
].
We will pretend that
u
i
and
v
i
are the minimum and maximum when we
write the proof, but we cannot assert that they are, since
fg
need not have
maxima and minima. We will then note that since our results hold for arbitrary
u
i
and v
i
, it must hold when f g is at its supremum and infimum.
We find what we pretend is the difference between the upper and lower sum:
n
X
i=1
x
i
− x
i−1
)(f(v
i
)g(v
i
) − f(u
i
)g(u
i
)
=
n
X
i=1
(x
i
− x
i−1
)
f(v
i
)(g(v
i
) − g(u
i
)) + (f(v
i
) − f(u
i
))g(u
i
)
≤
n
X
i=1
C(L
i
− `
i
) + (M
i
− m
i
)C
= C(U
D
g −L
D
g + U
D
f −L
D
f).
Since u
i
and v
i
are arbitrary, it follows that
U
D
(fg) − L
D
(fg) ≤ C(U
D
f −L
D
f + U
D
g −L
D
g).
Since
C
is fixed, and we can get
U
D
f − L
D
f
and
U
D
g − L
D
g
arbitrary small
(since
f
and
g
are integrable), we can get
U
D
(
fg
)
−L
D
(
fg
) arbitrarily small. So
the result follows.
Theorem. Every continuous function
f
on a closed bounded interval [
a, b
] is
Riemann integrable.
Proof. wlog assume [a, b] = [0, 1].
Suppose the contrary. Let
f
be non-integrable. This means that there exists
some
ε
such that for every dissection
D
,
U
D
− L
D
> ε
. In particular, for every
n, let D
n
be the dissection 0,
1
n
,
2
n
, ··· ,
n
n
.
Since
U
D
n
− L
D
n
> ε
, there exists some interval
k
n
,
k+1
n
in which
sup f −
inf f > ε
. Suppose the supremum and infimum are attained at
x
n
and
y
n
respectively. Then we have |x
n
− y
n
| <
1
n
and f(x
n
) − f(y
n
) > ε.
By Bolzano Weierstrass, (
x
n
) has a convergent subsequence, say (
x
n
i
). Say
x
n
i
→ x
. Since
|x
n
− y
n
| <
1
n
→
0, we must have
y
n
i
→ x
. By continuity, we
must have
f
(
x
n
i
)
→ f
(
x
) and
f
(
y
n
i
)
→ f
(
x
), but
f
(
x
n
i
) and
f
(
y
n
i
) are always
apart by ε. Contradiction.
With this result, we know that a lot of things are integrable, e.g. e
−x
2
.
To prove this, we secretly used the property of uniform continuity:
Definition (Uniform continuity*). Let
A ⊆ R
and let
f
:
A → R
. Then
f
is
uniformly continuous if
(∀ε)(∃δ > 0)(∀x)(∀y) |x − y| < δ ⇒ |f(x) −f(y)| ≤ ε.
This is different from regular continuity. Regular continuity says that at any
point
x
, we can find a
δ
that works for this point. Uniform continuity says that
we can find a δ that works for any x.
It is easy to show that a uniformly continuous function is integrable, since by
uniformly continuity, as long as the mesh of a dissection is sufficiently small, the
difference between the upper sum and the lower sum can be arbitrarily small by
uniform continuity. Thus to prove the above theorem, we just have to show that
continuous functions on a closed bounded interval are uniformly continuous.
Theorem (non-examinable). Let
a < b
and let
f
: [
a, b
]
→ R
be continuous.
Then f is uniformly continuous.
Proof. Suppose that f is not uniformly continuous. Then
(∃ε)(∀δ > 0)(∃x)(∃y) |x − y| < δ and |f(x) − f(y)| ≥ ε.
Therefore, we can find sequences (x
n
), (y
n
) such that for every n, we have
|x
n
− y
n
| ≤
1
n
and |f(x
n
) − f(y
n
)| ≥ ε.
Then by Bolzano-Weierstrass theorem, we can find a subsequence (
x
n
k
) converg-
ing to some
x
. Since
|x
n
k
−y
n
k
| ≤
1
n
k
,
y
n
k
→ x
as well. But
|f
(
x
n
k
)
−f
(
y
n
k
)
| ≥ ε
for every
k
. So
f
(
x
n
k
) and
f
(
y
n
k
) cannot both converge to the same limit. So
f
is not continuous at x.
This proof is very similar to the proof that continuous functions are integrable.
In fact, the proof that continuous functions are integrable is just a fuse of this
proof and the (simple) proof that uniformly continuously functions are integrable.
Theorem. Let f : [a, b] → R be monotone. Then f is Riemann integrable.
Note that monotone functions need not be “nice”. It can even have in-
finitely many discontinuities. For example, if
f
: [0
,
1]
→ R
maps
x
to the
1/(first non-zero digit in the binary expansion of x), with f (0) = 0.
Proof. let ε > 0. Let D be a dissection of mesh less than
ε
f(b)−f (a)
. Then
U
D
f −L
D
f =
n
X
i=1
(x
i
− x
i−1
)(f(x
i
) − f(x
i−1
))
≤
ε
f(b) − f(a)
n
X
i=1
(f(x
i
) − f(x
i−1
))
= ε.
Pictorially, we see that the difference between the upper and lower sums is
total the area of the red rectangles.
x
y
To calculate the total area, we can stack the red areas together to get something
of width
ε
f(b)−f (a)
and height f(b) − f(a). So the total area is just ε.
Lemma. Let
a < b
and let
f
be a bounded function from [
a, b
]
→ R
that is
continuous on (a, b). Then f is integrable.
An example where this would apply is
R
1
0
sin
1
x
. It gets nasty near
x
= 0, but
its “nastiness” is confined to
x
= 0 only. So as long as its nastiness is sufficiently
contained, it would still be integrable.
The idea of the proof is to integrate from a point
x
1
very near
a
up to a
point
x
n−1
very close to
b
. Since
f
is bounded, the regions [
a, x
1
] and [
x
n−1
, b
]
are small enough to not cause trouble.
Proof.
Let
ε >
0. Suppose that
|f
(
x
)
| ≤ C
for every
x ∈
[
a, b
]. Let
x
0
=
a
and
pick
x
1
such that
x
1
− x
0
<
ε
8C
. Also choose
z
between
x
1
and
b
such that
b − z <
ε
8C
.
Then
f
is continuous [
x
1
, z
]. Therefore it is integrable on [
x
1
, z
]. So we can
find a dissection D
0
with points x
1
< x
2
< ··· < x
n−1
= z such that
U
D
0
f −L
D
0
f <
ε
2
.
Let D be the dissection a = x
0
< x
1
< ··· < x
n
= b. Then
U
D
f −L
D
f <
ε
8C
· 2C +
ε
2
+
ε
8C
· 2C = ε.
So done by Riemann integrability criterion.
Example.
– f (x) =
(
sin
1
x
x 6= 0
0 x = 0
defined on [−1, 1] is integrable.
– g(x) =
(
x x ≤ 1
x
2
+ 1 x > 1
defined on [0, 1] is integrable.
Corollary. Every piecewise continuous and bounded function on [
a, b
] is inte-
grable.
Proof.
Partition [
a, b
] into intervals
I
1
, ··· , I
k
, on each of which
f
is (bounded
and) continuous. Hence for every
I
j
with end points
x
j−1
,
x
j
,
f
is integrable on
[
x
j−1
, x
j
] (which may not equal
I
j
, e.g.
I
j
could be [
x
j−1
, x
j
)). But then by the
additivity property of integration, we get that f is integrable on [a, b]
We defined Riemann integration in a very general way — we allowed arbitrary
dissections, and took the extrema over all possible dissection. Is it possible to
just consider some particular nice dissections instead? Perhaps unsurprisingly,
yes! It’s just that we opt to define it the general way so that we can easily talk
about things like least common refinements.
Lemma. Let
f
: [
a, b
]
→ R
be Riemann integrable, and for each
n
, let
D
n
be
the dissection
a
=
x
0
< x
1
< ··· < x
n
=
b
, where
x
i
=
a
+
i(b−a)
n
for each
i
.
Then
U
D
n
f →
Z
b
a
f(x) dx
and
L
D
n
f →
Z
b
a
f(x) dx.
Proof.
Let
ε >
0. We need to find an
N
. The only thing we know is that
f
is
Riemann integrable, so we use it:
Since
f
is integrable, there is a dissection
D
, say
u
0
< u
1
< ··· < u
m
, such
that
U
D
f −
Z
b
a
f(x) dx <
ε
2
.
We also know that f is bounded. Let C be such that |f(x)| ≤ C.
For any n, let D
0
be the least common refinement of D
n
and D. Then
U
D
0
f ≤ U
D
f.
Also, the sums
U
D
n
f
and
U
D
0
f
are the same, except that at most
m
of the
subintervals [x
i−1
, x
i
] are subdivided in D
0
.
For each interval that gets chopped up, the upper sum decreases by at most
b−a
n
· 2C. Therefore
U
D
n
f −U
D
0
f ≤
b − a
n
2C ·m.
Pick n such that 2Cm(b − a)/n <
ε
2
. Then
U
D
n
f −U
D
f <
ε
2
.
So
U
D
n
f −
Z
b
a
f(x) dx < ε.
This is true whenever
n >
4C(b−a)m
ε
. Since we also have
U
D
n
f ≥
R
b
a
f
(
x
) d
x
,
therefore
U
D
n
f →
Z
b
a
f(x) dx.
The proof for lower sums is similar.
For convenience, we define the following:
Notation. If b > a, we define
Z
a
b
f(x) dx = −
Z
b
a
f(x) dx.
We now prove that the fundamental theorem of calculus, which says that
integration is the reverse of differentiation.
Theorem (Fundamental theorem of calculus, part 1). Let
f
: [
a, b
]
→ R
be
continuous, and for x ∈ [a, b], define
F (x) =
Z
x
a
f(t) dt.
Then F is differentiable and F
0
(x) = f(x) for every x.
Proof.
F (x + h) − F (x)
h
=
1
h
Z
x+h
x
f(t) dt
Let
ε >
0. Since
f
is continuous, at
x
, then there exists
δ
such that
|y −x| < δ
implies |f(y) − f(x)| < ε.
If |h| < δ, then
1
h
Z
x+h
x
f(t) dt − f(x)
=
1
h
Z
x+h
x
(f(t) − f(x)) dt
≤
1
|h|
Z
x+h
x
|f(t) − f(x)| dt
≤
ε|h|
|h|
= ε.
Corollary. If f is continuously differentiable on [a, b], then
Z
b
a
f
0
(t) dt = f(b) − f(a).
Proof. Let
g(x) =
Z
x
a
f
0
(t) dt.
Then
g
0
(x) = f
0
(x) =
d
dx
(f(x) − f(a)).
Since
g
0
(
x
)
− f
0
(
x
) = 0,
g
(
x
)
− f
(
x
) must be a constant function by the mean
value theorem. We also know that
g(a) = 0 = f(a) − f(a)
So we must have
g
(
x
) =
f
(
x
)
−f
(
a
) for every
x
, and in particular, for
x
=
b
.
Theorem (Fundamental theorem of calculus, part 2). Let
f
: [
a, b
]
→ R
be a
differentiable function, and suppose that f
0
is integrable. Then
Z
b
a
f
0
(t) dt = f(b) − f(a).
Note that this is a stronger result than the corollary above, since it does not
require that f
0
is continuous.
Proof.
Let
D
be a dissection
x
0
< x
1
< ··· < x
n
. We want to make use of this
dissection. So write
f(b) − f(a) =
n
X
i=1
(f(x
i
) − f(x
i−1
)).
For each
i
, there exists
u
i
∈
(
x
i−1
, x
i
) such that
f
(
x
i
)
− f
(
x
i−1j
) = (
x
i
−
x
i−1
)f
0
(u
i
) by the mean value theorem. So
f(b) − f(a) =
n
X
i=1
(x
i
− x
i−1
)f
0
(u
i
).
We know that
f
0
(
u
i
) is somewhere between
sup
x∈[x
i
,x
i−1
]
f
0
(
x
) and
inf
x∈[x
i
,x
i−1
]
f
0
(
x
)
by definition. Therefore
L
D
f
0
≤ f(b) − f(a) ≤ U
D
f
0
.
Since
f
0
is integrable and
D
was arbitrary,
L
D
f
0
and
U
D
f
0
can both get arbitrarily
close to
R
b
a
f
0
(t) dt. So
f(b) − f(a) =
Z
b
a
f
0
(t) dt.
Note that the condition that
f
0
is integrable is essential. It is possible to find
a differentiable function whose derivative is not integrable! You will be asked to
find it in the example sheet.
Using the fundamental theorem of calculus, we can easily prove integration
by parts:
Theorem (Integration by parts). Let
f, g
: [
a, b
]
→ R
be integrable such that
everything below exists. Then
Z
b
a
f(x)g
0
(x) dx = f(b)g(b) − f (a)g(a) −
Z
b
a
f
0
(x)g(x) dx.
Proof. By the fundamental theorem of calculus,
Z
b
a
(f(x)g
0
(x) + f
0
(x)g(x)) dx =
Z
b
a
(fg)
0
(x) dx = f(b)g(b) − f (a)g(a).
The result follows after rearrangement.
Recall that when we first had Taylor’s theorem, we said it had the Lagrange
form of the remainder. There are many other forms of the remainder term. Here
we will look at the integral form:
Theorem (Taylor’s theorem with the integral form of the remainder). Let
f
be
n + 1 times differentiable on [a, b] with with f
(n+1)
continuous. Then
f(b) = f(a) + (b − a)f
0
(a) +
(b − a)
2
2!
f
(2)
(a) + ···
+
(b − a)
n
n!
f
(n)
(a) +
Z
b
a
(b − t)
n
n!
f
(n+1)
(t) dt.
Proof. Induction on n.
When n = 0, the theorem says
f(b) − f(a) =
Z
b
a
f
0
(t) dt.
which is true by the fundamental theorem of calculus.
Now observe that
Z
b
a
(b − t)
n
n!
f
(n+1)
(t) dt =
−(b − t)
n+1
(n + 1)!
f
(n+1)
(t)
b
a
+
Z
b
a
(b − t)
n+1
(n + 1)!
f
(n+1)
(t) dt
=
(b − a)
n+1
(n + 1)!
f
(n+1)
(a) +
Z
b
a
(b − t)
n+1
(n + 1)!
f
(n+2)
(t) dt.
So the result follows by induction.
Note that the form of the integral remainder is rather weird and unexpected.
How could we have come up with it? We might start with the fundamental
theorem of algebra and integrate by parts. The first attempt would be to
integrate 1 to t and differentiate f
0
(t) to f
(2)
(t). So we have
f(b) = f(a) +
Z
b
a
f
0
(t) dt
= f(a) + [tf
0
(t)]
b
a
−
Z
b
a
tf
(2)
(t) dt
= f(a) + bf
0
(b) − af
0
(a) −
Z
b
a
tf
(2)
(t) dt
We want something in the form (
b − a
)
f
0
(
a
), so we take that out and see what
we are left with.
= f(a) + (b − a)f
0
(a) + b(f
0
(b) − f
0
(a)) −
Z
b
a
tf
(2)
(t) dt
Then we note that f
0
(b) − f
0
(a) =
R
b
a
f
(2)
(t) dt. So we have
= f(a) + (b − a)f
0
(a) +
Z
b
a
(b − t)f
(2)
(t) dt.
Then we can see that the right thing to integrate is (
b −t
) and continue to obtain
the result.
Theorem (Integration by substitution). Let
f
: [
a, b
]
→ R
be continuous. Let
g
: [
u, v
]
→ R
be continuously differentiable, and suppose that
g
(
u
) =
a, g
(
v
) =
b
,
and f is defined everywhere on g([u, v]) (and still continuous). Then
Z
b
a
f(x) dx =
Z
v
u
f(g(t))g
0
(t) dt.
Proof.
By the fundamental theorem of calculus,
f
has an anti-derivative
F
defined on g([u, v]). Then
Z
v
u
f(g(t))g
0
(t) dt =
Z
v
u
F
0
(g(t))g
0
(t) dt
=
Z
v
u
(F ◦ g)
0
(t) dt
= F ◦ g(v) − F ◦ g(u)
= F (b) − F (a)
=
Z
b
a
f(x) dx.
We can think of “integration by parts” as what you get by integrating the
product rule, and “integration by substitution” as what you get by integrating
the chain rule.