Part IB — Metric and Topological Spaces
Based on lectures by J. Rasmussen
Notes taken by Dexter Chua
Easter 2015
These notes are not endorsed by the lecturers, and I have modified them (often
significantly) after lectures. They are nowhere near accurate representations of what
was actually lectured, and in particular, all errors are almost surely mine.
Metrics
Definition and examples. Limits and continuity. Open sets and neighbourhoods.
Characterizing limits and continuity using neighbourhoods and open sets. [3]
Topology
Definition of a topology. Metric topologies. Further examples. Neighbourhoods, closed
sets, convergence and continuity. Hausdorff spaces. Homeomorphisms. Topologi-
cal and non-topological properties. Completeness. Subspace, quotient and product
top ologies. [3]
Connectedness
Definition using open sets and integer-valued functions. Examples, including inter-
vals. Components. The continuous image of a connected space is connected. Path-
connectedness. Path-connected spaces are connected but not conversely. Connected
op en sets in Euclidean space are path-connected. [3]
Compactness
Definition using open covers. Examples: finite sets and [0, 1]. Closed subsets of
compact spaces are compact. Compact subsets of a Hausdorff space must b e closed.
The compact subsets of the real line. Continuous images of compact sets are compact.
Quotient spaces. Continuous real-valued functions on a compact space are bounded
and attain their bounds. The product of two compact spaces is compact. The compact
subsets of Euclidean space. Sequential compactness. [3]
Contents
0 Introduction
1 Metric spaces
1.1 Definitions
1.2 Examples of metric spaces
1.3 Norms
1.4 Open and closed subsets
2 Top ological spaces
2.1 Definitions
2.2 Sequences
2.3 Closed sets
2.4 Closure and interior
2.4.1 Closure
2.4.2 Interior
2.5 New topologies from old
2.5.1 Subspace topology
2.5.2 Product topology
2.5.3 Quotient topology
3 Connectivity
3.1 Connectivity
3.2 Path connectivity
3.2.1 Higher connectivity*
3.3 Components
3.3.1 Path components
3.3.2 Connected components
4 Compactness
4.1 Compactness
4.2 Products and quotients
4.2.1 Products
4.2.2 Quotients
4.3 Sequential compactness
4.4 Completeness
0 Introduction
The course Metric and Topological Spaces is divided into two parts. The first is
on metric spaces, and the second is on topological spaces (duh).
In Analysis, we studied real numbers a lot. We defined many properties such
as convergence of sequences and continuity of functions. For example, if (
x
n
) is
a sequence in R, x
n
→ x means
(∀ε > 0)(∃N)(∀n > N) |x
n
− x| < ε.
Similarly, a function f is continuous at x
0
if
(∀ε > 0)(∃δ > 0)(∀x) |x − x
0
| < δ ⇒ |f(x) − f(x
0
)| < ε.
However, the definition of convergence doesn’t really rely on
x
n
being real
numbers, except when calculating values of
|x
n
− x|
. But what does
|x
n
− x|
really mean? It is the distance between
x
n
and
x
. To define convergence, we
don’t really need notions like subtraction and absolute values. We simply need a
(sensible) notion of distance between points.
Given a set
X
, we can define a metric (“distance function”)
d
:
X × X → R
,
where
d
(
x, y
) is the distance between the points
x
and
y
. Then we say a sequence
(x
n
) in X converges to x if
(∀ε > 0)(∃N)(∀n > N) d(x
n
, x) < ε.
Similarly, a function f : X → X is continuous if
(∀ε > 0)(∃δ > 0)(∀x) d(x, x
0
) < δ ⇒ d(f(x), f(x
0
)) < ε.
Of course, we will need the metric
d
to satisfy certain conditions such as being
non-negative, and we will explore these technical details in the first part of the
course.
As people studied metric spaces, it soon became evident that metrics are not
really that useful a notion. Given a set
X
, it is possible to find many different
metrics that completely agree on which sequences converge and what functions
are continuous.
It turns out that it is not the metric that determines (most of) the properties
of the space. Instead, it is the open sets induced by the metric (intuitively, an
open set is a subset of
X
without “boundary”, like an open interval). Metrics
that induce the same open sets will have almost identical properties (apart from
the actual distance itself).
The idea of a topological space is to just keep the notion of open sets and
abandon metric spaces, and this turns out to be a really good idea. The second
part of the course is the study of these topological spaces and defining a lot of
interesting properties just in terms of open sets.
1 Metric spaces
1.1 Definitions
As mentioned in the introduction, given a set
X
, it is often helpful to have a
notion of distance between points. This distance function is known as the metric.
Definition (Metric space). A metric space is a pair (
X, d
X
) where
X
is a set
(the space) and
d
X
is a function
d
X
:
X × X → R
(the metric) such that for all
x, y, z,
– d
X
(x, y) ≥ 0 (non-negativity)
– d
X
(x, y) = 0 iff x = y (identity of indiscernibles)
– d
X
(x, y) = d
X
(y, x) (symmetry)
– d
X
(x, z) ≤ d
X
(x, y) + d
X
(y, z) (triangle inequality)
We will have two quick examples of metrics before going into other important
definitions. We will come to more examples afterwards.
Example.
– (Euclidean “usual” metric) Let X = R
n
. Let
d(v, w) = |v − w| =
v
u
u
t
n
X
i=1
(v
i
− w
i
)
2
.
This is the usual notion of distance we have in the
R
n
vector space. It is
not difficult to show that this is indeed a metric (the fourth axiom follows
from the Cauchy-Schwarz inequality).
– (Discrete metric) Let X be a set, and
d
X
(x, y) =
(
1 x 6= y
0 x = y
To show this is indeed a metric, we have to show it satisfies all the axioms.
The first three axioms are trivially satisfied. How about the fourth? We
can prove this by exhaustion.
Since the distance function can only return 0 or 1,
d
(
x, z
) can be 0 or 1,
while
d
(
x, y
) +
d
(
y, z
) can be 0, 1 or 2. For the fourth axiom to fail, we
must have RHS
<
LHS. This can only happen if the right hand side is 0.
But for the right hand side to be 0, we must have
x
=
y
=
z
. So the left
hand side is also 0. So the fourth axiom is always satisfied.
Given a metric space (
X, d
), we can generate another metric space by picking
out a subset of X and reusing the same metric.
Definition (Metric subspace). Let (
X, d
X
) be a metric space, and
Y ⊆ X
.
Then (
Y, d
Y
) is a metric space, where
d
Y
(
a, b
) =
d
X
(
a, b
), and said to be a
subspace of X.
Example.
S
n
=
{
v
∈ R
n+1
:
|
v
|
= 1
}
, the
n
-dimensional sphere, is a subspace
of R
n+1
.
Finally, as promised, we come to the definition of convergent sequences and
continuous functions.
Definition (Convergent sequences). Let (
x
n
) be a sequence in a metric space
(
X, d
X
). We say (
x
n
) converges to
x ∈ X
, written
x
n
→ x
, if
d
(
x
n
, x
)
→
0 (as a
real sequence). Equivalently,
(∀ε > 0)(∃N )(∀n > N ) d(x
n
, x) < ε.
Example.
–
Let (v
n
) be a sequence in
R
k
with the Euclidean metric. Write v
n
=
(
v
1
n
, ··· , v
k
n
), and v = (
v
1
, ··· , v
k
)
∈ R
k
. Then v
n
→
v iff (
v
i
n
)
→ v
i
for
all i.
–
Let
X
have the discrete metric, and suppose
x
n
→ x
. Pick
ε
=
1
2
.
Then there is some
N
such that
d
(
x
n
, x
)
<
1
2
whenever
n > N
. But if
d
(
x
n
, x
)
<
1
2
, we must have
d
(
x
n
, x
) = 0. So
x
n
=
x
. Hence if
x
n
→ x
,
then eventually all x
n
are equal to x.
Similar to what we did in Analysis, we can show that limits are unique (if
exist).
Proposition. If (
X, d
) is a metric space, (
x
n
) is a sequence in
X
such that
x
n
→ x, x
n
→ x
0
, then x = x
0
.
Proof.
For any
ε >
0, we know that there exists
N
such that
d
(
x
n
, x
)
< ε/
2 if
n > N . Similarly, there exists some N
0
such that d(x
n
, x
0
) < ε/2 if n > N
0
.
Hence if n > max(N, N
0
), then
0 ≤ d(x, x
0
)
≤ d(x, x
n
) + d(x
n
, x
0
)
= d(x
n
, x) + d(x
n
, x
0
)
≤ ε.
So 0 ≤ d(x, x
0
) ≤ ε for all ε > 0. So d(x, x
0
) = 0, and x = x
0
.
Note that to prove the above proposition, we used all of the four axioms. In
the first line, we used non-negativity to say 0
≤ d
(
x, x
0
). In the second line, we
used triangle inequality. In the third line, we used symmetry to swap
d
(
x, x
n
)
with
d
(
x
n
, x
). Finally, we used the identity of indiscernibles to conclude that
x = x
0
.
To define a continuous function, here, we opt to use the sequence definition.
We will later show that this is indeed equivalent to the
ε − δ
definition, as well
as a few other more useful definitions.
Definition (Continuous function). Let (
X, d
X
) and (
Y, d
Y
) be metric spaces,
and
f
:
X → Y
. We say
f
is continuous if
f
(
x
n
)
→ f
(
x
) (in
Y
) whenever
x
n
→ x (in X).
Example. Let
X
=
R
with the Euclidean metric. Let
Y
=
R
with the discrete
metric. Then
f
:
X → Y
that maps
f
(
x
) =
x
is not continuous. This is since
1/n → 0 in the Euclidean metric, but not in the discrete metric.
On the other hand,
g
:
Y → X
by
g
(
x
) =
x
is continuous, since a sequence
in Y that converges is eventually constant.
1.2 Examples of metric spaces
In this section, we will give four different examples of metrics, where the first
two are metrics on
R
2
. There is an obvious generalization to
R
n
, but we will
look at R
2
specifically for the sake of simplicity.
Example (Manhattan metric). Let X = R
2
, and define the metric as
d(x, y) = d((x
1
, x
2
), (y
1
, y
2
)) = |x
1
− y
1
| + |x
2
− y
2
|.
The first three axioms are again trivial. To prove the triangle inequality, we have
d(x, y) + d(y, z) = |x
1
− y
1
| + |x
2
− y
2
| + |y
1
− z
1
| + |y
2
− z
2
|
≥ |x
1
− z
1
| + |x
2
− z
2
|
= d(x, z),
using the triangle inequality for R.
This metric represents the distance you have to walk from one point to
another if you are only allowed to move horizontally and vertically (and not
diagonally).
Example (British railway metric). Let X = R
2
. We define
d(x, y) =
(
|x − y| if x = ky
|x| + |y| otherwise
To explain the name of this metric, think of Britain with London as the origin.
Since the railway system is
stupid
less than ideal, all trains go through London.
For example, if you want to go from Oxford to Cambridge (and obviously not
the other way round), you first go from Oxford to London, then London to
Cambridge. So the distance traveled is the distance from London to Oxford plus
the distance from London to Cambridge.
The exception is when the two destinations lie along the same line, in which
case, you can directly take the train from one to the other without going through
London, and hence the “if x = ky” clause.
Example (
p
-adic metric). Let
p ∈ Z
be a prime number. We first define the
norm
|n|
p
to be
p
−k
, where
k
is the highest power of
p
that divides
n
. If
n
= 0,
we let |n|
p
= 0. For example, |20|
2
= |2
2
· 5|
2
= 2
−2
.
Now take
X
=
Z
, and let
d
p
(
a, b
) =
|a−b|
p
. The first three axioms are trivial,
and the triangle inequality can be proved by making some number-theoretical
arguments about divisibility.
This metric has rather interesting properties. With respect to
d
2
, we have
1
,
2
,
4
,
8
,
16
,
32
, ··· →
0, while 1
,
2
,
3
,
4
, ···
does not converge. We can also use it
to prove certain number-theoretical results, but we will not go into details here.
Example (Uniform metric). Let
X
=
C
[0
,
1] be the set of all continuous
functions on [0, 1]. Then define
d(f, g) = max
x∈[0,1]
|f(x) − g(x)|.
The maximum always exists since continuous functions on [0
,
1] are bounded
and attain their bounds.
Now let
F
:
C
[0
,
1]
→ R
be defined by
F
(
f
) =
f
(
1
2
). Then this is continuous
with respect to the uniform metric on C[0, 1] and the usual metric on R:
Let
f
n
→ f
in the uniform metric. Then we have to show that
F
(
f
n
)
→ F
(
f
),
i.e. f
n
(
1
2
) → f (
1
2
). This is easy, since we have
0 ≤ |F (f
n
) − F (f)| = |f
n
(
1
2
) − f(
1
2
)| ≤ max |f
n
(x) − f(x)| → 0.
So |f
n
(
1
2
) − f(
1
2
)| → 0. So f
n
(
1
2
) → f (
1
2
).
1.3 Norms
There are several notions on vector spaces that are closely related to metrics.
We’ll look at norms and inner products of vector spaces, and show that they all
naturally induce a metric on the space.
First of all, we define the norm. This can be thought of as the “length” of a
vector in the vector space.
Definition (Norm). Let
V
be a real vector space. A norm on
V
is a function
k · k : V → R such that
– kvk ≥ 0 for all v ∈ V
– kvk = 0 if and only if v = 0.
– kλvk = |λ|kvk
– kv + wk ≤ kvk + kwk.
Example. Let
V
=
R
n
. There are several possible norms we can define on
R
n
:
kvk
1
=
n
X
i=1
|v
i
|
kvk
2
=
v
u
u
t
n
X
i=1
v
2
i
kvk
∞
= max{|v
i
| : 1 ≤ i ≤ n}.
In general, we can define the norm
kvk
p
=
n
X
i=1
|v
i
|
p
!
1/p
.
for any 1 ≤ p ≤ ∞, and kvk
∞
is the limit as p → ∞.
Proof that these are indeed norms is left as an exercise for the reader (in the
example sheets).
A norm naturally induces a metric on V :
Lemma. If k · k is a norm on V , then
d(v, w) = kv − wk
defines a metric on V .
Proof.
(i) d(v, w) = kv − wk ≥ 0 by the definition of the norm.
(ii) d(v, w) = 0 ⇔ kv − wk = 0 ⇔ v − w = 0 ⇔ v = w.
(iii) d(w, v) = kw − vk = k(−1)(v − w)k = | − 1|kv − wk = d(v, w).
(iv) d(u, v) + d(v, w) = ku − vk+ kv − w k ≥ ku − wk = d(u, w).
Example. We have the following norms on C[0, 1]:
kfk
1
=
Z
1
0
|f(x)| dx
kfk
2
=
s
Z
1
0
f(x)
2
dx
kfk
∞
= max
x∈[0,1]
|f(x)|
The first two are known as the
L
1
and
L
2
norms. The last is called the uniform
norm, since it induces the uniform metric.
It is easy to show that these are indeed norms. The only slightly tricky part
is to show that kf k = 0 iff f = 0, which we obtain via the following lemma.
Lemma. Let
f ∈ C
[0
,
1] satisfy
f
(
x
)
≥
0 for all
x ∈
[0
,
1]. If
f
(
x
) is not
constantly 0, then
R
1
0
f(x) dx > 0.
Proof.
Pick
x
0
∈
[0
,
1] with
f
(
x
0
) =
a >
0. Then since
f
is continuous, there is
a
δ
such that
|f
(
x
)
−f
(
x
0
)
| < a/
2 if
|x −x
0
| < δ
. So
|f
(
x
)
| > a/
2 in this region.
Take
g(x) =
(
a/2 |x −x
0
| < δ
0 otherwise
Then f (x) ≥ g(x) for all x ∈ [0, 1]. So
Z
1
0
f(x) dx ≥
Z
1
0
g(x) dx =
a
2
· (2δ) > 0.
Example. Let X = C[0, 1], and let
d
1
(f, g) = kf − gk
1
=
Z
1
0
|f(x) − g(x)| dx.
Define the sequence
f
n
=
(
1 − nx x ∈ [0,
1
n
]
0 x ≥
1
n
.
f
x
1
n
1
Then
kfk
1
=
1
2
·
1
n
· 1 =
1
2n
→ 0
as n → ∞. So f
n
→ 0 in (X, d
1
) where 0(x) = 0.
On the other hand,
kf
n
k
∞
= max
x∈[0,1]
kf(x)k = 1.
So f
n
6→ 0 in the uniform metric.
So the function (
C
[0
,
1]
, d
1
)
→
(
C
[0
,
1]
, d
∞
) that maps
f 7→ f
is not continu-
ous. This is similar to the case that the identity function from the usual metric
of
R
to the discrete metric of
R
is not continuous. However, the discrete metric
is a silly metric, but d
1
is a genuine useful metric here.
Using the same example, we can show that the function
G
: (
C
[0
,
1]
, d
1
)
→
(R, usual) with G(f ) = f(0) is not continuous.
We’ll now define the inner product of a real vector space. This is a general-
ization of the notion of the “dot product”.
Definition (Inner product). Let
V
be a real vector space. An inner product on
V is a function h·, ·i : V × V → R such that
(i) hv, vi ≥ 0 for all v ∈ V
(ii) hv, vi = 0 if and only if v = 0.
(iii) hv, wi = hw, vi.
(iv) hv
1
+ λv
2
, w) = hv
1
, wi + λhv
2
, wi.
Example.
(i) Let V = R
n
. Then
hv, wi =
n
X
i=1
v
i
w
i
is an inner product.
(ii) Let V = C[0, 1]. Then
hf, gi =
Z
1
0
f(x)g(x) dx
is an inner product.
We just showed that norms induce metrics. The proof was completely trivial
as the definitions were almost the same. Now we want to prove that inner
products induce norms. However, this is slightly less trivial. To do so, we need
the Cauchy-Schwarz inequality.
Theorem (Cauchy-Schwarz inequality). If h·, ·i is an inner product, then
hv, wi
2
≤ hv, vihw, wi.
Proof. For any x, we have
hv + xw, v + xwi = hv, vi + 2xhv, wi+ x
2
hw, wi ≥ 0.
Seen as a quadratic in
x
, since it is always non-negative, it can have at most one
real root. So
(2hv, wi)
2
− 4hv, vihw, wi ≤ 0.
So the result follows.
With this, we can show that inner products induce norms (and hence metrics).
Lemma. If h·, ·i is an inner product on V , then
kvk =
p
hv, vi
is a norm.
Proof.
(i) kvk =
p
hv, vi ≥ 0.
(ii) kvk = 0 ⇔ hv, vi = 0 ⇔ v = 0.
(iii) kλvk =
p
hλv, λvi =
p
λ
2
hv, vi = |λ|kvk.
(iv)
(kvk + kwk)
2
= kvk
2
+ 2kvkkwk+ kwk
2
≥ hv, vi + 2hv, wi + hw, wi
= kv + wk
2
1.4 Open and closed subsets
In this section, we will study open and closed subsets of metric spaces. We will
then proceed to prove certain key properties of open and closed subsets, and
show that they are all we need to define continuity. This will lead to the next
chapter (and the rest of the course), where we completely abandon the idea of
metrics and only talk about open and closed subsets.
To define open and closed subsets, we first need the notion of balls.
Definition (Open and closed balls). Let (
X, d
) be a metric space. For any
x ∈ X, r ∈ R,
B
r
(x) = {y ∈ X : d(y, x) < r}
is the open ball centered at x.
¯
B
r
(x) = {y ∈ X : d(y, x) ≤ r}
is the closed ball centered at x.
Example.
(i) When X = R, B
r
(x) = (x − r, x + r).
¯
B
r
(x) = [x − r, x + r].
(ii) When X = R
2
,
(a)
If
d
is the metric induced by the
k
v
k
1
=
|v
1
|
+
|v
2
|
, then an open ball
is a rotated square.
v
1
v
2
(b)
If
d
is the metric induced by the
k
v
k
2
=
p
v
2
1
+ v
2
2
, then an open ball
is an actual disk.
v
1
v
2
(c)
If
d
is the metric induced by the
k
v
k
∞
=
max{|v
1
|, |v
2
|}
, then an
open ball is a square.
v
1
v
2
Definition (Open subset).
U ⊆ X
is an open subset if for every
x ∈ U
,
∃δ >
0
such that B
δ
(x) ⊆ U .
C ⊆ X is a closed subset if X \ C ⊆ X is open.
As if we’ve not emphasized this enough, this is a very very important
definition.
We first prove that this is a sensible definition.
Lemma. The open ball
B
r
(
x
)
⊆ X
is an open subset, and the closed ball
¯
B
r
(x) ⊆ X is a closed subset.
Proof. Given y ∈ B
r
(x), we must find δ > 0 with B
δ
(y) ⊆ B
r
(x).
v
1
v
2
r
y
Since
y ∈ B
r
(
x
), we must have
a
=
d
(
y, x
)
< r
. Let
δ
=
r − a >
0. Then if
z ∈ B
δ
(y), then
d(z, x) ≤ d(z, y) + d(y, x) < (r − a) + a = r.
So z ∈ B
r
(x). So B
δ
(y) ⊆ B
r
(x) as desired.
The second statement is equivalent to
X \
¯
B
r
(
x
) =
{y ∈ X
:
d
(
y, x
)
> r}
is
open. The proof is very similar.
Note that openness is a property of a subset.
A ⊆ X
being open depends on
both
A
and
X
, not just
A
. For example, [0
,
1
2
) is not an open subset of
R
, but
is an open subset of [0
,
1] (since it is
B
1
2
(0)), both with the Euclidean metric.
However, we are often lazy and just say “open set” instead of “open subset”.
Example.
(i)
(0
,
1)
⊆ R
is open, while [0
,
1]
⊆ R
is closed. [0
,
1)
⊆ R
is neither closed
nor open.
(ii) Q ⊆ R
is neither open nor closed, since any open interval contains both
rational numbers and irrational numbers. So any open interval cannot be
a subset of Q or R \ Q.
(iii)
Let
X
= [
−
1
,
1]
\ {
0
}
with the Euclidean metric. Let
A
= [
−
1
,
0)
⊆ X
.
Then
A
is open since it is equal to
B
1
(
−
1).
A
is also closed since it is
equal to
¯
B
1
2
(−
1
2
).
Definition (Open neighborhood). If
x ∈ X
, an open neighborhood of
x
is an
open U ⊆ X with x ∈ U.
This is not really an interesting definition, but is simply a convenient short-
hand for “open subset containing x”.
Lemma. If
U
is an open neighbourhood of
x
and
x
n
→ x
, then
∃N
such that
x
n
∈ U for all n > N.
Proof.
Since
U
is open, there exists some
δ >
0 such that
B
δ
(
x
)
⊆ U
. Since
x
n
→ x
,
∃N
such that
d
(
x
n
, x
)
< δ
for all
n > N
. This implies that
x
n
∈ B
δ
(
x
)
for all n > N. So x
n
∈ U for all n > N.
Definition (Limit point). Let
A ⊆ X
. Then
x ∈ X
is a limit point of
A
if there
is a sequence x
n
→ x such that x
n
∈ A for all n.
Intuitively, a limit point is a point we can get arbitrarily close to.
Example.
(i)
If
a ∈ A
, then
a
is a limit point of
A
, by taking the sequence
a, a, a, a, ···
.
(ii) If A = (0, 1) ⊆ R, then 0 is a limit point of A, e.g. take x
n
=
1
n
.
(iii) Every x ∈ R is a limit point of Q.
It is possible to characterize closed subsets by limit points. This is often a
convenient way of proving that sets are closed.
Proposition.
C ⊆ X
is a closed subset if and only if every limit point of
C
is
an element of C.
Proof.
(
⇒
) Suppose
C
is closed and
x
n
→ x
,
x
n
∈ C
. We have to show that
x ∈ C.
Since
C
is closed,
A
=
X \ C ⊆ X
is open. Suppose the contrary that
x 6∈ C
.
Then
x ∈ A
. Hence
A
is an open neighbourhood of
x
. Then by our previous
lemma, we know that there is some
N
such that
x
n
∈ A
for all
n ≥ N
. So
x
N
∈ A
. But we know that
x
N
∈ C
by assumption. This is a contradiction. So
we must have x ∈ C.
(⇐) Suppose that C is not closed. We have to find a limit point not in C.
Since
C
is not closed,
A
is not open. So
∃x ∈ A
such that
B
δ
(
x
)
6⊆ A
for all
δ > 0. This means that B
δ
(x) ∩ C 6= ∅ for all δ > 0.
So pick
x
n
∈ B
1
n
(
x
)
∩C
for each
n >
0. Then
x
n
∈ C
,
d
(
x
n
, x
) =
1
n
→
0. So
x
n
→ x. So x is a limit point of C which is not in C.
Finally, we get to the Really Important Result
TM
that tells us metrics are
useless.
Proposition (Characterization of continuity). Let (
X, d
x
) and (
Y, d
y
) be metric
spaces, and f : X → Y . The following conditions are equivalent:
(i) f is continuous
(ii) If x
n
→ x, then f (x
n
) → f (x) (which is the definition of continuity)
(iii) For any closed subset C ⊆ Y , f
−1
(C) is closed in X.
(iv) For any open subset U ⊆ Y , f
−1
(U) is open in X.
(v)
For any
x ∈ X
and
ε >
0,
∃δ >
0 such that
f
(
B
δ
(
x
))
⊆ B
ε
(
f
(
x
)).
Alternatively, d
x
(x, z) < δ ⇒ d
y
(f(x), f(z)) < ε.
Proof.
– 1 ⇔ 2: by definition
–
2
⇒
3: Suppose
C ⊆ Y
is closed. We want to show that
f
−1
(
C
) is closed.
So let x
n
→ x, where x
n
∈ f
−1
(C).
We know that
f
(
x
n
)
→ f
(
x
) by (2) and
f
(
x
n
)
∈ C
. So
f
(
x
) is a limit
point of
C
. Since
C
is closed,
f
(
x
)
∈ C
. So
x ∈ f
−1
(
C
). So every limit
point of f
−1
(C) is in f
−1
(C). So f
−1
(C) is closed.
–
3
⇒
4: If
U ⊆ Y
is open, then
Y \ U
is closed in Y. So
f
−1
(
Y \ U
) =
X \ f
−1
(U) is closed in X. So f
−1
(U) ⊆ X is open.
–
4
⇒
5: Given
x ∈ X, ε >
0,
B
ε
(
f
(
x
)) is open in
Y
. By (4), we know
f
−1
(
B
ε
(
f
(
x
))) =
A
is open in
X
. Since
x ∈ A
,
∃δ >
0 with
B
δ
(
x
)
⊆ A
.
So
f(B
δ
(x)) ⊆ f (A) = f(f
−1
(B
ε
(f(x)))) = B
ε
(f(x))
–
5
⇒
2: Suppose
x
n
→ x
. Given
ε >
0,
∃δ >
0 such that
f
(
B
δ
(
x
))
⊆
B
ε
(
f
(
x
)). Since
x
n
→ x
,
∃N
such that
x
n
∈ B
δ
(
x
) for all
n > N
. Then
f(x
n
) ∈ f (B
δ
(x)) ⊆ B
ε
(f(x)) for all n > N . So f(x
n
) → f (x).
The third and fourth condition can allow us to immediately decide if a subset
is open or closed in some cases.
Example. Let f : R
3
→ R be defined as
f(x
1
, x
2
, x
3
) = x
2
1
+ x
4
2
x
6
3
+ x
8
1
x
2
3
.
Then this is continuous. So
{
x
∈ R
3
:
f
(x)
≤
1
}
=
f
−1
((
−∞,
1]) is closed in
R
3
.
Before we end, we prove some key properties of open subsets. These will be
used as the defining properties of open subsets in the next chapter.
Lemma.
(i) ∅ and X are open subsets of X.
(ii) Suppose V
α
⊆ X is open for all α ∈ A. Then U =
[
α∈A
V
α
is open in X.
(iii) If V
1
, ··· , V
n
⊆ X are open, then so is V =
n
\
i=1
V
i
.
Proof.
(i) ∅
satisfies the definition of an open subset vacuously.
X
is open since for
any x, B
1
(x) ⊆ X.
(ii)
If
x ∈ U
, then
x ∈ V
α
for some
α
. Since
V
α
is open, there exists
δ >
0
such that B
δ
(x) ⊆ V
α
. So B
δ
(x) ⊆
[
α∈A
V
α
= U. So U is open.
(iii)
If
x ∈ V
, then
x ∈ V
i
for all
i
= 1
, ··· , n
. So
∃δ
i
>
0 with
B
δ
i
(
x
)
⊆ V
i
.
Take
δ
=
min{δ
1
, ··· , δ
n
}
. So
B
δ
(
x
)
⊆ V
i
for all
i
. So
B
δ
(
x
)
⊆ V
. So
V
is
open.
Note that we can take infinite unions or finite intersection, but not infinite
intersections. For example, the intersection of all (
−
1
n
,
1
n
) is
{
0
}
, which is not
open.
2 Topological spaces
2.1 Definitions
We have previously shown that a function
f
is continuous iff
f
−1
(
U
) is open
whenever
U
is open. Convergence can also be characterized using open sets only.
This suggests that we can dump the metric and just focus on the open sets.
What we will do is to define the topology of a space
X
to be the set of open
sets of
X
. Then this topology would define most of the structure or geometry of
X, and we don’t need to care about metrics.
Definition (Topological space). A topological space is a set
X
(the space)
together with a set U ⊆ P(X) (the topology) such that:
(i) ∅, X ∈ U
(ii) If V
α
∈ U for all α ∈ A, then
[
α∈A
V
α
∈ U.
(iii) If V
1
, ··· , V
n
∈ U, then
n
\
i=1
V
i
∈ U.
The elements of
X
are the points, and the elements of
U
are the open subsets of
X.
Definition (Induced topology). Let (
X, d
) be a metric space. Then the topology
induced by d is the set of all open sets of X under d.
Example. Let
X
=
R
n
and consider the two metrics
d
1
(x
,
y) =
k
x
−
y
k
1
and
d
∞
(x, y) = kx − yk
∞
. We will show that they induce the same topology.
Recall that the metrics are defined by
kvk =
n
X
i=1
|v
i
|, kvk
∞
= max
1≤i≤n
|v
i
|.
This implies that
kvk
∞
≤ kvk
1
≤ nkvk
∞
.
This in turn implies that
B
∞
r
(x) ⊇ B
1
r
(x) ⊇ B
∞
r/n
(x).
v
1
v
2
B
1
B
∞
To show that the metrics induce the same topology, suppose that
U
is open with
respect to
d
1
, and we show that it is open with respect to
d
∞
. Let
x ∈ U
. Since
U
is open with respect to
d
1
, there exists some
δ >
0 such that
B
1
δ
(
x
)
⊆ U
. So
B
∞
δ/n
(x) ⊆ B
1
δ
(x) ⊆ U . So U is open with respect to d
∞
.
The other direction is similar.
Example. Let
X
=
C
[0
,
1]. Let
d
1
(
f, g
) =
kf − gk
1
and
d
∞
(
f, g
) =
kf − gk
∞
.
Then they do not induce the same topology, since (
X, d
1
)
→
(
X, d
∞
) by
f 7→ f
is not continuous.
It is possible to have some other topologies that are not induced by metrics.
Example.
(i) Let X be any set.
(a) U = {∅, X} is the coarse topology on X.
(b) U
=
P
(
X
) is the discrete topology on
X
, since it is induced by the
discrete metric.
(c) U
=
{A ⊆ X
:
X \A is finite or A
=
∅}
is the cofinite topology on
X
.
(ii) Let X = R, and U = {(a, ∞) : a ∈ R} is the right order topology on R.
Now we can define continuous functions in terms of the topology only.
Definition (Continuous function). Let
f
:
X → Y
be a map of topological
spaces. Then f is continuous if f
−1
(U) is open in X whenever U is open in Y .
When the topologies are induced by metrics, the topological and metric
notions of continuous functions coincide, as we showed in the previous chapter.
Example.
(i) Any function f : X → Y is continuous if X has the discrete topology.
(ii) Any function f : X → Y is continuous if Y has the coarse topology.
(iii)
If
X
and
Y
both have cofinite topology, then
f
:
X → Y
is continuous iff
f
−1
({y}) is finite for every y ∈ Y .
Lemma. If
f
:
X → Y
and
g
:
Y → Z
are continuous, then so is
g ◦ f
:
X → Z
.
Proof.
If
U ⊆ Z
is open,
g
is continuous, then
g
−1
(
U
) is open in
Y
. Since
f
is
also continuous, f
−1
(g
−1
(U)) = (g ◦ f)
−1
(U) is open in X.
In group theory, we had the notion of isomorphisms between groups. Isomor-
phic groups are equal-up-to-renaming-of-elements, and are considered to be the
same for most purposes.
Similarly, we will define homeomorphisms between topological spaces, and
homeomorphic topological spaces will be considered to be the same (notice the
“e” in homeomorphism).
Definition (Homeomorphism). f : X → Y is a homeomorphism if
(i) f is a bijection
(ii) Both f and f
−1
are continuous
Equivalently, f is a bijection and U ⊆ X is open iff f (U) ⊆ Y is open.
Two spaces are homeomorphic if there exists a homeomorphism between
them, and we write X ' Y .
Note that we specifically require
f
and
f
−1
to be both continuous. In
group theory, if
φ
is a bijective homomorphism, then
φ
−1
is automatically a
homomorphism as well. However, this is not true for topological spaces.
f
being continuous does not imply
f
−1
is continuous, as illustrated by the example
below.
Example. Let
X
=
C
[0
,
1] with the topology induced by
k · k
1
and
Y
=
C
[0
,
1]
with the topology induced by
k · k
∞
. Then
F
:
Y → X
by
f 7→ f
is continuous
but F
−1
is not.
Example. Let
X
= [0
,
2
π
) and
Y
=
S
1
=
{z ∈ C
:
|z|
= 1
}
. Then
f
:
X → Y
given by f(x) = e
ix
is continuous but its inverse is not.
Similar to isomorphisms, we can show that homeomorphism is an equivalence
relation.
Lemma. Homeomorphism is an equivalence relation.
Proof.
(i) The identity map I
X
: X → X is always a homeomorphism. So X ' X.
(ii)
If
f
:
X → Y
is a homeomorphism, then so is
f
−1
:
Y → X
. So
X ' Y ⇒ Y ' X.
(iii) If f : X → Y and g : Y → Z are homeomorphisms, then g ◦ f : X → Z is
a homeomorphism. So X ' Y and Y ' Z implies X ' Z.
Example.
(i)
Under the usual topology, the open intervals (0
,
1)
'
(
a, b
) for all
a, b ∈ R
,
using the homeomorphism x 7→ a + (b − a)x.
Similarly, [0, 1] ' [a, b]
(ii) (−1, 1) ' R by x 7→ tan(
π
2
x).
(iii) R ' (0, ∞) by x 7→ e
x
.
(iv) (a, ∞) ' (b, ∞) by x 7→ x + (b − a).
The fact that
'
is an equivalence relation implies that any 2 open intervals in
R
are homeomorphic.
It is relatively easy to show that two spaces are homeomorphic. We just have
to write down a homeomorphism. However, it is rather difficult to prove that
two spaces are not homeomorphic.
For example, is (0
,
1) homeomorphic to [0
,
1]? No, but we aren’t able
to immediately prove it now. How about
R
and
R
2
? Again they are not
homeomorphic, but to show this we will need some tools that we’ll develop in
the next few lectures.
How about
R
m
and
R
n
in general? They are not homeomorphic, but we
won’t be able to prove this rigorously in the course. To properly prove this, we
will need tools from algebraic topology.
So how can we prove that two spaces are not homeomorphic? In group theory,
we could prove that two groups are not isomorphic by, say, showing that they
have different orders. Similarly, to distinguish between topological spaces, we
have to define certain topological properties. Then if two spaces have different
topological properties, we can show that they are not homeomorphic.
But before that, we will first define many many useful definitions in topological
spaces, including sequences, subspaces, products and quotients. The remainder
of the chapter will be mostly definitions that we will use later.
2.2 Sequences
To define the convergence of a sequence using open sets, we again need the
concept of open neighbourhoods.
Definition (Open neighbourhood). An open neighbourhood of
x ∈ X
is an open
set U ⊆ X with x ∈ U .
Now we can use this to define convergence of sequences.
Definition (Convergent sequence). A sequence
x
n
→ x
if for every open
neighbourhood U of x, ∃N such that x
n
∈ U for all n > N.
Example.
(i)
If
X
has the coarse topology, then any sequence
x
n
converges to every
x ∈ X, since there is only one open neighbourhood of x.
(ii)
If
X
has the cofinite topology, no two
x
n
s are the same, then
x
n
→ x
for every
x ∈ X
, since every open set can only have finitely many
x
n
not
inside it.
This looks weird. This is definitely not what we used to think of sequences.
At least, we would want to have unique limits.
Fortunately, there is a particular class of spaces where sequences are well-
behaved and have at most one limit.
Definition (Hausdorff space). A topological space
X
is Hausdorff if for all
x
1
, x
2
∈ X
with
x
1
6
=
x
2
, there exist open neighbourhoods
U
1
of
x
1
,
U
2
of
x
2
such that U
1
∩ U
2
= ∅.
Lemma. If
X
is Hausdorff,
x
n
is a sequence in
X
with
x
n
→ x
and
x
n
→ x
0
,
then x = x
0
, i.e. limits are unique.
Proof.
Suppose the contrary that
x 6
=
x
0
. Then by definition of Hausdorff, there
exist open neighbourhoods U, U
0
of x, x
0
respectively with U ∩ U
0
= ∅.
Since
x
n
→ x
and
U
is a neighbourhood of
x
, by definition, there is some
N
such that whenever
n > N
, we have
x
n
∈ U
. Similarly, since
x
n
→ x
0
, there is
some N
0
such that whenever n > N
0
, we have x
n
∈ U
0
.
This means that whenever
n > max
(
N, N
0
), we have
x
n
∈ U
and
x
n
∈ U
0
.
So x
n
∈ U ∩ U
0
. This contradicts the fact that U ∩ U
0
= ∅.
Hence we must have x = x
0
.
Example.
(i)
If
X
has more than 1 element, then the coarse topology on
X
is not
Hausdorff.
(ii)
If
X
has infinitely many elements, the cofinite topology on
X
is not
Hausdorff.
(iii) The discrete topology is always Hausdorff.
(iv)
If (
X, d
) is a metric space, the topology induced by
d
is Hausdorff: for
x
1
6
=
x
2
, let
r
=
d
(
x
1
, x
2
)
>
0. Then take
U
i
=
B
r/2
(
x
i
). Then
U
1
∩U
2
=
∅
.
2.3 Closed sets
We will define closed sets similarly to what we did for metric spaces.
Definition (Closed sets). C ⊆ X is closed if X \ C is an open subset of X.
Lemma.
(i) If C
α
is a closed subset of X for all α ∈ A, then
T
α∈A
C
α
is closed in X.
(ii) If C
1
, ··· , C
n
are closed in X, then so is
S
n
i=1
C
i
.
Proof.
(i)
Since
C
α
is closed in
X
,
X \ C
α
is open in
X
. So
S
α∈A
(
X \ C
α
) =
X \
T
α∈A
C
α
is open. So
T
α∈A
C
α
is closed.
(ii)
If
C
i
is closed in
X
, then
X \ C
i
is open. So
T
n
i=1
(
X \ C
i
) =
X \
S
n
i=1
C
i
is open. So
S
n
i=1
C
i
is closed.
This time we can take infinite intersections and finite unions, which is the
opposite of what we have for open sets.
Note that it is entirely possible to define the topology to be the collection of
all closed sets instead of open sets, but people seem to like open sets more.
Corollary. If X is Hausdorff and x ∈ X, then {x} is closed in X.
Proof.
For all
y ∈ X
, there exist open subsets
U
y
, V
y
with
y ∈ U
y
, x ∈ V
y
,
U
y
∩ V
y
= ∅.
Let
C
y
=
X \ U
y
. Then
C
y
is closed,
y 6∈ C
y
,
x ∈ C
y
. So
{x}
=
T
y6=x
C
y
is
closed since it is an intersection of closed subsets.
2.4 Closure and interior
2.4.1 Closure
Given a subset
A ⊆ X
, if
A
is not closed, we would like to find the smallest
closed subset containing A. This is known as the closure of A.
Officially, we define the closure as follows:
Definition. Let X be a topological space and A ⊆ X. Define
C
A
= {C ⊆ X : A ⊆ C and C is closed in X}
Then the closure of A in X is
¯
A =
\
C∈C
A
C.
First we do a sanity check: since
¯
A
is defined as an intersection, we should
make sure we are not taking an intersection of no sets. This is easy: since
X
is closed in
X
(its complement
∅
is open),
C
A
6
=
∅
. So we can safely take the
intersection.
Since
¯
A
is an intersection of closed sets, it is closed in
X
. Also, if
C ∈ C
A
,
then A ⊆ C. So A ⊆
T
C∈C
A
C =
¯
A. In fact, we have
Proposition.
¯
A is the smallest closed subset of X which contains A.
Proof.
Let
K ⊆ X
be a closed set containing
A
. Then
K ∈ C
A
. So
¯
A
=
T
C∈C
A
C ⊆ K. So
¯
A ⊆ K.
We basically defined the closure such that it is the smallest closed subset of
X which contains A.
However, while this “clever” definition makes it easy to prove the above
property, it is rather difficult to directly use it to compute the closure.
To compute the closure, we define the limit point analogous to what we did
for metric spaces.
Definition (Limit point). A limit point of
A
is an
x ∈ X
such that there is a
sequence x
n
→ x with x
n
∈ A for all n.
In general, limit points are easier to compute, and can be a useful tool for
determining the closure of A.
Now let
L(A) = {x ∈ X : x is a limit point of A}.
We immediately get the following lemma.
Lemma. If C ⊆ X is closed, then L(C) = C.
Proof.
Exactly the same as that for metric spaces. We will also prove a more
general result very soon that implies this.
Recall that we proved the converse of this statement for metric spaces.
However, the converse is not true for topological spaces in general.
Example. Let
X
be an uncountable set (e.g.
R
), and define a topology on
X
by saying a set is open if it is empty or has countable complement. One can
check that this indeed defines a topology. We claim that the only sequences that
converge are those that are eventually constant.
Indeed, if x
n
is a sequence and x ∈ X, then consider the open set
U = (X \ {x
n
: n ∈ N}) ∪ {x}.
Then the only element in the sequence
x
n
that can possibly be contained in
U
is x itself. So if x
n
→ x, this implies that x
n
is eventually always x.
In particular, it follows that L(A) = A for all A ⊆ X.
However, we do have the following result:
Proposition. L(A) ⊆
¯
A.
Proof.
If
A ⊆ C
, then
L
(
A
)
⊆ L
(
C
). If
C
is closed, then
L
(
C
) =
C
. So
C ∈ C
A
⇒ L(A) ⊆ C. So L(A) ⊆
T
C∈C
A
C =
¯
A.
This in particular implies the previous lemma, since for any
A
, we have
A ⊆ L(A) ⊆
¯
A, and when A is closed, we have A =
¯
A.
Finally, we have the following corollary that can help us find the closure of
subsets:
Corollary. Given a subset
A ⊆ X
, if we can find some closed
C
such that
A ⊆ C ⊆ L(A), then we in fact have C =
¯
A.
Proof. C ⊆ L
(
A
)
⊆
¯
A ⊆ C
, where the last step is since
¯
A
is the smallest closed
set containing A. So C = L(A) =
¯
A.
Example.
– Let (a, b) ⊆ R. Then (a, b) = [a, b].
– Let Q ⊆ R. Then
¯
Q = R.
– R \ Q = R.
–
In
R
n
with the Euclidean metric,
B
r
(x)
=
¯
B
r
(
x
). In general,
B
r
(x) ⊆
¯
B
r
(
x
), since
¯
B
r
(
x
) is closed and
B
r
(
x
)
⊆
¯
B
r
(
x
), but these need not be
equal.
For example, if
X
has the discrete metric, then
B
1
(
x
) =
{x}
. Then
B
1
(x) = {x}, but
¯
B
1
(x) = X.
In the above example, we had
¯
Q
=
R
. In some sense, all points of
R
are
“surrounded” by points in Q. We say that Q is dense in R.
Definition (Dense subset). A ⊆ X is dense in X if
¯
A = X.
Example. Q and R \ Q are both dense in R with the usual topology.
2.4.2 Interior
We defined the closure of
A
to be the smallest closed subset containing
A
. We
can similarly define the interior of
A
to be the largest open subset contained in
A.
Definition (Interior). Let A ⊆ X, and let
O
A
= {U ⊆ X : U ⊆ A, U is open in X}.
The interior of A is
Int(A) =
[
U∈O
A
U.
Proposition. Int(A) is the largest open subset of X contained in A.
The proof is similar to proof for closure.
To find the closure, we could use limit points. What trick do we have to find
the interior?
Proposition. X \ Int(A) = X \ A
Proof. U ⊆ A ⇔
(
X \U
)
⊇
(
X \A
). Also,
U open in X ⇔ X \U is closed in X
.
So the complement of the largest open subset of
X
contained in
A
will be
the smallest closed subset containing X \ A.
Example. Int(Q) = Int(R \ Q) = ∅.
2.5 New topologies from old
In group theory, we had the notions of subgroups, products and quotients. There
are exact analogies for topological spaces. In this section, we will study the
subspace topology, product topology and quotient topology.
2.5.1 Subspace topology
Definition (Subspace topology). Let
X
be a topological space and
Y ⊆ X
.
The subspace topology on
Y
is given by:
V
is an open subset of
Y
if there is
some U open in X such that V = Y ∩ U.
If we simply write
Y ⊆ X
and don’t specify a topology, the subspace topology
is assumed. For example, when we write Q ⊆ R, we are thinking of Q with the
subspace topology inherited from R.
Example. If (
X, d
) is a metric space and
Y ⊆ X
, then the metric topology on
(Y, d) is the subspace topology, since B
Y
r
(y) = Y ∩ B
X
r
(y).
To show that this is indeed a topology, we need the following set theory facts:
Y ∩
[
α∈A
V
α
!
=
[
α∈A
(Y ∩ V
α
)
Y ∩
\
α∈A
V
α
!
=
\
α∈A
(Y ∩ V
α
)
Proposition. The subspace topology is a topology.
Proof.
(i) Since ∅ is open in X, ∅ = Y ∩ ∅ is open in Y .
Since X is open in X, Y = Y ∩ X is open in Y .
(ii) If V
α
is open in Y , then V
α
= Y ∩ U
α
for some U
α
open in X. Then
[
α∈A
V
α
=
[
α∈A
(Y ∩ U
α
) = Y ∩
[
α∈U
U
α
!
.
Since
S
U
α
is open in X, so
S
V
α
is open in Y .
(iii) If V
i
is open in Y , then V
i
= Y ∩ U
i
for some open U
i
⊆ X. Then
n
\
i=1
V
i
=
n
\
i=1
(Y ∩ U
i
) = Y ∩
n
\
i=1
U
i
!
.
Since
T
U
i
is open,
T
V
i
is open.
Recall that if
Y ⊆ X
, there is an inclusion function
ι
:
Y → X
that sends
y 7→ y
. We can use this to obtain the following defining property of a subspace.
Proposition. If
Y
has the subspace topology,
f
:
Z → Y
is continuous iff
ι ◦ f : Z → X is continuous.
Proof.
(
⇒
) If
U ⊆ X
is open, then
ι
−1
(
U
) =
Y ∩ U
is open in
Y
. So
ι
is
continuous. So if f is continuous, so is ι ◦ f .
(
⇐
) Suppose we know that
ι ◦ f
is continuous. Given
V ⊆ Y
is open, we
know that
V
=
Y ∩ U
=
ι
−1
(
U
). So
f
−1
(
V
) =
f
−1
(
ι
−1
(
U
))) = (
ι ◦ f
)
−1
(
U
) is
open since ι ◦ f is continuous. So f is continuous.
This property is “defining” in the sense that it can be used to define a
subspace:
Y
is a subspace of
X
if there exists some function
ι
:
Y → X
such
that for any f , f is continuous iff ι ◦ f is continuous.
Example.
D
n
=
{
v
∈ R
n
:
|
v
| ≤
1
}
is the
n
-dimensional closed unit disk.
S
n−1
= {v ∈ R
n
: |v| = 1} is the n − 1-dimensional sphere.
We have
Int(D
n
) = {v ∈ R
n
: |v| < 1} = B
1
(0).
This is, in fact, homeomorphic to
R
n
. To show this, we can first pick our favorite
homeomorphism
f
: [0
,
1)
7→
[1
, ∞
). Then v
7→ f
(
|
v
|
)v is a homeomorphism
Int(D
n
) → R
n
.
2.5.2 Product topology
If X and Y are sets, the product is defined as
X × Y = {(x, y) : x ∈ X, y ∈ Y }
We have the projection functions π
1
: X × Y → X, π
2
: X × Y → Y given by
π
1
(x, y) = x, π
2
(x, y) = y.
If A ⊆ X, B ⊆ Y , then we have A × B ⊆ X × Y .
Given topological spaces
X
,
Y
, we can define a topology on
X ×Y
as follows:
Definition (Product topology). Let
X
and
Y
be topological spaces. The product
topology on X × Y is given by:
U ⊆ X × Y
is open if: for every (
x, y
)
∈ U
, there exist
V
x
⊆ X, W
y
⊆ Y
open neighbourhoods of x and y such that V
x
× W
y
⊆ U .
Example.
–
If
V ⊆ X
and
W ⊆ Y
are open, then
V × W ⊆ X × Y
is open (take
V
x
= V , W
y
= W ).
–
The product topology on
R × R
is same as the topology induced by the
k · k
∞
, hence is also the same as the topology induced by
k · k
2
or
k · k
1
.
Similarly, the product topology on
R
n
=
R
n−1
×R
is also the same as that
induced by k · k
∞
.
– (0, 1) × (0, 1) × ··· × (0, 1) ⊆ R
n
is the open n-dimensional cube in R
n
.
Since (0, 1) ' R, we have (0, 1)
n
' R
n
' Int(D
n
).
–
[0
,
1]
× S
n
'
[1
,
2]
× S
n
' {
v
∈ R
n+1
: 1
≤ |
v
| ≤
2
}
, where the last
homeomorphism is given by (
t,
w)
7→ t
w with inverse v
7→
(
|
v
|,
ˆ
v
). This is
a thickened sphere.
–
Let
A ⊆ {
(
r, z
) :
r >
0
} ⊆ R
2
,
R
(
A
) be the set obtained by rotating
A
around the z axis. Then R(A) ' S × A by
(x, y, z) = (v, z) 7→ (
ˆ
v, (|v|, z)).
In particular, if
A
is a circle, then
R
(
A
)
' S
1
× S
1
=
T
2
is the two-
dimensional torus.
r
z
The defining property is that
f
:
Z → X × Y
is continuous iff
π
1
◦ f
and
π
2
◦ f are continuous.
Note that our definition of the product topology is rather similar to the
definition of open sets for metrics. We have a special class of subsets of the
form
V × W
, and a subset
U
is open iff every point
x ∈ U
is contained in some
V × W ⊆ U . In some sense, these subsets “generate” the open sets.
Alternatively, if U ⊆ X × Y is open, then
U =
[
(x,y)∈U
V
x
× W
y
.
So
U ⊆ X ×Y
is open if and only if it is a union of members of our special class
of subsets.
We call this special class the basis.
Definition (Basis). Let
U
be a topology on
X
. A subset
B ⊆ U
is a basis if
“U ∈ U iff U is a union of sets in B”.
Example.
– {V ×W
:
V ⊆ X, W ⊆ Y are open}
is a basis for the product topology for
X × Y .
– If (X, d) is a metric space, then
{B
1/n
(x) : n ∈ N
+
, x ∈ X}
is a basis for the topology induced by d.
2.5.3 Quotient topology
If
X
is a set and
∼
is an equivalence relation on
X
, then the quotient
X/∼
is the
set of equivalence classes. The projection
π
:
X → X/∼
is defined as
π
(
x
) = [
x
],
the equivalence class containing x.
Definition (Quotient topology). If
X
is a topological space, the quotient topology
on X/∼ is given by: U is open in X/∼ if π
−1
(U) is open in X.
We can think of the quotient as “gluing” the points identified by
∼
together.
The defining property is
f
:
X/∼ → Y
is continuous iff
f ◦ π
:
X → Y
is
continuous.
Example.
–
Let
X
=
R
,
x ∼ y
iff
x − y ∈ Z
. Then
X/∼
=
R/Z ' S
1
, given by
[x] 7→ (cos 2πx, sin 2πx).
–
Let
X
=
R
2
. Then v
∼
w iff v
−
w
∈ Z
2
. Then
X/∼
=
R
2
/Z
2
=
(
R/Z
)
×
(
R/Z
)
' S
1
×S
1
=
T
2
. Similarly,
R
n
/Z
n
=
T
n
=
S
1
×S
1
×···×S
1
.
–
If
A ⊆ X
, define
∼
by
x ∼ y
iff
x
=
y
or
x, y ∈ A
. This glues everything
in A together and leaves everything else alone.
We often write this as
X/A
. Note that this is not consistent with the
notation we just used above!
◦
Let
X
= [0
,
1] and
A
=
{
0
,
1
}
, then
X/A ' S
1
by, say,
t 7→
(
cos
2
πt, sin
2
πt
). Intuitively, the equivalence relation says that the
two end points of [0
,
1] are “the same”. So we join the ends together
to get a circle.
◦
Let
X
=
D
n
and
A
=
S
n−1
. Then
X/A ' S
n
. This can be pictured
as pulling the boundary of the disk together to a point to create a
closed surface
–
Let
X
= [0
,
1]
×
[0
,
1] with
∼
given by (0
, y
)
∼
(1
, y
) and (
x,
0)
∼
(
x,
1),
then X/∼ ' S
1
× S
1
= T
2
, by, say
(x, y) 7→
(cos 2πx, sin 2πx), (cos 2πy, sin 2πy)
Similarly, T
3
= [0, 1]
3
/∼, where the equivalence is analogous to above.
Note that even if
X
is Hausdorff,
X/∼
may not be! For example,
R/Q
is not
Hausdorff.
3 Connectivity
Finally we can get to something more interesting. In this chapter, we will study
the connectivity of spaces. Intuitively, we would want to say that a space is
“connected” if it is one-piece. For example,
R
is connected, while
R \ {
0
}
is
not. We will come up with two different definitions of connectivity - normal
connectivity and path connectivity, where the latter implies the former, but not
the other way round.
3.1 Connectivity
We will first look at normal connectivity.
Definition (Connected space). A topological space
X
is disconnected if
X
can
be written as
A ∪ B
, where
A
and
B
are disjoint, non-empty open subsets of
X
.
We say A and B disconnect X.
A space is connected if it is not disconnected.
Note that being connected is a property of a space, not a subset. When we
say “
A
is a connected subset of
X
”, it means
A
is connected with the subspace
topology inherited from X.
Being (dis)connected is a topological property, i.e. if
X
is (dis)connected,
and
X ' Y
, then
Y
is (dis)connected. To show this, let
f
:
X → Y
be the
homeomorphism. By definition,
A
is open in
X
iff
f
(
A
) is open in
Y
. So
A
and
B disconnect X iff f(A) and f (B) disconnect Y .
Example.
– If X has the coarse topology, it is connected.
– If X has the discrete topology and at least 2 elements, it is disconnected.
–
Let
X ⊆ R
. If there is some
α ∈ R \ X
such that there is some
a, b ∈ X
with
a < α < b
, then
X
is disconnected. In particular,
X ∩
(
−∞, α
) and
X ∩ (α, ∞) disconnect X.
For example, (0, 1) ∪ (1, 2) is disconnected (α = 1).
We can also characterize connectivity in terms of continuous functions:
Proposition.
X
is disconnected iff there exists a continuous surjective
f
:
X →
{0, 1} with the discrete topology.
Alternatively,
X
is connected iff any continuous map
f
:
X → {
0
,
1
}
is
constant.
Proof. (⇒) If A and B disconnect X, define
f(x) =
(
0 x ∈ A
1 x ∈ B
Then
f
−1
(
∅
) =
∅
,
f
−1
(
{
0
,
1
}
) =
X
,
f
−1
(
{
0
}
) =
A
and
f
−1
(
{
1
}
) =
B
are all
open. So f is continuous. Also, since A, B are non-empty, f is surjective.
(
⇐
) Given
f
:
X 7→ {
0
,
1
}
surjective and continuous, define
A
=
f
−1
(
{
0
}
),
B = f
−1
({1}). Then A and B disconnect X.
Theorem. [0, 1] is connected.
Note that
Q ∩
[0
,
1] is disconnected, since we can pick our favorite irrational
number
a
, and then
{x
:
x < a}
and
{x
:
x > a}
disconnect the interval. So we
better use something special about [0, 1].
The key property of R is that every non-empty A ⊆ [0, 1] has a supremum.
Proof.
Suppose
A
and
B
disconnect [0
,
1]. wlog, assume 1
∈ B
. Since
A
is
non-empty, α = sup A exists. Then either
– α ∈ A
. Then
α <
1, since 1
∈ B
. Since
A
is open,
∃ε >
0 with
B
ε
(
α
)
⊆ A
.
So α +
ε
2
∈ A, contradicting supremality of α; or
– α 6∈ A
. Then
α ∈ B
. Since
B
is open,
∃ε >
0 such that
B
ε
(
α
)
⊆ B
. Then
a ≤ α −ε
for all
a ∈ A
. This contradicts
α
being the least upper bound of
A.
Either option gives a contradiction. So
A
and
B
cannot exist and [0
,
1] is
connected.
To conclude the section, we will prove the intermediate value property. The
key proposition needed is the following.
Proposition. If
f
:
X → Y
is continuous and
X
is connected, then
im f
is also
connected.
Proof.
Suppose
A
and
B
disconnect
im f
. We will show that
f
−1
(
A
) and
f
−1
(
B
)
disconnect X.
Since
A, B ⊆ im f
are open, we know that
A
=
im f ∩ A
0
and
B
=
im f ∩ B
0
for some
A
0
, B
0
open in
Y
. Then
f
−1
(
A
) =
f
−1
(
A
0
) and
f
−1
(
B
) =
f
−1
(
B
0
) are
open in X.
Since
A, B
are non-empty,
f
−1
(
A
) and
f
−1
(
B
) are non-empty. Also,
f
−1
(
A
)
∩
f
−1
(
B
) =
f
−1
(
A ∩ B
) =
f
−1
(
∅
) =
∅
. Finally,
A ∪ B
=
im f
. So
f
−1
(
A
)
∪
f
−1
(B) = f
−1
(A ∪ B) = X.
So
f
−1
(
A
) and
f
−1
(
B
) disconnect
X
, contradicting our hypothesis. So
im f
is connected.
Alternatively, if
im f
is not connected, let
g
:
im f → {
0
,
1
}
be continuous
surjective. Then g ◦ f : X → {0, 1} is continuous surjective. Contradiction.
Theorem (Intermediate value theorem). Suppose
f
:
X → R
is continuous
and
X
is connected. If
∃x
0
, x
1
such that
f
(
x
0
)
<
0
< f
(
x
1
), then
∃x ∈ X
with
f(x) = 0.
Proof.
Suppose no such
x
exists. Then 0
6∈ im f
while 0
> f
(
x
0
)
∈ im f
,
0
< f
(
x
1
)
∈ im f
. Then
im f
is disconnected (from our previous example),
contradicting X being connected.
Alternatively, if
f
(
x
)
6
= 0 for all
x
, then
f(x)
|f(x)|
is a continuous surjection from
X to {−1, +1}, which is a contradiction.
Corollary. If
f
: [0
,
1]
→ R
is continuous with
f
(0)
<
0
< f
(1), then
∃x ∈
[0
,
1]
with f (x) = 0.
Is the converse of the intermediate value theorem true? If
X
is disconnected,
can we find a function g that doesn’t satisfy the intermediate value property?
The answer is yes. Since
X
is disconnected, let
f
:
X → {
0
,
1
}
be continu-
ous. Then let
g
(
x
) =
f
(
x
)
−
1
2
. Then
g
is continuous but doesn’t satisfy the
intermediate value property.
3.2 Path connectivity
The other notion of connectivity is path connectivity. A space is path connected
if we can join any two points with a path. First, we need a definition of a path.
Definition (Path). Let
X
be a topological space, and
x
0
, x
1
∈ X
. Then a
path from
x
0
to
x
1
is a continuous function
γ
: [0
,
1]
7→ X
such that
γ
(0) =
x
0
,
γ(1) = x
1
.
Definition (Path connectivity). A topological space
X
is path connected if for
all points x
0
, x
1
∈ X, there is a path from x
0
to x
1
.
Example.
(i)
(
a, b
)
,
[
a, b
)
,
(
a, b
]
, R
are all path connected (using paths given by linear
functions).
(ii) R
n
is path connected (e.g. γ(t) = tx
1
+ (1 − t)x
0
).
(iii) R
n
\{
0
}
is path-connected for
n >
1 (the paths are either line segments or
bent line segments to get around the hole).
Path connectivity is a stronger condition than connectivity, in the sense that
Proposition. If X is path connected, then X is connected.
Proof.
Let
X
be path connected, and let
f
:
X → {
0
,
1
}
be a continuous function.
We want to show that f is constant.
Let
x, y ∈ X
. By path connectedness, there is a map
γ
: [0
,
1]
→ X
such that
γ
(0) =
x
and
γ
(1) =
y
. Composing with
f
gives a map
f ◦ γ
: [0
,
1]
→ {
0
,
1
}
.
Since [0
,
1] is connected, this must be constant. In particular,
f
(
γ
(0)) =
f
(
γ
(1)),
i.e. f (x) = f(y). Since x, y were arbitrary, we know f is constant.
We can use connectivity to distinguish spaces. Apart from the obvious “
X
is connected while
Y
is not”, we can also try to remove points and see what
happens. We will need the following lemma:
Lemma. Suppose
f
:
X → Y
is a homeomorphism and
A ⊆ X
, then
f|
A
:
A →
f(A) is a homeomorphism.
Proof.
Since
f
is a bijection,
f|
A
is a bijection. If
U ⊆ f
(
A
) is open, then
U
=
f
(
A
)
∩ U
0
for some
U
0
open in
Y
. So
f|
−1
A
(
U
) =
f
−1
(
U
0
)
∩ A
is open in
A
.
So f |
A
is continuous. Similarly, we can show that (f|
A
)
−1
is continuous.
Example. [0
,
1]
6'
(0
,
1). Suppose it were. Let
f
: [0
,
1]
→
(0
,
1) be a homeomor-
phism. Let
A
= (0
,
1]. Then
f|
A
: (0
,
1]
→
(0
,
1)
\ {f
(0)
}
is a homeomorphism.
But (0, 1] is connected while (0, 1) \ {f (0)} is disconnected. Contradiction.
Similarly, [0
,
1)
6'
[0
,
1] and [0
,
1)
6'
(0
,
1). Also,
R
n
6' R
for
n >
1, and
S
1
is
not homeomorphic to any subset of R.
3.2.1 Higher connectivity*
We were able to use path connectivity to determine that
R
is not homeomorphic
to
R
n
for
n >
1. If we want to distinguish general
R
n
and
R
m
, we will need to
generalize the idea of path connectivity to higher connectivity.
To do so, we have to formulate path connectivity in a different way. Recall
that
S
0
=
{−
1
,
1
} ' {
0
,
1
} ⊆ R
, while
D
1
= [
−
1
,
1]
'
[0
,
1]
⊆ R
. Then we
can formulate path connectivity as:
X
is path-connected iff any continuous
f : S
0
→ X extends to a continuous γ : D
1
→ X with γ|
S
0
= f.
This is much more easily generalized:
Definition (
n
-connectedness).
X
is
n
-connected if for any
k ≤ n
, any continuous
f : S
k
→ X extends to a continuous F : D
k+1
→ X such that F |
S
k
= f.
For any point
p ∈ R
n
,
R
n
\{p}
is
m
-connected iff
m ≤ n −
2. So
R
n
\{p} 6'
R
m
\ {q} unless n = m. So R
n
6' R
m
.
Unfortunately, we have not yet proven that
R
n
6' R
m
. We casually stated
that
R
n
\ {p}
is
m
-connected iff
m ≤ n −
2. However, while this intuitively
makes sense, it is in fact very difficult to prove. To actually prove this, we will
need tools from algebraic topology.
3.3 Components
If a space is disconnected, we could divide the space into different components,
each of which is (maximally) connected. This is what we will investigate in this
section. Of course, we would have a different notion of “components” for each
type of connectivity. We will first start with path connectivity.
3.3.1 Path components
Defining path components (i.e. components with respect to path connectedness)
is relatively easy. To do so, we need the following lemma.
Lemma. Define
x ∼ y
if there is a path from
x
to
y
in
X
. Then
∼
is an
equivalence relation.
Proof.
(i)
For any
x ∈ X
, let
γ
x
: [0
,
1]
→ X
be
γ
(
t
) =
x
, the constant path. Then
this is a path from x to x. So x ∼ x.
(ii)
If
γ
: [0
,
1]
→ X
is a path from
x
to
y
, then
¯γ
: [0
,
1]
→ X
by
t 7→ γ
(1
− t
)
is a path from y to x. So x ∼ y ⇒ y ∼ x.
(iii)
If
γ
1
is a path from
x
to
y
and
γ
2
is a path from
y
to
z
, then
γ
2
∗γ
1
defined
by
t 7→
(
γ
1
(2t) t ∈ [0, 1/2]
γ
2
(2t − 1) t ∈ [1/2, 1]
is a path from x to z. So x ∼ y, y ∼ z ⇒ x ∼ z.
With this lemma, we can immediately define the path components:
Definition (Path components). Equivalence classes of the relation “
x ∼ y
if
there is a path from x to y” are path components of X.
3.3.2 Connected components
Defining connected components (i.e. components with respect to regular connec-
tivity) is slightly more involved. The key proposition we need is as follows:
Proposition. Suppose
Y
α
⊆ X
is connected for all
α ∈ T
and that
T
α∈T
Y
α
6
=
∅
.
Then Y =
S
α∈T
Y
α
is connected.
Proof.
Suppose the contrary that
A
and
B
disconnect
Y
. Then
A
and
B
are
open in
Y
. So
A
=
Y ∩ A
0
and
B
=
Y ∩ B
0
, where
A
0
, B
0
are open in
X
. For
any fixed α, let
A
α
= Y
α
∩ A = Y
α
∩ A
0
, B
α
= Y
α
∩ B = Y
α
∩ B
0
.
Then they are open in Y
α
. Since Y = A ∪ B, we have
Y
α
= Y ∩ Y
α
= (A ∪ B) ∩ Y
α
= A
α
∪ B
α
.
Since A ∩ B = ∅, we have
A
α
∩ B
α
= Y
α
∩ (A ∩ B) = ∅.
So
A
α
, B
α
are disjoint. So
Y
α
is connected but is the disjoint union of open
subsets A
α
, B
α
.
By definition of connectivity, this can only happen if A
α
= ∅ or B
α
= ∅.
However, by assumption,
\
α∈T
Y
α
6
=
∅
. So pick
y ∈
\
α∈T
Y
α
. Since
y ∈ Y
,
either
y ∈ A
or
y ∈ B
. wlog, assume
y ∈ A
. Then
y ∈ Y
α
for all
α
implies that
y ∈ A
α
for all
α
. So
A
α
is non-empty for all
α
. So
B
α
is empty for all
α
. So
B = ∅.
So A and B did not disconnect Y after all. Contradiction.
Using this lemma, we can define connected components:
Definition (Connected component). If x ∈ X, define
C(x) = {A ⊆ X : x ∈ A and A is connected}.
Then C(x) =
[
A∈C(x)
A is the connected component of x.
C
(
x
) is the largest connected subset of
X
containing
x
. To show this, we
first note that
{x} ∈ C
(
x
). So
x ∈ C
(
x
). To show that it is connected, just note
that
x ∈
T
A∈C(x)
A
. So
T
A∈C(x)
A
is non-empty. By our previous proposition,
this implies that C(x) is connected.
Lemma. If y ∈ C(x), then C(y) = C(x).
Proof.
Since
y ∈ C
(
x
) and
C
(
x
) is connected,
C
(
x
)
⊆ C
(
y
). So
x ∈ C
(
y
). Then
C(y) ⊆ C(x). So C(x) = C(y).
It follows that
x ∼ y
if
x ∈ C
(
y
) is an equivalence relation and the connected
components of X are the equivalence classes.
Example.
–
Let
X
= (
−∞,
0)
∪
(0
, ∞
)
⊆ R
. Then the connected components are
(−∞, 0) and (0, ∞), which are also the path components.
–
Let
X
=
Q ⊆ R
. Then
C
(
x
) =
{x}
for all
x ∈ X
. In this case, we say
X
is totally disconnected.
Note that
C
(
x
) and
X \ C
(
x
) need not disconnect
X
, even though it is the
case in our first example. For this, we must need
C
(
x
) and
X \C
(
x
) to be open
as well. For example, in Example 2, C(x) = {x} is not open.
It is also important to note that path components need not be equal to the
connected components, as illustrated by the following example. However, since
path connected spaces are connected, the path component containing
x
must be
a subset of C(x).
Example. Let Y = {(0, y) : y ∈ R} ⊆ R
2
be the y axis.
Let Z = {(x,
1
x
sin
1
x
) : x ∈ (0, ∞)}.
x
y
Y
Z
Let
X
=
Y ∪ Z ⊆ R
2
. We claim that
Y
and
Z
are the path components of
X
.
Since
Y
and
Z
are individually path connected, it suffices to show that there is
no continuous γ : [0, 1] → X with γ(0) = (0, 0), γ(1) = (1, sin 1).
Suppose
γ
existed. Then the function
π
2
◦ γ
: [0
,
1]
→ R
projecting the
path to the
y
direction is continuous. So it is bounded. Let
M
be such that
π
2
◦ γ
(
t
)
≤ M
for all
t ∈
[0
,
1]. Let
W
=
X ∩
(
R ×
(
−∞, M
]) be the part of
X
that lies below y = M. Then im γ ⊆ W .
However,
W
is disconnected: pick
t
0
with
1
t
0
sin
1
t
0
> M
. Then
W ∩
((
−∞, t
0
)
× R
) and
W ∩
((
t
0
, ∞
)
× R
) disconnect
W
. This is a contradiction,
since γ is continuous and [0, 1] is connected.
We also claim that
X
is connected: suppose
A
and
B
disconnect
X
. Then
since
Y
and
Z
are connected, either
Y ⊆ A
or
Y ⊆ B
;
Z ⊆ A
or
Z ⊆ B
. If both
Y ⊆ A, Z ⊆ A, then B = ∅, which is not possible.
So wlog assume
A
=
Y
,
B
=
Z
. This is also impossible, since
Y
is not open
in
X
as it is not a union of balls (any open ball containing a point in
Y
will also
contain a point in Z). Hence X must be connected.
Finally, recall that we showed that path-connected subsets are connected.
While the converse is not true in general, there are special cases where it is true.
Proposition. If U ⊆ R
n
is open and connected, then it is path-connected.
Proof. Let A be a path component of U. We first show that A is open.
Let
a ∈ A
. Since
U
is open,
∃ε >
0 such that
B
ε
(
a
)
⊆ U
. We know that
B
ε
(
a
)
' Int
(
D
n
) is path-connected (e.g. use line segments connecting the points).
Since
A
is a path component and
a ∈ A
, we must have
B
ε
(
a
)
⊆ A
. So
A
is an
open subset of U.
Now suppose b ∈ U \ A. Then since U is open, ∃ε > 0 such that B
ε
(b) ⊆ U .
Since
B
ε
(
b
) is path-connected, so if
B
ε
(
b
)
∩ A 6
=
∅
, then
B
ε
(
b
)
⊆ A
. But this
implies
b ∈ A
, which is a contradiction. So
B
ε
(
b
)
∩ A
=
∅
. So
B
ε
(
b
)
⊆ U \ A
.
Then U \ A is open.
So
A, U \ A
are disjoint open subsets of
U
. Since
U
is connected, we must
have U \ A empty (since A is not). So U = A is path-connected.
4 Compactness
Compactness is an important concept in topology. It can be viewed as a
generalization of being “closed and bounded” in
R
. Alternatively, it can also be
viewed as a generalization of being finite. Compact sets tend to have a lot of
really nice properties. For example, if
X
is compact and
f
:
X → R
is continuous,
then f is bounded and attains its bound.
There are two different definitions of compactness - one based on open covers
(which we will come to shortly), and the other based on sequences. In metric
spaces, these two definitions are equal. However, in general topological spaces,
these notions can be different. The first is just known as “compactness” and the
second is known as “sequential compactness”.
The actual definition of compactness is rather weird and unintuitive, since it
is an idea we haven’t seen before. To quote Qiaochu Yuan’s math.stackexchange
answer (http://math.stackexchange.com/a/371949),
The following story may or may not be helpful. Suppose you live
in a world where there are two types of animals: Foos, which are
red and short, and Bars, which are blue and tall. Naturally, in your
language, the word for Foo over time has come to refer to things
which are red and short, and the word for Bar over time has come to
refer to things which are blue and tall. (Your language doesn’t have
separate words for red, short, blue, and tall.)
One day a friend of yours tells you excitedly that he has discovered
a new animal. “What is it like?” you ask him. He says, “well, it’s
sort of Foo, but. . . ”
The reason he says it’s sort of Foo is that it’s short. However,
it’s not red. But your language doesn’t yet have a word for “short,”
so he has to introduce a new word — maybe “compact”. . .
The situation with compactness is sort of like the above. It
turns out that finiteness, which you think of as one concept (in the
same way that you think of “Foo” as one concept above), is really
two concepts: discreteness and compactness. You’ve never seen
these concepts separated before, though. When people say that
compactness is like finiteness, they mean that compactness captures
part of what it means to be finite in the same way that shortness
captures part of what it means to be Foo.
But in some sense you’ve never encountered the notion of com-
pactness by itself before, isolated from the notion of discreteness (in
the same way that above you’ve never encountered the notion of
shortness by itself before, isolated from the notion of redness). This
is just a new concept and you will to some extent just have to deal
with it on its own terms until you get comfortable with it.
4.1 Compactness
Definition (Open cover). Let
U ⊆ P
(
X
) be a topology on
X
. An open cover of
X is a subset V ⊆ U such that
[
V ∈V
V = X.
We say V covers X.
If V
0
⊆ V, and V
0
covers X, then we say V
0
is a subcover of V.
Definition (Compact space). A topological space
X
is compact if every open
cover V of X has a finite subcover V
0
= {V
1
, ··· , V
n
} ⊆ V.
Note that some people (especially algebraic geometers) call this notion “quasi-
compact”, and reserve the name “compact” for “quasi-compact and Hausdorff”.
We will not adapt this notion.
Example.
(i)
If
X
is finite, then
P
(
X
) is finite. So any open cover of
X
is finite. So
X
is compact.
(ii)
Let
X
=
R
and
V
=
{
(
−R, R
) :
R ∈ R, R >
0
}
. Then this is an open cover
with no finite subcover. So
R
is not compact. Hence all open intervals are
not compact since they are homeomorphic to R.
(iii) Let X = [0, 1] ∩ Q. Let
U
n
= X \ (α − 1/n, α + 1/n).
for some irrational α in (0, 1) (e.g. α = 1/
√
2).
Then
S
n>0
U
n
=
X
since
α
is irrational. Then
V
=
{U
n
:
n ∈ Z >
0
}
is
an open cover of X. Since this has no finite subcover, X is not compact.
Theorem. [0, 1] is compact.
Again, since this is not true for [0
,
1]
∩ Q
, we must use a special property of
the reals.
Proof. Suppose V is an open cover of [0, 1]. Let
A = {a ∈ [0, 1] : [0, a] has a finite subcover of V}.
First show that
A
is non-empty. Since
V
covers [0
,
1], in particular, there is some
V
0
that contains 0. So {0} has a finite subcover V
0
. So 0 ∈ A.
Next we note that by definition, if 0 ≤ b ≤ a and a ∈ A, then b ∈ A.
Now let α = sup A. Suppose α < 1. Then α ∈ [0, 1].
Since
V
covers
X
, let
α ∈ V
α
. Since
V
α
is open, there is some
ε
such that
B
ε
(
α
)
⊆ V
α
. By definition of
α
, we must have
α − ε/
2
∈ A
. So [0
, α − ε/
2]
has a finite subcover. Add
V
α
to that subcover to get a finite subcover of
[0
, α
+
ε/
2]. Contradiction (technically, it will be a finite subcover of [0
, η
] for
η = min(α + ε/2, 1), in case α + ε/2 gets too large).
So we must have α = sup A = 1.
Now we argue as before:
∃V
1
∈ V
such that 1
∈ V
1
and
∃ε >
0 with
(1
− ε,
1]
⊆ V
1
. Since 1
− ε ∈ A
, there exists a finite
V
0
⊆ V
which covers
[0, 1 − ε/2]. Then W = V
0
∪ {V
1
} is a finite subcover of V.
We mentioned that compactness is a generalization of “closed and bounded”.
We will now show that compactness is indeed in some ways related to closedness.
Proposition. If
X
is compact and
C
is a closed subset of
X
, then
C
is also
compact.
Proof.
To prove this, given an open cover of
C
, we need to find a finite subcover.
To do so, we need to first convert it into an open cover of
X
. We can do so by
adding
X \C
, which is open since
C
is closed. Then since
X
is compact, we can
find a finite subcover of this, which we can convert back to a finite subcover of
C.
Formally, suppose
V
is an open cover of
C
. Say
V
=
{V
α
:
α ∈ T }
. For
each
α
, since
V
α
is open in
C
,
V
α
=
C ∩ V
0
α
for some
V
0
α
open in
X
. Also, since
S
α∈T
V
a
= C, we have
S
α∈T
V
0
α
⊇ C.
Since
C
is closed,
U
=
X \ C
is open in
X
. So
W
=
{V
0
α
:
α ∈ T } ∪ {U}
is an open cover of
X
. Since
X
is compact,
W
has a finite subcover
W
0
=
{V
0
α
1
, ··· , V
0
α
n
, U}
(
U
may or may not be in there, but it doesn’t matter). Now
U ∩ C = ∅. So {V
α
1
, ··· , V
α
n
} is a finite subcover of C.
The converse is not always true, but holds for Hausdorff spaces.
Proposition. Let
X
be a Hausdorff space. If
C ⊆ X
is compact, then
C
is
closed in X.
Proof. Let U = X \ C. We will show that U is open.
For any
x
, we will find a
U
x
such that
U
x
⊆ U
and
x ∈ U
x
. Then
U
=
S
x∈U
U
x
will be open since it is a union of open sets.
To construct
U
x
, fix
x ∈ U
. Since
X
is Hausdorff, for each
y ∈ C
,
∃U
xy
, W
xy
open neighbourhoods of x and y respectively with U
xy
∩ W
xy
= ∅.
W
xy
U
xy
C
x
y
Then
W
=
{W
xy
∩ C
:
y ∈ C}
is an open cover of
C
. Since
C
is compact, there
exists a finite subcover W
0
= {W
xy
1
∩ C, ··· , W
xy
n
∩ C}.
Let
U
x
=
T
n
i=1
U
xy
i
. Then
U
x
is open since it is a finite intersection of open
sets. To show
U
x
⊆ U
, note that
W
x
=
S
n
i=1
W
xy
i
⊇ C
since
{W
xy
i
∩ C}
is an
open cover. We also have W
x
∩ U
x
= ∅. So U
x
⊆ U . So done.
W
x
U
x
C
x
After relating compactness to closedness, we will relate it to boundedness.
First we need to define boundedness for general metric spaces.
Definition (Bounded metric space). A metric space (X, d) is bounded if there
exists M ∈ R such that d(x, y) ≤ M for all x, y ∈ X.
Example. A ⊆ R is bounded iff A ⊆ [−N, N] for some N ∈ R.
Note that being bounded is not a topological property. For example, (0
,
1)
'
R
but (0
,
1) is bounded while
R
is not. It depends on the metric
d
, not just the
topology it induces.
Proposition. A compact metric space (X, d) is bounded.
Proof.
Pick
x ∈ X
. Then
V
=
{B
r
(
x
) :
r ∈ R
+
}
is an open cover of
X
. Since
X is compact, there is a finite subcover {B
r
1
(x), ··· , B
r
n
(x)}.
Let
R
=
max{r
1
, ··· , r
n
}
. Then
d
(
x, y
)
< R
for all
y ∈ X
. So for all
y, z ∈ X,
d(y, z) ≤ d(y, x) + d(x, z) < 2R
So X is bounded.
Theorem (Heine-Borel). C ⊆ R is compact iff C is closed and bounded.
Proof. Since R is a metric space (hence Hausdorff), C is also a metric space.
So if
C
is compact,
C
is closed in
R
, and
C
is bounded, by our previous two
propositions.
Conversely, if
C
is closed and bounded, then
C ⊆
[
−N, N
] for some
N ∈ R
.
Since [
−N, N
]
'
[0
,
1] is compact, and
C
=
C ∩
[
−N, N
] is closed in [
−N, N
],
C
is compact.
Corollary. If A ⊆ R is compact, ∃α ∈ A such that α ≥ a for all a ∈ A.
Proof.
Since
A
is compact, it is bounded. Let
α
=
sup A
. Then by definition,
α ≥ a for all a ∈ A. So it is enough to show that α ∈ A.
Suppose
α 6∈ A
. Then
α ∈ R \ A
. Since
A
is compact, it is closed in
R
. So
R \A
is open. So
∃ε >
0 such that
B
ε
(
α
)
⊆ R \ A
, which implies that
a ≤ α − ε
for all
a ∈ A
. This contradicts the assumption that
α
=
sup A
. So we can
conclude α ∈ A.
We call α = max A the maximum element of A.
We have previously proved that if
X
is connected and
f
:
X → Y
, then
im f ⊆ Y is connected. The same statement is true for compactness.
Proposition. If
f
:
X → Y
is continuous and
X
is compact, then
im f ⊆ Y
is
also compact.
Proof.
Suppose
V
=
{V
α
:
α ∈ T }
is an open cover of
im f
. Since
V
α
is open in
im f, we have V
α
= im f ∩ V
0
α
, where V
0
α
is open in Y . Then
W
α
= f
−1
(V
α
) = f
−1
(V
0
α
)
is open in
X
. If
x ∈ X
then
f
(
x
) is in
V
α
for some
α
, so
x ∈ W
α
. Thus
W = {W
α
: α ∈ T } is an open cover of X.
Since X is compact, there’s a finite subcover {W
α
1
, ··· , W
α
n
} of W.
Since V
α
⊆ im f , f(W
α
) = f(f
−1
(V
α
)) = V
α
. So
{V
α
1
, ··· , V
α
n
}
is a finite subcover of V.
Theorem (Maximum value theorem). If
f
:
X → R
is continuous and
X
is
compact, then ∃x ∈ X such that f(x) ≥ f(y) for all y ∈ X.
Proof.
Since
X
is compact,
im f
is compact. Let
α
=
max{im f}
. Then
α ∈ im f
.
So ∃x ∈ X with f(x) = α. Then by definition f(x) ≥ f (y) for all y ∈ X.
Corollary. If
f
: [0
,
1]
→ R
is continuous, then
∃x ∈
[0
,
1] such that
f
(
x
)
≥ f
(
y
)
for all y ∈ [0, 1]
Proof. [0, 1] is compact.
4.2 Products and quotients
4.2.1 Products
Recall the product topology on
X × Y
.
U ⊆ X × Y
is open if it is a union of
sets of the form V × W such that V ⊆ X, W ⊆ Y are open.
The major takeaway of this section is the following theorem:
Theorem. If X and Y are compact, then so is X × Y .
Proof.
First consider the special type of open cover
V
of
X × Y
such that every
U ∈ V has the form U = V × W , where V ⊆ X and W ⊆ Y are open.
For every (x, y) ∈ X × Y , there is U
xy
∈ V with (x, y) ∈ U
xy
. Write
U
xy
= V
xy
× W
xy
,
where V
xy
⊆ X, W
xy
⊆ Y are open, x ∈ V
xy
, y ∈ W
xy
.
Fix
x ∈ X
. Then
W
x
=
{W
xy
:
y ∈ Y }
is an open cover of
Y
. Since
Y
is
compact, there is a finite subcover {W
xy
1
, ··· , W
xy
n
}.
Then
V
x
=
T
n
i=1
V
xy
i
is a finite intersection of open sets. So
V
x
is open in
X
.
Moreover, V
x
= {U
xy
1
, ··· , U
xy
n
} covers V
x
× Y .
x
U
xy
i
V
x
× Y
X
Y
Now
O
=
{V
x
:
x ∈ X}
is an open cover of
X
. Since
X
is compact, there is
a finite subcover
{V
x
1
, ··· , V
x
m
}
. Then
V
0
=
S
m
i=1
V
x
i
is a finite subset of
V
,
which covers all of X × Y .
In the general, case, suppose
V
is an open cover of
X × Y
. For each (
x, y
)
∈
X × Y
,
∃U
xy
∈ V
with (
x, y
)
∈ U
xy
. Since
U
xy
is open,
∃V
xy
⊆ X, W
xy
⊆ Y
open with V
xy
× W
xy
⊆ U
xy
and x ∈ V
xy
, y ∈ W
xy
.
Then
Q
=
{V
xy
× W
xy
: (
x, y
)
∈
(
X, Y
)
}
is an open cover of
X × Y
of
the type we already considered above. So it has a finite subcover
{V
x
1
y
1
×
W
x
1
y
1
, ··· , V
x
n
y
n
×W
x
n
y
n
}
. Now
V
x
i
y
i
×W
x
i
y
i
⊆ U
x
i
y
i
. So
{U
x
1
y
1
, ··· , U
x
n
y
n
}
is a finite subcover of X × Y .
Example. The unit cube [0, 1]
n
= [0, 1] × [0, 1] × ··· × [0, 1] is compact.
Corollary (Heine-Borel in
R
n
).
C ⊆ R
n
is compact iff
C
is closed and bounded.
Proof.
If
C
is bounded,
C ⊆
[
−N, N
]
n
for some
N ∈ R
, which is compact. The
rest of the proof is exactly the same as for n = 1.
4.2.2 Quotients
It is easy to show that the quotient of a compact space is compact, since every
open subset in the quotient space can be projected back to an open subset the
original space. Hence we can project an open cover from the quotient space to
the original space, and get a finite subcover. The details are easy to fill in.
Instead of proving the above, in this section, we will prove that compact
quotients have some nice properties. We start with a handy proposition.
Proposition. Suppose
f
:
X → Y
is a continuous bijection. If
X
is compact
and Y is Hausdorff, then f is a homeomorphism.
Proof.
We show that
f
−1
is continuous. To do this, it suffices to show (
f
−1
)
−1
(
C
)
is closed in
Y
whenever
C
is closed in
X
. By hypothesis,
f
is a bijection . So
(f
−1
)
−1
(C) = f(C).
Supposed
C
is closed in
X
. Since
X
is compact,
C
is compact. Since
f
is
continuous,
f
(
C
) = (
im f|
C
) is compact. Since
Y
is Hausdorff and
f
(
C
)
⊆ Y
is
compact, f (C) is closed.
We will apply this to quotients.
Recall that if
∼
is an equivalence relation on
X
,
π
:
X → X/∼
is continuous
iff f ◦ π : X → Y is continuous.
Corollary. Suppose
f
:
X/∼ → Y
is a bijection,
X
is compact,
Y
is Hausdorff,
and f ◦ π is continuous, then f is a homeomorphism.
Proof.
Since
X
is compact and
π
:
X 7→ X/∼
is continuous,
im π ⊆ X/∼
is
compact. Since
f ◦ π
is continuous,
f
is continuous. So we can apply the
proposition.
Example. Let
X
=
D
2
and
A
=
S
1
⊆ X
. Then
f
:
X/A 7→ S
2
by (
r, θ
)
7→
(1, πr, θ) in spherical coordinates is a homeomorphism.
We can check that
f
is a continuous bijection and
D
2
is compact. So
X/A ' S
2
.
4.3 Sequential compactness
The other definition of compactness is sequential compactness. We will not
do much with it, but only prove that it is the same as compactness for metric
spaces.
Definition (Sequential compactness). A topological space
X
is sequentially
compact if every sequence (
x
n
) in
X
has a convergent subsequence (that converges
to a point in X!).
Example. (0
,
1)
⊆ R
is not sequentially compact since no subsequence of (1
/n
)
converges to any x ∈ (0, 1).
To show that sequential compactness is the same as compactness, we will
first need a lemma.
Lemma. Let (
x
n
) be a sequence in a metric space (
X, d
) and
x ∈ X
. Then (
x
n
)
has a subsequence converging to
x
iff for every
ε >
0,
x
n
∈ B
ε
(
x
) for infinitely
many n (∗).
Proof.
If (
x
n
i
)
→ x
, then for every
ε
, we can find
I
such that
i > I
implies
x
n
i
∈ B
ε
(x) by definition of convergence. So (∗) holds.
Now suppose (
∗
) holds. We will construct a sequence
x
n
i
→ x
inductively.
Take n
0
= 0. Suppose we have defined x
n
0
, ··· , x
n
i−1
.
By hypothesis,
x
n
∈ B
1/i
(
x
) for infinitely many
n
. Take
n
i
to be smallest
such n with n
i
> n
i−1
.
Then d(x
n
i
, x) <
1
i
implies that x
n
i
→ x.
Here we will only prove that compactness implies sequential compactness,
and the other direction is left as an exercise for the reader.
Theorem. If (
X, d
) is a compact metric space, then
X
is sequentially compact.
Proof.
Suppose
x
n
is a sequence in
X
with no convergent subsequence. Then
for any
y ∈ X
, there is no subsequence converging to
y
. By lemma, there exists
ε > 0 such that x
n
∈ B
ε
(y) for only finitely many n.
Let
U
y
=
B
ε
(
y
). Now
V
=
{U
y
:
y ∈ X}
is an open cover of
X
. Since
X
is
compact, there is a finite subcover
{U
y
1
, ··· , U
y
m
}
. Then
x
n
∈
S
m
i=1
U
y
i
=
X
for only finitely many n. This is nonsense, since x
n
∈ X for all n!
So x
n
must have a convergent subsequence.
Example. Let
X
=
C
[0
,
1] with the topology induced
d
∞
(uniform norm). Let
f
n
(x) =
nx x ∈ [0, 1/n]
2 − nx x ∈ [1/n, 2/n]
0 x ∈ [2/n, 1]
x
y
Then
f
n
(
x
)
→
0 for all
x ∈
[0
,
1]. We now claim that
f
n
has no convergent
subsequence.
Suppose
f
n
i
→ f
. Then
f
n
i
(
x
)
→ f
(
x
) for all
x ∈
[0
,
1]. However, we know
that
f
n
i
(
x
)
→
0 for all
x ∈
[0
,
1]. So
f
(
x
) = 0. However,
d
∞
(
f
n
i
,
0) = 1. So
f
n
i
6→ 0.
It follows that
B
1
(0)
⊆ X
is not sequentially compact. So it is not compact.
4.4 Completeness
The course ends with a small discussion about completeness. This really should
belong to the chapter on metric spaces, since this is all about metric spaces. How-
ever, we put it here because the (only) proposition we have is about compactness,
which was just defined recently.
Similar to what we did in Analysis, we can define Cauchy sequences.
Definition (Cauchy sequence). Let (X, d) be a metric space. A sequence (x
n
)
in
X
is Cauchy if for every
ε >
0,
∃N
such that
d
(
x
n
, x
m
)
< ε
for all
n, m ≥ N
.
Example.
(i) x
n
=
P
n
k=1
1/k is not Cauchy.
(ii)
Let
X
= (0
,
1)
⊆ R
with
x
n
=
1
n
. Then this is Cauchy but does not
converge.
(iii)
If
x
n
→ x ∈ X
, then
x
n
is Cauchy. The proof is the same as that in
Analysis I.
(iv)
Let
X
=
Q ⊆ R
. Then the sequence (2
,
2
.
7
,
2
.
71
,
2
.
718
, ···
) is Cauchy but
does not converge in Q.
Exactly analogous to what we did in Analysis, we can also define a complete
space.
Definition (Complete space). A metric space (
X, d
) is complete if every Cauchy
sequence in X converges to a limit in X.
Example. (0, 1) and Q are not complete.
Proposition. If X is a compact metric space, then X is complete.
Proof.
Let
x
n
be a Cauchy sequence in
X
. Since
X
is sequentially compact,
there is a convergent subsequence x
n
i
→ x. We will show that x
n
→ x.
Given
ε >
0, pick
N
such that
d
(
x
n
, x
m
)
< ε/
2 for
n, m ≥ N
. Pick
I
such that
n
I
≥ N
and
d
(
x
n
i
, x
)
< ε/
2 for all
i > I
. Then for
n ≥ n
I
,
d(x
n
, x) ≤ d(x
n
, x
n
I
) + d(x
n
I
, x) < ε. So x
n
→ x.
Corollary. R
n
is complete.
Proof.
If (
x
n
)
⊆ R
n
is Cauchy, then (
x
n
)
⊆
¯
B
R
(0) for some
R
, and
¯
B
R
(0) is
compact. So it converges.
Note that completeness is not a topological property.
R '
(0
,
1) but
R
is
complete while (0, 1) is not.