II Logic and Set Theory - Posets and Zorn's lemma

3Posets and Zorn's lemma

II Logic and Set Theory

3.1 Partial orders

Definition (Partial ordering (poset)). A partially ordered set or poset is a pair

(X, ≤), where X is a set and ≤ is a relation on X that satisfies

(i) x ≤ x for all x ∈ X (reflexivity)

(ii) x ≤ y and y ≤ z ⇒ x ≤ z (transitivity)

(iii) x ≤ y and y ≤ x ⇒ x = y (antisymmetry)

We write

x < y

to mean

x ≤ y

and

x 

. We can also define posets in terms

of <:

(i) x < x for all x ∈ X (irreflexive)

(ii) x < y and y < z ⇒ x < z (transitive)

Example.

(i) Any total order is (trivially) a partial order.

(ii) N with “x ≤ y” if x | y is a partial order.

(iii) P(S) with ⊆ for any set S is a partial order.

(iv) Any subset of P(S) with inclusion is a partial order.

(v) We can use a diagram

Where “above” means “greater”. So

a ≤ b ≤ c

a ≤ d ≤ e

, and what

follows by transitivity. This is a Hasse diagram.

Definition (Hasse diagram). A Hasse diagram for a poset

consists of

a drawing of the points of

in the plane with an upwards line from

y if y covers x:

Definition (Cover). In a poset,

covers

y > x

and no

has

y > z > x

Hasse diagrams can be useful — e.g. N, or useless, e.g. Q.

(vi)

The following example shows that we cannot assign “heights” or “ranks”

to posets:

(vii) We can also have complicated structures:

(viii)

Or the empty poset (let

be any set and nothing is less than anything

else).

While there are many examples of posets, all we care about are actually

power sets and their subsets only.

Often, we want to study subsets of posets. For example, we might want to

know if a subset has a least element. All subsets are equal, but some subsets are

more equal than others. A particular interesting class of subsets is a chain.

Definition (Chain and antichain). In a poset, a subset

is a chain if it is

totally ordered, i.e. for all

x ≤ y

y ≤ x

. An antichain is a subset in

which no two things are related.

Example. In (N, |), 1, 2, 4, 8, 16, ··· is a chain.

In (v), {a, b, c} or {a, c} are chains.

R is a chain in R.

Definition (Upper bound and supremum). For

S ⊂ X

, an upper bound for

an x ∈ X such that ∀y ∈ S : x ≥ y.

x ∈ X

is a least upper bound, supremum or join of

, written

sup S

, if

is an upper bound for

, and for all

y ∈ X

, if

is an upper bound,

then y ≥ x.

Example.

(i) In R, {x : x <

√

2} has an upper bound 7, and has a supremum

√

(ii)

In (v) above, consider

{a, b}

. Upper bounds are

and

. So

sup

However, {b, d} has no upper bound!

(iii) In (vii), {a, b} has upper bounds c, d, e, but has no least upper bound.

Definition (Complete poset). A poset

is complete if every

S ⊆ X

has a

supremum. In particular, it has a greatest element (i.e.

such that

∀y

x ≥ y

namely sup X, and least element (i.e. x such that ∀y : x ≤ y), namely sup ∅.

It is very important to remember that this definition does not require that

the subset

is bounded above or non-empty. This is different from the definition

of metric space completeness.

Example.

– R is not complete because R itself has no supremum.

–

1] is complete because every subset is bounded above, and so has a least

upper bound. Also, ∅ has a supremum of 0.

– (0, 1) is not complete because (0, 1) has no upper bound.

– P

(

) for any

is always complete, because given any

i ∈ A}

, where

each A

⊆ S,

is its supremum.

Now we are going to derive fixed-point theorems for complete posets. We

start with a few definitions:

Definition (Fixed point). A fixed point of a function

X → X

is an

such

that f(x) = x.

Definition (Order-preserving function). For a poset

X → X

is order-

preserving of x ≤ y ⇒ f(x) ≤ f(y).

Example.

– On N, x 7→ x + 1 is order-preserving

– On Z, x 7→ x − 1 is order-preserving

–

On (0

1),

x 7→

1+x

is order-preserving (this function halves the distance

from x to 1).

– On P(S), let some fixed i ∈ S. Then A 7→ A ∪ {i} is order-preserving.

Not every order-preserving

has a fixed point (e.g. first two above). However,

we have

Theorem (Knaster-Tarski fixed point theorem). Let

be a complete poset,

and f : X → X be a order-preserving function. Then f has a fixed point.

Proof.

To show that

(

) =

, we need

(

)

≤ x

and

(

)

≥ x

. Let’s not be

too greedy and just want half of it:

Let

x ≤ f

(

)

}

. Let

sup E

. We claim that this is a fixed point,

by showing f(s) ≤ s and s ≤ f(s).

To show

s ≤ f

(

), we use the fact that

is the least upper bound. So if

we can show that

(

) is also an upper bound, then

s ≤ f

(

). Now let

x ∈ E

x ≤ s

. Therefore

(

)

≤ f

(

) by order-preservingness. Since

x ≤ f

(

) (by

definition of E) x ≤ f(x) ≤ f(s). So f(s) is an upper bound.

To show

(

)

≤ s

, we simply have to show

(

)

∈ E

, since

is an upper

bound. But we already know

s ≤ f

(

). By order-preservingness,

(

)

≤ f

(

)).

So f(s) ∈ E by definition.

While this proof looks rather straightforward, we need to first establish that

s ≤ f

(

), then use this fact to show

(

)

≤ s

. If we decided to show

(

)

≤ s

first, then we would fail!

The very typical application of Knaster-Tarski is the quick, magic proof of

Cantor-Shr¨oder-Bernstein theorem.

Corollary (Cantor-Schr¨oder-Bernstein theorem). Let

A, B

be sets. Let

A →

B and g : B → A be injections. Then there is a bijection h : A → B.

Proof.

We try to partition

into

and

, and

into

and

, such that

f(P ) = R and g(S) = Q. Then we let h = f on R and g

−1

on Q.

Since S = B \ R and Q = A \ P , so we want

P = A \g(B \f(P ))

Since the function

P 7→ A \ g

(

B \ f

(

)) from

(

) to

(

) is order-preserving

(and P(a) is complete), the result follows.

The next result we have is Zorn’s lemma. The main focus of Zorn’s lemma is

on maximal elements.

Definition (Maximal element). In a poset

x ∈ X

is maximal if no

y ∈ X

has y > x.

Caution! Under no circumstances confuse a maximal element with a maximum

element, except under confusing circumstances! A maximum element is defined

as an

such that all

y ∈ X

satisfies

y ≤ x

. These two notions are the same in

totally ordered sets, but are very different in posets.

Example. In the poset

c and e are maximal.

Not every poset has a maximal element, e.g.

N, Q, R

. In each of these, not

only are they incomplete. They have chains that are not bounded above.

Theorem (Zorn’s lemma). Assuming Axiom of Choice, let

be a (non-empty)

poset in which every chain has an upper bound. Then it has a maximal element.

Note that “non-empty” is not a strictly necessary condition, because if

an empty poset, then the empty chain has no upper bound. So the conditions

can never be satisfied.

The actual proof of Zorn’s lemma is rather simple, given what we’ve had so

far. We “hunt” for the maximal element. We start with

. If it is maximal,

done. If not, we find a bigger

. If

is maximal, done. Otherwise, keep go on.

If we never meet a maximal element, then we have an infinite chain. This

has an upper bound

. If this is maximal, done. If not, find

ω+1

> x

. Keep

going on.

We have not yet reached a contradiction. But suppose we never meet a

maximal element. If

is countable, and we can reach

, then we have found

uncountably many elements in a countable set, which is clearly nonsense!

Since the ordinals can be arbitrarily large (Hartogs’ lemma), if we never

reach a maximal element, then we can get find more elements that X has.

Proof.

Suppose not. So for each

x ∈ X

, we have

′

∈ X

with

′

> x

. We denote

the-element-larger-than-x by x

′

We know that each chain C has an upper bound, say u(C).

Let γ = γ(X), the ordinal-larger-than-X by Hartogs’ lemma.

We pick x ∈ X, and define x

for α < γ recursively by

– x

= x

– x

= x

′

– x

= u({x

: α < λ})

′

for non-zero limit λ

Of course, we have to show that

α < λ}

is a chain. This is trivial by

induction.

Then α 7→ x

is an injection from γ → X. Contradiction.

Note that we could as well have defined

(

α < λ}

), and we can

easily prove it is still an injection. However, we are lazy and put the “prime”

just to save a few lines of proof.

This proof was rather easy. However, this is only because we are given

ordinals, definition by recursion, and Hartogs’ lemma. Without these tools, it is

rather difficult to prove Zorn’s lemma.

A typical application of Zorn’s lemma is: Does every vector space have a

basis? Recall that a basis of

is a subset of

that is linearly independent (no

finite linear combination = 0) and spanning (ie every

x ∈ V

is a finite linear

combination from it).

Example.

– Let V be the space of all real polynomials. A basis is {1, x, x

, x

, ···}.

–

Let

be the space of all real sequences. Let e

be the sequence with all

0 except 1 in the

th place. However,

{

}

is not a basis, since 1

, ···

cannot be written as a finite linear combination of them. In fact, there is

no countable basis (easy exercise). It turns out that there is no “explicit”

basis.

–

Take

as a vector space over

. A basis here, if exists, is called a Hamel

basis.

Using Zorn’s lemma, we can prove that the answer is positive.

Theorem. Every vector space V has a basis.

Proof. We go for a maximal linearly independent subset.

Let

be the set of all linearly independent subsets of

, ordered by inclusion.

We want to find a maximal

B ∈ X

. Then

is a basis. Otherwise, if

does

not span

, choose

x ∈ span B

. Then

B ∪ {x}

is independent, contradicting

maximality.

So we have to find such a maximal

. By Zorn’s lemma, we simply have to

show that every chain has an upper bound.

Given a chain

i ∈ I}

, a reasonable guess is to try the union. Let

. Then

A ⊆ A

for all

, by definition. So it is enough to check that

A ∈ X, i.e. is linearly independent.

Suppose not. Say

···

= 0 for some

···λ

scalars (not all

0). Suppose

∈ A

, ···x

∈ A

for some

, ···i

∈ I

. Then there is some

that contains all

, since they form a finite chain. So

contains all

This contradicts the independence of A

Hence by Zorn’s lemma, X has a maximal element. Done.

Another application is the completeness theorem for propositional logic when

P , the primitives, can be uncountable.

Theorem (Model existence theorem (uncountable case)). Let

S ⊆ L

(

) for any

set of primitive propositions P . Then if S is consistent, S has a model.

Proof.

We need a consistent

S ⊆ S

such that

∀t ∈ L

t ∈

¬t ∈

. Then we

have a valuation

(

) =

(

1 t ∈

0 t ∈

, as in our original proof for the countable

case.

So we seek a maximal consistent

S ⊇ S

. If

is maximal, then if

t ∈

, then

we must have

S ∪ {t}

inconsistent, i.e.

S ∪ {t} ⊢ ⊥

. By deduction theorem, this

means that

S ⊢ ¬t

. By maximality, we must have

¬t ∈

. So either

¬t

is in

Now we show that there is such a maximal

. Let

{T ⊆ L

T is consistent , T ⊇ S}

. Then

X 

∅

since

S ∈ X

. We show that any

non-empty chain has an upper bound. An obvious choice is, again the union.

Let

i ∈ I}

be a non-empty chain. Let

. Then

T ⊇ T

for all

So to show that T is an upper bound, we have to show T ∈ X.

Certainly,

T ⊇ S

, as any

contains

(and the chain is non-empty). So we

want to show

is consistent. Suppose

T ⊢ ⊥

. So we have

, ··· , t

∈ T

with

, ··· , t

} ⊢ ⊥

, since proofs are finite. Then some

contains all

since

are nested. So

is inconsistent. This is a contradiction. Therefore

must be

consistent.

Hence by Zorn’s lemma, there is a maximal element of X.

This proof is basically the same proof that every vector space has a basis! In

fact, most proofs involving Zorn’s lemma are similar.