II Logic and Set Theory - Predicate logic

4Predicate logic

II Logic and Set Theory

4.3 Syntactic implication

Again, to define syntactic implication, we need axioms and deduction rules.

Definition (Axioms of predicate logic). The axioms of predicate logic consists

of the 3 usual axioms, 2 to explain how = works, and 2 to explain how

∀

works.

They are

1. p ⇒ (q ⇒ p) for any formulae p, q.

2. [p ⇒ (q ⇒ r)] ⇒ [(p ⇒ q) ⇒ (p ⇒ r)] for any formulae p, q, r.

3. (¬¬p ⇒ p) for any formula p.

4. (∀x)(x = x) for any variable x.

(

∀x

)(

∀y

)



(

)

⇒

(

p ⇒ p

[

y/x

])



for any variable

x, y

and formula

with y not occurring bound in p.

[(

∀x

)

]

⇒ p

[

t/x

] for any formula

, variable

, term

with no free variable

of t occurring bound in p.

[(

∀x

)(

p ⇒ q

)]

⇒

[

p ⇒

(

∀x

)

] for any formulae

p, q

with variable

not

occurring free in p.

The deduction rules are

1. Modus ponens: From p and p ⇒ q, we can deduce q.

Generalization: From

, we can deduce (

∀x

)

, provided that no premise

used in the proof so far had x as a free variable.

Again, we can define proofs, theorems etc.

Definition (Proof). A proof of

from

is a sequence of statements, in which

each statement is either an axiom, a statement in

, or obtained via modus

ponens or generalization.

Definition (Syntactic implication). If there exists a proof a formula

form a

set of formulae S, we write S ⊢ p “S proves t”.

Definition (Theorem). If

S ⊢ p

, we say

is a theorem of

. (e.g. a theorem of

group theory)

Note that these definitions are exactly the same as those we had in propo-

sitional logic. The only thing that changed is the set of axioms and deduction

rules.

Example. {x = y, x = z} ⊢ y = z.

We go for x = z giving y = z using Axiom 5.

1. (∀x)(∀y)((x = y) ⇒ (x = z ⇒ y = z)) Axiom 5

2. [(∀x)(∀y)((x = y) ⇒ (x = z ⇒ y = z))] ⇒ (∀y)(x = y ⇒ y = z) Axiom 6

3. (∀y)((x = y) ⇒ x = z ⇒ y = z) MP on 1, 2

4. [(∀y)((x = y) ⇒ x = z ⇒ y = z)] ⇒ [(x = y) ⇒ (x = z ⇒ y = z) Axiom 6

5. (x = y) ⇒ [(x = z) ⇒ (y = z)] MP on 3, 4

6. x = y Premise

7. (x = z) ⇒ (y = z) MP 6, 7

8. x = z Premise

9. y = z MP on 7, 8

Note that in the first 5 rows, we are merely doing tricks to get rid of the

∀

signs.

We can now revisit why we forbid

∅

from being a structure. If we allowed

∅

then (

∀x

)

⊥

holds in

∅

. However, axioms 6 states that ((

∀x

)

⊥

)

⇒ ⊥

. So we can

deduce

⊥

in the empty structure! To fix this, we will have to add some weird

clauses to our axioms, or simply forbid the empty structure!

Now we will prove the theorems we had for propositional logic.

Proposition (Deduction theorem). Let

S ⊆ L

, and

p, q ∈ L

. Then

S ∪ {p} ⊢ q

if and only if S ⊢ p ⇒ q.

Proof.

The proof is exactly the same as the one for propositional logic, except

in the ⇒ case, we have to check Gen.

Suppose we have lines

– r

– (∀x)r Gen

and we have a proof of

S ⊢ p ⇒ r

(by induction). We want to seek a proof of

p ⇒ (∀x)r from S.

We know that no premise used in the proof of

from

S ∪{p}

had

as a free

variable, as required by the conditions of the use of Gen. Hence no premise used

in the proof of p ⇒ r from S had x as a free variable.

Hence S ⊢ (∀x)(p ⇒ r).

If x is not free in p, then we get S ⊢ p ⇒ (∀x)r by Axiom 7 (and MP).

is free in

, then we did not use premise

in our proof

from

S ∪ {p}

(by the conditions of the use of Gen). So

S ⊢ r

, and hence

S ⊢

(

∀x

)

by Gen.

So S ⊢ p ⇒ (∀x)r.

Now we want to show

S ⊢ p

iff

S |

. For example, a sentence that holds in

all groups should be deducible from the axioms of group theory.

Proposition (Soundness theorem). Let

be a set of sentences,

a sentence.

Then S ⊢ p implies S |= p.

Proof.

(non-examinable) We have a proof of

from

, and want to show that

for every model of S, p holds.

This is an easy induction on the lines of the proof, since our axioms are

tautologies and our rules of deduction are sane.

The hard part is proving

S |= p ⇒ S ⊢ p.

This is, by the deduction theorem,

S ∪ {¬p} |= ⊥ ⇒ S ∪ {¬p} ⊢ ⊥.

This is equivalent to the contrapositive:

S ∪ {¬p} ⊢ ⊥ ⇒ S ∪ {¬p} |= ⊥.

Theorem (Model existence lemma). Let

be a consistent set of sentences.

Then S has a model.

We need several ideas to prove the lemma:

(i)

We need to find a structure. Where can we start from? The only thing we

have is the language. So we start form the language. Let

= set of all

closed terms, with the obvious operations.

For example, in the theory of fields, we have “1 + 1”, “0 + 1“ etc in the

structure. Then (1 + 1) +

(0 + 1) = (1 + 1) + (0 + 1).

(ii)

However, we have a problem. In, say, the language of fields, and

our

field axioms, our

has distinct elements “1 + 0”, “0 + 1”, “0 + 1 + 0”

etc. However,

S ⊢

1 + 0 = 0 + 1 etc. So we can’t have them as distinct

elements. The solution is to quotient out by the equivalence relation

s ∼ t

S ⊢

(

), i.e. our structure is the set of equivalence classes. It is

trivial check to check that the +,

operations are well-defined for the

equivalence classes.

(iii)

We have the next problem: If

is ”field of characteristic 2 or 3“, i.e.

has a field axiom plus 1 + 1 = 0

∨

1 + 1 + 1 = 0. Then

S ⊢

1 + 1 = 0. Also

S ⊢

1 + 1 + 1 = 0. So [1 + 1]



= [0], and [1 + 1 + 1]



= [0]. But then our

has neither characteristic 2 or 3.

This is similar to the problem we had in the propositional logic case, where

we didn’t know what to do with

only talks about

and

. So we

first extend S to a maximal consistent (or complete)

(iv)

Next problem: Let

= “fields with a square root of 2”, i.e.

is the

field axioms plus (

∃x

)(

= 1 + 1). However, there is no closed term

which is equivalent to

√

. We say we lack witnesses to the statement

(

∃x

)(

= 1 + 1). So we add a witness. We add a constant

to the

language, and add the axiom “

= 1 + 1” to

. We do this for each such

existential statement.

(v)

Now what? We have added new symbols to

, so our new

is no longer

complete! Of course, we go back to (iii), and take the completion again.

Then we have new existential statements to take care of, and we do (iv)

again. Then we’re back to (iii) again! It won’t terminate!

So we keep on going, and finally take the union of all stages.

Proof.

(non-examinable) Suppose we have a consistent

in the language

(Ω

Π). Extend

to a consistent

such that

p ∈ S

or (

¬p

)

∈ S

for each

sentence

p ∈ L

(by applying Zorn’s lemma to get a maximal consistent

). In

particular, S

is complete, meaning S

⊢ p or S

⊢ ¬p for all p.

Then for each sentence of the form (

∃x

)

, add a new constant

and add

[

c/x

] to

— obtaining

in language

(Ω

∪ C

Π). It is easy

to check that T

is consistent.

Extend

to a complete theory

⊆ L

, and add witnesses to form

⊆

= L(Ω ∪C

∪ C

, Π). Continue inductively.

Let

∪ S

∪ ···

in language

∪ L

∪ ···

(i.e.

(Ω

∪ C

∪

∪ ··· , Π)).

Claim.

is consistent, complete, and has witnesses, i.e. if (

∃x

)

p ∈

, then

p[t/x] ∈

S For some term t.

It is consistent since if

S ⊢ ⊥

, then some

⊢ ⊥

since proofs are finite. But

all S

are consistent. So

S is consistent.

To show completeness, for sentence

p ∈

, we have

p ∈ L

for some

, as

has only finitely many symbols. So

n+1

⊢ p

n+1

⊢ ¬p

. Hence

S ⊢ p

S ⊢ ¬p.

To show existence of witnesses, if (

∃x

)

p ∈

, then (

∃x

)

p ∈ S

for some

Hence (by construction of T

), we have p[c/x] ∈ T

for some constant c.

Now define an equivalence relation

∼

on closed term of

s ∼ t

S ⊢

(

). It is easy to check that this is indeed an equivalence relation. Let

be the set of equivalence classes. Define

(i) f

([t

], ··· , [t

]) = [ft

, ··· , t

] for each formula f ∈ Ω, α(f ) = n.

(ii) ϕ

{

([

]

, ··· ,

[

]) :

S ⊢ ϕ

(

, ··· , t

)

}

for each relation

ϕ ∈

Π and

α(ϕ) = n.

It is easy to check that this is well-defined.

Claim. For each sentence

S ⊢ p

(i.e.

p ∈

) if and only if

holds in

, i.e.

= 1.

We prove this by an easy induction.

– Atomic sentences:

◦ ⊥:

S ⊢ ⊥, and ⊥

= 0. So good.

◦ s

S ⊢ s

iff [

] = [

] (by definition) iff

(by definition

of s

) iff (s = t)

. So done.

◦ ϕt

, ··· , t

is the same.

– Induction step:

◦ p ⇒ q

S ⊢

(

p ⇒ q

) iff

S ⊢

(

¬p

) or

S ⊢ q

(justification: if

S ⊢ ¬p

and

S ⊢ q

, then

S ⊢ p

and

S ⊢ ¬q

by completeness, hence

S ⊢ ¬

(

p ⇒ q

contradiction). This is true iff p

= 0 or q

= 1 iff (p ⇒ q)

= 1.

◦

(

∃x

)

S ⊢

(

∃x

)

iff

S ⊢ p

[

t/x

] for some closed term

. This is true

since

has witnesses. Now this holds iff

[

t/x

]

= 1 for some closed

term

(by induction). This is the same as saying (

∃x

)

holds in

because A is the set of (equivalence classes of) closed terms.

Here it is convenient to pretend

∃

is the primitive symbol instead of

∀

Then we can define (

∀x

)

to be

(

∃x

)

¬p

, instead of the other way round.

It is clear that the two approaches are equivalent, but using

∃

as primitive

makes the proof look clearer here.

Hence A is a model of

S. Hence it is also a model of S. So S has a model.

Again, if

is countable (i.e. Ω

Π are countable), then Zorn’s Lemma is not

needed.

From the Model Existence lemma, we obtain:

Corollary (Adequacy theorem). Let

be a theory, and

a sentence. Then

S |= p implies S ⊢ p.

Theorem (G¨odel’s completeness theorem (for first order logic)). Let

be a

theory, p a sentence. Then S ⊢ p if and only if S |= p.

Proof. (⇒) Soundness, (⇐) Adequacy.

Corollary (Compactness theorem). Let

be a theory such that every finite

subset of S has a model. Then so does S.

Proof.

Trivial if we replace “has a model” with “is consistent”, because proofs

are finite.

We can look at some applications of this:

Can we axiomatize the theory of finite groups (in the language of groups)?

i.e. is there a set of sentences T such that models of T are finite groups.

Corollary. The theory of finite groups cannot be axiomatized (in the language

of groups).

It is extraordinary that we can prove this, as opposed to just “believing it

should be true”.

Proof.

Suppose theory

has models all finite groups and nothing else. Let

′

be T together with

– (∃x

)(∃x

)(x

= x

) (intuitively, |G| ≥ 2)

– (∃x

)(∃x

)(x

= x

) (intuitively, |G| ≥ 3)

– ···

Then

′

has no model, since each model has to be simultaneously arbitrarily

large and finite. But every finite subset of

′

does have a model (e.g.

for

some n). Contradiction.

This proof looks rather simple, but it is not “easy” in any sense. We are

using the full power of completeness (via compactness), and this is not easy to

prove!

Corollary. Let

be a theory with arbitrarily large models. Then

has an

infinite model.

“Finiteness is not a first-order property”

Proof. Same as above.

Similarly, we have

Corollary (Upward L¨owenheim-Skolem theorem). Let

be a theory with an

infinite model. Then S has an uncountable model.

Proof. Add constants {c

: i ∈ I} to L for some uncountable I.

Let T = S

{“c

= c

” : i, j ∈ I, i = j}.

Then any finite

′

⊆ T

has a model, since it can only mention finitely many

of the

. So any infinite model of

will do. Hence by compactness,

has a

model

Similarly, we have a model for

that does not inject into

, for any chosen

set X. For example, we can add γ(X) constants, or P(X) constants.

Example. There exists an infinite field (

). So there exists an uncountable

field (e.g. R). Also, there is a field that does not inject into P(P(R)), say,

Theorem (Downward L¨owenheim-Skolem theorem). Let

be a countable

language (i.e. Ω and Π are countable). Then if

has a model, then it has a

countable model.

Proof.

The model constructed in the proof of model existence theorem is count-

able.

Note that the proof of the model existence theorem is non-examinable, but

the proof of this is examinable! So we are supposed to magically know that the

model constructed in the proof is countable without knowing what the proof

itself is!