Gödel didn't just prove a theorem or two, he invented a whole new branch of mathematics with new techniques, and so what we really care about is what of philosophical interest can be shown using those techniques.
Many of the results I wish to state will be for "sufficiently strong" theories. To define sufficiently strong, we shall fix a set of axioms
. The weaker the set of axioms
, the more theories will be sufficiently strong, and hence the stronger our theorems will be. As a first approximation, we may take
to be Peano's axioms.
weaker systems will work. Here are two examples:
The second set of axioms is finite.
. Since the theories I have proposed to use for
in their language, any sufficiently strong theory must be in a language that includes those symbols. Since standard axiomatizations of set theory, for example, do not include those symbols, they will not be, in the sense in which I defined, sufficiently strong. It is intuitively obvious what to do: though the symbols are not included in the language of set theory, for example, one could give them suitable definitions, and that would be good enough: the resulting theories would be, in the precise sense I've given, sufficiently strong, and the Gödel and related theorems for the usual theories of sets will follow from those for the theories to which we have added the requisite symbols. I outline the method in some detail in
.
What I have called Gödel's first incompleteness theorem is actually due to Gödel and Rosser. Gödel proved a somewhat weaker result.
Now, back to the mathematical development.
If the theory is inconsistent, it is easy to decide whether or not a sentence is in it: every sentence is in it. So suppose from now on that the theory is consistent.
To decide if a sentence
is in a complete axiomatizable theory, systematically generate all finite sequences of sentences and check whether each sequence is a proof—one can check whether a sentence is an axiom because the set of axioms is, by hypothesis (axiomatizable) decidable, and it is easy to check whether a sentence follows from previous ones by a rule of logic. When you find a proof check whether the theorem proved (that is, the last sentence of the proof) is
or
. If it is
, then
is in the theory, and you are done. If it is
, then
is not in the theory (since the theory is consistent), and you are done. If it is neither, keep looking for more proofs. Since the theory is complete, you will eventually find either a proof of
or of
, and so the method outlined always terminates with a decision.
By the lemma, if an axiomatizable theory is undecidable, it is incomplete, and so it is a consequence of Gödel's first incompleteness theorem that
- Corollary to Gödel's first incompleteness theorem Any sufficiently strong consistent axiomatizable theory is incomplete.
It is that version of the theorem that explains the name. In particular, Peano arithmetic (and also any consistent axiomatizable extension of it) is incomplete.
- Philosophical remark. The corollary to Gödel's first incompleteness theorem destroys another pillar of Hilbert's program: since he intended to introduce, for example, arithmetic solely on the basis of finitistically justified axioms, the truths of arithmetic will be exactly those that follow from the axioms. He therefore supposed that the axioms would be complete: otherwise, there would be sentences about the numbers that were neither provable nor disprovable from the axioms and that are, therefore, since the axioms are the sole basis on which arithmetic is introduced, neither true nor false. But what Gödel showed is that any axiomatizable theory of arithmetic is incomplete.
Indication of the proofs
First incompleteness theorem
I shall first discuss the corollary to Gödel's first incompleteness theorem. I mention how to extend the proof outlined here to the full first incompleteness theorem
below. The techniques that permit that extension also make it possible to give a simpler proof of the theorem (also discussed below), but the proof I give here is what leads to the second incompleteness theorem, and so it needs to be presented in any case.
- First, a rough version, to give the basic idea, with some details omitted and some outright mistakes.
Gödel showed that a sufficiently strong consistent axiomatizable theory is incomplete by showing how to write down a sentence that is true if and only if it is not provable in that theory. His sentence is called "the" Gödel sentence for the theory. How does it help? Is the Gödel sentence

provable in the theory or not? If

is provable, it follows that

. That can't happen, so

must not be provable. But

says that it is not provable, and so it turns out that

is true. Thus, the theory is incomplete: it can't prove

and, since

is true, it certainly can't prove

.
One of the big problems with the "proof" I just indicated is that it relies on the notion "

is true." It is not clear what that means: the theory may have more than one interpretation. True in which one? The careful statement of the proof must not rely on the notion of truth at all.
The second big problem is that I did not distinguish between what can be proved and what can be proved in the theory.
Let

be a complete axiomatizable sufficiently strong theory. Recall that "sufficiently strong" means that

includes the theory

, and hence that anything that can be proved in

can,
a fortiori be proved in

.
- Trivial but useful fact. Since
includes
, and
has 0 and
, successor, in its language,
has numerals: "0" for 0, "
' for 7, and so forth. Since it would be tiresome to actually write out, say 100
s to form the numeral for 100, we abbreviate it as follows:
. The superscript is a notation for how many times to repeat the
.
Gödel showed (see
below) how to associate a number with each sentence or sequence of sentences of the language of

, called "the" Gödel number of the sentence or sequence of sentences in such a way that there is some horribly complicated number-theoretic predicate, which we write "

," that, in the theory

, expresses the notion "provability in

." In detail:
If
is a sentence in the language of
with Gödel number
, we write
for the numeral
, that is, for the numeral that denotes the Gödel number of
. We think of that numeral as a notation for
within arithmetic, a name for a linguistic expression, which make it somewhat analogous to the way we use quotation names in philosophical English.
A proof is a sequence of sentences, and so if
is a proof in the language of
with Gödel number
, we may, similarly, write
for the numeral
, that is, for the numeral that denotes the Gödel number of
.
Gödel showed (see below) that there is a horribly complicated number-theoretic predicate
, the proof predicate, such that
is a proof of
if and only if
is a theorem of
if and only if
is not theorem of
.
There are, of course, many allied results. Here is one we shall need below: there is a number-theoretic predicate
such that
is the negation of
if and only if
is a theorem of
if and only if
is not theorem of
.
.
Now, the provability predicate
is defined by
—the definition is just a formalized counterpart of the metalinguistic idea that
is provable if there is a proof of
. It follows from what was said above about the proof predicate that the provability predicate is such that, for any sentence
,
is provable in
if and only if
is provable in
.
"The" Gödel sentence for
is now a sentence
such that one can prove in
that
.
I can now prove half the corollary to Gödel's first incompleteness theorem:
-
is not provable in
. Suppose
is provable in
. Then
is provable in
and hence in
. But then
is provable in
, which contradicts that
is consistent.
To complete the proof, I would now have to show that

is not provable in

. Unfortunately, that doesn't follow from our hypotheses. To show what goes wrong, here is an unsuccessful attempt at a proof:
-
is not provable in
. Suppose
is provable in
. Obviously,
is not provable in
, since if it were that would contradict the assumption that
is consistent. Thus,
is not provable in
. If we could show that, in fact,
is provable in
(which doesn't seem unreasonable to hope, since we just showed that no natural number is the Gödel number of a proof of
), then we would be done, since
is, by the definition of
, equivalent to
, and so we would have a proof of
, contradiction.
We can patch things up by using "the" Gödel-Rosser sentence instead of the Gödel sentence: Everything is the same as above, except that one starts with the notion of a Rosser proof:
- For any
and
,
is a Rosser proof of
,
, if and only if
.
- Rosser provability is now defined by
.
That is a formalization of the notion defined above, where we have used the Gödel number of a proof as its length. It follows from what was said about the Rosser proof predicate that the Rosser provability predicate is such that, for any sentence

,

is Rosser provable in

if and only if

is provable in

. "The" Gödel-Rosser sentence for

is now a sentence

such that one can prove in

that

.
Now, finally, I can give a reasonably precise proof of the corollary to Gödel's first incompleteness theorem. You will be able to follow my discussion of Gödel's second incompleteness theorem, which immediately follows the proof, even if you skip the proof. Here is the
Proof.
-
is not provable in
. Suppose
is provable in
. Since
is consistent, there is then no proof at all of
in
, and hence no such proof shorter than a proof of
in
. Thus,
is Rosser provable in
. Then
is provable in
and hence in
. But then
is provable in
, which contradicts that
is consistent.
-
is not provable in
. Suppose
is provable in
. Then, for some
, it is provable by a proof of length
.We now prove
, that is,
, in
. Since
is provable in
by a proof of length
,
is provable in
. Thus, it is provable in
that no
is a Rosser proof of
—that is just a formalization of the following argument: since any Gödel number of any proof that is greater than
is preceded by a proof of
, it cannot be a Rosser proof of
. It is easy to prove in
that there is also no Rosser proof of
of length exactly
, since the proof of length
is not even a proof of
(since it is a proof of
), let alone a Rosser proof of
. Finally, we can show in
that there is no Rosser proof of
of length less than
: We know that no such proof exists, because
is consistent, and we can verify that in
since there are only finitely many cases to check: for each
,
is not the Gödel number of a Rosser proof of
and so it is a theorem of
that
. Putting the three cases together, we have shown that it is a theorem of
that
. But then, by the definition of
, it is a theorem of
, and hence of
that
. Since, by hypothesis,
is provable in
, that contradicts the consistency of
.
Second incompleteness theorem
For the second incompleteness theorem, we use the original Gödel sentence (no more Rosser) and consider the proof that

is not provable in

. What we showed, in effect, is that if

is provable in

, then

is inconsistent. That is,
if
is consistent, then
is not provable in
.
We can now continue as follows to give an
incorrect proof that

is inconsistent: But "

is not provable in

," by the definition of

yields

, and so, from the hypothesis that

is consistent, we proved

. Since, by the first incompleteness theorem,

is not provable, we have a contradiction, and so

is not consistent.
The proof is obviously wrong: the first incompleteness theorem does not state that

is not provable, but that

is not provable in

. Our proof of

was not given in

, and so there is no contradiction. But what happens if we take the proof of

and formalize it in

? We get a proof of the second incompleteness theorem. Here it is, where I have paired the formal statements within

with the informal statements about

that led to them:
Suppose . |
Suppose is provable in . |
Then |
Then is provable in . |
But then . |
But then is provable in . |
So . |
So is inconsistent. |
In the final step,

may be taken to be an abbreviation for

. We may now argue correctly as follows: Suppose

is provable in

. Then, by the argument above,

is provable in

. However, from that, by the definition of

, we obtain that

is provable in

. But that contradicts what we showed earlier, that

is not provable in

and disproves the assumption that

is provable in

. That is, finally,

is not provable in

. \square
The proof outline above, while correct, is misleading: the hard part is showing that it is possible to formalize the proof in

, and I have not said anything about that, nor do I intend to, except for this: the formalization is difficult and requires that

be quite strong. The requirement that it contain

for the weak theories

I listed above, which was good enough for the proof of the first incompleteness theorem, is not good enough for the second incompleteness theorem. However, it suffices to let

be Peano arithmetic, which is plenty good enough for the philosophical applications we envisioned.
The fixed-point theorem
The piece of the proof of the two incompleteness theorems that seems most magical is the phenomenon usually inaccurately characterized in terms of self-reference, that is, the fixed-point theorem:
Fixed-point theorem In any sufficiently strong theory, for any predicate

expressible in the language of the theory, one can show that there is a sentence

such that the theory proves
The Gödel and Gödel-Rosser sentences are both obtained by applying the fixed-point theorem for suitable

. There is a proof of the fixed-point theorem for a simple theory
here. The usual proofs of fixed-point theorems for more traditional theories, like the candidates for

described above, use the same technique, but they are muddied with details about how to use Gödel numbers and the like. In addition, the usual fixed-point theorems for more traditional theories can be obtained as corollaries of the fixed-point theorem stated here, by showing that the theory given here can, in a suitable sense, be interpreted in those theories.
The fixed-point theorem can also be used to prove
- Tarski's theorem. The set of true sentences of a sufficiently strong theory is not definable.
One just uses the fixed-point sentence for

.
- A version of Gödel's first incompleteness theorem. No sufficiently strong axiomatizable theory can be both complete and consistent.
If a sufficiently strong theory were both complete and consistent, the set of theorems would be decidable (by the lemma
above), hence definable, and the set of theorems would be the set of true sentences, contradicting Tarski's theorem. I prove versions of those two results
here.
Abstract versions of some of the results
The point of proving the abstract versions of incompleteness and related results here is to show that the usual claim that the theorems make use of self-referential sentences is greatly exaggerated. The theorems are here proved about very simple interpreted languages that cannot express the relationships that are usually taken to be evidence of self reference. In fact, the languages, in a certain sense exhibited below, have only atomic sentences.
Formal proofs
Let

be a nonempty set called the set of constants. Fix a binary relation

on

, the
extension function; and a pair of functions

and

,
subject and
predicate,
from

to

such that for every pair

of constant symbols
there is an constant

such that

and

. We think of

as
expressing the sentence

of

, where

is the subject, and

is the
predicate with extension

. When thinking of a constant as a predicate, I shall usually use letters

for it; when thinking of a constant as a name, I shall usually use

. While that aids understanding, the predicates and the constants are in fact the same objects.
Definition. A predicate

is a
predicate of predicates if for
all predicates

and

, if

and

are coextensive (that is, if for all

,

if and only if

),
then

is in the extension of

if and only if

is in the extension of

(that is,

if and only if

).
Since they are the only predicates of interest for present purposes, we shall assume that all predicates are predicates
of predicates. That is, our results will concern
languages: structures of the form

in which every
predicate is a predicate of predicates. That is a class of structures that is defined by an axiomatizable theory, the
theory of languages: the theory consists of the single first-order sentence that says that every pair of a predicate and a constant is expressed by a constant and that every predicate is a predicate of predicates.
I have used the same set of constants to serve as constants and predicates and
to express sentences, and assumed that every constant is also a predicate and expresses a
sentence, and that every predicate is a predicate of predicates. None of
those assumptions are essential, but they simplify the statements of the
claims below.
- A constant
expresses a true sentence if, by definition, the subject is in the extension of the predicate, that is
.
- A norm of a predicate
is a constant
that expresses a sentence 'the predicate
holds of the constant
,' that is, a constant
such that
is coextensive with
and
.
A norm of a predicate

expresses a true sentence if

is autological.
- Say that the language
has appended norms if there is a function
from
to
(to be thought of as a function from predicates to predicates) such that for any constant
,
is in the extension of the predicate
(that is,
) if and only if there is a norm of the predicate
in the extension of
(that is, if and only if there is
in the extension of
such that
and
are coextensive and
.)
In human language, not part of the official notation, and abusing use and mention, a language has
appended norms if there is a function

from constants to their norms such
that for every predicate

there is a predicate

such that

. Sloppily,

, or

is

with the norm function
appended.
- Fixed point theorem. Suppose that the language
has appended norms, and let
be a function witnessing that fact. Let
be a predicate. Then a norm of
is in the extension of
if and only if a norm of
expresses a true sentence.
Proof. A norm of

is a constant that expresses '

of

.' Thus, a norm of

expresses a true sentence if and only if

. By the definition of the function

,

if and only if there is a norm of

in the extension of

, which completes the proof.
- Useful lemma. If
and
are both norms of the same predicate
, then
expresses a true sentence if and only if
expresses a true sentence.
Proof. 
expresses '

holds of

' and

expresses '

holds of

,' where

,

, and

are coextensive. Since

is a predicate of predicates, it holds of

if and only if it holds of

. \square
Applications
Say that a set of constants is
definable in the language 
if it is the extension of a
predicate of

. More precisely,

is
definable in the language 
if for some

in

.
- "Tarski's theorem." Suppose the language
has appended norms. The set
of constants that do not express true sentences is not definable in
.
Proof. Since the language

has appended norms, let

be a function witnessing that fact. If

is definable, then there is a predicate

such that

. By the fixed point theorem, a norm of

is
in the extension of

if and only if a norm of

expresses a true sentence. That is, a
norm of

is in

(does not express a true sentence) if and only if a norm
of

expresses a true sentence. That contradicts the useful lemma.
Remark. If the predicates are closed under complement (as they are in familiar applications to languages in which there is a negation sign), it follows that
the set of constants that express true sentences is not definable, which is what is usually stated.
- "Gödel's first incompleteness theorem." Suppose the language
has appended norms. Let
(the "nontheorems") be a definable set of false sentences. Then either there is a false sentence not in
("
is incomplete") or there is a true sentence in
("
is inconsistent").
Proof. If not, that is, if every false sentence is in

and no true sentence
is in

, then

, and so

cannot be definable by Tarski's theorem.
Remark. If

is a consistent definable set of sentences, the theorem not only
applies to

but to every definable consistent set of sentences that includes

. Thus,

is essentially incomplete.
The treatment above is suggested by, but by no means the same as,
Smullyan57.
Conclusions suggested by the formal results
The point of the above is that it makes clear that the sense in which "self reference" is required for the proof of the fixed point theorem and hence
Tarski's and Godel's theorems is entirely metalinguistic. I have proved a fixed-point theorem for languages in which, at best, there are atomic sentences, but no others, there are no relations between sentences (other than identity or diversity) that can be derived from the theory of languages, and the "Gödel numbers" (the constants associated with each sentence and predicate) are not given by any computational method. The fixed-point theorem—even an instance of the theorem for a single predicate—cannot be proved in the theory of languages: after all, a fixed point is a constant that bears a certain relationship to a predicate, and no nontrivial such relationships follow from the theory of languages.
Here are precise versions of the claims I just made.
more... close
- There is a language with domain a single constant, and, thus, with a single sentence. In that language, there are no sentences that bear nontrivial relations to each other. Since every theorem of the theory of languages must be true of that language, no such theorem can assert that two sentence are related in a nontrivial way.
- Let
be language, and let
be a permutation of
. Then
,
,
, and
are also languages. Thus, the "Gödel numbers" of the predicates and sentences are subject to fully general unrelated permutations, and they are therefore not given by any computational method.
For self reference to be possible
within a language, the language must have a definable set of constants expressing (Gödel numbering) theorems that
includes a constant expressing a formalization of the fixed point theorem and includes
constants expressing the instances of the following reflection scheme:
,
where

holds of the "Gödel numbers" of the theorems.
For self reference to be possible for a language user (by language, all I mean
is something that fits the thin definition above---it could be a system of
representations, not a language in an ordinary sense), the language user must
implement the definition of

and be competent to determine that

holds of
the names of the formalization of the fixed point scheme and the instances of
the reflection scheme. A standard way to ensure that is to require that

be
definable from axiom schemes and rule schemes in the usual way, that the
language user be committed to the axioms and becomes committed to whatever
follows from prior commitments in virtue of the rules whenever that is
appropriately presented. That isn't actually enough: one must require in
addition that the language user has (1) the capacity to prove the fixed point
theorem, (2) the capacity to express an instance of the scheme for each
sentence

she has the capacity to use, and (3) the capacity to recognize
that names of the instances of the scheme she has the capacity to express are
in

. Nothing so far has required that the proofs of the various instances of
the reflection scheme have anything in common. Capacity (3) seems to me to
only be plausible if there is a schematic proof of the reflection principle,
that is, a proof scheme that has as instances proofs of every instance of the
reflection scheme, and if the language user has the capacity to obtain proofs
of instances of the scheme that language user has the capacity to express in
virtue of the scheme.
The account of what it is for a language user to be capable of self-reference
doesn't require an explicit mechanism of self reference, but it doesn't prohibit such a
mechanism either. If a mechanism of self reference requires the capacity to
name predicates, apply predicates to those names, to systematically introduce
new predicates via simple logical definitions that take predicates and their
names as inputs, and to draw systematic conclusions on the basis of such
definitions, which seems to me to be true (emphasizing once again that by
language, predicates, names, definitions, I mean something very thin, as
shown above), then a capacity for self reference brings with it the
capacities I outlined above.
A language in which a fixed-point theorem can be proved
A simple way to nearly define a language in which the fixed point theorem can
be proved follows. (My definitions of functions have to be fixed up to be
total and onto in pretty much any way you like. This is a notational variant
of something in the Smullyan article): there is a single predicate

, a
function symbol

(quotation), and a function symbol

(norm).
A sentence is

followed by a term,

followed by a sentence or a term is a term, and
otherwise terms are formed in the usual way. The only rule is that

is
interdeducible with

, for any term

with one free variable

. It
follows that
(*)
if and only if
.
Now, adopt the following semantics:

is a predicate of expressions of the
language. Denotation of a term is computed as follows, where

is an
arbitrary term: The denotation of

is the denotation of

. The
denotation of

is

.
The denotation of

is the denotation of

, which, in turn, denotes
the expression

. Thus (*) says that
if and only if "
" has
the property
,
the fixed point theorem. Of course, this simple language
doesn't have the reflection principle.
The sentence

is often rendered, "This sentence has property

," or the
like. That isn't a bad a translation as one might at first think, if "This
sentence has property

" is to be understood in terms of taking the
denotation of "this" to be "This sentence has property P," so that the
sentence ""This sentence has property P" has property P" is taken to be
obtained from it using Leibniz's law. The construction I used is, admittedly,
a bit closer to the Quinean (see
Quine63 255,
Smullyan57 56n)
"Yields when appended to its quotation a
sentence with property
" yields when appended to its quotation a
sentence with property
.
The construction didn't depend on any special properties of

, and so, if the language had more predicates (formulas with one free variable), they would have fixed points too.
--
ShaughanLavine - 31 Oct 2006 - written 23 Oct 2005
Quotation, decidability, recursiveness, representability
Use a language that is such that only finitely many symbols are used—for example, one often uses infinitely many different symbols,

as variables, but we must use the symbols, for example,

and

, instead, and take variables to be the sequences of symbols

. Assign a nonzero number to every symbol of the language of a theory. (That is, for example, assign 2 to "(", 3 to ")", 4 to "0", 5 to "

", 6 to "

", 7 to "

", and so on.) Every formula is now a sequence of numbers. We can turn every sequence of numbers, say 12, 6, 10, 12, 13 into a number as follows:

. That is what I am going to use for "quotes," or Gödel numbers: it gives me a one-to-one correspondence between expressions (arbitrary sequences of the symbols) and some numbers. A proof is a sequence of formulas, and so I can use the same trick: a proof is a sequence of formulas, each of which is a sequence of symbols, and so a proof corresponds to the sequence of Gödel numbers for its formulas, which, in turn, corresponds to a single number for that sequence of numbers, the Gödel number of the proof.
With a little bit of ingenuity and a lot of patience, one can show that all of the following are monstrous formulas of arithmetic: "

is the Godel number of a formula," "the sequence with Godel number

has length

," "

is the Godel number of a proof of the formula with Godel number

," and so on.
The type of Gödel numbering I have proposed makes use of exponentiation, but the theories

I have been using only include + and

. By being more careful, that is, using a better coding, one can make do with just + and

, and so + and

is enough. There is a straightforward way to code any sequence of numbers using only + and

, not as a single number, but as a pair of numbers. It is annoying to use pairs of numbers instead of single numbers. One could use a special way to code pairs of numbers as single numbers, and then use those, but it is traditional, following Gödel, to use the code as pairs using just + and

for only one purpose: defining exponentiation, and to then use a code like the one I have described thereafter. The curious may investigateDefiningExponentiationfromPlusandTimes.
So much for Godel numbering. Now, decidability. Decidability is an intuitive notion, and so its formalization requires some kind of analysis. It isn't even clear to what the notion of decidability applies. I'll just take it, somewhat arbitrarily, but most conveniently for our purposes here, to take it to apply to sets of numbers.
- Intuitive characterization. A set of numbers is decidable if there is a fully specified method for telling whether or not a number is in it.
What is a fully specified method? It is a finite axiomatization