Dmytro Taranovsky
January 24, 2006

Ordinal Notation

Outline: We define strong yet simple ordinal notation systems. The first section defines the general structure for the notations. The notation in the second section reaches ATR₀ plus "for every ordinal a, there is recursively a-inaccessible ordinal." The third section reaches up to Π¹₂-CA₀. The final section attempts to reach full second order arithmetic. Proofs are not included.

A General Notation

We first introduce a general structure for the notations. The notation is built from 0 (the least ordinal), other constants, and a total binary function C. C(a, b) is the least ordinal above b that has degree a.

Every ordinal has degree 0. If a<b, then every ordinal of degree b has degree a. A limit of ordinals of degree a has degree a. An ordinal has degree a+1 iff it is a limit of ordinals of degree a or is not above a certain ordinal and has degree a. For a limit ordinal a, an ordinal in the notation has degree a iff it has every degree b<a. C(a, b) = b + ω^a iff C(a, b) ≥ a (in particular, if a is less than the least fix-point of x→ω^x above b).

The standard representation of the constants are their names, and for other ordinals, it is C(a, b) where a is maximal and b is minimal, and a and b are in the standard form. The standard representation of an ordinal should be the one with least number of applications of C. This should remain true even if all ordinals below a particular one were to be added as a constant.

The ordinals are compared as follows:

C(a, b) is in the standard form iff a is maximal and b is minimal, and a and b are in the standard form, and C(a, b) does not equal to a constant in the notation.
C(a, b) is monotonic in a and b.
If a is maximal, C(a, b) < C(c, d) iff C(a, b) ≤ d, or b < C(c, d) and a < c
Note: Maximal a exists for all ordinals in the notation. If a is not maximal, use ∀g (C(g, b) = C(a, b) → g<c) instead of a<c.
In C(a, b), b is minimal iff b = 0 or b = C(c, d) where c is maximal in C(c, d) and a≤c.
Note: To minimize b, recursively replace it with the second argument until it is minimal.

Besides the constants, the only unspecified issue is when a is maximal in C(a, b). For example, if a is always maximal, we just have addition and exponentiation. Note that we can compute whether a<b without verifying that b is in the standard form. In fact, definitions of standard forms usually use such computations.

For example, to reach the Bachmann-Howard ordinal, we can add constant Ω with C(a, b)<Ω if b<Ω and an ordinal above a is in the notation. Define a to be maximal if its standard representation (equivalently, some representation) in terms of ordinals not above b (equivalently, ordinals below C(a, b)) only uses ordinals below a.
For C(Ω*a+b, c) (a, b, c < Ω), we have either Ω*a+b or Ω*(a+1) or Ω²is maximal. C(Ω², c) is the least point in the range of Γ above c. If Ω*a+b is maximal and a>0, then C(Ω*a+b, c) is point number ω^b above c in the range if the ath fixpoint function (ε is the first fixpoint function). The notations below are essentially extensions of this notation.

An Ordinal Notation

In this section, we define the notation for rudimentary set theory plus "for every ordinal a, there is recursively a-inaccessible ordinal". The large ordinal hypothesis is equivalent over V=L to (boldface) Δ⁰₃ determinacy. Note that instead of rudimentary set theory (a set theoretical analogue of ACA₀), we can use a stronger theory like Π¹₁-CA₀.

Definition of the Notation

The notation is built from the constant 0 (the least ordinal) and a function C as follows:

C(a, b, c) is the least ordinal e of admissibility degree a that is above c and is not in H(b, e).
H(b, e) is the least set of ordinals that contains all members of e, and is closed under h, i, j → C(h, i, j) where i<b.
If an ordinal e is of degree a, then C(h, i, j)<e whenever h<a and j<e. 0 is of degree 0.

A canonical notion of the degrees is:
3'. Ordinals of degree a+1 are the recursively a-inaccessible ordinals and their limits.
For limit a, having degree a is the same as having every degree below a.
(An ordinal is recursively a-inaccessible iff it is admissible >ω and for every b<a is a limit of recursively b-inaccessible ordinals.).

1, 2, and 3 define the comparison relation for ordinals in the notation. 1, 2, and 3' uniquely fix C and imply 3.

Interpretation: C(a, b, c) is the least ordinal above c of degree a that is not reachable from below using "collapses" of ordinals less than b. If C(a, b, c) < b, then C(a, b, c) may be viewed as a collapse of b above c.
Note: The pair (a, b) may also be viewed as a degree and is the notion of degrees used in the other sections.

Comparison Relation

A polynomial time comparison algorithm is obtained as follows:

In C(a, b, c), treat (a, b) like an ordinal and use the general notation.
(a, b) < (c, d) iff a<c, or a=c and b<d.
a is always maximal in C(a, b, c).
b is maximal in e = C(a, b, c) iff b is in H(b, e).
k is in H(b, e) iff k < e, or k = C(g, h, i), h is maximal, i is minimal, each of g, h, i is in H(b, e), and h<b.

Instead of 3 and 4, we can use the following: b is maximal in C(a, b, c) iff it has a representation (equivalently, standard representation) in terms of ordinals below C(a, b, c) (used as constants) using only ordinals below b.

Examples

Here are some examples where C(a, b)=C(0, a, b), and f(x)=C(1, 0, x):
C(0, x) = x+1,
C(b, c) is the least ordinal not in H(b, c+1),
C(b, c) = min(c+ω^b, C(f(c), c)) if b<f(c),
C(f(a), a) is the least fixed point of x → ω^x above a.
{x: c<x<C(a, b, c) and x has degree a} has order type 0 if b=0 and min(ω^b, C(a, b, c)) otherwise.
C(a, C(a+1, 0, b), b) is the least ordinal c above b such that there are c ordinals <c of degree a.
C(f(0)², 0) = Γ₀.
C(f(f(0)), 0) is the Bachmann-Howard ordinal.
C(C(1, 1, 0), 0) is the proof-theoretical ordinal of Π¹₁-CA₀.
C(fC(1, 1, 0), 0) is the ordinal for Π¹₁-CA + TI.
C(C(2, 0, 0), 0) is the ordinal for Π¹₁ Transfinite Recursion (with induction limited to sets).
C(f(C(2, 0, 0)), 0) is the ordinal for KPi.
C(f(C(3, 0, 0)), 0) is the ordinal for KP + Σ¹₁ Monotonic Induction.

A Stronger Ordinal Notation

We define a strong ordinal notation as follows. Let Ω be a large ordinal and let O be a notation system for ordinals in terms of ordinals below Ω (ordinals below Ω are treated as given). The notation uses C(a, b) for b<Ω (which for ordinals in the notation implies C(a, b)<Ω), and O for larger ordinals. a is maximal in C(a, b) iff for every ordinal d which is included in the O representation of a, the following holds. The standard representation of d does not use ordinals that are below Ω but greater than d, excluding instances in the scope of an ordinal less than C(a, b). (If d is …f…, and f<C(a, b), then do not parse f for ordinals larger than d.)

Examples
Using a canonical O, and setting gaps in the canonical way, we have
C(Ω*a+b, c) = C(a, b, c) of the previous notation.
Ordinals (below Ω) of degree Ω^a+1 are recursively a-Mahlo ordinals (and their limits)
Ordinals (below Ω) of degree Ω^Ω are Π₃ reflecting ordinals (and their limits)
C(C(Ω, C(Ω², 0)), 0) is the ordinal for KPM (KP + the universe is recursively Mahlo, equivalently, KP + Δ₁ Monotonic Induction).
In general, for appropriate "…", the ordinal for KP + "the universe is …" is C(C(Ω, a), 0) (which equals C(ε_a+1, 0)) where a is the least ordinal such L_a satisfies or can be forced to satisfy ….
However, the ordinal for KP + Δ⁰₃ determinacy is C(C(Ω², 0)+ε_a+1, 0) since C(Ω², 0) is needed to reach a = C(Ω*C(Ω², 0) + Ω, 0), the least ordinal a that is recursively a-inaccessible.
C(C(ε_Ω+1, 0), 0) is the proof theoretical ordinal of KP+{Π_n reflection}_n, and also of ACA₀ + lightface Π¹₂ comprehension.
C(C(Γ_Ω+1, 0), 0) would be (but, below, we define C differently) the ordinal for ATR₀ + lightface Π¹₂ comprehension.

An ordinal that is a limit of ordinals of maximum degree a has degree a+b iff it is a "level" b limit of ordinals of degree a. For example, C(Ω^Ω+Ω²+Ω+1, 0) is the least limit of admissible limits of recursively Mahlo limits of Π₃ reflecting ordinals. An ordinal below Ω has degree a+Ω iff it is not above a certain ordinal and has degree a, or it is an admissible limit or a limit of admissible limits of ordinals of degree a.

Note that we are not defining C(a, b) when both a and C(a, b) are in a gap in the notation. One possibility is to allow ordinals not representable in the notation from below (that is using lesser ordinals as constants) to have the largest possible degrees. However, in that case some ordinals below C(Ω, 0) would have every degree below Ω.

The notation can just as well be defined above an arbitrary ordinal. Since O is a parameter, we can "stack" an ordinal notation on top of "itself" any finite number of times, thus reaching the proof theoretical ordinal of Π¹₂-CA₀.

Second Order Arithmetic

Full second order arithmetic can be reached by using the idea of "correctness". The following is a canonical definition of correctness. Let ρ be the least ordinal such that L_ρ satisfies ZFC minus power set.

An ordinal κ has correctness 1 iff it is admissible >ω or is a limit of admissible ordinals.
An ordinal κ has correctness n+1 with n>0 iff L_κ is a Σ_n elementary substructure of L_ρ.

For every positive integer i, the constant Ω_i is for the least ordinal of correctness i. (In the above, Ω corresponds to Ω₂.) C(a, b) has correctness n>0 iff there is c with correctness n+1 and b<c≤a. Every ordinal has correctness 0, and if m<n, every ordinal of correctness n has correctness m.
Thus, if an ordinal a has a positive but not infinite correctness n, b<a, and d is less than the least ordinal above a of correctness n+1, then C(d, b)<a.

For Ω of correctness at least two, the maximality condition in the notation in the previous section is a necessary but not a sufficient one. We propose the following condition, which implies the previous one.

If C(a, b) has maximum correctness n>0, let Ω be the least ordinal above b of correctness n+1, and Ω' the least ordinal above b of correctness n+2. Thus, we have Ω≤a<Ω'. For maximality of a, we require that the standard representation of a does not use ordinals that are above a but below Ω' except in the scope of an ordinal less than Ω. In addition, as in the previous section, parse "a" from the root to branches until a constant or an ordinal below Ω is reached on every branch (at this stage, do not parse the ordinals below Ω). For every such ordinal d<Ω, we require that its standard form does not use ordinals strictly between d and Ω except in the scope of an ordinal less than C(a, b). If C(a, b) only has correctness 0, we simply require that the standard form of a does not use ordinals strictly between a and Ω except in the scope of an ordinal less than C(a, b), where Ω is the least ordinal above b of correctness 2.

This leads to a polynomial time comparison algorithm. Note that the least ordinal of correctness n>0 above b can be found by taking the least Ω_i with i≥n and b<Ω_i, and applying x→C(x, b) i-n times.

The three questions about the notation are: Is the notation system well-founded? Does it reach full second order arithmetic? Is it the most canonical way to reach full second order arithmetic?

An issue with the notation is that it does not include the least ordinal a that is stable up to an admissible ordinal above a. A good notation is expected to include all ordinals having a canonical provable representation in the theory. Nevertheless, the present notation has means of obtaining strength. For example, C(C(Ω₃+1, 0), 0) is the least ordinal a that is stable up to the analogue of the proof-theoretical ordinal of Π¹₂-CA₀ above a.

If a has correctness n>0 (but not infinite correctness) and b is the least ordinal above a of correctness n and c<a, then C(ε_{a + 1}, c) = C(b, c).
C( C(Ω₃, Ω₂) + Γ_{Ω_2 + 1}, 0) is C(Γ_Ω+1, 0) of the previous section.
C(C(C(Ω₃, C(Ω₃+1, 0)), 0), 0) is the ordinal for Π¹₂-CA + TI.
C(Ω₃*2, 0) is the least ordinal κ of correctness 2 with L_κ satisfying KP + Δ₂ separation.
C(C(C(Ω₃*2, 0), 0), 0) is the ordinal for Π¹₂-TR₀.
C(C(C(Ω₃, C(Ω₃*2, 0)), 0), 0) may be the ordinal for Δ¹₃-CA + TI.
C(C(C(Ω₃, C(Ω₃², 0)), 0), 0) may be the ordinal for KP + Δ₂ Monotonic Induction.
For n>0, C(…C(Ω_n+1+1, 0)…, 0) (with n+1 Cs) is the ordinal for Π¹_n-CA₀.

Beyond Second Order Arithmetic

Canonical ordinal notation systems are not yet known for uncountable set theory, but some understanding is here today. A key idea is that there are transfinitely many degrees of correctness, and cardinals are ordinals that cannot be reached from below no matter how large the degree of correctness is. Let Ω be the least uncountable cardinal and b a countable ordinal, as computed in the model. If a<b, then b having correctness ω*(a+1) may be defined as L_b+a being elementarily embeddable L_Ω+a. Correctness Ω+ω may correspond to L_b+b being elementarily embeddable in L_Ω+Ω, Ω²+ω to L_Ω*Ω, and so on.

For a notation, we can use a total ternary function C such that C(a, b, c) is the least ordinal above c of correctness a and for that correctness of degree b. If we treat (a, b) like an ordinal, then the function satisfies formal requirements of C (as described in the general notation), so the only issue is specifying when (a, b) is maximal. For a=0, the maximality of b is like in the second section, but the general case is more complicated.