Stability theory is the most important part of model theory, which was studied by Saharon Shelah.
Without special explanation, we let T be a complete countableL-theory that has infinite models. M denotes the monster model and its universe of T. κ,λ are used to denote infinite cardinals.
Definition and Results
We say the theory T is κ-stable (or stable in κ) if for any subset A⊂M,
∣A∣≤κ⇒∣S(A)∣≤κ.
It is called stable if there is a cardinal where it is stable, and it’s called superstable if there is a cardinal κ such that for all λ>κ, T is λ-stable. A structure M is called κ-stable if Th(M) is κ-stable.
The definition can be replaced by a 1-type version, which is to say that a theory T is κ-stable if for any subset A⊂M, ∣A∣≤κ⇒∣S1(A)∣≤κ. To show this, we suppose the alternative definition holds but the original definition fails. Since S(A)=⋃i∈ωSi(A), there must be some i such that ∣Si(A)∣>κ. If i=1, then the contradiction completes the proof. Otherwise, we can find without loss of generality that ∣Si(A)∣>κ but ∣Si−1(A)∣≤κ. Now, for a p(x1,…,xi)∈Si(A), we can define p∗(x1,…,xi−1)={∃xiφ(x1,…,xi);φ∈p}. Consider an arbitrary Γ⊂Si(A) such that ∣Γ∣>κ. By the Pigeonhole principle, there is a q∈Si−1(A) such that
∣{p∈Γ∣p∗⊂q}∣>κ.
Finally, we choose a solution a1,…,ai−1∈M of q. For each p∈Γ, p(a1,…,ai−1,xi) defines a 1-type on A∪{a1,…,ai−1}. By choosing a formula φ∈p1 while ¬φ∈p2, we can find out that if p1=p2, then p1(a1,…,ai−1,xi)=p2(a1,…,ai−1,xi). Hence,
∣S1(A∪{a1,…,ai−1})∣=∣Γ∣>κ,
which contradicts the assumption that for any subset A⊂M, ∣A∣≤κ⇒∣S1(A)∣≤κ.
If you read the definitions carefully, you must be confused why there isn’t a definition that a theory is stable for all cardinals. The following theorem explains.
(ω-Stable) If the theory T is ω-stable, then it is κ-stable for any κ.
The proof of this theorem is quite complex, so it is omitted here by the low cost-performance ratio. But one should realize the relation between these stabilities.
ω-stable⇒superstable⇒stable
Such property is so strong that a lot of properties can be deduced directly from it. Let me show one. If the theory T is ω-stable, then there is a countable model A which is saturated.
To prove this, we choose an arbitrary countable model M0, then construct a chain of elementary extensions inductively. Suppose Mi is constructed and it is countable. Since T is ω-stable, then the set S=⋃{S(A);A is a finite subset of Mi} is countable. Then for each p∈S, we introduce some irrelevant constants aˉp∈M to satisfy p. Now the set Mi∪{aˉp} is countable. By the Löwenheim-Skolem theorem, there is a countable elementary extension Mi+1 of Mi. Finally, we take A=⋃i∈ωMi. By Tarski’s chain lemma, it is an elementary extension of M0, and you might see that it is ω-saturated, which is saturated by definition.
Definability
To characterize stability, it’s easy for us to think of the characterization of types. Stability is nothing more than “the types won’t ‘split’ into too many pieces”, therefore we need a way to describe types. It is the definability of a type p(x).
Given a set B of tuples of constants, we say it is definable on A0⊂A⊂M if there is a formula θ(xˉ)∈L(A0) such that
bˉ∈B⇔M⊨θ(bˉ).
Now consider a type p(xˉ)∈S(A). We may consider whether an L(A)-formula φ(xˉ)∈p(xˉ). We may detach all parameters from it, and then it is degenerated to an L-formula ψ(xˉ,yˉ). Consider the set
B={bˉ∈A∣ψ(xˉ,bˉ)∈p(xˉ)}.
If it is definable on A0, we say ψ(xˉ,yˉ) is definable on A0. If every L-formula ψ(xˉ,yˉ) is definable on A0, then we say the L(A)-formula is definable on A0.
Refining the definition: let A0⊂A. The type p(xˉ)∈S(A) is definable on A0 if for all formulas φ(xˉ,yˉ), there is an L(A0)-formula θφ(yˉ) such that
∀aˉ∈A,φ(xˉ,aˉ)∈p(xˉ)⇔M⊨θφ(aˉ).
If we take A0=A, we simply say p is definable.
(Char.A) If every type is definable, then T is a stable theory.
This proof is easy. Take λ to be a cardinal such that λω=λ (for example, 2ω). We can prove that T is λ-stable. For an arbitrary type p(xˉ) on set A⊂M where ∣A∣≤λ and L-formula φ(xˉ,yˉ), we can define the set
Now, ∣S(A)∣≤(∣L∣+∣A∣+ω)∣L∣+ω=λω=λ. Hence T is stable.
Binary Tree
The set p∣φ points out how we actually see a type. The formula θφ(yˉ) actually gives a 0/1 criterion to a sequence of parameters aˉi,i<ω. If there are only finitely many boolean patterns for parameters, we will find the type is definable easily. Therefore, it’s necessary to study this.
Think. Given a set of formulas Σ(xˉ) and a formula φ(xˉ,yˉ), a binary tree gives the possibilities of boolean values of given parameters of yˉ and the result of adding each layer into Σ to make it a type.
Σ(xˉ)⎩⎨⎧φ(xˉ,yˉ∅)⎩⎨⎧φ(xˉ,yˉ0){φ(xˉ,yˉ00)⋯x000 is the solution of this branch¬φ(xˉ,yˉ01)⋯x001 is the solution of this branch¬φ(xˉ,yˉ1){φ(xˉ,yˉ10)⋯x010 is the solution of this branch¬φ(xˉ,yˉ11)⋯x011 is the solution of this branch¬φ(xˉ,yˉ∅)⎩⎨⎧φ(xˉ,yˉ0){φ(xˉ,yˉ00)⋯x100 is the solution of this branch¬φ(xˉ,yˉ01)⋯x101 is the solution of this branch¬φ(xˉ,yˉ1){φ(xˉ,yˉ10)⋯x110 is the solution of this branch¬φ(xˉ,yˉ11)⋯x111 is the solution of this branch
Formalizing such idea, we get the following definition.
Let Σ(xˉ) be a satisfiable set of L(M)-formulas, and φ(xˉ,yˉ) be an L-formula. For each n∈ω, consider a set of variables
{xˉν∣ν∈n2}∪{yˉη∣η∈n>2}.
Σn is the union of the following sets, which is the n-th layer of the binary tree:
⋃ν∈n2Σ(xˉν),
{φ(xˉν,yˉν∣k);ν∈n2,k<n,ν(k)=0},
{¬φ(xˉν,yˉν∣k);ν∈n2,k<n,ν(k)=1}.
We call Σn the φ-binary tree with height n from Σ. In the case when Σ is empty, we simply say it’s a φ-tree with height n.
The greatest n that makes Σn satisfiable is written as R(Σ,φ,2)=n, called the (φ,2)-rank of Σ. If there is no greatest one, we simply write ω.
A simple fact is that when Γ⊂Σ are two sets of formulas, R(Γ,φ,2)≥R(Σ,φ,2). And you can always find a finite subset Π of Σ to make R(Π,φ,2)=R(Σ,φ,2). To show this, think of M as the monster model which makes any finitely satisfiable set of formulas satisfiable.
By this definition, we have a characterization of definability.
(Char.B) If for each L-formula φ(xˉ,yˉ), the satisfiable φ-tree has a finite height, namely R(xˉ=xˉ,φ,2)<ω, then all types p are definable.
Let p∈S(A) be a type and φ(xˉ,yˉ) be an L-formula. Let n=R(p,φ,2)<ω. Then there is a formula ψ∈p such that R(ψ,φ,2)=n.
We claim that for each aˉ∈A,
φ(xˉ,aˉ)∈p⇔R(ψ(xˉ)∧φ(xˉ,aˉ),φ(xˉ,yˉ),2)≥n.
The ⇒ part is obvious. Now assume φ(xˉ,aˉ)∈p. Since p is a type, ¬φ(xˉ,aˉ)∈p, then R(ψ∧¬φ(xˉ,aˉ),φ,2)≥n. At the same time, R(ψ(xˉ)∧φ(xˉ,aˉ),φ,2)≥n. Now we can insert φ(xˉ,aˉ) and ¬φ(xˉ,yˉ) in front of the binary tree starting from ψ, and this gives a φ-tree starting from ψ with height n+1 and satisfiable. This implies R(ψ,φ,2)=n+1, which is a contradiction to the fact that R(ψ,φ,2)=n. Hence the ⇐ part is proved.
Now, we denote the φ-tree starting from ψ∧φ(xˉ,aˉ) as Γ(aˉ) with height n. Then R(ψ(xˉ)∧φ(xˉ,aˉ),φ,2)≥n is equivalent to saying M⊨Γ(aˉ). Now take
θφ(aˉ)=∃{xˉν,yˉη∣ν∈n2,η∈n>2}γ∈Γ⋀γ.
We have φ(xˉ,aˉ)∈p⇔θφ(aˉ). Hence p is definable.
Order Property
Order property is an important thing in model theory. We say a theory T has the order property if there is an L-formula φ(xˉ,yˉ) and a sequence of elements (aˉi)i∈ω∈M such that
i<j⇔M⊨φ(ai,aj).
A simple example is taking ai=i and letting φ(i,j) be i<j, which is an order. This definition seems lovely, but unlike most other properties, we actually do not want this property. It may cause chaos in a theory. The following proposition explains.
(Char.C) If T doesn’t have the order property, then every formula φ has R(xˉ=xˉ,φ,2)<ω.
We define some temporary notations. Let X={xˉν;ν∈ω2}, Y={yˉη;η∈ω>2}, and
Before we give the proof of (Char.C), we give a lemma first. Let Γ(x) be a set of L(A)-formulas. If ⋃ν∈ω2Γ(xˉν)∪Σ(X,Y) is satisfiable, then we can find a type r(yˉ)∈S(A) such that ⋃ν∈ω2Γ(xˉν)∪Σ(X,Y)∪⋃η∈ω>2r(yˉη) is also satisfiable.
Choose a set of constants D={dη}η∈ω>2 in M which makes ⋃ν∈ω2Γ(xˉν)∪Σ(X,D) satisfiable. Now, for each L(A)-formula ψ(yˉ), there are only two possibilities:
There is an η∈ω>2 that makes {dˉι;η⊂ι,M⊨ψ(dι)} finite (the tree stops somewhere).
For all η∈ω>2, {dˉι;η⊂ι,M⊨ψ(dι)} is infinite.
If it’s 1, then ⋃ν∈ω2Γ(xˉν)∪Σ(X,Y)∪{¬ψ(yˉη);η∈ω>2} is satisfiable. Otherwise, ⋃ν∈ω2Γ(xˉν)∪Σ(X,Y)∪{ψ(yˉη);η∈ω>2} is satisfiable. Now, if the situation is 1, then we put ¬ψ(yˉ) into r(yˉ); otherwise we put ψ(yˉ) into r(yˉ). By enumeration, type r(yˉ) is constructed. By compactness, ⋃ν∈ω2Γ(xˉν)∪Σ(X,Y)∪⋃η∈ω>2r(yˉη) is satisfiable.
Now, we prove (Char.C) by contradiction. Assume there is a formula φ that has R(xˉ=xˉ,φ,2)=ω.
Since Σ(X,Y) is satisfiable, we find solutions aˉ0,bˉ0,bˉ0′ to xˉν,yˉ∅,yˉ0, where ν(0)=0, ν(1)=1. Intuitively, aˉ0 is the solution of the branch after ¬φ(xˉ,yˉ1), therefore we have M⊨φ(aˉ0,bˉ0)∧¬φ(aˉ0,bˉ0′). Hence,
M⊨φ(aˉ0,bˉ0)↔φ(aˉ0,bˉ0′),
Σ(X,Y)∪{φ(xˉν,bˉ0)↔φ(xˉν,bˉ0′);ν∈ω2} is satisfiable.
Now, we construct sequences of aˉn,bˉn,bˉn′ recursively. If the sequences for i<n are already constructed and
M⊨φ(aˉi,bˉj)↔φ(aˉi,bˉj′)⇔i<j,
Σ(X,Y)∪{φ(xˉν,bˉi)↔φ(xˉν,bˉi′);ν∈ω2} is satisfiable,
then by the lemma, we can construct a type r(yˉ) that makes
satisfiable. Now we take solutions aˉn,bˉn,bˉn′ to xˉν,yˉ∅,yˉ0. It’s easy to verify that this satisfies the previous two demands. Therefore, M⊨φ(aˉi,bˉj)↔φ(aˉi,bˉj′)⇔i<j.
Now, we concatenate the three sequences as one as cn=an⌢bn⌢bn′, and let
This gives an order property of T, which is a contradiction. Hence (Char.C) is proved.
Characterization
(Char.A), (Char.B) and (Char.C) give the sufficiency part of the characterization of stability, but we need more, which is the following theorem.
(Shelah’s Characterization of Stability) The following four are equivalent:
T is stable.
T doesn’t have the order property.
For each L-formula φ, R(xˉ=xˉ,φ,2)<ω.
All types p are definable.
The theorems (Char.A), (Char.B) and (Char.C) are 4⇒1, 3⇒4 and 2⇒3 respectively. Now we prove 1⇒2 to complete the loop.
We prove by contraposition. Assume T has the order property, which is given by φ(xˉ,yˉ) and {aˉi}i∈ω.
Let κ be an arbitrary cardinal and λ be the least cardinal such that κλ>κ. Take (D,<) to be a dense well-ordering without endpoints where ∣D∣=κ. Choose a set of constants {aη}η∈λ>D. There is a lexicographic order on it. Define the set
Σ={φ(aη,aη′)∣η<η′}∪{¬φ(aη,aη′)∣η≥η′}.
It is finitely satisfiable by using {ai}i∈ω. Therefore, it can be satisfied by a set of elements in M and we simply denote these elements by {aη}η∈λ>D, which is a harmless abuse of notation. This gives a PRO version of the order property, which means
We claim that it is finitely satisfiable. Take arbitrary finitely many indices
η1<⋯<ηk<ν<ηk+1<⋯<ηn.
Since λ>D is dense, we can find an η0 between ηk and ηk+1. Then the corresponding finite subset of Γν(xˉ) is satisfiable.
Now, since κ<λ=κ (by the minimality of λ), we have ∣A∣=κ where A={aη;η∈λ>D}. For each ν∈λD, Γν can be extended to a type pν∈S(A). Meanwhile, for different ν and ν′, the corresponding types are different. Therefore, we have ∣S(A)∣≥∣λD∣>κ, which means T is not κ-stable.