実直苦闘 シンセリティとストラグル

On Model Theory: More about Models

In this chapter, we will dive deeper into models (but still basic things). It’s mainly about the relationship between models.

Elementarity and Tarski’s Test

In the previous discussion, we have introduced the thing called embedding. However, the property of it is still not good enough for us. Thus, we introduce a stronger property called elementarity.

Generally, a map h:ABh:A\rightarrow B between the universes of two structures A\mathfrak{A} and B\mathfrak{B} is called elementary if for all a1,,anAa_1,\cdots,a_n\in A, Aφ(a1,,an)Bφ(h(a1),,h(an)).\mathfrak{A}\models \varphi(a_1,\cdots,a_n)\Leftrightarrow \mathfrak{B}\models \varphi(h(a_1),\cdots,h(a_n)). If it is an embedding, then we say it is an elementary embedding. Furthermore, we say A\mathfrak{A} is the elementary substructure of B\mathfrak{B} if it is a substructure, and B\mathfrak{B} is also called the elementary extension of A\mathfrak{A}, denoted as AB\mathfrak{A}\prec\mathfrak{B}.

The nontriviality of this definition lies in the fact that it preserves the truth value of existential and universal propositions. For example, (N,)(\mathbb{N},\le) is a substructure of (Z,)(\mathbb{Z},\le), but the sentence x(x0)\exists x(x\le 0) lives differently in those two structures. The following theorem characterizes this, which is called the Tarski’s Test. Consider an LL-structure A\mathfrak{A} and a subset BB of its universe, then BB is the universe of an elementary substructure B\mathfrak{B} of A\mathfrak{A} if and only if every L(B)L(B)-formula φ(x)\varphi(x) which is satisfied in A\mathfrak{A} can be satisfied by an element of BB.

The part of “only if” is almost obvious. We shall see each φ(x)\varphi(x) satisfied in A\mathfrak{A} implies Axφ(x)\mathfrak{A}\models\exists x\varphi(x), then by elementarity, this sentence holds in B\mathfrak{B}, and then an element exists to satisfy φ(x)\varphi(x).

To prove the part of “if”, we need to construct a substructure based on the subset, and then verify the elementarity. We can assign the interpretation of each constant directly from A\mathfrak{A}. By the condition, for any fLf\in L and a1,,anAa_1,\cdots,a_n\in A, the formula f(a1,,an)=xf(a_1,\cdots,a_n)=x is always satisfied by some element of BB, hence the set is closed under fAf^\mathfrak{A}. The interpretation of relations can be inherited from A\mathfrak{A} naturally.

Finally, let’s check the fact that AψBψ\mathfrak{A}\models\psi\Leftrightarrow\mathfrak{B}\models\psi for all L(A)L(A)-sentences ψ\psi. When ψ\psi is atomic, this is clear from the construction. When ψ\psi is of the form ψ=¬φ\psi=\neg\varphi or ψ=φ1φ2\psi=\varphi_1\wedge\varphi_2, it is quite easy to check by the table of truth values. When ψ\psi is of the form of ψ=xφ(x)\psi=\exists x\varphi(x), if ψ\psi holds in A\mathfrak{A}, then there is some aa in AA that satisfies φ(x)\varphi(x), then by the condition, some bb exists in BB to satisfy φ(x)\varphi(x). Hence xφ(x)\exists x\varphi(x) holds in B\mathfrak{B}. Conversely, when some bb in BB satisfies φ(x)\varphi(x), then Aφ(b)\mathfrak{A}\models\varphi(b) vacuously. Hence Axφ(x)Bxφ(x)\mathfrak{A}\models\exists x\varphi(x)\Leftrightarrow\mathfrak{B}\models\exists x\varphi(x). With the discussion previously on the negation normal form, we can induce the elementarity of B\mathfrak{B}.

Here the Tarski’s Test is proved. It’s quite useful to generate elementary substructures, but the usage of it will be shown later. Now, another proposition named after Tarski is going to be introduced, which concerns chains.

For the reader’s convenience, we introduce the idea of chains first. A partial order (I,)(I,\le) is called directed if for all i,jIi,j\in I, there exists some kIk\in I such that iki\le k and jkj\le k. A family of structures (Ai)iI(\mathfrak{A}_i)_{i\in I} is said to be directed if for all ijIi\le j\in I, AiAj\mathfrak{A}_i\subset\mathfrak{A}_j. Moreover, it is called a chain if (I,)(I,\le) is a linear order, and it is called elementary if we replace all the \subset by \prec in the definition.

The Tarski’s Chain Lemma asserts that, the union of an elementary directed family of structures is an elementary extension of all its members. To prove this, we assume (Ai)iI(\mathfrak{A}_i)_{i\in I} is the family and A=iIAi\mathfrak{A}=\bigcup_{i\in I}\mathfrak{A}_i. For all iIi\in I and tuple aˉAi\bar{a} \in A_i, we need to verify that Aiφ(aˉ)Aφ(aˉ).\mathfrak{A}_i\models\varphi(\bar{a})\Leftrightarrow\mathfrak{A}\models\varphi(\bar{a}).

For the situation where φ(xˉ)\varphi(\bar{x}) is an atomic formula, negation or a conjunction, it’s quite easy to see. When it’s of the form φ(xˉ)=yψ(xˉ,y)\varphi(\bar{x})=\exists y\psi(\bar{x},y), we can see φ(aˉ)\varphi(\bar{a}) holds in A\mathfrak{A} if and only if some bAb\in A exists with Aψ(aˉ,b)\mathfrak{A}\models\psi(\bar{a},b). By the directedness, there always exists some jIj\in I such that AiAj\mathfrak{A}_i\prec\mathfrak{A}_j and bAjb\in A_j. Hence Ayψ(xˉ,y)Ajyψ(xˉ,y)Aiyψ(xˉ,y).\mathfrak{A}\models\exists y\psi(\bar{x},y)\Leftrightarrow\mathfrak{A}_j\models\exists y\psi(\bar{x},y)\Leftrightarrow\mathfrak{A}_i\models\exists y\psi(\bar{x},y).

The Compactness Theorem

After introducing the idea of elementarity, we can talk about the so-called Löwenheim-Skolem Theorem. But before detailed introduction, an important theorem is going to be shown first, as a preliminary, which is the Compactness Theorem.

We say a theory TT is finitely satisfiable if every finite subset of TT is satisfiable. The Compactness Theorem says that each finitely satisfiable theory is consistent. To prove this, we need to construct a structure for the theory TT. One way is to extend this theorem to a “better” one, which we call a finitely complete Henkin theory, and find a model for it.

An LL-theory TT is informally defined as finitely complete if it is finitely satisfiable and for every LL-sentence φ\varphi, either φT\varphi \in T or ¬φT\neg\varphi\in T. Adding a set CC of new constant symbols into LL to get a new language L(C)L(C), an L(C)L(C)-theory TT' is called a Henkin theory if for every L(C)L(C)-formula φ(x)\varphi(x), there exists some cCc\in C such that xφ(x)φ(c)T.\exists x\varphi(x)\rightarrow \varphi(c)\in T'. The elements of CC are called the Henkin constants of TT'.

Choose an arbitrary LL-theory TT, we now extend it to a finitely complete Henkin theory. We first construct a set of Henkin constants. Let C0=C_0 = \emptyset, then inductively, we define a set of new symbols Ci+1={cφ(x);φ(x) a L(Ci)-formula}C_{i+1} = \{c_{\varphi(x)};\varphi(x) \text{ a } L(C_i)\text{-formula}\}, where φ\varphi here is used as indices to separate constants. Then we get an ascending chain C0C1C_0\subset C_1\subset \cdots. Let CC be the union of such a chain. Then for any L(C)L(C)-formula φ(x)\varphi(x), there is a unique constant in CC corresponding to it. Therefore, we construct a set THT^H of Henkin axioms for all L(C)L(C)-formula φ(x)\varphi(x) that xφ(x)φ(cφ(x))\exists x\varphi(x)\rightarrow \varphi(c_\varphi(x)). It’s obvious for one to extend a model of TT to a model of TTHT\cup T^H, and this theory is finitely satisfiable. We can find a maximal finitely satisfiable theory TT^* containing TTHT\cup T^H by Zorn’s Lemma. Finally, consider some L(C)L(C)-sentence φ\varphi. If neither φ\varphi nor ¬φ\neg\varphi belongs to TT^*, then there must be a finite subset SS of TT^* such that both S{φ}S\cup\{\varphi\} and S{¬φ}S\cup\{\neg\varphi\} are inconsistent, and this leads to the fact that SS is not satisfiable, which contradicts the finite satisfiability of TT^*. Hence it is finitely complete.

Now, we can then prove that every finitely complete Henkin theory has a model. To construct such a model, we first construct a universe to interpret all the Henkin constants. For c,dCc,d\in C, we define a relation cd(c=d)Tc\sim d\Leftrightarrow (c = d)\in T^*. This is definitely an equivalence relation, which leads to equivalence classes of cCc\in C, denoted by aca_c. The universe we need is defined as A={ac;cC}A=\{a_c;c\in C\}.

To find a corresponding structure A\mathfrak{A}, we define

fA(ac1,,acn)=ac0f(c1,,cn)=c0T,f^\mathfrak{A}(a_{c_1},\cdots,a_{c_n})=a_{c_0}\Leftrightarrow f(c_1,\cdots,c_n)=c_0\in T^*, RA(ac1,,acn)R(c1,,cn)T.R^\mathfrak{A}(a_{c_1},\cdots,a_{c_n})\Leftrightarrow R(c_1,\cdots,c_n)\in T^* .

Noting that A\mathfrak{A} is just an LL-structure, let A=(A,ac)cC\mathfrak{A}^*=(\mathfrak{A},a_c)_{c\in C}. To see that it is a model of TT^*, we need to verify for every L(C)L(C)-sentence φ\varphi that AφφT.\mathfrak{A}^*\models\varphi\Leftrightarrow \varphi\in T^*. When φ\varphi is of the form c=dc=d or R(c1,,cn)R(c_1,\cdots,c_n), it is clear from the definition.

When φ\varphi is of the form ψ(f(c1,,cn))\psi(f(c_1,\cdots,c_n)) for a function symbol fLf\in L and a formula ψ(x)\psi(x), there is some c0=f(c1,,cn)Tc_0=f(c_1,\cdots,c_n)\in T^*. Then AφAψ(c0)\mathfrak{A}^*\models\varphi\Leftrightarrow\mathfrak{A}^*\models\psi(c_0) and φTψ(c0)T\varphi\in T^*\Leftrightarrow \psi(c_0)\in T^*. Since c0c_0 here is not a function, it actually follows from the definition with ψ(c0)TAψ(c0)\psi(c_0)\in T^*\Leftrightarrow\mathfrak{A}^*\models\psi(c_0). Hence AφφT\mathfrak{A}^*\models\varphi\Leftrightarrow \varphi\in T^*.

When φ\varphi is a negation or a conjunction, it’s very easy to check.

When φ=xψ(x)\varphi=\exists x\psi(x), we have

AφAψ(c) for some cCψ(c)T for some cCφT.\begin{align*} \mathfrak{A}^*\models\varphi &\Leftrightarrow \mathfrak{A}^*\models\psi(c)\text{ for some } c\in C\\ &\Leftrightarrow\psi(c)\in T^*\text{ for some } c\in C\\ &\Leftrightarrow\varphi\in T^*. \end{align*}

Then by induction on the complexity of φ\varphi, we have checked that A\mathfrak{A}^* is the model of TT^*. And by simply forgetting the Henkin constants, we get a model of TT, which is a finitely satisfiable theory. Therefore, every satisfiable theory is consistent, which is the statement of the Compactness Theorem.

Ultraproduct: Another Proof

The proof above is quite fabulous, but I would say, it’s not elegant. Here, another proof can be given with the idea of ultraproduct, which is more “algebraic”. One can simply skip since this is not the main thread.

Firstly, we need to define the idea of a filter. For a given set II, a filter F\mathcal{F} is a collection of subsets of II ($\mathcal{F}\subset\mathcal{P}(I)) satisfying

  1. ∉F\emptyset\not\in\mathcal{F} and IFI\in\mathcal{F},
  2. For A,BFA,B\in\mathcal{F}, ABFA\cap B\in\mathcal{F},
  3. For AFA\in\mathcal{F} and BIB\subset I, ABA\subset B implies BFB\in\mathcal{F}.

Simply put, the idea of filter defines the “inescapable” part of a set, which may relate to the idea of “almost everywhere”. Especially, if for any subset AIA\subset I, either AFA\in\mathcal{F} or IAFI\setminus A\in\mathcal{F} holds, then we say it is an ultrafilter, which means we can accurately distinguish whether a part is “important”. And it’s clear that the ultrafilter is exactly the maximal filter of the ascending chain containing itself (because it contains exactly all the situations).

For a family of LL-structures {Ai}iI\{\mathfrak{A}_i\}_{i\in I}, there is a Cartesian product A=iIAiA=\prod_{i\in I}A_i. An ultrafilter U\mathcal{U} of II defines an equivalence relation of AA by (most parts of entries are equal) (ai)iI(bi)iI{iI;ai=bi}U.(a_i)_{i\in I}\sim (b_i)_{i\in I}\Leftrightarrow \{i\in I;a_i=b_i\}\in \mathcal{U}.

Then the quotient set A/A/\sim of equivalence classes [(ai)iI][(a_i)_{i\in I}] is the universe of an LL-structure, called the ultraproduct, denoted as iIAi/U\prod_{i\in I}\mathfrak{A}_i/\mathcal{U}, by letting

  1. for constants cc, ciIAi/U=[(cAi)iI]c^{\prod_{i\in I}\mathfrak{A}_i/\mathcal{U}}=[(c^{\mathfrak{A}_i})_{i\in I}],
  2. for functions ff and elements [(ai1)iI],,[(ain)iI][(a^1_i)_{i\in I}],\cdots,[(a^n_i)_{i\in I}], fiIAi/U([(ai1)iI],,[(ain)iI])=[(fiIAi/U(ai1,,ain))iI]f^{\prod_{i\in I}\mathfrak{A}_i/\mathcal{U}}\left([(a^1_i)_{i\in I}],\cdots,[(a^n_i)_{i\in I}]\right)=[(f^{\prod_{i\in I}\mathfrak{A}_i/\mathcal{U}}(a^1_i,\cdots,a^n_i))_{i\in I}],
  3. for relations RR and elements [(ai1)iI],,[(ain)iI][(a^1_i)_{i\in I}],\cdots,[(a^n_i)_{i\in I}], RiIAi/U([(ai1)iI],,[(ain)iI]){iI;RAi(ai1,,ain)}UR^{\prod_{i\in I}\mathfrak{A}_i/\mathcal{U}}([(a^1_i)_{i\in I}],\cdots,[(a^n_i)_{i\in I}])\Leftrightarrow \{i\in I; R^{\mathfrak{A}_i}(a^1_i,\cdots,a^n_i)\}\in\mathcal{U}.

These definitions seem to be complicated, but one will find it clear by realizing symbols are explained as the Cartesian product of what it is explained in “almost every” given structures.

With the definition of ultraproduct, we have the thing called Łos’s Theorem. Given a family of LL-structures {Ai}iI\{\mathfrak{A}_i\}_{i\in I} and an ultrafilter U\mathcal{U} of II, the ultraproduct A\mathfrak{A} holds, for every LL-formula φ(x1,,xn)\varphi(x_1,\cdots,x_n) and [(ai1)iI],,[(ain)iI]A=iIAi/[(a^1_i)_{i\in I}],\cdots,[(a^n_i)_{i\in I}]\in A=\prod_{i\in I}A_i/\sim, that Aφ([(ai1)iI],,[(ain)iI]){iI;Aiφ(ai1,,ain)}U.\mathfrak{A}\models\varphi([(a^1_i)_{i\in I}],\cdots,[(a^n_i)_{i\in I}])\leftrightarrow\{i\in I;\mathfrak{A}_i\models\varphi(a^1_i,\cdots,a^n_i)\}\in\mathcal{U}.

Now, let’s think about the Compactness Theorem. If the LL-theory TT is finitely satisfiable, then there is a set II of the finite subsets of TT, and for each finite subset ii, there is a model Ai\mathfrak{A}_i of ii.

Construct a class of subsets of II that S={Iσ={iI,σi};σT}S=\{I_\sigma=\{i\in I,\sigma\in i\};\sigma\in T\}. It is clear that for the intersection of any finite elements of SS is not empty (this is the property called finite intersection property), then we can construct a filter containing SS on II by F={A;AI and A1,,AnS such that AA1An}.\mathcal{F}=\{A;A\subset I \text{ and } \exists A_1,\cdots,A_n\in S\text{ such that } A\supset A_1\cap \cdots\cap A_n\}.

Therefore, there is an ultrafilter of II. Then, an ultraproduct can be constructed. And it is easy to check by Łos’s theorem that this ultraproduct is the model of TT. Hence the Compactness Theorem is proved.

Final Preparations

It’s so close to the end of this chapter, the Löwenheim-Skolem Theorem. But still, we have some necessary lemmas to be prepared.

Recall that the symbol AA\mathfrak{A}_A is the extended structure of the extended language L(A)L(A). The models of Th(AA)Th(\mathfrak{A}_A) are exactly the structures of the form (B,h(a))aA(\mathfrak{B},h(a))_{a\in A} for elementary embeddings h:ABh:\mathfrak{A}\rightarrow\mathfrak{B}.

It is vacuous that (B,h(a))aA(\mathfrak{B},h(a))_{a\in A} is a model of Th(AA)Th(\mathfrak{A}_A). Suppose there is a model C\mathfrak{C}. For each aAa\in A, choose c{cC;Cφ(c) where φ(x) is an L(A)-formula with Aφ(a)}c\in \{c\in C;\mathfrak{C}\models\varphi(c)\text{ where }\varphi(x)\text{ is an }L(A)\text{-formula with }\mathfrak{A}\models\varphi(a) \}, and then construct a function that h(a)=ch(a)=c. Then hh is an elementary embedding we need.

Now consider an LL-structure A\mathfrak{A}. Let SS be a subset of AA. Then there is an elementary substructure B\mathfrak{B} containing SS and Bmax(S,L,0)|\mathfrak{B}|\le \max(|S|,|L|,\aleph_0). Here L|L| stands for the number of constants, functions and relations.

To prove this, we need to construct such a substructure. Let S0=SS_0= S, then inductively, let Si+1=Si{aφA;φ a L(A)-structure andAφ(a)}S_{i+1}=S_i\cup\{a_\varphi\in A;\varphi\text{ a }L(A)\text{-structure and}\mathfrak{A}\models\varphi(a)\}. Then the union of SnS_n, denoted as BB, is the universe of a substructure B\mathfrak{B} by the Tarski’s Test.

Now we compute the cardinality of BB. Since the construction of φ(x)\varphi(x) is a finite sequence of symbols in LL, then we can assert that there are max(L,0)\max(|L|,\aleph_0) many LL-formulas. For each nn, there are max(Sn,L,0)\max(|S_n|,|L|,\aleph_0) many L(Sn)L(S_n)-formulas. Therefore inductively, there are max(S,L,0)\max(|S|,|L|,\aleph_0) formulas. Hence Bmax(S,L,0)|\mathfrak{B}|\le \max(|S|,|L|,\aleph_0).

The Löwenheim-Skolem Theorem

Finally, we arrive at the climax of this chapter. Simply put, this theorem asserts that, in the context of the first-order logic, once an LL-theory has an infinite model, then there is a model with cardinality of any infinite cardinal.

Formally, let A\mathfrak{A} be an infinite LL-structure, SS a subset of AA, κ\kappa an infinite cardinal.

  1. (Downward) If max(S,L)κA\max(|S|,|L|)\le\kappa\le|A|, then there is an elementary substructure B\mathfrak{B} of cardinality κ\kappa containing SS.
  2. (Upward) If Aκ|A|\le\kappa, then there is an elementary extension B\mathfrak{B} of cardinality κ\kappa.

To prove the downward one, we choose a subset SS' of AA with cardinality exactly κ\kappa, then apply the result of the last section, an elementary substructure containing SS' exists.

To prove the upward one, we construct a set CC with cardinality κ\kappa. A theory can then be given as Th(AA){¬c=d;cdC}Th(\mathfrak{A}_A)\cup\{\neg c=d;c\not= d\in C\}. Obviously, it is finitely satisfiable, and then the Compactness Theorem implies a structure K\mathfrak{K} exists with cardinality greater than κ\kappa. By the result stated in the last section, AK\mathfrak{A}\prec\mathfrak{K}. Finally, by the downward part, we can obtain an elementary extension B\mathfrak{B} with exactly B=κ|B|=\kappa.