On Model Theory: The Thing So Called Model
Alright, this series is a set of notes on Model Theory, which is based on the book A Course in Model Theory. Basically, there will be an article for each chapter, but this is just my plan, since I don’t know how dense the later chapters will be.
And here is the first article, about some basic things; most of them are the knowledge that one should have learnt in First Order Logic. Actually, I have another article, not published though, which concentrates on FOL, but since there is something more in the context of Model Theory, I’m going to write here once more.
Language and Structure
Let’s talk about a metaphor. Is the sentence “あ、天気がいいかな” true? No matter whether the answer is yes or no, such a judgment must go through 2 steps. Firstly, one needs to know what this sentence means, which is to say, one needs to understand this language and find the signified of each word. Moreover, one needs to learn the grammar to understand what it states, rather than what it talks about. Secondly, one needs to check the real situation: is the weather good? After that, one can make a judgment.
That is what we mainly think about when we deal with mathematical logic. We try to understand what a formula means, both the signified of symbols and the rules of deduction. And then we give a language a designated set (one should realize here that model theory is based on set theory), which we call a structure, and check the truth of a formula.
So, what is a language, or more precisely, a first order language?
First of all, we need a bunch of logic symbols, which don’t vary as the language changes, because the things we care about are mainly in mathematics. They are some variables (use any symbols you like; sometimes , sometimes ), some operations like , and two quantifier symbols . One who has learnt propositional logic may know that they’re not all necessary since they can represent each other by some combinations. Actually, we can also just “read” from it. For example, can be replaced by . But for the sake of convenience, we just introduce all of them here and do not analyze further until necessary.
After that, some symbols can be introduced for use in different categories. They are some constants, some functions, and some relations. Constants here are used to represent the significant elements we want in a theory, such as the identity in group theory. Functions here are used to represent the operations in a mathematical system, like or . A relation is the most important thing here, which is indispensable, used to construct statements, and is also called a predicate. Sometimes, a special relation is explicitly written as , which always indicates equality.
Once a language is given, we can talk about structures on it. Basically, a structure is an assignment based on a set (we call it the universe of the structure), which interprets each constant as an element of , each -ary function as a function from to , and each -ary relation as an -ary relation on . Usually, we use the fraktur style of a letter to represent the assignment of a set; for example, is the structure whose universe is . And we write when is a constant symbol, to denote its interpretation.
Wait, what have we missed? Ah, the grammar; we still need grammar to construct sentences. We call each variable and each constant a term, and when we fill a function with terms, the result is also a term. And with terms, we call a relation filled by terms an (atomic) formula, and the combination of formulas with logic symbols is also a formula. With some variable and a formula , and are both formulas. In particular, if all the variables in a formula are constrained by quantifiers, then we call it a sentence.
This may seem to be a mess, so let’s consider an example. is a language with one relation, one function and no constants. is a structure of it. is a term. and are both formulas. is a sentence.
After these are defined, we can talk about “truth”. But, given an arbitrary formula , we still can’t say if it’s true, because there are some variables not interpreted. So we need an assignment on variables. Suppose is a formula with variables , then we write . For , we write to denote replacing the variable with if is not under the scope of some quantifier. For example, , then . Then, we can define the notation for some when
- if is an atomic formula,
- and if ,
- or if ,
- if ,
- or if ,
- both and , or both and , if ,
- there exists some such that if ,
- for any , if . With 7 and 8, we shall see that the assignment of variables doesn’t affect the truth of a formula if such a variable is in the scope of some quantifier. We call such variables bounded variables; otherwise we say they are free. Therefore, a formula without any free variable has a unique truth value. We call such a formula a sentence. And if a sentence is true under a structure, we say the sentence is satisfied by the structure, and we call the structure a model of the sentence.
Language and Language, Structure and Structure
One may find some relationship between languages and languages, structures and structures. For example, a language of the ring seems to contain the symbols of the group. Given a language of the group, a group is a set and its subgroup seems to be another structure for the language.
Given a language , we say a subset of is a sublanguage, which also forms a language. For example, is a language, and is a sublanguage of it. Clearly, a structure of some language is naturally a structure of its sublanguage, which can be obtained easily by forgetting the interpretation of those symbols that are not used.
We can expand a language manually with a structure. Consider a language and its structure ; if is a subset of , then we can add the elements of into as new constants and interpret them as exactly what they are in . Such a new language is denoted by , and a new structure is naturally given by .
Between two -structures and , if there is a map such that
- , for a -constant,
- , for a -function,
- , for a -relation, then we call a homomorphism from to . If is an injection and , then we say it is an embedding. Once is a bijective embedding, we call it an isomorphism.
Then, for two structures and with , we say is a substructure of if the inclusion map is an embedding, denoted as .
As we have seen in group theory, we can define a substructure generated from a subset. By simple analysis, one may realize that the intersection of a family of substructures is a substructure. Then for a subset of a structure , there is a smallest substructure that contains , which is denoted by and is called the substructure generated by . If is finite, then is called finitely generated.
Moreover,
The World of Theories
However, we do not really care about language when learning mathematics, right? Everything we deal with when doing maths is the sentences, which we generally call axioms, theorems and propositions.
In mathematical logic, a theory is a set of sentences, such as the axioms of groups, rings and fields.
Generally, we are going to introduce the deduction of sentences, which is quite important in the first order language. However, we won’t describe it here, since it is not the topic of Model Theory, and we only give some conclusions.
If a theory has a model of it, which means there is a structure such that every sentence in it is satisfied by the structure, it is called consistent. Consistency means there are no inner contradictions within the set of sentences. This definition is also usable on theories; for two sets of sentences and , we say is consistent with if is consistent.
For two sentences and , if any model of is a model of , i.e., for any , if and only if , then we write . This definition can be generalized to two theories and easily: if all models of are models of . And we say and are equivalent if they have the same models, which means and , denoted as .
Given a language and an -theory , we say is complete if for any -sentence , either or . It’s quite clear that the relation is independent of the language, but the completeness does depend.
Before we give a characterization of completeness, a characterization of formulas and some kinds of sets are going to be introduced first.
We call a formula basic if it is an atomic formula or it is the negation form of some atomic formula. And the formula constructed from basic formulas with the logic symbols is said to be in negation normal form.
Actually, every formula is equivalent to a formula in negation normal form. One only needs to realize that is equivalent to , is equivalent to , and De Morgan’s law allows “putting” a into a given formula.
Some Sets
Given a formula with some free variables , there is a set It is called the realization of . We can see this as a generalization of for some a relation.
More generally, given a subset of the universe of a structure , for some -formula , is said to be a -definable set. In particular, the realization of is a -definable set (the notation denotes the empty set in set theory).
For a structure , the atomic diagram is defined as And the theory of is defined as Two models are called elementarily equivalent if they have the same theory, written .
With these, we can characterize the completeness of a theory. Let be a consistent theory; then the following are equivalent:
- is complete.
- All models of are elementarily equivalent.
- There exists a model with . To prove it, we separate it into 3 parts. "": Suppose two models and of . For any , without loss of generality, suppose , then and . "": If there is no model satisfying 3, since is consistent, there must be some such that is a proper extension of . Let , then is still consistent, and then the model of , which is also a model of , is not elementarily equivalent to , which contradicts 2. "": By definition.
-Sorted Structure
As the closure of this chapter, we introduce the so-called -sorted structure. Basically, it is something assigning types to a language, which is just like what TypeScript is to JavaScript.
Formally, given a set of sorts whose elements are sorts, a language is a set of constants assigned to some sorts, functions’ domains and ranges are constrained by some sorts, and relations with sorts. The -sorted structure is a structure whose universe is a family of non-empty sets, and
- ,
- ,
- .