54
$\begingroup$

I wonder about the foundations of set theory and my question can be stated in some related forms:

  • If we base Zermelo–Fraenkel set theory on first order logic, does that mean first order logic is not allowed to contain the notion of sets?

  • The axioms of Zermelo–Fraenkel set theory seem to already expect the notion of a set to be defined. Is there are pre-definition of what we are dealing with? And where?

  • In set theory, if a function is defined as a set using tuples, why or how does first order logic and the axioms of Zermelo–Fraenkel set theory contain parameter dependend properties $\psi(u_1,u_2,q,...)$, which basically are functions?

$\endgroup$
10
  • 3
    $\begingroup$ I don’t understand your second version: the axioms of ZF are in effect a definition of the formal notion of set. They are of course intended to capture the pre-existing informal, intuitive notion. $\endgroup$ Commented Mar 16, 2012 at 23:43
  • 2
    $\begingroup$ I wrestled with this exact same question many years ago. To study logic and set theory, you simply have to have an intuitive definition of sets, functions and the natural numbers. groups.google.com/group/sci.logic/msg/… $\endgroup$
    – dls
    Commented Mar 16, 2012 at 23:45
  • 4
    $\begingroup$ Related MO question. $\endgroup$
    – Zhen Lin
    Commented Mar 16, 2012 at 23:47
  • 5
    $\begingroup$ Of course: that’s part of the intuitive concept of set that the axioms are an attempt to formalize, so it’s altogether unsurprising that it appears to say something about sets. But in fact nothing compels you to interpret $\in$ as your intuitive notion of membership: the axioms tell you how it works, so to speak, not what it ‘means’. $\endgroup$ Commented Mar 16, 2012 at 23:51
  • 2
    $\begingroup$ @Nick: No -- formally, sets are just things that can either be in some distinguished relation to each other, or not be, where this distinguished relation must be such that the axioms happen to be true. The fact that we write the distinguished relation with an $\in$ symbol and pronounce it "is an element of" has no bearing on the formal structure of the therory. It is just a suggestive notation that help us remember how things fit together; it doesn't influence whether a formal proof from the axioms is valid or not. $\endgroup$ Commented Mar 16, 2012 at 23:54

7 Answers 7

38
$\begingroup$

(1) This is actually not a problem in the form you have stated it -- the rules of what is a valid proof in first-order logic can be stated without any reference to sets, such as by speaking purely about operations on concrete strings of symbols, or by arithmetization with Gödel numbers.

However, if you want to do model theory on your first-order theory you need sets. And even if you take the syntactical viewpoint and say that it is all just strings, that just pushes the fundamental problem down a level, because how can we then formalize reasoning about natural numbers (or symbol strings) if first-order logic itself "depends on" natural numbers (or symbol strings)?

The answer to that is that is just how it is -- the formalization of first-order logic is not really the ultimate basis for all of mathematics, but a mathematical model of mathematical reasoning itself. The model is not the thing, and mathematical reasoning is ultimately not really a formal theory, but something we do because we intuitively believe that it works.

(2) This is a misunderstanding. In axiomatic set theory, the axioms themselves are the definition of the notion of a set: A set is whatever behaves like the axioms say sets behave.

(3) What you quote is how functions usually are modeled in set theory. Again, the model is not the thing, and just because we can create a model of our abstract concept of functional relation in set theory, it doesn't mean that our abstract concept an sich is necessarily a creature of set theory. Logic has its own way of modeling functional relations, namely by writing down syntactic rules for how they must behave -- this is less expressive but sufficient for logic's need, and is no less valid as a model of functional relations than the set-theoretic model is.

$\endgroup$
5
  • $\begingroup$ Thanks for the answer! (1) So you base this on the ability of logic to survive on its own, just as rule based manipulation of string, and without sets in mind. I think like that. The second paragraph basically abstracts the relations and throws away the human idea of a set, right? Lastly, if we see how the string manipulation work out, why is "intuitively believe" still necessary? (2) In view of that, wouldn't it be better if we'd use other notation for the relations as $\in$ in logic and set theory. That would point out that we just use the way to manipulate these things - no "sets". $\endgroup$
    – Nikolaj-K
    Commented Mar 17, 2012 at 0:18
  • $\begingroup$ And to point (3): So you say that the set theoretic function is only used after it is defined there like that, while logic usees a version of function, which is not necessary related to it? And related to (2): "mathematical model of mathematical reasoning" sounds difficult, because I then don't know what 'mathematical reasoning' an sich is. I mean is there mathematical reasoning at all, if there is a notion, such that we can model it, i.e. is it there before there is a model of it. And do we say ZF set theory a model of logic, or of sets, or of what? $\endgroup$
    – Nikolaj-K
    Commented Mar 17, 2012 at 0:32
  • 1
    $\begingroup$ I have to run; will try to clarify the technical points tomorrow. For now, just this: If you can "see how the string manipulation works out", that is the intuitive belief I speak about. Certainly I intuitively believe that strings of symbols exist and can be concatenated and taken apart, and that if I take two different strings and append the same string to each of them, the results will be different, etc. But I cannot justify that belief by appealing to any concept that feels more basic to me. It's just there. I can model strings using integers or sets, but that is something different. $\endgroup$ Commented Mar 17, 2012 at 3:23
  • $\begingroup$ Mathematical reasoning "an sich" is whatever a group of mathematicians clustered around a blackboard do; whatever goes on in the head of a mathematician when he reads a paper and forms a conclusion either that "this looks valid" or "this proof doesn't work". This is somewhat informed by the mathematician's knowledge of the formalization of logic, but is by no means wholly caused by it. First, mathematical reasoning in this sense had been going on for centuries because formal logic developed as a subject; second, the mathematician probably knows severa different forms of logic ... $\endgroup$ Commented Apr 21, 2012 at 12:02
  • $\begingroup$ (cont.d) which do not always allow the same arguments -- and chooses among them based on some prior notion of his about what kind of reasoning step he already has decided to believe in. $\endgroup$ Commented Apr 21, 2012 at 12:04
20
$\begingroup$

For those concerned with secure foundations for mathematics, the problem is basically this: In order to even state the axioms of ZF set theory, we need to have developed first-order logic; but in order to recognize conditions under which first-order logical formulas can be regarded as "true" or "valid", we need to give a model-theoretic semantics for first-order logic, and this in turn requires certain notions from set theory. At first glance, this seems troublingly circular.

Obviously, this is an extremely subtle area. But as someone who has occasionally been troubled by these kinds of questions, let me offer a few personal thoughts, for whatever they may be worth.

My own view goes something like this. Let's first distinguish carefully between logic and metalogic.

In logic, which we regard as a pre-mathematical discipline, all we can do is give grammatical rules for constructing well-formed statements and derivation rules for constructing "proofs" (or perhaps, so as not to beg the question, we should call them "persuasive demonstrations"); these should be based purely on the grammatical forms of the various statements involved. There is, at this point, no technical or mathematical theory of how to assign meanings to these formulas; instead, we regard them merely as symbolic abbreviations for the underlying natural language utterances, with whatever informal psychological interpretations these already have attached to them. We "justify" the derivation rules informally by persuading ourselves through illustrative examples and informal semantical arguments that they embody a reasonable facsimile of some naturally occurring reasoning patterns we find in common use. What we don't do, at this level, is attempt to specify any formal semantics. So we can't yet prove or even make sense of the classical results from metalogic such as the soundness and/or completeness of a particular system of derivation rules. For us, "soundness" means only that each derivation rule represents a commonly accepted and intuitively legitimate pattern of linguistic inference; "completeness" means basically nothing. We think of first-order logic as merely a transcription into symbolic notation of certain relatively uncontroversial aspects of our pre-critical reasoning habits. We insist on this symbolic representation for two reasons: (i) regimentation, precisely because it severely limits expressive creativity, is a boon to precision; and (ii) without such a symbolic representation, there would be no hope of eventually formalizing anything.

Now we take our first step into true formalization by setting down criteria (in a natural language) for what should constitute formal expression in the first place. This involves taking as primitives (i) the notion of a token (i.e., an abstract stand-in for a "symbol"), (ii) a binary relation of distinctness between tokens, (iii) the notion of a string, and (iv) a ternary relation of "comprisal" between strings, subject to certain axioms. The axioms governing these primitives would go something like this. First, each token is a string; these strings are not comprised of any other strings. By definition, two strings that are both tokens are distinct if the two tokens are distinct, and any string that is a token is distinct from any string that is not a token. Secondly, for any token t and any string S, there is a unique string U comprised of t and S (we can denote this U by 'tS'; this is merely a notation for the string U, just as 't' is merely a notation for the abstract token involved --- it should not be thought of as being the string itself). By definition, tS and t'S' are distinct strings if either t and t' are distinct tokens or S and S' are distinct strings (this is intuitively a legitimate recursive definition). Thirdly, nothing is a string except by virtue of the first two axioms. We can define the concatenation UV of strings U and V recursively: if U is a token t, define UV as the string tV; otherwise, U must be t for some token t and string S, and we define UV as the string tW, where W is the (already defined) concatenation SV. This operation can be proven associative by induction. All this is part of intuitive, rather than formal, mathematics. There is simply no way to get mathematics off the ground without understanding that some portion of it is just part of the linguistic activity we routinely take part in as communicative animals. This said, it is worth the effort to formulate things so that this part of it becomes as small as possible.

Toward that end, we can insist that any further language used in mathematics be formal (or at least capable of formal representation). This means requiring that some distinguishable symbol be set aside and agreed upon for each distinct token we propose to take for granted, and that everything asserted of the tokens can ultimately be formulated in terms of the notions of distinctness, strings, comprisal, and concatenation discussed above. Notice that there is no requirement to have a finite set of tokens, nor do we have recourse within a formal language to any notion of a set or a number at all.

Coming back to our symbolic logic, we can now prove as a theorem of intuitive (informal) mathematics that, with the symbolic abbreviations (the connectives, quantifiers, etc.) mapped onto distinct tokens, we can formulate the grammatical and derivational rules of logic strictly in terms of strings and the associated machinery of formal language theory. This brings one foot of symbolic logic within the realm of formal mathematics, giving us a language in which to express further developments; but one foot must remain outside the formal world, connecting our symbolic language to the intuitions that gave birth to it. The point is that we have an excellent, though completely extra-mathematical, reason for using this particular formal language and its associated deduction system: namely, we believe it captures as accurately as possible the intuitive notions of correct reasoning that were originally only written in shorthand. This is the best we can hope to do; there must be a leap of faith at some point.

With our technically uninterpreted symbolic/notational view of logic agreed upon (and not everyone comes along even this far, obviously -- the intuitionists, for example), we can proceed to formulate the axioms of ZF set theory in the usual way. This adds sets and membership to the primitive notions already taken for granted (tokens, strings, etc.). The motivating considerations for each new axiom, for instance the desire to avoid Russell's paradox, make sense within the intuitive understanding of first-order logic, and these considerations are at any rate never considered part of the formal development of set theory, on any approach to foundational questions.

Once we have basic set theory, we can go back and complete our picture of formal reasoning by defining model-theoretic semantics for first-order logic (or indeed, higher-order logic) in terms of sets and set-theoretic ideas like inclusion and n-tuples. The purpose of this is twofold: first, to deepen our intuitive understanding of correct reasoning by embedding all of our isolated instincts about valid patterns of deduction within a single rigid structure, as well as to provide an independent check on those instincts; and secondly, to enable the development of the mathematical theory of logic, i.e., metalogic, where beautiful results about reasoning systems in their own right can be formulated and proven.

Once we have a precise understanding of the semantics (and metatheory) of first-order logic, we can of course go back to the development of set theory and double-check that any logical deductions we used in proving theorems there were indeed genuinely valid in the formal sense (and they all turn out to be, of course). This doesn't count as a technical justification of one by means of the other, but only shows that the two together have a comforting sort of coherence. I think of their relation as somehow analogous to the way in which the two concepts of "term" and "formula" in logical grammar cannot be defined in isolation from one another, but must instead be defined by a simultaneous recursion. That's obviously just a vague analogy, but it's one I find helpful and comforting.

I hope this made sense and was somewhat helpful. My apologies for being so wordy. Cheers. - Joseph

$\endgroup$
2
  • $\begingroup$ Isn't there a minimal version of set theory which you can develope on it's own and then use for the semantics of first order logic (which which you build your strong ZF set theory only later)? - Also, but this isn't too relevant for the matter of the question: I don't need any talk about truth, or justification for the certain rules of logic. I have a pretty formalist point of view, to me it's moving symbols around $\endgroup$
    – Nikolaj-K
    Commented Mar 7, 2013 at 13:24
  • $\begingroup$ @NikolajK: I know this is a very old post, but I just noticed your comment here and you may be interested in math.stackexchange.com/a/1808558/21820. $\endgroup$
    – user21820
    Commented Sep 24, 2016 at 17:23
11
$\begingroup$

Properly, logic shouldn't concern itself with notions of set. We can use the notion in an almost metaphorical sense, such as saying that our (first-order) language consists of a set of symbols, and from these symbols we define the set of formulae. This is not really a problem, since the basic objects of logic are symbols that can be written down, and our set or symbols can be just a listing of these symbols (or a description of how to write them), and our set of formulae is just a method for analysing a written expression and determining whether or not it is a formula. As Henning Makholm mentions in his answer, when one moves from logic to model theory the distinction can become murkier.

In (axiomatic) set theory, we consider the notion of a "set" as a basic or undefined. This is for mainly pragmatic purposes: you simply cannot define everything in terms of more basic concepts, and it is hard to think of a concept that "set" could be reduced to. This is really no different than Euclid, who defined a point to be "that which has no part," but left as totally undefined what a part is, or how something could lack one. What the axioms of ZF(C) do tell us is the basic properties that we assume are true about sets, or at least those properties of set that seem to be true after some serious thinking about it. (This may seem a little flippant, but consider the paradoxes/antinomies from the late 19th and early 20th centuries. These caused mathematicians -- or set theorists -- to consider very closely what operations or set construction principles will not obviously lead to contradictions, and these discussions resulted in many of the axioms we have today.)

In terms of functions a predicates, you are correct that a set theorist will define a relation to be a set consisting of ordered pairs, where an ordered pair is a set of the form $\{ \{ a \} , \{ a , b \} \}$, but the set theorist is not actually making any metaphysical/ontological claim about the true nature of these concepts. Instead, the set theorist is translating these notions from ordinary mathematics into the language of sets. This is done because, in the formal theory, one cannot speak about objects that are not sets. If this translation could not be done, there would be no possibility of using set theory as a foundation of all of mathematics, and set theory would be a rather bland area of study. Of course, after we have set up this translation, we must prove that these defined concepts have the same properties as our "informal" conception of them. For instance, a set theorist will prove that $\{ \{ a \} , \{ a , b \} \} = \{ \{ c \} , \{ c , d \} \}$ iff $a=c$ and $b=d$. This being accomplished, we may define the ordered pair $(a,b)$ to be the set $\{ \{ a \} , \{ a , b \} \}$, because our intuitive notion of an ordered pair is of something that can distinguish the first coordinate from the second.

These "parameter dependent" properties you speak of are just formulae (possibly) with free variables. The manner in which we handle such expressions syntactically and semantically may lead one to think about them as functions or predicates -- and this is extremely useful in practice -- but their "true nature" is far from this.

$\endgroup$
5
$\begingroup$

The axioms of ZFC are stated in first order logic. A statement is in first order logic based on its form, not its content. The two undefined terms in set theory are "set" and $\epsilon$. The things we talk about are called "sets", whatever those might be, and the one undefined relation is the binary predicate $\epsilon$, whatever that might mean. The axioms implicitly determine how these notions interact. The axioms are designed to capture our intuitive notions of how sets act, but once formalized, we no longer have any control over the meaning.

This resembles, for instance, Euclid's axioms for geometry in which "line", "between", etc. are never explicity defined.

A formula $\psi(u_1, u_2, q, \ldots)$ with "free" variables may be true or false depending on its inputs. It defines a relation in general and may define a function. A function is required to have a unique "output" for each set of "inputs"; a relation has no such requirement.

$\endgroup$
2
$\begingroup$

There actually is a deeper aspect to the second problem, assuming you're talking not purely about the theory of ZFC, but also the theory of its models.

It turns out that certain notions in ZFC are "absolute": That is, if you have a countable transitive standard model $ M $ of ZFC, these properties are true when considered in the "outer" set universe V exactly when they are true in M. For "is an element of", this is by definition of "standard model". But there are less obvious properties that can be shown to be absolute as well, such as "is equal to $ \aleph_0 $". Of course, by far not all properties are absolute. "is equal to $ \aleph_1 $", for instance, most definitely is not absolute, which is shown by the mere existance of countable transitive standard models.

In aspects of model theory where one operates with "real" sets to construct models, one certainly must very carefully distinguish between properties of "sets" in the model and sets in the outer meta-theory.

For more on this see, for instance, Kunen's "Introduction to Independence Proofs" (Chapter 4 specifically) - but most introduction books on set theory will discuss this issue in some form.

$\endgroup$
0
1
$\begingroup$

Henning and Arthur gave good answers, but I feel that I need to add a point on your third question.

The reason we allow parameters is that we want a rich world of possible functions. Note that internally the collection of sets which are definable just by a formula with no parameters is quite dull. Sometimes we even have to use axioms to prove that a set defined in a certain way is non-empty.

Now consider the function which, when given $x$ returns a function $f_x\colon x\to P(x)$ defined as $$f_x(a)=\{a\}, \forall a\in x$$

It is quite clear how this function is defined from $x$, but if $x$ is not a definable set then we have to allow parameters in order to assert something on $f_x$ in a first order sentence.

In particular if we want to assert that given a function definable with parameters then if its domain is a set, then so is its range (that is the replacement axiom schema) we must allow parameters otherwise we cannot assert that $f_x$ is a function whose range must be a set.

This is the reason that the replacement axiom schema says that given a formula, if after setting certain parameters the formula is now functional then if the domain is taken to be a set, then the range is also a set.

$\endgroup$
3
  • $\begingroup$ I didn't see an answer by Arthur. Did it get deleted, or did Arthur change username? $\endgroup$
    – ziggurism
    Commented Nov 27, 2017 at 17:03
  • $\begingroup$ user642796 was formerly known as arjafi, and before that as Arthur Fischer. $\endgroup$
    – Asaf Karagila
    Commented Nov 27, 2017 at 17:09
  • $\begingroup$ thanks for clarifying $\endgroup$
    – ziggurism
    Commented Nov 27, 2017 at 17:34
0
$\begingroup$

Unique book, where I found formal definition of set is Nicolas Bourbaki Theory of sets, English translation 1968, on page 68. There "set" is based on concept of collectivizing relation, so initially are defined terms and relations as some type of signs (symbols) assemblies. it needs hard work to understand, but it's worth it.

$\endgroup$
7
  • $\begingroup$ Yeah, I remember 8 years ago (when I asked this question), I read into that book. Here's some of the funky symbols. $\endgroup$
    – Nikolaj-K
    Commented Jun 8, 2020 at 0:25
  • $\begingroup$ They are quantifiers, as I think, one of the most important symbols/concepts of mathematics. IMHO research in them will give new power and deepness to mathematics and I am continuously thinking about them and searching thinking alike people. $\endgroup$
    – zkutch
    Commented Jun 8, 2020 at 0:36
  • $\begingroup$ I agree, e.g. I like the constructive realizations Curry–Howard correspondence or their realization in categories. And there's also other quantifiers, which haven't found so much traction in math. $\endgroup$
    – Nikolaj-K
    Commented Jun 8, 2020 at 20:29
  • $\begingroup$ Thanks for links. Their are really good. If/when you'll have some others I'll be grateful for info. $\endgroup$
    – zkutch
    Commented Jun 8, 2020 at 21:59
  • $\begingroup$ Depends on what. $\endgroup$
    – Nikolaj-K
    Commented Jun 9, 2020 at 18:31

You must log in to answer this question.

Not the answer you're looking for? Browse other questions tagged .