Am I doing a forcing argument here?

Question

I have an argument of the following form:

Executive Summary:

We have a $\mathbb R$-valued function $L$ which we want to show is $\mathbb Z$-valued. We approximate it by $\mathbb Q$-valued functions $\lim_{n \to \infty} L_n = L$. Magically, when we take this limit as an ultralimit in a bigger field $\mathcal R = \prod_{\mathcal F} \mathbb R$, we still get $L$, which tells us that $L$ is $\mathbb Q$-valued. We have control over the denominators to see that $L$ is $\mathbb Z$-valued.

EDIT: Here is a pdf (2 pages) with the complete argument.

Question: Can such an argument be conceptually streamlined by more systematically employing tools / language from model theory / nonstandard analysis / forcing — or maybe just from number theory?

Here is a more detailed summary:

Theorem Sketch: Let $C$ be a set equipped with two distinguished elements $0,1$ and a ternary relation $\Delta \subseteq C^3$ satisfying certain axioms [1]. Let $L : C \to \mathbb R_{\geq 0}$ be a “gapped norm” — a function satisfying certain inequalities [2]. Then $L$ is integer-valued.

Proof Sketch:

Step 0: Choose a prime $p$.

Step 1: Construct a sequence of functions $\lceil L \rceil =L_0 \geq L_1 \geq \dotsb \geq L : C \to \mathbb N[1/p]$ which decreases pointwise from the least integer $\geq L$ down to $L$ itself. The construction is such that every “norm” inequality is satisfied by $L_n$ for all but finitely many $n$ [4], along with some other “forcing” conditions which will be useful in Step 3 [5].

Step 2: Choose a nonprincipal ultrafilter $\mathcal F$ on the finite-powerset-lattice of the set of “norm” inequalities. Let $L_\infty : C \to \mathcal R := \prod_\mathcal{F} \mathbb R$ be the “ultralimit” of the $L_n$’s, but with values in the ultrapower field $\mathcal R$ (i.e. $L_\infty(C)(n) := L_n(C)$).

Step 3: Observe that by construction, we have $L_\infty \geq L$. Argue inductively [6] using the additional “forcing” conditions from Step 1 that the reverse inequality also holds, so that $L = L_\infty$.

Step 4: Since the equation $L = L_\infty$ is independent of the choice of $\mathcal F$ from Step 2, conclude that for each $X \in \mathcal C$, the sequence $n \mapsto L_n(X)$ is eventually constant, so that $L_\infty(X) = L(X) \in \mathbb N[1/p]$.

Step 5: Since the prime $p$ from Step 0 was arbitrary, we have $L(X) \in \mathbb N$ for all $X \in \mathcal C$ as desired.

Notes:

I think this form of argument must be a “standard forcing argument” in some sense. My guess, in fact, would be that there are some fields of logic where arguments of exactly the above form are considered “routine”, as well as other fields of logic where one might not routinely make arguments of exactly the above form but nevertheless the pieces are all quite familiar and the whole thing together looks quite natural.
By contrast, in algebraic topology (where I intend to make this argument) these tools are all quite unfamiliar, so that I think I am bumbling around and not putting things very elegantly.
I also suspect that a typical number theorist would find it very straightforward to make an argument equivalent to the above one, except that they would somehow never mention ultrafilters. I would also be keen to hear about an ultrafilter-free way to think about an argument like this one.
I’m also reminded of the fact that Cohen was an analyst, and some (to-me-cryptic) quote of his indicating that when he originally introduced forcing, he was “just” adapting some sort of (well-known?) analytic techniques to logic. I say this because the above argument only occurred to me after a very long period of becoming well-acquainted with the “minimization” procedure in footnote [3] below, which I thought of as very much an “analytic” phenomenon.

Footnotes:

[1]: The axioms say that $C$ is the object set of a triangulated category with $0$ the zero object and $\Delta$ the set of triples $X,Y,Z$ such that there exists a triangle $X \to Y \to Z$. Moreover, we impose the “monogencity” requirement that $C$ be generated by $1$ in the sense that the smallest triangulated subcategory of $C$ containing $1$ is $C$ itself.

[2]: The “norm” axioms say that $L(0) = 0$, $L(1) \leq 1$, and whenever $(X,Y,Z) \in \Delta$ we have $L(Y) \leq L(X) + L(Z)$. The norm is “gapped” if $L(1) = 1$ and $L(X) \leq 1 \implies L(X) = 0 \vee L(X) = 1$. I think of this condition as a sort of “seed” of integrality which the theorem “spreads” to the rest of $C$ like a crystalizing liquid :) Technically, we also require that a “gapped” norm satisfy $L(X \oplus Y) = L(X) + L(Y)$ and $L(X[1]) = L(X)$, which I suppose can’t quite be stated without referring to the triangulated structure in footnote [1], but I don’t think that’s important to the outline of the argument.

[3]: It seems to be relevant that these “norm inequalities” are set up in such a way that if $f : C \to \mathbb R_{\geq 0}$ is an arbitrary function, then there is a unique maximal $\mathbb R_{\geq 0}$-valued norm $L \leq f$, which can be obtained from $f$ by a straightforward “greedy minimization” procedure — if some “norm” inequality is violated, then just decrease the relevant value of $f$ to “force” it to be satisfied. Rinse and repeat. At any rate, the fact that this is so allows us to conclude already in step 2 that the “standard part” of $L_\infty(X)$ is exactly $L(X)$.

[4]: Moreover, any $\mathbb R_{\geq 0}$-valued function $\leq \lceil L \rceil$ which satisfies all of the “norm” inequalities must equal $L$. In practice the construction proceeds by listing out all of the “norm” inequalities which must be satisfied, and then guaranteeing that $L_n$ satisfies the first $n$ of them. But I don’t think this point is crucial.

[5]: Although these additional conditions are in some sense the crux of the proof, I think for the current question their precise form is not so important. In case I’m wrong about this: they say that if $(X,Y,Z) \in \Delta$, and if $L(X),L(Z) < L(Y)$, then for all but finitely many $n$ we have $L_n(X) - L_n(Y) + L_n(Z) \geq L(X) - L(Y) + L(Z)$.

[6]: The induction is over the “number of cells” in $X \in \mathcal C$ (which makes sense due to the “monogenicity” requirement in footnote [1]), which feels somehow akin to inducting over the “length of the name” of $X \in \mathcal C$….

Am I correct in thinking that if in [3] you happened to stumble on an $f$ that was somehow "eventually optimal" then everything would work out using such an $f$? (But there may be no reason to believe such an $f$ actually exists and, in fact, it might even provably not exist.) — François G. Dorais, Commented Jun 27 at 23:56
@FrançoisG.Dorais I’m not quite sure I understand what you mean by “eventually optimal”, but maybe it’s related to the following: since I assume at the beginning that $L(X \oplus Y) = L(X) + L(Y)$, I have that $L(X) = \lim_{n \to \infty} \frac 1 n \lceil L(X^{\oplus n})\rceil$. This is a kind of “eventual correctness” of the function $f = \lceil L \rceil = L_0$, sufficient for the whole argument. Perhaps also relevant: there is an “a priori estimate” which says that for any norm $L$, $L(X)$ is bounded above by the “number of cells” in $X$…. — Tim Campion, Commented Jun 28 at 0:16
… So ultralimits of norms can be taken freely— they never escape to $\infty$. Also: if $f$ is integer-valued, the “minimization procedure” in [3] preserves this integrality at each step, so by well-foundedness of $\mathbb N$ it eventually stabilizes for each $X \in C$. So without ultrafilters, we see that there is a maximal norm, which is integral. The more elaborate argument seems necessary to show that every gapped norm is integral. — Tim Campion, Commented Jun 28 at 0:16
It seems to hold "by construction" which suggests there should be a direct proof (to make the construction work) but I don't see one. — Will Sawin, Commented Jun 28 at 19:17
@WillSawin you may be right, I will have to think about this. And a reasonable resolution to the conundrum of “this argument seems mysterious” would be “this argument has a mistake”… — Tim Campion, Commented Jun 29 at 3:10

Will Sawin · Accepted Answer · 2024-06-29 16:47:24Z

This is not quite an answer to the question you asked, but:

The argument, and any similar argument, can't work because the theorem to be proven is false. Let me quote the result:

Let $\mathcal C$ be a triangulated category and $\Delta \in (\operatorname{Ob} \mathcal C)^3$ be the set of objects in a triple. Let $1\in \mathcal C$ be a distinguished object. Assume that $\mathcal C$ is monogenic in the sense that the smallest triangulated category of $\mathcal C$ containing 1 is $\mathcal C$ itself.

A function $L\colon \operatorname{Ob} \mathcal C\to \mathbb R^{\geq 0}$ is said to be a norm if $L(0)=0$, $L(1)\leq 1$, and for every triangle $X \to Y \to Z$ we have $L(Y) \leq L(X) + L(Z)$.

A norm is said to be gapped if there are no $X \in \mathcal C$ with $0< L(X) <1$, and moreover $L(X \oplus Y) = L(X)+ L(Y) $ and $L(X) =L(X[1])$ for all $X, Y \in \mathcal C$.

I will assume for simplicity that $\mathcal C$ has (countably many objects and) countably many morphisms, although this doesn't seem to be essential.

Theorem: Let $\mathcal C$ be a triangulated category and $1\in \mathcal C$ a distinguished object. Let $L\colon \operatorname{Ob} \mathcal C\to \mathbb R^{\geq 0}$ be a gapped norm. Then $L$ is integer-valued.

I will now give a counterexample. I will take $\mathcal C = D^b_{fg} ( \mathbb Q[x]/(x^2))$, the subcategory of the derived category of $\mathbb Q[x]/(x^2)$-modules consisting of bounded complexes with finitely generated cohomology. Each object is represented by a finite complex of finitely generated modules, so there are indeed countably many objects and morphisms. We take our distinguished object to be $\mathbb Q$, which generates $\mathbb Q[x]/x^2$ via the distinguished triangle arising from the short exact sequence $0 \to \mathbb Q \to Q[x]/x^2 \to \mathbb Q \to 0$ and therefore generates the whole category.

We fix $\lambda \in (0,1)$ and set $$L(M) = \sum_{i\in \mathbb Z} (\lambda \dim H^i(M) + (1-\lambda) \dim H^i(M)/xH^i(M) )).$$

To motivate this formula, note that because every $\mathbb Q[x]/x^2$-module is a sum of copies of $\mathbb Q$ and $\mathbb Q[x]/x^2$, so a gapped norm which, like this one, depends only on the cohomology of the module is determined via the direct sum and shift conditions by its value on $\mathbb Q$ and $\mathbb Q[x]/x^2$. The value on $\mathbb Q$ must be $1$ by definition so the value on $\mathbb Q[x]/x^2$ must be $1+\lambda$ for some $\lambda\in [0,1]$. The formula for the norm one obtains from this perspective is the above one.

Let's check that $L$ is actually a gapped norm. We have $L(0)=0$ and $L(1)=\delta+(1-\delta)=1$ by definition. For a triangle $X \to Y \to Z$, the exact sequence $H^i(X) \to H^i(Y) \to H^i(Z)$ gives $\dim H^i(Y) \leq \dim H^i(X) + \dim H^i(Z)$. The sequence $H^i(X)/x H^i(X) \to H^i(Y)/ x H^i(Y) \to H^i(Z)/ x H^i(Z)$ is not necessarily exact, but the kernel mod the image consists of elements of $H^i(Y)$ whose image in $H^i(Z)$ is divisible by $x$, and then dividing by $x$ gives an element of $H^i(Z)/ ( H^i(Z)[x], H^i(Y))$ and the natural map from the kernel mod the image to $H^i(Z)/ ( H^i(Z)[x], H^i(Y))$ is injective, but $H^i(Z)/ ( H^i(Z)[x], H^i(Y))$ is a quotient of $H^i(Z)/ (x H^i(Z), H^i(Y)) = (H^i(Z)/ x H^i(Z))/ (H^i(Y)/xH^i(Y))$ so we have $\dim H^i(Y) \leq \dim H^i(X) + \dim H^i(Z)$ anyways.

Everything else is easier to check. It is gapped since if $X$ is nontrivial then both dimensions are at least $1$. $L(X \oplus Y) =L(X) + L(Y)$ and $L(X) = L(X[1])$ hold by definition.

However $L( \mathbb Q[x]/x^2) = 2\lambda + (1-\lambda)=1+\lambda \notin\mathbb Z$, contradicting the theorem statement.

The problem with the proof is in step 1, where a construction is supposed to satisfy a list of conditions but not all the conditions are completely checked, and in fact one can observe in this case that (unless $\lambda \in \mathbb Z[1/p]$) one of the conditions will fail.

Thanks! I wish I’d been self-critical enough to see this myself. — Tim Campion, Commented Jun 29 at 19:21

3 revs · Accepted Answer · 2024-06-30 07:36:33Z

It looks like the "logic aspects" of the argument boil down to using compactness. [However, this appears to be moot since the argument may have flaws pointed out by Will Sawin.]

First, given a bound $f:C\to[0,\infty)$. The set $\mathcal{L}_f$ of all norms $L \leq f$ is a closed (and hence compact) subset of $\prod_{X \in C} [0,f(X)]$ because each inequality is a closed requirement.

Note that if $L_1, L_2 \in \mathcal{L}_f$ then the pointwise maximum $L(X) = \max(L_1(X),L_2(X))$ is also a norm in $\mathcal{L}_f$. Therefore, unless empty, $\mathcal{L}_f$ has a maximal element.

Given a gapped norm $L$ (including the requirement $L(X \oplus Y) = L(X) + L(Y)$. Set $f_n(X) = \frac{1}{n}\lceil L(X^{\oplus n}) \rceil$. Since $L \leq f_n$, $\mathcal{L}_{f_n}$ is nonempty, so it contains a maximal element $L_n$. Note that $\lim_{n\to\infty} L_n = L$.

Claim 1: $L_n$ takes values in $\frac{1}{n}\mathbb{N}$.

Proof. Given a finite list of distinguished triangles $(A_i,B_i,C_i)$, $i < \ell$. Construct a sequence of functions $h_0 \geq h_1 \geq \cdots$ as follows: Start with $h_0 = f_n$. Once $h_k$ is defined, set $h_{k+1}(B_i) = \min(h_k(B_i),h_k(A_i) + h_k(C_i))$.

Each $h_k$ takes values in $\frac{1}{n}\mathbb{N}$. Since $\frac{1}{n}\mathbb{N}$ is discrete, the sequence must stabilize to a limit $h$. Moreover, $L_n \leq h$.

By compactness, there is an accumulation point to all such $h$'s along the net of all finite lists of distinguished triangles. That accumulation point must be a norm and therefore it must be exactly $L_n$, by maximality.

Claim 2: Given $X \in C$ and any prime $p$, there is a subsequence of $L_{p^i}$ along which $L_{p^i}(X)$ is constant.

[I don't follow all the details here but this appears to be shown in the proof outline. Of course, this claim is obvious from the conclusion since $L_n$ must in fact be a constant sequence.]

From claim 2, we see that $L(X) \in \mathbb{N}[\frac{1}{p}]$. Since $p$ is arbitrary, it follows that $L(X) \in \mathbb{N}$.

Hmmm… yes, maybe I should rework things to emphasize this — we pretty immediately have all the properties we need to ensure that $L = L_\infty$ except possibly for the additivity property $L_\infty(X \oplus Y) \overset ? = L_\infty(X) + L_\infty(Y)$. In some sense forcing this to also hold gets us to $L = L_\infty$… BTW I added to the question a link to a pdf with the complete argument — Tim Campion, Commented Jun 28 at 23:08
If my memory serves me right, there's a fun little way of presenting forcing, from a model theoretic point of view (i.e., without worrying out well-founded models and adding ordinals) where we use compactness and type omitting to produce generic extensions. And in a way, if you study the way AC is involved in compactness, you see that compactness, to an extent, is not too far from forcing-style arguments. — Asaf Karagila, Commented Jun 29 at 1:08
Thanks so much! Sorry, I feel like I’ve sent you on a wild goose chase trying to interpret an argument which was simply wrong. — Tim Campion, Commented Jun 29 at 19:24

Stack Exchange Network

Am I doing a forcing argument here?

2 Answers 2

Not the answer you're looking for? Browse other questions tagged
nt.number-theory
lo.logic
forcing
nonstandard-analysis
or ask your own question.

Am I doing a forcing argument here?

2 Answers 2

Not the answer you're looking for? Browse other questions tagged nt.number-theorylo.logicforcingnonstandard-analysis or ask your own question.

Related

Not the answer you're looking for? Browse other questions tagged
nt.number-theory
lo.logic
forcing
nonstandard-analysis
or ask your own question.