13
$\begingroup$

Sorry for the provocative title. There's probably something wrong with my reasoning, and maybe someone will bother to point it out.

I did not understand Bell's Theorem, so I looked for easy explanations. I found one that looked very intuitive. They set up something like light polarization, but simpler. They explained that given "reasonable" assumptions, you must predict a linear change in correlation with the angle between detectors. The difference between 90 degrees and 91 degrees must be the same as the difference between 45 degrees and 46 degrees. But in fact it doesn't work that way, so the "reasonable" assumptions must be wrong.

picture

That made sense. Then I found a coherent explanation with equations.

This elegantly describes what properties a "reasonable" set of assumptions must have, and describes an experiment that no "reasonable" assumptions can give a correlation better than .75 for. The claim is that QM predicts more than .75, and experiment is consistent with QM and not with "reasonable" assumptions.

Background

We assume there is some system state $\lambda$. There are two detectors $x, y$ which have outcomes $a, b$. Each outcome can only be 0 or 1.

The experimenters measure $p(ab|xy)$. They don't know $\lambda$, but they have estimated probabilities for some states that $\lambda$ might be in, so what they're really measuring is

$\underset{\lambda}{\sum}{p(ab|xy\lambda) p(\lambda|xy)}$

Further assumptions:

  1. $\lambda$ is not correlated with $x$ or $y$.

$$p(\lambda|xy)=p(\lambda)$$

Given that,

$$p(ab|xy) = \underset{\lambda}{\sum}{p(ab|xy\lambda) p(\lambda)}$$

  1. $x, y$ and $\lambda$ are enough to completely determine $a, b$

$$p(ab|xy) \in \{0, 1\}$$

That results in

$$p(ab|xy) = \underset{\lambda}{\sum}{p(a|xy\lambda) p(b|xy\lambda) p(\lambda)}$$

  1. $x$ does not influence $b$ and $y$ does not influence $a$.

$$p(a|xy\lambda)=p(a|x\lambda)$$

$$p(b|xy\lambda)=p(b|y\lambda)$$

So

$$p(ab|xy) = \underset{\lambda}{\sum}{p(a|x\lambda) p(b|y\lambda) p(\lambda)}$$

And from there he goes on to demonstrate the thing that can't correlate better than .75. It's very clear, I highly recommend it. He expands on that in a couple of later posts.

My efforts

Given this understanding, I tried an example, polarized light. Given a photon with linear polarization angle 0 and a polarizer with polarization angle $\theta$, that has two outcomes 0 and 1, the probability that the photon has outcome 0 is $\cos^2{\theta}$. I ran various simulations using Glowscript, a simple graphics programming system that often lets you see your mistakes in the way the pictures come out.

For every x and y between 0 and $2\pi$, I integrated over $\theta$ to get the probability that the outcomes a and b were the same.

If a and b were independent, then

$$p(a|xy\theta)=cos^2(x-\theta)$$

$$p(b|xy\theta)=cos^2(y-\theta)$$

$$p(ab|xy\theta)=cos^2(x-\theta)cos^2(y-\theta)$$

But for entangled photons, a and b are not independent. The closer x and y are to each other, the more likely that a and b are the same, regardless of $\theta$. (Oops! For experimental entangled photons, they are opposite and not the same. I'm sure the results are equivalent, though.) The formula is

$$p(ab|xy\theta)=p(a|x\theta)p(b|axy\theta)=cos^2(x-\theta)cos^2(x-y)$$

Graphed out, this looks like

3D graph

(I went from 0 to $2\pi$ in each direction when it would have been enough to go from 0 to $\pi/2$, just in case my thinking was wrong and it would have made a difference.)

The correlation is the same whenever x-y is the same. This is just like the simple graph above except that one of the simplifications was to put the zeroes at 90 and 270 degrees, instead of at 45, 135, 225, and 315 degrees etc.

Again, the reason we get this result is that

$$p(ab|xy\theta)=p(a|x\theta)p(b|axy\theta)\neq p(a|x\lambda) p(b|y\lambda)$$

Photon polarization does not meet the criteria for Bell's inequality, which of course it must not or it could not be what it is. It looks to me like the assumptions behind Bell's inequality require that we must assume that the photons are not entangled. Assuming they are not entangled, of course we cannot get the same correlations we get when they are entangled.

Given that this is what it takes to get photon polarization to work, I didn't see why a deterministic system must fail. I quick found something that looked like it would work.

Each photon has three hidden variables. One is a polarization angle $\theta$ between 0 and 2 $\pi$.

Then there are two hidden variables $\rho$ and r between 0 and 1.

Same-entangled photons have all three of these variables the same.

To find whether a photon passes a filter with angle $\lambda$, calculate $\cos(\lambda-\theta)$. If $\rho, r$ are inside the square defined by (0,0) $(\cos(\lambda-\theta),\cos(\lambda-\theta))$ then it passes the filter.

Then set $\theta$ to $\lambda$. Multiply $\rho$ and r by 7 and discard any part of the results that are bigger than 1, to get their new values.

Question:

Since Bell's Theorem is correct, where did I go wrong?

$\endgroup$
4
  • $\begingroup$ Comments are not for extended discussion; this conversation has been moved to chat. $\endgroup$ Commented Sep 10, 2022 at 0:34
  • $\begingroup$ Please do not edit a question by including responses to specific answers. Edits are for clarifying the question, if you have a new (follow-up) question, please just ask a new question. If you want to respond to or criticize a specific answer, that's what comments are for. $\endgroup$
    – ACuriousMind
    Commented Sep 12, 2022 at 13:23
  • $\begingroup$ This question has changed significantly, I do not know if the users below will take enough effort to update their answers $\endgroup$
    – Mauricio
    Commented Sep 17, 2022 at 8:08
  • $\begingroup$ Agreed! What would you suggest I do? I considered offering a bounty, but thought I should wait at least a few days to make sure I don't get any responses without it. $\endgroup$
    – J Thomas
    Commented Sep 17, 2022 at 8:30

5 Answers 5

18
$\begingroup$

I'm really not a fan of the blog post you link, both due to the notation and to the specific focus on the somewhat opaque experiment described at the end. So I'll diverge from it to address your question. I'll start with some preliminary definitions and the statement/proof of the theorem. If you are already familiar with this, you can skip to the end of the answer.

Consider two detectors $X$ and $Y$ which return measurement values $x$ and $y$, respectively, where $x$ and $y$ may take the values $\pm 1$. Each detector has some additional settings $\alpha_X$ and $\alpha_Y$ which may be chosen freely and independently by the graduate students operating each detector. Finally, there is an additional "hidden variable" $\lambda$ which underlies the experiment and which may influence the experimental outcomes, but which is not influenced by the settings $\alpha_X$ and $\alpha_Y$ of the two detectors.

Under these conditions, we may define the correlation between the measurement results - which depend on the settings $\alpha_X$ and $\alpha_Y$ - as follows: $$C(\alpha_X,\alpha_Y) := \int \mathbb E_{\alpha_X,\alpha_Y}\big[xy|\lambda\big] \ p(\lambda) \mathrm d\lambda$$ where $$\mathbb E_{\alpha_X,\alpha_Y}[xy|\lambda]:= \sum_{x,y\in \{\pm 1\}} xy \ P^{XY}_{\alpha_X,\alpha_Y}(x,y|\lambda)$$ is the expected value of the product $xy$ given the settings $\alpha_X$ and $\alpha_Y$ and the hidden variable $\lambda$, $p(\lambda)$ is the PDF for the hidden variable, and $P^{XY}_{\alpha_X,\alpha_Y}$ is the joint PDF for the two experimental outcomes.

If we assume that the detector settings may be chosen independently in such a way that the measurement result at $X$ does not depend on the result at $Y$, then the joint PDF factorizes into $$P^{XY}_{\alpha_X,\alpha_Y}(x,y|\lambda) = P^X_{\alpha_X}(x|\lambda) P^Y_{\alpha_Y}(y|\lambda)$$ and so $$\mathbb E_{\alpha_X,\alpha_Y}\big[xy|\lambda\big] = \mathbb E_{\alpha_X}\big[x|\lambda\big] \mathbb E_{\alpha_Y}\big[y|\lambda\big]$$

If this is the case, then the CHSH inequality - the most common of a category of expressions which fall under the heading of "Bell inequalities" - says the following:

$$|C(a,b)-C(a,c)| + |C(d,b)+C(d,c)|\leq 2$$

for any choices of detector settings $a,b,c,d$.


Proof: The proof is very straightforward. Note that $$|C(a,b)-C(a,c)| = \left|\int \mathrm d\lambda\ p(\lambda) \mathbb E_a\big[x|\lambda\big] \big(\mathbb E_b\big[y|\lambda\big]-\mathbb E_c\big[y|\lambda\big]\big)\right|$$ $$\leq \int \mathrm d\lambda\ p(\lambda)\bigg|\mathbb E_b\big[y|\lambda\big]-\mathbb E_c\big[y|\lambda\big]\bigg|$$ Similarly, $$|C(d,b)+C(d,c)| \leq \int \mathrm d\lambda \ p(\lambda)\bigg|\mathbb E_b\big[y|\lambda\big] + \mathbb E_c\big[y|\lambda\big]\bigg|$$ It is an easily provable fact about real numbers $p,q\in[-1,1]$ that $|p-q|+|p+q|\leq 2$; the CHSH inequality follows immediately.


With those preliminaries out of the way, we can address your questions.

$\lambda$ is not correlated with $x$ or $y$. $p(\lambda|xy)=p(\lambda)$.

This notation suggests that $x$ and $y$ are random variables and that we're thinking about conditional probabilities. This is not the case - $x$ and $y$ (which I call $\alpha_X$ and $\alpha_Y$ in my notation) are the settings on the detectors, and are chosen freely and independently by our faithful graduate students. The assumption is that the PDF for our hidden variable $\lambda$ is not a function of what we choose for these settings.

As an example, we might imagine that the hidden variable turns out to be the temperature of the laboratory. In this context, the assumption is that the settings we choose for the dials on the experimental apparatuses do not influence the laboratory temperature.

After detector $x$ finds outcome $a$, now $\lambda$ is no longer independent. Different values for $\lambda$ would give different values for $a$, or else $\lambda$ wouldn't matter for predicting $a$. So if we know $x, a$ that gives us a clue about $\lambda$.

The fact that $p(\lambda)$ is not a function of $\alpha_X$ or $\alpha_Y$ does not imply that a given measurement gives us no clues about the possible values $\lambda$ might take. In particular, we are not claiming that the hidden variable $\lambda$ and the outcome of one of the detector measurements $x$ are uncorrelated for every choice of detector setting $\alpha_X$ - if we were, then Bayes' rule would tell us that $P^X_{\alpha_X}(x|\lambda) = P^X_{\alpha_X}(x)$, rendering the hidden variable pointless.

This is a weakness of the linked blog post, in my view. The word correlation implies (to me) that we're talking about two random variables. Instead, as mentioned above, our claim is that the PDF for $\lambda$ is not a function of the parameters $\alpha_X$ and $\alpha_Y$. In the context of our temperature example, the claim would be that the temperature of the lab is not influenced by the settings on the knobs of our detectors.

Is this how QM gets its correlation, or is it some other way that I have not noticed?

The general take-away from Bell's theorem is not that there is a limit on the correlation between our measurement results for any one choice of detector settings. Instead, the point is that if we are allowed to vary the detector settings locally and independently, then there is a limit on how the corresponding correlations may vary.

In other words, a violation of a Bell inequality means that our graduate students freely and independently changed their detector settings and that the subsequent correlations can not be accounted for by the assumption that a local hidden variable was determining their results.


I'll conclude by addressing your photon example. If we send an entangled pair of photons described by a spin-singlet state to detectors outfitted with polarizers whose orientation is defined by unit vectors $\vec a$ and $\vec b$ (these unit vectors are the "detector settings" described above) and whose outcomes are $+1$ for a detected photon and $-1$ for an undetected photon, then the correlation between our two detectors is given by $C(\vec a,\vec b)=-\vec a \cdot \vec b$ as per quantum mechanics. Plugging this into the CHSH inequality yields

$$|\vec a\cdot(\vec b-\vec c)| + |\vec d\cdot (\vec b+\vec c)| \leq 2$$

However, if we let $\vec a = \hat z$ and $\vec d = \hat x$ (so our first graduate student may turn her detector by $90^\circ$ at her leisure) and then let $\vec b = (\cos(\phi)\hat x+\sin(\phi)\hat z)/\sqrt{2}$ and $\vec c = (\sin(\phi)\hat x-\cos(\phi)\hat z)/\sqrt{2}$ (so our second graduate student may turn his detector by $90^\circ$ as well, but his orientations are offset from hers), then we find that the CHSH inequality becomes

$$2[\cos(\phi)+\sin(\phi)] \leq 2$$ which is clearly violated for $\phi \in (0,\pi/2)$.

As we see, the problem is not that the correlation between detectors is too high for any particular choice of orientations (indeed for appropriately chosen $\vec a$ and $\vec b$, we can clearly arrange for $C(\vec a,\vec b)=-\vec a \cdot \vec b$ to take any value in $[-1,1]$), but rather that upon independent variation of these orientations, we find correlations which violate the CHSH inequality and therefore are not consistent with our assumptions regarding local hidden variables.

$\endgroup$
6
  • $\begingroup$ Comments are not for extended discussion; this conversation has been moved to chat. $\endgroup$
    – ACuriousMind
    Commented Sep 12, 2022 at 13:24
  • $\begingroup$ If I understood correctly, violation of the CHSH inequality for configurations $a,b,c,d$ tells us no $\lambda$ satisfying our assumptions exists for both joint configurations $(a,b)$ and $(c,d)$. But if there are some other configurations $e,f$ so that the CHSH inequality does hold for $a,b,e,f$, it is possible a hidden variable satisfying our assumptions exists for both $(a,b)$ and $(e,f)$. Is this correct? $\endgroup$
    – jet457
    Commented Jul 29, 2023 at 17:37
  • $\begingroup$ @jet457 The CHSH inequality is a statement about the correlations which could exist over the space of all possible detector settings. In other words, if there exist detector configurations which violate the CHSH inequality, then the physics underlying the experiment cannot be explained by the presence of a local hidden variable. The fact that the CHSH may hold for some particular $(a,b,e,f)$ is irrelevant - the question is whether there are any configurations $(a,b,c,d)$ which violate it. $\endgroup$
    – J. Murray
    Commented Jul 29, 2023 at 21:18
  • $\begingroup$ @jet457 Put differently, you're asking whether it's possible for a hidden variable explanation to work for particular settings on the detectors, which doesn't really make sense. Either the hidden variable explanation can explain the results of the experiment or it cannot; if there are any detector settings such that our results violate the CHSH inequality, then we have the latter. $\endgroup$
    – J. Murray
    Commented Jul 29, 2023 at 21:19
  • $\begingroup$ I see, I think I was confusing what $\lambda$ is mathematically, and what it is physically (a hidden variable). I can find for some configurations a random variable satisfying the assumptions (e.g. for detector settings that always measure 1, any variable will do). But this is not what "hidden variable" means; a "hidden variable" is a random variable satisfying our assumptions for all detector settings. I come from a math background, and I think my lack of familiarity with physics is beginning to show 🙂 $\endgroup$
    – jet457
    Commented Jul 29, 2023 at 22:54
8
$\begingroup$

Bell's Theorem is fine mathematics, but the universe is under no obligation to agree with it. Mathematics is not physics.

The idea that we use to capture the actual phenomenon is that once you have measured one photon in the pair, you've forced the other photon's state to conform to the measurement. But it doesn't possess that state before your first measurement: if it did, Bell's Theorem would apply.

$\endgroup$
1
  • 17
    $\begingroup$ The universe is under no obligation to agree with physics either. Rather, it is up to physicists to ensure that physics agrees with the universe. ;) $\endgroup$
    – Galen
    Commented Sep 9, 2022 at 2:28
4
$\begingroup$

In the quantum mechanical case $\lambda$ is fixed; in particular it is not correlated with $a,b,x,y$. It is simply the quantum state $\psi$.

In the simplest version of Bell's theorem, the assumption that quantum mechanics violates is that of determinism, i.e. that $p(ab|xy) \in \{0,1\}$. This is already enough to demonstrate you won't be able to reproduce the quantum correlations with a deterministic model that is local, that is, such that $p(a|xy\lambda) = p(a|x\lambda)$ and $p(b|xy\lambda) = p(b|y\lambda)$.

One might think, though, that one can nevertheless reproduce the quantum correlations just by giving up on determinism, and still maintaining some sort of locality. That's why the nonlocal version of Bell's theorem is important: it shows that it is also impossible.

More specifically, it is not possible to violate a Bell inequality if $p(a|bxy\lambda) = p(a|x\lambda)$ and $p(b|axy\lambda) = p(b|y\lambda)$. That's the local causality assumption.

The interesting thing is that since we're not assuming determinism, this also applies to quantum mechanics, which implies that quantum mechanics must violate local causality in order to violate a Bell inequality, and indeed it does.

$\endgroup$
5
  • 1
    $\begingroup$ What is the nonlocal version of Bell's theorem? $\endgroup$
    – Plop
    Commented Sep 9, 2022 at 16:02
  • $\begingroup$ It is the version Bell proved in his 1975 paper "The theory of local beables". I've written about it here: mateusaraujo.info/2016/07/21/… $\endgroup$ Commented Sep 10, 2022 at 18:33
  • $\begingroup$ I may have misunderstood something here, and I want to find out. Theta (I'll call it t) is fixed but unknown. If we know x and a, that tells us something about t. If t is a polarization state between 0 and 2 pi, and a is 1, then the difference between t and x is more likely to be between -pi/4 and p/i4, than between pi/4 and 3pi/4. Before we didn't know that. Now we know more about theta than we used to, and our old p(b|yt) is out of date. Is that how QM gets its higher correlation, or is it because of something else? $\endgroup$
    – J Thomas
    Commented Sep 12, 2022 at 5:00
  • $\begingroup$ No, that's not it. $\theta$ is assumed to be completely known, so if the probability of $b$ were determined only by $\theta$ and $y$, then $p(b|y\theta)$ wouldn't change at all, no matter what you do at Alice's site. The point is that the probability of $b$ does depend on what Alice is doing, to get the real probability you need to compute $p(b|axy\theta)$, which is not equal to $p(b|y\theta)$. This is what we mean when we say that quantum mechanics violates local causality. $\endgroup$ Commented Sep 13, 2022 at 6:47
  • $\begingroup$ Note that this doesn't work if you have a single photon polarization, as in your example, you do need an entangled state of two photons. $\endgroup$ Commented Sep 13, 2022 at 6:48
2
$\begingroup$

People tend to overcomplicate this. When we have two entangled polarised photons and pass them through polarising analysers (A and B) the correlation probability is given by $$p = \cos^2(\theta_A - \theta_B),$$

where $\theta_A$ and $\theta_B$ are the angles of the respective polarising analysers. If at the time of emission we secretly encode the position of polariser A in photon B as a hidden variable we could obtain the correct correlation, but the problem is that the angle of analyser A can be changed mid flight and somehow the hidden variable has to be updated to this new position to get the correct result. The problem for local hidden variable theories, is that if the polarising filters are spaced far apart and the photons arrive at their respective polarisers near enough simultaneously, there is not enough time for the position of polariser A to be communicated to photon B, if the communication is limited to the speed of light. It is not possible for photon B to know the new position of polariser A using only local information, so the local hidden variable theory fails and can not match actual measurements made in labs or the predictions of quantum theory.

A super-deterministic local hidden variable theory can replicate the results of quantum theory if by "super-deterministic" we mean the future is absolutely pre-determined and can be predicted with 100% accuracy. In this model if Anne is operating the A analyser, Anne has no free will and the position she will turn the analyser to, is inevitable and predictable. We could then encode photon B with information of the predicted position of analyser A at the future time when photon B arrives at analyser B.

$\endgroup$
3
  • $\begingroup$ @Maurico. I think you are right. I will edit that. $\endgroup$
    – KDP
    Commented 2 days ago
  • $\begingroup$ Nice answer! (+1) There was a comment by Tony Leggett about how pretty much the only remaining no-go zone in physical theories now is that the future can affect the past. youtube.com/watch?v=1JrqtZxmT40 (~minute 35`00) eg. we can solve the Schrodinger/Dirac equation as an analytic function involving space and time. GR solutions are analytic in 4 space-time dimensions, etc. How is this viewpoint the same/different as superdeterminism do you think? $\endgroup$
    – James
    Commented 2 days ago
  • 1
    $\begingroup$ @James retrocausality is indeed another solution to the EPR problem but I would not call retrocausality superdeterministic or local. $\endgroup$
    – Mauricio
    Commented 2 days ago
-5
$\begingroup$

The problem I have with Bell's Theorem is that its trying to use three vectors (lambda, x, and y here) to expand a two dimensional spin space. Any 3 vectors in that space will be linearly dependent, invalidating the independence assumption he's making and thus any steps afterwards.

Factoring out that relationship is the linear algebra equivalent of dividing by zero. After a move like that, you can "prove" that 2 = 1 a clear inconsistency. That's where I suspect the issue with his inequality comes from, and why both QM & experiment are correct while Bell's Theorem isn't.

$\endgroup$
15
  • $\begingroup$ Thank you for saying that so clearly! That's basicly the problem I ran into. If you assume determinism and that proves that you cannot satisfy the requirements for Bell's inequality, then Bell's inequality does not apply to you. But it doesn't seem plausible that large numbers of physicists would make such a simple mistake, so maybe someone will explain why this reasoning is wrong. $\endgroup$
    – J Thomas
    Commented Sep 13, 2022 at 5:55
  • $\begingroup$ This is complete nonsense. $x$ and $y$ are bits, not vectors. Bell's theorem is not "trying to expand a [vector] space". A system of two spin-half particles has 15 dimensions in any case. And the independence assumption in Bell's theorem is statistical independence, it is unrelated to linear independence in linear algebra. $\endgroup$ Commented Sep 13, 2022 at 6:54
  • $\begingroup$ I'm unaware of any positive experimental results supporting Bell's theorem, only negatives. I'm more willing to believe that a few theoretical physicists like Bell made a mistake that experimentalists were unable to verify, than I am to believe something extreme about the universe - like violations of determinism - that aren't experimentally supported by any evidence. @MateusAraújo the flaw in your post is where you assume 𝜆 exists and is statistically independent of x, y, a, b. Such 𝜆 don't always exist. $\endgroup$ Commented Sep 13, 2022 at 20:37
  • $\begingroup$ What would you consider a positive experimental result supporting Bell's theorem? $\endgroup$ Commented Sep 14, 2022 at 7:20
  • 1
    $\begingroup$ @StephenFedele I think you've significantly off base in terms of what you think Bell's Theorem is, and what evidence exists that supports it, and what that means. $\endgroup$
    – TKoL
    Commented Oct 14, 2022 at 11:42

Not the answer you're looking for? Browse other questions tagged or ask your own question.