Tsirelson’s Bound Prohibits Communication through a Disconnected Channel

Carmi, Avishy; Moskovich, Daniel

doi:10.3390/e20030151

Open AccessArticle

Tsirelson’s Bound Prohibits Communication through a Disconnected Channel

by

Avishy Carmi

^1,2,* and

Daniel Moskovich

^1,2,*

¹

Center for Quantum Information Science and Technology, Ben-Gurion University of the Negev, Beersheba 8410501, Israel

²

Faculty of Engineering Sciences, Ben-Gurion University of the Negev, Beersheba 8410501, Israel

^*

Authors to whom correspondence should be addressed.

Entropy 2018, 20(3), 151; https://doi.org/10.3390/e20030151

Submission received: 4 December 2017 / Revised: 18 February 2018 / Accepted: 24 February 2018 / Published: 27 February 2018

(This article belongs to the Special Issue Entropy in Foundations of Quantum Physics)

Download

Browse Figures

Versions Notes

Abstract

Why does nature only allow nonlocal correlations up to Tsirelson’s bound and not beyond? We construct a channel whose input is statistically independent of its output, but through which communication is nevertheless possible if and only if Tsirelson’s bound is violated. This provides a statistical justification for Tsirelson’s bound on nonlocal correlations in a bipartite setting.

Keywords:

nonlocality; Bell inequality; Tsirelson’s bound; no-signaling; information causality; Fisher information

1. Introduction

Some of the predictions made by quantum mechanics appear to be at odds with common sense. Yet quantum mechanics remains the most precisely tested and successful quantitative theory of nature. It is therefore believed that even if quantum mechanics is someday replaced, any successor will have to inherit at least some of its “preposterous” but highly predictive principles. Perhaps the most counter-intuitive quantum mechanical feature is nonlocality [1]: the correlations exhibited by remote parties may exceed those allowed by any local realistic model.

The mystery of nonlocality is not only why nature is as nonlocal as it is, but why nature is not more nonlocal than it is. There are alternative Non-Signaling theories which permit nonlocality beyond the quantum limit [2,3]; why doesn’t nature choose one of these theories over quantum mechanics? In Section 1.1 we review several previously proposed explanations. This paper presents another explanation, from statistics.

In this paper we construct a protocol (a repeated oblivious transfer) which sends messages through a disconnected channel. We show that Alice can communicate nontrivial information to Bob via this protocol if and only if the maximal quantum mechanical violation of the Bell–CHSH inequality [1,4], Tsirelson’s bound [5], is exceeded. We thus provide a statistical explanation of this bound that is independent of the mathematical formalism of quantum mechanics.

We briefly recall the setting for the Bell–CHSH experiment. Section 2 provides a more detailed account. A famous application of nonlocality is to construct an 1-2 oblivious transfer protocol between two distant agents (A)lice and (B)ob. Alice and Bob each hold a box. Alice’s box might, for example, contain one half of a singlet state of spin–

\frac{1}{2}

particles, with Bob’s box containing the other half [1,4]. In addition, Alice possesses a pair of bits

x_{0}

and

x_{1}

, each of which is a zero or a one. Using boolean algebra and her boxes (the protocol will be described later), Alice encodes her pair of bits into a single bit

x^{(1)}

which she sends across a classical channel to Bob. Bob wants to know the value either of

x_{0}

or of

x_{1}

, but Alice doesn’t know which of these Bob wants to know. Bob uses the received bit

x^{(1)}

, his box, and some boolean algebra to construct an estimate

y_{i}

for his desired bit

x_{i}

. See Figure 2 later on.

What is the probability that Bob correctly estimates the bit he wishes to know? He has two possible sources of knowledge—the bit

x^{(1)}

he received from Alice, and some mysterious “nonlocal” correlation between his box and Alice’s. The strength of such a nonlocal coordination between two systems is captured by a parameter

c \in [- 1, 1]

called the Bell–CHSH correlator. Bob’s probability of guessing the value of Alice’s bit correctly is

(1 + |c|) / 2

. The Bell–CHSH inequality states that

|c| \leq 1 / 2

in a world governed by classical (non-quantum) mechanics [1,4]. Nonlocality is the state of affairs in which the Bell–CHSH inequality is violated. To the best of our knowledge, real world physics is nonlocal. Over the years, the violation of the Bell–CHSH inequality has been measured in increasingly accurate and loophole-free experiments, culminating in celebrated loophole-free verifications [6,7,8].

Thus, we know that

|c|

can exceed

1 / 2

. How large can

|c|

be? Tsirelson’s bound tells us that

|c|

cannot exceed

1 / \sqrt{2}

in a world described by quantum mechanics [5]. This quantum bound on nonlocality:

|c| \leq \frac{1}{\sqrt{2}},

(1)

has been tested experimentally, with the current state of the art being an experiment which has achieved a value of c which is only

0.00084 \pm 0.00051

distant from Tsirelson’s bound [9]. Such experimental evidence supports the contention that Tsirelson’s bound indeed holds true in the real world. Tsirelson’s result as presented in the original paper is a specifically quantum mechanical fact, following from the Hilbert-space mathematical formalism for quantum mechanics, for which there has been no good conceptual physical explanation. How fundamental is Tsirelson’s bound? Must this inequality also hold for any future theory which might someday supercede quantum mechanics [10]? We are led to the following question: Can we identify a plausible physical principle, independent of quantum mechanics (or independent of functional analysis), which is necessary and sufficient to guarantee that

|c| \leq 1 / \sqrt{2}

?

1.1. Existing Principles

For the last two decades, people have searched for physical principles that bound nonlocality. It was initially expected that the physical principle of relativistic causality (no-signaling) itself restricts the strength of nonlocality [11,12,13]. But then it was discovered that no-signaling theories may exist for which

|c| > 1 / \sqrt{2}

. This led to the device-independent formalism of No-Signaling (NS)–boxes [2,14] (see also [3]). In particular, maximum violation of the Bell–CHSH inequality is achieved by Popescu–Rohrlich (PR)–boxes which are consistent with relativistic causality.

So relativistic causality doesn’t limit nonlocality after all; Why then does nature not permit (1) to be violated (as far as we know)? Several suggestions have been made. Superquantum correlations lead to violations of the Heisenberg uncertainty principle [15,16], which is another seemingly purely quantum result. PR–boxes would allow distributed computation to be performed with only one bit of communication [17], which looks unlikely but doesn’t violate any known physical law. Similarly, in stronger-than-quantum nonlocal theories some computations exceed reasonable performance limits [18]. The principle of Information Causality [19] shows that no sensible measure of mutual information exists between pairs of systems in superquantum nonlocal theories. Our approach is most directly comparable with Information Causality, with a conceptual difference being that we use variance of an efficient estimator, therefore Fisher information, whereas information causality uses mutual information (Shannon information). The relationship between our approach and theirs is the topic of Section 6. Finally, it was shown that superquantum nonlocality does not permit local (non-nonlocal) physics to emerge in the limit of infinitely many microscopic systems [20,21].

1.2. Tsirelson’s Bound from a Statistical No-Signaling Condition

Here we show that Tsirelson’s bound follows from the following principle applied to a certain limiting Bell–CHSH setting:

Statistical No-Signaling: It is impossible to communicate a nontrivial message through a channel whose output is independent of its input.

Our strategy is to construct a channel whose input is a Bernoulli random variable X of mean

θ

and whose output is another Bernoulli random variable Y (Section 3.2). The construction of our channel is not new— it is a reinterpretation of the well-known van Dam protocol [17]. Through the channel, Alice sends

2^{n}

samples

A \overset{def}{=} \{x_{0}, x_{1}, \dots, x_{2^{n} - 1}\}

from X, and at the other end Bob receives a set of values

B \overset{def}{=} \{y_{0}, y_{1}, \dots, y_{m - 1}\}

.

We imagine

θ \in [- 1, 1]

as encoding a message, perhaps in the digits of its binary expansion. Bob’s task is to estimate

θ

. The following theorem states that he can do so if and only if Tsirelson’s bound fails.

Theorem 1.1.

(1): The channel from X to Y we construct is described by the conditional probability $p (Y = x ∣ X = x) = (1 + c^{n}) / 2$ , where c is the Bell–CHSH correlator. Its output satisfies:

$p (Y = 1 ∣ θ) = \frac{1}{2} + \frac{c^{n} \cdot θ}{2} .$

In the $n \to \infty$ limit it disconnects for $p (Y ∣ X) = p (Y)$ (i.e., we can arrange that $c < 1$ ).
(2): The unbiased estimator:

$\hat{θ} \overset{d e f}{=} \frac{1}{2^{n} c^{n}} \sum_{i = 0}^{2^{n} - 1} y_{i},$

for θ has variance:

$Var [\hat{θ} ∣ θ] = \lim_{n \to \infty} \frac{1 - c^{2 n} θ^{2}}{{(2 c^{2})}^{n}} = \{\begin{matrix} 0, & 2 c^{2} > 1 (signaling) \\ 1, & 2 c^{2} = 1 (randomness) \\ \infty, & 2 c^{2} < 1 (no - signaling) \end{matrix}$
(3): The estimator $\hat{θ}$ isefficient, i.e., it has the minimal variance of any estimator of θ constructed from Bob’s set of samples $B$ for all $n \in N$ .

The theorem is visually summarized by Figure 1.

The theorem shows that failure of Tsirelson’s bound leads to failure of the following consequence of Statistical No-Signaling—Consequence of Statistical No-Signaling—In the above notation, if X and Y are independent, then no estimator constructed from

B

has both mean

θ

and variance 0.

Section 5 shows that a violation of Uffink’s inequality [22], a generalization of Tsirelson’s bound, also leads to the failure of the same consequence of Statistical No-Signaling. Uffink’s inequality is also known to be recovered by Information Causality [23].

Theorem 1.1 is formulated as an asymptotic construction, but in practice a finite number of samples suffices because for any experimental setup there exists a nonzero minimal possible environmental noise level

ϵ > 0

. By Theorem 1.1,

p (Y = 1 ∣ θ)

is physically indistinguishable from

1 / 2

when the absolute value of

c^{n} θ / 2

is less than

ϵ

. Since

|θ| \leq 1

, we need

n \geq \ln 2 ϵ / \ln c

trials. As an example, for a photon pair where

ϵ

is greater than or equal to the reduced Planck constant ℏ, we find that

n \geq 244

suffices to make

p (Y = 1 ∣ θ)

physically indistinguishable from

1 / 2

when

|c| \leq 1 / \sqrt{2}

. Thus, if we can still distinguish

p (Y = 1 ∣ θ)

from

1 / 2

for

n = 244

, we know that Tsirelson’s bound has been violated, and if not then it holds.

1.3. Organization of This Paper

Section 2 recalls the bipartite Bell experiment and exhibits the Bell–CHSH correlator c as the correlator of a certain noisy symmetric channel. Section 3 presents the van Dam protocol as an extension of the Bell–CHSH setup, and explain how it defines a noisy symmetric channel with correlator

c^{n}

. Section 4 computes the means and variance of an estimator

\hat{θ}

for

θ

, and proves that

\hat{θ}

is an efficient estimator. Section 5 extends Theorem 1.1 to recover Uffink’s inequality [22,23] for anisotropic correlators from Statistical No-Signaling. Finally, Section 6 discusses the relationship of Statistical No-Signaling with Information Causality.

2. The Bipartite Bell Experiment as a Noisy Symmetric Channel

In this section we recall the definition of the Bell–CHSH correlator c and we formulate the Bell–CHSH inequality, establishing notation. We then exhibit c as the correlator of a symmetric binary channel.

2.1. The Bell–CHSH Inequality

Let us recall the classical bipartite Bell experiment [1]. Alice and Bob each hold one half of an EPR pair (a pair of particles with certain properties summarized below) such as a singlet state of spin–

\frac{1}{2}

particles. They each possess two different measuring instruments. Alice measures her particle using one of the instruments, and Bob measures his particles using one of his. We write i for the index of the instrument used by Alice, and a for its reading. Similarly, we let j and b denote the index of an instrument chosen by Bob and its reading correspondingly. In the language of probability, a and b are

\pm 1

–valued Bernoulli random variables. The choices of measuring instrument, i and j, may be either parameters or

0 / 1

–valued Bernoulli random variables.

Repeating the experiment for many different EPR pairs, Alice and Bob may compute the two-point correlator

E [a b ∣ i, j]

of their readings a and b for any given pair of indices i and j, where

E [\cdot]

is the statistical expectation operator. We now define the Bell–CHSH correlator c by the formula:

c \overset{def}{=} \frac{1}{4} \{E [a b ∣ 0, 0] + E [a b ∣ 0, 1] + E [a b ∣ 1, 0] - E [a b ∣ 1, 1]\} .

(2)

In a theory in which both Alice and Bob’s choices, and the readings of their measuring devices, are local, the Bell–CHSH inequality [4] holds:

|c| \leq \frac{1}{2} .

(3)

Operationally speaking, locality means that Alice’s readings may only be affected by her own choices (and perhaps by other variables hidden locally at her site), and similarly for Bob’s readings. Quantum mechanically, however, Alice and Bob may violate (3). Correlators violating (3) are said to be nonlocal.

2.2. The Bell–CHSH Correlator c as a Channel Correlator

Non-signaling (NS)–boxes provide an abstraction and an extension of the Bell–CHSH experiment [2,14]. This time, Alice and Bob each owns a box. Such a box may be thought of as a complete laboratory containing two measuring devices. Either participants inserts their choice of measuring device into their box. The box output is the respective reading of the chosen measuring device.

Alice and Bob share a pair of NS–boxes whose

0 / 1

–valued inputs are i and j and whose

\pm 1

–valued outputs are Bernoulli random variables a and b. We will show that the Bell–CHSH correlator (2) represents the correlator of a symmetric binary channel whose input is the Bernoulli random variable

X \overset{def}{=} {(- 1)}^{i j}

and whose output is the Bernoulli random variable

Y \overset{def}{=} a \cdot b

.

Let

x \in {- 1, 1}

. Define the channel correlators

c_{x}

as follows:

c_{x} \overset{def}{=} E [X Y ∣ X = x] = p (Y = x ∣ X = x) - p (Y \neq x ∣ X = x) = 2 p (Y = x ∣ X = x) - 1 .

(4)

With respect to a particular choice of measuring devices i and j and for

x = {(- 1)}^{i j}

, (4) becomes:

c_{x} (i, j) = E [a \cdot b \cdot {(- 1)}^{i j} ∣ i, j] = 2 p (a \cdot b = {(- 1)}^{i j} ∣ i, j) - 1 .

(5)

Assume the underlying channel is symmetric and therefore that

c_{x} (i, j)

is fixed for all

i, j

. By (5) the Bell–CHSH correlator (2) may be written as:

c = \frac{1}{4} (c_{1} (0, 0) + c_{1} (0, 1) + c_{1} (1, 0) + c_{- 1} (1, 1)) = c_{x} (i, j) = 2 p (a \cdot b = i j ∣ i, j) - 1 .

(6)

which is our promised interpretation of the Bell–CHSH correlator as a correlator of a noisy symmetric binary channel.

3. The Van Dam Protocol as a Noisy Symmetric Channel

In this section we recall the construction of the van-Dam protocol [17,19]. We then reinterpret this protocol as underlying a noisy symmetric binary channel, as a special case of the construction of Section 2. We compute its correlator, and establish the effect of noise on its classical component.

3.1. The Van Dam Protocol

The van Dam protocol realizes an oblivious transfer protocol by means of a classical channel and a collection of NS-boxes. Each of Alice’s boxes has a corresponding box on Bob’s side, and different pairs of boxes are statistically independent. Suppose that Alice has in her possession the bits

x_{0}, \dots, x_{m - 1}

where

m = 2^{n}

,

n \geq 1

. Bob wishes to know the value of one of her bits. He may do so by specifying the address of the bit whose value he wishes to know via its binary address

j = j_{n - 1} j_{n - 2} \dots j_{0}

. For example, if

n = 2

then Bob may specify which of the bits

x_{0}

to

x_{3}

he wants by specifying a binary address, 00, 01, 10, or 11. Alice bits and Bob addresses are encoded into the inputs of

2^{n} - 1

NS-boxes following a particular protocol which is described next.

Alice uses outputs of boxes and choices of measuring device to determine choices of measuring device for other boxes. Such a procedure is called wiring. The wiring of boxes on Alice side admits a recursive description which we now give. Let

a_{i}^{k, l}

denote the output of Alice’s lth box on the kth level for the input i. We follow the convention that box outputs for the van Dam protocol are 0/1–valued (rather than

\pm 1

–valued) random variables. Let also:

f^{k, l} (q_{1}, q_{2}) \overset{def}{=} q_{1} \oplus a_{q_{1} \oplus q_{2}}^{k, l} .

(7)

Suppose that Alice wishes to encode

m = 4

bits with her boxes. To do so, she first picks two boxes and computes:

x_{1}^{(1)} \overset{def}{=} f^{1, 1} (x_{0}, x_{1}), x_{2}^{(1)} \overset{def}{=} f^{1, 2} (x_{2}, x_{3}) .

(8)

This forms the first level in her construction. The second level then follows:

x^{(2)} \overset{def}{=} f^{2, 1} (x_{1}^{(1)}, x_{2}^{(1)}) .

(9)

In this example there are only two levels and so

x^{(2)}

is the bit which Alice transmits to Bob through the classical channel. In case where

m = 2^{n}

there will be n levels and thus

x^{(n)}

is the bit Bob will receive from Alice.

Unbeknownst to Alice, Bob now decides which bit

x_{j}

he would like to know the value of. He takes its binary address

j = j_{n - 1} j_{i - 2} \dots j_{0}

, and inserts

j_{k - 1}

into all of his boxes whose counterparts are on the k level on Alice’s side. He then uses the values

b_{j_{k - 1}}^{k, l}

that he obtains, together with the bit

x^{(n)}

he received from Alice, to construct the decoding function:

y_{j} \overset{def}{=} x^{(n)} \oplus b_{j_{0}}^{1, l_{1}} \oplus b_{i_{1}}^{2, l_{2}} \oplus \dots \oplus b_{j_{n - 1}}^{n, l_{n}} .

(10)

The values

l_{1}, \dots, l_{n}

(which boxes Bob uses) are determined by the binary address

j = j_{n - 1} j_{n - 2} \dots j_{0}

via the recursive formula

l_{h - 1} = 2 l_{h} - 1 + l_{h - 1}

for

h = 1, 2, \dots n - 1

starting from

l_{n} = 1

.

The van Dam protocol we have described above is summarized in Figure 2.

The probability that Bob will decode the correct value of the bit he desires is governed by the NS–box correlator c. In general, decoding any bit out of

2^{n}

possible bits involves using n pairs of NS boxes. Noting that an even number of errors,

a \oplus b \neq i j

, will cancel out in such a construction, we obtain the following expression [19]:

c^{n} = 2 p (y_{j} = x_{j} ∣ x_{j}) - 1 .

(11)

For example, for

n = 2

:

\begin{array}{l} p (a_{i_{1}} \oplus b_{j_{1}} \oplus a_{j_{2}} \oplus b_{j_{2}} = & i_{1} j_{1} \oplus i_{2} j_{2} ∣ i_{1, 2}, j_{1, 2}, i_{1} j_{1} \oplus i_{2} j_{2}) = \\ p (a_{i_{1}} \oplus b_{j_{1}} = i_{1} j_{1} ∣ a_{1}, b_{1}) p (a_{i_{2}} \oplus b_{j_{2}} = i_{2} j_{2} ∣ i_{2}, j_{2}) + \\ p (a_{i_{1}} \oplus b_{j_{1}} \neq i_{1} j_{1} ∣ i_{1}, j_{1}) p (a_{i_{2}} \oplus b_{j_{2}} \neq i_{2} j_{2} ∣ i_{2}, j_{2}) = \\ \frac{1}{2} (1 + c) \cdot \frac{1}{2} (1 + c) + \frac{1}{2} (1 - c) \cdot \frac{1}{2} (1 - c) = \frac{1}{2} (1 + c^{2}) . \end{array}

(12)

3.2. Van Dam Protocol as a Symmetric Channel

This section describes the modification of the van Dam protocol that we use.

Alice has in her possession an information source that is a

\pm 1

-valued Bernoulli random variable X whose mean is

θ

. Alice takes m iid samples,

{\tilde{x}}_{0}, \dots, {\tilde{x}}_{m - 1}

, from X and converts them into

0 / 1

-valued bits,

x_{0}, x_{1}, \dots, x_{m - 1}

by mapping 0 to

- 1

and 1 to 1. Alice and Bob repeat the van Dam protocol m times, once for each of Alice’s samples. Each time, Bob uses the protocol to estimate Alice’s bit, first

x_{0}

, then

x_{1}

, and so on until

x_{m - 1}

.

As in (12), the van Dam protocol has a memoryless property:

p (y_{i} = x_{i} ∣ x_{0}, x_{1}, \dots, x_{m - 1}) = p (y_{i} = x_{i} ∣ x_{i}) .

(13)

From this it follows that if Alice’s inputs

x_{0}, x_{1}, \dots, x_{m - 1}

are iid then Bob’s outputs

y_{0}, y_{1}, \dots, y_{m - 1}

are also iid. Therefore the set of

{\tilde{y}}_{i} \overset{def}{=} {(- 1)}^{y_{i}}

determines a Bernoulli random variable Y. In this way, the van Dam protocol may be viewed as a symmetric binary channel whose input is X and whose output is Y. By (11) the channel correlator is:

E [X Y ∣ X = {\tilde{x}}_{i}] = 2 p (Y = {\tilde{x}}_{i} ∣ X = {\tilde{x}}_{i}) - 1 = 2 p (y_{i} = x_{i} ∣ x_{i}) - 1 = c^{n} .

(14)

We generalize slightly, for the purpose of treating the

|c| = 1

case in the next section. Suppose that Alice’s bits are contaminated with noise and therefore might be flipped once injected into her boxes. Let

[1 - {(c^{'})}^{n}] / 2

be the probability that the bit

x_{i}

is flipped where

|c^{'}| \leq 1

. In this case the corresponding channel correlator (14) is

E [X Y ∣ X = {\tilde{x}}_{i}] = {(c c^{'})}^{n}

, which follows from (4) and:

\begin{matrix} p (Y = {\tilde{x}}_{i} ∣ X = {\tilde{x}}_{i}) = p (Y = {\tilde{x}}_{i} ∣ X^{'} & = {\tilde{x}}_{i}) p (X^{'} = {\tilde{x}}_{i} ∣ X = {\tilde{x}}_{i}) + \\ p (Y = {\tilde{x}}_{i} ∣ X^{'} \neq {\tilde{x}}_{i}) p (X^{'} \neq {\tilde{x}}_{i} ∣ X = {\tilde{x}}_{i}) = \frac{1}{2} [1 + {(c c^{'})}^{n}], \end{matrix}

(15)

where

p (Y = {\tilde{x}}_{i} ∣ X^{'} = {\tilde{x}}_{i}) = [1 + c^{n}] / 2

underlies the channel defined by the ordinary van Dam protocol, and

p (X^{'} \neq {\tilde{x}}_{i} ∣ X = {\tilde{x}}_{i}) = [1 - {(c^{'})}^{n}] / 2

is the probability of

x_{i}

having been flipped.

3.3. The Van Dam Channel Disconnects in the $n \to \infty$ Limit

If

|c| < 1

or

|c^{'}| < 1

then it follows that:

E [X Y] = 2 p (Y = i ∣ X = i) - 1 = {(c c^{'})}^{n} \overset{n \to \infty}{⟶} 0 .

(16)

Therefore, in the

n \to \infty

limit:

p (Y = i ∣ X = i) = 1 / 2 .

(17)

But also:

p (Y = i) = p (Y = i ∣ X = i) p (X = i) + p (Y = i ∣ X \neq i) p (X \neq i) = \frac{1}{2} (p (X = i) + p (X \neq i)) = \frac{1}{2} .

(18)

Combining (17) with (18) gives:

p (Y ∣ X) \overset{n \to \infty}{⟶} p (Y) .

(19)

Thus X and Y are statistically independent in the

n \to \infty

limit, proving the first part of Theorem 1.1.

4. Bob’s Estimator

4.1. Bob’s Estimator

In Section 3 we used the van Dam protocol to construct a symmetric channel whose input is a

\pm 1

–valued Bernoulli random variable X and whose output is another

\pm 1

–valued Bernoulli random variable Y. The channel correlator is

c^{n}

.

Alice sends m iid random samples

X \overset{def}{=} \{X_{1}, \dots, X_{m}\}

through the channel. Denote the set of respective outputs

Y \overset{def}{=} \{Y_{1}, \dots, Y_{m}\}

. Assume a prior distribution for X given by:

p (X = - 1 ∣ θ) = \frac{1}{2} (1 + θ),

(20)

with parameter

θ \in [- 1, 1]

.

Bob attempts to estimate

θ

using the estimator:

\hat{θ} \overset{def}{=} \frac{1}{2^{n} c^{n}} \sum_{i = 0}^{2^{n} - 1} Y_{i} .

(21)

We will show that Bob’s estimator is unbiased,

E [\hat{θ} ∣ θ] = θ

. Note that

E [Y_{i} ∣ θ] = p (Y = 1 ∣ θ) - p (Y = - 1 ∣ θ) .

(22)

and

p (Y = - 1 ∣ θ) = p (Y = - 1 ∣ X = - 1) p (X = - 1 ∣ θ) + p (Y = - 1 ∣ X = 1) p (X = 1 ∣ θ) = \frac{1 + c^{n} θ}{2} .

(23)

From (22) and (23) together, deduce:

E [Y_{i} ∣ θ] = c^{n} θ .

(24)

and therefore,

E [\hat{θ} ∣ θ] = θ

.

As for variance, by (24):

Var [Y_{i} ∣ θ] = E [Y_{i}^{2} ∣ θ] - E {[Y_{i} ∣ θ]}^{2} = 1 - c^{2 n} θ^{2} .

(25)

Therefore:

Var [\hat{θ} ∣ θ] = \frac{1 - c^{2 n} θ^{2}}{{(2 c^{2})}^{n}} .

(26)

We have proved the second part of Theorem 1.1.

4.2. Bob’s Estimator $\hat{θ}$ is Efficient

We prove efficiency of

\hat{θ}

by calculating the Fisher information about

θ

contained in Bob’s set of samples

B

. The Cramer–Rao Theorem tells us that one over this Fisher information is a lower bound for the variance of an estimator for

θ

constructed from

B

. By showing that

\hat{θ}

saturates this bound, we will have proven that it is efficient. In the derivation that follows, we assume that

|c| < 1

by replacing c by

c c^{'}

if necessary.

We compute the Fisher information. The likelihood of

θ

given the set

B

is given by the expression:

p (B ∣ θ) = {[p (Y = - 1 ∣ θ)]}^{\sum_{i = 1}^{2^{n}} 1_{{Y_{i} = - 1}}} {[p (Y = 1 ∣ θ)]}^{\sum_{i = 1}^{2^{n}} 1_{{Y_{i} = 1}}},

(27)

where the indicator random variable of a random event A is given as:

1_{A} \overset{def}{=} \{\begin{matrix} 1, & A occurred; \\ 0, & otherwise . \end{matrix}

(28)

According to (27) the log-likelihood is given by the expression:

L (θ) \overset{def}{=} \log p (B ∣ θ) = [\sum_{i = 1}^{2^{n}} 1_{{Y_{i} = - 1}}] \log p (Y = - 1 ∣ θ) + [\sum_{i = 1}^{2^{n}} 1_{{Y_{i} = 1}}] \log p (Y = 1 ∣ θ) .

(29)

The Fisher information about

θ

contained in the set

B

is defined as:

I_{B} (θ) \overset{def}{=} E [{(\frac{\partial L (θ)}{\partial θ})}^{2}] = - E [\frac{\partial^{2} L (θ)}{\partial θ^{2}}] .

(30)

Note that:

E [\sum_{i = 1}^{2^{n}} 1_{{Y_{i} = s}}] = \sum_{i = 1}^{2^{n}} E [1_{{Y_{i} = s}}] = 2^{n} p (Y = s ∣ θ), s = - 1, 1 .

(31)

Using this, (30) reads:

I_{B} (θ) = \frac{{(2 c^{2})}^{n}}{1 - c^{2 n} θ^{2}} .

(32)

Indeed the Fisher information about

θ

in

B

as given by Equation (32) equals one over the variance of

\hat{θ}

as given by Equation (26). Thus, by the Cramer–Rao Theorem,

\hat{θ}

is an efficient estimator for

θ

. Parenthetically, note that the minimum of

I_{B} (θ)

is obtained for

θ = 0

in which case

p (X ∣ θ) = 1 / 2

and

I_{B} (0) = {(2 c^{2})}^{n}

. We have proved the final part of Theorem 1.1.

5. Uffink’s Inequality from Statistical No-Signalling

The basic protocol in Section 3 assumes all box correlators are identical in absolute value. When this assumption is relaxed, Statistical No-Signaling leads to Uffink’s inequality, which is a necessary condition for quantum mechanical Bell-CHSH correlators [22,23]. Our approach is based on evaluating the total Fisher information

I_{B} (θ)

gained by Bob in

2^{n}

trials of the experiment.

Suppose that the mean of Alice’s bits,

x_{i}

, is

θ^{'}

for even i, and

θ

otherwise. Consider now a pair of NS-boxes with correlators,

c (i, j) \overset{d e f}{=} E [a b ∣ i, j]

. The channel underlying the van Dam protocol in this case is described by

p (y_{j} = x_{j} ∣ x_{0}, x_{1}) = p (a \oplus b = i j ∣ j, i = x_{0} \oplus x_{1}) = [1 + c (x_{0} \oplus x_{1}, j)] / 2,

(33)

where

y_{j}

is Bob’s guess of Alice’s bit

x_{j}

. It now follows that

\begin{array}{l} p (y_{j} = 1 ∣ θ^{'}, θ) = \\ p (y_{j} = x_{j} ∣ x_{j} = 1, x_{1 - j} = 1) p (x_{j} = 1) p (x_{1 - j} = 1) + p (y_{j} \neq x_{j} ∣ x_{j} = 0, x_{1 - j} = 0) p (x_{j} = 0) p (x_{1 - j} = 0) + \\ p (y_{j} = x_{j} ∣ x_{j} = 1, x_{1 - j} = 0) p (x_{j} = 1) p (x_{1 - j} = 0) + p (y_{j} \neq x_{j} ∣ x_{j} = 0, x_{1 - j} = 1) p (x_{j} = 0) p (x_{1 - j} = 1) = \\ \frac{1}{2} [1 + \frac{1}{2} (c (0, j) + {(- 1)}^{j} c (1, j)) θ^{'} + \frac{1}{2} (c (0, j) - {(- 1)}^{j} c (1, j)) θ] . \end{array}

(34)

For simplicity, assume that

θ^{'} = 0

. It can now be verified that for a n-level construction in the van Dam protocol

p (y_{j_{1}, \dots, j_{n}} = 1 ∣ θ) = \frac{1}{2} [1 + c_{j_{1}} c_{j_{2}} \dots c_{j_{n}} θ],

(35)

where

c_{j} \overset{d e f}{=} (c (0, j) - {(- 1)}^{j} c (1, j)) / 2

. According to (32) the Fisher information about

θ

contained in

y_{j_{1}, \dots, j_{n}}

is

I_{j_{1}, \dots, j_{n}} (θ) = \frac{{(c_{j_{1}} \dots c_{j_{n}})}^{2}}{1 - {(c_{j_{1}} \dots c_{j_{n}})}^{2} θ^{2}} .

(36)

Assuming

|c (i, j)| < 1

, Bob’s total amount of information about

θ

in

2^{n}

trials is

I_{B} (θ) = \sum_{j_{1} = 0, 1} \dots \sum_{j_{n} = 0, 1} I_{j_{1}, \dots, j_{n}} (θ) \approx \sum_{j_{1} = 0, 1} \dots \sum_{j_{n} = 0, 1} {(c_{j_{1}} \dots c_{j_{n}})}^{2} = {[c_{0}^{2} + c_{1}^{2}]}^{n},

(37)

for large n. As before, the underlying channel asymptotically disconnects for

c_{j_{1}} \dots c_{j_{n}} \to 0

in the

n \to \infty

limit. Statistical No-Signaling dictates that in this case the variance of Bob’s estimator

\lim_{n \to \infty} Var [\hat{θ} ∣ θ] = \lim_{n \to \infty} I_{B} {(θ)}^{- 1} \geq 1

, which holds if and only if Uffink’s inequality holds [22],

c_{0}^{2} + c_{1}^{2} = \frac{1}{4} {[c (0, 0) - c (1, 0)]}^{2} + \frac{1}{4} {[c (0, 1) + c (1, 1)]}^{2} \leq 1 .

(38)

6. Relation to Information Causality

Of previous non-quantum justifications of Tsirelson’s bound, Information Causality (IC) is perhaps the closest to Statistical No-Signalling [19]. IC is also stated as a limit on communication: Information gain that Bob can reach about a previously unknown to him data set of Alice, by using all his local resources and m classical bits communicated by Alice, is at most m bits.

IC is formally a restriction on the classical channel capacity. Detecting violation of this principle therefore requires the utilization of nonlocal resources, which the authors achieve through the application of IC to the van Dam protocol, that is the same communication protocol used in this paper.

The Information Causality quantity I is defined as the Shannon mutual information of Alice’s input and Bob’s output given the value of the single bit transmitted in the van Dam protocol. IC holds if

I \leq 1

and is violated if

I > 1

. At the end of the supplementary section of [19], the following expression for the IC quantity is obtained:

I \geq \frac{1}{2 \ln (2)} {(c_{1}^{2} + c_{- 1}^{2})}^{n},

(39)

where

c_{i} \overset{d e f}{=} E [X Y ∣ X = \tilde{i}]

as in (4). In the symmetric setting,

c_{1} = c_{- 1} = c

, and for

θ = 0

, Equations (39) and (32) combine to yield:

I \geq \frac{2^{n} c^{2 n}}{2 \ln (2)} = \frac{[1 - c^{2 n} θ^{2}] I_{B} (θ)}{2 \ln (2)} .

(40)

In particular, in the

n \to \infty

limit, if

2 c^{2} > 1

then

I_{B} (θ) \to \infty

implying that

I \to \infty

. Thus, violation of Statistical No-Signaling implies violation of IC. Conversely, as (39) is an inequality, it is unknown whether Tsirelson’s bound being satisfied implies

I \leq 1

(IC for the van Dam protocol), although, by our main theorem, it does imply

I_{B} (θ) \leq 1

(Statistical No-Signaling for the van Dam protocol).

7. Conclusions

We have formulated a Statistical No-Signaling principle which dictates that no information can pass through a disconnected channel. A violation of Tsirelson’s bound, i.e. a value of

|c|

greater that

1 / \sqrt{2}

, allows us to violate Statistical No-Signalling by constructing a disconnected channel through which Bob can construct an unbiased estimator with variance 0 for Alice’s parameter

θ

. Conversely, when Tsirelson’s bound holds, then, through this channel, so does Statistical No-Signalling. Our construction thus provides a purely statistical justification for Tsirelson’s bound, independent of quantum mechanics.

Acknowledgments

The authors thank Daniel Rohrlich for useful discussions. Avishy Carmi acknowledges support from Israel Science Foundation Grant No. 1723/16.

Author Contributions

Avishy Carmi and Daniel Moskovich have both written the text and worked out the mathematical proofs in this paper.

Conflicts of Interest

The authors declare no conflict of interest.

References

Bell, J.S. On the Einstein-Podolsky-Rosen paradox. Physics 1964, 1, 195–200. [Google Scholar] [CrossRef]
Popescu, S.; Rohrlich, D. Quantum nonlocality as an axiom. Found. Phys. 1994, 24, 379–385. [Google Scholar] [CrossRef]
Popescu, S. Nonlocality beyond quantum mechanics. Nature Phys. 2014, 10, 264–270. [Google Scholar] [CrossRef]
Clauser, J.; Horne, M.; Shimony, A.; Holt, R. Proposed experiment to test local hidden—Variable theories. Phys. Rev. Lett. 1969, 23, 880–884. [Google Scholar] [CrossRef]
Cirel’son, B.S. Quantum generalizations of Bell’s inequality. Lett. Math. Phys. 1980, 4, 93–100. [Google Scholar] [CrossRef]
Giustina, M.; Versteegh, M.A.M.; Wengerowsky, S.; Handsteiner, J.; Hochrainer, A.; Phelan, K.; Steinlechner, F.; Kofler, J.; Larsson, J.-A.; Abellán, C.; et al. Significant-loophole-free test of Bell’s theorem with entangled photons. Phys. Rev. Lett. 2015, 115, 250401. [Google Scholar] [CrossRef] [PubMed]
Hensen, B.; Bernien, H.; Dréau, A.E.; Reiserer, A.; Kalb, N.; Blok, M.S.; Ruitenberg, J.; Vermeulen, R.F.L.; Schouten, R.N.; Abellán, C.; et al. Experimental loophole-free violation of a Bell inequality using entangled electron spins separated by 1.3 km. Nature 2015, 526, 682–686. [Google Scholar] [CrossRef] [PubMed]
Shalm, L.K.; Meyer-Scott, E.; Christensen, B.G.; Bierhorst, P.; Wayne, M.A.; Stevens, M.J.; Gerrits, T.; Glancy, S.; Hamel, D.R.; Allman, M.S.; et al. Strong loophole-free test of local realism. Phys. Rev. Lett. 2015, 115, 250402. [Google Scholar] [CrossRef] [PubMed]
Poh, H.S.; Joshi, S.K.; Ceré, A.; Cabello, A.; Kurtsiefer, C. Approaching Tsirelson’s bound in a photon pair experiment. Phys. Rev. Lett. 2015, 115, 180408. [Google Scholar] [CrossRef] [PubMed]
Seife, C. Do deeper principles underlie quantum uncertainty and nonlocality? Science 2005, 309, 98. [Google Scholar] [CrossRef] [PubMed][Green Version]
Shimony, A. Controllable and Uncontrollable Non-Locality. In Proceedings of the International Symposium on Foundations of Quantum Mechanics in the Light of New Technology; Kamefuchi, S., Ed.; Physical Society of Japan: Tokyo, Japan, 1984; pp. 225–230. [Google Scholar]
Shimony, A. Events and processes in the quantum world. In Quantum Concepts in Space and Time; Penrose, R., Isham, C.J., Eds.; Oxford University Press: Oxford, UK, 1986; pp. 182–203. [Google Scholar]
Aharonov, Y.; Rohrlich, D. Nonlocality and Causality. In Quantum Paradoxes: Quantum Theory for the Perplexed; Wiley-VCH: Weinheim, Germany, 2005. [Google Scholar]
Barrett, J.; Linden, N.; Massar, S.; Pironio, S.; Popescu, S.; Roberts, D. Non-local correlations as an information theoretic resource. Phys. Rev. A 2005, 71, 022101. [Google Scholar] [CrossRef]
Wolf, M.; Garcia, D.P.; Fernandez, C. Measurements incompatible in quantum theory cannot be measured jointly in any other no-signaling theory. Phys. Rev. Lett. 2009, 103, 230402. [Google Scholar] [CrossRef] [PubMed]
Oppenheim, J.; Wehner, S. The uncertainty principle determines the non-locality of quantum mechanics. Science 2010, 330, 1072–1074. [Google Scholar] [CrossRef] [PubMed]
Van Dam, W. Implausible consequences of superstrong nonlocality. Nat. Comput. 2013, 12, 9–12. [Google Scholar] [CrossRef]
Linden, N.; Popescu, S.; Short, A.J.; Winter, A. Quantum nonlocality and beyond: Limits from nonlocal computation. Phys. Rev. Lett. 2007, 99, 180502. [Google Scholar] [CrossRef] [PubMed]
Pawlowski, M.; Paterek, T.; Kaszlikowski, D.; Scarani, V.; Winter, A.; Żukowski, M. Information causality as a physical principle. Nature 2009, 461, 1101–1104. [Google Scholar] [CrossRef] [PubMed]
Rohrlich, D. PR-box correlations have no classical limit. In Quantum Theory: A Two-Time Success Story; Struppa, D.C., Tollaksen, J.M., Eds.; Springer: Berlin/Heidelberg, Germany, 2014; pp. 205–211. [Google Scholar]
Navascués, M.; Wunderlich, H. A glance beyond the quantum model. Proc. R. Soc. A 2010, 466, 881–890. [Google Scholar] [CrossRef]
Uffink, J. Quadratic Bell inequalities as tests for multipartite entanglement. Phys. Rev. Lett. 2002, 88, 230406. [Google Scholar] [CrossRef] [PubMed]
Allcock, J.; Brunner, N.; Pawlowski, M.; Scarani, V. Recovering part of the boundary between quantum and nonquantum correlations from information causality. Phys. Rev. A 2009, 80, 040103. [Google Scholar] [CrossRef]

Figure 1. The Statistical No-Signaling condition. The van Dam protocol defines an underlying channel which becomes disconnected in the

n \to \infty

limit. The upper illustration shows this channel and the Fisher information (one over the variance) of the maximum likelihood estimators for

θ

at its input and at its output. When the number of nonlocal resources increases unboundedly, the two ends of the channel become disconnected as illustrated by a vanishing bottleneck in the lower illustration. Statistical No-Signaling dictates that in this case no information can pass through. This occurs if and only if

2 c^{2} \leq 1

. The case of

2 c^{2} > 1

leads to a physically unreasonable limit where Bob can fully read off the value of Alice’s

θ

through a disconnected channel.

Figure 1. The Statistical No-Signaling condition. The van Dam protocol defines an underlying channel which becomes disconnected in the

n \to \infty

limit. The upper illustration shows this channel and the Fisher information (one over the variance) of the maximum likelihood estimators for

θ

at its input and at its output. When the number of nonlocal resources increases unboundedly, the two ends of the channel become disconnected as illustrated by a vanishing bottleneck in the lower illustration. Statistical No-Signaling dictates that in this case no information can pass through. This occurs if and only if

2 c^{2} \leq 1

. The case of

2 c^{2} > 1

leads to a physically unreasonable limit where Bob can fully read off the value of Alice’s

θ

through a disconnected channel.

Figure 2. Distributed oblivious transfer (van Dam) protocol [17]. Its basic building block is on the left, where Alice inserts

x_{0} \oplus x_{1}

into her box, receives a, and sends

x_{0} \oplus a

to Bob. Bob decides that he wants to know the value of

x_{j}

, and he feeds j into his box, which outputs b. Bob’s estimate of

x_{i}

is then

x^{(1)} \oplus b

. When there are multiple boxes, Alice concatenates (the process is called wiring). For example, with seven boxes, Alice begins with a collection of bits

x_{0}, x_{1}, \dots, x_{7}

, and she inputs

x_{2 i} \oplus x_{2 i + 1}

into box i, where

i = 0, 1, 2, 3

, receiving

a_{0}, a_{1}, a_{2}, a_{3}

correspondingly. The bits fed into the next level of boxes become

x_{i}^{(1)} \overset{def}{=} x_{2 i} \oplus a_{i}

with

i = 0, 1, 2, 3

. The final output

x^{(3)}

is sent to Bob. Bob encodes the address of the bit he wants as the binary number

j_{3} j_{2} j_{1}

—for example, if he wants

x_{2}

, then he sets

j_{3} = 0

,

j_{2} = 1

, and

j_{1} = 0

because 10 is 2 in binary. This binary encoding describes a path in his binary tree from a root to a branch, where 0 means ‘go left’ and 1 means ‘go right’. Bob inserts

j_{3}

into the lowermost box to obtain

b_{6}

. Setting

k \overset{def}{=} 5 - (1 - j_{3})

, he then inserts

j_{2}

into box k to obtain

b_{k}

. Finally, setting

l \overset{def}{=} k - (3 - j_{3}) - (1 - j_{2})

, Bob inserts

j_{1}

into box l to obtain

B_{l}

. His final estimate for

x_{j}

is

y_{j} = x^{(3)} \oplus b_{6} \oplus b_{k} \oplus b_{l}

.

Figure 2. Distributed oblivious transfer (van Dam) protocol [17]. Its basic building block is on the left, where Alice inserts

x_{0} \oplus x_{1}

into her box, receives a, and sends

x_{0} \oplus a

to Bob. Bob decides that he wants to know the value of

x_{j}

, and he feeds j into his box, which outputs b. Bob’s estimate of

x_{i}

is then

x^{(1)} \oplus b

. When there are multiple boxes, Alice concatenates (the process is called wiring). For example, with seven boxes, Alice begins with a collection of bits

x_{0}, x_{1}, \dots, x_{7}

, and she inputs

x_{2 i} \oplus x_{2 i + 1}

into box i, where

i = 0, 1, 2, 3

, receiving

a_{0}, a_{1}, a_{2}, a_{3}

correspondingly. The bits fed into the next level of boxes become

x_{i}^{(1)} \overset{def}{=} x_{2 i} \oplus a_{i}

with

i = 0, 1, 2, 3

. The final output

x^{(3)}

is sent to Bob. Bob encodes the address of the bit he wants as the binary number

j_{3} j_{2} j_{1}

—for example, if he wants

x_{2}

, then he sets

j_{3} = 0

,

j_{2} = 1

, and

j_{1} = 0

because 10 is 2 in binary. This binary encoding describes a path in his binary tree from a root to a branch, where 0 means ‘go left’ and 1 means ‘go right’. Bob inserts

j_{3}

into the lowermost box to obtain

b_{6}

. Setting

k \overset{def}{=} 5 - (1 - j_{3})

, he then inserts

j_{2}

into box k to obtain

b_{k}

. Finally, setting

l \overset{def}{=} k - (3 - j_{3}) - (1 - j_{2})

, Bob inserts

j_{1}

into box l to obtain

B_{l}

. His final estimate for

x_{j}

is

y_{j} = x^{(3)} \oplus b_{6} \oplus b_{k} \oplus b_{l}

.

© 2018 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Carmi, A.; Moskovich, D. Tsirelson’s Bound Prohibits Communication through a Disconnected Channel. Entropy 2018, 20, 151. https://doi.org/10.3390/e20030151

AMA Style

Carmi A, Moskovich D. Tsirelson’s Bound Prohibits Communication through a Disconnected Channel. Entropy. 2018; 20(3):151. https://doi.org/10.3390/e20030151

Chicago/Turabian Style

Carmi, Avishy, and Daniel Moskovich. 2018. "Tsirelson’s Bound Prohibits Communication through a Disconnected Channel" Entropy 20, no. 3: 151. https://doi.org/10.3390/e20030151

APA Style

Carmi, A., & Moskovich, D. (2018). Tsirelson’s Bound Prohibits Communication through a Disconnected Channel. Entropy, 20(3), 151. https://doi.org/10.3390/e20030151

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Tsirelson’s Bound Prohibits Communication through a Disconnected Channel

Abstract

1. Introduction

1.1. Existing Principles

1.2. Tsirelson’s Bound from a Statistical No-Signaling Condition

1.3. Organization of This Paper

2. The Bipartite Bell Experiment as a Noisy Symmetric Channel

2.1. The Bell–CHSH Inequality

2.2. The Bell–CHSH Correlator c as a Channel Correlator

3. The Van Dam Protocol as a Noisy Symmetric Channel

3.1. The Van Dam Protocol

3.2. Van Dam Protocol as a Symmetric Channel

3.3. The Van Dam Channel Disconnects in the $n \to \infty$ Limit

4. Bob’s Estimator

4.1. Bob’s Estimator

4.2. Bob’s Estimator $\hat{θ}$ is Efficient

5. Uffink’s Inequality from Statistical No-Signalling

6. Relation to Information Causality

7. Conclusions

Acknowledgments

Author Contributions

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

Tsirelson’s Bound Prohibits Communication through a Disconnected Channel

Abstract

1. Introduction

1.1. Existing Principles

1.2. Tsirelson’s Bound from a Statistical No-Signaling Condition

1.3. Organization of This Paper

2. The Bipartite Bell Experiment as a Noisy Symmetric Channel

2.1. The Bell–CHSH Inequality

2.2. The Bell–CHSH Correlator c as a Channel Correlator

3. The Van Dam Protocol as a Noisy Symmetric Channel

3.1. The Van Dam Protocol

3.2. Van Dam Protocol as a Symmetric Channel

3.3. The Van Dam Channel Disconnects in the n → ∞ Limit

4. Bob’s Estimator

4.1. Bob’s Estimator

4.2. Bob’s Estimator θ ^ is Efficient

5. Uffink’s Inequality from Statistical No-Signalling

6. Relation to Information Causality

7. Conclusions

Acknowledgments

Author Contributions

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

3.3. The Van Dam Channel Disconnects in the $n \to \infty$ Limit

4.2. Bob’s Estimator $\hat{θ}$ is Efficient