Classical Modeling of a Lossy Gaussian Bosonic Sampler

Umanskii, Mikhail V.; Rubtsov, Alexey N.

doi:10.3390/e26060493

Open AccessArticle

Classical Modeling of a Lossy Gaussian Bosonic Sampler

by

Mikhail V. Umanskii

¹ and

Alexey N. Rubtsov

^1,2,*

¹

Department of Physics, Lomonosov Moscow State University, Leninskie Gory 1, 119991 Moscow, Russia

²

Russian Quantum Center, Bolshoy Bulvar 30, bld. 1, Skolkovo, 121205 Moscow, Russia

^*

Author to whom correspondence should be addressed.

Entropy 2024, 26(6), 493; https://doi.org/10.3390/e26060493

Submission received: 22 April 2024 / Revised: 23 May 2024 / Accepted: 1 June 2024 / Published: 5 June 2024

(This article belongs to the Topic Quantum Information and Quantum Computing, 2nd Volume)

Download

Browse Figures

Versions Notes

Abstract

:

Gaussian boson sampling (GBS) is considered a candidate problem for demonstrating quantum advantage. We propose an algorithm for the approximate classical simulation of a lossy GBS instance. The algorithm relies on the Taylor series expansion, and increasing the number of terms of the expansion that are used in the calculation yields greater accuracy. The complexity of the algorithm is polynomial in the number of modes given the number of terms is fixed. We describe conditions for the input state squeezing parameter and loss level that provide the best efficiency for this algorithm (by efficient, we mean that the Taylor series converges quickly). In recent experiments that claim to have demonstrated quantum advantage, these conditions are satisfied; thus, this algorithm can be used to classically simulate these experiments.

Keywords:

Gaussian boson sampling; quantum complexity; emulation of quantum devices

1. Introduction

Quantum computers are computational devices which operate using phenomena described by quantum mechanics. Therefore, they can carry out the operations which are not available for classical computers. The ability of a quantum computer to solve a specific task faster than any classical computer is usually referred to as quantum advantage. Although quantum algorithms that provide exponential speedup over classical ones are known, they are hard to implement in practice. Examples of such algorithms include Shor’s algorithm of factoring integers [1], that works in polynomial time, whereas all classical algorithms require exponential time. Modern quantum computers are far from experimentally demonstrating quantum advantage on basic problems like integer factorization.

Boson sampling [2] is a problem that was proposed as a good candidate for demonstrating quantum advantage due to its nature. A boson sampler is a linear-optical device that consists of non-classical sources of indistinguishable photons, a multichannel interferometer mixing photons of different sources, and photon detectors at the output channels of the interferometer. In the original proposal, the indistinguishable photons were prepared in Fock states. The problem then is to calculate the photon statistics after the interferometer given an input state and the interferometer matrix. The relevant parameters are the number of modes N and the total number of photons injected in the interferometer M. Experimentally, it corresponds to performing multiple measurements of the photon counts at the outputs of such a device [3].

Due to the technological complexity of generating Fock states, several variants of the original boson sampling problem have been proposed. They aim at improving the photon generation efficiency and increasing the scale of implementations. One such example is the scattershot boson sampling, which uses many parametric down-conversion sources to improve the single photon generation rate. It has been implemented experimentally using a 13-mode integrated photonic chip and six PDC photon sources [4].

Another variant is the Gaussian boson sampling [5,6], in which Gaussian states are injected into the interferometer instead of Fock states. Gaussian input states can be generated using PDC sources, and it allows the non-classical input states to be prepared deterministically. In this variant, the relative input photon phases can affect the sampling distribution. Experiments were carried out with N = 12 [7], N = 100 [8] and

N = 144

[9,10], with up to 255 photons registered in one event. The latter implementations used PPKTP crystals as PDC sources and employed an active phase-locking mechanism to ensure a coherent superposition.

Any experimental setup, of course, differs from the idealized model considered in theoretical modeling. Bosonic samplers suffer from two fundamental types of imperfections. First, the parameters of a real device, such as the reflection coefficients of the beam splitters and the phase rotations, are never known exactly. A small change in the interferometer parameters can affect the sampling statistics drastically, so that the modeling of an ideal device no longer makes much sense. Another type of imperfections is photon losses. These losses happen because of imperfections in photon preparation, absorption inside the interferometer and imperfect detectors and coupling.

There are different ways of modeling losses: for example, by introducing extra beam splitters [11] or replacing the interferometer matrix by a combination of lossless linear optics transformations and the diagonal matrix that contains transmission coefficients [12]. In the algorithm described in this paper, we will assume that losses occur on the inputs of the interferometer, and we will describe the exact way that we model them.

Imperfections in middle-sized systems make them, in general, easier to emulate with classical computers [13]. It was shown [14] that with the increase of losses in a system, the complexity of the task decreases. When the number of photons

M^{'}

that arrive at the outputs is less than

\sqrt{M}

, the problem of boson sampling can be efficiently solved using classical computers. On the other hand, if the losses are low, the problem remains hard for classical computers [15].

In this paper, we propose a classical algorithm for calculating probabilities of output states in a GBS problem. The algorithm uses Taylor series expansion, and it converges faster depending on the parameters of the problem: namely, the amount of losses in the system and the squeezing parameter of the input states. The higher the losses in the system, the less orders of the series are needed to approximate the probability of observing a given output state.

The work by Oh et al. [16] used the following approach to simulating GBS: the covariance matrix of the output Gaussian state was decomposed into “quantum” and “classical” parts, in which the “quantum” part was simulated using matrix product states and the “classical” part was simulated by random displacement. Thus, when the photon loss rate is high, the computational complexity of this algorithm is reduced.

The algorithm that we propose in this paper uses some similar ideas: namely, the zeroth order of the Taylor series may be considered the “classical” part that is computed quite easily, while the remaining terms are the “quantum” part that is more computationally complex. The contribution of this “quantum” part is smaller when the losses in the system are high; thus, our algorithm also has optimal conditions that depend on the magnitude of losses. We also analyze some recent GBS implementations to compare the conditions in those experiments with the optimal conditions for our algorithm.

2. Problem Specification

Let us first consider a lossless linear-optics interferometer with a transmission matrix U:

{\hat{a}}_{i}^{†} = \sum_{j} U_{i j} {\hat{d}}_{j}^{†}, {\hat{a}}_{i} = \sum_{j} U_{i j}^{*} {\hat{d}}_{j}

(1)

where creation operators acting on the i-th input and output modes are denoted

a_{i}^{†}

and

d_{i}^{†}

. Suppose the input modes are injected with single-mode squeezed states:

| ψ 〉 = e^{\sum_{i} \frac{α_{i}}{2} {({\hat{a}}_{i}^{†})}^{2}} | 0 〉,

(2)

where we omit the state’s normalizing constant

{(1 - | α |}^{2})^{N / 4}

.

The goal is to calculate the probability of detecting

n_{1}

photons in the first output mode,

n_{2}

photons in the second output mode and so on. This probability can be calculated in the following way:

T r \{{\hat{ρ}}_{o u t} \hat{\vec{n}}\} = T r \{{\hat{ρ}}_{o u t} ⨂_{i} | n_{i} 〉 〈 n_{i} |\},

(3)

where

{\hat{ρ}}_{o u t}

is the density matrix of the output state.

Modeling Losses

In real-life bosonic samplers, there will always be losses. Here, we will model them by substituting

a_{i}^{†} ⟶ c a_{i}^{†} + s b_{i}^{†},

(4)

where

b_{i}^{†}

acts on a mode that we cannot observe, and

c^{2} + s^{2} = 1

,

c, s \in R

. Now, the goal is to compute the same probability (

T r \{\hat{ρ_{o u t}} \hat{\vec{n}}\}

) but taking losses into account. The input state will now be

| ψ^{'} 〉 = e^{\sum_{i} \frac{α_{i}}{2} {(c {\hat{a}}_{i}^{†} + s {\hat{b}}_{i}^{†})}^{2}} | 0_{a} 0_{b} 〉

(5)

and we now take partial trace over all loss modes when calculating the density matrix:

\hat{ρ} = T r_{b} \{e^{\sum_{i} \frac{α_{i}}{2} {(c {\hat{a}}_{i}^{†} + s {\hat{b}}_{i}^{†})}^{2}} | 0_{a} 0_{b} 〉 〈 0_{a} 0_{b} | e^{\sum_{i} \frac{α_{i}}{2} {(c {\hat{a}}_{i}^{†} + s {\hat{b}}_{i}^{†})}^{2}}\} .

(6)

3. Algorithm Derivation

Let us consider a single mode:

| ψ^{'} 〉 = e^{\frac{α}{2} {(c {\hat{a}}^{†} + s {\hat{b}}^{†})}^{2}} | 0_{a} 0_{b} 〉,

(7)

\hat{ρ} = T r_{b} \{e^{\frac{α}{2} {(c {\hat{a}}^{†} + s {\hat{b}}^{†})}^{2}} | 0_{a} 0_{b} 〉 〈 0_{a} 0_{b} | e^{\frac{α}{2} {(c \hat{a} + s \hat{b})}^{2}}\} .

(8)

3.1. Calculating Partial Trace

We start by applying the Hubbard–Stratonovich transformation [17,18]

e^{\frac{{\hat{A}}^{2}}{2}} = \frac{1}{\sqrt{2 π}} \int_{- \infty}^{+ \infty} e^{ξ \hat{A} - \frac{ξ^{2}}{2}} d ξ

(9)

to both exponents in the density matrix operator. This gives us the following:

\begin{matrix} \hat{ρ} & = \frac{1}{2 π} \int_{- \infty}^{+ \infty} \int_{- \infty}^{+ \infty} T r_{b} \{e^{ξ \sqrt{α} (c {\hat{a}}^{†} + s {\hat{b}}^{†})} | 0_{a} 0_{b} 〉 〈 0_{a} 0_{b} | e^{\tilde{ξ} \sqrt{α} (c \hat{a} + s \hat{b})}\} e^{- \frac{ξ^{2} + {\tilde{ξ}}^{2}}{2}} d ξ d \tilde{ξ} \\ = \frac{1}{2 π α} \int_{- \infty}^{+ \infty} \int_{- \infty}^{+ \infty} T r_{b} \{e^{ξ \sqrt{α} (c {\hat{a}}^{†} + s {\hat{b}}^{†})} | 0_{a} 0_{b} 〉 〈 0_{a} 0_{b} | e^{\tilde{ξ} \sqrt{α} (c \hat{a} + s \hat{b})}\} e^{- \frac{{(ξ \sqrt{α})}^{2} + {(\tilde{ξ} \sqrt{α})}^{2}}{2 α}} d (ξ \sqrt{α}) d (\tilde{ξ} \sqrt{α}) . \end{matrix}

(10)

Let us redefine

ξ \sqrt{α} ⟶ ξ

,

\tilde{ξ} \sqrt{α} ⟶ \tilde{ξ}

for convenience:

\hat{ρ} = \frac{1}{2 π α} \int_{- \infty}^{+ \infty} \int_{- \infty}^{+ \infty} T r_{b} \{e^{ξ (c {\hat{a}}^{†} + s {\hat{b}}^{†})} | 0_{a} 0_{b} 〉 〈 0_{a} 0_{b} | e^{\tilde{ξ} (c \hat{a} + s \hat{b})}\} e^{- \frac{ξ^{2} + {\tilde{ξ}}^{2}}{2 α}} d ξ d \tilde{ξ} .

(11)

We can now calculate the partial trace over loss modes:

\begin{matrix} T r_{b} \{e^{ξ (c {\hat{a}}^{†} + s {\hat{b}}^{†})} | 0_{a} 0_{b} 〉 〈 0_{a} 0_{b} | e^{\tilde{ξ} (c \hat{a} + s \hat{b})}\} \\ = e^{ξ c {\hat{a}}^{†}} | 0_{a} 〉 〈 0_{a} | e^{\tilde{ξ} c \hat{a}} \cdot T r \{e^{ξ s {\hat{b}}^{†}} | 0_{b} 〉 〈 0_{b} | e^{\tilde{ξ} s \hat{b}}\} \\ = e^{ξ c {\hat{a}}^{†}} | 0_{a} 〉 〈 0_{a} | e^{\tilde{ξ} c \hat{a}} \cdot 〈 0_{b} | e^{\tilde{ξ} s \hat{b}} e^{ξ s {\hat{b}}^{†}} | 0_{b} 〉 . \end{matrix}

(12)

The following expression can be simplified:

\begin{matrix} 〈 0_{b} | e^{\tilde{ξ} s \hat{b}} e^{ξ s {\hat{b}}^{†}} | 0_{b} 〉 \\ = 〈 0_{b} | (1 + \tilde{ξ} s \hat{b} + \frac{1}{2} {(ξ s \hat{b})}^{2} + \dots) (1 + ξ s \hat{b^{†}} + \frac{1}{2} {(ξ s \hat{b^{†}})}^{2} + \dots) | 0_{b} 〉 \\ = (〈 0_{b} | + \tilde{ξ} s 〈 1_{b} | + \frac{1}{\sqrt{2}} {(\tilde{ξ} s)}^{2} 〈 2_{b} | + \dots) (〈 0_{b} + ξ s | 1_{b} 〉 + \frac{1}{\sqrt{2}} {(ξ s)}^{2} | 2_{b} 〉 + \dots) \\ = 1 + ξ \tilde{ξ} s^{2} + \frac{1}{2} {(ξ \tilde{ξ} s^{2})}^{2} + \dots = e^{ξ \tilde{ξ} s^{2}} . \end{matrix}

(13)

The density matrix now can be written in the following way:

\hat{ρ} = \frac{1}{2 π α} \int_{- \infty}^{+ \infty} \int_{- \infty}^{+ \infty} e^{ξ c {\hat{a}}^{†}} | 0 〉 〈 0 | e^{\tilde{ξ} c \hat{a}} \cdot e^{- \frac{ξ^{2} + {\tilde{ξ}}^{2}}{2 α} + ξ \tilde{ξ} s^{2}} d ξ d \tilde{ξ} .

(14)

3.2. Switching between Probability Density Functions

We can view this integral as taking an expected value over a

t w o

-dimensional normal distribution.

ξ

and

\tilde{ξ}

then become normally distributed random variables with a mean vector equal to zero. Their covariance matrix has the following form:

Σ = {(\begin{matrix} 1 / α & - s^{2} \\ - s^{2} & 1 / α \end{matrix})}^{- 1} = \frac{1}{1 / α^{2} - s^{4}} (\begin{matrix} 1 / α & s^{2} \\ s^{2} & 1 / α \end{matrix}) .

(15)

Then, we can write

\begin{matrix} \hat{ρ} & = \frac{{(d e t Σ)}^{1 / 2}}{α} \frac{1}{2 π {(d e t Σ)}^{1 / 2}} \int_{- \infty}^{+ \infty} \int_{- \infty}^{+ \infty} e^{ξ c {\hat{a}}^{†}} | 0 〉 〈 0 | e^{\tilde{ξ} c \hat{a}} e^{- \frac{ξ^{2} + {\tilde{ξ}}^{2}}{2 α} + ξ \tilde{ξ} s^{2}} d ξ d \tilde{ξ} \\ = \frac{{(d e t Σ)}^{1 / 2}}{α} \cdot E_{N (0, Σ)} [e^{ξ c {\hat{a}}^{†}} | 0 〉 〈 0 | e^{\tilde{ξ} c \hat{a}}], \end{matrix}

(16)

where

E_{N (0, Σ)}

denotes averaging over the

t w o

-dimensional normal distribution

N (0, Σ)

.

The expression

e^{ξ c {\hat{a}}^{†}} | 0 〉 〈 0 | e^{\tilde{ξ} c \hat{a}}

is troublesome to calculate, since there are two different variables

ξ

and

\tilde{ξ}

. We want to arrive somehow at an expression with only one such variable, i.e.,

e^{ξ c {\hat{a}}^{†}} | 0 〉 〈 0 | e^{ξ c \hat{a}}

, which we will denote

\hat{ν} (ξ c)

.

We now will choose normally distributed random variables

ξ_{0}, χ, \tilde{χ} \in R

such that

ξ = ξ_{0} + χ, \tilde{ξ} = ξ_{0} + \tilde{χ}

and the distributions over

ξ, \tilde{ξ}

and

ξ_{0}, χ, \tilde{χ}

have the same moments:

\{\begin{matrix} \bar{ξ^{2}} = \bar{{(ξ_{0} + χ)}^{2}} = \bar{ξ_{0}^{2}} + 2 \bar{ξ_{0} χ} + \bar{χ^{2}}, \\ \bar{{\tilde{ξ}}^{2}} = \bar{{(ξ_{0} + \tilde{χ})}^{2}} = \bar{ξ_{0}^{2}} + 2 \bar{ξ_{0} \tilde{χ}} + \bar{{\tilde{χ}}^{2}}, \\ \bar{ξ \tilde{ξ}} = \bar{(ξ_{0} + χ) (ξ_{0} + \tilde{χ})} = \bar{ξ_{0}^{2}} + \bar{ξ_{0} χ} + \bar{ξ_{0} \tilde{χ}} + \bar{χ \tilde{χ}} . \end{matrix}

(17)

We have some freedom in choosing these variables; we will set

\bar{ξ_{0} χ} = \bar{ξ_{0} \tilde{χ}} = 0

so that

ξ_{0} ⊥ ⊥ χ

and

ξ_{0} ⊥ ⊥ \tilde{χ}

. Then, the covariance matrix

Γ

of

ξ_{0}, χ, \tilde{χ}

will be determined by one parameter

h = \bar{χ \tilde{χ}}

:

\{\begin{matrix} \bar{ξ^{2}} = \bar{ξ_{0}^{2}} + \bar{χ^{2}}, \\ \bar{{\tilde{ξ}}^{2}} = \bar{ξ_{0}^{2}} + \bar{{\tilde{χ}}^{2}}, \\ \bar{ξ \tilde{ξ}} = \bar{ξ_{0}^{2}} + h . \end{matrix}

(18)

\{\begin{matrix} \bar{ξ_{0}^{2}} = \bar{ξ \tilde{ξ}} - h = \frac{s^{2}}{1 / α^{2} - s^{4}} - h, \\ \bar{χ^{2}} = \bar{{\tilde{χ}}^{2}} = \bar{ξ^{2}} - \bar{ξ \tilde{ξ}} + h = \frac{1 / α - s^{2}}{1 / α^{2} - s^{4}} + h = \frac{1}{1 / α + s^{2}} + h . \end{matrix}

(19)

Note that

- \frac{1}{1 / α + s^{2}} \leq h \leq \frac{s^{2}}{1 / α^{2} - s^{4}}

. We will later find an optimal way to choose h. The density matrix in terms of the new variables

ξ_{0}, χ, \tilde{χ} \in N (0, Γ)

is

\hat{ρ} = \frac{{(d e t Σ)}^{1 / 2}}{α} \cdot E_{N (0, Γ)} [e^{(ξ_{0} + χ) c {\hat{a}}^{†}} | 0 〉 〈 0 | e^{(ξ_{0} + \tilde{χ}) c \hat{a}}] .

(20)

Since

ξ_{0} ⊥ ⊥ χ

and

ξ_{0} ⊥ ⊥ \tilde{χ}

, the distribution

E_{N (0, Γ)}

can be split into a combination of distributions over

ξ_{0}

and over

χ, \tilde{χ}

. The covariance matrix

Λ

of

χ

and

\tilde{χ}

is

Λ = (\begin{matrix} \bar{χ^{2}} & \bar{χ \tilde{χ}} \\ \bar{χ \tilde{χ}} & \bar{{\tilde{χ}}^{2}} \end{matrix}) = (\begin{matrix} \frac{1}{1 / α + s^{2}} + h & h \\ h & \frac{1}{1 / α + s^{2}} + h \end{matrix}),

(21)

and the distribution

E_{N (0, Γ)}

can be written as

N (0, Γ) = N (0, \bar{ξ_{0}^{2}}) \cdot N (0, Λ)

(22)

3.3. Taylor Series Expansion

We now consider the Taylor series of the expression

e^{(ξ_{0} + χ) c {\hat{a}}^{†}} | 0 〉 〈 0 | e^{(ξ_{0} + \tilde{χ}) c \hat{a}}

, leaving only

ξ_{0}

in the exponent:

\begin{matrix} e^{(ξ_{0} + χ) c {\hat{a}}^{†}} | 0 〉 〈 0 | e^{(ξ_{0} + \tilde{χ}) c \hat{a}} = e^{χ c {\hat{a}}^{†}} e^{ξ_{0} c {\hat{a}}^{†}} | 0 〉 〈 0 | e^{ξ_{0} c \hat{a}} e^{\tilde{χ} c \hat{a}} \\ = e^{χ c {\hat{a}}^{†}} \hat{ν} (ξ_{0} c) e^{\tilde{χ} c \hat{a}} = (1 + χ c {\hat{a}}^{†} + \frac{{(χ c {\hat{a}}^{†})}^{2}}{2} + \dots) \hat{ν} (ξ_{0} c) (1 + \tilde{χ} c \hat{a} + \frac{{(\tilde{χ} c \hat{a})}^{2}}{2} + \dots) . \end{matrix}

(23)

Each term in the expression will be proportional to

c^{n + m} χ^{n} {\tilde{χ}}^{m} \cdot {({\hat{a}}^{†})}^{n} \hat{ν} (ξ_{0} c) {\hat{a}}^{m},

and since

ξ_{0} ⊥ ⊥ χ

and

ξ_{0} ⊥ ⊥ \tilde{χ}

, the integral over

ξ_{0}, χ, \tilde{χ}

can be written as a product of integrals over

ξ_{0}

and

χ, \tilde{χ}

:

E_{N (0, Γ)} [c^{n + m} χ^{n} {\tilde{χ}}^{m} \cdot {({\hat{a}}^{†})}^{n} \hat{ν} (ξ_{0} c) {\hat{a}}^{m}] = c^{n + m} \cdot E_{N (0, Λ)} [χ^{n} {\tilde{χ}}^{m}] \cdot E_{N (0, \bar{ξ_{0}^{2}})} [{({\hat{a}}^{†})}^{n} \hat{ν} (ξ_{0} c) {\hat{a}}^{m}] .

(24)

The moments

E_{N (0, Λ)} [χ^{n} {\tilde{χ}}^{m}]

can be calculated analytically using Wick’s probability theorem.

3.4. Choosing $Γ$

The idea consists of minimizing the “perturbation parameter” so that each subsequent order of the Taylor series expansion has less impact on the expression. Since higher orders of the expansion contain higher powers of

c^{2}

and higher moments

E_{N (0, Λ)} [χ^{n} {\tilde{χ}}^{m}]

, and these moments can be calculated via second moments

\bar{χ^{2}} = \bar{{\tilde{χ}}^{2}}

and

\bar{χ \tilde{χ}} = h

, the role of the “perturbation parameter” is played by

ε = c^{2} \cdot max (\bar{χ^{2}}, | \bar{χ \tilde{χ}} |)

.

Let us consider the conditions that must be satisfied by h. Firstly, h must satisfy

- \frac{1}{1 / α + s^{2}} \leq h \leq \frac{s^{2}}{1 / α^{2} - s^{4}}

, because

\bar{ξ_{0}^{2}} \geq 0

and

\bar{χ^{2}} \geq 0

. Secondly, since

Γ

is a covariance matrix, its eigenvalues must be non-negative. The eigenvalues of

Γ

are

\bar{ξ_{0}^{2}}

,

\bar{χ^{2}} - h

and

\bar{χ^{2}} + h

. Thus, h needs to satisfy

\bar{χ^{2}} + h \geq 0 \Leftrightarrow \frac{1}{1 / α + s^{2}} + 2 h \geq 0 \Leftrightarrow h \geq - \frac{1}{2} \frac{1}{1 / α + s^{2}} .

(25)

The minimum of

max (\bar{χ^{2}}, | h |)

is realized when

h = - \bar{χ^{2}} = - \frac{1}{2} \frac{1}{1 / α + s^{2}}

.

3.5. Multimode Case

Let us apply the steps described above to the case of N modes. We start with an input state

| ψ^{' (N)} 〉 = \prod_{i = 1}^{N} e^{\frac{α}{2} {(c {\hat{a}}_{i}^{†} + s {\hat{b}}_{i}^{†})}^{2}} | 0_{a} 0_{b} 〉 .

(26)

We construct a density matrix and take the partial trace over loss modes:

\hat{ρ} = T r_{b} \{e^{\sum_{i} \frac{α}{2} {(c {\hat{a}}_{i}^{†} + s {\hat{b}}_{i}^{†})}^{2}} | 0_{a} 0_{b} 〉 〈 0_{a} 0_{b} | e^{\sum_{i} \frac{α}{2} {(c {\hat{a}}_{i} + s {\hat{b}}_{i})}^{2}}\} .

(27)

We apply the Hubbard–Stratonovich transformation

2 N

times, resulting in an integral over

\prod_{i = 1}^{N} d ξ_{i} d {\tilde{ξ}}_{i}

:

\begin{matrix} \hat{ρ} = \frac{1}{{(2 π)}^{N}} \int_{R^{2 N}} T r_{b} \{e^{\sum_{i} ξ_{i} \sqrt{α} (c {\hat{a}}_{i}^{†} + s {\hat{b}}_{i}^{†})} | 0_{a} 0_{b} 〉 〈 0_{a} 0_{b} | e^{\sum_{i} {\tilde{ξ}}_{i} \sqrt{α} (c {\hat{a}}_{i} + s {\hat{b}}_{i})}\} e^{- \sum_{i} \frac{ξ_{i}^{2} + {\tilde{ξ}}_{i}^{2}}{2}} \prod_{i} d ξ_{i} d {\tilde{ξ}}_{i} \\ = \frac{1}{{(2 π α)}^{N}} \int_{R^{2 N}} T r_{b} \{e^{\sum_{i} ξ_{i} \sqrt{α} (c {\hat{a}}_{i}^{†} + s {\hat{b}}_{i}^{†})} | 0_{a} 0_{b} 〉 〈 0_{a} 0_{b} | e^{\sum_{i} {\tilde{ξ}}_{i} \sqrt{α} (c {\hat{a}}_{i} + s {\hat{b}}_{i})}\} \\ \cdot e^{- \sum_{i} \frac{{(ξ_{i} \sqrt{α})}^{2} + {({\tilde{ξ}}_{i} \sqrt{α})}^{2}}{2 α}} \prod_{i} d (ξ_{i} \sqrt{α}) d ({\tilde{ξ}}_{i} \sqrt{α}), \end{matrix}

(28)

where the integral for each variable

ξ_{i}

and

{\tilde{ξ}}_{i}

is calculated over

(- \infty, + \infty)

.

Again, we redefine

ξ_{i} \sqrt{α} ⟶ ξ_{i}

,

{\tilde{ξ}}_{i} \sqrt{α} ⟶ {\tilde{ξ}}_{i}

:

\hat{ρ} = \frac{1}{{(2 π α)}^{N}} \int_{R^{2 N}} T r_{b} \{e^{\sum_{i} ξ_{i} (c {\hat{a}}_{i}^{†} + s {\hat{b}}_{i}^{†})} | 0_{a} 0_{b} 〉 〈 0_{a} 0_{b} | e^{\sum_{i} {\tilde{ξ}}_{i} (c {\hat{a}}_{i} + s {\hat{b}}_{i})}\} e^{- \sum_{i} \frac{ξ_{i}^{2} + {\tilde{ξ}}_{i}^{2}}{2 α_{i}}} \prod_{i} d ξ_{i} d {\tilde{ξ}}_{i} .

(29)

We compute partial trace over loss modes:

\hat{ρ} = \frac{1}{{(2 π α)}^{N}} \int_{R^{2 N}} e^{\sum_{i} ξ_{i} c {\hat{a}}_{i}^{†}} | 0 〉 〈 0 | e^{\sum_{i} {\tilde{ξ}}_{i} c {\hat{a}}_{i}} e^{- \sum_{i} \frac{ξ_{i}^{2} + {\tilde{ξ}}_{i}^{2}}{2 α} + ξ_{i} {\tilde{ξ}}_{i} s^{2}} \prod_{i} d ξ_{i} d {\tilde{ξ}}_{i} .

(30)

This expression now can be considered as taking an expected value over a

2 N

-dimensional normal distribution where all variable pairs

ξ_{i}, {\tilde{ξ}}_{i}

are independent. Every variable pair

ξ_{i}, {\tilde{ξ}}_{i}

has covariance matrix

Σ

, and we can write this expression in the following way:

\hat{ρ} = \frac{{(d e t Σ)}^{N / 2}}{α^{N}} \cdot E_{\prod_{i} N (0, Σ)} [e^{\sum_{i} ξ_{i} c {\hat{a}}_{i}^{†}} | 0 〉 〈 0 | e^{\sum_{i} \tilde{ξ_{i}} c {\hat{a}}_{i}}] .

(31)

For each variable pair

ξ_{i}, {\tilde{ξ}}_{i}

we now choose

ξ_{0 i}, χ_{i}, {\tilde{χ}}_{i}

in a way that is described above. Then,

\hat{ρ} = \frac{{(d e t Σ)}^{N / 2}}{α^{N}} \cdot E_{\prod_{i} N (0, Γ)} [e^{\sum_{i} (ξ_{0 i} + χ_{i}) c {\hat{a}}_{i}^{†}} | 0 〉 〈 0 | e^{\sum_{i} (ξ_{0 i} + {\tilde{χ}}_{i}) c {\hat{a}}_{i}}] .

(32)

We now consider the Taylor series expansion (up to the second order) of the expression in the square brackets, which we will denote

\hat{μ}

:

\begin{matrix} \hat{μ} = e^{\sum_{i} χ_{i} c {\hat{a}}_{i}^{†}} e^{\sum_{i} ξ_{0 i} c {\hat{a}}_{i}^{†}} | 0 〉 〈 0 | e^{\sum_{i} ξ_{0 i} c {\hat{a}}_{i}} e^{\sum_{i} \tilde{χ_{i}} c {\hat{a}}_{i}} \\ = \prod_{i} (1 + χ_{i} c {\hat{a}}_{i}^{†} + \frac{{(χ_{i} c {\hat{a}}_{i}^{†})}^{2}}{2}) e^{\sum_{i} ξ_{0 i} c {\hat{a}}_{i}^{†}} | 0 〉 〈 0 | e^{\sum_{i} ξ_{0 i} c {\hat{a}}_{i}} \prod_{i} (1 + \tilde{χ_{i}} c {\hat{a}}_{i} + \frac{{(\tilde{χ_{i}} c {\hat{a}}_{i})}^{2}}{2}) . \end{matrix}

(33)

The creation operators

{\hat{a}}_{i}^{†}

that act on the input modes can be written in terms of the operators

{\hat{d}}_{i}^{†}

that act on the output modes:

\begin{matrix} \hat{μ} = \prod_{i} (1 + χ_{i} c \sum_{j} U_{i j} {\hat{d}}_{j}^{†} + \frac{{(χ_{i} c \sum_{j} U_{i j} {\hat{d}}_{j}^{†})}^{2}}{2}) e^{\sum_{i j} ξ_{0 i} c U_{i j} {\hat{d}}_{j}^{†}} | 0 〉 \\ \cdot 〈 0 | e^{\sum_{i j} ξ_{0 i} c U_{i j}^{*} {\hat{d}}_{j}} \prod_{i} (1 + \tilde{χ_{i}} c \sum_{j} U_{i j}^{*} {\hat{d}}_{j} + \frac{{(\tilde{χ_{i}} c \sum_{j} U_{i j}^{*} {\hat{d}}_{j})}^{2}}{2}) . \end{matrix}

(34)

We will denote

\hat{ν} ({\vec{ξ}}_{0} c) = e^{\sum_{i j} ξ_{0 i} c U_{i j} {\hat{d}}_{j}^{†}} | 0 〉 〈 0 | e^{\sum_{i j} ξ_{0 i} c U_{i j}^{*} {\hat{d}}_{j}} .

(35)

We can expand the brackets in the expression for

\hat{μ}

, leaving the terms up to the second order:

\begin{matrix} \prod_{i} (1 + χ_{i} c \sum_{j} U_{i j} {\hat{d}}_{j}^{†} + \frac{{(χ_{i} c \sum_{j} U_{i j} {\hat{d}}_{j}^{†})}^{2}}{2}) = 1 + \sum_{j} {\hat{d}}_{j}^{†} \sum_{i} c χ_{i} U_{i j} \\ + \sum_{j k} {\hat{d}}_{j}^{†} {\hat{d}}_{k}^{†} (\frac{1}{2} \sum_{i} c^{2} χ_{i}^{2} U_{i j} U_{i k} + \sum_{i \neq l} c χ_{i} c_{l} χ_{l} U_{i j} U_{l k}) . \end{matrix}

(36)

\begin{matrix} \prod_{i} (1 + \tilde{χ_{i}} c \sum_{j} U_{i j}^{*} {\hat{d}}_{j} + \frac{{(\tilde{χ_{i}} c \sum_{j} U_{i j}^{*} {\hat{d}}_{j})}^{2}}{2}) = 1 + \sum_{j} {\hat{d}}_{j} \sum_{i} c \tilde{χ_{i}} U_{i j}^{*} \\ + \sum_{j k} {\hat{d}}_{j} \hat{d_{k}} (\frac{1}{2} \sum_{i} c^{2} {\tilde{χ}}_{i}^{2} U_{i j}^{*} U_{i k}^{*} + \sum_{i \neq l} c \tilde{χ_{i}} c_{l} \tilde{χ_{l}} U_{i j}^{*} U_{l k}^{*}) . \end{matrix}

(37)

When we take the product of these two expressions, most of the resulting terms will have zero expected value because of the properties of the normal distribution. Then

\begin{matrix} \hat{μ} = & \hat{ν} ({\vec{ξ}}_{0} c) + \frac{1}{2} \sum_{i} χ_{i}^{2} c^{2} \sum_{j k} U_{i j} U_{i k} \cdot {\hat{d}}_{j}^{†} {\hat{d}}_{k}^{†} \hat{ν} ({\vec{ξ}}_{0} c) + \frac{1}{2} \sum_{i} {\tilde{χ_{i}}}^{2} c^{2} \sum_{j k} U_{i j}^{*} U_{i k}^{*} \cdot \hat{ν} ({\vec{ξ}}_{0} c) {\hat{d}}_{j} {\hat{d}}_{k} \\ + \sum_{i} χ_{i} \tilde{χ_{i}} c^{2} \sum_{j k} U_{i j} U_{i k}^{*} \cdot {\hat{d}}_{j}^{†} \hat{ν} ({\vec{ξ}}_{0} c) {\hat{d}}_{k} \\ + \frac{1}{4} \sum_{i j} χ_{i}^{2} {\tilde{χ_{j}}}^{2} c^{4} \sum_{k l m n} U_{i k} U_{i l} U_{j m}^{*} U_{j n}^{*} \cdot {\hat{d}}_{k}^{†} {\hat{d}}_{l}^{†} \hat{ν} ({\vec{ξ}}_{0} c) {\hat{d}}_{m} {\hat{d}}_{n} \\ + \sum_{i \neq j} χ_{i} χ_{j} \tilde{χ_{i}} \tilde{χ_{j}} c^{4} \sum_{k l m n} U_{i k} U_{j l} U_{i m}^{*} U_{j n}^{*} \cdot {\hat{d}}_{k}^{†} {\hat{d}}_{l}^{†} \hat{ν} ({\vec{ξ}}_{0} c) {\hat{d}}_{m} {\hat{d}}_{n} . \end{matrix}

(38)

The integrals over

χ_{i}

,

{\tilde{χ}}_{i}

result in specific moments of the distribution, and the integral over

ξ_{0 i}

can be calculated using Monte-Carlo methods. The final expression is

\begin{matrix} T r \{{\hat{ρ}}_{o u t} \hat{\vec{n}}\} & = \frac{{(det Σ)}^{N / 2}}{α^{N}} E_{\prod_{i} N (0, \bar{ξ_{0}^{2}})} [T r \{\hat{ν} ({\vec{ξ}}_{0} c) \hat{\vec{n}}\} \\ + \frac{1}{2} \bar{χ^{2}} c^{2} \sum_{i j k} U_{i j} U_{i k} \cdot T r \{{\hat{d}}_{j}^{†} {\hat{d}}_{k}^{†} \hat{ν} ({\vec{ξ}}_{0} c) \hat{\vec{n}}\} + \frac{1}{2} \bar{{\tilde{χ}}^{2}} c^{2} \sum_{i j k} U_{i j}^{*} U_{i k}^{*} \cdot T r \{\hat{ν} ({\vec{ξ}}_{0} c) {\hat{d}}_{j} {\hat{d}}_{k} \hat{\vec{n}}\} \\ + \bar{χ \tilde{χ}} c^{2} \sum_{i j k} U_{i j} U_{i k}^{*} \cdot T r \{{\hat{d}}_{j}^{†} \hat{ν} ({\vec{ξ}}_{0} c) {\hat{d}}_{k} \hat{\vec{n}}\} \\ + \frac{1}{4} c^{4} \sum_{i j} ({(\bar{χ^{2}})}^{2} + 2 δ_{i j} {(\bar{χ \tilde{χ}})}^{2}) \sum_{k l m n} U_{i k} U_{i l} U_{j m}^{*} U_{j n}^{*} \cdot T r \{{\hat{d}}_{k}^{†} {\hat{d}}_{l}^{†} \hat{ν} ({\vec{ξ}}_{0} c) {\hat{d}}_{m} {\hat{d}}_{n} \hat{\vec{n}}\} \\ + {(\bar{χ \tilde{χ}})}^{2} c^{4} \sum_{i \neq j} \sum_{k l m n} U_{i k} U_{j l} U_{i m}^{*} U_{j n}^{*} \cdot T r \{{\hat{d}}_{k}^{†} {\hat{d}}_{l}^{†} \hat{ν} ({\vec{ξ}}_{0} c) {\hat{d}}_{m} {\hat{d}}_{n} \hat{\vec{n}}\}], \end{matrix}

(39)

where by Wick’s probability theorem

\bar{χ_{i}^{2} {\tilde{χ}}_{j}^{2}} = \bar{χ_{i}^{2}} \cdot \bar{{\tilde{χ}}_{j}^{2}} + \bar{χ_{i} {\tilde{χ}}_{j}} \cdot \bar{χ_{i} {\tilde{χ}}_{j}} + \bar{χ_{i} {\tilde{χ}}_{j}} \cdot \bar{χ_{i} {\tilde{χ}}_{j}} = {(\bar{χ^{2}})}^{2} + 2 δ_{i j} {(\bar{χ \tilde{χ}})}^{2}

.

3.6. Calculating Traces

In order to calculate

T r \{{\hat{ρ}}_{o u t} \hat{\vec{n}}\}

, we need to be able to calculate expressions

T r \{\hat{ν} (\vec{x}) \hat{\vec{n}}\}

,

T r \{{\hat{d}}_{j}^{†} {\hat{d}}_{k}^{†} \hat{ν} (\vec{x}) \hat{\vec{n}}\}

,

T r \{\hat{ν} (\vec{x}) {\hat{d}}_{j} {\hat{d}}_{k} \hat{\vec{n}}\}

,

T r \{{\hat{d}}_{j}^{†} \hat{ν} (\vec{x}) {\hat{d}}_{k} \hat{\vec{n}}\}

, etc., for different

\vec{x}

. The first one can be calculated fairly easily:

\begin{matrix} T r \{\hat{ν} (\vec{x}) \hat{\vec{n}}\} & = T r \{e^{\sum_{i j} x_{i} U_{i j} {\hat{d}}_{j}^{†}} | 0 〉 〈 0 | e^{\sum_{i j} x_{i} U_{i j}^{*} {\hat{d}}_{j}} | \vec{n} 〉 〈 \vec{n} |\} \\ = 〈 0 | e^{\sum_{i j} x_{i} U_{i j}^{*} {\hat{d}}_{j}} | \vec{n} 〉 〈 \vec{n} | e^{\sum_{i j} x_{i} c U_{i j} {\hat{d}}_{j}^{†}} | 0 〉 \\ = \prod_{j} 0 e^{\sum_{i} x_{i} U_{i j}^{*} {\hat{d}}_{j}} | n_{j} 〉 〈 n_{j} | e^{\sum_{i} x_{i} U_{i j} {\hat{d}}_{j}^{†}} | 0 〉 \\ = \prod_{j} 〈 0 | \frac{{(\sum_{i} x_{i} U_{i j}^{*} {\hat{d}}_{j})}^{n_{j}}}{n_{j}!} | n_{j} 〉 〈 n_{j} | \frac{{(\sum_{i} x_{i} U_{i j} {\hat{d}}_{j}^{†})}^{n_{j}}}{n_{j}!} | 0 〉 \\ = \prod_{j} [\frac{{(\sum_{i} x_{i} U_{i j}^{*})}^{n_{j}}}{\sqrt{n_{j}!}}] \cdot [\frac{{(\sum_{i} x_{i} U_{i j})}^{n_{j}}}{\sqrt{n_{j}!}}] \\ = \prod_{j} \frac{1}{n_{j}!} {|\sum_{i} x_{i} U_{i j}|}^{2 n_{j}} . \end{matrix}

(40)

Now, suppose we need to calculate

T r \{{({\hat{d}}_{1}^{†})}^{q_{1}} \dots {({\hat{d}}_{N}^{†})}^{q_{N}} \hat{ν} (\vec{x}) {({\hat{d}}_{1})}^{q_{1}} \dots {({\hat{d}}_{N})}^{q_{N}} \hat{\vec{n}}\}

. First, we note that

\begin{matrix} T r \{{({\hat{d}}_{1}^{†})}^{q_{1}} \dots {({\hat{d}}_{N}^{†})}^{q_{N}} \hat{ν} (\vec{x}) {({\hat{d}}_{1})}^{p_{1}} \dots {({\hat{d}}_{N})}^{p_{N}} \hat{\vec{n}}\} = T r \{\hat{ν} (\vec{x}) {({\hat{d}}_{1})}^{p_{1}} \dots {({\hat{d}}_{N})}^{p_{N}} \hat{\vec{n}} {({\hat{d}}_{1}^{†})}^{q_{1}} \dots {({\hat{d}}_{N}^{†})}^{q_{N}}\} \\ = T r \{\hat{ν} (\vec{x}) | \vec{n} - \vec{p} 〉 〈 \vec{n} - \vec{q} |\} \cdot \prod_{j} \sqrt{n_{j} (n_{j} - 1) \dots (n_{j} - p_{j} + 1) n_{j} (n_{j} - 1) \dots (n_{j} - q_{j} + 1)} \\ = T r \{\hat{ν} (\vec{x}) | \vec{n} - \vec{p} 〉 〈 \vec{n} - \vec{q} |\} \cdot \prod_{j} \sqrt{\frac{n_{j}!}{(n_{j} - p_{j})!} \frac{n_{j}!}{(n_{j} - q_{j})!}}, \end{matrix}

(41)

where by, e.g.,

| \vec{n} - \vec{p} 〉

we mean

⨂_{i} | n_{i} - p_{i} 〉

.

\begin{matrix} T r \{\hat{ν} (\vec{x}) | \vec{n} - \vec{p} 〉 〈 \vec{n} - \vec{q} |\} & = \prod_{j} 0 e^{\sum_{i} x_{i} U_{i j}^{*} {\hat{d}}_{j}} | n_{j} - p_{j} 〉 〈 n_{j} - q_{j} | e^{\sum_{i} x_{i} U_{i j} {\hat{d}}_{j}^{†}} 0 \\ = \prod_{j} [\frac{{(\sum_{i} x_{i} U_{i j}^{*})}^{n_{j} - p_{j}}}{\sqrt{(n_{j} - p_{j})!}}] \cdot [\frac{{(\sum_{i} x_{i} U_{i j})}^{n_{j} - q_{j}}}{\sqrt{(n_{j} - q_{j})!}}] \\ = \prod_{j} \frac{1}{\sqrt{(n_{j} - p_{j})! (n_{j} - q_{j})!}} \cdot \frac{{|\sum_{i} x_{i} U_{i j}|}^{2 n_{j}}}{{(\sum_{i} x_{i} U_{i j}^{*})}^{p_{j}} {(\sum_{i} x_{i} U_{i j})}^{q_{j}}} \\ = \prod_{j} \frac{{|\sum_{i} x_{i} U_{i j}|}^{2 n_{j}}}{{(\sum_{i} x_{i} U_{i j}^{*})}^{p_{j}} {(\sum_{i} x_{i} U_{i j})}^{q_{j}}} \cdot \frac{\sqrt{\frac{n_{j}!}{(n_{j} - p_{j})!} \frac{n_{j}!}{(n_{j} - q_{j})!}}}{n_{j}!} \\ = T r \{\hat{ν} (\vec{x}) \hat{\vec{n}}\} \prod_{j} \sqrt{\frac{n_{j}!}{(n_{j} - p_{j})!} \frac{n_{j}!}{(n_{j} - q_{j})!}} \frac{1}{{(\sum_{i} x_{i} U_{i j}^{*})}^{p_{j}} {(\sum_{i} x_{i} U_{i j})}^{q_{j}}} . \end{matrix}

(42)

Finally, we can write

\begin{matrix} T r \{{({\hat{d}}_{1}^{†})}^{q_{1}} \dots {({\hat{d}}_{N}^{†})}^{q_{N}} \hat{ν} (\vec{x}) {({\hat{d}}_{1})}^{p_{1}} \dots {({\hat{d}}_{N})}^{p_{N}} \hat{\vec{n}}\} \\ = T r \{\hat{ν} (\vec{x}) \hat{\vec{n}}\} \prod_{j} \frac{n_{j}!}{(n_{j} - p_{j})!} \frac{n_{j}!}{(n_{j} - q_{j})!} \frac{1}{{(\sum_{i} x_{i} U_{i j}^{*})}^{p_{j}} {(\sum_{i} x_{i} U_{i j})}^{q_{j}}} . \end{matrix}

(43)

4. Algorithm Overview

The goal of the algorithm is to calculate the probability of a state

| \vec{n} 〉

, given

\vec{n}

,

α

, c, s and U. We assume that the Taylor series expansion is calculated up to the desired order before computation starts. The integrals over

χ_{i}

and

{\tilde{χ}}_{i}

should also be computed (it can be completed analytically via Wick’s probability theorem).

We start by calculating two-variable covariance matrix

Σ

using

α

and s. We now select

Γ

in the way specified above such that it minimizes the series expansion parameter

ε

. In order to compute the integrals over

ξ_{0 i}

, we sample

ξ_{0 i}

for each i from a normal distribution

N (0, \bar{ξ_{0}^{2}})

.

We now compute

T r \{\hat{μ} \hat{\vec{n}}\}

, which by linearity consists in computing traces of the form described above; for each sample,

{\vec{ξ}}_{0}

we need only a polynomial number of operations.

Finally, we take an average over our samples and multiply by the necessary constant

{(\frac{\sqrt{det Σ}}{α \sqrt{(1 - | α |^{2})}})}^{N}

.

5. Taylor Series Convergence for Actual Experimental Conditions

We have discussed above the fact that the role of the “perturbation parameter” in the series expansion is played by

c^{2} \cdot max (\bar{χ^{2}}, | h |)

, which we can choose to be equal to

ε = \frac{1}{2} \frac{c^{2}}{1 / α + s^{2}}

. This parameter depends on the experimental conditions (i.e., the squeezing parameter of the input state

α

and loss level

s^{2}

). The smaller this parameter is, the faster the series will converge. Thus, the best conditions for this algorithm are achieved when the loss level

s^{2}

is high and the squeezing parameter

α

is low. Let us consider the actual experimental implementation of the Gaussian boson sampling problem and estimate how small this parameter is in those conditions.

Let us consider the relation between

α

and the average amount of photons per state

〈 n 〉

. If the squeezing parameter is

ζ = r e^{i φ}

, then

α = tanh r

, while

〈 n 〉 = {sinh}^{2} r

.

In a paper by Zhong et al. [8], 25 PPKTP crystals were used to produce 25 two-mode squeezed states, which is equivalent to 50 single-mode squeezed states. The average number of photons registered by the detectors was 43. Thus, the average amount of photons per mode

〈 n 〉

is around

\frac{43}{50}

;

r = arcsinh (\sqrt{〈 n 〉}) \approx 0.855

,

α = tanh r \approx 0.694

. The average collection efficiency is said to be

c^{2} = 0.628

. Then,

ε = \frac{1}{2} \frac{c^{2}}{\frac{1}{α} + s^{2}} \approx 0.18

.

In another paper by Zhong et al. [9], the average amount of photons produced was increased to 70 at maximum pump intensity. This corresponds to

α \approx 0.76

. The overall transmission rate in the experiment is said in the paper to be

48 %

and

54 %

for different settings, so we take

s^{2} \approx 0.5

. This yields

ε \approx 0.14

.

In the most recent experiment by Deng et al. [10], the average amount of photons was increased even more, measuring states with

\approx 50

,

\approx 75

and

\approx 100

photons on average with different pump intensities while still producing 25 two-mode squeezed states. The efficiency of the setup is said to be

43 %

, yielding

ε \approx 0.11

,

ε \approx 0.12

and

ε \approx 0.12

, respectively.

To estimate the expected accuracy of the algorithm, we can assume that the numerical values of each order are approximately equal, meaning that we can write

T r \{{\hat{ρ}}_{o u t} \hat{\vec{n}}\} = P_{0} + ε P_{2} + ε^{2} P_{4} + \dots,

(44)

where

P_{k}

denotes the sum of all the terms of the k-th order, and

P_{0} \approx P_{2} \approx P_{4} \approx P_{k}

is assumed. The expression then becomes a geometric progression with common ratio

ε

. Then, on average, the 0-th order contributes to the probability a part equal to

1 - ε

, while the second order contributes

ε (1 - ε)

, the fourth contributes

ε^{2} (1 - ε)

, etc.

Calculating up to the second order then discards a total contribution of

ε

, which is approximately 0.18² = 3.2%, 0.14² = 1.96%, 0.11² = 1.21% and 0.12² = 1.44% for the conditions that are analyzed above. When the calculation is performed up to the fourth order, the lost contribution is approximately 0.18³ ≈ 0.58%, 0.14³ ≈ 0.27%, 0.11³ ≈ 0.13% and 0.12³ ≈ 0.17%.

The conclusion that we draw is that even in large GBS experiments which are said to demonstrate quantum advantage, the conditions are such that

ε

is fairly small, and calculating up to the fourth order is enough for the lost contribution to be below 1%.

6. Implementation Details

6.1. Contraction Precomputation

Let us consider the term

\frac{1}{2} \bar{χ^{2}} c^{2} \sum_{i j k} U_{i j} U_{i k} \cdot T r \{{\hat{d}}_{j}^{†} {\hat{d}}_{k}^{†} \hat{ν} ({\vec{ξ}}_{0} c) \hat{\vec{n}}\} .

We can rewrite it as

\frac{1}{2} \bar{χ^{2}} c^{2} \sum_{j k} T r \{{\hat{d}}_{j}^{†} {\hat{d}}_{k}^{†} \hat{ν} ({\vec{ξ}}_{0} c) \hat{\vec{n}}\} \sum_{i} U_{i j} U_{i k} = \frac{1}{2} \bar{χ^{2}} c^{2} \sum_{j k} T r \{{\hat{d}}_{j}^{†} {\hat{d}}_{k}^{†} \hat{ν} ({\vec{ξ}}_{0} c) \hat{\vec{n}}\} T_{j k},

(45)

where

T_{j k} = \sum_{i} U_{i j} U_{i k}

is a contraction of U with itself. It depends only on U and can be calculated before sampling

{\vec{ξ}}_{0}

, which reduces the amount of operations required to calculate each probability sample from a

{\vec{ξ}}_{0}

sample.

6.2. Factorial Fractions Precomputation

In calculating traces of the form described above, we need to calculate factorial fractions of the form

\frac{m!}{(m - p)!} \equiv F_{p}^{m}

, where

0 \leq p \leq m

. Since the target state

\hat{\vec{n}}

is fixed,

m \leq max (n_{i})

.

6.3. Reusing $\sum_{i} x_{i} U_{i j}$

During calculation, while calculating each trace, we can calculate

\sum_{i} x_{i} U_{i j}

only once for each

{\vec{ξ}}_{0}

sample and then reuse it, thus using less operations to calculate each trace. Let us denote

S_{j} = \sum_{i} x_{i} U_{i j}

;

\vec{S} = U^{T} \vec{x}

. Then,

T r \{\hat{ν} (\vec{x}) \hat{\vec{n}}\} = \prod_{j} \frac{1}{n_{j}!} {|S_{j}|}^{2 n_{j}}

(46)

and

\begin{matrix} T r \{{({\hat{d}}_{1}^{†})}^{q_{1}} \dots {({\hat{d}}_{N}^{†})}^{q_{N}} \hat{ν} (\vec{x}) {({\hat{d}}_{1})}^{p_{1}} \dots {({\hat{d}}_{N})}^{p_{N}} \hat{\vec{n}}\} \\ = T r \{\hat{ν} (\vec{x}) \hat{\vec{n}}\} \prod_{j} \frac{F_{p_{j}}^{n_{j}} F_{q_{j}}^{n_{j}}}{{(S_{j}^{*})}^{p_{j}} {(S_{j})}^{q_{j}}} . \end{matrix}

(47)

7. Complexity Analysis

7.1. Precomputation

In this section, we will analyze the computational complexity of precomputation. By precomputation we mean the calculations that need to be carried out only once before

{\vec{ξ}}_{0}

sampling and before calculating probability samples for each

{\vec{ξ}}_{0}

. The multiplicative constant

{(\frac{\sqrt{det Σ}}{α \sqrt{(1 - | α |^{2})}})}^{N}

can be calculated with

O (N)

multiplication operations. For each term in the resulting sum, we will define its order to be the number of variables

χ

and

\tilde{χ}

, or, equivalently, the power of the loss parameter c. Thus, the term

\bar{χ \tilde{χ}} c^{2} \sum_{i j k} U_{i j} U_{i k}^{*} \cdot T r \{{\hat{d}}_{j}^{†} \hat{ν} ({\vec{ξ}}_{0} c) {\hat{d}}_{k} \hat{\vec{n}}\}

will be of the second order. Then, each term of the order K will have a contraction of the form

\sum_{j_{1} \dots j_{K}} U_{i_{1} j_{1}} U_{i_{2} j_{2}} \dots U_{i_{K} j_{K}}

where some of the

U_{j i}

can be conjugated. This leaves at most

K + 1

different ways to conjugate the factors. Each contraction has K free indices, and calculating the sum requires

N^{K}

additions and

N^{K} (K - 1)

multiplications. The total number of additions is

N^{2 K}

and the number of multiplications is

N^{2 K} (K - 1)

, where K is the maximum order we choose to calculate.

Calculating all

F_{p}^{m} \equiv \frac{m!}{(m - p)!}

for

0 \leq p \leq m \leq max (n_{i})

requires only around

\frac{max {(n_{i})}^{2}}{2}

multiplications, since

\forall m F_{0}^{m} = 1

,

F_{1}^{m} = m

,

F_{2}^{m} = m (m - 1) = (m - 1) F_{1}^{m}

, …,

F_{k}^{m} = (m - k + 1) F_{k - 1}^{m}

.

7.2. Probability Sample Computation

Here, we will analyze the computational complexity of calculating a single probability sample given

{\vec{ξ}}_{0}

. We will assume that the terms are calculated up to some order K.

Calculating the trace

T r \{\hat{ν} (\vec{x}) \hat{\vec{n}}\} = \prod_{j} \frac{1}{n_{j}!} {|\sum_{i} x_{i} U_{i j}|}^{2 n_{j}} .

(48)

requires one multiplication of an

N \times N

matrix by a N-dimensional vector, N exponentiation operations and

2 N

multiplication operations. This calculations needs to be made only once for each

\vec{x}

. Calculating any other trace of the form

T r \{{({\hat{d}}_{1}^{†})}^{q_{1}} \dots {({\hat{d}}_{N}^{†})}^{q_{N}} \hat{ν} (\vec{x}) {({\hat{d}}_{1})}^{p_{1}} \dots {({\hat{d}}_{N})}^{p_{N}} \hat{\vec{n}}\}

requires

2 N

exponentiation operations and

4 N

multiplication operations (since factorial fractions are precomputed).

The number of terms for a given order K is

N^{K}

times the number of different non-zero K-th order moments

\bar{χ_{i_{1}} \dots χ_{i_{r}} {\tilde{χ}}_{i_{r + 1}} \dots {\tilde{χ}}_{i_{K}}}

. The exact amount is hard to calculate, but the total number of moments (including those that are zero) is

(K + 1) N^{K}

. Thus, the maximum amount of terms required to compute is

(K + 1) N^{2 K}

.

Since the amount of operations required to calculate each term is

O (N)

, the total computational complexity of calculating a probability sample for a given

{\vec{ξ}}_{0}

is

O (K \cdot N^{2 K})

.

8. Results

Below are the results of probability calculation for

N = 5

for different output states. The calculated probabilities are compared to exact solutions. The parameters are

α = 0.9

,

c = s = \frac{\sqrt{2}}{2}

. The number of samples is 4096.

These results show that for calculating a single output state probability accurately, the number of samples needs to be on the order of

10^{4}

. Below are the results of using fewer samples per state, but instead of comparing individual probabilities, we look at the cosine similarity between the exact and approximated probability distributions over all two-photon states, Figure 1, Figure 2 and Figure 3.

The above graph suggests that the number of samples per state needed to approximate the distribution does not depend much on N. It is computationally hard to check this when comparing to the exact solution, but if we assume that the cosine similarity converges to a value close to 1, we can estimate how quickly it converges. Below, we look at the cosine similarity between a distribution calculated using H samples per state and a distribution calculated with

H + Δ H

samples per state for different H, where we choose

Δ H = 10

. This allows us to estimate how much the distribution changes with

Δ H

new samples: if the cosine similarity is close to 1, then new samples do not alter the distribution significantly. Figure 4 suggests more strongly that the number of samples per state required for accurate approximation is not really influenced by N. This can be explained by the fact that the number of

t w o

-photon states increases with N. If the number of samples per state is constant, then the total number of samples increases with N.

Below are benchmark results that show the average precomputation time, which depends only on N, and the time per sample, which depends on N and the amount of photons M in the target state, Figure 5 and Figure 6.

These results show that even

N = 40

mode GBS can be simulated on an average laptop using this algorithm.

A direct comparison of the performance of our method with other published results is problematic, since our algorithm calculates the probability of output states rather than directly sampling states from some approximate distribution. The closest algorithm is described in [16]. However, the performance comparison is still problematic because the error bars of the two methods cannot be compared directly. Nevertheless, judging by Figure 9 from [16], our algorithm is more memory-efficient, since it uses about 0.1 GB of memory to run in conditions where the transmission rate is 0.5 and the number of modes is 40, whereas the algorithm [16] uses 1 TB, and the memory requirement grows exponentially with the number of modes.

9. Conclusions

In this paper, we have presented a new algorithm for the approximate calculation of the probability of observing a given output state in the Gaussian boson sampling instance. We have discussed various implementation details that help to reduce the number of operations needed to calculate each probability sample. We also analyze the total computational complexity both of the calculations that need to be carried out once for each specific problem and of computing each probability sample.

This algorithm relies on the Taylor series expansion where the “perturbation” parameter is dependent on the problem conditions. The algorithm consists of calculating the terms of this Taylor series up to some finite order. For a fixed maximum order, the computational complexity of the algorithm is polynomial in N.

We have demonstrated that increasing the maximum order does increase the accuracy of the answer. We have also measured the precomputation and sampling time for a regular CPU, showing that even large instances of Gaussian boson sampling (

N \approx 40

) can be solved in reasonable time.

We have considered recent GBS experiments and estimated the parameters of the problem for those conditions. We conclude that the contribution of the terms that are discarded when the calculation completed up to the second order is less than

5 %

, and if the calculation is completed up to the fourth order, this number drops to

1 %

.

Author Contributions

Methodology, A.N.R. and M.V.U.; Software, M.V.U.; Investigation, M.V.U.; Resources, A.N.R.; Data curation, M.V.U.; Writing—original draft, M.V.U.; Writing—review & editing, A.N.R.; Supervision, A.N.R. All authors have read and agreed to the published version of the manuscript.

Funding

This work was carried out in the framework of the Russian Quantum Technologies Roadmap.

Data Availability Statement

Data and program code are available upon reasonable request.

Conflicts of Interest

The authors declare no conflict of interest.

References

Shor, P.W. Polynomial-Time Algorithms for Prime Factorization and Discrete Logarithms on a Quantum Computer. Siam J. Comput. 1997, 26, 1484–1509. [Google Scholar] [CrossRef]
Aaronson, S.; Arkhipov, A. The Computational Complexity of Linear Optics. arXiv 2010, arXiv:1011.3245. [Google Scholar]
Gard, B.T.; Motes, K.R.; Olson, J.P.; Rohde, P.P.; Dowling, J.P. An Introduction to Boson-Sampling. In From Atomic to Mesoscale; World Scientific: Singapore, 2015; pp. 167–192. [Google Scholar] [CrossRef]
Bentivegna, M.; Spagnolo, N.; Vitelli, C.; Flamini, F.; Viggianiello, N.; Latmiral, L.; Mataloni, P.; Brod, D.J.; Galvão, E.F.; Crespi, A.; et al. Experimental scattershot boson sampling. Sci. Adv. 2015, 1, e1400255. [Google Scholar] [CrossRef] [PubMed]
Hamilton, C.S.; Kruse, R.; Sansoni, L.; Barkhofen, S.; Silberhorn, C.; Jex, I. Gaussian Boson Sampling. Phys. Rev. Lett. 2017, 119, 170501. [Google Scholar] [CrossRef] [PubMed]
Lund, A.P.; Laing, A.; Rahimi-Keshari, S.; Rudolph, T.; O’Brien, J.L.; Ralph, T.C. Boson Sampling from a Gaussian State. Phys. Rev. Lett. 2014, 113, 100502. [Google Scholar] [CrossRef] [PubMed]
Zhong, H.S.; Peng, L.C.; Li, Y.; Hu, Y.; Li, W.; Qin, J.; Wu, D.; Zhang, W.; Li, H.; Zhang, L.; et al. Experimental Gaussian Boson sampling. Sci. Bull. 2019, 64, 511–515. [Google Scholar] [CrossRef] [PubMed]
Zhong, H.S.; Wang, H.; Deng, Y.H.; Chen, M.C.; Peng, L.C.; Luo, Y.H.; Qin, J.; Wu, D.; Ding, X.; Hu, Y.; et al. Quantum computational advantage using photons. Science 2020, 370, 1460–1463. [Google Scholar] [CrossRef] [PubMed]
Zhong, H.S.; Deng, Y.H.; Qin, J.; Wang, H.; Chen, M.C.; Peng, L.C.; Luo, Y.H.; Wu, D.; Gong, S.Q.; Su, H.; et al. Phase-Programmable Gaussian Boson Sampling Using Stimulated Squeezed Light. Phys. Rev. Lett. 2021, 127, 180502. [Google Scholar] [CrossRef] [PubMed]
Deng, Y.H.; Gu, Y.C.; Liu, H.L.; Gong, S.Q.; Su, H.; Zhang, Z.J.; Tang, H.Y.; Jia, M.H.; Xu, J.M.; Chen, M.C.; et al. Gaussian Boson Sampling with Pseudo-Photon-Number Resolving Detectors and Quantum Computational Advantage. Phys. Rev. Lett. 2023, 131, 150601. [Google Scholar] [CrossRef] [PubMed]
Oh, C.; Noh, K.; Fefferman, B.; Jiang, L. Classical simulation of lossy boson sampling using matrix product operators. Phys. Rev. 2021, 104, 022407. [Google Scholar] [CrossRef]
García-Patrón, R.; Renema, J.J.; Shchesnovich, V. Simulating boson sampling in lossy architectures. Quantum 2019, 3, 169. [Google Scholar] [CrossRef]
Popova, A.S.; Rubtsov, A.N. Cracking the Quantum Advantage Threshold for Gaussian Boson Sampling. arXiv 2021, arXiv:2106.01445. [Google Scholar]
Qi, H.; Brod, D.J.; Quesada, N.; García-Patrón, R. Regimes of Classical Simulability for Noisy Gaussian Boson Sampling. Phys. Rev. Lett. 2020, 124, 100502. [Google Scholar] [CrossRef] [PubMed]
Aaronson, S.; Brod, D.J. BosonSampling with lost photons. Phys. Rev. 2016, 93, 012335. [Google Scholar] [CrossRef]
Oh, C.; Liu, M.; Alexeev, Y.; Fefferman, B.; Jiang, L. Classical Algorithm for Simulating Experimental Gaussian Boson Sampling. arXiv 2023, arXiv:quant-ph/2306.03709. [Google Scholar]
Hubbard, J. Calculation of Partition Functions. Phys. Rev. Lett. 1959, 3, 77–78. [Google Scholar] [CrossRef]
Stratonovich, R.L. On a Method of Calculating Quantum Distribution Functions. Sov. Phys. Dokl. 1957, 2, 416. [Google Scholar]

Figure 1. Probability calculation for 5 modes for different 2-photon output states.

Figure 2. Graph of the average probability and the standard deviation calculated up to different orders for different numbers of samples. The state for this graph is 2-photon.

Figure 3. Convergence of the cosine similarity between estimated probability distribution over the set of all 2-photon states and ground truth for different N.

Figure 4. Cosine similarity between probability distribution over the set of all 2-photon states after H samples and after

H + 10

samples for different N.

Figure 4. Cosine similarity between probability distribution over the set of all 2-photon states after H samples and after

H + 10

samples for different N.

Figure 5. Precomputation time on an Intel i5 CPU in ms versus the number of modes.

Figure 6. Average time per sample on an Intel i5 CPU versus the number of modes for states with different photon numbers.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Umanskii, M.V.; Rubtsov, A.N. Classical Modeling of a Lossy Gaussian Bosonic Sampler. Entropy 2024, 26, 493. https://doi.org/10.3390/e26060493

AMA Style

Umanskii MV, Rubtsov AN. Classical Modeling of a Lossy Gaussian Bosonic Sampler. Entropy. 2024; 26(6):493. https://doi.org/10.3390/e26060493

Chicago/Turabian Style

Umanskii, Mikhail V., and Alexey N. Rubtsov. 2024. "Classical Modeling of a Lossy Gaussian Bosonic Sampler" Entropy 26, no. 6: 493. https://doi.org/10.3390/e26060493

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Classical Modeling of a Lossy Gaussian Bosonic Sampler

Abstract

1. Introduction

2. Problem Specification

Modeling Losses

3. Algorithm Derivation

3.1. Calculating Partial Trace

3.2. Switching between Probability Density Functions

3.3. Taylor Series Expansion

3.4. Choosing $Γ$

3.5. Multimode Case

3.6. Calculating Traces

4. Algorithm Overview

5. Taylor Series Convergence for Actual Experimental Conditions

6. Implementation Details

6.1. Contraction Precomputation

6.2. Factorial Fractions Precomputation

6.3. Reusing $\sum_{i} x_{i} U_{i j}$

7. Complexity Analysis

7.1. Precomputation

7.2. Probability Sample Computation

8. Results

9. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

Classical Modeling of a Lossy Gaussian Bosonic Sampler

Abstract

1. Introduction

2. Problem Specification

Modeling Losses

3. Algorithm Derivation

3.1. Calculating Partial Trace

3.2. Switching between Probability Density Functions

3.3. Taylor Series Expansion

3.4. Choosing Γ

3.5. Multimode Case

3.6. Calculating Traces

4. Algorithm Overview

5. Taylor Series Convergence for Actual Experimental Conditions

6. Implementation Details

6.1. Contraction Precomputation

6.2. Factorial Fractions Precomputation

6.3. Reusing ∑ i x i U i j

7. Complexity Analysis

7.1. Precomputation

7.2. Probability Sample Computation

8. Results

9. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

3.4. Choosing $Γ$

6.3. Reusing $\sum_{i} x_{i} U_{i j}$