The Entropic Dynamics Approach to Quantum Mechanics

Caticha, Ariel

doi:10.3390/e21100943

Open AccessFeature PaperArticle

The Entropic Dynamics Approach to Quantum Mechanics

by

Ariel Caticha

Physics Department, University at Albany-SUNY, Albany, NY 12222, USA

Entropy 2019, 21(10), 943; https://doi.org/10.3390/e21100943

Submission received: 13 August 2019 / Revised: 16 September 2019 / Accepted: 18 September 2019 / Published: 26 September 2019

(This article belongs to the Special Issue Entropy in Foundations of Quantum Physics)

Download Versions Notes

Abstract

:

Entropic Dynamics (ED) is a framework in which Quantum Mechanics is derived as an application of entropic methods of inference. In ED the dynamics of the probability distribution is driven by entropy subject to constraints that are codified into a quantity later identified as the phase of the wave function. The central challenge is to specify how those constraints are themselves updated. In this paper we review and extend the ED framework in several directions. A new version of ED is introduced in which particles follow smooth differentiable Brownian trajectories (as opposed to non-differentiable Brownian paths). To construct ED we make use of the fact that the space of probabilities and phases has a natural symplectic structure (i.e., it is a phase space with Hamiltonian flows and Poisson brackets). Then, using an argument based on information geometry, a metric structure is introduced. It is shown that the ED that preserves the symplectic and metric structures—which is a Hamilton-Killing flow in phase space—is the linear Schrödinger equation. These developments allow us to discuss why wave functions are complex and the connections between the superposition principle, the single-valuedness of wave functions, and the quantization of electric charges. Finally, it is observed that Hilbert spaces are not necessary ingredients in this construction. They are a clever but merely optional trick that turns out to be convenient for practical calculations.

Keywords:

quantum mechanics; entropic dynamics; symplectic geometry; information geometry

1. Introduction

Quantum mechanics has been commonly regarded as a generalization of classical mechanics with an added element of indeterminism. The standard quantization recipe starts with a description in terms of the system’s classical coordinates and momenta

{q, p}

and then proceeds by applying a series of more or less ad hoc rules that replace the classical

{q, p}

by self-adjoint linear operators

{\hat{q}, \hat{p}}

acting on some complex Hilbert space [1]. The Hilbert space structure is given priority while the probabilistic structure is relegated to the less fundamental status of providing phenomenological rules for how to handle those mysterious physical processes called measurements. The result is a dichotomy between two separate and irreconcilable modes of wave function evolution: one is the linear and deterministic Schrödinger evolution and the other is the discontinuous and stochastic wave function collapse [2,3]. To put it bluntly, the dynamical and the probabilistic aspects of quantum theory are incompatible with each other. And furthermore, the dichotomy spreads to the interpretation of the quantum state itself [4,5,6,7,8]. It obscures the issue of whether the wave function describes the ontic state of the system or whether it describes an epistemic state about the system [9].

In the Entropic Dynamics (ED) approach these problems are resolved by placing the probabilistic aspects of QM at the forefront while the Hilbert space structure is relegated to the secondary role of a convenient calculational tool [10,11,12]. ED tackles QM as an example of entropic inference, a framework designed to handle insufficient information [13,14,15,16,17,18]. The starting point is to specify the subject matter, the ontology—are we talking about the positions of particles or the configurations of fields? Once this decision is made our inferences about these variables are driven by entropy subject to information expressed by constraints. The main effort is directed towards choosing those constraints since it is through them that the “physics” is introduced.

From the ED perspective many of the questions that seemed so urgent in other approaches are successfully evaded. For example, when quantum theory is regarded as an extension of classical mechanics any deviations from causality demand an explanation. In contrast, in the entropic approach uncertainty and probabilities are the norm. Indeterminism is just the inevitable consequence of incomplete information and no deeper explanation is needed. Instead, it is the certainty and determinism of the classical limit that require explanations. Another example of a question that has consumed an enormous effort is the problem of deriving the Born rule from a fundamental Hilbert space structure. In the ED approach this question does not arise and the burden of explanation runs in the opposite direction: how do objects such as wave functions involving complex numbers emerge in a purely probabilistic framework? Yet a third example concerns the interpretation of the wave function itself. ED offers an uncompromising and radically epistemic view of the wave function Ψ. This turns out to be extremely restrictive: in a fully epistemic interpretation there is no logical room for “quantum” probabilities obeying alternative rules of inference. Not only is the probability

{| Ψ |}^{2}

interpreted as a state of knowledge but, in addition, the epistemic significance of the phase of the wave function must be clarified and made explicit. Furthermore, it is also required that all updates of Ψ, which include both its unitary time evolution and the wave function collapse during measurement, must be obtained as a consequence of entropic and Bayesian updating rules [19,20,21,22,23,24].

There is a large literature on reconstructions of quantum mechanics (see e.g., [25,26,27,28,29,30,31] and references therein) and there are several approaches based on information theory (see e.g., [32,33,34,35,36,37,38,39,40,41,42,43,44,45,46]). What distinguishes ED is a strict adherence to Bayesian and entropic methods and a central concern with the nature of time. The issue here is that any discussion of dynamics must inevitably include a notion of time but the rules for inference do not mention time—they are totally atemporal. One can make inferences about the past just as well as about the present or the future. This means that any model of dynamics based on inference must also include assumptions about time, and those assumptions must be explicitly stated. In ED “entropic” time is a book-keeping device designed to keep track of changes. The construction of entropic time involves several ingredients. One must introduce the notion of an ‘instant’; one must show that these instants are suitably ordered; and finally, one must define a convenient measure of the duration or interval between the successive instants. It turns out that an arrow of time is generated automatically and entropic time is intrinsically directional.

This paper contains a review of previous work on ED and extends the formalism in several new directions. In [10,11,12] the Schrödinger equation was derived as a peculiar non-dissipative diffusion in which the particles perform an irregular Brownian motion that resembles the Einstein–Smoluchowski (ES) process [47]. The trajectories are continuous and non-differentiable so their velocity is undefined. Since the expected length of the path between any two points is infinite this would be a very peculiar motion indeed. Here we exhibit a new form of ED in which the Brownian motion resembles the much smoother Oernstein–Uhlenbeck (OU) process [47]. The trajectories have finite expected lengths; they are continuous and differentiable. On the other hand, although the velocities are well defined and continuous, they are not differentiable [25,48].

We had also shown that the irregular Brownian motion at the “microscopic” or sub-quantum level was not unique. One can enhance or suppress the fluctuations while still obtaining the same emergent Schrödinger behavior at the “macroscopic” or quantum level [49,50]. A similar phenomenon is also found in the smoother ED developed here. In both the ES and the OU cases the special limiting case in which fluctuations are totally suppressed turns out to be of particular interest because the particles evolve deterministically along the smooth lines of probability flow. This means that ED includes the Bohmian or causal form of quantum mechanics [51,52,53] as a limiting case.

ED consists of the entropic updating of probabilities through information supplied by constraints. The main concern is how these constraints are chosen including, in particular, how the constraints themselves are updated. In [54] an effective criterion was found by adapting Nelson’s seminal insight that QM is a non-dissipative diffusion [55]. This amounts to updating constraints in such a way that a certain energy functional is conserved. Unfortunately, this criterion, while fully satisfactory in a non-relativistic setting, fails in curved space-times where the concept of a globally conserved energy may not exist.

The second contribution in this paper is a geometric framework for updating constraints that does not rely on the notion of a conserved energy. Our framework draws inspiration from two sources: one is the fact that QM has a rich geometrical structure [56,57,58,59,60,61,62,63,64]. The authors of [56,57,58,59,60,61,62] faced the task of unveiling geometric structures that, although well hidden, are already present in the standard QM framework. Our goal runs in the opposite direction: we impose these natural geometric structures as the foundation upon which we reconstruct the QM formalism.

The other source of inspiration is the connection between QM and information geometry [17,65,66,67,68] that was originally suggested in the work of Wootters [32]. This connection has been explored in the context of quantum statistical inference [69], in the operational description of quantum measurements [37,39], and in the reconstruction of QM [43,44]. Our previous presentation in [12] has been considerably streamlined by recognizing the central importance of symmetry principles when implemented in conjunction with concepts of information geometry.

In ED, the degrees of freedom are the probability densities

ρ (x)

and certain “phase” fields Φ

(x)

that represent the constraints that control the flow of probabilities. Thus, we are concerned not just with the “configuration” space of probabilities

{ρ}

but with the larger space of probabilities and phases

{ρ, Φ}

. The latter has a natural symplectic structure, i.e.,

{ρ, Φ}

is a phase space. Imposing a dynamics that preserves this symplectic structure leads to Hamiltonian flows, Poisson brackets, and so much of the canonical formalism associated with mechanics. To single out the particular Hamiltonian flow that reproduces QM we extend the information geometry of the configuration space

{ρ}

to the full phase space. This is achieved by imposing a symmetry that is natural in a probabilistic setting: we extend the well-known spherically symmetric information geometry of the space

{ρ}

to the full phase space

{ρ, Φ}

. This construction yields a derivation of the Fubini–Study metric. A welcome by-product is that the joint presence of a symplectic and a metric structure leads to a complex structure. This is the reason QM involves complex numbers.

The dynamics that preserves the metric structure is a Killing flow. We propose that the desired geometric criterion for updating constraints is a dynamics that preserves both the symplectic and the metric structures. Thus, in the final step of our reconstruction of QM we show that the Hamiltonians that generate Hamiltonian-Killing flows lead to an entropic dynamics described by the linear Schrödinger equation.

We conclude with some comments exploring various aspects of the ED formalism. We show that despite the arrow of entropic time, the resulting ED is symmetric under time reversal. We discuss the connections between linearity, the superposition principle, the single-valuedness of wave functions, and the quantization of charge. We also discuss the classical limit and the Bohmian limit in which fluctuations are suppressed and particles follow deterministic trajectories. Finally, we discuss the introduction of Hilbert spaces. We argue that while strictly unnecessary in principle, Hilbert spaces are extremely convenient for calculational purposes.

This paper focuses on the derivation of the Schrödinger equation but the ED approach has been applied to a variety of other topics in quantum theory. These include: the quantum measurement problem [70,71]; momentum and uncertainty relations [50,72] (see also [73,74,75,76]); the Bohmian limit [49,50] and the classical limit [77]; extensions to curved spaces [78]; to relativistic fields [79,80]; and the ED of spin [81].

2. The ED of Short Steps

We deal with N particles living in a flat 3-dimensional space X with metric

δ_{a b}

. For N particles the configuration space is

X_{N} = X \times \dots \times X

. We assume that the particles have definite positions

x_{n}^{a}

and it is their unknown values that we wish to infer [82]. (The index

n = 1, \dots, N

denotes the particle and

a = 1, 2, 3

the spatial coordinates.)

In ED positions play a very special role: they define the ontic state of the system. This is in contradiction with the standard Copenhagen notion that quantum particles acquire definite positions only as a result of a measurement. For example, in the ED description of the double slit experiment the particle definitely goes through one slit or the other but one might not know which. The wave function, on the other hand, is a purely epistemic notion and, as it turns out, all other quantities, such as energy or momentum, are epistemic too. They do not reflect properties of the particles but properties of the wave function [70,71,72].

Having identified the microstates

x \in X_{N}

we tackle the dynamics. The main dynamical assumption is that the particles follow trajectories that are continuous. This represents an enormous simplification because it implies that a generic motion can be analyzed as the accumulation of many infinitesimally short steps. Therefore, the first task is to find the transition probability

P (x^{'} | x)

for a short step from an initial x to an unknown neighboring

x^{'}

and only later we will determine how such short steps accumulate to yield a finite displacement.

The probability

P (x^{'} | x)

is found by maximizing the entropy

S [P, Q] = - \int d x^{'} P (x^{'} | x) \log \frac{P (x^{'} | x)}{Q (x^{'} | x)}

(1)

relative to the joint prior

Q (x^{'} | x)

subject to constraints given below. (In multidimensional integrals such as (1) the notation

d x^{'}

stands for

d^{3 N} x^{'}

.)

The prior. The choice of prior

Q (x^{'} | x)

must reflect the state of knowledge that is common to all short steps. (It is through the constraints that the information that is specific to any particular short step will be supplied.) We adopt a prior that carries the information that the particles take infinitesimally short steps and reflects the translational and rotational invariance of the Euclidean space X but is otherwise uninformative. In particular, the prior expresses total ignorance about any correlations. Such a prior can itself be derived from the principle of maximum entropy. Indeed, maximize

S [Q] = - \int d x^{'} Q (x^{'} | x) \log \frac{Q (x^{'} | x)}{μ (x^{'})},

(2)

relative to the uniform measure

μ (x^{'})

[83], subject to normalization, and subject to the N independent constraints

〈 δ_{a b} Δ x_{n}^{a} Δ x_{n}^{b} 〉 = κ_{n}, (n = 1 \dots N),

(3)

where

κ_{n}

are small constants and

Δ x_{n}^{a} = {x^{'}}_{n}^{a} - x_{n}^{a}

. The result is a product of Gaussians,

Q (x^{'} | x) \propto \exp - \frac{1}{2} \sum_{n} α_{n} δ_{a b} Δ x_{n}^{a} Δ x_{n}^{b},

(4)

where, to reflect translational invariance and possibly non-identical particles, the Lagrange multipliers

α_{n}

are independent of x but may depend on the index n. Eventually we will let

α_{n} \to \infty

to implement infinitesimally short steps. Next we specify the constraints that are specific to each particular short step.

The drift potential constraint. In Newtonian dynamics one does not need to explain why a particle perseveres in its motion in a straight line; what demands an explanation—that is, a force—is why the particle deviates from inertial motion. In ED one does not require an explanation for why the particles move; what requires an explanation is how the motion can be both directional and highly correlated. This physical information is introduced through one constraint that acts simultaneously on all particles. The constraint involves a function

ϕ (x) = ϕ (x_{1} \dots x_{N})

on configuration space

X_{N}

that we call the “drift” potential. We impose that the displacements

Δ x_{n}^{a}

are such that the expected change of the drift potential

〈Δ ϕ〉

is constrained to be

〈Δ ϕ〉 = \sum_{n = 1}^{N} 〈 Δ x_{n}^{a} 〉 \frac{\partial ϕ}{\partial x_{n}^{a}} = κ^{'} (x),

(5)

where

κ^{'} (x)

is another small but for now unspecified function. As we shall later see this information is already sufficient to construct an interesting ED. However, to reproduce the particular dynamics that describes quantum systems we must further require that the potential

ϕ (x)

be a multi-valued function with the topological properties of an angle—

ϕ

and

ϕ + 2 π

represent the same angle [84].

The physical origin of the drift potential

ϕ (x)

is at this point unknown so how can one justify its introduction? The idea is that identifying the relevant constraints can represent significant progress even when their physical origin remains unexplained. Indeed, with the single assumption of a constraint involving a drift potential we will explain and coordinate several features of quantum mechanics such as entanglement, the existence of complex and symplectic structures, the actual form of the Hamiltonian, and the linearity of the Schrödinger equation.

The gauge constraints. The single constraint (5) already leads to a rich entropic dynamics but by imposing additional constraints we can construct even more realistic models. To incorporate the effect of an external electromagnetic field we impose that for each particle n the expected displacement

〈 Δ x_{n}^{a} 〉

will satisfy

〈 Δ x_{n}^{a} 〉 A_{a} (x_{n}) = κ_{n}^{″} for n = 1 \dots N,

(6)

where the electromagnetic vector potential

A_{a} (x_{n})

is a field that lives in the 3-dimensional physical space (

x_{n} \in X

). The strength of the coupling is given by the values of the

κ_{n}^{″}

. These quantities could be specified directly but, as is often the case in entropic inference, it is much more convenient to specify them indirectly in terms of the corresponding Lagrange multipliers.

The transition probability. An important feature of the ED model can already be discerned. The central object of the discussion so far, the transition probability

P (x^{'} | x)

, codifies information supplied through the prior and the constraints which makes no reference to anything earlier than the initial position x. Therefore ED must take the form of a Markov process.

The distribution

P (x^{'} | x)

that maximizes the entropy

S [P, Q]

in (1) relative to (4) and subject to (5), and (6), and normalization is

P (x^{'} | x) = \frac{1}{Z} \exp - \sum_{n} (\frac{α_{n}}{2} δ_{a b} Δ x_{n}^{a} Δ x_{n}^{b} - α^{'} [\partial_{n a} ϕ - β_{n} A_{a} (x_{n})] Δ x_{n}^{a})

(7)

where

α^{'}

and

β_{n}

are Lagrange multipliers. This is conveniently written as

P (x^{'} | x) = \frac{1}{Z} \exp - \sum_{n} \frac{α_{n}}{2} δ_{a b} (Δ x_{n}^{a} - Δ {\bar{x}}_{n}^{a}) (Δ x_{n}^{b} - Δ {\bar{x}}_{n}^{b}),

(8)

with a suitably modified normalization and

Δ {\bar{x}}_{n}^{a} = \frac{α^{'}}{α_{n}} [\partial_{n a} ϕ - β_{n} A_{a} (x_{n})] = 〈 Δ x_{n}^{a} 〉 .

(9)

A generic displacement is expressed as a drift plus a fluctuation,

Δ x_{n}^{a} = 〈 Δ x_{n}^{a} 〉 + Δ w_{n}^{a},

(10)

where

〈 Δ w_{n}^{a} 〉 = 0, and 〈 Δ w_{n}^{a} Δ w_{n^{'}}^{b} 〉 = \frac{1}{α_{n}} δ_{n n^{'}} δ^{a b},

(11)

The fact that the constraints (5) and (6) are not independent—both involve the same displacements

〈 Δ x_{n}^{a} 〉

—has turned out to be significant. We can already see in (7) and (9) that it leads to a gauge symmetry. As we shall later see the vector potential

A_{a}

will be interpreted as the corresponding gauge connection field and the multipliers

β_{n}

will be related to the electric charges through

β_{n} = q_{n} / ℏ c

.

3. Entropic Time

The task of iterating the short steps described by the transition probability (8) to predict motion over finite distances leads us to introduce a book-keeping parameter t, to be called time, in order to keep track of the accumulation of short steps. The construction of time involves three ingredients: (a) we must specify what we mean by an ‘instant’; (b) these instants must be ordered; and finally; (c) one must specify the interval

Δ t

between successive instants—one must define ‘duration’.

Since the foundation for any theory of time is the theory of change, i.e., the dynamics, the notion of time constructed below will reflect the inferential nature of entropic dynamics. Such a construction we will call “entropic” time [10]. Later we will return to the question of whether and how this “entropic” time is related to the “physical” time that is measured by clocks.

3.1. Time as an Ordered Sequence of Instants

ED consists of a succession of short steps. Consider, for example, the ith step which takes the system from

x = x_{i - 1}

to

x^{'} = x_{i}

. Integrating the joint probability,

P (x_{i}, x_{i - 1})

, over

x_{i - 1}

gives

P (x_{i}) = \int d x_{i - 1} P (x_{i}, x_{i - 1}) = \int d x_{i - 1} P (x_{i} | x_{i - 1}) P (x_{i - 1}) .

(12)

No physical assumptions were involved in deriving this equation; it follows directly from the laws of probability. To establish the connection to time and dynamics we will make the physical assumption that if

P (x_{i - 1})

is interpreted as the probability of different values of

x_{i - 1}

at one “instant” labelled t, then we will interpret

P (x_{i})

as the probability of values of

x_{i}

at the next “instant” labelled

t^{'}

. More explicitly, if we write

P (x_{i - 1}) = ρ_{t} (x)

and

P (x_{i}) = ρ_{t^{'}} (x^{'})

then we have

ρ_{t^{'}} (x^{'}) = \int d x P (x^{'} | x) ρ_{t} (x) .

(13)

This equation defines the notion of “instant”: if the distribution

ρ_{t} (x)

refers to one instant t, then the distribution

ρ_{t^{'}} (x^{'})

generated by

P (x^{'} | x)

defines what we mean by the “next” instant

t^{'}

. Iterating this process defines the dynamics.

This construction of time is intimately related to information and inference. An instant is an informational state that is complete in the sense that it is specified by the information—codified into the distributions

ρ_{t} (x)

and

P (x^{'} | x)

—that is sufficient for predicting the next instant. Thus, the present is defined through a sufficient amount of information such that given the present, the future is independent of the past.

In the ED framework the notions of instant and of simultaneity are intimately related to the distribution

ρ_{t} (x)

. To see how this comes about consider a single particle at the point

\vec{x} = (x^{1}, x^{2}, x^{3})

. It is implicit in the notation that

x^{1}

,

x^{2}

, and

x^{3}

occur simultaneously. When we describe a system of N particles by a single point

x = ({\vec{x}}_{1}, {\vec{x}}_{2}, \dots {\vec{x}}_{N})

in

3 N

-dimensional configuration space it is also implicitly assumed that all the

3 N

coordinate values refer to the same instant; they are simultaneous. The very idea of a point in configuration space assumes simultaneity. And furthermore, whether we deal with one particle or many, a distribution such as

ρ_{t} (x)

is meant to describe our uncertainty about the possible configurations x of the system at the given instant. Thus, a probability distribution

ρ_{t} (x)

provides a criterion of simultaneity [85].

3.2. The Arrow of Entropic Time

The notion of time constructed according to Equation (13) is intrinsically directional. There is an absolute sense in which

ρ_{t} (x)

is prior and

ρ_{t^{'}} (x^{'})

is posterior. Indeed, the same rules of probability that led us to Equation (13) can also lead us to the time-reversed evolution,

ρ_{t} (x) = \int d x^{'} P (x | x^{'}) ρ_{t^{'}} (x^{'}) .

(14)

Note, however, that there is a temporal asymmetry: while the distribution

P (x^{'} | x)

, Equation (7), is a Gaussian derived using the maximum entropy method, its time-reversed version

P (x | x^{'})

is related to

P (x^{'} | x)

by Bayes’ theorem,

P (x | x^{'}) = \frac{ρ_{t} (x)}{ρ_{t^{'}} (x^{'})} P (x^{'} | x),

(15)

which in general will not be Gaussian.

The puzzle of the arrow of time (see e.g., [86,87]) arises from the difficulty in deriving a temporal asymmetry from underlying laws of nature that are symmetric. The ED approach offers a fresh perspective on this topic because it does not assume any underlying laws of nature—whether they be symmetric or not. The asymmetry is the inevitable consequence of constructing time in a dynamics driven by entropic inference.

From the ED point of view the challenge does not consist of explaining the arrow of time—entropic time itself only flows forward—but rather in explaining how it comes about that despite the arrow of time some laws of physics, such as the Schrödinger equation, turn out to be time reversible. We will revisit this topic in Section 9.

3.3. Duration and the Sub-Quantum Motion

We have argued that the concept of time is intimately connected to the associated dynamics but at this point neither the transition probability

P (x^{'} | x)

that specifies the dynamics nor the corresponding entropic time have been fully defined yet. It remains to specify how the multipliers

α_{n}

and

α^{'}

are related to the interval

Δ t

between successive instants.

The basic criterion for this choice is convenience: duration is defined so that motion looks simple. The description of motion is simplest when it reflects the symmetry of translations in space and time. In a flat space-time this leads to an entropic time that resembles Newtonian time in that it flows “equably everywhere and everywhen.” Referring to Equations (9) and (11) we choose

α^{'}

and

α_{n}

to be independent of x and t, and we choose the ratio

α^{'} / α_{n} \propto Δ t

so that there is a well-defined drift velocity. For future convenience the proportionality constants will be expressed in terms of some particle-specific constants

m_{n}

,

\frac{α^{'}}{α_{n}} = \frac{ℏ}{m_{n}} Δ t,

(16)

where ℏ is an overall constant that fixes the units of the

m_{n}

s relative to the units of time. As we shall later see, the constants

m_{n}

will eventually be identified with the particle masses while the constant ℏ will be identified as Planck’s constant. Having specified the ratio

α^{'} / α_{n}

it remains to specify

α_{n}

(or

α^{'}

). It turns out that the choice is not unique. There is a variety of motions at the sub-quantum “microscopic” level that lead to the same quantum mechanics at the “macroscopic” level.

In previous work [10,11,12] we chose

α_{n}

proportional to

1 / Δ t

. This led to an ED in which the particles follow the highly irregular non-differentiable Brownian trajectories characteristic of an Einstein–Smoluchowski process. The first new contribution of this paper is to explore the consequences of choosing

α_{n} \propto 1 / Δ t^{3}

,

α_{n} = \frac{m_{n}}{η Δ t^{3}},

(17)

where a new constant

η

is introduced.

It is convenient to introduce a notation tailored to configuration space. Let

x^{A} = x_{n}^{a}

,

\partial_{A} = \partial / \partial x_{n}^{a}

, and

δ_{A B} = δ_{n n^{'}} δ_{a b}

, where the upper case indices

A, B, \dots

label both the particles

n, n^{'}, \dots

and their coordinates

a, b, \dots

. Then the transition probability (8) becomes

P (x^{'} | x) = \frac{1}{Z} \exp [- \frac{1}{2 η Δ t} m_{A B} (\frac{Δ x^{A}}{Δ t} - v^{A}) (\frac{Δ x^{B}}{Δ t} - v^{B})],

(18)

where we used (9) to define the drift velocity,

v^{A} = \frac{〈 Δ x^{A} 〉}{Δ t} = m^{A B} [\partial_{B} Φ - {\bar{A}}_{B}] .

(19)

The drift potential is rescaled into a new variable

Φ = ℏ ϕ

(20)

which will be called the phase. We also introduced the “mass” tensor and its inverse,

m_{A B} = m_{n} δ_{A B} = m_{n} δ_{n n^{'}} δ_{a b} and m^{A B} = \frac{1}{m_{n}} δ^{A B},

(21)

and

{\bar{A}}_{A}

is a field in configuration space with components,

{\bar{A}}_{A} (x) = ℏ β_{n} A_{a} (x_{n}),

(22)

A generic displacement is then written as a drift plus a fluctuation,

Δ x^{A} = v^{A} Δ t + Δ w^{A},

(23)

and the fluctuations

Δ w^{A}

are given by

〈 Δ w^{A} 〉 = 0 and 〈 Δ w^{A} Δ w^{B} 〉 = η m^{A B} Δ t^{3},

(24)

or

〈(\frac{Δ x^{A}}{Δ t} - v^{A}) (\frac{Δ x^{B}}{Δ t} - v^{B})〉 = η m^{A B} Δ t .

(25)

It is noteworthy that

〈 Δ x^{A} 〉 \sim O (Δ t)

and

Δ w^{A} \sim O (Δ t^{3 / 2})

. This means that for short steps the fluctuations are negligible and the dynamics is dominated by the drift. The particles follow trajectories that are indeterministic but differentiable. Since

Δ w^{A} \sim O (Δ t^{3 / 2})

the limit

V^{A} = lim_{Δ t \to 0} \frac{Δ x^{A}}{Δ t} = v^{A}

(26)

is well defined. In words: the actual velocities of the particles coincide with the expected or drift velocities. From Equation (19) we see that these velocities are continuous functions. The question of whether the velocities themselves are differentiable or not is trickier.

Consider two successive displacements

Δ x = x^{'} - x

followed by

Δ x^{'} = x^{″} - x^{'}

. The velocities are

V^{A} = \frac{Δ x^{A}}{Δ t} and V^{'}^{A} = \frac{Δ x^{'}^{A}}{Δ t} .

(27)

The change in velocity is given by a Langevin equation,

Δ V^{A} = {〈 Δ V^{A} 〉}_{x^{″} x^{'}} + Δ U^{A},

(28)

where

{〈 \cdot 〉}_{x^{″} x^{'}}

denotes taking the expectations over

x^{″}

using

P (x^{″} | x^{'})

, and then over

x^{'}

using

P (x^{'} | x)

, and

Δ U^{A}

is a fluctuation. It is straightforward to show that

{〈 Δ V^{A} 〉}_{x^{″} x^{'}} = (\partial_{t} v^{A} + v^{B} \partial_{B} v^{A}) Δ t,

(29)

so that the expected acceleration is given by the convective derivative of the velocity field along itself,

lim_{Δ t \to 0} \frac{〈 Δ V^{A} 〉}{Δ t} = (\partial_{t} + v^{B} \partial_{B}) v^{A} .

(30)

One can also show that

{〈 Δ U^{A} 〉}_{x^{″} x^{'}} = 0, and {〈 Δ U^{A} Δ U^{B} 〉}_{x^{″} x^{'}} = 2 η m^{A B} Δ t,

(31)

which means that

Δ U

is a Wiener process and we deal with a Brownian motion of the Oernstein–Uhlenbeck type.

We conclude this section with some general remarks.

On the nature of clocks. In Newtonian mechanics time is defined to simplify the dynamics. The prototype of a clock is a free particle which moves equal distances in equal times. In ED time is also defined to simplify the dynamics of free particles (for sufficiently short times all particles are free) and the prototype of a clock is a free particle too: as we see in (23) the particle’s mean displacement increases by equal amounts in equal times.

On the nature of mass. In standard quantum mechanics, “what is mass?” and “why quantum fluctuations?” are two independent mysteries. In ED the mystery is somewhat alleviated: as we see in Equation (25) mass and fluctuations are two sides of the same coin. Mass is an inverse measure of the velocity fluctuations.

The information metric of configuration space. In addition to defining the dynamics the transition probability Equation (18) serves to define the geometry of the N-particle configuration space,

X_{N}

. Since the physical single particle space

X

is described by the Euclidean metric

δ_{a b}

we can expect that the N-particle configuration space,

X_{N} = X \times \dots \times X

, will also be flat, but for non-identical particles a question might be raised about the relative scales or weights associated with each

X

factor. Information geometry provides the answer.

The fact that to each point

x \in X_{N}

there corresponds a probability distribution

P (x^{'} | x)

means that to the space

X_{N}

we can associate a statistical manifold the geometry of which (up to an overall scale factor) is uniquely determined by the information metric [17,65],

γ_{A B} = \int d x^{'} P (x^{'} | x) \frac{\partial \log P (x^{'} | x)}{\partial x^{A}} \frac{\partial \log P (x^{'} | x)}{\partial x^{B}} .

(32)

Substituting Equations (18) into (32) yields

γ_{A B} = \frac{1}{η Δ t^{3}} m_{A B} .

(33)

The divergence as

Δ t \to 0

arises because the information metric measures statistical distinguishability. As

Δ t \to 0

the distributions

P (x^{'} | x)

and

P (x^{'} | x + Δ x)

become more sharply peaked and increasingly easier to distinguish so that

γ_{A B} \to \infty

. Thus, up to a scale factor the metric of configuration space is basically the mass tensor.

The practice of describing a many-particle system as a single point in an abstract configuration space goes back to the work of H. Hertz in 1894 [88]. Historically the choice of the mass tensor as the metric of configuration space has been regarded as being convenient but of no particular significance. We can now see that the choice is not just a merely useful convention: up to an overall scale the metric follows uniquely from information geometry. Furthermore, it suggests the intriguing possibility of a deeper connection between kinetic energy and information geometry.

Invariance under gauge transformations. The fact that constraints (5) and (6) are not independent—they are both linear in the same displacements

〈 Δ x_{n}^{a} 〉

—leads to a gauge symmetry. This is evident in Equation (7) where

ϕ

and

A_{a}

appear in the combination

\partial_{n a} ϕ - β_{n} A_{a}

which is invariant under the gauge transformations,

\begin{matrix} A_{a} (x_{n}) & \to & A_{a}^{'} (x_{n}) = A_{a} (x_{n}) + \partial_{a} χ (x_{n}), \end{matrix}

(34)

\begin{matrix} ϕ (x) & \to & ϕ^{'} (x) = ϕ (x) + \sum_{n} β_{n} χ (x_{n}) . \end{matrix}

(35)

These transformations are local in 3d-space. Introducing

\bar{χ} (x) = \sum_{n} ℏ β_{n} χ (x_{n}),

(36)

they can be written in the N-particle configuration space,

\begin{matrix} {\bar{A}}_{A} (x) & \to & {\bar{A}}_{A}^{'} (x) = {\bar{A}}_{A} (x) + \partial_{A} \bar{χ} (x), \end{matrix}

(37)

\begin{matrix} Φ (x) & \to & Φ^{'} (x) = Φ (x) + \bar{χ} (x) . \end{matrix}

(38)

Interpretation: The drift potential

ϕ (x) = ϕ ({\vec{x}}_{1}, {\vec{x}}_{2}, \dots)

is assumed to be an “angle”–

ϕ (x)

and

ϕ (x) + 2 π

are meant to describe the same angle. The angle at

{\vec{x}}_{1}

depends on the values of all the other positions

{\vec{x}}_{2}, {\vec{x}}_{3}, \dots

, and the angle at

{\vec{x}}_{2}

depends on the values of all the other positions

{\vec{x}}_{1}, {\vec{x}}_{3}, \dots

, and so on. The fact that the origins from which these angles are measured can be redefined by different amounts at different places gives rise to a local gauge symmetry. To compare angles at different locations one introduces a connection field, the vector potential

A_{a} (\vec{x})

. It defines which origin at

\vec{x} + Δ \vec{x}

is the “same” as the origin at

\vec{x}

. This is implemented by imposing that as we change origins and

Φ (x)

changes to

Φ + \bar{χ}

then the connection transforms as

A_{a} \to A_{a} + \partial_{a} χ

so that the quantity

\partial_{A} Φ - {\bar{A}}_{A}

remains invariant.

A fractional Brownian motion? The choices

α_{n} \propto 1 / Δ t

and

α_{n} \propto 1 / Δ t^{3}

lead to Einstein–Smoluchowski and Oernstein–Uhlenbeck processes, respectively. For definiteness throughout the rest of this paper we will assume that the sub-quantum motion is an OU process but more general fractional Brownian motions [89] are possible. Consider

α_{n} = \frac{m_{n}}{η Δ t^{γ}},

(39)

where

γ

is some positive parameter. The corresponding transition probability (8),

P (x^{'} | x) = \frac{1}{Z} \exp [- \frac{1}{2 η Δ t^{γ}} m_{A B} (Δ x^{A} - v^{A} Δ t) (Δ x^{B} - v^{B} Δ t)],

(40)

leads to fluctuations such that

〈 Δ w^{A} 〉 = 0 and 〈 Δ w^{A} Δ w^{B} 〉 = η m^{A B} Δ t^{γ},

(41)

or

〈(\frac{Δ x^{A}}{Δ t} - v^{A}) (\frac{Δ x^{B}}{Δ t} - v^{B})〉 = η m^{A B} Δ t^{γ - 2} .

(42)

We will not pursue this topic further except to note that since

〈 Δ x^{A} 〉 \sim O (Δ t)

and

Δ w^{A} \sim O (Δ t^{γ / 2})

for

γ < 2

the sub-quantum motion is dominated by fluctuations and the trajectories are non-differentiable, while for

γ > 2

the drift dominates and velocities are well defined.

4. The Evolution Equation in Differential Form

Entropic dynamics is generated by iterating Equation (13): given the information that defines one instant, the integral Equation (13) is used to construct the next instant. As so often in physics it is more convenient to rewrite the equation of evolution in differential form. The result is

\partial_{t} ρ = - \partial_{A} (v^{A} ρ),

(43)

where

v^{A}

is given by (19). Before we proceed to its derivation we note that Equation (43) is a consequence of the fact that the particles follow continuous paths. Accordingly, we will follow standard practice and call it the continuity equation. Also note that in the OU process considered here (

γ = 3

) the current velocity—the velocity with which the probability flows in configuration space— coincides with the drift velocity (19) and with the actual velocities of the particles (26) [90].

Next we derive (43) using a technique that is well known in diffusion theory [91]. (For an alternative derivation see [92].) The result of building up a finite change from an initial time

t_{0}

to a later time t leads to the distribution

ρ (x, t) = \int d x_{0} P (x, t | x_{0}, t_{0}) ρ (x_{0}, t_{0}),

(44)

where the finite-time transition probability,

P (x, t | x_{0}, t_{0})

, is constructed by iterating the infinitesimal changes described in Equation (13),

P (x, t + Δ t | x_{0}, t_{0}) = \int d z P (x, t + Δ t | z, t) P (z, t | x_{0}, t_{0}) .

(45)

For small times

Δ t

the distribution

P (x, t + Δ t | z, t)

, given in Equation (18), is very sharply peaked at

x = z

. In fact, as

Δ t \to 0

we have

P (x, t + Δ t | z, t) \to δ (x - z)

. Such singular behavior cannot be handled directly by Taylor expanding in z about the point x. Instead one follows an indirect procedure. Multiply by a smooth test function

f (x)

and integrate over x,

\int d x P (x, t + Δ t | x_{0}, t_{0}) f (x) = \int d z [\int d x P (x, t + Δ t | z, t) f (x)] P (z, t | x_{0}, t_{0}) .

(46)

The test function

f (x)

is assumed sufficiently smooth precisely so that it can be expanded about z. Then as

Δ t \to 0

the integral in the brackets, dropping all terms of order higher than

Δ t

, is

\begin{matrix} [\dots] & = \int d x P (x, t + Δ t | z, t) (f (z) + \frac{\partial f}{\partial z^{A}} (x^{A} - z^{A}) + \dots) \\ = f (z) + v^{A} (z) Δ t \frac{\partial f}{\partial z^{A}} + \dots \end{matrix}

(47)

where we used Equation (23). Next substitute (47) into the right hand side of (46), divide by

Δ t

, and let

Δ t \to 0

. Since

f (x)

is arbitrary the result is

\partial_{t} P (x, t | x_{0}, t_{0}) = - \partial_{A} [v^{A} (x) P (x, t | x_{0}, t_{0})],

(48)

which is the continuity equation for the finite-time transition probability. Differentiating Equation (44) with respect to t, and substituting (48) completes the derivation of the continuity Equation (43).

The continuity Equation (43) can be written in another equivalent but very suggestive form involving functional derivatives. For some suitably chosen functional

\tilde{H} [ρ, Φ]

we have

\partial_{t} ρ (x) = - \partial_{A} [ρ m^{A B} (\partial_{B} Φ - {\bar{A}}_{B})] = \frac{δ \tilde{H}}{δ Φ (x)} .

(49)

It is easy to check that the appropriate functional

\tilde{H}

is

\tilde{H} [ρ, Φ] = \int d x \frac{1}{2} ρ m^{A B} (\partial_{A} Φ - {\bar{A}}_{A}) (\partial_{B} Φ - {\bar{A}}_{B}) + F [ρ],

(50)

where the unspecified functional

F [ρ]

is an integration constant [93].

The continuity Equation (49) describes a somewhat peculiar OU Brownian motion in which the probability density

ρ (x)

is driven by the non-dynamical fields Φ, and

\bar{A}

. This is an interesting ED in its own right but it is not QM. Indeed, a quantum dynamics consists in the coupled evolution of two dynamical fields: the density

ρ (x)

and the phase of the wave function. This second field can be naturally introduced into ED by allowing the phase field

Φ

in (19) to become dynamical which amounts to an ED in which the constraint (5) is continuously updated at each instant in time. Our next topic is to propose the appropriate updating criterion. It yields an ED in which the phase field Φ guides the evolution of

ρ

, and in return, the evolving

ρ

reacts back and induces a change in Φ.

5. The Epistemic Phase Space

In ED we deal with two configuration spaces. One is the ontic configuration space

X_{N} = X \times X \times \dots

of all particle positions,

x = (x_{1} \dots x_{N}) \in X_{N}

. The other is the epistemic configuration space or e-configuration space P of all normalized probabilities,

P = \{ρ |ρ (x) \geq 0; \int d x ρ (x) = 1\} .

(51)

To formulate the coupled dynamics of

ρ

and Φ we need a framework to study paths in the larger space

{ρ, Φ}

that we will call the epistemic phase space or e-phase space.

Given any manifold such as P the associated tangent and cotangent bundles, respectively

T P

and

T^{*} P

, are geometric objects that are always available to us independently of any physical considerations. Both are manifolds in their own right but the cotangent bundle

T^{*} P

—the space of all probabilities and all covectors—is of particular interest because it comes automatically endowed with a rich geometrical structure [56,57,58,59,60,61,62]. The point is that cotangent bundles are symplectic manifolds and this singles out as “natural” those dynamical laws that happen to preserve some privileged symplectic form. This observation will lead us to identify e-phase space

{ρ, Φ}

with the cotangent bundle

T^{*} P

and provides the natural criterion for updating constraints, that is, for updating the phase

Φ

[94].

5.1. Notation: Vectors, Covectors, Etc.

A point

X \in T^{*} P

will be represented as

X = (ρ (x), π (x)) = (ρ^{x}, π_{x}),

(52)

where

ρ^{x}

represents coordinates on the base manifold

P

and

π_{x}

represents some generic coordinates on the space

T^{*} P_{ρ}

that is cotangent to P at the point

ρ

. Curves in

T^{*} P

allow us to define vectors. Let

X = X (λ)

be a curve parametrized by

λ

, then the vector

\bar{V}

tangent to the curve at

X = (ρ, π)

has components

d ρ^{x} / d λ

and

d π_{x} / d λ

, and is written

\bar{V} = \frac{d}{d λ} = \int d x [\frac{d ρ^{x}}{d λ} \frac{δ}{δ ρ^{x}} + \frac{d π_{x}}{d λ} \frac{δ}{δ π_{x}}],

(53)

where

δ / δ ρ^{x}

and

δ / δ π_{x}

are the basis vectors. The directional derivative of a functional

F [X]

along the curve

X (λ)

is

\frac{d F}{d λ} = \tilde{\nabla} F [\bar{V}] = \int d x [\frac{δ F}{δ ρ^{x}} \frac{d ρ^{x}}{d λ} + \frac{δ F}{δ π_{x}} \frac{d π_{x}}{d λ}],

(54)

where

\tilde{\nabla}

is the functional gradient in

T^{*} P

, i.e., the gradient of a generic functional

F [X] = F [ρ, π]

is

\tilde{\nabla} F = \int d x [\frac{δ F}{δ ρ^{x}} \tilde{\nabla} ρ^{x} + \frac{δ F}{δ π_{x}} \tilde{\nabla} π_{x}] .

(55)

The tilde ‘˜’ serves to distinguish the functional gradient

\tilde{\nabla}

from the spatial gradient

\nabla f = \partial_{a} f \nabla x^{a}

on

R^{3}

.

The fact that the space P is constrained to normalized probabilities means that the coordinates

ρ^{x}

are not independent. This technical difficulty is handled by embedding the ∞-dimensional manifold

P

in a (∞ + 1)-dimensional manifold

P^{+ 1}

where the coordinates

ρ^{x}

are unconstrained [95]. Thus, strictly,

\tilde{\nabla} F

is a covector on

T^{*} P^{+ 1}

, i.e.,

\tilde{\nabla} F \in T^{*} {(T^{*} P^{+ 1})}_{X}

and

\tilde{\nabla} ρ^{x}

and

\tilde{\nabla} π_{x}

are the corresponding basis covectors. Nevertheless, the gradient

\tilde{\nabla} F

will yield the desired directional derivatives (54) on

T^{*} P

provided its action is restricted to vectors

\bar{V}

that are tangent to the manifold

P

. Such tangent vectors are constrained to obey

\frac{d}{d λ} \int d x ρ (x) = \int d x \frac{d ρ^{x}}{d λ} = 0 .

(56)

Instead of keeping separate track of the

ρ^{x}

and

π_{x}

coordinates it is more convenient to combine them into a single index. A point

X = (ρ, π)

will then be labelled by its coordinates

X^{I} = (X^{1 x}, X^{2 x}) = (ρ^{x}, π_{x}) .

(57)

We will use capital letters from the middle of the Latin alphabet (

I, J, K \dots

);

I = (α, x)

is a composite index where

α = 1, 2

keeps track of whether x is an upper index (

α = 1

) or a lower index (

α = 2

) [96]. Then Equations (53)–(55) are written as

\bar{V} = V^{I} \frac{δ}{δ X^{I}}, where V^{I} = \frac{d X^{I}}{d λ} = [\begin{matrix} d ρ^{x} / d λ \\ d π_{x} / d λ \end{matrix}],

(58)

\frac{d F}{d λ} = \tilde{\nabla} F [\bar{V}] = \frac{δ F}{δ X^{I}} V^{I} and \tilde{\nabla} F = \frac{δ F}{δ X^{I}} \tilde{\nabla} X^{I},

(59)

where the repeated indices indicate a summation over

α

and an integration over x.

5.2. The Symplectic Form in ED

In classical mechanics with configuration space

{q^{i}}

the Lagrangian

L (q, \dot{q})

is a function on the tangent bundle while the Hamiltonian

H (q, p)

is a function on the cotangent bundle [97,98]. A symplectic form provides a mapping from the tangent to the cotangent bundles. Given a Lagrangian the map is defined by

p_{i} = \partial L / \partial {\dot{q}}^{i}

and this automatically defines the corresponding symplectic form. In ED there is no Lagrangian so to define the symplectic map we must look elsewhere. We propose that the role played by the Lagrangian in classical mechanics will in ED be played by the continuity Equation (49).

The fact that the preservation of a symplectic structure must reproduce the continuity equation leads us to identify the phase

Φ_{x}

as the momentum canonically conjugate to

ρ^{x}

. This identification of the e-phase space

{ρ, Φ}

with

T^{*} P

is highly non-trivial. It amounts to asserting that the phase

Φ_{x}

transforms as the components of a Poincare 1-form

θ = \int d x Φ_{x} d ρ^{x},

(60)

where

d

is the exterior derivative and the corresponding symplectic 2-form

Ω = - d θ

is

Ω = \int d x d ρ^{x} \land d Φ_{x} = \int d x [\tilde{\nabla} ρ^{x} \otimes \tilde{\nabla} Φ_{x} - \tilde{\nabla} Φ_{x} \otimes \tilde{\nabla} ρ^{x}] .

(61)

By construction Ω is exact (

Ω = - d θ

) and closed (

d Ω = 0

). The action of

Ω [\cdot, \cdot]

on two vectors

\bar{V} = d / d λ

and

\bar{U} = d / d μ

is given by

Ω [\bar{V}, \bar{U}] = \int d x [V^{1 x} U^{2 x} - V^{2 x} U^{1 x}] = Ω_{I J} V^{I} U^{J},

(62)

so that the components of Ω are

Ω_{I J} = Ω_{α x, β x^{'}} = [\begin{matrix} 0 & 1 \\ - 1 & 0 \end{matrix}] δ (x, x^{'}) .

(63)

5.3. Hamiltonian Flows and Poisson Brackets

Next we reproduce the ∞-dimensional

T^{*} P

analogues of results that are standard in finite-dimensional classical mechanics [97,98]. Given a vector field

\bar{V} [X]

in e-phase space we can integrate

V^{I} [X] = d X^{I} / d λ

to find its integral curves

X^{I} = X^{I} (λ)

. We are particularly interested in those vector fields that generate flows that preserve the symplectic structure,

£_{V} Ω = 0,

(64)

where the Lie derivative is given by

{(£_{V} Ω)}_{I J} = V^{K} {\tilde{\nabla}}_{K} Ω_{I J} + Ω_{K J} {\tilde{\nabla}}_{I} V^{K} + Ω_{I K} {\tilde{\nabla}}_{J} V^{K} .

(65)

Since by Equation (63) the components

Ω_{I J}

are constant,

{\tilde{\nabla}}_{K} Ω_{I J} = 0

, we can rewrite

£_{V} Ω

as

{(£_{V} Ω)}_{I J} = {\tilde{\nabla}}_{I} (Ω_{K J} V^{K}) - {\tilde{\nabla}}_{J} (Ω_{K I} V^{K}),

(66)

which is the exterior derivative (basically, the curl) of the covector

Ω_{K I} V^{K}

. By Poincare’s lemma, requiring

£_{V} Ω = 0

(a vanishing curl) implies that

Ω_{K I} V^{K}

is the gradient of a scalar function, which we will denote

\tilde{V} [X]

,

Ω_{K I} V^{K} = {\tilde{\nabla}}_{I} \tilde{V} .

(67)

Using (63) this is more explicitly written as

\int d x [\frac{d ρ^{x}}{d λ} \tilde{\nabla} Φ_{x} - \frac{d Φ_{x}}{d λ} \tilde{\nabla} ρ^{x}] = \int d x [\frac{δ \tilde{V}}{δ ρ^{x}} \tilde{\nabla} ρ^{x} + \frac{δ \tilde{V}}{δ Φ_{x}} \tilde{\nabla} Φ_{x}],

(68)

or

\frac{d ρ^{x}}{d λ} = \frac{δ \tilde{V}}{δ Φ_{x}} and \frac{d Φ_{x}}{d λ} = - \frac{δ \tilde{V}}{δ ρ^{x}},

(69)

which we recognize as Hamilton’s equations for a Hamiltonian function

\tilde{V}

. This justifies calling

\bar{V}

the Hamiltonian vector field associated with the Hamiltonian function

\tilde{V}

.

From (62), the action of the symplectic form

Ω

on two Hamiltonian vector fields

\bar{V} = d / d λ

and

\bar{U} = d / d μ

generated respectively by

\tilde{V}

and

\tilde{U}

is

Ω [\bar{V}, \bar{U}] = \int d x [\frac{d ρ^{x}}{d λ} \frac{d Φ_{x}}{d μ} - \frac{d Φ_{x}}{d λ} \frac{d ρ^{x}}{d μ}],

(70)

which, using (69), gives

Ω [\bar{V}, \bar{U}] = \int d x [\frac{δ \tilde{V}}{δ ρ^{x}} \frac{δ \tilde{U}}{δ Φ_{x}} - \frac{δ \tilde{V}}{δ Φ_{x}} \frac{δ \tilde{U}}{δ ρ^{x}}] \overset{def}{=} {\tilde{V}, \tilde{U}},

(71)

where on the right we introduced the Poisson bracket notation.

To summarize these results: (1) The condition for a flow generated by the vector field

V^{I}

to preserve the symplectic structure,

£_{V} Ω = 0

, is that

V^{I}

be the Hamiltonian vector field associated to a Hamiltonian function

\tilde{V}

, Equation (69),

V^{I} = \frac{d X^{I}}{d λ} = {X^{I}, \tilde{V}} .

(72)

(2) The action of Ω on two Hamiltonian vector fields (71) is the Poisson bracket of the associated Hamiltonian functions,

Ω [\bar{V}, \bar{U}] = Ω_{I J} V^{I} U^{J} = {\tilde{V}, \tilde{U}} .

(73)

We conclude that the ED that preserves the symplectic structure Ω and reproduces the continuity Equation (49) is described by the Hamiltonian flow of the scalar functional

\tilde{H}

in (50). However, the full dynamics, which will obey the Hamiltonian evolution equations

\partial_{t} ρ^{x} = \frac{δ \tilde{H}}{δ Φ_{x}} and \partial_{t} Φ_{x} = - \frac{δ \tilde{H}}{δ ρ^{x}},

(74)

is not yet fully determined because the integration constant

F [ρ]

in (50) remains to be specified.

5.4. The Normalization Constraint

Since the particular flow that we will associate with time evolution is required to reproduce the continuity equation it will also preserve the normalization constraint,

\tilde{N} = 0 where \tilde{N} = 1 - |ρ| and |ρ| \overset{def}{=} \int d x ρ (x) .

(75)

Indeed, one can check that

\partial_{t} \tilde{N} = {\tilde{N}, \tilde{H}} = 0 .

(76)

The Hamiltonian flow (72) generated by

\tilde{N}

and parametrized by

α

is given by the vector field

\bar{N} = N^{I} \frac{δ}{δ X^{I}} with N^{I} = \frac{d X^{I}}{d α} = {X^{I}, \tilde{N}},

(77)

or, more explicitly,

N^{1 x} = \frac{d ρ^{x}}{d α} = 0 and N^{2 x} = \frac{d Φ_{x}}{d α} = 1 .

(78)

The conservation of

\tilde{N}

, Equation (76), implies that

\tilde{N}

is the generator of a symmetry, namely

\frac{d \tilde{H}}{d α} = {\tilde{H}, \tilde{N}} = 0 .

(79)

Integrating (78) one finds the integral curves generated by

\tilde{N}

,

ρ^{x} (α) = ρ^{x} (0) and Φ_{x} (α) = Φ_{x} (0) + α .

(80)

This shows that the symmetry generated by

\tilde{N}

is to shift the phase

Φ

by a constant

α

without otherwise changing the dynamics. This was, of course, already evident in the continuity Equation (43) with (19) but the implications are very significant. Not only does the constraint

\tilde{N} = 0

reduce by one the (infinite) number of independent

ρ^{x}

degrees of freedom but the actual number of

Φ_{x}

s is also reduced by one because for any value of

α

the phases

Φ_{x} + α

and

Φ_{x}

correspond to the same state. (This is the ED analogue of the fact that in QM states are represented by rays rather than vectors in a Hilbert space.) An immediate consequence is that two vectors

\bar{U}

and

\bar{V}

at X that differ by a vector proportional to

\bar{N}

,

\bar{U} = \bar{V} + k \bar{N},

(81)

are “physically” equivalent. In particular the vector

\bar{N}

is equivalent to zero.

The phase space of interest is

T^{*} P

but to handle the constraint

| ρ | = 1

we have been led to using coordinates that are more appropriate to the larger embedding space

T^{*} P^{+ 1}

. The price we pay for introducing one superfluous coordinate is to also introduce a superfluous momentum. We eliminate the extra coordinate by imposing the constraint

\tilde{N} = 0

. We eliminate the extra momentum by declaring it unphysical. All vectors that differ by a vector along the gauge direction

\bar{N}

are declared equivalent; they belong to the same equivalence class. The result is a global gauge symmetry.

An equivalence class can be represented by any one of its members and choosing a convenient representative amounts to fixing the gauge. As we shall see below a convenient gauge condition is to impose

\int d x ρ^{x} V^{2 x} = 0 or 〈 V^{2} 〉 = 0,

(82)

so that the representative “Tangent Gauge-Fixed” vectors (which we shall refer to as TGF vectors) will satisfy two conditions, Equations (56) and (82),

| V^{1} | = \int d x V^{1 x} = 0 and 〈 V^{2} 〉 = \int d x ρ^{x} V^{2 x} = 0 .

(83)

The first condition enforces a flow tangent to the

|ρ| = 1

surface; the second eliminates a superfluous vector component along the gauge direction

\bar{N}

.

We end this section with a comment on the symplectic form Ω which is non-degenerate on

T^{*} P^{+ 1}

but at first sight appears to be degenerate on

T^{*} P

. Indeed, we have

Ω (\bar{N}, \bar{V}) = 0

for any tangent vector

\bar{V}

. However, we must recall that

\bar{N}

is equivalent to 0. In fact, since the TGF equivalent of

\bar{N}

is 0,

Ω

is not degenerate on

T^{*} P

.

6. The Information Geometry of E-Phase Space

The construction of the ensemble Hamiltonian

\tilde{H}

—or e-Hamiltonian—is motivated as follows. The goal of dynamics is to determine the evolution of the state

(ρ_{t}, Φ_{t})

. From a given initial state

(ρ_{0}, Φ_{0})

two slightly different Hamiltonians will lead to slightly different final states, say

(ρ_{t}, Φ_{t})

or

(ρ_{t} + δ ρ_{t}, Φ_{t} + δ Φ_{t})

. Will these small changes make any difference? Can we quantify the extent to which we can distinguish between two neighboring states? This is precisely the kind of question that metrics are designed to address. It is then natural that

\tilde{H}

be in some way related to some choice of metric. But although

P

is naturally endowed with a unique information metric the space

T^{*} P

has none. Thus, our next goal is to construct a metric for

T^{*} P

.

Once a metric structure is in place we can ask: does the distance between two neighboring states—the extent to which we can distinguish them—grow, stay the same, or diminish over time? There are many possibilities here but for pragmatic (and esthetic) reasons we are led to consider the simplest form of dynamics—one that preserves the metric. This leads us to study the Hamilton flows (those that preserve the symplectic structure) that are also Killing flows (those flows that preserve the metric structure).

In ED entropic time is constructed so that time (duration) is defined by a clock provided by the system itself. This leads to require that the generator

\tilde{H}

of time translations be defined in terms of the very same clock that provides the measure of time. Thus, the third and final ingredient in the construction of

\tilde{H}

is the requirement is that the e-Hamiltonian agree with (50) to reproduce the evolution of

ρ

given by the continuity Equation (49).

In this section, our goal is to transform e-phase space

T^{*} P

from a manifold that is merely symplectic to a manifold that is both symplectic and Riemannian. The implementation of the other two requirements on

\tilde{H}

—that it generates a Hamilton–Killing flow and that it agrees with the ED continuity equation—will be tackled in Section 7 and Section 8.

6.1. The Metric on the Embedding Space $T^{*} P^{+ 1}$

The configuration space

P

is a metric space. Our goal here is to extend its metric—given by information geometry—to the full cotangent bundle,

T^{*} P

. It is convenient to first recall one derivation of the information metric. In the discrete case the statistical manifold is the k-simplex

Σ = {p = (p^{0} \dots p^{k}) : \sum_{i = 0}^{k} p^{i} = 1}

. The basic idea is to find the most general metric consistent with a certain symmetry requirement. To suggest what that symmetry might be we change to new coordinates

ξ^{i} = {(p^{i})}^{1 / 2}

. In these new coordinates the equation for the k-simplex

Σ

—the normalization condition—reads

\sum_{i = 0}^{k} {(ξ^{i})}^{2} = 1

which suggests the equation of a sphere.

We take this hint seriously and declare that the k-simplex is a k-sphere embedded in a generic

(k + 1)

-dimensional spherically symmetric space

Σ^{+ 1}

[99]. In the

ξ^{i}

coordinates the metric of

Σ^{+ 1}

is of the form

d ℓ^{2} = [a (| p |) - b (| p |)] {(\sum_{i = 0}^{k} ξ^{i} d ξ^{i})}^{2} + | p | b (| p |) \sum_{i = 0}^{k} {(d ξ^{i})}^{2},

(84)

where

a (| p |)

and

b (| p |)

are two arbitrary smooth and positive functions of

| p | = \sum_{i = 0}^{k} p^{i}

. Expressed in terms of the original

p^{i}

coordinates the metric of

Σ^{+ 1}

is

d ℓ^{2} = [a (| p |) - b (| p |)] {(\sum_{i = 0}^{k} d p^{i})}^{2} + | p | b (| p |) \sum_{i = 0}^{k} \frac{1}{p^{i}} {(d p^{i})}^{2} .

(85)

The restriction to normalized states,

| p | = 1

with displacements tangent to the simplex,

\sum_{i = 0}^{k} d p^{i} = 0

, gives the information metric induced on the k-simplex

Σ

,

d ℓ^{2} = b (1) \sum_{i = 0}^{k} \frac{1}{p^{i}} {(d p^{i})}^{2} .

(86)

The overall constant

b (1)

is not important; it amounts to a choice of the units of distance.

To extend the information metric from the k-simplex

Σ

to its cotangent bundle

T^{*} Σ

we focus on the embedding spaces

Σ^{+ 1}

and

T^{*} Σ^{+ 1}

and require that

(a): the metric on $T^{*} Σ^{+ 1}$ be compatible with the metric on $Σ^{+ 1}$ ; and
(b): that the spherical symmetry of the $(k + 1)$ -dimensional space $Σ^{+ 1}$ be enlarged to full spherical symmetry for the $2 (k + 1)$ -dimensional space $T^{*} Σ^{+ 1}$ .

The simplest way to implement (a) is to follow as closely as possible the derivation that led to (85). The fact that Φ inherits from the drift potential

ϕ

the topological structure of an angle suggests introducing new coordinates,

ξ^{i} = {(p^{i})}^{1 / 2} \cos Φ_{i} / ℏ and η^{i} = {(p^{i})}^{1 / 2} \sin Φ_{i} / ℏ .

(87)

Then the normalization condition reads

| p | = \sum_{i = 0}^{k} p^{i} = \sum_{i = 0}^{k} [{(ξ^{i})}^{2} + {(η^{i})}^{2}] = 1

(88)

which suggests the equation of a

(2 k + 1)

-sphere embedded in

2 (k + 1)

dimensions. To implement (b) we take this spherical symmetry seriously. The most general metric in the embedding space that is invariant under rotations is

\begin{matrix} d ℓ^{2} & = & [a (| p |) - b (| p |)] {[\sum_{i = 0}^{k} (ξ^{i} d ξ^{i} + η^{i} d η^{i})]}^{2} \\ + | p | b (| p |) \sum_{i = 0}^{k} [{(d ξ^{i})}^{2} + {(d η^{i})}^{2}], \end{matrix}

(89)

where the two functions

a (| p |)

and

b (| p |)

are smooth and positive but otherwise arbitrary. Therefore, changing back to the

(p^{i}, Φ_{i})

coordinates, the most general rotationally invariant metric for the embedding space

T^{*} Σ^{+ 1}

is

\begin{matrix} d ℓ^{2} & = & \frac{1}{4} [a (| p |) - b (| p |)] {[\sum_{i = 0}^{k} d p^{i}]}^{2} \\ + | p | b (| p |) \frac{1}{2 ℏ} \sum_{i = 0}^{k} [\frac{ℏ}{2 p^{i}} {(d p^{i})}^{2} + \frac{2 p^{i}}{ℏ} {(d Φ_{i})}^{2}] . \end{matrix}

(90)

Generalizing from the finite-dimensional case to the ∞-dimensional case yields the metric on the spherically symmetric space

T^{*} P^{+ 1}

,

δ {\tilde{ℓ}}^{2} = A {[\int d x δ ρ_{x}]}^{2} + B \int d x [\frac{ℏ}{2 ρ_{x}} {(δ ρ_{x})}^{2} + \frac{2 ρ_{x}}{ℏ} {(δ Φ_{x})}^{2}] .

(91)

where we set

A (| ρ |) = \frac{1}{4} [a (| ρ |) - b (| ρ |)] and B (| ρ |) = \frac{1}{2 ℏ} | ρ | b (| ρ |) .

(92)

6.2. The Metric Induced on $T^{*} P$

As we saw in Section 5.4 the normalization constraint

| ρ | = 1

induces a symmetry—points with phases differing by a constant are identified. Therefore, the e-phase space

T^{*} P

can be obtained from the spherically symmetric space

T^{*} P^{+ 1}

by the restriction

| ρ | = 1

and by identifying points

(ρ_{x}, Φ_{x})

and

(ρ_{x}, Φ_{x} + α)

that lie on the same gauge orbit, or on the same ray.

Consider two neighboring points

(ρ_{x}, Φ_{x})

and

(ρ_{x}^{'}, Φ_{x}^{'})

. The metric induced on

T^{*} P

is defined as the shortest

T^{*} P^{+ 1}

distance between

(ρ_{x}, Φ_{x})

and points on the ray defined by

(ρ_{x}^{'}, Φ_{x}^{'})

. Setting

| δ ρ | = 0

the

T^{*} P^{+ 1}

distance between

(ρ_{x}, Φ_{x})

and

(ρ_{x} + δ ρ_{x}, Φ_{x} + δ Φ_{x} + δ α)

is given by

δ {\tilde{ℓ}}^{2} = B (1) \int d x [\frac{ℏ}{2 ρ_{x}} {(δ ρ_{x})}^{2} + \frac{2 ρ_{x}}{ℏ} {(δ Φ_{x} + δ α)}^{2}] .

(93)

Let

δ {\tilde{s}}^{2} = \min_{δ α} δ {\tilde{ℓ}}^{2} .

(94)

Minimizing over

δ α

gives the metric on

T^{*} P

,

δ {\tilde{s}}^{2} = \int d x [\frac{ℏ}{2 ρ_{x}} {(δ ρ_{x})}^{2} + \frac{2 ρ_{x}}{ℏ} {(δ Φ_{x} - 〈 δ Φ 〉)}^{2}],

(95)

where we set

B (1) = 1

which amounts to a choice of units of length. This metric is known as the Fubini–Study metric.

The scalar product between two vectors

\bar{V}

and

\bar{U}

is

G (\bar{V}, \bar{U}) = \int d x [\frac{ℏ}{2 ρ_{x}} V^{1 x} U^{1 x} + \frac{2 ρ_{x}}{ℏ} (V^{2 x} - 〈 V^{2} 〉) (U^{2 x} - 〈 U^{2} 〉)] .

(96)

It is at this point that we recognize the convenience of imposing the TGF gauge condition (83): the scalar product simplifies to

G (\bar{V}, \bar{U}) = \int d x [\frac{ℏ}{2 ρ_{x}} V^{1 x} U^{1 x} + \frac{2 ρ_{x}}{ℏ} V^{2 x} U^{2 x}] .

(97)

An analogous expression can be written for the length

δ \tilde{s}

of a displacement

(δ ρ_{x}, δ Φ_{x})

,

δ {\tilde{s}}^{2} = \int d x [\frac{ℏ}{2 ρ_{x}} {(δ ρ_{x})}^{2} + \frac{2 ρ_{x}}{ℏ} {(δ Φ_{x})}^{2}],

(98)

where it is understood that

(δ ρ_{x}, δ Φ_{x})

satisfies the TGF condition

| δ ρ | = 0 and 〈 δ Φ 〉 = 0 .

(99)

In index notation the metric (98) of

T^{*} P

is written as

δ {\tilde{s}}^{2} = G_{I J} δ X^{I} δ X^{J} = \int d x d x^{'} G_{α x, β x^{'}} δ X^{x α} δ X^{x^{'} β}

(100)

where the metric tensor

G_{I J}

is

G_{I J} = G_{α x, β x^{'}} = [\begin{matrix} \frac{ℏ}{2 ρ_{x}} δ_{x x^{'}} & 0 \\ 0 & \frac{2}{ℏ} ρ_{x} δ_{x x^{'}} \end{matrix}] .

(101)

The tensor

G_{I J}

in Equation (101) can act on arbitrary vectors whether they satisfy the TGF condition or not. It is only when

G_{I J}

acts on TGF vectors that it is interpreted as a metric tensor on

T^{*} P

.

6.3. A Complex Structure

Next we contract the symplectic form

Ω_{I J}

, Equation (63), with the inverse of the metric tensor,

G^{I J} = G^{α x, β x^{'}} = [\begin{matrix} \frac{2}{ℏ} ρ_{x} δ_{x x^{'}} & 0 \\ 0 & \frac{ℏ}{2 ρ_{x}} δ_{x x^{'}} \end{matrix}] .

(102)

The result is a mixed tensor J with components

J^{I}_{J} = - G^{I K} Ω_{K J} = [\begin{matrix} 0 & - \frac{2}{ℏ} ρ_{x} δ_{x x^{'}} \\ \frac{ℏ}{2 ρ_{x}} δ_{x x^{'}} & 0 \end{matrix}] .

(103)

(The reason for introducing an additional negative sign will become clear below.) The tensor

J^{I}_{J}

maps vectors to vectors—as any mixed

(1, 1)

tensor should. What makes the tensor J special is that—as one can easily check— its action on a TGF vector

\bar{V}

yields another vector

J \bar{V}

that is also TGF and, furthermore, its square is

J^{I}_{K} J^{K}_{J} = - δ_{x x^{'}} [\begin{matrix} 1 & 0 \\ 0 & 1 \end{matrix}] = - δ^{I}_{J} .

(104)

In words, when acting on vectors tangent to

T^{*} P

the action of

J^{2}

(or

Ω^{2}

) is equivalent to multiplying by

- 1

. This means that J plays the role of a complex structure.

We conclude that the cotangent bundle

T^{*} P

has a symplectic structure Ω, as all cotangent bundles do; that it can be given a Riemannian structure

G_{I J}

; and that the mixed tensor J provides it with a complex structure.

6.4. Complex Coordinates

The fact that

T^{*} P

is endowed with a complex structure suggests introducing complex coordinates,

Ψ_{x} = ρ_{x}^{1 / 2} \exp i Φ_{x} / ℏ,

(105)

so that a point

Ψ \in

T^{*} P^{+ 1}

has coordinates

Ψ^{μ x} = (\binom{Ψ^{1 x}}{Ψ^{2 x}}) = (\binom{Ψ_{x}}{i ℏ Ψ_{x}^{*}}),

(106)

where the index

μ

takes two values,

μ = 1, 2

.

We can check that the transformation from real coordinates

(ρ, Φ)

to complex coordinates

(Ψ, i ℏ Ψ^{*})

is canonical. Indeed, the action of

Ω

on two infinitesimal vectors

δ X^{I}

and

δ^{'} X^{J}

is

Ω_{I J} δ X^{I} δ^{'} X^{J} = \int d x (δ ρ_{x} δ^{'} Φ_{x} - δ Φ_{x} δ^{'} ρ_{x}),

which, when expressed in Ψ coordinates, becomes

Ω_{I J} δ X^{I} δ^{'} X^{J} = \int d x (δ Ψ δ^{'} i ℏ Ψ^{*} - δ i ℏ Ψ^{*} δ^{'} Ψ) = Ω_{μ x, ν x^{'}} δ Ψ^{μ x} δ Ψ^{ν x^{'}}

(107)

where

Ω_{μ x, ν x^{'}} = [\begin{matrix} 0 & 1 \\ - 1 & 0 \end{matrix}] δ_{x x^{'}},

(108)

retains the same form as (63).

Expressed in Ψ coordinates the Hamiltonian flow generated by the normalization constraint (75),

\tilde{N} = 0 with \tilde{N} = 1 - \int d x Ψ_{x}^{*} Ψ_{x},

(109)

and parametrized by

α

is given by the vector field

\bar{N} = - (\binom{Ψ_{x} / i ℏ}{i ℏ {(Ψ_{x} / i ℏ)}^{*}}) .

(110)

Its integral curves are

Ψ_{x} (α) = Ψ_{x} (0) e^{i α / ℏ} .

(111)

The constraint

\tilde{N} = 0

induces a gauge symmetry which leads us to restrict our attention to vectors

\bar{V} = d / d λ

satisfying two real TGF conditions (83). In

Ψ

coordinates this is replaced by the single complex TGF condition,

\int d x Ψ_{x}^{*} \frac{d Ψ_{x}}{d λ} = 0 .

(112)

In Ψ coordinates the metric on

T^{*} P

, Equation (98), becomes

δ {\tilde{s}}^{2} = - 2 i \int d x δ Ψ_{x} δ i ℏ Ψ_{x}^{*} = \int d x d x^{'} G_{μ x, ν x^{'}} δ Ψ^{μ x} δ Ψ^{ν x^{'}},

(113)

where the metric tensor and its inverse are

G_{μ x, ν x^{'}} = - i δ_{x x^{'}} [\begin{matrix} 0 & 1 \\ 1 & 0 \end{matrix}] and G^{μ x, ν x^{'}} = i δ_{x x^{'}} [\begin{matrix} 0 & 1 \\ 1 & 0 \end{matrix}] .

(114)

Finally, using

G^{μ x, ν x^{'}}

to raise the first index of

Ω_{ν x^{'}, γ x^{″}}

gives the Ψ components of the tensor J

J^{μ x}_{γ x^{″}} \overset{def}{=} - G^{μ x, ν x^{'}} Ω_{ν x^{'}, γ x^{″}} = [\begin{matrix} i & 0 \\ 0 & - i \end{matrix}] δ_{x x^{'}} .

(115)

7. Hamilton-Killing Flows

Our next goal will be to find those Hamiltonian flows

Q^{I}

that also happen to preserve the metric tensor, i.e., we want

Q^{I}

to be a Killing vector. The condition for

Q^{I}

is

{(£_{Q} G)}_{I J} = Q^{K} {\tilde{\nabla}}_{K} G_{I J} + G_{K J} {\tilde{\nabla}}_{I} Q^{K} + G_{I K} {\tilde{\nabla}}_{J} Q^{K} = 0 .

(116)

In complex coordinates Equation (114) gives

{\tilde{\nabla}}_{K} G_{I J} = 0

, and the Killing equation simplifies to

{(£_{Q} G)}_{I J} = G_{K J} {\tilde{\nabla}}_{I} Q^{K} + G_{I K} {\tilde{\nabla}}_{J} Q^{K} = 0,

(117)

or

{(£_{Q} G)}_{μ x, ν x^{'}} = - i [\begin{matrix} \frac{δ Q^{2 x^{'}}}{δ Ψ_{x}} + \frac{δ Q^{2 x}}{δ Ψ_{x^{'}}}; & \frac{δ Q^{1 x^{'}}}{δ Ψ_{x}} + \frac{δ Q^{2 x}}{δ i ℏ Ψ_{x^{'}}^{*}} \\ \frac{δ Q^{2 x^{'}}}{δ i ℏ Ψ_{x}^{*}} + \frac{δ Q^{1 x}}{δ Ψ_{x^{'}}}; & \frac{δ Q^{1 x^{'}}}{δ i ℏ Ψ_{x}^{*}} + \frac{δ Q^{1 x}}{δ i ℏ Ψ_{x^{'}}^{*}} \end{matrix}] = 0 .

(118)

If we further require that

Q^{I}

be a Hamiltonian flow,

£_{Q} Ω = 0

, then we substitute

Q^{1 x} = \frac{δ \tilde{Q}}{δ i ℏ Ψ_{x}^{*}} and Q^{2 x} = - \frac{δ \tilde{Q}}{δ Ψ_{x}}

(119)

into (118) to get

\frac{δ^{2} \tilde{Q}}{δ Ψ_{x} δ Ψ_{x^{'}}} = 0 and \frac{δ^{2} \tilde{Q}}{δ Ψ_{x}^{*} δ Ψ_{x^{'}}^{*}} = 0 .

(120)

Therefore, to generate a flow that preserves both G and Ω the functional

\tilde{Q} [Ψ, Ψ^{*}]

must be linear in both Ψ and

Ψ^{*}

,

\tilde{Q} [Ψ, Ψ^{*}] = \int d x d x^{'} Ψ_{x}^{*} {\hat{Q}}_{x x^{'}} Ψ_{x^{'}},

(121)

where

{\hat{Q}}_{x x^{'}}

is a possibly non-local kernel. The actual Hamilton–Killing flow is

\begin{matrix} \frac{d Ψ_{x}}{d λ} & = & Q^{1 x} = \frac{δ \tilde{Q}}{δ i ℏ Ψ_{x}^{*}} = \frac{1}{i ℏ} \int d x^{'} {\hat{Q}}_{x x^{'}} Ψ_{x^{'}}, \end{matrix}

(122)

\begin{matrix} \frac{d i ℏ Ψ_{x}^{*}}{d λ} & = & Q^{2 x} = - \frac{δ \tilde{Q}}{δ Ψ_{x}} = - \int d x^{'} Ψ_{x^{'}}^{*} {\hat{Q}}_{x x^{'}} . \end{matrix}

(123)

Taking the complex conjugate of (122) and compared to (123), shows that the kernel

{\hat{Q}}_{x x^{'}}

is Hermitian,

{\hat{Q}}_{x x^{'}}^{*} = {\hat{Q}}_{x^{'} x},

(124)

and we can check that the corresponding Hamiltonian functionals

\tilde{Q}

are real,

\tilde{Q} {[Ψ, Ψ^{*}]}^{*} = \tilde{Q} [Ψ, Ψ^{*}] .

The Hamiltonian flows that might potentially be of interest are those that generate symmetry transformations. For example, the generator of translations is total momentum. Under a spatial displacement by

ε^{a}

,

g (x) \to g_{ε} (x) = g (x - ε)

, the change in

f [ρ, Φ]

is

δ_{ε} f [ρ, Φ] = \int d x (\frac{δ f}{δ ρ_{x}} δ_{ε} ρ_{x} + \frac{δ f}{δ Φ_{x}} δ_{ε} Φ_{x}) = {f, {\tilde{P}}_{a} ε^{a}}

(125)

where

{\tilde{P}}_{a} = \int d x ρ \sum_{n} \frac{\partial Φ}{\partial x_{n}^{a}} = \int d x ρ \frac{\partial Φ}{\partial X^{a}}

(126)

is interpreted as the expectation of the total momentum, and

X^{a}

are the coordinates of the center of mass,

X^{a} = \frac{1}{M} \sum_{n} m_{n} x_{n}^{a} .

(127)

In complex coordinates,

{\tilde{P}}_{a} = \int d x Ψ^{*} (\sum_{n} \frac{ℏ}{i} \frac{\partial}{\partial x_{n}^{a}}) Ψ = \int d x Ψ^{*} (\frac{ℏ}{i} \frac{\partial}{\partial X^{a}}) Ψ,

(128)

and the corresponding kernel

{\hat{P}}_{a x x^{'}}

is

{\hat{P}}_{a x x^{'}} = δ_{x x^{'}} \sum_{n} \frac{ℏ}{i} \frac{\partial}{\partial x_{n}^{a}} = δ_{x x^{'}} \frac{ℏ}{i} \frac{\partial}{\partial X^{a}} .

(129)

8. The E-Hamiltonian

In the previous sections we supplied the symplectic e-phase space

T^{*} P

with a Riemannian metric and, as a welcome by-product, also with a complex structure. Then we showed that the condition for the simplest form of dynamics—one that preserves all the metric, symplectic, and complex structures—is a Hamilton–Killing flow generated by a Hamiltonian

\tilde{H}

that is linear in both Ψ and

Ψ^{*}

,

\tilde{H} [Ψ, Ψ^{*}] = \int d x d x^{'} Ψ_{x}^{*} {\hat{H}}_{x x^{'}} Ψ_{x^{'}} .

(130)

The last ingredient in the construction of

\tilde{H}

is that the e-Hamiltonian must agree with (50) to reproduce the entropic evolution of

ρ

given by the continuity Equation (49).

To proceed we use the identity

\frac{1}{2} ρ m^{A B} (\partial_{A} Φ - {\bar{A}}_{A}) (\partial_{B} Φ - {\bar{A}}_{B}) = \frac{ℏ^{2}}{2} m^{A B} {(D_{A} Ψ)}^{*} D_{B} Ψ - \frac{ℏ^{2}}{8 ρ^{2}} m^{A B} \partial_{A} ρ \partial_{B} ρ

(131)

where

D_{A} = \partial_{A} - \frac{i}{ℏ} {\bar{A}}_{A} and {\bar{A}}_{A} (x) = ℏ β_{n} A_{a} (x_{n}) .

(132)

Rewriting

\tilde{H} [ρ, Φ]

in (50) in terms of

Ψ

and

Ψ^{*}

we get

\tilde{H} [Ψ, Ψ^{*}] = \int d x (\frac{- ℏ^{2}}{2} m^{A B} Ψ^{*} D_{A} D_{B} Ψ) + F^{'} [ρ] .

(133)

where

F^{'} [ρ] = F [ρ] - \frac{ℏ^{2}}{8 ρ^{2}} m^{A B} \partial_{A} ρ \partial_{B} ρ .

(134)

According to (121) for

\tilde{H} [Ψ, Ψ^{*}]

to generate an HK flow we must impose that

F^{'} [ρ]

be linear in both Ψ and

Ψ^{*}

,

F^{'} [ρ] = \int d x d x^{'} Ψ_{x}^{*} {\hat{V}}_{x x^{'}} Ψ_{x^{'}}

(135)

for some Hermitian kernel

{\hat{V}}_{x x^{'}}

, but

F^{'} [ρ]

must remain independent of

Φ

,

\frac{δ F^{'} [ρ]}{δ Φ_{x}} = 0 .

(136)

Substituting

Ψ = ρ^{1 / 2} e^{i Φ / ℏ}

into (135) and using

{\hat{V}}_{x^{'} x}^{*} = {\hat{V}}_{x x^{'}}

leads to

\frac{δ F^{'}}{δ Φ_{x}} = \frac{2}{ℏ} ρ_{x}^{1 / 2} \int d x^{'} ρ_{x^{'}}^{1 / 2} Im ({\hat{V}}_{x x^{'}} e^{- i (Φ_{x} - Φ_{x^{'}}) / ℏ}) = 0

(137)

This equation must be satisfied for all choices of

ρ_{x^{'}}

, which implies

Im ({\hat{V}}_{x x^{'}} e^{- i (Φ_{x} - Φ_{x^{'}}) / ℏ}) = 0,

(138)

and also for all choices of

Φ_{x}

and

Φ_{x^{'}}

. Therefore, the kernel

{\hat{V}}_{x x^{'}}

must be local in x,

{\hat{V}}_{x x^{'}} = δ_{x x^{'}} V_{x},

(139)

where

V_{x} = V (x)

is some real function.

We conclude that the Hamiltonian that generates a Hamilton–Killing flow and agrees with the ED continuity equation must be of the form

\tilde{H} [Ψ, Ψ^{*}] = \int d x Ψ^{*} (- \frac{ℏ^{2}}{2} m^{A B} D_{A} D_{B} + V (x)) Ψ .

(140)

The evolution of Ψ is given by the Hamilton equation,

\partial_{t} Ψ_{x} = {Ψ_{x}, \tilde{H}} = \frac{δ \tilde{H}}{δ (i ℏ Ψ^{*} (x))},

(141)

which is the Schrödinger equation,

i ℏ \partial_{t} Ψ = - \frac{ℏ^{2}}{2} m^{A B} D_{A} D_{B} Ψ + V Ψ .

(142)

In more standard notation it reads

i ℏ \partial_{t} Ψ = \sum_{n} \frac{- ℏ^{2}}{2 m_{n}} δ^{a b} (\frac{\partial}{\partial x_{n}^{a}} - i β_{n} A_{a} (x_{n})) (\frac{\partial}{\partial x_{n}^{b}} - i β_{n} A_{b} (x_{n})) Ψ + V Ψ .

(143)

At this point we can finally provide the physical interpretation of the various constants introduced along the way. Since the Schrödinger Equation (143) is the tool we use to analyze experimental data we can identify ℏ with Planck’s constant,

m_{n}

will be interpreted as the particles’ masses, and the

β_{n}

are related to the particles’ electric charges

q_{n}

by

β_{n} = \frac{q_{n}}{ℏ c} .

(144)

For completeness we write the Hamiltonian in the

(ρ, Φ)

variables,

\begin{matrix} \tilde{H} [ρ, Φ] & = & \int d^{3 N} x ρ [\sum_{n} \frac{δ^{a b}}{2 m_{n}} (\frac{\partial Φ}{\partial x_{n}^{a}} - \frac{q_{n}}{c} A_{a} (x_{n})) (\frac{\partial Φ}{\partial x_{n}^{b}} - \frac{q_{n}}{c} A_{b} (x_{n})) \\ + \sum_{n} \frac{ℏ^{2}}{8 m_{n}} \frac{δ^{a b}}{ρ^{2}} \frac{\partial ρ}{\partial x_{n}^{a}} \frac{\partial ρ}{\partial x_{n}^{b}} + V (x_{1} \dots x_{n})] . \end{matrix}

(145)

The Hamilton equations for

ρ

and

Φ

are the continuity equation (49),

\partial_{t} ρ = \frac{δ \tilde{H}}{δ Φ} = - \sum_{n} \frac{\partial}{\partial x_{n}^{a}} [ρ \frac{δ^{a b}}{m_{n}} (\frac{\partial Φ}{\partial x_{n}^{b}} - \frac{q_{n}}{c} A_{b} (x_{n}))],

(146)

and the quantum analogue of the Hamilton-Jacobi equation,

\begin{matrix} \partial_{t} Φ & = & - \frac{δ \tilde{H}}{δ ρ} = \sum_{n} \frac{- δ^{a b}}{2 m_{n}} (\frac{\partial Φ}{\partial x_{n}^{a}} - \frac{q_{n}}{c} A_{a} (x_{n})) (\frac{\partial Φ}{\partial x_{n}^{b}} - \frac{q_{n}}{c} A_{b} (x_{n})) \\ + \sum_{n} \frac{ℏ^{2}}{2 m_{n}} \frac{δ^{a b}}{ρ^{1 / 2}} \frac{\partial^{2} ρ^{1 / 2}}{\partial x_{n}^{a} \partial x_{n}^{b}} - V (x_{1} \dots x_{n})] . \end{matrix}

(147)

To summarize: we have just shown that an ED that preserves both the symplectic and metric structures of the e-phase space

T^{*} P

leads to a linear Schrödinger equation. In particular, such an ED reproduces the quantum potential in (147) with the correct coefficients

ℏ^{2} / 2 m_{n}

.

9. Entropic Time, Physical Time, and Time Reversal

Now that the dynamics has been fully developed we revisit the question of time. The derivation of laws of physics as examples of inference led us to introduce the notion of entropic time which includes assumptions about the concept of instant, of simultaneity, of ordering, and of duration. It is clear that entropic time is useful but is this the actual, real, “physical” time? The answer is yes. By deriving the Schrödinger equation (from which we can obtain the classical limit) we have shown that the t that appears in the laws of physics is entropic time. Since these are the equations that we routinely use to design and calibrate our clocks we conclude that what clocks measure is entropic time. No notion of time that is in any way deeper or more “physical” is needed. Most interestingly, the entropic model automatically includes an arrow of time.

The statement that the laws of physics are invariant under time reversal has nothing to do with particles travelling backwards in time. It is instead the assertion that the laws of physics exhibit a certain symmetry. For a classical system described by coordinates q and momenta p the symmetry is the statement that if

{q_{t}, p_{t}}

happens to be one solution of Hamilton’s equations then we can construct another solution

{q_{t}^{T}, p_{t}^{T}}

where

q_{t}^{T} = q_{- t} and p_{t}^{T} = - p_{- t},

(148)

but both solutions

{q_{t}, p_{t}}

and

{q_{t}^{T}, p_{t}^{T}}

describe evolution forward in time. An alternative statement of time reversibility is the following: if there is one trajectory of the system that takes it from state

{q_{0}, p_{0}}

at time

t_{0}

to state

{q_{1}, p_{1}}

at the later time

t_{1}

, then there is another possible trajectory that takes the system from state

{q_{1}, - p_{1}}

at time

t_{0}

to state

{q_{0}, - p_{0}}

at the later time

t_{1}

. The merit of this re-statement is that it makes clear that nothing needs to travel back in time. Indeed, rather than time reversal the symmetry might be more appropriately described as momentum or motion reversal.

Since ED is a Hamiltonian dynamics one can expect that similar considerations will apply to QM and indeed they do. It is straightforward to check that given one solution

{ρ_{t} (x), Φ_{t} (x)}

that evolves forward in time, we can construct another solution

{ρ_{t}^{T} (x), Φ_{t}^{T} (x)}

that is also evolving forward in time. The reversed solution is

ρ_{t}^{T} (x) = ρ_{- t} (x) and Φ_{t}^{T} (x) = - Φ_{- t} (x) .

(149)

These transformations constitute a symmetry—e.g., the transformed

Ψ_{t}^{T} (x)

is a solution of the Schrödinger equation— provided the motion of the sources of the external potentials is also reversed, i.e., the potentials

A_{a} (\vec{x}, t)

and

V (x, t)

are transformed according to

A_{a}^{T} (\vec{x}, t) = - A_{a} (\vec{x}, - t) and V^{T} (x, t) = V (x, - t) .

(150)

Expressed in terms of wave functions the time reversal transformation is

Ψ_{t}^{T} (x) = Ψ_{- t}^{*} (x) .

(151)

The proof that this is a symmetry is straightforward; just take the complex conjugate of (143), and let

t \to - t

.

10. Linearity and the Superposition Principle

The Schrödinger equation is linear, i.e., a linear combination of solutions is a solution too. However, this mathematical linearity does not guarantee the physical linearity that is usually referred to as the superposition principle. The latter is the physical assumption that if there is one experimental setup that prepares a system in the (epistemic) state

Ψ_{1}

and there is another setup that prepares the system in the state

Ψ_{2}

then, at least in principle, it is possible to construct yet a third setup that can prepare the system in the superposition

Ψ_{3} = α_{1} Ψ_{1} + α_{2} Ψ_{2},

(152)

where

α_{1}

and

α_{2}

are arbitrary complex numbers. Mathematical linearity refers to the fact that solutions can be expressed as sums of solutions. There is no implication that any of these solutions will necessarily describe physical situations. Physical linearity on the other hand—the Superposition Principle—refers to the fact that the superposition of physical solutions is also a physical solution. The point to be emphasized is the Superposition Principle is not a principle; it is a physical hypothesis that need not be universally true.

10.1. The Single-Valuedness of Ψ

The question “Why should wave functions be single-valued?” has been around for a long time. In this section we build on and extend recent work [100] to argue that the single- or multi-valuedness of the wave functions is closely related to the question of linearity and the superposition principle. Our discussion parallels that by Schrödinger [101,102]. (See also [103,104,105,106,107,108,109,110].)

To show that the mathematical linearity of (143) is not sufficient to imply the superposition principle, we argue that even when

| Ψ_{1} |^{2} = ρ_{1}

and

| Ψ_{2} |^{2} = ρ_{2}

are probabilities it is not generally true that

| Ψ_{3} |^{2}

, Equation (152), will also be a probability. Consider moving around a closed loop Γ in configuration space. Since phases

Φ (x)

can be multi-valued the corresponding wave functions could in principle be multi-valued too. Let a generic Ψ change by a phase factor,

Ψ \to Ψ^{'} = e^{i δ} Ψ,

(153)

then the superposition

Ψ_{3}

of two wave functions

Ψ_{1}

and

Ψ_{2}

changes into

Ψ_{3} \to Ψ_{3}^{'} = α_{1} e^{i δ_{1}} Ψ_{1} + α_{2} e^{i δ_{2}} Ψ_{2} .

(154)

The problem is that even if

| Ψ_{1} |^{2} = ρ_{1}

and

| Ψ_{2} |^{2} = ρ_{2}

are single-valued (because they are probability densities), the quantity

| Ψ_{3} |^{2}

need not in general be single-valued. Indeed,

| Ψ_{3} |^{2} = | α_{1} |^{2} ρ_{1} + {| α_{2} |}^{2} ρ_{2} + 2 R e [α_{1} α_{2}^{*} Ψ_{1} Ψ_{2}^{*}],

(155)

changes into

| Ψ_{3}^{'} |^{2} = | α_{1} |^{2} ρ_{1} + {| α_{2} |}^{2} ρ_{2} + 2 R e [α_{1} α_{2}^{*} e^{i (δ_{1} - δ_{2})} Ψ_{1} Ψ_{2}^{*}],

(156)

so that in general

| Ψ_{3}^{'} |^{2} \neq {| Ψ_{3} |}^{2},

(157)

which precludes the interpretation of

| Ψ_{3} |^{2}

as a probability. That is, even when the epistemic states

Ψ_{1}

and

Ψ_{2}

describe actual physical situations, their superpositions need not.

The problem does not arise when

e^{i (δ_{1} - δ_{2})} = 1 .

(158)

If we were to group the wave functions into classes each characterized by its own

δ

then we could have a limited version of the superposition principle that applies within each class. We conclude that beyond the linearity of the Schrödinger equation we have a superselection rule that restricts the validity of the superposition principle to wave functions belong to the same

δ

-class.

To find the allowed values of

δ

we argue as follows. It is natural to assume that if

{ρ, Φ}

(at some given time

t_{0}

) is a physical state then the state with reversed momentum

{ρ, - Φ}

(at the same time

t_{0}

) is an equally reasonable physical state. Basically, the idea is that if particles can be prepared to move in one direction, then they can also be prepared to move in the opposite direction. In terms of wave functions the statement is that if

Ψ_{t_{0}}

is a physically allowed initial state, then so is

Ψ_{t_{0}}^{*}

[111]. Next we consider a generic superposition

Ψ_{3} = α_{1} Ψ + α_{2} Ψ^{*} .

(159)

Is it physically possible to construct superpositions such as (159)? The answer is that while constructing

Ψ_{3}

for an arbitrary Ψ might not be feasible in practice there is strong empirical evidence that there exist no superselection rules to prevent us from doing so in principle. Indeed, it is easy to construct superpositions of wavepackets with momentum

\vec{p}

and

- \vec{p}

, or superpositions of states with opposite angular momenta,

Y_{ℓ m}

and

Y_{ℓ, - m}

. We shall assume that in principle the superpositions (159) are physically possible.

According to Equation (153) as one moves in a closed loop Γ the wave function

Ψ_{3}

will transform into

Ψ_{3}^{'} = α_{1} e^{i δ} Ψ + α_{2} e^{- i δ} Ψ^{*},

(160)

and the condition (158) for

| Ψ_{3} |^{2}

to be single-valued is

e^{2 i δ} = 1 or e^{i δ} = \pm 1 .

(161)

Thus, we are restricted to two discrete possibilities

\pm 1

. Since the wave functions are assumed sufficiently well behaved (continuous, differentiable, etc.) we conclude that they must be either single-valued,

e^{i δ} = 1

, or double-valued,

e^{i δ} = - 1

.

We conclude that the Superposition Principle appears to be valid in a sufficiently large number of cases to be a useful rule of thumb but it is restricted to single-valued (or double-valued) wave functions. The argument above does not exclude the possibility that a multi-valued wave function might describe an actual physical situation. What the argument implies is that the Superposition Principle would not extend to such states.

10.2. Charge Quantization

Next we analyze the conditions for the electromagnetic gauge symmetry to be compatible with the superposition principle. We shall confine our attention to systems that are described by single-valued wave functions (

e^{i δ} = + 1

) [112]. The condition for the wave function to be single-valued is

Δ \frac{Φ}{ℏ} = \oint_{Γ} d ℓ^{A} \partial_{A} \frac{Φ}{ℏ} = 2 π k_{Γ},

(162)

where

k_{Γ}

is an integer that depends on the loop Γ. Under a local gauge transformation

A_{a} (\vec{x}) \to A_{a} (\vec{x}) + \partial_{a} χ (\vec{x})

(163)

the phase Φ transforms according to (38),

Φ (x) \to Φ^{'} (x) = Φ (x) + \sum_{n} \frac{q_{n}}{c} χ ({\vec{x}}_{n}) .

(164)

The requirement that the gauge symmetry and the superposition principle be compatible amounts to requiring that the gauge transformed states also be single-valued,

Δ \frac{Φ^{'}}{ℏ} = \oint_{Γ} d ℓ^{A} \partial_{A} \frac{Φ^{'}}{ℏ} = 2 π k_{Γ}^{'} .

(165)

Thus, the allowed gauge transformations are restricted to functions

χ (\vec{x})

such that

\sum_{n} \frac{q_{n}}{ℏ c} \oint_{Γ} d ℓ_{n}^{a} \partial_{n a} χ ({\vec{x}}_{n}) = 2 π Δ k_{Γ}

(166)

where

Δ k_{Γ} = k_{Γ}^{'} - k_{Γ}

is an integer. Consider now a loop

γ

in which we follow the coordinates of the nth particle around some closed path in 3-dimensional space while all the other particles are kept fixed. Then

\frac{q_{n}}{ℏ c} \oint_{γ} d ℓ_{n}^{a} \partial_{a n} χ ({\vec{x}}_{n}) = 2 π Δ k_{n γ}

(167)

where

Δ k_{n γ}

is an integer. Since the gauge function

χ (\vec{x})

is just a function in 3-dimensional space it is the same for all particles and the integral on the left is independent of n. This implies that the charge

q_{n}

divided by an integer

Δ k_{n γ}

must be independent of n which means that

q_{n}

must be an integer multiple of some basic charge

q_{0}

. We conclude that the charges

q_{n}

are quantized.

The issue of charge quantization is ultimately the issue of deciding which is the gauge group that generates electromagnetic interactions. We could for example decide to restrict the gauge transformations to single-valued gauge functions

χ (\vec{x})

so that (167) is trivially satisfied irrespective of the charges being quantized or not. Under such a restricted symmetry group the single-valued (or double-valued) nature of the wave function is unaffected by gauge transformations. If, on the other hand, the gauge functions

χ (\vec{x})

are allowed to be multi-valued, then the compatibility of the gauge transformation (163)–(164) with the superposition principle demands that charges be quantized.

The argument above cannot fix the value of the basic charge

q_{0}

because it depends on the units chosen for the vector potential

A_{a}

. Indeed since the dynamical equations show

q_{n}

and

A_{a}

appearing only in the combination

q_{n} A_{a}

we can change units by rescaling charges and potentials according to

C q_{n} = q_{n}^{'}

and

A_{a} / C = A_{a}^{'}

so that

q_{n} A_{a} = q_{n}^{'} A_{a}^{'}

. For conventional units such that the basic charge is

q_{0} = e / 3

with

α = e^{2} / ℏ c = 1 / 137

the scaling factor is

C = {(α ℏ c)}^{1 / 2} / 3 q_{0}

. A more natural set of units might be to set

q_{0} = ℏ c

so that all

β_{n}

s are integers and the gauge functions

χ (\vec{x})

are angles.

A similar conclusion—that charge quantization is a reflection of the compactness of the gauge group—can be reached following an argument due to C. N. Yang [113]. Yang’s argument assumes that a Hilbert space has been established and one has access to the unitary representations of symmetry groups. Yang considers a gauge transformation

Ψ (x) \to Ψ (x) \exp i \sum_{n} \frac{q_{n}}{c} χ ({\vec{x}}_{n}),

(168)

with

χ (\vec{x})

independent of

\vec{x}

. If the

q_{n}

s are not commensurate there is no value of

χ

(except 0) that makes (168) be the identity transformation. The gauge group—translations on the real line—would not be compact. If, on the other hand, the charges are integer multiples of a basic charge

q_{0}

, then two values of

χ

that differ by an integer multiple of

2 π c / q_{0}

give identical transformations and the gauge group is compact. In the present ED derivation, however, we deal with the space

T^{*} P

which is a complex projective space. We cannot adopt Yang’s argument because a gauge transformation

χ

independent of

\vec{x}

is already an identity transformation—it leads to an equivalent state in the same ray—and cannot therefore lead to any constraints on the allowed charges.

11. The Classical Limits and the Bohmian Limit

11.1. Classical Limits

There are two classical limits that one might wish to consider. One is the mathematical limit

ℏ \to 0

. Taking

ℏ \to 0

leaves unchanged both the velocities

v_{n}^{a}

of the particles, Equation (19), and the probability flow, Equation (146). The main effect is to suppress the quantum potential so that Equation (147) becomes the classical Hamilton-Jacobi equation. The symplectic form, Equation (63), survives unscathed but the metric and the complex structures, Equations (101) and (103), do not. However, this is not quite classical mechanics. Since the velocity fluctuations, Equation (25), remain unaffected the resulting dynamics is a non-dissipative version of the classical Oernstein–Uhlenbeck Brownian motion. To recover a deterministic classical mechanics one must also take the limit

η \to 0

.

The other classical limit arises in the more physically relevant situation where one deals with a system with a large number N of particles—for example, a speck of dust—and one wishes to study the motion of an effective macrovariable such as the center of mass (CM), Equation (127). The large N limit of ED with particles undergoing an ES Brownian motion was studied in [77]. The same argument goes through essentially unchanged for the OU Brownian motion discussed here. Skipping all details we find that because of the central limit theorem the continuity equation for

ρ_{cm} (X^{a})

and the velocity fluctuations are given by the analogues of (43) and (25) for a single particle of mass

M = \sum_{n = 1}^{N} m_{n}

,

\partial_{t} ρ_{cm} = \frac{\partial}{\partial X^{a}} (ρ_{cm} V^{a}) with V^{a} = \frac{〈 Δ X^{a} 〉}{Δ t} = \frac{1}{M} \frac{\partial Φ_{cm}}{\partial X^{a}},

(169)

〈(\frac{Δ X^{a}}{Δ t} - V^{a}) (\frac{Δ X^{b}}{Δ t} - V^{b})〉 = \frac{η Δ t}{M} .

(170)

We also find that under rather general conditions the CM motion decouples from the motion of the component particles and obeys the single particle HJ equation

- \partial_{t} Φ_{cm} = \frac{1}{2 M} {(\frac{\partial Φ_{cm}}{\partial X^{a}})}^{2} - \frac{ℏ^{2}}{2 M} \frac{\nabla^{2} ρ_{cm}^{1 / 2}}{ρ_{cm}^{1 / 2}} + V_{ext} (X) .

(171)

In the large N limit

M \sim O (N)

and we obtain a finite velocity

V^{a}

in (169) provided

Φ_{cm} \sim O (N)

. In Equation (171) we see that for a sufficiently large system the quantum potential for the CM motion vanishes. Therefore, for

N \to \infty

, the CM follows smooth trajectories described by a classical Hamilton-Jacobi equation. Furthermore, Equation (170) shows that as

N \to \infty

the velocity fluctuations vanish irrespective of the value of

η

. This is a truly deterministic classical mechanics.

An important feature of this derivation is that ℏ and

η

remain finite which means that a mesoscopic or macroscopic object will behave classically while all its component particles remain fully quantum mechanical.

11.2. The Bohmian Limit

ED models with different values of

η

lead to the same Schrödinger equation. In other words, different sub-quantum models lead to the same emergent quantum behavior. The limit of vanishing

η

deserves particular attention because the velocity fluctuations, Equation (25), are suppressed and the motion becomes deterministic. This means that ED includes the Bohmian form of quantum mechanics [51,52,53] as a special limiting case—but with the important caveat that the difference in physical interpretation remains enormous. It is only with respect to the mathematical formalism that ED includes Bohmian mechanics as a special case.

Bohmian mechanics attempts to provide an actual description of reality. In the Bohmian view the universe consists of real particles that have definite positions and their trajectories are guided by a real field, the wave function Ψ. Not only does this pilot wave live in

3 N

-dimensional configuration space but it manages to act on the particles without the particles reacting back upon it. These are peculiarities that have stood in the way of a wider acceptance of the Bohmian interpretation. In contrast, ED’s pragmatic goal is much less ambitious: to make the best possible predictions based on very incomplete information. As in Bohmian mechanics, in ED the particles also have definite positions and its formalism includes a function Φ that plays the role of a pilot wave. However, Φ is an epistemic tool for reasoning; it is not meant to represent anything real. There is no implication that the particles move the way they do because they are pushed around by a pilot wave or by some stochastic force. In fact, ED is silent on the issue of what if anything is pushing the particles. What the probability

ρ

and the phase Φ are designed to do is not to guide the particles but to guide our inferences. They guide our expectations of where and when to find the particles but they do not exert any causal influence on the particles themselves.

12. Hilbert Space

The formulation of the ED of spinless particles is now complete. We note, in particular, that the notion of Hilbert spaces turned out to be unnecessary to the formulation of quantum mechanics. As we shall see next, while strictly unnecessary in principle, the introduction of Hilbert spaces is nevertheless very convenient for calculational purposes.

A vector space. As we saw above the infinite-dimensional e-phase space—the cotangent bundle

T^{*} P

—is difficult to handle. The problem is that the natural coordinates are probabilities

ρ_{x}

which, due to the normalization constraint, are not independent. In a discrete space one could single out one of the coordinates and its conjugate momentum and then proceed to remove them. Unfortunately, with a continuum of coordinates and momenta the removal is not feasible. The solution is to embed

T^{*} P

in a larger space

T^{*} P^{+ 1}

. This move allows us to keep the natural coordinates

ρ_{x}

but there is a price: we are forced to deal with a constrained system and its attendant gauge symmetry.

We also saw that the geometry of the embedding space was not fully determined: any spherically symmetric space would serve our purposes. This is a freedom we can further exploit. For calculational purposes the linearity of the Schrödinger Equation (143) is very convenient but its usefulness is severely limited by the normalization constraint. If

Ψ_{1}

and

Ψ_{2}

are flows in

T^{*} P

then the superposition

Ψ_{3}

in (152) will also be a flow in

T^{*} P

but only if the coefficients

α_{1}

and

α_{2}

are such that

Ψ_{3}

is properly normalized. This restriction can be removed by choosing the extended embedding space

T^{*} P^{+ 1}

to be flat—just set

A = 0

and

B = 1

in Equation (91). (The fact that this space is flat is evident in the metric (89) for the discrete case.) We emphasize that this choice is not at all obligatory; it is optional.

The fact that in the flat space

T^{*} P^{+ 1}

superpositions are allowed for arbitrary constants

α_{1}

and

α_{2}

means that

T^{*} P^{+ 1}

is not just a manifold; it is also a vector space. Each point Ψ in

T^{*} P^{+ 1}

is itself a vector. Furthermore, since the vector tangent to a curve is just a difference of two vectors Ψ we see that that points on the manifold and vectors tangent to the manifold are objects of the same kind. In other words, the tangent spaces

T {[T^{*} P^{+ 1}]}_{Ψ}

are identical to the space

T^{*} P^{+ 1}

itself.

The symplectic form Ω and the metric tensor G on the extended space

T^{*} P^{+ 1}

are given by Equations (108) and (114). Since they are tensors Ω and G are meant to act on vectors but now they can also act on all points

Ψ \in T^{*} P^{+ 1}

and not just on those that happen to be normalized and gauge fixed according to (83). For example, the action of the mixed tensor J, Equation (115), on a wave function Ψ is

J^{μ x}_{ν x^{'}} Ψ^{ν x^{'}} = [\begin{matrix} i & 0 \\ 0 & - i \end{matrix}] (\binom{Ψ_{x}}{i ℏ Ψ_{x}^{*}}) = (\binom{i Ψ_{x}}{i ℏ {(i Ψ_{x})}^{*}}),

(172)

which indicates that J plays the role of multiplication by i, i.e., when acting on a point Ψ the action of J is

Ψ \overset{J}{\to} i Ψ

.

Dirac notation. We can at this point introduce the Dirac notation to represent the wave functions

Ψ_{x}

as vectors

| Ψ 〉

in a Hilbert space. The scalar product

〈 Ψ_{1} | Ψ_{2} 〉

is defined using the metric G and the symplectic form Ω,

〈 Ψ_{1} | Ψ_{2} 〉 \overset{def}{=} \frac{1}{2 ℏ} \int d x d x^{'} (Ψ_{1 x}, i ℏ Ψ_{1 x}^{*}) (G + i Ω) (\binom{Ψ_{2 x^{'}}}{i ℏ Ψ_{2 x^{'}}^{*}}) .

(173)

A straightforward calculation gives

〈 Ψ_{1} | Ψ_{2} 〉 = \int d x Ψ_{1}^{*} Ψ_{2} .

(174)

The map

Ψ_{x} \leftrightarrow | Ψ 〉

is defined by

| Ψ 〉 = \int d x | x 〉 Ψ_{x} where Ψ_{x} = 〈 x | Ψ 〉,

(175)

where, in this “position” representation, the vectors

{| x 〉}

form a basis that is orthogonal and complete,

〈 x | x^{'} 〉 = δ_{x x^{'}} and \int d x | x 〉 〈 x | = \hat{1} .

(176)

Hermitian and unitary operators. The bilinear Hamilton functionals

\tilde{Q} [Ψ, Ψ^{*}]

with kernel

\hat{Q} (x, x^{'})

in Equation (121) can now be written in terms of a Hermitian operator

\hat{Q}

and its matrix elements,

\tilde{Q} [Ψ, Ψ^{*}] = 〈 Ψ | \hat{Q} | Ψ 〉 and \hat{Q} (x, x^{'}) = 〈 x | \hat{Q} | x^{'} 〉 .

(177)

The corresponding Hamilton–Killing flows are given by

i ℏ \frac{d}{d λ} 〈 x | Ψ 〉 = 〈 x | \hat{Q} | Ψ 〉 or i ℏ \frac{d}{d λ} | Ψ 〉 = \hat{Q} | Ψ 〉 .

(178)

These flows are described by unitary transformations

| Ψ (λ) 〉 = {\hat{U}}_{Q} (λ) | Ψ (0) 〉 where {\hat{U}}_{Q} (λ) = \exp (- \frac{i}{ℏ} \hat{Q} λ) .

(179)

Commutators. The Poisson bracket of two Hamiltonian functionals

\tilde{U} [Ψ, Ψ^{*}]

and

\tilde{V} [Ψ, Ψ^{*}]

,

{\tilde{U}, \tilde{V}} = \int d x (\frac{δ \tilde{U}}{δ Ψ_{x}} \frac{δ \tilde{V}}{δ i ℏ Ψ_{x}^{*}} - \frac{δ \tilde{U}}{δ i ℏ Ψ_{x}^{*}} \frac{δ \tilde{V}}{δ Ψ_{x}}),

can be written in terms of the commutator of the associated operators, then

{\tilde{U}, \tilde{V}} = \frac{1}{i ℏ} 〈 Ψ | [\hat{U}, \hat{V}] | Ψ 〉 .

(180)

Thus, the Poisson bracket is the expectation of the commutator. This identity is much sharper than Dirac’s pioneering discovery that the quantum commutator of two q-variables is analogous to the Poisson bracket of the corresponding classical variables. Further parallels between the geometric and the Hilbert space formulation of QM can be found in [56,57,58,59,60,61,62,63,64].

13. Remarks on ED and Quantum Bayesianism

Having discussed the ED approach in some detail it is now appropriate to comment on how ED differs from the interpretations known as Quantum Bayesianism [20,21,22] and its closely related descendant QBism [23,24]; for simplicity, I shall refer to both as QB. Both ED and QB adopt an epistemic degree-of-belief concept of probability but there are important differences:

(a): QB adopts a personalistic de Finetti type of Bayesian interpretation while ED adopts an impersonal entropic Bayesian interpretation somewhat closer but not identical to Jaynes’ [15,16,17,18]. In ED, the probabilities do not reflect the subjective beliefs of any particular person. They are tools designed to assist us in those all too common situations in which are confused and due to insufficient information we do not know what to believe. The probabilities will then provide guidance as to what agents ought to believe if only they were ideally rational. More explicitly, probabilities in ED describe the objective degrees of belief of ideally rational agents who have been supplied with the maximal allowed information about a particular quantum system.
(b): ED derives or reconstructs the mathematical framework of QM—it explains where the symplectic, metric, and complex structures, including Hilbert spaces and time evolution come from. In contrast, at its current stage of development QB consists of appending a Bayesian interpretation to an already existing mathematical framework. Indeed, assumptions and concepts from quantum information are central to QB and are implicitly adopted from the start. For example, a major QB concern is the justification of the Born rule starting from the Hilbert space framework while ED starts from probabilities and its goal is to justify the construction of wave functions; the Born rule follows as a trivial consequence.
(c): ED is an application of entropic/Bayesian inference. Of course, the choices of variables and of the constraints that happen to be physically relevant are specific to our subject matter—quantum mechanics—but the inference method itself is of universal applicability. It applies to electrons just as well as to the stock market or to medical trials. In contrast, in QB the personalistic Bayesian framework is not of universal validity. For those special systems that we call ‘quantum’ the inference framework is itself modified into a new “Quantum-Bayesian coherence” in which the standard Bayesian inference must be supplemented with concepts from quantum information theory. The additional technical ingredient is a hypothetical structure called a “symmetric informationally complete positive-operator-valued measure”. In short, in QB Born’s Rule is not derived but constitutes an addition beyond the raw probability theory.
(d): QB is an anti-realist neo-Copenhagen interpretation; it accepts complementarity. (Here complementarity is taken to be the common thread that runs through all Copenhagen interpretations.) Probabilities in QB refer to the outcomes of experiments and not to ontic pre-existing values. In contrast, in ED probabilities refer to ontic positions—including the ontic positions of pointer variables. In the end, this is what solves the problem of quantum measurement (see [70,71]).

14. Some Final Remarks

We conclude with a summary of the main assumptions:

Particles have definite but unknown positions and follow continuous trajectories.
The probability of a short step is given by the method of maximum entropy subject to a drift potential constraint that introduces directionality and correlations, plus gauge constraints that account for external electromagnetic fields.
The accumulation of short steps requires a notion of time as a book-keeping device. This involves the introduction of the concept of an instant and a convenient definition of the duration between successive instants.
The e-phase space ${ρ, Φ}$ has a natural symplectic geometry that results from treating the pair $(ρ_{x}, Φ_{x})$ as canonically conjugate variables.
The information geometry of the space of probabilities is extended to the full e-phase space by imposing the latter be spherically symmetric.
The drift potential constraint is updated instant by instant in such a way as to preserve both the symplectic and metric geometries of the e-phase space.

The resulting entropic dynamics is described by the Schrödinger equation. Different sub-quantum Brownian motions all lead to the same emergent quantum mechanics. In previous work we dealt with an Einstein–Smoluchowski process; here we have explored an Oernstein–Uhlenbeck process. Other “fractional” Brownian motions might be possible but have not yet been studied.

A natural question is whether these different sub-quantum Brownian motions might have observable consequences. At this point our answer can only be tentative. To the extent that we have succeeded in deriving QM and not some other theory one should not expect deviations in the predictions for the standard experiments that are the subject of the standard quantum theory— at least not in the non-relativistic regime. As the ED program is extended to other regimes involving higher energies and/or gravity it is quite possible that those different sub-quantum motions might not be empirically equivalent.

ED achieves ontological clarity by sharply separating the ontic elements from the epistemic elements — positions of particles on one side and probabilities

ρ

and phases Φ on the other. ED is a dynamics of probabilities and not a dynamics of particles. Of course, if probabilities at one instant are large in one place and at a later time they are large in some other place one infers that the particles must have moved—but nothing in ED describes what it is that has pushed the particles around. ED is a mechanics without a mechanism.

We can elaborate on this point from a different direction. The empirical success of ED suggests that its epistemic probabilities agree with ontic features of the physical world. It is highly desirable to clarify the precise nature of this agreement. Consider, for example, a fair die. Its property of being a perfect cube is an ontic property of the die which is reflected at the epistemic level in the equal assignment of probabilities to each face of the die. In this example we see that the epistemic probabilities achieve objectivity, and therefore usefulness, by corresponding to something ontic. The situation in ED is similar except for one crucial aspect. The ED probabilities are objective, and they are empirically successful. They must therefore reflect something real. However, it is not yet known what those underlying ontic properties might possibly be. Fortunately, for the purposes of making predictions knowing those epistemic probabilities is all we need.

The trick of embedding the e-phase space

T^{*} P

in a flat vector space

T^{*} P^{+ 1}

is clever but optional. It allows one to make use of the calculational advantages of linearity. This recognition that Hilbert spaces are not fundamental is one of the significant contributions of the entropic approach to our understanding of QM. The distinction—whether Hilbert spaces are necessary in principle as opposed to merely convenient in practice—is not of purely academic interest. It can be important in the search for a quantum theory that includes gravity: Shall we follow the usual approaches to quantization that proceed by replacing classical dynamical variables by an algebra of linear operators acting on some abstract space? Or, in the spirit of an entropic dynamics, shall we search for an appropriately constrained dynamics of probabilities and information geometries? First steps towards formulating a first-principles theory along these lines are given in [114,115].

Funding

This research received no external funding

Acknowledgments

I would like to thank M. Abedi, D. Bartolomeo, C. Cafaro, N. Carrara, N. Caticha, F. Costa, S. DiFranzo, K. Earle, A. Giffin, S. Ipek, D.T. Johnson, K. Knuth, O. Lunin, S. Nawaz, P. Pessoa, M. Reginatto, C. Rodríguez, and K. Vanslette, for valuable discussions on entropic inference and entropic dynamics and for their many insights and contributions at various stages of this program.

Conflicts of Interest

The author declares no conflict of interest.

References

Dirac, P.A.M. Quantum Mechanics, 3rd ed.; Oxford University Press: Oxford, UK, 1930. [Google Scholar]
Von Neumann, J. Mathematical Foundations of Quantum Mechanics; Princeton University Press: Princeton, NJ, USA, 1955. [Google Scholar]
Bell, J. Against ‘measurement’. Phys. World 1990, 8, 33. [Google Scholar] [CrossRef]
Excellent reviews with extended references to the literature are given in e.g., [5,6,7,8].
Stapp, H.P. The Copenhagen Interpretation. Am. J. Phys. 1972, 40, 1098. [Google Scholar] [CrossRef]
Schlösshauer, M. Decoherence, the measurement problem, and interpretations of quantum mechanics. Rev. Mod. Phys. 2004, 76, 1267. [Google Scholar] [CrossRef]
Jaeger, G. Entanglement, Information, and the Interpretation of Quantum Mechanics; Springer: Berlin/Heidelberger, Germany, 2009. [Google Scholar]
Leifer, M.S. Is the Quantum State Real? An Extended Review of Ψ-ontology Theorems. Quanta 2014, 3, 67, arXiv.org:1409.1570. [Google Scholar]
Since the terms ‘ontic’ and ‘epistemic’ are not yet of widespread use outside the community of Foundations of QM, a clarification might be useful. A concept is referred as ‘ontic’ when it describes something that is supposed to be real, to exist out there independently of any observer. A concept is referred as ‘epistemic’ when it is related to the state of knowledge, opinion, or belief of an agent, albeit an ideally rational agent. Examples of epistemic quantities are probabilities and entropies. An important point is that the distinction ontic/epistemic is not the same as the distinction objective/subjective. For example, probabilities are fully epistemic—they are tools for reasoning with incomplete information— but they can lie anywhere in the spectrum from being completely subjective (two different agents can have different beliefs) to being completely objective. In QM, for example, probabilities are epistemic and objective. Indeed, at the non-relativistic level anyone who computes probabilities that disagree with QM will be led to experimental predictions that are demonstrably wrong. We will say that the wave function Ψ, which is fully epistemic and objective, represents a “physical” state when it represents information about an actual “physical” situation.
Caticha, A. Entropic Dynamics, Time, and Quantum Theory. J. Phys. A Math. Theor. 2011, 44, 225303, arXiv.org:1005.2357. [Google Scholar]
Caticha, A. Entropic Dynamics. Entropy 2015, 17, 6110–6128, arXiv.org:1509.03222. [Google Scholar] [CrossRef] [Green Version]
Caticha, A. Entropic Dynamics: Quantum Mechanics from Entropy and Information Geometry. Ann. Physik 2018, 1700408, arXiv.org:1711.02538. [Google Scholar]
The principle of maximum entropy as a method for inference can be traced to the pioneering work of E. T. Jaynes [14,15,16]. For a pedagogical overview including more modern developments see [17,18].
Jaynes, E.T. Information Theory and Statistical Mechanics I and II. Phys. Rev. 1957, 106. 108, and 171, 620. [Google Scholar] [CrossRef]
Jaynes, E.T. Papers on Probability, Statistics and Statistical Physics; Rosenkrantz, R.D., Ed.; D. Reidel: Dordrecht, The Netherlands, 1983. [Google Scholar]
Jaynes, E.T. Probability Theory: The Logic of Science; Bretthorst, G.L., Ed.; Cambridge University Press: Cambridge, UK, 2003. [Google Scholar]
Caticha, A. Entropic Inference and the Foundations of Physics. Available online: http://www.albany.edu/physics/ACaticha-EIFP-book.pdf (accessed on 20 September 2019).
Caticha, A. Towards an Informational Pragmatic Realism. Mind Mach. 2014, 24, 37, arXiv.org:1412.5644. [Google Scholar] [CrossRef]
There exist many different Bayesian interpretations of probability. In Section 13 we comment on how ED differs from the frameworks known as Quantum Bayesianism [20,21,22] and its closely related descendant QBism [23,24].
Brun, T.A.; Caves, C.M.; Schack, R. Quantum Bayes rule. Phys. Rev. A 2001, 63, 042309. [Google Scholar] [CrossRef]
Caves, C.M.; Fuchs, C.A.; Schack, R. Unknown quantum states: The quantum de Finetti representation. J. Math. Phys. 2002, 43, 4547. [Google Scholar] [CrossRef]
Caves, C.M.; Fuchs, C.A.; Schack, R. Quantum Probabilities as Bayesian Probabilities. Phys. Rev. A 2002, 65, 022305. [Google Scholar] [CrossRef]
Fuchs, C.A.; Schack, R. Quantum-Bayesian Coherence. Rev. Mod. Phys. 2013, 85, 1693. [Google Scholar] [CrossRef]
Fuchs, C.A.; Mermin, N.D.; Schack, R. An introduction to QBism with an application to the locality of quantum mechanics. Am. J. Phys. 2014, 82, 749. [Google Scholar] [CrossRef]
Nelson, E. Quantum Fluctuations; Princeton University Press: Princeton, NJ, USA, 1985. [Google Scholar]
Adler, S. Quantum Theory as an Emergent Phenomenon; Cambridge University Press: Cambridge, UK, 2004. [Google Scholar]
Smolin, L. Could quantum mechanics be an approximation to another theory? arXiv 2006. arXiv.org/abs/quant-ph/0609109. [Google Scholar]
de la Peña, L.; Cetto, A.M. The Emerging Quantum: The Physics Behind Quantum Mechanics; Springer: Berlin/Heidelberg, Germany, 2014. [Google Scholar]
Grössing, G. The Vacuum Fluctuation Theorem: Exact Schrödinger Equation via Nonequilibrium Thermodynamics. Phys. Lett. A 2008, 372, 4556, arXiv:0711.4954. [Google Scholar] [CrossRef]
Grössing, G.; Fussy, S.; Mesa Pascasio, J.; Schwabl, H. The Quantum as an Emergent System. J. Phys.Conf. Ser. 2012, 361, 012008. [Google Scholar] [CrossRef]
Hooft, G.T. The Cellular Automaton Interpretation of Quantum Mechanics; Springer: Berlin/Heidelberg, Germany, 2016. [Google Scholar]
Wootters, W.K. Statistical distance and Hilbert space. Phys. Rev. D 1981, 23, 357. [Google Scholar] [CrossRef]
Caticha, A. Consistency and Linearity in Quantum Theory. Phys. Lett. A 1998, 244, 13. [Google Scholar] [CrossRef]
Caticha, A. Consistency, Amplitudes, and Probabilities in Quantum Theory. Phys. Rev. A 1998, 57, 1572. [Google Scholar] [CrossRef]
Caticha, A. Insufficient Reason and Entropy in Quantum Theory. Found. Phys. 2000, 30, 227. [Google Scholar] [CrossRef]
Brukner, C.; Zeilinger, A. Information and Fundamental Elements of the Structure of Quantum Theory. In Time, Quantum, Information; Castell, L., Ischebeck, O., Eds.; Springer: Berlin/Heidelberg, Germany, 2003. [Google Scholar]
Mehrafarin, M. Quantum mechanics from two physical postulates. Int. J. Theor. Phys. 2005, 44, 429, arXiv:quant-ph/0402153. [Google Scholar] [CrossRef]
Spekkens, R. Evidence for the epistemic view of quantum states: A toy theory. Phys. Rev. A 2007, 75, 032110. [Google Scholar] [CrossRef] [Green Version]
Goyal, P. From Information Geometry to Quantum Theory. New J. Phys. 2010, 12, 023012. [Google Scholar] [CrossRef]
Goyal, P.; Knuth, K.; Skilling, J. Origin of complex quantum amplitudes and Feynman’s rules. Phys. Rev. A 2010, 81, 022109. [Google Scholar] [CrossRef]
Chiribella, G.; D’Ariano, G.M.; Perinotti, P. Informational derivation of quantum theory. Phys. Rev. 2011, 84, 012311. [Google Scholar] [CrossRef]
Hardy, L. Reformulating and Reconstructing Quantum Theory. arXiv 2011, arXiv:1104.2066. [Google Scholar]
Reginatto, M.; Hall, M.J.W. Quantum theory from the geometry of evolving probabilities. AIP Conf. Proc. 2012, 1443, 96, arXiv:1108.5601. [Google Scholar]
Reginatto, M.; Hall, M.J.W. Information geometry, dynamics and discrete quantum mechanics. AIP Conf. Proc. 2013, 1553, 246, arXiv:1207.6718. [Google Scholar]
Hardy, L. Reconstructing Quantum Theory. In Quantum Theory: Informational Foundations and Foils; Chiribella, G., Spekkens, R., Eds.; Springer: Berlin/Heidelberg, Germany, 2015. [Google Scholar]
D’Ariano, G.M. Physics without physics: The power of information-theoretical principles. Int. J. Theor. Phys. 2017, 56, 97. [Google Scholar] [CrossRef]
Nelson, E. Dynamical Theories of Brownian Motion, 2nd ed.; Princeton University Press: Princeton, NJ, USA, 1967; Available online: http://www.math.princeton.edu/nelson/books.html (accessed on 20 September 2019).
In both the ES and the OU processes, which were originally meant to model the actual physical Brownian motion, friction and dissipation play essential roles. In contrast, ED is non-dissipative. ED formally resembles Nelson’s stochastic mechanics [25] but the conceptual differences are significant. Nelson’s mechanics attempted an ontic interpretation of QM as an ES process driven by real stochastic classical forces while ED is a purely epistemic model that does not appeal to an underlying classical mechanics.
Bartolomeo, D.; Caticha, A. Entropic Dynamics: The Schrödinger equation and its Bohmian limit. AIP Conf. Proc. 2016, 1757, 030002, arXiv.org:1512.09084. [Google Scholar]
Bartolomeo, D.; Caticha, A. Trading drift and fluctuations in entropic dynamics: Quantum dynamics as an emergent universality class. J. Phys. Conf. Ser. 2016, 701, 012009, arXiv.org:1603.08469. [Google Scholar] [CrossRef]
Bohm, D. A suggested interpretation of the quantum theory in terms of “hidden” variables, I and II. Phys. Rev. 1952, 85, 166, 180. [Google Scholar] [CrossRef]
Bohm, D.; Hiley, B.J. The Undivided Universe—An Ontological Interpretation of Quantum Theory; Routlege: New York, NY, USA, 1993. [Google Scholar]
Holland, P.R. The Quantum Theory of Motion; Cambridge University Press: Cambridge, UK, 1993. [Google Scholar]
Caticha, A.; Bartolomeo, D.; Reginatto, M. Entropic Dynamics: From entropy and information geometry to Hamiltonians and quantum mechanics. AIP Conf. Proc. 2015, 1641, 155, arXiv.org:1412.5629. [Google Scholar]
Nelson, E. Connection between Brownian motion and quantum mechanics. Lect. Notes Phys. 1979, 100, 168. [Google Scholar]
Kibble, T.W.B. Geometrization of Quantum Mechanics. Commun. Math. Phys. 1979, 65, 189. [Google Scholar] [CrossRef]
Heslot, A. Quantum mechanics as a classical theory. Phys. Rev. 1985, 31, 1341. [Google Scholar] [CrossRef]
Anandan, J.; Aharonov, Y. Geometry of Quantum Evolution. Phys. Rev. Lett. 1990, 65, 1697. [Google Scholar] [CrossRef]
Cirelli, R.; Manià, A.; Pizzochero, L. Quantum mechanics as an infinite-dimensional Hamiltonian system with uncertainty structure: Part I and II. J. Math. Phys. 1990, 31, 2891, 2898. [Google Scholar] [CrossRef]
Abe, S. Quantum-state space metric and correlations. Phys. Rev. A 1992, 46, 1667. [Google Scholar] [CrossRef]
Hughston, L.P. Geometric aspects of quantum mechanics. In Twistor Theory; Huggett, S.A., Ed.; Marcel Dekker: New York, NY, USA, 1995. [Google Scholar]
Ashtekar, A.; Schilling, T.A. Geometrical Formulation of Quantum Mechanicss. In On Einstein’s Path; Harvey, A., Ed.; Springer: New York, NY, USA, 1998. [Google Scholar]
de Gosson, M.A.; Hiley, B.J. Imprints of the Quantum World in Classical Mechanics. Found. Phys. 2011, 41, 1415. [Google Scholar] [CrossRef]
Elze, H.T. Linear dynamics of quantum-classical hybrids. Phys. Rev. A 2012, 85, 052109. [Google Scholar] [CrossRef]
Amari, S. Differential-Geometrical Methods in Statistics; Springer: Berlin/Heidelberger, Germany, 1985. [Google Scholar]
Campbell, L.L. An extended Čencov characterization of the information metric. Proc. Am. Math. Soc. 1986, 98, 135. [Google Scholar]
Rodríguez, C.C. The metrics generated by the Kullback number. In Maximum Entropy and Bayesian Methods; Skilling, J., Ed.; Kluwer: Dordrecht, The Netherlands, 1989. [Google Scholar]
Ay, N.; Jost, J.; Vân Lê, H.; Schwachhöfer, L. Information Geometry; Springer: Berlin, Germany, 2017. [Google Scholar]
Brodie, D.J.; Hughston, L.P. Statistical Geometry in Quantum Mechanics. Philos. Trans. R. Soc. Lond. A 1998, 454, 2445, arXiv:gr-qc/9701051. [Google Scholar] [CrossRef]
Johnson, D.T.; Caticha, A. Entropic dynamics and the quantum measurement problem. AIP Conf. Proc. 2012, 1443, 104, arXiv:1108.2550. [Google Scholar]
Vanslette, K.; Caticha, A. Quantum measurement and weak values in entropic quantum dynamics. AIP Conf. Proc. 2017, 1853, 090003, arXiv:1701.00781. [Google Scholar]
Nawaz, S.; Caticha, A. Momentum and uncertainty relations in the entropic approach to quantum theory. AIP Conf. Proc. 2012, 1443, 112, arXiv:1108.2629. [Google Scholar]
These are the well-known uncertainty relations due to Heisenberg and to Schrödinger. The entropic uncertainty relations proposed by Deutsch [75,76,77] have not yet been explored within the context of ED.
Deutsch, D. Uncertainty in Quantum Measurements. Phys. Rev. Lett. 1983, 50, 631. [Google Scholar] [CrossRef]
Partovi, M.H. Entropic Formulation of Uncertainty for Quantum Measurements. Phys. Rev. Lett. 1983, 50, 1883. [Google Scholar] [CrossRef]
Maassen, H.; Uffink, J. Generalized Entropic Uncertainty Relations. Phys. Rev. Lett. 1988, 60, 1103. [Google Scholar] [CrossRef]
Demme, A.; Caticha, A. The Classical Limit of Entropic Quantum Dynamics. AIP Conf. Proc. 2017, 1853, 090001, arXiv.org:1612.01905. [Google Scholar]
Nawaz, S.; Abedi, M.; Caticha, A. Entropic Dynamics on Curved Spaces. AIP Conf. Proc. 2016, 1757, 030004, arXiv.org:1601.01708. [Google Scholar]
Ipek, S.; Caticha, A. Entropic quantization of scalar fields. AIP Conf. Proc. 2015, 1641, 345, arXiv.org:1412.5637. [Google Scholar]
Ipek, S.; Abedi, M.; Caticha, A. Entropic Dynamics: Reconstructing Quantum Field Theory in Curved Spacetime. Class. Quantum Grav. 2019, in press, arXiv:1803.07493. [Google Scholar] [CrossRef]
Caticha, A.; Carrara, N. The Entropic Dynamics of Spin. In preparation.
In this work ED is a model for the quantum mechanics of particles. The same framework can be deployed to construct models for the quantum mechanics of fields, in which case it is the fields that are ontic and have well-defined albeit unknown values [79,80].
In Cartesian coordinates μ = const and may be ignored.
The angular nature of the drift potential is explained when the ED framework is extended to particles with spin [81].
In a relativistic theory there is more freedom in the choice of instants and this translates into a greater flexibility with the notion of simultaneity. Conversely, the requirement of consistency among the different notions of simultaneity severely limits the allowed forms of relativistic ED [80].
Price, H. Time’s Arrow and Archimedes’ Point; Oxford University Press: Oxford, UK, 1996. [Google Scholar]
Zeh, H.D. The Physical Basis of the Direction of Time; Springer: Berlin, Germany, 2007. [Google Scholar]
Lanczos, C. The Variational Principles of Mechanics, 4th ed.; Dover: New York, NY, USA, 1986. [Google Scholar]
Mandelbrot, B.B.; Van Ness, J.W. Fractional Brownian motions, fractional noises, and applications. SIAM Rev. 1968, 10, 422. [Google Scholar] [CrossRef]
In the ES type of ED considered in previous papers (γ = 1) [10,11,12] the probability also satisfies a continuity equation—a Fokker-Planck equation—and the current velocity is the sum of the drift velocity plus an osmotic component

$u^{A} = - ℏ m^{A B} \partial_{B} \log ρ^{1 / 2}$

due to diffusion.
Chandrasekhar, S. Stochastic Problems in Physics and Astronomy. Rev. Mod. Phys. 1943, 15, 1. [Google Scholar] [CrossRef]
DiFranzo, S. The Entropic Dynamics Approach to the Paradigmatic Quantum Mechanical Phenomena. Ph.D. Thesis, University at Albany, Albany, NY, USA, 2018. [Google Scholar]
Equations (49) and (50) show the reason to have introduced the new variable Φ = ℏϕ. With this choice Φ will eventually be recognized as the momentum that is canonically conjugate to the generalized coordinate ρ with Hamiltonian $\tilde{H}$ .
We deal with ∞-dimensional spaces. The level of mathematical rigor in what follows is typical of theoretical physics—which is a euphemism for “from very low to none at all.” For a more sophisticated treatment, see [59,62].
At this point the act of embedding P into P⁺¹ represents no loss of generality because the embedding space P⁺¹ remains unspecified.
This allows us, among other things, the freedom to switch from ρ^x to ρ_x as convenience dictates; from now on ρ_x = ρ^x = ρ(x).
Arnold, V.I. Mathematical Methods of Classical Mechanics; Springer: Berlin/Heidelberger, Germany, 1997. [Google Scholar]
Schutz, B. Geometrical Methods of Mathematical Physics; Cambridge University Press: Cambridge, UK, 1980. [Google Scholar]
We are effectively determining the metric by imposing a symmetry, namely rotational invariance. One might be concerned that choosing this symmetry is an ad hoc assumption but the result proves to be very robust. It turns out that exactly the same metric is obtained by several other criteria that may appear more natural in the context of inference and probability. Such criteria include invariance under Markovian embeddings, the geometry of asymptotic inference, and the metrics induced by relative entropy [66,67] (see also [17]).
Carrara, N.; Caticha, A. Quantum phases in entropic Dynamics. Springer Proc. Math. Stat. 2018, 239, 1. [Google Scholar]
Schrödinger, E. The multi-valuedness of the wave function. Ann. Phys. 1938, 32, 49. [Google Scholar] [CrossRef]
Schrödinger invoked time reversal invariance which was a very legitimate move back in 1938 but today it is preferable to develop an argument which does not invoke symmetries that are already known to be violated.
The answer proposed by Pauli is also worthy of note [104,105,106]. He proposed that admissible wave functions must form a basis for representations of the transformation group that happens to be pertinent to the problem at hand. Pauli’s argument serves to discard double-valued wave functions for describing the orbital angular momentum of scalar particles. The question of single-valuedness was revived by Takabayashi [107,108] in the context of the hydrodynamical interpretation of QM, and later rephrased by Wallstrom [109,110] as an objection to Nelson’s stochastic mechanics: Are these theories equivalent to QM or do they merely reproduce a subset of its solutions? Wallstrom’s objection is that Nelson’s stochastic mechanics leads to phases and wave functions that are either both multi-valued or both single-valued. Both alternatives are unsatisfactory because on one hand QM requires single-valued wave functions, while on the other hand single-valued phases exclude states that are physically relevant (e.g., states with non-zero angular momentum).
Pauli, W. Über ein Kriterium für Ein-oder Zweiwertigkeit der Eigenfunktionen in der Wellenmechanik. Helv. Phys. Acta 1939, 12, 147. [Google Scholar]
Pauli, W. General Principles of Quantum Mechanics; Springer: Berlin, Germany, 1980. [Google Scholar]
Merzbacher, E. Single Valuedness of Wave Functions. Am. J. Phys. 1962, 30, 237. [Google Scholar] [CrossRef]
Takabayasi, T. On the Formulation of Quantum Mechanics associated with Classical Pictures. Prog. Theor. Phys. 1952, 8, 143. [Google Scholar] [CrossRef]
Takabayasi, T. Vortex, Spin and Triad for Quantum Mechanics of Spinning Particle. Prog. Theor. Phys. 1983, 70, 1. [Google Scholar] [CrossRef]
Wallstrom, T.C. On the derivation of the Schrödinger equation from stochastic mechanics. Found. Phys. Lett. 1989, 2, 113. [Google Scholar] [CrossRef]
Wallstrom, T.C. The inequivalence between the Schrödinger equation and the Madelung hydrodynamic equations. Phys. Rev. A 1994, 49, 1613. [Google Scholar] [CrossRef] [PubMed]
We make no symmetry assumptions such as parity or time reversibility. It need not be the case that there is any symmetry that relates the time evolution of $Ψ_{t_{0}}^{*}$ to that of Ψ_t0.
Double-valued wave functions with e^iδ = −1 will, of course, find use in the description of spin-1/2 particles [81].
Yang, C.N. Charge Quantization, Compactness of the Gauge Group, and Flux Quantization. Phys. Rev. D 1979, 1, 2360. [Google Scholar] [CrossRef]
Caticha, A. The information geometry of space-time. In Proceedings of the 39th International Workshop on Bayesian Inference and Maximum Entropy Methods in Science and Engineering, Garching, Germany, 30 June–5 July 2019. [Google Scholar]
Ipek, S.; Caticha, A. An entropic approach to geometrodynamics. In Proceedings of the 39th International Workshop on Bayesian Inference and Maximum Entropy Methods in Science and Engineering, Garching, Germany, 30 June–5 July 2019. [Google Scholar]

© 2019 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Caticha, A. The Entropic Dynamics Approach to Quantum Mechanics. Entropy 2019, 21, 943. https://doi.org/10.3390/e21100943

AMA Style

Caticha A. The Entropic Dynamics Approach to Quantum Mechanics. Entropy. 2019; 21(10):943. https://doi.org/10.3390/e21100943

Chicago/Turabian Style

Caticha, Ariel. 2019. "The Entropic Dynamics Approach to Quantum Mechanics" Entropy 21, no. 10: 943. https://doi.org/10.3390/e21100943

APA Style

Caticha, A. (2019). The Entropic Dynamics Approach to Quantum Mechanics. Entropy, 21(10), 943. https://doi.org/10.3390/e21100943

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

The Entropic Dynamics Approach to Quantum Mechanics

Abstract

1. Introduction

2. The ED of Short Steps

3. Entropic Time

3.1. Time as an Ordered Sequence of Instants

3.2. The Arrow of Entropic Time

3.3. Duration and the Sub-Quantum Motion

4. The Evolution Equation in Differential Form

5. The Epistemic Phase Space

5.1. Notation: Vectors, Covectors, Etc.

5.2. The Symplectic Form in ED

5.3. Hamiltonian Flows and Poisson Brackets

5.4. The Normalization Constraint

6. The Information Geometry of E-Phase Space

6.1. The Metric on the Embedding Space T * P + 1

6.2. The Metric Induced on T * P

6.3. A Complex Structure

6.4. Complex Coordinates

7. Hamilton-Killing Flows

8. The E-Hamiltonian

9. Entropic Time, Physical Time, and Time Reversal

10. Linearity and the Superposition Principle

10.1. The Single-Valuedness of Ψ

10.2. Charge Quantization

11. The Classical Limits and the Bohmian Limit

11.1. Classical Limits

11.2. The Bohmian Limit

12. Hilbert Space

13. Remarks on ED and Quantum Bayesianism

14. Some Final Remarks

Funding

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

6.1. The Metric on the Embedding Space $T^{*} P^{+ 1}$

6.2. The Metric Induced on $T^{*} P$