Abstract
Drawing from the optimal transport theory adapted to the Lorentzian setting, we propose and study the extension of the Sorkin–Woolgar causal relation onto the space of Borel probability measures on a given spacetime. We show that it retains its fundamental properties of transitivity and closedness. Furthermore, we list and prove several characterizations of this relation, including the “measure-theoretic” analogue of the characterization of in terms of time functions.
1. Introduction
One of the vital fibers of mathematical relativity is causality theory; i.e., the study of the causal properties of spacetimes. The fundamental role in this theory is played by certain binary relations on a given spacetime, designed to describe which events (e.g., points of spacetime) are causally connected, and which can never constitute links of a cause–effect chain. The two most important such relations—called and —involve pairs of events which can be connected by means of a future-directed chronological curve (which is denoted ) or a future-directed causal curve (which is often denoted ), respectively. For a concise review of causality theory, the reader is referred to [1] (see also [2,3,4,5,6] for the full exposition).
Motivated by the approach to Lorentzian noncommutative geometry developed in [7,8,9,10,11]—where spacetime usually can no longer be regarded as composed of point-like events but nevertheless possesses a causal structure—we have recently proposed and studied a natural extension of the relation ⪯ onto the space of Borel probability measures on a given spacetime [12], which relies on the notion of a coupling drawn from the optimal transport theory [13,14]. Given a pair of measures , we call a coupling of μ and ν iff the latter two measures are ω’s marginals; that is, iff and , where , denote the canonical projections. With denoting the set of all such couplings, we have put forward the following definition.
Definition 1.
Let be a spacetime. For any , we say that μ causally precedes ν (symbolically ) iff there exists a causal couplingof μ and ν, by which we mean any such that .
Let us emphasize that the quantity is well-defined, because is a Borel subset of ([12], Section 3). The name “causal coupling” was independently coined in [15].
This definition rigorously encapsulates the following common intuition: infinitesimal portions of probability distributions can propagate in spacetime only along future-directed causal curves. The thus-obtained formalism can be successfully used to model the causal time-evolution of spatially distributed physical quantities [16] and wave packets [17], and to quantify the breakdown of causality in various physical models. Although the idea of studying causality for spread objects is by no means new (e.g., [18,19,20,21,22,23,24,25]), the tools of the modern optimal transport theory adapted to the Lorentzian setting allow us to cast some new light on the previous approaches and significantly extend them. See [17] for concrete physical applications and a more detailed discussion.
Even though play the central role in mathematical relativity, these relations are not the only ones studied in causality theory, with the so-called Sorkin–Woolgar relation providing an important example. It is defined in [26] as the smallest transitive and topologically closed relation containing , and it was originally studied in the context of reformulating causality theory along topological and order-theoretic lines, which in particular would accommodate for low-regularity metrics. It turns out that is additionally antisymmetric (and hence a partial order) precisely when the spacetime is stably causal [27], in which case it possesses the following characterization in terms of time functions (i.e., continuous real-valued functions on which are strictly increasing along any future-directed causal curve) [28].
Theorem 1 (Minguzzi).
Let be a stably causal spacetime and . Then,
In his work [27], E. Minguzzi calls “one of the most important [relations] for the development of causality theory”. It is thus natural to ask whether the formalism developed in [12] for can be adapted to . Indeed, Definition 1 can be immediately modified to extend the relation (instead of ) onto (see Definition 2 below). The aim of this paper is to study the thus-obtained relation between measures. More concretely, in Section 2 we show that the Sorkin–Woolgar relation—when extended onto —retains its fundamental properties of transitivity and closedness (Proposition 1). Then, we establish its various characterizations (Theorem 2), in particular providing the “measure-theoretic” counterpart of equivalence (1). All of that generalizes and sheds new light on some of the results presented in [12], which is briefly discussed in Section 3. Finally, the Appendix A outlines the definitions and theorems from the probability theory on Polish spaces used in the article.
2. Results
From now on, the term “measure” will always stand for “Borel probability measure”.
Analogously as for the relations and , for any , one introduces the notation and . By definition, is closed, and hence for any compact , the sets are closed. More generally, however, for which is only Borel, the sets need not be Borel. Nevertheless, being projections of Borel sets, they are universally measurable, which means that for any measure , the sets are Borel up to a μ-negligible set, and therefore the quantity is well-defined [29].
As announced in the Introduction, let us put forward the following definition of the Sorkin–Woolgar relation between measures.
Definition 2.
Let be a spacetime. For any , we say that μ K-causally precedes ν (symbolically ) iff there exists a K-causal coupling of μ and ν, by which we mean any such that .
As a first result, let us observe that the defining properties of the Sorkin–Woolgar relation—closedness and transitivity—still hold after extending it onto .
Proposition 1.
The relation on is reflexive and transitive, as well as closed, by which we mean that the set is closed in endowed with the (product) narrow topology (for the definition of the narrow topology, see Appendix A).
Proof.
Reflexivity and transitivity can be shown exactly as in the proof of ([12], Theorem 11) with ⪯ and replaced by and .
As for the closedness, take any sequences satisfying for all , and suppose that they converge, respectively, to some in the narrow topology; i.e., that
What we need to prove is that . To this end, let be the K-causal coupling of and for every . We claim that the sequence has a narrowly convergent subsequence. Indeed, the sets , are relatively compact (in the narrow topology) and hence tight by the Prokhorov theorem (see Appendix A). However, this implies that the set is also tight ([14], Lemma 4.4) and hence relatively compact. Since is contained in , it possesses a subsequence (which we keep denoting as ) convergent to some .
It now remains to show that ω is a K-causal coupling of μ and ν. In order to prove that , observe first that for any
which proves that . The equality is proven analogously. Finally, in order to show that , invoke the Portmanteau theorem (see Appendix A), which toghether with the closedness of yields . ☐
We now provide several characterizations of the relation for measures, which constitutes the main result of the paper.
Theorem 2.
Let be a spacetime. For any , consider the following list of conditions.
- 1●
- .
- 2●
- For any compact subset ,
- 3●
- For any Borel subset such that ,
- 4●
- For any time function and any ,
- 5●
- For any bounded time function ,
Then, we have the following equivalences and implications . Moreover, if is causally continuous, then all above conditions are equivalent.
Observe that conditions and implicitly assume that is stably causal. Condition is a natural extension of Minguzzi’s condition (1) onto measures. Notice, however, that we assume here the causal continuity of for the equivalence to hold.
All above conditions (with the exception of ) are analogues of those studied in [12] in the context of the relation , and in fact they reduce to them for the case of causally simple spacetimes. However, the properties of allow these analogues to work in a broader class of spacetimes. Some parts of the following proof simply mimic the argumentation presented in [12], while others are significantly different, thus offering an alternative way to reach the results of the cited work. In particular, the crucial role will be played by the following fact, adapted from a more general result obtained by Suhr, cf. ([15], Theorem 2.5).
Theorem 3 (Suhr).
For any spacetime and any , the following conditions are equivalent
- (i)
- .
- (ii)
- For any Borel ,
We will also need the following lemma, the proof of which is a straightforward adaptation of that of ([12], Proposition 1).
Lemma 1.
For any spacetime and any , denote . We have the equivalence of inclusions .
Proof of Theorem 2.
By the closedness of , the set is closed, and hence Borel, for any compact C. Denoting the characteristic function of by χ, the inequality holds for all by the transitivity of . By assumption, there exists a K-causal coupling , and one has that
Let the set be as specified in , and let C be any compact subset of . Then also , and one has that
where the inequality holds by . Every Borel probability measure on a Polish space is inner regular (see Appendix A); therefore, by taking the supremum over all compacts , we obtain that
It suffices to prove that condition ii) in Theorem 3 is satisfied.
To this end, observe that the first inequality in ii) follows directly from with (up to a μ-negligible set), and the obvious inequality . In order to obtain the second inequality, notice that is in fact equivalent to its “” counterpart, namely
- 3′•
- For any Borel subset such that ,
The second inequality in condition ii) of Theorem 3 follows from with (up to a ν-negligible set) and the obvious inequality .
Take any time function (from now on, we implicitly assume that is stably causal) and any . On the strength of , we only have to show that .
To this end, let be such that there exists with . By Theorem 1, we immediately obtain that , and so as well.
Fix a bounded time function . Reasoning similarly as in the proof of ([12], Theorem 7, ), denote and and for every define the simple function
where , , and the functions denote their respective characteristic functions. On the strength of , we have that
The functions are designed so as to satisfy for all . The somewhat technical justification (which we skip here) can be found in the above-mentioned proof in [12]. This property allows us to go with n to infinity in (7) by Lebesgue’s dominant convergence theorem, which yields (5).
Fix a compact set C and define . By the closedness of and Lemma 1, is an open set with the property that .
We go along the lines of the suitably modified proof of ([12], Theorem 7, ). To begin with, let be an admissible measure, which by definition satisfies for all open and for all ([2], Definition 3.19). For any , the measure is another admissible measure. Its associated future volume function , defined for any via
is a (bounded) time function provided is causally continuous (this is where this assumption comes into play). Take now any sequence of strictly increasing functions pointwise convergent to the negative characteristic function of the interval , denoted here as (for instance, one can take ). By , for all and all , it is true that
Using the dominant convergence theorem twice, first for taking , and then , we obtain that
It is now crucial to observe that the map is positive on and zero on . Indeed, for , the set is open and nonempty by the openness of , and therefore by the very definition of an admissible measure. Conversely, if , then there exists such that , which in turn means that .
Remark 1.
If one replaces in condition the term “bounded” with “μ- and ν-integrable”, and/or the term “time” with “temporal”, “smooth time”, “smooth causal”, or “continuous causal”, the thus-obtained condition is equivalent with .
Proof.
To begin with, let τ stand for any term from the set {temporal, smooth time, time, smooth causal, continuous causal}. Let us assume that inequality (5) holds for all bounded τ functions and take any μ- and ν-integrable τ function . For every , consider the bounded function , which is also τ. By assumption,
Notice now that and that pointwise, and hence, by the dominant convergence theorem, inequality (9) becomes (5) as n approaches infinity.
The converse implication holds trivially, because every bounded τ function is μ-integrable for any .
We now move to proving that replacing “time” in with any term from the set {temporal, smooth time, time, smooth causal, continuous causal} yields a condition equivalent to . Notice that the “continuous causal” version of implies all other versions, whereas the “temporal” version of is implied by every other version. We thus only have to show that the “temporal” version implies the “continuous causal” one.
To this end, assume that is a causal function and let t be a fixed bounded time function. Then, for any is another bounded time function, which in turn can be approximated by a sequence of temporal functions such that everywhere (cf. [30], Corollary 5.4 and the subsequent comments). If we assume that the “temporal” version of holds, then we can write that for any ,
With the aid of Lebesgue’s dominant convergence theorem, we can go with m and then with n to infinity, thus obtaining . ☐
Remark 2.
In condition , one can equivalently use the closed half-lines. In other words, is equivalent to the following condition:
- 4′•
- For any time function and any ,
3. Conclusions
In this work, we have defined the natural extension of the Sorkin–Woolgar relation onto and studied its basic properties and characterizations. Since for causally simple spacetimes, many of the results obtained here reduce to those known from [12]. However, the current paper yields two new properties of the causal precedence relation ⪯ on for causally simple . Namely, by Proposition 1, the relation ⪯ on is closed, whereas by Theorem 2, conditions and provide additional characterizations of in terms of inverse images of half-lines under time functions. Finally, observe that Remark 1—making reference neither to ⪯ nor —generalizes ([12], Theorem 6).
Let us remark that all the obtained results would also hold if one, instead of , considered another well-studied causal relation introduced by Seifert [31] and often denoted . Of course, one would readily define its extension onto through the obvious replacements in Definition 2. In fact, even obtaining the proofs of the “Seifert” versions of Proposition 1 and Theorem 2 would simply amount to swapping for throughout the whole of Section 2. This is because —just like —is always transitive and topologically closed; moreover, if the underlying spacetime is stably causal, then actually [27].
There are still some natural questions concerning the extension of onto to be addressed. In future work, we shall investigate, for example, the antisymmetry of for measures, which for pointlike events is known to hold exactly in stably causal spacetimes [27]. The question is whether stable causality already guarantees that , or if rather one has to impose some stronger requirements on the causal properties of , such as causal continuity. Another idea would be to try to recast the results of [16], which pertain to (the measure-theoretic extension of) and to -metrics, into the more general setting of the Sorkin–Woolgar causality and the metrics of lower regularity.
Acknowledgments
The author wishes to thank Stefan Suhr for his clarifying comments on Theorem 3.
Conflicts of Interest
The author declares no conflict of interest.
Appendix A
For the reader’s convenience, below we outline some topological and measure-theoretic notions and results used in the paper.
Polish spaces are completely metrizable spaces possessing a countable dense subset. Their basic examples include (second countable) manifolds as well as certain spaces of continuous functions ([32], Appendix A). They turn out to have “nice” measure-theoretic properties (exemplified by the two theorems below), which render them extremely useful in probability theory [33] and optimal transport theory [13,34].
The space of all Borel probability measures on a Polish space is usually endowed with the narrow topology, defined as the coarsest topology such that for each the real-valued map is continuous. Thus, a sequence is narrowly convergent to iff for every .
It can be shown that is a Polish space as well ([33], Chapter 18, Theorem 24), and hence narrowly convergent sequences fully determine the narrow topology.
Two seminal results of the probability theory on Polish spaces used in this paper are the Portmanteau theorem and the Prokhorov theorem ([33], Chapter 18, Theorems 6 & 17, respectively). The former provides a list of equivalent conditions for a sequence to be narrowly convergent to some μ. In particular, in the paper we make use of the following characterization:
Prokhorov’s theorem, on the other hand, characterizes the relatively compact subsets of , i.e., the subsets whose closure is compact (in the narrow topology). Concretely, is relatively compact iff it is tight, that is iff
This in particular implies that any Borel probability measure on a Polish space is inner regular, i.e., for any and any Borel
References
- Minguzzi, E.; Sánchez, M. The causal hierarchy of spacetimes. In Recent Developments in Pseudo-Riemannian Geometry (ESI Lectures in Mathematics and Physics); Alekseevsky, D.V., Baum, H., Eds.; European Mathematical Society Publishing House: Scheuchzerstrasse, Switzerland, 2008; pp. 299–358. [Google Scholar]
- Beem, J.; Ehrlich, P.; Easley, K. Global Lorentzian Geometry; Monographs and Textbooks in Pure and Applied Mathematics; CRC Press: Boca Raton, FL, USA, 1996. [Google Scholar]
- Penrose, R. Techniques of Differential Topology in Relativity; CBMS–NSF Regional Conference Series in Applied Mathematics; SIAM: Philadelphia, PA, USA, 1972. [Google Scholar]
- O’Neill, B. Semi-Riemannian Geometry with Applications to Relativity; Academic Press: Cambridge, MA, USA, 1983. [Google Scholar]
- Hawking, S.W.; Ellis, G.F.R. The Large Scale Structure of Space-Time; Cambridge Books Online; Cambridge University Press: Cambridge, UK, 1973. [Google Scholar]
- Wald, R.M. General Relativity; University of Chicago Press: Chicago, IL, USA, 1984. [Google Scholar]
- Franco, N.; Eckstein, M. An algebraic formulation of causality for noncommutative geometry. Class. Quantum Gravity 2013, 30, 135007. [Google Scholar] [CrossRef]
- Franco, N. Lorentzian Approach to Noncommutative Geometry. arXiv 2011. [Google Scholar]
- Franco, N.; Eckstein, M. Exploring the causal structures of almost commutative geometries. Symmetry Integr. Geom. Meth. Appl. 2014, 10, 010. [Google Scholar] [CrossRef]
- Eckstein, M.; Franco, N. Causal structure for noncommutative geometry. In Proceedings of the 2014 Frontiers of Fundamental Physics, Marseille, France, 15–18 July 2014.
- Franco, N.; Eckstein, M. Causality in noncommutative two-sheeted space-times. J. Geom. Phys. 2015, 96, 42–58. [Google Scholar] [CrossRef]
- Eckstein, M.; Miller, T. Causality for nonlocal phenomena. arXiv 2015. [Google Scholar]
- Ambrosio, L.; Gigli, N. A User’s Guide to Optimal Transport. In Modelling and Optimisation of Flows on Networks; Piccoli, B., Rascle, M., Eds.; Springer: Berlin/Heidelberg, Germany, 2013. [Google Scholar]
- Villani, C. Optimal Transport: Old and New; Volume 338, Grundlehren der Mathematischen Wissenschaften; Springer: Berlin/Heidelberg, Germany, 2008. [Google Scholar]
- Suhr, S. Theory of optimal transport for Lorentzian cost functions. arXiv 2016. [Google Scholar]
- Miller, T. Polish spaces of causal curves. J. Geom. Phys. 2017, 116, 295–315. [Google Scholar] [CrossRef]
- Eckstein, M.; Miller, T. Causal evolution of wave packets. Phys. Rev. A 2017, 95, 032106. [Google Scholar] [CrossRef]
- Gerlach, B.; Gromes, D.; Petzold, J.; Rosenthal, P. Über kausales Verhalten nichtlokaler Größen und Teilchenstruktur in der Feldtheorie. Z. Phys. 1968, 208, 381–389. [Google Scholar] [CrossRef]
- Gerlach, B.; Gromes, D.; Petzold, J. Energie und Kausalität. Z. Phys. 1969, 221, 141–157. [Google Scholar] [CrossRef]
- Gromes, D. On the problem of macrocausality in field theory. Z. Phys. 1970, 236, 276–287. [Google Scholar] [CrossRef]
- Hegerfeldt, G.C. Remark on causality and particle localization. Phys. Rev. D 1974, 10, 3320–3321. [Google Scholar] [CrossRef]
- Hegerfeldt, G.C. Violation of causality in relativistic quantum theory? Phys. Rev. Lett. 1985, 54, 2395–2398. [Google Scholar] [CrossRef] [PubMed]
- Hegerfeldt, G.C. Causality problems for Fermi’s two-atom system. Phys. Rev. Lett. 1994, 72, 596–599. [Google Scholar] [CrossRef] [PubMed]
- Hegerfeldt, G.C. Particle localization and the notion of Einstein causality. In Extensions of Quantum Theory; Horzela, A., Kapuścik, E., Eds.; Apeiron: Montreal, QC, Canada, 2001; pp. 9–16. [Google Scholar]
- Hegerfeldt, G.C.; Ruijsenaars, S.N.M. Remarks on causality, localization, and spreading of wave packets. Phys. Rev. D 1980, 22, 377–384. [Google Scholar] [CrossRef]
- Sorkin, R.D.; Woolgar, E. A causal order for spacetimes with C0 Lorentzian metrics: Proof of compactness of the space of causal curves. Class. Quantum Gravity 1996, 13, 1971–1994. [Google Scholar] [CrossRef]
- Minguzzi, E. K-Causality Coincides with Stable Causality. Commun. Math. Phys. 2009, 290, 239–248. [Google Scholar] [CrossRef]
- Minguzzi, E. Time functions as utilities. Commun. Math. Phys. 2010, 298, 855–868. [Google Scholar] [CrossRef]
- Aliprantis, C.D.; Border, K. Infinite Dimensional Analysis: A Hitchhiker’s Guide; Springer: Berlin/Heidelberg, Germany, 2006. [Google Scholar]
- Chruściel, P.T.; Grant, J.D.E.; Minguzzi, E. On Differentiability of Volume Time Functions. Annales Henri Poincaré 2016, 17, 2801–2824. [Google Scholar] [CrossRef]
- Seifert, H.J. The causal boundary of space-times. Gen. Relativ. Gravitat. 1971, 1, 247–259. [Google Scholar] [CrossRef]
- Kerr, D.; Li, H. Ergodic Theory: Independence and Dichotomies; Springer Monographs in Mathematics; Springer: Berlin/Heidelberg, Germany, 2016. [Google Scholar]
- Fristedt, B.E.; Gray, L.F. A Modern Approach to Probability Theory; Probability and Its Applications; Birkhäuser: Basel, Switzerland, 1997. [Google Scholar]
- Ambrosio, L.; Gigli, N.; Savaré, G. Gradient Flows: In Metric Spaces and in the Space of Probability Measures; Birkhäuser: Basel, Switzerland, 2008. [Google Scholar]
© 2017 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license ( http://creativecommons.org/licenses/by/4.0/).