The Heroic Age of Probability: Kolmogorov, Doob, Lévy, Khinchin and Feller
Abstract
:1. Introduction
2. The Pre-History
3. Kolmogorov and Measure Theoretic Foundations
- A law of the iterated logarithm for sums of independent a.s. bounded random variables with increasing a.s. bound [29] (see Theorem 10.2.1 on p. 355 of Chow and Teicher [25]). This result is a significant advance beyond the law of the iterated logarithm first established by Khinchin [30] in the special case of independent Bernoulli trials, and required the introduction of methods and tools of proof completely different from those in [30], such as exponential upper and lower bounds; these methods and tools would become basic for much future work on laws of the iterated logarithm (see, e.g., Strassen [31]). The proof given by Kolmogorov of the law of the iterated logarithm seems (unavoidably) to be rather demanding; indeed, according to Stout ([32], p. 272), the proof involves “Herculean effort”.
- Sufficient conditions for a stochastic process constructed by means of the Kolmogorov consistency theorem to be almost surely sample-path continuous (the so-called Kolmogorov–Chentsov condition—see Theorem 2.2.8 on pp. 53–54 of Karatzas and Shreve [35]).
- A significant simplification in the case of finite-variance distributions of the Lévy–Khinchin representation formula which characterizes infinitely divisible distributions (see Theorem 12.1.4, p. 432 of Chow and Teicher [25]).
- The analytical study of Markov processes over continuous time and in continuous state space in the work [36], which initiated the study of Markov diffusion processes and introduced the Kolmogorov forward and backward equations. This work is completely analytical, without any reference to sample paths, being concerned with the construction of Markov transition probabilities in terms of which one can then construct the Markov diffusion process by means of the Kolmogorov consistency theorem. The work [36] would later motivate Itô to introduce stochastic integrals and stochastic differential equations to study Markov diffusion processes from a sample-path perspective.
- Ergodic properties and mean recurrence times of Markov chains with countable state space (see Theorem 8.18, p. 152 and Theorem 8.22, p. 154 of Kallenberg [37]).
- Backward equation for pure jump-type Markov processes in a measurable space (see Theorem 12.22, p. 242 of Kallenberg [37]).
4. Doob and Stochastic Processes
- The optional sampling theorem for discrete parameter submartingales (see Theorem VII.2.1 on p. 300 and Theorem VII.2.2 on p. 302 in [39]) and for continuous parameter submartingales (see Theorem VII.11.8 on pp. 376–377 in [39]); note that the term semi-martingale is used throughout [39] in place of the current submartingale.
- The convergence theorems for discrete parameter martingales (see Theorem VII.4.1 on p. 319 of [39]) and for discrete parameter submartingales (see Theorem VII.4.1s on p. 324 of [39]), both in the “forward” direction with respect to an increasing filtration. The corresponding convergence theorems in the “reverse direction”, with respect to a decreasing filtration, are Theorem VII.4.2 on p. 328 of [39] for discrete parameter martingales and Theorem VII.4.2s on p. 329 of [39] for discrete parameter submartingales. Finally, extension of all of these convergence results to the continuous parameter case is indicated on p. 354 of [39].
5. Lévy
- 1.
- Characteristic functions and limit laws (1919–1935)
- 2.
- Independent increments processes and martingales (1930–1940)
- 3.
- Brownian motion (1938–1955).
- 1. Characteristic functions and limit laws: As already noted, Laplace introduced the method of characteristic functions for discrete distributions in the Théorie Analytique of 1812, and Lyapunov extended this method to continuous distributions having a density function, in order to prove the central limit theorem. Using the power of the Lebesgue integral Lévy defined the characteristic function for a completely general distribution function as the Fourier–Stieltjes transform of the distribution function and developed its properties to the extent that “since then, only improvements of detail have been obtained” (see Loève [59], p. 2). The result is an exceptionally powerful tool which has been in constant use by probabilists ever since (although, somewhat ironically, seldom used by Lévy himself; see p. 2 of Loève [59] and p. 303 of Taylor [63]). In particular, Lévy established
- The one-to-one correspondence between distribution functions and characteristic functions, as well as an explicit inversion formula for characteristic functions (see Theorem 8.3.1, Corollary 8.3.1 and Corollary 8.3.2, pp. 269–270 of Chow and Teicher [25]).
- The relation between multiplication of characteristic functions and convolution of the corresponding distribution functions (see Theorem 8.3.2 and Corollary 8.3.3, p. 271 of Chow and Teicher [25]).
- The basic continuity theorem which relates complete convergence of a sequence of distribution functions to uniform convergence on closed bounded intervals of the corresponding characteristic functions (see Theorem 8.3.3, p. 271 of Chow and Teicher [25], which is a slight extension due to Bochner of Lévy’s continuity theorem).
- The Lévy metric on the set of all distribution functions on the real line (see (6) on p. 260 of Chow and Teicher [25] for the definition). This metrizes weak convergence of distribution functions, and the resulting metric space is separable and complete. Furthermore, the Lévy metric prefigures the Prokhorov metric on the set of all probability measures on a given complete separable metric space; the latter metric is now a standard tool in studying weak convergence of stochastic processes and is essential for the important Strassen–Dudley theorem which relates the weak proximity of probability measures to proximity in the sense of convergence in probability of corresponding marginal random variables (see Theorem 11.6.2, p. 407 of Dudley [64]).
- Necessary and sufficient conditions for the weak convergence of sums of independent random variables to the normal distribution [65] (see Theorem 5.15, p. 93 of Kallenberg [37] for a modern rendition). The question of such necessary and sufficient conditions was of particular interest to Lévy, who ever since his 1919 lectures on probability mentioned previously had been much engaged in understanding the precise origins of the normal distribution. A particularly interesting aspect of this result is that there is no mention of mean and variance anywhere, thus demonstrating a surprising “non-relevance” of these seemingly basic notions to the central limit problem when approached at a truly fundamental level. It was in the course of this work that Lévy rediscovered the central limit theorem of Lyapunov discussed earlier. These necessary and sufficient conditions for weak convergence to the normal law were obtained independently by Feller [66], as will be discussed in Section 6.
- A generalization of the classical Borel–Cantelli lemma to conditional form (see Corollary 7.20 on p. 131 of Kallenberg [37]).
- The Lévy–Khinchin representation formula, giving a complete characterization of the infinitely divisible distributions on the real line [67]; see Theorem 12.1.3 on p. 431 of Chow and Teicher [25] for a modern rendition (this representation was obtained independently by Khinchin [68], see Section 6 which follows).
- The formulation of a particularly important subset of the set of infinitely divisible distributions, namely the set of stable distributions. In a work written jointly with Khinchin [69], Lévy completely and definitively characterized the set of all stable distributions (see Theorem 12.3.2, p. 449 of Chow and Teicher [25] for a modern treatment).
- 2. Independent increments processes and martingales: Lévy initiated and single-handedly created most of the theory of independent increments processes. Among the definitive results on these established by Lévy, we mention only the following:
- An independent increments process which is null at the origin and has continuous sample paths is necessarily Gaussian (see Theorem 13.4, p. 252 of Kallenberg [37]).
- An independent increments process which is continuous in probability has a version with cádlág sample paths and without any fixed jump times (see Theorem 15.1, p. 286 of Kallenberg [37]).
- In view of the preceding discussion, it follows that, if an independent increments process fails to be Gaussian, then the process cannot have continuous sample paths and must therefore have a “jump” component. The precise characterization of the jump component as an integral with respect to a Poisson process is given by the Itô-Lévy decomposition, for which see Theorem 15.4, p. 287 of Kallenberg [37]. This decomposition was first established by Lévy on the basis of somewhat intuitive arguments, rather than a completely water-tight proof (see Remark 6 in this regard). A completely rigorous proof was given by Itô [70].
- As already noted in Section 4, the martingale concept dates as far back as Bernstein [46]. According to Taylor ([63], p. 305), the idea of a martingale was also introduced by Lévy, who noticed the following: proofs giving limit results for sequences of independent random variables which are assumed centered at expectations, that is , , often carried over fairly easily to sequences of random variables which are assumed centered at conditional expectations, in the sense that , . In this way, Lévy discovered martingale difference sequences. Building on this penetrating observation, Lévy obtained
- A central limit theorem for martingale difference sequences. A modern (and slightly generalized) rendition of this result is Theorem 9.3.1 of Chow and Teicher ([25], p. 318); notice that the proof of this theorem in [25] is just a straightforward modification of the proof of the classical Lindeberg central limit theorem for sums of independent random variables—exactly in line with Lévy’s observation.
- A maximal inequality for martingale differences which generalizes the Kolmogorov inequality for sequences of independent zero-mean random variables (as already noted, this maximal inequality was established independently by Bernstein [47]).
- The Lévy zero-one law (see (5.8) on p. 261 of Durrett [71] and the interesting discussion of the zero-one law given there).
- 3. Brownian motion: Specializing from general independent-increments processes to Brownian motion, we again find a wealth of basic ideas and results due to Lévy. These include
- The modulus of continuity for Brownian motion sample paths (see Theorem 2.9.5, p. 114 of Karatzas and Shreve [35]).
- The quadratic variation process for Brownian motion (see Theorem 13.9, p. 255 of Kallenberg [37]).
- Construction of Brownian motion (see Section 1.2, pp. 5–8 of McKean [72], or Section 2.3, pp. 56–59 of Karatzas and Shreve [35])). A completely rigorous construction of Brownian motion was first accomplished in 1923 by Wiener [73] using a Fourier series expansion with respect to the trigonometric basis in the Hilbert space (see Itô and McKean [74], pp. 21–22 for a brief summary of this construction). Lévy noticed that this construction of Brownian motion simplifies dramatically if one uses instead the Haar functions as a basis in .
- Arcsine laws for Brownian motion (see Theorem 13.16, p. 258 of Kallenberg [37]).
- Law of the first passage time of Brownian motion (see Proposition 2.6.19, p. 88 of Karatzas and Shreve [35]).
- Laws of the running maximum of Brownian motion (see Proposition 2.8.1, Proposition 2.8.2 and Proposition 2.8.8, pp. 95–97 of Karatzas and Shreve [35]).
- Martingale characterization of Brownian motion (see Theorem 3.3.16, p. 157 of Karatzas and Shreve [35]).
- Transience and escape of Brownian motion for dimensions greater than or equal to two (see Theorem 18.6, p. 354 of Kallenberg [37]).
- Brownian local time, the basic intuitive foundations of which are to be found in Lévy [75], although rigorous proofs of existence had to await Trotter [76] and Tanaka [77] (a complete history of the Brownian local time process, including a summary of the intuitive approach of Lévy, is too lengthy to be included here, and may be found in Chung [78]).
6. Khinchin and Feller
- The law of the iterated logarithm. This magisterial result, described in Chung ([56], p. 242) as “a crowning achievement in classical probability theory”, is a profound and many-sided complement to the law of large numbers and the central limit theorem. Khinchin pioneered laws of the iterated logarithm, addressing the particular case of independent Bernoulli trials in the work [30]. The law of the iterated logarithm can be regarded as an almost-sure rate of convergence for a law of large numbers, and the result of Khinchin constitutes the “ultimate refinement” in the rate of convergence for the law of large numbers in the particular case of Bernoulli trials, improving on rates previously established by Hausdorff and by Hardy and Littlewood, and being essentially a “best possible” rate of convergence (the matter is nicely discussed in Lamperti ([84], pp. 41–49)). The law of the iterated logarithm of Khinchin served as a precursor to a more general result of this kind established by Kolmogorov [29], as already noted in Section 3. Subsequently, Khinchin [85] also established the law of the iterated logarithm for Brownian motion (see Theorem 2.9.33, p. 112 of Karatzas and Shreve [35]), which itself prefigures the more recent functional law of the iterated logarithm of Strassen [31]. It is an indication of the richness and subtlety of laws of the iterated logarithm that this genre of results, inaugurated by Khinchin, continues to be of undiminished interest, as is clear from the comprehensive survey article of Bingham [86].
- A significant generalization of the weak law of large numbers in the following setting: one is given independent identically distributed non-negative random variables , , with distribution function F on a fixed probability space (observe that integrability conditions on the are not stipulated). The goal is to establish necessary and sufficient conditions for the existence of strictly positive constants , , such that one has the convergence in probability as (here ). From Khinchin [88], the necessary and sufficient condition is that the truncated mean function
- The Lévy–Khinchin formula characterizing all infinitely divisible distributions on the real line established in [68]. As already noted in Section 5, this characterization was obtained independently by Lévy [67]. The two authors apply different methods, that of Lévy being essentially probabilistic while Khinchin uses a shorter and more direct analytic approach.
- The Khinchin inequality giving upper and lower bounds on weighted sums of independent symmetric Bernoulli random variables (see Theorem 10.3.1, p. 366 of Chow and Teicher [25]). This result has several applications in both probability and analysis, and is an essential ingredient in establishing the classical Marcinkiewicz–Zygmund inequality for sums of independent random variables.
- The Bochner–Khinchin theorem, which gives necessary and sufficient conditions for a continuous complex-valued function to be the characteristic function of a distribution function (see [93]).
- The Birkhoff–Khinchin ergodic theorem, which reformulates and generalizes the individual ergodic theorem of G.D. Birkhoff [94]. This latter result essentially concerns the long-term evolution of a discrete time dynamical system. Khinchin [95] introduces the notion of a measure-preserving transformation on a probability space, and establishes a significantly more general version of the Birkhoff theorem in terms of iterations of the measure-preserving transformation (see Theorem 5.3.3, p. 39 of Shiryayev [28]).
- The Wiener–Khinchin theorem, which expresses the autocorrelation function of a wide-sense stationary second-order stochastic process as the Fourier–Stieltjes transform of a power spectral distribution function [96].
- Necessary and sufficient conditions for the weak convergence to the normal distribution of sums of independent but not necessarily identically distributed random variables [66]; as noted in Section 5, these conditions were established independently by Lévy [65]. In the same work, [66] Feller also establishes the necessity of the classical sufficiency conditions introduced by Lindeberg [97] for the central limit theorem (see Theorem 5.12, p. 91 and Theorem 5.15, p. 93 of Kallenberg [37] for a modern treatment).
- An “ultimate” extension of the Kolmogorov law of the iterated logarithm for a.s. bounded random variables to arbitrary random variables under very mild restrictions, as well as “upper class” and “lower class” characterizations of limiting properties of sums of independent random variables under various restrictions; these are addressed in the works [101,102,103,104] (an exposition of some of these developments at the graduate textbook level can be found in Section 10.10.2 of Chow and Teicher [25]).
- A thorough makeover of much of renewal theory in the works [105,106,107]. In [106], one finds, basically for the first time, a rigorous treatment of several results in the literature on renewal theory obtained by arguments which were plausible rather than completely rigorous, as well as correction of faulty results and simplified development of other results. The short work [105], which is concerned exclusively with analysis and not probability as such, develops a very subtle argument to obtain a technical result which is of great importance to renewal theory, sometimes referred to as “the cornerstone of renewal theory”. This technical result is then deployed in [107], which is a thorough and comprehensive examination of recurrence problems in probability, including for example the probability of last return to the origin of a simple symmetric random walk on the set of all integers [107] (see Proposition 9.9, p. 165 of Kallenberg [37]).
- A comprehensive study of the semigroup concept in [108,109,110]. None of these works is specifically concerned with questions of probability theory as such, but the results are certainly pertinent to probability, being essential, for instance, to the study [111] on one-dimensional diffusions. In particular, [108] addresses “paired” parabolic partial differential equations, one equation being the formal adjoint of the other, the main goal being to relate the notion of formal “adjointness” of partial differential equations to the usual functional analytic pairing of the corresponding semigroups operating on well-defined paired Banach spaces. The motivation for [108] is, of course, in the Kolmogorov backward and forward equations, although the setting in [108] is completely abstract. The work [109] is concerned with weakening the restriction of strong measurability built into the classical definition of semigroup, while [110] is devoted to an extension of the Hille–Yosida theorem to the less restrictive notion of semigroup introduced in [109].
- A comprehensive study of one-dimensional diffusions in [111], including a full characterization of the generators, and the formulation of the boundary properties in terms of the domain of the generator for such processes (an exposition of some of the central results in [111] is to be found in Chapter 23 of Kallenberg [37]).
7. Concluding Remarks
Funding
Data Availability Statement
Conflicts of Interest
References
- Ore, O. Cardano—The Gambling Scholar; Princeton University Press: Princeton, NJ, USA, 1953. [Google Scholar]
- Ore, O. Pascal and the invention of probability theory. Am. Math. Mon. (MAA) 1960, 67, 409–419. [Google Scholar] [CrossRef]
- Maistrov, L.E. Probability Theory—A Historical Sketch; Kotz, S., Ed. and Translator; Academic Press: New York, NY, USA, 1974. [Google Scholar]
- Bolthausen, E. Bernoullis gesetz der grossen zahlen. Elem. Math. 2010, 65, 134–143. [Google Scholar] [CrossRef]
- Bolthausen, E.; Wüthrich, M.V. Bernoulli’s law of large numbers. Astin Bull. 2013, 43, 73–79. [Google Scholar] [CrossRef]
- Snell, J.L. Introduction to Probability; Random House-Birkhäuser Mathematics Series; Springer: New York, NY, USA, 1988. [Google Scholar]
- Archibald, R.C. A rare pamphlet of de Moivre and some of his discoveries. Isis 1926, 8, 671–834. [Google Scholar]
- Bellhouse, D.R. Abraham de Moivre-Setting the Stage for Classical Probability and Its Applications; CRC Press: Boca Raton, FL, USA, 2011. [Google Scholar]
- Hald, A. A History of Mathematical Statistics from 1750 to 1930; Wiley-Interscience: New York, NY, USA, 1998. [Google Scholar]
- Todhunter, I. A History of the Mathematical Theory of Probability from the Time of Pascal to that of Laplace; Macmillan & Company: Cambridge, UK; London, UK, 1865. [Google Scholar]
- Mackey, G.W. Harmonic analysis as the exploitation of symmetry-a historical survey. Bull. Am. Math. Soc. 1980, 3, 543–698. [Google Scholar] [CrossRef]
- Gnedenko, B.V.; Kolmogorov, A.N. Limit Distributions for Sums of Independent Random Variables; Addison Wesley: Reading, MA, USA, 1968. [Google Scholar]
- Seneta, E. The central limit problem and linear least squares in pre-revolutionary Russia—The background. Math. Sci. 1984, 9, 37–77. [Google Scholar]
- Lyapunov, A. Nouvelle forme du théorème sur la limite des probabilité. Mém. Acad. Sci. St. Petersbourg 1901, 1–24. [Google Scholar]
- Esseen, C. Fourier analysis of distribution functions. Acta. Math. 1945, 77, 1–125. [Google Scholar] [CrossRef]
- Berry, A.C. The accuracy of the Gaussian approximation to the sum of independent variates. Trans. Am. Math. Soc. 1941, 49, 122–136. [Google Scholar] [CrossRef]
- Seneta, E. Early influences on probability and statistics in the Russian empire. Arch. Hist. Exact Sci. 1998, 53, 201–213. [Google Scholar] [CrossRef]
- Doob, J.L. William Feller and twentieth century probability. In Proceedings of the Sixth Berkeley Symposium on Mathematical Statistics and Probability, Berkeley, CA, USA, 16–21 June 1971; University California Press: Berkeley, CA, USA, 1972; pp. xv–xx. [Google Scholar]
- Lebesgue, H. Intégral, longeur, aire (Thèse, Univ. Paris). Ann. Mat. Pura Appl. 1902, 7, 231–359. [Google Scholar] [CrossRef]
- Fréchet, M. Sur l’intégrale d’une fonctionnelle étendue à un ensemble abstrait. Bull. Soc. Math. France 1915, 43, 248–265. [Google Scholar] [CrossRef]
- Kolmogorov, A.N. Grundbegriffe der Wahrscheinlichkeitsrechnung; (English Translation Foundations of the Theory of Probability; Chelsea: New York, NY, USA, 1950); Springer: Berlin/Heidelberg, Germany, 1933. [Google Scholar]
- Williams, D. Probability with Martingales; Cambridge Univerity Press: Cambridge, UK, 1991. [Google Scholar]
- Kendall, D.G. Obituary–Andrei Nikolaevich Kolmogorov, 1903–1987, D.G. Kendall Ed. Bull. Lond. Math. Soc. 1990, 22, 31–100. [Google Scholar] [CrossRef]
- Khinchin, A.Y.; Kolmogorov, A.N. Uber konvergenz von Reihen deren Glieder durch den Zufall bestimmt werden. Mat. Sb. 1925, 32, 668–676. [Google Scholar]
- Chow, Y.S.; Teicher, H. Probability Theory—Independence, Interchangeablilty, Martingales, 2nd ed.; Springer: New York, NY, USA, 1988. [Google Scholar]
- Kolmogorov, A.N. Ueber die Summen durch den Zufall bestimmter unabhangiger Grossen. Math. Ann. 1928, 309–319. [Google Scholar] [CrossRef]
- Kolmogorov, A.N. Sur la loi forte des grands nombres. C.R. Acad. Sci. Paris 1930, 191, 910–912. [Google Scholar]
- Shiryayev, A.N. Probability—2, 3rd ed.; Springer GTM Series no. 95; Springer: Berlin/Heidelberg, Germany, 2019. [Google Scholar]
- Kolmogorov, A.N. Ueber das Gesetz des iterierten Logarithmus. Math. Ann. 1929, 101, 126–135. [Google Scholar] [CrossRef]
- Khinchin, A.Y. Ueber einen Satz der Wahrscheinlichkeitsrechnung. Fund. Math. 1924, 6, 9–20. [Google Scholar] [CrossRef]
- Strassen, V. An invariance principle for the law of the iterated logarithm. Z. Wahrsch. Verw. Geb. 1964, 3, 211–226. [Google Scholar] [CrossRef]
- Stout, W.F. Almost Sure Convergence; Academic Press: New York, NY, USA, 1974. [Google Scholar]
- Kolmogorov, A.N. Sulla determinazione empirica di una legge di distribuzione. Giorn. Ist. Ital. Attuari 1933, 83–91. [Google Scholar]
- Dacunha-Castelle, D.; Duflo, M. Probability and Statistics Volume II; Springer: New York, NY, USA, 1986. [Google Scholar]
- Karatzas, I.; Shreve, S.E. Brownian Motion and Stochastic Calculus; Springer: New York, NY, USA, 1991. [Google Scholar]
- Kolmogorov, A.N. Ueber die analytischen Methoden in der Wahrscheinlichkeitsrechnung. Math. Ann. 1931, 104, 415–458. [Google Scholar] [CrossRef]
- Kallenberg, O. Foundations of Modern Probability, 2nd ed.; Springer: New York, NY, USA, 2002. [Google Scholar]
- Doob, J.L. Kolmogorov’s early work on convergence theory and foundations. Ann. Probab. 1989, 17, 815–821. [Google Scholar] [CrossRef]
- Doob, J.L. Stochastic Processes; Wiley-Interscience: New York, NY, USA, 1953. [Google Scholar]
- Doob, J.L. Stochastic processes depending on a continuous parameter. Trans. Am. Math. Soc. 1937, 42, 107–140. [Google Scholar] [CrossRef]
- Loève, M. Probability Theory II, 4th ed.; Springer: New York, NY, USA, 1978. [Google Scholar]
- Billingsley, P. Probability and Measure, 3rd ed.; Wiley: New York, NY, USA, 1995. [Google Scholar]
- Getoor, R.J.L. Doob: Foundations of stochastic processes and probabilistic potential theory. Ann. Prob. 2009, 37, 1647–1663. [Google Scholar] [CrossRef]
- Doob, J.L. The development of rigor in mathematical probability (1900–1950). Am. Math. Mon. 1996, 103, 586–595. [Google Scholar]
- Dellacherie, C.; Meyer, P.A. Probabilities and Potential; North Holland Mathematics Studies, no. 29; North-Holland Publishing Co.: Amsterdam, The Netherlands, 1978. [Google Scholar]
- Bernstein, S.N. Sur l’extension du théorème limite du calcul des probabilités aux sommes de quantités dépendants. Math. Ann. 1927, 97, 1–59. [Google Scholar] [CrossRef]
- Bernstein, S.N. On some modifications of Chebyshev’s inequality. Dokl. Akad. Nauk SSSR 1937, 17, 275–278. [Google Scholar]
- Ville, J. Étude Critique de la Notion du Collectif; Gauthier-Villars: Paris, France, 1939. [Google Scholar]
- Doob, J.L. Regularity properties of certain families of chance variables. Trans. Am. Math. Soc. 1940, 44, 455–486. [Google Scholar] [CrossRef]
- Doob, J.L. Continuous parameter martingales. In Proceedings of the Second Berkeley Symposium on Mathematical Statistics and Probability, Berkeley, CA, USA, 31 July–12 August 1950; University California Press: Berkeley, CA, USA, 1951; pp. 269–277. [Google Scholar]
- Rogers, L.C.G.; Williams, D. Diffusions, Markov Processes, and Martingales—Volume One: Foundations, 2nd ed.; Wiley: Chichester, UK, 1994. [Google Scholar]
- Shiryayev, A.N. Probability—1, 3rd ed.; Springer GTM Series no. 95; Springer: Berlin/Heidelberg, Germany, 2016. [Google Scholar]
- Doob, J.L. Stochastic processes with an integral-valued parameter. Trans. Am. Math. Soc. 1938, 44, 87–150. [Google Scholar]
- Meyer, P.A. A decomposition theorem for supermartingales. Ill. J. Math. 1962, 193–205. [Google Scholar] [CrossRef]
- Meyer, P.A. Decomposition of supermartingales: The uniqueness theorem. Ill. J. Math. 1963, 1–17. [Google Scholar] [CrossRef]
- Chung, K.L. A Course in Probability Theory, 3rd ed.; Academic Press: New York, NY, USA, 2001. [Google Scholar]
- Sucheston, L. Book reviews of A Course in Probability Theory by K.L. Chung, and Probability by L. Breiman. Bull. Am. Soc. 1969, 75, 706–709. [Google Scholar] [CrossRef]
- Parthasarathy, K.R. Probability Measures on Metric Spaces; Academic Press: New York, NY, USA, 1967. [Google Scholar]
- Loève, M. Paul Lévy, 1886–1971. Ann. Prob. 1973, 1, 1–18. [Google Scholar] [CrossRef]
- Lévy, P. Calcul des Probabilités; Gauthier-Villars: Paris, France, 1925. [Google Scholar]
- Bingham, N.H. Studies in the history of probability and statistics XLVI. Measure into probability: From Lebesgue to Kolmogorov. Biometrika 2000, 87, 145–156. [Google Scholar] [CrossRef]
- Lévy, P. Quelques Aspects de la Pensée d’un Mathématicien; Gauthier-Villars: Paris, France, 1970. [Google Scholar]
- Taylor, S.J. Paul Lévy. Bull. Lond. Math. Soc. 1975, 7, 300–320. [Google Scholar] [CrossRef]
- Dudley, R.M. Real Analysis and Probability; Cambridge Univerity Press: Cambridge, UK, 2002. [Google Scholar]
- Lévy, P. Propriétés asymptotiques des sommes de variables aléatoires indépendantes ou enchainées. J. Math. Pures Appl. 1935, 14, 347–402. [Google Scholar]
- Feller, W. Uber den zentralen Grenzwertsatz der Wahrscheinlichkeitsrechnung I. Math. Z. 1935, 40, 521–559. [Google Scholar] [CrossRef]
- Lévy, P. Sur les intégrales dont les éléments sont des variables aléatoires indépendantes. Ann. Sc. Norm. Sup. Pisa 1934, 3, 337–366. [Google Scholar]
- Khinchin, A.Y. A new derivation of a formula of M. Paul Lévy. Bull. Moscou Univ. 1937, 1, 1–5. [Google Scholar]
- Khinchin, A.Y.; Lévy, P. Sur les lois stables. C. R. Acad. Sci. Paris 1936, 374–376. [Google Scholar]
- Itô, K. On stochastic processes (I) (infinitely divisible laws of probability). Jpn. J. Math. 1942, XVIII, 261–301. [Google Scholar]
- Durrett, R. Probability: Theory and Examples, 3rd ed.; Thomson Brooks/Cole: Belmont, CA, USA, 2005. [Google Scholar]
- McKean, H.P. Stochastic Integrals; Academic Press: New York, NY, USA, 1969. [Google Scholar]
- Wiener, N. Differential space. J. Math. Phys. 1923, 131–174. [Google Scholar] [CrossRef]
- Itô, K.; McKean, H.P. Diffusion Processes and their Sample Paths; Springer: New York, NY, USA, 1974. [Google Scholar]
- Lévy, P. Processus Stochastiques et Mouvement Brownien; Gauthier-Villars: Paris, France, 1948. [Google Scholar]
- Trotter, H.F. A property of Brownian motion paths. Ill. J. Math. 1958, 2, 425–433. [Google Scholar] [CrossRef]
- Tanaka, H. Note on continuous additive functionals of the 1-dimensional Brownian path. Z. Wahrsch. Verw. Geb. 1963, 1, 251–257. [Google Scholar] [CrossRef]
- Chung, K.L. Reminiscences of some of Paul Lévy’s ideas in Brownian motion and in Markov chains. In Seminar on Stochastic Processes 1988; Cinlar, E., Chung, K.L., Getoor, R.K., Eds.; Birkhauser: Boston, MA, USA, 1986. [Google Scholar]
- Doob, J.L. Obituary-Paul Levy. J. Appl. Prob. 1972, 9, 870–872. [Google Scholar] [CrossRef]
- Halmos, P.R. I Want to Be a Mathematician—An Automathography; Springer: New York, NY, USA, 1985. [Google Scholar]
- Stroock, D.W.; Varadhan, S.R.S. (Eds.) Kiyoshi Itô—Selected Papers; Springer: New York, NY, USA, 1986. [Google Scholar]
- Lévy, P. Théorie de l’Addition des Variables Aléatoires; Gauthier-Villars: Paris, France, 1937. [Google Scholar]
- Gnedenko, B.V. Alexander Yakovlevich Khinchin-Obituary. Theor. Prob. Appl. 1960, 5, 1–4. [Google Scholar] [CrossRef]
- Lamperti, J. Probability; Benjamin Cummings: Reading, MA, USA, 1966. [Google Scholar]
- Khinchin, A.Y. Asymptotische Gesetze der Wahrscheinlichkeitsrechnung; Ergebnisse der Mathematik; Springer: Berlin/Heidelberg, Germany, 1933. [Google Scholar]
- Bingham, N.H. Variants on the law of the iterated logarithm. Bull. Lond. Math. Soc. 1986, 18, 433–467. [Google Scholar] [CrossRef]
- Khinchin, A.Y. Sul dominio di attrazione delle legge di Gauss. Giorn. Ist. Ital. Attuari 1935, 6, 371–393. [Google Scholar]
- Khinchin, A.Y. Su una legge dei grandi numeri generalizzata. Giorn. Ist. Ital. Attuari 1936, 7, 365–377. [Google Scholar]
- Seneta, E. A tricentary history of the law of large numbers. Bernoulli 2013, 19, 1088–1121. [Google Scholar] [CrossRef]
- Khinchin, A.Y. Zur Theorie der unbeschrankt teilbaren Verteilungsgesetze. Mat. Sb. 1937, 44, 79–120. [Google Scholar]
- Sato, K.-I. Lévy Processes and Infinitely Divisible Distributions; Cambridge Univerity Press: Cambridge, UK, 1999. [Google Scholar]
- Khinchin, A.Y. Sur la croissance locale des processus stochastiques homogènes à accroissements indépendants. Izv. Acad. Nauk SSSR 1939, 3, 487–505. [Google Scholar]
- Feller, W. An Introduction to Probability Theory and Its Applications; Wiley: New York, NY, USA, 1966; Volume 2. [Google Scholar]
- Birkhoff, G.D. Proof of the ergodic theorem. Proc. Nat. Acad. Sci. USA 1932, 17, 656–660. [Google Scholar] [CrossRef]
- Khinchin, A.Y. Zu Birkhoffs Losung des Ergodenproblems. Math. Ann. 1933, 107, 485–488. [Google Scholar] [CrossRef]
- Khinchin, A.Y. Korrelationstheorie der stationaren stochastischen Prozesse. Math. Ann. 1934, 109, 604–615. [Google Scholar] [CrossRef]
- Lindeberg, J.W. Eine neue herleitung des exponentialgesetz in der wahrscheinlichkeitsrechnung. Math. Z. 1922, 15, 211–225. [Google Scholar] [CrossRef]
- Feller, W. Zur Theorie der stochastischen Prozesse. (Existenz und Eindeutigkeitssatze). Math. Ann. 1936, 113, 113–160. [Google Scholar] [CrossRef]
- Feller, W. Uber das Gesetz der grossen Zahlen. Acta Litt. Sci. Szeged 1937, 8, 191–201. [Google Scholar]
- Feller, W. Neuer Beweis fur die Kolmogoroff-P. Levysche Charakterisierung der Unbeschrankt teilbaren Verteilungsfunktion. Bull. Int. Acad. Yugosl. Cl. Sci. Math. Nat. 1939, 32, 1–8. [Google Scholar]
- Feller, W. The general form of the so-called law of the iterated logarithm. Trans. Am. Math. Soc. 1943, 54, 373–402. [Google Scholar] [CrossRef]
- Feller, W. The law of the iterated logarithm for identically distributed random variables. Ann. Math. 1946, 47, 631–638. [Google Scholar] [CrossRef]
- Feller, W. An extension of the law of the iterated logarithm to variables without variance. J. Math. Mech. 1968, 18, 343–355. [Google Scholar] [CrossRef]
- Feller, W. General analogues to the law of the iterated logarithm. Z. Wahrsch. Geb. 1969, 14, 21–26. [Google Scholar] [CrossRef]
- Erdos, P.; Feller, W.; Pollard, H. A property of power series with positive coefficients. Bull. Am. Math. Soc. 1949, 55, 201–204. [Google Scholar] [CrossRef]
- Feller, W. On the integral equation of renewal theory. Ann. Math. Stat. 1941, 12, 243–267. [Google Scholar] [CrossRef]
- Feller, W. Fluctuation theory of recurrent events. Trans. Am. Math. Soc. 1949, 67, 98–119. [Google Scholar] [CrossRef]
- Feller, W. The parabolic differential equations and the associated semi-groups of transformations. Ann. Math. 1952, 55, 468–519. [Google Scholar] [CrossRef]
- Feller, W. Semi-groups of transformations in general weak topologies. Ann. Math. 1953, 57, 287–308. [Google Scholar] [CrossRef]
- Feller, W. On the generation of unbounded semi-groups of bounded linear operators. Ann. Math. 1953, 58, 166–174. [Google Scholar] [CrossRef]
- Feller, W. Diffusion processes in one dimension. Trans. Am. Math. Soc. 1954, 77, 1–31. [Google Scholar] [CrossRef]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Heunis, A.J. The Heroic Age of Probability: Kolmogorov, Doob, Lévy, Khinchin and Feller. Mathematics 2025, 13, 867. https://doi.org/10.3390/math13050867
Heunis AJ. The Heroic Age of Probability: Kolmogorov, Doob, Lévy, Khinchin and Feller. Mathematics. 2025; 13(5):867. https://doi.org/10.3390/math13050867
Chicago/Turabian StyleHeunis, Andrew J. 2025. "The Heroic Age of Probability: Kolmogorov, Doob, Lévy, Khinchin and Feller" Mathematics 13, no. 5: 867. https://doi.org/10.3390/math13050867
APA StyleHeunis, A. J. (2025). The Heroic Age of Probability: Kolmogorov, Doob, Lévy, Khinchin and Feller. Mathematics, 13(5), 867. https://doi.org/10.3390/math13050867