Next Article in Journal
Hierarchical Object Part Learning Using Deep Lp Smooth Symmetric Non-Negative Matrix Factorization
Previous Article in Journal
Asymmetric Dark Matter in Baryon Asymmetrical Universe
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Indeterminate Stieltjes Moment Problem: Entropy Convergence

by
Pier Luigi Novi Inverardi
* and
Aldo Tagliani
Department of Economics & Management, University of Trento, 38100 Trento, Italy
*
Author to whom correspondence should be addressed.
Symmetry 2024, 16(3), 313; https://doi.org/10.3390/sym16030313
Submission received: 8 February 2024 / Revised: 27 February 2024 / Accepted: 1 March 2024 / Published: 6 March 2024

Abstract

:
The aim of this paper is to consider the indeterminate Stieltjes moment problem together with all its probability density functions that have the positive real or the entire real axis as support. As a consequence of the concavity of the entropy function in both cases, there is one such density that has the largest entropy: we call it f hmax , the largest entropy density. We will prove that the Jaynes maximum entropy density (MaxEnt), constrained by an increasing number of integer moments, converges in entropy to the largest entropy density f hmax . Note that this kind of convergence implies convergence almost everywhere, with remarkable consequences in real applications in terms of the reliability of the results obtained by the MaxEnt approximation of the underlying unknown distribution, both for the determinate and the indeterminate case.

1. Introduction

For an absolutely continuous random variable X with the entire real axis as support and the associated Hamburger moment problem determinate, [1]—Thm.1 and [2] proved that the MaxEnt approximation converges in entropy to the unknown density f of X. Well-known necessary and sufficient conditions for the determinacy of the moment problem, as well as their geometric implications, were used to prove the above convergence result. In the present work, our goals consist of the following:
  • improve the proof of convergence in entropy provided by [1], Section 3, when the Stieltjes moment problem is determinate, listing through the use of the limit parabolic regions described below all the possible cases that may arise;
  • state and prove an analogous entropy convergence result in the case of the indeterminate Stieltjes moment problem. That extension is not immediate and additional considerations of MaxEnt distribution properties are required.
An enormous amount of material is available in the literature on the topic of the indeterminate moment problem, which has been extensively treated in the last and present centuries, employing different techniques of investigation. However, we will address this topic through a novel approach based on the notion of differential entropy and Jaynes’ MaxEnt techniques ([3]). More specifically, well-known necessary and sufficient conditions for Stieltjes moment problem determinacy/indeterminacy in terms of Hankel matrix determinants will be employed.
Such necessary and sufficient conditions, formulated through Theorems 3 and 4 below, have an interesting and enlightening geometric meaning. As a consequence, they will enable us to introduce and then exploit the notion of limit parabolic regions as a graphical tool for an easy and immediate understanding of such determinacy/indeterminacy conditions. Next, we will use these tools to prove the convergence in entropy of the MaxEnt solution of the indeterminate Stieltjes moment problem, which is the main result of the present paper.
There are several motivations to study the convergence in entropy of { f M } M = 1 to f hmax , the density with the largest entropy among those sharing the given integer moment sequence { μ k } k = 1 , μ 0 = 1 .
The original contribution of this paper is to exploit one of the characteristic properties of Shannon entropy; namely, that it is a concave functional.
Let C denote the infinite set of densities f, which are the solutions of the moment problem. It is noteworthy that in an M-infinite problem, by virtue of its concavity, the Shannon entropy induces an ordering principle in terms of entropy for the elements of C : this guarantees the existence of a unique f hmax , distinguishable from all other functions f C such that it can be considered the genuine (that is, the most non-committal in Jaynes’ spirit) MaxEnt representative of all densities in C . At the same time, it makes f hmax the candidate limit to which the MaxEnt approximations converge in entropy, analogously to what happens in the determinate case, where the maximum entropy approximations converge to the unique density compatible with the information carried by the sequence of moments { μ k } k = 0 .
Consistent with our purposes, of all the densities, f hmax is the most important because, when all known information is taken into account, the system with the largest entropy is the most likely state as it is the system in which the least amount of information has been defined ([4]).
Moreover, the convergence in entropy is a stronger mode of convergence, which in turn implies convergence in direct divergence, in L 1 , in probability and in distribution. This concatenation of convergence modes has been proved in the work by [5], Section 4.3, for the case of fractional moments. However, considering that both integer and fractional moments belong to the so-called Tchebycheff systems, these convergence modes can be extended to the case of integer moments. Combining the convergence theorem in entropy proved below with the fact that f hmax has the largest entropy H [ f hmax ] , one could find reasonable answers to several questions. Here, we limit ourselves to mentioning the following two.
  • Consider the so-called Stieltjes class S R + ( f 0 , h ) ([6], Section 3) with a center at f 0 and perturbation function h defined as follows:
    S R + ( f 0 , h ) = S ϵ ( x ) = : { f 0 ( x ) [ 1 + ϵ h ( x ) ] , x R + , ϵ [ 1 , 1 ] } .
    where f 0 ( x ) , x R + is a continuous probability density function, that is, f 0 = F , where F is the distribution function of some random variable X and h ( x ) is a non-identically zero continuous function, satisfying the condition 1 h ( x ) 1 , for all x R + . Then, S R + ( f 0 , h ) is an infinite family of densities all sharing the same moment sequence as f 0 and obviously requires knowledge of both functions f 0 and h. The latter satisfies the infinite relations R + x k f 0 ( x ) h ( x ) d x = 0 , con k = 1 , 2 , . Usually, both f 0 and h are chosen by trial and error to satisfy (1). It is well-known that convergence in entropy implies convergence almost everywhere. Consequently, it makes sense to assume f 0 f hmax since it is distinguishable (hence, identifiable) from all densities in C . Recently, h has been obtained with a precise theoretical procedure by [7], Thm. 1.2, starting from f 0 . Therefore, through the combination of the proposal to assume f 0 f hmax with the theoretical procedure of López-Garcia for the calculation of h, a complete theoretical program for the construction of Stieltjes classes solely using the given moment sequence is outlined.
    The originality of our study lies in the fact that, within the class of densities, we identify a particular one, f hmax , with the highest entropy H [ f hmax ] . This density is the limit of a sequence of MaxEnt densities with an increasing number of moments coinciding with the given one.
  • In the so-called Method of Moments, [8], Section 6, p. 540–541, proved the following.
    Theorem 1 ([9]). Let F M and F be probability distributions supported on R with finite moments of all orders, which we denote by μ k ( M ) and μ k , respectively. If F is the only distribution with the moments { μ k } k = 1 and if lim M μ k ( M ) = μ k for k = 1 , 2 , , then the sequence { F M } M = 1 converges in distribution to F; that is
    lim M F M ( x ) = F ( x )
    at any point of continuity of F ( x ) .
    The uniqueness of F ( x ) is assumed to be not only sufficient for the validity of the relation (2) (at any point of continuity) but also necessary. Theorem 7 below leads us to conjecture that the assumed moment problem determinacy condition, designed to guarantee convergence in distribution, could be sufficient but not necessary. This topic will be investigated in future research.
The paper is organized follows: Section 2 presents two crucial known results that state necessary and sufficient conditions for the existence and the determinacy of the Hamburger and Stieltjes moment problem solution in terms of Hankel matrices and their determinants; their geometrical meaning is clarified through the relationships between the elements that characterize the corresponding limit parabolic regions. Section 3 exploits these regions to reformulate the necessary and sufficient conditions for the indeterminacy of the Hamburger and Stieltjes moment problem, preparing the theoretical framework for the proof of the convergence in entropy of the MaxEnt solution in the Stieltjes indeterminate case given in Section 4. The conclusions follow in Section 5.

2. Moment Problem: Existence and Determinacy Conditions

Suppose Ω is a continuum, ( Ω , A ) a measurable space and ν a probability measure defined on A . Let X be a continuous real-valued random variable defined on the probability space ( Ω , A , ν ) with U X R as support and an absolutely continuous distribution function F and suppose that all (positive) integer moments of the random variable X
μ k = E [ X k ] = U X x k d F ( x ) , k = 0 , 1 , 2 , , μ 0 = 1
exist (or, equivalently, are finite). If F is the unique distribution having the sequence { μ k } k = 1 as its moment sequence, then F is uniquely determined by its moments; that is, we say that F is M-determinate (M-det), and the same holds for ν because there is always a (probability) measure ν behind the sequence { μ k } k = 1 . Otherwise, F is M-indeterminate (M-indet): in this case, there exists more than one distribution (potentially, an infinite number) sharing the same moment sequence { μ k } k = 1 and, consequently, F is not uniquely determined by its moments and the same conclusion holds for the probability measure ν .
We use standard terminology and notations generally accepted in the moment problem framework: if U X = R , we say { μ k } k = 1 is a Hamburger moment sequence, while if U X = R + , { μ k } k = 1 is a Stieltjes moment sequence.
Books such as those by [8,10,11] and, more recently, [12] are widely known references on this topic and contain exhaustive details on the debate over and progress made in the moment problems for a century.
The following deep analytical and non-trivial result can be obtained (see [13] for references): for a real moment sequence { μ k } k = 1 , we define the ( N + 1 ) × ( N + 1 ) -Hankel matrices as follows:
Δ N = μ i + j i , j = 0 N a n d Δ N , p = μ i + j + p i , j = 0 N , p = 1 , 2 , 3 , 4 , N N
where Δ N is the basic Hankel matrix with Δ 0 = [ μ 0 ] , while Δ N , p , p = 1 , 2 , 3 , 4 , are the shifted Hankel matrices. More precisely, Δ N , 1 = ( μ i + j + 1 ) i , j = 0 N is based on the shifted moment sequence { μ 1 , μ 2 , } generated by the measure ν 1 with d ν 1 = x d ν , while Δ N , 2 = ( μ i + j + 2 ) i , j = 0 N is based on the shifted moment sequence { μ 2 , μ 3 , } generated by the measure ν 2 with d ν 2 = x 2 d ν , and similarly for Δ N , 3 and Δ N , 4 . This is similar to the case of the basic Hankel matrix, Δ 0 , p = [ μ p ] . Further, let
D N : = det Δ N = Δ N a n d D N , p : = det Δ N , p = Δ N , p
be the determinants of the Hankel matrices, as defined in (3). It is a well-known fact that the positivity of the determinants (4) of all orders N is a necessary and sufficient condition for the existence of the moment problem solution. In other words, Hankel matrices (3) must be positive definite for all N N .
The following theorem states a well-known result:
Theorem 2
(Fundamental Theorem [14]). Suppose that F is M-indet. Then, there are infinitely many absolutely continuous distributions and infinitely many purely discrete distributions, all of them sharing the same moments of F.
In what follows, since we need both to specify some special values of μ 0 and μ 1 and at the same time think μ 1 as continuously varying, we will denote with μ 0 * and μ 1 * the values of μ 0 and μ 1 that have been preassigned.
The following Theorems 3 and 4, jointly with the geometric interpretation of the determinacy/indeterminacy conditions as developed by [15], allow us to draw the limit parabolic regions. They play a fundamental role for the aims of this paper as we will see in Section 4.2, where different determinate–indeterminate configurations of the Stieltjes moment problem (jointly with the Hamburger one) will be considered in terms of ( μ 0 , μ 1 ) .
Theorem 3
([8], Thm. 2.18, p. 69). Let ν be a measure associated with the positive definite Hamburger moment sequence { μ k } k = 0 . Then, ν is determinate if and only if at least one of the quantities D N D N 1 , 2 and D N 1 , 2 D N 2 , 4 has a limit of zero as N .
Theorem 3 has an interesting and enlightening geometric meaning. Consider the finite moment set { μ k } k = 0 M with M = 2 N , where μ 0 varies continuously whilst the remaining ones are kept fixed. The relationship
D N D N 1 , 2 = μ 0 * μ 0 , M
holds, where μ 0 , M comes from the equation D N ( μ 0 , M ) = 0 , replacing μ 0 * with μ 0 , M , so that the sequence { μ 0 , M } M = 2 is monotonic non-decreasing and, as M , converges to a limit; say, μ 0 , = lim M μ 0 , M μ 0 * . From Theorem 3, in the indeterminacy case,
μ 0 * μ 0 , > 0 .
holds. This inequality will be useful in what follows, when the limit parabolic regions are obtained. Similarly, consider once more { μ k } k = 0 M , where μ 1 varies continuously whilst the remaining ones are kept fixed. If D N , the determinant of the Hankel matrix Δ N , is regarded as a function of μ 1 , D N = D N ( μ 1 ) is a quadratic function with leading term D N 2 , 4 < 0 , so that
D N D N 2 , 4 = ( μ 1 * μ 1 , M + ) ( μ 1 * μ 1 , M )
where μ 1 , M and μ 1 , M + are solutions of the equation D N ( μ 1 , M ± ) = 0 with μ 1 , M μ 1 * μ 1 , M + . Since Δ N is symmetric and positive definite, as M increases, μ 1 , M is monotonically non-decreasing whilst μ 1 , M + is monotonically non-increasing, so that the intervals [ μ 1 , M , μ 1 , M + ] for each value of M are nested. Combining Theorem 3 with Equation (7), it follows that
D N D N 1 , 2 · D N 1 , 2 D N 2 , 4 = D N D N 2 , 4 = ( μ 1 * μ 1 , M + ) ( μ 1 * μ 1 , M )
and, as a consequence, the determinacy/indeterminacy convergence conditions can be drawn. In the H-det case,
lim n μ 1 , M = lim n μ 1 , M + = μ 1 *
while in the H-indet case,
lim M μ 1 , M < μ 1 * , lim M μ 1 , M + > μ 1 *
strictly. Equations (9) and (10) are the distinguishing features of the H-det and H-indet cases. At the end, they allow us to define the Hamburger moment problem determinacy or indeterminacy conditions only in terms of the differences μ 1 * μ 1 , or μ 1 , + μ 1 * . This aspect will play an important role in Section 3, in which the Stieltjes moment problem determinacy/indeterminacy is geometrically interpreted through the limit parabolic regions.
Theorem 4
([15], Lemma 3). Let ν be a measure associated with the positive definite Stieltjes moment sequence { μ k } k = 0 . Then, ν is determinate if and only if at least one of the quantities D N D N 1 , 2 and D N , 1 D N 1 , 3 has a limit of zero as n .
Theorem 4, like Theorem 3, has an interesting and enlightening geometric meaning (note that the quantity D N D N 1 , 2 has been previously investigated in the Hamburger moment problem). As far as D N , 1 D N 1 , 3 is concerned, consider the finite moment set { μ k } k = 1 M with M = 2 N + 1 , where μ 1 varies continuously whilst the remaining ones are kept fixed. The following relationship holds:
D N , 1 D N 1 , 3 = μ 1 * μ 1 , M
where the number μ 1 , M is the unique solution of the equation D N , 1 ( μ 1 , M ) = 0 in which μ 1 * is replaced with μ 1 , M . The numerical sequence { μ 1 , M } M = 1 is monotonic non-decreasing and, as M , convergent to a limit; say, μ 1 , μ 1 * . The indeterminacy condition is formulated as follows:
μ 1 * μ 1 , > 0
In the end, Equations (6)–(12) allow us to define the Stieltjes moment problem determinacy or indeterminacy conditions only in terms of the difference μ 0 * μ 0 , and μ 1 * μ 1 , (the latter coming from (11) and (12)). This aspect will play an important role in Section 4 where convergence in entropy of the MaxEnt approximation (14) of f will be discussed in the S-indet case.
Notice that, in order not to burden the notation in the two Hamburger and Stieltjes cases, given the first M moments { μ k } k = 1 M , we denoted with the same symbol μ 1 , M two quantities that are solutions of two distinct equations: in the Hamburger case, with M = 2 N , D N ( μ 1 , M ) = 0 ; and in the Stieltjes case, with M = 2 N or M = 2 N + 1 , D N , 1 ( μ 1 , M ) = 0 . We specify that, in the subsequent Stieltjes problem investigation, the values of μ 1 * μ 1 , M will be drawn from (11) rather than from (7), in accordance with the next Theorem 6.

3. Limit Parabolic Regions

The limit parabolic regions represent an enlightening device for giving an intuitive graphical representation of the determinacy or indeterminacy conditions for the Stieltjes or Hamburger moment problems. In this paper, we are mostly interested in the indeterminacy condition.
1.
Hamburger case
Suppose { μ k } k = 0 is a moment sequence for which the moments { μ 2 , μ 3 , } are kept fixed while the moments μ 0 and μ 1 are left to be continuously varying. Setting x = μ 0 and y = μ 1 , the whole moment sequence can be written as { x , y , μ 2 , μ 3 , } and the numbers x and y (i.e., the moments preceding μ 2 ) cannot be arbitrary.
Following [15], we set M = 2 N and consider the moments:
( μ 0 , μ 1 , μ 2 , , μ M ) = ( x , y , μ 2 , , μ M )
where, as before, μ 2 , , μ M are fixed and x = μ 0 and y = μ 1 can vary continuously. For each M, the relation D N ( x , y ) 0 defines a closed convex region as follows:
P M : = { ( x , y ) R 2 : D N ( x , y ) 0 }
which is bounded by a proper parabola with horizontal axis and vertex in the right-half plane. Since, for M = 1 , 2 , , the Hankel matrices Δ N ( x , y ) are positive semidefinite, the regions P M are nested (i.e., P M P M 1 ) and the element of interest is their intersection, P : = M = 1 P M , called a limit parabolic region. One possibility is that P is a proper closed region in the right-half plane such that P is bounded by a proper parabola and contains the initially given moments ( μ 0 , μ 1 ) . The alternative is that P degenerates, becoming just a ray.
Theorem 5 ([15], Thm 1). Let { x , y , μ k } k = 2 be a Hamburger moment sequence for the measure ν. Then, ν is indeterminate if and only if the point ( μ 0 * , μ 1 * ) is an interior point of the limit parabolic region P .
Theorem 5 is the geometric equivalent to the algebraic formulation of the Hamburger moment problem indeterminacy given in terms of Equations (6) and (10).
2.
Stieltjes case
Suppose that { μ k } k = 0 is a Stieltjes moment sequence. In this case, we develop a procedure similar to that in the Hamburger case but with some specific features. For doing this, we rely essentially on two known results. First, we recall that every Stieltjes sequence can also be considered a Hamburger sequence; hence, the existence of the limit parabolic region P is assured and defined by the two relations D N ( x , y ) 0 and D N 1 , 1 ( y ) 0 , in which we use x = μ 0 and y = μ 1 . Second, P = { ( x , y ) : x μ 0 , , y = μ 1 , } , a ray, or P is the intersection of proper limit parabolic regions in the half plain { y μ 1 , } . Taking into account Theorem 5, the following conclusion can be drawn.
Theorem 6 ([15], Thm 2). The positive definite Stieltjes moment sequence { μ k } k = 0 generates an indeterminate measure ν if and only if two conditions are satisfied:
(i) 
the point ( μ 0 * , μ 1 * ) is an interior point of the limit parabolic region P ;
(ii) 
μ 1 , < μ 1 * .
Again, Theorem 6 is geometrically equivalent to the algebraic formulation of the Stieltjes moment problem indeterminacy given in terms of Equations (6), (11) and (12).

4. Stieltjes Moment Problem

The inverse moment problem consists of recovering the unknown PDF f knowing the sequence of its moments { μ k } k = 1 ([8]). For practical purposes, only a finite number M of moments { μ k } k = 1 M is considered so that what we have is the so-called reduced (or truncated) moment problem, and it may have infinitely many solutions. Then, one needs a criterion for selecting the most appropriate probability density function among the available solutions. The strategy of selecting the least biased estimate of f can be based on the MaxEnt method ([3]).

4.1. Maxent Method

Given the first M moments { μ k } k = 1 M ,
f M ( x ) = exp j = 0 M λ j x j
provides the MaxEnt approximation ([16], p. 59) of f, supplemented by the ( M + 1 ) conditions
R + x k f M ( x ) d x = μ k , k = 0 , , M
and μ 0 = 1 , (14) being a density function with differential entropy (for short, entropy)
H [ f M ] = R + f M ( x ) ln f M ( x ) d x = k = 0 M λ k μ k ,
That is to say that f M has the largest entropy out of all densities sharing the first M moments with f. Note an important fact: the sequence { H [ f M ] } M = 1 is monotonic non-increasing.
Let
C = : f 0 : R + x k f ( x ) d x = μ k , k = 1 , 2 , .
be a convex set of (density) functions that, in the M-indet case, include infinitely many densities, which are solutions of the moment problem. The class C will play a strategic role in the proof of the entropy convergence theorem. Since, as is well known, Shannon entropy possesses many good properties, one of which is to be a concave functional, a unique density—say, f h m a x C —with the largest entropy exists, as shown by the following:
H [ f h m a x ] = max f C H [ f ] .
After a brief revision of the S-det (with H-det) case, previously covered by [1], with the aim of giving a new and more structured interpretation of the determinacy conditions in terms of limit parabolic regions, our attention will be devoted to one of the central topics in this paper: a deep discussion of the S-indet case, the associated MaxEnt solution f M and the proof of convergence in entropy of f M to the underlying unknown density f.

4.2. Limit Parabolic Regions for S-Det or S-Indet

Let us summarize the known facts: an S-det measure ν may be either H-indet or H-det, while an S-indet measure ν is H-indet. Suppose that { μ k } k = 1 M , with M = 2 N , for instance, is a moment set. Thus, we have the following cases.
  Case 1:  
ν is S-det with H-indet (see [15], Corollary, p. 417). Combining Theorems 3 and 4, it follows that
D N D N 1 , 2 c 0 > 0 and D N , 1 D N 1 , 3 0
equivalently, and both μ 0 , < μ 0 * from (6) and μ 1 , = μ 1 * (see Figure 1).
Note that both here and in the following graphs, the shaded area represents the parabolic region and the red dot indicates the fixed value ( μ 0 * , μ 1 * ) of the point ( μ 0 , μ 1 ) . Ref. [17], in a corollary on p. 481, states that if a measure ν is determinate in the Stieltjes sense and indeterminate in the Hamburger sense, then ν is a Nevanlinna-extremal measure and, in particular, is discrete with a positive mass at the origin. As a consequence, the S-det case is disregarded. In contrast, the H-indet case where infinitely many densities occur, will be investigated in Lemma (1).
  Case 2:  
ν is S-indet with H-indet.
There are infinitely many densities with support R + and R . Combining Theorems 3 and 4, both inequalities μ 0 , < μ 0 * and μ 1 , < μ 1 * follow; equivalently, from Theorem 6, the point ( μ 0 * , μ 1 * ) is inside the limit parabolic (see Figure 2).
Entropy convergence in both cases S-indet and H-indet will be investigated below.
  Case 3. 
ν is S-det with H-det.
For a given moment sequence, S-det means that there is only one measure with support [ 0 , ) compatible with it. Here, we take the opportunity to review and refine the proof previously given by [1], Section 3. Combining Theorems 3 and 4 and their geometric meaning, we find that there are four possible limit parabolic regions, all of which are feasible. More specifically, we have the following:
(a)
the two limit parabolic regions degenerate into different rays, both arising from
D N 1 , 2 D N 2 , 4 0 and D N , 1 D N 1 , 3 0
so that μ 1 , = μ 1 * and from
D N D N 1 , 2 0 or D N D N 1 , 2 c 0 > 0 ,
respectively. Hence, μ 0 , = μ 0 * or μ 0 , < μ 0 * occur;
(b)
a non-degenerate limit parabolic region arising from Case 1. Accordingly, we have S-det with H-indet after removing the mass at the origin, so that both
D N D N 1 , 2 0 and D N , 1 D N 1 , 3 0
(see Figure 3). In agreement with the results for S-det attained in Case 1, the measure is once more purely discrete with no mass at the origin and then it has to be disregarded;
(c)
a non-degenerate limit parabolic region arising from the conditions
D N D N 1 , 2 0 and D N , 1 D N 1 , 3 c 1
so that the corresponding distribution has zero mass at the origin. As a consequence μ 0 , = μ 0 * and μ 1 , < μ 1 * hold, so that the point ( μ 0 * , μ 1 * ) is on the boundary of the limit parabolic region (see Figure 4).
The nature (probability mass or density) of the unique solution can now be investigated by adding a mass δ at the origin and renormalizing. The moments sequence amounts to { 1 , μ k / ( 1 + δ ) } k = 1 . In such a case, the moment problem becomes S-indet with H-indet (Figure 2), with maximum mass concentration δ 1 + δ at the origin. As is known, in an indeterminate Hamburger problem the maximum mass concentration at the origin just coincides with δ 1 + δ , so that the initial solution is transformed into an extremal solution of the S-indet with H-indet problem. Since there exists only a discrete extremal solution of the moment problem for which this maximum is attained [8], Thm. 2.13, p. 60; [11], Corollary 7.17, p. 159), the original determinate Stieltjes solution is purely discrete with no mass at the origin, in accordance with with [18] p. 94. Thus, it has to be disregarded.
In conclusion, collecting together the just-above-limit parabolic regions (a), (b) and (c), it can be concluded that a necessary condition for a density to be associated with the unique measure S-det with H-det is that the corresponding limit parabolic region is degenerate, from which μ 1 , = μ 1 * . Under the latter condition, the entropy convergence had been proved by [1], Section 3. Note that thanks to the MaxEnt formalism, the proof provided by the authors was completely analogous to the one used in the determinate Hamburger problem. Thus, the first goal outlined in the Introduction has been reached.
Regarding the proof of entropy convergence in the two cases S-indet and S-det, comparing Case 2 with Case 3, the crucial difference is due to the two distinct conditions μ 1 , < μ 1 * and μ 1 , = μ 1 * that hold in the two cases, respectively. As a consequence, the proving procedure adopted for the S-det case (in which the relationship μ 1 , = μ 1 * plays a basic role) cannot be simply extended to the S-indet one. For the latter case, the entropy convergence is investigated below and represents the second main goal of this paper, as outlined in the Introduction.

4.3. Convergence in Entropy of the Maxent Approximation in the S-Indet Case

In analogy with the S-det case investigated by [1], Section 3, it is reasonable to conjecture that, in the S-indet case, the MaxEnt approximation f M also converges in entropy to the underlying unknown density f h m a x . Indeed, considering two distinct cases concerning the nature of the support of the density f (that is, R + and R ) and relying in both cases on the fact that the sequence { H [ f M ] } is monotonically non-increasing with a finite lower bound H [ f h m a x ] , we can prove Theorems 7 and 8, the second two main results of this paper.
However, before presenting and proving these results, we briefly recall some basic steps of the convergence in entropy proof in the determinate Hamburger problem as given by [1], Thm. 1, and in the same paper also extended to the S-det case. The determinacy condition used for the Hamburger moment problem is given by (9). As we will see later, this procedure is the essential tool for proving entropy convergence in the S-indet case too.
1.
Let ν be the unique measure with support R and density f having the given moment sequence { μ k } k = 0 . Consider the finite set { μ k } k = 1 M , with M = 2 N . If ( μ 0 , μ 2 , , μ M ) are kept fixed while μ 1 ( μ 1 , M , μ 1 * ] varies continuously, the function H [ f M ] ( μ 1 ) is monotonic increasing and concave (to ensure the monotonicity of H [ f M ] ( μ 1 ) , if needed, the interval μ 1 ( μ 1 , M , μ 1 * ] could be replaced with [ μ 1 * , μ 1 , M + ) , leaving the procedure unchanged). As μ 1 μ 1 , M , D N 0 so that f M corresponds to a purely discrete distribution with a finite number of mass points ([8], p. 5, Thm. 1.2). Then, as the differential entropy of a discrete distribution can be considered to be ([4], p 247-9), lim μ 1 μ 1 , M H [ f M ] ( μ 1 ) = holds. On the other hand, as μ 1 = μ 1 * , H [ f M ] ( μ 1 * ) H [ f ] holds. Then, since H [ f M ] ( μ 1 ) is a continuous function of μ 1 , there exists a value of μ 1 ( μ 1 , M , μ 1 * ] (say ξ M = ξ M ( M , H [ f ] ) ) such that H [ f M ] ( ξ M ) = H [ f ] . As M , from lim M μ 1 , M = μ 1 * (and then lim M ξ M = μ 1 * too) it follows that H [ f M ] ( ξ M ) = H [ f M ] ( μ 1 * ) = H [ f ] . From here, lim M H [ f M ] ( μ 1 * ) = H [ f ] , which proves the entropy convergence in the H-det case.
2.
From Theorem 6, in the S-indet case, the point ( μ 0 * , μ 1 * ) is an interior point of the limit parabolic region such that μ 1 , < μ 1 * . In accordance with the above item in this list, H [ f M ] ( μ 1 ) is monotonic increasing over μ 1 ( μ 1 , M , μ 1 * ] with
lim μ 1 μ 1 , M H [ f M ] ( μ 1 ) = and H [ f M ] ( μ 1 * ) H [ f h m a x ] .
As we have μ 1 , < μ 1 * , both scenarios are admissible as follows:
(i)
lim M H [ f M ] > H [ f h m a x ] (strictly);
(ii)
lim M H [ f M ] = H [ f h m a x ]
Following the procedure outlined in the first item of this list, no conclusion about entropy convergence can be drawn, so an alternative approach must be sought.

4.3.1. Densities Supported on R +

Now we are ready to prove the first out the two main results of this paper.
Theorem 7 (Main result I). In the indeterminate Stieltjes problem, the MaxEnt approximate sequence { f M } M = 1 supported on R + converges in entropy to f h m a x ; i.e.,
lim M H [ f M ] = H [ f h m a x ]
Proof. 
We here give the proof of Theorem 7, distinguishing the cases of M finite and M infinite.
(i)
M <
Consider a given moment set { μ k } k = 1 M , the corresponding MaxEnt density f M having moments μ k , f M : = R + x k f M ( x ) d x , k 1 , where
μ k , f M = μ k if k M R + x k f M ( x ) d x if k > M
and the additional new moment set
{ μ ˜ k } k = 1 M + K = { μ k } k = 1 M { μ k , f M } k = M + 1 K
Now, let f ˜ M + K be the MaxEnt density constrained by (18) and let { μ ˜ k , f ˜ M + K } k = 1 be the sequence of its moments in analogy with (17). The following are the reasons for introducing f ˜ M + K :
1.
the densities f M and f ˜ M + K coincide, as f M satisfies the first M + K moments { μ ˜ k } k = 1 M + K too, so that f ˜ M + K has its Lagrange multipliers λ M + 1 = = λ M + K = 0 ;
2.
f ˜ M + K gives rise to a measure ν , which is S-det, since it admits a moment-generating function (which is a sufficient condition to guarantee moment problem determinacy, the so-called Cramér criterion) for each M + K . Indeed, from (14), as x , f ˜ M + K ( x ) exp ( λ M + K x M + K ) , with λ M + K > 0 ;
3.
related to the sequence { μ ˜ k , f ˜ M + K } k = 1 , there are the numbers μ ˜ 1 , M + K , which are the unique solution of the equation D N , 1 ( μ ˜ 1 , M + K ) = 0 , whether M + K = 2 N + 1 or M + K = 2 N , where in the matrix Δ N , 1 , μ 1 * is replaced with μ ˜ 1 , M + K , as outlined in Section 2. Further, μ 1 , M < μ ˜ 1 , M + K for each K holds. As K increases, the sequence { μ ˜ 1 , M + K } K = 1 is monotonic increasing since the moment sequence { μ ˜ k , f ˜ M + K } k = 1 coincides with the one associated with f M . The latter density, as said previously, gives rise to a determinate moment problem so that, as K , μ ˜ 1 , M + K = μ 1 * , in accordance with the conclusions drawn above from Case 3. in Section 4.2.
In conclusion, with finite and fixed M and arbitrary K, MaxEnt density f ˜ M + K constrained by { μ ˜ k } k = 1 M + K fulfils the two following properties:
H [ f ˜ M + K ] = H [ f M ] with K finite μ ˜ 1 , M + K = μ 1 * with K
That is, the distribution corresponding to { μ ˜ k } k = 1 M + K has the same entropy as the distribution having density f M so that the supplementary K moments do not vary the entropy but, at the same time, they push μ ˜ 1 , M + K towards μ 1 * as K increases, forcing the moment problem to be determinate. This is a crucial condition for the convergence in entropy.
(ii)
M
From (19) with arbitrary K, the two sequences { H [ f ˜ M + K ] } M = 1 and { H [ f M ] } M = 1 fulfill the relationship
{ H [ f ˜ M + K ] } M = 1 { H [ f M ] } M = 1 inf M H [ f M ]
since the moment sequence { μ k , f M } k = M + 1 does not introduce additional information content. In conclusion, taking into account (19) and using the moment sequence { μ ˜ k } k = 1 , which directly stems from { μ k } k = 1 , we are led to the S-det case. Indeed, as M , f M and the unique f (in the S-det case) and f M and the unique f h m a x (in the S-indet case) share the same amount of information provided by the two different moment sequences { μ k } k = 1 and { μ ˜ k } k = 1 , respectively. From this it follows that μ 1 , = μ 1 * and μ ˜ 1 , = μ 1 * . Accordingly, from here on, the procedure is carried out in a manner more or less similar to that described by [1], Thm. 1, with the only difference being that f, { μ k } k = 1 M , f M , μ 1 , M and M are here replaced with f h m a x , { μ ˜ k } k = 1 M + K , f ˜ M + K , μ ˜ 1 , M + K and M , K , respectively. Combining the previous arguments, we can state the following
lim M , K H [ f ˜ M + K ] = lim M H [ f M ] = H [ f h m a x ]
The introduction of the sequence (17), and therefore of { f ˜ M + K } K = 1 in place of { f M } M = 1 , has the twofold aim of (i) not altering the entropy of f M and (ii) permitting the relationship lim M μ ˜ 1 , M + K = μ 1 * , forcing the moment problem to be determinate and bringing us back to the previously solved case S-det. Foe the previously formulated admissible scenarios lim M H [ f M ] > H [ f h m a x ] (strictly) or lim M H [ f M ] = H [ f h m a x ] , a precise answer is found.
Hence, Theorem 7 is proved. □

4.3.2. Densities Supported on R

Thanks to MaxEnt formalism, the above quantities f M and C given by (14) and (15), respectively, and previously defined on the positive support R + can likewise be reformulated for distributions having the entire real line R as support: in this case, M must be even to guarantee the existence of the MaxEnt approximation f M . We need the following preliminary result.
Lemma 1.
In the S-det with H-indet moment problem, the MaxEnt approximate sequence { f M } M = 2 with M = 2 N , N N , supported on R , converges in entropy to f h m a x ; i.e.,
lim M H [ f M ] = H [ f h m a x ]
Proof. 
Combining Theorems 3 and 4, the relation
lim N D N , 1 D N 1 , 3 = 0 or equivalently , μ 1 , = μ 1 *
follows (here, M = 2 N ). Consequently, thanks to the MaxEnt formalism, the proof for the H-indet case is similar to that mentioned above for the H-det case ([1], Thm. 1), being in both cases μ 1 , = μ 1 * . Again, the proof mimics that of the H-det case, simply replacing both f with f h m a x and H [ f ] with H [ f h m a x ] . □
Considering Lemma 1, we are now ready to prove the second of the two main results of this paper.
Theorem 8 (Main result II). In the S-indet with H-indet moment problem, the MaxEnt approximate sequence { f M } M = 2 supported on R converges in entropy to f h m a x ; i.e.,
lim M H [ f M ] = H [ f h m a x ]
Proof. 
From Theorem 6, the point ( μ 0 * , μ 1 * ) is an interior point of the limit parabolic region and then μ 1 , < μ 1 * . Here, there is no way to prove the convergence in entropy of the MaxEnt approximation f M in the S-indet with H-indet case just by applying the same technique as in the three previous cases (H-det, S-det with H-det and S-det with H-indet), and an alternative that allows us to reconnect to the previous case S-det with H-indet must be thought of. One possibility to construct a measure for S-det with H-indet, starting from the indeterminate Stieltjes moment sequence { μ k } k = 1 , is outlined by [19], p. 96, Thm. 3.3 and Appendix C3, and [11], Example 8.11, p. 183, and Exercise 6, p. 143, and we borrow it. Let { μ k } k = 1 be the original Stieltjes moment sequence and let { μ ˜ k ( c ) } k = 1 denote the sequence shifted by c. From the binomial theorem, we have
μ ˜ k ( c ) = j = 0 k k j c j μ k j , k 0
For the Hamburger problem, { μ ˜ k ( c ) } k = 1 has the following properties (translation of Hamburger moment sequence):
(a)
the solutions associated with { μ k } k = 1 and those of each { μ ˜ k ( c ) } k = 1 are in one–one correspondence;
(b)
if the density f ( x ) solves the { μ k } k = 1 problem, f ( x c ) solves the { μ ˜ k ( c ) } k = 1 one and vice versa. Specifically, to the densities f M ( x ) and f h m a x ( x ) correspond the densities f M ( x c ) and f h m a x ( x c ) , with the obvious equalities H [ f M ( x ) ] = H [ f M ( x c ) ] and H [ f h m a x ( x ) ] = H [ f h m a x ( x c ) ] ;
(c)
for any indeterminate Stieltjes problem, there is always a unique c 0 0 , so that the sequence { μ ˜ k ( c ) } k = 1 can be the following:
(i)
an indeterminate Stieltjes moment sequence for each c > c 0 ;
(ii)
a determinate Stieltjes moment sequence (but, of course, with the H-indet problem) for c = c 0 , giving rise to the so-called Friedrichs’s solution;
(iii)
not a Stieltjes moment sequence for c < c 0 .
As a consequence, from (21) with c = c 0 , we fall back into the S-det with H-indet case, investigated above in Lemma (1). Exploiting the results of this latter case, lim M H [ f M ( x c 0 ) ] = H [ f h m a x ( x c 0 ) ] occurs. Joining together all the above found equalities,
lim M H [ f M ] = lim M H [ f M ( x c 0 ) ] = H [ f h m a x ( x c 0 ) ] = H [ f h m a x ]
holds, and Theorem (8) is also proved. □

5. Conclusions

In every indeterminate Stieltjes moment problem, it follows from Theorems (7) and (8) that the MaxEnt approximation f M with support R + or R converges in entropy to f h m a x . Combining the present results with those previously obtained by two of the authors, it can be concluded that convergence in entropy has been proved in both the Stieltjes and the Hamburger problems. As far as we know, when reconstructing densities with maxentropic techniques, it is assumed that the underlying moment problem is determinate; then, the MaxEnt density f M becomes a reasonable approximation of the unique underlying density f, with moments that match the given elements of the sequence { μ k } k = 0 . The problem of determinacy is understandable, although the question of determinacy/indeterminacy is not really of much practical importance: a complete set of moments will never be attainable, so for practical purposes, only finite values of M are of interest. For theoretical purposes, however, Theorem 7 provides a justification in the case where the underlying moment problem is indeterminate, since it ensures that the MaxEnt approximations will converge in any case to the one with the largest entropy f hmax . In summary, in a process of probability density reconstruction using maxentropic techniques, the determinacy/indeterminacy question for the moment problem does not arise. Nevertheless, f M must be considered a valuable approximation of f hmax since both f M and f hmax have the same first M moments and, for finite M, H [ f M ] > H [ f hmax ] , according to the MaxEnt principle.
Recalling the known necessary and sufficient conditions for the determinacy of the moment problem, as well as their geometric meaning described by limit parabolic regions, helped us to provide the following:
  • an improvement of an earlier proof of convergence in entropy provided by the authors when the Stieltjes moment problem is determinate;
  • the statement and proof of an analogous entropy convergence result in the case of an indeterminate Stieltjes moment problem.
In terms of future research directions, it is easy to see that the entropy convergence results in the case of the indeterminate Stieltjes problem can be exploited to provide the following:
  • criteria for the determinacy/indeterminacy of the Stieltjes moment problem with entropic techniques;
  • an analogous entropy convergence criterion concerning the case of the indeterminate Hamburger problem;
  • a criterion for the determinacy/indeterminacy of the Hamburger moment problem involving entropic techniques.

Author Contributions

Conceptualization, P.L.N.I. and A.T.; methodology, P.L.N.I. and A.T.; writing—original draft preparation, A.T.; writing—review and editing, P.L.N.I. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

Data are contained within the article.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Milev, M.; Tagliani, A. Convergence of finite moment approximations in Hamburger and Stieltjes problems. Stat. Prob. Lett. 2017, 120, 114–117. [Google Scholar] [CrossRef]
  2. Frontini, M.; Tagliani, A. Entropy-convergence in Stieltjes and Hamburger moment problem. Appl. Math. Comput. 1997, 88, 39–51. [Google Scholar] [CrossRef]
  3. Jaynes, E.T. Information theory and statistical mechanics. Phys. Rev. 1957, 106, 620. [Google Scholar] [CrossRef]
  4. Cover, T.M.; Thomas, J.A. Elements of Information Theory; Wiley-Interscience: Hoboken, NJ, USA, 2006. [Google Scholar]
  5. Novi Inverardi, P.L.; Tagliani, A. Probability Distributions Approximation via Fractional Moments and Maximum Entropy: Theoretical and Computational Aspects. Axioms 2023, 13, 28. [Google Scholar] [CrossRef]
  6. Stoyanov, J. Stieltjes Classes for Moment-Indeterminate Probability Distributions. J. Appl. Prob. 2004, 41, 281–294. [Google Scholar] [CrossRef]
  7. López-García, M. Krein condition and the Hilbert transform. Electron. Commun. Probab. 2020, 25, 71. [Google Scholar]
  8. Shohat, J.A.; Tamarkin, J.D. The Problem of Moments, Reprint of the Original 1943 Edition; Mathematical Surveys; American Mathematical Society: Providence, RI, USA, 1970; Volume 1. [Google Scholar]
  9. Fréchet, M.; Shohat, J.A. A Proof of the Generalized Second-Limit Theorem in the Theory of Probability. Trans. Am. Math. Soc. 1931, 33, 533–543. [Google Scholar] [CrossRef]
  10. Akhiezer, N.I. The Classical Moment Problem and Some Related Questions in Analysis; Oliver and Boyd: Edinburgh, UK, 1965. [Google Scholar]
  11. Schmüdgen, K. The Moment Problem; Graduate Texts in Mathematics; Springer: Berlin/Heidelberg, Germany, 2017; Volume 277. [Google Scholar]
  12. Olteanu, O. Symmetry and asymmetry in moment, functional equations and optimization problems. Symmetry 2023, 15, 1471. [Google Scholar] [CrossRef]
  13. Stoyanov, J. Krein condition in probabilistic moment problems. Bernoulli 2000, 6, 939–949. [Google Scholar] [CrossRef]
  14. Berg, C.; Christensen, J.P.R. Density questions in the classical theory of moments. Ann. l’Institut Fourier 1981, 31, 99–114. [Google Scholar] [CrossRef]
  15. Merkes, E.P.; Wetzel, M. A geometric characterization of indeterminate moment sequences. Pacific J. Math. 1976, 65, 409–419. [Google Scholar] [CrossRef]
  16. Kesavan, H.K.; Kapur, J.N. Entropy Optimization Principles with Applications; Academic Press: London, UK, 1992. [Google Scholar]
  17. Chihara, T.S. On Indeterminate Hamburger Moment Problems. Pacific J. Math. 1968, 27, 475–484. [Google Scholar] [CrossRef]
  18. Heyde, C.C. Some remarks on the moment problem (I). Quart. J. Math. 1963, 14, 91–96. [Google Scholar] [CrossRef]
  19. Simon, B. The Classical Moment Problem as a Self-Adjoint Finite Difference Operator. Adv. Math. 1998, 137, 82–203. [Google Scholar] [CrossRef]
Figure 1. Limit parabolic region for the case (S-det, H-indet).
Figure 1. Limit parabolic region for the case (S-det, H-indet).
Symmetry 16 00313 g001
Figure 2. Limit parabolic region for the case (S-indet, H-indet).
Figure 2. Limit parabolic region for the case (S-indet, H-indet).
Symmetry 16 00313 g002
Figure 3. Limit parabolic region for the case (S-det, H-det) - b).
Figure 3. Limit parabolic region for the case (S-det, H-det) - b).
Symmetry 16 00313 g003
Figure 4. Limit parabolic region for the case (S-det, H-det) - c).
Figure 4. Limit parabolic region for the case (S-det, H-det) - c).
Symmetry 16 00313 g004
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Novi Inverardi, P.L.; Tagliani, A. Indeterminate Stieltjes Moment Problem: Entropy Convergence. Symmetry 2024, 16, 313. https://doi.org/10.3390/sym16030313

AMA Style

Novi Inverardi PL, Tagliani A. Indeterminate Stieltjes Moment Problem: Entropy Convergence. Symmetry. 2024; 16(3):313. https://doi.org/10.3390/sym16030313

Chicago/Turabian Style

Novi Inverardi, Pier Luigi, and Aldo Tagliani. 2024. "Indeterminate Stieltjes Moment Problem: Entropy Convergence" Symmetry 16, no. 3: 313. https://doi.org/10.3390/sym16030313

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop