Maximum Entropy Criterion for Moment Indeterminacy of Probability Densities

Stoyanov, Jordan M.; Tagliani, Aldo; Novi Inverardi, Pier Luigi

doi:10.3390/e26020121

Open AccessFeature PaperArticle

Maximum Entropy Criterion for Moment Indeterminacy of Probability Densities

by

Jordan M. Stoyanov

^1,2,*,

Aldo Tagliani

³ and

Pier Luigi Novi Inverardi

³

¹

Institute of Mathematics & Informatics, Bulgarian Academy of Sciences, 1113 Sofia, Bulgaria

²

Faculty of Mathematical Sciences, Shandong University, Jinan 250100, China

³

Department of Economics & Management, University of Trento, 38100 Trento, Italy

^*

Author to whom correspondence should be addressed.

Entropy 2024, 26(2), 121; https://doi.org/10.3390/e26020121

Submission received: 2 January 2024 / Revised: 29 January 2024 / Accepted: 29 January 2024 / Published: 30 January 2024

(This article belongs to the Special Issue Applied Probability, Information Theory and Applications)

Download Versions Notes

Abstract

We deal with absolutely continuous probability distributions with finite all-positive integer-order moments. It is well known that any such distribution is either uniquely determined by its moments (M-determinate), or it is non-unique (M-indeterminate). In this paper, we follow the maximum entropy approach and establish a new criterion for the M-indeterminacy of distributions on the positive half-line (Stieltjes case). Useful corollaries are derived for M-indeterminate distributions on the whole real line (Hamburger case). We show how the maximum entropy is related to the symmetry property and the M-indeterminacy.

Keywords:

probability density; moments; Stieltjes and Hamburger moment problems; Hankel matrices; determinacy; indeterminacy; maximum entropy; MaxEnt criterion for M-indeterminacy

MSC:

44A60; 60E05; 62E10; 94A17

1. Introduction

When studying probability distributions, one of the challenging questions we arrive at comes from the classical moment problem. The question is whether or not a probability distribution is uniquely determined by the sequence of all moments, assuming they are finite. The answer can be given for the distribution itself; equivalently, for the associated random variable X; its distribution function F; the density

f = F^{'}

; or the bounded positive measure

μ = μ_{F}

induced by F. Thus, if the answer is positive, we call the distribution (also

X, F, f, μ

) M-determinate; otherwise, we call it M-indeterminate. (Here, ‘M’ stands for ‘Moment’.)

It is well known that if

μ

is M-indeterminate, then there are infinitely many absolutely continuous distributions, infinitely many purely discrete distributions, and infinitely many singular distributions, all having the same moments as

μ

.

It is important, from both the theoretical and applied points of view, to have criteria at hand allowing to specify/identify the determinacy or indeterminacy property of a distribution. The best is to work with conditions which are in the group ‘checkable conditions’; comments and references are given at the end of our paper. There is another group of ‘non-checkable conditions’. Here are the well-known classical necessary and sufficient conditions for the (in)determinacy of

μ

in terms of the limits of the smallest eigenvalues of sequences of Hankel matrices. Our recent review paper [1] describes the whole spectrum, called ‘a bunch’, of the fundamental results, old and recent. The reader will find in [1] details about the great contributions of T. Stieltjes, H. Hamburger, N. Akhiezer, M. Krein, C. Berg, K. Schmüdgen, M. Putinar, B. Simon, and others. Their works are widely known.

Developments over the last few decades have shown the efficiency of involving the Principle of Maximum Entropy, see, for example, [2,3,4]. We also use the terms ‘maximum entropy approach’ and ‘maximum entropy method’. For ‘maximum entropy’, we write the traditional ‘MaxEnt’.

The idea of the MaxEnt method consists in selecting a distribution which possesses maximum uncertainty, and at the same time, fulfills the restrictions imposed by the known information.

In general, it is more delicate to deal with M-indeterminate distributions, since we need, for example, to know how to find, describe and work with an infinite family of distributions all having the same moments. In any case, MaxEnt may help to shed light on this ‘dark tunnel’.

In this paper, we follow the generally accepted terminology and notations as used in probability theory. We write

X \sim F

for a random variable X whose distribution function is F, with

f = F^{'}

being the density, and specify the range

U

of values of X, the support of F, which is assumed to be unbounded. Only in this case can the ‘interesting’ property of M-indeterminacy appear. We work with the moment sequence

{m_{k}}_{k = 0}^{\infty}

, and if

U = R = (- \infty, \infty)

, this is a Hamburger case, while with

U = R_{+} = [0, \infty),

it is a Stieltjes case.

For X being an absolutely continuous random variable with strictly positive density f, we are looking for conditions, or criteria, guaranteeing the M-determinacy or M-indeterminacy of

μ

. We use the entropy (called also ‘differential entropy’), which is denoted by

h_{f}

and defined as follows:

h_{f} : = E [- \ln f (X)] = - \int_{U} (\ln f (x)) f (x) d x .

The idea is very natural: We start with the n-truncated moment set

{m_{k}}_{k = 0}^{n},

and based on it, we find the MaxEnt approximant

f_{n}

of f and study the limit of the entropy

h_{f_{n}}

of

f_{n}

as

n \to \infty .

There is a remarkable fact, namely, that there are only two possibilities for the ‘value’ of the limit

{lim}_{n \to \infty} h_{f_{n}}

; either it is a finite number, or it is ‘equal’ to

- \infty

. Depending on this limit, we decide that f is M-determinate or M-indeterminate.

It is relevant to mention one of the results proved in ([5], Theorem 1): if an absolutely continuous distribution F with density f is M-determinate, then the sequence of MaxEnt approximants is converging in entropy to f. One of our goals in this paper is to involve additional arguments allowing to show that such a result on entropy convergence can be extended to the case of M-indeterminate distributions.

The remainder of the paper is organized as follows. In Section 2, we recall briefly what we need about Hankel matrices and introduce the MaxEnt setup. In Section 3, we calculate the entropy of densities with the given n-truncated moment set, for fixed n, and also for the entire moment sequence

{m_{k}}_{k = 0}^{\infty} .

In Section 4, we provide an M-indeterminacy MaxEnt criterion in the Stieltjes case. In Section 5, we present corollaries related to the M-indeterminacy in the Hamburger case. Discussed is the question: Among a family of infinitely many densities all with the same moments, which density has the largest entropy?

2. Basics of Hankel Matrices and the MaxEnt Setup

When we tell that

{m_{k}}_{k = 0}^{\infty}

with

m_{0} = 1

is a moment sequence, it always means that there is a probability measure

μ = μ_{F}

which is ‘behind’. Thus, think of a random variable X defined in an underlying probability space

(Ω, F, P)

, taking values in a set

U \subset R

. If F is its distribution function,

F (x) : = P [X \leq x], x \in R

, then

μ = μ_{F}

is a positive Borel measure induced by F. We write just

μ .

A basic assumption is that

E [| X |^{k}] < \infty

for all

k = 1, 2, \dots .

Thus, well defined are the moments

m_{k} = E [X^{k}] = \int_{Ω} X^{k} (ω) d P = \int_{U} x^{k} d F (x) = \int_{U} x^{k} f (x) d x = \int_{U} x^{k} μ (d x), k = 1, 2, \dots,

and also the moment sequence

{m_{k}}_{k = 0}^{\infty} .

If

U = R

, we say that

{m_{k}}_{k = 0}^{\infty}

is a Hamburger moment sequence, while for

U = R_{+}

,

{m_{k}}_{k = 0}^{\infty}

is a Stieltjes moment sequence.

For any moment sequence

{m_{k}}_{k = 0}^{\infty}

, we define a few infinite sequences of Hankel matrices, namely,

{H_{n}}_{n = 1}^{\infty}

and

{H_{n, p}}_{n = 1}^{\infty}

, and their determinants, as follows:

H_{n, p} = {(m_{i + j + p})}_{i, j = 0}^{n}, D_{n, p} : = \det (H_{n, p}), p = 0 or 1 .

If

p = 0

,

H_{n} : = H_{n, 0}

is the ‘basic’ Hankel matrix,

H_{n, p},

for

p = 1

is the ‘shifted’ Hankel matrix:

H_{n, 1} = {(m_{i + j + 1})}_{i, j = 0}^{n}

is based on the ‘shifted’ moment sequence

{m_{1}, m_{2}, \dots}

generated by the measure

μ_{1}

with

d μ_{1} = x d μ

.

In what follows, we involve and use the entropy of the strictly positive density f under the constraint of knowing only the n-truncated moment set

{m_{k}}_{k = 0}^{n} .

We will see that the MaxEnt formalism allows to study in parallel both the Hamburger and the Stieltjes cases; hence, we assume that the distributions and their densities have support

U = R

or

U = R_{+}

.

Let us consider the Stieltjes case. For a density f with n-truncated moment set

{m_{k}}_{k = 0}^{n},

there is a density, say

f_{n},

satisfying two properties:

(a): The ‘first’ n moments of $f_{n}$ are exactly ${m_{k}}_{k = 0}^{n};$
(b): $f_{n}$ maximizes the Shannon entropy.

It is well known, see [2], that

f_{n} (x) = exp (- \sum_{j = 0}^{n} λ_{j} x^{j}), x \in R_{+},

where

(λ_{0}, . . ., λ_{n})

are the Lagrange multipliers satisfying the constraints

\int_{R_{+}} x^{j} f_{n} (x) d x = m_{j}, j = 0, . . ., n .

In this case, we use the simple notation

h_{f_{n}}

for the entropy of

f_{n},

and remember that

h_{f_{n}}

depends on the moments

{m_{k}}_{k = 0}^{n}

. It is easy to see that

h_{f_{n}} = - \int_{R_{+}} (\ln f_{n} (x)) f_{n} (x) d x = \sum_{j = 0}^{n} λ_{j} m_{j} .

We would like the sequence of approximants

{f_{n}}

and the entropy sequence

{h_{f_{n}}}

to be well defined for any

n = 1, 2, \dots .

It may happen, see ([6], Theorem 1), that for given f and

{m_{k}}_{k = 0}^{n}

, the desired density

f_{n}

does not exist, in which case the quantity

h_{f_{n}}

is meaningless. However, in the cited paper, the following useful relation is established (the class

D_{n}

is defined at the end of this section):

{sup}_{f \in D_{n}} h_{f} = h_{f_{n - 1}}

, even if the MaxEnt approach does not apply. Since the entropy is monotone and non-increasing as n increases, the latter equality enables us to simply set

h_{f_{n}} = h_{f_{n - 1}}

, thus filling the ‘gap’ left by the non-existing densities

f_{n}

. This justifies the assumption made in the sequel, without loss of generality, that all entries of the monotone non-increasing entropy sequence

{h_{f_{n}}}_{n = 1}^{\infty}

are well defined.

In the non-symmetric Hamburger case, once the n-truncated moment set

{m_{k}}_{k = 0}^{n}

for even n is assigned, the positivity of the Hankel determinant

D_{n, 0} = D_{n, 0} (m_{0}, \dots, m_{n})

guarantees the existence of a MaxEnt solution, see ([6], Appendix A). As a consequence, the entire entropy sequence

{h_{f_{n}}}_{n = 1}^{\infty}

is defined. In the symmetric Hamburger problem, the MaxEnt density existence is guaranteed under conditions similar to those in the Stieltjes case.

Now suppose that

{m_{k}}_{k = 0}^{n + 1}

is a moment set for which we ‘keep fixed’ (unchanged) the moments

{m_{0}, m_{1}, \dots, m_{n}},

while we treat as ‘varying continuously’ the moment

m_{n + 1}

. If letting

b : = m_{n + 1}

, then the

(n + 1)

-truncated moment set can be written as

{m_{0}, m_{1}, \dots, m_{n}, b}

. Moreover, the existence conditions for a solution of the moment problem require the Hankel determinants to be positive. This is guaranteed by imposing b to have a lower bound, say

b_{p, n + 1}^{-}

, which is the unique real number satisfying the equation

D_{n, p} (m_{p}, \dots, m_{n}, b_{p, n + 1}^{-}) = 0, for p = 0 or 1 .

As well, due to the MaxEnt machinery, see ([6], Appendix A), in the Stieltjes case, the following value of b has to be considered:

b_{n + 1}^{+} = \int_{R_{+}} x^{n + 1} f_{n} (x) d x .

Notice that in general,

b_{n + 1}^{+} \neq m_{n + 1}

. Recall that we deal with the truncated

(n + 1)

-moment set in which the parameter b stands for the

(n + 1)

th moment:

{m_{0}, m_{1}, \dots, m_{n}, b}, where b_{p, n + 1}^{-} \leq b \leq b_{n + 1}^{+} for p = 0 o r 1 .

In the Stieltjes case, we introduce the following classes of densities:

D_{n} : = \{f > 0 : \int_{R_{+}} x^{j} f (x) d x = m_{j}, j = 1, \dots, n\},

D_{\infty} : = \{f > 0 : \int_{R_{+}} x^{j} f (x) d x = m_{j}, j = 1, 2, \dots\} .

Similar notions can be introduced also in the Hamburger case, just replacing

R_{+}

with

R

.

The class

D_{n}

is a convex set for each n, and then

D_{\infty} = ⋂_{1}^{\infty} D_{n}

is also convex. We know that in the M-indeterminate case,

D_{\infty}

contains ‘infinitely many’ densities, all being solutions of the same moment problem.

For both the Hamburger and Stieltjes cases, we need to recall a few known facts which will be essentially used later.

Fact 1.

We are going to work the moment sequence

{m_{k}}_{k = 0}^{\infty}

whose underlying density f has entropy

h_{f}

such that either

h_{f}

is finite, or

h_{f} = - \infty

. To be precise, distributions with

h_{f} = + \infty

are not allowed. The reason for this is that once

{m_{k}}_{k = 0}^{\infty}

is assigned, the ‘option’

h_{f} = + \infty

is not feasible since it is well known in the MaxEnt setup that

h_{f} \leq h_{f_{2}}

, where

h_{f_{2}} = \frac{1}{2} \ln [2 π e (m_{2} - {(m_{1})}^{2})]

is finite because of Lyapunov’s inequality

m_{2} - {(m_{1})}^{2} \geq 0

(Hamburger case) and

h_{f} \leq h_{f_{1}} = 1 + \ln m_{1}

is finite for every

m_{1} > 0

(Stieltjes case).

Fact 2.

Once the moment set

{m_{k}}_{k = 0}^{n}

is given and

f_{n}

is the corresponding MaxEnt density, the entropy sequence

{h_{f_{n}}}_{n = 1}^{\infty}

is monotone non-increasing, and its limit is either finite or

- \infty .

Fact 3.

For consistency between the differential entropy of a continuous random variable and the entropy of its discretization, the differential entropy of any discrete measure which can be compared with Dirac’s deltas set is assumed to be

- \infty,

see ([3], pp. 247–249).

Fact 4.

If the density f is bounded, this is sufficient to eliminate the option

h_{f} = - \infty

. Indeed, suppose that

0 < f (x) \leq L < \infty

for all x. It follows that

- h_{f} = \int_{U} (\ln f (x)) f (x) d x < \int_{U} (\ln L) f (x) d x = \ln L \Rightarrow h_{f} \geq - \ln L > - \infty .

3. Entropy of Densities Which Are M-Indeterminate

The MaxEnt formalism allows to treat both Hamburger and Stieltjes cases in a similar way. For the sake of brevity, we confine ourselves mainly to discussions on the Stieltjes case. All arguments can then be easily extended to the Hamburger case. This possibility is one of the advantages of involving the MaxEnt machinery.

3.1. Entropy of Densities from the Class $D_{n}$

We start with the formulation and the proof of the following result.

Theorem 1.

Suppose that

{m_{k}}_{k = 0}^{\infty}, m_{0} = 1,

is the full moment sequence of a given density

f .

For fixed n, based on the n-truncated moment set

{m_{k}}_{k = 0}^{n},

we consider

f_{n},

the MaxEnt approximant of f, and let

h_{f_{n}}

be the entropy of

f_{n}

. Then, there are infinitely many densities

g \in D_{n}

whose entropy

h_{g}

is spanning an interval, namely

h_{g} \in (- \infty, h_{f_{n}}] .

(1)

Proof.

We provide arguments in both cases, Stieltjes and Hamburger.

(a) Stieltjes case. Preliminarily, for fixed

n,

let us consider

f_{n}

and the upper bound

b_{n + 1}^{+}

of its

(n + 1)

th order moment. It was mentioned that, in general,

b_{n + 1}^{+}

is different from

m_{n + 1}

. Our goal is to specify the range of values of the entropy

h_{g}

, where g is an arbitrary density from the class

D_{n}

. For this, we introduce the following suitable subclass

E_{n} \subset D_{n} :

E_{n} = \{f_{n + 1} = (f_{n + 1} | fixed m_{1}, \dots, m_{n}, b)}, where b \in (b_{p, n + 1}^{-}, b_{n + 1}^{+}]\}, p = 0 or 1 .

Notice that we rely here on the specific truncated moment set

(m_{0}, m_{1}, . . ., m_{n}, b_{p, n + 1}^{-}) \in \partial (m (D_{n + 1}))

, the boundary of the moment space. Equivalently, the elements of

E_{n}

are MaxEnt densities which are constrained by

(m_{1}, \dots, m_{n}, b)

; they belong to

D_{n}

and, primarily, they all have analytically tractable entropy. The latter property enables us to calculate the entropy of all

g \in D_{n}

by evaluating the entropy of all

g \in E_{n} .

Let us consider

f_{n + 1}

for b varying in the interval

(b_{p, n + 1}^{-}, b_{n + 1}^{+}]

and calculate the values spanned by the entropy

h_{f_{n + 1}}

.

Subcase a1. If

b = b_{n + 1}^{+}

, the right-end point, it is easy to verify that

f_{n + 1}

has a Lagrange multiplier

λ_{n + 1} = 0

and hence

f_{n + 1}

coincides with

f_{n}

; hence,

h_{f_{n + 1}} = h_{f_{n}} .

(2)

Subcase a2. If b is ‘close’ to the left-end point, i.e.,

b \to b_{p, n + 1}^{-}

, we look at the Hankel determinants

D_{n, 0}

and

D_{n, 1}

and see that either

D_{n, 0} \to 0

or

D_{n, 1} \to 0

. This implies that the underlying measure

μ

is discrete; see, for example, ([7], Theorem 1.3, p. 6). Therefore, the entropy quantity

h_{f_{n + 1}}

is approaching

- \infty

:

lim_{b \to b_{p, n + 1}^{-}} h_{f_{n + 1}} = - \infty .

(3)

It remains to mention an essential property of the entropy

h_{f_{n + 1}}

as a function of the variable b. Since

\frac{d h_{f_{n + 1}}}{d b} = λ_{n + 1} > 0,

see ([2], Equation (2.73), p. 47), or the arguments below, we have that

h_{f_{n + 1}}

is monotone increasing with respect to

b \in (b_{p, n + 1}^{-}, b_{n + 1}^{+}]

.

(b) Hamburger case. The arguments here are similar to those above. We need to replace

E_{n}

by the following one with analogous meaning of all notations:

{\tilde{E}}_{n} = {f_{n + 2} = (f_{n + 2} | fixed m_{1}, \dots, m_{n}, b_{n + 1}^{+}, b)}, where b \in (b_{0, n + 2}^{-}, b_{n + 2}^{+}] .

Here,

b_{0, n + 2}^{-}

and

b_{n + 2}^{+}

are such that

D_{n, 0} (m_{0}, \dots, m_{n + 1}, b_{0, n + 2}^{-}) = 0, b_{n + q}^{+} = \int_{R} x^{n + q} f_{n} (x) d x, q = 1, 2 .

In such a case, it is easy to see that

f_{n + 2} = (f_{n + 2} | fixed m_{1}, \dots, m_{n}, b_{n + 1}^{+}, b_{n + 2}^{+}) \equiv f_{n},

and hence, just as above, we conclude that

f_{n + 2}

satisfies the entropy relation

h_{f_{n + 2}} = h_{f_{n}}

.

Joining together (2) and (3) (with obvious extension to the Hamburger case) with

f_{n + 1}

or

f_{n + 2}

and referring to the monotone increasing of the entropy with respect to b, we conclude that indeed there are infinitely many densities

f_{n + 1}

and

f_{n + 2} \in E_{n}

whose entropy spans the interval in (1), with this property holding for all

g \in D_{n}

. Theorem 1 is proved. □

3.2. Entropy of Densities from the Class $D_{\infty}$

Among the well-known properties of Shannon’s entropy, we use its concavity as a functional, which implies that the entropy of all densities

g \in D_{\infty}

can be calculated.

We start with the Stieltjes moment sequence

{m_{k}}_{k = 0}^{\infty}

and calculate the entropy sequence

{h_{f_{n}}}_{n = 1}^{\infty}

, which is monotone, non-increasing and convergent. Similarly, for a Hamburger moment sequence

{m_{k}}_{k = 0}^{\infty}

, we calculate the entropy sequence

{h_{f_{n}}}_{n = 2}^{\infty}

, being also monotone, non-increasing and convergent.

Let us show first that there exists only one density, say,

f^{*} \in D_{\infty}

such that

f^{*}

has the largest entropy, i.e.,

h_{f^{*}} = max_{g \in D_{\infty}} h_{g} .

Indeed, the set of solutions to the S-indeterminate moment problem includes infinitely many densities, previously grouped in the convex set

D_{\infty}

. On the other hand, the continuous entropy functional

h : g \mapsto h_{g} = - \int_{U} (\ln g (x)) g (x) d x

is strictly concave and, over the convex set

D_{\infty}

,

h_{g}

attains its maximum value. Hence, we have that the optimization problem to maximize

h_{g}

over

g \in D_{\infty}

has indeed a unique solution

f^{*}

such that

h_{f^{*}} : = {max}_{g \in D_{\infty}} h_{g} .

Relying on Theorem 1, we are ready to calculate the entropy of all moment equivalent densities

g \in D_{\infty}

. We keep in mind, all densities in the class

D_{\infty}

have support

R_{+}

in the Stieltjes case and

R

in the Hamburger case.

Theorem 2.

Suppose that

{m_{k}}_{k = 0}^{\infty}

is the full Stieltjes moment sequence of a density f and it is known that f is M-indeterminate. We use

f_{n}

and

h_{f_{n}}

as before. Then, there are infinitely many densities

g \in D_{\infty}

whose entropy

h_{g}

is spanning an interval, namely,

h_{g} \in (- \infty, h_{*}], w h e r e h_{*} : = inf_{n} h_{f_{n}} .

Proof.

Note first that each

g \in D_{\infty}

satisfies

g \in ⋂_{n = 1}^{\infty} D_{n}

; hence, according to Theorem 1, g has entropy

h_{g} \in ⋂_{n = 1}^{\infty} (- \infty, h_{f_{n}}] = (- \infty, h_{*}] .

This implies that

h_{*} \geq h_{f^{*}}

(4)

which completes the proof. □

We use below, for example, S-determinate or H-determinate, meaning that a density is on

R_{+}

or on

R

, and it is M-determinate or H-determinate. This similarly applies for S-indeterminacy and H-indeterminacy.

In general, it is not easy to establish the S-determinacy, and hence S-indeterminacy, through known criteria based on necessary and sufficient conditions. The existence of the density

f^{*}

with the largest entropy, see Theorem 1, indicates that there is some similarity between the M-determinate and M-indeterminate cases. Consequently, since initially a finite set of moments is involved, the technique of density reconstruction through the MaxEnt approach can be applied without distinguishing these two cases.

With

f, {m_{1}, \dots, m_{n}}, f_{n}, h_{f_{n}}

, all as above, we give here some details.

First, if f is S-determinate and H-determinate, the sequence of approximants

{f_{n}}_{n = 1}^{\infty}

converges in entropy to a unique underlying density f, see ([5], Section 3), that is,

{h_{f_{n}}}_{n = 1}^{\infty} \to h_{f}

as

n \to \infty

with

{inf}_{n} h_{f_{n}}

all being finite. However, from the used procedure, relying on the geometrical meaning of Theorem 2.19, p. 72 in [7], it is immediate to deduce that the statement of convergence in entropy is equally extended to the case where

h_{f} = - \infty

, from which

{inf}_{n} h_{f_{n}} = - \infty .

Second, if f is S-indeterminate, the entropy sequence

{h_{f_{n}}}_{n = 1}^{\infty}

is monotone non-increasing and hence convergent with lower bound

h_{*}

, i.e.

h_{f_{n}} \to inf_{n} h_{f_{n}} \geq h_{f^{*}} .

It is useful to mention that Theorem 2 and the comments completely agree with the rationale of the MaxEnt approach: when all known information has been taken into account, a system with maximum entropy is the most probable state because it is the system in which the least amount of information has been defined.

Moreover, Theorem 2 justifies the approach of reconstruction of the density f, starting from a finite set of moments and passing to the full moment sequence, regardless of the M-determinacy or M-indeterminacy of f. In any case, that issue is not really of great practical significance. In fact, a full set of moments will ‘never’ be available; hence, for practical purposes, we deal only with finite n, which is perhaps ‘big enough’. Nevertheless,

f_{n}

is a valuable approximation of

f^{*}

since both

f_{n}

and

f^{*}

have the same first n moments and

h_{f_{n}} > h_{f^{*}} .

This fact also corresponds well to the MaxEnt rationale. Thus, the question of moment (in)determinacy of the density f is not essential for the procedure we follow.

4. Stieltjes Case: MaxEnt Criterion for M-Indeterminacy

We deal with a random variable X on

R_{+}, X \sim F, f = F^{'}

with finite all moments

{m_{k}}_{k = 1}^{\infty}

. Recall that

m_{0} = 1 .

We mentioned in the Introduction one fundamental fact: if the distribution F is M-indeterminate, then there are infinitely many distributions of any kind, all having the same moments as F.

Recall that a Stieltjes moment sequence

{m_{k}}_{k = 1}^{\infty}

can also be considered a Hamburger moment sequence, i.e., it is coming from a random variable in

R

. We always have to make a distinction between M-determinacy and M-indeterminacy by specifying that it is in the sense of Stieltjes, or in the sense of Hamburger. We use below the obvious terms, S-determinate, S-indeterminate, H-determinate and H-indeterminate, in their short forms, S-det, S-indet, H-det and H-indet. Let us list the possibilities for the distribution F:

If F is S-indet, it is also H-indet. If F is H-det, it is also S-det.
If F is H-indet, then either F is also S-indet, or, it may look a little ‘surprising’, F is S-det.

Thus, we have three cases; they will be discussed below. Relying on the results in Section 3, we provide now a MaxEnt criterion for M-indeterminacy in the Stieltjes case.

Theorem 3.

(Main result.) Let f be a probability density with finite all moments. Denote by

m : = {m_{k}}_{k = 1}^{\infty}

its full moment sequence and

m_{n} : = {m_{k}}_{k = 1}^{n}

its nth truncated set. If

m

is considered as a Stieltjes moment sequence, we write

f_{n}^{(S)}

for the MaxEnt approximant of f based on

m_{n} .

Similarly,

f_{n}^{(H)}

will stand for the MaxEnt approximant of f based on

m_{n}

if considering

m

as a Hamburger moment sequence. For the entropy, we use the notations

h_{f_{n}^{(S)}}

and

h_{f_{n}^{(H)}} .

The Stieltjes moment sequence

{m_{k}}_{k = 1}^{\infty}

corresponds to the moments of infinitely many distributions on

R_{+}

; equivalently, the distribution F is M-indeterminate, if and only if the following relation holds:

inf_{n} h_{f_{n}^{(H)}} > inf_{n} h_{f_{n}^{(S)}} > - \infty .

Proof.

First, we recall the well-known result according to which if n is odd, the estimator

f_{n}^{(H)}

does not exist. Since the entropy is monotone and non-increasing as n increases, it is proved in [6] (Section 3.2) that the entropy quantity

h_{f_{n - 1}^{(H)}}

can be associated with

f_{n}^{(H)}

in the sense that

h_{f_{n}^{(H)}} = h_{f_{n - 1}^{(H)}}

. Thus, the sequence

{h_{f_{n}^{(H)}}}

will be well defined for each n, filling up the ‘initial’ gap left by the odd moments. Furthermore, the inequality

h_{f_{n}^{(H)}} > h_{f_{n}^{(S)}}

is meaningful for any n since both

f_{n}^{(H)}

and

f_{n}^{(S)}

are based on the same constraints, the moment set

{m_{k}}_{k = 1}^{n}

, whilst

f_{n}^{(S)}

has an additional constraint, namely, the support is

U = R_{+} \subset R

.

Now we consider the three possibilities mentioned above. In brackets, we write what is F first, and what is second.

Case 1. [F is S-indet and H-indet] We refer to relation (4) from which it follows that

{inf}_{n} h_{f_{n}^{(H)}} > {inf}_{n} h_{f_{n}^{(S)}} > - \infty

holds.

Case 2. [F is H-indet and S-det] We recall that the unique solution, a measure, with positive support is a Nevanlinna extremal solution whose spectrum contains 0 and is a discrete unbounded subset of

[0, \infty),

see ([8], Remark 2.2.2, p. 178). Then, as quoted before,

{inf}_{n} h_{f_{n}^{(H)}}

is finite and

{inf}_{n} h_{f_{n}^{(S)}} = - \infty .

Case 3. [F is H-det and S-det] Clearly, one solution solely supported on

[0, \infty)

exists. As a consequence, for the limit

L : = {inf}_{n} h_{f_{n}^{(H)}} = {inf}_{n} h_{f_{n}^{(S)}},

we have that either L is finite or L is ‘equal’ to

- \infty

. If L is finite, the distribution F is absolutely continuous with either bounded or unbounded density. If

L = - \infty

, F is either absolutely continuous with unbounded density, or it is discrete.

It remains to show that the converse statements, call them Case 1c, Case 2c, and Case 3c, are also true. We show this by contradiction.

Indeed, in Case 1c, if

{inf}_{n} h_{f_{n}^{(H)}} > {inf}_{n} h_{f_{n}^{(S)}} > - \infty

holds true, then both cases ‘H-indet with S-det’ and ‘H-det with S-det’ are not possible. This is because they respectively require both

{inf}_{n} h_{f_{n}^{(S)}} = - \infty

and

{inf}_{n} h_{f_{n}^{(H)}} = {inf}_{n} h_{f_{n}^{(S)}}

. These arguments show that F is H-indet and S-indet. The arguments to prove Case 2c and Case 3c are similar. □

It is worth mentioning that the criterion for M-indeterminacy established in Theorem 3, the Stieltjes case, cannot be extended to the Hamburger case. Indeed, from both Case 2 [H-indet with S-det] and Case 3 [H-det with S-det], the condition ‘finite lower bound

{inf}_{n} h_{f_{n}^{(H)}}

’ does not distinguish the H-indeterminate case from the H-determinate case. Nevertheless, from Theorem 3, some useful corollaries concerning Stieltjes or Hamburger easily follow.

Corollary 1.

A necessary condition for the distribution F to be S-indeterminate with H-indeterminacy is for both quantities

{inf}_{n} h_{f_{n}^{(S)}}

and

{inf}_{n} h_{f_{n}^{(H)}}

to be finite.

Corollary 2.

A sufficient condition for the distribution F to be H-determinate is that the quantity

{inf}_{n} h_{f_{n}^{(H)}}

is ‘equal’ to

- \infty

.

Notice that in the Stieltjes case, Theorem 3 provides also a sufficient condition to guarantee the existence of a density, which is equivalent to the absolute continuity property of the distribution F. Similarly, such a condition can be extended to the Hamburger case.

Corollary 3.

Let

{m_{k}}_{k = 0}^{\infty}

be a strictly positive definite Stieltjes moment sequence which corresponds to the moments of exactly one distribution F. If

{inf}_{n} h_{f_{n}^{(S)}} = {inf}_{n} h_{f_{n}^{(H)}}

is finite, then F is absolutely continuous with either bounded or unbounded density.

5. Comments on M-Indeterminate Distributions on $R$

Here, we cite a result from ([9], Theorem 1 and Corollary 1, pp. 100–101); see also ([10], Examples 11.12 and 11.13) for a general family of distributions.

General Statement.

Suppose that F is a distribution function on

R

with moment sequence

{m_{0}, 0, m_{2}, 0, m_{4}, . . .}

(Hamburger case). Then, if F is M-indeterminate, symmetric and non-symmetric solutions exist.

Besides the above sources, we can also refer to the notion Stieltjes class,

S (f, h)

, introduced for any M-indeterminate distribution, see [11]. Recall that

S (f, h) = {f_{ε} (x) = f (x) [1 + ε p (x)], x \in R, ε \in [- 1, 1]},

where

f = F^{'}

is the density of the M-indeterminate distribution F, and

p (x), x \in R

, called a ‘perturbation function’, is a sign function with norm

| | p | | = 1

, satisfying the ‘vanishing moments’ property,

\int x^{k} f (x) p (x) d x = 0, k = 0, 1, 2, \dots

Another related recent work is [12].

It turns out that the MaxEnt technique enables us to make a further refinement of what we know about the symmetric solutions, also of the measures, which are M-indeterminate.

Theorem 4.

Suppose that F is an arbitrary distribution on

R

with finite moments and moment sequence

{m_{0}, 0, m_{2}, 0, m_{4}, . . .}

(Hamburger case) and that F is M-indeterminate. Then, the density

f^{*}

, see Section 3.2, with the largest entropy, is symmetric.

Proof.

Consider an arbitrary non-symmetric

g = (g (x), x \in R) \in D_{\infty}

. It is easy to verify that

\tilde{g} = (g (- x), x \in R)

is such that

\tilde{g} \in D_{\infty}

. Moreover, g and

\tilde{g}

have the same entropy, i.e.,

h_{\tilde{g}} = h_{g}

. Consider the densities

{\tilde{g}}_{*}

and

g_{*}

for which the entropies

h_{\tilde{g}}

and

h_{g}

are maximal. They are both in the set

D_{\infty}

. Combining the above general statement with the uniqueness of the MaxEnt density

g_{*}

, it follows that

{\tilde{g}}_{*} \equiv g_{*}

; hence,

g_{*}

is symmetric. □

6. Brief Conclusions

In this paper, we establish a new criterion for the M-indeterminacy of a probability density on the positive half-line (Stieltjes case) by involving the MaxEnt approach. Interesting corollaries are derived for probability densities on the whole real line (Hamburger case). The obtained results are new and they can be considered a valuable addition to the results based on two groups of conditions called ‘checkable’ or ‘uncheckable’ for either the M-determinacy or M-indeterminacy of distributions.

The recent review paper [1] contains a comprehensive description of significant results based on ‘uncheckable conditions’, including two illustrations of how to use this kind of condition as an indication for a specific property of a distribution in terms of its moments. However, from the applied point of view, most useful are the results involving ‘checkable conditions’. The reader is referred to the following sources: [10,13,14]. The property ‘M-indeterminacy’, besides its non-triviality as a mathematical phenomenon, arises in important applied areas. Among them are atmospheric studies, gravity theory and quantum mechanics, see, for example, [15,16,17]. The involvement of the MaxEnt technique may lead to challenging theoretical problems; however, the answers, when available, would shed additional light on the analysis of applied problems.

Author Contributions

The authors declare equal participation at all stages, formulation of the problems and the questions, finding methods to follow in order to provide solutions, performing all technical details, selecting and using appropriate references, multiple checking of the text. All authors have read and agreed to the published version of the manuscript.

Funding

The research received no external funding.

Data Availability Statement

No new data were created or analyzed in this study. Data sharing is not applicable to this article.

Acknowledgments

Thanks to the anonymous referees for their positive comments, as well as for asking about the applied aspects of our results. To address the latter, we added a few papers to our References section. We are grateful to the MDPI Editors and Technical team for their help and advice during the preparation of the paper. JMS expresses his thanks to the organizers of IWAP-2023, Thessaloniki, Greece, for the invitation to give a talk. Also, JMS acknowledges the support to travel provided by the Bulgarian Ministry of Education and Science, Program PIKOM, no. DO 1-67/05.05.2022.

Conflicts of Interest

The authors declare no conflict of interest.

References

Novi Inverardi, P.L.; Tagliani, A.; Stoyanov, J.M. The problem of moments: A bunch of old classical results with some novelties. Symmetry 2023, 15, 1743. [Google Scholar] [CrossRef]
Kesavan, H.K.; Kapur, J.N. Entropy Optimization Principles with Applications; Academic Press: London, UK, 1992. [Google Scholar]
Cover, T.M.; Thomas, J.A. Elements of Information Theory; Wiley-Interscience: Hoboken, NJ, USA, 2006. [Google Scholar]
Gzyl, H.; Mayoral, S.; Gomez-Goncalves, E. Loss Data Analysis: The Maximum Entropy Approach, 2nd ed.; Walter de Gruyter: Berlin, Germany, 2023. [Google Scholar]
Milev, M.; Tagliani, A. Entropy convergence of finite moment approximations in Hamburger and Stieltjes problems. Stats. Probab. Lett. 2017, 120, 114–117. [Google Scholar] [CrossRef]
Novi Inverardi, P.L.; Tagliani, A. Stieltjes and Hamburger reduced moment problem when MaxEnt solution does not exist. Mathematics 2021, 9, 309. [Google Scholar] [CrossRef]
Shohat, J.A.; Tamarkin, J.D. The Problem of Moments; Math. Surveys No. 1; Amer. Math. Soc.: Providence, RI, USA, 1943. [Google Scholar]
Berg, C.; Valent, G. The Nevanlinna parametrization for some indeterminate Stieltjes moment problems associated with birth and death processes. Methods Appl. Anal. 1994, 1, 169–209. [Google Scholar] [CrossRef]
Heyde, C.C. Some remarks on the moment problem. Q. J. Math. 1963, 14, 91–96. [Google Scholar] [CrossRef]
Stoyanov, J.M. Counterexamples in Probability, 3rd ed.; Dover: New York, NY, USA, 2013. [Google Scholar]
Stoyanov, J.M. Stieltjes classes for moment-indeterminate probability distributions. J. Appl. Probab. 2004, 41, 281–294. [Google Scholar] [CrossRef]
López-Garcia, M. Operators on the vanishing moments subspace and Stieltjes classes for M-indeterminate probability distributions. Arab J. Math. Sci. 2022, 28, 229–242. [Google Scholar] [CrossRef]
Lin, G.D. Recent developments on the moment problem. J. Statist. Distrib. Appl. 2017, 4, 1–17. [Google Scholar] [CrossRef]
Stoyanov, J.M.; Lin, G.D.; Kopanov, P. New checkable conditions for moment determinacy of probability distributions. Theory Probab. Appl. 2020, 65, 497–509. [Google Scholar] [CrossRef]
McGraw, R.; Nemesure, S.; Schwarts, S.E. Properties and evolution of aerosols with size distributions having identical moments. J. Aerosol Sci. 1998, 29, 761–772. [Google Scholar] [CrossRef]
Janssen, O.; Mirbabayi, M.; Zograf, P. Gravity as an ensemble and the moment problem. J. High Energy Phys. 2021, 2021, 184. [Google Scholar] [CrossRef]
Mayato, R.S.; Loughlin, P.; Cohen, L. Generating M-indeterminate probability densities by way of quantum mechanics. J. Theor. Probab. 2022, 35, 1537–1555. [Google Scholar] [CrossRef]

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Stoyanov, J.M.; Tagliani, A.; Novi Inverardi, P.L. Maximum Entropy Criterion for Moment Indeterminacy of Probability Densities. Entropy 2024, 26, 121. https://doi.org/10.3390/e26020121

AMA Style

Stoyanov JM, Tagliani A, Novi Inverardi PL. Maximum Entropy Criterion for Moment Indeterminacy of Probability Densities. Entropy. 2024; 26(2):121. https://doi.org/10.3390/e26020121

Chicago/Turabian Style

Stoyanov, Jordan M., Aldo Tagliani, and Pier Luigi Novi Inverardi. 2024. "Maximum Entropy Criterion for Moment Indeterminacy of Probability Densities" Entropy 26, no. 2: 121. https://doi.org/10.3390/e26020121

APA Style

Stoyanov, J. M., Tagliani, A., & Novi Inverardi, P. L. (2024). Maximum Entropy Criterion for Moment Indeterminacy of Probability Densities. Entropy, 26(2), 121. https://doi.org/10.3390/e26020121

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Maximum Entropy Criterion for Moment Indeterminacy of Probability Densities

Abstract

1. Introduction

2. Basics of Hankel Matrices and the MaxEnt Setup

3. Entropy of Densities Which Are M-Indeterminate

3.1. Entropy of Densities from the Class $D_{n}$

3.2. Entropy of Densities from the Class $D_{\infty}$

4. Stieltjes Case: MaxEnt Criterion for M-Indeterminacy

5. Comments on M-Indeterminate Distributions on $R$

6. Brief Conclusions

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

Maximum Entropy Criterion for Moment Indeterminacy of Probability Densities

Abstract

1. Introduction

2. Basics of Hankel Matrices and the MaxEnt Setup

3. Entropy of Densities Which Are M-Indeterminate

3.1. Entropy of Densities from the Class D n

3.2. Entropy of Densities from the Class D ∞

4. Stieltjes Case: MaxEnt Criterion for M-Indeterminacy

5. Comments on M-Indeterminate Distributions on R

6. Brief Conclusions

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

3.1. Entropy of Densities from the Class $D_{n}$

3.2. Entropy of Densities from the Class $D_{\infty}$

5. Comments on M-Indeterminate Distributions on $R$