Estimating Common Mean in Heteroscedastic Variances Model

Rukhin, Andrew L.

doi:10.3390/math13081290

Open AccessArticle

Estimating Common Mean in Heteroscedastic Variances Model

by

Andrew L. Rukhin

Department of Mathematics and Statistics, University of Maryland at Baltimore County, Baltimore, MD 21250, USA

Mathematics 2025, 13(8), 1290; https://doi.org/10.3390/math13081290

Submission received: 3 March 2025 / Revised: 26 March 2025 / Accepted: 7 April 2025 / Published: 15 April 2025

(This article belongs to the Section D1: Probability and Statistics)

Download Review Reports Versions Notes

Abstract

:

Bayes estimators for the unknown mean against a reference, non-informative prior distribution for both the mean and independent variances are derived. I entertain the scenario with two groups of observables with the same unknown mean. The unknown variances of the the first group are not supposed to be equal or to be restricted; the second homeogeneous group of observations all have the same unknown variance. Under the normality condition, these procedures turn out to have a very explicit form of the weighted average with data-dependent weights that admit of a very clear interpretation. The approximate formulas for the variance of the considered estimators and their limiting behavior are also examined. The related “self-dual” orthogonal polynomials and their properties are examined. Recursive formulas for estimators on the basis of these polynomials are developed.

Keywords:

Bayes estimators; heterogeneous variances; non-informative prior; Mills ratio; missing uncertainties; orthogonal polynomials; poly-t distribution; repeatability

MSC:

62F10; 62F15

1. Introduction: Missing Uncertainties

Assume that the available data consist of a sequence of independent observations,

x_{j}, j = 1, \dots, n

, each having the same mean. We consider the situation where the unknown variances of the observables cannot be supposed to be equal or to be restricted in any way. This scenario is perhaps unusual in the statistical community. However, according to Morris (1983, p. 49) [1], “… almost all applications involve unequal variances”. The problem is to estimate the common mean modeled as a location parameter without traditional conditions on the standard deviations (scale parameters).

This setting appears in instances of heterogeneous research synthesis where

x_{j}

represents the summary estimate of the common mean (e.g., the treatment effect) obtained by the j-th study. Commonly, the protocol of such studies demands that

x_{j}

be accompanied by its uncertainty estimate, but sometimes these estimates are either unavailable or cannot be trusted. In many applications, the variances of systematic, laboratory-specific errors cannot be reliably estimated; a scientist cannot place confidence in inferences made under unrealistically low noise. The issue of underreported uncertainties, particularly those that stem from asymptotic normal theory, which presupposes large datasets, is prevalent in many applications. The existing imputation techniques (e.g., Rukhin [2], Templ [3]) may not provide justifiable uncertainty values.

The latest point of view (Spiegelhalter [4]) is that uncertainty is a subjective relationship between an observer and what is observed. The issue of underreported uncertainties, particularly those that stem from asymptotic normal theory, which presupposes large datasets, is prevalent in metrology. The challenge of reproducibility within individual centers may be exacerbated by the nature of the measuring instruments employed, leading to heterogeneous unknown uncertainties (see Possolo [5]). The most striking example is provided by “one-shot” devices in the atomic industry, which are limited to single use.

An additional contemporary illustration is found in citizen science or crowd-sourcing projects, where participants contribute measurement results of various random phenomena, with some of them using relatively imprecise instruments, such as smartphones. These measurements can range from precipitation levels to air quality and biological observations. See Hand [6] for an introduction. Despite anticipated heterogeneity, the project organizers are faced with the task of synthesizing data in the absence of reliable uncertainty (

σ_{i}^{2}

) estimates.

Our investigation focuses on Bayes estimators obtained from the posterior distribution for an unknown mean, set against a non-informative, objective, or “uniform” prior distribution for both the mean and independent variances. This line of inquiry, initiated by Rukhin [7] under the assumption of normality, grapples with the complete lack of variance information. Needless to say, this framework introduces several statistical complications. For instance, the classical maximum likelihood estimator becomes undefined, as the likelihood function reaches infinity at each data point. Nevertheless, the problem is well defined, as estimating the common mean requires determining at most n parameters, the mean itself, and the variance ratios,

ω_{i} = σ_{i}^{- 2} / \sum_{j} σ_{j}^{- 2}

, which belong to the unit simplex of dimension

n - 1

,

\sum_{i} ω_{i} = 1

.

In Section 2.1, we investigate the Bayes estimators in the setting allowing for a group of homeogeneous observations, which have the same unknown variance. Under the normality condition, these procedures turn out to have a surprisingly explicit form. In fact, each of the derived rules is a weighted average with data-dependent weights that are invariant under the location–scale transformations, admitting a very clear interpretation. The approximate formulas for the variance of the considered estimators and their limiting behavior are also examined. Section 3 contains several approaches to the distribution of the Bayes estimator. The orthogonal polynomials are discussed in Section 3.3 with recursive formulas derived in Section 3.4.

2. Non-Informative Priors and Bayes Estimators

Consider the situation where distinct independent observables

x_{j}

are drawn from a location–scale parameter family with underlying symmetric density p,

x_{j} \sim σ_{j}^{- 1} p ((\cdot - μ) / σ_{j}), j = 1, \dots, n,

which has all necessary moments.

The principal interest is in the mean

μ

, while

σ_{j}, j = 1, \dots, n

are positive nuisance parameters. For this purpose, one needs to estimate the

(n - 1)

-dimensional vector

(ω_{1}, \dots, ω_{n})

, with

ω_{j} = σ_{j}^{- 2} / (\sum_{j} σ_{j}^{- 2})

and

\sum ω_{j} = 1

. If

(w_{1}, \dots, w_{n})

is such an estimator, then one can use

\sum_{j} w_{j} x_{j}

as a

μ

-statistic. Indeed, if all scale parameters

σ_{j}

are known, the best unbiased estimator of

μ

is the weighted means rule,

\sum_{j} ω_{j} x_{j}

.

Commonly, the estimated weights are taken to be location-invariant—i.e., for any real c,

w_{j} (x_{1} + c, \dots, x_{n} + c) = w_{j} (x_{1}, \dots, x_{n}) .

Then, the corresponding estimator

\tilde{μ}

is (location) equivariant,

\tilde{μ} (x_{1} + c, \dots, x_{n} + c) = \tilde{μ} (x_{1}, \dots, x_{n}) + c .

Most estimators used in practice are also scale-equivariant,

\tilde{μ} (a x_{1}, \dots, a x_{n}) = a \tilde{μ} (x_{1}, \dots, x_{n}), a > 0,

and this property calls for scale-invariant weights.

In the normal case, the reduction to the invariant procedures leads to an explicit form of the maximum likelihood estimators and of some Bayes procedures.

To eliminate nuisance parameters

σ_{j}

(or

ω_{j}), j = 1, \dots, n

, one can use a non-informative prior, which is a classical technique. Under mild regularity conditions on the underlying density p, Rukhin [8] derived the Bayes estimator under the quadratic loss (the posterior mean) against the uniform (reference) prior

d μ \prod_{j} d σ_{i} / σ_{j}

. This statistic coincides with the Bayes rule within the class of invariant procedures.

The discrete posterior distribution is supported by all data points with probabilities

w_{i}^{0} = \frac{1}{\prod_{j \neq i} | x_{i} - x_{j} |} {[\sum_{k} \prod_{j \neq k} \frac{1}{| x_{k} - x_{j} |}]}^{- 1}

(1)

= \frac{s_{i}}{\prod_{j \neq i} (x_{i} - x_{j})} {[\sum_{k} \prod_{j \neq k} \frac{s_{k}}{(x_{k} - x_{)} |}]}^{- 1} .

Here, further,

s_{i} = sign (\prod_{j \neq i} (x_{i} - x_{j})), i = 1, \dots, n,

denotes the parity of observations.

Thus, the Bayes estimator has a very explicit form:

δ^{0} = \sum_{j} w_{j}^{(0)} x_{j} = \frac{\sum_{j} x_{j} \prod_{i \neq k} {| x_{i} - x_{j} |}^{- 1}}{\sum_{j} \prod_{i \neq k} {| x_{i} - x_{j} |}^{- 1}} .

(2)

The magnitude of probabilities (1) describes the intrinsic similarity of observations: the weight of a data-point

x_{j}

is large if it is close to the bulk of data, meaning that

\prod_{i \neq j} | x_{j} - x_{i} |

is relatively small.

Statistic

δ^{0}

also appears in the approximation theory. According to the Tchebycheff interpolation formula, one has

| δ^{0} | = min_{R} max_{i} | s_{i} x_{i} - R (x_{i}) |,

where

R (x)

runs through polynomials of degree not exceeding

n - 2

. See Chapter 5 in Trefethen (2013) [9].

The probabilities (1) have their origin in optimization problems involving the discriminant function. Borodin [10] discusses their use in statistical physics. Genest et al. [11] study the remarkable mirror-symmetry (persymmetry) of the underlying Jacobi matrix.

2.1. Heterogeneity and Homogeneity

Here, normality of observations,

x_{j} \sim σ_{j}^{- 1} φ ((\cdot - μ) / σ_{j})

,

φ (x)

= exp (- (x^{2} / 2) / \sqrt{2 π}

, is assumed. We consider the setting where, in addition to x values, there is a possible group of distinct homogeneous data that have the same unknown standard deviation

σ

, say,

x_{i} \sim σ^{- 1} φ ((y_{i} - μ) / σ), i = n + 1, \dots, n + m

. In the context of citizen science projects mentioned in Section 1,

x_{i}, i = n + 1, \dots, n + m

may represent data supplied by smartphone users, while x values correspond to measurements derived by other means. In metrology applications, a known homogeneous group of laboratories employing the same techniques may participate in interlaboratory studies.

Then, one has

m + n

independent observations and altogether

n + 2

unknown parameters

μ, σ_{1}, \dots, σ_{n}, σ

with the main interest in

μ

.

We start with the traditional reference prior density of the form

π (μ, σ_{1}, \dots, σ_{n}, σ) = [\prod_{j} σ_{j}^{- a}] σ^{- b},

(3)

relative to

d μ [\prod_{j} d σ_{j} / σ_{j}] d σ / σ

. Here,

a > - 1, b + m > - 1 .

For any continuous bounded function

h (μ)

,

\int_{- \infty}^{\infty} h (μ) d μ \prod_{j} \int_{0}^{\infty} φ (\frac{x_{j} - μ}{σ_{j}}) \frac{d σ_{j}}{σ_{j}^{a + 2}} \int_{0}^{\infty} \prod_{i} φ (\frac{x_{i} - μ}{σ}) \frac{d σ}{σ^{b + m + 2}}

= \frac{{[\int_{0}^{\infty} φ (u) u^{a} d u]}^{n} \int_{0}^{\infty} φ (x) u^{b + m} d u}{{(2 π)}^{(m - 1) / 2}} \int_{- \infty}^{\infty} \frac{h (μ) {[\sum {(x_{i} - μ)}^{2}]}^{(b + m + 1) / 2} d μ}{\prod_{j} {| x_{j} - μ |}^{a + 1}} .

Indeed, for any

j = 1, \dots, n

,

\int_{0}^{\infty} φ (\frac{x_{j} - μ}{σ_{j}}) \frac{d σ_{j}}{σ_{j}^{a + 2}} = \frac{\int_{0}^{\infty} φ (u) u^{a} d u}{| x_{j} {- μ |}^{a + 1}},

and

\int_{0}^{\infty} \prod_{i} φ (\frac{x_{i} - μ}{σ}) \frac{d σ}{σ_{j}^{b + m + 2}} = \frac{\int_{0}^{\infty} φ (u) {| u |}^{b + m} d u}{{(2 π)}^{(m - 1) / 2} {[\sum {(x_{i} - μ)}^{2}]}^{(b + m + 1) / 2}} .

For any fixed small

ϵ > 0

and fixed

j = 1, \dots, n

, provided that all data points are different,

lim_{a \to 0} a \int_{x_{j} - ϵ}^{x_{j} + ϵ} \frac{h (μ) d μ}{\prod_{k} {| x_{k} - μ |}^{1 + a} {[\sum_{i} {(x_{i} - μ)}^{2}]}^{(b + m + 1) / 2}}

= \frac{2 h (x_{j})}{\prod_{1 \leq k \neq j \leq n} | x_{j} - x_{k} | {[\sum_{i = n + 1}^{n + m} {(x_{i} - x_{j})}^{2}]}^{(b + m + 1) / 2}} .

Therefore, for

a \to 0

,

\frac{\int_{- \infty}^{\infty} h (μ) d μ \prod_{j} \int_{0}^{\infty} φ (\frac{x_{j} - μ}{σ_{j}}) \frac{d σ_{j}}{σ_{j}^{a + 2}} \int_{0}^{\infty} \prod_{i} φ (\frac{x_{i} - μ}{σ}) \frac{d σ}{σ^{b + m + 2}}}{\int_{- \infty}^{\infty} d μ \prod_{j} \int_{0}^{\infty} φ (\frac{x_{j} - μ}{σ_{j}}) \frac{d σ_{j}}{σ_{j}^{a + 2}} \int_{0}^{\infty} \prod_{i} φ (\frac{x_{i} - μ}{σ}) \frac{d σ}{σ^{b + m + 2}}}

\to \frac{\sum_{j} \frac{h (x_{j})}{\prod_{1 \leq k \neq j \leq n} | x_{j} - x_{k} | {[\sum_{i} {(x_{i} - {\bar{x}}_{m})}^{2} + m {(x_{j} - {\bar{x}}_{m})}^{2}]}^{(b + m + 1) / 2}}}{\frac{1}{\prod_{1 \leq k \neq j \leq n} | x_{j} - x_{k} | {[\sum_{i} {(x_{i} - {\bar{x}}_{m})}^{2} + m {(x_{j} - {\bar{x}}_{m})}^{2}]}^{(b + m + 1) / 2}}},

where

{\bar{x}}_{m} = \sum_{i} x_{i} / m

.

Thus, we can formulate the first result.

Theorem 1.

Under the prior (3), when

a \to 0

the posterior distribution of μ is discrete with finite support

{x_{1}, \dots, x_{n}}

and the probabilities

w_{j} = \frac{1}{[\prod_{1 \leq k \neq j \leq n} | x_{j} - x_{k} |] [1 + {(x_{j} - {\bar{x}}_{m})}^{2} / v^{2}]]^{(b + m + 1) / 2}}

(4)

\times {[\sum_{j} \prod_{k \neq j} \frac{1}{| x_{j} - x_{k} {| [1 + {(x_{j} - {\bar{x}}_{m})}^{2} / v^{2}]]}^{(b + m + 1) / 2}}]}^{- 1} .

Here,

{\bar{x}}_{m} = \sum_{i} x_{i} / m, v^{2} = \sum_{i} {(x_{i} - {\bar{x}}_{m})}^{2} / m

are estimators of the common mean and variance based on the homogeneous sub-sample.

The Bayes estimator of μ, i.e., the posterior mean, is

δ = \sum_{j = 1}^{n} w_{j} x_{j},

(5)

with

w_{j}

defined by (4).

The probabilities (4) still describe the intrinsic similarity of observations: the weight of a data-point

x_{j}

is large if it is close to the greater part of the data. The attenuating factor,

{[1 + {(x_{j} - {\bar{x}}_{m})}^{2} / v^{2}]}^{- (b + m + 1) / 2},

when

b + m + 1 > 0

, encourages

x_{j}

, which is close to

{\bar{x}}_{m}

. When the homogeneous data are absent, this factor is 1 and (5) coincides with (2).

In this situation, the posterior mode

\hat{δ} = x_{ı}

, if

\prod_{k : k \neq ı} | x_{ı} - x_{k} | {[1 + \frac{{(x_{ı} - {\bar{x}}_{m})}^{2}}{v^{2}}]}^{(b + m + 1) / 2}

= min_{j} \prod_{k : k \neq j} | x_{j} - x_{k} | {[1 + \frac{{(x_{ı} - {\bar{x}}_{m})}^{2}}{v^{2}}]}^{(b + m + 1) / 2},

presents the maximum likelihood estimator within the class of invariant procedures.

The prior density (3) for the mean and the variances represents the right Haar measure on the group of linear transforms. In the context of the multivariate normal model, it is known as the the Geisser–Cornfield prior. See Geisser [12], Ch 9.1. This prior is known to be an exact frequentist matching prior yielding as the posterior Fisher’s fiducial distribution (Fernandez and Steel [13], Severini et al. [14]).

Despite this fact, “the prior seems to be quite bad for correlations, predictions and other inferences involving a multivariate normal distribution” (Sun and Berger [15]). Its mentioned drawbacks stem from the fact that if

n \geq 2

, the marginal (or prior predictive) density does not exist. A related weakness of (5) is its sensitivity to observations which are close one to another.

To mitigate these drawbacks, we look now for other prior distributions.

2.2. Conjugate Priors and Variance Formulas

A wide class of Bayes estimators of

μ

arises from conjugate prior densities,

π (μ, σ_{1}^{- 2}, \dots, σ_{n}^{- 2}, σ^{- 2}) = exp \{- \sum_{j} \frac{s^{2}}{2 σ_{j}^{2}} - \frac{t_{0}^{2}}{2 σ^{2}}\} [\prod_{i} σ_{i}^{- a}] σ^{- b},

(6)

relative to

d μ \prod_{i} d σ_{i}^{- 2} d σ^{- 2}

. Here,

a, b, t_{0}^{2}

and

s^{2} \geq 0

are hyperparameters to be specified in (6),

a > - 1, b + m > - 1, s^{2} \geq 0, t_{0}^{2} \geq 0

.

A slightly modified proof of Theorem 1 shows that the posterior distribution of

μ

under the prior (6) is proportional to

[\prod_{j} {(s^{2} + {(x_{j} - μ)}^{2}]}^{- (a + 1) / 2} {[t^{2} + {({\bar{x}}_{m} - μ)}^{2}]}^{- (b + m + 1) / 2},

where

t^{2} = t_{0}^{2} / m + v^{2}

, which is treated as a constant in the following discussion. The posterior distribution in this situation is the product of t-densities (with a degrees of freedom) and t-density (with

b + m

degrees of freedom). Thus, it is a particular case of the poly-t distribution, which is ubiquitous in multivariate analysis. It appears in the posterior analysis of linear models (Box and Tiao [16]) and is popular in econometrics (Bauwens [17]).

The Bayes estimator has the form

δ^{B} = \frac{\int_{- \infty}^{\infty} \frac{μ d μ}{\prod_{j} {[s^{2} + {(x_{j} - μ)}^{2}]}^{(a + 1) / 2} {[t^{2} + {({\bar{x}}_{m} - μ)}^{2}]}^{(b + m + 1) / 2}}}{\int_{- \infty}^{\infty} \frac{d μ}{\prod_{j} {[s^{2} + {(x_{j} - μ)}^{2}]}^{(a + 1) / 2} {[t^{2} + {({\bar{x}}_{m} - μ)}^{2}]}^{(b + m + 1) / 2}}} .

(7)

If

b + m \to - 1

, (7) is the classical Pitman estimator of the location parameter involving t-distributions with a degrees of freedom. It is especially well studied for the Cauchy location/scale parameter family (

a = 1

).

If, in addition,

s^{2} = 0

,

δ_{B} = \int_{- \infty}^{\infty} \frac{μ d μ}{\prod_{j} {| x_{j} - μ |}^{a + 1}} {[\int_{- \infty}^{\infty} \frac{d μ}{\prod_{j} {| x_{j} - μ |}^{a + 1}}]}^{- 1},

which corresponds to the formal Pitman estimator of the location parameter derived from the working family

{| x - μ |}^{a + 1}

employed to estimate the location parameter

μ

when the observations are normal.

Needless to say that the functions in this family are not probability densities. Moreover, they have a singularity of the third kind.

When

s^{2}

and

t^{2}

are fixed positive numbers, the approximate variance of (7) can be found via the usual argument employed for M-estimators, i.e., solutions of the moment-type equation

\sum_{j} ψ_{j} (x_{j} - μ) = 0

(or minimizers of

\sum_{j} ρ_{j} (x_{j} - μ)

). In our case, the contrast functions are,

ψ_{j} (x_{j} - μ) = \frac{(a + 1) (x_{j} - μ)}{s^{2} + {(x_{j} - μ)}^{2}}, j = 1, \dots, n,

ψ ({\bar{x}}_{m} - μ) = \frac{(b + m + 1) ({\bar{x}}_{m} - μ)}{t^{2} + {({\bar{x}}_{m} - μ)}^{2}} .

The M-estimator

\tilde{μ}

satisfies the equation

\sum_{j} \frac{(a + 1) (x_{j} - μ)}{s^{2} + {(x_{j} - μ)}^{2}} + + \frac{(b + m + 1) ({\bar{x}}_{m} - \tilde{μ})}{t^{2} + {({\bar{x}}_{m} - \tilde{μ})}^{2}} = 0 .

According to well-known results (e.g., Huber and Ronchetti [18]),

Var (\tilde{μ}) \approx [\sum_{j} E_{j} ψ_{j}^{2} (x_{j}) + E ψ^{2} ({\bar{x}}_{m})] {[\sum_{j} E_{j} ψ_{j}^{'} (x_{j}) + E ψ^{'} ({\bar{x}}_{m})]}^{- 2}

(8)

= [{(a + 1)}^{2} \sum_{j} E_{j} \frac{x_{j}^{2}}{{(s^{2} + x_{j}^{2})}^{2}} + {(b + m + 1)}^{2} E \frac{{\bar{x}}_{m}^{2}}{{(t^{2} + {\bar{x}}_{m}^{2})}^{2}}]

\times {[(a + 1) \sum_{j} E_{j} \frac{s^{2} - x_{j}^{2}}{{(s^{2} + x_{j}^{2})}^{2}} + (b + m + 1) E \frac{t^{2} - {\bar{x}}_{m}^{2}}{t^{2} + {\bar{x}}_{m}^{2}}]}^{- 2} .

Here,

E_{j}

refers to the expectation evaluated under the normal distribution with zero mean and variance

σ_{j}^{2}, j = 1, \dots, n

; the distribution of

{\bar{x}}_{m}

is also normal with variance

σ^{2} / m

. The main restriction on

σ^{2}

is that the Central Limit Theorem for

\sum_{j} ψ_{j} (x_{j})

holds. For instance, one can employ (8) if the Liapounov condition for independent non-identically distributed summands is satisfied—i.e.,

{(\sum_{j} σ_{j}^{- 6})}^{2}

= o ({(\sum_{j} σ_{j}^{- 4})}^{3})

(e.g., Lehmann [19], Theorem 2.7.3).

To simplify (8), we need the known formula for the standard normal Z and positive

β

,

E \frac{β}{Z^{2} + β^{2}} = M (β),

where

M (β) = [1 - Φ (β)] / φ (β)

is the familiar Mills ratio (Stuart and Ord, 1994) [20]. With

β_{j} = s / σ_{j}, β_{n + 1} = m^{1 / 2} t / σ, x_{n + 1} = {\bar{x}}_{m},

the differentiation of this identity shows that for

i = 1, \dots, n + 1

,

E_{i} \frac{x_{i}^{2}}{{(x_{i}^{2} + s^{2})}^{2}} = \frac{β_{i}}{2 s^{2}} [(1 + β_{i}^{2}) M (β_{i}) - β_{i}],

and

E_{i} \frac{s^{2} - x_{i}^{2}}{{(x_{i}^{2} + s^{2})}^{2}} = \frac{β_{i}^{2}}{s^{2}} [(1 - β_{i} M (β_{i})],

where for

i = n + 1

one has to replace

s^{2}

by

t^{2}

.

These identities allow for expressing the approximate variance (8) in terms of the Mills ratio:

Var (\tilde{μ}) = {\frac{{(a + 1)}^{2}}{2 s^{2}} \sum_{i} [β_{i} (1 + β_{i}^{2}) M (β_{i}) - β_{i}^{2}]

(9)

+ \frac{{(b + m + 1)}^{2}}{2 t^{2}} [β_{n + 1} (1 + β_{n + 1}^{2}) M (β_{n + 1}) - β_{n + 1}^{2}]}

\times {[\frac{a + 1}{s^{2}} \sum_{i} β_{i}^{2} [(1 - β_{i} M (β_{i})] + \frac{b + m + 1}{t^{2}} β_{n + 1}^{2} [(1 - β_{n + 1} M (β_{n + 1})]]}^{- 2} .

When

σ_{i}^{2} \equiv σ^{2}

,

Var (\tilde{μ}) = \frac{σ^{2} λ [(1 + β^{2}) M (β) - β]}{β {[1 - β M (β)]}^{2}},

where

λ = \frac{n {(a + 1)}^{2} s^{- 2} + {(b + m + 1)}^{2} t^{- 2}}{2 {[n (a + 1) s^{- 2} + (b + m + 1) t^{- 2}]}^{2}} .

Since

λ \geq {(n s^{- 1} + t^{- 1})}^{- 1}

, one has

Var (\tilde{μ}) \geq \frac{σ^{2}}{n s^{- 2} + t^{- 2}} = Var ({\tilde{μ}}^{0}),

where

{\tilde{μ}}^{0} = \frac{\sum x_{j} s^{- 2} + \bar{x} t^{- 2}}{n s^{- 2} + t^{- 2}},

is the best unbiased estimator of

μ

when all variances are equal.

If

β = 1

, i.e.,

s^{2}

and

t^{2}

are adequate approximations of the common variance, then

\frac{Var (\tilde{μ})}{Var ({\tilde{μ}}^{0})} \approx \frac{[2 M (1) - 1]}{2 {[1 - M (1)]}^{2}} = 1.31 . . .

Thus, when all

σ_{i}^{2}

are equal, and hyperparameters in (6) are chosen so that

s^{2} \approx \sum {(x_{j} - \bar{x})}^{2} / (n - 1)

and

t^{2} \approx v^{2}

, the variance of

\tilde{μ}

is about

1.31

larger than that of

{\tilde{μ}}^{0} .

Smaller values of

s^{2}

lead to the larger variance

\tilde{μ}

.

If

s^{2} = s_{0}^{2} γ_{n} \to 0

for some sequence

γ_{n} \to 0

,

n γ_{n}^{2} \to \infty

, the corresponding estimator

\tilde{μ} = {\tilde{μ}}_{n}

is asymptotically normal,

\sqrt{n} γ_{n} ({\tilde{μ}}_{n} - μ) \to N (0, σ^{4} M (0) / (2 s_{0}^{2}))

, albeit at a slower rate than

\sqrt{n}

. Therefore, there is no surprise that in the case of

δ^{0}

(for which

s^{2} = 0, m = 0)

, when

n β Var (\tilde{μ}) \to σ^{2} M (0) / 2

as

β \to 0

, one has n Var(

δ^{0}) \to \infty

. Indeed, it seems that

δ^{0}

bears more resemblance to the nonparametric estimates of the location parameter for which the convergence rate is slower than

\sqrt{n}

. Numerical experiments suggest that in the normal case,

n^{1 / 2}

Var

(δ^{0}) / log (n) \to π^{2} / 8

.

We summarize now the main results of this section.

Theorem 2.

Under the prior distribution (6), the posterior distribution is the product of t-distributions with a degrees of freedom and a t-distribution with

b + m

degrees of freedom. The approximate variance of the Bayes estimator (7) satisfies (8) with Expression (9) via the Mills ratio.

For the remainder of this paper, we will concentrate on the estimator

δ^{0}

.

3. Distribution of $δ^{0}$

3.1. Jacobian and Moments

Let the vector

e

have unit coordinates

w = {(w_{1}^{0}, \dots, w_{n}^{0})}^{T}

and

x = {(x_{1}, \dots, x_{n})}^{T}

representing a random sample. By location equivariance,

\sum_{k} \frac{\partial δ^{0}}{\partial x_{k}} = e^{T} \nabla δ^{0} = 1,

(10)

and by scale equivariance,

\sum_{k} x_{k} \frac{\partial δ^{0}}{\partial x_{k}} = x^{T} \nabla δ^{0} = δ^{0} .

(11)

Define the matrix Q by its elements

q_{i j} = \sum_{k : k \neq i, j} {(x_{i} - x_{k})}^{- 1}

,

i \neq j

,

q_{i i} = 0,

so that

Q = r e^{T} - diag (r) - Z,

(12)

where the i-th coordinate of the vector

r

is

r_{i} = \sum_{j : j \neq i} {(x_{i} - x_{j})}^{- 1}

, and the skew-symmetric matrix Z for

i \neq j

has elements

{(x_{i} - x_{j})}^{- 1}

, and a zero diagonal.

Then, the form of the Jacobian

J (w)

with elements

\partial w_{i}^{0} / \partial x_{j}, i, j = 1, \dots, n

, is

J (w) = [diag (w) - w w^{T}] Q^{T},

(13)

so that

tr J (w) = - w^{T} Q w,

and

\nabla δ^{0} = [I + Q (diag (x) - δ^{0} I)] w = w - [diag (r) + Z] [diag (x) - δ^{0} I] w .

(14)

Because of (14), one obtains

w^{T} Q [diag (x) - δ^{0} I] w = \frac{1}{2} - w^{T} w,

so that

w^{T} \nabla δ^{0} = \frac{1}{2} .

(15)

We will get an extension of (15) to higher moments:

δ_{m} = \sum x_{i}^{m} w_{i}, m = 1, 2, \dots, δ_{0} = 1, δ_{1} = δ^{0}

. It is shown in Rukhin (2023) [8] that for any integer

m \geq 1

,

\nabla δ_{m} = [m diag {(x)}^{m - 1} + Q (diag {(x)}^{m} - δ_{m} I)] w

(16)

= [m diag {(x)}^{m - 1} - (diag (r) + Z) (diag {(x)}^{m} - δ_{m} I)] w,

so that

w^{T} \nabla δ_{m} = \frac{m}{2} \sum_{i} {(w_{i}^{0})}^{2} x_{i}^{m - 1} + \frac{1}{2} \sum_{p = 0}^{m - 1} δ_{p} δ_{m - 1 - p} - \sum r_{i} {(w_{i}^{0})}^{2} x_{i}^{m} .

The coefficients,

τ_{m} = 2 w^{T} \nabla δ_{m} - \sum_{p = 0}^{m - 1} δ_{p} δ_{m - 1 - p},

vanish for

0 \leq m \leq 2 n - 2

.

Indeed,

w^{T} (diag (r) [diag (x^{m}) - δ_{m} I] w = \frac{m}{2} \sum_{i} {(w_{i}^{0})}^{2} x_{i}^{m - 1} .

Because of Hermite’s (osculatory) interpolation formula, for

0 \leq m \leq 2 n - 2

,

2 \sum_{i} r_{i} x_{i}^{m} w_{i}^{2} = m \sum_{i} x_{i}^{m - 1} w_{i}^{2} .

Therefore, for these values of m,

2 w^{T} \nabla δ_{m} = \sum_{p = 0}^{m - 1} δ_{p} δ_{m - 1 - p} .

In particular,

w^{T} \nabla (δ_{2} - δ_{1}^{2}) = δ_{1} - δ_{1} = 0 .

Let

W = {(\sum_{i} \prod_{j \neq i} {| x_{i} - x_{j} |}^{- 1})}^{- 1},

so that the probabilities (4) can be written as

w_{i} = \prod_{j \neq i} {| x_{i} - x_{j} |}^{- 1} W

. When

m = 2 n - 1

and

τ_{m} = W^{2}

,

2 w^{T} \nabla δ_{2 n - 1} = \sum_{p = 0}^{2 n - 2} δ_{p} δ_{2 n - 2 - p} + W^{2} .

For any positive integer m, the coefficients

τ_{m}

determine the asymptotic expansion in

x^{- 1}

of

W^{2} \prod_{i} {(x - x_{i})}^{- 2}

, which is

\frac{W^{2}}{\prod_{i} {(x - x_{i})}^{2}} = \sum \frac{w_{i}^{2}}{{(x - x_{i})}^{2}} - 2 \sum \frac{r_{i} w_{i}^{2}}{x - x_{i}}

(17)

= \sum_{m = 1}^{\infty} \frac{m \sum_{i} w_{i}^{2} x_{i}^{m - 1}}{x^{m + 1}} - 2 \sum_{m = 1}^{\infty} \frac{\sum_{i} r_{i} w_{i}^{2} x_{i}^{m}}{x^{m + 1}} = \sum_{m = 2 n - 1}^{\infty} \frac{τ_{m}}{x^{m + 1}} .

For

m \geq 2 n - 1

, the values

τ_{m}

can be found from the formula

W^{- 2} τ_{2 n - 1 + m} = \sum_{p_{1} + \dots + p_{n} = m} \prod_{1}^{n} (p_{i} + 1) x_{i}^{p_{i}} .

If

s_{i} = sign (\prod_{j \neq i} (x_{i} - x_{j})), i = 1, \dots, n

, is the parity of

x_{i}

, then

\prod_{1}^{n} (x - x_{k}) = P_{e} (x) P_{o} (x) = \prod_{i : s_{i} = 1} (x - x_{i}) \prod_{j : s_{j} = - 1} (x - x_{j})

is a product of two polynomials of degrees

N_{e} = N_{o} = n / 2, n

(even), or

N_{e} = (n + 1) / 2 = N_{o} + 1, n

(odd);

w_{i} = W / [P_{o} (x_{i}) P_{e}^{'} (x_{i})]

if

s_{i} = 1

; if

s_{j} = - 1, w_{j} = - W / [P_{e} (x_{j}) P_{o}^{'} (x_{j})]

.

According to the classical Lagrange interpolation formula, one has

\frac{1}{\prod_{1}^{n} (x - x_{k})} = \sum_{k} \frac{1}{(x - x_{k}) \prod_{j \neq k} (x_{k} - x_{j})} .

(18)

For non-negative integer m, define

κ_{m} = \sum_{i} s_{i} w_{i} x_{i}^{m}

. Then, the same formula implies that for any

m, m = 0, 1, \dots, n - 2

,

κ_{m} = W \sum_{1}^{n} \frac{x_{i}^{m}}{\prod_{j \neq i} (x_{i} - x_{j})} = 0 .

The coefficients

κ_{m}

determine the asymptotic expansion in

x^{- 1}

of

W {[\prod_{1}^{n} (x - x_{k})]}^{- 1} = \sum κ_{m} x^{- m - 1} .

For

m \geq n - 1

, the values

κ_{m}

can be found from the formula

W^{- 1} κ_{n - 1 + m} = \sum_{p_{1} + \dots + p_{n} = m} \prod_{i} x_{i}^{p_{i}}

, e.g.,

κ_{n - 1} = W, κ_{n} = W \sum x_{k}

. With

E_{r}, r = 0, 1, \dots, n

, denoting elementary symmetric functions, one obtains for a positive integer m

κ_{n + m - 1} = E_{1} κ_{n + m - 2} - E_{2} κ_{n + m - 3} + \dots + - {(- 1)}^{n} E_{n} κ_{m - 1} .

Comparison to (17) shows that

\sum_{m = 2 n - 1}^{\infty} \frac{τ_{m}}{x^{m + 1}} = {[\sum_{m = n - 1}^{\infty} \frac{κ_{m}}{x^{m + 1}}]}^{2},

which means that for any m,

2 w^{T} \nabla δ_{m} = \sum_{p = 0}^{m - 1} δ_{p} δ_{m - 1 - p} + \sum_{p = 0}^{m - 1} κ_{p} κ_{m - 1 - p} .

(19)

Furthermore, one has

\frac{δ_{m} + κ_{m}}{W} = 2 \sum_{i : s_{i} = 1} \frac{x_{i}^{m}}{\prod_{j \neq i} (x_{i} - x_{j})},

(20)

and

\frac{δ_{m} - κ_{m}}{W} = 2 \sum_{i : s_{i} = - 1} \frac{x_{i}^{m}}{\prod_{j \neq i} (x_{i} - x_{j})} .

(21)

I will now formulate the results.

Theorem 3.

Formulas (17) and (19)–(21) hold for

δ_{m} = \sum_{i} w_{i} x_{i}^{m}

and

κ_{m} = \sum_{i} s_{i} w_{i} x_{i}^{m}

,

m = 0, 1, \dots

The matrix

d i a g (x) [d i a g (r) + Z]

is diagonalizable with the eigenvalues

n - j, j = 1, \dots, n

.

Proof.

Let

V = [\begin{matrix} 1 & 1 & \dots & 1 & 1 \\ x_{1} & x_{2} & \dots & x_{n - 1} & x_{n} \\ ⋮ & ⋮ & ⋱ & ⋮ \\ x_{1}^{n - 1} & x_{2}^{n - 1} & \dots & x_{n - 1}^{n - 2} & x_{n}^{n - 1} \end{matrix}]

represent the Vandermonde matrix.

We prove that

V diag (x) [diag (r) + Z] = Λ V .

Here,

Λ

is a lower triangular matrix with the elements

λ_{j j} = n - j, λ_{i j} = \sum_{k} x_{k}^{i - j}, i > j, λ_{i j} = 0, i < j, i, j = 1, \dots, n

.

Indeed, the elements of the matrix on the left-hand side are

(V diag (x) {[diag (+ Z])}_{p k} = \sum_{t : t \neq k} \frac{x_{k}^{p} - x_{t}^{p}}{x_{k} - x_{t}} = \sum_{t : t \neq k} \sum_{i = 0}^{p - 1} x_{k}^{i} x_{t}^{p - 1 - i}

= \sum_{i = 0}^{p - 2} x_{k}^{i} \sum_{t} x_{t}^{p - 1 - i} + (n - p) x_{k}^{p - 1} = \sum_{t} λ_{p t} x_{k}^{t - 1}, 1 \leq p, k \leq n .

Since all eigenvalues of

diag (x) [diag (r) + Z]

are distinct, it is diagonalizable. □

3.2. Differential Equations and Integration by Parts

Let

L = L (x_{1}, \dots, x_{n}) = log \prod_{i < j} | x_{j} - x_{i} |,

and

F = F (x_{1}, \dots, x_{n}) = log (n^{- 1} \sum_{k} \prod_{i < j, i, j \neq k} | x_{i} - x_{j} |),

where

x_{1}, \dots, x_{n}

form a standard normal random sample.

Then,

exp {2 L} = \prod_{i} (\prod_{j : j \neq i} | x_{i} - x_{i} |), F \geq (n - 2) L / n

. In this notation,

e^{T} \nabla L = e^{T} \nabla F = 0

,

W = n^{- 1} exp {L - F}

,

x^{T} \nabla L = n (n - 1) / 2, x^{T} \nabla F = (n - 1) (n - 2) / 2,

and

w^{T} \nabla F = w^{T} Q w = w^{T}} = w^{T} \nabla L .

According to the celebrated Selberg formula for

z > - 2 / n

,

E exp {z L (x_{1}, \dots, x_{n})} = \prod_{k = 1}^{n} \frac{Γ (z k / 2 + 1)}{Γ (z / 2 + 1)},

which implies that

E exp {F (x_{1}, \dots, x_{n})} = \prod_{k = 1}^{n - 1} \frac{Γ (k / 2 + 1)}{Γ (3 / 2)} .

See Lu and Richards [21] for several results related to uses of the Selberg formula in statistics. In particular, these authors elaborate the Central Limit Theorem for L expressed as a U-statistic,

\frac{1}{\sqrt{n}} [\frac{2 L}{n} + \frac{C n}{2}] \to N (0, \frac{π^{2}}{18}),

where

C = 0.577 \dots

is Euler’s constant.

To simplify the formula for the quadratic risk of

δ^{0}

, we use integration by parts:

E {(δ^{0})}^{2} = \sum_{j} E δ^{0} w_{j} x_{j} = \sum_{j} E \frac{\partial (δ^{0} w_{j})}{\partial x_{j}} = \sum_{j} E (\frac{\partial δ^{0}}{\partial x_{j}} w_{j} + \frac{\partial w_{j}}{\partial x_{j}} δ^{0}),

so that

E {(δ^{0})}^{2} = \frac{1}{2} - E (w^{T} r) δ^{0} .

Let

q (t)

denote the density of the estimator

δ^{0}

. This density exists and is differentiable as

δ

is the sum of two independent random variables:

δ^{0} - \bar{x}

and normally distributed

\bar{x}

. Clearly,

q (t) = q (- t)

, and

\frac{q^{'} (t)}{q (t)} = - n E (\bar{x} | δ^{0} = t) .

This identity holds since the score function of

δ^{0}

is the conditional expected value of the score function of

\bar{x}

for given

δ^{0}

. It also follows from (10) in the same way as the next formula follows from (11).

For any z,

x^{T} \nabla e^{z δ^{0}} = z e^{z δ^{0}} δ^{0},

so that

E e^{z δ^{0}} (\sum_{i} x_{i}^{2} - n) = z E e^{z δ^{0}} δ^{0},

or

E (\sum_{i} x_{i}^{2} | δ^{0} = t) = n - 1 - t \frac{q^{'} (t)}{q (t)} = n - 1 + t E (\sum_{i} x_{i} | δ^{0} = t) .

More generally,

\frac{e^{n z^{2} / 2} q (t - z)}{q (t)} = E [e^{z \sum_{i} x_{i}} | δ^{0} = t],

so that

q^{″} (t) / q (t) = E [{(\sum_{i} x_{i})}^{2} | δ^{0} = t] - n .

Similarly, it follows from (15) that

E w^{T} \nabla e^{z δ^{0}} = E e^{z δ^{0}} [δ^{0} - tr J (w)] = E e^{z δ^{0}} (δ^{0} + w^{T} r) = \frac{z E^{z δ^{0}}}{2},

which means that

\frac{q^{'} (t)}{q (t)} = - 2 [t + E (w^{T} r | δ^{0} = t)] .

(22)

Since

w^{T} r = w^{T} Q w = w^{T} \nabla L,

for any differentiable bounded function

g (t)

and any z,

g (t) (w^{T} \nabla e^{z L}) = z g (t) (w^{T} r) e^{z L} .

Integrating by parts, one obtains for

z > - 2 / n

z E g (δ^{0}) (w^{T} r) e^{z L} = E g (δ^{0}) (w^{T} \nabla e^{z L}) = E [- g (δ^{0}) tr J (w) - w^{T} \nabla g + δ^{0} g (δ^{0})] e^{z L},

or

E e^{z L} [δ^{0} g (δ^{0}) - g^{'} (δ^{0}) / 2] = (z - 1) E e^{z L} g (δ^{0}) (w^{T} r) .

It follows that

[\frac{q^{'} (t)}{2 q (t)} + δ] E (e^{z L} | δ^{0} = t) + \frac{d}{2 d t} E (e^{z L} | δ^{0} = t) = (z - 1) E (e^{z L} w^{T} r | δ^{0} = t) .

By putting

z = 1

, one obtains

q (t) = \frac{E e^{L}}{\sqrt{π}} \frac{e^{- t^{2}}}{E (e^{L} | δ^{0} = t)} = \frac{1}{\sqrt{π}} \prod_{k = 1}^{n} \frac{Γ (k / 2 + 1)}{Γ (3 / 2)} \frac{e^{- t^{2}}}{E (e^{L} | δ^{0} = t)} .

(23)

One can show that

E (e^{L} | δ^{0}) / E e^{L} = E (e^{F} | δ^{0}) / E e^{F}

by applying the above argument to

w^{T} r = w^{T} \nabla F

. Indeed,

\frac{d}{2 d t} E (e^{z F} | δ^{0} = t) - E (w^{T} r | δ^{0} = t) E (e^{z F} | δ^{0} = t) = (z - 1) E (e^{z F} w^{T} r | δ^{0} = t),

so that

2 E (w^{T} r | δ^{0} = t) = \frac{d}{d t} log (E (e^{F} | δ^{0} = t)) = - \frac{q^{'} (t)}{q (t)} - 2 t .

Theorem 4.

Formulas (22) and (23) hold for the density q of the estimator

δ^{0}

.

For

- 2 / n < z < 1

,

E (w^{T} r | δ^{0} = t) E (e^{(1 - z) F + z L} | δ^{0} = t) = \frac{d}{2 d t} E (e^{(1 - z) F + z L} | δ^{0} = t),

which means that

E (e^{F} | δ^{0}, W) E e^{F} = E (e^{F} | δ^{0}) E (e^{F} | W) .

To proceed, we need some facts about orthogonal polynomials with regard to (random) weights

w

.

3.3. Random Orthogonal Polynomials

Using notation from the previous section for

m = 1, \dots, n,

we define Hankel moment matrices as follows:

[\begin{matrix} 1 & δ_{1} & \dots & δ_{m} \\ ⋮ & ⋮ & ⋱ & ⋮ \\ δ_{m} & δ_{m + 1} & \dots & δ_{2 m} \end{matrix}] .

Their determinants

G_{m}

satisfy the condition

G_{m} = W^{2 m - n + 2} G_{n - m - 2}

. This fact is due to self-duality of the weights

w

(Rukhin, 2023 [8]). Then, the sequence

h_{m} = \sum_{j} {[T_{m} (x_{j})]}^{2} w_{j},^{0} h_{0} = 1

is such that

h_{m} = G_{m} / G_{m - 1}

,

h_{m} = W^{2} / h_{n - m - 1}

.

We consider the sequence of monic polynomials

T_{m}, m = 0, 1, \dots, n

,

T_{- 1} (x) = 0

,

T_{0} (x) = 1, T_{1} (x) = x - δ_{1}, \dots, T_{n} (x) = \prod (x - x_{k}) = P_{e} (x) P_{o} (x)

, which are orthogonal in the space

L_{2}

of all functions over

{x_{1}, \dots, x_{n}}

. They are known to satisfy the three-term recurrence:

T_{m + 1} (z) = (x - α_{m}) T_{m} (z) - β_{m} T_{m - 1} (z),

(24)

where

β_{m} = h_{m} / h_{m - 1}, β_{m} = β_{n - m}

, and

α_{m} = \sum_{t} x_{t} {[T_{m} (x_{t})]}^{2} w_{t} / h_{m}, α_{m} = α_{n - 1 - m}

. Clearly,

α_{m}

depends on

δ_{1}, \dots, δ_{2 m + 1}

only, while

β_{m}

is determined by the first

2 m

moments,

δ_{1}, \dots, δ_{2 m}

. For example,

α_{0} = δ_{1}, α_{1} = (δ_{3} - δ_{1} δ_{2} + δ_{1}^{3}) / (δ_{2} - δ_{1}^{2})

,

β_{0} = 0, β_{1} = δ_{2} - δ_{1}^{2}

.

Formula (24) shows that

T_{m} (x_{i}) = s_{i} h_{m} T_{n - m - 1} (x_{i}) / W .

(25)

Thus, if n is odd, then

T_{(n - 1) / 2} = P_{o},

and

h_{(n - 1) / 2} = W

. Polynomials

P_{e}

and

P_{o}

are orthogonal for any n.

The polynomial

T_{n - 1} (x) = \sum_{i} w_{i}^{0} \prod_{j \neq i} (x - x_{j})

is less deviant from zero in

L_{\infty}

:

T_{n - 1} (x_{i}) = s_{i} W

. Therefore, any monic polynomial

R (x)

of degree not exceeding

n - 1

W \leq max_{i} | R (x_{i}) | .

(26)

The ratio

T_{n - 1} (z) / T_{n} (z)

gives the Stiltjes transform of M,

T_{n - 1} (z) = - \sum_{i} w_{i}^{0} \frac{\partial T_{n} (z)}{\partial x_{i}} = - w^{T} \nabla T_{n} (z) .

In addition to

x_{1}, \dots, x_{n}

, the polynomial

T_{n - 1}^{2} (z) - W^{2}

has

n - 2

real roots (which interlace those of

T_{n - 1}

), with

R_{n - 1}

denoting the monic polynomial of degree

n - 2

, which has the following roots:

T_{n - 1}^{2} (z) - W^{2} = T_{n} (z) R_{n - 1} (z) .

It follows that

R_{n - 1}

coincides with the associated polynomial

T_{n - 1}

:

R_{n - 1} (z) = \sum_{i} \frac{[T_{n - 1} (z) - T_{n - 1} (x_{i})] w_{i}}{z - x_{i}} .

Associated with

T_{m}

, the orthogonal monic polynomial

R_{m}, m = 0, 1, \dots, n - 1

has degree

m - 1

. These polynomials satisfy the same recurrence (refrek) but with different initial conditions—

R_{0} = 0

and

R_{1} = 1

—so that

R_{2} (z) = z - α_{1}

,

T_{n - 1} (z) = (z - δ_{1}) R_{n - 1} (z) - β_{1} R_{n - 2} (z)

.

According to (17),

R_{n - 1} (z) = - 2 \sum_{i} w_{i}^{0} \frac{\partial T_{n - 1} (z)}{\partial x_{i}} = - 2 w^{T} \nabla T_{n - 1} (z) .

(27)

Since

T_{n - 1}^{'} (x_{i}) = \frac{s_{i} W}{w_{i}^{0}} \sum_{j \neq i} \frac{w_{i}^{0} + w_{j}^{0}}{x_{i} - x_{j}},

one has

R_{n - 1} (x_{i}) = \frac{2 T_{n - 1}^{'} (x_{i})}{w_{i}^{0}} = 2 s_{i} W \sum_{j : s_{j} = - s_{i}} \frac{w_{j}^{0}}{x_{i} - x_{j}} .

(28)

One can represent

R_{n - 1} (z) = R_{e}^{(n)} (z) R_{o}^{(n)} (z)

as a product of two monic polynomials (with real roots) of degrees

N_{e} - 1

and

N_{o} - 1

respectively. Then,

T_{n - 1} (z) - W = P_{e} (z) R_{o}^{(n)} (z)

and

T_{n - 1} (z) + W = P_{o} (z) R_{e}^{(n)} (z)

, so that if

s_{i} = 1

,

R_{e}^{(n)} (x_{i}) = 2 W / P_{o} (x_{i}),

and if

s_{j} = - 1

,

R_{o}^{(n)} (x_{j}) = - 2 W / P_{e} (x_{j})

,

R_{e}^{(n)} (z) = 2 W \sum_{s_{i} = 1} \frac{\prod_{s_{k} = 1, k \neq i} (z - x_{k})}{P_{o} (x_{i}) P_{e}^{'} (x_{i})},

(29)

and

R_{o}^{(n)} (z) = - 2 W \sum_{s_{j} = - 1} \frac{\prod_{s_{ℓ} = - 1, ℓ \neq j} (z - x_{ℓ})}{P_{e} (x_{j}) P_{o}^{'} (x_{j})} .

(30)

Clearly,

\frac{2 W}{T_{n} (z)} = \frac{R_{e}^{(n)} (z)}{P_{e} (z)} - \frac{R_{o}^{(n)} (z)}{P_{o} (z)},

(31)

and

\frac{2 T_{n - 1} (z)}{T_{n} (z)} = \frac{R_{e}^{(n)} (z)}{P_{e} (z)} + \frac{R_{o}^{(n)} (z)}{P_{o} (z)} .

(32)

To specify coefficients of

R_{e}^{(n)} (z)

, we use the identity

\prod_{s_{k} = 1, k \neq i} (z - x_{k}) = z^{N_{e} - 1} - (E_{1} - x_{i}) z^{N_{e} - 2} + \dots

+ {(- 1)}^{N_{e} - 1} E_{N_{e} - 1} - + {(- 1)}^{N_{e} - 2} E_{N_{e} - 2} x_{i} + \dots + x_{i}^{N_{e} - 1},

where

E_{r} = E_{r}^{(n)}, r = 0, 1, \dots, N_{e} - 1

denotes the elementary symmetric function based on

x_{i}, s_{i} = 1

. Because of (18) for

m \leq N_{o} - 1

,

R_{e}^{(n)} (z) = z^{N_{e} - 1} - (E_{1} - δ_{1}) z^{N_{e} - 2} + \dots + \sum_{r = 0}^{N_{e} - 1} {(- 1)}^{r} E_{r} δ_{N_{e} - 1 - r},

(33)

so that

z R_{e}^{(n)} (z) - P_{e} (z) = δ_{1} z^{N_{e} - 1} + (δ_{2} - E_{1} δ_{1}) z^{N_{e} - 2} + \dots +

(\sum_{r = 0}^{N_{e} - 2} {(- 1)}^{r} E_{r} δ_{N_{e} - 1 - r}) z + {(- 1)}^{N_{e} - 1} E_{N_{e}} .

Similarly, one obtains formulas for

R_{o}^{(n)}

and

P_{o}

via the elementary symmetric functions

D_{r} = D_{r}^{(n)}, r = 0, \dots, N_{o} - 1

based on

x_{j}, s_{j} = - 1

.

If

x_{i}, x_{i} < x_{i + 1}, i = 1, \dots, n - 1

, represent order statistics, then

R_{o}^{(3)} (z) = 1,

R_{e}^{(4)} (z) = z - E_{1} + \frac{E_{2} - D_{2}}{E_{1} - D_{1}},

R_{o}^{(4)} (z) = z - D_{1} + \frac{E_{2} - D_{2}}{E_{1} - D_{1}} .

The ratio

R_{e}^{(n)} (z) / P_{e} (z)

coincides with the Stiltjes transform of the discrete measure defined by the weights

{2 w_{i}^{0}, s_{i} = 1}

;

R_{e}^{(n)} (z)

is associated with

P_{e} = P_{e}^{(n)}

. Thus,

R_{e}^{(n)} (z) / P_{e} (z)

can be written as a finite continued fraction whose coefficients can be found from the three-term recurrence (24) for orthogonal polynomials on

{x_{i}, s_{i} = 1}

for this measure. Similar facts hold for

P_{o} = P_{o}^{(n)}

and the probability distribution given by

{2 w_{j}^{0}, s_{j} = 1}

.

I now summarize the obtained results.

Theorem 5.

Let

T_{m} = T_{m}^{(n)}, m = 0, 1, \dots, n,

be monic polynomials which are orthogonal under the norm defined by

{w_{j}^{0}}

. Then, (25) holds. For the polynomial associated with

T_{n - 1}

,

R_{n - 1} (z) = R_{e}^{(n)} (z) R_{o}^{(n)} (z)

, (27) and (28) are valid. The polynomials

R_{e}^{(n)}

and

R_{o}^{(n)}

defined by (29) and (30) satisfy (31)–(33).

3.4. Main Representation

The polynomials

R_{e}^{(n - 1)}

allow for expressing

δ_{m} = δ_{m}^{(n)}, m < N_{e}^{(n)}

, as a rational function of

x_{n} = x_{(n)} = {max}_{t} x_{t},

namely,

\frac{δ_{m}^{(n)}}{W^{(n)}} = \frac{1}{W^{(n - 1)}} [\frac{x_{n}^{m} R_{e}^{(n - 1)} (x_{n})}{P_{e}^{(n - 1)} (x_{n})} - \sum_{r = 0}^{m - 1} δ_{r}^{(n - 1)} x_{n}^{m - 1 - r}] .

(34)

To stress dependence on n, we write

N_{e} = N_{e}^{(n)}, N_{o} = N_{o}^{(n)}

,

W = W^{(n)}

, and

s_{j} = s_{j}^{(n)}

. The functions on the right-hand side of (34) correspond to the sample of size

n - 1

obtained by deleting

x_{n}

from the original dataset.

For the reduced sample,

N_{e}^{(n - 1)} = N_{o}^{(n)}, P_{e}^{(n - 1)} (x) = P_{o}^{(n)} (x)

; if originally

s_{j} = s_{j}^{(n)} = - 1,

then

P_{o}^{(n - 1)} (x_{j}) = - P_{e}^{(n)} (x_{j}) / (x_{n} - x_{j}), s_{j}^{(n - 1)} = 1

. Therefore, because of (18),

\frac{1}{W} = \frac{R_{e}^{(n - 1)} (x_{n})}{W^{(n - 1)} P_{e}^{(n - 1)} (x_{n})} .

(35)

For fixed n and

1 \leq m \leq N_{e} - 1

,

z^{m} \frac{R_{e}^{(n)} (z)}{P_{e}^{(n)} (z)} = \sum_{r = 0}^{m - 1} δ_{r} z^{m - 1 - r} + 2 W \sum_{s_{i} = 1} \frac{x_{i}^{m}}{(z - x_{i}) P_{o} (x_{i}) P_{e}^{'} (x_{i})},

implying (34).

Since

P_{e}^{(1)} (z) = z - x_{1}, P_{e}^{(2)} (z) = z - x_{2}, R_{e}^{(1)} (z) = R_{e}^{(2)} (z) = 1

, the repeated use of (35) gives

\frac{1}{2 W} = \prod_{p = 0}^{n - 2} \frac{R_{e}^{(n - p - 1)} (x_{n - p})}{P_{e}^{(n - p - 1)} (x_{n - p})} = \frac{\prod_{p = 0}^{n - 2} R_{e}^{(n - p - 1)} (x_{n - p})}{| S |} .

Here,

S = S^{(n)} = Res (P_{e}, P_{o}) = \prod_{s_{i} = 1, s_{j} = - 1} (x_{i} - x_{j}) = \prod_{s_{i} = 1} P_{o} (x_{i}) = \prod_{s_{j} = - 1} P_{e} (x_{j}),

(36)

is the resultant of polynomials

P_{e}

and

P_{o}

(which are supposed not to have common roots). One can check by using (31) that

S = A (z) P_{e} (z) + B (z) P_{o} (z),

where the degree of polynomial

A (z) = - [S / (2 W)] R_{o}^{(n)} (z)

is

N_{o} - 1

and the degree of

B (z) = [S / (2 W)] R_{e}^{(n)} (z)

is

N_{e} - 1

.

Actually, (34) and (35) hold if

x_{n}

is replaced by any

x_{k}, s_{k} = 1

, in which case

R_{e}^{(n - 1)}

refers to the monic polynomial of degree

N_{o} - 1 = N_{e}^{(n - 1)} - 1

proportional to

\sum_{s_{j} = - 1} \prod_{ℓ \neq j, s_{ℓ} = - 1} (\frac{z - x_{ℓ}}{x_{j} - x_{ℓ}}) \frac{1}{\prod_{i \neq k, s_{i} = 1} (x_{j} - x_{i})} .

It corresponds to the sample of size

n - 1

obtained by removing

x_{k}

, when

P_{e}^{(n - 1)} (x_{k})

becomes

P_{o}^{(n)} (x_{k}) = \prod_{s_{j} = - 1} (x_{k} - x_{j}) .

In an alternative form of (34) for

x_{ℓ}, s_{ℓ} = - 1

, the monic polynomial is proportional to

\sum_{s_{i} = 1} \prod_{k \neq i, s_{k} = 1} (\frac{z - x_{k}}{x_{i} - x_{k}}) \frac{1}{\prod_{j \neq ℓ, s_{j} = - 1} (x_{i} - x_{j})}

with

\prod_{s_{i} = 1} (x_{ℓ} - x_{i}) = P_{e}^{(n)} (x_{k})

instead of

P_{o}^{(n - 1)} (x_{ℓ})

.

Our goal is to prove the following representation of

δ_{m}

.

Theorem 6.

For any integer

m, 0 \leq m < N_{e}^{(n)}

,

n \geq 3,

with S defined in (36),

δ_{m} = 2 W {| S |}^{- 1} K_{m} (x_{1}, \dots, x_{n}) .

Here,

K_{m} = K_{m}^{(n)}

of

x_{1}, \dots, x_{n}

represents a homogeneous symmetric function of degree

m + p_{n}

—where

p_{n} = {(n - 2)}^{2} / 4, n

even and

p_{n} = (n - 1) (n - 3) / 4, n

odd—which is a linear combination of products of homogeneous symmetric polynomials in

{x_{i}, s_{i} = 1, i = 1, \dots, N_{e}}

and

{x_{j}, s_{j} = - 1, j = 1, \dots, N_{o}}

with integer coefficients.

K_{m} (- x_{1}, \dots, - x_{n}) = {(- 1)}^{m} K_{m} (x_{1}, \dots, x_{n})

if

n \geq 4

, with

K_{m} (x, \dots, x) = 0 .

The recursive Formula (40) relates

K_{m}^{(n)} (x_{1}, \dots, x_{n})

to

K_{r}^{(n - 1)} (x_{1}, \dots, x_{n - 1}), r = 0, \dots, m - 1

, based on the reduced sample

x_{t}, 1 \leq t \leq n - 1

. One has

\sum_{i} \frac{\partial K_{m} (x_{1}, \dots, x_{n})}{\partial x_{i}} = K_{m - 1} (x_{1}, \dots, x_{n}),

(37)

and

\sum_{i} x_{i} \frac{\partial K_{m} (x_{1}, \dots, x_{n})}{\partial x_{i}} = (m + p_{n}) K_{m} (x_{1}, \dots, x_{n}) .

(38)

Proof.

As was already noticed, for the subsample

x_{t}, t = 1, \dots, n - 1,

the polynomial

P_{e}^{(n - 1)} (z)

coincides with

P_{o} (z) = P_{o}^{(n)} (z)

. The corresponding result is

S^{(n - 1)} = S^{(n)} / \prod_{ℓ \neq j, s_{ℓ} = - 1} (x_{ℓ} - x_{n}) = {(- 1)}^{N_{o} - 1} S^{(n)} / P_{e}^{(n - 1)} (x_{n})

.

Therefore, (35) implies that

K_{0}^{(n)} = {(\frac{| S |}{2 W})}^{(n)} = R_{e}^{(n - 1)} (x_{n}) K_{0}^{(n - 1)},

(39)

where the coefficients of the polynomial

R_{e}^{(n - 1)}

depend only on

x_{1}, \dots, x_{n - 1}

, and

K_{p}^{(n - 1)}

are evaluated on the sample of size

n - 1

obtained by removing

x_{k}

(with

s_{t}^{(n - 1)} = s_{t}^{(n)}

for

t \neq k

). The homogeneity degree of

K_{0}

is

N_{e}^{(n)} - 1 + N_{e}^{(n - 1)} - 1 + \dots + N_{e}^{(2)} - 1 = p_{n} = (N_{e}^{(n)} - 1) (N_{o}^{(n)} - 1)

. If

s_{k} = 1

, then

K_{0}^{(n)} = \sum_{q + r \leq N_{o} - 1} {(- 1)}^{N_{o} - 1 - q - r} D_{N_{o} - 1 - q - r} x^{r},

Because of (34) for

1 \leq m \leq N_{e}^{(n)} - 1

,

K_{m}^{(n)} = K_{0}^{(n - 1)} [x_{n}^{m} R_{e}^{(n - 1)} (x_{n}) - x_{n}^{m - 1} P_{e}^{(n - 1)} (x_{n})]

(40)

- P_{e}^{(n - 1)} (x_{n}) \sum_{r = 1}^{m - 1} K_{r}^{(n - 1)} x_{n}^{m - 1 - r} .

If

K^{(n)} = {(K_{0}^{(n)}, \dots, K_{N_{e} - 1}^{(n})}^{T}

is an

N_{e}

-dimensional vector, then (39) and (40) mean that

K^{(n)} = A^{(n)} K^{(n - 1)}

where

A^{(n)}

is a matrix of size

N_{e} \times N_{o}

with elements

A_{m p}^{(n)} = \begin{matrix} \sum_{r = m}^{N_{o} - 1 + m - p} D_{N_{o} - 1 + m - p} x^{r}, p \leq m \leq N_{o} - 1 \\ - \sum_{r = m - p - 1}^{m - 1} D_{N_{o} - 1 + m - p} x^{r}, 0 \leq m \leq p - 1 \end{matrix} .

The induction assumption that for

p < n

all

K_{m}^{(p)}, m \leq N_{e}^{(p)} - 1,

can be represented as linear combinations of products of homogeneous symmetric polynomials in

{x_{i}, s_{i}^{(p)} = 1}

and

{x_{j}, s_{j}^{(p)} = - 1}

with integer coefficients implies that

K_{m}^{(n)}, m \leq N_{e} - 1

has the claimed properties. Namely, it is homogeneous of the stated degree, it is symmetric in

{x_{i}, s_{i} = 1, i = 1, \dots, N_{e}}

, and

{x_{j}, s_{j} = - 1, j = 1, \dots, N_{o}}

, and it can be written as a linear combination of products of homogeneous symmetric polynomials in these variables with integer coefficients.

To complete the proof, note that

K_{m} (x_{1} + c, \dots, x_{n} + c) = K_{m} (x_{1}, \dots, x_{n}) + \dots + c^{m} K_{0} (x_{1}, \dots, x_{n}),

so that (37) follows while (38) holds because of homogeneity. □

In particular, with non-negative and shift-invariant

K_{0}

specified in (39) as a product of polynomials (29) evaluated at successive order statistics, one obtains the representation of (2) as follows:

δ^{0} = δ_{1}^{(n)} = x_{n} - \frac{P_{e}^{(n - 1)} (x_{n})}{R_{e}^{(n - 1)} (x_{n})}

(41)

with polynomials

R_{e}^{(n - 1)}

and

P_{e}^{(n - 1)}

based on

x_{i}, i = 1, \dots, n - 1

, for which

s_{i}^{(n - 1)} = 1

.

Theorem 6 shows that

δ_{1}

has some resistance to extreme observations,

lim_{x_{n} \to \infty} δ_{1}^{(n)} (x_{1}, \dots, x_{n}) = δ_{1}^{(n - 1)} (x_{1}, \dots, x_{n - 1}),

and

lim_{x_{1} \to - \infty} δ_{1}^{(n)} (x_{1}, \dots, x_{n}) = δ_{1}^{(n - 1)} (x_{2}, \dots, x_{n}) .

Here are examples of Ks for smaller values of n given in terms of elementary symmetric functions:

n = 3, p_{3} = 0, K_{0} (x_{1}, x_{2}, x_{3}) \equiv 1, K_{1} (x_{1}, x_{2}, x_{3}) = x_{2};

n = 4, p_{4} = 1, K_{0} (x_{1}, \dots, x_{4}) = E_{1}^{(4)} - D_{1}^{(4)},

K_{1} (x_{1}, \dots, x_{4}) = E_{2}^{(4)} - D_{2}^{(4)},

K_{2} (x_{1}, \dots, x_{4}) = E_{2}^{(4)} D_{1}^{(4)} - D_{2}^{(4)} E_{1}^{(4)};

n = 5, p_{5} = 2, K_{0} (x_{1}, \dots, x_{5}) = D_{2}^{(5)} - {[D_{1}^{(5)}]}^{2} + E_{1}^{(5)} D_{1}^{(5)} - E_{2}^{(5)},

K_{1} (x_{1}, \dots, x_{5}) = D_{2}^{(5)} D_{1}^{(5)} - D_{2}^{(5)} E_{1}^{(5)} + E_{3}^{(5)} .

K_{2} (x_{1}, \dots, x_{5}) = D_{2}^{(5)} E_{2}^{(5)} - D_{1}^{(5)} E_{3}^{(5)} - D_{2}^{(5)};

n = 6, p_{5} = 4, K_{0} (x_{1}, \dots, x_{6}) = E_{3} (E_{1} + D_{1}) + E_{2} (E_{1} D_{1} + 2 D_{2} - D_{1}^{2}) + E_{1} D_{1} D_{2} - {(E_{2} - D_{2})}^{2},

K_{1} (x_{1}, \dots, x_{6}) = E_{3} (- E_{2} + E_{1} D_{1} + D_{2} - D_{1}^{2}) - E_{2} D_{2} - (E_{1}^{2} - 2 D_{2}) D_{3} + E_{1} D_{1} D_{3} - D_{2} D_{3},

K_{2} (x_{1}, \dots, x_{6}) = E_{3} E_{2} D_{2} - E_{3}^{2} D_{1} + E_{3} D_{3} (E_{1} + D_{1}) - E_{2} D_{3} (E_{2} - D_{2}) - E_{1} D_{3}^{2} .

4. Conclusions

The present work allows for obtaining meaningful consensus values under unreliable or missing uncertainties by using Bayes estimators (5) or (7). This method leads to mathematically challenging properties of self-dual weights (1) and their extension (4). The orthogonal polynomials that involve rank parity exhibit fascinating symmetry.

The approach explores non-traditional statistical estimation in the absence of variance information. The recursive algorithm detailed in Theorem 6 may be useful for practical calculations. The paper hints at new mathematical findings and sets the stage for a detailed exploration of a novel statistical methodology for estimating common parameters in the face of variance heterogeneity.

Funding

This research received no external funding.

Data Availability Statement

No new data were created or analyzed in this study.

Conflicts of Interest

The author declares no conflicts of interest.

References

Morris, C.N. Parametric empirical Bayes inference: Theory and applications. J. Am. Stat. Assoc. 1983, 78, 47–65. [Google Scholar] [CrossRef]
Rukhin, A.L. Estimating heterogeneity variances to select a random effects model. J. Stat. Plan. Inference 2019, 202, 1–13. [Google Scholar] [CrossRef]
Templ, M. Enhancing precision in large scale data-analysis: An innovative robust imputation algorithm for managing outliers and missing values. Mathematics 2023, 11, 2729. [Google Scholar] [CrossRef]
Spiegelhalter, D. The Art of Uncertainty; Norton: New York, NY, USA, 2025. [Google Scholar]
Possolo, A. Measurement science meets the reproducibility challenge. Metrologia 2022, 80, 044002. [Google Scholar] [CrossRef]
Hand, E. Citizen science: People power. Nature 2010, 466, 685–687. [Google Scholar] [CrossRef] [PubMed]
Rukhin, A.L. Estimation of the common mean from heterogeneous normal observations with unknown variances. J. R. Stat. Soc. Ser. B 2017, 79, 1601–1618. [Google Scholar] [CrossRef]
Rukhin, A.L. Orthogonal polynomials for self-dual weights. J. Approx. Theory 2023, 288, 105865. [Google Scholar] [CrossRef]
Trefethen, L.N. Approximation theory and approximation practice. SIAM Rev. 2013, 46, 501–517. [Google Scholar]
Borodin, A. Duality of orthogonal polynomials on a finite set. J. Statist. Phys. 2002, 109, 1109–1120. [Google Scholar] [CrossRef]
Genest, V.; Tsujimoto, S.; Vinet, L.; Zhedanov, A. Persymmetric Jacobi matrices, isospectral deformations and orthogonal polynomials. J. Math. Anal. Appl. 2017, 450, 915–928. [Google Scholar] [CrossRef]
Geisser, S. Predictive Inference: An Introduction; Chapman & Hall: New York, NY, USA, 1993. [Google Scholar]
Fernandez, C.; Steel, M.F.J. Reference priors for the general location-scale model. Stat. Probab. Lett. 1999, 43, 377–384. [Google Scholar] [CrossRef]
Severini, T.A.; Mukherjea, R.; Ghosh, M. On an exact probability matching property of right-invariant priors. Biometrika 2002, 89, 952–957. [Google Scholar] [CrossRef]
Sun, D.; Berger, J.O. Objective Bayesian analysis for the multivariate normal model. In Bayesian Statistics 8; University Press: Oxford, UK, 2007; pp. 525–562. [Google Scholar]
Box, G.; Tiao, G. Bayesian Inference in Statistical Analysis, 2nd ed.; Wiley: New York, NY, USA, 1992. [Google Scholar]
Bauwens, L. Bayesian Full Information Analysis of Simultaneous Equation Models Using Integration by Monte Carlo; Springer: Berlin/Heidelberg, Germany, 1994. [Google Scholar]
Huber, P.J.; Ronchetti, E.M. Robust Statistics, 2nd ed.; Wiley: New York, NY, USA, 2009. [Google Scholar]
Lehmann, E. Elements of Large-Sample Theory; Springer: New York, NY, USA, 1999. [Google Scholar]
Stuart, A.; Ord, J.K. Kendall’s Advanced Theory of Statistics, 6th ed.; E. Arnold: London, UK, 1994; Volume 1. [Google Scholar]
Lu, I.-L.; Richards, D. Random discriminants. Ann. Stat. 1993, 21, 1982–2000. [Google Scholar] [CrossRef]

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Rukhin, A.L. Estimating Common Mean in Heteroscedastic Variances Model. Mathematics 2025, 13, 1290. https://doi.org/10.3390/math13081290

AMA Style

Rukhin AL. Estimating Common Mean in Heteroscedastic Variances Model. Mathematics. 2025; 13(8):1290. https://doi.org/10.3390/math13081290

Chicago/Turabian Style

Rukhin, Andrew L. 2025. "Estimating Common Mean in Heteroscedastic Variances Model" Mathematics 13, no. 8: 1290. https://doi.org/10.3390/math13081290

APA Style

Rukhin, A. L. (2025). Estimating Common Mean in Heteroscedastic Variances Model. Mathematics, 13(8), 1290. https://doi.org/10.3390/math13081290

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Estimating Common Mean in Heteroscedastic Variances Model

Abstract

1. Introduction: Missing Uncertainties

2. Non-Informative Priors and Bayes Estimators

2.1. Heterogeneity and Homogeneity

2.2. Conjugate Priors and Variance Formulas

3. Distribution of $δ^{0}$

3.1. Jacobian and Moments

3.2. Differential Equations and Integration by Parts

3.3. Random Orthogonal Polynomials

3.4. Main Representation

4. Conclusions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

Estimating Common Mean in Heteroscedastic Variances Model

Abstract

1. Introduction: Missing Uncertainties

2. Non-Informative Priors and Bayes Estimators

2.1. Heterogeneity and Homogeneity

2.2. Conjugate Priors and Variance Formulas

3. Distribution of δ 0

3.1. Jacobian and Moments

3.2. Differential Equations and Integration by Parts

3.3. Random Orthogonal Polynomials

3.4. Main Representation

4. Conclusions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

3. Distribution of $δ^{0}$