Generalizations of Rao–Blackwell and Lehmann–Scheffé Theorems with Applications

Alemam, Seyf; Homei, Hazhir; Nadarajah, Saralees

doi:10.3390/math11194146

Open AccessArticle

Generalizations of Rao–Blackwell and Lehmann–Scheffé Theorems with Applications

by

Seyf Alemam

¹,

Hazhir Homei

¹ and

Saralees Nadarajah

^2,*

¹

Department of Statistics, University of Tabriz, P.O. Box 51666-17766, Tabriz 51666-16471, Iran

²

Department of Mathematics, University of Manchester, Manchester M13 9PL, UK

^*

Author to whom correspondence should be addressed.

Mathematics 2023, 11(19), 4146; https://doi.org/10.3390/math11194146

Submission received: 18 July 2023 / Revised: 25 September 2023 / Accepted: 29 September 2023 / Published: 30 September 2023

Download Versions Notes

Abstract

:

Our aim in this paper is extending the applicability domain of the Rao–Blackwell theorem, our methodology is using conditional expectation and generalizing sufficient statistics, and one result is a generalization of the Lehmann–Scheffé theorem; as a conclusion, some problems that could not be solved by an earlier version of the Lehmann–Scheffé theorem become solvable by our new generalization.

Keywords:

complete minimal sufficient statistic; incomplete minimal sufficient statistic; minimal sufficient statistic; Rao–Blackwell theorem; sufficient statistic

MSC:

62F10; 62F99

1. Introduction

A population quantity (for example, the average height of all men) can be estimated in many different ways. An unbiased estimator (see Definition 1) is one that provides zero bias (that is, it estimates the population quantity with zero bias). Some examples of unbiased estimators are estimation for average length of stay in intensive care units in the COVID-19 pandemic (Lapidus et al. [1]); estimation of cumulative incidence incorporating antibody kinetics and epidemic recency (Takahashi et al. [2]); estimation of background distribution for automated quantitative imaging (Silberberg and Grecco [3]); and estimation of target tracking in Doppler radar (Han et al. [4]). The Rao–Blackwell theorem is a generator for unbiased estimators with small variances. In this note, we aim at generalizing this effective theorem for finding those estimators. Fisher [5] introduced sufficient statistics in 1920; see Definition 2.

Definition 1.

Let

X = [X_{1}, \dots, X_{n}]

, where

X_{1}, \dots, X_{n}

are random variables from an unknown population

P_{θ} \in P

,

θ \in Θ

. Let

a : Θ ⟶ R

be a parameter taking real values. An estimator

δ (X)

,

δ : X ⟶ R

, of

a

is unbiased if and only if

E [δ (X)] = a

for every

P_{θ} \in P

.

Definition 2

(Sufficient statistic (Fisher [5])). Let

X = [X_{1}, \dots, X_{n}]

, where

X_{1}, \dots, X_{n}

are random variables from an unknown population

P_{θ} \in P

,

θ \in Θ

. A statistic

T (X)

is said to be sufficient for θ if the conditional probability distribution of X, given the statistic

t = T (X)

, is independent of θ.

Examples of sufficient statistics can be found in the books Lehmann [6], Casella and Berger [7], and Shao [8]. We should understand sufficient statistics better to derive uniformly minimum variance unbiased estimators (UMVUEs); see Definition 4. Statisticians have cleverly embedded sufficient statistics into estimators, which is the main idea of the Rao–Blackwell theorem; see Rao [9] and Blackwell [10]. UMVUEs can be calculated by complete sufficient statistics (see Definition 5), leading to the Lehmann–Scheffé theorem; see Lehmann and Scheffé [11,12] and Kumar and Vaish [13]. A complete statistic is defined by Definition 3.

Definition 3.

Let

X = [X_{1}, \dots, X_{n}]

, where

X_{1}, \dots, X_{n}

are random variables from an unknown population

P_{θ} \in P

. A statistic

T (X)

,

T : X ⟶ R

, is said to be complete for

P \in P

if and only if, for any Borel-measurable function f from

R

to

R

,

E [f (T)] = 0

for all

P \in P

implies

f (T) = 0

almost surely

P

.

Definition 4.

Let

X = [X_{1}, \dots, X_{n}]

, where

X_{1}, \dots, X_{n}

are random variables from an unknown population

P_{θ} \in P

. An unbiased estimator

T (X)

of

a

is a UMVUE if and only if Var

[T (X)] \leq

Var

[δ (X)]

for every

P_{θ} \in P

and every unbiased estimator

δ (X)

of

a

.

Definition 5

(Complete sufficient statistic (Fisher [5])). Let

T (X)

be a sufficient statistic for θ. If

E [g (T)] = 0

with probability 1, for some function g, then it is said to be a complete sufficient statistic for θ.

Applications of the Rao–Blackwell and Lehmann–Scheffé theorems are still widespread. Application areas have included reliability estimation (Kumar and Vaish [13]), adaptive cluster sampling (Felix-Medina [14]), alchemical free energy calculation (Ding et al. [15]), hazardous source parameter estimation (Ristic et al. [16]) and quantum probability (Sinha [17]).

However, nonconstant functions can be UMVUEs whenever there is not a complete sufficient statistic. Some authors try solving these problems by Theorem 1 by focusing on problem 5.11 in Rao [9] or pages 76–77 in Lehmann and Scheffé [11,12] (which are essentially Example 7 below).

Theorem 1

(Lehmann–Scheffé theorem (Lehmann and Scheffé [11,12])). Let

X = [X_{1}, \dots, X_{n}]

, where

X_{1}, \dots, X_{n}

are random variables from an unknown population

P_{θ} \in P

,

θ \in Θ

. If and only if condition for a statistic

T (X)

to be UMVUE of its mean is that

E [T (X) U (X)] = 0

for all

θ \in Θ

and all

U \in U_{0}

, where

U_{0}

denotes the set of all the unbiased estimators of 0.

This theorem can be used whenever there are no complete sufficient statistics. It is a competitor to the Rao–Blackwell theorem.

This fact is hardly pointed out or explained in undergraduate or graduate textbooks; see, for example, Bondesson [18]. The motivation to introduce a new concept of sufficient statistic called an

H

-sufficient statistic comes from the above discussion. We investigate the properties of

H

-sufficient statistics and compare them with those of sufficient statistics. Then, the Rao–Blackwell theorem (RBT) and Lehmann–Scheffé theorem (LST) will be generalized in a way that can solve some of the problems where UMVUEs exist but there are no complete sufficient statistics; cf. problem 5.11 in Rao [19], pages 76–77 in Lehmann [6], page 167, Example 3.7 in Shao [8], page 366, Example 10 in Rohatgi and Ehsanes [20], page 377, Section 7.6.1 in Mukhopadhyay [21], page 243 in Peña and Rohatgi [22], page 293, Section 12.4 in Roussas [23] and pages 330–331 in Mood et al. [24]. Some of the theorems are restated and proved by using the newly introduced

H

-sufficient statistic.

Definition 6

(Ancillary statistic). Let

X = [X_{1}, \dots, X_{n}]

, where

X_{1}, \dots, X_{n}

are random variables from an unknown population

P_{θ} \in P

,

θ \in Θ

. A statistic

T (X)

is said to be ancillary for θ if its distribution is the same for all

θ \in Θ

.

Boos and Hughes-Oliver [25] state that “If a minimal sufficient statistic is not complete, then by the suggestion of Fisherian tradition we should consider condition on ancillary statistics (see Definition 6) for the purposes of inference. This approach runs into problems because there are many situations where several ancillary statistics exists but there are no maximal “ancillaries”. Of course, when a complete sufficient statistic exists, Basu’s theorem assures us that we need not worry about conditioning on ancillary statistics since they are all independent of the complete sufficient statistic”. We suggest complete

H

-sufficient statistics for the purposes of inference when there are no complete sufficient statistics. Theorem 1 assures that we need not worry about ancillary statistics since they are uncorrelated regarding complete

H

-sufficient statistics.

2. The Main Contribution

If the minimal sufficient statistic is not complete, then the RBT and LST will not be of much use, as has been explicitly stated in various books and papers; see, for example, page 46, Section 2, Example 1 of Bondesson [18], page 243 of Peña and Rohatgi [22] page 293, Section 12.4 of Roussas [23], pages 330–331 of Mood et al. [24], page 343, Section 7.3 of Casella and Berger [7], page 86, Example 1.8 of Lehmann and Casella [26], Section 1 of Bahadur [27] and Section 1 of Stigler [28].

The main contribution of this note is a generalization of the RBT and LST, resulting in the use of the newly introduced

H

-sufficient statistics. This enables us to obtain UMVUEs even when the minimal sufficient statistic is not complete, in which case the RBT and LST are not directly applicable.

Consider a model

(X, A, P = \{P_{θ} : θ \in Θ\})

. Let

\{P_{θ} : θ \in Θ\}

denote the set of probability measures on the sample space

X

. Let

X = [X_{1}, \dots, X_{n}]

denote an element in

X

.

P_{θ} \in P

is the population.

X = [X_{1}, \dots, X_{n}]

is a sample. Let

X :

X

⟶

X

,

(Y; B)

and

T : X ⟶ Y

denote, respectively, the identity mapping, a measurable space and a

A - B -

measurable mapping (that is,

T^{- 1} B \in A

for all

B \in B

).

T (X)

is a statistic to

(Y; B)

, written as

T : (X, A) \to (Y; B)

.

a

is referred to as a U-estimable parameter if

a

is an unbiased estimator.

Throughout this note, we assume that

X = [X_{1}, \dots, X_{n}]

, where

X_{1}, \dots, X_{n}

are random variables from an unknown population

P_{θ} \in P

. Assume also

a

has an unbiased estimator. Let

U_{a}

denote the class of unbiased estimators

δ : X ⟶ R

for

a

. All the considered estimators are assumed to have finite variances. The space used in this note is

R^{n}

and the elements of

B

are Borel sets. For related notation and discussions, see Shao [8].

3. Sufficient Statistics

Sufficient statistics can be used to derive maximum likelihood estimators of a population quantity. Maximum likelihood estimation is a popular method for estimation, so sufficient statistics are important. Sufficient statistics were defined in Definition 2. Two weaker concepts of sufficiency, which are tailored to a given unbiased estimable aspect

a : Θ ⟶ R

, are introduced and discussed in the following. Some properties of these statistics are studied in the sequel.

3.1. $H$ -Sufficient Statistic in Distribution

Definition 7.

Let

X = [X_{1}, \dots, X_{n}]

, where

X_{1}, \dots, X_{n}

are random variables from an unknown population

P_{θ} \in P

. A statistic

T (X)

is

H

-sufficient in distribution for

a

if, for all

δ (X) \in U_{a}

, there is a Markov kernel

k_{a, δ (X)} : T \times B (R) ⟶ [0, 1]

such that, for every

θ \in Θ

,

k_{a, δ}

is a version of a regular conditional distribution of

δ (X)

given

T (X)

under

P_{θ}

.

Definition 7 introduces a class of statistics that are weaker than sufficient statistics, which is not the main aim of this note. These statistics could be used in Rao–Blackwell and Lehmann–Scheffé theorems. We use this idea in Definition 8.

Example 1

(Example of Meeden [29]). Let X be Poisson-distributed with

E (X) = λ

, so X belongs to the exponential family. Then, X is a complete sufficient statistic and

{(- 1)}^{X}

is only unbiased estimator for

e^{- 2 λ}

. By Definition 7,

k {(- 1)}^{X}

is an

H

-sufficient statistic in distribution for

e^{- 2 λ}

for k a constant. We can check that

{(- 1)}^{X}

is a UMVUE for

e^{- 2 λ}

.

The estimator

{(- 1)}^{X}

could not be suitable for

e^{- 2 λ}

in the same way that in the Bernoulli distribution with parameter p the estimator X will not be suitable. Of course, increasing the sample size or varying the loss function remedies this deficiency.

3.2. $H$ -Sufficient Statistic

To derive UMVUEs when there are no complete sufficient statistics, we need to introduce a new concept named an

H

-sufficient statistic for

a

. It is defined as follows.

Definition 8.

Let

X = [X_{1}, \dots, X_{n}]

, where

X_{1}, \dots, X_{n}

are random variables from an unknown population

P_{θ} \in P

. A statistic

T (X)

is an

H

-sufficient statistic for

a

if, for all

δ (X) \in U_{a}

, there is a measurable mapping

h_{a, δ} : T ⟶ R

such that for every

θ \in Θ

we have

E_{θ} [δ (X) ∣ T] = h_{a, δ} \circ T

almost surely

P_{θ}

.

Example 2.

Let X from

P_{θ}

have a discrete distribution with

\begin{matrix} P_{θ} (X = - 1) = θ, P_{θ} (X = k) = {(1 - θ)}^{2} θ^{k}, k = 0, 1, 2, \dots, \end{matrix}

where

θ \in (0, 1)

is unknown.

I_{{0}} (X)

is an

H

-sufficient statistic for

{(1 - θ)}^{2}

because

\begin{matrix} E_{θ} [I_{{0}} (X) + α X ∣ I_{{0}} (X)] = I_{{0}} (X) \end{matrix}

(1)

almost surely

P_{θ}

for every

θ \in (0, 1)

and

α \in R

. The expectations needed for the left hand side of (1) are

\begin{matrix} E_{θ} [X ∣ I_{{0}} (X) = 1] = E_{θ} [X ∣ X = 0] = 0, \end{matrix}

and

\begin{matrix} E_{θ} [X ∣ I_{{0}} (X) = 0] = E_{θ} [X ∣ X \neq 0] = 0 . \end{matrix}

X is a minimal sufficient statistic for

{(1 - θ)}^{2}

because of the LST. However, X is not complete since

E (X) = 0

. Also,

I_{{0}} (X)

is not an

H

-sufficient statistic in distribution for

{(1 - θ)}^{2}

since its conditional distribution depends on θ.

For having all the unbiased estimators of

{(1 - θ)}^{2}

, see the following proof.

For every

g (x)

, we have

\begin{matrix} 0 = E g (x) = \sum_{x = - 1}^{\infty} g (x) P (X = x) = θ g (- 1) + \sum_{x = 0}^{\infty} g (x) {(1 - θ)}^{2} θ^{x} . \end{matrix}

Then, for any

θ \in (0, 1)

,

\begin{matrix} \sum_{x = 0}^{\infty} g (x) θ^{x} = - θ g (- 1) {(1 - θ)}^{- 2} . \end{matrix}

We have

\begin{matrix} g (x) θ^{x} = - θ g (- 1) (1 + 2 θ + 3 θ^{2} + \dots) = g (- 1) \sum_{x = 1}^{\infty} x θ^{x} . \end{matrix}

Comparing power series coefficients, we have

\begin{matrix} g (0) = 0, g (x) = - g (0) x, x = 1, 2, \dots \end{matrix}

or

\begin{matrix} g (0) = 0, g (x) = α x, x = 1, 2, \dots, \end{matrix}

where

α = - g (0)

.

Some properties of

H

-sufficient statistics are in Theorem 2.

Theorem 2.

Let

P = \{P_{θ} : θ \in Θ\}

. Consider

(i): a sufficient statistic for $P$ (or θ),
(ii): an $H$ -sufficient statistic in distribution for $a$ ,
(iii): an $H$ -sufficient statistic for $a$ .

Then, we have

(a): any sufficient statistic for $P$ is an $H$ -sufficient statistic in distribution for $a$ ;
(b): any $H$ -sufficient statistic in distribution for $a$ is an $H$ -sufficient statistic for $a$ ;
(c): any sufficient statistic for $P$ is an $H$ -sufficient statistic for $a$ .

Proof.

(a) follows because the conditional distribution of samples given a sufficient statistic does not depend on

θ

. (b) follows because the conditional distribution of unbiased estimators given an

H

-sufficient statistic does not depend on

θ

. (c) follows because the conditional distribution of samples given a sufficient statistic does not depend on

θ

. □

Remark 1.

In general, the converse of none of the three parts of Theorem 2 holds (see Examples 1 and 2).

It is clear from Theorem 2 and Remark 1 that the class of

H

-sufficient statistics for

a

contains sufficient statistics for

θ

. Also, we can conclude from Theorem 2 that the jointly sufficient statistics are

H

-sufficient statistics.

Proposition 1.

Let

X = [X_{1}, \dots, X_{n}]

, where

X_{1}, \dots, X_{n}

are random variables from an unknown population

P_{θ} \in P

. If an unbiased estimator

T (X)

is unique for

a

, then

T (X)

is an

H

-sufficient statistic for

a

.

Proof.

Obviously,

E_{θ} [T (X) ∣ T (X)] = T (X)

almost surely

P

because of the definition of

H

-sufficiency; cf. Casella and Berger [7] and Shao [8]. □

Proposition 2.

Let

X = [X_{1}, \dots, X_{n}]

, where

X_{1}, \dots, X_{n}

are random variables from an unknown population

P_{θ} \in P

. Let

T (X)

be an

H

-sufficient statistic for

a

such that

S (X) = g (T (X))

for

S (X)

, another statistic, and g, a one-to-one measurable function. Then,

S (X)

is an

H

-sufficient statistic for

a

.

Proof.

Let

U (X)

denote an unbiased estimator of

a

. Then, we have

E_{θ}

[U (X) ∣ S (X)]

=

E_{θ}

[U (X) ∣ T (X)]

almost surely

P

, which shows that

E_{θ} [U (X) ∣ S (X)]

is independent of

θ

; cf. Casella and Berger [7]. □

Remark 2.

Let

X = [X_{1}, \dots, X_{n}]

, where

X_{1}, \dots, X_{n}

are random variables from an unknown population

P_{θ} \in P

. Let

S (X)

be an

H

-sufficient statistic for

a

and

U (X)

another statistic such that

S (X) = g (U (X))

for a measurable function g. We expect

U (X)

to be an

H

-sufficient statistic for

a

, but, actually, it is not. Consider Example 2 again: Let

S (X) = I_{0} (X)

and

U (X) = 1

, 0 and 2 for

x = 0, - 1

and

x > 1

, respectively. Then, verify that (i)

S (X)

is an

H

-sufficient statistic, (ii)

S (X)

is a function of

U (X)

but (iii)

U (X)

is not an

H

-sufficient statistic.

4. A Generalization of RBT and LST

We now apply the RBT for arbitrary

H

-sufficient statistics for

a

to obtain a better estimator.

Theorem 3.

Let

X = [X_{1}, \dots, X_{n}]

, where

X_{1}, \dots, X_{n}

are random variables from an unknown population

P_{θ} \in P

. Let

H (X)

be an

H

-sufficient statistic for

a

. Let

δ (X)

be an unbiased estimator of a U-estimable

a

, and the loss function

L (θ, δ (X))

be a strictly convex function of

δ (X)

. Then, if

δ (X)

has finite expectation and risk, we have

R (θ, δ (X)) = E L [θ, δ (X)] < \infty

, and, if

ψ (h) = E [δ (X) ∣ H (X) = h]

, then the risk of the estimator

ψ (H (X))

satisfies

R (θ, ψ (H (X))) < R (θ, δ (X))

unless

δ (X) = ψ (H (X))

almost surely

P

.

Proof.

Since L is convex, by Jensen’s inequality,

\begin{matrix} E (L [θ, δ (X)] ∣ H (X)) ⩾ L (θ, E [δ (X) ∣ H (X)]) \end{matrix}

and

\begin{matrix} R [θ, δ (X)] ⩾ R [θ, ψ (X)] . \end{matrix}

Hence, the result follows from Definition 8; see Lehmann and Casella [26] for details. □

We now reexpress Lemma 1.10 in Lehmann and Casella [26] within the new framework.

Lemma 1.

Let

X = [X_{1}, \dots, X_{n}]

, where

X_{1}, \dots, X_{n}

are random variables from an unknown population

P_{θ} \in P

. Let

H (X)

be a complete

H

-sufficient statistic for

a

. Then, every U-estimable

a

has one and only one unbiased estimator that is a function of

H (X)

. Of course, uniqueness here means that any two such functions agree almost surely

P

.

Proof.

If

H^{'} (X)

is another unbiased estimator, then

E [H (X) - H^{'} (X)] = 0

. By the completeness property,

H (X) = H^{'} (X)

with probability one; see Lehmann and Casella [26] for details. □

The generalization of LST (Lehmann and Scheffé [11], Theorem 5.1) by using a complete

H

-sufficient statistic for

a

is as follows.

Theorem 4.

Let

X = [X_{1}, \dots, X_{n}]

, where

X_{1}, \dots, X_{n}

are random variables from an unknown population

P_{θ} \in P

. Suppose that

H (X)

is a complete

H

-sufficient statistic for

a

. Then, we have the following:

(i): For every U-estimable $a$ , there exists an unbiased estimator that uniformly minimizes the risk for any loss function $L (θ, δ)$ that is convex in δ; therefore, the estimator is UMVUE of $a$ .
(ii): The UMVU estimator of (i) is a unique unbiased estimator and is a function of $H (X)$ ; it has minimum risk, provided its risk is finite and $L (θ, δ)$ is strictly convex in δ.

Proof.

(i) If U is unbiased, by Theorem 3, we can consider the estimator of

E [U ∣ H (X)]

whose risk is less than the risk of U. (ii) If

U^{'}

is another estimator with minimum risk, then

E [U^{'} ∣ H (X)]

must have less risk by Theorem 3, which would be impossible. Thus, by Lemma 1,

U = U^{'}

. □

Theorem 5.

Let

X = [X_{1}, \dots, X_{n}]

, where

X_{1}, \dots, X_{n}

are random variables from an unknown population

P_{θ} \in P

,

θ \in Θ

. Let

T (X)

be an unbiased estimator for

a

and

H (X)

an

H

-sufficient statistic for

a

such that

T (X) = g (H (X))

for a measurable function g. Then, if and only if condition for

T (X)

to be a UMVUE of

a

is that

E_{θ} [T (X) U^{*} (X)] = 0

for all

U^{*} (X) \in U_{0} (H_{a})

and

θ \in Θ

, where

U_{0} (H_{a})

denotes the set of all unbiased estimators of 0.

Proof.

Suppose that

U (X) \in U_{0}

. The result follows because of

E_{θ}

[U (X) ∣ H (X)]

∈

U_{0}

(H_{a})

and

\begin{matrix} E_{θ} [T (X) U (X)] = E_{θ} \{E_{θ} [g (H (X)) U (X) ∣ H (X)]\} = E_{θ} \{g (H (X)) E_{θ} [U (X) ∣ H (X)]\}, \end{matrix}

where

U (X)

is an unbiased estimator of 0.

E_{θ} [U (X) ∣ H (X)]

is a statistic since

E_{θ}

{

T (X)

−

[T (X) - U (X)]

∣

H (X)

} is independent of

θ

. The converse follows because of

\begin{matrix} E_{θ} \{g (H (X)) E_{θ} [U (X) ∣ H (X)]\} = E_{θ} \{E_{θ} [g (H (X)) U (X) ∣ H (X)]\} = E_{θ} [T (X) U (X)] . \end{matrix}

□

Theorem 6.

Let

X = [X_{1}, \dots, X_{n}]

, where

X_{1}, \dots, X_{n}

are random variables from an unknown population

P_{θ} \in P

,

θ \in Θ

. Let

H (X)

be an

H

-sufficient statistic for

a

. In addition, suppose for every unbiased estimator

T (X)

for

a

there is a measurable function g such that

T (X) = g (H (X))

. Then,

T (X)

is a UMVUE if

E_{θ} [U (X) ∣ H (X)] = 0

almost surely

P_{θ}

for every

U (X) \in U_{0}

and

θ \in Θ

.

Proof.

For

U (X) \in U_{0}

, we have

E_{θ} [T (X) U (X)]

=

E_{θ} \{g (H (X)) E [U (X) ∣ H (X)]\}

= 0 since

E_{θ}

[U (X) ∣ H (X)]

= 0 almost surely

P_{θ}

. Since

E [T (X) U (X)] = 0

,

T (X)

is a UMVUE by the LST. □

5. Complete $H$ -Sufficient Statistic

We are interested in finding an

H

-sufficient statistic with the simplest structure. A minimal

H

-sufficient statistic is an

H

-sufficient statistic that is a function of any other

H

-sufficient statistic.

Definition 9

(Minimal

H

-sufficient statistics). Let

X = [X_{1}, \dots, X_{n}]

, where

X_{1}, \dots, X_{n}

are random variables from an unknown population

P_{θ} \in P

. Let

T (X)

be an

H

-sufficient statistic for

a

. A statistic

T (X)

is a minimal

H

-sufficient statistic for

a

if and only if, for any other statistic

S (X)

that is an

H

-sufficient for

a

, there exists a measurable function ψ such that

T (X) = ψ (S (X))

almost surely

P

.

Theorem 7.

Let

X = [X_{1}, \dots, X_{n}]

, where

X_{1}, \dots, X_{n}

are random variables from an unknown population

P_{θ} \in P

,

θ \in Θ

. Let

T (X)

be a complete sufficient statistic for

P

(or θ) such that

T (X)

,

T : X ⟶ R

, has mean

a

. Then, any

H

-sufficient statistic for

a

is a sufficient statistic for

P

(or θ).

Proof.

Let

H (X)

be an

H

-sufficient statistic for

a

. By Theorem 3, var

\{E [(T (X) ∣ H (X)]\}

≤ var

[T (X)]

. Since

T (X)

is a UMVUE,

T (X) = E [T (X) ∣ H (X)]

almost surely

P

because there can be no better estimators. So, we can find a measurable function g such that

T (X) = g \circ H (X)

almost surely

P

. Hence,

H (X)

is a sufficient statistic. □

Thus, we can apply

H

-sufficient statistics for

a

in case complete sufficient statistics do not exist. Intuitively, an

H

-sufficient statistic with the complete property will be a minimal

H

-sufficient statistic. The following theorem, a version of Bahadur’s theorem, see Bahadur [27], states an important property of minimal

H

-sufficient statistics.

Theorem 8.

Let

X = [X_{1}, \dots, X_{n}]

, where

X_{1}, \dots, X_{n}

are random variables from an unknown population

P_{θ} \in P

. If

T (X)

,

T : X ⟶ R

, is a complete

H

-sufficient statistic for

a

, then

T (X)

is a minimal

H

-sufficient statistic for

a

.

Proof.

Let

S (X)

be an

H

-sufficient statistic for

a

. By Theorem 3,

T (X) = E [T (X) ∣ S (X)]

almost surely

P

since

T (X)

is a UMVUE. □

We now show that complete

H

-sufficient statistics may not exist.

Example 3

(Complete

H

-sufficient statistics may not exist). Let X be a random variable with

P = \{Bin (θ, 0.5) : θ \in {1, 2, \dots}\}

, and then X is not complete.

k {(- 1)}^{X + 1}

,

k \in R

are all of the zero unbiased estimators. Since X is sufficient, X is an

H

-sufficient statistic for θ (see Theorem 2, part a). However, a complete

H

-sufficient statistic for θ does not exist. Otherwise, for every

k \in R

and some

k_{0} \in R

, we would have

E [2 X + k {(- 1)}^{X + 1} ∣ g (X)] = 2 X + k_{0} {(- 1)}^{X + 1}

almost surely

P

, where

g (X)

is assumed to be a complete

H

-sufficient statistic for θ, but this cannot hold since

2 X + k_{0} {(- 1)}^{X + 1}

is not UMVUE for θ. Since

\begin{matrix} E [(2 X + k {(- 1)}^{X + 1}) (k_{0} {(- 1)}^{X + 1})] = \{\begin{matrix} k_{0} (k + 1), & i f θ = 1, \\ k_{0} k, & i f θ \geq 2, \end{matrix} \end{matrix}

there is no

k_{0}

such that

E [(2 X + k {(- 1)}^{X + 1}) (k_{0} {(- 1)}^{X + 1})] = 0

.

2 X + k {(- 1)}^{X + 1}

,

k \in R

are all of the unbiased estimators.

6. Some Applications

In this section, some examples are presented for which Theorems 3 and 4 are applicable.

6.1. When the Minimal Sufficient Statistic Is Not Complete

Consider a case where UMVUE exists but the minimal sufficient statistics are not complete. The LST cannot be used to obtain UMVUEs. We illustrate through some examples that we can find a UMVUE without having complete sufficiency. Therefore, some worries in the literature on the inadequacy of the LST and RBT for obtaining UMVUEs can be removed, and seemingly unbeatable obstacles can be overcome by using

H

-sufficient statistics.

Example 4

(Example of Lehmann and Scheffé [11]). Let X be a discrete random variable with

P_{θ} (X = - 1) = θ

and

P_{θ} (X = k) = {(1 - θ)}^{2} θ^{k}

,

k = 0, 1, 2, \dots

, where

θ \in (0, 1)

is unknown.

I_{{0}} (X)

is a complete and minimal

H

-sufficient statistic for

{(1 - θ)}^{2}

because

E_{θ} [I_{{0}} (X) + α X ∣ I_{{0}} (X)] = I_{{0}} (X)

almost surely

P_{θ}

for every

θ \in (0, 1)

and

α \in R

. On the other hand,

I_{{0}} (x)

has Bernoulli distribution so is complete. Hence, by Theorem 4,

I_{{0}} (X)

is a UMVUE for

{(1 - θ)}^{2}

, so every function

A I_{{0}} (X) + B

is also a UMVUE for

A {(1 - θ)}^{2} + B

.

Alternatively, for every

θ \in (0, 1)

and

α \in R

, we have

E_{θ} [α X ∣ I_{{0}} (X)] = 0

almost surely

P_{θ}

, and thus the same result can be obtained by using Theorem 6.

So far, Examples 1 and 2 have shown usefulness of

H

-sufficiency. However, in both cases, the considered estimation problem is a rather esoteric one. The following examples are more reasonable.

Example 5.

Let

X_{1}, \dots, X_{n}

be independent and identical random variables with

\begin{matrix} f (x; μ, σ) = \frac{x - μ}{σ^{2}} e^{- \frac{x - μ}{σ}} I_{(μ, \infty)} (x), \end{matrix}

(2)

where

θ = (μ, σ) \in R \times R^{+}

is an unknown parameter.

Suppose that μ is known. Then,

\bar{X}

is a complete sufficient statistic for σ since (2) is from the exponential family and σ is a scale parameter (σ does not denote variance,

E (X) = μ + 2 σ)

. By the RBT and since

\bar{X}

is a UMVUE,

\bar{X}

is an

H

-sufficient statistic for

μ + 2 σ

since

E_{θ} [δ (X) | \bar{X}] = \bar{X}

almost surely

P_{θ}

for every

δ (X) \in U_{μ + 2 σ}

.

Suppose now μ is unknown. Since

E_{θ} [δ (X) | \bar{X}]

is free of parameters,

\bar{X}

is a complete and minimal

H

-sufficient statistic for

μ + 2 σ

. On the other hand, by Theorem 4,

\bar{X}

is a UMVUE for

μ + 2 σ

and so is any function of

\bar{X}

.

Example 6.

Let

X_{1}, \dots, X_{n}

be independent and identical random variables with

\begin{matrix} f (x; μ, σ) = 2 \frac{x - μ}{σ^{2}} I_{(μ, μ + σ)} (x), \end{matrix}

where

θ = (μ, σ) \in R \times R^{+}

is an unknown parameter. By arguments in Example 5, taking μ to be known, we can see that

max (X_{1}, \dots, X_{n})

is a complete and minimal sufficient statistic for

μ + \frac{2 n}{2 n + 1} σ

and UMVUE. So,

max (X_{1}, \dots, X_{n})

is a complete and minimal

H

-sufficient statistic for

μ + \frac{2 n}{2 n + 1} σ

(see Theorem 2, part a). If μ is unknown, then

max (X_{1}, \dots, X_{n})

is a complete and minimal

H

-sufficient statistic for

μ + \frac{2 n}{2 n + 1} σ

. Since

E_{θ} [δ (X) | max (X_{1}, \dots, X_{n})]

is free of parameters,

max (X_{1}, \dots, X_{n})

is a complete and minimal

H

-sufficient statistic for

μ + \frac{2 n}{2 n + 1} σ

. Hence, by Theorem 4,

max (X_{1}, \dots, X_{n})

is a UMVUE for

μ + \frac{2 n}{2 n + 1} σ

and so is any function of

max (X_{1}, \dots, X_{n})

.

6.2. When a Complete and Sufficient Statistic Is Not Available

Even though complete sufficient statistics do exist in the following examples, namely max

[1, max (X_{1}, \dots, X_{n})]

and X

I_{N ∖ \{m, m + 1\}}

(X)

, we can apply Theorems 3 and 4 for obtaining their UMVUEs.

Example 7

(Example of Shao [8]). Let

X_{1}, \dots, X_{n}

be independent and identical uniform random variables on

(0, θ)

with

Θ = [1, \infty)

. Then,

X_{(n)}

is not complete but sufficient for θ. Thus, the RBT and LST are not applicable. We now apply Theorem 3 to find a UMVUE of θ. Let

U (X_{(n)})

denote an unbiased estimator of 0 in

U_{0} (X_{(n)})

.

We can show that

H (X_{(n)}) = I_{[0, 1]} (X_{(n)}) + \frac{n + 1}{n} X_{(n)} I_{(1, \infty)} (X_{(n)})

is a complete and

H

-sufficient statistic for θ, although we need only

\begin{matrix} E_{θ} [I_{[0, 1]} (X_{(n)}) + \frac{n + 1}{n} X_{(n)} I_{(1, \infty)} (X_{(n)}) + U (X_{(n)}) ∣ H (X_{n})] = H (X_{(n)}) \end{matrix}

almost surely

P_{θ}

for every

θ \in Θ

. Here,

U (X_{(n)})

is an arbitrary nonzero unbiased estimator of zero, so its conditional expectation is zero too; that is,

E_{θ} [U (X_{(n)}) ∣ H (X_{n})] = 0

.

By definition of zero unbiasedness,

\begin{matrix} \int_{0}^{1} U (x) x^{n - 1} d x + \int_{1}^{θ} U (x) x^{n - 1} d x = 0, \end{matrix}

so

\begin{matrix} \int_{0}^{1} U (x) x^{n - 1} d x = 0 \end{matrix}

and

U (x) = 0

for every

θ \geq 1

. Hence,

H (X_{(n)})

=

E [H (X_{(n)}) ∣ H (X_{(n)})]

+

E [U (X_{(n)}) ∣

H (X_{(n)})]

and

I_{[0, 1]} (X_{(n)}) + \frac{n + 1}{n} X_{(n)} I_{(1, \infty)} (X_{(n)})

is a UMVUE for θ.

Example 8

(Example of Stigler [28]). Let X be a discrete random variable distributed as

\begin{matrix} P (X = x) = \{\begin{matrix} N^{- 1}, & if x = 1, \dots, N, \\ 0, & otherwise, \end{matrix} \end{matrix}

where N is an unknown parameter.

We have excluded

N = m

for fixed

m \geq 1

from

\{P_{N} : N \geq 1\}

. Let P=

\{P_{N} : N \geq 1, N \neq m\}

. Then, X is not complete but sufficient for N. Consider the following function

\begin{matrix} U (X) = \{\begin{matrix} k, & if x = m, \\ - k, & if x = m + 1, \\ 0, & otherwise . \end{matrix} \end{matrix}

It can be easily checked that

E [U (X)] = 0

. Thus, the RBT and LST are not applicable.

We now apply Theorem 3 to find a UMVUE of N. We can see that

\begin{matrix} H (X) = \{\begin{matrix} 2 X - 1, & if X \neq m, X \neq m + 1, \\ 2 m, & if X = m, X = m + 1 \end{matrix} \end{matrix}

is a complete and

H

-sufficient statistic for N. Here,

H (X)

is also a UMVUE for N, which can be proved similarly since

E [H (X) + U (X) ∣ H (X)] = E [H (X) ∣ H (X)] + E [U (X) ∣ H (X)] = H (X) + 0 = H (X)

.

6.3. A Note on the Structure of UMVUE

We now show that the structure of UMVUEs depends on

H

-sufficient statistics for

E (UMVUE)

.

Theorem 9.

Let

P = \{P_{θ} : θ \in Θ\}

. Let

S (X)

be a sufficient statistic for

P

and an

H

-sufficient statistic

H (X)

for

a

. For any function such as

α (S (X))

that is a UMVUE, there exists a function

β (H (X))

so that

α (S (X)) = β (H (X))

almost surely

P

.

Proof.

The proof is an easy consequence of Theorem 3, where

E_{θ}

[UMVUE ∣ S (X)]

and

E_{θ}

[UMVUE ∣ H (X)]

are UMVUEs. The proof follows by uniqueness of UMVUEs because

E [

UMVUE

∣ S (X)]

is a function of

S (X)

and UMVUE, and

E [

UMVUE

∣ H (X)]

is a function of

H (X)

and UMVUE. □

7. Conclusions

Sufficient statistics are of central concern for statisticians. They play a fundamental role in Rao–Blackwell and Lehmann–Scheffé theorems. By Theorem 3, every sufficient statistic is an

H

-sufficient statistic. The class of

H

-sufficient statistics contains all of the sufficient statistics and also some statistics that are not necessarily sufficient. So, the factorization theorem, and its corollaries, should not hold generally for

H

-sufficient statistics. The concepts closest to

H

-sufficient statistics are those of “partial sufficient” and “sufficient subspace”. However, they are slightly different.

When a complete sufficient statistic is lacking, there may sometimes be nonconstant parametric functions that can be UMVU-estimated. This fact is seldom pointed out and exemplified in undergraduate and graduate textbooks. In this note, we have shown how the concept of

H

-sufficient statistics can be used to obtain UMVUEs in these contexts.

More research based on the concept of

H

-sufficiency are under investigation. They are

Generalizing $H$ -sufficiency to multi-parameter cases.
How to find $H$ -sufficient statistics.

Author Contributions

Methodology, S.A., H.H. and S.N.; Investigation, S.A., H.H. and S.N. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Acknowledgments

The authors would like to thank the editor and the three referees for careful and helpful comments that improved this paper.

Conflicts of Interest

The authors declare no conflict of interest.

References

Lapidus, N.; Zhou, X.; Carrat, F.; Riou, B.; Zhao, Y.; Hejblum, G. Biased and unbiased estimation of the average length of stay in intensive care units in the Covid-19 pandemic. Ann. Intensive Care 2020, 10, 135. [Google Scholar] [CrossRef] [PubMed]
Takahashi, S.; Peluso, M.J.; Hakim, J.; Turcios, K.; Janson, O.; Routledge, I.; Busch, M.P.; Hoh, R.; Tai, V.; Kelly, J.D.; et al. SARS-CoV-2 serology across scales: A framework for unbiased estimation of cumulative incidence incorporating antibody kinetics and epidemic recency. Am. J. Epidemiol. 2023, 192, 1562–1575. [Google Scholar] [CrossRef] [PubMed]
Silberberg, M.; Grecco, H.E. Robust and unbiased estimation of the background distribution for automated quantitative imaging. J. Opt. Soc. Am. A 2023, 40, C8–C15. [Google Scholar] [CrossRef] [PubMed]
Han, P.; Ting, C.; Xi, L. De-correlated unbiased sequential filtering based on best unbiased linear estimation for target tracking in Doppler radar. J. Syst. Eng. Electron. 2020, 31, 1167–1177. [Google Scholar] [CrossRef]
Fisher, R.A. A mathematical examination of the methods of determining the accuracy of an observation by the mean error, and by the mean square error. Mon. Not. R. Astron. Soc. 1920, 80, 758–770. [Google Scholar] [CrossRef]
Lehmann, E.L. Theory of Point Estimation; Wiley: New York, NY, USA, 1983. [Google Scholar]
Casella, G.; Berger, R.L. Statistical Inference, 2nd ed.; Duxbury Press: Pacific Grove, CA, USA, 2002. [Google Scholar]
Shao, J. Mathematical Statistics, 2nd ed.; Springer: New York, NY, USA, 2003. [Google Scholar]
Rao, C.R. Information and accuracy attainable in the estimation of statistical parameters. Bull. Calcutta Math. Soc. 1945, 37, 81–89. [Google Scholar]
Blackwell, D. Conditional expectation and unbiased sequential estimation. Ann. Math. Stat. 1947, 18, 105–110. [Google Scholar] [CrossRef]
Lehmann, E.L.; Scheffé, H. Completeness, similar regions and unbiased estimation: Part I. Sankhyā 1950, 10, 305–340. [Google Scholar]
Lehmann, E.L.; Scheffé, H. Completeness, similar regions and unbiased estimation: Part II. Sankhyā 1955, 10, 219–236. [Google Scholar]
Kumar, S.; Vaish, M. UMVUE of the stress-strength reliability for a class of distributions by using the estimates of reliability. J. Stat. Manag. Syst. 2018, 21, 217–223. [Google Scholar] [CrossRef]
Felix-Medina, M.H. Analytical expressions for Rao-Blackwell estimators in adaptive cluster sampling. J. Stat. Plan. Inference 2000, 84, 221–236. [Google Scholar] [CrossRef]
Ding, X.; Vilseck, J.Z.; Hayes, R.L.; Brooks, C.L. Gibbs sampler-based λ-dynamics and Rao-Blackwell estimator for alchemical free energy calculation. J. Chem. Theory Comput. 2017, 13, 2501–2510. [Google Scholar] [CrossRef] [PubMed]
Ristic, B.; Gunatilaka, A.; Wang, Y. Rao-Blackwell dimension reduction applied to hazardous source parameter estimation. Signal Process. 2017, 132, 177–182. [Google Scholar] [CrossRef]
Sinha, K.B. Sufficient statistic and Rao–Blackwell theorem in quantum probability. Infin. Dimens. Anal. Quantum Probab. Relat. Top. 2022, 25, 2240005. [Google Scholar] [CrossRef]
Bondesson, L. On uniformly minimum variance unbiased estimation when no complete sufficient statistic exist. Metrika 1983, 30, 49–54. [Google Scholar] [CrossRef]
Rao, C.R. Linear Statistical Inference and Its Applications; Wiley: New York, NY, USA, 1973. [Google Scholar]
Rohatgi, V.; Ehsanes, S.A.K.M. An Introduction to Probability and Statistics, 3rd ed.; Wiley: New York, NY, USA, 2015. [Google Scholar]
Mukhopadhyay, N. Probability and Statistical Inference; Marcel Dekker: New York, NY, USA, 2020. [Google Scholar]
Peña, E.A.; Rohatgi, V. Some comments about sufficiency and unbiased estimation. Am. Stat. 1994, 48, 242–243. [Google Scholar] [CrossRef]
Roussas, G.G. A Course in Mathematical Statistics, 2nd ed.; Springer: New York, NY, USA, 1997. [Google Scholar]
Mood, A.; Graybill, F.; Boes, D. An Introduction to Probability Theory of Statistics; McGraw-Hill: New York, NY, USA, 1974. [Google Scholar]
Boos, D.D.; Hughes-Oliver, J.M. Applications of Basu’s theorem. Am. Stat. 1998, 52, 218–221. [Google Scholar]
Lehmann, E.L.; Casella, G. Theory of Point Estimation, 2nd ed.; Springer: New York, NY, USA, 1998. [Google Scholar]
Bahadur, R.R. On unbiased estimates of uniformly minimum variance. Sankhyā 1957, 18, 211–224. [Google Scholar]
Stigler, S.M. Completeness and unbiased estimation. Am. Stat. 1972, 26, 28–29. [Google Scholar]
Meeden, G. Estimation when using a statistic that is not sufficient. Am. Stat. 1987, 41, 135–136. [Google Scholar]

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Alemam, S.; Homei, H.; Nadarajah, S. Generalizations of Rao–Blackwell and Lehmann–Scheffé Theorems with Applications. Mathematics 2023, 11, 4146. https://doi.org/10.3390/math11194146

AMA Style

Alemam S, Homei H, Nadarajah S. Generalizations of Rao–Blackwell and Lehmann–Scheffé Theorems with Applications. Mathematics. 2023; 11(19):4146. https://doi.org/10.3390/math11194146

Chicago/Turabian Style

Alemam, Seyf, Hazhir Homei, and Saralees Nadarajah. 2023. "Generalizations of Rao–Blackwell and Lehmann–Scheffé Theorems with Applications" Mathematics 11, no. 19: 4146. https://doi.org/10.3390/math11194146

APA Style

Alemam, S., Homei, H., & Nadarajah, S. (2023). Generalizations of Rao–Blackwell and Lehmann–Scheffé Theorems with Applications. Mathematics, 11(19), 4146. https://doi.org/10.3390/math11194146

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Generalizations of Rao–Blackwell and Lehmann–Scheffé Theorems with Applications

Abstract

1. Introduction

2. The Main Contribution

3. Sufficient Statistics

3.1. $H$ -Sufficient Statistic in Distribution

3.2. $H$ -Sufficient Statistic

4. A Generalization of RBT and LST

5. Complete $H$ -Sufficient Statistic

6. Some Applications

6.1. When the Minimal Sufficient Statistic Is Not Complete

6.2. When a Complete and Sufficient Statistic Is Not Available

6.3. A Note on the Structure of UMVUE

7. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

Generalizations of Rao–Blackwell and Lehmann–Scheffé Theorems with Applications

Abstract

1. Introduction

2. The Main Contribution

3. Sufficient Statistics

3.1. H -Sufficient Statistic in Distribution

3.2. H -Sufficient Statistic

4. A Generalization of RBT and LST

5. Complete H -Sufficient Statistic

6. Some Applications

6.1. When the Minimal Sufficient Statistic Is Not Complete

6.2. When a Complete and Sufficient Statistic Is Not Available

6.3. A Note on the Structure of UMVUE

7. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

3.1. $H$ -Sufficient Statistic in Distribution

3.2. $H$ -Sufficient Statistic

5. Complete $H$ -Sufficient Statistic