1. Introduction
Let
and
be discrete probability distributions on non-negative integers. Distribution
is a partial-sums distribution (a descendant) of
(a parent) if
depends on a weighted sum of probabilities
. Several partial-sums distributions can be found in scientific literature (in the following,
c is a normalization constant). The simplest one, with constant weights,
was mentioned in [
1] and further analyzed in [
2]. A partial-sums distribution with reciprocal weights,
was studied in [
3]. Many pairs of distributions which are parents and descendants with respect to (
2) can be found in [
4]. The third one, defined by
was introduced in [
5] as a generalization of the Bradford law (see, e.g., [
6]). Partial-sums distributions are also mentioned in a subchapter of the comprehensive monograph on discrete univariate probability distributions [
7] (pp. 508–512). Models based on partial summations (although not always under this name) find their applications in different research areas, such as actuarial mathematics (time to ruin) [
8,
9,
10,
11], economy (income under-reporting) [
12], inventory management (a decision rule for inventory resupply) [
13], reliability theory (the residual life of a component in a system) [
14], and linguistics (rank-frequency distributions) [
15,
16].
Partial summations were generalized in [
17], where formulas
generalizing (
1) and (
2), and
generalizing (
3), were introduced. In the following, we will consider only partial summation (
4).
Depending on the choice of function
from (
4), some constraints on the parent distribution may be necessary, e.g., the mean of the parent distribution must be finite under (
1), see [
18]. In [
19], it was shown that all pairs of discrete distributions are connected by a summation (
4), i.e., for any pair of discrete probability distributions
and
there is a partial summation (
4) such that
is the parent of
. Limit properties of partial-sums distributions were derived in [
20,
21,
22].
One of interesting problems related to partial summations is the question which distributions are invariant (i.e., the parent and the descendant are identical;
,
) under particular summations (all distributions mentioned in this paper can be found in [
23]). This question was first answered for special cases. Thus, [
18] presents the proof that the geometric distribution is invariant under (
1), [
12] shows that the Yule distribution is invariant under (
2), and according to [
5], the Salvia–Bolinger distribution is invariant under (
3). The general solution of the problem—the necessary and sufficient condition for a distribution to be invariant under summation (
4)—can be found in [
17]. Let
be a function given by
Distribution
is invariant under (
4) if and only if
Thus, e.g., for the geometric distribution with
, we have
and
(see [
18]), and for the Poisson distribution with
it holds
and
(see [
17] (p. 407)). There is one-to-one correspondence between discrete distributions defined on non-negative integers and functions
from (
7). These functions can, therefore, be considered new characteristics of discrete distributions.
The paper is organized as follows: In
Section 2, parametrized partial summations are introduced, in which the parent distribution and function
from (
7) can have different parameters.
Section 3 shows that there are two patterns of behavior—some parent distributions remain invariant even if their parameters differ from the parameters of the partial summations, while other produce new distributions (with two parameters) under these circumstances. Finally,
Section 4 concludes the paper, also presenting some ideas for future research.
2. Parametrized Partial Summations
For the sake of simplicity, in the following considerations we limit ourselves to parent distributions with one parameter only. Denote
a the parameter of distribution
. In order to emphasize the role of the parameter, in (
4) we can use the notation
and
. The descendant distribution from (
4) is uniquely determined by its parent
and by function
, hence it also depends solely on one parameter
a and can be denoted
. The invariance condition (
7) can be rewritten accordingly as
Using this notation, for function
from (
8) we have
Consider a modification of summation (
9), namely,
with
where the formula for weight function
from (
8) is kept, but parameter
a is replaced with
b;
c is a proper constant (with respect to
x) which ensures that
from (
10) sums to 1.
We now pay attention to constant
c from (
10). Summation (
10) can be written as a series of equations
or, equivalently,
The right sides of the equations can be rewritten as
Summing both sides of all equations above we obtain
and, consequently,
Notice that if parent distribution
in (
10) has finite mean
, the normalization constant
c from (
2) can be rewritten as
see also Lemma III.1 in [
17] (p. 408). Obviously, the normalization constant given by (
2) or (
12) must be positive.
Example 1. Let be the Poisson distribution with parameter a, i.e., , . We have and substituting the Poisson probabilities to (11) we obtain and . According to (12), the normalization constant is For the Poisson parent and , summation (10) yields The descendant is a new discrete partial-sums distribution with two parameters a and b, e.g., We only note that, as must be positive, and both a and b are parameters of two (in general different) Poisson distributions in this example, it must hold . For , partial summation (13) yields, in agreement with Theorem II.1 in [17] (pp. 404–405), the descendant distribution which is identical with its parent (i.e., the Poisson distribution with parameter a in this case). At the first sight, descendant distributions resulting from partial summation (
10) seem to depend on two parameters if
. However, it is not always true, as Example 2 below demonstrates.
Example 2. If is the geometric distribution with parameter a, i.e., , , the mean of the parent is . Substituting the geometric probabilities to (11) we obtain , , and Thus, the descendant of the geometric parent under partial summation (10) is It means that if the parent distribution is geometric and if in (10) satisfies (11), there is the unique summation (10)—the one which leaves the geometric distribution unchanged. The choice of parameter
b in (
10) is irrelevant in Example 2, as this second parameter is eliminated by the normalization constant given by (
12). The descendant distribution, being identical with its parent, has only one parameter
a.
3. Sensitive and Resistant Distributions
As was shown in
Section 2, parametrized partial summations (
10) divide one-parametric discrete probability distributions into two families. The first one consists of distributions which are sensitive to a change of the parameter in partial summation (
8). Partial summations (
10), in which the formula for function (
8) is kept but its parameter was changed, see (
11), create new distributions with two parameters
a and
b (e.g., the Poisson distribution is sensitive to the change, see Example 1). Distributions belonging to the other family are resistant to the change of the parameter. They are invariant under partial summation (
10) regardless of the value of parameter
b which does not play any role (e.g., the geometric distribution is resistant, see Example 2), as it is eliminated by normalization constant (
2). The following theorem presents the necessary and sufficient condition for a distribution to belong to the resistant family (distributions which do not satisfy the condition belong to the sensitive family). We use the notation from
Section 1 and
Section 2.
Theorem 1. Discrete probability distribution with one parameter a belongs to the family of resistant distributions if and only if Proof. In [
17], it is proved that distribution
is invariant with respect to summation (
4) if and only if
, see (
6) and (
7), i.e.,
Summation (
10) yields a proper distribution if and only if normalization constant
c satisfies (
2), i.e.,
Comparing weights by which probabilities
are multiplied in sums in (
15) and in (
16), and taking into account that (
15) is true for all non-negative
x, relation (
14)—and thereby also Theorem 1—is proved. □
Analogously to (
12), relation (
14) can be replaced with
for distributions with a finite mean. It can be easily shown that the geometric distribution satisfies relation (
17), but the relation is not valid for the Poisson distribution (see Examples 1 and 2, respectively).
Relation (
14), and similarly (
17), can be reformulated as
see (
6) for the last equality. Obviously, the left side of (
18) does not depend on
j, which must be true also for the right side. We thus have a necessary condition for a distribution to be resistant under partial summation (
10) regardless of parameter
b. For such a distribution, fraction
depends only on parameters
a and
b, while it is independent of
j. In addition, if the necessary condition is satisfied, combining (
2) and (
18) one obtains a simpler formula for normalization constant
c, namely,
Example 3. The logarithmic distribution (shifted to the left by 1 in order to be defined on all non-negative integers, see [17]) has probabilitieswith parameter . Asthe necessary condition (19) is not satisfied, which means that the logarithmic distribution belongs to the sensitive family. Again, it is easy to see that the necessary condition (
19) is satisfied by the geometric distribution (Example 2), whereas it is not true for the Poisson distribution (Example 1).
Example 4. The Salvia–Bolinger distribution with parameter has probability mass functionwith parameter . The necessary condition (19) is satisfied, as it holds According to [17], we havesee (8), and the particular form of the partial summation from (10) is Due to (20), it holdswhich means that parameter b from (21) is eliminated and, regardless of its choice, it does not have an impact on the resulting distribution. In other words, there is only one partial summation (
21) with the Salvia–Bolinger distribution being the parent distribution, namely the one which leaves the Salvia–Bolinger distribution unaltered.
4. Conclusions
The paper presents new theoretical results on partial-sums discrete probability distributions (see
Section 2 and
Section 3). Parametrized partial summations are introduced in (
10). We showed that discrete probability distributions with one parameter can be divided into two families. Sensitive distributions lose that invariance under a parameter change in partial summation. On the other hand, resistant distributions eliminate the parameter change and remain invariant. A criterion was derived, based on which one can decide into which of the two families a distribution belongs.
It seems that the sensitive family is much richer than the resistant one. To date, only two distributions (geometric and Salvia–Bolinger, see Examples 2 and 4) are known to be resistant. It remains an open question whether the resistant family is limited to these two distributions (e.g., many distributions from the Kemp–Dacey hypergeometric family, see [
23] (pp. 508–512), were scrutinized in [
24] (pp. 24–29), but no other resistant distributions were found). We note that the geometric and the Salvia–Bolinger distributions are invariant under the most simple special cases of partial summations (
4) and (
5), respectively. However, a link between a special position of (
4) and (
5) among partial summations, on the one hand, and of the geometric and the Salvia–Bolinger distributions among discrete probability distributions, on the other, is still at the level of a conjecture.
The results achieved in this paper provide also some new impulses towards future research. Here we limited ourselves to univariate discrete probability distributions with one parameter only. A similar study can be conducted for distributions with two or more parameters, and for multivariate discrete distributions (bivariate partial-sums distributions were first mentioned in [
25], they were studied more in detail in [
26]). Finally, as was shown in Example 1, parametrized partial summations applied to sensitive distributions yield descendants which are new, so far unknown distributions, see, e.g., (
13). Properties of these new distributions, such as their basic characteristics and their relation to their parents, will be the topic of future research. These distributions can potentially be applied as models of real-life data.