Next Article in Journal
Correction: Nedelcu et al. Low-Resolution Precoding for Multi-Antenna Downlink Channels and OFDM. Entropy 2022, 24, 504
Next Article in Special Issue
Time Series of Counts under Censoring: A Bayesian Approach
Previous Article in Journal
Decision Fault Tree Learning and Differential Lyapunov Optimal Control for Path Tracking
Previous Article in Special Issue
A Modified Multiplicative Thinning-Based INARCH Model: Properties, Saddlepoint Maximum Likelihood Estimation, and Application
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Two Features of the GINAR(1) Process and Their Impact on the Run-Length Performance of Geometric Control Charts

by
Manuel Cabral Morais
Department of Mathematics & CEMAT (Center for Computational and Stochastic Mathematics), Instituto Superior Técnico, Universidade de Lisboa, 1049-001 Lisbon, Portugal
Entropy 2023, 25(3), 444; https://doi.org/10.3390/e25030444
Submission received: 7 February 2023 / Revised: 28 February 2023 / Accepted: 1 March 2023 / Published: 2 March 2023
(This article belongs to the Special Issue Discrete-Valued Time Series)

Abstract

:
The geometric first-order integer-valued autoregressive process (GINAR(1)) can be particularly useful to model relevant discrete-valued time series, namely in statistical process control. We resort to stochastic ordering to prove that the GINAR(1) process is a discrete-time Markov chain governed by a totally positive order 2 ( T P 2 ) transition matrix.Stochastic ordering is also used to compare transition matrices referring to pairs of GINAR(1) processes with different values of the marginal mean. We assess and illustrate the implications of these two stochastic ordering results, namely on the properties of the run length of geometric charts for monitoring GINAR(1) counts.

1. Introduction

The INAR(1) and GINAR(1) processes were originally proposed by McKenzie [1,2]; the latter model was soon after discussed in more detail by Alzaid and Al-Osh [3]. They rely on the binomial thinning operation due to Steutel and van Harn [4] which is defined below.
Definition 1. 
Let X be a non-negative integer-valued r.v. with range N 0 = { 0 , 1 , } and ρ a scalar in ( 0 , 1 ) . Then the binomial thinning operation on X results in the r.v.
ρ X = t = 1 X Y t ,
where ∘ represents the binomial thinning operator; { Y t : t N } is a sequence of i.i.d. Bernoulli r.v. with parameter ρ; { Y t : t N } is independent of X.
We usually refer to ρ X as the r.v. that arises from X by binomial thinning. Furthermore, we define 0 X = 0 and 1 X = X .
Now that we have defined the binomial thinning operation, a sort of scalar multiplication counterpart in the integer-valued setting, the reader is reminded of the definition of McKenzie’s GINAR(1) process and its main properties.
Definition 2. 
Let ρ , p ( 0 , 1 ) . Then { X t : t N 0 } is said to be a GINAR(1) process if X t is written in the form
X t = ρ X t 1 + B t × G t ,
where { B t : i N } and { G t : i N } are independent sequences of i.i.d. Bernoulli r.v. with parameter ( 1 ρ ) and of i.i.d. geometric r.v. with parameter p, respectively; the sequence of innovations { ε t = B t × G t : t t N } and { X 0 , , X t 2 , X t 1 } are independent; all thinning operations are performed independently of each other and of { ε t : t N } ; and all the thinning operations at time t are independent of { X 0 , , X t 2 , X t 1 } .
According to McKenzie [2] and Alzaid and Al-Osh [3], if X 0 g e o m e t r i c ( p ) then { X t : t N 0 } is a stationary AR(1) process with g e o m e t r i c ( p ) marginal distribution.
McKenzie [2] also adds that { X t : t N 0 } is a DTMC with TPM, P ( p , ρ ) = [ p i j ( p , ρ ) ] i , j N 0 = [ P ( X t = j X t 1 = i ) ] i , j N 0 , where
p i j ( p , ρ ) = m = 0 min { i , j } i m ρ m ( 1 ρ ) i m × ( 1 ρ ) ( 1 p ) j m p + i j ρ j ( 1 ρ ) i j × ρ × I N 0 ( i j ) , i , j N 0 ,
where I N 0 represents the indicator function of the set of non-negative integers. These entries can be obtained by taking advantage of a few facts: ( ρ X t 1 X t 1 = 0 ) = 0 with probability 1; ( ρ X t 1 X t 1 = i ) b i n o m i a l ( i , ρ ) , for i N ; the p.f. of the innovations, ε t = B t × G t , is equal to
P ( ε t = j ) = P ( B t = 0   o r   G t = 0 ) = p ( 1 ρ ) + ρ , j = 0 P ( B t = 1 , G t = j ) = ( 1 ρ ) ( 1 p ) j p , j N .
The autocorrelation function of the GINAR(1) process is equal to
c o r r ( X t , X t + k ) = ρ k , k , t N 0 .
We ought to point out that the GINAR(1) process is a particular case of the generalized geometric INAR(1) or GGINAR(1) process, introduced by (Al-Osh and Aly [5], Section 3). Moreover, autocorrelated geometric counts can also be modeled by the new geometric INAR(1) or NGINAR(1) process, proposed by Ristić et al. [6] and relying on the negative binomial thinning operator. Finally, the NGINAR(1) process is a special instance of the ZMGINAR(1) process, the zero-modified geometric first-order integer-valued autoregressive, introduced and thoroughly described by Barreto-Souza [7].
The remainder of the paper is organized as follows. In Section 2, we shall prove that P has two important features stated in the two following theorems.
Theorem 1. 
The TPM P ( p , ρ ) of a GINAR(1) process is totally positive of order 2,
P ( p , ρ ) T P 2 ,
i.e., all the 2 × 2 minors of the P ( p , ρ ) are non-negative.
Theorem 2. 
Let: { X t ( p , ρ ) : t N 0 } and { X t ( p , ρ ) : t N 0 } be two independent GINAR(1) processes, with parameters ( p , ρ ) and ( p , ρ ) ; P ( p , ρ ) = [ p i j ( p , ρ ) ] i , j { 0 , 1 , , n } and P ( p , ρ ) = [ p i j ( p , ρ ) ] i , j { 0 , 1 , , n } be their corresponding TPM. Then P ( p , ρ ) is stochastically smaller than P ( p , ρ ) in the usual (or in the Kalmykov order) sense,
P ( p , ρ ) s t P ( p , ρ ) ,
if 0 ρ / ( ρ + 1 ) < p p < 1 , that is,
j = l n p i j ( p , ρ ) j = l n p m j ( p , ρ ) , i , l , m { 0 , 1 , , n } , i m ,
in case 1 / ρ > E [ X t ( p , ρ ) ] E [ X t ( p , ρ ) ] > 0 .
In Section 3, we discuss and illustrate the impact of (6) and (7) on the run length of an upper one-sided geometric chart for monitoring GINAR(1) processes. In Section 4, we sum up our findings and briefly refer to related and future work.

2. Proving the Two Features of the GINAR(1) Process

Demonstrating that the 2 × 2 minors of the TPM of a GINAR(1) process are all non-negative is not simple, due to the aspect of the transition probabilities defined in (3). However, by adopting the reasoning of (Morais and Pacheco [8] Section 2) and resorting to some auxiliary definitions and lemmas in Appendix A.1, we can prove (6).
Proof of Theorem 1. 
Note that
( X t + 1 ( p , ρ ) X t ( p , ρ ) = i ) = ρ X t 1 ( p , ρ ) + ε t = s t B ( i , ρ ) + B G ( p , ρ ) ,
where: B ( 0 , ρ ) = s t 0 ; B ( i , ρ ) b i n o m i a l ( i , ρ ) , i N ; B G ( p , ρ ) a r.v. with p.f. given by (4); B ( i , ρ ) and B G ( p , ρ ) are two independent r.v.
In accordance to Lemmas A1 and A3, B ( i ) stochastically increases with i in the likelihood ratio sense and B ( i , ρ ) , B G ( p , ρ ) P F 2 . Hence, we can invoke the closure of the stochastic order l r (see Definition A1) under the sum of independent P F 2 r.v. (see (Shaked and Shanthikumar [9] p. 46, Theorem 1.C.9) or Karlin and Proschan [10]) to conclude that
B ( i , ρ ) + B G ( p , ρ ) l r B ( i + 1 , ρ ) + B G ( p , ρ ) X t + 1 ( p , ρ ) X t ( p , ρ ) = i l r X t + 1 ( p , ρ ) X t ( p , ρ ) = i + 1 ,
for i N 0 , i.e., P T P 2 or P is a stochastically monotone TPM in the likelihood ratio sense ( P M l r ) , according to Definition A2. □
The next proof refers to a stochastic ordering between the TPM that govern two DTMC with the same state space, thus associated with what Kulkarni [11] (pp. 148–149) terms the Kalmykov-dominance or Kalmykov order(see Kalmykov [12] Theorem 2).
Proof of Theorem 2. 
Result (7) can be shown to hold by successively capitalizing on: Lemmas A1 and A2; the closure of l r under the sum of independent P F 2 r.v.; P ( p , ρ ) T P 2 ; and X l r Y implies that the r.v. X is stochastically smaller than the r.v. Y in the usual sense, in short X s t Y (see Shaked and Shanthikumar [9] p. 42, Theorem 1.C.1). Then, for i , m { 0 , 1 , , n } , i m , and 0 ρ / ( ρ + 1 ) < p p < 1 :
( X t + 1 ( p , ρ ) X t ( p , ρ ) = i ) = s t B ( i , ρ ) + B G ( p , ρ ) l r B ( i + 1 , ρ ) + B G ( p , ρ ) = s t ( X t + 1 ( p , ρ ) X t ( p , ρ ) = i ) l r ( X t + 1 ( p , ρ ) X t ( p , ρ ) = m ) ( X t + 1 ( p , ρ ) X t ( p , ρ ) = i ) s t ( X t + 1 ( p , ρ ) X t ( p , ρ ) = m ) j = l n p i j ( p , ρ ) j = l n p m j ( p , ρ ) , l { 0 , 1 , , n } ,
i.e., P ( p , ρ ) s t P ( p , ρ ) if 1 / ρ > 1 / p 1 = E [ X t ( p , ρ ) ] E [ X t ( p , ρ ) ] = 1 / p 1 > 0 . □

3. Practical Implications in Statistical Process Control

Time series of counts arise naturally in several applications, namely the manufacturing industry, health care, service industry, insurance, and network analysis. Using control charts for monitoring the underlying count processes is essential to swiftly detect changes in such processes and start preventive or corrective actions (see Weiß [13]). For an overview of control charts for count processes, we refer the reader to Weiß [14].
As noted by Ristić et al. [6], counts with geometric marginal distributions play a major role in several areas, for instance reliability, medicine, and precipitation modeling. These counts may refer to the number of machines waiting for maintenance, congenital malformations, or thunderstorms in a day.
In statistical process control, the GINAR(1) process can be used to model, for example, the cumulative counts of conforming items between two nonconforming items when these successive counts are no longer independent, say because the observations are generated by automated high-frequency sampling.
The literature review reveals that no charts have been proposed for monitoring GINAR(1) or GGINAR(1) counts. However, Li et al. [15] proposed a combined jumps chart, a cumulative sum (CUSUM) chart, and a combined exponentially weighted moving average (EWMA) chart for monitoring the NGINAR(1) counts. Furthermore, Li et al. [16] described upper and lower one-sided CUSUM charts for monitoring the mean of ZMGINAR(1) counts.
Let us consider that the following quality control chart is being used to detect decreases in the parameter p of the GINAR(1) process.
Definition 3. 
Let { X t : t N 0 } be a GINAR(1) process. The upper one-sided geometric chart makes use of the set of control statistics { X t : t N } and triggers a signal at time t ( t N ) if X t > U , where U is a fixed upper control limit (UCL) in N 0 .
We should bear in mind that the control statistic X t becomes stochastically smaller in the usual sense as p increases (see Lemma A4). Consequently and as suggested by (Xie et al. [17] p. 42), it is clear that when an observed value of X t exceeds the UCL of the chart, this should be taken as a sign that the p has decreased, that is, an indication of a potential increase in the process mean ( 1 p ) / p .
The performance of the upper one-sided geometric chart is about to be assessed in terms of the run length (RL), the random number of samples collected before a signal is triggered by this control chart. Consequently, the following first passage time of the stochastic process { X t : t N 0 } , under the condition that X 0 = u { 0 , 1 , , U } , is a vital performance measure of this chart for monitoring a GINAR(1) process:
R L u R L u ( U ) = min { t N : X t > U X 0 = u } ,
where u is a fixed initial value in the set { 0 , 1 , , U } .
U is chosen in such a way that false alarms are rather infrequent and increases in the process mean ( 1 p ) / p (i.e., decreases in p) are detected as quickly as possible. Hence, we should be dealing with a large in-control RL and smaller out-of-control run lengths.

3.1. Significance of P T P 2

By invoking the first part of Theorem 3.1 of Assaf et al. [18], we can state that the T P 2 character of the TPM of the GINAR(1) process leads to the following result.
Corollary 1. 
Let { X t : t N 0 } be a GINAR(1) process. Then
R L 0 = min { t N : X t > U X 0 = 0 } P F 2 ,
i.e., [ P R L 0 ( x + 1 ) ] 2 P R L 0 ( x ) × P R L 0 ( x + 2 ) , for x N 0 .
Corollary 1 implies that R L 0 has an increasing hazard rate ( R L 0 I H R ) , that is, λ R L 0 ( m ) = P ( R L 0 = m ) / P ( R L 0 m ) is a nondecreasing function of m N (see Kijima [19] p. 118, Theorem 3.7(ii)). R L 0 I H R means that signaling, given that no observation has previously exceeded the UCL, becomes more likely as we proceed with the collection of observations provided that X 0 = 0 .
Note, however, that R L u may not be IHR, for u { 1 , , U } . In fact, the second part of Theorem 3.1 of Assaf et al. [18] allows us to state that the p.f. P R L U ( l + n ) is T P 2 in l and n ( l , n N 0 ) , i.e., P R L U ( l + n ) × P R L U ( l + n ) P R L U ( l + n ) × P R L U ( l + n ) , for l , n N 0 ( l < l , n < n ) . As a consequence, [ P R L U ( x + 1 ) ] 2 P R L U ( x ) × P R L U ( x + 2 ) , for x N 0 , thus we can add that R L U has an decreasing hazard rate ( R L U D H R ) .
The next corollary translates the stochastic influence of an increase in the initial value u and can be shown to be valid by capitalizing on (Karlin [20] pp. 42–43, Theorem 2.1).
Corollary 2. 
Let { X t : t N 0 } be a GINAR(1) process. Then, for u , u { 0 , 1 , , U } ,
R L u l r R L u , u u .
Let us denote the upper one-sided geometric chart with X 0 = u (resp. X 0 = u ) by Scheme 1 (resp. Scheme 2). Then (10) can be interpreted as follows: the odds of Scheme 1 signaling at sample m against Scheme 2 triggering a signal at the same sample decreases as m increases (see [21] p. 5).
Result (10) seems quite evident; nevertheless, it would not be valid if the GINAR(1) process was not governed by a T P 2 TPM.

3.2. Other Comparisons of Run Lengths

The stochastic inequality P ( p , ρ ) s t P ( p , ρ ) , for 0 ρ / ( ρ + 1 ) < p p < 1 , allows us to stochastically compare two GINAR(1) processes. As a matter of fact, by invoking Lemma A4 and Theorem 6.B.32 of (Shaked and Shanthikumar [9] p. 282), we can state the next result.
Corollary 3. 
Let { X t ( p , ρ ) : t N 0 } and { X t ( p , ρ ) : t N 0 } two GINAR(1) processes. If 0 ρ / ( ρ + 1 ) < p p < 1 and the initial states are deterministic X 0 = u X 0 ( p ) = u or random, say X 0 ( p , ρ ) g e o m e t r i c ( p ) s t X 0 ( p , ρ ) g e o m e t r i c ( p ) , then
{ X t ( p , ρ ) : t N 0 } s t { X t ( p , ρ ) : t N 0 } .
From (11) we can infer from (11) that X 1 ( p , ρ ) s t X 1 ( p , ρ ) .
The next lemma plays a vital role in the comparison of run lengths and is taken from (Shaked and Shanthikumar [9] p. 283).
Lemma 1. 
If two stochastic processes { X t : t T } and { Y t : t T } satisfy { X t : t T } s t { Y t : t T } then
inf { t T : Y t > U } s t inf { t T : X t > U } .
Lemma 1 states what could be considered obvious: if we are dealing with two ordered stochastic processes in the usual sense, the larger stochastic process in the usual sense exceeds the critical level U stochastically sooner also in the usual sense.
By combining Corollary 3 and Lemma 1, we can provide a stochastic flavor to the influence of an increase in p not only on R L u but also on another important RL:
R L X 1 = min { t N : X t > U X 1 } ,
which we coin as overall run length, following (Weiß [22] Section 20.2.2). R L X 1 refers to a first passage time of the stochastic process { X t : t N } under the condition that the initial state coincides with the r.v. X 1 . In point of fact, it is reasonable to resort to this performance measure because in practice we do not know X 0 , hence it is plausible to rely, for example, on X 1 g e o m e t r i c ( p ) .
Corollary 4. 
The following stochastic ordering results hold for the run lengths of the upper one-sided geometric chart for monitoring GINAR(1) processes:
R L u ( p , ρ ) s t R L u ( p , ρ )
R L X 1 ( p , ρ ) ( p , ρ ) s t R L X 1 ( p , ρ ) ( p , ρ ) ,
for u u and 0 ρ / ( ρ + 1 ) < p p < 1 .
Note that we could have also invoked (14) and the closure of the usual stochastic order s t under mixtures (see Shaked and Shanthikumar [9] p. 6, Theorem 1.A.3.(d)) to prove (15).
Results (14) and (15) mean that the upper one-sided geometric chart for the GINAR(1) process stochastically increases its detection speed (in the usual sense) as the downward shift in p becomes more extreme. This stochastic ordering result parallels with the notion of a sequentially repeated uniformly powerful test.

3.3. An Illustration

Ristić et al. [6] found that an NGINAR(1) model with estimated parameters p ^ 0 = 1 / ( 1 + 0.5872 ) = 0.63 and ρ ^ 0 = 0.1650 adequately described the monthly counts of sex offenses reported in the 21st police car beat in Pittsburgh. This data set comprises 144 observations, starting in January 1990 and ending in December 2001.
Note that the GINAR(1) and NGINAR(1) processes share the same geometric marginal distribution; and, as far as the offense data set is concerned, the value of the Akaike information criterion (AIC) for the NGINAR(1) and GINAR(1) models are very close, namely 302.67 and 303.74 , respectively, as (Ristić et al. [6] Table 2) attest. Hence, we are going to consider the upper one-sided geometric chart from Definition 3 with p 0 = 0.63 and ρ 0 = 0.1650 for monitoring such counts.
An UCL equal to U = 5 and an initial state u = 0 (resp. u = U ) yield an in-control ARL of E [ R L 0 ( p 0 , ρ 0 ) ] 393.7 (resp. E [ R L U ( p 0 , ρ 0 ) ] 391.4 ). These and other RL-related performance measures used in this subsection are described in Appendix A.2.
The plots of the hazard rate function in Figure 1 give additional insights into the RL performance of the geometric chart as we proceed with the sampling and to the impact of the adoption of a head start. Indeed, it illustrates two results that follow from Corollary 1: R L 0 ( p 0 , ρ 0 ) I H R and R L U ( p 0 , ρ 0 ) D H R . This last result suggests that the false-alarm rate conveniently decreases in the first samples when we adopt a head start ( u = U > 0 ) .
According to Brook and Evans [23], the limiting form of the p.f. of the RL is geometric-like with parameter 1 ξ ( p , ρ ) , where ξ ( p , ρ ) is the maximum real eigenvalue of Q ( p , ρ ) = [ p i j ( p , ρ ) ] i , j { 0 , 1 , , U } , regardless of the initial value u of the control statistic X t . Therefore, it comes as no surprise that the values of the hazard rate functions of R L 0 ( p 0 , ρ 0 ) and R L U ( p 0 , ρ 0 ) converge to
lim m + λ R L 0 ( p 0 , ρ 0 ) ( m ) = lim m + λ R L U ( p 0 , ρ 0 ) ( m ) = 1 ξ ( p , ρ ) 0.002541 ,
as suggested by Figure 1.
Furthermore, the hazard rate function of R L 0 ( p 0 , ρ 0 ) is pointwise below the one of R L U ( p 0 , ρ 0 ) because Corollary 2 establishes that R L U ( p 0 , ρ 0 ) l r R L 0 ( p 0 , ρ 0 ) and this result in turn implies R L U ( p 0 , ρ 0 ) h r R L 0 ( p 0 , ρ 0 ) , that is, λ R L U ( p 0 , ρ 0 ) ( m ) λ R L 0 ( p 0 , ρ 0 ) ( m ) , for m N (see Definition A4).
We now illustrate the first result of Corollary 4 and also of a consequence of its second result: R L 0 ( p , ρ ) s t R L 0 ( p , ρ ) , for 0 ρ / ( ρ + 1 ) < p p < 1 ; E [ R L X 1 ( p , ρ ) ( p , ρ ) ] is an increasing function of p in the interval, ( ρ / ( ρ + 1 ) , 1 ) .
In the left panel of Figure 2, we plotted the survival functions of R L 0 ( 0.9 p 0 , ρ 0 ) and R L 0 ( p 0 , ρ 0 ) .
Since R L 0 ( 0.9 p 0 , ρ ) s t R L 0 ( p 0 , ρ ) , the plot of survival function of R L 0 ( 0.9 p 0 , ρ ) is pointwise below the one of R L 0 ( p 0 , ρ ) , as Figure 2 plainly demonstrates. Hence, the number of samples taken until the detection of a 10 % decrease in p by the upper one-sided geometric chart is indeed stochastically smaller than the number of samples we collect until this chart emits a false alarm.
The right panel of Figure 2 refers to the overall ARL function, E [ R L X 1 ( p , ρ 0 ) ] , for ρ / ( ρ + 1 ) < p p 0 . It increases with p in this particular interval from E [ R L X 1 ( ρ 0 / ( ρ 0 + 1 ) , ρ 0 ) ] 8.3 to E [ R L X 1 ( p 0 , ρ 0 ) ] 393.5 . We ought to note that it increases further when we take p ( p 0 , 1 ) , therefore the upper one-sided geometric chart cannot detect increases in p in an expedient manner, as we have anticipated.
We wrote a program for Mathematica 10.3 (Wolfram [24]) to produce all the graphs and results in this subsection.

4. Concluding Remarks

As expertly put by Montgomery and Mastrangelo [25], the independence assumption is often violated in practice. As a consequence, we often deal with discrete-valued time series, namely when we are dealing with very high sampling rates, as suggested by Weiß and Testik [26], and Rakitzis et al. [27].
In this paper, we considered the GINAR(1) count process, resorted to stochastic ordering to prove two features of its TPM, and discussed the implications of these two traits on RL-related performance measures of an upper one-sided geometric control chart that accounts for the autocorrelated character of such process.
For example: the T P 2 character of the TPM of the GINAR(1) process implies an IHR behaviour of the run length R L 0 of that same chart; the run length R L u and the overall run length R L X 1 stochastically increase in the usual sense in the interval ( ρ 0 / ( ρ 0 + 1 ) , 1 ) .
These features of the GINAR(1) process and the associated results are comparable to the ones derived by (Morais [21] Section 3.2) and Morais and Pacheco [8,28].
It is important to note that the notion of stochastically monotone matrices in the usual sense was introduced by Daley [29] for real-valued discrete-time Markov chains. Moreover, Karlin [20] implicitly states that a T P 2 TPM possesses a monotone likelihood ratio property and, thus, virtually defines stochastically monotone Markov chains in the likelihood ratio sense. Furthermore, the comparison of counting processes and queues in the usual sense can be traced back, for instance, to Whitt [30] and the multivariate likelihood ratio order of random vectors (or T P 2 order) is discussed, for example, by (Shaked and Shanthikumar [9] pp. 298–305).
Coincidentally, the stochastic order in the likelihood sense for stochastic processes or TPM has not been defined up to now, as far as we have investigated. For this reason and the fact that the l r order is not closed under mixtures (see Shaked and Shanthikumar [31] p. 33), we did not state or prove the l r analogue of the two results in Corollary 4.
We also failed to prove that P ( p , ρ ) s t P ( p , ρ ) , for 0 ρ ρ < 1 , because of two opposing stochastic behaviors of the summands ( ρ X t X t = i ) and ε t + 1 : the r.v. binomial ( i , ρ ) (resp. B G ( p , ρ ) ) stochastically increases (resp. decreases) with ρ in the likelihood ratio sense. Had we proven that result, we could have concluded that the larger the upward shifts in the autocorrelation parameter, the longer it takes the upper one-sided geometric chart to detect such a change in ρ .
It would be pertinent to investigate the stochastic properties of the RL and overall RL of lower one-sided geometric charts for detecting increases in the parameter p of a GINAR(1) process.
Another possibility of further work which certainly deserves some consideration is to investigate the extension of Theorems 1 and 2 to the NGINAR(1) process, the novel geometric INAR(1) process proposed by Guerrero et al. [32], or the new INAR(1) process with Poisson binomial-exponential 2 innovations studied by Zhang et al. [33], and assess the impact of these two results in the RL performance of upper one-sided charts for monitoring such autocorrelated geometric counts.
We ought to mention that deriving results similar to (6) and (7) seems to be very unlikely for the mixed generalized Poisson INAR process [34]. This follows from the fact that the generalized Poisson distribution has not a P F 2 p.f.

Funding

The author acknowledges the financial support of the Portuguese FCT—Fundação para a Ciência e a Tecnologia, through the projects UIDB/04621/2020 and UIDP/04621/2020 of CEMAT/IST-ID (Center for Computational and Stochastic Mathematics), Instituto Superior Técnico, Universidade de Lisboa.

Acknowledgments

We are grateful to the three reviewers who selflessly devoted their time to scrutinizing this work and offered pertinent comments that led to an improved version of the original manuscript. The author would also like to thank Christian H. Weiß for the opportunity to celebrate the vital role of “Discrete-valued Time Series” in this special issue of “Entropy”.

Conflicts of Interest

The author declares no conflict of interest.

Abbreviations

The following abbreviations are used in this manuscript:
c.d.f.cumulative distribution function
DHRdecreasing hazard rate
DTMCdiscrete-time Markov chain
GGINAR(1)generalized geometric first-order integer-valued autoregressive process
GINAR(1)geometric first-order integer-valued autoregressive process
i.i.d.independent and identically distributed
IHRincreasing hazard rate
INAR(1)first-order integer-valued autoregressive process
NGINAR(1)new geometric first-order integer-valued autoregressive process
p.f.probability function
P F 2 Pólya frequency of order 2
RLrun length
r.v.random variable
T P 2 totally positive of order 2
TPMtransition probability matrix
UCLupper control limit
ZMGINAR(1)zero-modified geometric first-order integer-valued autoregressive process

Appendix A

Appendix A.1. Auxiliary Definitions and Lemmas

This appendix has the sole purpose of providing a few notions and results that are crucial to prove (6), (7), and some of the implications of these two stochastic ordering results.
The notions of stochastically smaller in the likelihood ratio sense, stochastically monotone in the likelihood ratio sense, and Pólya frequency of order 2 p.f. are taken or follow from (Shaked and Shanthikumar [9] p. 42), (Kijima [19] pp. 129–131), and (Kijima [19] p. 106) (respectively).
Definition A1. 
Let X and Y be two non-negative integer r.v., with p.f. P X and P Y . Then X is said to be stochastically smaller than Y in the likelihood ratio sense if
P X ( x ) P Y ( x ) i s   a   n o n i n c r e a s i n g   f u n c t i o n   o f   x ,
over the union of the supports of the r.v. X and Y. In shorthand notation, X l r Y .
Lemma A1. 
Let B ( i , ρ ) b i n o m i a l ( i , ρ ) . Then
B ( i , ρ ) l r B ( i + 1 , ρ ) , i N 0 .
B ( i , ρ ) stochastically increases with i in the likelihood ratio sense because the ratio
P B ( i , ρ ) ( x ) P B ( i + 1 , ρ ) ( x ) = i + 1 x ( 1 p ) ( i + 1 )
is a nonincreasing function of x { 0 , 1 , , i + 1 } .
Lemma A2. 
Let B G ( p , ρ ) be a r.v. with p.f. given by (4). Then
B G ( p , ρ ) l r B G ( p , ρ ) , ρ ρ + 1 < p p < 1 .
Note that
r B G ( x ) = P B G ( p , ρ ) ( x ) P B G ( p , ρ ) ( x ) = p ( 1 ρ ) + ρ p ( 1 ρ ) + ρ , x = 0 p p × 1 p 1 p x , x N .
Since ( 1 p ) / ( 1 p ) 1 , r B G ( x ) is a nonincreasing function of x N . We still have to verify that r B G ( 0 ) r B G ( 1 ) : this inequality is valid if f ( p ) = [ p ( 1 ρ ) + ρ ] / [ p ( 1 p ] = 1 / ( 1 p ) + ρ / p has a positive derivative, i.e., if 1 / ( 1 p ) 2 ρ / p 2 > 0 or, equivalently, p > ρ / ( ρ + 1 ) . Hence B G ( p , ρ ) stochastically decreases with p in the likelihood ratio sense as long as 0 ρ / ( ρ + 1 ) < p p < 1 .
Definition A2. 
Let { X t : t N 0 } be an irreducible DTMC with TPM P . Then { X t : t N 0 } is said to be stochastically monotone in the likelihood ratio sense if
X t + 1 X t = i l r X t + 1 X t = i + 1 ,
for all i. In this case, we write P M l r or P T P 2 .
Definition A3. 
Let X be a non-negative r.v. with probability function (p.f.) P X ( x ) . If
P X ( x ) P X ( x + 1 ) i s   n o n d e c r e a s i n g   i n   x N 0 ,
i.e., [ P X ( x + 1 ) ] 2 P X ( x ) × P X ( x + 2 ) , x N 0 , then X is said to have a Pólya frequency of order 2 ( P F 2 ) p.f. and we write X P F 2 .
Lemma A3. 
If B ( i , ρ ) b i n o m i a l ( i , ρ ) and B G ( p , ρ ) is a r.v. with p.f. given by (4) then B ( i , ρ ) , B G ( p , ρ ) P F 2 .
We have
P B ( i , ρ ) ( x ) P B ( i , ρ ) ( x + 1 ) = 1 p p × x + 1 i x , x = 0 , 1 , , i , P B G ( p , ρ ) ( x ) P B G ( p , ρ ) ( x + 1 ) = p ( 1 ρ ) + ρ ( 1 ρ ) ( 1 p ) p , x = 0 1 1 p , x N .
Since [ p ( 1 ρ ) + ρ ] / [ ( 1 ρ ) ( 1 p ) p ] 1 / ( 1 p ) ρ 1 , we can state that these two ratios are certainly nondecreasing functions of x over the support of the corresponding p.f.
The concepts of stochastically smaller in the usual sense in the univariate and multivariate cases and stochastically smaller in the hazard rate sense in the univariate case can be found in (Shaked and Shanthikumar [9] pp. 3, 17, 266), whereas on p. 281 of this same reference the stochastic ordering of stochastic processes in the usual sense is defined.
Definition A4. 
Let X and Y be two non-negative integer r.v., with p.f. P X and P Y and c.d.f. F X and F Y . Then: X is said to be stochastically smaller than Y in the usual sense ( X s t Y ) if
F ¯ X ( x ) = 1 F X ( x ) 1 F Y ( x ) = F ¯ Y ( x ) , x N 0 ;
X is said to be stochastically smaller than Y in the hazard rate sense ( X h r Y ) in case
λ X ( x ) = P X ( x ) F ¯ X ( x 1 ) P Y ( x ) F ¯ Y ( x 1 ) = λ Y ( x ) , x N 0 .
The stochastic orders l r , h r , and s t can be related: X l r Y X h r Y X s t Y (see Shaked and Shanthikumar [9] pp. 18, 43, Theorems 1.B.1, 1.C.1). Moreover, X s t Y E ( X ) E ( Y ) provided that these expectations exist.
Lemma A4. 
Let G ( p ) be an r.v. with geometric distribution with parameter p. Then
G ( p ) s t G ( p ) , 0 < p p < 1 .
Equation (A8) follows in a straightforward manner: P G ( p ) ( x ) = ( 1 p ) x p , for x N 0 ; thus, F ¯ G ( p ) ( x ) = ( 1 p ) x + 1 ( 1 p ) x + 1 = F ¯ G ( p ) ( x ) , for x N 0 , when 0 < p p < 1 .
Let x ̲ = ( x 1 , , x m ) and y ̲ = ( y 1 , , y m ) be two vectors in R m ; then we write x ̲ y ̲ if x t y t , for t = 1 , , m . Additionally, recall that U R m is called an upper set if y ̲ U whenever x ̲ y ̲ and x ̲ U (see Shaked and Shanthikumar [9] p. 266).
Definition A5. 
Let X ̲ and Y ̲ be two m dimensional random vectors. Then X ̲ is said to be smaller than Y ̲ in the usual sense if
P ( X ̲ U ) P ( Y ̲ U ) ,
for every upper set U in R m . We write X ̲ s t Y ̲ .
Definition A6. 
Let { X t : t N 0 } and { Y t : t N 0 } be two discrete-time stochastic processes with a common state space S . Then { X t : t N 0 } is said to be stochastically smaller than { Y t : t N 0 } in the usual sense if
( X t 1 , , X t m ) s t ( Y t 1 , , Y t m ) ,
for every m N and ( t 1 , , t m ) N 0 m . In this case, we write { X t : t N 0 } s t { Y t : t N 0 } .
As a consequence of Definition A6, { X t : t N 0 } s t { Y t : t N 0 } implies that X t s t Y t , for all t N 0 .

Appendix A.2. Run Length Related Performance Measures

The run length of the upper one-sided geometric chart, R L u R L u ( p , ρ ) , is the first passage time to the set of states { U + 1 , U + 2 , } , where u { 0 , 1 , U } . Thus, we can use the Markov chain approach proposed by Brook and Evans [23] and provide the expected value of R L u ,
E ( R L u ) = e ̲ u × ( I Q ) 1 × 1 ̲ ,
where: e ̲ u is the ( u + 1 ) th vector of the orthogonal basis for R ( U + 1 ) ; I represents an identity matrix with rank ( U + 1 ) ; Q Q ( p , ρ ) = [ p i j ( p , ρ ) ] i , j { 0 , 1 , , U } is the sub-stochastic matrix that governs the transitions between the states in { 0 , 1 , , U } , with entries given by (3); 1 ̲ is a column-vector with ( U + 1 ) ones.
We can also add the survival and hazard rate functions of R L u are equal to
F ¯ R L u ( m ) = e ̲ u × Q m × 1 ̲ ,
λ R L u ( m ) = P ( R L u = m ) P ( R L u m ) = F ¯ R L u ( m 1 ) F ¯ R L u ( m ) F ¯ R L u ( m 1 ) ,
for m N .
The overall ARL of the upper one-sided geometric chart is given by
E ( R L X 1 ) = 1 + u = 0 U A R L u × P ( X 1 = u )
(see (Weiß [22] Section 20.2.2) or Weiß and Testik [35]).

References

  1. McKenzie, E. Some simple models for discrete variate time series. Water Resour. Bull. 1985, 21, 645–650. [Google Scholar] [CrossRef]
  2. McKenzie, E. Autoregressive moving-average processes with negative-binomial and geometric marginal distributions. Adv. Appl. Probab. 1986, 18, 679–705. [Google Scholar] [CrossRef]
  3. Alzaid, A.; Al-Osh, M. First-order integer-valued autoregressive (INAR (1)) process: Distributional and regression properties. Stat. Neerl. 1988, 42, 53–61. [Google Scholar] [CrossRef]
  4. Steutel, F.W.; van Harn, K. Discrete analogues of self-decomposability and stability. Ann. Probab. 1979, 7, 893–899. [Google Scholar] [CrossRef]
  5. Al-Osh, M.A.; Aly, E.-E.A.A. First order autoregressive time series with negative binomial and geometric marginals. Commun. Stat.–Theory Methods 1992, 21, 2483–2492. [Google Scholar] [CrossRef]
  6. Ristić, M.M.; Bakouch, H.S.; Nastić, A.S. A new geometric first-order integer-valued autoregressive (NGINAR(1)) process. J. Stat. Plan. Inference 2009, 139, 2218–2226. [Google Scholar] [CrossRef]
  7. Barreto-Souza, W. Zero-Modified geometric INAR(1) process for modelling count time series with deflation or inflation of zeros. J. Time Ser. Anal. 2015, 36, 839–852. [Google Scholar] [CrossRef]
  8. Morais, M.C.; Pacheco, A. On hitting times for Markov time series of counts with applications to quality control. REVSTAT–Stat. J. 2016, 4, 455–479. [Google Scholar]
  9. Shaked, M.; Shanthikumar, J.G. Stochastic Orders; Springer: New York, NY, USA, 2007. [Google Scholar]
  10. Karlin, S.; Proschan, F. Pólya type distributions of convolutions. Ann. Math. Stat. 1960, 31, 721–736. [Google Scholar] [CrossRef]
  11. Kulkarni, V.G. Modeling and Analysis of Stochastic Systems; Chapman and Hall: London, UK, 1995. [Google Scholar]
  12. Kalmykov, G.I. On the partial ordering of one-dimensional Markov processes. Theory Probab. Its Appl. 1962, 7, 456–459. [Google Scholar] [CrossRef]
  13. Weiß, C.H. SPC methods for time-dependent processes of counts—A literature review. Cogent Math. 2015, 2, 1–11. [Google Scholar] [CrossRef]
  14. Weiß, C.H. An Introduction to Discrete-Valued Time Series; Wiley: Hoboken, NJ, USA, 2018. [Google Scholar]
  15. Li, C.; Wang, D.; Zhu, F. Effective control charts for monitoring the NGINAR(1) process. Qual. Reliab. Eng. Int. 2016, 32, 877–888. [Google Scholar] [CrossRef]
  16. Li, C.; Wang, D.; Sun, J. Control charts based on dependent count data with deflation or inflation of zeros. J. Stat. Comput. Simul. 2019, 89, 3273–3289. [Google Scholar] [CrossRef]
  17. Xie, M.; Goh, T.N.; Kuralmani, V. Statistical Models and Control Charts for High-Quality Processes; Springer Science+Business Media, LLC: New York, NY, USA, 2002. [Google Scholar]
  18. Assaf, D.; Shaked, M.; Shanthikumar, J.G. First-passage times with PFr densities. J. Appl. Probab. 1985, 22, 185–196. [Google Scholar] [CrossRef]
  19. Kijima, M. Markov Processes for Stochastic Modeling; Chapman and Hall: London, UK, 1997. [Google Scholar]
  20. Karlin, S. Total positivity, absorption probabilities and applications. Trans. Am. Math. Soc. 1964, 11, 33–107. [Google Scholar] [CrossRef]
  21. Morais, M.J.C. Stochastic Ordering in the Performance Analysis of Quality Control Schemes. Ph.D. Thesis, Universidade Técnica de Lisboa, Lisbon, Portugal, 2002. [Google Scholar]
  22. Weiß, C.H. Categorical Time Series Analysis and Applications in Statistical Quality Control. Ph.D. Thesis, Fakultät für Mathematik und Informatik der Universität Würzburg, Würzburg, Germany, 2009. [Google Scholar]
  23. Brook, D.; Evans, D.A. An approach to the probability distribution of CUSUM run length. Biometrika 1972, 59, 539–549. [Google Scholar] [CrossRef]
  24. Wolfram Research, Inc. Mathematica; Version 10.3; Wolfram Research, Inc.: Champaign, IL, USA, 2015. [Google Scholar]
  25. Montgomery, D.C.; Mastrangelo, C.M. Some statistical process control methods for autocorrelated data. J. Qual. Technol. 1991, 23, 179–193. [Google Scholar] [CrossRef]
  26. Weiß, C.H.; Testik, M.C. The Poisson INAR(1) CUSUM chart under overdispersion and estimation error. IIE Trans. 2011, 43, 805–818. [Google Scholar] [CrossRef]
  27. Rakitzis, A.C.; Weiß, C.H.; Castagliola, P. Control charts for monitoring correlated counts with a finite range. Appl. Stoch. Model. Bus. Ind. 2017, 33, 733–749. [Google Scholar] [CrossRef]
  28. Morais, M.C.; Pacheco, A. On stochastic ordering and control charts for traffic intensity. Seq. Anal. 2016, 35, 536–559. [Google Scholar] [CrossRef]
  29. Daley, D.J. Stochastically monotone Markov chains. Z. Wahrscheinlichkeitstheorie Werwandte Geb. 1968, 10, 305–317. [Google Scholar] [CrossRef]
  30. Whitt, W. Comparing counting processes and queues. Adv. Appl. Probab. 1981, 13, 207–220. [Google Scholar] [CrossRef]
  31. Shaked, M.; Shanthikumar, J.G. Stochastic Orders and Their Applications; Academic Press: San Diego, CA, USA, 1994. [Google Scholar]
  32. Guerrero, M.B.; Barreto-Souza, W.; Ombao, H. Integer-valued autoregressive processes with prespecified marginal and innovation distributions: A novel perspective. Stoch. Models 2022, 38, 70–90. [Google Scholar] [CrossRef]
  33. Zhang, J.; Zhu, F.; Khan, N.M. A new INAR model based on Poisson-BE2 innovations. Commun. Stat. Theory Methods 2022. [Google Scholar] [CrossRef]
  34. Huang, J.; Zhu, F.; Deng, D. A mixed generalized Poisson INAR model with applications. J. Stat. Comput. Simul. 2022. [Google Scholar] [CrossRef]
  35. Weiß, C.H.; Testik, M.C. CUSUM monitoring of first-order integer-valued autoregressive processes of Poisson counts. J. Qual. Technol. 2009, 41, 389–400. [Google Scholar] [CrossRef]
Figure 1. Hazard rate functions of R L 0 ( p 0 , ρ 0 ) and R L U ( p 0 , ρ 0 ) .
Figure 1. Hazard rate functions of R L 0 ( p 0 , ρ 0 ) and R L U ( p 0 , ρ 0 ) .
Entropy 25 00444 g001
Figure 2. Survival function of R L 0 ( p , ρ 0 ) , for p = 0.9 p 0 and p = p 0 (black and gray solid lines); overall ARL function, E [ R L X 1 ( p , ρ 0 ) ] , for ρ / ( ρ + 1 ) < p p 0 .
Figure 2. Survival function of R L 0 ( p , ρ 0 ) , for p = 0.9 p 0 and p = p 0 (black and gray solid lines); overall ARL function, E [ R L X 1 ( p , ρ 0 ) ] , for ρ / ( ρ + 1 ) < p p 0 .
Entropy 25 00444 g002
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Morais, M.C. Two Features of the GINAR(1) Process and Their Impact on the Run-Length Performance of Geometric Control Charts. Entropy 2023, 25, 444. https://doi.org/10.3390/e25030444

AMA Style

Morais MC. Two Features of the GINAR(1) Process and Their Impact on the Run-Length Performance of Geometric Control Charts. Entropy. 2023; 25(3):444. https://doi.org/10.3390/e25030444

Chicago/Turabian Style

Morais, Manuel Cabral. 2023. "Two Features of the GINAR(1) Process and Their Impact on the Run-Length Performance of Geometric Control Charts" Entropy 25, no. 3: 444. https://doi.org/10.3390/e25030444

APA Style

Morais, M. C. (2023). Two Features of the GINAR(1) Process and Their Impact on the Run-Length Performance of Geometric Control Charts. Entropy, 25(3), 444. https://doi.org/10.3390/e25030444

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop