Next Article in Journal
Cnoidal Wave-Induced Residual Liquefaction in Loosely Deposited Seabed under Coastal Shallow Water
Previous Article in Journal
Object Detection in Autonomous Driving Scenarios Based on an Improved Faster-RCNN
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

On the Subrange and Its Application to the R-Chart

1
School of Economics & Management, Nanjing University of Science and Technology, Nanjing 210094, China
2
College of Economics and Management, Nanjing University of Aeronautics and Astronautics, Nanjing 211106, China
3
Department of Industrial Engineering, Pusan National University, Busan 46241, Korea
*
Author to whom correspondence should be addressed.
Appl. Sci. 2021, 11(24), 11632; https://doi.org/10.3390/app112411632
Submission received: 13 October 2021 / Revised: 24 November 2021 / Accepted: 2 December 2021 / Published: 8 December 2021
(This article belongs to the Topic Industrial Engineering and Management)

Abstract

:
The conventional sample range is widely used for the construction of an R-chart. In an R-chart, the sample range estimates the standard deviation, especially in the case of a small sample size. It is well known that the performance of the sample range degrades in the case of a large sample size. In this paper, we investigate the sample subrange as an alternative to the range. This subrange includes the range as a special case. We recognize that we can improve the performance of estimating the standard deviation by using the subrange, especially in the case of a large sample size. Note that the original sample range is biased. Thus, the correction factor is used to make it unbiased. Likewise, the original subrange is also biased. In this paper, we provide the correction factor for the subrange. To compare the sample subranges with different trims to the conventional sample range or the sample standard deviation, we provide the theoretical relative efficiency and its values, which can be used to select the best trim of the subrange with the sense of maximizing the relative efficiency. For a practical guideline, we also provide a simple formula for the best trim amount, which is obtained by the least-squares method. It is worth noting that the breakdown point of the conventional sample range is always zero, while that of the sample subrange increases proportionally to a trim amount. As an application of the proposed method, we illustrate how to incorporate it into the construction of the R-chart.

1. Introduction

The control chart is a widely used and powerful graphical tool in quality control that is used to measure, monitor, and control a process over time. Usually, the control charts are in pairs. For example, an X ¯ -chart monitors the average of the manufacturing process while an R -chart monitors the variation of the process [1]. Generally, there are two phases for constructing control charts [2]. In Phase-I the goal is to obtain reliable control limits from the process data. Then, in Phase-II monitor the process by comparing the statistical properties of the future observation to the control limits, which are achieved in Phase-I [3,4]. The performance of the control charts constructed in Phase-I will determine the performance of the results in Phase-II. Thus, the data quality in Phase-I plays an important role in statistical process control (SPC). However, for X ¯ R charts, the sample mean and range are susceptible to the outliers, which is also called data contamination. Thus, the conventional control charts may be invalidated in the case of data contamination. To solve this problem, we use a robust estimator to construct control charts in Phase-I.
Robust statistics can provide good performance when there is the presence of outliers and departures from the model assumption. In robust design, when the collected data are contaminated, the robust estimators are employed to reduce or even avoid the influence of outliers on the results [5,6,7,8,9]. In statistical process control, Park et al. [10] proposed the use of robust scale estimators (e.g., median absolute deviation (MAD)) [11] and Shamos [12]) used an alternative to the standard deviation to construct the control charts in Phase–I when the data is contaminated. Shaweish and Saeed modified the S-control chart based on the decile mean standard deviation, which possessed a better performance when the sample size was small in most realistic applications [13]. When the process data in Phase-I diverge from normality and outliers are present, the robust S -chart based on robust scale estimators is more preferable than a traditional S -chart [10,13]. The reweighted robust standard deviation estimator is adopted to modify the S -chart by Mutlu and Alakent [14]. To improve robustness and be more efficient, in the case of the presence of outliers in the data set, the trimmed mean of mean, standard deviation, and range are adopted to construct the control charts [15,16,17]. Shaweish et al. [18] and Riaz and Saghir [19] pointed out that the control chart based on the robust scale estimator provides good performance not only without outliers, but also within outliers in the data set. Cumulative sum (CUSUM) control charts for special causes are also under the assumption of normal distribution. Given this, many attentions are paid to the robust CUSUM chart when the ideal assumption is violated [20,21,22]. Considering the multivariate non-normality problems, the distribution-free method [23] and Bayesian seemingly unrelated regression model [24] can give us some inspirations.
The trimmed mean (also called truncated mean) is the mean of the part after removing the x % largest and smallest observations [25]. For example, to reduce and mitigate the influence of bias in subjective judgements, the highest and lowest scores are discarded and the middle three are summed for a five-judge panel [26,27] for many Olympics games (e.g., diving and gymnastics etc.).
However, there is seldom research about the R -chart under data contamination in the literature. This motivates us to develop a robust method for control charts when the data is contaminated. In this paper, we propose a robust scale estimator subrange as an alternative to the sample range—as an estimator of the standard deviation to improve the control chart in the case of data contamination. The calculation of subrange is very similar to the trimmed mean. However, the subrange is not unbiased.
As we know, order statistics are widely used in nonparametric inferences and robust procedures. Order statistics theories can also be used to compute simple statistics such as median, the empirical cumulative distribution function (cdf), and the sample range [28,29,30]. The range of a sample dispersion tells us how far apart the largest and smallest values are. The traditional sample range is the distance between the largest and the smallest observation values from a sample. However, estimating the standard deviation by means of the sample range is not unbiased. Thus, it is necessary to use the unbiasing factor d 2 for the original sample range to estimate the standard deviation properly, which is a function of the sample size [31]. The method is widely seen in quality control literature [32,33].
Based on the joint pdf, the distribution of the sample range has been extensively studied [34,35,36]. However, in the earlier literature, many papers focus on the sample range. If there is an outlier in a sample, obviously the sample range will be easily influenced by even a single outlier. Given this, we can say that the sample range clearly has a zero-breakdown point, which is a popular measure for the robustness property of a statistical estimator. The breakdown point of a statistical estimator is the fraction of data that can be given arbitrary values without making the estimator arbitrarily bad [10,36]. The sample range is almost as efficient as the sample standard deviation for a small sample under the normal distribution, which is generally assumed in many engineering applications. Howefer, Patnaik [37] pointed out that when the sample size is larger than 20, even a small departure from normality in the tail of the parental distribution has a considerable effect on the distribution of the sample range. In order to make a good performance of the sample range for a large sample, Cadwell [38] proposed a method of dividing the sample into a few subgroups and used the average of the range of these subgroups.
Since the sample subrange depends on the trim amount, there may be several subranges to consider under different trims. Therefore, natural questions arise: which of the subranges is preferred? Then, by what criteria, shall we make the choice? Serfling [39] and Lehmann [40] proposed that the smaller the variance in its sampling distribution, the more “efficient” is that estimator. The relative efficiency (RE) value is considered in order to decide how to choose the optimal trim amount of the subrange by comparing the variance of the sample. Firstly, we give the distribution of the sample subrange and provide the unbiasing factors, which depend on the sample size and the number of trims to make the estimator unbiased. Next, we provide a criterion that is based on the RE value to choose the optimal trims in the sample. Lastly, we consider the breakdown point of the subrange. As we know, the inter-quartile range (IQR) is widely used to estimate the standard deviation when the sample has an outlier or extreme values. The proposed subrange includes the IQR as a special case. As well, the subrange is robust to outliers. The contribution of this article is as follows.
We give the distribution of the sample subrange and provide the unbiasing factor for the subrange of a sample through a Monte Carlo simulation. We assume the data are a normal distribution in order to calculate the unbiasing factor and the distribution for the subrange because the commonly used control charts are under the independent and normality assumption [1,12,41,42].
The RE values are calculated by using two different kinds of baseline estimators, from which we discover that the results coincide with choosing the best performance of subranges. Some interesting points are found through comparing the breakdown points of a range and a subrange, by which we conclude that the subrange has a positive breakdown point of k / n , and the asymptotic breakdown point of the subrange can be increased to around 1 / 2 in theory.
In previous studies, researchers paid little attention to the sample range due to its limitations, such as its narrow application and susceptibility to outliers. In this paper, we improve the range by extending its application and improving its breakdown point. Then we use the subrange to construct a control chart for monitoring the changes in the progress when the data are contaminated. Through the Monte Carlo simulation, we investigate the properties of the subrange. We offer unbiasing factors and the relative efficiency values for the sample size of 50 or less, firstly.
In this paper, the sample subrange is proposed as an alternative to the sample range as an estimate of the standard deviation. A correction factor for the subrange is provided in order to get unbiasedness. An application of the proposed method is illustrated by incorporating it into the construction of the R -chart. In Section 2, calculations are carried out for a random sample from a standard normal distribution, and then from a non-standard normal distribution. In Section 3, the central limit theorem is used to construct control charts by adopting the subrange. Some discussions about the relative efficiency value and the breakdown point of the subrange are provided in Section 4. Additionally, concluding remarks are given in Section 4.

2. The Distribution of the Subrange

Let X 1 , X 2 , X n be a random sample with continuous cdf F ( x ) and pdf f ( x ) . Let X ( 1 ) , X ( 2 ) , X ( n ) be the order statistics of a random sample. Then the joint pdf of X ( i ) and X ( j ) for 1 i < j n is given by
f ( i , j ) ( u , v ) = n ! ( i 1 ) ! ( j i 1 ) ! ( n j ) ! f ( u ) f ( v ) [ F ( u ) ] i 1 [ F ( v ) F ( u ) ] j i 1 [ 1 F ( v ) ] n j ,
for < u < v < . For more details on the above result, refer to Casella and Berger [28], and Hogg et al. [30].
We consider the symmetrically trimmed subrange. That is, we exclude the k smallest and k largest values. Thus, the symmetrically trimmed subrange is defined as R [ k ] = X ( n k ) X ( k + 1 ) where k = 0 , 1 , 2 , , n / 2 1 and n 2 . Here x is the largest integer less than or equal to x . Note that R [ k ] becomes the regular range with k = 0 .
We assume that a random sample is from a normal distribution with mean μ and standard deviation σ . It is well known that the range is not unbiased to estimate σ . By dividing R by the unbiasing factor d 2 , we can easily make this R unbiased. The values of d 2 are provided in the quality control literature—for more details refer to Shewhart [32] and Oakland [33].
In this paper, we will provide the unbiasing factor which makes R [ k ] unbiased for σ . We denote this factor by d 2 ( n , k ) . It should be noted that the factor d 2 depends on the sample size n and the number of trims k in the sample.
For notational convenience, we denote the joint pdf of X ( k + 1 ) and X ( n k ) by f ( u , v ) Then we have
f k ( u , v ) = n ! ( k ! ) 2 ( n 2 k 2 ) ! f ( u ) f ( v ) [ F ( u ) ] k [ 1 F ( v ) ] k [ F ( v ) F ( u ) ] n 2 k 2 .
Let Z 1 , Z 2 , Z n be a random sample from a standard normal distribution with pdf ϕ ( z ) and cdf Φ ( z ) . For notational convenience, we denote W 1 = Z ( k + 1 ) and W 2 = Z ( n k ) . Using (1), we have the joint pdf of W 1 and W 2 .
f k ( w 1 , w 2 ) = n ! ( k ! ) 2 ( n 2 k 2 ) ! ϕ ( w 1 ) ϕ ( w 2 ) [ Φ ( w 1 ) ] k [ 1 Φ ( w 2 ) ] k [ Φ ( w 2 ) Φ ( w 1 ) ] n 2 k 2 .
The goal is to derive the distribution of the subrange, W 2 W 1 = Z ( n k ) Z ( k + 1 ) where the Z i are from the standard normal distribution, N ( 0 , 1 ) . Next, we consider the new random variables given by Y 1 = W 1 and Y 2 = W 2 W 1 . Notice that the random variable Y 2 is the subrange. The inverse transformations are easily obtained by w 1 = y 1 and w 1 = y 2 + y 1 . Then, using the bivariate transformations, the joint pdf of Y 1 and Y 2 , denoted by g k ( y 1 , y 2 ) is given by
g k ( y 1 , y 2 ) = n ! ( k ! ) 2 ( n 2 k 2 ) ! ϕ ( y 1 ) ϕ ( y 1 + y 2 ) Φ ( y 1 ) k [ 1 Φ ( y 1 + y 2 ) ] k [ Φ ( y 1 + y 2 ) Φ ( y 1 ) ] n 2 k 2 | J | ,
where < y 1 < , y 2 > 0 and J is the determinant of the Jacobian matrix given by
J = det [ w 1 y 1 w 1 y 2 w 2 y 1 w 2 y 2 ] = det [ 1 0 1 1 ] = 1
For more details, see Casella and Berger [28]. Then the pdf Y 2 = R [ k ] = Z ( n k ) Z ( k + 1 ) is just the marginal pdf of Y 2 , which is given by
g [ k ] ( y 2 ) = n ! ( k ! ) 2 ( n 2 k 2 ) ! ϕ ( y 1 ) ϕ ( y 1 + y 2 ) Φ ( y 1 ) k [ 1 Φ ( y 1 + y 2 ) ] k [ Φ ( y 1 + y 2 ) Φ ( y 1 ) ] n 2 k 2 d y 1
where k = 0 , 1 , 2 , , n / 2 1 and n 2 again.

2.1. The Unbiasing Factors for the Subrange

Using the pdf of the range in (2), we can obtain the l -th moment of the range by calculating the expectation as follows:
E ( Y 2 l ) = 0 y 2 l g [ k ] ( y 2 ) d y 2
We denote
d 2 ( n , k ) = E ( Y 2 ) .
Let X i be a sample from the normal distribution N ( μ , σ 2 ) . Since σ is a scale parameter, we have X i = σ Z i + μ , where Z i is a sample from the standard normal distribution N ( 0 , 1 ) . Then we have E ( R [ k ] ) = E ( X ( n k ) X ( k + 1 ) ) = d 2 ( n , k ) σ . Thus, we can obtain the unbiased scale estimator of σ using the subrange which is given by
σ [ k ] = R [ k ] d 2 ( n , k ) .
Note that the unbiasing factor, d 2 in the quality control literature is obtained by d 2 = d 2 ( n , 0 ) , so that σ 0 = R / d 2 . Next, we denote
d 3 ( n , k ) = SD ( Y 2 ) = E ( Y 2 2 ) E ( Y 2 ) 2 .
Then, we have SD ( R [ k ] ) = d 3 ( n , k ) σ . Also, the unbiasing factor d 3 in the quality control literature is also given by d 3 = d 3 ( n , 0 ) .
In Table A1 and Table A2, we provide the values of d 2 ( n , k ) and d 3 ( n , k ) , respectively. For brevity, we provide the values for n = 2 , 3 , 50 and k = 0 , 1 , , 9 in the tables. The R language program is provided in the online supplement. Subrange. Available online: https://github.com/jin-yuyu/subrange.git (accessed on 1 December 2021).

2.2. The Relative Efficiency of the Subrange

We proposed the R -chart based on the subrange. Given this, a natural question is which subrange should be selected? This is essentially the same as how to choose k . We suggest the choice of k based on the RE value. In the statistics literature, the RE is defined as
RE ( θ 2 | θ 1 ) = Var ( θ 1 ) Var ( θ 2 ) × 100 % ,
where θ is often a reference or baseline estimator. For more details, see Serfling [39] and Lehmann [40]. We consider the RE of the unbiased scale estimator σ [ k ] using σ [ 0 ] as a baseline estimator, which is given by
RE ( σ [ k ] | σ [ 0 ] ) = Var ( σ [ 0 ] ) Var ( σ [ k ] ) × 100 % = ( d 3 ( n , 0 ) 2 / d 2 ( n , 0 ) 2 ) / ( d 3 ( n , k ) 2 / d 2 ( n , k ) 2 ) × 100 % .
We calculated the values of RE ( σ [ k ] | σ [ 0 ] ) in Table A3 for n = 2 , 3 , 50 and k = 0 , 1 , , 9 . It is easily seen that the choice of k = 0 gives the best performance for n = 2 , 3 , , 17 , and the choice of k = 1 does for n = 18 , 19 , , 31 etc. In Table 1, we also summarize the best choice of k which provides the maximum RE value. We also obtain the simple regression line using the least-squares method. The fitted line equation is given by
k = 0.65806 + 0.06926 n .
Thus, when the sample size n is very large, the value of k can be approximately selected using the above-fitted value. For example, when n = 500 , we have k 34 .
Using the unbiased estimator S n / c 4 ( n ) of σ as a baseline, we also calculated the RE of the unbiased scale estimator σ [ k ] again. Here, c 4 ( n ) is given by
c 4 ( n ) = 2 n 1 Γ ( n 2 ) Γ ( n 2 1 2 ) .
It should be noted that E [ S n 2 ] = σ 2 and E [ S n ] = c 4 ( n ) σ . Thus, we have Var ( S n / c 4 ( n ) ) = { 1 / c 4 ( n ) 2 1 } σ 2 and
RE ( σ [ k ] | S n c 4 ( n ) ) = Var ( S n c 4 ( n ) ) Var ( σ [ k ] ) × 100 % = 1 / c 4 ( n ) 2 1 d 3 ( n , k ) 2 / d 2 ( n , k ) 2 × 100 % .
We also calculated the values of RE ( σ [ k ] | S n c 4 ( n ) ) in Table A3 for n = 2 , 3 , 50 and k = 0 , 1 , , 9 . It should be noted that RE ( σ [ k ] | S n c 4 ( n ) ) = 100 % when n = 2 . We investigated this specific case in more detail as follows. When n = 2 , k should be zero. Thus, in this case, it is immediately from (2) that the pdf of Y 2 = R [ 0 ] = Z 2 Z 1 is given by
g [ 0 ] = 2 ϕ ( y 1 ) ϕ ( y 1 + y 2 ) d y 1 = 2 1 2 π exp ( y 1 2 2 ) 1 2 π exp ( ( y 1 + y 2 ) 2 2 ) d y 1 = 1 π exp ( y 2 2 4 ) g * ( y 1 ) d y 1 .
where g * ( y 1 ) = 1 2 π 1 / 2 exp ( 1 2 ( y 1 + y 2 / 2 ) 2 1 / 2 ) .
Science g * ( y 1 ) is the pdf of the normal distribution with N ( y 2 / 2 , 1 / 2 ) , we have
g [ 0 ] = 1 π exp ( y 2 2 4 ) .
which results in
d 2 ( 2 , 0 ) = 0 y 2 g [ 0 ] d y 2 = 0 1 π y 2 exp ( y 2 2 4 ) d y 2 = 1 π [ 2 exp ( y 2 2 4 ) ] 0 = 2 π .
Then we have
σ [ 0 ] = R [ 0 ] d 2 ( 2 , 0 ) = π 2 ( X ( 2 ) X ( 1 ) ) .
On the other hand, we have
c 4 ( 2 ) = 2 Γ ( 2 ) Γ ( 1 / 2 ) = 2 π
Since S 2 2 = 1 2 1 i = 1 2 ( X i X ¯ 2 ) 2 = ( X 1 X 2 ) 2 / 2 , we have
S 2 c 4 ( 2 ) = | X 1 X 2 | / 2 2 / π = π 2 | X 1 X 2 | = π 2 ( X ( 2 ) X ( 1 ) )
It is immediate from (8) and (9) that
σ [ 0 ] = S 2 c 4 ( 2 )
Thus, as aforementioned, we have RE ( σ [ k ] | S n c 4 ( n ) ) = 100 % when n = 2 (also k = 0 in this case, as mentioned earlier).

2.3. The Breakdown Points

Another way of choosing k is considering the finite-sample breakdown point, denoted by ε n . This is the maximum proportion of arbitrarily extreme observations where an estimator results in a reasonable value. For more detail see [6,10]. This finite-sample breakdown point is generally a function of sample size n . The proposed estimator, based on the subrange, has additional merit because of a positive breakdown point. It is clear that σ [ k ] has a positive breakdown point of k / n for k > 0 , σ ^ [ 0 ] has a zero one. Thus, we can choose k based on the value k / n .
It should be noted that the maximum attainable value of the asymptotic breakdown point is given as follows. Since the maximum value of k is n / 2 1 , the maximum finite-sample breakdown point available is given by
ε n = n / 2 1 n .
Since n / 2 can be expressed by n / 2 = n / 2 δ with 0 δ < 1 , we have
ε n = 1 2 δ + 1 n .
Thus, taking the limit of ε n as n , we have the asymptotic breakdown point
ε = lim n ε n = 1 2 ,
which is the maximum attainable asymptotic breakdown point.
It is likely that the RE value tends to decrease as the finite-sample breakdown point increases. Thus, when we choose k , we need to consider the RE value and the finite-sample breakdown point as well.

3. The Construction of Control Charts

In this section, we provide the method for incorporating the proposed methods into constructing the control charts. We use the proposed subrange as an alternative of the range and standard deviation when there are outliers in the data of Phase I. We provided the X ¯ -chart first, and then the R -chart.
For the case of the X ¯ -chart, it is immediately from the central limit theorem that we have
X ¯ E ( X ¯ ) SE ( X ¯ ) ~ N ( 0 , 1 ) ,
where X ¯ -is the sample mean from a sample of size n , and SE ( ) is the standard deviation from a sample of size n . Solving the below for X ¯
( X ¯ E ( X ¯ ) ) / SE ( X ¯ ) = ± 3 ,
we can construct the CL ± 3 SE control limits.
E ( X ¯ ) ± 3 SE ( X ¯ ) = μ ± 3 σ n .
Then we have UCL = μ + 3 σ n , CL = μ , and LCL = μ 3 σ n . Since μ and σ are unknown in practice, we need to estimate them. Suppose that there are m samples of size n . The most widely used estimator of μ is X ¯ ¯ where
X ¯ ¯ = X ¯ 1 + X ¯ 2 + + X ¯ m m .
The scale σ can be estimated by using the subrange. Let R [ k ] , i be the subrange with k trims from the i th sample. Then, as shown in (4), R [ k ] , i / d 2 ( n , k ) is an unbiased scale estimator of σ . Thus, it is quite reasonable to use R ¯ [ k ] / d 2 ( n , k ) for the scale estimator, where
R ¯ [ k ] = R [ k ] , 1 + R [ k ] , 2 + + R [ k ] , m m .
With the estimators of μ and σ , we have the following control limits:
UCL = X ¯ ¯ + 3 R ¯ [ k ] d 2 ( n , k ) n CL = X ¯ ¯ LCL = X ¯ ¯ 3 R ¯ [ k ] d 2 ( n , k ) n .
It should be noted that when k = 0 , we have R ¯ [ k ] = R ¯ and d 2 ( n , k ) = d 2 . The above control limits then become
UCL = X ¯ ¯ + 3 R ¯ d 2 n CL = X ¯ ¯ LCL = X ¯ ¯ 3 R ¯ d 2 n .
which is the traditional X ¯ -chart provided in the quality control literature.
For the case of the R -chart, we briefly reviewed the conventional R -chart and proposed a new R [ l ] -chart. Solving the below
R E ( R ) SE ( R ) = ± 3
for R , we can construct the CL ± 3 SE control limits given by
E ( R ) ± 3 SE ( R ) = d 2 σ ± 3 d 3 σ .
Then we have UCL = d 2 σ + 3 d 3 σ , CL = d 2 σ and LCL = d 2 σ 3 d 3 σ .
Analogous to the above construction, we can calculate the R [ l ] -chart with the limits given by
E ( R [ l ] ) ± 3 SE ( R [ l ] ) = d 2 ( n , l ) σ ± 3 d 3 ( n , l ) σ
In practice, the scale σ is unknown. Thus, we need to estimate σ using R ¯ [ k ] / d 2 ( n , k ) , as above. We then have
UCL = d 2 ( n , l ) R ¯ [ k ] d 2 ( n , k ) + 3 d 3 ( n , l ) R ¯ [ k ] d 2 ( n , k ) CL = d 2 ( n , l ) R ¯ [ k ] d 2 ( n , k ) LCL = d 2 ( n , l ) R ¯ [ k ] d 2 ( n , k ) 3 d 3 ( n , l ) R ¯ [ k ] d 2 ( n , k ) .
When l = 0 and k = 0 , we have the following limits which are essentially the same as the conventional R -chart
UCL = R ¯ [ 0 ] + 3 d 3 ( n , 0 ) R ¯ [ 0 ] d 2 ( n , 0 ) CL = R ¯ [ 0 ] LCL = R ¯ [ 0 ] 3 d 3 ( n , 0 ) R ¯ [ 0 ] d 2 ( n , 0 )
Also, if we select k = l , we can simplify the above R ¯ [ l ] -chart
UCL = R ¯ [ l ] + 3 d 3 ( n , l ) R ¯ [ l ] d 2 ( n , l ) CL = R ¯ [ l ] LCL = R ¯ [ l ] 3 d 3 ( n , l ) R ¯ [ l ] d 2 ( n , l )
Then a natural question is how to choose k for the R [ l ] -chart? The following proposition provides how to select k for the best performance
Proposition 1.
Let X 1 , X 2 , X n be a random sample from the normal distribution with scale σ , we then have
Var ( R ¯ [ k ] / d 2 ( n , k ) ) Var ( R ¯ [ k * ] / d 2 ( n , k * ) ) ,
where k * is the choice of k , RE ( σ ^ [ k ] | σ ^ [ 0 ] )  in (7), for k = 0 , 1 , , n / 2 1 .
Proof of Proposition 1.
It is immediately from (5) that we can obtain Var ( R ¯ [ k ] / d 2 ( n , k ) ) = d 3 ( n , k ) 2 σ 2 and Var ( R ¯ [ k * ] / d 2 ( n , k * ) ) = d 3 ( n , k * ) 2 σ 2 .
Using the above, Var ( R ¯ [ k ] / d 2 ( n , k ) ) / Var ( R ¯ [ k * ] / d 2 ( n , k * ) ) can be formulated as
d 3 ( n , 0 ) 2 / d 2 ( n , 0 ) 2 d 3 ( n , k * ) 2 / d 2 ( n , k * ) 2 / d 3 ( n , 0 ) 2 / d 2 ( n , 0 ) 2 d 3 ( n , k ) 2 / d 2 ( n , k ) 2 .
It is easily seen from (7) that k * provides the maximum RE value so that we have
Var ( R ¯ [ k ] / d 2 ( n , k ) ) / Var ( R ¯ [ k * ] / d 2 ( n , k * ) ) = RE ( σ [ k * ] | σ [ 0 ] ) / RE ( σ [ k ] | σ [ 0 ] ) 1 .
Thus, we get Var ( R ¯ [ k ] / d 2 ( n , k ) ) Var ( R ¯ [ k * ] / d 2 ( n , k * ) ) , which completes the proof. □

4. Discussion

As mentioned above, the choice of the trim k * is based on the relative efficiency under different sample sizes. From Table 1, we can see that when the sample size is 18 n 31 , the trim k * = 1 is the best choice. This means that the subrange is robust in the case of an outlier in the data, under the trim k * when its set to be 1. Given this, the subrange performs better than the traditional range. However, we also can see that when the sample size is 18 n 31 , the largest breakdown point is 1 / 18 5.6 % . This breakdown point of the subrange is much smaller than the MAD is. The proposed subrange is very useful for constructing a control chart when the data are contaminated in Phase-I. Although the breakdown point of the subrange is not satisfactory, it is robust under data contamination.

5. Conclusions

In this paper, we proposed a method of estimating the scale estimator using the subrange. Using this, we can construct the X ¯ and R charts, which are widely used in the manufacturing process. By using the proposed control charts based on the subrange, we can gain statistical efficiency along with a robustness property.
We also provided the method of choosing the trim amount of the subrange, in the sense of gaining more efficiency compared to the conventional range. For the R program used in the paper, one can refer to the URL. Subrange. Available online: https://github.com/jin-yuyu/subrange.git (accessed on 1 December 2021). The simulation results are shown in the Appendix. Table A1 and Table A2 give the unbiasing factors for the sample size of 50 or less. Table A3 and Table A4 give the information on relative efficiency values for the sample size of 50 or less—in terms of the relative efficiency values.
Note that the subrange has a positive breakdown point, while the breakdown point of the conventional range is always zero. The proposed subrange in this paper is robust. It can also be used when the sample size is larger (I.e., greater than 25).
However, for the X ¯ chart, the centerline is the sample mean, which is also the sensitivity to outliers. For future research, we suggest using a robust location estimator (e.g., median and Hodges-Lehmann [43]) to construct the X ¯ -chart. As discussed in the last section, selecting suitable or proper criteria to improve the breakdown may be an interesting topic.

Author Contributions

Conceptualization, E.X. and C.P.; methodology and software, C.P.; formal analysis, L.O.; funding acquisition, Y.M. All authors have read and agreed to the published version of the manuscript.

Funding

The work of was supported by the National Natural Science Foundation of China, grants number NSFC-71871119/NSFC-71811540414/NSFC-71702072, the National Research Foundation of Korea (NRF), grant funded by the Korea government (No. NRF-2017R1A2B4004169).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

Table A1. Unbiasing factor, d 2 ( n , k ) .
Table A1. Unbiasing factor, d 2 ( n , k ) .
n k = 0 k = 1 k = 2 k = 3 k = 4 k = 5 k = 6 k = 7 k = 8 k = 9
21.1284
31.6926
42.05880.5940
52.32590.9900
62.53441.28350.4031
72.70441.51470.7054
82.84721.70440.94560.305
92.97001.86461.14390.5491
103.07752.00271.31210.75150.2453
113.17292.12381.45770.9240.4498
123.25852.23151.58571.07370.62450.2052
133.33602.32821.69971.20570.77670.3810
143.40682.41581.80231.32350.91110.53460.1763
153.47182.49591.89541.42981.03140.67060.3306
163.5322.56951.98051.52631.14000.79240.46750.1546
173.58792.63762.05891.61481.23890.90270.59040.2920
183.64012.70082.13151.69631.32961.00320.70170.41550.1376
193.68902.75992.19891.77171.41321.09540.80330.52750.2614
203.73492.81522.26191.84201.49081.18060.89670.62990.37390.1240
213.77832.86722.32091.90761.56301.25960.9830.72410.47680.2367
223.81942.91632.37651.96921.63051.33331.06310.81120.57160.3399
233.85832.96272.42892.02711.69391.40231.13790.89220.65930.4351
243.89533.00682.47852.08181.75361.46711.20800.96780.74090.5233
253.93063.04862.52552.13361.81001.52811.27381.03870.81720.6054
263.96433.08852.57022.18271.86341.58581.33591.10530.88870.6821
273.99653.12652.61282.22941.91411.64041.39451.16820.95600.7541
284.02743.16292.65352.27391.96231.69231.45021.22771.01950.8219
294.05703.19782.69242.31642.00831.74171.50301.28411.07960.8860
304.08553.23122.72962.35712.05221.78881.55331.33771.13670.9466
314.11293.26332.76542.39612.09421.83381.60131.38881.19091.0041
324.13933.29422.79972.43342.13441.87681.64721.43751.24261.0589
334.16483.3242.83272.46942.1731.91811.69111.48411.29191.1110
344.18943.35272.86462.50392.21021.95771.73321.52871.33911.1609
354.21323.38052.89522.53722.24591.99581.77361.57151.38431.2085
364.23623.40722.92492.56932.28032.03251.81251.61261.42761.2542
374.25863.43322.95352.60032.31352.06781.84991.65211.46931.2980
384.28023.45832.98122.63032.34562.10191.88601.69021.50941.3402
394.30123.48263.0082.65932.37662.13481.92081.72691.5481.3807
404.32163.50623.03402.68742.40662.16661.95441.76231.58521.4198
414.34143.52923.05932.71462.43562.19741.98701.79651.62111.4574
424.36063.55143.08382.74102.46382.22732.01841.82961.65581.4938
434.37943.57313.10752.76662.49112.25622.04891.86161.68941.5290
444.39763.59413.13072.79152.51762.28432.07851.89271.72191.5630
454.41543.61473.15322.81572.54342.31152.10721.92281.75351.5959
464.43283.63463.17512.83922.56842.33802.13501.95201.78401.6278
474.44973.65413.19642.86222.59282.36372.16211.98041.81371.6588
484.46623.67313.21722.88452.61652.38882.18842.00791.84251.6888
494.48243.69163.23752.90622.63972.41322.21402.03471.87051.7180
504.49813.70973.25732.92752.66222.43692.23902.06081.89771.7464
Table A2. Unbiasing factor, d 3 ( n , k ) .
Table A2. Unbiasing factor, d 3 ( n , k ) .
n k = 0 k = 1 k = 2 k = 3 k = 4 k = 5 k = 6 k = 7 k = 8 k = 9
20.8525
30.8884
40.87980.4990
50.86410.5685
60.8480.58940.3548
70.83320.59460.4245
80.81980.59360.45380.2757
90.80780.58990.46720.3403
100.79710.58510.47320.37150.2256
110.78730.57980.47530.38840.2844
120.77850.57450.47530.39790.31530.191
130.77040.56920.47400.40320.33350.2445
140.76300.56420.47200.40600.34490.27430.1656
150.75620.55930.46950.40710.35200.29290.2146
160.74990.55460.46690.40730.35650.30500.24290.1462
170.74410.55020.46410.40670.35930.31330.26130.1912
180.73860.54600.46130.40570.36090.31890.27380.21810.1309
190.73350.54200.45850.40440.36170.32280.28270.23600.1724
200.72870.53830.45570.40290.36190.32540.28900.24870.19790.1185
210.72420.53460.45300.40130.36170.32710.29360.25780.21530.1571
220.71990.53120.45030.39950.36110.32820.29690.26460.22780.1812
230.71590.52800.44780.39780.36030.32870.29930.26960.23710.1980
240.71210.52480.44530.39600.35940.32890.30100.27350.24410.2103
250.70840.52190.44280.39420.35830.32880.30220.27640.24950.2196
260.70500.51900.44050.39240.35720.32850.30290.27850.25370.2267
270.70170.51630.43820.39060.35600.32800.30330.28010.25690.2323
280.69860.51360.43600.38890.35480.32740.30350.28130.25950.2368
290.69560.51110.43390.38720.35350.32670.30350.28220.26150.2403
300.69270.50870.43190.38550.35230.32590.30330.28270.26300.2431
310.68990.50640.42990.38380.35100.32500.30300.28300.26410.2453
320.68730.50420.42800.38220.34970.32420.30250.28320.26500.2471
330.68470.50200.42610.38060.34840.32320.30200.28320.26560.2485
340.68220.49990.42430.37910.34720.32230.30150.28310.26610.2497
350.67990.49790.42260.37760.34600.32140.30090.28290.26630.2505
360.67760.49600.42090.37610.34470.32040.30020.28260.26650.2512
370.67540.49410.41920.37470.34350.31940.29950.28220.26650.2517
380.67330.49230.41760.37330.34230.31850.29880.28180.26640.2521
390.67120.49050.41610.37200.34120.31750.29810.28130.26630.2523
400.66920.48880.41460.37060.34000.31660.29740.28080.26610.2524
410.66730.48720.41310.36940.33890.31560.29660.28030.26580.2524
420.66540.48560.41170.36810.33780.31470.29590.27980.26550.2524
430.66360.48400.41030.36690.33670.31370.29510.27920.26520.2523
440.66180.48250.40900.36570.33560.31280.29430.27860.26480.2522
450.66010.48100.40770.36450.33460.31190.29360.27800.26440.2520
460.65840.47960.40640.36330.33360.31100.29280.27740.26390.2517
470.65680.47820.40510.36220.33260.31010.29210.27680.26350.2514
480.65520.47680.40390.36110.33160.30930.29130.27610.26300.2511
490.65360.47550.40270.36000.33060.30840.29060.27560.26250.2508
500.65210.47420.40160.35900.32970.30760.28980.27500.26200.2505
Table A3. Relative efficiency, RE ( σ [ k ] | σ [ 0 ] ) × 100 % .
Table A3. Relative efficiency, RE ( σ [ k ] | σ [ 0 ] ) × 100 % .
n k = 0 k = 1 k = 2 k = 3 k = 4 k = 5 k = 6 k = 7 k = 8 k = 9
2100
3100
410025.9
510041.9
610053.114.5
710061.626.2
810068.436.010.1
910073.944.319.3
1010078.651.627.57.9
1110082.657.934.915.4
1210086.163.541.622.46.6
1310089.268.647.728.913.0
1410092.073.253.335.019.15.7
1510094.577.358.540.724.911.3
1610096.781.163.346.130.416.75.0
1710098.884.667.851.135.722.010.0
18100100.787.972.055.940.727.014.94.5
19100102.590.975.960.345.531.919.79.1
20100104.193.879.664.650.136.624.413.64.2
21100105.796.483.068.654.541.229.018.08.3
22100107.198.986.372.458.745.533.422.412.5
23100108.4101.389.476.162.749.837.726.616.6
24100109.7103.592.479.666.553.841.930.820.7
25100110.9105.795.282.970.257.745.934.824.7
26100112.0107.797.986.173.761.549.838.828.6
27100113.1109.6100.489.177.165.253.642.732.5
28100114.1111.4102.992.080.468.757.346.436.3
29100115.0113.2105.294.883.672.160.950.140.0
30100116.0114.8107.597.686.675.464.453.743.6
31100116.8116.4109.6100.289.678.667.757.247.1
32100117.7118.0111.7102.792.481.771.060.650.6
33100118.5119.4113.7105.195.284.774.263.954.0
34100119.3120.9115.7107.597.887.677.367.257.3
35100120.0122.2117.6109.7100.490.580.470.360.6
36100120.7123.6119.4111.9103.093.383.373.463.8
37100121.4124.8121.1114.1105.495.986.276.566.9
38100122.1126.1122.8116.2107.898.689.079.469.9
39100122.7127.3124.5118.2110.1101.191.782.372.9
40100123.4128.4126.1120.1112.3103.694.485.175.9
41100124.0129.5127.6122.0114.5106.097.087.978.7
42100124.6130.6129.1123.9116.6108.499.690.681.5
43100125.1131.7130.6125.7118.7110.7102.193.284.3
44100125.7132.7132.0127.4120.7112.9104.595.887.0
45100126.2133.7133.4129.1122.7115.1106.998.389.7
46100126.7134.7134.7130.8124.7117.3109.2100.892.3
47100127.2135.6136.0132.4126.5119.4111.5103.294.8
48100127.7136.5137.3134.0128.4121.4113.8105.697.3
49100128.2137.4138.6135.5130.2123.5115.9108.099.8
50100128.6138.3139.8137.1131.9125.4118.1110.3102.2
Table A4. Relative efficiency, RE ( σ [ k ] | S n / c 4 ( n ) ) × 100 % .
Table A4. Relative efficiency, RE ( σ [ k ] | S n / c 4 ( n ) ) × 100 % .
n k = 0 k = 1 k = 2 k = 3 k = 4 k = 5 k = 6 k = 7 k = 8 k = 9
2100.00
399.19
497.5225.24
595.4839.97
693.3049.5513.48
791.1256.1423.88
889.0060.8432.059.03
986.9564.2738.5616.75
1084.9966.8043.8323.336.74
1183.1368.6748.1428.9712.80
1281.3670.0751.6933.8218.215.36
1379.6871.0954.6538.0023.0410.32
1478.0971.8357.1241.6427.3414.884.44
1576.5772.3559.2044.8031.1919.058.62
1675.1372.6960.9547.5734.6322.8612.543.78
1773.7672.8962.4450.0037.7126.3416.197.40
1872.4672.9863.7052.1440.4829.5219.5810.833.30
1971.2172.9864.7654.0342.9732.4222.7314.066.47
2070.0272.9165.6755.7045.2335.0925.6517.109.512.92
2168.8872.7866.4357.1947.2637.5328.3619.9612.415.75
2267.7972.5967.0858.5149.1139.7630.8822.6415.168.48
2366.7572.3767.6259.6850.7841.8233.2125.1617.7711.09
2465.7572.1168.0760.7352.3143.7235.3827.5220.2413.60
2564.7871.8268.4561.6653.6945.4637.4029.7322.5715.99
2663.8671.5268.7662.4954.9647.0739.2831.8124.7818.28
2762.9771.1969.0063.2456.1148.5641.0333.7626.8720.45
2862.1170.8669.2063.9057.1749.9442.6635.5928.8522.52
2961.2970.5169.3564.4958.1351.2144.1937.3130.7224.49
3060.4970.1569.4665.0159.0152.3945.6138.9332.4926.36
3159.7269.7869.5365.4859.8253.4946.9440.4534.1628.15
3258.9769.4169.5765.9060.5654.5048.1941.8935.7429.85
3358.2569.0369.5866.2661.2455.4449.3643.2437.2431.46
3457.5668.6569.5766.5961.8656.3250.4544.5138.6633.00
3556.8868.2769.5366.8762.4357.1351.4845.7140.0134.46
3656.2367.8969.4867.1262.9557.8952.4446.8541.2935.86
3755.6067.5169.4167.3463.4358.6053.3447.9242.5037.19
3854.9867.1369.3267.5363.8659.2654.1948.9343.6638.46
3954.3866.7669.2167.6964.2659.8754.9949.8944.7539.67
4053.8066.3869.1067.8264.6360.4455.7450.8045.7940.82
4153.2466.0068.9767.9464.9660.9756.4451.6646.7841.92
4252.6965.6368.8368.0365.2761.4657.1152.4747.7242.97
4352.1665.2668.6968.1065.5461.9357.7353.2448.6143.97
4451.6464.8968.5368.1665.8062.3558.3253.9649.4744.93
4551.1464.5368.3768.2066.0362.7558.8754.6650.2845.85
4650.6464.1768.2068.2266.2363.1359.3955.3151.0546.72
4750.1663.8168.0268.2466.4263.4859.8955.9351.7947.56
4849.6963.4667.8468.2366.5963.8060.3556.5552.4948.36
4949.2463.1167.6668.2266.7464.1060.7957.0853.1649.13
5048.7962.7667.4768.2066.8764.3861.2057.6153.8049.86

References

  1. Tesfay, Y.Y. Process Control Charts. In Developing Structured Procedural and Methodological Engineering Designs; Springer: Cham, Switzerland, 2021. [Google Scholar] [CrossRef]
  2. Montgomery, D.C. Statistical Quality Control: An Modern Introduction, 7th ed.; John Wiley & Sons: Singapore, 2013. [Google Scholar]
  3. Benneyan, J.C. Statistical quality control methods in infection control and hospital epidemiology, part I introduction and basic theory. Infect. Control. Hosp. Epidemiol. 1998, 19, 194–214. [Google Scholar] [CrossRef]
  4. Birnbaum, D.; Benneyan, J.C. Statistical quality control methods in infection control and hospital epidemiology, part II: Chart use, statistical properties, and research issues. Infect. Control. Hosp. Epidemiol. 1998, 19, 265–283. [Google Scholar] [CrossRef]
  5. Gao, X.; Huang, G.; Zhao, Q.; Cao, C.; Jiang, H. Robust optimization model for medical staff rebalancing problem with data contamination during COVID-19 pandemic. Int. J. Prod. Res. 2021, 1, 1–30. [Google Scholar] [CrossRef]
  6. Ouyang, L.; Park, C.; Byun, J.H.; Leeds, M. Robust Design in the Case of Data Contamination and Model Departure. Statistical Quality Technologies; Springer: Cham, Switzerland, 2019; pp. 347–373. [Google Scholar]
  7. Gao, X.; Cui, C. A note on the warehouse location problem with data contamination. RAIRO Rech. Opérationnelle 2021, 55, 1113. [Google Scholar] [CrossRef]
  8. Park, C.; Leeds, M. A highly efficient robust design under data contamination. Comput. Ind. Eng. 2016, 93, 131–142. [Google Scholar] [CrossRef]
  9. Goetz, S.; Roth, M.; Schleich, B. Early Robust Design—Its Effect on Parameter and Tolerance Optimization. Appl. Sci. 2021, 11, 9407. [Google Scholar] [CrossRef]
  10. Park, C.; Kim, H.; Wang, M. Investigation of finite-sample properties of robust location and scale estimators. Commun. Stat. Simul. Comput. 2021, 1, 1–27. [Google Scholar] [CrossRef]
  11. Hampel, F.R. The influence curve and its role in robust estimation. J. Am. Stat. Assoc. 1974, 69, 383–393. [Google Scholar] [CrossRef]
  12. Shamos, M.I. Geometry and statistics: Problems at the interface. In Algorithms and Complexity: New Directions and Recent Results; Traub, J.F., Ed.; Academic Press: New York, NY, USA, 1976; pp. 251–280. [Google Scholar]
  13. Abu-Shawiesh, M.O.A.; Saeed, N. Monitoring process variability using decile mean standard deviation. Qual. Reliab. Eng. Int. 2021, 1, 2996. [Google Scholar] [CrossRef]
  14. Mutlu, E.C.; Alakent, B. Revisiting reweighted robust standard deviation estimators for univariate Shewhart S-charts. Qual. Reliab. Eng. Int. 2019, 35, 995–1009. [Google Scholar] [CrossRef]
  15. Langenberg, P.; Iglewicz, B. Trimmed mean X and R charts. J. Qual. Technol. 1986, 18, 152–161. [Google Scholar] [CrossRef]
  16. Dey, K.K.; Dhara, K.; Karmakar, B.; Sengupta, S. Univariate and data-depth based multivariate control charts using trimmed mean and winsorized standard deviation. arXiv 2012, arXiv:1211.4262. [Google Scholar]
  17. Sindhumol, M.R.; Srinivasan, M.M.; Gallo, M. Robust control charts based on modified trimmed standard deviation and Gini’s mean difference. J. Appl. Quant. 2016, 11, 18–30. [Google Scholar]
  18. Abu-Shawiesh, M.O.A. A simple robust control chart based on MAD. J. Math. Stat. 2008, 4, 102–107. [Google Scholar]
  19. Riaz, M.; Saghir, A. A mean deviation-based approach to monitor process variability. J. Stat. Comput. Simul. 2009, 79, 1173–1193. [Google Scholar] [CrossRef]
  20. Schoonhoven, M.; Nazir, H.Z.; Riaz, M.; Does, R.J.M.M. Robust location estimators for the control chart. J. Qual. Technol. 2021, 43, 363–379. [Google Scholar] [CrossRef]
  21. Nazir, H.Z.; Riaz, M.; Does, R.J.; Abbas, N. Robust CUSUM control charting. Qual. Eng. 2013, 25, 211–224. [Google Scholar] [CrossRef]
  22. Riaz, M. A dispersion control chart. Commun. Stat. Simul. Comput. 2008, 37, 1239–1261. [Google Scholar] [CrossRef]
  23. Marchant, C.; Leiva, V.; Cysneiros, F.J.A.; Liu, S. Robust multivariate control charts based on Birnbaum–Saunders distributions. J. Stat. Comput. Simul. 2018, 88, 182–202. [Google Scholar] [CrossRef]
  24. Ouyang, L.; Zhu, S.; Ye, K.; Park, C.; Wang, M. Robust Bayesian hierarchical modeling and inference using scale mixtures of normal distributions. IISE Trans. 2021, 1, 1–13. [Google Scholar] [CrossRef]
  25. Tukey, J.W.; McLaughlin, D.H. Less Vulnerable Confidence and Significance Procedures for Location Based on a Single Sample: Trimming/Winsorization 1. Sankhyā Indian J. Stat. Ser. A 1963, 25, 331–352. Available online: http://www.jstor.org/stable/25049278 (accessed on 1 December 2021).
  26. Osório, C.; António, M. Performance Evaluation: Subjectivity, Bias and Judgement Style in Sport Data. Bias and Judgement Style in Sport Data. 24 January 2019. Available online: https://papers.ssrn.com/sol3/papers.cfm?abstract_id=3321838 (accessed on 1 December 2021).
  27. Bialik, C. Removing Judges’ Bias Is Olympic-Size Challenge. The Wall Street Journal. Available online: https://www.wsj.com/articles/SB10000872396390443477104577551253521597214 (accessed on 7 September 2014).
  28. George, C.; Berger, R.L. Statistical Inference; Pacific Grove: Duxbury, CA, USA, 2002; Volume 2. [Google Scholar]
  29. David, H.A.; Nagaraja, H.N. Order Statistics; John Wiley & Sons: Hoboken, NJ, USA, 2004. [Google Scholar]
  30. Hogg, R.V.; Tanis, E.A.; Zimmerman, D.L. Probability and Statistical Inference; Pearson/Prentice Hall: Upper Saddle River, NJ, USA, 2010. [Google Scholar]
  31. Tippett, L.H.C. On the extreme individuals and the range of samples taken from a normal population. Biometrika 1925, 17, 364–387. [Google Scholar] [CrossRef]
  32. Shewhart, W.A. Economic Control of Quality of Manufactured Product; Macmillan: London, UK, 1931. [Google Scholar]
  33. Oakland, R. Quality, Processes and Control. Statistical Process Control; Routledge: London, UK, 2018; pp. 3–23. [Google Scholar]
  34. Siotani, M. Order statistics for discrete case with a numerical application to the binomial distribution. Ann. Inst. Stat. Math. 1956, 8, 95–104. [Google Scholar] [CrossRef]
  35. Abdel-Aty, S.H. Ordered variables in discontinuous distributions. Stat. Neerl. 1954, 8, 61–82. [Google Scholar] [CrossRef]
  36. Hartley, H.O.; David, H.A. Universal bounds for mean range and extreme observation. Ann. Math. Stat. 1954, 25, 85–99. [Google Scholar] [CrossRef]
  37. Patnaik, P.B. The use of mean range as an estimator of variance in statistical tests. Biometrika 1950, 37, 78–87. [Google Scholar] [CrossRef]
  38. Cadwell, J.H. The distribution of quasi-ranges in samples from a normal population. Ann. Math. Stat. 1953, 24, 603–613. [Google Scholar] [CrossRef]
  39. Serfling, R. Asymptotic Relative Efficiency in Estimation. Int. Encycl. Stat. Sci. 2011, 23, 68–72. [Google Scholar]
  40. Lehmann, E.L. Elements of Large-Sample Theory; Springer: Amsterdam, The Netherlands, 2004. [Google Scholar]
  41. Yu, M.; Zhou, Y.; Tsung, F. Robust Online Detection in Serially Correlated Directed Network. arXiv Prepr. 2021, arXiv:2111.02653. [Google Scholar]
  42. Nidsunkid, S.; Lao, M.; Thaithanan, J. The impact of a random vector with variables from normal and non-normal distributions on multivariate control charts. Qual. Reliab. Eng. Int. 2021, 1, 1–25. [Google Scholar] [CrossRef]
  43. Hodges, J.L., Jr.; Lehmann, E.L. Estimates of location based on rank tests. Ann. Math. Stat. 1963, 34, 598–611. [Google Scholar] [CrossRef]
Table 1. The number of trimmings, k * , with the best relative efficiency.
Table 1. The number of trimmings, k * , with the best relative efficiency.
2 n 17 18 n 31 32 n 45 46 n 60 61 n 74
k * = 0 k * = 1 k * = 2 k * = 3 k * = 4
75 n 88 89 n 103 104 n 118 119 n 132 133 n 146
k * = 5 k * = 6 k * = 7 k * = 8 k * = 9
147 n 162 163 n 175 176 n 188 189 n 203 204 n 217
k * = 10 k * = 11 k * = 12 k * = 13 k * = 14
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Xie, E.; Ma, Y.; Ouyang, L.; Park, C. On the Subrange and Its Application to the R-Chart. Appl. Sci. 2021, 11, 11632. https://doi.org/10.3390/app112411632

AMA Style

Xie E, Ma Y, Ouyang L, Park C. On the Subrange and Its Application to the R-Chart. Applied Sciences. 2021; 11(24):11632. https://doi.org/10.3390/app112411632

Chicago/Turabian Style

Xie, En, Yizhong Ma, Linhan Ouyang, and Chanseok Park. 2021. "On the Subrange and Its Application to the R-Chart" Applied Sciences 11, no. 24: 11632. https://doi.org/10.3390/app112411632

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop