1. Introduction
Control charts are widely used in statistical process control (SPC) for monitoring and detecting out-of-control processes. The research on constructing control charts for monitoring normal processes has been extensively studied. Most control charts are designed to monitor either the process mean or the process variability, but it is usually desirable to simultaneously monitor the process mean and the process variability because both may change at the same time. A change in the standard deviation usually leads to out-of-control signals on the mean chart. When the distribution of quality characteristics is normal, the Shewhart
-chart [
1] is one of the most commonly used control charting techniques for monitoring the process mean, while the Shewhart
S-chart is commonly used to monitor the process variability. However, in many manufacturing applications, the quality variable typically follows a positively skewed distribution, such as the lognormal distribution. For example, the percent viscosity increase (PVI) of an engine oil after it has been put to an accelerated aging test for a specific period of time is a critical quality dimension of engine oil in the automotive industry. Engineering experience indicates that the PVI follows a lognormal distribution. In this case, it is very important to simultaneously monitor the mean and the standard deviation of the PVI based on a lognormal distribution.
In general, the implementation of a control chart is done in two stages, also known as Phase I control and Phase II monitoring. In Phase I control, in order to evaluate the variation of the process over time, assess the process stability, and estimate the in-control process parameters, one collects and analyzes certain amounts of historical data. In Phase II monitoring, one collects data sequentially and monitors the process in real time to quickly detect changes in the process parameters.
In the literature, there have been several studies on constructing control charts for monitoring the lognormal mean or the lognormal standard deviation. In monitoring the lognormal mean, a modified control chart using the sample ratio was proposed by Morrison [
2]. A control chart for monitoring the “geometric midrange” of a lognormal distribution was developed by Ferrell [
3]. A control chart for sequentially testing the arithmetic mean of a lognormal distribution was constructed by Joffe and Sichel [
4]. A simple heuristic method for constructing the
- and
R- charts using the weighted variance (WV) method with no assumption on the form of the distribution was proposed by Bai and Choi [
5]. Castagliola [
6] proposed a new
control chart devoted to the monitoring of skewed populations. Huang et al. [
7] discussed the control charts for the lognormal mean based on the confidence intervals of the lognormal mean. In monitoring the standard deviation, Abu-Shawiesh [
8] presented a simple approach for robustly estimating the process standard deviation based on the median absolute deviation. Adekeye and Azubuike [
9] derived the limits for control charts using the median absolute deviation for monitoring non-normal processes. Adekeye [
10] proposed modified control limits based on the median absolute deviation. Huang et al. [
11] proposed a control chart for monitoring the standard deviation of a lognormal process based on an approximate confidence interval of the lognormal standard deviation. Karagöz [
12] proposed an asymmetric control limit for a range chart under a non-normal distributed process. Liao and Pearm [
13] presented a modified weighted standard deviation index for the capability of a lognormal process. Shaheen et al. [
14] presented a monitoring control chart based on lognormal process variation using a repetitive sampling scheme. Omar et al. [
15] proposed an efficient approach for monitoring a positively skewed process. The control charts for jointly monitoring the mean and the standard deviation of a lognormal distribution are not as well established as those for a normal distribution. McCracken and Chakraborti [
16] gave an overview of control charts for joint monitoring of the mean and variance. Yang [
17] proposed a single-average loss control chart to monitor a process’s mean and variability. Chen and Lu [
18] proposed a new sum-of-squares exponentially weighted moving average (SSEWMA) chart using auxiliary information—called the AIB-SSEWMA chart—for jointly monitoring the process mean and variability .
In this study, we discuss three combined
- and
S-charts for jointly monitoring the mean and the standard deviation of a lognormal process: (1) The first combined charts are the conventional combined Shewhart
- and
S-charts. (2) The second combined charts are constructed based on the median absolute deviation method. (3) The third combined charts are the combined lognormal
- and
S-charts based on the methodologies studied in Huang et al. [
7] and Huang et al. [
11], respectively. The performances of these combined control charts are evaluated and compared in terms of the average run length (ARL), where the run length is defined as the number of samples taken before the first out-of-control signal alerts on a control chart [
19].
The rest of this paper is organized as follows. The aforementioned combined
- and
S-charts for jointly monitoring the lognormal mean and standard deviation are discussed in
Section 2.
Section 3 is devoted to assessing the performance of the combined
- and
S-charts. A real example from the automotive industry is given in
Section 4 to demonstrate how the aforementioned combined
- and
S-charts can be used in practice. Concluding remarks are given in
Section 5.
2. The Methodologies
In this section, we discuss three combined
- and
S-charts for jointly monitoring the lognormal mean and the standard deviation. Let
,
,
, be
m samples, each with size
n, following the lognormal distribution with parameters
and
, with a probability density function
and, consequently,
, which follow a normal distribution with the mean,
, and variance,
. Let
and
denote the mean and the standard deviation of the lognormal distribution such that
and
.
2.1. The Combined Shewhart - and S-Charts
The Shewhart
- and
S-charts are based on the assumption that the distribution of the quality characteristic is normal. The upper control limit (UCL) and lower control limit (LCL) of the combined Shewhart
- and
S-charts are given by
and
respectively, where
and
are multipliers chosen to satisfy a specific in-control chart performance and
[
1,
19].
If the parameters
and
are unknown, they can be estimated by
and
using data obtained from Phase I control data. Let
and
be the sample mean and sample standard deviation of the
ith sample,
, that is,
and
, respectively. The sample grand mean is
, and the average of the
m standard deviations is
. Then, the parameters
and
are estimated by
and
, respectively. Therefore, the control limits of the combined Shewhart
- and
S-charts are estimated by
and
respectively, where
and
are multipliers that depend on
n and the desired in-control average run length (ARL
).
When Phase II joint monitoring begins, independent samples, each of size n, are repeatedly taken from the process. Assume that the true in-control parameter and are equal to and , respectively, for each sample, ; one calculates and and plots and against the sampling sequence, respectively. An out-of-control signal is detected when is below or above , or when is below or above .
2.2. The Combined Median Absolute Deviation - and S-Charts
The median absolute deviation (MAD), which measures the deviation of the data from the sample median, was first studied by Hampel [
20]. It is a more robust scale estimator than the sample standard deviation and is often used as an initial value for computing more efficient and robust estimators. The MAD for a random sample,
, is defined as follows:
where
b is a constant used to make the estimator consistent for the parameter of interest and
is the sample median of
,
. If the sample observations are normally distributed, the constant
b is equal to 1.4826, and the statistic
is an unbiased estimator of the standard deviation (Rousseeuw and Croux ) [
21], where
is a function of the sample size
n; the values of
were derived and tabulated in Abu-Shawiesh [
8].
Based on the conventional Shewhart principle, when the parameters
and
are unknown, they can be estimated by
and
using data obtained from Phase I control data. Let
be the average median absolute deviation, where MAD
is the median absolute deviation of the
ith sample,
. The parameter
and
can be estimated by
and
, respectively. Hence, the control limits of the combined
- and
S-charts based on MAD are estimated by
and
respectively, where
and
are multipliers that depend on
n and the desired ARL
.
When Phase II joint monitoring begins, independent samples, each of size n, are repeatedly taken from the process. For each sample, , one calculates and and plots and against the sampling sequence, respectively. An out-of-control signal is detected when is below or above , or when is below or above .
2.3. The Combined Lognormal - and S-Charts
Based on the conventional Shewhart
-chart, if the parameters
and
are unknown, they can be replaced by the estimators
and
obtained from Phase I in-control data. According to Huang et al. [
7], let
and
be the sample mean and the sample standard deviation of the
ith sample,
, respectively. The grand sample mean of
Y is
, and the average of the
m standard deviations is
. The parameters
and
can be estimated by
and
, respectively. Therefore, the control limits of the lognormal
-chart are estimated using
where
is set to satisfy a desired ARL
.
According to Huang, et al. [
11], there are two cases for the standard deviation. For the first case of
, the control limits of the lognormal
S-chart are estimated using
where
is a multiplier that depends on
n and the desired ARL
. For the second case of
, the control limits of the lognormal
S-chart are estimated using
When Phase II joint monitoring begins, independent samples, each of size n, are repeatedly taken from the process. For each sample, , in the case of , one calculates and plots and against the sampling sequence, where and . An out-of-control signal is detected if the plotting statistic is below or above , or when the plotting statistic falls below or above . For the case of , one computes and plots and against the sampling sequence. An out-of-control signal is detected if the plotting statistic is below or above , or when the plotting statistic falls below or above .
Remark 1. The population standard deviation σ is usually unknown and needs to be estimated in practice. It can be estimated by utilizing data collected from Phase I control, when the process was in control. Based on the estimate, one can then decide whether to use Case I or Case II to construct the lognormal S-chart.
3. Chart Performance Evaluations and Comparisons
In this section, we conduct a simulation study to compare the performance of the combined lognormal - and S-charts with the combined MAD - and S-charts and the conventional Shewhart - and S-charts in terms of the ARL.
3.1. Simulated Settings
Since , we set such that the mean of the lognormal distribution remains unchanged. Let be the value of the in-control parameter. As discussed earlier, the control limits need to be determined using Phase I observations and will depend on , the subgroup size n, and the number of Phase I samples m. Therefore, we approximate the control limits of the three combined - and S-charts using simulations for various values of and combinations of and 100, as well as and 10. The value of is set to be between 0.2 and 2.0, with an increment of 0.2. The multipliers of the three combined control charts are calibrated to have an overall ARL that is approximately equal to 370. Note that the considered ARL for the three combined control charts is conditional on the estimated UCL and LCL. Here, we describe how the simulation is conducted for the combined lognormal - and S-charts, as it is similar for the other two combined charts. Given , m, and n, the following steps are carried out:
Step 1: Choose a value of
and a value of
, and generate
m independent samples of
n observations each from a lognormal distribution with a mean of 1 and a standard deviation of
. Compute the
and
, and calculate the
and
using either Equation (
1) if
or Equation (
2) if
.
Step 2: Repeatedly generate samples of n observations each from a lognormal distribution with a mean of 1 and standard deviation of . For each sample, calculate the two plotting statistics for the -chart and S-chart, respectively. Then, evaluate whether the plotting statistic for the -chart exceeds or goes below ; next, evaluate whether the plotting statistic for the S-chart exceeds or goes below . Stop when it does, and denote the number of samples generated by RL.
Step 3: Repeat Step 2 times, resulting in RL, . Calculate the simulated in-control average run length .
Step 4: If , stop. If , return to Step 1. Choose a larger if and a smaller if .
The multipliers of the three combined
- and
S-charts are given in
Table 1 and
Table 2 for different values of
. Note that the multipliers of these three combined
- and
S-charts increase when
(
) increases.
Denote the out-of-control process parameters by
and
, where
. Given
, we simulated the ARL
for the three combined charts in the same way as in Steps 2 and 3, which were mentioned earlier. The out-of-control ARLs (ARL
s) of the three combined
- and
S-charts are summarized in
Table 3 and
Table 4 (
) and
Table 5 and
Table 6 (
).
3.2. Discussion of Results
According to the assessment of the numerical results summarized in
Table 3,
Table 4,
Table 5 and
Table 6, the combined lognormal
- and
S-charts perform better than the other two combined
- and
S-charts when
. Nevertheless, the ARL
s of the combined lognormal
- and
S-charts are larger than those of the other two combined
- and
S-charts when
. Note that smaller values of
correspond to smaller values of
, under which the lognormal distribution is more symmetric. Therefore, when
is small, which means that the data are more symmetric, the combined lognormal
- and
S-charts are less effective than the combined Shewhart
- and
S-charts and the combined MAD
- and
S-charts. On the other hand, as
becomes larger, the lognormal distribution becomes more skewed, thus making the combined lognormal
- and
S-charts more effective in detecting changes in
and
than the other two combined control charts.
4. Example from the Automotive Industry
In this section, we present a real example to illustrate the applicability of the combined lognormal
- and
S-charts. The ASTM D7320 Ref Oil Data were provided by the Test Monitoring Center [
22]. In order to assess the engine oil quality, especially for new vehicles, the percent viscosity increase (PVI), which follows a lognormal distribution, needs to be tested. In this dataset, the quality of three reference oils—Ref Oils 434, 435, and 438—needs to be tested. We collected 50 samples, each of size 10, for each of these three reference oils, and used them to construct the Phase I control charts.
The multipliers of the three combined
- and
S-charts were calibrated to have a Type I error approximately equal to 0.0027 in order to have a fair comparison. Summarized in
Table 7 are the upper and lower control limits of the three combined
- and
S-charts, the estimated means, and the estimated standard deviations of the PVI for the three reference oils.
Figure A1,
Figure A2 and
Figure A3 show the three combined
- and
S-charts for Ref Oils 434, 435, and 438, respectively.
Neither the combined Shewhart
- and
S-charts nor the combined MAD
- and
S-charts showed any out-of-control samples for Ref Oil 434 (
Figure A1). On the other hand, sample 9 was outside the control limits of the lognormal
S-chart. Consequently, the control limits were recalculated without sample 9 for the Phase II joint monitoring of the combined lognormal
- and
S-charts for Ref Oil 434. Similarly, there were no out-of-control samples in the combined Shewhart
- and
S-charts or the combined MAD
- and
S-charts for Ref Oil 435 (
Figure A2), while sample 15 was out of control on the lognormal
S-chart. Hence, the control limits of the combined lognormal
- and
S-charts for Ref Oil 435 were recalculated without sample 15 for the Phase II joint monitoring. As for Ref Oil 438 (
Figure A3), none of the three combined
-charts and
S-charts showed any out-of-control samples.
The sample mean and the sample standard deviation of the PVI calculated based on the Phase I samples were used as the “true mean" and the “true standard deviation", respectively, in the simulation process because the true population mean and population standard deviation were unknown. The mean and the standard deviation of the PVI were respectively assumed to be 154.747 and 127.376, 175.846 and 66.963, and 94.647 and 20.828 for Ref Oils 434, 435, and 438, respectively. For Phase II monitoring, 20 new samples with a subgroup size of 10 were generated when
changed from
to
and
changed from
to
. Note that the 20 new samples were computer simulated, not obtained from additional samples using the ASTM test method. All combined
- and
S-charts were tuned to produce an overall ARL
that was approximately equal to 370. The resulting control charts for Ref Oils 434, 435, and 438 are shown in
Figure A4,
Figure A5 and
Figure A6, respectively.
For Ref Oil 434 (
Figure A4), the combined lognormal
- and
S-charts detected out-of-control signals at samples 5 and 4 in the
- and
S-charts, respectively, while the combined Shewhart
- and
S-charts triggered at samples 8 and 11 in the
- and
S-charts, respectively. In addition, out-of-control signals were detected at sample 8 in both the MAD
- and
S-charts. For Ref Oil 435 (
Figure A5), out-of-control signals were detected on sample 6 in both the Shewhart
- and
S-charts and both the MAD
- and
S-charts. At the same time, out-of-control signals also appeared at sample 6 in both the lognormal
- and
S-charts. As for Ref Oil 438 (
Figure A6), the combined Shewhart
- and
S-charts detected out-of-control signals at samples 6 and 2 in the
- and
S-charts, respectively, while the combined MAD
- and
S-charts detected at sample 6 in both charts. In addition, out-of-control signals were detected at sample 6 in both the lognormal
- and
S-charts.
5. Conclusions
In this study, we discuss the construction of three combined - and S-charts for jointly monitoring the mean and the standard deviation of the lognormal distribution. The simulation studies show that the combined lognormal - and S-charts have good performance when the underlying lognormal distribution is more skewed. The practical application of the combined lognormal - and S-charts is also demonstrated in a real example.
The numerical results of the current work indicate that for skewed non-normal processes, it is possible to construct more effective control charts for monitoring the process mean, process variability, or both based on the actual process distribution. This is at least the case for lognormal processes. It would be worth it to investigate how to construct more effective control charts for other skewed non-normal processes.
Remark 2. As for other skewed non-normal processes, the process mean or process variability may need to be approximated first, and these can be estimated using Phase I in-control data. Based on the conventional Shewhart - and S-charts, one can construct the control limits of - and S-charts for monitoring the process mean or process variability under other skewed non-normal processes.