Next Article in Journal
Path Planning of Unmanned Helicopter in Complex Dynamic Environment Based on State-Coded Deep Q-Network
Next Article in Special Issue
Cornish–Fisher-Based Control Charts Inclusive of Skewness and Kurtosis Measures for Monitoring the Mean of a Process
Previous Article in Journal
Third-Order Tensor Decorrelation Based on 3D FO-HKLT with Adaptive Directional Vectorization
Previous Article in Special Issue
Asymmetric Laplace Distribution Models for Financial Data: VaR and CVaR
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Confidence Interval, Prediction Interval and Tolerance Interval for the Skew Normal Distribution: A Pivotal Approach

1
The School of Cyberspace Security, Xi’an University of Posts and Telecommunications, Xi’an 710121, China
2
The Special Education School of Jinzhong City, Jinzhong 030600, China
3
College of Big Data and Internet, Shenzhen Technology University, Shenzhen 518118, China
4
Department of Applied Mathematics, Xi’an University of Technology, Xi’an 710048, China
*
Author to whom correspondence should be addressed.
Symmetry 2022, 14(5), 855; https://doi.org/10.3390/sym14050855
Submission received: 22 March 2022 / Revised: 14 April 2022 / Accepted: 14 April 2022 / Published: 21 April 2022

Abstract

:
The class of skew normal distributions, introduced by Azzalini (1985), which is an asymmetric distribution and allows the presence of skewness. In this paper, we propose the pivotal quantity approach to construct the confidence interval for the mean, prediction interval for the mean of the future sample, and tolerance interval for the quantile. The fiducial distribution is also studied. Moreover, the performances of all the proposed confidence intervals are investigated through the Monte Carlo simulation. The pivotal quantity is a common method for calculating confidence intervals, which is used to construct confidence intervals in this paper. And the convergence of the obtained confidence interval is illustrated by the figures. Finally, a real data is used to explain proposed intervals in real life.

1. Introduction

Statistics consists of three types of statistical inferences: Bayesian inference, frequency inferences and fiducial inferences. Fiducial inference was originally proposed by Fisher [1] and was aimed to overcome the deficiency in Bayesian framework when there was little or no parameter information in prior distribution. From Fisher’s point of view, fiducial inference simply changed the logical identity of the parameter. Fiducial confidence interval is a kind of confidence interval based on fiducial statistical theory. It also treats the unknown population parameter as a random variable. Based on the fiducial inference principle, Li et al. [2] illustrated the usefulness of the fiducial inferences method; Wang et al. [3] have given construction method of prediction intervals for the normal distribution, exponential distribution, and gamma distribution; Veronese and Melilli [4] developed a simple and direct method to define the fiducial distribution of real exponential families; Krishnamoorthy and Wang [5] obtained the fiducial confidence limits and prediction limits of the gamma distribution; Hoang-Nguyen-Thuy et al. [6] have given the fiducial estimation method of location scale distribution family, and listed several distributions for analysis.
Confidence intervals are used to describe parameters that have some uncertainty due to sampling error. There are many methods to construct the three confidence intervals. The pivotal quantities approach is commonly used to calculate confidence intervals and the approach based on pivotal quantities allows finding exact test or confidence interval for the values of the parameter. The pivotal quantity method has been used by many scholars to obtain confidence interval. For example, the pivotal quantity used in Chen [7] for constructing confidence intervals was adjusted to improve the performance of the confidence intervals. Seo [8] provided the exact confidence intervals for unknown parameters and exacted predictive intervals for the future upper record values by providing some pivotal quantities in the two-parameter Rayleigh distribution. Johnson [9] mentioned a prediction interval covered a future observation from a random process in repeated sampling, and was typically constructed by identifying a pivotal quantity that was also an ancillary statistic.
In many real-world problems, the data did not satisfy the conditions of symmetry and the assumptions of normality are violated. The class of skew normal distributions is an extension of the normal distribution, allowing for the presence of skewness, see Azzalini [10]. Since then, the skew normal distributions have been studied in a number of important areas, see Azzalini [11] for details. Scholars have studied the confidence interval of a parameter in the skew normal distribution such as Mameli [12] analyzing the approximate confidence interval of skewness parameter under large sample by using Fisher’s transformation; Wang et al. [13] gave three confidence intervals for location parameters in skew normal distribution family with known coefficient of variation and skewness; Wang et al. [14] studied the confidence interval of skewness parameter under skew normal distribution.
Based on our knowledge, the confidence interval of the mean, the prediction interval of future sample mean, tolerance interval of the quantile, and the fiducial distribution for the skew normal distribution are seldom studied. In this paper, we will use the pivotal approach to construct the confidence intervals for the skew normal distribution. All experiments are implemented using R software. The rest of the paper is organized as follows. Some basic properties and pivotal quantities of the skew normal distribution are introduced in Section 2. The confidence interval of the mean for skew normal distribution is constructed by pivotal quantity method, and the simulation experiment is carried out in Section 3. The prediction interval of the future sample mean and one-side tolerance limit are studied in Section 4 and Section 5. The fiducial distribution for the probabilities of the skew normal distribution is discussed in Section 6. All proposed intervals are illustrated using an actual data in Section 7. Some conclusions are given in Section 8.

2. Point Estimates and Pivotal Quantities

According to Azzalini [15], the probability density function (pdf) of the skew normal distribution is given as follows
f ( x | μ , σ , λ ) = 2 σ ϕ x μ σ Φ λ x μ σ ,
where μ is the location parameter, σ is the scale parameter, λ is the skewness parameter, and ϕ and Φ are the probability density function and cumulative distribution function of the normal distribution, respectively. We denote it by S N ( μ , σ 2 , λ ) . Moreover, the effect of λ on the skew normal distribution will be graphically shown in Figure 1.
The expressions for the expectation, variance and skewness of the SN( μ , σ 2 , λ ) are
E ( X ) = μ + 2 / π σ δ ,
V a r ( X ) = 1 2 π δ 2 σ 2 , S k e w n e s s ( X ) = 4 π 2 ( 2 / π δ ) 3 1 2 π δ 2 3 2 ,
where δ = λ 1 + λ 2 .
According to the properties of the skew normal distribution, λ can be obtained from the skewness of the sample, which do not depend on μ and σ 2 . Let X 1 , X 2 , , X n be a sample from S N ( μ , σ 2 , λ ) , and denote γ to be the skewness of the sample, we have
λ ^ = 2 γ 4 π 1 3 2 π 1 + 1 + π 2 2 γ 4 π 2 3 .
Meanwhile, let μ ^ and σ ^ be the MLE estimator of μ and σ based on the sample, which can be obtained by the followings
i = 1 n X i μ ^ σ ^ λ i = 1 n ϕ λ X i μ ^ σ ^ Φ λ X i μ ^ σ ^ = 0 , n + i = 1 n ( X i μ ^ ) 2 σ ^ 2 λ i = 1 n ϕ λ X i μ ^ σ ^ ( X i μ ^ ) σ ^ Φ λ X i μ ^ σ ^ = 0 .
The more details for MLE estimator of μ , σ and λ can be found in Figueiredo and Gomes [16].
Based on the Equation (1), we know that Z i = X i μ σ S N ( 0 , 1 , λ ) , and
X i μ ^ σ ^ = Z i ( μ ^ μ ) / σ σ ^ / σ = Z i μ ^ * σ ^ * .
According to the arguments from Lawless [17] and Krishnamoorthy et al. [18], we have
μ ^ μ σ μ ^ * a n d σ ^ σ σ ^ * ,
where the notation ∼ means distributed as, and μ ^ * and σ ^ * are equivalent estimators for μ = 0 and σ = 1 of S N ( μ , σ 2 , λ ) .

3. Confidence Interval of the Mean by Pivotal Quantity

In this section, we study the confidence intervals of the mean of S N ( μ , σ 2 , λ ) through the pivot quantity approach. According to the Equations (2) and (3), we have
μ + σ δ 2 π μ ^ σ ^ = μ μ ^ σ ^ + δ 2 π σ σ ^ = δ 2 π μ ^ * σ ^ * .
Let u n = δ ^ 2 π μ ^ * σ ^ * , and u n ; α denotes the 100 α percentile of u n , then the 100 ( 1 α ) % confidence interval for the mean is
μ ^ + u n ; α / 2 σ ^ , μ ^ + u n ; 1 α / 2 σ ^ .
Without loss of generality, we choose μ = 0 , σ = 1 , and the values of λ are chosen as −2, 1, 0.05, respectively. The percentiles for calculating 100 ( 1 α ) % confidence intervals based on different sample sizes are given in Table A1 with the Monte Carlo experiments. Figure 2 shows the percentiles of computing 5 % , 95 % , 2.5 % , 97.5 % , 1 % and 99 % confidence intervals for the mean based on the results obtained from Table A1.
It is observed from Figure 2 that despite the different confidence levels, the percentages gradually decrease and tend to zero as the sample size increases.
From Table A1, we can find that the distance between the upper and lower percentiles of the u n decrease as the sample size increases. To evaluate the performances of the proposed confidence interval constructed by the pivotal quantity approach, we carried out simulation studies with the same values of λ given in Table A1. The coverage probabilities (CP), average length (AL), and associated standard deviations (SD) were calculated based on the R software. For each of the generated sets, we used the R code with N = 10,000 runs to compute confidence intervals. The percentage of these 10,000 confidence intervals that include the actual mean value is an estimate of the CP. The AL and SD are estimated similarly. The corresponding results for sample sizes of n ranging from 5 to 100 and different values of α are displayed in the following Table A2. See the Appendix. The values of AL in Table A2 are displayed in Figure 3.
From Table A2, we can see that all the CP can reach the corresponding confidence levels. With the increase of sample size, both the mean length and the standard deviations of the interval decrease. Figure 3 illustrate our conclusion more visually. The convergence based on n for different λ would be clearer seen.

4. Prediction Intervals for the Mean of a Future Sample

A prediction interval is a statistical interval that contains future random variables with a specific probability, which works on estimating the range of the samples in the future according to the samples in the past or present. Hahn [19] and Kaminsky [20] have expounded the prediction interval of normal distribution and exponential distribution respectively. In the following, we aim to find a prediction interval for the mean value of the future data, with sample size m, from S N ( μ , σ 2 , λ ) .
Let Y ¯ denote the mean of future sample of size m from S N ( μ , σ 2 , λ ) . To find a prediction interval for Y ¯ , we denote the quantity w n that
w n = Y ¯ μ ^ σ ^ Y ¯ * μ ^ * σ ^ * ,
where Y ¯ * is the mean of a sample of size m from the S N ( 0 , 1 , λ ^ ) . Therefore, the 100 ( 1 α ) % prediction intervals for a future sample mean Y ¯ is
μ ^ + P α / 2 σ ^ , μ ^ + P 1 α / 2 σ ^ ,
where P α is 100 α percentile of w n . In the real life, future data is not easy to be collected due to various factors. So here we only consider the case m less than n, and the 95 % prediction interval based on the Monte Carlo simulation experiment are obtained in Table A3.
The values of width in Table A3 are plotted in Figure 4.
As can be seen from Table A3, the predicted interval length decreases with the increasing of m and n. Figure 4 also visually illustrates the above conclusion. And we can conclude that 95 % of the mean of the future data is ( μ ^ 0.2918 σ ^ , μ ^ + 1.5078 σ ^ ) , when the current sample size is 20 and the future sample size is 5, where μ ^ and σ ^ can be estimated from the sample size 20.

5. One-Sided Tolerance Interval Limits

In many practical applications such as medical treatment, environment and engineering, people hope to find an interval estimate based on the sample, which can capture at least a proportion p in the sample population with confidence ν . This statistical interval is called the tolerance interval. This type of interval estimation is called p content- ν coverage tolerance interval or ( p , ν ) tolerance interval for short. Proschan [21] has studied the tolerance interval of normal distribution. Krishnamoorthy et al. [18] have discussed the prediction and tolerance intervals of the Rayleigh distribution with two-parameter. Hoang-Nguyen-Thuy et al. [22] have given the calculation method of tolerance interval of location scale distribution family. Therefore, it is necessary to study the tolerance interval of skew normal distribution when many data in life tend to show some skewness compared with normal distribution.
Let 0.5 p 1 and Q p ( μ , σ , λ ) to be the 100 p percentile of the distribution S N ( μ , σ 2 , λ ) , then
Q p ( μ , σ , λ ) μ ^ σ ^ = ( Q p ( μ , σ , λ ) μ ) / σ ( μ ^ μ ) / σ σ ^ / σ Q p ( 0 , 1 , λ ^ ) μ ^ * σ ^ * = q p ,
where q p can be used to set confidence bound on Q p ( μ , σ , λ ) . If q p , ν is the 100 ν percentile of q p , the one-sided tolerance interval is
μ ^ + q 1 p , 1 ν σ ^ , μ ^ + q p , θ σ ^ .
In the following, we calculated the q p , ν and q 1 p , 1 ν for ν = 0.95 with different sample sizes of n and values of p. The simulations are based on S N ( 0 , 1 , 1 ) and results are shown in Table A4. The values of q p , ν and q 1 p , 1 ν are described in Figure 5.
From Table A4 and Figure 5, we can see that the lower bound of one-side tolerance interval increases, the upper bound of one-side tolerance interval decreases, and the interval length of one-side tolerance interval decreases as the sample size increases, which means that the larger the sample size is, the smaller and more precise the intervals are.
Table A5 selects several sample sizes for simulation and gives the CP and AL of the tolerance confidence interval. Repeat 10,000 times to get the SD of the length of the tolerance confidence intervals. The values of AL in Table A5 are displayed in Figure 6.
From Table A5, we can see that the coverage of tolerance confidence interval can reach 95 % , the AL and SD of tolerance confidence interval decrease with the increase of sample size.

6. Fiducial Distribution of Skew Normal Distribution

In practical application, it is often necessary to know the probability that the sample is larger than a certain critical value. When analyzing survival data, we need to get the probability that the patient’s survival time after illness is greater than a certain value t, that is P ( x > t ) . For example, in mechanical manufacturing, it is necessary to know the probability of the parts manufactured in the tolerance range, which can be obtained by using fiducial inference. Krishnamoorthy [23] has shown that the fiducial method is a useful tool for solving the frequency characteristics of many complex problems. The application of the fiducial method to the concrete distribution has also been studied by many scholars. O’Reilly [24] studied the fiducial distribution of exponential distribution, and Hoang-Nguyen-Thuy [6] obtained the fiducial distribution of position scale distribution family. In this section, we study the fiducial distribution for the probabilities of S N ( μ , σ 2 , λ ) .
Given X S N ( μ , σ , λ ) and t , let P t = P ( X t | μ , σ , λ ) = F X ( t | μ , σ , λ ) be the cumulative distribution function (cdf) of S N ( μ , σ 2 , λ ) . Consider the testing
H 0 : P t = P 0 V S H a : P t > P 0 ,
where P 0 is a specific value between (0,1). The hypothesis above are equivalent to
H 0 : P X μ σ t μ σ = F z t μ σ | 0 , 1 , λ t = μ + σ F 1 ( P 0 | 0 , 1 , λ ) ,
and
H a : t > μ + σ F 1 ( P 0 | 0 , 1 , λ ) .
For given level α and observed value ( μ ^ 0 , σ ^ 0 ) of ( μ , σ ), the H 0 is rejected if,
P F 1 ( P 0 | 0 , 1 , λ ) μ ^ * σ ^ * < t μ ^ 0 σ ^ 0 < α P P 0 < F * μ ^ * + σ ^ * σ ^ 0 ( t μ 0 ) < α ,
where F * is the CDF of S N ( 0 , 1 , λ ) .
Therefore, the fiducial distribution of F X ( t | μ , σ , λ ) is given by
Q P t = F * μ ^ * + σ ^ * σ ^ 0 ( t μ ^ 0 ) ,
and the 100 ( 1 α ) % fiducial confidence interval for P t is formed by the lower and upper 100 α / 2 percentiles of Q P t .
Furthermore, let P L U = P ( L X U ) = P ( X U ) P ( X L ) and replace the parameters with their fiducial quantities. We can obtain a fiducial quantity of P L U as
Q P L U = Q P U Q P L = F * μ ^ * + σ ^ * σ ^ 0 ( U μ ^ 0 ) F * μ ^ * + σ ^ * σ ^ 0 ( L μ ^ 0 ) .
Therefore, the 100 ( 1 α ) % confidence interval for P L U is Q P L U , α / 2 , Q P L U , 1 α / 2 , where Q P L U , α is the 100 α percentile of Q P L U .
To verify the validity of the constructed fiducial confidence interval, the following simulation is performed. Let α l and α r denote the left and right-tail error probability, such that α l + α r = α . We choose α l = α r = α / 2 in the study. Table 1 shows the CP and AL of fiducial confidence intervals simulated by Monte Carlo method. The SD of the AL was obtained by repeating the simulation experiment 10,000 times. Without loss of generality, the following simulations are based on S N ( 0 , 1 , 1 ) .
As can be seen from the simulation in Table 1, the coverage rate increases, AL and SD decrease with the increase of sample size.

7. Application

Corn seed quality is an important factor to determine corn yield and it is easy to suffer mechanical damage when threshing. Mancera-Rico et al. [25] conducted an experiment to measure the mechanical damage suffered by maize seeds, and they considered that maize seeds contained different levels of moisture and endosperm were compressed until rupture occurred. We choose one of the variables, stain, which has the same function as Mancera-Rico et al. [25] stated. The data set was presented in Table 2 contains 90 observations, and strain (mm) were measured on maize seeds containing flour endosperm and 8 % water.
Using the sn package in R software, the estimators of parameter for the skew normal distribution are μ ^ = 0.1814 , σ ^ = 0.1020 , and λ ^ = 1.1901 , respectively. Next, we will work on this data to illustrate our proposed methods. Based on the equations in Section 2, the estimators of the corresponding parameter for the skew normal distribution are μ ^ = 0.1531 , σ ^ = 0.1229 , and λ ^ = 2.5025 , respectively.
Furthermore, the Kolmogorov-Smirnov (K-S) test, the Anderson-Darling (A-D) goodness-of-fit tests, as well as the p-value (pval) are reported in Table 3. The K-S statistic (based on the MLE of the parameter μ ^ = 0.1814 , σ ^ = 0.1020 , and λ ^ = 1.1901 ) is 0.0531 and the corresponding p-value is 0.9613. The K-S statistic (based on the our method of the parameter μ ^ = 0.1531 , σ ^ = 0.1229 , and λ ^ = 2.5025 ) is 0.0424 and the corresponding p-value is 0.9969. Therefore, the data set is reasonably fitted for the skew normal distribution. The fitting curves of the probability densities from these two methods are also displayed in the Figure 7.
Furthermore, we use the strain data in Table 2 to study the construction of different proposed statistical intervals. The 95% confidence interval for the mean strain (MCI) of corn, 95 % prediction interval for the mean strain (MPI) in a future sample of size m = 20 , and one-sided tolerance limited with p = 0.975 , ν = 0.95 (TL) are given in Table 4.
From Table 4, we can find that the 95 % confidence interval for the mean of corn is (0.2276, 0.2621). In other words, there is 95% chance that the average strain of corn seed after extrusion will be between 0.2276 mm and 0.2621 mm. We also notice that the 95 % prediction confidence interval for the 20 samples in the future is (0.2320, 0.2665). It means that 95 % chance that the average strain of a corn seed will be between 0.2320 mm and 0.2665 mm. In addition, we study the one-sided tolerance interval about the strain of the corn seed, and found that the upper and lower tolerance limits are 0.4989 mm and 0.0688 mm respectively. This means that at least 97.5 % of the corn that will change at least 0.4989 mm has a confidence 95 % .
Finally, we study the probability of strain for the corn seed in a certain length, such as L = 0.1 , U = 0.45 . Based on Equation (4), we known
Q P L U = F * μ ^ * + σ ^ * 0.1229 ( 0.45 0.1531 ) F * μ ^ * + σ ^ * 0.1229 ( 0.1 0.1531 ) ,
where μ ^ * and σ ^ * are estimates of S N ( 0 , 1 , λ ^ ) . With the same procedure instructed in Section 6, the lower and upper 2.5th percentiles of Q P L U are calculated as 0.5833 and 0.7263. Thus, the 95% confidence interval for P ( 0.1 X 0.45 ) is (0.5833, 0.7263), which means 58.33–72.63% of corn seeds have changed 0.1 mm to 0.45 mm in length with a confidence 95%.

8. Concluding Remarks

Secondly, we propose the confidence interval of the mean, the prediction interval of the future sample mean, and one-side tolerance limit for the skew normal distribution based on the pivotal quantity approach. We discuss that the estimator of the skewness parameter λ can be obtained without depending on μ and σ , and obtain the method to estimate the parameter λ , which is simpler than the traditional MLE method. Monte Carlo random simulation experiments are carried out for all the obtained intervals. The simulation experiments show that the CPs of the confidence intervals reach the corresponding confidence levels. Moreover, the mean lengths and standard deviations of the intervals decrease as the sample size increases, and the lengths of the prediction intervals decrease as m and n increase. In addition, we study the fiducial distribution of the skew normal distribution, and the pivotal approach provides a good idea to study the mean of one sample. In the end, we employ our proposed methods on the real data, which conclude our proposed methods can provide effective and useful information. It can be used as an extension of traditional methods to better solve specific problems in practice.
In fact, the proposed estimation method has some limitations, especially for the λ 0 , which are also discussed by Azzalini [11]. In the future, the estimation method for solving the problem when λ closes to 0 can be studied. Meanwhile, we will keep working on these confidence intervals with the fiducial approach and do some comparisons between skew normal distribution and other skewed distributions, such as, lognormal distribution, skew-t distribution, and skew-Cauchy distribution. Furthermore, these three different intervals with the pivotal quantity approach based on the skew slash distribution, which proposed by Tian et al. [26], will be conducted to enrich the research work on the asymmetric data.

Author Contributions

X.Q. and W.T.: Conceptualization, Methodology, Validation, Investigation, Resources, Supervision, Project Administration, Visualization, Writing review and editing; X.Q., H.L. and Y.Y.: Software, Formal analysis, Data curation, Writing—original draft preparation, Visualization. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Informed Consent Statement

Not applicable.

Data Availability Statement

Datasets are provided in the paper.

Acknowledgments

The authors would like to thank the Editor and two anonymous referees for their careful reading of this article and for their constructive suggestions, which considerably improved this article.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

Table A1. Percentiles for computing 95 % , 97.5 % and 99 % confidence intervals for the mean.
Table A1. Percentiles for computing 95 % , 97.5 % and 99 % confidence intervals for the mean.
λ n5%95%2.5%97.5%1%99%
−25−0.27771.3049−0.52591.6273−0.79922.0208
6−0.16661.2251−0.41851.4106−0.67971.6698
7−0.11121.1800−0.29151.3090−0.49531.5542
9−0.04021.0680−0.22901.1891−0.35811.3687
10−0.02831.0308−0.16381.1597−0.33591.3243
12−0.00421.0027−0.10671.0969−0.29261.2365
150.04710.9596−0.08481.0461−0.18621.1543
200.08170.9283−0.01830.9984−0.14261.0830
500.17880.83850.08350.8913−0.00030.9053
1000.26920.80060.17170.82800.08300.8649
15−0.28471.4305−0.53851.5272−1.02931.9166
6−0.16511.2116−0.35371.4457−0.66801.7314
7−0.10501.1774−0.27791.3189−0.52991.5005
9−0.05121.0820−0.23411.1842−0.38311.3218
10−0.05231.0577−0.18951.1975−0.36261.2757
12−0.00271.0100−0.12951.0892−0.27801.2587
150.02980.9377−0.10261.0603−0.23091.1159
200.04710.9030−0.05560.9701−0.20231.0582
500.10460.78370.02570.8274−0.05900.8659
1000.10550.69960.03030.7351−0.02000.7846
0.055−0.27771.3786−0.52161.5586−0.84331.8587
6−0.17991.1889−0.34611.3726−0.68211.5816
7−0.11071.1825−0.32481.2880−0.56361.5984
9−0.06001.0671−0.20211.2250−0.37981.3478
10−0.04611.0461−0.17321.1317−0.38011.2893
12−0.00110.9813−0.12511.1021−0.32061.1831
150.02900.9477−0.08471.0221−0.18111.1478
200.03030.8857−0.02660.9575−0.19031.0357
500.08000.75430.02330.8203−0.06630.8512
1000.09960.67640.03520.7210−0.02620.7573
Table A2. CP and AL(SD) of 95%, 97.5% and 99% confidence intervals for mean.
Table A2. CP and AL(SD) of 95%, 97.5% and 99% confidence intervals for mean.
λ n95% (CP)AL (SD)97.5% (CP)AL (SD)99% (CP)AL (SD)
250.94631.7926 (0.7109)0.97232.2516 (0.8903)0.98513.2492 (1.3394)
60.95671.6827 (0.5977)0.96981.7316 (0.6173)0.98592.5215 (0.9335)
100.95661.1255 (0.3108)0.97641.3098 (0.3771)0.99541.7042 (0.4815)
200.95390.8409 (0.1828)0.98661.0676 (0.2354)0.99121.0727 (0.2448)
500.96470.6023 (0.1072)0.98090.6876 (0.1182)0.99270.8605 (0.1497)
1000.95790.5048 (0.0704)0.97640.5870 (0.0828)0.99080.6809 (0.0915)
150.95922.0615 (0.7752)0.97272.5247 (1.0354)0.99663.7686 (1.5299)
60.95991.8600 (0.6249)0.97532.2248 (0.7519)0.99552.8911 (0.9883)
100.95061.3376 (0.3507)0.98241.5818 (0.4343)0.99672.2160 (0.6313)
200.96880.9907 (0.1969)0.97811.1634 (0.2364)0.99691.4784 (0.2986)
500.95560.7147 (0.1096)0.97480.7930 (0.1201)0.99201.0239 (0.1566)
1000.96100.6266 (0.0839)0.97800.6732 (0.0832)0.99400.8248 (0.1056)
0.0550.95422.6378 (1.0244)0.98073.5027 (1.3501)0.99194.6244 (1.7167)
60.96952.2843 (0.7763)0.98182.4948 (0.8453)0.99213.8161 (1.3642)
100.96361.6428 (0.4366)0.97911.9899 (0.5264)0.99342.4142 (0.6480)
200.95891.2151 (0.2390)0.98851.3537 (0.2675)0.99581.6762 (0.3292)
500.95270.8795 (0.1307)0.97480.9390 (0.1374)0.99101.2342 (0.1876)
1000.96800.7027 (0.0820)0.97500.7964 (0.0943)0.99500.9969 (0.1186)
Table A3. Lower and upper percentiles for computing 95% PI for the mean of the future sample.
Table A3. Lower and upper percentiles for computing 95% PI for the mean of the future sample.
n = 20 n = 30 n = 50
mLUwidthmLUwidthmLUwidth
1−1.11822.49373.61191−0.95882.29333.25211−0.97892.22303.2019
2−0.52691.96182.48873−0.38751.55911.94662−0.57271.70312.2758
3−0.41541.65162.06705−0.18421.51521.69944−0.22171.42191.6436
4−0.39231.53251.92487−0.14681.33451.48136−0.09461.35411.4487
5−0.29181.50781.79969−0.09571.28061.376310−0.02821.14811.1763
6−0.24431.40771.652011−0.08631.19411.2804140.00931.11661.1073
7−0.16531.37861.543913−0.02711.15041.1775200.03181.06951.0377
8−0.17081.30041.471215−0.00051.15781.1573250.07381.00570.9319
9−0.06591.28571.3516170.02201.15151.1295300.12861.00860.8800
10−0.07171.26631.3380200.08161.10251.0209500.13200.98270.8507
Table A4. q p , ν and q 1 p , 1 ν for computing one sided tolerance limits with ν = 0.95 .
Table A4. q p , ν and q 1 p , 1 ν for computing one sided tolerance limits with ν = 0.95 .
q p , ν q 1 p , 1 ν
n/p0.700.900.950.9750.990.700.900.950.9750.99
5−0.8757−2.11352.9571−4.1657−5.50482.59104.23755.29846.29657.7219
6−0.7468−1.7410−2.3376−3.4057−4.72832.25273.61494.62165.55716.5805
7−0.4968−1.5049−2.1431−3.2633−3.95561.99493.39713.99975.13136.0606
8−0.4793−1.3428−1.8418−2.8103−3.33641.87413.10533.69724.80005.3883
9−0.3614−1.2859−1.7308−2.6282−3.30151.85502.95283.49504.65615.1384
10−0.4069−1.1523−1.5725−2.5213−2.90531.83692.79353.47704.48944.8346
15−0.2218−0.9547−1.4110−2.1069−2.60461.59042.51732.93893.84504.4227
20−0.1727−0.8567−1.2687−1.9871−2.51271.47902.36262.74733.57454.0068
30−0.1052−0.7806−1.1493−1.7982−2.21571.36222.19372.59133.30063.7451
50−0.0481−0.6849−1.0436−1.7050−2.07441.25972.00532.39963.06943.4490
75−0.0342−0.6817−1.0312−1.6251−1.97941.20891.93472.30212.95223.3199
100−0.0162−0.6757−0.9957−1.6056−1.97471.17211.88772.24882.88793.2148
200−0.0016−0.6560−0.9786−1.5643−1.91111.10451.79662.15202.74703.0836
Table A5. CP and AL(SD) of tolerance confidence intervals.
Table A5. CP and AL(SD) of tolerance confidence intervals.
p = 0.95 p = 0.975 p = 0.99
nCPAL (SD)CPAL (SD)CPAL (SD)
50.95807.7085 (2.8979)0.96229.8706 (3.7216)0.963212.2723 (4.6524)
60.95926.6142 (2.2054)0.96108.5191 (2.8111)0.964610.7338 (3.6711)
70.95555.8302 (1.8119)0.96048.0063 (2.4752)0.96269.5267 (2.9277)
100.95684.8681 (1.1886)0.96206.8025 (1.7521)0.96017.4704 (1.8516)
200.95553.9673 (0.6805)0.95925.4877 (0.9281)0.95936.3928 (1.0884)
500.95363.4139 (0.3558)0.95434.7411 (0.5072)0.95865.4889 (0.5626)
2000.95273.1305 (0.1613)0.95404.3032 (0.2228)0.95604.9850 (0.2567)

References

  1. Fisher, R.A. Inverse probability. In Mathematical Proceedings of the Cambridge Philosophical Society; Cambridge University Press: Cambridge, UK, 1930; Volume 26, pp. 528–535. [Google Scholar]
  2. Li, X.; Wang, J.; Liang, H. Comparison of several means: A fiducial based approach. Comput. Stat. Data Anal. 2011, 55, 1993–2002. [Google Scholar] [CrossRef]
  3. Wang, C.M.; Hannig, J.; Iyer, H.K. Fiducial prediction intervals. J. Stat. Plan. Inference 2012, 142, 1980–1990. [Google Scholar] [CrossRef]
  4. Veronese, P.; Melilli, E. Fiducial and confidence distributions for real exponential families. Scand. J. Stat. 2015, 42, 471–484. [Google Scholar] [CrossRef]
  5. Krishnamoorthy, K.; Wang, X. Fiducial confidence limits and prediction limits for a gamma distribution: Censored and uncensored cases. Environmetrics 2016, 27, 479–493. [Google Scholar] [CrossRef]
  6. Hoang-Nguyen-Thuy, N.; Krishnamoorthy, K. Estimation of the probability content in a specified interval using fiducial approach. J. Appl. Stat. 2021, 48, 1541–1558. [Google Scholar] [CrossRef]
  7. Chen, Z. Exact confidence interval for the shape parameter of a log-logistic distribution. J. Stat. Comput. Simul. 1997, 56, 193–211. [Google Scholar] [CrossRef]
  8. Seo, J.I.; Jeon, J.W.; Kang, S.B. Exact interval inference for the two-parameter Rayleigh distribution based on the upper record values. J. Probab. Stat. 2016, 2016, 8246390. [Google Scholar] [CrossRef]
  9. Johnson, G.S. Tolerance and prediction intervals for non-normal models. arXiv 2020, arXiv:2011.11583. [Google Scholar]
  10. Azzalini, A. A class of distributions which includes the normal ones. Scand. J. Stat. 1985, 12, 171–178. [Google Scholar]
  11. Azzalini, A. The Skew-Normal and Related Families; Cambridge University Press: Cambridge, UK, 2013; Volume 3. [Google Scholar]
  12. Mameli, V.; Musio, M.; Sauleau, E.; Biggeri, A. Large sample confidence intervals for the skewness parameter of the skew-normal distribution based on Fisher’s transformation. J. Appl. Stat. 2012, 39, 1693–1702. [Google Scholar] [CrossRef]
  13. Wang, Z.; Wang, C.; Wang, T. Estimation of Location Parameter in the Skew Normal Setting with Known Coefficient of Variation and Skewness. Int. J. Intell. Technol. Appl. Stat. 2016, 9, 191–208. [Google Scholar]
  14. Wang, C.; Wang, T.; Trafimow, D.; Myuz, H.A. Desired sample size for estimating the skewness under skew normal settings. In International Conference of the Thailand Econometrics Society; Springer: Cham, Switzerland, 2019; pp. 152–162. [Google Scholar]
  15. Azzalini, A. Further results on a class of distributions which includes the normal ones. Statistica 1986, 46, 199–208. [Google Scholar]
  16. Figueiredo, F.; Gomes, M.I. The skew-normal distribution in SPC. REVSTAT–Statistical J. 2013, 11, 83–104. [Google Scholar]
  17. Lawless, J.F. Statistical Models and Methods for Lifetime Data; John Wiley Sons: Hoboken, NJ, USA, 2011; Volume 362. [Google Scholar]
  18. Krishnamoorthy, K.; Waguespack, D.; Hoang-Nguyen-Thuy, N. Confidence interval, prediction interval and tolerance limits for a two-parameter Rayleigh distribution. J. Appl. Stat. 2019, 47, 160–175. [Google Scholar] [CrossRef]
  19. Hahn, G.J. Factors for calculating two-sided prediction intervals for samples from a normal distribution. J. Am. Stat. Assoc. 1969, 64, 878–888. [Google Scholar] [CrossRef]
  20. Kaminsky, K.S.; Nelson, P.I. Prediction intervals for the exponential distribution using subsets of the data. Technometrics 1974, 16, 57–59. [Google Scholar] [CrossRef]
  21. Proschan, F. Confidence and tolerance intervals for the normal distribution. J. Am. Stat. Assoc. 1953, 48, 550–564. [Google Scholar] [CrossRef]
  22. Hoang-Nguyen-Thuy, N.; Krishnamoorthy, K. A method for computing tolerance intervals for a location-scale family of distributions. Comput. Stat. 2021, 36, 1065–1092. [Google Scholar] [CrossRef]
  23. Krishnamoorthy, K.; Mathew, T. Statistical methods for establishing equivalency of several sampling devices. J. Occup. Environ. Hyg. 2007, 5, 15–21. [Google Scholar] [CrossRef]
  24. O’Reilly, F.; Rueda, R. Fiducial inferences for the truncated exponential distribution. Commun. Stat.-Theory Methods 2007, 36, 2207–2212. [Google Scholar] [CrossRef]
  25. Mancera-Rico, A.; Garca, G.; Zavaleta, H.A. Fracture resistance and physiological quality of corn seeds under axial compression. Rev. Mex Cienc. Agrcolas 2016, 7, 45–57. [Google Scholar]
  26. Tian, W.; Wang, T.; Gupta, A.K. A new family of multivariate skew slash distribution. Commun. Stat.-Theory Methods 2018, 47, 5812–5824. [Google Scholar] [CrossRef]
Figure 1. PDF graph for different the values of λ .
Figure 1. PDF graph for different the values of λ .
Symmetry 14 00855 g001
Figure 2. Percentiles for computing 95 % , 97.5 % and 99 % confidence intervals for the mean.
Figure 2. Percentiles for computing 95 % , 97.5 % and 99 % confidence intervals for the mean.
Symmetry 14 00855 g002aSymmetry 14 00855 g002b
Figure 3. Percentiles for computing 95 % , 97.5 % and 99 % confidence intervals for the mean.
Figure 3. Percentiles for computing 95 % , 97.5 % and 99 % confidence intervals for the mean.
Symmetry 14 00855 g003
Figure 4. The values of width for computing 95% PI for the mean of the future sample.
Figure 4. The values of width for computing 95% PI for the mean of the future sample.
Symmetry 14 00855 g004
Figure 5. q p , ν and q 1 p , 1 ν for computing one sided tolerance limits with ν = 0.95 .
Figure 5. q p , ν and q 1 p , 1 ν for computing one sided tolerance limits with ν = 0.95 .
Symmetry 14 00855 g005
Figure 6. Percentiles for computing 95 % , 97.5 % and 99 % confidence intervals for the mean.
Figure 6. Percentiles for computing 95 % , 97.5 % and 99 % confidence intervals for the mean.
Symmetry 14 00855 g006
Figure 7. Histogram and PDF fit.
Figure 7. Histogram and PDF fit.
Symmetry 14 00855 g007
Table 1. CP and AL(SD) of 95% fiducial confidence intervals for P ( L < X < U ) .
Table 1. CP and AL(SD) of 95% fiducial confidence intervals for P ( L < X < U ) .
n = 10 n = 20 n = 30 n = 50
LUCPAL (SD)CPAL (SD)CPAL (SD)CPAL (SD)
−0.20.20.94120.1940 (0.0088)0.95300.1499 (0.0050)0.95590.1292 (0.0045)0.95640.1060 (0.0028)
−0.50.50.94770.4542 (0.0152)0.95580.3450 (0.0105)0.95140.2896 (0.0076)0.95560.2396 (0.0075)
−110.94340.6148 (0.0136)0.95040.4893 (0.0124)0.95280.4153 (0.0111)0.95670.3519 (0.0103)
−220.94710.4530 (0.0172)0.94850.3409 (0.0133)0.95070.2684 (0.0118)0.95370.1657 (0.0089)
−0.20.40.94250.2877 (0.0117)0.95300.2329 (0.0089)0.95490.1996 (0.0059)0.95670.1657 (0.0050)
−0.30.50.94710.4098 (0.0139)0.95060.2935 (0.0098)0.94540.2578 (0.0070)0.96580.2238 (0.0060)
−1.50.80.94770.5718 (0.0166)0.95210.4158 (0.0140)0.95420.3593 (0.0103)0.95850.2842 (0.0086)
−2.310.94220.4613 (0.0138)0.94460.3344 (0.0102)0.95060.2616 (0.0079)0.95450.2159 (0.0069)
0.10.30.94270.1151 (0.0042)0.95000.0824 (0.0031)0.95560.0656 (0.0024)0.95870.0604 (0.0017)
0.51.50.94460.3757 (0.0189)0.95450.2969 (0.0109)0.95280.2502 (0.0093)0.95460.2253 (0.0073)
0.630.94830.4716 (0.0165)0.95450.3622 (0.0144)0.95590.3096 (0.0125)0.95770.2752 (0.0095)
−0.4−0.10.95150.1332 (0.0056)0.95470.1016 (0.0031)0.95220.0808 (0.0025)0.95570.0637 (0.0021)
−2−10.94820.2810 (0.0047)0.95030.2345 (0.0052)0.95490.2241 (0.0045)0.95550.2031 (0.0036)
−2.5−0.30.94600.5189 (0.0153)0.94960.4297 (0.0133)0.95690.3678 (0.0117)0.95960.3521 (0.0103)
Table 2. Strain of maize seeds.
Table 2. Strain of maize seeds.
0.2930.2740.2800.2620.2700.2840.1770.4630.2570.2120.1920.3360.220
0.3710.2080.2870.3430.2610.2460.2760.2620.2690.2260.3310.2670.231
0.3290.2460.4650.1680.1640.1700.1930.2700.2420.3690.2420.2060.227
0.2260.3070.3250.1660.1180.1450.2250.2100.1300.1030.2320.2570.099
0.2490.1160.1830.3550.1470.1280.1930.2370.1280.1860.4480.1600.282
0.1970.4000.2130.1960.2720.3860.2130.1650.2150.1920.1470.1260.186
0.3220.2010.4150.2230.2870.3310.2340.1300.3180.3220.1850.357
Table 3. Goodness of fit test for data sets.
Table 3. Goodness of fit test for data sets.
μ ^ σ ^ λ ^
Our method0.15310.12292.5025
K-S D = 0.0424 p v a l = 0.9969
A-D A n = 0.1484 p v a l = 0.9987
μ ^ σ ^ λ ^
MLE0.18140.10201.1901
K-S D = 0.0531 p v a l = 0.9613
A-D A n = 0.3517 p v a l = 0.8946
Table 4. Confidence interval, prediction interval and tolerance interval of the data.
Table 4. Confidence interval, prediction interval and tolerance interval of the data.
Lower PercentilesUpper PercentilesInterval
MCI0.60640.8867(0.2276, 0.2621)
MPI0.64200.9223(0.2320, 0.2665)
TL−0.68562.8137(0.0688, 0.4989)
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Qi, X.; Li, H.; Tian, W.; Yang, Y. Confidence Interval, Prediction Interval and Tolerance Interval for the Skew Normal Distribution: A Pivotal Approach. Symmetry 2022, 14, 855. https://doi.org/10.3390/sym14050855

AMA Style

Qi X, Li H, Tian W, Yang Y. Confidence Interval, Prediction Interval and Tolerance Interval for the Skew Normal Distribution: A Pivotal Approach. Symmetry. 2022; 14(5):855. https://doi.org/10.3390/sym14050855

Chicago/Turabian Style

Qi, Xinlei, Huihui Li, Weizhong Tian, and Yaoting Yang. 2022. "Confidence Interval, Prediction Interval and Tolerance Interval for the Skew Normal Distribution: A Pivotal Approach" Symmetry 14, no. 5: 855. https://doi.org/10.3390/sym14050855

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop