Abstract
We consider third-order likelihood inferences for the parameters, quantiles and reliability function of the logistic distribution. This theory involves the conditioning and marginalization of the likelihood function. The logistic distribution is a symmetric distribution which is closely related to normal distributions, and which has several applications because of its mathematical tractability and the availability of a closed-form cumulative distribution function. The performance of the third-order techniques is investigated and compared with the first-order techniques using simulations. The results show that the third-order techniques are far more accurate than the usual first-order inference procedures. This results in more accurate inferences about the functions of the parameters of the distribution, which leads to more precise conclusions about the phenomenon modeled by the logistic distribution.
Keywords:
likelihood ratio statistic; third order inference; logistic distribution; ancillary directions; conditioning; Barndorff-Nielsen’s formula MSC:
Primary 62F05; Secondary 62F03
1. Introduction
The logistic distribution is a continuous distribution with many important applications in various fields, including logistic regression, logit models, neural networks, and finance. The logistic distribution is symmetrical, and it is very close in shape to the normal distribution, but it has heavier tails. The symmetry of the logistic distribution allows it to provide a better model than the normal distribution for applications such as extreme events. The logistic model has been applied earlier as a growth model in human populations and in some biological organisms. See [1] for more details and relevant references. Balakrishnan [2] provided a detailed account of this distribution as well.
Inference procedures for the parameters of the logistic distribution and the related quantities have received considerable attention in the literature. For example, the construction of confidence intervals for the parameters of this distribution was discussed by [3], who determined the necessary percentage points of the pivotal quantities through Monte Carlo simulations. Several other authors have worked on various aspects of this distribution and the related log-logistic distribution, including [4,5] among others.
Inferences about the functions of the parameters of the logistic distribution are the key focus of several scientific investigations. The functions could be a distribution quantile or a reliability function, which are needed in industrial testing to determine the reliability of products and warranty periods (see [6] for detailed examples on the importance of these and other functions in reliability studies). The inferences are usually based on the maximum likelihood estimator and the related likelihood quantities. These inferences are justified by the asymptotic properties of likelihood inference procedures, and require large samples for their validity and accuracy. Recent relevant research on the maximum likelihood estimation in various models of interest includes [4,5,7]. In this paper, we shall derive higher-order likelihood inference procedures for the parameters, quantiles, and reliability function of this distribution. The methods are based on the third-order inference procedures developed by [8]. The third-order inference procedures usually need smaller sample sizes to achieve the desired accuracy and validity, which leads to a reduction in the cost and time of the scientific investigation. The methods are developed in Section 2. A simulation study is conducted in Section 3 to investigate the performance of the third-order inference procedures and to make sure that they achieve the desired objectives. The results and conclusions are given in the final section.
2. Likelihood Inference
Likelihood theory and higher-order refinements have received considerable attention in the literature. Barndorff-Nielsen and Cox [9] and Severini [10] have discussed this topic in detail. One of the most important lines of research is the likelihood ratio statistic and its refinements and modifications. Fraser and Reid [11] anf Fraser et al. [8] have developed third-order refinements to the likelihood ratio statistic. These modifications were further investigated for several statistical models, including the Weibull distribution [12] and lognormal regression [13], among others. It appears that little attention has been paid so far to developing third-order likelihood inferences for the logistic distribution and the related functions of parameters. Since this model has important applications in various fields, this work is an attempt to fill this gap.
Let be a vector of length observations from a continuous statistical model with joint density . Let be a parameter vector of length . Consider the inference for a scalar parameter of interest . Third-order inference procedures use modifications of the first-order statistic given by:
where is the MLE of .
Let be the observed information matrix and let be its inverse. Let be the element in the inverse of the information matrix corresponding to the scalar parameter , and let be the gradient of the interest parameter evaluated at the MLE , and then is obtained from:
The signed square root of the likelihood ratio statistic is defined as:
where is the restricted MLE of for a given . Both and have p-values that are accurate to the first order . Fraser et al. [8] proposed simple and widely applicable formulas to find a quantity , such that the resulting approximations to the p-value have third-order accuracy . The approach used by [8] for the construction of involves dimension reduction from to by conditioning it on an approximate ancillary statistic. The resulting model is then approximated by a member of the exponential family, called the tangent exponential model (see [9]). The canonical parameter of the tangent exponential model is obtained by the differentiation of the log-likelihood function with respect to the data (the sample space derivative). The nuisance parameter is then eliminated by marginalization.
Skovgaard [14] and Fraser an Reid [11] showed that only the second-order ancillary is required to achieve third-order accuracy. Let be the observed data point and let be the corresponding maximum likelihood estimate. Consider a vector ’ of pivotal quantities denoted by , and the array of “ancillary directions” is obtained from the pivotal by:
where the expression is calculated for a fixed . The second-order ancillary generated at the data point is free of the data point of the second order. The local canonical parameter of the tangent exponential model is given by:
The quantity given by:
which is a standardized maximum likelihood departure in the parameterization , where:
where is the constrained maximum likelihood value based on the data point , and the gradient of with respect to is calculated as:
In the next section, we will apply this methodology to the logistic distribution, where we will consider inferences about the parameters, quantiles, and reliability function.
3. Third-Order Likelihood Inference in the Logistic Distribution
The probability density function of the logistic distribution with location parameter and scale parameter is given by:
The corresponding cumulative distribution function is given by:
Let be a random sample of size from this distribution. The likelihood function of is given by:
The corresponding log-likelihood function is given by:
To obtain the maximum likelihood estimator and the associated quantities, and to develop the third-order inference techniques, we need the following first- and second-order partial derivatives:
The array of ancillary directions , obtained from (4), is given by:
Define the Lagrangian function as follows (see [12]):
The first-order conditions are:
The tilted log-likelihood function is defined as:
where is the value of the Lagrange multiplier. Now we will consider some specific cases.
3.1. Inference about the Scale Parameter
We have ; therefore, the Lagrangian and its derivatives become:
It follows that , where , and the tilted likelihood becomes:
We have:
This implies that:
Recall from (5) that the local canonical parameter is given by:
For the logistic distribution, consider:
Therefore:
It follows that:
Using (5), the components of the local canonical parameter are given by:
We also need to find:
where:
Now let , and in the present case we have . We obtain:
An estimate of the variance of is given by:
The signed likelihood departure is given by:
The maximum likelihood departure is given by:
The Lugannani and Rice [15] approximation of the tail probability is given by:
where the functions and denote the probability density function and the cumulative distribution function of the standard normal distribution. The Barndorff-Nielsen [16] formula for the tail probability is given by:
3.2. Inference about the Location Parameter
We have , and therefore:
with , , and .
Equating these equations to zero and solving simultaneously, it follows that , and the tilted likelihood becomes:
We have:
This implies that:
We also have .
3.3. Inference about the Quantiles
The quantile of the logistic distribution, , is given by the solution of the equation:
Therefore, the interest parameter is of the form , where .
We have:
It follows that:
The tilted likelihood becomes:
With:
This implies that:
The remaining quantities are the same as case 1 with .
3.4. Inference about the Reliability Function
The reliability function of the logistic distribution at time is given by:
Note that . Therefore, the interest parameter is reduced to . The Lagrangian and its derivatives for this situation are given by:
By equating the derivatives to zero and solving, we obtain:
It follows that the tilted likelihood becomes:
We have:
The remaining quantities are the same as case 1 with .
4. Simulation Study
A simulation study was conducted to investigate the performance of the intervals described above. We used sample sizes of 10, 20, 30, 40, 50, 70, and 100. For the quantile case, we used . For the reliability function, we used , where is the inverse cumulative distribution function of the standard logistic distribution. We used , for the sample generation. For each combination of the simulation indices, we generated 10,000 samples. For each sample, we computed the Wald interval and the LRT interval, in addition to the intervals based on the Barndorff-Nielsen and Lugannani and Rice modifications of the LRT that are based on the likelihood departure derived by [8]. The confidence coefficient or probability content is taken as 0.90, 0.95, and 0.99. The results of our simulations are given in Table 1, Table 2, Table 3 and Table 4.
Table 1.
Coverage Probabilities for the Confidence Intervals for .
Table 2.
Coverage Probabilities for the Confidence Intervals for .
Table 3.
Coverage Probabilities for the Confidence Intervals for Quantiles .
Table 4.
Coverage Probabilities for the Confidence Intervals for the Reliability Function .
5. Results and Conclusions
The results for the confidence intervals for the location parameter are given in Table 1. It appears that the LR intervals need at least a sample size of 20 for the confidence coefficients of 90% and 95%, and sample sizes of at least 30 for intervals with a confidence coefficient of 99%, to achieve reasonable accuracy in terms of the observed coverage probability. The Wald intervals need even larger sample sizes to have a reasonable performance. Wald intervals tend to be generally conservative for small sample sizes, while the LR interval tends to be anti-conservative. On the other hand, the third-order confidence intervals BN and LGR have a satisfactory performance, even for sample sizes as small as 10. The performances of the BN interval and the LGR interval are quite similar.
A similar pattern is observed for intervals for the scale parameter . When the sample size is small, such as 10 or 20, the Wald intervals tend to be highly anti-conservative. The refined likelihood ratio intervals (BN and LGR) have a quite satisfactory performance, even for samples of size 10, and for all values of under study. A similar performance is observed for quantiles. However, for the reliability function, all intervals have a satisfactory performance, while the BN and LGR intervals have the closest coverage probability to the nominal one.
It appears that the third-order procedures are very effective in improving the coverage probability of the LR intervals, and they give very accurate results, even for samples as small as 10 observations only. This is very desirable in situations where it is difficult or costly to obtain large samples.
The work in this paper achieved its goal of developing more accurate likelihood inference procedures that need smaller sample sizes for its validity, resulting in more precise inferences about the quantities of practical importance in applications, such as the distribution quantiles and the reliability function. Other functions of importance can be treated using the same general formulation followed in this paper. The consequences of applying third-order methods are a reduction in the cost and the time needed for obtaining the samples, while still obtaining precise and accurate conclusions about the phenomenon under study.
Funding
Open access funding provided by the Qatar National Library.
Acknowledgments
The author would like to thank the referees for their thoughtful comments and suggestions that improved the paper.
Conflicts of Interest
The authors declare no conflict of interest.
References
- Johnson, N.L.; Kotz, S.; Balakrishnan, N. Continuous Univariate Distributions, 2nd ed.; John Wiley and Sons: New York, NY, USA, 1995; Volume 2. [Google Scholar]
- Balakrishnan, N. Handbook of the Logistic Distribution; Chapman and Hall: London, UK, 1992. [Google Scholar]
- Antle, C.; Klimko, L.; Harkness, W. Confidence intervals for the parameters of the logistic distribution. Biometrika 1970, 57, 397–402. [Google Scholar] [CrossRef]
- Asgharzadeh, A.; Valiollahi, R.; Mousa, A. Point and interval estimation for the logistic distribution based on record data. SORT-Stat. Oper. Res. Trans. 2016, 40, 89–112. [Google Scholar]
- He, X.; Chen, W.; Qian, W. Maximum likelihood estimators of the parameters of the log-logistic distribution. Stat. Pap. 2020, 61, 1875–1892. [Google Scholar] [CrossRef]
- Meeker, W.; Escobar, L. Statistical Methods for Reliability Data; Wiley Inter-Science: Hoboken, NJ, USA, 1998. [Google Scholar]
- Chen, W.X.; Long, C.X.; Yang, R.; Yao, D.S. Maximum likelihood estimator of the location parameter under moving extremes ranked set sampling design. Acta Math. Appl. Sin. (Engl. Ser.) 2021, 37, 101–108. [Google Scholar] [CrossRef]
- Fraser, D.A.S.; Reid, N.; Wu, J. A simple general formula for tail probabilities for frequentist and Bayesian inference. Biometrika 1999, 86, 249–264. [Google Scholar] [CrossRef]
- Barndorff-Nielsen, O.E.; Cox, D.R. Inference and Asymptotics; Chapman and Hall: London, UK, 1994. [Google Scholar]
- Severini, T. Likelihood Methods in Statistics; Oxford University Press: London, UK, 2000. [Google Scholar]
- Fraser, D.A.S.; Reid, N. Ancillaries and third-order significance. Utilitas Math. 1995, 7, 33–53. [Google Scholar]
- Rekkas, M.; Wong, A. Third-order inference for the Weibull distribution. Comput. Stat. Data Anal. 2005, 49, 499–525. [Google Scholar] [CrossRef]
- Tarng, C. Third-order likelihood-based inference for the log-normal regression model. J. Appl. Stat. 2014, 41, 1976–1988. [Google Scholar] [CrossRef]
- Skovgaard, I.M. Successive improvements of the order of ancillarity. Biometrika 1986, 73, 516–519. [Google Scholar] [CrossRef]
- Lugannani, R.; Rice, S. Saddlepoint approximation for the distribution function of the sum of independent variables. Adv. Appl. Probab. 1980, 12, 475–490. [Google Scholar] [CrossRef]
- Barndorff-Nielsen, O.E. Modified signed log likelihood ratio. Biometrika 1991, 78, 557–563. [Google Scholar] [CrossRef]
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2022 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).