Abstract
Count data containing both excess zeros and ones occur in many fields, and the zero-and-one inflated distribution is suitable for analyzing them. Herein, we construct confidence intervals (CIs) for the parameters of the zero-and-one inflated cosine geometric (ZOICG) distribution constructed by using five methods: a Wald CI based on the maximum likelihood estimate, equal-tailed Bayesian CIs based on the uniform or Jeffreys prior, and the highest posterior density intervals based on the uniform or Jeffreys prior. Their efficiencies were compared in terms of their coverage probabilities and average lengths via a simulation study. The results show that the highest posterior density intervals based on the uniform prior performed the best in most cases. The number of new daily COVID-19-related deaths in Luxembourg in 2020 involving data with a high proportion of zeros and ones were analyzed. It was found that the ZOICG model was appropriate for this scenario.
Keywords:
zero-and-one inflated cosine geometric distribution; Bayesian estimation; Metropolis–Hasting algorithm; confidence intervals MSC:
62F25
1. Introduction
Overdispersed count data with excess zeros frequently occur in various fields such as natural science (e.g., the number of torrential rainfall incidences at the Daegu and Busan rain gauge stations in South Korea [1]), medical science (e.g., the DMFT (decayed, missing, and filled teeth) index in dentistry [2], the number of falls by people with Parkinson’s disease [3], and the number of daily COVID-19 deaths in Thailand [4]), and insurance (e.g., the frequency of health insurance claims [5]). Although the Poisson model is widely used to analyze such discrete data, the strong assumption of the equality of the mean and variance implies that it is inadequate for modeling data where the variance is larger than the mean, which is termed “overdispersion” [6] and can arise in various ways.
One common source of overdispersion is when the observed counts contain excess zeros, which has motivated the creation of modified count models such as the zero-inflated (ZI) and hurdle models. These two classes can be considered finite mixture models of two subpopulations (components): (1) the observations contain zeros with a probability , and (2) the observations occur with a probability from the baseline distribution of a ZI model and the zero-truncated probability mass function (pmf) (the nonzero part) of a hurdle model. One of the most popular ZI models is the ZI Poisson (ZIP) model, where the Poisson distribution is the baseline distribution [7].
However, overdispersion arises when large numbers of zeros and ones occur simultaneously. Since the ZI model is not an appropriate for model such data, it has been extended to the zero-and-one-inflated (ZOI) model combined with several other distributions to produce the ZOI Poisson (ZOIP), ZOI geometric, ZOI negative binomial-beta exponential (ZOINB-BE), and ZOIP-Lindley mixed distributions. Zhang et al. [8] explored the ZOIP distribution extended from the ZIP distribution [9]. Five equivalent stochastic representations for a ZOIP random variable were presented, and their important distributional properties were derived. Tang et al. [10] provided a ZOIP model and analyzed two real datasets of Legionnaires’ disease cases in Singapore and accidental deaths in Detroit, USA, both of which contained high proportions of zeros and ones. They used the data augmentation method to obtain the maximum likelihood estimators (MLEs) via the EM algorithm and Bayesian estimations via Gibbs sampling. By obtaining the lowest Akaike information criterion (AIC), deviance information criterion (DIC), and widely applicable information criterion (WAIC) values for the ZOIP, they reported that it was more appropriate for these datasets than the ZIP model. Liu et al. [11] analyzed two real datasets of Legionnaires’ disease cases in Singapore and accidental deaths in Detroit, USA by using the ZOIP model with both the maximum likelihood and Bayesian estimation. They found that the ZOIP model was more appropriate than the ZIP model for analyzing these two real datasets. Liu et al. [12] studied the number of daily accidental deaths in 1994, which were available from the NMMAPS database, by using a ZOIP regression model and investigated the maximum likelihood and Bayesian estimation, the expectation maximization algorithm, the generalized expectation maximization algorithm, and Gibbs’ sampling to estimate its parameters. They found that the ZOIP regression model was more appropriate than the ZIP model for analyzing this accidental deaths dataset, as it attained lower AIC, Bayesian information criterion (BIC), DIC, and WAIC values. Xiao et al. [13] constructed a ZOI geometric distribution regression model and introduced Pólya-gamma latent variables in the Bayesian inference, and they found that it was more suitable for analyzing a doctoral dissertation dataset than the ZOIP regression model. Jornsatian and Bodhisuwan [14] presented a ZOI negative binomial-beta exponential (ZOINB-BE) distribution, investigated some of its important properties, and analyzed three real datasets: the number of visits to the doctor in Germany in 1998 (the COUNT package in the R programming suite), the number of accidental injuries in the US in 2001 [15,16], and the number of monthly crimes in Greece from 1982 to 1993 [17,18]. They found that the ZOINB-BE distribution was most appropriate to fit these data by attaining the lowest log-likelihood, AIC, mean absolute error, and root mean squared error values. Tajuddin et al. [19] introduced the ZOIP–Lindley distribution and developed MLEs and method-of-moment estimators for its parameters. They found that it was the most appropriate for analyzing two datasets: the number of criminal acts [20] and the number of stillbirths in New Zealand white rabbits [21]. Mohammadi et al. [22] introduced a zero-and-one inflated INAR(1) process with a Poisson–Lindley distribution and analyzed the number of abortions of animals reported monthly, which contained large proportions of zeros and ones. They found that the proposed model had the best fit based on the AIC, BIC, log-likelihood, and root mean square differences between the observation and prediction (RMS) criteria.
Aside from these mixed distributions, another one is the two-parameter discrete cosine geometric (CG) distribution, which was proposed by Chesneau [23]. This belongs to the family of weighted geometric distributions, with its pmf given by
where
If , then we can obtain , and X is a standard geometric distribution. A weighted geometric distribution makes the CG distribution more flexible than the standard geometric distribution. The former is better for analyzing overdispersed data than the Poisson, geometric, negative binomial (NB), and weighted NB distributions. Junnumtuam et al. [24] extended it and proposed the ZICG distribution (with CG as the baseline distribution), and they reported that it is appropriate for fitting overdispersed count data containing excess zeros, such as the number of daily positive COVID-19 cases at the Tokyo 2020 Olympic Games.
Since studies on ZOI count data have gained much research interest, and the CG distribution is appropriate for overdispersed data, we can extend it to the novel, four-parameter, discrete ZOICG distribution and derive some of its statistical properties such as the pmf, moment-generating function (mgf), mean, variance, and Fisher information. Moreover, its parameters are estimated by deriving their confidence intervals (CIs). There are several examples of applying CIs to analyze ZI and ZOI count data. Liu et al. [11] considered both point and interval estimation for the parameters of a ZOIP model and compared Bayesian estimation using either the Jeffreys or reference prior with the MLE method via Monte Carlo simulation. The results indicated that the Bayesian estimates performed slightly better when the sample size was small or moderate. Tian et el. [25] proposed CIs for the mean of a zero-and-one inflated population by using the jackknife empirical likelihood and adjusted jackknife empirical likelihood methods. Wald CIs were constructed for the parameters in the Bernoulli component of the ZIP and hurdle models in [26] and for the ZIP mean in [27].
This motivated us to study the confidence intervals for the parameters of the ZOICG distribution to estimate the overdispersed data, which contain a large proportion of zeros and ones with a high index of dispersion, by using five methods: a Wald CI based on the MLE, equal-tailed Bayesian CIs based on the uniform or Jeffreys prior, and highest posterior density (HPD) intervals based on the uniform or Jeffreys prior. Furthermore, real data containing excess zeros and ones (the number of new daily COVID-19 deaths in Luxembourg in 2020) were used to investigate their efficacies.
2. The ZOICG Distribution
A random variable Y in a ZOICG model can be derived as follows:
where Z is a Bernoulli random variable with a success probability , is a Bernoulli random variable with a success probability , X follows a CG distribution with parameters p and , and and X are mutually independent. Hence, the relationship between Y and becomes
which follows a ZOICG distribution with parameters and . This can be verified by
Hence, the pmf of the nonnegative integer-valued random variable Y is
where , and .
2.1. Moment-Generating Function
The moment-generating function of can be derived by using [8] as follows:
where and . Thus
2.2. Maximum Likelihood Estimation
Alternatively, we can develop the second form of the ZOICG model by using the following transformation of the first form with a similar method to that in [11]. The respective probabilities of Y being zeros and ones are
Hence, the pmf from Equation (6) becomes
where , and .
Based on a random sample of a size n from the ZOICG model in Equation (15), we define the likelihood function as follows:
where , and . The mean and variance of Y are respectively given by
According to the likelihood function in Equation (16), the log-likelihood function is given by
Hence, the MLEs of and are
In addition, the MLEs of p and as well as and are the respective solutions to the following equations:
which can be solved numerically according to the Newton-type algorithm using the nlm function in [28]. Furthermore, parameters and in the ZOICG model in Equation (6) can be expressed by using . Based on the one-to-one transformation in Equation (14), the MLEs of and are respectively given by
2.3. The Fisher Information Matrix
The observed data log-likelihood function of the ZOICG model in Equation (15) can be expressed as
The score vector is denoted by
where
The second derivatives are given by
Since
Thus, the expected Fisher information matrix can be calculated as , where
Moreover, we have
Thus the expected Fisher information matrix of for one observation, when using a similar method to that in [11], is
where
and
which implies that the two-group parameters and are independent, and this is useful information for applying the Bayesian inference in the next section.
3. Bayesian Inference
In this section, suitable priors for the ZOICG model in Equation (15) are derived by using the Bayesian framework.
3.1. Jeffreys Prior
Since Jeffreys prior [29,30] is proportional to the square root of the determinant of the Fisher information matrix, that for can be derived as follows:
where , , , and .
3.2. The Uniform Prior
The uniform prior on (0,1) is widely used as the prior distribution in Bayesian analysis, and that for is given by
3.3. The Posterior Distribution
The joint posterior distributions of using the Jeffreys and uniform priors can be respectively expressed as
Bayesian inference is based on the posterior distribution of the parameter interest obtained analytically or via Markov chain Monte Carlo methods. In particular, the 95% Bayesian CI for the parameter interest is simply from the 2.5 percentile to the 97.5 percentile of the posterior distribution. For an asymmetric posterior (the usual case), the HPD interval is not the same as the Bayesian CI from the 2.5 percentile to the 97.5 percentile [31], and so in practice, their efficiencies are compared via their coverage probabilities (CPs) and average lengths (ALs). According to the joint posterior distribution of (Equations (29) and (30)), the marginal posterior distribution of using the Jeffreys and uniform priors are respectively given by
and
These are Dirichlet distributions with shape parameters and , respectively. The marginal posterior distribution of p using the Jeffreys and uniform priors are respectively given by
and
Similarly, the marginal posterior distribution of using the Jeffreys and uniform priors are respectively given by
and
Since it is evident that the closed forms of the marginal posterior distributions of p and cannot be evaluated, various general-purpose simulation methods can be used to evaluate Equations (33)–(36) without requiring the complete closed-form expression. One popular method, the Metropolis–Hastings algorithm, was applied in the present study. It was used to generate a large sequence of sampled values from the posterior distribution without fully specifying it, which then allowed us to estimate it [32].
The random walk Metropolis–Hastings algorithm is a special case of the Metropolis–Hastings algorithm that can be used to simulate the proposal ,, where is a random perturbation of that is simulated from a chosen probability distribution (e.g., a normal distribution with a mean of zero and a specified variance). Proposal is taken to be the next value in the chain with an acceptance probability that depends on the prior and data distributions (i.e., the likelihood function). The acceptance probability is the ratio of two posterior distributions, which can be calculated as follows:
If proposal is not taken, then is set to as the next value in the chain. This process is specified in Algorithm 1.
| Algorithm 1 The random walk Metropolis–Hastings algorithm. |
|
3.4. The Bayesian CIs
In Bayesian analysis, interval estimates are based on the quantiles of the posterior distribution of a parameter. The interpretation of a credible interval is that it has a probability of containing the parameter because it is based on its posterior distribution. The most popular type of credible interval is an equal-tailed interval that is limited by the probabilities to the left of the lower limit () and the right of the upper limit (). Thus, can be obtained as follows:
where and are the and quantiles of the function g, respectively.
Equal-tailed intervals are relatively easier to calculate than other ones, and they perform best when the posterior distribution is fairly symmetric. Since the marginal posterior distributions of p and are dependent, and their closed forms do not exist, the Metropolis–Hastings steps within a Gibbs sampling algorithm can be used to estimate them [33]. Hence, it was applied to generate a large sequence of sampled values from the marginal posterior distribution in Equations (33)–(36). The procedure to generate samples and construct the Bayesian CIs for parameters p and is detailed in Algorithm 2.
3.5. The HPD Intervals
When the posterior is not symmetric, it is best to use the HPD interval calculated by finding its lower and upper limits corresponding to the HPD region (i.e., the most probable region). The resulting interval is generally the most narrow possible interval that contains of the posterior density. It can be obtained by using the Markov chain Monte Carlo method [34] and only requires a sample generated from the marginal posterior distributions of parameters p and . In the simulation and computation, the HPD intervals were computed by using the HDInterval package version 0.2.2 [35] in the R programming suite, as covered in Algorithm 3.
| Algorithm 2 Establishing the Bayesian CIs for parameters p and . |
|
| Algorithm 3 Establishing the HPD interval. |
|
4. Simulation Study
The performance of the ZOICG model in Equation (15) was measured by using Monte Carlo simulation. The sample size was set to n = 30, 50, or 100 with the proviso that . The probability of real zeros () was set to 0.1 or 0.2. The probability of real ones () was set to 0.1, 0.2, or 0.3 while was set to (0.5,0.7) or (0.9,1.5), and the nominal confidence level was set to 0.95. All of the simulations were run 3000 times, and the samples were generated by using the Metropolis–Hastings algorithm with 10,000 samples and 3000 burn-ins. The criteria for comparing the efficiencies of the CIs were their CPs and ALs. For a particular scenario, the CI with a CP close to or greater than the nominal level of 0.95 and the shortest AL performed the best.
The CP and AL results are reported in Table 1 and Table 2, respectively. For the interval estimation of the parameter , the CPs of all five methods were close to the nominal level of 0.95, especially when the sample size was large. For the interval estimation of the parameter , an equal-tailed, two-sided Bayesian CI based on the uniform prior provided CPs equal to or more than 0.95 in almost all cases, but when the sample size was large (), the CPs for all of the methods were similarly close to the nominal level of 0.95. For the interval estimation of parameter p, when () was (0.5,0.7), only the equal-tailed two-sided Bayesian CI and HPD interval based on the uniform prior performed well (CP > 0.95), with the HPD interval based on the uniform prior having the shortest AL. However, when () was (0.9,1.5), all five methods performed similarly. For the interval estimation of , when () was (0.5,0.7), all four Bayesian methods performed well (CP > 0.95), with the HPD interval based on the uniform prior providing the shortest AL. When () was (0.9,1.5), none of the methods produced CPs equal to or more than 0.95, with the Wald CI providing the worst performance. In general, the ALs decreased when the sample size was increased and when () was increased from (0.5,0.7) to (0.9,1.5).
Table 1.
Coverage probability of parameter estimation for model in Equation (15).
Table 2.
Average length of parameter estimation for model in Equation (15).
5. The Efficacies of the Methods with Real Data
The number of new COVID-19 cases and deaths reported each day by country is available from the European Centre for Disease Prevention and Control (ECDC; accessed on 13 September 2022 https://www.ecdc.europa.eu/en/publications-data/data-daily-new-cases-covid-19-eueea-country). In this empirical study, the number of new COVID-19 deaths per day in Luxembourg from 24 February 2020 to 31 December 2020, which contained 312 days, 167 days with 0 deaths, and 47 days with 1 death, are presented in Table 3 and Figure 1. From the descriptive statistics for the dataset (Table 4), the mean and variance were 1.6314 and 5.9570, respectively, which were used to calculate the index of dispersion (the variance divided by the mean) to be 3.6514. Since the index of dispersion was larger than one, this dataset was clearly overdispersed. The appropriate model was checked by comparing the AIC and corrected AIC (AICc) values of nine distributions: ZOICG, ZICG, CG, ZIP, ZIG, ZINB, Poisson, geometric, and NB (Table 5). Those of the ZOICG were very similar (1038.372 and 1038.502) and the lowest, thereby inferring that it provided the best fit for the data.
Table 3.
The number of COVID-19 daily new deaths in Luxembourg in 2020.
Figure 1.
The number of daily new deaths in Luxembourg for 2020.
Table 4.
Descriptive statistics.
Table 5.
The log likelihood, AIC, and AICc values of different models.
The CIs for the parameters of the ZOICG model obtained by using the five methods are provided in Table 6. The lower and upper bounds for parameters and provided by all of the methods were similar, with the lower bound for the parameter p via the Wald CI being slightly lower and the Wald-based CI for the parameter being much lower than the others.
Table 6.
Estimation of the number of COVID-19 daily new deaths in Luxembourg in 2020.
6. Discussion
In this paper, a novel four-parameter discrete distribution called the ZOICG distribution is proposed, and its statistical properties are derived. As the model in Equation (6) has a complex pmf, it is more difficult to calculate parameters and by using the MLEs than by reparameterizing, and so the ZOICG model in Equation (15) is needed. This model requires that parameters and are independent of parameters p and , which means that the proportion of data containing zeros and ones should be treated separately from the rest for the convenience of estimation. From the simulation results, it is clear that all of the methods could detect zeros and ones ( and ) with coverage probabilities close to the nominal level. The MLEs of parameters p and did not have a closed form, so the Newton-type algorithm was applied to solve them numerically. However, the application of this algorithm was not appropriate, especially for parameter . Hence, Bayesian analysis was required in this study. However, the derivations of the Bayesian estimates for parameters p and were still complex and difficult to estimate, since their marginal posterior distributions did not have a closed form. Hence, the random walk Metropolis–Hastings steps within a Gibbs sampling algorithm were applied to generate samples for these two parameters.
The simulation results show that the Bayesian methods performed better than the Wald CI based on the MLE. Moreover, the Bayesian method based on the uniform prior was more efficient than the one based on the Jeffreys prior. Since the optimization method is sometimes unsuitable for a model that is complex, the Wald-based CI provided the worst estimates. The efficacy of the proposed ZOICG model for analyzing real data containing excess zeros and ones (new daily COVID-19 deaths in Luxembourg in 2020) was excellent, and so it is recommended in these circumstances. In future research, some characteristics and properties of the ZOICG distribution and statistical methods for estimating the parameters of this model should be further investigated.
7. Conclusions
The proposed ZOICG distribution with reparameterization is useful for analyzing datasets with excess zeros and ones. It can detect the proportion of zeros and ones by using five methods that provide CPs close to the nominal level. However, the other parameters of the model (p and ) are difficult to estimate, since their MLEs and their marginal posterior distributions do not have closed forms. From the simulation results, the Wald CI provided unsuitable CPs for the parameter under all scenarios tested. Hence, the Metropolis–Hastings steps within a Gibbs sampling algorithm were used to estimate them. The results show that the Bayesian method based on the uniform prior performed better than the others in most situations, and so it is recommended for estimating the CIs for the parameters of the ZOICG model.
In the application study, we showed that the proposed ZOICG distribution is appropriate for analyzing datasets with excess zeros and ones, such as the number of new daily COVID-19 deaths in Luxembourg in 2020. Thus, the ZOICG model is suitable for analyzing overdispersed count data, especially those with a high index of dispersion. Furthermore, the proposed model performed well with the Bayesian-derived CI based on a uniform prior in estimating the parameters of the ZOICG model in both the simulation and application studies. Other statistical parameters, such as the mean, will be investigated in future research because they play an important role in statistical inference. There have not been many research articles published on CIs for the mean of a zero-and-one inflated population [25]. Therefore, we will investigate this in the near future.
Author Contributions
Conceptualization, S.-A.N. and S.N.; methodology, S.J.; software, S.J.; validation, S.J., S.-A.N., and S.N.; formal analysis, S.J.; investigation, S.J.; resources, S.N.; data curation, S.J.; writing—original draft preparation, S.J.; writing—review and editing, S.-A.N. and S.N.; visualization, S.N.; supervision, S.-A.N.; project administration, S.-A.N.; funding acquisition, S.-A.N. All authors have read and agreed to the published version of the manuscript.
Funding
This research received funding support from the National Science, Research and Innovation Fund (NSRF), and King Mongkut’s University of Technology North Bangkok (KMUTNB) (Grant No. KMUTNB-FF-65-22).
Data Availability Statement
The numbers of COVID-19 new cases and deaths reported per day and per country in the EU and EEA by the European Centre for Disease Prevention and Control (ECDC) are available at https://www.ecdc.europa.eu/en/publications-data/data-daily-new-cases-covid-19-eueea-country (accessed on 13 September 2022).
Acknowledgments
The authors would like to thank the academic editor and the referees for their helpful comments on our manuscripts. The first author acknowledges the generous financial support from the Science Achievement Scholarship of Thailand (SAST).
Conflicts of Interest
The authors declare no conflict of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript; or in the decision to publish the results.
Abbreviations
The following abbreviations are used in this manuscript:
| CI | Confidence interval |
| CP | Coverage probability |
| AL | Average length |
| B.uniform | The dual-tail Bayesian confidence interval with the uniform prior |
| B.Jeffreys | The dual-tail Bayesian confidence interval with the Jeffreys prior |
| HPD.uniform | The highest posterior density interval with the uniform prior |
| HPD.jeffreys | The highest posterior density interval with the Jeffreys prior |
| ZICG | Zero-inflated cosine geometric |
| ZOICG | Zero-and-one inflated cosine geometric |
| ZIP | Zero-inflated Poisson |
| ZIG | Zero-inflated geometric |
| ZINB | Zero-inflated negative binomial |
References
- Lee, C.; Kim, S. Applicability of zero-inflated models to fit the torrential rainfall count data with extra zeros in south korea. Water 2017, 9, 123. [Google Scholar] [CrossRef]
- Böhning, D.; Dietz, E.; Schlattmann, P.; Mendonca, L.; Kirchner, U. The zero-inflated poisson model and the decayed, missing and filled teeth index in dental epidemiology. J. R. Stat. Soc. Ser. A Stat. Soc. 1999, 162, 195–209. [Google Scholar] [CrossRef]
- Ashburn, A.; Fazakarley, L.; Ballinger, C.; Pickering, R.; McLellan, L.D.; Fitton, C. A randomised controlled trial of a home based exercise programme to reduce the risk of falling among people with parkinsonas disease. J. Neurol. Neurosurg. Psychiatry 2006, 78, 678–684. [Google Scholar] [CrossRef] [PubMed]
- Junnumtuam, S.; Niwitpong, S.-A.; Niwitpong, S. The bayesian confidence interval for coefficient of variation of zero-inflated poisson distribution with application to daily covid-19 deaths in Thailand. Emerg. Sci. J. 2021, 5, 62–76. [Google Scholar] [CrossRef]
- Kusuma, R.D.; Purwono, Y. Zero-inflated poisson regression analysis on frequency of health insurance claim pt.xyz. In Proceedings of the 12th International Conference on Business and Management Research, Bali, Indonesia, 7–8 November 2018. [Google Scholar]
- Simonoff, J.S. Analyzing Categorical Data; Springer Texts in Statistics; Springer: Berlin/Heidelberg, Germany, 2003. [Google Scholar]
- Lambert, D. Zero-inflated poisson regression, with an application to defects in manufacturing. Technometrics 1992, 34, 1–14. [Google Scholar] [CrossRef]
- Zhang, C.; Tian, G.; Ng, K. Properties of the zero-and-one inflated Poisson distribution and likelihood-based inference methods. Stat. Interface 2016, 9, 11–32. [Google Scholar] [CrossRef]
- Melkersson, M.; Olsson, C. Is Visiting the Dentist a Good Habit? Analyzing Count Data with Excess Zeros and Excess Ones; Umea Economic Studies; Umea University: Umea, Sweden, 1999. [Google Scholar]
- Tang, Y.; Liu, W.; Xu, A. Statistical inference for Zero-and-one-inflated Poisson models. Stat. Theory Relat. Fields 2017, 1, 216–226. [Google Scholar] [CrossRef]
- Liu, W.; Tang, Y.; Xu, A. A zero-and-one inflated Poisson model and its application. Stat. Interface 2018, 11, 339–351. [Google Scholar] [CrossRef]
- Liu, W.; Tang, Y.; Xu, A. Zero-and-one-inflated Poisson regression model. Stat. Pap. 2019, 62, 915–934. [Google Scholar] [CrossRef]
- Xiao, X.; Tang, Y.; Xu, A.; Wang, G. Bayesian inference for zero-and-one-inflated geometric distribution regression model using Pólya-Gamma latent variables. Commun. Stat. Theory Methods 2020, 49, 3730–3743. [Google Scholar] [CrossRef]
- Jornsatian, C.; Bodhisuwan, W. Zero-one inflated negative binomial–beta exponential distribution for count data with many zeros and ones. Commun. Stat. Theory Methods 2021, 1–5. [Google Scholar] [CrossRef]
- Kadane, J.B.; Krishnan, R.; Shmueli, G. A data disclosure policy for count data based on the com-Poisson distribution. Manag. Sci. 2006, 52, 1610–1617. [Google Scholar] [CrossRef]
- Low, Y.C.; Ong, S.H.; Gupta, R.C. Generalized Sichel distribution and associated inference. J. Stat. Theory Appl. 2017, 16, 322–336. [Google Scholar] [CrossRef]
- Karlis, D. EM algorithm for mixed Poisson and other discrete distributions. ASTIN Bull. 2005, 35, 3–24. [Google Scholar] [CrossRef][Green Version]
- Gomez-Deniz, E.; Calderın-Ojeda, E. The Poisson-conjugate Lindley mixture distribution. Commun. Stat. Theory Methods 2006, 45, 2857–2872. [Google Scholar] [CrossRef]
- Tajuddin, R.R.; Ismail, N.; Ibrahim, K.; Bakar, S.A. A New Zero–One-Inflated Poisson–Lindley Distribution for Modelling Overdispersed Count Data. Bull. Malays. Math. Sci. Soc. 2022, 45, 21–35. [Google Scholar] [CrossRef]
- Diekmann, A. Ein einfaches stochastisches modell zur analyse von haufigkeitsverteilungen abweichenden verhaltens. Z. Soziol. 1981, 10, 319–325. [Google Scholar] [CrossRef]
- Morgan, B.J.T.; Palmer, K.J.; Ridout, M.S. Negative score test statistics. Am. Stat. 2007, 61, 285–288. [Google Scholar] [CrossRef]
- Mohammadi, Z.; Sajjadnia, Z.; Bakouch, H.S.; Sharafi, M. Zero-and-one inflated Poisson–Lindley INAR(1) process for modelling count time series with extra zeros and ones. J. Stat. Comput. Simul. 2022, 92, 2018–2040. [Google Scholar] [CrossRef]
- Chesneau, C.; Bakouch, H.; Hussain, T.; Para, B. The cosine geometric distribution with count data modeling. J. Appl. Stat. 2021, 48, 124–137. [Google Scholar] [CrossRef]
- Junnumtuam, S.; Niwitpong, S.-A.; Niwitpong, S. Bayesian Computation for the Parameters of a Zero-inflated Cosine Geometric Distribution with Application to COVID-19 Pandemic Data. CMES Comput. Model. Eng. Sci. 2022, in press. [Google Scholar] [CrossRef]
- Tian, W.; Liu, T.; Ning, W. Jackknife empirical likelihood for the mean of a zero-and-one inflated population. Commun. Stat. Theory Methods 2022, 1–15. [Google Scholar] [CrossRef]
- Srisuradetchai, P.; Junnumtuam, S. Wald confidence intervals for the parameter in a bernoulli component of zero-inflated poisson and zero-altered poisson models with different link functions. Sci. Technol. Asia 2020, 25, 1–14. [Google Scholar]
- Waguespack, D.; Krishnamoorthy, K.; Lee, M. Tests and confidence intervals for the mean of a zero-inflated poisson distribution. Am. J. Math. Manag. Sci. 2020, 39, 383–390. [Google Scholar] [CrossRef]
- R Core Team. R: A Language and Environment for Statistical Computing; R Foundation for Statistical Computing: Vienna, Austria, 2022; Available online: https://www.R-project.org/ (accessed on 15 June 2022).
- Jeffreys, H. An Invariant Form for the Prior Probability in Estimation Problems. Proc. R. Soc. Lond. Ser. A Math. Phys. Sci. 1946, 186, 453–461. [Google Scholar]
- Jeffreys, H. Theory of Probability, 3rd ed.; Oxford University Press: Oxford, UK, 1961. [Google Scholar]
- Cameron, A.C.; Trivedi, P.K. Regression Analysis of Count Data; Cambridge University Press: New York, NY, USA, 2013. [Google Scholar]
- Bilder, C.R.; Loughin, T.M. Analysis of Categorical Data with R; CRC Press: Boca Raton, FL, USA, 2014; p. 468. [Google Scholar]
- Givens, G.H.; Hoeting, J.A. Computational Statistics; John Wiley & Sons, Inc.: Hoboken, NJ, USA, 2005. [Google Scholar]
- Chen, M.-H.; Shao, Q.-M. Monte carlo estimation of bayesian credible and hpd intervals. J. Comput. Graph. Stat. 1999, 8, 69–92. [Google Scholar]
- Meredith, M.; Kruschke, J. HDInterval: Highest (Posterior) Density Intervals. R package version 0.2.2. 2020. Available online: https://CRAN.R-project.org/package=HDInterval (accessed on 1 July 2022).
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
