1. Introduction
The trapezoidal distribution has been applied to various risk analysis problems [
1,
2]. However, studies on trapezoidal distributions remain relatively limited. Van Dorp and Kotz [
3] introduced the Generalized Trapezoidal (GT) distribution, which models phenomena involving growth, stability, and physical or mental decline. Kacker and Lawrence [
4] proposed the Type-B trapezoidal distribution, analyzing key properties such as the probability density function (PDF), cumulative distribution function (CDF), inverse CDF, moment-generating function (MGF), expected value, and variance. Additionally, Lawrence et al. [
5] explored cases where the marginal distribution follows a trapezoidal form in greater detail.
Trapezoidal distributions are important in describing real-world phenomena that go through three stages, such as biological growth, economic cycles, and reliability assessments. These distributions capture the general shape of a process but are constrained by their linear structure. The second stage assumes complete stability, which may not accurately reflect real scenarios where gradual increases or decreases occur.
Despite their applicability, previous studies on trapezoidal distributions are limited in number and scope. Existing research has primarily focused on defining their properties rather than developing estimation methods or expanding their flexibility. Recognizing this gap, our study aims to introduce the Fragmented Kumar-Trapez (FKT) distribution, which enhances the traditional trapezoidal model by incorporating additional parameters. These parameters allow for a more versatile representation of different three-stage phenomena, accommodating nonlinear transitions in the stability stage.
Furthermore, this study presents rigorous estimation techniques for the FKT distribution, utilizing Maximum Likelihood and Bayesian methods. By evaluating the efficiency and accuracy of these methods through simulation studies, we demonstrate the practical advantages of the proposed model. This contribution extends the applicability of trapezoidal-based distributions, offering a more refined statistical tool for analyzing complex three-stage processes.
Our interest in trapezoidal distributions and their modifications stems mainly from the idea that many physical processes in nature, the human body, and mind over time reflect the shape of the trapezoidal distribution. Therefore, in this research, we strived to develop the trapezoidal distribution into a more flexible distribution and present it as one of the trapezoidal distributions.
The trapezoidal distribution has been used to detect cancer [
6,
7]. It is represented by three stages: the first stage, the growth stage; the second stage, the stability stage; and the third stage, the declining stage (decay or fading). It takes the form of a quadrilateral trapezoid with two parallel sides and two non-parallel sides, which tend to be suitable for data that show a fairly rapid growth and settlement period and then somewhat rapid disappearance [
8]. These stages are all linear. The first stage increases with a positive slope, the settlement period is fixed, and the last stage decreases with a negative slope, as shown in
Figure 1.
The PDF of the trapezoidal distribution is defined as follows:
and the CDF of the trapezoidal distribution is defined as follows:
where
,
,
,
.
In this article, the trapezoidal distribution is developed to another distribution using the Kumaraswamy [
9] method, based on cumulative distribution function (CDF):
, where
, the CDF for the trapezoidal distribution.
In the rest of this article, the Fragmented Kumar-Trapez distribution with some mathematical properties and reliability measures is addressed in
Section 2. The ideas of estimating parameters for the proposed distribution and for the trapezoidal distribution are addressed in
Section 3.
Section 4 reports the results of an MCMC simulation study. In
Section 5, an illustrated example is provided to demonstrate the applicability of the proposed distribution to real-world data. Finally,
Section 6 summarizes the conclusions and suggests future research directions.
2. Fragmented Kumar-Trapezoidal Distribution
Depending on the Kumaraswamy method, using the probability density function (PDF) in Equation (1) and the cumulative distribution function (CDF) in Equation (2), the PDF of the Fragmented Kumaraswamy Trapezoidal (FKT) distribution is defined as follows:
where
.
It has been proven that the function in Equation (3) is a probability density function that is as follows:
The FKT distribution will often be abbreviated
, and its behavior is studied at different values of
with fixed
and with different values of
with fixed
. It took multiple shapes at different values of the parameters
such as: left and right triangular shape at
, pyramid shape at
,
, twisted curve shape to the right at
,
, and has taken the moderate distribution shape at
. Based on those forms, the FKT distribution can be represented by three stages, see
Figure 2 and
Figure 3. These stages differed by changing the values of
and
. Each of these three stages can be seen in three stages (growth—growth and fading—fading) as can be seen in
Figure 4.
2.1. Reliability Measures
If
is a random variable that follows FKT distribution, then CDF, survival function
, hazard function
, cumulative hazard function
of the random variable
are defined as follows:
The random numbers of the FKT distribution are as follows:
where
Equation (4) does not have a closed form solution, so
will be created as uniform random variables from
, and then solve for
in order to create random numbers from the FKT distribution. From Equation (4), the quantile
of the FKT distribution can be obtained.
Figure 5,
Figure 6 and
Figure 7 show the shape of the cumulative distribution function (CDF), the shape of the survival function, and the shape of the risk function for the FKT distribution.
2.2. Arithmetic Mean, Variance, and Random Number Generation
The mean of the FKT distribution is defined as follows:
The 2nd row moment of the FKT distribution is defined as follows:
where
.
Depending on the mean and the 2nd row moment, the variance of the FKT distribution is defined as follows:
where
and
an incomplete beta function defined as follows:
The quantile function
of the FKT distribution is defined as follows:
2.3. Inferences of Fragmented Kumaraswamy Trapezoid Parameters
The parameters of the FKT distribution are related to three different analytical expressions appearing in the definition of the structure of the FKT distribution. The derivation of the ML estimators for the parameters seems to be similar but more complicated compared to the one presented in Johnson and Kotz [
10]. They dealt with two different analytical expressions of the structure of the standard triangular distribution. For more details, see [
11]. Let
an independently and identically distributed (iid) random sample of size
and the order statistics be
, then the likelihood and the log likelihood function of the STK PDF with parameters
are as follows:
Above are implicitly defined by , and .
The ML estimator of
and
maximizing the likelihood in Equation (6) is
, where
such that,
with
.
By taking the partial derivative of
with respect to
, the closed form of
is obtained as follows:
Unfortunately, to obtain
, the computation of
cannot be performed by solving the normal equations for
then it will be solving numerically, such that:
Let
, the Fisher information (FI) matrix of
and
, then the observed FI matrix is as follows:
where
Therefore, the estimated variance–covariance matrix of
and
is as follows:
The asymptotic distributions of and are used to construct the asymptotic CIs of and . Thus, a confidence intervals of and are as follows: and .
3. Bayesian Estimates
The Bayesian method is based on the assumption that the parameter is a random variable with a probability distribution that expresses our previous information about the parameter . This distribution is called the prior distribution because it depends on the researcher’s beliefs and previous experience with the parameter before the data were collected or before the sample was obtained. There are prior distributions and non-prior distributions. To apply the Bayesian method, we must have the posterior distribution, which is represented by the function that contains all the information contained in the likelihood function, in addition to the information about the parameter . That is, we are looking for a function that includes all the information about the parameter after obtaining the sample. The posterior distribution of the parameter is a conditional probability distribution of the parameter on the condition of obtaining the sample. That is, it describes the degree of our belief in the different values of the parameter after obtaining the sample.
The posterior distribution in Bayesian analysis replaces the likelihood function, in that it includes all the information about the parameter
before obtaining the sample and the information contained in the sample about the parameter
. Therefore, if we want to estimate the parameter
, we can use the same method that we followed with the likelihood function. That is, we can estimate the parameter
with the value that maximizes the posterior distribution. Since the posterior distribution is a probability distribution, the distribution can be summarized by one of the measures of central tendency, that is, we can estimate the parameter by its expected value or the median of the posterior distribution [
8]. Therefore, the parameters of the Kumaraswamy trapezoid distribution will be estimated assuming that the prior distribution accompanying the parameter
is the exponential distribution with the parameter
, the gamma distribution with two parameters
, and the uniform distribution.
3.1. Bayesian Estimates of the Shape Parameter
The likelihood function in Equation (5) can be re-written as follows:
Case 1:
Assume that the prior of
is exponential
, i.e.,
. Then, the posterior density function of
is as follows:
Therefore, the Bayesian estimator
of the shape parameter
is as follows:
where
Case 2:
Assume that the prior of
is gamma
, i.e.,
. Then, the posterior density function of
is as follows:
Therefore, the Bayesian estimator
of the shape parameter
is as follows:
where
Case 3:
Assume that the prior of
is uniform
, i.e.,
. Then, the posterior density function of
is as follows:
Therefore, the Bayesian estimator
of the shape parameter
is as follows:
where
3.2. Bayesian Estimates of the Shape Parameter
Case 1:
Assume that the prior of
is exponential
, i.e.,
. Then, the posterior density function of
is as follows:
Therefore, the Bayesian estimator
of the shape parameter
is as follows:
Case 2:
Assume that the prior of
is Weibull
, i.e.,
. Then, the posterior density function of
is as follows:
Therefore, the Bayesian estimator
of the shape parameter
is as follows:
Case 3:
Assume that the prior of
is uniform
, i.e.,
. Then, the posterior density function of
is as follows:
Therefore, the Bayesian estimator
of the shape parameter
is as follows:
Given the complexity of evaluating the integrals in Equations (14) and (15) for Bayesian estimation of the unknown parameters, we employ numerical simulations via Markov Chain Monte Carlo (MCMC) integration. Specifically, we utilize the WinBUGS 1.4 software to estimate the expected values of the unknown parameters. The MCMC algorithm is particularly effective for addressing challenges associated with posterior distributions, especially in cases where explicit solutions to posterior integrals are difficult to obtain [
12]. This algorithm constructs a Markov chain that converges to a stationary distribution, allowing for the generation of samples from the posterior distribution
. A key technique within MCMC is Gibbs sampling, which iteratively draws conditional samples, ultimately enabling statistical inference about the parameters based on the generated samples [
13].
The Gibbs sampling method is one of the simplest and most widely used techniques within the MCMC framework. The Gibbs sampling procedure follows these steps:
Let represent the posterior distribution of the unknown parameter vector . Given an initial set of parameter values , the sampling process proceeds as follows:
Step 1: The parameter is sampled from the conditional distribution .
Step 2: The parameter is sampled from the conditional distribution .
Step k: The parameter is sampled from the conditional distribution .
These steps (from Step 1 to Step
) are repeated
times, generating a Markov chain:
When
is sufficiently large, the sampled values are considered simulated observations from the marginal posterior distribution of the parameter. The sample mean of these values serves as the Bayesian estimate of the parameter [
14].
4. Simulation Study
This section illustrates the application of the various theoretical results developed in the previous sections on basis of generated data from FKT distribution as well as to assess the performance of SKT-parameters estimation for eleven different simulated random samples of sizes ranging from 10 to 20,000 data units by ML and Bayesian methods. The computations are performed using WinBUGS 1.4, MatheCad15, and R programs. MLEs for the unknown parameter
were obtained for five different random samples, and WinBUGS 1.4 program was used to perform a Monte Carlo. The iterations continue until the Monte Carlo standard error value is less than 5% [
15].
Table 1 summarizes the calculations of the ML estimates (MLEs) of the shape parameters with some measurements. The estimations of
are estimated using Equations (7)–(9) at the initial values
c0,
d0, and for more explained, graphs of
and
defined by Equations (6)–(8) as shown in
Figure 8. It shows that, for example at
, the maximum value of
is the value corresponding to
, that is,
, and this leads us to that
and accordingly
. The maximum value of
is the value corresponding to
, that is,
, and this leads us to that
and accordingly
, see
Figure 8 and so on for the remaining samples.
As can be seen from
Table 2 and
Table 3 that the estimates of both parameters are reasonable for all considered random samples. The results show the stability of these estimated around the actual parameter values by which the data have been simulated, especially for the large sample sizes. It is clear from the results that the parameters’ estimates are reasonable for all the random samples studied. The results show the stability of the values of these estimators, especially for large sample sizes. Note that the value of the Monte Carlo standard error is small and decreases as the sample size increases.
Monte Carlo simulation was conducted to simulate Bayesian estimates of the unknown parameter (
) using the WinBUGS 1.4 program at different sample sizes. The generated data and the estimated values of (
), and the estimated values of the parameters of the range of the random variable for the KT distribution (
) and three prior distributions (exponential–gamma–uniform) were used at each sample size to obtain Bayesian estimates of the unknown parameter (
). The Markov chain began by relying on different initial values for the unknown parameter (
) and three prior distributions, as shown in
Table 4. The Markov chain continues for a number
of iterations at different sample sizes.
Table 4 shows the number of iterations that were run for each Markov chain. The iterations continue until the Monte Carlo standard error value is less than 5% [
15].
It is clear from the results of
Table 4 that the parameter estimates are good for the random samples studied. The results show the stability of the parameter estimates through the simulated data, especially for large sample sizes, as we note that the value of the Monte Carlo standard error is small and decreases as the sample size increases. Noted also that Bayesian estimates of the parameters in the case of the prior exponential distribution were the best compared to the estimates in the cases of Weibull and uniform prior distributions. Also, Bayesian estimates in the cases of Weibull and exponential prior distributions were the best as they provided the least error compared to the maximum likelihood method. While in the case of a uniform prior distribution, the results showed that the maximum likelihood method is the best as it provided the least error compared to the Bayesian method.
The following plots are examples of the Weibull prior at :
- (i)
The trace plots for 20,000 iterations are presented in
Figure 9.
- (ii)
Chains for which convergence looks reasonable. Gelman–Rubin convergence statistic, R, is introduced.
Figure 10 shows that the Gelman–Rubin statistic has converged, i.e., R is close to one [
16].
- (iii)
The shape of the posterior density of the parameter
is shown in
Figure 11.
- (iv)
The accuracy of the posterior estimate is calculated in terms of Monte Carlo standard error (MC error) of the mean according to Gelman et al. [
8]. The simulation is run until the MC error for each node is less than 0.05 of the sample standard deviation. Therefore, the rule of MC error has been achieved.
Table 3 and
Table 4 show that the MC error for each node is less than 0.05 of the sample standard deviation.
- (v)
Quantiles: The running mean with 95% confidence intervals plotted against the iteration number is shown in
Figure 12.
It is clear from the results of
Table 5 that the parameter estimates are good for the random samples studied, and the results show the stability of the parameter estimates, especially for large sample sizes. As we note that the value of the MC error is small and decreases as the sample size increases. Also, we found that the Bayesian estimates of the parameter in the case of the prior exponential distribution were the best compared to the estimates in the cases of the prior Weibull and the prior uniform distributions. In addition, the Bayes estimates in the cases of the prior Weibull and the prior exponential distributions were the best as they provided the minimum MC error compared to the maximum likelihood method. While in the case of a prior uniform distribution, the results showed that the maximum likelihood method is the best as it provided the minimum MC error compared to the Bayes method in the case of a prior uniform distribution, see
Figure 13.
5. Illustrated Example
The data were taken from Lawless [
17]. These data represent the ages of a random sample of 10 pieces of industrial equipment, measured in months. The data are as follows: 81, 72, 70, 60, 41, 31, 31, 30, 29, 21. The results showed that the data fit the Kumaraswamy trapezoid distribution using the Kolmogorov–Smirnov test, where the test value was (0.27434), and it is less than the critical value (0.54179) at a significance level (α = 0.01). The ML of the random variable range parameters of the Kumaraswamy trapezoidal distribution was estimated using the initial values
, and the calculations for
and
are summarized in
Table 6.
We noted from the results of
Table 6 that the maximum value of
correspond to the value
, that is,
, and this leads us to that
, and therefore,
. Also, the maximum value of
correspond to the value
, that is,
, and this leads us to that
, and therefore,
, and this is clear from
Figure 14.
Figure 15 shows that the Kumaraswamy trapezoidal function is considered more flexible than the trapezoidal distribution. As the Kumaraswamy trapezoidal function took the form of a model that goes through three stages (implementation and propagation stage—slow decline stage—fading stage) and is not limited to linear shapes as is the case in the trapezoidal function in
Figure 1.
It is clear from the results of
Table 7 that the parameter estimates are good for the real data, as we note that the value of the Monte Carlo standard error is small. The trace plots for 50,000 iterations are presented in
Figure 16.
Figure 17 shows that the Gelman–Rubin statistics are believed to have converged, and
Figure 18 shows the shape of the posterior density of the parameter
. Also, plotting the running mean with running 95% confidence intervals against iteration number is shown in
Figure 19.
6. Conclusions
In this article, a new form of trapezoidal distribution is discussed, which apparently did not find any interest of previous studies on such distributions. This article is the first that discusses and deals with this distribution. Trapezoidal distribution was developed to the Fragmented Kumar-Trapez (FKT) distribution, its parameters are related to three different analytical expressions appearing in the definition of the structure of the KT distribution. Some mathematical properties, reliability measures, and maximum likelihood and Bayesian FKT parameters estimation are presented.
It is noted that trapezoidal distribution takes the form of a quadrilateral trapezoid with two parallel sides and two non-parallel sides and is represented by three stages, all are linear: the first stage increases with a positive slope, the settlement period is fixed, and the last stage decreases with a negative slope.
On the other hand, the new FKT distribution appears to be more flexible than the trapezoidal distribution; it took multiple shapes at different values of the parameters (left and right triangular shape—pyramid shape—twisted curve shape to the right—moderate distribution shape). Based on those forms, the KT distribution can be represented by three stages, differing by changing the values of . So, it is reflected in the shape of a phenomenon that can be represented by three stages that are not limited to linear shapes but rather showed growth and dissolution, convex, nonlinear, or concave. To assess the performance of FKT parameters estimation, generated data were created from FKT distribution for different simulated random samples. The results show that the estimates of FKT parameters are reasonable for all considered random samples, with stability of these estimated around the actual parameter values as the sample sizeincreases.
The results also showed that the parameter estimates are good, and that the value of the MC error is small and decreases as the sample size increases. It also turns out that the Bayesian estimates in the case of the prior exponential distribution are better compared to the estimates in the case of the prior Weibull distribution and the prior uniform distribution. Also, the Bayesian estimates in the case of the prior Weibull distribution and the prior exponential distribution were the best, as they provided minimum MC error compared to the maximum likelihood method. While in the case of the prior uniform distribution, the results showed that the maximum likelihood method is the best as it provided minimum MC error compared to the Bayes method in the case of the prior uniform distribution.
Through the results of this article, it can be recommended to expand the use of different estimation methods and make comparisons between them; conduct further research to develop similar distributions whose probability density function is partitioned into two parts and three or more; also, complete the derivation of other properties of the Kumaraswamy trapezoidal distribution. In addition, we recommend using the general idea of Johnson and Kotz [
10] to estimate the parameters of the range of the random variable for segmented distributions. Finally, we recommend conducting further research and developing similar distributions, especially those whose probability density function is divided into two or more parts.