Next Article in Journal
Operatorial Formulation of a Model of Spatially Distributed Competing Populations
Previous Article in Journal
On the Use of a Non-Constant Non-Affine or Slip Parameter in Polymer Rheology Constitutive Modeling
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

On the Hybrid Normal Distribution and Its Application in Fiber Data on the Strength of Glass

by
Damodaran Santhamani Shibu
1,
Soman Latha Nitin
1,
Christophe Chesneau
2,*,
Muhammed Rasheed Irshad
3,
Sobhanam Padmini Shibin
4 and
Radhakumari Maya
1
1
Department of Statistics, University College, Thiruvananthapuram 695 034, India
2
LMNO, UFR de Sciences, Université de Caen Basse-Normandie, F-14032 Caen, France
3
Department of Statistics, Cochin University of Science and Technology, Cochin 682 022, India
4
National Center for Aquatic Animal Hospital, Lakeside Campus, Cochin University of Science and Technology, Cochin 682 022, India
*
Author to whom correspondence should be addressed.
Dynamics 2022, 2(4), 399-413; https://doi.org/10.3390/dynamics2040023
Submission received: 10 October 2022 / Revised: 15 November 2022 / Accepted: 16 November 2022 / Published: 18 November 2022

Abstract

:
The hybrid normal (HN) distribution is a new generalization of the normal distribution that we introduce and study in this article. Its mathematical foundation is based on the logarithmically transformed version of the famous hybrid log-normal (HLN) distribution, which is an unexplored direction in the literature. We emphasize the applicability of the HN distribution with the aim of fitting versatile data, such as, in this paper, fiber data on the strength of glass. In particular, the unknown parameters are estimated using both Bayesian and maximum likelihood estimation approaches, with Bayesian estimation carried out using the MCMC approach. A thorough simulation study is performed to determine the flexibility of the estimates’ performance. The glass fiber data are then analyzed, with an assessment of several existing distributions from the literature used to demonstrate how the HN distribution is relevant in this regard.

1. Introduction

The majority of real-life datasets resulting from complex processes are infrequently characterized and predicted adequately by standard fundamental distributions in practice. The choice of a flexible distribution for data analysis is crucial, as the perfection of results from statistical analysis greatly relies on the model that is adopted. In order to obtain statistical outputs of higher quality and greater accuracy, it is essential to identify more connected distributions. Several continuous distributions are readily accessible in the statistical literature for modelling real data. Among them, the skewed distributions have more strategic significance, and are frequently used in many areas of applied research. For this reason, numerous families of skewed distributions have been proposed. Azzalini (1985) (see [1]) introduced a method for introducing skewness to the normal (N) distribution. It should be noted that while the majority of skewed distributions have the positive real values set as the support, the selected based distribution has the full real line as the support. Subsequently, many authors have continued to develop this concept; for example, see [2,3].
In the same respect, we cite Coles (2001) [4], Castillo et al., (2005) [5], and Ferrari and Pinheiro (2012) [6], all of whom emphasize the Gumbel distribution (defined on the real line) for analyzing real datasets. As a fair competitor, [7] developed the log-Dagum (LD) distribution, which is derived by transforming the Dagum distribution logarithmically. The LD distribution has the feature of having leptokurtic shapes.
In order to create vast classes of standard distributions and pertinent statistical procedures that can be used as models for a variety of real-world problems, much effort has been put forth throughout the years. Real-world phenomena need to be introduced to new statistical distributions that are defined on the entire real line and have bimodal behaviour for the probability density function (pdf). As a consequence, in this paper, we offer a novel distribution called the hybrid N (HN) distribution, which can be described as a generalized version of the well-known N distribution. It can additionally be represented as the logarithmically transformed version of the hybrid log-N (HLN) distribution, as defined by the cumulative distribution function (cdf) provided in Equation (1). It has the interesting feature of having a bimodal pdf. The leading motivations behind the introduction of this new distribution are:
(i)
to develop a new generalization for the well-known N distribution, which continues to pique the interest of statisticians across the board;
(ii)
as mentioned, to investigate the characteristics of the HLN distribution’s logarithmically transformed version, which has surprisingly not received much attention in the statistical literature.
The practical viewpoint is considered as well; we aim to demonstrate how the HN distribution can be concretely used as a model for dealing with the analysis of important data, and show that it outperforms comparable well-known models. We examine an experimental dataset of glass fiber strength from the National Physical Laboratory in England as an application in this research. Because the HN distribution is an extended version of the well-known N distribution, which has numerous applications in physics, including various dynamical systems (see [8,9]), it is quite conceivable to employ the HN distribution for further physical applications.
The remainder of the present article is divided into several sections, and is structured as follows. We introduce the HN distribution and discuss its special cases and moments in Section 2. The various functions and moments related to reliability measures are discussed in Section 3. In Section 4, the unknown parameters of the new model are estimated using efficient procedures, namely, the maximum likelihood (ML) and Bayesian estimation procedures. A simulation study is conducted in Section 5 to analyze the performance and flexibility of the ML estimates (MLEs). Additionally, Section 6 presents a parametric bootstrap simulation technique utilizing these estimates. Section 7 illustrates the application of the HN model based on a real-life dataset. The final concluding remarks are presented in Section 8.

2. The Hybrid Normal Distribution

For fitting data on occupational radiation exposure, Kumazawa and Numakunai [10] introduced a new distribution in 1981 called the HLN distribution. An absolutely continuous random variable (acrv) Y is said to follow an HLN distribution if its cdf is indicated as
H Y ( y ) = 0 y 1 2 π α u + σ e 1 2 ( α log u + σ u + μ ) 2 d u ,
and the pdf is provided by
h Y ( y ) = 1 2 π α y + σ e 1 2 ( α log y + σ y + μ ) 2 ,
where y > 0 , α 0 , σ 0 , α + σ > 0 , and μ R . For σ = 0 , the pdf in Equation (2) reduces to the well-known two-parameter log-N distribution denoted as LN ( μ α 1 , α 2 ) . Although the HLN distribution might seem simple, there do not seem to be many references to it in the statistical literature. A scalar function was defined in [11], showing a hybrid pdf made up of p N and q log-N variants. The hybrid multivariate stochastic differential equation for stand basal area and volume models in a forest stand is presented in [12].
Here, the proposed three-parameter distribution is referred to as the hybrid N distribution, denoted as HN ( α , μ , σ ) or simply HN.
Definition 1.
If an acrv Y follows the HLN distribution as provided by its cdf and pdf in Equations (1) and (2), respectively, then the acrv X = log Y is said to follow the HN distribution. The cdf of X is specified by
F ( x ) = x 1 2 π ( α + σ e u ) e 1 2 ( α u + σ e u + μ ) 2 d u ,
which can be written as
F ( x ) = Φ ( α x + σ e x + μ ) ,
where Φ ( x ) is the cdf of the standard N (SN) distribution, conventionally denoted as N ( 0 , 1 ) . Then, the pdf of the acrv X is provided by
f ( x ) = 1 2 π ( α + σ e x ) e 1 2 ( α x + σ e x + μ ) 2 , x R ,
where α 0 , σ 0 , α + σ > 0 , and μ R .
Two extremely alluring characteristics instantly follow from Definition 1, which are as follows:
  • Property A: When σ = 0 , the pdf in Equation (4) reduces to the pdf of the N distribution with parameters μ α 1 and α 2 , i.e., N ( μ α 1 , α 2 ) , where μ is not the typical location parameter of the N distribution.
  • Property B: If the acrv X follows the HN distribution, then Y = e X has the HLN distribution.
Plots of the corresponding cdf and pdf of the HN distribution are shown in Figure 1 and Figure 2. It is possible to see from Figure 2 that the pdf is bimodal in nature, with a possible spike, which is an intriguing feature of this distribution.
Now, the moment generating function of the HN distribution is indicated as
M ( t ) = I * ( t ) 2 π ,
where
I * ( t ) = ( α + σ e x ) e t x 1 2 ( α x + σ e x + μ ) 2 d x ,
which is well-defined for all t R .
Theorem 1.
If X has the HN ( α , μ , σ ) distribution, then Y = | X | follows the half-HN distribution (HHN) with parameters α , μ , and σ, with the cdf and pdf provided by
G ( y ) = Φ ( α y + σ e y + μ ) Φ ( α y + σ e y + μ ) ,
and
g ( y ) = ( α + σ e y ) ϕ ( α y + σ e y + μ ) + ( α + σ e y ) ϕ ( α y + σ e y + μ ) ,
respectively, where y > 0 and ϕ ( x ) is the pdf of the SN distribution.
Proof. 
For any y > 0 , the cdf of Y is specified by
G ( y ) = P ( Y y ) = P ( | X | y ) = P ( y X y ) = F ( y ) F ( y ) = Φ ( α y + σ e y + μ ) Φ ( α y + σ e y + μ ) .
The pdf of the HHN distribution is provided by
g ( y ) = d d y G ( y ) = d d y Φ ( α y + σ e y + μ ) d d y Φ ( α y + σ e y + μ ) = ( α + σ e y ) ϕ ( α y + σ e y + μ ) + ( α + σ e y ) ϕ ( α y + σ e y + μ ) ,
which ends the proof.    □
It is significant to mention that the HHN distribution is not addressed in the current statistical literature. In a subsequent publication, we intend to investigate the various mathematical properties of this distribution.

3. Reliability Measures

Identification of a system’s important components and estimation of the effects of component failures are the fundamental goals of systems reliability analysis. Therefore, it is essential to derive the functions of the HN distribution’s reliability measures.
Assume that X is an acrv with pdf f ( x ) and cdf F ( x ) . Then, the general formulas for the survival, hazard rate, cumulative hazard rate and reversed hazard rate functions are S ( x ) = 1 F ( x ) , h ( x ) = f ( x ) / S ( x ) , R ( x ) = log [ S ( x ) ] , and r ( x ) = f ( x ) / F ( x ) , respectively.

3.1. Survival, Hazard Rate, Cumulative Hazard Rate, and Reversed Hazard Rate Functions

The survival function of the HN distribution is indicated as
S ( x ) = 1 Φ ( α x + σ e x + μ ) ,
where x R (as hereinafter).
Thus, the hazard rate function of the HN distribution is provided as
h ( x ) = ( α + σ e x ) e 1 2 ( α x + σ e x + μ ) 2 2 π [ 1 Φ ( α x + σ e x + μ ) ] .
The plots in Figure 3 portray the increasing nature of the hazard rate function.
The cumulative hazard rate function of the HN distribution is specified by
R ( x ) = log [ 1 Φ ( α x + σ e x + μ ) ] ,
while the reversed hazard rate function of the HN distribution is expressed as
r ( x ) = ( α + σ e x ) e 1 2 ( α x + σ e x + μ ) 2 2 π Φ ( α x + σ e x + μ ) .

3.2. Conditional Moments

For studying the reliability of a system, it is of great interest to know the conditional moments that are important tools for prediction purposes. If X is an acrv that follows the HN distribution, then its nth conditional moment is provided by
E ( X n | X > t ) = 1 2 π S ( t ) I 1 ( n , t ) ,
where
I 1 ( n , t ) = t x n ( α + σ e x ) e 1 2 ( α x + σ e x + μ ) 2 d x .
A direct application is discussed below. In reliability engineering, biomedical research, and survival analysis, the vitality function is crucial. A component must have low relative vitality to age quickly, whereas high vitality implies comparatively gradual aging over the course of the given period. The vitality function of an acrv X that follows the HN distribution is expressed by the conditional moments provided in Equation (14) for n = 1 , that is,
V ( t ) = E ( X | X > t ) = I 1 ( 1 , t ) 2 π S ( t ) ,
where I 1 ( n , t ) is presented in Equation (15). For more details on the vitality function, see [13].

3.3. Geometric Vitality Function

The geometric mean of the residual lifetime serves as the foundation for the geometric vitality function. If X is an acrv which represents the lifetime of a component, then the geometric mean of the component that lives up to time t is represented by log G ( t ) = E ( log X | X > t ) , provided that E ( log X ) exists. Hence, the geometric vitality function for a non-negative acrv X with pdf f ( t ) is defined as
log G ( t ) = 1 S ( t ) t ( log x ) f ( x ) d x ,
where S ( t ) = P ( X > t ) represents the survival function. For more information on the geometric vitality function, see [14]. Now, the geometric vitality function of an acrv X that follows the HN distribution is provided by
log G ( t ) = I 2 ( t ) 2 π S ( t ) ,
where
I 2 ( t ) = t ( log x ) ( α + σ e x ) e 1 2 ( α x + σ e x + μ ) 2 d x .

3.4. Moments of Residual Life

The rth order moment of the residual life of an acrv X that follows the HN distribution is obtained as follows:
μ r ( t ) = E [ ( X t ) r | X > t ] = 1 S ( t ) t i = 0 r r i ( t ) r i x i f ( x ) d x = 1 2 π S ( t ) i = 0 r r i ( t ) r i I 1 ( i , t ) ,
where I 1 ( n , t ) is provided in Equation (15).
As a special case, the mean residual life (MRL) function is expressed as
μ 1 ( t ) = V ( t ) t = I 1 ( 1 , t ) 2 π S ( t ) t .
Similarly, the second moment of the residual life time is provided by
μ 2 ( t ) = t 2 2 t V ( t ) + I 1 ( 2 , t ) 2 π S ( t ) = t 2 2 t I 1 ( 1 , t ) 2 π S ( t ) + I 1 ( 2 , t ) 2 π S ( t ) .
Using μ 1 ( t ) and μ 2 ( t ) , it is possible to calculate the variance of the residual life function.

4. Estimation of the Parameters

In this section, we go over how to estimate the HN distribution’s parameters using the ML and Bayesian approaches, the two most widely used techniques.

4.1. Maximum Likelihood Estimation

Let x 1 , x 2 , , x n represent the observed values of a random sample formed of acrvs, say, X 1 , X 2 , , X n , taken from the HN distribution. The log-likelihood function for the parameter vector Θ = ( α , μ , σ ) T is provided by
L n = n log 2 π + i = 1 n log ( α + σ e x i ) 1 2 i = 1 n ( α x i + σ e x i + μ ) 2 .
By maximising Equation (23) with regard to the parameters, the MLEs can be obtained more quickly in practice. Hereafter, we denote by μ ^ , α ^ , and σ ^ the MLEs of μ , α , and σ , respectively. From a computational viewpoint, the score vector function is
U = L n α , L n μ , L n σ T ,
where
L n μ = i = 1 n ( α x i + σ e x i + μ ) ,
L n α = i = 1 n 1 α + σ e x i i = 1 n x i ( α x i + σ e x i + μ )
and
L n σ = i = 1 n e x i α + σ e x i i = 1 n e x i ( α x i + σ e x i + μ ) .
By computationally solving the nonlinear system of equations U = ( 0 , 0 , 0 ) T , it is possible to derive the MLEs. In this study, we obtain the MLEs numerically using the fitdistrplus package in R software (see [15]). The following link provides more information about this package: (https://CRAN.R-project.org/package=fitdistrplus).
We now depict the asymptotic confidence intervals (CIs) for the parameters. On taking the second partial derivatives of Equations (25)–(27) at the MLEs, the observed Hessian matrix of the HN distribution can be obtained, and is specified by
H ^ = 2 L n α 2 2 L n α μ 2 L n α σ 2 L n μ α 2 L n μ 2 2 L n μ σ 2 L n σ α 2 L n σ μ 2 L n σ 2 ( α , μ , σ ) = ( α ^ , μ ^ , σ ^ ) .
Based on the above, the observed Fisher’s information matrix is J ^ = H ^ , from which we can derive the estimated dispersion matrix as
Ξ ^ = J ^ 1 = Ξ ^ 11 Ξ ^ 12 Ξ ^ 13 Ξ ^ 21 Ξ ^ 22 Ξ ^ 23 Ξ ^ 31 Ξ ^ 32 Ξ ^ 33 ,
and Ξ ^ i j = Ξ ^ j i for i j = 1 , 2 , 3 . The asymptotic N distribution of the random versions of the MLEs is guaranteed by the current general theory, with technical regular assumptions. More precisely, the random version of Θ ^ = ( α ^ , μ ^ , σ ^ ) T follows the multivariate N distribution N 3 ( Θ , Ξ ^ ) . Thus, we obtain 100 × ( 1 δ ) % asymptotic CIs of the parameters using the following formulas:
I α = α ^ υ δ / 2 Ξ ^ 11 , I μ = μ ^ υ δ / 2 Ξ ^ 22 , I σ = σ ^ υ δ / 2 Ξ ^ 33 ,
where υ δ is the upper δ th percentile of the SN distribution.

4.2. MCMC Estimation Using Bayesian Approach

In this subsection, we implement the MCMC estimation using the Bayesian estimation procedure on the HN distribution parameters. Each parameter demands a prior pdf to accomplish this. For this, we use two different variants of the almost improper uniform priors, as when using the almost improper uniform priors the comparison with MLEs is not confounded by the choice of proper prior distributions. Thus, we choose Uniform (−1000, 1000) ( U ( 1000 , 1000 ) ) for the parameter μ , and U ( 0 , 1000 ) for the other two parameters. With this configuration, the joint posterior pdf is described by
ψ ( μ , α , σ | x 1 , x 2 , , x n ) L n ψ ( μ ) ψ ( α ) ψ ( σ ) ,
where L n = e L n is the likelihood function of the HN distribution, where ψ ( μ ) , ψ ( α ) and ψ ( σ ) are the almost improper prior pdfs of the corresponding parameters. From Equation (28), there is obviously no analytical way to determine the MCMC Bayesian estimates. To perform numerical computations for MCMC Bayesian estimation, the LaplacesDemon package of the R software, which provides a comprehensive environment for Bayesian inference, is used. For more information and examples about this package, the following link is recommended: https://cran.r-project.org/package=LaplacesDemon (accessed on 14 September 2022).

5. Simulation Study

In this section, we execute simulation tests to evaluate the long-term performance of the MLEs of the HN distribution parameters. Several finite sample sizes are considered. More precisely, we generate samples of sizes n = 25 , 50 , 100 , 250 , and 500 from the HN distribution for two sets of parameter values.

5.1. Simulation Study for the MLEs

Here, the iteration is conducted 500 times, and the mean values of the biases, root mean squared errors (RMSEs), 95 % (asymptotic) coverage probabilities (CPs), and ALs of the 95 % (asymptotic) CIs corresponding to each of the parameter estimates for every replication are calculated with respect to the corresponding sample sizes. The results corresponding to each of the parameter sets are reported in Table 1 and Table 2.
It can be seen that the RMSEs and ALs corresponding to each estimate decrease as the sample size increases. Additionally, for each parameter the CPs are relatively close to expected value of 0.95 . This attests to the MLEs of the HN distribution perform consistently.

5.2. Simulation Study for MCMC Bayesian Estimates

Here, we consider the prior distributions for the HN distribution parameters provided in SubSection 4.2. Iteration is conducted 10,001 times. For each parameter set of the respective sample sizes, the posterior summary results, such as mean, standard deviation (SD), Monte Carlo error (MCE), ALs of the 95 % (asymptotic) CIs, and median, are presented in Table 3 and Table 4.
It can be seen that the SD, MCE, and AL decline as the sample size rises, which indicates that the MCMC Bayesian estimates of the HN distribution parameters perform consistently.

6. Bootstrap CIs

In this section, we approximate the distribution of the MLEs of the HN model parameters using the parametric bootstrap method. Then, we can estimate CIs for each parameter for the fitted HN distribution using the bootstrap distribution. Using the dataset comprised of x 1 , x 2 , , x n , let the MLE of θ { α , μ , σ } be θ ^ . In our setting, the bootstrap stands for a method for estimating the distribution of θ ^ by drawing a random sample ( θ 1 * , θ 2 * , , θ B * ) for θ based on B random samples that are drawn with replacement from x 1 , x 2 , , x n . The sample of bootstrapping, i.e., ( θ 1 * , θ 2 * , , θ B * ) , can be used to construct bootstrap CIs for each HN distribution parameter.
Thus, using the following formulas, we calculate the 100 × ( 1 δ ) % bootstrap CIs for the parameters as
I α , b o o t = α ^ z δ / 2 s e ^ α , b o o t , I μ , b o o t = μ ^ z δ / 2 s e ^ μ , b o o t , I σ , b o o t = σ ^ z δ / 2 s e ^ σ , b o o t ,
where z δ represents the δ th percentile of the bootstrap sample; for θ { α , μ , σ } ,
s e ^ θ , b o o t = 1 B b = 1 B θ b * 1 B b = 1 B θ b * 2 .

7. Applications and Empirical Study

The purpose of this section is to illustrate the empirical significance of the HN distribution. We consider a real dataset from the area of subject physics. The data consist of a sample from an experimental dataset of the strength of glass fibers with a length 15 cm from the National Physical Laboratory in England. This dataset, usually called the glass fiber (GF) dataset, can be found in [16]. The summary statistics of the data are provided in Table 5. We use the R software for numerical evaluations.
We study the empirical hazard rate function of the dataset using the concept of total time on test (TTT) plot. The TTT plot is a graph that is primarily used to distinguish between various ageing types indicated by hazard rate shapes. For more details, see [17]. Figure 4 indicates that the dataset has an increasing hazard rate shape for its empirical hazard rate function. Therefore, the HN distribution represents a credible choice for this dataset.
We demonstrate the potentiality of the HN distribution by comparing it with the Laplace, logistic, Gumbel, and N distributions, all of which are defined on the entire real line. Using the statistical tools of the negative log-likelihood ( log L ), Kolmogorov–Smirnov statistics (KS), Cramér–von Mises ( W * ), Anderson–Darling ( A * ), the values of the Akaike information criterion (AIC), and Bayesian information criterion (BIC), we compare the competing models with the suggested model. The corresponding MLEs and goodness-of-fit (GOF) statistics are presented in Table 6.
From this table, it can be seen that the GOF statistics values of the HN distribution are lower than those of the examined distributions. In light of this, we can draw the conclusion that the suggested HN model provides a better fit than those from the compared distributions.
The plots of the empirical cdfs, empirical pdfs, and the theoretical probabilities against empirical ones (P–P plot) for the real dataset (for the HN model only for the P–P plot) are displayed in Figure 5.
This graphic displays attractive curves for the fitted and empirical functions. In light of this, we can draw the conclusion that the HN distribution is the most appropriate for this dataset among the alternative distributions.
Now, the empirical Hessian matrix is obtained as follows:
H ^ = 65.5553 51.9800 185.5325 51.980 46.000 147.2665 185.5325 147.2665 528.4451 ,
and the associated estimated dispersion matrix is
Ξ ^ = 2.4932 0.1388 0.8367 0.1388 0.2093 0.0096 0.8367 0.0096 0.2983 .
In order to complete the above analysis, Table 7 provides the 95 % (asymptotic) CIs for the HN distribution parameters.
On the basis of the GF dataset outlined above, we now concentrate on estimating the HN distribution parameters using the MCMC Bayesian technique. The investigation was carried out using the MH algorithm of the MCMC method with respect to 1000 iterations in the perspective of Bayesian estimation. All three MCMC Bayesian estimates of the HN distribution parameters are included in Table 8 for comparison with the MLEs. Numerical computations for MCMC Bayesian estimation were performed using R software.
We now focus on the 95 % bootstrap CIs for the HN distribution parameters using the computed MLEs. To this end, we generate 1001 samples of the same size as the GF dataset using the HN distribution, with the true values of parameters chosen as the corresponding MLEs. For each obtained sample, we calculate the MLEs α ^ b * , μ ^ b * , and σ ^ b * for b { 1 , 2 , , 1001 } . Thus, for the parameters α , μ , and σ , Table 9 shows the median and 95 % bootstrap CIs.
In order to identify any potential structural correlation between the parameters, it is noteworthy to examine the joint distribution of the bootstrapped values in the matrix of scatter plots. Figure 6 demonstrates the matrix scatterplots of the bootstrapped HN parameter values, which reflect the joint uncertainty distribution with respect to the fitted parameters.

8. Concluding Remarks

In this study, we suggest a new distribution that generalizes the well-known normal distribution and is defined on the entire real line. We investigate the mathematical and statistical features of the new model, which we refer to as the hybrid normal (HN) distribution. With regard to the HN distribution, we present explicit expressions for several reliability metrics. The hazard rate function of the HN distribution possesses increasing shaped graphical representation. In terms of inference, the observed information matrix is presented together with the estimation of the model parameters using the maximum likelihood and Bayesian estimation methods, with Bayesian estimation performed using the MCMC approach. In addition, we use the parametric bootstrap method to obtain the model parameter confidence intervals. Goodness-of-fit tests are applied to a real dataset concerning the strength of glass fibers with a length of 15 cm from the National Physical Laboratory in England in order to demonstrate the new model’s real-world applicability. In comparison to the other analyzed models, the new model consistently offers a superior fit. We expect that the proposed model can be used more frequently to represent actual datasets in a variety of fields, including physics, engineering, economics, hydrology, survival analysis, and others.

Author Contributions

Conceptualization, D.S.S., S.L.N., C.C., M.R.I., S.P.S. and R.M.; methodology, D.S.S., S.L.N., C.C., M.R.I., S.P.S. and R.M.; validation, D.S.S., S.L.N., C.C., M.R.I., S.P.S. and R.M.; formal analysis, D.S.S., S.L.N., C.C., M.R.I., S.P.S. and R.M.; investigation, D.S.S., S.L.N., C.C., M.R.I., S.P.S. and R.M.; writing—original draft preparation, D.S.S., S.L.N., C.C., M.R.I., S.P.S. and R.M.; writing—review and editing, D.S.S., S.L.N., C.C., M.R.I., S.P.S. and R.M. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data are contained within the article.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Azzalini, A. A class of distributions which includes the normal ones. Scand. J. Stat. 1985, 12, 171–178. [Google Scholar]
  2. Wahed, A.; Ali, M. The skew-logistic distribution. J. Statist. Res. 2001, 35, 71–80. [Google Scholar]
  3. Genton, M. Skew-Elliptical Distributions and Their Applications: A Journey Beyond Normality; CRC Press: Boca Raton, FL, USA, 2004; pp. 1–396. [Google Scholar]
  4. Coles, S. An Introduction to Statistical Modeling of Extreme Values; Springer: Berlin/Heidelberg, Germany, 2001; Volume 208. [Google Scholar] [CrossRef]
  5. Castillo, E.; Hadi, A.; Balakrishnan, N.; Sarabia, J. Extreme Value and Related Models with Applications in Engineering and Science; Wiley: Hoboken, NJ, USA, 2005. [Google Scholar]
  6. Ferrari, S.; Pinheiro, E.C. Small-sample likelihood inference in extreme-value regression models. J. Stat. Comput. Simul. 2012, 84, 582–595. [Google Scholar] [CrossRef]
  7. Domma, F.; Perri, P.F. Some developments on the log-Dagum distribution. Stat. Methods Appl. 2009, 18, 205–220. [Google Scholar] [CrossRef]
  8. Landauer, R. The physical nature of information. Phys. Lett. A 1996, 217, 188–193. [Google Scholar] [CrossRef]
  9. Wang, X.; Wang, B.; Zhang, Y.; Suo, Y.; Jia, P.; Huang, F. Dispersion of mechanical properties of high-strength glass fibre composites in hygrothermal environment. Polymers 2022, 14, 3514. [Google Scholar] [CrossRef]
  10. Kumazawa, S.; Numakunai, T. A New Theoretical analysis of occupational dose distributions Indicating the Effect of Dose Limits. Health Phys. 1981, 41, 465–475. [Google Scholar] [CrossRef]
  11. Fletcher, S.J.; Zupanski, M. A hybrid multivariate normal and lognormal distribution for data assimilation. Atmos. Sci. Lett. 2006, 7, 43–46. [Google Scholar] [CrossRef]
  12. Rupšys, P.; Narmontas, M.; Petrauskas, E. A multivariate hybrid stochastic differential equation model for whole-stand dynamics. Mathematics 2020, 8, 2230. [Google Scholar] [CrossRef]
  13. Kupka, J.; Loo, S. The hazard and vitality measures of ageing. J. Appl. Probab. 1989, 26, 532–542. [Google Scholar] [CrossRef]
  14. Nair, K.; Rajesh, G. Geometric vitality function and its applications to reliability. IAPQR Trans. 2000, 25, 1–8. [Google Scholar]
  15. R Core Team. R: A Language and Environment for Statistical Computing; R Foundation for Statistical Computing: Vienna, Austria, 2022; Available online: https://www.R-project.org/ (accessed on 14 September 2022).
  16. Smith, R.L.; Naylor, J.C. A comparison of maximum likelihood and Bayesian estimators for the three-parameter Weibull distribution. J. R. Stat. Soc. Ser. (Appl. Stat.) 1987, 36, 358–369. [Google Scholar] [CrossRef]
  17. Aarset, M.V. How to identify a bathtub hazard rate. IEEE Trans. Reliab. 1987, R-36, 106–108. [Google Scholar] [CrossRef]
Figure 1. Cdf plots of the HN distribution.
Figure 1. Cdf plots of the HN distribution.
Dynamics 02 00023 g001
Figure 2. Pdf plots of the HN distribution: (a) bimodal with a dominant bell shape, (b) bimodal with a bell-shape and a spike shape, and (c) original skewness shapes.
Figure 2. Pdf plots of the HN distribution: (a) bimodal with a dominant bell shape, (b) bimodal with a bell-shape and a spike shape, and (c) original skewness shapes.
Dynamics 02 00023 g002
Figure 3. Hazard rate function plots of the HN distribution: (a) moving σ and (b) combined action of α and σ .
Figure 3. Hazard rate function plots of the HN distribution: (a) moving σ and (b) combined action of α and σ .
Dynamics 02 00023 g003
Figure 4. The TTT plot of the strength of the GF dataset.
Figure 4. The TTT plot of the strength of the GF dataset.
Dynamics 02 00023 g004
Figure 5. Empirical plots on the strength of the GF dataset: (a) empirical versus theoretical cdfs, (b) histogram and fitted pdfs, and (c) P–P plot.
Figure 5. Empirical plots on the strength of the GF dataset: (a) empirical versus theoretical cdfs, (b) histogram and fitted pdfs, and (c) P–P plot.
Dynamics 02 00023 g005
Figure 6. Matrix scatterplots of the bootstrapped HN parameter values due to strength of the GF dataset.
Figure 6. Matrix scatterplots of the bootstrapped HN parameter values due to strength of the GF dataset.
Dynamics 02 00023 g006
Table 1. The simulation results of the MLEs for the set ( α = 0.2 , μ = 0.9 , σ = 0.6 ).
Table 1. The simulation results of the MLEs for the set ( α = 0.2 , μ = 0.9 , σ = 0.6 ).
ParametersnMLEBiasRMSECPAL
α 250.20870.00870.03940.9660.1548
500.20450.00450.02730.9480.1063
1000.20220.00220.01780.9640.0742
2500.20150.00150.01200.9420.0466
5000.20070.00070.00860.9440.0328
μ 250.92370.02370.31060.9581.2256
500.91730.01730.22400.9360.8528
1000.90510.00510.15320.9420.5976
2500.90390.00390.09420.9640.3757
5000.8998-0.000180.06500.9660.2651
σ 250.78690.18690.57330.9541.5415
500.67540.07540.28420.9560.9509
1000.63760.03760.16950.9500.6353
2500.61330.01330.10170.9440.3889
5000.60670.00670.06830.9500.2717
Table 2. The simulation results of the MLEs for the set ( α = 1.1 , μ = 0.2 , σ = 0.05 ).
Table 2. The simulation results of the MLEs for the set ( α = 1.1 , μ = 0.2 , σ = 0.05 ).
ParametersnMLEBiasRMSECPAL
α 251.0490−0.05100.24320.9681.1842
501.0689−0.03110.16810.9640.7897
1001.0803−0.01970.11330.9880.5403
2501.0894−0.01060.07530.9740.3350
5001.0935−0.00650.05660.9680.2327
μ 250.0443−0.15570.38370.9661.5773
500.1190−0.08100.25660.9661.0372
1000.1509−0.04910.17270.9620.7024
2500.1746−0.02540.10500.9680.4336
5000.1864−0.01360.07490.9640.2997
σ 250.19160.14160.29480.9781.0386
500.12500.07500.17070.9780.6460
1000.09130.04130.10250.9840.4234
2500.07200.02200.06390.9720.2570
5000.05960.00960.04390.9680.1743
Table 3. Posterior summary results for ( α = 0.2 , μ = 0.9 , σ = 0.6 ).
Table 3. Posterior summary results for ( α = 0.2 , μ = 0.9 , σ = 0.6 ).
ParametersnMeanSDMCEALMedian
α 250.29010.05110.00840.18910.2866
500.29400.04290.00720.16070.2883
1000.21090.02150.00340.08370.2115
2500.21080.01060.00320.03570.2133
5000.20900.00520.00080.01440.2038
μ 251.02480.28680.04401.21011.0270
501.06880.25190.03890.90151.0579
1000.80760.15920.02510.52980.7742
2500.91860.10710.02010.36150.9101
5000.96510.03840.00460.09840.9855
σ 250.47470.29090.05921.10740.4390
500.49460.24430.05280.84780.5202
1000.74440.20180.04290.80190.7700
2500.55790.07410.02350.29610.5930
5000.65330.05970.01560.18310.5656
Table 4. Posterior summary results for ( α = 1.1 , μ = 0.2 , σ = 0.05 ).
Table 4. Posterior summary results for ( α = 1.1 , μ = 0.2 , σ = 0.05 ).
ParametersnMeanSDMCEALMedian
α 251.23240.20870.02550.81821.2660
501.10250.14810.01770.58101.0935
1001.05280.09170.01590.34201.0744
2501.10090.08610.01320.27511.1148
5001.09930.05900.00580.14721.1162
μ 250.22780.25690.04941.14990.2437
500.17670.16900.03050.81050.2195
1000.25300.14820.02310.67990.2523
2500.21390.09100.01410.34270.2457
5000.21510.08380.00850.25580.1958
σ 250.04760.13300.02770.34820.03409
500.03160.06590.01290.19010.0272
1000.04690.05410.01210.17610.0237
2500.07410.05090.01180.09090.0676
5000.03000.03310.01080.06930.0120
Table 5. Descriptive statistics of the strength of the GF dataset.
Table 5. Descriptive statistics of the strength of the GF dataset.
StatisticnMinFirst QuartileMedianMeanThird QuartileMax
Values460.370.95751.161.131.33751.61
Table 6. MLEs and GOF statistics results of the strength of the GF dataset.
Table 6. MLEs and GOF statistics results of the strength of the GF dataset.
EstimatesLaplaceLogisticGumbelNHybrid N
α ^ ----0.0142
μ ^ 1.16091.15040.98651.1300−4.0688
σ ^ 0.20950.14980.31110.26841.2659
log L 5.99964.461313.49784.77002.2719
KS0.08640.07340.14970.09280.0619
W *0.06480.04110.29470.07420.0187
A *0.51210.37901.98810.52670.1397
AIC15.999312.922530.995513.540010.5439
BIC19.656616.579834.652817.197316.0298
Table 7. The 95% (asymptotic) CIs of the HN distribution parameters based on the strength of the GF dataset.
Table 7. The 95% (asymptotic) CIs of the HN distribution parameters based on the strength of the GF dataset.
ParameterLowerUpper
α −3.08063.1090
μ −4.9655−3.1720
σ 0.19542.3364
Table 8. ML and MCMC Bayesian estimates of the HN distribution parameters on the strength of the GF dataset.
Table 8. ML and MCMC Bayesian estimates of the HN distribution parameters on the strength of the GF dataset.
ParameterMLMCMC Bayesian
α 0.01420.0135
μ −4.0688−4.1753
σ 1.26591.2977
Table 9. The median and 95% bootstrap CIs for the HN distribution parameters on the strength of the GF dataset.
Table 9. The median and 95% bootstrap CIs for the HN distribution parameters on the strength of the GF dataset.
ParameterMedianBootstrap CI
Strength of the α 0.2340(0.0001, 4.3993)
GF μ −4.2647(−5.6048, −3.4066)
dataset σ 1.1516(0.0001, 1.5551)
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Shibu, D.S.; Nitin, S.L.; Chesneau, C.; Irshad, M.R.; Shibin, S.P.; Maya, R. On the Hybrid Normal Distribution and Its Application in Fiber Data on the Strength of Glass. Dynamics 2022, 2, 399-413. https://doi.org/10.3390/dynamics2040023

AMA Style

Shibu DS, Nitin SL, Chesneau C, Irshad MR, Shibin SP, Maya R. On the Hybrid Normal Distribution and Its Application in Fiber Data on the Strength of Glass. Dynamics. 2022; 2(4):399-413. https://doi.org/10.3390/dynamics2040023

Chicago/Turabian Style

Shibu, Damodaran Santhamani, Soman Latha Nitin, Christophe Chesneau, Muhammed Rasheed Irshad, Sobhanam Padmini Shibin, and Radhakumari Maya. 2022. "On the Hybrid Normal Distribution and Its Application in Fiber Data on the Strength of Glass" Dynamics 2, no. 4: 399-413. https://doi.org/10.3390/dynamics2040023

Article Metrics

Back to TopTop