Next Article in Journal
Importance of Weather Conditions in a Flight Corridor
Next Article in Special Issue
Omnibus Tests for Multiple Binomial Proportions via Doubly Sampled Framework with Under-Reported Data
Previous Article in Journal
Resampling under Complex Sampling Designs: Roots, Development and the Way Forward
Previous Article in Special Issue
Multivariate Threshold Regression Models with Cure Rates: Identification and Estimation in the Presence of the Esscher Property
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Properties and Limiting Forms of the Multivariate Extended Skew-Normal and Skew-Student Distributions

by
Christopher J. Adcock
1,2
1
Sheffield University Management School, University of Sheffield, Sheffield S10 1FL, UK
2
UCD Michael Smurfit Graduate Business School, University College Dublin, Carysfort Avenue, Blackrock, D04 V1W8 Dublin, Ireland
Stats 2022, 5(1), 270-311; https://doi.org/10.3390/stats5010017
Submission received: 9 November 2021 / Revised: 22 February 2022 / Accepted: 2 March 2022 / Published: 9 March 2022
(This article belongs to the Special Issue Multivariate Statistics and Applications)

Abstract

:
This paper is concerned with the multivariate extended skew-normal [MESN] and multivariate extended skew-Student [MEST] distributions, that is, distributions in which the location parameters of the underlying truncated distributions are not zero. The extra parameter leads to greater variability in the moments and critical values, thus providing greater flexibility for empirical work. It is reported in this paper that various theoretical properties of the extended distributions, notably the limiting forms as the magnitude of the extension parameter, denoted τ in this paper, increases without limit. In particular, it is shown that as τ , the limiting forms of the MESN and MEST distributions are different. The effect of the difference is exemplified by a study of stockmarket crashes. A second example is a short study of the extent to which the extended skew-normal distribution can be approximated by the skew-Student.

1. Introduction

The skew-normal distribution was introduced in [1] and the skew-Student in [2]. These two distributions share the property that they may be derived formally. There are several methods of derivation of which probably the best known is to consider the bivariate normal distribution of X and Y each with zero mean, unit variance, and correlation ρ . The skew-normal distribution then arises by then considering the distribution of X conditional on Y < 0 [or Y > 0 ]. A second method of construction is to consider a random variable X + λ U where U has a standard normal distribution truncated from below at zero, written U T N 0 , 1 ; 0 + , where T N μ , σ 2 ; x + denotes a normally distributed variable with mean μ and standard deviation σ truncated from below at x and λ R . There are similar and equally well-known constructions for the skew-Student and for the multivariate versions of these distributions. That the conditioning variable Y is required to be less than (greater than) zero and that U follows a standard normal distribution truncated from below at zero are, however, limitations. This is for four principal reasons. First, using negative (positive) values of Y to determine whether or not X is observed is self-evidently a limitation. Depending on the application, the appropriate threshold or truncation point for Y might take any nonzero value, as might the value of the mean of its underlying normal distribution. For example, in his recent paper [3] refers to early work by [4]. The latter was concerned with the scores from admission examinations: in such a case, the mean of Y would surely be greater than zero, as would the truncation point. Similarly, there is often no reason a priori for the underlying mean of U to be zero.
Second, empirical evidence reported in the financial economics literature suggests that in the absence of truncation from below at zero, the distribution of the unobserved variable U, denoted N τ , 1 in this paper, exhibits nonzero values of τ (see, for example, [5] or [6]). In the first method of derivation above, the corresponding conditioning event would that Y < τ ( Y > τ ). Such distributions are referred to in the literature (see [7,8]), as extended skew-normal or extended skew-Student. The importance of nonzero values of τ also arises in stochastic frontier analysis, commonly referred to as SFA. SFA models are used to measure the efficiency of manufacturing companies and organizations such as banks. There is a detailed review of SFA models and methods in [9]. In its basic form, SFA employs linear regression models in which the unobserved residual has two components, commonly written as ϵ ν . The first term, ϵ , is a standard N 0 , σ ϵ 2 variate. The second term, ν , is a non-negative variate assumed to have an N 0 , σ ν 2 distribution, which is truncated from below at zero; that is, a half normal distribution. The expected value of ν , which is nonzero, measures inefficiency. With these assumptions, the residual ϵ ν has a skew-normal distribution. A somewhat different model was introduced by [10] in which the half normal variable is replaced one which has an exponential distribution. The paper by [11] shows that under the limit as τ , and with suitable choice of other model parameters, the extended skew-normal distribution encompasses both the half-normal and exponential distributions for the inefficiency term ν . Use of the extended version of the distribution offers greater flexibility in modeling inefficiency: the distribution of the inefficiency variable may exhibit a nonzero mode or may decay steeply.
Nonzero and negative values τ also arise in the study of stock market crashes. The standard model in financial economics is that returns on risky financial assets follow a multivariate normal distribution. Under this assumption, formally, the basic model of portfolio theory is to consider the conditional distribution of asset returns given a specified return on a market index. The resulting conditional distribution is multivariate normal, leading in essence to regression models for the return on individual assets. In the same manner, a market crash may be studied by considering the distribution of asset returns given that the return on the market index is less than a specified negative value. The resulting distribution is multivariate extended skew-normal. For market crashes, the value of the parameter denoted τ is both negative and of substantial magnitude. Analogous results arise if it is assumed that returns follow a multivariate Student distribution. In both the normal and Student cases, the distributions that arise as τ are of interest, one reason being that the limiting properties are different.
Third, use of extended versions of the skew-normal or skew-Student gives greater variability in the moments and critical values of the distributions. For empirical applications, this offers the possibility of better model fit. For some applications, the implied flexibility in the formal foundations may offer insights into the underlying data generation process. Last, in the multivariate case, conditional distributions are always in general of the extended type. Thus, for applications where conditional distributions play a role, extended versions are important if not unavoidable. The formal derivation of a skew-normal regression model as in [5] offers an example of this.
Extended versions of the skew-normal and skew-Student distributions have explicit advantages for some purposes. They offer the potential for greater flexibility in empirical work and, in addition, methodological advantages in some cases. The main aim of this paper is to present properties of the multivariate extended skew-normal (MESN) and multivariate extended skew-Student (MEST) distributions. The results demonstrate the differences from the standard versions. The paper also studies limiting cases of the distributions as the magnitude of the extension parameter τ increases without limit, extending a result reported in [11]. As the paper shows, these limiting cases are of interest from a theoretical point of view and offer insights for some applications.
The methodological results are illustrated by two applications. First, there is a study of the effect of a stock market crash. The results are different depending on whether the underlying distributions are multivariate normal or Student. The study presented here is theoretical, but its results can inform the development of econometric models of stock returns. Second, some researchers in this area of statistics have suggested informally that the skew-Student could be used as an alternative to the extended skew-normal. For a specified univariate application, it would be straightforward to estimate the parameters of both distributions and then make an informed choice using a test of fit or, for example, consideration of the tails of the distribution. Such an alternative may be attractive, but the suggestion could equally well be made in reverse: the extended skew-normal could be an alternative to the skew-Student. A general investigation of the similarity of the two distributions, particularly for multivariate cases, would be a major task and beyond the scope of this paper. To inform further research into this issue, this paper contains a short study designed to investigate this conjecture.
The structure of this paper is as follows. In Section 2 and Section 3, results for the MESN and MEST distributions, respectively, are presented. The results in these two sections are based on the extended versions of the second method of construction referred to above. Section 4 is concerned with the first method of construction, sometimes referred to in the literature as a hidden truncation model. This section contains the illustrative example of the effect of a stock market crash. The example shows that different behavior arises depending on the choice of model. Section 5 describes a brief investigation into the use of the skew-Student as an alternative to the extended skew-normal. Section 6 offers some concluding remarks. The abbreviations (E)SN and (E)ST are used for the univariate (extended) skew-normal and (extended) skew-Student distributions, respectively, with MSN and MST for the multivariate versions. Examples and graphs are based on univariate distributions, with most numerical results rounded to four decimal places. Notation not defined explicitly in the text is that in common use.

2. Multivariate Extended Skew-Normal Distribution

The multivariate skew-normal distribution was introduced by [12]. The multivariate extended skew-normal distribution, MESN, with an additional parameter, was first described in [13], independently by [8,14]. Following the notation in the third of these papers, the distribution of an n-vector X that follows this distribution is denoted M E S N n ( μ , , λ , τ ) . The authors of reference [13] derive the MESN distribution as a hidden truncation model. The authors of reference [8] present a direct derivation and link it to results in [7], who show that conditional distributions are in general of the extended type. The authors of reference [14] derive it as the convolution X = U + λ V , where the random vector U has the multivariate normal distribution N n μ , and the scalar random variable V is independently normally distributed as N τ , 1 truncated from below at 0, denoted V T N τ , 1 ; 0 + . The basic properties of the MESN distribution are described in this section using the notation in [14]. The probability density function of the distribution of X is
f X x = ϕ n x , μ + λ τ , + λ λ T Φ τ ω + λ T 1 x μ λ τ / ω Φ ( τ ) ,
where
ω = 1 + λ T 1 λ .
ϕ n x , μ , is the probability density function of an n-vector X , which has a multivariate normal distribution with mean vector μ and covariance matrix evaluated at x . Φ z is the standard normal distribution function evaluated at z, with ϕ z denoting the corresponding density function. The distribution is denoted X M E S N n ( μ , , λ , τ ) . The moment generating function of X is
M X t = e x p μ + λ T t + t T + λ λ T t / 2 Φ τ + λ T t / Φ τ .
The mean vector and covariance matrix of the M E S N distribution are, respectively
E X = μ + λ τ + ξ 1 ( τ ) = α , c o v X = + λ λ T 1 + ξ 2 ( τ ) = Ω ,
where the function ξ k ( z ) is defined as
ξ k z = k l o g Φ ( z ) / z k .
Note that the covariance matrix may also be written
c o v X = + λ λ T 1 τ ξ 1 τ ξ 1 τ 2 ,
a form that is referred to in Section 3.3. Coskewness and cokurtosis, defined here as the 4th cumulant, are given by
λ i λ j λ k ξ 3 τ ; λ i λ j λ k λ l ξ 4 τ ,
respectively.
For the skew-normal distribution itself, the mean of the underlying truncated normal variable denoted V equals 2 / π . Rounded to four decimal places, this value is shown in panel 2, column 1 of Table 1 in the row named “mean”. When | τ | 1 the minimum and maximum values of the mean are 0.5251 and 1.2876, respectively, as shown in panels 1 and 3. The corresponding results for the higher moments are shown in in the other rows of column 1 of the table. Columns 2 and 3 shown the analogous results when | τ | 5 and ≤30, respectively. Thus, as well as arising automatically under conditioning, the extended version of the skew-normal provides for more flexibility in the moments of the distribution.
In their Lemma 2, reference [11] report an apparently known result concerning the limiting distribution of X as τ . The lemma is reported here for convenience.
Lemma 1
([11]). Let X be distributed as M E S N n ( μ , , λ , τ ) . As τ , the distribution of X tends to the multivariate normal distribution N n μ , .
An implication of this result, described in more detail below, is that as τ < 0 increases in magnitude, V has an exponential distribution with parameter 1 / τ , that is, with mean and standard deviation both equal to 1 / τ . As τ , the distribution of X tends to a multivariate normal with an unbounded mean vector μ + λ τ but a finite covariance matrix + λ λ T .
The remainder of this section of the paper presents a number of properties of the MESN distribution. Figure 1 shows two sets of examples of the density function of the (univariate) extended skew-normal distribution for τ = ± 30 , ± 15 , ± 5 , ± 2.5 , and 0. In the left-hand set, the nonzero values of the extension parameter τ are negative. In the right-hand set, the signs of the τ are reversed. In both sets, μ = 0 , σ = 2 , and λ = 5 . Both sets demonstrate that asymmetry disappears progressively as | τ | increases and exhibit the properties reported in Lemma 1 and the text that follows it.
Papers by [7,15] show that a suitable linear transformation reduces the MSN distribution to a canonical form. Corresponding representations may be derived for the extended version of the distribution and, as shown below, for the extended skew-Student. These representations depend on the following standard result.
Lemma 2.
Let I n denote an n × n unit matrix, ψ an n-vector and 0 n an n-vector of zeros. The eigenvalues of the matrix I n + ψ ψ T are (i) 1 + ψ T ψ and (ii) 1 repeated n 1 times. The corresponding eigenvectors are (i) ψ / ψ T ψ and (ii) an n × n 1 orthogonal matrix T 0 which satisfies ψ T T 0 = 0 n 1 T .
This is used to establish the following:
Proposition 1.
Let X M E S N n ( μ , , λ , τ ) and let Y = 1 / 2 X μ , where 1 / 2 is a left square-root matrix of Σ. Then, Y M E S N n ( 0 n , I n , ψ , τ ) , ψ = 1 / 2 λ , and there exists an orthogonal transformation of Y
T T Y = Z 0 , Z T T ,
such that Z 0 and Z are independently distributed, Z 0 E S N ( 0 , 1 , ψ T ψ , τ ) and Z N n 1 ( 0 n 1 , I n 1 ) .
Note that from Lemma 1, as τ , the limiting distribution of Z 0 is the standard normal.

2.1. The Truncated Normal Distribution and Its Approximations

The probability density function of the distribution of the truncated normal variable V T N τ , 1 ; 0 + is
f V v = ϕ 1 v , τ , 1 Φ τ .
The moment-generating function (MGF), originally reported in [16], is
M V t = e τ t + t 2 / 2 Φ τ + t Φ τ ,
with the MGF valid for all t R . Following on from [17], numerous authors present results for the moments of the truncated normal distribution and generalizations thereof. These include [18,19,20,21,22] and, recently, [23], among others. For values of τ that are less than zero, the asymptotic expansion of Φ τ from page 932 of [24] is
ϕ τ 1 | τ | + j = 1 m 1 j Γ 2 j Γ j 2 j 1 | τ | 2 j + 1 + R m | τ | ϕ τ D m | τ | .
Noting that with suitable choices of m and values of τ , the remainder term R m .
R m | τ | = 1 m + 1 Γ 2 m 2 m + 1 Γ m 2 m 1 τ ϕ x x 2 m + 1 d x ,
may be ignored. In this case, the moment-generating function of V is
M V t D m | τ | 1 1 | τ | 1 t / | τ | + j = 1 m 1 j Γ 2 j Γ j 2 j 1 | τ | 2 j + 1 1 t / | τ | 2 j + 1 .
This leads to a distribution for which the corresponding density function is a weighted average of gamma densities
f v v = D m | τ | 1 g v , 1 / | τ | , 1 | τ | + j = 1 m 1 j Γ 2 j g v , 1 / | τ | , 2 j + 1 Γ j 2 j 1 | τ | 2 j + 1 ,
where g . denotes the density function of the gamma distribution
g x , α , ν = x ν 1 e x / α α ν Γ ν ; α , ν > 0 .
For sufficiently large values of | τ | , terms after the first may be ignored, giving an exponential distribution with density function
f V v = | τ | e | τ | v .
When used to to form the convolution X = U + λ V , the distribution at (15) leads to the skew-normal exponential distribution described in [11] but originally due to [10]. Figure 2 shows sketches of the truncated normal density function for τ = 3 , 10 and 30 . The steepness of decay increases with | τ | . Figure 3 shows the truncated normal density function for τ = 3 , together with the corresponding exponential density function and and approximation based on the density at Equation (13) with m = 2 . As Figure 3 indicates, the three density functions are visually similar. In particular, there is little difference between the truncated normal density and the three term mixture based on Equation (13).

2.2. Moments of the Truncated Normal Distribution

Expressions for moments of the truncated normal distribution are reported in [21], as well as in references cited above in Section 2.1. In the notation of the present paper, from Equation (9), the mean and variance of the truncated normal distribution are, respectively,
E V = τ + ξ 1 τ , v a r V = 1 + ξ 2 τ = 1 τ ξ 1 τ ξ 1 τ 2 = β 1 .
Skewness and kurtosis, defined here as the fourth cumulant, are respectively
κ 3 = ξ 3 τ = ξ 1 τ τ 2 1 + 3 τ ξ 1 τ + 2 ξ 1 τ 2 .
and
κ 4 = ξ 4 τ = ξ 1 τ 3 τ τ 3 + ξ 1 τ 2 4 7 τ 2 12 τ ξ 1 τ 3 6 ξ 1 τ 4 .
Kurtosis, the fourth moment about the mean and denoted by κ ¯ 4 , is
κ ¯ 4 = ξ 4 τ + 3 1 + ξ 2 τ 2 .
Expressed in terms of ξ 1 τ , this is
κ ¯ 4 = 3 ξ 1 τ 3 τ + τ 3 ξ 1 τ 2 2 + 4 τ 2 6 τ ξ 1 τ 3 3 ξ 1 τ 4 .
Note that, from [25], κ 3 0 for all τ R . Using the first term of the asymptotic expansion for Φ τ for τ 0 , under which V has the exponential distribution at Equation (15), leads to the following expressions for the first four derivatives of l o g Φ τ .
ξ 1 ( τ ) τ + 1 / τ , ξ 2 ( τ ) 1 / τ 2 1 , ξ 3 ( τ ) 2 / τ 3 , ξ 4 ( τ ) 6 / τ 4 ,
where in this paper the notation ≃ is taken to mean that the ratio of the two functions tends to unity as, in this case, τ . These results give the same expressions for the first four moments as those computed from the exponential distribution at Equation (15). Table 2 shows the computed values of the first four moments of the truncated normal distribution, the limiting exponential distribution at Equation (15), and the mixture distribution based on Equation (13) with m = 2 . Values are shown for τ = 3 , 10 , and 30 . In the table, kurtosis is the fourth moment about the mean, that is, κ ¯ 4 . As the table shows, the differences between the exact and approximate results are small and decline as | τ | increases. Whether a given approximation may be used as a practical alternative to the truncated normal will depend on the magnitude of τ and the application in question.

2.3. Standardized Form of the Extended Skew-Normal Distribution

Additional insights into the MESN distribution may be obtained by standardization. If Ω 1 / 2 denotes a left square root matrix of Ω , the random n-vector Z now defined as
Z = Ω 1 / 2 X α .
satisfies E Z = 0 n and c o v Z = I n . The distribution of Z has the density function
f Z z = | Ω | 1 / 2 ϕ n Ω 1 / 2 z , λ ξ 1 τ , + λ λ T Φ Δ S S N Φ ( τ ) ,
where ϕ n . and ω are as defined for Equation (1) and
Δ S S N = τ ω + λ T 1 Ω 1 / 2 z + λ ξ 1 τ / ω .
For the standardized form of extended skew-normal distribution, coskewness and cokurtosis (also defined here in terms of the fourth cumulant) are given by
λ ˜ i λ ˜ j λ ˜ k ξ 3 τ ; λ ˜ i λ ˜ j λ ˜ k λ ˜ l ξ 4 τ ,
respectively, where λ ˜ i is the standardized value of the skewness or shape parameter defined as
λ ˜ i = λ i σ i 2 + λ i 2 1 + ξ 2 τ 1 / 2 .
Both coskewness and cokurtosis tend to zero as τ , in which case the limiting distribution of Z is the standard multivariate normal. A suitable transformation similar to that in Proposition 1 shows that the standardized MESN distribution may be expressed in canonical form similar once again to that described in [7] and [15].
Proposition 2.
Let X M E S N n ( μ , , λ , τ ) and let Y = Ω 1 / 2 X α , where Ω 1 / 2 is a left square-root matrix of Ω and let
Y = Y 0 , Y ˜ T T .
Then Y ˜ N n 1 0 n 1 , I n 1 and Y 0 E S N μ c , σ c 2 , ψ c , τ are independently distributed with
μ c = ψ c τ + ξ 1 τ ; σ c 2 = 1 1 + ψ T ψ β 1 ; ψ c = ψ T ψ 1 + ψ T ψ β 1 ,
and β 1 as defined at Equation (16) and ψ in Proposition 1. Note that as in Proposition 1 as τ the limiting distribution of Z 0 is the standard normal.
Figure 4 shows two sets of standardized extended skew-normal density functions. In both sets μ = 0 , σ = 1 and λ = 5 . In the left-hand set, values of the extension parameter τ are set to −30, −15, −5, −2.5 and 0. In the right-hand set, the signs of the τ are reversed. Both sets of densities illustrate that for τ 0 little asymmetry is apparent even when the shape parameter λ is substantial; in this case five times greater than the scale parameter σ . Of the values of τ shown in the figure, only τ = 0 leads to a density function with a discernible amount of asymmetry. Figure 5 shows two more sets of the skew-normal density functions. The panel on the left shows extended skew-normal density functions with μ = 0 , σ = 1 and λ = 5 . The values of τ are −5, −2.5, −1, 0, 1, 2.5 and 5. The panel on the right-hand side consists of the corresponding densities standardized to have mean equal to zero and variance equal to one. The X-scales are the same in each panel. As Figure 5 shows, the skewness apparent for the extended skew-normal distributions reduces and largely disappears under standardization. There are analogous results for negative values of λ .
Table 3 shows a selection of moments for the extended skew-normal distribution and the corresponding standardized form for values of τ that are less than or equal to zero. Values of τ and λ are as shown in the table. Values of the location and scale parameter are μ = 0 and σ = 1 and are used in all numerical results. As the table shows, when τ 10 the values of standardized skewness and kurtosis are numerically close to 0 and 3 respectively, thus supporting the result of Lemma 1. For λ = 0 and 1 there is evidence to support normality for τ < 1 . Asymmetry is most evident when τ is zero or close to it. Table 4 shows the corresponding selection for positive values of τ . The panel corresponding to τ = 0 is repeated for ease of reading. The table indicates normality for τ 5 . Asymmetry is evident when τ 1 . The panel with τ = 2.5 has values of κ 4 that are negative.

3. Multivariate Extended Skew-Student Distribution

The multivariate extended skew-Student distribution, MEST], is an extension of the multivariate skew-Student distribution originally introduced by [2]. The extended version is reported in [26] and later in both [27,28]. Following [14], the former derives it as the convolution X = U + λ V , where the random vector U T , V of length n + 1 has a multivariate Student distribution with location parameter vector μ T , τ and scale matrix
0 n 0 n T 1 ,
with V truncated from below at zero. Consistent with the notation in Section 1, this is denoted V T T ν τ , 1 ; 0 + , where T T ν μ , σ 2 ; x + denotes a Student’s t variable with location parameter μ and scale σ truncated from below at x. The marginal distribution of U has the symmetric density function reported Section 3.2 of [27] and independently in [28]. The probability density function of the distribution of X is
f X x = t ν , n x , μ + λ τ , + λ λ T T ν + n Δ S T T ν τ ,
where
Δ S T = ν + n τ ω + λ T 1 x μ λ τ / ω ν + x μ λ τ T + λ λ T 1 x μ λ τ ,
and where ω is as defined at Equation (2). t ν , n x , μ , is the probability density function of an n-vector X which has a multivariate Student distribution with location parameter vector μ and scale matrix evaluated at x . T ν ( z ) is the distribution function of a Student’s t variable with ν degrees of freedom evaluated at z and t ν ( z ) is the corresponding density function. This distribution is denoted X M E S T n ( μ , , λ , τ ; ν ) . As in Section 2, this section of the paper presents basic properties of the MEST distribution. Similar to Table 1, unreported results show that nonzero values of τ make a substantial difference to the moments of the distribution. As τ , the limiting distribution of X is multivariate Student.
Proposition 3.
Let X M E S T n ( μ , , λ , τ ; ν ) . The limiting distribution as τ is multivariate Student with location parameter μ + λ τ , scale matrix + λ λ T , and ν degrees of freedom; denoted X M V T n ( μ + λ τ , + λ λ T ; ν ) .
The proof of this result uses the scale mixture representation reported in Lemma 3 of [29]. This result is consistent with the analogous property of the MESN distribution reported in Section 2. As shown later in this section, however, the limiting distribution of X as τ in the MEST case is different from that for the MESN.
Figure 6 shows sketches of the extended skew-Student density function for λ = 0 and ν = 3 . The left-hand panel shows density functions with negative values of τ ranging from 30 to 1 .The right-hand side shows densities with positive values of τ ranging from 0 to 30. This symmetric density function is that reported in both [27,28]. Two notable features are, first, the similarity of the density function for increasing positive values of τ , but, second, the increasing spread of the density function as | τ | increases for negative values of τ . For λ = 5 , the left-hand panel of Figure 7 shows density functions with the same negative values of τ . The right-hand panel shows densities with τ ranging from 0 to 20. In both of these figures, μ = 0 and σ = 1 . In the right-hand panel of Figure 7, the density function is qualitatively similar to the corresponding skew-normal distribution: asymmetry disappears with increasing values of τ , and the location parameter increases, but the spread does not. For negative values of τ , the spread increases and asymmetry decreases with increasing values of | τ | . To support the sketches in the figures, the moments of the extended skew-Student distribution are reported in Section 3.3 below.
A canonical form of the MEST distribution may be derived using an approach that is essentially the same as that in Proposition 1.
Proposition 4.
Let X M E S T n ( μ , , λ , τ ; ν ) and let Y = 1 / 2 X μ , where 1 / 2 is a left square-root matrix of Σ. Then Y M E S T n ( 0 n , I n , ψ , τ ; ν ) , ψ = 1 / 2 λ , and there exists an orthogonal transformation of Y
T T Y = Z ˜ = Z 0 , Z T T ,
such that the density function of Z ˜ is
f z 0 , z = K ν , n 1 + ψ 0 2 1 + Q c z ˜ / ν ( ν + n ) / 2 T ν + n Δ S T , c T ν τ ; ψ 0 = ψ T ψ ,
where K ν , n is the normalizing constant for an n-variate multivariate Student distribution with ν degrees of freedom,
Q c z ˜ = z T z T + z 0 ψ 0 τ 2 1 + ψ 0 2 ,
and
Δ S T , c = ν + n τ ω c + ψ 0 z 0 ψ 0 τ / ω c ν + Q c z ˜ ; ω c = 1 + ψ 0 2 .
Equivalently, Z ˜ M E S T n ( 0 n , I n , ψ n , τ ; ν ) where ψ n = ( ψ 0 , 0 n 1 T ) T .
Standard manipulations show that Z 0 E S T ( 0 , 1 , ψ 0 , τ ; ν ) and that the marginal distribution of Z has the symmetric Student-like density function reported in Section 3.2 of [27].

3.1. The Truncated Student’s t Distribution

Similar to the extended skew-normal, the properties of the extended skew-Student distribution are substantially affected by those of the truncated form of Student’s t. The density function of the truncated Student’s t variable v is
f V ( v ) = t ν , 1 v , τ , 1 T ν τ .
Figure 8 shows sketches of the truncated Student t density function for τ = 35 , together with two approximating beta type-2 density functions as described below in Lemma 5. The degrees of freedom ν are 5 and 20, respectively. For a fixed value of τ , the figure illustrates the increasing severity of decay as ν increases. It is notable that for ν 5 , the truncated Student t is well approximated by the beta type-2 densities.

3.2. Moments of the Truncated Student’s t Distribution

Moments of the truncated distribution at Equation (30) may be evaluated directly. Note that expressions for the moments of a doubly truncated t distribution may be found in [30]. As reported in [27], for ν > 1 and ν > 2 , respectively, the mean and variance of this distribution are
E V = τ + ξ ν τ , v a r V = η ν τ ξ ν τ 2 ,
where
ξ ν τ = ν 1 + τ 2 / ν t ν ( τ ) ν 1 ) T ν ( τ , η ν ( τ ) = τ ξ ν τ + ν T ν 2 ( τ ( ν 2 ) / ν ) ( ν 2 ) T ν ( τ ) .
The following result, derived using integration by parts, leads to a more useful representation of η ν ( τ ) .
Lemma 3.
For ν > 2 , the following result holds
τ t ν ( τ ) 1 + τ 2 / ν ( ν 1 ) T ν ( τ ) + T ν 2 ( τ ( ν 2 ) / ν ) T ν ( τ ) = 1 .
Using this result, for ν > 2 , the functions η ν τ and ξ ν τ are related by the identity
η ν τ = ν ν 2 τ ν 1 ξ ν τ ν 2 .
Equation (33) allows the variance to be written as
v a r V = ν ν 2 τ ν 1 ξ ν τ ν 2 ξ ν τ 2 ,
Note that l i m ν ξ ν τ = ξ 1 τ is sufficient to show that the limiting values in Equation (31) equal those for the truncated normal at Equation (16). For ν > 3 and ν > 4 , skewess and kurtosis (the fourth moment about the mean), respectively, are
κ 3 V = ξ ν τ ν 1 ν 3 τ 2 ν ν 5 ν 2 ν 3 + 3 ν 1 ν 2 τ ξ ν τ + 2 ξ ν τ 2
and
κ ¯ 4 V = 3 ν 2 ν 2 ν 4 + K 1 ξ ν τ + K 2 ξ ν τ 2 + K 3 τ ξ ν τ 3 3 ξ ν τ 4 ,
where
K 1 = ν 1 ν 4 τ 3 + 3 ν τ ν 2 ; K 2 = 2 ν ν + 1 ν 2 ν 3 + 4 ν 1 τ 2 ν 3 ,
and
K 3 = 6 ν 1 ν 2 .
As already noted above, reference [25] showed that the skewness of the truncated normal distribution is non-negative for all values of τ . The following shows that the same result holds for the truncated Student distribution.
Proposition 5.
Let V T T ν τ , 1 ; 0 + . For ν > 3 , the following result holds: κ 3 V 0 .
The proof is by contradiction. First, note that since ξ ν ( τ ) 0 , the sign of κ 3 V is determined by the sign of the expression in . in Equation (35). This quadratic function of ξ ν ( τ ) has roots
3 ( ν 1 ) τ ( ν 2 ) 4 ± ( ν 1 ) τ ( ν 2 ) 2 ( ν + 1 ) ( ν 5 ) ( ν 1 ) ( ν 3 ) + 8 ν ( ν 5 ) ( ν 2 ) ( ν 3 ) 4 .
Since the coefficient ξ ν ( τ ) 2 is positive, the function is negative between the roots, which is a contradiction.
Note that as ν , Proposition 5 also establishes Sampford’s result, and note that the expressions for the first four moments tend to those for the truncated normal distribution at Equations (16), (17), and (19).
Computation of limiting expressions for the moments as τ requires a result that is analogous to the well-known asymptotic expression for normal distribution reported in [24]. Such a result was first reported in [31]. As it does not appear to be well known, it is summarized below in the notation of this paper.
Lemma 4
([31]). For values of τ that are less than zero, the asymptotic expansion of T ν τ is
t ν τ 1 + τ 2 / ν 1 | τ | + j = 1 m 1 j Γ 2 j a j Γ j 2 j 1 | τ | 2 j + 1 + R m , ν | τ | ; a j = ν j Γ ν 2 + 1 2 j Γ ν 2 + 1 + j .
Noting that with suitable choices of m and values of τ , the remainder term R m .
R m , ν | τ | = 1 m + 1 Γ 2 m 2 m + 1 a m + 1 Γ m 2 m 1 τ t ν x 1 + τ 2 / ν x 2 m + 1 d x ,
may be ignored.
Using the first two terms in the expansion in Lemma 4 for τ < 0 and ν > 1 gives
ξ ν τ ν | τ | ν 1 + ν 2 ν 1 ν + 2 | τ | .
from which the asymptotic expected value is
E V = τ + ξ ν τ | τ | ν 1 + ν 2 ν 1 ν + 2 | τ | ; τ < 0 .
For ν > 2 , the corresponding expression for the asymptotic variance is
ν | τ | 2 ν 1 2 ν 2 + 2 ν ν 2 ν + 1 ν 1 2 ν 2 ν + 2 + ν 3 ν 3 4 ν 2 + 2 ν 2 ν 1 2 ν 2 ν + 2 2 ν + 4 | τ | 2 .
Thus, for fixed finite degrees of freedom ν > 2 , the expected value and variance increase without limit as τ . As ν , the expected value and variance tend to 1 / | τ | and 1 / | τ | 2 , respectively, the results for the truncated normal distribution. The corresponding expressions for skewness and kurtosis are omitted in view of their complexity. However, if just the terms proportional to | τ | 3 are considered, then for ν > 3 as τ , asymptotic skewness is
2 ν | τ | 3 ν 1 ν 2 ν 1 2 ν ν 2 + 1 ν 2 ν 3 .
Similarly for ν > 4 , asymptotic kurtosis is proportional to | τ | 4 . Table 5 shows a selection of moments from the truncated Student’s t distribution. As τ increases above zero, the distribution increasingly resembles Student’s t as demonstrated by the values in the bottom panel of the table. The top panel corresponding to τ = 35 shows the increasing values of the moments. The analog of the limiting exponential distribution that arises in the normal case described in Section 2.1 is as follows.
Lemma 5.
Let V T T ν τ ˜ , 1 ; 0 + . For τ < < 0 , as the ratio | τ | / ν increases without limit, the asymptotic distribution of Y = V / | τ | is β I I 1 , ν , that is, with density function.
f Y ( y ) = ν 1 + y ν + 1 .
The proof of this lemma is in Appendix A. An asymptotically equivalent result is that the variable Y ˜ = V / ν + | τ | 2 is also distributed as β I I 1 , ν .
It is straightforward to show that the conditional distribution of X given V = v follows a multivariate Student distribution with ν + 1 degrees of freedom, location parameter vector μ + λ v , and scale matrix
ν ν + 1 1 + v τ 2 ν .
Use of this distribution in conjunction with the asymptotic distribution of V in Equation (45), for τ < 0 does not lead to tractable results that are analogous to those in Section 2.

3.3. Moments of the MEST Distribution

For ν > 1 and ν > 2 , respectively, the mean vector and covariance matrix of the MEST distribution are
E X = μ + λ τ + ξ ν τ = α ν ,
and
c o v X = ν { 1 + η ν τ / ν } / ν 1 + λ λ T η ν τ ξ ν τ 2 = Ω ν .
Using the identity at Equation (33) allows the covariance matrix to be written as
ν ν 2 τ ν 2 ξ ν τ + λ λ T ν ν 2 ν 1 ν 2 τ ξ ν τ ξ ν τ 2 .
The similarity of the coeffcient of λ λ T to the corresponding term in Equation (6) may be noted. The coefficient of provides the inequality ν τ ξ ν τ 0 .
The skewness of a single variable X i in X with scale denoted by σ may be expressed in terms of the moments of V the truncated Student’s t variable, specifically Equations (34) and (35), and is given by
κ 3 X i = 6 λ i σ 2 ν 1 v a r V + λ i λ i 2 + 3 σ 2 ν 1 κ 3 V ,
Defining the constants
K 4 = 3 ν 2 σ 4 ν 1 ν 3 , K 5 = 6 λ 2 σ 2 ν ν 1 , χ = 1 + ξ ν τ 2 ν , ϑ = 1 + 3 ξ ν τ 2 ν
The kurtosis of X i is given by
κ ¯ 4 = K 4 χ 2 + 2 K 4 ν ϑ + K 5 χ v a r V + 2 ξ ν τ ν 2 K 4 ν + K 5 κ 3 V + K 4 + K 5 ν + λ 4 κ ¯ 4 V
The corresponding expressions for coskewness and cokurtosis are omitted. A selection of moments of the extended skew-Student is shown in Table 6, Table 7, Table 8 and Table 9. Table 6 [7] shows results for τ 0 [ τ 0 ] for λ = 0 . The panel for τ = 0 is repeated for convenience and corresponds to Student’s t distribution. The lower panels of Table 6 show the increasing magnitude of variance and kurtosis as | τ | increases, even for λ = 0 . Table 8 and Table 9 show the corresponding results for λ = 5 . Note that in Table 8, some large results are shown to two decimal places only to preserve the formatting.

3.4. Standardized Forms of the MEST Distribution

As in Section 2.3, further insights into the extended skew-Student distribution may be obtained by standardization. If Ω ν 1 / 2 denotes a left square root matrix of Ω ν , the random vector Z now defined as
Z = Ω ν 1 / 2 X α ν .
satisfies E Z = 0 n and c o v Z = I n . The distribution of Z has the density function
f Z z = | Ω ν | 1 / 2 t ν , n Ω ν 1 / 2 z , λ ξ ν τ , + λ λ T T ν + n Δ S S T T ν τ ,
where t ν , n . is as defined for Equation (28), ω is as defined for Equation (1) and
Δ S S T = ν + n τ ω + λ T 1 Ω ν 1 / 2 z + λ ξ ν τ / ω ν + Ω ν 1 / 2 z + λ ξ ν τ T + λ λ T 1 Ω ν 1 / 2 z + λ ξ ν τ ,
The distribution at Equation (54) has a canonical form. First, define
β 0 , ν = ν ν 1 1 + η ν τ ν ; β 1 , ν = η ν τ ξ ν τ 2 β 0 ,
partition Z into a scalar Z 0 and an n 1 -vector Z 1 and let Q Z be the quadratic form
Q z = β 0 , ν z 1 T z 1 + z ˜ 0 2 ; z ˜ 0 = β 0 , ν 1 + β 1 , ν ψ 0 2 z 0 + ψ 0 ξ ν τ 1 + ψ 0 2 ; ψ 0 = ψ T ψ ,
where ψ is as defined in Proposition 1. Methods similar to those used in that proposition gives the following result.
Proposition 6.
Let X M E S T n ( μ , , λ , τ ; ν ) and Z = Ω ν 1 / 2 X α ν , where Ω ν 1 / 2 is a left square-root matrix of Ω ν . Then Z M E S T n ( μ c , ν , c , ν , λ c , ν , τ ; ν ) where
μ c , ν = λ c , ν τ + ξ ν τ ; λ c , ν = λ ν , 0 n 1 T T ; λ ν = ψ 0 β 0 , ν ( 1 + β 1 , ν ψ 0 2 ) ,
and
c , ν = σ ν 2 0 n 1 T 0 n 1 T I n 1 / β 0 , ν , σ ν 2 = 1 / β 0 , ν ( 1 + β 1 , ν ψ 0 2 ) .
The density function of Z is
f Z z = β 0 n 1 + β 1 ψ 0 2 1 + ψ 0 2 K ν , n 1 + Q z ν ν + n / 2 T ν + n Δ C S S T T ν τ ,
where
Δ C S S T = ν + n τ 1 + ψ T ψ + ψ T ψ z ˜ 0 ν + Q z .
As Equations (58) and (59) show, under the canonical representation, the asymmetry in the density function is attributable solely to the scalar variable Z ˜ 0 . The marginal distribution of Z 1 is symmetric and of the same type reported Section 3.2 of [27]. Examples of the EST and standardized EST density functions are shown in Figure 9 for τ = 30 and 5 and ν = 10 , 20 , and 100. In the upper (lower) row, λ = 0 [ 5 ] . The X-scales are the same in each panel. The graphs confirm results from Table 8 and Table 09, namely that the degree of asymmetry is reduced under standardization. Examples of contour plots for the bivariate EST and standardized EST distributions are shown in Figure 10.
To investigate the behavior of the distribution as τ for fixed ν , consider the scalar variable Z 0 , which has the marginal distribution E S T μ ν , σ ν 2 , λ ν , τ ; ν where μ ν = λ ν τ + ξ ν τ . For ν > 2 , define
A = ν ( ν 2 ) ψ 0 ν 1 + ψ 0 2 ; B = ν 1 + ψ 0 2 / ( ν 1 ) ( ν 1 ) ( ν 2 ) ( 1 + ψ 0 2 ) .
As τ for fixed ν , the asymptotic density function of Z 0 is
f z 0 = | τ | B K ν , 1 1 + z 0 + A 2 | τ | 2 B / ν ( ν + 1 ) / 2 T ν + 1 Δ E S T T ν τ ,
where
Δ E S T = ν + 1 ν τ 1 + ψ 0 2 + ψ 0 z 0 + A | τ | B 1 + z 0 + A 2 | τ | 2 B / ν .
This leads to the following result:
Proposition 7.
For ν > 2 , as τ / ν , the distribution of Z 0 has the asymptotic density function
f z 0 = ν B ν / 2 | z 0 + A | ν + 1 T ν + 1 ν + 1 ± ψ 0 1 + ψ 0 2 B 1 / 2 | z 0 + A | ; z 0 A ,
with the sign of ψ 0 determined by the sign of z 0 + A , and
f A = 1 + ψ 0 2 ν 3 / 2 K ν + 1 , 1 1 + ψ 0 2 ν / 2 + 1 ( ν 1 ) ( ν 2 ) ( ν + 1 ) .
The result in this proposition requires the asymptotic expression for the distribution function of Student’s t. As noted above, such a result was first provided by [31] and is summarized in Lemma 4. Comparative examples of the exact and asymptotic EST density functions are shown in Figure 11. The implication of Proposition 7 is that as τ , the standardized distribution is qualitatively similar to the corresponding form for the extended skew-normal in that dependence on τ disappears. For nonzero values of λ or ψ 0 , however, the distribution remains asymmetric. It is important to note though that, unlike the MESN, dependence on τ as it tends to does not disappear in the nonstandardized MEST case. In addition to Proposition 7, recall from results in Section 3.2 and Section 3.3 that for finite degrees of freedom, the location parameter vector depends on | τ | and the covariance matrix on | τ | 2 .

4. Hidden Truncation Models

In their simple form, hidden truncation models are concerned with the bivariate normal distribution of X , Y in situations in which X is observed if Y is greater than (less than) a given threshold, here denoted τ ˜ . The procedure is commonly referred to as selective sampling. The resulting conditional distribution is that of X | Y τ ˜ . Such a construction is reported in a more general form in [12] for the case in which the scalar X is replaced by a random vector X . The phrase hidden truncation models is more often associated with the [13] in which they refer to an earlier work [32]. In selective sampling situations, it seems self-evident that the threshold τ ˜ will depend on the application in question. This is clearly implied in Section 2 of [13] in which they denote the threshold by α and report the resulting distribution of X conditional on Y α , which is the extended skew-normal. The extended version of the skew-normal is also described in [33]. In the introduction to a sole-authored later paper, [34], Y is assumed to exceed its expected value. This case is more in keeping with the skew-normal literature, which does not generally employ the extended version of the distribution. Subsequent sections of [34], however, are inter alia concerned with extended versions of the skew-normal and other distributions.
The aim of this section is to present limiting forms of the extended skew-normal and skew-Student distributions when they are derived as hidden truncation models. Consistent with the results in Section 2 and Section 3, the limiting distributions exhibit different properties. The distributions of the hidden truncated variable Y and the observed vector X differ markedly depending on whether the underlying form is normal or Student’s t. In selective sampling, limiting forms of the distributions arise when the notional observation on the conditioning variable Y is required to be in one of the tails of its distribution. To illustrate the differences between the hidden truncation skew-normal and skew-Student distributions, either extended or not, this section contains a table of critical values corresponding to a probability of 0.025. Critical values corresponding to other probabilities are available on request. In addition to these general results, Section 4.4 describes an application to stock market crashes, in which the truncated variable is not only material to the resulting distribution but is also observed.

4.1. Hidden Truncation under the Normal Distribution

It is assumed that the n-vector X and a scalar variable denoted Y have a multivariate normal distribution
X Y N ( n + 1 ) μ X μ Y , X δ δ T σ Y 2 ,
The conditional distribution of X , given that Y τ ˜ , has the probability density function
f X | Y τ ˜ x | Y τ ˜ = ϕ n x , μ X , X Φ τ σ Y δ T X 1 x μ X / ω Y Φ ( τ ) ,
where
ω Y = σ Y 2 δ T 1 δ , τ = τ ˜ μ Y / σ Y .
The moment-generating function of the conditional distribution of X given Y τ ˜ is
M X t = e μ x T t + t T X t / 2 Φ τ δ T t / σ Y Φ τ ,
and that of Y is given Y τ ˜
M Y s = e μ Y s + s 2 σ Y 2 / 2 Φ τ σ Y s Φ τ .
Noting the similarity to the MGF of the truncated variable denoted V in Section 2.1, it follows that
E Y | Y τ ˜ = μ Y σ Y ξ 1 τ , v a r Y | Y τ ˜ = σ Y 2 1 + ξ 2 τ ,
As τ ˜ , the variable Y given that it is less than or equal to τ ˜ becomes deterministic in the sense that its expected value is asymptotically equal to τ ˜ , but its variance and all higher moments are asymptotically equal to zero. The conditional expected return and covariance matrix of X are, respectively,
E X | Y τ ˜ = μ X δ / σ Y ξ 1 τ , c o v X | Y τ ˜ = X + δ δ T / σ Y 2 ξ 2 τ .
As τ ˜ , the vector of expected values and the covariance matrix become
E X | Y τ ˜ E X | Y = τ ˜ , c o v X | Y τ ˜ c o v X | Y = τ ˜ .
It is interesting to note that element i of the vector of expected values decreases or increases depending upon whether δ i is positive or negative. The joint moment-generating function of X and Y conditional on Y τ ˜ is
M X , Y | Y τ ˜ t , s = e μ x T t + t T X t / 2 + t T δ s + μ Y s + s 2 σ Y 2 / 2 Φ τ δ T t / σ Y σ Y s Φ τ ,
from which
c o v X , Y | Y τ ˜ = δ 1 + ξ 2 τ .
Using similar arguments to those for Lemma 1, as τ ˜ , the covariances all tend to zero as expected.

4.2. Hidden Truncation under Student’s t Distribution

It is now assumed that the n-vector X and a scalar variable Y have a multivariate Student distribution with ν degrees of freedom. The conditional distribution of X , given that Y τ ˜ , has the probability density function
f X | Y τ ˜ x | Y τ ˜ = t n , ν x , μ , T ν + n τ σ Y δ T 1 x μ / ( ω Y Ψ ) T ν τ
where ω Y and τ are as defined above and
Ψ = ν + x μ T 1 x μ / ν + n .
The conditional mean and variance of Y are
E Y | Y τ ˜ = μ Y σ Y ξ ν τ , v a r Y | Y τ ˜ = σ Y 2 η ν τ ξ ν τ 2 ,
where ξ ν τ and η ν τ are defined at Equation (32). As τ ˜ , the asymptotic expected value and variance are
E Y | Y τ ˜ μ Y ν 1 ν | τ ˜ | ν 1 , v a r Y | Y τ ˜ ν τ 2 ν 1 2 ν 2 .
For finite and fixed degrees of freedom, and ignoring μ Y for ease of exposition, the conditional expected value is uplifted through multiplication by ν / ν 1 , that is, the effect is most pronounced when the degrees of freedom are small. The asymptotic variance increases with | τ | 2 , that is, potentially without limit. The conditional expected return and covariance matrix of X are
E X | Y τ ˜ = μ X δ / σ Y ξ ν τ ,
and
c o v X | Y τ ˜ = ν ν 1 1 + η ν τ ν X , C + δ δ T σ Y 2 η ν τ ξ ν τ 2
where
X , C = X δ δ T / σ Y 2 .
As τ ˜ , the vector of expected values and the covariance matrix become
E X | Y τ ˜ E X | Y = τ ˜ + δ τ ˜ μ Y ν 1 σ Y 2 ,
and
c o v X | Y τ ˜ ν ν 1 ν 1 ν 2 + τ 2 ν 2 X , C + δ δ T σ Y 2 ν τ 2 ν 1 2 ν 2 .
That is, for finite degrees of freedom, both expected values and the covariance matrix increase in magnitude without limit as τ ˜ . Similar to Equation (71), the conditional expected value of element i of X will increase without limit if the corresponding value of δ i is negative and is unaffected if it equals zero.
Comparing the normal and Student hidden truncation models, the vectors of expected values are mainly determined by τ . Differences will be marked only if the degrees of freedom are small. The covariance matrices differ substantially: in the Student case for fixed ν , the covariance matrix increases without limit as τ ˜ . For a given finite value of τ < < 0 , the increase in the elements of the covariance matrix decreases with increasing ν . The conditional covariance between X and Y is
c o v X , Y | Y τ ˜ = δ σ Y η ν τ ξ ν τ 2 .
Standard manipulations using Equation (83) show that the conditional correlation between a typical element i of X and Y is asymptotically equal to
1 + σ Y 2 σ X , C , i 2 ν 1 / δ i 2 1 / 2 ,
which tends to zero as ν .

4.3. Hidden Truncation with Extended Distributions

Table 10 shows critical values corresponding to a probability of 0.025 for the univariate versions of distributions at Equations (66) and (75) for a range of values of τ , ρ , and ν . Table entries are computed numerically, displayed to two decimal places. In Panel 4, corresponding to the standard case τ = 0 , the first row, ρ = 0 yields the critical values for Student’s t distribution with 5 , 10 , 20 , 50 , and 100 degrees of freedom and the standard normal distribution. The other rows in the same panel correspond to ρ = 0.2 , 0.4 , 0.6 , and 0.8 . As the panel shows the critical values range from 1.96 to 3.15 . In Panels 1 to 3, for which τ takes negative values, the range is greater and increases with the magnitude of τ . In panels 5 to 7, with positive values of τ , the critical values closely approximate those of Student’s t and the normal distribution as expected. In each panel, the rows corresponding to ρ = 0 are the critical values of the nonstandard symmetric Student-like distribution reported in both [27,28]. The effect of the distribution of X and Y and the threshold τ ˜ has a non-negligible effect on critical values, that is, for many applications, extended versions of the distributions may be preferred.

4.4. Stock Market Crashes

The basic empirical model for the returns on stocks is a regression in which the single explanatory variable is the contemporaneous return on a suitable market index, such as the UK’s FTSE100 or the USA’s S&P 500. The model is generally referred to as the market model. It is the operational version of the capital asset pricing model, universally referred to as the CAPM, of [35,36,37]. Numerous other regression setups are in widespread use, but all maintain a close connection to the market model. More formally, it is assumed that the n-vector of asset returns R and the contemporaneous return on the market index R m have a multivariate normal distribution
R R m N ( n + 1 ) μ μ m , R δ δ T σ m 2 ,
where δ = β σ m 2 . An element R i of R may denote the return on an individual stock or a portfolio of stocks. The market model is then the conditional distribution of R given that Rm = rm that is
R | R m = r m N n μ + β ( r m μ m ) , R C ; R C = R σ m 2 β β T .
or, if the market model is written in familiar regression style notation
R = μ + β R m μ m + ϵ ;
The results with an underlying Student distribution are similar. For ν > 1 , the conditional mean is the same, but for ν > 2 , the conditional covariance matrix now depends on r m as follows
c o v R | R m = r m = ν ν 1 1 + r m μ m 2 ν σ m 2 R C .
That is, the conditional variance is inflated by a factor that is proportional to the squared deviation of r m from its expected value.
In this subsection, the effect of a market crash is considered. A detailed coverage of the statistical and empirical properties of crashes is beyond the scope of this paper, but some theoretical insights into crashes may be derived using the skew-normal and skew-Student distributions. Specifically, the standard conditioning event R m = r m is changed to R m τ ˜ . This characterizes a crash when τ ˜ is both negative and of large magnitude. Comparison of Equation (85) with (65) and (66) shows that the resulting conditional distribution of R is extended skew-normal or extended skew-Student. For underlying normal returns, the conditional mean and variance of market returns are, respectively,
E R m | R m τ ˜ = μ m σ m ξ 1 τ ; v a r R m | R m τ ˜ = σ m 2 1 + ξ 2 ( τ ) ,
where τ = τ ˜ μ m / σ m . Similar to the results in Section 4.1, in the limit, as τ ˜ , market return becomes nonstochastic with (expected) value equal to τ ˜ .
The corresponding results for the conditional mean vector and covariance matrix of asset returns R are
E R | R m τ ˜ = μ σ m β ξ 1 τ μ + β τ ˜ μ m ,
and
c o v R | R m τ ˜ = R C + σ m 2 β β T 1 + ξ 2 τ R C .
In a crash, the conditional expected return on asset i decreases or increases without limit depending on the sign of β i , but there is no effect if β i = 0 . The conditional covariance matrix is asymptotically equal to c o v R | R m = τ ˜ , the conventional case defined at Equation (86). With underlying Student returns, for ν > 2 , the conditional mean and variance of market returns are, respectively,
E R m | R m τ ˜ = μ m σ m ξ ν τ ; v a r R m | R m τ ˜ = σ m 2 η ν ( τ ) ξ ν τ 2 .
Using the results at Equation (78), it follows that the expected value of market return in a crash is negative and increases pro rata to the standardized crash size. Unlike the results based on an underlying normal distribution, the conditional variance is proportional to the square of the standardized crash size; for given ν , the variance increases without limit. A sketch of the conditional distribution of index returns under normal and Student’s t distributions with five degrees of freedom and corresponding to a five-standard-deviation crash is shown in Figure 12. As the sketch shows, the Student’s t tail is longer and fatter than that of the normal.
The corresponding results for the conditional mean expected return vector is
E R | R m τ ˜ = μ R σ m β ξ ν τ .
As above, the conditional expected return for asset i will increase or decrease without limit depending on the sign of β i but is unchanged if it equals zero. Using Equation (80), the conditional covariance matrix is
c o v R | R m τ ˜ = ν ν 1 1 + η ν τ ν R C + σ m 2 β β T η ν τ ξ ν τ 2 ,
which, in keeping with Equation (83), may also increase without limit. Noting that η ν τ = E R m μ m 2 / σ m 2 | R m τ ˜ , the similarities between Equation (94) and (88) are clear.

5. Extended Skew-Normal versus Skew-Student

The literature concerning the skew-normal and skew-Student distributions is more abundant that that for the corresponding extended versions. It has been conjectured by some researchers in the area, albeit informally, that the skew-Student could be used as an alternative to the extended skew-normal distribution. To some extent, such a suggestion is motivated naturally by the similarities in the shapes of some of the respective density functions. Somewhat more formally, use of the skew-Student could be regarded as being closer in spirit to the original skew-normal literature. For univariate distributions, and from the perspective of empirical work, this is an issue that is more concerned with parameter estimation and tests of fit. That is, for a given data set, does the extended skew-normal or the skew-Student offer better fit? For multivariate distributions, the issue is the same in principle, although the details are more complex. It is of course also the case that the extended skew-normal might be preferred to the skew-Student. For example, for the former, all moments exist, which may be a consideration for some applications. Conditional distributions are in general of the extended type. For multivariate applications in which conditioning is a requirement, methodological issues could imply that extended versions of the distribution are more appropriate. That is, an MESN or even MEST distribution may be preferable to the MST.
To construct an approximation, at least two types of method suggest themselves. Given a specified extended skew-normal distribution, one method would be to minimize a suitable measure of the distance between the two density functions. Several measures of distance could be considered. Denoting the two density functions by f E S N ( x ) and f S T ( x ) and assuming that the parameters of the former (latter) are given, the parameters of the latter (former) could be chosen by minimizing
f E S N x f S T x 2 d x .
Numerous variations on this theme could be constructed, for example, using a different norm or minimizing the divergence between the ESN and ST density functions using the Kullback–Leibler divergence measure [38] or the Hellinger distance ([39]). A second approach could be to seek to match the first four moments of the two distributions. It is clear that a comprehensive study of this conjecture, particularly bearing in mind multivariate distributions, would be a substantial undertaking. In this section of the paper, an initial investigation into the approximation of the univariate extended skew-normal distribution by the skew-Student, which may inform more comprehensive studies to be carrried out in the future, is described. The section is in two parts. In the first section, a theoretical investigation based on population moments is reported. In the second part, a study in which simulated data from a number of specified extended skew-normal distributions is used to estimate the parameters of both models is reported.
There are three technical points to note. First, the choice of an approximating skew-Student distribution is informed by the limiting forms of the extended skew-normal. From Lemma 1, as τ , the limiting form of the ESN distribution is N n μ , , which is the limiting form of a skew-Student distribution with λ = 0 , that is, Student’s t, as ν . As also reported in Section 2, a similar result holds as τ . The implication is that using ST distributions to approximate the ESN is appropriate for values of | τ | that are not too large. Second, motivated again by similarities in the shape of the density function, an ESN distribution may be approximated by the SN itself. Third, there are combinations of the parameters λ and τ for which approximation by moment matching are infeasible. To illustrate this, consider an approximation of a univariate ESN with parameters μ , σ 2 , λ and τ by an SN with parameters μ 0 , σ 0 2 and λ 0 . Equating skewness shows that a real value of the ratio σ 0 2 / λ 0 2 requires that
ξ 3 0 2 / 3 σ 2 / λ 2 + 1 + ξ 2 τ > ξ 3 τ 2 / 3 1 + ξ 2 0 ,
and that simple computations show that the inequality does not always hold.

5.1. Moment Matching Study

The study in this paper considers the approximation of an ESN distribution by an ST. As above, for the ESN, μ = 0 and σ 2 = 1 . The extension parameter τ takes 11 values in the range 20 , 20 . As skewness is asymmetric in the shape parameter; λ takes 9 values in the range [ 10 , 0 ] . For practical reasons, the derived value of ν is restricted to be an integer. For a given pair λ , τ , the approximating values of ν and λ 0 are derived by minimizing the absolute difference in standardized skewness. This is done by grid search. The other parameters are computed by equating the expected value and variance of the two distributions. For λ , τ , pairs for which a moment matching approximation exists, the divergence between the ESN and ST density functions is computed using the Kullback–Leibler divergence measure [38]. The values of this divergence measure are ranked from best to worst, with the parameters corresponding to the best ten and worst ten shown in Table 11. The first two columns of each panel show the values of λ and τ . The next three columns show the computed values of μ , σ 2 , and λ for the approximating ST distribution, with values rounded to four decimal places. Computed values of ν that were equal to 1000 or greater were replaced by , that is, the approximating distribution is effectively skew-normal. Table 12 shows the corresponding values of the moments. As the Best 10 panel shows, the differences in the first four moments are negligible. For the Worst 10 panel, differences in mean, variance, and skewness are also negligible because of the method of construction. Unlike the results in the upper panel, there are differences in kurtosis. Table 13 shows the corresponding critical values, displayed in eight columns. These show critical values at p-values of 0.5%, 2.5%, 95.5%, and 99.5% in ESN/ST pairs. Values are shown corrected to two decimal places and were computed numerically. As the table shows, for the Best 10 approximations, the differences are negligible. For the Worst 10, the differences are more pronounced. To illustrate the effect of the moment matching procedure, Figure 13 shows ESN and ST density functions for which the ST approximation is the worst according to the Kullback–Leibler divergence measure.
The results in Table 11, Table 12 and Table 13 provide support to the implications of Equation (95), namely that the method of approximations works well for values of | τ | that are not too large. An interesting result is that for numerous parameter combinations, the extended skew-normal distribution may be well approximated by a skew-normal. The usefulness of the results in the Worst 10 panels will depend on the application. In some applications, accurate critical values are not necessary, but in others, they are. There are other methods of measuring the divergence between two density functions. Two well-known ones are Hellinger distance ([39]) and Jensen–Shannon divergence ([40]), both of which constitute topics for future investigation.

5.2. Simulation Study

The simulation study uses the same sets of values of μ , σ 2 , λ , and τ . For each combination of the parameters, 100 samples of size 100 from an extended skew-normal distribution were drawn. The parameters were estimated by maximum likelihood for the ESN and ST distributions. In addition, motivated by the results in Table 11, the parameters of the skew-normal distribution were also estimated. Summaries of the results are shwon in Table 14, Table 15 and Table 16. Table 14 shows the value of the log-likelihood function for each parameter combination computed at its estimated maximum, averaged over the 100 samples and over values of τ . The table has four columns, with the first showing values of l o g L based on parameter values inferred from sample moments. As columns 2 through 4 of the table show, the value of l o g L varies little with the choice of underlying distribution. For this relatively small sample size, if the value of L o g L were the sole criterion for model selection, it would be difficult to discriminate between the three distributions.
For each parameter combination shown in Table 15 and Table 16, the entries are averages of the 100 samples. Table 15 shows the root mean-square error in the moments for the three distributions and for 35 selected combinations of λ , τ . As the table shows, the lowest root mean-square error occurs under the ESN for 30 of the λ , τ combinations. Root mean square error is computed as the square root of the average squared difference between the population moments and the average of the estimated moments based on parameters based on MLE for each distrbution. The population moments included in the calculations are mean, variance, skewness, and kurtosis. Table 16 shows the corresponding errors in the critical values. Root mean square error is computed as the square root of the average squared difference between the population critical values and the average of the estimated values based on MLE parameter estimates for each distribution. The critical values are computed at nominal percent probabilities equal to 0.05, 0.5, 2.5, 5.0, 95.0, 97.5, 99.5, and 99.95. The lowest root mean square error occurs under the ESN for 28 of the parameter combinations. In both Table 15 and Table 16, the root mean square error is generally the largest under the ST distribution.

6. Concluding Remarks

In this paper, results that demonstrate the properties of both the multivariate extended skew-normal and extended skew-Student distributions as the value of the extension parameter τ changes are presented. In general, for given value of location, scale, and shape or skewness, nonzero values of τ lead to greater variability in both the moments and critical values. In turn, this offers greater flexibility in empirical applications of these distributions. From a theoretical perspective, increasing values of | τ | leads to more fundamental changes in both distributions. As τ increases without limit, the asymptotic distributions are multivariate normal and multivariate Student, respectively. The respective vectors of expected values of both distributions are dependent on τ and are unbounded. The covariance matrices, however, remain finite. Skewness disappears for both distributions. By contrast, as τ , more substantial changes take place in the distributions. Most notable is that for the MESN distribution dependence on τ vanishes, but for the MEST in general, it does not. In the case of the MESN, the limiting distribution is multivariate normal. For the MEST distribution with finite degrees of freedom, asymmetry remains. For fixed τ , the extent of asymmetry decreases as the degrees of freedom increase. For fixed degrees of freedom, as τ , the vector of expected values and the covariance matrix are both unbounded.
To illustrate the potential of the MESN and MEST distributions, two applications are described. First, the effect of a stock market crash is studied assuming underlying multivariate normal and multivariate Student distributions. A crash, in which the return on a market index is less than a given negative threshold, results in multivariate extended skew-normal and multivariate extended skew-Student distributions. Under an underlying multivariate normal distribution, as the crash size increases without limit, the return on a stock market index becomes nonstochastic. In short, the market plummets: actual return equals expected return. Under an underlying multivariate Student distribution, expected return is broadly the same, but variability increases without limit. The market decline is noisy. There are analogous results for the returns on individual stocks. In particular, with underlying normality, the conditional covariance matrix remains finite, whereas under an underlying Student distribution, it does not. A detailed investigation of the implications and suitability of these models is beyond the scope of this particular paper, but it is reasonable to posit that the results offer support to the view that an underlying Student distribution is a more realistic model than the normal. Given that stock market collapses have in the past been of relatively short duration, the results also imply that for financial applications the models change. The methods described may be applied in principle to stock market booms. It may also be noted that if an inefficiency variable were to be constructed, SFA analysis could be treated in the same way.
Second, the conjecture that the skew-Student could be used instead of the extended skew-normal is an interesting one. Given the similarity in the shapes of the density functions for many combinations of parameters, this conjecture suggests that there is the possibility of flexible model choice. A general investigation of this conjecture would be a substantial task. The exercise reported in this paper is intended to offer evidence to motivate further research. The short study reported in this paper, part theoretical and part based on simulation, suggests that a given ESN distribution should be treated as such. However, the results also suggest that the ESN could be well-approximated by the skew-normal in some circumstances, but in general not by the skew-Student.

Funding

This research received no funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Acknowledgments

Thanks are due to the reviewers of the paper for comments which have led to both improved presentation and content.

Conflicts of Interest

The author declares no conflict of interest.

Appendix A. Asymptotic Distribution of a Truncated Student’s t Variable

Let V T T ν τ , 1 ; 0 + . For τ < 0 , the density function is
f x = K ν 1 + x + | τ | 2 ν ν + 1 / 2 T ν τ ,
The term in {} s , now denoted k x , may be expanded by the binomial theorem to give
k x = j = 0 2 x | τ | j a j ν j 1 + x 2 / ν + | τ | 2 / ν ν + 1 / 2 + j ; a j = Γ ( ν + 1 + j ) / 2 Γ ( ν + 1 ) / 2 j ! ,
Writing 1 + x 2 / ν + | τ | 2 / ν as
1 + x 2 / ν + | τ | 2 / ν = 1 + | τ | 2 / ν 1 + x 2 / ν 1 + | τ | 2 / ν ,
allows the jth term to be written as
A j = 2 x | τ | j a j ν j 1 + | τ | 2 / ν ν + 1 / 2 + j 1 + x 2 / ν 1 + | τ | 2 / ν ν + 1 / 2 + j .
Note that
l i m τ | τ | / ν 1 + | τ | 2 / ν = 1
gives
A j 1 + | τ | 2 / ν ν + 1 / 2 a j 2 j x 2 / ν 1 + | τ | 2 / ν j / 2 1 + x 2 / ν 1 + | τ | 2 / ν ν + 1 / 2 + j .
Some further algebra gives
k x = 1 + | τ | 2 / ν ν + 1 / 2 1 + x / ν 1 + | τ | 2 / ν ν + 1 .
Using the first term in the asymptotic expansion for T ν τ gives
f x = | τ | 1 + | τ | 2 / ν 1 1 + x / ν 1 + | τ | 2 / ν ν + 1 ν / | τ | 1 + x / | τ | ν + 1 ;
That is, as τ , the variable X / | τ | has a beta type-2 distribution with parameters 1 and ν , and X / | τ | β I I 1 , ν . Noting the above limit also leads to the alternative but asymptotically equivalent representation
X / ν / 1 + | τ | 2 / ν β I I 1 , ν .

References

  1. Azzalini, A. A Class of Distributions which Includes The Normal Ones. Scand. J. Stat. 1985, 12, 171–178. [Google Scholar]
  2. Azzalini, A.; Capitanio, A. Distributions Generated by Perturbation of Symmetry With Emphasis on a Multivariate Skew t Distribution. J. R. Stat. Soc. Ser. B 2003, 65, 367–389. [Google Scholar] [CrossRef]
  3. Azzalini, A. An overview on the progeny of the skew-normal family—A personal perspective. J. Multivar. Anal. 2021, in press. [Google Scholar] [CrossRef]
  4. Birnbaum, Z.W. Effect of linear truncation on a multinormal population. Ann. Math. Stat. 1950, 21, 272–279. [Google Scholar] [CrossRef]
  5. Adcock, C.J. Capital Asset Pricing for UK Stocks Under the Multivariate Skew-Normal Distribution. In Skew Elliptical Distributions and Their Applications: A Journey beyond Normality; Genton, M., Ed.; Chapman and Hall: London, UK, 2004; pp. 191–204. [Google Scholar]
  6. Adcock, C.J. Stein’s Lemma For Skew-Normal Distributions: A Comment and an Example. J. Appl. Probab. Stat. 2013, 8, 58–64. [Google Scholar]
  7. Azzalini, A.; Capitanio, A. Statistical Applications of The Multivariate Skew Normal Distribution. J. R. Stat. Soc. Ser. B 1999, 61, 579–602. [Google Scholar] [CrossRef]
  8. Capitanio, A.; Azzalini, A.; Stanghellini, E. Graphical Models for Skew-Normal Variates. Scand. J. Stat. 2003, 30, 129–144. [Google Scholar] [CrossRef]
  9. Kumbhakar, S.C.; Parmeter, C.F.; Zelenyuk, V. Stochastic Frontier Analysis: Foundations and Advances I. In Handbook of Production Economics; Ray, S.C., Chambers, R., Kumbhakar, S.C., Eds.; Springer: Singapore, 2020. [Google Scholar]
  10. Aigner, D.J.; Lovell, C.K.; Schmidt, P. Formulation and Estimation of Stochastic Production Function Model. J. Econom. 1977, 12, 21–37. [Google Scholar] [CrossRef]
  11. Adcock, C.J.; Shutes, K. On the Multivariate Extended Skew-Normal, Normal-exponential and Normal-gamma Distributions. J. Stat. Theory Pract. 2012, 6, 636–664. [Google Scholar] [CrossRef]
  12. Azzalini, A.; Dalla-Valle, A. The Multivariate Skew Normal Distribution. Biometrika 1996, 83, 715–726. [Google Scholar] [CrossRef]
  13. Arnold, B.C.; Beaver, R.J. Hidden Truncation Models. Sankhya Ser. A 2000, 62, 22–35. [Google Scholar]
  14. Adcock, C.J.; Shutes, K. Portfolio Selection Based on The Multivariate Skew-Normal Distribution. In Financial Modelling; Skulimowski, A., Ed.; Progress & Business Publishers: Krakow, Poland, 1999. [Google Scholar]
  15. Capitanio, A. On The Canonical Form of Scale Mixtures of Skew-Normal Distributions. Statistica 2020, 80, 145–160. [Google Scholar]
  16. Tallis, G.M. The moment generating function of the truncated multi-normal distribution. J. R. Stat. Soc. Ser. B 1961, 23, 223–229. [Google Scholar] [CrossRef]
  17. Fisher, R.A. The moments of the distribution of normal samples of measures of departure from normality. Proc. R. Soc. Lond. 1930, 130, 16–28. [Google Scholar]
  18. Barr, D.R.; Sherrill, E.T. Mean and Variance of Truncated Normal Distributions. Am. Stat. 1979, 53, 357–361. [Google Scholar]
  19. Nadarajah, S.; Kotz, S. Moments of truncated t and F distributions. Port. Econ. J. 2008, 7, 63–73. [Google Scholar] [CrossRef]
  20. Genç, A.d. Moments of truncated normal/independent variables. Stat. Pap. 2013, 54, 741–764. [Google Scholar] [CrossRef]
  21. Horrace, W.C. Moments of the truncated normal distribution. J. Product. Anal. 2015, 43, 133–138. [Google Scholar] [CrossRef]
  22. Ogasawara, H. A non-recursive formula for various moments of the multivariate normal distribution with sectional truncation. J. Multivar. Anal. 2021, 183, 104729. [Google Scholar] [CrossRef]
  23. Loux, T.; Davy, O. Adjusting Published Estimates for Exploratory Biases Using the Truncated Normal Distribution. Am. Stat. 2021, 75, 294–299. [Google Scholar] [CrossRef]
  24. Abramowitz, M.; Stegun, I. Handbook of Mathematical Functions; Dover: Mineola, NY, USA, 1965. [Google Scholar]
  25. Sampford, M.R. Some Inequalities on Mill’s ratio and Related Functions. Ann. Math. Stat. 1953, 24, 130–132. [Google Scholar] [CrossRef]
  26. Adcock, C.J. Asset Pricing and Portfolio Selection based on The Multivariate Skew-Student Distribution. In Proceedings of the Non-Linear Asset Pricing Workshop, Paris, France, 16–18 April 2002. [Google Scholar]
  27. Adcock, C.J. Asset Pricing and Portfolio Selection Based on the Multivariate Extended Skew-Student-t Distribution. Ann. Oper. Res. 2010, 176, 221–234. [Google Scholar] [CrossRef] [Green Version]
  28. Arellano-Valle, R.B.; Genton, M.G. Multivariate Extended Skew-t Distributions and Related Families. Metron 2010, 68, 201–234. [Google Scholar] [CrossRef]
  29. Adcock, C.J. Mean-variance-skewness efficient surfaces, Stein’s lemma and the multivariate extended skew-Student Distribution. Eur. J. Oper. Res. 2014, 234, 392–401. [Google Scholar] [CrossRef]
  30. Kim, H.J. Moments of truncated Student-t distribution. J. Korean Stat. Soc. 2008, 37, 81–87. [Google Scholar] [CrossRef]
  31. Soms, A.P. An Asymptotic Expansion for the Tail Area of the t-Distribution. J. Am. Stat. Assoc. 1976, 71, 728–730. [Google Scholar] [CrossRef]
  32. Arnold, B.; Beaver, R.J.; Groeneveld, R.A.; Meeker, W.Q. The non truncated marginal of a truncated bivariate normal distribution. Psychometrika 1993, 58, 471–478. [Google Scholar] [CrossRef]
  33. Arnold, B.; Beaver, R.J. Skewed multivariate models related to hidden truncation and/or selective reporting. Test 2002, 11, 7–54. [Google Scholar] [CrossRef]
  34. Arnold, B.C. Flexible univariate and multivariate models based on hidden truncation. J. Stat. Plan. Inference 2009, 139, 3741–3749. [Google Scholar] [CrossRef]
  35. Sharpe, W.F. Capital Asset Prices: A Theory of Market Equilibrium Under Conditions of Risk. J. Financ. 1964, 19, 425–442. [Google Scholar]
  36. Lintner, J. The Valuation of Risky Assets and The Selection of Risky Investments in Stock Portfolios and Capital Budgets. Rev. Econ. Stat. 1965, 47, 13–37. [Google Scholar] [CrossRef]
  37. Mossin, J. Equilibrium in a Capital Asset Market. Econometrica 1966, 34, 768–783. [Google Scholar] [CrossRef]
  38. Kullback, S.; Leibler, R.A. On information and sufficiency. Ann. Math. Stat. 1951, 22, 79–86. [Google Scholar] [CrossRef]
  39. Hellinger, E. Neue Begründung der Theorie quadratischer Formen von unendlichvielen Veränderlichen. J. für Die Reine Angew. Math. 1909, 137, 210–271. [Google Scholar] [CrossRef]
  40. Menéndez, M.L.; Pardo, J.A.; Pardo, L.; Pardo, M.C. The Jensen-Shannon Divergence. J. Frankl. Inst. 1997, 334B, 307–318. [Google Scholar] [CrossRef]
Figure 1. Extended skew-normal density functions; τ = 0 , ± 2.5 , ± 5.0 , ± 15.0 , and ± 30.0 .
Figure 1. Extended skew-normal density functions; τ = 0 , ± 2.5 , ± 5.0 , ± 15.0 , and ± 30.0 .
Stats 05 00017 g001
Figure 2. Truncated normal density function τ = 3 , 10 , 30 .
Figure 2. Truncated normal density function τ = 3 , 10 , 30 .
Stats 05 00017 g002
Figure 3. Truncated normal density function τ = 3 and approximations.
Figure 3. Truncated normal density function τ = 3 and approximations.
Stats 05 00017 g003
Figure 4. Standardized Extended Skew-Normal Density Functions.
Figure 4. Standardized Extended Skew-Normal Density Functions.
Stats 05 00017 g004
Figure 5. Skew-Normal Standardized Skew-Normal Density Functions.
Figure 5. Skew-Normal Standardized Skew-Normal Density Functions.
Stats 05 00017 g005
Figure 6. Extended skew-Student density functions, ν = 3 , λ = 0 .
Figure 6. Extended skew-Student density functions, ν = 3 , λ = 0 .
Stats 05 00017 g006
Figure 7. Extended skew-Student density functions, ν = 3 , λ = 5 .
Figure 7. Extended skew-Student density functions, ν = 3 , λ = 5 .
Stats 05 00017 g007
Figure 8. Truncated Student t density functions, τ = 35 , ν = 5 , 20 .
Figure 8. Truncated Student t density functions, τ = 35 , ν = 5 , 20 .
Stats 05 00017 g008
Figure 9. Extended and standardized skew-Student density functions, τ = 35 , 10 .
Figure 9. Extended and standardized skew-Student density functions, τ = 35 , 10 .
Stats 05 00017 g009
Figure 10. Contour plots of the extended skew-normal and skew-Student density functions, ν = 5, 100, τ = 1 .
Figure 10. Contour plots of the extended skew-normal and skew-Student density functions, ν = 5, 100, τ = 1 .
Stats 05 00017 g010
Figure 11. Asymptotic and exact skew-Student density functions, ν = 5 , τ = 30 .
Figure 11. Asymptotic and exact skew-Student density functions, ν = 5 , τ = 30 .
Stats 05 00017 g011
Figure 12. Comparison of the conditional distribution of standardized market returns.
Figure 12. Comparison of the conditional distribution of standardized market returns.
Stats 05 00017 g012
Figure 13. Example of an extended skew-normal and approximating skew-Student density functions.
Figure 13. Example of an extended skew-normal and approximating skew-Student density functions.
Stats 05 00017 g013
Table 1. Moments of the truncated normal distribution.
Table 1. Moments of the truncated normal distribution.
Panel 1: Min
| τ | 1 | τ | 5 | τ | 30
mean0.52510.18650.0333
variance0.19910.03270.0011
skewness0.11690.00000.0000
kurtosis0.19810.00830.0000
kappa40.0005−0.1879−0.1879
standardized-skewness0.59180.00000.0000
standardized-kurtosis3.00142.76092.7609
Panel 2: SN
| τ | 1 | τ | 5 | τ | 30
mean0.79790.79790.7979
variance0.36340.36340.3634
skewness0.21800.21800.2180
kurtosis0.51090.51090.5109
kappa40.11480.11480.1148
standardized-skewness0.99530.99530.9953
standardized-kurtosis3.86923.86923.8692
Panel 3: Max
| τ | 1 | τ | 5 | τ | 30
mean1.28765.000030.0000
variance0.62971.00001.0000
skewness0.29570.29570.2957
kurtosis1.19012.99983.0000
kappa40.11480.11480.1148
standardized-skewness1.31621.83111.9935
standardized-kurtosis4.99747.75928.9472
Kurtosis is the fourth moment about the mean.
Table 2. Moments of the truncated normal distribution and its approximations.
Table 2. Moments of the truncated normal distribution and its approximations.
Panel 1: τ = 3
ExactExp-1Exp-3
mean0.28310.33330.2694
variance0.07060.11110.0553
skewness0.03150.07410.0149
kurtosis0.03390.11110.0195
Panel 2: τ = 10
ExactExp-1Exp-3
mean0.09810.10000.0980
variance0.00940.01000.0094
skewness0.00180.00200.0018
kurtosis0.00080.00090.0008
Panel 3: τ = 30
ExactExp-1Exp-3
mean0.03330.03330.0333
variance0.00110.00110.0011
skewness0.00010.00010.0001
kurtosis0.00000.00000.0000
Kurtosis is the fourth moment about the mean. The abbreviations ‘Exact‘, ‘Exp-1‘, and ‘Exp-3‘ are as described in Figure 3.
Table 3. Moments of the extended skew-normal distributions, τ 0 .
Table 3. Moments of the extended skew-normal distributions, τ 0 .
Panel 1: τ = 0
mnvrskkuk4ssksku
λ = 0 0.00001.00000.00003.00000.00000.00003.0000
10.79791.36340.21805.69120.11480.13693.0617
2.51.99473.27113.406536.5844.48320.57583.4190
53.989410.084527.2517376.823471.73170.85103.7053
Panel 2: τ = 0 . 5
λ = 0 0.00001.00000.00003.00000.00000.00003.0000
10.64111.26850.16264.93010.10300.11383.0640
2.51.60272.67802.540725.5394.02390.57973.5611
53.20547.712020.3255242.807764.38240.94914.0825
Panel 3: τ = 1
λ = 0 0.00001.00000.00003.00000.00000.00003.0000
10.52511.19910.11694.39270.07920.08913.0551
2.51.31282.24441.827018.20423.09280.54343.6140
52.62575.977414.6164156.673849.48441.00024.3850
Panel 4: τ = 2 . 5
λ = 0 0.00001.00000.00003.00000.00000.00003.0000
10.32271.08900.04293.58480.02720.03773.0230
2.50.80691.55610.67008.32831.06410.34523.4395
51.61373.22435.359848.214617.02540.92574.6376
Panel 5: τ = 5
λ = 0 0.00001.00000.00003.00000.00000.00003.0000
10.18651.03270.01083.20450.00510.01033.0048
2.50.46631.20440.16924.55010.19870.12803.1370
50.93251.81741.353213.08883.17990.55233.9627
Panel 6: τ = 10
λ = 0 0.00001.00000.00003.00000.00000.00003.0000
10.09811.00940.00183.05740.00050.00183.0005
2.50.24521.05900.02793.38410.01940.02563.0173
50.49051.23610.22334.89520.31120.16253.2036
Panel 7: τ = 20
λ = 0 0.00001.00000.00003.00000.00000.00003.0000
10.04981.00250.00023.01480.00000.00023.0000
2.50.12441.01540.00383.09450.00140.00373.0014
50.24881.06160.03033.40320.02230.02773.0198
Panel 8: τ = 30
λ = 0 0.00001.00000.00003.00000.00000.00003.0000
10.03331.00110.00013.00660.00000.00013.0000
2.50.08311.00690.00113.04180.00030.00113.0003
50.16631.02760.00913.17230.00450.00883.0043
Mean, variance, skewness, kurtosis ( κ ¯ 4 ) and 4th cumulant ( κ 4 ) are denoted mn, vr, sk, ku & k4 respectively. ssk and sku denote skewness and kurtosis for the standardized distributions.
Table 4. Moments of the extended skew-normal distributions τ 0 .
Table 4. Moments of the extended skew-normal distributions τ 0 .
Panel 1: τ = 0
mnvrskkuk4ssksku
λ = 0 0.00001.00000.00003.00000.00000.00003.0000
10.79791.36340.21805.69120.11480.13693.0617
2.51.99473.27113.406536.5844.48320.57583.4190
53.989410.084527.2517376.823471.73170.85103.7053
Panel 2: τ = 0 . 5
λ = 0 0.00001.00000.00003.00000.00000.00003.0000
11.00921.48620.27106.71430.08820.14963.0399
2.52.52294.03864.234252.37483.44410.52173.2112
55.045813.154433.8738574.218555.10490.71003.3185
Panel 3: τ = 1
λ = 0 0.00001.00000.00003.00000.00000.00003.0000
11.28761.62970.29577.96820.00050.14213.0002
2.53.21904.93554.620673.10000.02140.42143.0009
56.438016.742236.9648841.24180.34230.53963.0012
Panel 4: τ = 2 . 5
λ = 0 0.00001.00000.00003.00000.00000.00003.0000
12.51761.95560.094911.3172−0.15580.03472.9593
2.56.29416.97251.4835139.7583−6.08740.08062.8748
512.588224.889911.86781761.1161−97.39900.09562.8428
Panel 5: τ = 5
λ = 0 0.00001.00000.00003.00000.00000.00003.0000
15.00002.00000.000011.9997−0.00020.00003.0000
2.512.50007.25000.0006157.6791−0.00640.00002.9999
525.000025.99980.00452027.8688−0.10220.00002.9998
Panel 6: τ = 10
λ = 0 0.00001.00000.00003.00000.00000.00003.0000
110.00002.00000.000012.00000.00000.00003.0000
2.525.00007.25000.0000157.68750.00000.00003.0000
550.000026.00000.00002028.00000.00000.00003.0000
Panel 7: τ = 20
λ = 0 0.00001.00000.00003.00000.00000.00003.0000
120.00002.00000.000012.00000.00000.00003.0000
2.550.00007.25000.0000157.68750.00000.00003.0000
5100.000026.00000.00002028.00000.00000.00003.0000
Panel 8: τ = 30
λ = 0 0.00001.00000.00003.00000.00000.00003.0000
130.00002.00000.000012.00000.00000.00003.0000
2.575.00007.25000.0000157.68750.00000.00003.0000
5150.000026.00000.00002028.00000.00000.00003.0000
Mean, variance, skewness, kurtosis ( κ ¯ 4 ), and fourth cumulant ( κ 4 ) are denoted mn, vr, sk, ku, and k4, respectively. ssk and sku denote skewness and kurtosis for the standardized distributions.
Table 5. Moments of the Truncated Student’s t Distribution.
Table 5. Moments of the Truncated Student’s t Distribution.
Panel 1: τ = 35
mnvrskkuk4ssksku
ν = 5 8.7755128.22936742.23381211054.8961161726.6164.643373.6528
103.915319.1388235.01226511.2115412.33462.806917.776
1000.38180.14850.11770.20870.14262.05689.4645
5000.09860.00970.00190.00090.00062.00729.0581
10000.06350.0040.00050.00010.00012.00129.0094
Inf0.02850.00080.00000.00000.00001.99518.9623
Panel 2: τ = 10
ν = 5 2.588511.0436168.68748789.59518423.71394.596472.0692
101.20251.78186.570154.694745.16982.762317.227
1000.19820.03940.01570.01410.00942.00839.0738
5000.11800.01370.00310.00160.00111.95828.6753
10000.10800.01150.00240.00110.00071.95218.6276
Inf0.09810.00940.00180.00080.00051.94608.5804
Panel 3: τ = 1
ν = 5 0.81440.79372.407124.559122.66923.404238.9844
100.65190.37970.45441.35490.92241.94229.3977
1000.53650.21170.13280.23490.10051.36285.2408
5000.52740.20160.11990.20490.08301.32535.0441
10000.52630.20030.11840.20150.08111.32085.0206
Inf0.52510.19910.11690.19810.07921.31624.9974
Panel 4: τ = 0
ν = 5 0.94900.76601.709413.560311.79982.549623.1085
100.86470.50230.52101.63560.87861.46346.4821
1000.80390.37410.23570.56230.14241.03034.0175
5000.79910.36550.22140.52060.11991.00213.8977
10000.79850.36440.21970.51570.11730.99873.8834
Inf0.79790.36340.21800.51090.11480.99533.8692
Panel 5: τ = 1
ν = 5 1.40260.96771.584311.81739.0081.664312.6197
101.33940.75300.60032.48300.78210.91884.3795
1001.29240.63960.31531.26030.03310.61633.0809
5001.28850.63160.29951.20350.00670.59663.0167
10001.28810.63070.29761.19680.00360.59423.009
Inf1.28760.62970.29571.19010.00050.59183.0014
Panel 6: τ = 10
ν = 5 10.00111.65230.215320.479512.28910.10147.5012
1010.00001.24990.00116.23611.54940.00083.9918
10010.00001.02040.00003.18880.06510.00003.0625
50010.00001.00400.00003.03630.01220.00003.0121
100010.00001.00200.00003.01810.00600.00003.0060
Inf10.00001.00000.00003.00000.00000.00003.0000
Mean, variance, skewness, kurtosis ( κ ¯ 4 ), and fourth cumulant ( κ 4 ) are denoted mn, vr, sk, ku, and k4, respectively. ssk and sku denote skewness and kurtosis for the standardized distributions.
Table 6. Extended skew-Student moments, λ = 0 , τ 0 .
Table 6. Extended skew-Student moments, λ = 0 , τ 0 .
Panel 1: τ = 0
mnvrskkuk4ssksku
ν = 5 0.00001.66670.000025.000016.66670.00009.0000
100.00001.25000.00006.25001.56250.00004.0000
1000.00001.02040.00003.18880.06510.00003.0625
10000.00001.00200.00003.01810.00600.00003.0060
Panel 2: τ = 0 . 5
ν = 5 0.00001.89140.000033.681422.94890.00009.4148
100.00001.32700.00007.08621.80330.00004.0240
1000.00001.02630.00003.22570.06600.00003.0626
10000.00001.00260.00003.02150.00610.00003.0060
Panel 3: τ = 1
ν = 5 0.00002.27150.000050.402334.92340.00009.7686
100.00001.45650.00008.58032.21630.00004.0448
1000.00001.03610.00003.28780.06730.00003.0627
10000.00001.00350.00003.02730.00610.00003.0060
Panel 4: τ = 2 . 5
ν = 5 0.00004.53350.0000184.0710122.41180.00008.9559
100.00002.21670.000019.94275.20150.00004.0586
1000.00001.09300.00003.66130.07710.00003.0646
10000.00001.00910.00003.06100.00630.00003.0062
Panel 5: τ = 5
ν = 5 0.000012.37040.00001630.56641171.48280.000010.6554
100.00004.83250.000095.812825.75330.00004.1028
1000.00001.28750.00005.07780.10450.00003.0631
10000.00001.02800.00003.17680.00640.00003.0060
Panel 6: τ = 10
ν = 5 0.000043.62810.000020,481.236314,770.99830.000010.7603
100.000015.25310.0000956.4646258.49210.00004.1110
1000.00002.06100.000013.01180.26820.00003.0631
10000.00001.10330.00003.65910.00730.00003.0060
Panel 7: τ = 20
ν = 5 0.0000168.63020.0000306,820.5795221,512.10220.000010.7898
100.000056.92090.000013,327.50593607.53340.00004.1134
1000.00005.15330.000081.34671.67640.00003.0631
10000.00001.40420.00005.92720.01190.00003.0060
Mean, variance, skewness, kurtosis ( κ ¯ 4 ), and fourth cumulant ( κ 4 ) are denoted mn, vr, sk, ku, and k4, respectively. ssk and sku denote skewness and kurtosis for the standardized distributions.
Table 7. Extended skew-Student moments, λ = 0 , τ 0 .
Table 7. Extended skew-Student moments, λ = 0 , τ 0 .
Panel 1: τ = 0
mnvrskkuk4ssksku
ν = 5 0.00001.66670.00002516.66670.00009.0000
100.00001.25000.00006.25001.56250.00004.0000
1000.00001.02040.00003.18880.06510.00003.0625
10000.00001.00200.00003.01810.00600.00003.0060
Panel 2: τ = 0.5
ν = 5 0.00001.56130.000020.930613.61750.00008.5862
100.00001.21480.00005.86731.44050.00003.9762
1000.00001.01780.00003.17230.06460.00003.0624
10000.00001.00170.00003.01650.0060.00003.0060
Panel 3: τ = 1
ν = 5 0.00001.53250.000019.363012.31780.00008.2452
100.00001.20760.00005.77121.39650.00003.9577
1000.00001.01740.00003.16990.06450.00003.0623
10000.00001.00170.00003.01630.0060.00003.0060
Panel 4: τ = 2.5
ν = 5 0.00001.58640.000019.070711.52100.00007.5781
100.00001.23460.00005.91541.34300.00003.8812
1000.00001.01990.00003.18080.06030.00003.0580
10000.00001.00200.00003.01740.00560.00003.0056
Panel 5: τ = 5
ν = 5 0.00001.64470.000021.69813.58340.00008.0218
100.00001.24900.00006.22591.54560.00003.9907
1000.00001.02040.00003.18880.06510.00003.0625
10000.00001.00200.00003.01810.00600.00003.0060
Panel 6: τ = 10
ν = 5 0.00001.66310.000023.251414.95390.00008.4066
100.00001.25000.00006.24921.56180.00003.9996
1000.00001.02040.00003.18880.06510.00003.0625
10000.00001.00200.00003.01810.00600.00003.0060
Panel 7: τ = 20
ν = 5 0.00001.66620.000024.11415.78550.00008.6861
100.00001.25000.00006.25001.56250.00004.0000
1000.00001.02040.00003.18880.06510.00003.0625
10000.00001.00200.00003.01810.00600.00003.0060
Mean, variance, skewness, kurtosis ( κ ¯ 4 ), and fourth cumulant ( κ 4 ) are denoted mn, vr, sk, ku, and k4, respectively. ssk and sku denote skewness and kurtosis for the standardized distributions.
Table 8. Extended skew-Student moments, λ = 5 , τ 0 .
Table 8. Extended skew-Student moments, λ = 5 , τ 0 .
Panel 1: τ = 0
mnvrskkuk4ssksku
ν = 5 4.745120.8175225.83458893.03517592.92832.377620.5207
104.323413.80867.66481136.224564.24281.31885.9594
1004.019710.37329.6176412.230789.43590.88653.8312
10003.992410.112727.4779380.162773.36250.85443.7174
Panel 2: τ = 0 . 5
ν = 5 4.242820.5686255.647211,315.568210,046.36662.740526.7465
103.661511.942461.69371002.0177574.15491.49497.0257
1003.24658.027222.5251273.480180.17020.99044.2442
10003.20957.742720.5341245.669565.82040.95314.0979
Panel 3: τ = 1
ν = 5 4.072222.1142315.872516,158.414214,691.30313.037433.0413
103.259310.949158.8277963.236603.58691.62378.0348
1002.68266.329416.6811183.589363.40621.04764.5827
10002.63136.011614.8101159.148450.73181.00484.4038
Panel 4: τ = 2 . 5
ν = 5 4.701337.0039572.367840,066.929135,959.07372.542829.2612
102.967211.740464.05271183.1857769.6731.59238.5839
1001.73513.71969.293191.776650.27011.29546.6334
10001.62573.27147.589771.250639.14451.28276.6577
Panel 5: τ = 5
ν = 5 7.111393.27733369.5085468,082.58441,980.583.740353.7985
103.660120.7462172.22464829.29453538.08591.822611.2204
1001.17862.62142.936030.36559.75000.69184.4188
10000.95691.89051.472814.31963.59740.56664.0065
Panel 6: τ = 10
ν = 5 12.9423319.711521,801.335,806,861.725,500,2153.813756.81
106.012659.7887838.157941709.082530985.00481.813011.6679
1000.99123.02181.976234.46487.07010.37623.7743
10000.54011.36380.30046.27570.69590.18863.3742
Panel 7: τ = 20
ν = 5 25.22271225.9337164,426.3786,687,91382,179,1733.830657.6799
1011.3418217.12195771.12555,743414,3181.803911.7887
1001.25656.75534.1809157.087720.18590.23813.4423
10000.34861.52380.08427.08350.11740.04473.0506
Mean, variance, skewness, kurtosis ( κ ¯ 4 ), and fourth cumulant ( κ 4 ) are denoted mn, vr, sk, ku, and k4, respectively. ssk and sku denote skewness and kurtosis for the standardized distributions. Some large results in the bottom panel are shown to two decimal places only to preserve the formatting.
Table 9. Extended skew-Student moments, λ = 5 , τ 0 .
Table 9. Extended skew-Student moments, λ = 5 , τ 0 .
Panel 1: τ = 0
mnvrskkuk4ssksku
ν = 5 4.745120.8175225.83458893.03517592.92832.377620.5207
104.323413.808067.66481136.224564.24281.31885.9594
1004.019710.373029.6176412.230789.43590.88653.8312
10003.992410.112727.4779380.162773.36250.85443.7174
Panel 2: τ = 0 . 5
ν = 5 5.660722.7023214.97927908.22826362.04881.987415.3441
105.319616.584774.63381370.39545.22931.10504.9823
1005.070713.426836.4146615.206874.36780.74013.4125
10005.048313.181134.1180578.112956.89060.71293.3274
Panel 3: τ = 1
ν = 5 7.013225.7246211.23987728.97825743.70701.619011.6795
106.697020.032078.54981695.5477491.70800.87614.2253
1006.461817.006839.6479888.137120.43830.56533.0707
10006.440416.768137.2234845.7192.20970.54213.0079
Panel 4: τ = 2 . 5
ν = 5 12.981834.9913193.65848374.65534701.47690.93566.8398
1012.747128.948965.12752496.4765-17.64340.41812.9789
10012.600725.248625.78041552.3035-360.17760.20322.4350
100012.589424.925423.31241497.7373-366.09070.18732.4107
Panel 5: τ = 5
ν = 5 25.06641.105697.313911,095.19776026.1760.36936.5665
1025.007732.282410.24033921.4378794.98620.05583.7628
10025.000026.52980.32892150.244138.75290.00243.0551
100025.000026.05190.03542039.61683.51480.00033.0052
Panel 6: τ = 10
ν = 5 50.005442.97140.107313,286.39287746.86560.14247.1954
1050.000032.49754.31034122.5494954.28980.02333.9036
10050.000026.53060.30922150.829139.20890.00233.0557
100050.000026.05210.03012039.77233.63590.00023.0054
Panel 7: τ = 20
ν = 5 100.000443.284619.934814,666.63159045.95240.07007.8282
10100.000032.50004.16824131.0685962.32120.022503.9111
100100.000026.53060.30922150.829139.20890.00233.0557
1000100.000026.05210.03012039.77233.63590.00023.0054
Mean, variance, skewness, kurtosis ( κ ¯ 4 ), and fourth cumulant ( κ 4 ) are denoted mn, vr, sk, ku, and k4, respectively. ssk and sku denote skewness and kurtosis for the standardized distributions.
Table 10. Extended skew-normal and skew-Student critical values, p = 0.025.
Table 10. Extended skew-normal and skew-Student critical values, p = 0.025.
Panel 1: τ = 5
ν = 35102050100500Inf
ρ = 0 −11.63−7.00−4.39−3.21−2.49−2.23−2.02−1.96
0.2−13.56−8.35−5.50−4.25−3.50−3.24−3.02−2.96
0.4−15.19−9.47−6.45−5.17−4.41−4.15−3.93−3.88
0.6−16.51−10.36−7.22−5.94−5.20−4.95−4.75−4.70
0.8−17.44−10.94−7.74−6.49−5.81−5.59−5.41−5.37
Panel 2: τ = 2 . 5
ρ = 0 −6.58−4.24−2.97−2.44−2.15−2.06−1.98−1.96
0.2−7.62−4.97−3.57−3.00−2.69−2.59−2.51−2.49
0.4−8.50−5.57−4.07−3.47−3.15−3.05−2.96−2.94
0.6−9.20−6.04−4.46−3.84−3.51−3.41−3.33−3.31
0.8−9.70−6.34−4.69−4.07−3.75−3.65−3.57−3.55
Panel 3: τ = 1
ρ = 0 −4.12−3.00−2.41−2.17−2.04−2.00−1.97−1.96
0.2−4.69−3.40−2.73−2.47−2.32−2.28−2.24−2.24
0.4−5.16−3.72−2.98−2.69−2.54−2.49−2.46−2.45
0.6−5.53−3.96−3.16−2.85−2.69−2.64−2.6−2.59
0.8−5.79−4.10−3.25−2.93−2.76−2.7−2.66−2.65
Panel 4: τ = 0
ρ = 0 −3.18−2.57−2.23−2.09−2.01−1.99−1.97−1.96
0.2−3.52−2.80−2.40−2.24−2.15−2.12−2.10−2.10
0.4−3.79−2.97−2.53−2.35−2.25−2.22−2.19−2.19
0.6−3.99−3.09−2.60−2.40−2.30−2.26−2.24−2.23
0.8−4.13−3.15−2.63−2.42−2.31−2.28−2.25−2.24
Panel 5: τ = 1
ρ = 0 −2.93−2.47−2.19−2.07−2.00−1.98−1.97−1.96
0.2−3.12−2.58−2.27−2.13−2.05−2.03−2.01−2.01
0.4−3.27−2.66−2.31−2.16−2.08−2.05−2.03−2.03
0.6−3.38−2.71−2.33−2.17−2.09−2.06−2.04−2.03
0.8−3.45−2.74−2.34−2.18−2.09−2.06−2.04−2.03
Panel 6: τ = 2 . 5
ρ = 0 −3.00−2.51−2.22−2.08−2.01−1.99−1.97−1.96
0.2−3.09−2.55−2.23−2.09−2.01−1.99−1.97−1.96
0.4−3.16−2.58−2.24−2.09−2.01−1.99−1.97−1.96
0.6−3.20−2.59−2.24−2.09−2.01−1.99−1.97−1.96
0.8−3.23−2.59−2.24−2.09−2.01−1.99−1.97−1.96
Panel 7: τ = 5
ρ = 0 −3.11−2.56−2.23−2.09−2.01−1.99−1.97−1.96
0.2−3.14−2.57−2.23−2.09−2.01−1.99−1.97−1.96
0.4−3.16−2.57−2.23−2.09−2.01−1.99−1.97−1.96
0.6−3.18−2.57−2.23−2.09−2.01−1.99−1.97−1.96
0.8−3.19−2.57−2.23−2.09−2.01−1.99−1.97−1.96
The table values correspond to a probability of 0.025 for the hidden truncation models at Equation (66) and Equation (75). Table entries are computed numerically and displayed to two decimal places.
Table 11. Divergence between ESN ST density functions.
Table 11. Divergence between ESN ST density functions.
Panel 1: Kullback–Leibler divergence: Best 10
λ τ μ [ST] σ 2 [ST] λ [ST] ν
−0.5−10.0−0.04901.00240.0000
−0.50.00.00000.9999−0.5000
−1.00.00.00000.9998−1.0000
−1.50.00.00000.9998−1.5000
−1.010.0−10.00002.00000.0000
−0.510.0−5.00001.25000.0000
−2.010.0−20.00005.00000.0000
−3.010.0−30.000010.00000.0000
−5.010.0−50.000026.00000.0000
−1.510.0−15.00003.25000.0000
Panel 2: Worst 10
−2.00.5−0.96522.2017−1.300050
−2.50.5−0.92082.5358−2.0000190
−3.00.5−1.03133.0917−2.5000
−2.51.0−2.01214.0001−1.500090
−2.5−0.50.39250.4047−2.5000
−3.01.0−2.26585.1990−2.0000
−5.0−5.00.67250.3260−2.0000130
−5.01.0−4.834215.0568−2.0000150
−5.00.5−2.64699.7981−3.0000340
−3.0−0.50.47400.1265−3.0000500
The values of the Kullback–Leibler divergence measure are ranked from best to worst, with the parameters corresponding to the best ten and worst ten shown in the two panels. The first two columns of each panel show the values of λ and τ. The next three columns show the computed value of μ, σ2, and λ for the approximating ST distribution, rounded to four decimal places. Computed values of ν equal to 1000 or greater were replaced by ∓, that is, the approximating distribution is effectively skew-normal.
Table 12. ESN and ST Moments.
Table 12. ESN and ST Moments.
Panel 1: Kullback–Leibler divergence: Best 10
MeanVarianceSkewnessKurtosisMean (ST)Variance (ST)Skewness (ST)Kurtosis (ST)
−0.04901.0024−0.00023.0142−0.04901.0024−0.00023.0142
−0.39891.0908−0.02733.5770−0.39891.0908−0.02753.5773
−0.79791.3634−0.21805.6912−0.79791.3634−0.21865.6917
−1.19681.8176−0.735810.4921−1.19681.8176−0.737110.4930
−10.00002.00000.000012.0000−10.00002.00000.000012.0000
−5.00001.25000.00004.6875−5.00001.25000.00004.6875
−20.00005.00000.000075.0000−20.00005.00000.000075.0000
−30.000010.00000.0000300.0000−30.000010.00000.0000300.0000
−50.000026.00000.00002028.0000−50.000026.00000.00002028.0000
−15.00003.25000.000031.6875−15.00003.25000.000031.6875
Panel 2: Worst 10
−2.01832.9447−2.167927.4245−2.01832.9447−2.167828.132
−2.52294.0386−4.234252.3748−2.52294.0386−4.220054.5273
−3.02755.3756−7.316793.8321−3.02755.3756−7.348299.0404
−3.21904.9355−4.620673.1000−3.21904.9355−4.595179.0081
−1.60272.6780−2.540725.539−1.60272.6780−2.541124.5375
−3.86286.6672−7.9844133.3981−3.86286.6672−8.0281147.2278
−0.93251.8174−1.353213.0888−0.93251.8174−1.349311.1028
−6.438016.7422−36.9648841.2418−6.438016.7422−36.8354940.0303
−5.045813.1544−33.8738574.2185−5.045813.1544−33.8859612.2122
−1.92323.4163−4.390343.3578−1.92323.4163−4.397441.1866
This table shows values of the moments corresponding to the parameters reported in Table 11.
Table 13. ESN vs. ST critical values.
Table 13. ESN vs. ST critical values.
Panel 1: Kullback–Leibler divergence: Best 10
0.5%ESNST2.5%ESNST97.5%ESNST99.5%ESNST
−2.65−2.65−2.05−2.051.901.902.502.50
−3.15−3.15−2.50−2.501.601.602.252.25
−4.00−4.00−3.20−3.201.401.402.052.05
−5.10−5.10−4.05−4.051.251.251.901.90
−13.65−13.65−12.80−12.80−7.25−7.25−6.40−6.40
−7.90−7.90−7.20−7.20−2.85−2.85−2.15−2.15
−25.80−25.80−24.40−24.40−15.65−15.65−14.25−14.25
−38.15−38.15−36.20−36.20−23.85−23.85−21.90−21.90
Panel 2: Worst 10
−63.15−63.15−60.00−60.00−40.05−40.05−36.90−36.90
−19.65−19.65−18.55−18.55−11.50−11.50−10.40−10.40
−7.05−6.80−5.75−5.550.901.201.652.25
−8.55−8.20−6.95−6.700.801.201.602.25
−10.05−9.65−8.20−7.900.701.201.502.40
−9.60−9.20−8.00−7.700.551.051.352.35
−6.75−6.90−5.30−5.401.150.851.851.35
−11.35−10.80−9.45−9.100.401.051.252.55
−5.30−5.30−3.95−4.051.401.152.101.60
−18.45−17.20−15.40−14.550.001.501.004.00
−16.30−14.90−13.30−12.400.351.851.253.90
−7.90−8.05−6.20−6.351.050.551.800.90
The table shows critical values at p-values of 0.5%, 2.5%, 95.5%, and 99.5%, respectively, in ESN/ST pairs. Values are shown corrected to two decimal places and were computed numerically using Simpson‘s rule.
Table 14. Summary of estimated log-likelihood function.
Table 14. Summary of estimated log-likelihood function.
SampleMLE (ESN)MLE (ST)MLE (SN)
−2.5−162.1227−143.8078−143.7856−144.2421
−1−166.7300−147.7064−148.1087−148.2171
−0.5−173.5951−154.3467−154.7982−155.0793
0−181.6436−162.3594−162.5675−163.1677
0.5−190.3665−171.9357−172.3604−172.6343
1−199.0348−180.9025−180.4694−181.2470
2.5−199.5184−181.6458−181.8682−182.0044
The table shows the value of the log-likelihood function for each parameter combination computed at its estimated maximum, averaged over the 100 samples and over values of τ. Column 1 shows values of logL based on parameter values inferred from sample moments. As columns 2 through 4, the value of logL varies little with the choice of underlying distribution.
Table 15. Root mean square errors in the moments.
Table 15. Root mean square errors in the moments.
Panel 1: λ = 2.5
MLE (ESN)MLE (ST)MLE (SN)
τ = −5.00.01730.28640.0733
−2.50.36790.27200.3626
−1.00.93110.60390.9170
0.00.65275.32540.6702
1.03.00868.18425.8937
2.55.87288.79356.9017
5.03.82596.47905.7127
Panel 2: 1
−5.00.08780.27360.1137
−2.50.08510.38630.1398
−1.00.06300.25080.0838
0.00.05280.30520.0915
1.00.21900.54520.3016
2.50.40061.03830.4772
5.00.32510.69280.3406
Panel 3: 0
−5.00.07510.34430.1084
−2.50.10380.37840.1294
−1.00.07970.42210.1050
0.00.09790.41470.114 0
1.00.08980.42130.1257
2.50.11620.52110.1340
5.00.07520.28550.1242
Panel 4: 1
−5.00.07090.39940.0769
−2.50.06510.39190.1001
−1.00.06640.45520.1297
0.00.16060.51500.1971
1.00.38680.83290.3099
2.50.30011.03570.3504
5.00.26481.44940.3207
Panel 5: 2.5
−5.00.13210.56720.1594
−2.50.48290.96960.8214
−1.00.70272.34342.1031
0.00.73553.74072.8966
1.03.36699.85342.1806
2.54.961414.38905.0747
5.01.921315.81122.5600
Root mean square error is computed as the square root of the average squared difference between the population moments and the average of the estimated moments based on MLE parameter estimates for each distribution. The population moments included in the calculations are mean, variance, skewness, and kurtosis.
Table 16. Root mean square errors in the critical values.
Table 16. Root mean square errors in the critical values.
Panel 1: λ = 2.5
MLE (ESN)MLE (ST)MLE (SN)
τ = −5.00.08120.0590.0932
−2.50.22940.21060.2017
−1.00.21580.21090.2074
0.00.04070.3420.0361
1.00.14590.59140.2343
2.50.29710.4650.2715
5.00.06680.17350.3859
Panel 2: 1 . 0
−5.00.03360.07650.1295
−2.50.04920.06760.1692
−1.00.06100.04570.1172
0.00.05470.08920.0849
1.00.07150.10720.1381
2.50.04360.15640.1600
5.00.03310.08080.1662
Panel 3: 0
−5.00.01310.08300.1360
−2.50.03650.09560.1347
−1.00.01850.10620.1427
0.00.02600.07860.1465
1.00.01900.10700.1452
2.50.04450.12820.1184
5.00.04860.08240.1750
Panel 4: 1
−5.00.03350.07820.1309
−2.50.02430.04810.1753
−1.00.05160.08070.2266
0.00.06490.10040.2440
1.00.05060.09250.2430
2.50.04680.15420.2263
5.00.02030.14980.1910
Panel 5: 2.5
−5.00.05440.05160.2556
−2.50.25040.25430.5273
−1.00.20450.23280.7658
0.00.05120.25580.8942
1.00.10200.56540.9505
2.50.27900.53850.6304
5.00.11550.30890.3715
Root mean square error is computed as the square root of the average squared difference between the population critical values and the average of the estimated values based on MLE parameter estimates for each distribution. The critical values are computed at nominal percent probabilities equal to 0.05, 0.5, 2.5, 5.0, 95.0, 97.5, 99.5, and 99.95.
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Adcock, C.J. Properties and Limiting Forms of the Multivariate Extended Skew-Normal and Skew-Student Distributions. Stats 2022, 5, 270-311. https://doi.org/10.3390/stats5010017

AMA Style

Adcock CJ. Properties and Limiting Forms of the Multivariate Extended Skew-Normal and Skew-Student Distributions. Stats. 2022; 5(1):270-311. https://doi.org/10.3390/stats5010017

Chicago/Turabian Style

Adcock, Christopher J. 2022. "Properties and Limiting Forms of the Multivariate Extended Skew-Normal and Skew-Student Distributions" Stats 5, no. 1: 270-311. https://doi.org/10.3390/stats5010017

Article Metrics

Back to TopTop