Properties and Limiting Forms of the Multivariate Extended Skew-Normal and Skew-Student Distributions

Adcock, Christopher J.

doi:10.3390/stats5010017

Open AccessArticle

Properties and Limiting Forms of the Multivariate Extended Skew-Normal and Skew-Student Distributions

by

Christopher J. Adcock

^1,2

¹

Sheffield University Management School, University of Sheffield, Sheffield S10 1FL, UK

²

UCD Michael Smurfit Graduate Business School, University College Dublin, Carysfort Avenue, Blackrock, D04 V1W8 Dublin, Ireland

Stats 2022, 5(1), 270-311; https://doi.org/10.3390/stats5010017

Submission received: 9 November 2021 / Revised: 22 February 2022 / Accepted: 2 March 2022 / Published: 9 March 2022

(This article belongs to the Special Issue Multivariate Statistics and Applications)

Download

Browse Figures

Versions Notes

Abstract

:

This paper is concerned with the multivariate extended skew-normal [MESN] and multivariate extended skew-Student [MEST] distributions, that is, distributions in which the location parameters of the underlying truncated distributions are not zero. The extra parameter leads to greater variability in the moments and critical values, thus providing greater flexibility for empirical work. It is reported in this paper that various theoretical properties of the extended distributions, notably the limiting forms as the magnitude of the extension parameter, denoted

τ

in this paper, increases without limit. In particular, it is shown that as

τ \to - \infty

, the limiting forms of the MESN and MEST distributions are different. The effect of the difference is exemplified by a study of stockmarket crashes. A second example is a short study of the extent to which the extended skew-normal distribution can be approximated by the skew-Student.

Keywords:

hidden truncation models; market model; multivariate extended skew-normal distribution; multivariate extended skew-Student distribution; stock market crashes

JEL Classification:

C18; G01; G10; G12

1. Introduction

The skew-normal distribution was introduced in [1] and the skew-Student in [2]. These two distributions share the property that they may be derived formally. There are several methods of derivation of which probably the best known is to consider the bivariate normal distribution of X and Y each with zero mean, unit variance, and correlation

ρ

. The skew-normal distribution then arises by then considering the distribution of X conditional on

Y < 0

[or

Y > 0

]. A second method of construction is to consider a random variable

X + λ U

where U has a standard normal distribution truncated from below at zero, written

U \sim T N {(0, 1; 0)}^{+}

, where

T N {(μ, σ^{2}; x)}^{+}

denotes a normally distributed variable with mean

μ

and standard deviation

σ

truncated from below at x and

λ \in R

. There are similar and equally well-known constructions for the skew-Student and for the multivariate versions of these distributions. That the conditioning variable Y is required to be less than (greater than) zero and that U follows a standard normal distribution truncated from below at zero are, however, limitations. This is for four principal reasons. First, using negative (positive) values of Y to determine whether or not X is observed is self-evidently a limitation. Depending on the application, the appropriate threshold or truncation point for Y might take any nonzero value, as might the value of the mean of its underlying normal distribution. For example, in his recent paper [3] refers to early work by [4]. The latter was concerned with the scores from admission examinations: in such a case, the mean of Y would surely be greater than zero, as would the truncation point. Similarly, there is often no reason a priori for the underlying mean of U to be zero.

Second, empirical evidence reported in the financial economics literature suggests that in the absence of truncation from below at zero, the distribution of the unobserved variable U, denoted

N (τ, 1)

in this paper, exhibits nonzero values of

τ

(see, for example, [5] or [6]). In the first method of derivation above, the corresponding conditioning event would that

Y < τ

(

Y > τ

). Such distributions are referred to in the literature (see [7,8]), as extended skew-normal or extended skew-Student. The importance of nonzero values of

τ

also arises in stochastic frontier analysis, commonly referred to as SFA. SFA models are used to measure the efficiency of manufacturing companies and organizations such as banks. There is a detailed review of SFA models and methods in [9]. In its basic form, SFA employs linear regression models in which the unobserved residual has two components, commonly written as

ϵ - ν

. The first term,

ϵ

, is a standard

N (0, σ_{ϵ}^{2})

variate. The second term,

ν

, is a non-negative variate assumed to have an

N (0, σ_{ν}^{2})

distribution, which is truncated from below at zero; that is, a half normal distribution. The expected value of

ν

, which is nonzero, measures inefficiency. With these assumptions, the residual

ϵ - ν

has a skew-normal distribution. A somewhat different model was introduced by [10] in which the half normal variable is replaced one which has an exponential distribution. The paper by [11] shows that under the limit as

τ \to - \infty

, and with suitable choice of other model parameters, the extended skew-normal distribution encompasses both the half-normal and exponential distributions for the inefficiency term

ν

. Use of the extended version of the distribution offers greater flexibility in modeling inefficiency: the distribution of the inefficiency variable may exhibit a nonzero mode or may decay steeply.

Nonzero and negative values

τ

also arise in the study of stock market crashes. The standard model in financial economics is that returns on risky financial assets follow a multivariate normal distribution. Under this assumption, formally, the basic model of portfolio theory is to consider the conditional distribution of asset returns given a specified return on a market index. The resulting conditional distribution is multivariate normal, leading in essence to regression models for the return on individual assets. In the same manner, a market crash may be studied by considering the distribution of asset returns given that the return on the market index is less than a specified negative value. The resulting distribution is multivariate extended skew-normal. For market crashes, the value of the parameter denoted

τ

is both negative and of substantial magnitude. Analogous results arise if it is assumed that returns follow a multivariate Student distribution. In both the normal and Student cases, the distributions that arise as

τ \to - \infty

are of interest, one reason being that the limiting properties are different.

Third, use of extended versions of the skew-normal or skew-Student gives greater variability in the moments and critical values of the distributions. For empirical applications, this offers the possibility of better model fit. For some applications, the implied flexibility in the formal foundations may offer insights into the underlying data generation process. Last, in the multivariate case, conditional distributions are always in general of the extended type. Thus, for applications where conditional distributions play a role, extended versions are important if not unavoidable. The formal derivation of a skew-normal regression model as in [5] offers an example of this.

Extended versions of the skew-normal and skew-Student distributions have explicit advantages for some purposes. They offer the potential for greater flexibility in empirical work and, in addition, methodological advantages in some cases. The main aim of this paper is to present properties of the multivariate extended skew-normal (MESN) and multivariate extended skew-Student (MEST) distributions. The results demonstrate the differences from the standard versions. The paper also studies limiting cases of the distributions as the magnitude of the extension parameter

τ

increases without limit, extending a result reported in [11]. As the paper shows, these limiting cases are of interest from a theoretical point of view and offer insights for some applications.

The methodological results are illustrated by two applications. First, there is a study of the effect of a stock market crash. The results are different depending on whether the underlying distributions are multivariate normal or Student. The study presented here is theoretical, but its results can inform the development of econometric models of stock returns. Second, some researchers in this area of statistics have suggested informally that the skew-Student could be used as an alternative to the extended skew-normal. For a specified univariate application, it would be straightforward to estimate the parameters of both distributions and then make an informed choice using a test of fit or, for example, consideration of the tails of the distribution. Such an alternative may be attractive, but the suggestion could equally well be made in reverse: the extended skew-normal could be an alternative to the skew-Student. A general investigation of the similarity of the two distributions, particularly for multivariate cases, would be a major task and beyond the scope of this paper. To inform further research into this issue, this paper contains a short study designed to investigate this conjecture.

The structure of this paper is as follows. In Section 2 and Section 3, results for the MESN and MEST distributions, respectively, are presented. The results in these two sections are based on the extended versions of the second method of construction referred to above. Section 4 is concerned with the first method of construction, sometimes referred to in the literature as a hidden truncation model. This section contains the illustrative example of the effect of a stock market crash. The example shows that different behavior arises depending on the choice of model. Section 5 describes a brief investigation into the use of the skew-Student as an alternative to the extended skew-normal. Section 6 offers some concluding remarks. The abbreviations (E)SN and (E)ST are used for the univariate (extended) skew-normal and (extended) skew-Student distributions, respectively, with MSN and MST for the multivariate versions. Examples and graphs are based on univariate distributions, with most numerical results rounded to four decimal places. Notation not defined explicitly in the text is that in common use.

2. Multivariate Extended Skew-Normal Distribution

The multivariate skew-normal distribution was introduced by [12]. The multivariate extended skew-normal distribution, MESN, with an additional parameter, was first described in [13], independently by [8,14]. Following the notation in the third of these papers, the distribution of an n-vector

X

that follows this distribution is denoted

M E S N_{n} (μ, \sum, λ, τ)

. The authors of reference [13] derive the MESN distribution as a hidden truncation model. The authors of reference [8] present a direct derivation and link it to results in [7], who show that conditional distributions are in general of the extended type. The authors of reference [14] derive it as the convolution

X = U + λ V

, where the random vector

U

has the multivariate normal distribution

N_{n} (μ, \sum)

and the scalar random variable V is independently normally distributed as

N (τ, 1)

truncated from below at 0, denoted

V \sim T N {(τ, 1; 0)}^{+}

. The basic properties of the MESN distribution are described in this section using the notation in [14]. The probability density function of the distribution of

X

is

f_{X} (x) = ϕ_{n} (x, μ + λ τ, \sum + λ λ^{T}) \frac{Φ \{τ ω + λ^{T} \sum^{- 1} (x - μ - λ τ) / ω\}}{Φ (τ)},

(1)

where

ω = \sqrt{1 + λ^{T} \sum^{- 1} λ} .

(2)

ϕ_{n} (x, μ, \sum)

is the probability density function of an n-vector

X

, which has a multivariate normal distribution with mean vector

μ

and covariance matrix

\sum

evaluated at

x

.

Φ (z)

is the standard normal distribution function evaluated at z, with

ϕ (z)

denoting the corresponding density function. The distribution is denoted

X \sim M E S N_{n} (μ, \sum, λ, τ)

. The moment generating function of

X

is

M_{X} (t) = e x p \{{(μ + λ)}^{T} t + t^{T} (\sum + λ λ^{T}) t / 2\} Φ (τ + λ^{T} t) / Φ (τ) .

(3)

The mean vector and covariance matrix of the

M E S N

distribution are, respectively

E (X) = μ + λ \{τ + ξ_{1} (τ)\} = α, c o v (X) = \sum + λ λ^{T} \{1 + ξ_{2} (τ)\} = Ω,

(4)

where the function

ξ_{k} (z)

is defined as

ξ_{k} (z) = \partial^{k} l o g Φ (z) / \partial z^{k} .

(5)

Note that the covariance matrix may also be written

c o v (X) = \sum + λ λ^{T} \{1 - τ ξ_{1} (τ) - ξ_{1} {(τ)}^{2}\},

(6)

a form that is referred to in Section 3.3. Coskewness and cokurtosis, defined here as the 4th cumulant, are given by

λ_{i} λ_{j} λ_{k} ξ_{3} (τ); λ_{i} λ_{j} λ_{k} λ_{l} ξ_{4} (τ),

(7)

respectively.

For the skew-normal distribution itself, the mean of the underlying truncated normal variable denoted V equals

\sqrt{2 / π}

. Rounded to four decimal places, this value is shown in panel 2, column 1 of Table 1 in the row named “mean”. When

| τ | \leq 1

the minimum and maximum values of the mean are 0.5251 and 1.2876, respectively, as shown in panels 1 and 3. The corresponding results for the higher moments are shown in in the other rows of column 1 of the table. Columns 2 and 3 shown the analogous results when

| τ | \leq 5

and ≤30, respectively. Thus, as well as arising automatically under conditioning, the extended version of the skew-normal provides for more flexibility in the moments of the distribution.

In their Lemma 2, reference [11] report an apparently known result concerning the limiting distribution of

X

as

τ \to - \infty

. The lemma is reported here for convenience.

Lemma 1

([11]). Let

X

be distributed as

M E S N_{n} (μ, \sum, λ, τ)

. As

τ \to - \infty

, the distribution of

X

tends to the multivariate normal distribution

N_{n} (μ, \sum)

.

An implication of this result, described in more detail below, is that as

τ < 0

increases in magnitude, V has an exponential distribution with parameter

1 / |τ|

, that is, with mean and standard deviation both equal to

1 / |τ|

. As

τ \to \infty

, the distribution of

X

tends to a multivariate normal with an unbounded mean vector

μ + λ τ

but a finite covariance matrix

\sum + λ λ^{T}

.

The remainder of this section of the paper presents a number of properties of the MESN distribution. Figure 1 shows two sets of examples of the density function of the (univariate) extended skew-normal distribution for

τ = \pm 30, \pm 15, \pm 5, \pm 2.5

, and 0. In the left-hand set, the nonzero values of the extension parameter

τ

are negative. In the right-hand set, the signs of the

τ

are reversed. In both sets,

μ = 0, σ = 2

, and

λ = 5

. Both sets demonstrate that asymmetry disappears progressively as

| τ |

increases and exhibit the properties reported in Lemma 1 and the text that follows it.

Papers by [7,15] show that a suitable linear transformation reduces the MSN distribution to a canonical form. Corresponding representations may be derived for the extended version of the distribution and, as shown below, for the extended skew-Student. These representations depend on the following standard result.

Lemma 2.

Let

I_{n}

denote an

n \times n

unit matrix,

ψ

an n-vector and

0_{n}

an n-vector of zeros. The eigenvalues of the matrix

I_{n} + ψ ψ^{T}

are (i)

1 + ψ^{T} ψ

and (ii) 1 repeated

n - 1

times. The corresponding eigenvectors are (i)

ψ / \sqrt{ψ^{T} ψ}

and (ii) an

n \times (n - 1)

orthogonal matrix

T_{0}

which satisfies

ψ^{T} T_{0} = 0_{n - 1}^{T}

.

This is used to establish the following:

Proposition 1.

Let

X \sim M E S N_{n} (μ, \sum, λ, τ)

and let

Y = \sum^{- 1 / 2} (X - μ)

, where

\sum^{1 / 2}

is a left square-root matrix of Σ. Then,

Y \sim M E S N_{n} (0_{n}, I_{n}, ψ, τ)

,

ψ = \sum^{- 1 / 2} λ

, and there exists an orthogonal transformation of

Y

T^{T} Y = {(Z_{0}, Z^{T})}^{T},

such that

Z_{0}

and

Z

are independently distributed,

Z_{0} \sim E S N (0, 1, \sqrt{ψ^{T} ψ}, τ)

and

Z \sim N_{n - 1} (0_{n - 1}, I_{n - 1})

.

Note that from Lemma 1, as

τ \to - \infty

, the limiting distribution of

Z_{0}

is the standard normal.

2.1. The Truncated Normal Distribution and Its Approximations

The probability density function of the distribution of the truncated normal variable

V \sim T N {(τ, 1; 0)}^{+}

is

f_{V} (v) = \frac{ϕ_{1} (v, τ, 1)}{Φ (τ)} .

(8)

The moment-generating function (MGF), originally reported in [16], is

M_{V} (t) = e^{τ t + t^{2} / 2} \frac{Φ (τ + t)}{Φ (τ)},

(9)

with the MGF valid for all

t \in R

. Following on from [17], numerous authors present results for the moments of the truncated normal distribution and generalizations thereof. These include [18,19,20,21,22] and, recently, [23], among others. For values of

τ

that are less than zero, the asymptotic expansion of

Φ (τ)

from page 932 of [24] is

ϕ (τ) \{\frac{1}{| τ |} + \sum_{j = 1}^{m} \frac{{(- 1)}^{j} Γ (2 j)}{Γ (j) 2^{j - 1} {| τ |}^{2 j + 1}} + R_{m} (| τ |)\} ≃ ϕ (τ) \{D_{m} (| τ |)\} .

(10)

Noting that with suitable choices of m and values of

τ

, the remainder term

R_{m} (.)

R_{m} (| τ |) = \frac{{(- 1)}^{m + 1} Γ (2 m) (2 m + 1)}{Γ (m) 2^{m - 1}} \int_{- \infty}^{τ} \frac{ϕ (x)}{x^{2 m + 1}} d x,

(11)

may be ignored. In this case, the moment-generating function of V is

M_{V} (t) ≃ D_{m} {(| τ |)}^{- 1} \{\frac{1}{| τ | (1 - t / | τ |)} + \sum_{j = 1}^{m} \frac{{(- 1)}^{j} Γ (2 j)}{Γ (j) 2^{j - 1} {| τ |}^{2 j + 1} {(1 - t / | τ |)}^{2 j + 1}}\} .

(12)

This leads to a distribution for which the corresponding density function is a weighted average of gamma densities

f_{v} (v) = D_{m} {(| τ |)}^{- 1} \{\frac{g (v, 1 / | τ |, 1)}{| τ |} + \sum_{j = 1}^{m} \frac{{(- 1)}^{j} Γ (2 j) g (v, 1 / | τ |, 2 j + 1)}{Γ (j) 2^{j - 1} {| τ |}^{2 j + 1}}\},

(13)

where

g (.)

denotes the density function of the gamma distribution

g (x, α, ν) = \frac{x^{ν - 1} e^{- x / α}}{α^{ν} Γ (ν)}; α, ν > 0 .

(14)

For sufficiently large values of

| τ |

, terms after the first may be ignored, giving an exponential distribution with density function

f_{V} (v) = | τ | e^{- | τ | v} .

(15)

When used to to form the convolution

X = U + λ V

, the distribution at (15) leads to the skew-normal exponential distribution described in [11] but originally due to [10]. Figure 2 shows sketches of the truncated normal density function for

τ = - 3, - 10

and

- 30

. The steepness of decay increases with

| τ |

. Figure 3 shows the truncated normal density function for

τ = - 3

, together with the corresponding exponential density function and and approximation based on the density at Equation (13) with

m = 2

. As Figure 3 indicates, the three density functions are visually similar. In particular, there is little difference between the truncated normal density and the three term mixture based on Equation (13).

2.2. Moments of the Truncated Normal Distribution

Expressions for moments of the truncated normal distribution are reported in [21], as well as in references cited above in Section 2.1. In the notation of the present paper, from Equation (9), the mean and variance of the truncated normal distribution are, respectively,

E (V) = τ + ξ_{1} (τ), v a r (V) = 1 + ξ_{2} (τ) = 1 - τ ξ_{1} (τ) - ξ_{1} {(τ)}^{2} = β_{1} .

(16)

Skewness and kurtosis, defined here as the fourth cumulant, are respectively

κ_{3} = ξ_{3} (τ) = ξ_{1} (τ) \{τ^{2} - 1 + 3 τ ξ_{1} (τ) + 2 ξ_{1} {(τ)}^{2}\} .

(17)

and

κ_{4} = ξ_{4} (τ) = ξ_{1} (τ) (3 τ - τ^{3}) + ξ_{1} {(τ)}^{2} (4 - 7 τ^{2}) - 12 τ ξ_{1} {(τ)}^{3} - 6 ξ_{1} {(τ)}^{4} .

(18)

Kurtosis, the fourth moment about the mean and denoted by

{\bar{κ}}_{4}

, is

{\bar{κ}}_{4} = ξ_{4} (τ) + 3 {\{1 + ξ_{2} (τ)\}}^{2} .

(19)

Expressed in terms of

ξ_{1} (τ)

, this is

{\bar{κ}}_{4} = 3 - ξ_{1} (τ) (3 τ + τ^{3}) - ξ_{1} {(τ)}^{2} (2 + 4 τ^{2}) - 6 τ ξ_{1} {(τ)}^{3} - 3 ξ_{1} {(τ)}^{4} .

(20)

Note that, from [25],

κ_{3} \geq 0

for all

τ \in R

. Using the first term of the asymptotic expansion for

Φ (τ)

for

τ ≪ 0

, under which V has the exponential distribution at Equation (15), leads to the following expressions for the first four derivatives of

l o g Φ (τ)

.

ξ_{1} (τ) ≃ |τ| + 1 / |τ|, ξ_{2} (τ) ≃ 1 / τ^{2} - 1, ξ_{3} (τ) ≃ 2 / |τ^{3}|, ξ_{4} (τ) ≃ 6 / |τ^{4}|,

(21)

where in this paper the notation ≃ is taken to mean that the ratio of the two functions tends to unity as, in this case,

τ \to - \infty

. These results give the same expressions for the first four moments as those computed from the exponential distribution at Equation (15). Table 2 shows the computed values of the first four moments of the truncated normal distribution, the limiting exponential distribution at Equation (15), and the mixture distribution based on Equation (13) with

m = 2

. Values are shown for

τ = - 3, - 10

, and

- 30

. In the table, kurtosis is the fourth moment about the mean, that is,

{\bar{κ}}_{4}

. As the table shows, the differences between the exact and approximate results are small and decline as

| τ |

increases. Whether a given approximation may be used as a practical alternative to the truncated normal will depend on the magnitude of

τ

and the application in question.

2.3. Standardized Form of the Extended Skew-Normal Distribution

Additional insights into the MESN distribution may be obtained by standardization. If

Ω^{1 / 2}

denotes a left square root matrix of

Ω

, the random n-vector

Z

now defined as

Z = Ω^{- 1 / 2} (X - α) .

(22)

satisfies

E (Z) = 0_{n}

and

c o v (Z) = I_{n}

. The distribution of

Z

has the density function

f_{Z} (z) = {| Ω |}^{1 / 2} ϕ_{n} (Ω^{1 / 2} z, - λ ξ_{1} (τ), \sum + λ λ^{T}) \frac{Φ \{Δ_{S S N}\}}{Φ (τ)},

(23)

where

ϕ_{n} (.)

and

ω

are as defined for Equation (1) and

Δ_{S S N} = τ ω + λ^{T} \sum^{- 1} (Ω^{1 / 2} z + λ ξ_{1} (τ)) / ω .

(24)

For the standardized form of extended skew-normal distribution, coskewness and cokurtosis (also defined here in terms of the fourth cumulant) are given by

{\tilde{λ}}_{i} {\tilde{λ}}_{j} {\tilde{λ}}_{k} ξ_{3} (τ); {\tilde{λ}}_{i} {\tilde{λ}}_{j} {\tilde{λ}}_{k} {\tilde{λ}}_{l} ξ_{4} (τ),

(25)

respectively, where

{\tilde{λ}}_{i}

is the standardized value of the skewness or shape parameter defined as

{\tilde{λ}}_{i} = \frac{λ_{i}}{{[σ_{i}^{2} + λ_{i}^{2} \{1 + ξ_{2} (τ)\}]}^{1 / 2}} .

(26)

Both coskewness and cokurtosis tend to zero as

τ \to - \infty

, in which case the limiting distribution of

Z

is the standard multivariate normal. A suitable transformation similar to that in Proposition 1 shows that the standardized MESN distribution may be expressed in canonical form similar once again to that described in [7] and [15].

Proposition 2.

Let

X \sim M E S N_{n} (μ, \sum, λ, τ)

and let

Y = Ω^{- 1 / 2} (X - α)

, where

Ω^{1 / 2}

is a left square-root matrix of Ω and let

Y = {(Y_{0}, {\tilde{Y}}^{T})}^{T} .

Then

\tilde{Y} \sim N_{n - 1} (0_{n - 1}, I_{n - 1})

and

Y_{0} \sim E S N (μ_{c}, σ_{c}^{2}, ψ_{c}, τ)

are independently distributed with

μ_{c} = - ψ_{c} \{τ + ξ_{1} (τ)\}; σ_{c}^{2} = \frac{1}{(1 + ψ^{T} ψ β_{1})}; ψ_{c} = \frac{\sqrt{ψ^{T} ψ}}{\sqrt{(1 + ψ^{T} ψ β_{1})}},

and

β_{1}

as defined at Equation (16) and

ψ

in Proposition 1. Note that as in Proposition 1 as

τ \to - \infty

the limiting distribution of

Z_{0}

is the standard normal.

Figure 4 shows two sets of standardized extended skew-normal density functions. In both sets

μ = 0

,

σ = 1

and

λ = 5

. In the left-hand set, values of the extension parameter

τ

are set to −30, −15, −5, −2.5 and 0. In the right-hand set, the signs of the

τ

are reversed. Both sets of densities illustrate that for

τ \neq 0

little asymmetry is apparent even when the shape parameter

λ

is substantial; in this case five times greater than the scale parameter

σ

. Of the values of

τ

shown in the figure, only

τ = 0

leads to a density function with a discernible amount of asymmetry. Figure 5 shows two more sets of the skew-normal density functions. The panel on the left shows extended skew-normal density functions with

μ = 0, σ = 1

and

λ = 5

. The values of

τ

are −5, −2.5, −1, 0, 1, 2.5 and 5. The panel on the right-hand side consists of the corresponding densities standardized to have mean equal to zero and variance equal to one. The X-scales are the same in each panel. As Figure 5 shows, the skewness apparent for the extended skew-normal distributions reduces and largely disappears under standardization. There are analogous results for negative values of

λ

.

Table 3 shows a selection of moments for the extended skew-normal distribution and the corresponding standardized form for values of

τ

that are less than or equal to zero. Values of

τ

and

λ

are as shown in the table. Values of the location and scale parameter are

μ = 0

and

σ = 1

and are used in all numerical results. As the table shows, when

τ \leq - 10

the values of standardized skewness and kurtosis are numerically close to 0 and 3 respectively, thus supporting the result of Lemma 1. For

λ = 0

and 1 there is evidence to support normality for

τ < - 1

. Asymmetry is most evident when

τ

is zero or close to it. Table 4 shows the corresponding selection for positive values of

τ

. The panel corresponding to

τ = 0

is repeated for ease of reading. The table indicates normality for

τ \geq 5

. Asymmetry is evident when

τ \leq 1

. The panel with

τ = 2.5

has values of

κ_{4}

that are negative.

3. Multivariate Extended Skew-Student Distribution

The multivariate extended skew-Student distribution, MEST], is an extension of the multivariate skew-Student distribution originally introduced by [2]. The extended version is reported in [26] and later in both [27,28]. Following [14], the former derives it as the convolution

X = U + λ V

, where the random vector

(U^{T}, V)

of length

n + 1

has a multivariate Student distribution with location parameter vector

(μ^{T}, τ)

and scale matrix

\begin{matrix} [\begin{matrix} \sum & 0_{n} \\ 0_{n}^{T} & 1 \end{matrix}], \end{matrix}

(27)

with V truncated from below at zero. Consistent with the notation in Section 1, this is denoted

V \sim T T_{ν} {(τ, 1; 0)}^{+}

, where

T T_{ν} {(μ, σ^{2}; x)}^{+}

denotes a Student’s t variable with location parameter

μ

and scale

σ

truncated from below at x. The marginal distribution of

U

has the symmetric density function reported Section 3.2 of [27] and independently in [28]. The probability density function of the distribution of

X

is

f_{X} (x) = t_{ν, n} (x, μ + λ τ, \sum + λ λ^{T}) \frac{T_{ν + n} (Δ_{S T})}{T_{ν} (τ)},

(28)

where

Δ_{S T} = \frac{\sqrt{ν + n} \{τ ω + λ^{T} \sum^{- 1} (x - μ - λ τ) / ω\}}{\sqrt{ν + {(x - μ - λ τ)}^{T} {(\sum + λ λ^{T})}^{- 1} (x - μ - λ τ)}},

(29)

and where

ω

is as defined at Equation (2).

t_{ν, n} (x, μ, \sum)

is the probability density function of an n-vector

X

which has a multivariate Student distribution with location parameter vector

μ

and scale matrix

\sum

evaluated at

x

.

T_{ν} (z)

is the distribution function of a Student’s t variable with

ν

degrees of freedom evaluated at z and

t_{ν} (z)

is the corresponding density function. This distribution is denoted

X \sim M E S T_{n} (μ, \sum, λ, τ; ν)

. As in Section 2, this section of the paper presents basic properties of the MEST distribution. Similar to Table 1, unreported results show that nonzero values of

τ

make a substantial difference to the moments of the distribution. As

τ \to \infty

, the limiting distribution of

X

is multivariate Student.

Proposition 3.

Let

X \sim M E S T_{n} (μ, \sum, λ, τ; ν)

. The limiting distribution as

τ \to \infty

is multivariate Student with location parameter

μ + λ τ

, scale matrix

\sum + λ λ^{T}

, and ν degrees of freedom; denoted

X \sim M V T_{n} (μ + λ τ, \sum + λ λ^{T}; ν)

.

The proof of this result uses the scale mixture representation reported in Lemma 3 of [29]. This result is consistent with the analogous property of the MESN distribution reported in Section 2. As shown later in this section, however, the limiting distribution of

X

as

τ \to - \infty

in the MEST case is different from that for the MESN.

Figure 6 shows sketches of the extended skew-Student density function for

λ = 0

and

ν = 3

. The left-hand panel shows density functions with negative values of

τ

ranging from

- 30

to

- 1

.The right-hand side shows densities with positive values of

τ

ranging from 0 to 30. This symmetric density function is that reported in both [27,28]. Two notable features are, first, the similarity of the density function for increasing positive values of

τ

, but, second, the increasing spread of the density function as

| τ |

increases for negative values of

τ

. For

λ = 5

, the left-hand panel of Figure 7 shows density functions with the same negative values of

τ

. The right-hand panel shows densities with

τ

ranging from 0 to 20. In both of these figures,

μ = 0

and

σ = 1

. In the right-hand panel of Figure 7, the density function is qualitatively similar to the corresponding skew-normal distribution: asymmetry disappears with increasing values of

τ

, and the location parameter increases, but the spread does not. For negative values of

τ

, the spread increases and asymmetry decreases with increasing values of

| τ |

. To support the sketches in the figures, the moments of the extended skew-Student distribution are reported in Section 3.3 below.

A canonical form of the MEST distribution may be derived using an approach that is essentially the same as that in Proposition 1.

Proposition 4.

Let

X \sim M E S T_{n} (μ, \sum, λ, τ; ν)

and let

Y = \sum^{- 1 / 2} (X - μ)

, where

\sum^{1 / 2}

is a left square-root matrix of Σ. Then

Y \sim M E S T_{n} (0_{n}, I_{n}, ψ, τ; ν)

,

ψ = \sum^{- 1 / 2} λ

, and there exists an orthogonal transformation of

Y

T^{T} Y = \tilde{Z} = {(Z_{0}, Z^{T})}^{T},

such that the density function of

\tilde{Z}

is

f (z_{0}, z) = \frac{K_{ν, n}}{\sqrt{1 + ψ_{0}^{2}} {\{1 + Q_{c} (\tilde{z}) / ν\}}^{(ν + n) / 2}} \frac{T_{ν + n} (Δ_{S T, c})}{T_{ν} (τ)}; ψ_{0} = \sqrt{ψ^{T} ψ},

where

K_{ν, n}

is the normalizing constant for an n-variate multivariate Student distribution with ν degrees of freedom,

Q_{c} (\tilde{z}) = z^{T} z^{T} + \frac{{(z_{0} - ψ_{0} τ)}^{2}}{(1 + ψ_{0}^{2})},

and

Δ_{S T, c} = \frac{\sqrt{ν + n} \{τ ω_{c} + ψ_{0} (z_{0} - ψ_{0} τ) / ω_{c}\}}{\sqrt{ν + Q_{c} (\tilde{z})}}; ω_{c} = \sqrt{1 + ψ_{0}^{2}} .

Equivalently,

\tilde{Z} \sim M E S T_{n} (0_{n}, I_{n}, ψ_{n}, τ; ν)

where

ψ_{n} = {(ψ_{0}, 0_{n - 1}^{T})}^{T}

.

Standard manipulations show that

Z_{0} \sim E S T (0, 1, ψ_{0}, τ; ν)

and that the marginal distribution of

Z

has the symmetric Student-like density function reported in Section 3.2 of [27].

3.1. The Truncated Student’s t Distribution

Similar to the extended skew-normal, the properties of the extended skew-Student distribution are substantially affected by those of the truncated form of Student’s t. The density function of the truncated Student’s t variable v is

f_{V} (v) = \frac{t_{ν, 1} (v, τ, 1)}{T_{ν} (τ)} .

(30)

Figure 8 shows sketches of the truncated Student t density function for

τ = - 35

, together with two approximating beta type-2 density functions as described below in Lemma 5. The degrees of freedom

ν

are 5 and 20, respectively. For a fixed value of

τ

, the figure illustrates the increasing severity of decay as

ν

increases. It is notable that for

ν \geq 5

, the truncated Student t is well approximated by the beta type-2 densities.

3.2. Moments of the Truncated Student’s t Distribution

Moments of the truncated distribution at Equation (30) may be evaluated directly. Note that expressions for the moments of a doubly truncated t distribution may be found in [30]. As reported in [27], for

ν > 1

and

ν > 2

, respectively, the mean and variance of this distribution are

E (V) = τ + ξ_{ν} (τ), v a r (V) = η_{ν} (τ) - ξ_{ν} {(τ)}^{2},

(31)

where

ξ_{ν} (τ) = \frac{ν (1 + τ^{2} / ν) t_{ν} (τ)}{(ν - 1) T_{ν} (τ)}, η_{ν} (τ) = - τ ξ_{ν} (τ) + \frac{ν T_{ν - 2} (τ \sqrt{(ν - 2) / ν})}{(ν - 2) T_{ν} (τ)} .

(32)

The following result, derived using integration by parts, leads to a more useful representation of

η_{ν} (τ)

.

Lemma 3.

For

ν > 2

, the following result holds

\frac{τ t_{ν} (τ) (1 + τ^{2} / ν)}{(ν - 1) T_{ν} (τ)} + \frac{T_{ν - 2} (τ \sqrt{(ν - 2) / ν})}{T_{ν} (τ)} = 1 .

Using this result, for

ν > 2

, the functions

η_{ν} (τ)

and

ξ_{ν} (τ)

are related by the identity

η_{ν} (τ) = \frac{ν}{ν - 2} - \frac{τ (ν - 1) ξ_{ν} (τ)}{ν - 2} .

(33)

Equation (33) allows the variance to be written as

v a r (V) = \frac{ν}{ν - 2} - \frac{τ (ν - 1) ξ_{ν} (τ)}{ν - 2} - ξ_{ν} {(τ)}^{2},

(34)

Note that

l i m_{ν \to \infty} ξ_{ν} (τ) = ξ_{1} (τ)

is sufficient to show that the limiting values in Equation (31) equal those for the truncated normal at Equation (16). For

ν > 3

and

ν > 4

, skewess and kurtosis (the fourth moment about the mean), respectively, are

κ_{3} (V) = ξ_{ν} (τ) \{\frac{(ν - 1)}{(ν - 3)} τ^{2} - \frac{ν (ν - 5)}{(ν - 2) (ν - 3)} + \frac{3 (ν - 1)}{(ν - 2)} τ ξ_{ν} (τ) + 2 ξ_{ν} {(τ)}^{2}\}

(35)

and

{\bar{κ}}_{4} (V) = \frac{3 ν^{2}}{(ν - 2) (ν - 4)} + K_{1} ξ_{ν} (τ) + K_{2} ξ_{ν} {(τ)}^{2} + K_{3} τ ξ_{ν} {(τ)}^{3} - 3 ξ_{ν} {(τ)}^{4},

(36)

where

K_{1} = - \frac{(ν - 1)}{(ν - 4)} \{τ^{3} + \frac{3 ν τ}{(ν - 2)}\}; K_{2} = - \{\frac{2 ν (ν + 1)}{(ν - 2) (ν - 3)} + \frac{4 (ν - 1) τ^{2}}{(ν - 3)}\},

(37)

and

K_{3} = - \frac{6 (ν - 1)}{(ν - 2)} .

(38)

As already noted above, reference [25] showed that the skewness of the truncated normal distribution is non-negative for all values of

τ

. The following shows that the same result holds for the truncated Student distribution.

Proposition 5.

Let

V \sim T T_{ν} {(τ, 1; 0)}^{+}

. For

ν > 3

, the following result holds:

κ_{3} (V) \geq 0

.

The proof is by contradiction. First, note that since

ξ_{ν} (τ) \geq 0

, the sign of

κ_{3} (V)

is determined by the sign of the expression in

\{.\}

in Equation (35). This quadratic function of

ξ_{ν} (τ)

has roots

\frac{\frac{- 3 (ν - 1) τ}{(ν - 2)}}{4} \pm \frac{\sqrt{{\{\frac{(ν - 1) τ}{(ν - 2)}\}}^{2} \{\frac{(ν + 1) (ν - 5)}{(ν - 1) (ν - 3)}\} + \frac{8 ν (ν - 5)}{(ν - 2) (ν - 3)}}}{4} .

Since the coefficient

ξ_{ν} {(τ)}^{2}

is positive, the function is negative between the roots, which is a contradiction.

Note that as

ν \to \infty

, Proposition 5 also establishes Sampford’s result, and note that the expressions for the first four moments tend to those for the truncated normal distribution at Equations (16), (17), and (19).

Computation of limiting expressions for the moments as

τ \to - \infty

requires a result that is analogous to the well-known asymptotic expression for normal distribution reported in [24]. Such a result was first reported in [31]. As it does not appear to be well known, it is summarized below in the notation of this paper.

Lemma 4

([31]). For values of

τ

that are less than zero, the asymptotic expansion of

T_{ν} (τ)

is

t_{ν} (τ) (1 + τ^{2} / ν) \{\frac{1}{| τ |} + \sum_{j = 1}^{m} \frac{{(- 1)}^{j} Γ (2 j) a_{j}}{Γ (j) 2^{j - 1} {| τ |}^{2 j + 1}} + R_{m, ν} (| τ |)\}; a_{j} = \frac{ν^{j} Γ (\frac{ν}{2} + 1)}{2^{j} Γ (\frac{ν}{2} + 1 + j)} .

(39)

Noting that with suitable choices of m and values of

τ

, the remainder term

R_{m} (.)

R_{m, ν} (| τ |) = \frac{{(- 1)}^{m + 1} Γ (2 m) (2 m + 1) a_{m + 1}}{Γ (m) 2^{m - 1}} \int_{- \infty}^{τ} \frac{t_{ν} (x) (1 + τ^{2} / ν)}{x^{2 m + 1}} d x,

(40)

may be ignored.

Using the first two terms in the expansion in Lemma 4 for

τ < 0

and

ν > 1

gives

ξ_{ν} (τ) ≃ \frac{ν | τ |}{(ν - 1)} + \frac{ν^{2}}{(ν - 1) (ν + 2) | τ |} .

(41)

from which the asymptotic expected value is

E (V) = τ + ξ_{ν} (τ) ≃ \frac{| τ |}{(ν - 1)} + \frac{ν^{2}}{(ν - 1) (ν + 2) | τ |}; τ < 0 .

(42)

For

ν > 2

, the corresponding expression for the asymptotic variance is

\frac{{ν | τ |}^{2}}{{(ν - 1)}^{2} (ν - 2)} + \frac{2 ν (ν^{2} - ν + 1)}{{(ν - 1)}^{2} (ν - 2) (ν + 2)} + \frac{ν^{3} (ν^{3} - 4 ν^{2} + 2 ν - 2)}{{(ν - 1)}^{2} (ν - 2) {(ν + 2)}^{2} (ν + 4) {| τ |}^{2}} .

(43)

Thus, for fixed finite degrees of freedom

ν > 2

, the expected value and variance increase without limit as

τ \to - \infty

. As

ν \to \infty

, the expected value and variance tend to

1 / | τ |

and

1 / {| τ |}^{2}

, respectively, the results for the truncated normal distribution. The corresponding expressions for skewness and kurtosis are omitted in view of their complexity. However, if just the terms proportional to

{| τ |}^{3}

are considered, then for

ν > 3

as

τ \to - \infty

, asymptotic skewness is

\frac{{2 ν | τ |}^{3}}{(ν - 1)} \{\frac{ν^{2}}{{(ν - 1)}^{2}} - \frac{ν}{(ν - 2)} + \frac{1}{(ν - 2) (ν - 3)}\} .

(44)

Similarly for

ν > 4

, asymptotic kurtosis is proportional to

{| τ |}^{4}

. Table 5 shows a selection of moments from the truncated Student’s t distribution. As

τ

increases above zero, the distribution increasingly resembles Student’s t as demonstrated by the values in the bottom panel of the table. The top panel corresponding to

τ = - 35

shows the increasing values of the moments. The analog of the limiting exponential distribution that arises in the normal case described in Section 2.1 is as follows.

Lemma 5.

Let

V \sim T T_{ν} {(\tilde{τ}, 1; 0)}^{+}

. For

τ < < 0

, as the ratio

| τ | / \sqrt{ν}

increases without limit, the asymptotic distribution of

Y = V / | τ |

is

β_{I I} (1, ν)

, that is, with density function.

f_{Y} (y) = \frac{ν}{{(1 + y)}^{ν + 1}} .

(45)

The proof of this lemma is in Appendix A. An asymptotically equivalent result is that the variable

\tilde{Y} = V / \sqrt{ν + {| τ |}^{2}}

is also distributed as

β_{I I} (1, ν)

.

It is straightforward to show that the conditional distribution of

X

given

V = v

follows a multivariate Student distribution with

ν + 1

degrees of freedom, location parameter vector

μ + λ v

, and scale matrix

\frac{ν}{ν + 1} \sum \{1 + \frac{{(v - τ)}^{2}}{ν}\} .

(46)

Use of this distribution in conjunction with the asymptotic distribution of V in Equation (45), for

τ < 0

does not lead to tractable results that are analogous to those in Section 2.

3.3. Moments of the MEST Distribution

For

ν > 1

and

ν > 2

, respectively, the mean vector and covariance matrix of the MEST distribution are

E (X) = μ + λ \{τ + ξ_{ν} (τ)\} = α_{ν},

(47)

and

c o v (X) = ν {1 + η_{ν} (τ) / ν} \sum / (ν - 1) + λ λ^{T} \{η_{ν} (τ) - ξ_{ν} {(τ)}^{2}\} = Ω_{ν} .

(48)

Using the identity at Equation (33) allows the covariance matrix to be written as

\{\frac{ν}{(ν - 2)} - \frac{τ}{(ν - 2)} ξ_{ν} (τ)\} \sum + λ λ^{T} \{\frac{ν}{(ν - 2)} - \frac{(ν - 1)}{(ν - 2)} τ ξ_{ν} (τ) - ξ_{ν} {(τ)}^{2}\} .

(49)

The similarity of the coeffcient of

λ λ^{T}

to the corresponding term in Equation (6) may be noted. The coefficient of

\sum

provides the inequality

ν - τ ξ_{ν} (τ) \geq 0

.

The skewness of a single variable

X_{i}

in

X

with scale denoted by

σ

may be expressed in terms of the moments of V the truncated Student’s t variable, specifically Equations (34) and (35), and is given by

κ_{3} (X_{i}) = \frac{6 λ_{i} σ^{2}}{(ν - 1)} v a r (V) + λ_{i} \{λ_{i}^{2} + \frac{3 σ^{2}}{(ν - 1)}\} κ_{3} (V),

(50)

Defining the constants

K_{4} = \frac{3 ν^{2} σ^{4}}{(ν - 1) (ν - 3)}, K_{5} = \frac{6 λ^{2} σ^{2} ν}{(ν - 1)}, χ = \{1 + \frac{ξ_{ν} {(τ)}^{2}}{ν}\}, ϑ = \{1 + \frac{3 ξ_{ν} {(τ)}^{2}}{ν}\}

(51)

The kurtosis of

X_{i}

is given by

\begin{matrix} {\bar{κ}}_{4} = & K_{4} χ^{2} + [\frac{2 K_{4}}{ν} ϑ + K_{5} χ] v a r (V) + \frac{2 ξ_{ν} (τ)}{ν} (\frac{2 K_{4}}{ν} + K_{5}) κ_{3} (V) \\ + (\frac{K_{4} + K_{5}}{ν} + λ^{4}) {\bar{κ}}_{4} (V) \end{matrix}

(52)

The corresponding expressions for coskewness and cokurtosis are omitted. A selection of moments of the extended skew-Student is shown in Table 6, Table 7, Table 8 and Table 9. Table 6 [7] shows results for

τ \leq 0

[

τ \geq 0

] for

λ = 0

. The panel for

τ = 0

is repeated for convenience and corresponds to Student’s t distribution. The lower panels of Table 6 show the increasing magnitude of variance and kurtosis as

| τ |

increases, even for

λ = 0

. Table 8 and Table 9 show the corresponding results for

λ = 5

. Note that in Table 8, some large results are shown to two decimal places only to preserve the formatting.

3.4. Standardized Forms of the MEST Distribution

As in Section 2.3, further insights into the extended skew-Student distribution may be obtained by standardization. If

Ω_{ν}^{1 / 2}

denotes a left square root matrix of

Ω_{ν}

, the random vector

Z

now defined as

Z = Ω_{ν}^{- 1 / 2} (X - α_{ν}) .

(53)

satisfies

E (Z) = 0_{n}

and

c o v (Z) = I_{n}

. The distribution of

Z

has the density function

f_{Z} (z) = {| Ω_{ν} |}^{1 / 2} t_{ν, n} (Ω_{ν}^{1 / 2} z, - λ ξ_{ν} (τ), \sum + λ λ^{T}) \frac{T_{ν + n} (Δ_{S S T})}{T_{ν} (τ)},

(54)

where

t_{ν, n} (.)

is as defined for Equation (28),

ω

is as defined for Equation (1) and

Δ_{S S T} = \frac{\sqrt{ν + n} [τ ω + λ^{T} \sum^{- 1} \{Ω_{ν}^{1 / 2} z + λ ξ_{ν} (τ)\} / ω]}{\sqrt{ν + {\{Ω_{ν}^{1 / 2} z + λ ξ_{ν} (τ)\}}^{T} {(\sum + λ λ^{T})}^{- 1} \{Ω_{ν}^{1 / 2} z + λ ξ_{ν} (τ)\}}},

(55)

The distribution at Equation (54) has a canonical form. First, define

β_{0, ν} = \frac{ν}{(ν - 1)} \{1 + \frac{η_{ν} (τ)}{ν}\}; β_{1, ν} = \frac{\{η_{ν} (τ) - ξ_{ν} {(τ)}^{2}\}}{β_{0}},

(56)

partition

Z

into a scalar

Z_{0}

and an

(n - 1)

-vector

Z_{1}

and let

Q (Z)

be the quadratic form

Q (z) = β_{0, ν} z_{1}^{T} z_{1} + {\tilde{z}}_{0}^{2}; {\tilde{z}}_{0} = \frac{\{\sqrt{β_{0, ν}} \sqrt{1 + β_{1, ν} ψ_{0}^{2}} z_{0} + ψ_{0} ξ_{ν} (τ)\}}{\sqrt{1 + ψ_{0}^{2}}}; ψ_{0} = \sqrt{ψ^{T} ψ},

(57)

where

ψ

is as defined in Proposition 1. Methods similar to those used in that proposition gives the following result.

Proposition 6.

Let

X \sim M E S T_{n} (μ, \sum, λ, τ; ν)

and

Z = Ω_{ν}^{- 1 / 2} (X - α_{ν})

, where

Ω_{ν}^{1 / 2}

is a left square-root matrix of

Ω_{ν}

. Then

Z \sim M E S T_{n} (μ_{c, ν}, \sum_{c, ν}, λ_{c, ν}, τ; ν)

where

μ_{c, ν} = - λ_{c, ν} \{τ + ξ_{ν} (τ)\}; λ_{c, ν} = {\{λ_{ν}, 0_{n - 1}^{T}\}}^{T}; λ_{ν} = \frac{ψ_{0}}{\sqrt{β_{0, ν} (1 + β_{1, ν} ψ_{0}^{2})}},

and

\begin{matrix} \sum_{c, ν} = [\begin{matrix} σ_{ν}^{2} & 0_{n - 1}^{T} \\ 0_{n - 1}^{T} & I_{n - 1} / β_{0, ν} \end{matrix}], σ_{ν}^{2} = 1 / \{β_{0, ν} (1 + β_{1, ν} ψ_{0}^{2})\} . \end{matrix}

The density function of

Z

is

f_{Z} (z) = \sqrt{\frac{β_{0}^{n} (1 + β_{1} ψ_{0}^{2})}{(1 + ψ_{0}^{2})}} \frac{K_{ν, n}}{{\{1 + \frac{Q (z)}{ν}\}}^{(ν + n) / 2}} \frac{T_{ν + n} (Δ_{C S S T})}{T_{ν} (τ)},

(58)

where

Δ_{C S S T} = \frac{\sqrt{ν + n} (τ \sqrt{1 + ψ^{T} ψ} + \sqrt{ψ^{T} ψ} {\tilde{z}}_{0})}{\sqrt{ν + Q (z)}} .

(59)

As Equations (58) and (59) show, under the canonical representation, the asymmetry in the density function is attributable solely to the scalar variable

{\tilde{Z}}_{0}

. The marginal distribution of

Z_{1}

is symmetric and of the same type reported Section 3.2 of [27]. Examples of the EST and standardized EST density functions are shown in Figure 9 for

τ = - 30

and

- 5

and

ν = 10, 20

, and 100. In the upper (lower) row,

λ = 0 [5]

. The X-scales are the same in each panel. The graphs confirm results from Table 8 and Table 09, namely that the degree of asymmetry is reduced under standardization. Examples of contour plots for the bivariate EST and standardized EST distributions are shown in Figure 10.

To investigate the behavior of the distribution as

τ \to - \infty

for fixed

ν

, consider the scalar variable

Z_{0}

, which has the marginal distribution

E S T (μ_{ν}, σ_{ν}^{2}, λ_{ν}, τ; ν)

where

μ_{ν} = - λ_{ν} \{τ + ξ_{ν} (τ)\}

. For

ν > 2

, define

A = \frac{\sqrt{ν (ν - 2)} ψ_{0}}{\sqrt{ν - 1 + ψ_{0}^{2}}}; B = \frac{ν \{1 + ψ_{0}^{2} / (ν - 1)\}}{(ν - 1) (ν - 2) (1 + ψ_{0}^{2})} .

(60)

As

τ \to - \infty

for fixed

ν

, the asymptotic density function of

Z_{0}

is

f (z_{0}) = \frac{| τ | \sqrt{B} K_{ν, 1}}{{\{1 + {(z_{0} + A)}^{2} {| τ |}^{2} B / ν\}}^{(ν + 1) / 2}} \frac{T_{ν + 1} (Δ_{E S T})}{T_{ν} (τ)},

(61)

where

Δ_{E S T} = \sqrt{\frac{ν + 1}{ν}} \frac{\{τ \sqrt{1 + ψ_{0}^{2}} + ψ_{0} (z_{0} + A) | τ | \sqrt{B}\}}{\sqrt{1 + {(z_{0} + A)}^{2} {| τ |}^{2} B / ν}} .

(62)

This leads to the following result:

Proposition 7.

For

ν > 2

, as

τ / \sqrt{ν} \to - \infty

, the distribution of

Z_{0}

has the asymptotic density function

f (z_{0}) = \frac{ν}{B^{ν / 2} {| z_{0} + A |}^{ν + 1}} T_{ν + 1} \{\sqrt{ν + 1} (\pm ψ_{0} - \frac{\sqrt{1 + ψ_{0}^{2}}}{B^{1 / 2} | z_{0} + A |})\}; z_{0} \neq - A,

(63)

with the sign of

ψ_{0}

determined by the sign of

z_{0} + A

, and

f (- A) = \frac{\sqrt{1 + ψ_{0}^{2}} ν^{3 / 2} K_{ν + 1, 1}}{{(1 + ψ_{0}^{2})}^{ν / 2 + 1} \sqrt{(ν - 1) (ν - 2) (ν + 1)}} .

(64)

The result in this proposition requires the asymptotic expression for the distribution function of Student’s t. As noted above, such a result was first provided by [31] and is summarized in Lemma 4. Comparative examples of the exact and asymptotic EST density functions are shown in Figure 11. The implication of Proposition 7 is that as

τ \to - \infty

, the standardized distribution is qualitatively similar to the corresponding form for the extended skew-normal in that dependence on

τ

disappears. For nonzero values of

λ

or

ψ_{0}

, however, the distribution remains asymmetric. It is important to note though that, unlike the MESN, dependence on

τ

as it tends to

- \infty

does not disappear in the nonstandardized MEST case. In addition to Proposition 7, recall from results in Section 3.2 and Section 3.3 that for finite degrees of freedom, the location parameter vector depends on

| τ |

and the covariance matrix on

{| τ |}^{2}

.

4. Hidden Truncation Models

In their simple form, hidden truncation models are concerned with the bivariate normal distribution of

(X, Y)

in situations in which X is observed if Y is greater than (less than) a given threshold, here denoted

\tilde{τ}

. The procedure is commonly referred to as selective sampling. The resulting conditional distribution is that of

X | Y \geq (\leq) \tilde{τ}

. Such a construction is reported in a more general form in [12] for the case in which the scalar X is replaced by a random vector

X

. The phrase hidden truncation models is more often associated with the [13] in which they refer to an earlier work [32]. In selective sampling situations, it seems self-evident that the threshold

\tilde{τ}

will depend on the application in question. This is clearly implied in Section 2 of [13] in which they denote the threshold by

α

and report the resulting distribution of X conditional on

Y \geq α

, which is the extended skew-normal. The extended version of the skew-normal is also described in [33]. In the introduction to a sole-authored later paper, [34], Y is assumed to exceed its expected value. This case is more in keeping with the skew-normal literature, which does not generally employ the extended version of the distribution. Subsequent sections of [34], however, are inter alia concerned with extended versions of the skew-normal and other distributions.

The aim of this section is to present limiting forms of the extended skew-normal and skew-Student distributions when they are derived as hidden truncation models. Consistent with the results in Section 2 and Section 3, the limiting distributions exhibit different properties. The distributions of the hidden truncated variable Y and the observed vector

X

differ markedly depending on whether the underlying form is normal or Student’s t. In selective sampling, limiting forms of the distributions arise when the notional observation on the conditioning variable Y is required to be in one of the tails of its distribution. To illustrate the differences between the hidden truncation skew-normal and skew-Student distributions, either extended or not, this section contains a table of critical values corresponding to a probability of 0.025. Critical values corresponding to other probabilities are available on request. In addition to these general results, Section 4.4 describes an application to stock market crashes, in which the truncated variable is not only material to the resulting distribution but is also observed.

4.1. Hidden Truncation under the Normal Distribution

It is assumed that the n-vector

X

and a scalar variable denoted Y have a multivariate normal distribution

\begin{matrix} [\begin{matrix} X \\ Y \end{matrix}] \sim N_{(n + 1)} ([\begin{matrix} μ_{X} \\ μ_{Y} \end{matrix}], [\begin{matrix} \sum_{X} & δ \\ δ^{T} & σ_{Y}^{2} \end{matrix}]), \end{matrix}

(65)

The conditional distribution of

X

, given that

Y \leq \tilde{τ}

, has the probability density function

\begin{matrix} f_{X | Y \leq \tilde{τ}} (x | Y \leq \tilde{τ}) = ϕ_{n} (x, μ_{X}, \sum_{X}) \frac{Φ [\{τ σ_{Y} - δ^{T} \sum_{X}^{- 1} (x - μ_{X})\} / ω_{Y}]}{Φ (τ)}, \end{matrix}

(66)

where

ω_{Y} = \sqrt{σ_{Y}^{2} - δ^{T} \sum^{- 1} δ}, τ = (\tilde{τ} - μ_{Y}) / σ_{Y} .

(67)

The moment-generating function of the conditional distribution of

X

given

Y \leq \tilde{τ}

is

M_{X} (t) = e^{μ_{x}^{T} t + t^{T} \sum_{X} t / 2} \frac{Φ (τ - δ^{T} t / σ_{Y})}{Φ (τ)},

(68)

and that of Y is given

Y \leq \tilde{τ}

M_{Y} (s) = e^{μ_{Y} s + s^{2} σ_{Y}^{2} / 2} \frac{Φ (τ - σ_{Y} s)}{Φ (τ)} .

(69)

Noting the similarity to the MGF of the truncated variable denoted V in Section 2.1, it follows that

E (Y | Y \leq \tilde{τ}) = μ_{Y} - σ_{Y} ξ_{1} (τ), v a r (Y | Y \leq \tilde{τ}) = σ_{Y}^{2} \{1 + ξ_{2} (τ)\},

(70)

As

\tilde{τ} \to - \infty

, the variable Y given that it is less than or equal to

\tilde{τ}

becomes deterministic in the sense that its expected value is asymptotically equal to

\tilde{τ}

, but its variance and all higher moments are asymptotically equal to zero. The conditional expected return and covariance matrix of

X

are, respectively,

E (X | Y \leq \tilde{τ}) = μ_{X} - (δ / σ_{Y}) ξ_{1} (τ), c o v (X | Y \leq \tilde{τ}) = \sum_{X} + (δ δ^{T} / σ_{Y}^{2}) ξ_{2} (τ) .

(71)

As

\tilde{τ} \to - \infty

, the vector of expected values and the covariance matrix become

E (X | Y \leq \tilde{τ}) ≃ E (X | Y = \tilde{τ}), c o v (X | Y \leq \tilde{τ}) ≃ c o v (X | Y = \tilde{τ}) .

(72)

It is interesting to note that element i of the vector of expected values decreases or increases depending upon whether

δ_{i}

is positive or negative. The joint moment-generating function of

X

and Y conditional on

Y \leq \tilde{τ}

is

M_{X, Y | Y \leq \tilde{τ}} (t, s) = e^{μ_{x}^{T} t + t^{T} \sum_{X} t / 2 + t^{T} δ s + μ_{Y} s + s^{2} σ_{Y}^{2} / 2} \frac{Φ (τ - δ^{T} t / σ_{Y} - σ_{Y} s)}{Φ (τ)},

(73)

from which

c o v (X, Y | Y \leq \tilde{τ}) = δ \{1 + ξ_{2} (τ)\} .

(74)

Using similar arguments to those for Lemma 1, as

\tilde{τ} \to - \infty

, the covariances all tend to zero as expected.

4.2. Hidden Truncation under Student’s t Distribution

It is now assumed that the n-vector

X

and a scalar variable Y have a multivariate Student distribution with

ν

degrees of freedom. The conditional distribution of

X

, given that

Y \leq \tilde{τ}

, has the probability density function

f_{X | Y \leq \tilde{τ}} (x | Y \leq \tilde{τ}) = t_{n, ν} (x, μ, \sum) \frac{T_{ν + n} [\{τ σ_{Y} - δ^{T} \sum^{- 1} (x - μ)\} / (ω_{Y} Ψ)]}{T_{ν} (τ)}

(75)

where

ω_{Y}

and

τ

are as defined above and

Ψ = \sqrt{ν + {(x - μ)}^{T} \sum^{- 1} (x - μ)} / \sqrt{ν + n} .

(76)

The conditional mean and variance of Y are

E (Y | Y \leq \tilde{τ}) = μ_{Y} - σ_{Y} ξ_{ν} (τ), v a r (Y | Y \leq \tilde{τ}) = σ_{Y}^{2} \{η_{ν} (τ) - ξ_{ν} {(τ)}^{2}\},

(77)

where

ξ_{ν} (τ)

and

η_{ν} (τ)

are defined at Equation (32). As

\tilde{τ} \to - \infty

, the asymptotic expected value and variance are

E (Y | Y \leq \tilde{τ}) ≃ - \frac{μ_{Y}}{(ν - 1)} - \frac{ν | \tilde{τ} |}{(ν - 1)}, v a r (Y | Y \leq \tilde{τ}) ≃ \frac{ν τ^{2}}{{(ν - 1)}^{2} (ν - 2)} .

(78)

For finite and fixed degrees of freedom, and ignoring

μ_{Y}

for ease of exposition, the conditional expected value is uplifted through multiplication by

ν / (ν - 1)

, that is, the effect is most pronounced when the degrees of freedom are small. The asymptotic variance increases with

{| τ |}^{2}

, that is, potentially without limit. The conditional expected return and covariance matrix of

X

are

E (X | Y \leq \tilde{τ}) = μ_{X} - (δ / σ_{Y}) ξ_{ν} (τ),

(79)

and

c o v (X | Y \leq \tilde{τ}) = \frac{ν}{(ν - 1)} \{1 + \frac{η_{ν} (τ)}{ν}\} \sum_{X, C} + \frac{δ δ^{T}}{σ_{Y}^{2}} \{η_{ν} (τ) - ξ_{ν} {(τ)}^{2}\}

(80)

where

\sum_{X, C} = \sum_{X} - (δ δ^{T} / σ_{Y}^{2}) .

(81)

As

\tilde{τ} \to - \infty

, the vector of expected values and the covariance matrix become

E (X | Y \leq \tilde{τ}) ≃ E (X | Y = \tilde{τ}) + \frac{δ (\tilde{τ} - μ_{Y})}{(ν - 1) σ_{Y}^{2}},

(82)

and

c o v (X | Y \leq \tilde{τ}) ≃ \frac{ν}{(ν - 1)} \{\frac{(ν - 1)}{(ν - 2)} + \frac{τ^{2}}{(ν - 2)}\} \sum_{X, C} + \frac{δ δ^{T}}{σ_{Y}^{2}} \{\frac{ν τ^{2}}{{(ν - 1)}^{2} (ν - 2)}\} .

(83)

That is, for finite degrees of freedom, both expected values and the covariance matrix increase in magnitude without limit as

\tilde{τ} \to - \infty

. Similar to Equation (71), the conditional expected value of element i of

X

will increase without limit if the corresponding value of

δ_{i}

is negative and is unaffected if it equals zero.

Comparing the normal and Student hidden truncation models, the vectors of expected values are mainly determined by

τ

. Differences will be marked only if the degrees of freedom are small. The covariance matrices differ substantially: in the Student case for fixed

ν

, the covariance matrix increases without limit as

\tilde{τ} \to - \infty

. For a given finite value of

τ < < 0

, the increase in the elements of the covariance matrix decreases with increasing

ν

. The conditional covariance between

X

and Y is

c o v (X, Y | Y \leq \tilde{τ}) = \frac{δ}{σ_{Y}} \{η_{ν} (τ) - ξ_{ν} {(τ)}^{2}\} .

(84)

Standard manipulations using Equation (83) show that the conditional correlation between a typical element i of

X

and Y is asymptotically equal to

{\{1 + σ_{Y}^{2} σ_{X, C, i}^{2} (ν - 1) / δ_{i}^{2}\}}^{- 1 / 2},

which tends to zero as

ν \to \infty

.

4.3. Hidden Truncation with Extended Distributions

Table 10 shows critical values corresponding to a probability of 0.025 for the univariate versions of distributions at Equations (66) and (75) for a range of values of

τ

,

ρ

, and

ν

. Table entries are computed numerically, displayed to two decimal places. In Panel 4, corresponding to the standard case

τ = 0

, the first row,

ρ = 0

yields the critical values for Student’s t distribution with

5, 10, 20, 50

, and 100 degrees of freedom and the standard normal distribution. The other rows in the same panel correspond to

ρ = 0.2, 0.4, 0.6

, and

0.8

. As the panel shows the critical values range from

- 1.96

to

- 3.15

. In Panels 1 to 3, for which

τ

takes negative values, the range is greater and increases with the magnitude of

τ

. In panels 5 to 7, with positive values of

τ

, the critical values closely approximate those of Student’s t and the normal distribution as expected. In each panel, the rows corresponding to

ρ = 0

are the critical values of the nonstandard symmetric Student-like distribution reported in both [27,28]. The effect of the distribution of

X

and Y and the threshold

\tilde{τ}

has a non-negligible effect on critical values, that is, for many applications, extended versions of the distributions may be preferred.

4.4. Stock Market Crashes

The basic empirical model for the returns on stocks is a regression in which the single explanatory variable is the contemporaneous return on a suitable market index, such as the UK’s FTSE100 or the USA’s S&P 500. The model is generally referred to as the market model. It is the operational version of the capital asset pricing model, universally referred to as the CAPM, of [35,36,37]. Numerous other regression setups are in widespread use, but all maintain a close connection to the market model. More formally, it is assumed that the n-vector of asset returns

R

and the contemporaneous return on the market index

R_{m}

have a multivariate normal distribution

\begin{matrix} [\begin{matrix} R \\ R_{m} \end{matrix}] \sim N_{(n + 1)} ([\begin{matrix} μ \\ μ_{m} \end{matrix}], [\begin{matrix} \sum_{R} & δ \\ δ^{T} & σ_{m}^{2} \end{matrix}]), \end{matrix}

(85)

where

δ = β σ_{m}^{2}

. An element

R_{i}

of

R

may denote the return on an individual stock or a portfolio of stocks. The market model is then the conditional distribution of

R

given that R_m = r_m that is

R | R_{m} = r_{m} \sim N_{n} (μ + β (r_{m} - μ_{m}), \sum_{R C}); \sum_{R C} = \sum_{R} - σ_{m}^{2} β β^{T} .

(86)

or, if the market model is written in familiar regression style notation

R = μ + β (R_{m} - μ_{m}) + ϵ;

(87)

The results with an underlying Student distribution are similar. For

ν > 1

, the conditional mean is the same, but for

ν > 2

, the conditional covariance matrix now depends on

r_{m}

as follows

c o v (R | R_{m} = r_{m}) = \frac{ν}{(ν - 1)} \{1 + \frac{{(r_{m} - μ_{m})}^{2}}{ν σ_{m}^{2}}\} \sum_{R C} .

(88)

That is, the conditional variance is inflated by a factor that is proportional to the squared deviation of

r_{m}

from its expected value.

In this subsection, the effect of a market crash is considered. A detailed coverage of the statistical and empirical properties of crashes is beyond the scope of this paper, but some theoretical insights into crashes may be derived using the skew-normal and skew-Student distributions. Specifically, the standard conditioning event

R_{m} = r_{m}

is changed to

R_{m} \leq \tilde{τ}

. This characterizes a crash when

\tilde{τ}

is both negative and of large magnitude. Comparison of Equation (85) with (65) and (66) shows that the resulting conditional distribution of

R

is extended skew-normal or extended skew-Student. For underlying normal returns, the conditional mean and variance of market returns are, respectively,

E (R_{m} | R_{m} \leq \tilde{τ}) = μ_{m} - σ_{m} ξ_{1} (τ); v a r (R_{m} | R_{m} \leq \tilde{τ}) = σ_{m}^{2} \{1 + ξ_{2} (τ)\},

(89)

where

τ = (\tilde{τ} - μ_{m}) / σ_{m}

. Similar to the results in Section 4.1, in the limit, as

\tilde{τ} \to - \infty

, market return becomes nonstochastic with (expected) value equal to

\tilde{τ}

.

The corresponding results for the conditional mean vector and covariance matrix of asset returns

R

are

E (R | R_{m} \leq \tilde{τ}) = μ - σ_{m} β ξ_{1} (τ) ≃ μ + β (\tilde{τ} - μ_{m}),

(90)

and

c o v (R | R_{m} \leq \tilde{τ}) = \sum_{R C} + σ_{m}^{2} β β^{T} \{1 + ξ_{2} (τ)\} ≃ \sum_{R C} .

(91)

In a crash, the conditional expected return on asset i decreases or increases without limit depending on the sign of

β_{i}

, but there is no effect if

β_{i} = 0

. The conditional covariance matrix is asymptotically equal to

c o v (R | R_{m} = \tilde{τ})

, the conventional case defined at Equation (86). With underlying Student returns, for

ν > 2

, the conditional mean and variance of market returns are, respectively,

E (R_{m} | R_{m} \leq \tilde{τ}) = μ_{m} - σ_{m} ξ_{ν} (τ); v a r (R_{m} | R_{m} \leq \tilde{τ}) = σ_{m}^{2} \{η_{ν} (τ) - ξ_{ν} {(τ)}^{2}\} .

(92)

Using the results at Equation (78), it follows that the expected value of market return in a crash is negative and increases pro rata to the standardized crash size. Unlike the results based on an underlying normal distribution, the conditional variance is proportional to the square of the standardized crash size; for given

ν

, the variance increases without limit. A sketch of the conditional distribution of index returns under normal and Student’s t distributions with five degrees of freedom and corresponding to a five-standard-deviation crash is shown in Figure 12. As the sketch shows, the Student’s t tail is longer and fatter than that of the normal.

The corresponding results for the conditional mean expected return vector is

E (R | R_{m} \leq \tilde{τ}) = μ_{R} - σ_{m} β ξ_{ν} (τ) .

(93)

As above, the conditional expected return for asset i will increase or decrease without limit depending on the sign of

β_{i}

but is unchanged if it equals zero. Using Equation (80), the conditional covariance matrix is

c o v (R | R_{m} \leq \tilde{τ}) = \frac{ν}{(ν - 1)} \{1 + \frac{η_{ν} (τ)}{ν}\} \sum_{R C} + σ_{m}^{2} β β^{T} \{η_{ν} (τ) - ξ_{ν} {(τ)}^{2}\},

(94)

which, in keeping with Equation (83), may also increase without limit. Noting that

η_{ν} (τ) = E \{{(R_{m} - μ_{m})}^{2} / σ_{m}^{2} | R_{m} \leq \tilde{τ}\}

, the similarities between Equation (94) and (88) are clear.

5. Extended Skew-Normal versus Skew-Student

The literature concerning the skew-normal and skew-Student distributions is more abundant that that for the corresponding extended versions. It has been conjectured by some researchers in the area, albeit informally, that the skew-Student could be used as an alternative to the extended skew-normal distribution. To some extent, such a suggestion is motivated naturally by the similarities in the shapes of some of the respective density functions. Somewhat more formally, use of the skew-Student could be regarded as being closer in spirit to the original skew-normal literature. For univariate distributions, and from the perspective of empirical work, this is an issue that is more concerned with parameter estimation and tests of fit. That is, for a given data set, does the extended skew-normal or the skew-Student offer better fit? For multivariate distributions, the issue is the same in principle, although the details are more complex. It is of course also the case that the extended skew-normal might be preferred to the skew-Student. For example, for the former, all moments exist, which may be a consideration for some applications. Conditional distributions are in general of the extended type. For multivariate applications in which conditioning is a requirement, methodological issues could imply that extended versions of the distribution are more appropriate. That is, an MESN or even MEST distribution may be preferable to the MST.

To construct an approximation, at least two types of method suggest themselves. Given a specified extended skew-normal distribution, one method would be to minimize a suitable measure of the distance between the two density functions. Several measures of distance could be considered. Denoting the two density functions by

f_{E S N} (x)

and

f_{S T} (x)

and assuming that the parameters of the former (latter) are given, the parameters of the latter (former) could be chosen by minimizing

\int_{- \infty}^{\infty} {\{f_{E S N} (x) - f_{S T} (x)\}}^{2} d x .

Numerous variations on this theme could be constructed, for example, using a different norm or minimizing the divergence between the ESN and ST density functions using the Kullback–Leibler divergence measure [38] or the Hellinger distance ([39]). A second approach could be to seek to match the first four moments of the two distributions. It is clear that a comprehensive study of this conjecture, particularly bearing in mind multivariate distributions, would be a substantial undertaking. In this section of the paper, an initial investigation into the approximation of the univariate extended skew-normal distribution by the skew-Student, which may inform more comprehensive studies to be carrried out in the future, is described. The section is in two parts. In the first section, a theoretical investigation based on population moments is reported. In the second part, a study in which simulated data from a number of specified extended skew-normal distributions is used to estimate the parameters of both models is reported.

There are three technical points to note. First, the choice of an approximating skew-Student distribution is informed by the limiting forms of the extended skew-normal. From Lemma 1, as

τ \to - \infty

, the limiting form of the ESN distribution is

N_{n} (μ, \sum)

, which is the limiting form of a skew-Student distribution with

λ = 0

, that is, Student’s t, as

ν \to \infty

. As also reported in Section 2, a similar result holds as

τ \to \infty

. The implication is that using ST distributions to approximate the ESN is appropriate for values of

| τ |

that are not too large. Second, motivated again by similarities in the shape of the density function, an ESN distribution may be approximated by the SN itself. Third, there are combinations of the parameters

λ

and

τ

for which approximation by moment matching are infeasible. To illustrate this, consider an approximation of a univariate ESN with parameters

μ

,

σ^{2}

,

λ

and

τ

by an SN with parameters

μ_{0}

,

σ_{0}^{2}

and

λ_{0}

. Equating skewness shows that a real value of the ratio

σ_{0}^{2} / λ_{0}^{2}

requires that

{\{ξ_{3} (0)\}}^{2 / 3} [σ^{2} / λ^{2} + \{1 + ξ_{2} (τ)\}] > {\{ξ_{3} (τ)\}}^{2 / 3} \{1 + ξ_{2} (0)\},

(95)

and that simple computations show that the inequality does not always hold.

5.1. Moment Matching Study

The study in this paper considers the approximation of an ESN distribution by an ST. As above, for the ESN,

μ = 0

and

σ^{2} = 1

. The extension parameter

τ

takes 11 values in the range

[- 20, 20]

. As skewness is asymmetric in the shape parameter;

λ

takes 9 values in the range

[- 10, 0]

. For practical reasons, the derived value of

ν

is restricted to be an integer. For a given pair

(λ, τ)

, the approximating values of

ν

and

λ_{0}

are derived by minimizing the absolute difference in standardized skewness. This is done by grid search. The other parameters are computed by equating the expected value and variance of the two distributions. For

(λ, τ)

, pairs for which a moment matching approximation exists, the divergence between the ESN and ST density functions is computed using the Kullback–Leibler divergence measure [38]. The values of this divergence measure are ranked from best to worst, with the parameters corresponding to the best ten and worst ten shown in Table 11. The first two columns of each panel show the values of

λ

and

τ

. The next three columns show the computed values of

μ

,

σ^{2}

, and

λ

for the approximating ST distribution, with values rounded to four decimal places. Computed values of

ν

that were equal to 1000 or greater were replaced by ∞, that is, the approximating distribution is effectively skew-normal. Table 12 shows the corresponding values of the moments. As the Best 10 panel shows, the differences in the first four moments are negligible. For the Worst 10 panel, differences in mean, variance, and skewness are also negligible because of the method of construction. Unlike the results in the upper panel, there are differences in kurtosis. Table 13 shows the corresponding critical values, displayed in eight columns. These show critical values at p-values of 0.5%, 2.5%, 95.5%, and 99.5% in ESN/ST pairs. Values are shown corrected to two decimal places and were computed numerically. As the table shows, for the Best 10 approximations, the differences are negligible. For the Worst 10, the differences are more pronounced. To illustrate the effect of the moment matching procedure, Figure 13 shows ESN and ST density functions for which the ST approximation is the worst according to the Kullback–Leibler divergence measure.

The results in Table 11, Table 12 and Table 13 provide support to the implications of Equation (95), namely that the method of approximations works well for values of

| τ |

that are not too large. An interesting result is that for numerous parameter combinations, the extended skew-normal distribution may be well approximated by a skew-normal. The usefulness of the results in the Worst 10 panels will depend on the application. In some applications, accurate critical values are not necessary, but in others, they are. There are other methods of measuring the divergence between two density functions. Two well-known ones are Hellinger distance ([39]) and Jensen–Shannon divergence ([40]), both of which constitute topics for future investigation.

5.2. Simulation Study

The simulation study uses the same sets of values of

μ

,

σ^{2}

,

λ

, and

τ

. For each combination of the parameters, 100 samples of size 100 from an extended skew-normal distribution were drawn. The parameters were estimated by maximum likelihood for the ESN and ST distributions. In addition, motivated by the results in Table 11, the parameters of the skew-normal distribution were also estimated. Summaries of the results are shwon in Table 14, Table 15 and Table 16. Table 14 shows the value of the log-likelihood function for each parameter combination computed at its estimated maximum, averaged over the 100 samples and over values of

τ

. The table has four columns, with the first showing values of

l o g L

based on parameter values inferred from sample moments. As columns 2 through 4 of the table show, the value of

l o g L

varies little with the choice of underlying distribution. For this relatively small sample size, if the value of

L o g L

were the sole criterion for model selection, it would be difficult to discriminate between the three distributions.

For each parameter combination shown in Table 15 and Table 16, the entries are averages of the 100 samples. Table 15 shows the root mean-square error in the moments for the three distributions and for 35 selected combinations of

(λ, τ)

. As the table shows, the lowest root mean-square error occurs under the ESN for 30 of the

(λ, τ)

combinations. Root mean square error is computed as the square root of the average squared difference between the population moments and the average of the estimated moments based on parameters based on MLE for each distrbution. The population moments included in the calculations are mean, variance, skewness, and kurtosis. Table 16 shows the corresponding errors in the critical values. Root mean square error is computed as the square root of the average squared difference between the population critical values and the average of the estimated values based on MLE parameter estimates for each distribution. The critical values are computed at nominal percent probabilities equal to 0.05, 0.5, 2.5, 5.0, 95.0, 97.5, 99.5, and 99.95. The lowest root mean square error occurs under the ESN for 28 of the parameter combinations. In both Table 15 and Table 16, the root mean square error is generally the largest under the ST distribution.

6. Concluding Remarks

In this paper, results that demonstrate the properties of both the multivariate extended skew-normal and extended skew-Student distributions as the value of the extension parameter

τ

changes are presented. In general, for given value of location, scale, and shape or skewness, nonzero values of

τ

lead to greater variability in both the moments and critical values. In turn, this offers greater flexibility in empirical applications of these distributions. From a theoretical perspective, increasing values of

| τ |

leads to more fundamental changes in both distributions. As

τ

increases without limit, the asymptotic distributions are multivariate normal and multivariate Student, respectively. The respective vectors of expected values of both distributions are dependent on

τ

and are unbounded. The covariance matrices, however, remain finite. Skewness disappears for both distributions. By contrast, as

τ \to - \infty

, more substantial changes take place in the distributions. Most notable is that for the MESN distribution dependence on

τ

vanishes, but for the MEST in general, it does not. In the case of the MESN, the limiting distribution is multivariate normal. For the MEST distribution with finite degrees of freedom, asymmetry remains. For fixed

τ

, the extent of asymmetry decreases as the degrees of freedom increase. For fixed degrees of freedom, as

τ \to - \infty

, the vector of expected values and the covariance matrix are both unbounded.

To illustrate the potential of the MESN and MEST distributions, two applications are described. First, the effect of a stock market crash is studied assuming underlying multivariate normal and multivariate Student distributions. A crash, in which the return on a market index is less than a given negative threshold, results in multivariate extended skew-normal and multivariate extended skew-Student distributions. Under an underlying multivariate normal distribution, as the crash size increases without limit, the return on a stock market index becomes nonstochastic. In short, the market plummets: actual return equals expected return. Under an underlying multivariate Student distribution, expected return is broadly the same, but variability increases without limit. The market decline is noisy. There are analogous results for the returns on individual stocks. In particular, with underlying normality, the conditional covariance matrix remains finite, whereas under an underlying Student distribution, it does not. A detailed investigation of the implications and suitability of these models is beyond the scope of this particular paper, but it is reasonable to posit that the results offer support to the view that an underlying Student distribution is a more realistic model than the normal. Given that stock market collapses have in the past been of relatively short duration, the results also imply that for financial applications the models change. The methods described may be applied in principle to stock market booms. It may also be noted that if an inefficiency variable were to be constructed, SFA analysis could be treated in the same way.

Second, the conjecture that the skew-Student could be used instead of the extended skew-normal is an interesting one. Given the similarity in the shapes of the density functions for many combinations of parameters, this conjecture suggests that there is the possibility of flexible model choice. A general investigation of this conjecture would be a substantial task. The exercise reported in this paper is intended to offer evidence to motivate further research. The short study reported in this paper, part theoretical and part based on simulation, suggests that a given ESN distribution should be treated as such. However, the results also suggest that the ESN could be well-approximated by the skew-normal in some circumstances, but in general not by the skew-Student.

Funding

This research received no funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Acknowledgments

Thanks are due to the reviewers of the paper for comments which have led to both improved presentation and content.

Conflicts of Interest

The author declares no conflict of interest.

Appendix A. Asymptotic Distribution of a Truncated Student’s t Variable

Let

V \sim T T_{ν} {(τ, 1; 0)}^{+}

. For

τ < 0

, the density function is

f (x) = \frac{K_{ν} {\{1 + \frac{{(x + | τ |)}^{2}}{ν}\}}^{- (ν + 1) / 2}}{T_{ν} (τ)},

The term in {}

s

, now denoted

k (x)

, may be expanded by the binomial theorem to give

k (x) = \sum_{j = 0}^{\infty} \frac{{(- 2 x | τ |)}^{j} a_{j}}{ν^{j}} {(1 + x^{2} / ν + {| τ |}^{2} / ν)}^{- (ν + 1) / 2 + j}; a_{j} = \frac{Γ \{(ν + 1 + j) / 2\}}{Γ \{(ν + 1) / 2\} j!},

Writing

(1 + x^{2} / ν + {| τ |}^{2} / ν)

as

(1 + x^{2} / ν + {| τ |}^{2} / ν) = (1 + {| τ |}^{2} / ν) \{1 + \frac{x^{2} / ν}{(1 + {| τ |}^{2} / ν)}\},

allows the jth term to be written as

A_{j} = \frac{{(- 2 x | τ |)}^{j} a_{j}}{ν^{j} {(1 + {| τ |}^{2} / ν)}^{^{(ν + 1) / 2 + j}}} {\{1 + \frac{x^{2} / ν}{(1 + {| τ |}^{2} / ν)}\}}^{- (ν + 1) / 2 + j} .

Note that

l i m_{τ \to - \infty} \frac{| τ | / \sqrt{ν}}{\sqrt{1 + {| τ |}^{2} / ν}} = 1

gives

A_{j} \to {(1 + {| τ |}^{2} / ν)}^{- (ν + 1) / 2} a_{j} {(- 2)}^{j} {\{\frac{x^{2} / ν}{(1 + {| τ |}^{2} / ν)}\}}^{j / 2} {\{1 + \frac{x^{2} / ν}{(1 + {| τ |}^{2} / ν)}\}}^{- (ν + 1) / 2 + j} .

Some further algebra gives

k (x) = {(1 + {| τ |}^{2} / ν)}^{- (ν + 1) / 2} {(1 + \frac{x / \sqrt{ν}}{\sqrt{1 + {| τ |}^{2} / ν}})}^{- (ν + 1)} .

Using the first term in the asymptotic expansion for

T_{ν} (τ)

gives

f (x) = | τ | {(1 + {| τ |}^{2} / ν)}^{- 1} {(1 + \frac{x / \sqrt{ν}}{\sqrt{1 + {| τ |}^{2} / ν}})}^{- (ν + 1)} ≃ \frac{ν / | τ |}{{(1 + x / | τ |)}^{(ν + 1)}};

That is, as

τ \to - \infty

, the variable

X / | τ |

has a beta type-2 distribution with parameters 1 and

ν

, and

X / | τ | \sim β_{I I} (1, ν)

. Noting the above limit also leads to the alternative but asymptotically equivalent representation

(X / \sqrt{ν}) / \sqrt{1 + {| τ |}^{2} / ν} \sim β_{I I} (1, ν) .

References

Azzalini, A. A Class of Distributions which Includes The Normal Ones. Scand. J. Stat. 1985, 12, 171–178. [Google Scholar]
Azzalini, A.; Capitanio, A. Distributions Generated by Perturbation of Symmetry With Emphasis on a Multivariate Skew t Distribution. J. R. Stat. Soc. Ser. B 2003, 65, 367–389. [Google Scholar] [CrossRef]
Azzalini, A. An overview on the progeny of the skew-normal family—A personal perspective. J. Multivar. Anal. 2021, in press. [Google Scholar] [CrossRef]
Birnbaum, Z.W. Effect of linear truncation on a multinormal population. Ann. Math. Stat. 1950, 21, 272–279. [Google Scholar] [CrossRef]
Adcock, C.J. Capital Asset Pricing for UK Stocks Under the Multivariate Skew-Normal Distribution. In Skew Elliptical Distributions and Their Applications: A Journey beyond Normality; Genton, M., Ed.; Chapman and Hall: London, UK, 2004; pp. 191–204. [Google Scholar]
Adcock, C.J. Stein’s Lemma For Skew-Normal Distributions: A Comment and an Example. J. Appl. Probab. Stat. 2013, 8, 58–64. [Google Scholar]
Azzalini, A.; Capitanio, A. Statistical Applications of The Multivariate Skew Normal Distribution. J. R. Stat. Soc. Ser. B 1999, 61, 579–602. [Google Scholar] [CrossRef]
Capitanio, A.; Azzalini, A.; Stanghellini, E. Graphical Models for Skew-Normal Variates. Scand. J. Stat. 2003, 30, 129–144. [Google Scholar] [CrossRef]
Kumbhakar, S.C.; Parmeter, C.F.; Zelenyuk, V. Stochastic Frontier Analysis: Foundations and Advances I. In Handbook of Production Economics; Ray, S.C., Chambers, R., Kumbhakar, S.C., Eds.; Springer: Singapore, 2020. [Google Scholar]
Aigner, D.J.; Lovell, C.K.; Schmidt, P. Formulation and Estimation of Stochastic Production Function Model. J. Econom. 1977, 12, 21–37. [Google Scholar] [CrossRef]
Adcock, C.J.; Shutes, K. On the Multivariate Extended Skew-Normal, Normal-exponential and Normal-gamma Distributions. J. Stat. Theory Pract. 2012, 6, 636–664. [Google Scholar] [CrossRef]
Azzalini, A.; Dalla-Valle, A. The Multivariate Skew Normal Distribution. Biometrika 1996, 83, 715–726. [Google Scholar] [CrossRef]
Arnold, B.C.; Beaver, R.J. Hidden Truncation Models. Sankhya Ser. A 2000, 62, 22–35. [Google Scholar]
Adcock, C.J.; Shutes, K. Portfolio Selection Based on The Multivariate Skew-Normal Distribution. In Financial Modelling; Skulimowski, A., Ed.; Progress & Business Publishers: Krakow, Poland, 1999. [Google Scholar]
Capitanio, A. On The Canonical Form of Scale Mixtures of Skew-Normal Distributions. Statistica 2020, 80, 145–160. [Google Scholar]
Tallis, G.M. The moment generating function of the truncated multi-normal distribution. J. R. Stat. Soc. Ser. B 1961, 23, 223–229. [Google Scholar] [CrossRef]
Fisher, R.A. The moments of the distribution of normal samples of measures of departure from normality. Proc. R. Soc. Lond. 1930, 130, 16–28. [Google Scholar]
Barr, D.R.; Sherrill, E.T. Mean and Variance of Truncated Normal Distributions. Am. Stat. 1979, 53, 357–361. [Google Scholar]
Nadarajah, S.; Kotz, S. Moments of truncated t and F distributions. Port. Econ. J. 2008, 7, 63–73. [Google Scholar] [CrossRef]
Genç, A.d. Moments of truncated normal/independent variables. Stat. Pap. 2013, 54, 741–764. [Google Scholar] [CrossRef]
Horrace, W.C. Moments of the truncated normal distribution. J. Product. Anal. 2015, 43, 133–138. [Google Scholar] [CrossRef]
Ogasawara, H. A non-recursive formula for various moments of the multivariate normal distribution with sectional truncation. J. Multivar. Anal. 2021, 183, 104729. [Google Scholar] [CrossRef]
Loux, T.; Davy, O. Adjusting Published Estimates for Exploratory Biases Using the Truncated Normal Distribution. Am. Stat. 2021, 75, 294–299. [Google Scholar] [CrossRef]
Abramowitz, M.; Stegun, I. Handbook of Mathematical Functions; Dover: Mineola, NY, USA, 1965. [Google Scholar]
Sampford, M.R. Some Inequalities on Mill’s ratio and Related Functions. Ann. Math. Stat. 1953, 24, 130–132. [Google Scholar] [CrossRef]
Adcock, C.J. Asset Pricing and Portfolio Selection based on The Multivariate Skew-Student Distribution. In Proceedings of the Non-Linear Asset Pricing Workshop, Paris, France, 16–18 April 2002. [Google Scholar]
Adcock, C.J. Asset Pricing and Portfolio Selection Based on the Multivariate Extended Skew-Student-t Distribution. Ann. Oper. Res. 2010, 176, 221–234. [Google Scholar] [CrossRef] [Green Version]
Arellano-Valle, R.B.; Genton, M.G. Multivariate Extended Skew-t Distributions and Related Families. Metron 2010, 68, 201–234. [Google Scholar] [CrossRef]
Adcock, C.J. Mean-variance-skewness efficient surfaces, Stein’s lemma and the multivariate extended skew-Student Distribution. Eur. J. Oper. Res. 2014, 234, 392–401. [Google Scholar] [CrossRef]
Kim, H.J. Moments of truncated Student-t distribution. J. Korean Stat. Soc. 2008, 37, 81–87. [Google Scholar] [CrossRef]
Soms, A.P. An Asymptotic Expansion for the Tail Area of the t-Distribution. J. Am. Stat. Assoc. 1976, 71, 728–730. [Google Scholar] [CrossRef]
Arnold, B.; Beaver, R.J.; Groeneveld, R.A.; Meeker, W.Q. The non truncated marginal of a truncated bivariate normal distribution. Psychometrika 1993, 58, 471–478. [Google Scholar] [CrossRef]
Arnold, B.; Beaver, R.J. Skewed multivariate models related to hidden truncation and/or selective reporting. Test 2002, 11, 7–54. [Google Scholar] [CrossRef]
Arnold, B.C. Flexible univariate and multivariate models based on hidden truncation. J. Stat. Plan. Inference 2009, 139, 3741–3749. [Google Scholar] [CrossRef]
Sharpe, W.F. Capital Asset Prices: A Theory of Market Equilibrium Under Conditions of Risk. J. Financ. 1964, 19, 425–442. [Google Scholar]
Lintner, J. The Valuation of Risky Assets and The Selection of Risky Investments in Stock Portfolios and Capital Budgets. Rev. Econ. Stat. 1965, 47, 13–37. [Google Scholar] [CrossRef]
Mossin, J. Equilibrium in a Capital Asset Market. Econometrica 1966, 34, 768–783. [Google Scholar] [CrossRef]
Kullback, S.; Leibler, R.A. On information and sufficiency. Ann. Math. Stat. 1951, 22, 79–86. [Google Scholar] [CrossRef]
Hellinger, E. Neue Begründung der Theorie quadratischer Formen von unendlichvielen Veränderlichen. J. für Die Reine Angew. Math. 1909, 137, 210–271. [Google Scholar] [CrossRef]
Menéndez, M.L.; Pardo, J.A.; Pardo, L.; Pardo, M.C. The Jensen-Shannon Divergence. J. Frankl. Inst. 1997, 334B, 307–318. [Google Scholar] [CrossRef]

Figure 1. Extended skew-normal density functions;

τ = 0, \pm 2.5, \pm 5.0, \pm 15.0

, and

\pm 30.0

.

Figure 1. Extended skew-normal density functions;

τ = 0, \pm 2.5, \pm 5.0, \pm 15.0

, and

\pm 30.0

.

Figure 2. Truncated normal density function

τ = - 3, - 10, - 30

.

Figure 2. Truncated normal density function

τ = - 3, - 10, - 30

.

Figure 3. Truncated normal density function

τ = - 3

and approximations.

Figure 3. Truncated normal density function

τ = - 3

and approximations.

Figure 4. Standardized Extended Skew-Normal Density Functions.

Figure 5. Skew-Normal Standardized Skew-Normal Density Functions.

Figure 6. Extended skew-Student density functions,

ν = 3

,

λ = 0

.

Figure 6. Extended skew-Student density functions,

ν = 3

,

λ = 0

.

Figure 7. Extended skew-Student density functions,

ν = 3

,

λ = 5

.

Figure 7. Extended skew-Student density functions,

ν = 3

,

λ = 5

.

Figure 8. Truncated Student t density functions,

τ = - 35, ν = 5, 20

.

Figure 8. Truncated Student t density functions,

τ = - 35, ν = 5, 20

.

Figure 9. Extended and standardized skew-Student density functions,

τ = - 35, - 10

.

Figure 9. Extended and standardized skew-Student density functions,

τ = - 35, - 10

.

Figure 10. Contour plots of the extended skew-normal and skew-Student density functions,

ν

= 5, 100,

τ = 1

.

Figure 10. Contour plots of the extended skew-normal and skew-Student density functions,

ν

= 5, 100,

τ = 1

.

Figure 11. Asymptotic and exact skew-Student density functions,

ν = 5, τ = - 30

.

Figure 11. Asymptotic and exact skew-Student density functions,

ν = 5, τ = - 30

.

Figure 12. Comparison of the conditional distribution of standardized market returns.

Figure 13. Example of an extended skew-normal and approximating skew-Student density functions.

Table 1. Moments of the truncated normal distribution.

Panel 1: Min
	$\| τ \| \leq 1$	$\| τ \| \leq 5$	$\| τ \| \leq 30$
mean	0.5251	0.1865	0.0333
variance	0.1991	0.0327	0.0011
skewness	0.1169	0.0000	0.0000
kurtosis	0.1981	0.0083	0.0000
kappa4	0.0005	−0.1879	−0.1879
standardized-skewness	0.5918	0.0000	0.0000
standardized-kurtosis	3.0014	2.7609	2.7609
Panel 2: SN
	$\| τ \| \leq 1$	$\| τ \| \leq 5$	$\| τ \| \leq 30$
mean	0.7979	0.7979	0.7979
variance	0.3634	0.3634	0.3634
skewness	0.2180	0.2180	0.2180
kurtosis	0.5109	0.5109	0.5109
kappa4	0.1148	0.1148	0.1148
standardized-skewness	0.9953	0.9953	0.9953
standardized-kurtosis	3.8692	3.8692	3.8692
Panel 3: Max
	$\| τ \| \leq 1$	$\| τ \| \leq 5$	$\| τ \| \leq 30$
mean	1.2876	5.0000	30.0000
variance	0.6297	1.0000	1.0000
skewness	0.2957	0.2957	0.2957
kurtosis	1.1901	2.9998	3.0000
kappa4	0.1148	0.1148	0.1148
standardized-skewness	1.3162	1.8311	1.9935
standardized-kurtosis	4.9974	7.7592	8.9472

Kurtosis is the fourth moment about the mean.

Table 2. Moments of the truncated normal distribution and its approximations.

Panel 1: $τ = - 3$
	Exact	Exp-1	Exp-3
mean	0.2831	0.3333	0.2694
variance	0.0706	0.1111	0.0553
skewness	0.0315	0.0741	0.0149
kurtosis	0.0339	0.1111	0.0195
Panel 2: $τ = - 10$
	Exact	Exp-1	Exp-3
mean	0.0981	0.1000	0.0980
variance	0.0094	0.0100	0.0094
skewness	0.0018	0.0020	0.0018
kurtosis	0.0008	0.0009	0.0008
Panel 3: $τ = - 30$
	Exact	Exp-1	Exp-3
mean	0.0333	0.0333	0.0333
variance	0.0011	0.0011	0.0011
skewness	0.0001	0.0001	0.0001
kurtosis	0.0000	0.0000	0.0000

Kurtosis is the fourth moment about the mean. The abbreviations ‘Exact‘, ‘Exp-1‘, and ‘Exp-3‘ are as described in Figure 3.

Table 3. Moments of the extended skew-normal distributions,

τ \leq 0

.

Table 3. Moments of the extended skew-normal distributions,

τ \leq 0

.

Panel 1: $τ = 0$
	mn	vr	sk	ku	k4	ssk	sku
$λ = 0$	0.0000	1.0000	0.0000	3.0000	0.0000	0.0000	3.0000
1	0.7979	1.3634	0.2180	5.6912	0.1148	0.1369	3.0617
2.5	1.9947	3.2711	3.4065	36.584	4.4832	0.5758	3.4190
5	3.9894	10.0845	27.2517	376.8234	71.7317	0.8510	3.7053
Panel 2: $τ = - 0.5$
$λ = 0$	0.0000	1.0000	0.0000	3.0000	0.0000	0.0000	3.0000
1	0.6411	1.2685	0.1626	4.9301	0.1030	0.1138	3.0640
2.5	1.6027	2.6780	2.5407	25.539	4.0239	0.5797	3.5611
5	3.2054	7.7120	20.3255	242.8077	64.3824	0.9491	4.0825
Panel 3: $τ = - 1$
$λ = 0$	0.0000	1.0000	0.0000	3.0000	0.0000	0.0000	3.0000
1	0.5251	1.1991	0.1169	4.3927	0.0792	0.0891	3.0551
2.5	1.3128	2.2444	1.8270	18.2042	3.0928	0.5434	3.6140
5	2.6257	5.9774	14.6164	156.6738	49.4844	1.0002	4.3850
Panel 4: $τ = - 2.5$
$λ = 0$	0.0000	1.0000	0.0000	3.0000	0.0000	0.0000	3.0000
1	0.3227	1.0890	0.0429	3.5848	0.0272	0.0377	3.0230
2.5	0.8069	1.5561	0.6700	8.3283	1.0641	0.3452	3.4395
5	1.6137	3.2243	5.3598	48.2146	17.0254	0.9257	4.6376
Panel 5: $τ = - 5$
$λ = 0$	0.0000	1.0000	0.0000	3.0000	0.0000	0.0000	3.0000
1	0.1865	1.0327	0.0108	3.2045	0.0051	0.0103	3.0048
2.5	0.4663	1.2044	0.1692	4.5501	0.1987	0.1280	3.1370
5	0.9325	1.8174	1.3532	13.0888	3.1799	0.5523	3.9627
Panel 6: $τ = - 10$
$λ = 0$	0.0000	1.0000	0.0000	3.0000	0.0000	0.0000	3.0000
1	0.0981	1.0094	0.0018	3.0574	0.0005	0.0018	3.0005
2.5	0.2452	1.0590	0.0279	3.3841	0.0194	0.0256	3.0173
5	0.4905	1.2361	0.2233	4.8952	0.3112	0.1625	3.2036
Panel 7: $τ = - 20$
$λ = 0$	0.0000	1.0000	0.0000	3.0000	0.0000	0.0000	3.0000
1	0.0498	1.0025	0.0002	3.0148	0.0000	0.0002	3.0000
2.5	0.1244	1.0154	0.0038	3.0945	0.0014	0.0037	3.0014
5	0.2488	1.0616	0.0303	3.4032	0.0223	0.0277	3.0198
Panel 8: $τ = - 30$
$λ = 0$	0.0000	1.0000	0.0000	3.0000	0.0000	0.0000	3.0000
1	0.0333	1.0011	0.0001	3.0066	0.0000	0.0001	3.0000
2.5	0.0831	1.0069	0.0011	3.0418	0.0003	0.0011	3.0003
5	0.1663	1.0276	0.0091	3.1723	0.0045	0.0088	3.0043

Mean, variance, skewness, kurtosis (

{\bar{κ}}_{4}

) and 4th cumulant (

κ_{4}

) are denoted mn, vr, sk, ku & k4 respectively. ssk and sku denote skewness and kurtosis for the standardized distributions.

Table 4. Moments of the extended skew-normal distributions

τ \geq 0

.

Table 4. Moments of the extended skew-normal distributions

τ \geq 0

.

Panel 1: $τ = 0$
	mn	vr	sk	ku	k4	ssk	sku
$λ = 0$	0.0000	1.0000	0.0000	3.0000	0.0000	0.0000	3.0000
1	0.7979	1.3634	0.2180	5.6912	0.1148	0.1369	3.0617
2.5	1.9947	3.2711	3.4065	36.584	4.4832	0.5758	3.4190
5	3.9894	10.0845	27.2517	376.8234	71.7317	0.8510	3.7053
Panel 2: $τ = 0.5$
$λ = 0$	0.0000	1.0000	0.0000	3.0000	0.0000	0.0000	3.0000
1	1.0092	1.4862	0.2710	6.7143	0.0882	0.1496	3.0399
2.5	2.5229	4.0386	4.2342	52.3748	3.4441	0.5217	3.2112
5	5.0458	13.1544	33.8738	574.2185	55.1049	0.7100	3.3185
Panel 3: $τ = 1$
$λ = 0$	0.0000	1.0000	0.0000	3.0000	0.0000	0.0000	3.0000
1	1.2876	1.6297	0.2957	7.9682	0.0005	0.1421	3.0002
2.5	3.2190	4.9355	4.6206	73.1000	0.0214	0.4214	3.0009
5	6.4380	16.7422	36.9648	841.2418	0.3423	0.5396	3.0012
Panel 4: $τ = 2.5$
$λ = 0$	0.0000	1.0000	0.0000	3.0000	0.0000	0.0000	3.0000
1	2.5176	1.9556	0.0949	11.3172	−0.1558	0.0347	2.9593
2.5	6.2941	6.9725	1.4835	139.7583	−6.0874	0.0806	2.8748
5	12.5882	24.8899	11.8678	1761.1161	−97.3990	0.0956	2.8428
Panel 5: $τ = 5$
$λ = 0$	0.0000	1.0000	0.0000	3.0000	0.0000	0.0000	3.0000
1	5.0000	2.0000	0.0000	11.9997	−0.0002	0.0000	3.0000
2.5	12.5000	7.2500	0.0006	157.6791	−0.0064	0.0000	2.9999
5	25.0000	25.9998	0.0045	2027.8688	−0.1022	0.0000	2.9998
Panel 6: $τ = 10$
$λ = 0$	0.0000	1.0000	0.0000	3.0000	0.0000	0.0000	3.0000
1	10.0000	2.0000	0.0000	12.0000	0.0000	0.0000	3.0000
2.5	25.0000	7.2500	0.0000	157.6875	0.0000	0.0000	3.0000
5	50.0000	26.0000	0.0000	2028.0000	0.0000	0.0000	3.0000
Panel 7: $τ = 20$
$λ = 0$	0.0000	1.0000	0.0000	3.0000	0.0000	0.0000	3.0000
1	20.0000	2.0000	0.0000	12.0000	0.0000	0.0000	3.0000
2.5	50.0000	7.2500	0.0000	157.6875	0.0000	0.0000	3.0000
5	100.0000	26.0000	0.0000	2028.0000	0.0000	0.0000	3.0000
Panel 8: $τ = 30$
$λ = 0$	0.0000	1.0000	0.0000	3.0000	0.0000	0.0000	3.0000
1	30.0000	2.0000	0.0000	12.0000	0.0000	0.0000	3.0000
2.5	75.0000	7.2500	0.0000	157.6875	0.0000	0.0000	3.0000
5	150.0000	26.0000	0.0000	2028.0000	0.0000	0.0000	3.0000

Mean, variance, skewness, kurtosis (

{\bar{κ}}_{4}

), and fourth cumulant (

κ_{4}

) are denoted mn, vr, sk, ku, and k4, respectively. ssk and sku denote skewness and kurtosis for the standardized distributions.

Table 5. Moments of the Truncated Student’s t Distribution.

Panel 1: $τ = - 35$
	mn	vr	sk	ku	k4	ssk	sku
$ν = 5$	8.7755	128.2293	6742.2338	1211054.896	1161726.616	4.6433	73.6528
10	3.9153	19.1388	235.0122	6511.211	5412.3346	2.8069	17.776
100	0.3818	0.1485	0.1177	0.2087	0.1426	2.0568	9.4645
500	0.0986	0.0097	0.0019	0.0009	0.0006	2.0072	9.0581
1000	0.0635	0.004	0.0005	0.0001	0.0001	2.0012	9.0094
Inf	0.0285	0.0008	0.0000	0.0000	0.0000	1.9951	8.9623
Panel 2: $τ = - 10$
$ν = 5$	2.5885	11.0436	168.6874	8789.5951	8423.7139	4.5964	72.0692
10	1.2025	1.7818	6.5701	54.6947	45.1698	2.7623	17.227
100	0.1982	0.0394	0.0157	0.0141	0.0094	2.0083	9.0738
500	0.1180	0.0137	0.0031	0.0016	0.0011	1.9582	8.6753
1000	0.1080	0.0115	0.0024	0.0011	0.0007	1.9521	8.6276
Inf	0.0981	0.0094	0.0018	0.0008	0.0005	1.9460	8.5804
Panel 3: $τ = - 1$
$ν = 5$	0.8144	0.7937	2.4071	24.5591	22.6692	3.4042	38.9844
10	0.6519	0.3797	0.4544	1.3549	0.9224	1.9422	9.3977
100	0.5365	0.2117	0.1328	0.2349	0.1005	1.3628	5.2408
500	0.5274	0.2016	0.1199	0.2049	0.0830	1.3253	5.0441
1000	0.5263	0.2003	0.1184	0.2015	0.0811	1.3208	5.0206
Inf	0.5251	0.1991	0.1169	0.1981	0.0792	1.3162	4.9974
Panel 4: $τ = 0$
$ν = 5$	0.9490	0.7660	1.7094	13.5603	11.7998	2.5496	23.1085
10	0.8647	0.5023	0.5210	1.6356	0.8786	1.4634	6.4821
100	0.8039	0.3741	0.2357	0.5623	0.1424	1.0303	4.0175
500	0.7991	0.3655	0.2214	0.5206	0.1199	1.0021	3.8977
1000	0.7985	0.3644	0.2197	0.5157	0.1173	0.9987	3.8834
Inf	0.7979	0.3634	0.2180	0.5109	0.1148	0.9953	3.8692
Panel 5: $τ = 1$
$ν = 5$	1.4026	0.9677	1.5843	11.8173	9.008	1.6643	12.6197
10	1.3394	0.7530	0.6003	2.4830	0.7821	0.9188	4.3795
100	1.2924	0.6396	0.3153	1.2603	0.0331	0.6163	3.0809
500	1.2885	0.6316	0.2995	1.2035	0.0067	0.5966	3.0167
1000	1.2881	0.6307	0.2976	1.1968	0.0036	0.5942	3.009
Inf	1.2876	0.6297	0.2957	1.1901	0.0005	0.5918	3.0014
Panel 6: $τ = 10$
$ν = 5$	10.0011	1.6523	0.2153	20.4795	12.2891	0.1014	7.5012
10	10.0000	1.2499	0.0011	6.2361	1.5494	0.0008	3.9918
100	10.0000	1.0204	0.0000	3.1888	0.0651	0.0000	3.0625
500	10.0000	1.0040	0.0000	3.0363	0.0122	0.0000	3.0121
1000	10.0000	1.0020	0.0000	3.0181	0.0060	0.0000	3.0060
Inf	10.0000	1.0000	0.0000	3.0000	0.0000	0.0000	3.0000

Mean, variance, skewness, kurtosis (

{\bar{κ}}_{4}

), and fourth cumulant (

κ_{4}

) are denoted mn, vr, sk, ku, and k4, respectively. ssk and sku denote skewness and kurtosis for the standardized distributions.

Table 6. Extended skew-Student moments,

λ = 0

,

τ \leq 0

.

Table 6. Extended skew-Student moments,

λ = 0

,

τ \leq 0

.

Panel 1: $τ = 0$
	mn	vr	sk	ku	k4	ssk	sku
$ν = 5$	0.0000	1.6667	0.0000	25.0000	16.6667	0.0000	9.0000
10	0.0000	1.2500	0.0000	6.2500	1.5625	0.0000	4.0000
100	0.0000	1.0204	0.0000	3.1888	0.0651	0.0000	3.0625
1000	0.0000	1.0020	0.0000	3.0181	0.0060	0.0000	3.0060
Panel 2: $τ = - 0.5$
$ν = 5$	0.0000	1.8914	0.0000	33.6814	22.9489	0.0000	9.4148
10	0.0000	1.3270	0.0000	7.0862	1.8033	0.0000	4.0240
100	0.0000	1.0263	0.0000	3.2257	0.0660	0.0000	3.0626
1000	0.0000	1.0026	0.0000	3.0215	0.0061	0.0000	3.0060
Panel 3: $τ = - 1$
$ν = 5$	0.0000	2.2715	0.0000	50.4023	34.9234	0.0000	9.7686
10	0.0000	1.4565	0.0000	8.5803	2.2163	0.0000	4.0448
100	0.0000	1.0361	0.0000	3.2878	0.0673	0.0000	3.0627
1000	0.0000	1.0035	0.0000	3.0273	0.0061	0.0000	3.0060
Panel 4: $τ = - 2.5$
$ν = 5$	0.0000	4.5335	0.0000	184.0710	122.4118	0.0000	8.9559
10	0.0000	2.2167	0.0000	19.9427	5.2015	0.0000	4.0586
100	0.0000	1.0930	0.0000	3.6613	0.0771	0.0000	3.0646
1000	0.0000	1.0091	0.0000	3.0610	0.0063	0.0000	3.0062
Panel 5: $τ = - 5$
$ν = 5$	0.0000	12.3704	0.0000	1630.5664	1171.4828	0.0000	10.6554
10	0.0000	4.8325	0.0000	95.8128	25.7533	0.0000	4.1028
100	0.0000	1.2875	0.0000	5.0778	0.1045	0.0000	3.0631
1000	0.0000	1.0280	0.0000	3.1768	0.0064	0.0000	3.0060
Panel 6: $τ = - 10$
$ν = 5$	0.0000	43.6281	0.0000	20,481.2363	14,770.9983	0.0000	10.7603
10	0.0000	15.2531	0.0000	956.4646	258.4921	0.0000	4.1110
100	0.0000	2.0610	0.0000	13.0118	0.2682	0.0000	3.0631
1000	0.0000	1.1033	0.0000	3.6591	0.0073	0.0000	3.0060
Panel 7: $τ = - 20$
$ν = 5$	0.0000	168.6302	0.0000	306,820.5795	221,512.1022	0.0000	10.7898
10	0.0000	56.9209	0.0000	13,327.5059	3607.5334	0.0000	4.1134
100	0.0000	5.1533	0.0000	81.3467	1.6764	0.0000	3.0631
1000	0.0000	1.4042	0.0000	5.9272	0.0119	0.0000	3.0060

Mean, variance, skewness, kurtosis (

{\bar{κ}}_{4}

), and fourth cumulant (

κ_{4}

) are denoted mn, vr, sk, ku, and k4, respectively. ssk and sku denote skewness and kurtosis for the standardized distributions.

Table 7. Extended skew-Student moments,

λ = 0

,

τ \geq 0

.

Table 7. Extended skew-Student moments,

λ = 0

,

τ \geq 0

.

Panel 1: $τ =$ 0
	mn	vr	sk	ku	k4	ssk	sku
$ν = 5$	0.0000	1.6667	0.0000	25	16.6667	0.0000	9.0000
10	0.0000	1.2500	0.0000	6.2500	1.5625	0.0000	4.0000
100	0.0000	1.0204	0.0000	3.1888	0.0651	0.0000	3.0625
1000	0.0000	1.0020	0.0000	3.0181	0.0060	0.0000	3.0060
Panel 2: $τ =$ 0.5
$ν = 5$	0.0000	1.5613	0.0000	20.9306	13.6175	0.0000	8.5862
10	0.0000	1.2148	0.0000	5.8673	1.4405	0.0000	3.9762
100	0.0000	1.0178	0.0000	3.1723	0.0646	0.0000	3.0624
1000	0.0000	1.0017	0.0000	3.0165	0.006	0.0000	3.0060
Panel 3: $τ =$ 1
$ν = 5$	0.0000	1.5325	0.0000	19.3630	12.3178	0.0000	8.2452
10	0.0000	1.2076	0.0000	5.7712	1.3965	0.0000	3.9577
100	0.0000	1.0174	0.0000	3.1699	0.0645	0.0000	3.0623
1000	0.0000	1.0017	0.0000	3.0163	0.006	0.0000	3.0060
Panel 4: $τ =$ 2.5
$ν = 5$	0.0000	1.5864	0.0000	19.0707	11.5210	0.0000	7.5781
10	0.0000	1.2346	0.0000	5.9154	1.3430	0.0000	3.8812
100	0.0000	1.0199	0.0000	3.1808	0.0603	0.0000	3.0580
1000	0.0000	1.0020	0.0000	3.0174	0.0056	0.0000	3.0056
Panel 5: $τ =$ 5
$ν = 5$	0.0000	1.6447	0.0000	21.698	13.5834	0.0000	8.0218
10	0.0000	1.2490	0.0000	6.2259	1.5456	0.0000	3.9907
100	0.0000	1.0204	0.0000	3.1888	0.0651	0.0000	3.0625
1000	0.0000	1.0020	0.0000	3.0181	0.0060	0.0000	3.0060
Panel 6: $τ =$ 10
$ν = 5$	0.0000	1.6631	0.0000	23.2514	14.9539	0.0000	8.4066
10	0.0000	1.2500	0.0000	6.2492	1.5618	0.0000	3.9996
100	0.0000	1.0204	0.0000	3.1888	0.0651	0.0000	3.0625
1000	0.0000	1.0020	0.0000	3.0181	0.0060	0.0000	3.0060
Panel 7: $τ =$ 20
$ν = 5$	0.0000	1.6662	0.0000	24.114	15.7855	0.0000	8.6861
10	0.0000	1.2500	0.0000	6.2500	1.5625	0.0000	4.0000
100	0.0000	1.0204	0.0000	3.1888	0.0651	0.0000	3.0625
1000	0.0000	1.0020	0.0000	3.0181	0.0060	0.0000	3.0060

Mean, variance, skewness, kurtosis (

{\bar{κ}}_{4}

), and fourth cumulant (

κ_{4}

) are denoted mn, vr, sk, ku, and k4, respectively. ssk and sku denote skewness and kurtosis for the standardized distributions.

Table 8. Extended skew-Student moments,

λ = 5

,

τ \leq 0

.

Table 8. Extended skew-Student moments,

λ = 5

,

τ \leq 0

.

Panel 1: $τ = 0$
	mn	vr	sk	ku	k4	ssk	sku
$ν = 5$	4.7451	20.8175	225.8345	8893.0351	7592.9283	2.3776	20.5207
10	4.3234	13.808	67.6648	1136.224	564.2428	1.3188	5.9594
100	4.0197	10.373	29.6176	412.2307	89.4359	0.8865	3.8312
1000	3.9924	10.1127	27.4779	380.1627	73.3625	0.8544	3.7174
Panel 2: $τ = - 0.5$
$ν = 5$	4.2428	20.5686	255.6472	11,315.5682	10,046.3666	2.7405	26.7465
10	3.6615	11.9424	61.6937	1002.0177	574.1549	1.4949	7.0257
100	3.2465	8.0272	22.5251	273.4801	80.1702	0.9904	4.2442
1000	3.2095	7.7427	20.5341	245.6695	65.8204	0.9531	4.0979
Panel 3: $τ = - 1$
$ν = 5$	4.0722	22.1142	315.8725	16,158.4142	14,691.3031	3.0374	33.0413
10	3.2593	10.9491	58.8277	963.236	603.5869	1.6237	8.0348
100	2.6826	6.3294	16.6811	183.5893	63.4062	1.0476	4.5827
1000	2.6313	6.0116	14.8101	159.1484	50.7318	1.0048	4.4038
Panel 4: $τ = - 2.5$
$ν = 5$	4.7013	37.0039	572.3678	40,066.9291	35,959.0737	2.5428	29.2612
10	2.9672	11.7404	64.0527	1183.1857	769.673	1.5923	8.5839
100	1.7351	3.7196	9.2931	91.7766	50.2701	1.2954	6.6334
1000	1.6257	3.2714	7.5897	71.2506	39.1445	1.2827	6.6577
Panel 5: $τ = - 5$
$ν = 5$	7.1113	93.2773	3369.5085	468,082.58	441,980.58	3.7403	53.7985
10	3.6601	20.7462	172.2246	4829.2945	3538.0859	1.8226	11.2204
100	1.1786	2.6214	2.9360	30.3655	9.7500	0.6918	4.4188
1000	0.9569	1.8905	1.4728	14.3196	3.5974	0.5666	4.0065
Panel 6: $τ = - 10$
$ν = 5$	12.9423	319.7115	21,801.33	5,806,861.72	5,500,215	3.8137	56.81
10	6.0126	59.7887	838.1579	41709.0825	30985.0048	1.8130	11.6679
100	0.9912	3.0218	1.9762	34.4648	7.0701	0.3762	3.7743
1000	0.5401	1.3638	0.3004	6.2757	0.6959	0.1886	3.3742
Panel 7: $τ = - 20$
$ν = 5$	25.2227	1225.9337	164,426.37	86,687,913	82,179,173	3.8306	57.6799
10	11.3418	217.1219	5771.12	555,743	414,318	1.8039	11.7887
100	1.2565	6.7553	4.1809	157.0877	20.1859	0.2381	3.4423
1000	0.3486	1.5238	0.0842	7.0835	0.1174	0.0447	3.0506

Mean, variance, skewness, kurtosis (

{\bar{κ}}_{4}

), and fourth cumulant (

κ_{4}

) are denoted mn, vr, sk, ku, and k4, respectively. ssk and sku denote skewness and kurtosis for the standardized distributions. Some large results in the bottom panel are shown to two decimal places only to preserve the formatting.

Table 9. Extended skew-Student moments,

λ = 5

,

τ \geq 0

.

Table 9. Extended skew-Student moments,

λ = 5

,

τ \geq 0

.

Panel 1: $τ = 0$
	mn	vr	sk	ku	k4	ssk	sku
$ν = 5$	4.7451	20.8175	225.8345	8893.0351	7592.9283	2.3776	20.5207
10	4.3234	13.8080	67.6648	1136.224	564.2428	1.3188	5.9594
100	4.0197	10.3730	29.6176	412.2307	89.4359	0.8865	3.8312
1000	3.9924	10.1127	27.4779	380.1627	73.3625	0.8544	3.7174
Panel 2: $τ = 0.5$
$ν = 5$	5.6607	22.7023	214.9792	7908.2282	6362.0488	1.9874	15.3441
10	5.3196	16.5847	74.6338	1370.39	545.2293	1.1050	4.9823
100	5.0707	13.4268	36.4146	615.2068	74.3678	0.7401	3.4125
1000	5.0483	13.1811	34.1180	578.1129	56.8906	0.7129	3.3274
Panel 3: $τ = 1$
$ν = 5$	7.0132	25.7246	211.2398	7728.9782	5743.7070	1.6190	11.6795
10	6.6970	20.0320	78.5498	1695.5477	491.7080	0.8761	4.2253
100	6.4618	17.0068	39.6479	888.1371	20.4383	0.5653	3.0707
1000	6.4404	16.7681	37.2234	845.719	2.2097	0.5421	3.0079
Panel 4: $τ = 2.5$
$ν = 5$	12.9818	34.9913	193.6584	8374.6553	4701.4769	0.9356	6.8398
10	12.7471	28.9489	65.1275	2496.4765	-17.6434	0.4181	2.9789
100	12.6007	25.2486	25.7804	1552.3035	-360.1776	0.2032	2.4350
1000	12.5894	24.9254	23.3124	1497.7373	-366.0907	0.1873	2.4107
Panel 5: $τ = 5$
$ν = 5$	25.066	41.1056	97.3139	11,095.1977	6026.176	0.3693	6.5665
10	25.0077	32.2824	10.2403	3921.4378	794.9862	0.0558	3.7628
100	25.0000	26.5298	0.3289	2150.2441	38.7529	0.0024	3.0551
1000	25.0000	26.0519	0.0354	2039.6168	3.5148	0.0003	3.0052
Panel 6: $τ = 10$
$ν = 5$	50.0054	42.971	40.1073	13,286.3928	7746.8656	0.1424	7.1954
10	50.0000	32.4975	4.3103	4122.5494	954.2898	0.0233	3.9036
100	50.0000	26.5306	0.3092	2150.8291	39.2089	0.0023	3.0557
1000	50.0000	26.0521	0.0301	2039.7723	3.6359	0.0002	3.0054
Panel 7: $τ = 20$
$ν = 5$	100.0004	43.2846	19.9348	14,666.6315	9045.9524	0.0700	7.8282
10	100.0000	32.5000	4.1682	4131.0685	962.3212	0.02250	3.9111
100	100.0000	26.5306	0.3092	2150.8291	39.2089	0.0023	3.0557
1000	100.0000	26.0521	0.0301	2039.7723	3.6359	0.0002	3.0054

Mean, variance, skewness, kurtosis (

{\bar{κ}}_{4}

), and fourth cumulant (

κ_{4}

) are denoted mn, vr, sk, ku, and k4, respectively. ssk and sku denote skewness and kurtosis for the standardized distributions.

Table 10. Extended skew-normal and skew-Student critical values, p = 0.025.

Panel 1: $τ = - 5$
	$ν$ = 3	5	10	20	50	100	500	Inf
$ρ = 0$	−11.63	−7.00	−4.39	−3.21	−2.49	−2.23	−2.02	−1.96
0.2	−13.56	−8.35	−5.50	−4.25	−3.50	−3.24	−3.02	−2.96
0.4	−15.19	−9.47	−6.45	−5.17	−4.41	−4.15	−3.93	−3.88
0.6	−16.51	−10.36	−7.22	−5.94	−5.20	−4.95	−4.75	−4.70
0.8	−17.44	−10.94	−7.74	−6.49	−5.81	−5.59	−5.41	−5.37
Panel 2: $τ = - 2.5$
$ρ = 0$	−6.58	−4.24	−2.97	−2.44	−2.15	−2.06	−1.98	−1.96
0.2	−7.62	−4.97	−3.57	−3.00	−2.69	−2.59	−2.51	−2.49
0.4	−8.50	−5.57	−4.07	−3.47	−3.15	−3.05	−2.96	−2.94
0.6	−9.20	−6.04	−4.46	−3.84	−3.51	−3.41	−3.33	−3.31
0.8	−9.70	−6.34	−4.69	−4.07	−3.75	−3.65	−3.57	−3.55
Panel 3: $τ = - 1$
$ρ = 0$	−4.12	−3.00	−2.41	−2.17	−2.04	−2.00	−1.97	−1.96
0.2	−4.69	−3.40	−2.73	−2.47	−2.32	−2.28	−2.24	−2.24
0.4	−5.16	−3.72	−2.98	−2.69	−2.54	−2.49	−2.46	−2.45
0.6	−5.53	−3.96	−3.16	−2.85	−2.69	−2.64	−2.6	−2.59
0.8	−5.79	−4.10	−3.25	−2.93	−2.76	−2.7	−2.66	−2.65
Panel 4: $τ = 0$
$ρ = 0$	−3.18	−2.57	−2.23	−2.09	−2.01	−1.99	−1.97	−1.96
0.2	−3.52	−2.80	−2.40	−2.24	−2.15	−2.12	−2.10	−2.10
0.4	−3.79	−2.97	−2.53	−2.35	−2.25	−2.22	−2.19	−2.19
0.6	−3.99	−3.09	−2.60	−2.40	−2.30	−2.26	−2.24	−2.23
0.8	−4.13	−3.15	−2.63	−2.42	−2.31	−2.28	−2.25	−2.24
Panel 5: $τ = 1$
$ρ = 0$	−2.93	−2.47	−2.19	−2.07	−2.00	−1.98	−1.97	−1.96
0.2	−3.12	−2.58	−2.27	−2.13	−2.05	−2.03	−2.01	−2.01
0.4	−3.27	−2.66	−2.31	−2.16	−2.08	−2.05	−2.03	−2.03
0.6	−3.38	−2.71	−2.33	−2.17	−2.09	−2.06	−2.04	−2.03
0.8	−3.45	−2.74	−2.34	−2.18	−2.09	−2.06	−2.04	−2.03
Panel 6: $τ = 2.5$
$ρ = 0$	−3.00	−2.51	−2.22	−2.08	−2.01	−1.99	−1.97	−1.96
0.2	−3.09	−2.55	−2.23	−2.09	−2.01	−1.99	−1.97	−1.96
0.4	−3.16	−2.58	−2.24	−2.09	−2.01	−1.99	−1.97	−1.96
0.6	−3.20	−2.59	−2.24	−2.09	−2.01	−1.99	−1.97	−1.96
0.8	−3.23	−2.59	−2.24	−2.09	−2.01	−1.99	−1.97	−1.96
Panel 7: $τ = 5$
$ρ = 0$	−3.11	−2.56	−2.23	−2.09	−2.01	−1.99	−1.97	−1.96
0.2	−3.14	−2.57	−2.23	−2.09	−2.01	−1.99	−1.97	−1.96
0.4	−3.16	−2.57	−2.23	−2.09	−2.01	−1.99	−1.97	−1.96
0.6	−3.18	−2.57	−2.23	−2.09	−2.01	−1.99	−1.97	−1.96
0.8	−3.19	−2.57	−2.23	−2.09	−2.01	−1.99	−1.97	−1.96

The table values correspond to a probability of 0.025 for the hidden truncation models at Equation (66) and Equation (75). Table entries are computed numerically and displayed to two decimal places.

Table 11. Divergence between ESN ST density functions.

Panel 1: Kullback–Leibler divergence: Best 10
	$λ$	$τ$	$μ$ [ST]	$σ^{2}$ [ST]	$λ$ [ST]	$ν$
	−0.5	−10.0	−0.0490	1.0024	0.0000	∞
	−0.5	0.0	0.0000	0.9999	−0.5000	∞
	−1.0	0.0	0.0000	0.9998	−1.0000	∞
	−1.5	0.0	0.0000	0.9998	−1.5000	∞
	−1.0	10.0	−10.0000	2.0000	0.0000	∞
	−0.5	10.0	−5.0000	1.2500	0.0000	∞
	−2.0	10.0	−20.0000	5.0000	0.0000	∞
	−3.0	10.0	−30.0000	10.0000	0.0000	∞
	−5.0	10.0	−50.0000	26.0000	0.0000	∞
	−1.5	10.0	−15.0000	3.2500	0.0000	∞
Panel 2: Worst 10
	−2.0	0.5	−0.9652	2.2017	−1.3000	50
	−2.5	0.5	−0.9208	2.5358	−2.0000	190
	−3.0	0.5	−1.0313	3.0917	−2.5000	∞
	−2.5	1.0	−2.0121	4.0001	−1.5000	90
	−2.5	−0.5	0.3925	0.4047	−2.5000	∞
	−3.0	1.0	−2.2658	5.1990	−2.0000	∞
	−5.0	−5.0	0.6725	0.3260	−2.0000	130
	−5.0	1.0	−4.8342	15.0568	−2.0000	150
	−5.0	0.5	−2.6469	9.7981	−3.0000	340
	−3.0	−0.5	0.4740	0.1265	−3.0000	500

The values of the Kullback–Leibler divergence measure are ranked from best to worst, with the parameters corresponding to the best ten and worst ten shown in the two panels. The first two columns of each panel show the values of λ and τ. The next three columns show the computed value of μ, σ², and λ for the approximating ST distribution, rounded to four decimal places. Computed values of ν equal to 1000 or greater were replaced by ∓, that is, the approximating distribution is effectively skew-normal.

Table 12. ESN and ST Moments.

Panel 1: Kullback–Leibler divergence: Best 10
	Mean	Variance	Skewness	Kurtosis	Mean (ST)	Variance (ST)	Skewness (ST)	Kurtosis (ST)
	−0.0490	1.0024	−0.0002	3.0142	−0.0490	1.0024	−0.0002	3.0142
	−0.3989	1.0908	−0.0273	3.5770	−0.3989	1.0908	−0.0275	3.5773
	−0.7979	1.3634	−0.2180	5.6912	−0.7979	1.3634	−0.2186	5.6917
	−1.1968	1.8176	−0.7358	10.4921	−1.1968	1.8176	−0.7371	10.4930
	−10.0000	2.0000	0.0000	12.0000	−10.0000	2.0000	0.0000	12.0000
	−5.0000	1.2500	0.0000	4.6875	−5.0000	1.2500	0.0000	4.6875
	−20.0000	5.0000	0.0000	75.0000	−20.0000	5.0000	0.0000	75.0000
	−30.0000	10.0000	0.0000	300.0000	−30.0000	10.0000	0.0000	300.0000
	−50.0000	26.0000	0.0000	2028.0000	−50.0000	26.0000	0.0000	2028.0000
	−15.0000	3.2500	0.0000	31.6875	−15.0000	3.2500	0.0000	31.6875
Panel 2: Worst 10
	−2.0183	2.9447	−2.1679	27.4245	−2.0183	2.9447	−2.1678	28.132
	−2.5229	4.0386	−4.2342	52.3748	−2.5229	4.0386	−4.2200	54.5273
	−3.0275	5.3756	−7.3167	93.8321	−3.0275	5.3756	−7.3482	99.0404
	−3.2190	4.9355	−4.6206	73.1000	−3.2190	4.9355	−4.5951	79.0081
	−1.6027	2.6780	−2.5407	25.539	−1.6027	2.6780	−2.5411	24.5375
	−3.8628	6.6672	−7.9844	133.3981	−3.8628	6.6672	−8.0281	147.2278
	−0.9325	1.8174	−1.3532	13.0888	−0.9325	1.8174	−1.3493	11.1028
	−6.4380	16.7422	−36.9648	841.2418	−6.4380	16.7422	−36.8354	940.0303
	−5.0458	13.1544	−33.8738	574.2185	−5.0458	13.1544	−33.8859	612.2122
	−1.9232	3.4163	−4.3903	43.3578	−1.9232	3.4163	−4.3974	41.1866

This table shows values of the moments corresponding to the parameters reported in Table 11.

Table 13. ESN vs. ST critical values.

Panel 1: Kullback–Leibler divergence: Best 10
	0.5%ESN	ST	2.5%ESN	ST	97.5%ESN	ST	99.5%ESN	ST
	−2.65	−2.65	−2.05	−2.05	1.90	1.90	2.50	2.50
	−3.15	−3.15	−2.50	−2.50	1.60	1.60	2.25	2.25
	−4.00	−4.00	−3.20	−3.20	1.40	1.40	2.05	2.05
	−5.10	−5.10	−4.05	−4.05	1.25	1.25	1.90	1.90
	−13.65	−13.65	−12.80	−12.80	−7.25	−7.25	−6.40	−6.40
	−7.90	−7.90	−7.20	−7.20	−2.85	−2.85	−2.15	−2.15
	−25.80	−25.80	−24.40	−24.40	−15.65	−15.65	−14.25	−14.25
	−38.15	−38.15	−36.20	−36.20	−23.85	−23.85	−21.90	−21.90
Panel 2: Worst 10
	−63.15	−63.15	−60.00	−60.00	−40.05	−40.05	−36.90	−36.90
	−19.65	−19.65	−18.55	−18.55	−11.50	−11.50	−10.40	−10.40
	−7.05	−6.80	−5.75	−5.55	0.90	1.20	1.65	2.25
	−8.55	−8.20	−6.95	−6.70	0.80	1.20	1.60	2.25
	−10.05	−9.65	−8.20	−7.90	0.70	1.20	1.50	2.40
	−9.60	−9.20	−8.00	−7.70	0.55	1.05	1.35	2.35
	−6.75	−6.90	−5.30	−5.40	1.15	0.85	1.85	1.35
	−11.35	−10.80	−9.45	−9.10	0.40	1.05	1.25	2.55
	−5.30	−5.30	−3.95	−4.05	1.40	1.15	2.10	1.60
	−18.45	−17.20	−15.40	−14.55	0.00	1.50	1.00	4.00
	−16.30	−14.90	−13.30	−12.40	0.35	1.85	1.25	3.90
	−7.90	−8.05	−6.20	−6.35	1.05	0.55	1.80	0.90

The table shows critical values at p-values of 0.5%, 2.5%, 95.5%, and 99.5%, respectively, in ESN/ST pairs. Values are shown corrected to two decimal places and were computed numerically using Simpson‘s rule.

Table 14. Summary of estimated log-likelihood function.

	Sample	MLE (ESN)	MLE (ST)	MLE (SN)
−2.5	−162.1227	−143.8078	−143.7856	−144.2421
−1	−166.7300	−147.7064	−148.1087	−148.2171
−0.5	−173.5951	−154.3467	−154.7982	−155.0793
0	−181.6436	−162.3594	−162.5675	−163.1677
0.5	−190.3665	−171.9357	−172.3604	−172.6343
1	−199.0348	−180.9025	−180.4694	−181.2470
2.5	−199.5184	−181.6458	−181.8682	−182.0044

The table shows the value of the log-likelihood function for each parameter combination computed at its estimated maximum, averaged over the 100 samples and over values of τ. Column 1 shows values of logL based on parameter values inferred from sample moments. As columns 2 through 4, the value of logL varies little with the choice of underlying distribution.

Table 15. Root mean square errors in the moments.

Panel 1: $λ = - 2.5$
	MLE (ESN)	MLE (ST)	MLE (SN)
$τ =$ −5.0	0.0173	0.2864	0.0733
−2.5	0.3679	0.2720	0.3626
−1.0	0.9311	0.6039	0.9170
0.0	0.6527	5.3254	0.6702
1.0	3.0086	8.1842	5.8937
2.5	5.8728	8.7935	6.9017
5.0	3.8259	6.4790	5.7127
Panel 2: $- 1$
−5.0	0.0878	0.2736	0.1137
−2.5	0.0851	0.3863	0.1398
−1.0	0.0630	0.2508	0.0838
0.0	0.0528	0.3052	0.0915
1.0	0.2190	0.5452	0.3016
2.5	0.4006	1.0383	0.4772
5.0	0.3251	0.6928	0.3406
Panel 3: 0
−5.0	0.0751	0.3443	0.1084
−2.5	0.1038	0.3784	0.1294
−1.0	0.0797	0.4221	0.1050
0.0	0.0979	0.4147	0.114 0
1.0	0.0898	0.4213	0.1257
2.5	0.1162	0.5211	0.1340
5.0	0.0752	0.2855	0.1242
Panel 4: 1
−5.0	0.0709	0.3994	0.0769
−2.5	0.0651	0.3919	0.1001
−1.0	0.0664	0.4552	0.1297
0.0	0.1606	0.5150	0.1971
1.0	0.3868	0.8329	0.3099
2.5	0.3001	1.0357	0.3504
5.0	0.2648	1.4494	0.3207
Panel 5: 2.5
−5.0	0.1321	0.5672	0.1594
−2.5	0.4829	0.9696	0.8214
−1.0	0.7027	2.3434	2.1031
0.0	0.7355	3.7407	2.8966
1.0	3.3669	9.8534	2.1806
2.5	4.9614	14.3890	5.0747
5.0	1.9213	15.8112	2.5600

Root mean square error is computed as the square root of the average squared difference between the population moments and the average of the estimated moments based on MLE parameter estimates for each distribution. The population moments included in the calculations are mean, variance, skewness, and kurtosis.

Table 16. Root mean square errors in the critical values.

Panel 1: $λ = - 2.5$
	MLE (ESN)	MLE (ST)	MLE (SN)
$τ =$ −5.0	0.0812	0.059	0.0932
−2.5	0.2294	0.2106	0.2017
−1.0	0.2158	0.2109	0.2074
0.0	0.0407	0.342	0.0361
1.0	0.1459	0.5914	0.2343
2.5	0.2971	0.465	0.2715
5.0	0.0668	0.1735	0.3859
Panel 2: $- 1.0$
−5.0	0.0336	0.0765	0.1295
−2.5	0.0492	0.0676	0.1692
−1.0	0.0610	0.0457	0.1172
0.0	0.0547	0.0892	0.0849
1.0	0.0715	0.1072	0.1381
2.5	0.0436	0.1564	0.1600
5.0	0.0331	0.0808	0.1662
Panel 3: 0
−5.0	0.0131	0.0830	0.1360
−2.5	0.0365	0.0956	0.1347
−1.0	0.0185	0.1062	0.1427
0.0	0.0260	0.0786	0.1465
1.0	0.0190	0.1070	0.1452
2.5	0.0445	0.1282	0.1184
5.0	0.0486	0.0824	0.1750
Panel 4: 1
−5.0	0.0335	0.0782	0.1309
−2.5	0.0243	0.0481	0.1753
−1.0	0.0516	0.0807	0.2266
0.0	0.0649	0.1004	0.2440
1.0	0.0506	0.0925	0.2430
2.5	0.0468	0.1542	0.2263
5.0	0.0203	0.1498	0.1910
Panel 5: 2.5
−5.0	0.0544	0.0516	0.2556
−2.5	0.2504	0.2543	0.5273
−1.0	0.2045	0.2328	0.7658
0.0	0.0512	0.2558	0.8942
1.0	0.1020	0.5654	0.9505
2.5	0.2790	0.5385	0.6304
5.0	0.1155	0.3089	0.3715

Root mean square error is computed as the square root of the average squared difference between the population critical values and the average of the estimated values based on MLE parameter estimates for each distribution. The critical values are computed at nominal percent probabilities equal to 0.05, 0.5, 2.5, 5.0, 95.0, 97.5, 99.5, and 99.95.

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Adcock, C.J. Properties and Limiting Forms of the Multivariate Extended Skew-Normal and Skew-Student Distributions. Stats 2022, 5, 270-311. https://doi.org/10.3390/stats5010017

AMA Style

Adcock CJ. Properties and Limiting Forms of the Multivariate Extended Skew-Normal and Skew-Student Distributions. Stats. 2022; 5(1):270-311. https://doi.org/10.3390/stats5010017

Chicago/Turabian Style

Adcock, Christopher J. 2022. "Properties and Limiting Forms of the Multivariate Extended Skew-Normal and Skew-Student Distributions" Stats 5, no. 1: 270-311. https://doi.org/10.3390/stats5010017

APA Style

Adcock, C. J. (2022). Properties and Limiting Forms of the Multivariate Extended Skew-Normal and Skew-Student Distributions. Stats, 5(1), 270-311. https://doi.org/10.3390/stats5010017

Article Menu

Properties and Limiting Forms of the Multivariate Extended Skew-Normal and Skew-Student Distributions

Abstract

1. Introduction

2. Multivariate Extended Skew-Normal Distribution

2.1. The Truncated Normal Distribution and Its Approximations

2.2. Moments of the Truncated Normal Distribution

2.3. Standardized Form of the Extended Skew-Normal Distribution

3. Multivariate Extended Skew-Student Distribution

3.1. The Truncated Student’s t Distribution

3.2. Moments of the Truncated Student’s t Distribution

3.3. Moments of the MEST Distribution

3.4. Standardized Forms of the MEST Distribution

4. Hidden Truncation Models

4.1. Hidden Truncation under the Normal Distribution

4.2. Hidden Truncation under Student’s t Distribution

4.3. Hidden Truncation with Extended Distributions

4.4. Stock Market Crashes

5. Extended Skew-Normal versus Skew-Student

5.1. Moment Matching Study

5.2. Simulation Study

6. Concluding Remarks

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

Appendix A. Asymptotic Distribution of a Truncated Student’s t Variable

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI