Testing Multivariate Normality Based on F-Representative Points

Wang, Sirao; Liang, Jiajuan; Zhou, Min; Ye, Huajun

doi:10.3390/math10224300

Open AccessArticle

Testing Multivariate Normality Based on F-Representative Points

by

Sirao Wang

^1,2

,

Jiajuan Liang

^1,3,

Min Zhou

^1,3 and

Huajun Ye

^1,3,*

¹

Faculty of Science and Technology, BNU-HKBU United International College, Zhuhai 519087, China

²

Department of Mathematics, Hong Kong Baptist University, Hong Kong, China

³

Guangdong Provincial Key Laboratory of Interdisciplinary Research and Application for Data Science, BNU-HKBU United International College, Zhuhai 519087, China

^*

Author to whom correspondence should be addressed.

Mathematics 2022, 10(22), 4300; https://doi.org/10.3390/math10224300

Submission received: 11 July 2022 / Revised: 2 September 2022 / Accepted: 14 November 2022 / Published: 16 November 2022

(This article belongs to the Special Issue Distribution Theory and Application)

Download

Browse Figures

Versions Notes

Abstract

:

The multivariate normal is a common assumption in many statistical models and methodologies for high-dimensional data analysis. The exploration of approaches to testing multivariate normality never stops. Due to the characteristics of the multivariate normal distribution, most approaches to testing multivariate normality show more or less advantages in their power performance. These approaches can be classified into two types: multivariate and univariate. Using the multivariate normal characteristic by the Mahalanobis distance, we propose an approach to testing multivariate normality based on representative points of the simple univariate F-distribution and the traditional chi-square statistic. This approach provides a new way of improving the traditional chi-square test for goodness-of-fit. A limited Monte Carlo study shows a considerable power improvement of the representative-point-based chi-square test over the traditional one. An illustration of testing goodness-of-fit for three well-known datasets gives consistent results with those from classical methods.

Keywords:

affine invariance; chi-squared test; F-distribution; multivariate normality; representative points

MSC:

62H15; 62E10

1. Introduction

The effectiveness of many statistical methodologies relies on the multivariate normal assumption in high-dimensional data analysis. There are various approaches to testing multivariate normality (MVN for short) in the literature (Mardia [1,2,3]; Koziol [4,5]; Mudholkar et al. [6]; Liang and Bentler [7]; Liang et al. [8]; Henze [9]; Mecklin and Mundfrom [10]; Thulin [11]; Szekely and Rizzo [12]; Tenreiro [13]; Kim and Park [14] and Enomoto [15]). As commented in some review papers (Andrews et al. [16]; Gnanadesikan [17]; Looney [18]), it is quite difficult for a single method to beat others completely. To understand the power performance of some frequently-cited methods for testing MVN, Monte Carlo studies have been conducted for comparing these MVN tests (Royston [19,20,21]; Horswell and Looney [22]; Romeu and Ozturk [23]; Young et al. [24]; Beirlant et al. [25]; Mecklin [26]; Mecklin and Mundfrom [27]). In general, when developing a new test, it is compared with a particular category of tests in the literature against a number of alternative distributions, since it would be unreasonable to test every possible deviation from normality (Mecklin and Mundfrom [10]). For example, Ward [28] compared the power of Mardia’s skewness and kurtosis tests, the Malkovich–Afifi test, Hawkins’ test, the Mardia–Foster test, and Ward’s extension of the Kolmogorov–Smirnov and Anderson–Darling tests. It was found none of these tests performed well against the multivariate t-distribution which is a mild deviation from normality. Ahn [29] described a squared jackknife distance and recommended the use of an F-distribution probability plot for checking multivariate normality. Liang et al. [8] constructed projection tests for MVN based on affine invariant statistics. Their tests are still effective in the case of a high dimension with a small sample size. Henze [9] stated that some desirable properties such as affine invariance and consistency are usually required for testing MVN.

There are various approaches to constructing tests for MVN. The idea of statistical representative points or principal points (Fang and He [30]; Flury [31]) is a new approach to constructing goodness-of-fit tests. A set of representative points (RPs for simplicity) refers to selecting a given number of discrete points from a continuous distribution so as to minimize the expected value of the squared distance between the continuous random variable and the set of discrete representative points. In this way, RPs can retain information about the original population as much as possible. The idea of representative points was firstly used in Cox [32] and Max [33] for quantization in univariate normal distribution. Fang [34] applied this concept to the national project of clothing standardization in China and obtained desirable outcomes. Flury [31] applied a similar concept to that in Fang [34] in determining the optimal size of new gas masks and gave the terminology “principal points”. The theoretical foundation, the algorithms for computing RPs, and some associated applications have been developed over the past three decades (Flury [35]; Tarpey [36]; Fang, Zhou, and Wang [37]; Fang, He, and Yang [38]).

In this paper, we propose a new approach to constructing MVN tests, which is based on the RPs of the univariate F-distribution and Pearson’s chi-square test. The new test is based on the squared jackknife distances [29] between each observation and the sample mean, which are affine invariant statistics. As a result, we can avoid estimating unknown parameters

(μ, Σ)

in the d-variate the normal distribution

N_{d} (μ, Σ)

. In Section 2, an MVN test based on RPs of the F-distribution is described. A Monte Carlo study is carried out to investigate the performance of the proposed test in Section 3. Section 4 illustrates the application of the new test to three real examples with computational results from other classical MVN tests. The concluding remarks are summarized in Section 5. All data analysis is performed using the R software (R Development Core Team 2009).

2. The MVN Test Based on the $F$ -Representative Points

2.1. A Brief Review on Affine Invariance

A test for MVN is usually expected to have the property of affine invariance. The consequence of this property is that the null distribution of the test statistic does not depend on the unknown mean

μ

and covariance matrix

Σ

of the multivariate normal distribution

N_{d} (μ, Σ)

. Henze ([9], Proposition 2.1) pointed out that any affine invariant test for MVN is a function of the sample Mahalanobis distances (M-distance for short) and angles are defined as follows.

Let

{x_{1}, \dots, x_{n}}

be a set of i.i.d. (independent identically distributed) sample from a continuous distribution

P^{X}

.

N_{d}

stands for the d-dimensional normal distribution

N_{d} (μ, Σ)

with mean vector

μ \in R^{d}

and a nonsingular covariance matrix

Σ

. The problem of assessing MVN for the sample is to test the hypothesis

H_{d} : P^{X} \in N_{d}

(1)

against general alternatives. Any affine invariant statistic

T_{n} (x_{1}, \dots, x_{n})

should satisfy the condition

T_{n} (A x_{1} + b, \dots, A x_{n} + b) = T_{n} (x_{1}, \dots, x_{n})

(2)

for any

b \in R^{d}

and nonsingular matrix

A \in R^{d \times d}

. Denote the sample mean and the sample covariance matrix by

\bar{x} = \frac{1}{n} \sum_{j = 1}^{n} x_{i} and S_{n} = \frac{1}{n} \sum_{j = 1}^{n} (x_{j} - \bar{x}) {(x_{j} - \bar{x})}^{'},

(3)

respectively. The sample M-distance between two observations

x_{j}

and

x_{k}

is defined by

D_{n, j k} = {(x_{j} - \bar{x})}^{'} S_{n}^{- 1} (x_{k} - \bar{x}), j, k = 1, \dots, n .

(4)

In particular, when

j = k

, denote by

D_{n, j}^{2} = D_{n, j j} = {(x_{j} - \bar{x})}^{'} S_{n}^{- 1} (x_{j} - \bar{x}) .

(5)

It is known that any test for MVN based on the M-distance (4) satisfies (2), and so, its null distribution does not depend on the normal parameters

μ

and

Σ

. In the following section, we will construct a test for MVN that is a function of the M-distance (4).

2.2. The Jackknife Distance

Define the Jackknife mean and the Jackknife covariance matrix by

{\bar{x}}_{- i} = \frac{1}{n - 1} \sum_{j \neq i} x_{j} and S_{- i} = \frac{1}{n - 2} \sum_{j \neq i} (x_{j} - {\bar{x}}_{- i}) {(x_{j} - {\bar{x}}_{- i})}^{'},

(6)

respectively. The squared Jackknife distance from the i-th observation

x_{i}

to the sample mean is defined by

D_{- i}^{2} = {(x_{i} - \bar{x})}^{'} S_{- i}^{- 1} (x_{i} - \bar{x}), i = 1, \dots, n .

(7)

Ahn [29] employed the squared Jackknife distance (7) to construct an F-probability plot for assessing MVN of the i.i.d. sample

{x_{1}, \dots, x_{n}}

. It is pointed out that the Jackknife covariance matrix

S_{- i}

is related to the usual sample covariance matrix

S_{n}

in (3) by

\frac{1}{n - 1} S_{- i}^{- 1} = \frac{1}{n - 1} S^{- 1} + \frac{γ_{i}}{{(n - 1)}^{2}} S^{- 1} (x_{i} - \bar{x}) {(x_{i} - \bar{x})}^{'} S^{- 1},

(8)

where

\begin{matrix} S = \sum_{j = 1}^{n} (x_{j} - \bar{x}) {(x_{j} - \bar{x})}^{'} = n S_{n}, \\ γ_{i} = \frac{n}{n - 1} [1 - \frac{n}{{(n - 1)}^{2}}] {(x_{i} - \bar{x})}^{'} S^{- 1} (x_{i} - \bar{x}) . \end{matrix}

(9)

The following monotonic relation between the squared Jackknife distance and the usual chi-squared distance is given in [29]:

D_{- i}^{2} = \frac{(n - 1) (n - 2) D_{i}^{2}}{{(n - 1)}^{2} - n D_{i}^{2}}, D_{i}^{2} = {(x_{i} - \bar{x})}^{'} S^{- 1} (x_{i} - \bar{x}) .

(10)

It is obvious that

D_{i}^{2}

is related to the M-distance

D_{n, i}^{2}

in (5) by

D_{n, i}^{2} = n D_{i}^{2}

. Equations (8)–(10) provide a simple computational method for obtaining the squared Jackknife distances

{D_{- i}^{2} : i = 1, \dots, n}

by avoiding the computation of n inverse matrices

S_{- i}^{- 1}

in (6). The following theorem can be easily derived from [29].

Theorem 1.

Under hypothesis (1), the following two assertions are true:

(1): the adjusted Jackknife distance has an exact F-distribution:

$d_{- i}^{2} = \frac{n (n - 1 - d)}{(n - 1) (n - 2) d} D_{- i}^{2} \sim F (d, n - 1 - d);$

(11)
(2): ${d_{- i}^{2} : i = 1, \dots, n}$ are asymptotically independent.

Proof of Theorem 1.

The exact F-distribution for each

d_{- i}

in (11) is given in Theorem 1 of Ahn [29]. The asymptotic independence of

{d_{- i}^{2} : i = 1, \dots, n}

can be verified as follows. According to the strong law of convergence,

\bar{x} \overset{a . s}{\to} μ, (n \to \infty)

where

\overset{a . s}{\to}

means “converge almost surely”.

S_{n}

can be written as

S_{n} = \frac{1}{n} \sum_{i = 1}^{n} (x_{i} - μ) {(x_{i} - μ)}^{'} + (\bar{x} - μ) {(\bar{x} - μ)}^{'} .

Because

\frac{1}{n} \sum_{i = 1}^{n} (x_{i} - μ) {(x_{i} - μ)}^{'}

is the sample mean of the i.i.d. terms

(x_{i} - μ) {(x_{i} - μ)}^{'}

, the strong law of convergence (page 238, Theorem 1 of Feller, [39]) gives

\frac{1}{n} \sum_{i = 1}^{n} (x_{i} - μ) {(x_{i} - μ)}^{'} \overset{a . s}{\to} E {(x_{i} - μ) {(x_{i} - μ)}^{'}} = Σ .

According to the continuous mapping theorem (page 7, Van der Vaart [40]), it follows that

(\bar{x} - μ) {(\bar{x} - μ)}^{'} \overset{a . s}{\to} 0, S_{n} = \frac{1}{n} \sum_{i = 1}^{n} (x_{i} - μ) {(x_{i} - μ)}^{'} + (\bar{x} - μ) {(\bar{x} - μ)}^{'} \overset{a . s}{\to} Σ, (n \to \infty)

and

n D_{i}^{2} \overset{a . s}{\to} {(x_{i} - μ)}^{'} Σ^{- 1} (x_{i} - μ), n D_{j}^{2} \overset{a . s}{\to} {(x_{j} - μ)}^{'} Σ^{- 1} (x_{j} - μ) (n \to \infty)

for any given

i \neq j

. The independence between

x_{i}

and

x_{j}

results in the asymptotic independence between

n D_{i}^{2}

and

n D_{j}^{2}

for

i \neq j

. Because

d_{- i}^{2}

is a continuous function of

n D_{i}^{2}

for

i = 1, \dots, n

, using the continuous mapping theorem again, we can obtain the asymptotic independence of

{d_{- i}^{2} : i = 1, \dots, n}

in (11). This completes the proof. □

2.3. The Chi-Squared Test Based on the F-Representative Points

The above Theorem 1 provides an approach to testing hypothesis (1). Instead of testing (1), we can test

H_{0} : {d_{- i}^{2} : i = 1, \dots, n} in (11) is sample from F (d, n - 1 - d)

(12)

against the alternative that

H_{0}

is not true. It is obvious that if hypothesis (12) is rejected, hypothesis (1) is also rejected, but the converse is not true. A test for hypothesis (12) is called a necessary test [41] for the normality of the original data in the literature. Testing hypothesis (12) can be carried out by the classical chi-squared test for the purpose of general goodness of fit. It is well-known that the classical chi-squared test is facing the choice of classification cells for observed sample data (Sturges [42]; Mann and Wald [43]; Williams [44]; Dahiya and Gurland [45]; Mineo [46]; Harrison [47]; Kallenberg [48,49]; Oosterhoff [50]; Quine and Robinson [51]; D’Agostini and Stephens [52]; Koehler and Gann [53]; Bogdan [54]). A classification with equiprobable cells is a common choice in the literature. Because the representative points minimize some kind of quadratic loss function (see Appendix A), we conjecture that classification cells based on representative points may have better performance than the simple equiprobable classification. Therefore, we propose to use the F-representative points to construct classification cells and expect to improve the performance of the classical chi-squared test. A Monte study will be carried out in next section to verify the performance of this approach.

The F-representative points are a set of points

{F_{1}, \dots, F_{m}}

(for a selected number of points m) that minimize the quadratic loss function:

ϕ (x_{1}, \dots, x_{m}) = \int_{0}^{\infty} min_{1 \leq i \leq m} \{{(x_{i} - x)}^{2}\} f_{F} (x; d, n - 1 - d) d x,

(13)

where

f_{F} (x; d, n - 1 - d)

stands for the density function of the F-distribution with degrees of freedom

(d, n - 1 - d)

,

ϕ (F_{1}, \dots, F_{m}) = min_{1 \leq i \leq m} {ϕ (x_{1}, \dots, x_{m}) : 0 < x_{1} < \dots < x_{m} < \infty} .

Define the following intervals

\begin{matrix} I_{1} = (0, \frac{F_{1} + F_{2}}{2}), I_{2} = [\frac{F_{1} + F_{2}}{2}, \frac{F_{2} + F_{3}}{2}), \dots, \\ I_{m - 1} = [\frac{F_{m - 2} + F_{m - 1}}{2}, \frac{F_{m - 1} + F_{m}}{2}), I_{m} = [\frac{F_{m - 1} + F_{m}}{2}, + \infty) \end{matrix}

(14)

and the probabilities

p_{i} = \int_{I_{i}} f_{F} (x; d, n - 1 - d) d x, i = 1, \dots, m .

(15)

According to Fang and He [30],

{p_{1}, \dots, p_{m}}

can be considered as a set of “representative probabilities” for the F-distribution

F (d, n - 1 - d)

.

Based on Theorem 1 above, a test for hypothesis (1) can be approximately (under large sample size n) transferred to testing (12) with classification intervals defined by (14). The

χ^{2}

-statistic is computed by:

χ_{R}^{2} = \sum_{i = 1}^{m} \frac{{(n_{i} - n p_{i})}^{2}}{n p_{i}} \to χ^{2} (m - 1), (n \to \infty under the hypothesis (1))

(16)

where

n_{i}

is the frequency of the transformed approximately i.i.d. sample points

{d_{- i}^{2} : i = 1, \dots, n}

computed by (11) that are located in the interval

I_{i}

in (14). The approximate p-value is computed by

P (χ_{R}^{2}, ν) = K \int_{χ_{R}^{2}}^{\infty} z^{\frac{ν}{2} - 1} exp (- \frac{z}{2}) d z, with ν = m - 1, K = {[2^{\frac{ν}{2}} Γ (\frac{ν}{2})]}^{- 1} .

(17)

The Monte Carlo study in the next section will verify how well the chi-square approximation (16) is by simulating the empirical type I error rates and its power against some selected sets of alternative distributions.

3. A Monte Carlo Study

In order to compare the

χ^{2}

-test (16) under the “representative probabilities”

{p_{1}, \dots, p_{m}}

in (15) with the traditional chi-squared test, we choose the equiprobable cells for computing the traditional chi-squared test. For a selected number of points

m - 1

, define the interval endpoints:

\begin{matrix} a_{1} satisfies P (χ^{2} (m - 1) < a_{1}) = \frac{1}{m}; \\ a_{2} satisfies P (a_{1} < χ^{2} (m - 1) < a_{2}) = \frac{1}{m}; \\ ⋮ \\ a_{m - 1} satisfies P (χ^{2} (m - 1) > a_{m - 1}) = \frac{1}{m} . \end{matrix}

(18)

Denote the traditional chi-squared test based on the interval endpoints (18) by

χ_{T}^{2}

which also has an approximate null distribution χ²(m − 1),

χ_{T}^{2} = \sum_{i = 1}^{m} \frac{{(N_{i} - n / m)}^{2}}{n / m}

(19)

where

N_{i}

is the frequency of the approximate i.i.d. transformed sample points

{d_{- i}^{2} : i = 1, \dots, n}

that are located in the i-th interval.

3.1. A Comparison between Empirical Type I Error Rates

Because the chi-squared test based on the transformed sample points

{d_{- i}^{2} : i = 1, \dots, n}

given by (11) are affine invariant under any nonsingular linear transformation of the original i.i.d. sample

{x_{1}, \dots, x_{n}}

, we only need to generate samples from a d-dimensional standard normal

N_{d} (0, I_{d})

(

I_{d}

stands for the

d \times d

identity matrix). The simulation results under 2000 replications for each case are summarized in Table 1. It shows that both statistics

χ_{R}^{2}

and

χ_{T}^{2}

control their empirical type I error rates reasonably well under the significant level 0.05. For most of cases,

χ_{T}^{2}

is more conservative than

χ_{R}^{2}

, especially for a smaller sample size. They have similar performance for significant levels 0.01 and 0.10 for which the simulation outcomes are not presented here to save space.

3.2. A Power Comparison

The following alternative distributions are selected.

(1): The multivariate t-distribution has a density function of the form

$f_{t} (∥ x ∥) = C_{1} {(1 + \frac{{∥ x ∥}^{2}}{v})}^{- \frac{d + v}{2}}, v > 0,$

where “ $∥ \cdot ∥$ ” stands for the Euclidean norm of a vector, and let the degree of freedom $v = 5$ ;
(2): The $β$ -generalized normal distribution $N_{d} (0, I_{d}, 1 / 2)$ with $β = 1 / 2$ has a density function of the form by (Goodman and Kotz [55]):

$f (x_{1}, \dots, x_{d}) = \frac{β^{d} r^{d / β}}{2^{d} Γ^{d} (1 / β)} \cdot exp \{- r \sum_{i = 1}^{d} {| x_{i} |}^{β}\}, {(x_{1}, \dots, x_{d})}^{'} \in R^{d},$

where $r > 0$ is a parameter. Let $r = 1 / 2$ in the simulation and denote it by $β$ g-normal;
(3): The shifted i.i.d. $χ^{2} (1)$ with i.i.d. marginals, each marginal has the same distribution as that of the random variable $Y = X - E (X)$ , where $X \sim χ^{2} (1)$ , the univariate chi-squared distribution with 1 degree of freedom and $E (X) = 1$ ;
(4): The distribution $N (0, 1) + χ^{2} (2)$ consists of i.i.d. $[d / 2]$ normal $N (0, 1)$ marginals and $d - [d / 2]$ i.i.d. $χ^{2} (2)$ marginals, where $[d / 2]$ stands for the integer part of $d / 2$ ;
(5): The shifted i.i.d. $exp (1)$ with i.i.d. marginals, each marginal has the same distribution as that of the random variable $Y = X - E (X)$ , where $X \sim exp (1)$ , the univariate exponential distribution.

For each of these alternative distributions, we choose the sample size

n = 20, \dots, 400

. By simulation with 10,000 replications, the power values versus the sample size n for both statistics

χ_{R}^{2}

in (18) and

χ_{T}^{2}

computed via (19) are plotted in Figure 1, Figure 2, Figure 3, Figure 4 and Figure 5. As can be seen, the proposed statistic

χ_{R}^{2}

obviously outperforms the classical

χ_{T}^{2}

for different alternative distributions consistently. It shows a clear trend that the chi-squared statistic

χ_{R}^{2}

based on F-representative points can be quite successful in improving the performance of traditional chi-squared statistic

χ_{T}^{2}

. To see the power performance of

χ_{R}^{2}

versus other frequently used tests in the literature, we choose a category of tailor-made MVN tests in the literature (Mardia [3]; Henze and Wagner [56]; Szekely and Rizzo [57]) against two types of alternative distributions (symmetric and non-symmetric) in Figure 6 and Figure 7 where “hz” stands for Henze and Wagner’s statistic and “energy” for Szekely and Rizzo’ test. The simple comparison in Figure 6 and Figure 7 shows that the

χ_{R}^{2}

-test based on F-representative points can be comparable to the selected frequently used tests in the literature. More power comparisons based on three additional alternative distributions are given in Appendix C. This gives some confidence in using the

χ_{R}^{2}

-test as a supplement to enhance the application of some existing tests.

4. Illustrative Examples

Example 1—Ramus Bone Data. Elston and Grizzle [58] collected interesting data on the ramus height (in millimeters) of 20 boys each measured at age 8, 8.5, 9, and 9.5 years old. The objective of the original study is to establish a normal growth curve for use by orthodontists. The data has a few interesting features. Timm [59] showed that the recorded heights of the ramus bone marginally appear to be normally distributed but show a certain departure from multivariate normality for

α = 0.05

. To test the multivariate normality of ages 8, 8.5, 9, and 9.5, the methods proposed by Zhou and Shao [60] gave p-values 0.002, 0.054, <0.001, <0.001.

Example 2—Fisher’s Iris Data. Fisher’s Iris dataset examined by R.A. Fisher demonstrated the initial study of discriminant analysis in 1936 [61]. In this well-known example of multivariate data, three varieties of iris flowers are measured (setosa, versicolor and virginica), consisting of a total of 150 observations (50 each) in terms of four measurements: sepal length, sepal width, petal length, petal width. Considering the multivariate normality of the four measurements, Srivastava and Mudholkar [62] concluded that the assumption of multivariate normality may not be appropriate in the context of any of the varieties of Iris. Furthermore, Shao and Zhou [63] rejected multivariate normality for this dataset using different tests with all p-values

< 2 %

at level

α = 5 %

which is also consistent with the findings in Small [64].

Example 3—Rao’s Cork Data. The dataset collected by Rao [65] measured the weight of cork borings taken from the north (N), east (E), south (S), and west (W) directions which consisted of the thickness of bark deposit in 28 corks. The original problem is to investigate whether the bark deposit varies in thickness in the four directions. Regarding the multivariate normality assumption, Srivastava and Hui [66] rejected the hypothesis using tests with p-values of 0.01 and 0.037. Moreover, Mudholkar, McDermott, and Srivastava [6] also doubted the multivariate normality assumption based on their p-value of

0.0302

. Srivastava and Mudholkar [62] applied the simplified Fisher combination statistic and obtained a p-value of

0.006

, which also implies certain departure from multivariate normality.

To assess the assumption of multivariate normality for these data by the two chi-squared tests

χ_{R}^{2}

and

χ_{T}^{2}

, the p-values under different m (the number of representative points) from the two chi-squared tests

χ_{R}^{2}

in (16) and

χ_{T}^{2}

computed via (19) are summarized in following Table 2. Meanwhile, we use the classical skewness, kurtosis and univariate Shapiro–Wilk tests [67] to assess the marginal normality of each of the four variables. Table 3 also gives the results of the p-values of the two statistics for skew and kurtosis from Mardia’s MVN test. The variables in the following two tables are:

\begin{matrix} X_{8} & = & {boys}^{'} ramus height at age 8 in Example 1 \\ X_{8.5} & = & {boys}^{'} ramus height at age 8.5 in Example 1 \\ X_{9} & = & {boys}^{'} ramus height at age 9 in Example 1 \\ X_{9.5} & = & {boys}^{'} ramus height at age 9.5 in Example 1 \\ X_{s l} & = & sepal length in Example 2 \\ X_{s w} & = & sepal width in Example 2 \\ X_{p l} & = & petal length in Example 2 \\ X_{p w} & = & petal width in Example 2 \\ X_{N} & = & the weight of cork borings taken from north in Example 3 \\ X_{E} & = & the weight of cork borings taken from east in Example 3 \\ X_{S} & = & the weight of cork borings taken from south in Example 3 \\ X_{W} & = & the weight of cork borings taken from west in Example 3 \end{matrix}

The p-values from the representative-points chi-squared test

χ_{R}^{2}

in Table 2 clearly indicate the significant departure from joint multivariate normality for the four variables in each example, while the p-values from the traditional chi-squared test

χ_{T}^{2}

only partially imply a certain departure from joint multivariate normality. The potential departure from joint multivariate normality is also convinced by Mardia’s multivariate skewness test and most of the univariate normality tests (UVN tests in Table 3) by Shapiro–Wilk’s statistic. Because the results from

χ_{R}^{2}

-test are nearly all consistent with those from the well-performed tests in the literature, the representative-point-based

χ_{R}^{2}

-test seems to give more consistent results with some well-known existing tests than does the traditional

χ_{T}^{2}

-test.

5. Concluding Remarks

Testing multivariate normality is a long-lasting research direction in the area of testing goodness-of-fit. There have been various approaches to constructing the goodness-of-fit tests in the literature. The representative-points approach in this paper provides a different angle to modify the classical Pearson–Fisher chi-squared test for multivariate probability distributions such as the multivariate normal. The limited Monte Carlo study in the paper demonstrates the remarkable power improvement of the representative-points chi-squared test over the traditional equiprobable chi-squared test. Although it is difficult to theoretically prove why the representative-points chi-squared test can improve the traditional equiprobable chi-squared test, we discover an implementable way to extend the application of the theory of statistical representative points. More importantly, the simple power comparison with other multivariate normality tests in Figure 6 and Figure 7 implies that the representative-points-based chi-squared test can have competitive power performance with some well-known existing tests in the literature. It should be pointed out that the representative-points-based chi-square test in this paper is only a “necessary test” [68] for multivariate normality, which means that if the null hypothesis is rejected by

χ_{R}^{2}

in (16), one can definitely conclude the departure from multivariate normality. However, if the test fails to reject the null hypothesis, there is no guarantee to ensure multivariate normality. This implies that the test in this paper possesses the same weakness as do many existing tests for multivariate normality. The representative-points approach to constructing goodness-of-fit sheds some additional light on the rich literature in testing multivariate normality. This paper provides some ideas to improve some other existing tests (e.g., Malkovich [69]; McAssey [70]) for multivariate normality by using Jackknife distance and statistical representative points.

Author Contributions

Conceptualization, J.L., H.Y., and M.Z.; methodology, J.L., H.Y., M.Z. and S.W.; software, S.W.; validation, J.L.; writing—original draft preparation, S.W.; writing—review and editing, J.L., H.Y. and M.Z. All authors have read and agreed to the published version of the manuscript.

Funding

This work was partially supported by the Guangdong Provincial Key Laboratory of Interdisciplinary Research and Application for Data Science, BNU-HKBU United International College (UIC), project code 2022B1212010006 and in part by Guangdong Higher Education Upgrading Plan (2021-2025) R0400001-22 and a UIC New Faculty Start-up Research Fund R72021106.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Acknowledgments

The authors thank the Editor, Associate Editor and referees for their constructive comments leading to significant improvement of this paper.

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

The following abbreviations are used in this manuscript:

i.i.d.	Independent identically distributed
Mahalanobis distances	M-distance
MVN	Multivariate normality
MSE	Mean square error
RPs	Representative points
UVN	Univariate normality tests

Appendix A. An Algorithm for MSE-RPs of the F-Distribution

The idea of mean square error-representative points (MSE-RPs) is a method for obtaining k representative points that can best approximate the distribution in the sense of minimal mean square error by a high-precision algorithm. Let

F_{k} : = \{F_{1}, F_{2}, \dots, F_{k}\}

be a k-principal set for F-distribution with a probability density function

f_{F} (x)

such that

- \infty < F_{1} < F_{2} < \dots < F_{k} < \infty

. The MSE function is often referred to as the cost or distortion error for representative points,

\begin{matrix} MSE (F_{1}, F_{2}, \dots, F_{k}) & = \int_{- \infty}^{+ \infty} min_{1 \leq i \leq k} {(x - F_{i})}^{2} f_{F} (x) d x \\ = min \sum_{i = 1}^{k} \int_{e_{i}}^{e_{i + 1}} {(x - F_{i})}^{2} f_{F} (x) d x \end{matrix}

where

e_{1} = - \infty, e_{i} = (F_{i} + F_{i - 1}) / 2, e_{k + 1} = \infty, i = 2, \dots, k

. Generally,

(e_{1}, e_{k + 1})

are the endpoints of the domain of random variable X. To minimize the MSE function, we take the partial derivative with respect to

F_{i}

, which are the roots of the following equations. These roots are representative points.

\{\begin{matrix} \int_{- \infty}^{\frac{F_{1} + F_{2}}{2}} (x - F_{1}) f_{F} (x) d x = 0 \\ \int_{\frac{F_{i - 1} + F_{i}}{2}}^{\frac{F_{i} + F_{i + 1}}{2}} (x - F_{i}) f_{F} (x) d x = 0, i = 2, \dots, k - 1 \\ \int_{\frac{F_{k - 1} + F_{k}}{2}}^{+ \infty} (x - F_{k}) f_{F} (x) d x = 0 \end{matrix}

Based on the idea of Chakraborty et al. [71], the following algorithm is used. Suppose that

D

is the domain of the probability density function

f_{F} (x)

, with limiting values

c : = inf (D), d : = sup (D),

And

M (F_{i} ∣ F_{k}) : = \{\begin{matrix} (c, \frac{F_{1} + F_{2}}{2}) & if i = 1, \\ [\frac{F_{i - 1} + F_{i}}{2}, \frac{F_{i} + F_{i + 1}}{2}) & if 2 \leq i \leq k - 1, \\ [\frac{F_{n - 1} + F_{k}}{2}, d) & if i = k, \end{matrix}

where

M (F_{i} ∣ F_{k})

represent the Voronoi regions of

F_{i}

for all

1 \leq i \leq k

with respect to the set

F_{k}

,

c = 0

and

d = \infty

. Since the principal points are the expected values of their own Voronoi regions, we have

F_{i} = E (X : X \in M (F_{i} ∣ F_{k}))

for all

1 \leq i \leq k

. For the sake of clarity, we denote the endpoints of the regions

M (F_{j} ∣ F_{k})

as

m_{j} : = \{\begin{matrix} c & if j = 0, \\ \frac{F_{j} + F_{j + 1}}{2} & if 1 \leq j \leq k - 1, \\ d & if j = k, \end{matrix}

which depend continuously on the array

F_{k}

. We seek to solve, numerically, the set of k, nonlinear equations.

F_{j} = E (X : X \in M (F_{j} ∣ F_{k})) : = \frac{e (M (F_{i} ∣ F_{k}))}{P (M (F_{j} ∣ F_{k}))} for j = 1, 2, \dots, k

where the unconditional expected value function

e (M (F_{j} ∣ F_{k}))

, and probability function

P (M (F_{j} ∣ F_{k}))

are defined by

\begin{matrix} e (M (F_{j} ∣ F_{k})) & : = \int_{e_{j - 1}}^{e_{j}} x f_{F} (x) d x, and \\ P (M (F_{j} ∣ F_{k})) & : = \int_{e_{j - 1}}^{e_{j}} f_{F} (x) d x . \end{matrix}

Solving the nonlinear system is equivalent to finding the root of the function

g : R^{k} \to R^{k}

whose jth entry is defined as the difference:

g_{j} (F) : = F_{j} \int_{e_{j - 1}}^{e_{j}} f_{F} (x) d x - \int_{e_{j - 1}}^{e_{j}} x f_{F} (x) d x for j = 1, 2, \dots, k .

We can apply Newton’s algorithm for computing the roots of nonlinear systems to obtain high-precision numerical solutions.

Given an initial vector

F_{0} \in R^{k}

, the Newton iteration for finding the root to

g (F)

takes the form

F_{n e w} = F_{o l d} + J {(F_{o l d})}^{- 1} g (F_{o l d}),

where

J : R^{k} \to R^{k \times k}

is the Jacobian matrix, whose entries are defined as

J_{j, i} : =

\partial g_{j} / \partial F_{i}

.

The iteration is continued until the residual

∥g (F_{n e w})∥ = 10^{- 12}

is sufficiently small. Note that the function

g_{j} (F)

, for

j = 1, 2, \dots, k

depends only on

F_{j - 1}, F_{j}, F_{j + 1}

, indicating that the matrix

J (F)

is always tridiagonal. Let

ℓ_{j}

describe the distance between consecutive points:

ℓ_{j} : = F_{j + 1} - F_{j} for j = 1, 2, \dots, k - 1 .

Then, the diagonal entries are given by

J_{j, j} (F) : = \frac{\partial g_{j}}{\partial F_{j}} = \int_{e_{j - 1}}^{e_{j}} f_{F} (x) d x - f_{F} (e_{j - 1}) \frac{ℓ_{j - 1}}{4} - f_{F} (e_{j}) \frac{ℓ_{j}}{4},

define

ℓ_{0} = ℓ_{k} = 0

, for the sake of simplicity.

In addition to being tridiagonal, the Jacobian is also symmetric:

J_{j + 1, j} : = \partial g_{j + 1} / \partial F_{j} = \partial g_{j} / \partial F_{j + 1} = : J_{j, j + 1} .

The off-diagonal entries are given by

J_{j + 1, j} = J_{j, j + 1} = - f_{F} (e_{j}) \frac{ℓ_{j}}{4} f o r j = 1, 2, \dots, k - 1

Appendix B. R Code for Computing F-Representative Points

Appendix C. Additional Power Comparisons

Figure A1. Power (

α = 0.05

) comparison for

N (0, 1) + χ^{2} (2) .

Figure A1. Power (

α = 0.05

) comparison for

N (0, 1) + χ^{2} (2) .

Figure A2. Power (

α = 0.05

) comparison for shifted i.i.d.

χ^{2} (1)

.

Figure A2. Power (

α = 0.05

) comparison for shifted i.i.d.

χ^{2} (1)

.

Figure A3. Power (

α = 0.05

) comparison for shifted i.i.d.

exp (1) .

Figure A3. Power (

α = 0.05

) comparison for shifted i.i.d.

exp (1) .

References

Mardia, K.V. Measures of multivariate skewnees and kurtosis with applications. Biometrika 1970, 57, 519–530. [Google Scholar] [CrossRef]
Mardia, K.V. Applications of some measures of multivariate skewness and kurtosis for testing normality and robustness studies. Sankhy A 1974, 36, 115–128. [Google Scholar]
Mardia, K.V. Tests of univariate and multivariate normality. Handb. Stat. 1980, 1, 297–320. [Google Scholar]
Koziol, J.A. A class of invariant procedures for assessing multivariate normality. Biometrika 1982, 69, 423–427. [Google Scholar] [CrossRef]
Koziol, J.A. Assessing multivariate normality: A compendium. Commun. Stat. Theory Methods 1986, 15, 2763–2783. [Google Scholar] [CrossRef]
Mudholkar, G.S.; McDermott, M.; Srivastava, D.K. A test of p-variate normality. Biometrika 1992, 79, 850–854. [Google Scholar] [CrossRef]
Liang, J.; Bentler, P.M. A t-distribution plot to detect non-multinormality. Comput. Stat. Data Anal. 1995, 30, 31–44. [Google Scholar] [CrossRef]
Liang, J.; Li, R.; Fang, H.; Fang, K.T. Testing multinormality based on low-dimensional projection. J. Stat. Plan. Inference 2000, 86, 129–141. [Google Scholar] [CrossRef]
Henze, N. Invariant tests for multivariate normality: A critical review. Stat. Papers 2002, 43, 467–507. [Google Scholar] [CrossRef]
Mecklin, C.J.; Mundfrom, D.J. An appraisal and bibliography of tests for multivariate normality. Int. Stat. Rev. 2004, 72, 123–138. [Google Scholar] [CrossRef]
Thulin, M. Tests for multivariate normality based on canonical correlations. Stat. Meth. Appl. 2014, 23, 189–208. [Google Scholar] [CrossRef] [Green Version]
Szekely, G.J.; Rizzo, M.L. Energy statistics: A class of statistics based on distances. J. Stat. Plan. Inference 2013, 143, 1249–1272. [Google Scholar] [CrossRef]
Tenreiro, C. A new test for multivariate normality by combining extreme and nonextreme BHEP tests. Commun. Stat. Simul. Comput. 2017, 46, 1746–1759. [Google Scholar] [CrossRef]
Kim, I.; Park, S. Likelihood ratio test for multivariate normality. Commun. Stat. Theory Meth. 2018, 47, 1923–1934. [Google Scholar] [CrossRef]
Enomoto, R.; Hanusz, Z.; Hara, A.; Seo, T. Multivariate normality test using normalizing transformation for Mardia’s multivariate kurtosis. Commun. Stat. Simul. Comput. 2020, 49, 684–698. [Google Scholar] [CrossRef]
Andrews, D.F.; Gnanadesikan, R.; Warner, J.L. Methods for assessing multivariate normality. Proc. Int. Symp. Multivar. Anal. 1973, 3, 95–116. [Google Scholar]
Gnanadesikan, R. Methods for Statistical Data Analysis of Multivariate Observations; Wiley: New York, NY, USA, 1977. [Google Scholar]
Looney, S.W. How to use tests for univariate normality to assess multivariate normality. Am. Stat. 1995, 39, 75–79. [Google Scholar]
Royston, J.P. Some techniques for assessing multivariate normality based on the Shapiro-Wilk W. Appl. Stat. 1983, 32, 121–133. [Google Scholar] [CrossRef]
Royston, J.P. Approximating the Shapiro-Wilk W-Test for non-normality. Stat. Comput. 1992, 2, 117–119. [Google Scholar] [CrossRef]
Royston, J.P. Remark AS R94: A remark on Algorithm AS 181: The W test for normality. Appl. Stat. 1995, 44, 547–551. [Google Scholar] [CrossRef]
Horswell, R.L.; Looney, S.W. A comparison of tests for multivariate normality that are based on measures of multivariate skewness and kurtosis. Stat. Comput. Simul. 1992, 42, 21–38. [Google Scholar] [CrossRef]
Romeu, J.L.; Ozturk, A. A comparative study of goodness-of-fit tests for multivariate normality. J. Multivar. Anal. 1993, 46, 309–334. [Google Scholar] [CrossRef] [Green Version]
Young, D.M.; Seaman, S.L.; Seaman, J.W. A comparison of six test statistics for detecting multivariate nonnormality which utilize the multivariate squared-radii statistic. Texas J. Sci. 1995, 47, 21–38. [Google Scholar]
Beirlant, J.; Mason, D.M.; Vynckier, C. Goodness-of-fit analysis for multivariate normality based on generalized quantiles. Comput. Stat. Data Anal. 1999, 30, 119–142. [Google Scholar] [CrossRef]
Mecklin, C.J. A Comparison of the Power of Classical and Newer Tests of Multivariate Normality. Ph.D. Thesis, University of Northern Colorado, Greeley, CO, USA, 2000. [Google Scholar]
Mecklin, C.J.; Mundfrom, D.J. A Monte Carlo comparison of the Type I and Type II error rates of tests of multivariate normality. J. Stat. Comput. Simul. 2005, 75, 93–107. [Google Scholar] [CrossRef]
Ward, P.J. Goodness-of-Fit Tests for Multivariate Normality. Ph.D. Thesis, University of Alabama, Tuscaloosa, AL, USA, 1988. [Google Scholar]
Ahn, S.K. F-Probability plot and its applications to multivariate normality. Commun. Stat. Theory Methods 1992, 21, 997–1023. [Google Scholar] [CrossRef]
Fang, K.T.; He, S.D. The Problem of Selecting a Given Number of Representative Points in a Normal Population and a Generalized Mill’s Ratio; Technical Report; U.S. Army Research Office Contract DAAG 29-82-K-0156; Department of Stanford University: Stanford, CA, USA, 1982. [Google Scholar]
Flury, B. Estimation of principal points. Appl. Stat. 1993, 42, 139–151. [Google Scholar] [CrossRef]
Cox, D.R. Note on grouping. J. Am. Stat. Assoc. 1957, 52, 543–547. [Google Scholar] [CrossRef]
Max, J. Quantizing for minimum distortion. IEEE Trans. Inf. Theory 1960, 6, 7–12. [Google Scholar] [CrossRef]
Fang, K.T. Application of the theory of the conditional distribution for the standardization of clothes. Acta Math. Appl. Sin. 1976, 2, 62–74. (In Chinese) [Google Scholar]
Flury, B. Principal points. Biometrika 1990, 77, 33–41. [Google Scholar] [CrossRef]
Tarpey, T. Self-consistency algorithms. J. Comput. Graph. Stat. 1999, 8, 889–905. [Google Scholar]
Fang, K.; Zhou, M.; Wang, W. Applications of the representative points in statistical simulations. Sci. China Math. 2014, 57, 2609–2620. (In Chinese) [Google Scholar] [CrossRef]
Fang, K.; He, P.; Yang, J. Set of representative points of statistical distributions and their applications. Sci. Sin. Math. 2020, 50, 1–20. (In Chinese) [Google Scholar]
Feller, W. An Introduction to Probability Theory and Its Applications; Wiley: New York, NY, USA, 1970; Volume 2. [Google Scholar]
Van der Vaart, A.W. Asymptotic Statistics; Cambridge University Press: New York, NY, USA, 1988. [Google Scholar]
Al-Labadi, L.; Fazeli Asl, F.; Saberi, Z. A Necessary Bayesian Nonparametric Test for Assessing Multivariate Normality. Math. Methods Stat. 2021, 30, 64–81. [Google Scholar] [CrossRef]
Sturges, H. The choice of a class-interval. J. Am. Stat. Assoc. 1926, 21, 65–66. [Google Scholar] [CrossRef]
Mann, H.; Wald, A. On the Choice of the Number of Class Intervals in the Application of the Chi Square Test. Ann. Math. Stat. 1942, 13, 306–317. [Google Scholar] [CrossRef]
Williams, C.A. On the choice of the number and width of classes for the Chi-square test of goodness of fit. J. Am. Stat. Assoc. 1950, 45, 77–86. [Google Scholar]
Dahiya, R.C.; Gurland, J. How Many Classes in the Pearson Chi-Square Test? J. Am. Stat. Assoc. 1973, 68, 707–712. [Google Scholar]
Mineo, A. A new grouping method for the right evaluation of the Chi-square test of goodness-of-fit. Scand. J. Stat. 1979, 6, 145–153. [Google Scholar]
Harrison, R.H. Choosing the Optimum Number of Classes in the Chi-Square Test for Arbitrary Power Levels. Indian J. Stat. 1985, 47, 319–324. [Google Scholar] [CrossRef]
Kallenberg, W. On moderate and large deviations in multinomial distributions. Ann. Stat. 1985, 13, 1554–1580. [Google Scholar] [CrossRef]
Kallenberg, W.; Oosterhoff, J.; Schriever, B. The number of classes in Chi-squared goodness-of-fit tests. J. Am. Stat. Assoc. 1985, 80, 959–968. [Google Scholar] [CrossRef]
Oosterhoff, J. The choice of cells in Chi-square tests. Stat. Neerl. 1985, 39, 115–128. [Google Scholar] [CrossRef]
Quine, M.; Robinson, J. Efficiencies of Chi-square and likelihood ratio goodness-of-fit tests. Ann. Stat. 1985, 13, 727–742. [Google Scholar] [CrossRef]
D’Agostini, R.B.; Stephens, M.A. Goodness-of-Fit Techniques, Statistics: Textbooks and Monographs; Marcel Dekker: New York, NY, USA, 1986. [Google Scholar]
Koehler, K.; Gann, F. Chi-squared goodness-of-fit tests: Cell selection and power. Commun. Stat. Simul. Comput. 1990, 19, 1265–1278. [Google Scholar] [CrossRef]
Bogdan, M. Data Driven Version of Pearson’s Chi-Square Test for Uniformity. J. Stat. Comput. Simul. 1995, 52, 217–237. [Google Scholar] [CrossRef]
Goodman, T.R.; Kotz, S. Multivariate θ-generalized normal distributions. J. Multivar. Anal. 1973, 3, 204–219. [Google Scholar] [CrossRef] [Green Version]
Henze, N.; Wagner, T. A New Approach to the BHEP tests for multivariate normality. J. Multivar. Anal. 1997, 62, 1–23. [Google Scholar] [CrossRef] [Green Version]
Szekely, G.J.; Rizzo, M.L. The Energy of Data. Annu. Rev. Stat. Appl. 2017, 4, 447–479. [Google Scholar] [CrossRef] [Green Version]
Elston, R.C.; Grizzle, J.E. Estimation of time-response curves and their confidence bands. Biometrics 1962, 18, 148–159. [Google Scholar] [CrossRef]
Timm, N.H. Applied Multivariate Analysis; Springer: New York, NY, USA, 2002. [Google Scholar]
Zhou, M.; Shao, Y. A powerful test for multivariate normality. J. Appl. Stat. 2015, 41, 351–363. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Fisher, R.A. The use of multiple measurements in taxonomic problems. Ann. Eugen. 1936, 7, 179–188. [Google Scholar] [CrossRef]
Srivastava, D.K.; Mudholkar, G.S. Goodness-of-fit tests for univariate and multivariate normal models. In Handbook of Statistics 22: Statistics in Industry; Elsevier: Amsterdam, The Netherlands, 2003. [Google Scholar]
Shao, Y.; Zhou, M. A characterization of multivariate normality through univariate projections. J. Multivar. Anal. 2010, 101, 2637–2640. [Google Scholar] [CrossRef] [Green Version]
Small, N. Marginal skewness and kurtosis in testing multivariate normality. Appl. Stat. 1980, 29, 85–87. [Google Scholar] [CrossRef]
Rao, C.R. Tests of significance in multivariate analysis. Biometrika 1948, 33, 58–79. [Google Scholar] [CrossRef]
Srivastava, M.S.; Hui, T.K. On assessing multivariate normality based on Shapiro-Wilk W statistic. Stat. Prob. Lett. 1987, 5, 15–18. [Google Scholar] [CrossRef]
Shapiro, S.S.; Wilk, M.B. An analysis of variance test for normality (complete samples). Biometrika 1965, 52, 591–611. [Google Scholar] [CrossRef]
Batsidis, A.; Martin, N.; Pardo, L.; Zografos, K. A Necessary Power Divergence Type Family Tests of Multivariate Normality. Commun. Stat. Simul. Comput. 2013, 42, 2253–2271. [Google Scholar] [CrossRef]
Malkovich, J.F.; Afifi, A.A. On tests for multivariate normality. J. Am. Stat. Assoc. 1973, 68, 176–179. [Google Scholar] [CrossRef]
McAssey, M.P. An empirical goodness-of-fit test for multivariate distributions. J. Appl. Stat. 2013, 40, 1120–1131. [Google Scholar] [CrossRef]
Chakraborty, S.; Roychowdhury, M.K.; Sifuentes, J. High Precision Numerical Computation of Principal Points for Univariate Distributions. Sankhya B 2021, 83 (Suppl. 2), 558–584. [Google Scholar] [CrossRef]

Figure 1. Power (

α = 0.05

) in multivariate t distribution.

Figure 1. Power (

α = 0.05

) in multivariate t distribution.

Figure 2. Power (

α = 0.05

) in

β

g-normal distribution.

Figure 2. Power (

α = 0.05

) in

β

g-normal distribution.

Figure 3. Power (

α = 0.05

) in distribution

N (0, 1) + χ^{2} (2)

.

Figure 3. Power (

α = 0.05

) in distribution

N (0, 1) + χ^{2} (2)

.

Figure 4. Power (

α = 0.05

) in shifted i.i.d.

χ^{2} (1)

Figure 4. Power (

α = 0.05

) in shifted i.i.d.

χ^{2} (1)

Figure 5. Power (

α = 0.05

) in shifted i.i.d.

exp (1) .

Figure 5. Power (

α = 0.05

) in shifted i.i.d.

exp (1) .

Figure 6. Power (

α = 0.05

) comparison in multivariate t distribution.

Figure 6. Power (

α = 0.05

) comparison in multivariate t distribution.

Figure 7. Power (

α = 0.05

) comparison in

β

g-normal distribution.

Figure 7. Power (

α = 0.05

) comparison in

β

g-normal distribution.

Table 1. Empirical type I error rates (

α = 0.05

).

Table 1. Empirical type I error rates (

α = 0.05

).

n	m	Test	$d = 3$	$d = 5$	$d = 10$	$d = 15$	$d = 20$
$n = 20$	$m = 10$	$χ_{R}^{2}$	0.056	0.060	0.064	0.056	0.072
	$m = 10$	$χ_{T}^{2}$	0.040	0.022	0.032	0.030	0.024
	$m = 20$	$χ_{R}^{2}$	0.050	0.070	0.061	0.064	0.062
	$m = 20$	$χ_{T}^{2}$	0.038	0.026	0.038	0.038	0.022
	$m = 30$	$χ_{R}^{2}$	0.062	0.070	0.064	0.074	0.078
	$m = 30$	$χ_{T}^{2}$	0.042	0.032	0.036	0.038	0.028
$n = 50$	$m = 10$	$χ_{R}^{2}$	0.032	0.035	0.048	0.046	0.035
	$m = 10$	$χ_{T}^{2}$	0.027	0.030	0.039	0.038	0.033
	$m = 20$	$χ_{R}^{2}$	0.049	0.059	0.059	0.065	0.064
	$m = 20$	$χ_{T}^{2}$	0.037	0.036	0.034	0.036	0.035
	$m = 30$	$χ_{R}^{2}$	0.057	0.074	0.072	0.070	0.082
	$m = 30$	$χ_{T}^{2}$	0.038	0.037	0.033	0.037	0.040
$n = 100$	$m = 10$	$χ_{R}^{2}$	0.032	0.034	0.048	0.046	0.035
	$m = 10$	$χ_{T}^{2}$	0.025	0.031	0.034	0.025	0.036
	$m = 20$	$χ_{R}^{2}$	0.062	0.067	0.044	0.048	0.054
	$m = 20$	$χ_{T}^{2}$	0.041	0.025	0.038	0.032	0.031
	$m = 30$	$χ_{R}^{2}$	0.059	0.060	0.065	0.060	0.062
	$m = 30$	$χ_{T}^{2}$	0.040	0.036	0.043	0.036	0.032
$n = 200$	$m = 10$	$χ_{R}^{2}$	0.041	0.038	0.022	0.028	0.030
	$m = 10$	$χ_{T}^{2}$	0.034	0.038	0.025	0.030	0.031
	$m = 20$	$χ_{R}^{2}$	0.057	0.046	0.041	0.043	0.041
	$m = 20$	$χ_{T}^{2}$	0.036	0.038	0.032	0.040	0.040
	$m = 30$	$χ_{R}^{2}$	0.072	0.069	0.051	0.039	0.049
	$m = 30$	$χ_{T}^{2}$	0.042	0.046	0.039	0.034	0.039
$n = 400$	$m = 10$	$χ_{R}^{2}$	0.027	0.034	0.034	0.038	0.037
	$m = 10$	$χ_{T}^{2}$	0.036	0.032	0.032	0.037	0.036
	$m = 20$	$χ_{R}^{2}$	0.043	0.041	0.037	0.045	0.047
	$m = 20$	$χ_{T}^{2}$	0.040	0.033	0.039	0.032	0.033
	$m = 30$	$χ_{R}^{2}$	0.058	0.054	0.046	0.050	0.044
	$m = 30$	$χ_{T}^{2}$	0.042	0.033	0.041	0.036	0.037
$n = 1000$	$m = 10$	$χ_{R}^{2}$	0.034	0.038	0.036	0.050	0.042
	$m = 10$	$χ_{T}^{2}$	0.032	0.038	0.028	0.036	0.044
	$m = 20$	$χ_{R}^{2}$	0.038	0.044	0.042	0.042	0.040
	$m = 20$	$χ_{T}^{2}$	0.044	0.054	0.050	0.060	0.028
	$m = 30$	$χ_{R}^{2}$	0.034	0.046	0.036	0.054	0.042
	$m = 30$	$χ_{T}^{2}$	0.032	0.046	0.026	0.046	0.038

Table 2. p-values from the two chi-squared tests.

Variables	$χ^{2}$ -Test	$m = 10$	$m = 20$	$m = 30$
$(X_{8}, X_{8.5}, X_{9}, X_{9.5})$	$χ_{R}^{2}$	0.0016	$1.661 \times 10^{- 6}$	0.0005
$(X_{8}, X_{8.5}, X_{9}, X_{9.5})$	$χ_{T}^{2}$	0.0669	0.1302	0.0839
$(X_{s l}, X_{s w}, X_{p l}, X_{p w})$	$χ_{R}^{2}$	0.0066	0.0199	0.0076
$(X_{s l}, X_{s w}, X_{p l}, X_{p w})$	$χ_{T}^{2}$	0.0179	0.0397	0.0979
$(X_{N}, X_{E}, X_{S}, X_{W})$	$χ_{R}^{2}$	0.0004	<10 $^{- 10}$	<10 $^{- 10}$
$(X_{N}, X_{E}, X_{S}, X_{W})$	$χ_{T}^{2}$	0.7110	0.3610	0.7566

Table 3. Skewness and Kurtosis for each measurement and feature.

Variables	Skewness	Kurtosis	UVN Test ¹	Mardia’s MVN Test ²
$X_{8}$	0.3069	−1.0682	0.3360	Sk: 0.0093
$X_{8.5}$	0.3111	−0.8932	0.6020	Ku: 0.1125
$X_{9}$	0.0645	−1.2076	0.5016
$X_{9.5}$	0.0648	−1.4332	0.0905
$X_{s l}$	0.3118	−0.5736	0.0102	Sk: $4.7570 \times 10^{- 7}$
$X_{s w}$	0.3158	0.1810	0.1012	Ku: 0.8180
$X_{p l}$	−0.2721	−1.3955	<0.0001
$X_{p w}$	−0.1019	−1.3361	<0.0001
$X_{N}$	0.8088	−0.3535	0.0179	Sk: $7.6249 \times 10^{- 9}$
$X_{E}$	0.8322	−0.2680	0.0135	Ku: 0.0076
$X_{S}$	−0.1289	−0.3767	0.0361
$X_{W}$	0.4417	−0.7800	0.1185

¹p-values from univariate Shapiro-Wilk tests. ²p-values from Mardia’s multivariate skewness and kurtosis tests.

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Wang, S.; Liang, J.; Zhou, M.; Ye, H. Testing Multivariate Normality Based on F-Representative Points. Mathematics 2022, 10, 4300. https://doi.org/10.3390/math10224300

AMA Style

Wang S, Liang J, Zhou M, Ye H. Testing Multivariate Normality Based on F-Representative Points. Mathematics. 2022; 10(22):4300. https://doi.org/10.3390/math10224300

Chicago/Turabian Style

Wang, Sirao, Jiajuan Liang, Min Zhou, and Huajun Ye. 2022. "Testing Multivariate Normality Based on F-Representative Points" Mathematics 10, no. 22: 4300. https://doi.org/10.3390/math10224300

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Testing Multivariate Normality Based on F-Representative Points

Abstract

1. Introduction

2. The MVN Test Based on the $F$ -Representative Points

2.1. A Brief Review on Affine Invariance

2.2. The Jackknife Distance

2.3. The Chi-Squared Test Based on the F-Representative Points

3. A Monte Carlo Study

3.1. A Comparison between Empirical Type I Error Rates

3.2. A Power Comparison

4. Illustrative Examples

5. Concluding Remarks

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

Abbreviations

Appendix A. An Algorithm for MSE-RPs of the F-Distribution

Appendix B. R Code for Computing F-Representative Points

Appendix C. Additional Power Comparisons

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

Testing Multivariate Normality Based on F-Representative Points

Abstract

1. Introduction

2. The MVN Test Based on the F -Representative Points

2.1. A Brief Review on Affine Invariance

2.2. The Jackknife Distance

2.3. The Chi-Squared Test Based on the F-Representative Points

3. A Monte Carlo Study

3.1. A Comparison between Empirical Type I Error Rates

3.2. A Power Comparison

4. Illustrative Examples

5. Concluding Remarks

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

Abbreviations

Appendix A. An Algorithm for MSE-RPs of the F-Distribution

Appendix B. R Code for Computing F-Representative Points

Appendix C. Additional Power Comparisons

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

2. The MVN Test Based on the $F$ -Representative Points