High-Dimensional Random Matrices from the Classical Matrix Groups, and Generalized Hypergeometric Functions of Matrix Argument

Richards, Donald St. P.

doi:10.3390/sym3030600

Open AccessArticle

High-Dimensional Random Matrices from the Classical Matrix Groups, and Generalized Hypergeometric Functions of Matrix Argument

by

Donald St. P. Richards

Department of Statistics, Pennsylvania State University, University Park, PA 16802-2111, USA

Symmetry 2011, 3(3), 600-610; https://doi.org/10.3390/sym3030600

Submission received: 27 May 2011 / Revised: 16 August 2011 / Accepted: 23 August 2011 / Published: 26 August 2011

(This article belongs to the Special Issue Symmetry in Probability and Inference)

Download Versions Notes

Abstract

:

Results from the theory of the generalized hypergeometric functions of matrix argument, and the related zonal polynomials, are used to develop a new approach to study the asymptotic distributions of linear functions of uniformly distributed random matrices from the classical compact matrix groups. In particular, we provide a new approach for proving the following result of D’Aristotile, Diaconis, and Newman: Let the random matrix

H_{n}

be uniformly distributed according to Haar measure on the group of

n \times n

orthogonal matrices, and let

A_{n}

be a non-random

n \times n

real matrix such that

tr (A_{n}^{'} A_{n}) = 1

. Then, as

n \to \infty

,

\sqrt{n} tr A_{n} H_{n}

converges in distribution to the standard normal distribution.

Keywords:

Generalized hypergeometric function of matrix argument; normal approximation; orthogonal matrix; random matrix; Stiefel manifold; symplectic matrix; unitary matrix; zonal polynomial

Classification:

MSC 33C10; 60F05; 60B10; 60B15

1. Introduction

The study of high-dimensional random orthogonal and unitary matrices can be traced to a famous paper of E. Borel [1] in which the following result is proved: Let

X_{1, n}

denote the first coordinate of

X_{n}

, a n-dimensional random vector that is uniformly distributed on the unit sphere

S^{n - 1}

; then, as

n \to \infty

, the random variable

\sqrt{n} X_{1, n}

converges in distribution to Z, a standard normal random variable.

Subsequent to Borel’s paper, there has ensued a literature of substantial size. We mention, as only a few of the papers in this area, the articles of Weingarten [2], Diaconis and Freedman [3], Diaconis, Eaton and Lauritzen [4], Diaconis and Shahshahani [5], Johansson [6], Rains [7], Diaconis and Evans [8], D’Aristotile, Diaconis and Newman [9], Pastur and Vasilchuk [10], Collins and Śniady [11], Meckes [12], Fulman [13], and Jiang [14]. A reader interested in exploring the field further may obtain from those papers many references to the area.

In a survey of the literature, we were especially intrigued by a result of D’Aristotile, Diaconis and Newman [9]. We denote by

O (n)

the group of

n \times n

orthogonal matrices, and by the uniform distribution on

O (n)

we mean the Haar measure, normalized to be a probability distribution. Further, we let

N (0, 1)

denote the standard normal distribution. Then the result is as follows:

Theorem 1.1.

(D’Aristotile et al. [9]) Let

{A_{n} : n = 1, 2, 3, \dots}

be a sequence of real matrices such that

A_{n}

is

n \times n

and

tr (A_{n}^{'} A_{n}) = 1

, and let

H_{n}

be a random orthogonal matrix that is uniformly distributed on

O (n)

. Then

\sqrt{n} tr (A_{n} H_{n}) \overset{L}{\to} N (0, 1)

as

n \to \infty

.

The proof given by D’Aristotile, et al. [9] is based on classical probabilistic methods involving tightness. Their result was later studied by Meckes [12] who obtained a bound on the distance, in the total variation metric on the set of probability distributions, between the distribution of

\sqrt{n} tr (A_{n} H_{n})

and the standard normal distribution; as a consequence, Meckes obtained an explicit formula for the rate of convergence to normality.

It was particularly striking to us that, throughout the existing literature on high-dimensional random matrices from the classical compact matrix groups, the theory of generalized hypergeometric functions of matrix argument appears not to have played an explicit role. We found this absence intriguing because it has been known since the work of Herz [15] that the characteristic function of a uniformly distributed random orthogonal matrix can be expressed in terms of the Bessel functions of matrix argument; indeed, a primary motivation for the invention of those Bessel functions was the study of random matrices which are uniformly distributed on

O (n)

.

In this paper, we provide a heuristic derivation of Theorem 1.1. To that end, we will present crucial features of the theory of the zonal polynomials and a generalized hypergeometric function of matrix argument as necessary to make the paper self-contained. It is also noteworthy that the approach given here applies with ease, mutatis mutandis, to cases in which the matrix

H_{n}

is uniformly distributed on the unitary group or the symplectic group, and to cases in which

H_{n}

is a rectangular random matrix on Stiefel manifolds corresponding to the classical compact matrix groups. In short, the theory of the generalized hypergeometric functions of matrix argument lends itself readily to the study of linear functions of high-dimensional random matrices from the classical compact matrix groups.

Conversely, the study of high-dimensional orthogonal and unitary matrices also yields new results for the Bessel functions of matrix argument. By application of a result of Johansson [6], we will obtain an upper bound on the distance, in the supremum norm on

R

, between a certain generalized hypergeometric function of scalar matrix argument and the Gaussian quantity,

exp (- t^{2} / 2)

,

t \in R

.

2. Zonal Polynomials and a Generalized Hypergeometric Function of Matrix Argument

Throughout the paper, we denote the determinant and trace of a square matrix A by

\det (A)

and

tr (A)

, respectively. We also denote by

I_{n}

the identity matrix of order n. We denote by

E

the generic operation of expectation with respect to a probability distribution which, on all occasions, will be explicit from the context.

A partition is a vector

κ = (κ_{1}, \dots, κ_{n})

of non-negative integers that are weakly decreasing:

κ_{1} \geq \dots \geq κ_{n}

. The entries

κ_{1}, \dots, κ_{n}

are called the parts of

κ

; the length of

μ

is the number of non-zero

κ_{j}

; and the weight of

κ

is

| κ | : = κ_{1} + \dots + κ_{n}

.

The set of partitions may be ordered lexicographically: If

λ = (λ_{1}, \dots, λ_{n})

and

κ = (κ_{1}, \dots, κ_{n})

are partitions then we write

λ < κ

if

λ_{j} < κ_{j}

for the first index j such that corresponding parts are unequal.

We shall encounter in the sequel the quantity,

ρ_{κ} = \sum_{j = 1}^{n} κ_{j} (κ_{j} - i)

(2.1)

Perhaps coincidentally, the term

ρ_{κ}

has appeared before now in the theory of zonal polynomials. James [16], in proving that the zonal polynomial

Z_{κ}

is an eigenfunction of the Laplace–Beltrami operator on the cone of positive definite matrices, shows that

ρ_{κ}

appears in the expression for the corresponding eigenvalue; see also Muirhead [17] (p. 229, Equation (5)) and Richards [18].

We will also need the following monotonicity property of

ρ_{κ}

.

Lemma 2.1.

In the lexicographic ordering on the set of partitions of weight k,

ρ_{κ}

is a strictly increasing function:

ρ_{λ} < ρ_{κ}

for

λ < κ

. In particular,

| ρ_{κ} | \leq k (k - 1)

(2.2)

Proof.

We shall use induction on

λ

in the lexicographic ordering on the set of partitions. For the top two partitions,

(k)

and

(k - 1, 1)

, we find that

ρ_{(k)} - ρ_{(k - 1, 1)} = 2 k - 1 > 0

.

As inductive hypothesis, suppose that the result has been proved for all partitions from

(k)

down to a partition

κ = (κ_{1}, \dots, κ_{n})

. Then the partition which is immediately below

κ

is of the form

λ = (κ_{1}, \dots, κ_{j - 1}, κ_{j} - 1, κ_{j + 1}, \dots, κ_{l - 1}, κ_{l} + 1, κ_{l + 1}, \dots, κ_{n})

for some j and l with

j < l

. By comparing the jth and lth parts of

λ

we also find that, necessarily,

κ_{j} - 1 \geq κ_{l} + 1

.

By cancelling common terms in the sums that define

ρ_{κ}

and

ρ_{λ}

, we obtain

\begin{matrix} ρ_{κ} - ρ_{λ} & = & κ_{j} (κ_{j} - j) + κ_{l} (κ_{l} - l) - (κ_{j} - 1) (κ_{j} - 1 - j) - (κ_{l} + 1) (κ_{l} + 1 - l) \\ = & (κ_{j} - 1) - (κ_{l} - 1) + (κ_{j} - j) - (κ_{l} - l) . \end{matrix}

We have seen already that

κ_{j} - 1 \geq κ_{l} + 1

. Further, since the sequence

{κ_{j}}

is weakly decreasing then the sequence

{κ_{j} - j}

is strictly decreasing, and hence

(κ_{j} - j) - (κ_{l} - l) > 0

for

j < l

. Therefore, we obtain

ρ_{κ} - ρ_{λ} > 0

; consequently, by induction, the strictly-increasing property holds for all partitions of weight k.

Because the set of partitions of weight k is totally ordered with respect to the lexicographic ordering, with minimal element

(1^{k}) = ({\underset{︸}{1, \dots, 1}}_{k})

and maximal element

(k)

, it follows from the monotonicity property of

ρ_{κ}

that

- \frac{1}{2} k (k - 1) = ρ_{(1^{k})} \leq ρ_{κ} \leq ρ_{(k)} = k (k - 1)

for all partitions

κ

of weight k. Thus, we obtain Equation (2.2). □

For

a \in C

and any nonnegative integer j, the rising factorial,

{(a)}_{j}

is defined as

{(a)}_{j} = \frac{Γ (a + j)}{Γ (a)} = a (a + 1) (a + 2) \dots (a + j - 1)

(2.3)

Corresponding to each partition

κ

, the partitional rising factorial,

{(a)}_{κ}

is defined as

{(a)}_{κ} = \prod_{j = 1}^{n} {(a - \frac{1}{2} (j - 1))}_{κ_{j}}

(2.4)

Let S be a real symmetric

n \times n

matrix. For each partition

κ

, we denote by

Z_{κ} (S)

the zonal polynomial of the matrix S. A complete description of the zonal polynomials may be obtained from James [19], Muirhead [17], or Gross and Richards [20]. Noting that the present paper deals directly with aspects of integration over the orthogonal group

O (n)

, we remark that a direct definition of the zonal polynomials may be obtained as follows: For any symmetric

n \times n

matrix S, and for

j = 1, \dots, n

, denote by

\det_{j} (S)

the principal minor of order j of S. Let

p_{κ} (S) = {(\det S)}^{κ_{n}} \prod_{j = 1}^{n - 1} {(\det_{j} (S))}^{κ_{j} - κ_{j + 1}}

(2.5)

be the power function corresponding to the partition

κ

. Denote by

d H_{n}

the Haar measure on

O (n)

, normalized to be a probability measure. Then

Z_{κ} (S)

, the zonal polynomial corresponding to the partition

κ

, may be defined by

Z_{κ} (S) = c_{κ} \int_{O (n)} p_{κ} (H_{n}^{'} S H_{n}) d H_{n}

(2.6)

where the normalizing constants

c_{κ}

are positive and are chosen uniquely so that

\sum_{| κ | = k} Z_{κ} (S) = {(tr S)}^{k}

(2.7)

Integral representations of the type given in Equation (2.6) have played a crucial role in earlier studies of central limit theorems for positive definite random matrices (Richards [22]).

We now introduce a generalized hypergeometric function of matrix argument. Let

a \in C

be such that

- a + \frac{1}{2} (j - 1)

is not a non-negative integer for all

j = 1, \dots, n

. For any symmetric

n \times n

matrix S, we define a generalized hypergeometric function of matrix argument,

_{0} F_{1} (a; S) = \sum_{k = 0}^{\infty} \frac{1}{k!} \sum_{| κ | = k} \frac{Z_{κ} (S)}{{(a)}_{κ}}

(2.8)

where the inner summation is over all partitions

κ = (κ_{1}, \dots, κ_{n})

of weight k.

By a result of Gross and Richards [20], Theorem 6.3, the series Equation (2.8) converges absolutely for all S. With

i \equiv \sqrt{- 1}

, it is a result of Herz [15], (p. 423, see also James [19]) that for any

n \times n

matrix A, there holds the integral formula,

\int_{O (n)} exp (2 i tr A H_{n}) d H_{n} =_{0} F_{1} (n / 2; - A^{'} A)

(2.9)

This result generalizes a well-known formula that expresses a classical Bessel function as an integral over the unit circle; for this reason, the function

_{0} F_{1} (n / 2; - S)

also is viewed as a Bessel function of matrix argument.

3. The Case of the Stiefel Manifold

We regard this section as preparatory for the ensuing new approach to Theorem 1.1, for the method of hypergeometric functions of matrix argument very easily yields the high-dimensional asymptotic behavior of random matrices taking values in Stiefel manifolds.

Denote by

V_{n, m}

the Stiefel manifold of all m-tuples of orthonormal n-dimensional vectors. As a homogeneous space,

V_{n, m} ≃ O (n) / O (n - m)

, hence is compact. An explicit description of the unique

O (n)

-invariant uniform distribution on

V_{n, m}

is given by Herz [15]. The following result is both a generalization of Borel’s result for the unit sphere and an analog of Theorem 1.1 for the Stiefel manifold.

Theorem 3.1.

Let m be a fixed positive integer, and let

{A_{n} : n \geq m}

be a sequence of real matrices such that

A_{n}

is

n \times m

and

tr (A_{n}^{'} A_{n}) = 1

. For each

n \geq m

, let

H_{n}

be a

n \times m

random matrix that is uniformly distributed on

V_{n, m}

. Then

\sqrt{n} tr (A_{n} H_{n}) \overset{L}{\to} N (0, 1)

as

n \to \infty

.

To see how this result is obtained, we apply Equation (2.9) to obtain, for

t \in R

,

\begin{matrix} E exp (i t \sqrt{n} tr A_{n}^{'} H_{n}) & = & _{0} F_{1} (n / 2; - t^{2} n A_{n}^{'} A_{n} / 4) \\ = & \sum_{k = 0}^{\infty} \frac{{(- 1)}^{k} n^{k} t^{2 k}}{k! 4^{k}} \sum_{| κ | = k} \frac{Z_{κ} (A_{n}^{'} A_{n})}{{(n / 2)}_{κ}} \end{matrix}

Because

A_{n}^{'} A_{n}

is an

m \times m

matrix then, by Equation (2.6),

Z_{κ} (A_{n}^{'} A_{n}) = 0

if

κ

has length greater than m; therefore, in this case, the zonal polynomial expansion involves partitions of length at most m only.

By Equation (2.4), we obtain for any partition

κ

of weight k,

\begin{matrix} \frac{n^{k}}{{(n / 2)}_{κ}} & = & n^{k} \prod_{j = 1}^{m} \prod_{l = 1}^{κ_{j}} {(\frac{1}{2} n - \frac{1}{2} j + l - \frac{1}{2})}^{- 1} \\ = & n^{k} \prod_{j = 1}^{m} {(\frac{1}{2} n)}^{- κ_{j}} \prod_{l = 1}^{κ_{j}} {(1 - \frac{j - 2 l + 1}{n})}^{- 1} \\ = & 2^{k} \prod_{j = 1}^{m} \prod_{l = 1}^{κ_{j}} {(1 - \frac{j - 2 l + 1}{n})}^{- 1} \end{matrix}

(3.1)

Therefore,

n^{k} / {(n / 2)}_{κ} \sim 2^{k}

for large n, and so we obtain

n^{k} \sum_{| κ | = k} \frac{Z_{κ} (A_{n}^{'} A_{n})}{{(n / 2)}_{κ}} \sim 2^{k} \sum_{| κ | = k} Z_{κ} (A_{n}^{'} A_{n}) = 2^{k} {(tr A_{n}^{'} A_{n})}^{k} = 2^{k}

Thus, as

n \to \infty

,

E exp (i t \sqrt{n} tr A_{n}^{'} H_{n}) \to \sum_{k = 0}^{\infty} \frac{{(- 1)}^{k} t^{2 k}}{k! 2^{k}} = exp (- t^{2} / 2)

which establishes that

\sqrt{n} tr (A_{n}^{'} H_{n})

converges in distribution to

N (0, 1)

.

For general

A_{n}

, the argument given above also leads to the conclusion,

E exp (i \sqrt{n} tr A_{n}^{'} H_{n}) \sim exp (- \frac{1}{2} tr A_{n}^{'} A_{n})

We deduce, by applying the standard Cramér–Wold device, that for large n the entries of the matrix

\sqrt{n} H_{n}

are asymptotically multivariate normally distributed with mean 0 and identity covariance matrix

I_{n}

. We note also that a similar conclusion may be obtained for the results to follow.

4. The Case of the Orthogonal Group

We now present a new approach to Theorem 1.1. In this setting,

A_{n}

is an

n \times n

real matrix satisfying the condition

tr A_{n}^{'} A_{n} = 1

, and the random matrix

H_{n} \in O (n)

is uniformly distributed. Then, for

t \in R

, we again apply Equation (2.9) to deduce that the characteristic function of the random variable

\sqrt{n} tr (A_{n} H_{n})

is

\begin{matrix} E exp (i t \sqrt{n} tr A_{n} H_{n}) & = & \int_{O (n)} exp (i t \sqrt{n} tr A_{n} H_{n}) d H_{n} \\ = & _{0} F_{1} (n / 2; - t^{2} n A_{n}^{'} A_{n} / 4) \end{matrix}

On expanding the

_{0} F_{1}

function in a series of zonal polynomials, we obtain a generating function for the moments of the random variable

\sqrt{n} tr (A_{n} H_{n})

:

\begin{matrix} \sum_{k = 0}^{\infty} \frac{t^{k}}{k!} E {(i \sqrt{n} tr A_{n} H_{n})}^{k} & = & E \sum_{k = 0}^{\infty} \frac{1}{k!} {(i t \sqrt{n} tr A_{n} H_{n})}^{k} \\ = & E exp (i t \sqrt{n} tr A_{n} H_{n}) \\ = & _{0} F_{1} (n / 2; - t^{2} n A_{n}^{'} A_{n} / 4) \\ = & \sum_{k = 0}^{\infty} \frac{{(- 1)}^{k} n^{k} t^{2 k}}{k! 4^{k}} \sum_{| κ | = k} \frac{Z_{κ} (A_{n}^{'} A_{n})}{{(n / 2)}_{κ}} \end{matrix}

On comparing the coefficients of like powers of t we deduce that, for

k = 0, 1, 2, \dots

\begin{matrix} E {(\sqrt{n} tr A_{n} H_{n})}^{2 k + 1} & = 0 \end{matrix}

and

\begin{matrix} 1 - 1 E {(\sqrt{n} tr A_{n} H_{n})}^{2 k} & = \frac{(2 k)!}{k! 4^{k}} n^{k} \sum_{| κ | = k} \frac{Z_{κ} (A_{n}^{'} A_{n})}{{(n / 2)}_{κ}} \end{matrix}

(4.1)

We now examine the asymptotic behavior of the kth moment of

\sqrt{n} tr (A_{n} H_{n})

as

n \to \infty

. For a partition

κ

of weight k, the same argument used at Equation (3.1) shows that

\frac{n^{k}}{{(n / 2)}_{κ}} = 2^{k} \prod_{j = 1}^{n} \prod_{l = 1}^{κ_{j}} {(1 - \frac{j - 2 l + 1}{n})}^{- 1}

Substituting this result into Equation (4.1), we obtain

E {(\sqrt{n} tr A_{n} H_{n})}^{2 k} = \frac{(2 k)!}{k! 2^{k}} \sum_{| κ | = k} [\prod_{j = 1}^{n} \prod_{l = 1}^{κ_{j}} {(1 - \frac{j - 2 l + 1}{n})}^{- 1}] Z_{κ} (A_{n}^{'} A_{n})

By a Taylor–Maclaurin expansion, we obtain

\prod_{j = 1}^{n} \prod_{l = 1}^{κ_{j}} {(1 - \frac{j - 2 l + 1}{n})}^{- 1} \sim 1 - n^{- 1} ρ_{κ}

as

n \to \infty

, where

ρ_{κ} = - \sum_{j = 1}^{n} \sum_{l = 1}^{κ_{j}} (j - 2 l + 1) \equiv \sum_{j = 1}^{n} κ_{j} (κ_{j} - j)

is the quantity first encountered at Equation (2.1).

On applying Equation (2.7), we obtain

\begin{matrix} E {(\sqrt{n} tr A_{n} H_{n})}^{2 k} & \sim & \frac{(2 k)!}{k! 2^{k}} \sum_{| κ | = k} (1 - n^{- 1} ρ_{κ}) Z_{κ} (A_{n}^{'} A_{n}) \\ = & \frac{(2 k)!}{k! 2^{k}} [{(tr A_{n}^{'} A_{n})}^{k} - n^{- 1} \sum_{| κ | = k} ρ_{κ} Z_{κ} (A_{n}^{'} A_{n})] \end{matrix}

By Equation (2.6), it follows that

Z_{κ} (A_{n}^{'} A_{n}) \geq 0

. Hence, by applying Equation (2.2), we obtain

\begin{matrix} | \sum_{| κ | = k} ρ_{κ} Z_{κ} (A_{n}^{'} A_{n}) | & \leq & \sum_{| κ | = k} | ρ_{κ} | Z_{κ} (A_{n}^{'} A_{n}) \\ \leq & k (k - 1) \sum_{| κ | = k} Z_{κ} (A_{n}^{'} A_{n}) \\ = & k (k - 1) {(tr A_{n}^{'} A_{n})}^{k} = k (k - 1) \end{matrix}

an upper bound which is not dependent on n. Therefore,

E {(\sqrt{n} tr A_{n} H_{n})}^{2 k} \sim \frac{(2 k)!}{k! 2^{k}} [1 + O (n^{- 1})]

and we conclude that for fixed k,

E {(\sqrt{n} tr A_{n} H_{n})}^{2 k} \to \frac{(2 k)!}{k! 2^{k}} \equiv E (Z^{2 k})

as

n \to \infty

, where

Z \sim N (0, 1)

. Finally, we apply the moment problem (Loéve [23], p. 185) to deduce that

\sqrt{n} tr (A_{n} H_{n})

converges in distribution to

N (0, 1)

.

We remark that the condition

tr (A_{n}^{'} A_{n}) = 1

can be weakened to require only that

tr (A_{n}^{'} A_{n}) \to 1

, with a sufficiently fast rate of convergence, as

n \to \infty

.

It is also interesting to discover that the study of high-dimensional random orthogonal matrices yields a new inequality for the generalized hypergeometric function,

_{0} F_{1}

, of scalar matrix argument.

Proposition 4.1.

There exist positive constants c and d such that, for all

n \geq 1

and

t \in R

,

|_{0} F_{1} (n / 2; - t^{2} I_{n} / 4) - exp (- t^{2} / 2) | \leq c e^{- d n}

Proof.

Define the random variable

Y = tr H_{n}

, where

H_{n}

is uniformly distributed on

U (n)

. Denote by

g_{Y}

and

ϕ

the probability density functions of Y and the

N (0, 1)

random variable, respectively. By Johansson [6], Theorem 3.7(b) there exist positive constants c and d such that

\int_{- \infty}^{\infty} | g_{Y} (x) - ϕ (x) | d x \leq c e^{- d n}

for all n. Therefore, for

t \in R

,

\begin{matrix} |_{0} F_{1} (n / 2; - t^{2} I_{n} / 4) - exp (- t^{2} / 2) | & = & | \int_{- \infty}^{\infty} e^{i t x} (g_{Y} (x) - ϕ (x)) d x | \\ \leq & \int_{- \infty}^{\infty} | e^{i t x} (g_{Y} (x) - ϕ (x)) | d x \\ = & \int_{- \infty}^{\infty} | g_{Y} (x) - ϕ (x) | d x \leq c e^{- d n} \end{matrix}

The proof is complete. □

5. The Case of the Unitary Group

As we noted in the introduction, the method used in Section 4 produces similar results in the case of the unitary and symplectic groups. We shall present the details in the unitary case; and as regards the symplectic case, which we leave to the reader, we note that necessary details on the zonal polynomials and generalized hypergeometric function may be obtained from the paper of Gross and Richards [20].

In the sequel, we denote by

A^{*}

the adjoint of a complex matrix:

A^{*} = {\bar{A}}^{'}

. The analog of Theorem 1.1 in the unitary case, due to Meckes [12], is the following:

Theorem 5.1.

(Meckes [12]) Let

{A_{n} : n = 1, 2, 3, \dots}

be a sequence of complex matrices such that

A_{n}

is

n \times n

and

tr (A_{n}^{*} A_{n}) = 1

for all n. Let

H_{n}

be a random unitary matrix which is uniformly distributed on

U (n)

. Then

\sqrt{2 n} Re tr (A_{n} H_{n}) \overset{L}{\to} N (0, 1)

as

n \to \infty

.

In this setting, we will need the analogs of the partitional rising factorial, the zonal polynomial, and the generalized hypergeometric function of matrix argument that pertain to the “complex” case; see James [19] or Gross and Richards [20,21]. Specifically, the partitional rising factorial is now defined as

{[a]}_{κ} = \prod_{j = 1}^{n} {(a - j + 1)}_{κ_{j}}

(5.1)

where each

{(a)}_{κ_{j}}

is a classical rising factorial as defined in Equation (2.3); the zonal polynomial is defined for any Hermitian

n \times n

matrix S as

{\tilde{Z}}_{κ} (S) = {\tilde{c}}_{κ} \int_{U (n)} p_{κ} (H_{n}^{*} S H_{n}) d H_{n}

(5.2)

where the power function

p_{κ}

is defined in Equation (2.5), and the normalizing constants

{\tilde{c}}_{κ}

are positive and are chosen uniquely so that

\sum_{| κ | = k} {\tilde{Z}}_{κ} (S) = {(tr S)}^{k}

(5.3)

and for any

a \in C

such that

- a + j - 1

is not a non-negative integer for all

j = 1, \dots, n

, the generalized hypergeometric function of matrix argument is defined as

_{0} {\tilde{F}}_{1} (a; S) = \sum_{k = 0}^{\infty} \frac{1}{k!} \sum_{| κ | = k} \frac{{\tilde{Z}}_{κ} (S)}{{[a]}_{κ}}

(5.4)

Similar to the orthogonal case, the characteristic function of the random variable

\sqrt{2 n} Re tr (A_{n} H_{n})

is

\begin{matrix} E exp (i t \sqrt{2 n} Re tr A_{n} H_{n}) & = & \int_{U (n)} exp (i t \sqrt{2 n} Re tr A_{n} H_{n}) d H_{n} \\ = & _{0} {\tilde{F}}_{1} (n; - t^{2} n A_{n}^{*} A_{n} / 2) \end{matrix}

where

_{0} {\tilde{F}}_{1}

is a generalized hypergeometric function of Hermitian matrix argument.

By expanding the

_{0} {\tilde{F}}_{1}

function in a series of complex zonal polynomials, we obtain a generating function for the moments of the random variable

\sqrt{2 n} Re tr A_{n} H_{n}

:

\begin{matrix} \sum_{k = 0}^{\infty} \frac{t^{k}}{k!} E {(i \sqrt{2 n} Re tr A_{n} H_{n})}^{k} & = & E \sum_{k = 0}^{\infty} \frac{1}{k!} {(i t \sqrt{2 n} Re tr A_{n} H_{n})}^{k} \\ = & E exp (i t \sqrt{2 n} Re tr A_{n} H_{n}) \\ = & _{0} {\tilde{F}}_{1} (n; - t^{2} n A_{n}^{*} A_{n} / 2) \\ = & \sum_{k = 0}^{\infty} \frac{{(- 1)}^{k} n^{k} t^{2 k}}{k! 2^{k}} \sum_{| κ | = k} \frac{{\tilde{Z}}_{κ} (A_{n}^{'} A_{n})}{{[n]}_{κ}} \end{matrix}

By comparing powers of t, we deduce that, for

k = 0, 1, 2, \dots

\begin{matrix} E {(\sqrt{2 n} Re tr A_{n} H_{n})}^{2 k + 1} & = 0 \end{matrix}

and

\begin{matrix} 1 - 1 E {(\sqrt{2 n} Re tr A_{n} H_{n})}^{2 k} & = \frac{(2 k)!}{k! 2^{k}} n^{k} \sum_{| κ | = k} \frac{{\tilde{Z}}_{κ} (A_{n}^{'} A_{n})}{{[n]}_{κ}} \end{matrix}

By Equation (5.1),

\frac{n^{k}}{{[n]}_{κ}} = \prod_{j = 1}^{n} \prod_{l = 1}^{κ_{j}} {(1 - \frac{j - l}{n})}^{- 1} \sim 1 - n^{- 1} {\tilde{ρ}}_{κ}

as

n \to \infty

, where

{\tilde{ρ}}_{κ} = - \sum_{j = 1}^{n} \sum_{l = 1}^{κ_{j}} (j - l) = \frac{1}{2} \sum_{j = 1}^{n} κ_{j} (κ_{j} - 2 j + 1)

We can prove by means of an argument similar to that given in the proof of Lemma 2.1 that the coefficients

{\tilde{ρ}}_{κ}

are strictly increasing in the lexicographic ordering on the set of partitions of weight

κ

; therefore,

- \frac{1}{2} k (k - 1) = {\tilde{ρ}}_{(1^{k})} \leq {\tilde{ρ}}_{κ} \leq {\tilde{ρ}}_{(k)} = \frac{1}{2} k (k - 1)

so we obtain

\begin{matrix} | \sum_{| κ | = k} {\tilde{ρ}}_{κ} {\tilde{Z}}_{κ} (A_{n}^{*} A_{n}) | & \leq & \sum_{| κ | = k} | {\tilde{ρ}}_{κ} | {\tilde{Z}}_{κ} (A_{n}^{*} A_{n}) \\ \leq & \frac{1}{2} k (k - 1) \sum_{| κ | = k} {\tilde{Z}}_{κ} (A_{n}^{*} A_{n}) \\ = & \frac{1}{2} k (k - 1) {(tr A_{n}^{*} A_{n})}^{k} = \frac{1}{2} k (k - 1) \end{matrix}

which is not dependent on n. Therefore,

\begin{matrix} E {(\sqrt{2 n} Re tr A_{n} H_{n})}^{2 k} & \sim & \frac{(2 k)!}{k! 2^{k}} \sum_{| κ | = k} (1 - n^{- 1} {\tilde{ρ}}_{κ}) {\tilde{Z}}_{κ} (A_{n}^{*} A_{n}) \\ = & \frac{(2 k)!}{k! 2^{k}} [{(tr A_{n}^{*} A_{n})}^{k} - n^{- 1} \sum_{| κ | = k} {\tilde{ρ}}_{κ} {\tilde{Z}}_{κ} (A_{n}^{*} A_{n})] \\ = & \frac{(2 k)!}{k! 2^{k}} [1 + O (n^{- 1})] \end{matrix}

We conclude that for fixed k,

E {(\sqrt{2 n} Re tr A_{n} H_{n})}^{2 k} \to \frac{(2 k)!}{k! 2^{k}} \equiv E (Z^{2 k})

as

n \to \infty

, where

Z \sim N (0, 1)

. Finally, we apply the moment problem to deduce that

\sqrt{2 n} Re tr (A_{n} H_{n})

converges in distribution to

N (0, 1)

.

We can also obtain an upper bound on the difference between the

_{0} {\tilde{F}}_{1}

function of scalar matrix argument and the Gaussian quantity,

exp (- t^{2} / 2)

. The proof is similar to that of Proposition 4.1 and rests on an inequality of Johansson [6], Theorem 2.6(b).

Proposition 5.2.

There exist positive constants c and d such that, for all

n \geq 1

and

t \in R

,

|_{0} {\tilde{F}}_{1} (n; - t^{2} I_{n} / 2) - exp (- t^{2} / 2) | \leq c n^{- d n}

References

Borel, E. Sur les principes de la theorie cinétique des gaz. Ann. Sci. l’École Norm. Supér. 1906, 23, 9–32. [Google Scholar] [CrossRef]
Weingarten, D. Asymptotic behavior of group integrals in the limit of infinite rank. J. Math. Phys. 1978, 19, 999–1001. [Google Scholar] [CrossRef]
Diaconis, P.; Freedman, D. A dozen de Finetti-style results in search of a theory. Ann. l’Inst. Henri Poincaré (B) Probab. Stat. 1987, 23, 397–423. [Google Scholar]
Diaconis, P.; Eaton, M.; Lauritzen, S. Finite de Finetti theorems in linear models and multivariate analysis. Scand. J. Stat. 1992, 19, 289–315. [Google Scholar]
Diaconis, P.; Shahshahani, M. On the eigenvalues of random matrices. J. Appl. Probab. 1994, 31A, 49–62. [Google Scholar] [CrossRef]
Johansson, K. On random matrices from the compact classical groups. Ann. Math. 1997, 145, 519–545. [Google Scholar] [CrossRef]
Rains, E.M. Normal limit theorems for symmetric random matrices. Probab. Theory Relat. Fields 1998, 112, 411–423. [Google Scholar] [CrossRef]
Diaconis, P.; Evans, S. Linear functionals of eigenvalues of random matrices. Trans. Amer. Math. Soc. 2001, 353, 2615–2633. [Google Scholar] [CrossRef]
D’Aristotile, A.; Diaconis, P.; Newman, C.M. Brownian Motion and the Classical Groups. In Probability, Statistics and Their Applications: Papers in Honor of Rabi Bhattacharya; IMS Lecture Notes Monograph Series 2006; Athreya, K.B., Bhattacharya, R.N., Eds.; Institute of Mathematical Statistics: Beachwood, OH, USA, 2003; Volume 41, pp. 97–116. [Google Scholar]
Pastur, L.; Vasilchuk, V. On the moments of traces of matrices of classical groups. Commun. Math. Phys. 2004, 252, 149–166. [Google Scholar] [CrossRef]
Collins, B.; Śniady, P. Integration with respect to the Haar measure on unitary, orthogonal and symplectic group. Commun. Math. Phys. 2006, 264, 773–795. [Google Scholar] [CrossRef]
Meckes, E. Linear functions on the classical matrix groups. Trans. Amer. Math. Soc. 2008, 360, 5355–5366. [Google Scholar] [CrossRef]
Fulman, J. Stein’s method and characters of compact Lie groups. Commun. Math. Phys. 2009, 288, 1181–1201. [Google Scholar] [CrossRef]
Jiang, T. The entries of Haar-invariant matrices from the classical compact groups. J. Theor. Probab. 2010, 23, 1227–1243. [Google Scholar] [CrossRef]
Herz, C.S. Bessel functions of matrix argument. Ann. Math. 1955, 61, 474–523. [Google Scholar] [CrossRef]
James, A.T. Calculation of zonal polynomial coefficients by use of the Laplace-Beltrami operator. Ann. Math. Statist. 1968, 39, 1711–1718. [Google Scholar] [CrossRef]
Muirhead, R.J. Aspects of Multivariate Statistical Theory; Wiley: New York, NY, USA, 1982. [Google Scholar]
Richards, D.St.P. Applications of invariant differential operators to multivariate distribution theory. SIAM J. Appl. Math. 1985, 45, 280–288. [Google Scholar] [CrossRef]
James, A.T. Distributions of matrix variates and latent roots derived from normal samples. Ann. Math. Stat. 1964, 35, 475–501. [Google Scholar] [CrossRef]
Gross, K.I.; Richards, D.St.P. Special functions of matrix argument. I. Algebraic induction, zonal polynomials, and hypergeometric functions. Trans. Amer. Math. Soc. 1987, 301, 781–811. [Google Scholar]
Gross, K.I.; Richards, D.St.P. Hypergeometric functions on complex matrix space. Bull. Amer. Math. Soc. (N.S.) 1991, 24, 349–355. [Google Scholar] [CrossRef]
Richards, D.St.P. The central limit theorem on spaces of positive definite matrices. J. Multivar. Anal. 1989, 29, 326–332. [Google Scholar] [CrossRef]
Loève, M. Probability Theory; van Nostrand: New York, NY, USA, 1955. [Google Scholar]

© 2011 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Richards, D.S.P. High-Dimensional Random Matrices from the Classical Matrix Groups, and Generalized Hypergeometric Functions of Matrix Argument. Symmetry 2011, 3, 600-610. https://doi.org/10.3390/sym3030600

AMA Style

Richards DSP. High-Dimensional Random Matrices from the Classical Matrix Groups, and Generalized Hypergeometric Functions of Matrix Argument. Symmetry. 2011; 3(3):600-610. https://doi.org/10.3390/sym3030600

Chicago/Turabian Style

Richards, Donald St. P. 2011. "High-Dimensional Random Matrices from the Classical Matrix Groups, and Generalized Hypergeometric Functions of Matrix Argument" Symmetry 3, no. 3: 600-610. https://doi.org/10.3390/sym3030600

APA Style

Richards, D. S. P. (2011). High-Dimensional Random Matrices from the Classical Matrix Groups, and Generalized Hypergeometric Functions of Matrix Argument. Symmetry, 3(3), 600-610. https://doi.org/10.3390/sym3030600

Article Menu

High-Dimensional Random Matrices from the Classical Matrix Groups, and Generalized Hypergeometric Functions of Matrix Argument

Abstract

1. Introduction

2. Zonal Polynomials and a Generalized Hypergeometric Function of Matrix Argument

3. The Case of the Stiefel Manifold

4. The Case of the Orthogonal Group

5. The Case of the Unitary Group

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI