On the Limiting Distribution of the Spectra of Random Block Matrices

Tikhomirov, Alexander N.

doi:10.3390/math13132056

Open AccessArticle

On the Limiting Distribution of the Spectra of Random Block Matrices

by

Alexander N. Tikhomirov

Institute of Physics and Mathematics of FRC “Komi Science Center of Ural Branch of RAS”, Syktyvkar 167982, Russia

Mathematics 2025, 13(13), 2056; https://doi.org/10.3390/math13132056

Submission received: 31 March 2025 / Revised: 14 May 2025 / Accepted: 2 June 2025 / Published: 20 June 2025

Download

Browse Figures

Versions Notes

Abstract

The behavior of the spectra of symmetric block-type random matrices with symmetric blocks of high dimensionality is considered in this paper. Under minimal conditions regarding the distributions of matrix block elements (Lindeberg conditions), the universality of the limiting empirical distribution function of block-type random matrices is shown.

Keywords:

block matrices; Toeplitz matrices; Hankel matrices; spectral distributions

MSC:

60B20; 60F99

1. Introduction

In the theory of high-dimensional random matrices, one of the main objects to study is the empirical spectral distribution function of high-dimensional random matrices and its approximation. The most studied matrices are ensembles of symmetric real or Hermite matrices with independent elements (semicircular law) and sample covariance matrices (Marchenko–Pastur distribution law). We consider ensembles of random matrices of a rather wide class—matrices with a block structure. The study of the spectra of the block-type random matrices began at the end of the last century and the beginning of the 2000s. One of the first works in this field was Girko’s paper [1]. He considered matrices of increasing dimensions with block elements of fixed dimensions and proved various laws for the limiting spectral distribution of such matrices. In the first decade of the 21st century, a number of papers appeared focused on the study of the spectra of block matrices with blocks of increasing dimensions. See, for example, the works [2,3,4,5,6]. The study of such matrix models is gaining particular popularity in connection with various applications in machine learning and neural networks. See, for instance, [7,8].

One such example is communication channels with multiple transmitting and receiving antennas, the so-called MIMO (Multi-Input, Multi-Output) channels. The channel matrix has the form

\begin{array}{r} A = [\begin{matrix} A_{1} & A_{2} & \dots & A_{L} & O & O & \dots & O \\ O & A_{1} & A_{2} & \dots & A_{L} & O & \dots & O \\ O & O & A_{1} & A_{2} & \dots & A_{L} & \dots & O \\ \dots & \dots & \dots & \dots & \dots & \dots & \dots & \dots \\ O & O & \dots & \dots & O & A_{1} & \dots & A_{L} \end{matrix}] \end{array}

where

A_{1}, A_{2}, \dots, A_{L}

are matrices of size

n \times n

and

A_{ν} = {(h_{i j}^{ν})}_{i, j = 1}^{n}

, where

h_{i j}^{(ν)}

reflects the channel’s effect on the signal transmitted from antenna j of the transmitter and received at antenna i of the receiver.

In what follows, we denote matrices and vectors in bold. To calculate the capacity of a channel, one needs to know the distribution of the eigenvalues of the matrix, as follows:

H = [\begin{matrix} O & A \\ A^{*} & O \end{matrix}],

where

O

denotes a matrix with zero entries and

A^{*}

denotes the complex conjugate of

A

. For details, see, e.g., [4]. Some more examples of block matrix applications can be found in spectral graph theory. Adjacency matrices and Laplace matrices of random multipartite graphs are examples of block-type random matrices; see [9,10,11]. Block random matrix models are applicable to the study of products of random matrices due to so-called linearization. The simplest example is sample covariance matrices, where instead of matrices of the form

X X^{*}

, matrices of the form

H = [\begin{matrix} O & X \\ X^{*} & O \end{matrix}]

are studied; see [12] and the references therein for more details. The asymptotics of the spectra of the products of random matrices are studied, for example, when analyzing the asymptotics of the information capacity of communication channels; see e.g., [13,14]. Block matrices play a fundamental role in neural networks, particularly in large-scale deep learning, optimization, and structured architectures. Their applications span distributed training, sparse computation, attention mechanisms, and optimization theory, making them essential for modern machine learning systems. Recent work, such as that by Xiao-Kai An et al. [15], has explored nonconvex optimization techniques for analyzing spectral distributions in random matrices, which have implications for neural network training dynamics.

The development of methods for studying the spectra of random classical matrix ensembles (Wigner random matrices, sampled covariance matrices) in recent years has led to an increased interest in studying the spectra of random matrices with block structures. See, for example, [6,16,17,18,19,20,21,22,23,24,25,26] and references therein.

1.1. List of Notations

We denote matrices by capital letters in bold, (column) vectors by lowercase letters in bold, and matrix elements by the same letters as matrices (as a rule) but in regular font.

The symbol

I

denotes the unit matrix; when it is necessary to specify the dimensions, we provide a lower index; e.g.,

I_{n - 1}

means a unit matrix of dimensions

n - 1

.

With

R_{A}

, we denote the resolvent matrix for matrix

A

, i.e.,

R_{A} = (A - z I)

.

With

e_{j}

,

j = 1, \dots, k

, we denote the standard base vectors in

R^{k}

(j-th component equals one, all the others are zeros).

With

E^{(i j)}

, we denote the matrix defined as

E^{(i j)} = e_{i} e_{j}^{T}

, i.e., all the elements of

E^{(i j)}

are zeros, except for

E_{i j}^{(i j)} = 1

.

For any vector

a

and any matrix

A

, we denote the transposed vector

a

and the transposed matrix

A

by

a^{T}

and

A^{T}

, respectively. For a complex matrix

A

, we denote the Hermitian conjugate of A by

A^{*}

, i.e.,

A^{*} = {(\bar{A})}^{T}

.

We denote the indicator function by

I {\cdot}

.

1.2. Model Representation

Consider the following model of block random matrices. Let

X^{(i j)}

,

i, j = 1, \dots, k

, be a family of

n \times n

random matrices. We will assume that for any fixed pair of indices (i, j),

i, j = 1, \dots, k

, the elements of matrix

X^{(i, j)}

are independent (with respect to symmetry) and have a mean of zero and finite variance, i.e.,

E X_{p q}^{(i, j)} = 0 and E {(X_{p q}^{(i, j)})}^{2} = {(σ_{p q}^{(i, j)})}^{2} < \infty .

(1)

We will also assume that the matrices

X^{(i, j)}

either coincide at

(i, j) \neq (i_{1}, j_{1})

or are independent.

Let the symbol ⊗ denote the Kronecker product. Consider a matrix of the form

W_{X} = \sum_{i, j = 1}^{k} E^{(i, j)} \otimes X^{(i, j)} .

(2)

We find it more convenient to change the representation of (2). Since we have assumed that some of the matrices

X^{(i, j)}

may coincide, we define the so-called “generating matrices”

Z^{(1)}, \dots, Z^{(m)}

(with

m \leq k^{2}

), which are random and independent, such that for any pair of indices

(i, j)

:

i, j = 1, \dots, k

, there exists some

l = l (i, j) : 1 \leq l \leq m

with

X^{(i, j)} = Z^{(l)}

. Suppose, for

l = 1, \dots, m

,

A_{l} = {(i, j) i, j = 1, \dots, k : X^{(i, j)} = Z^{(l)}} .

(3)

Then the matrix

W

can be rewritten as

W = \sum_{l = 1}^{m} (\sum_{(i, j) \in A_{l}} E^{(i, j)}) \otimes Z^{(l)} .

1.3. Toeplitz and Hankel Random Matrices

The Toeplitz and Hankel random matrices occupy a special place in the study of structured matrices. In [3], the existence of a limiting spectral distribution for random heats and Hankel random matrices was proven. In particular, it was proven that the limiting distribution of the Toeplitz and Hankel matrices has infinite support and differs from the normal distribution (the variance is 1, but the fourth moment is

8 / 3

). Oraby in [5] showed that for circulant block-type Toeplitz matrices with random blocks of high dimensionality, the limiting spectral distribution is a mixture of semicircular laws with variances depending on the dimensionality of the original Töplitz matrix. In [27], Oraby’s result was obtained under weaker conditions for the distribution of block elements. In [28], it was shown that for a palindromic Toeplitz matrix with Wigner blocks, the limiting spectral distribution is normal. In [18,29], symmetric block Toeplitz and Hankel matrices were considered. Let the generating matrices be

Z^{(1)}, \dots, Z^{(k)}

. Define the multiplicity sets as

A_{l} = {(i, j) : | i - j | = l + 1}

, for

l = 1, \dots, k

. The symmetric Toeplitz matrix has the form

\begin{matrix} W = \sum_{l = 1}^{k} Z^{(l)} \otimes (\sum_{i, j \in A_{l}} E^{(i j)}) = [\begin{matrix} Z^{(1)}, & Z^{(2)} & \dots & Z^{(k - 1)} & Z^{(k)} \\ Z^{(2)} & Z^{(1)} & \dots & Z^{(k - 2)} & Z^{(k - 1)} \\ \dots & \dots & \dots & \dots \\ \dots & \dots & \dots & \dots \\ \dots & \dots & \dots & \dots \\ Z^{(k)} & Z^{(k - 1)} & \dots & Z^{(2)} & Z^{(1)} \end{matrix}] . \end{matrix}

(4)

Hankel block matrices and circulant block matrices can be represented in a similar way.

1.3.1. Hankel Matrices

In a Hankel matrix, the number of distinct blocks is

m = 2 k - 1

, and there are equal blocks on all diagonals perpendicular to the main diagonal,

A_{l} = {(i, j) : i + j = l + 1}, l = 1, \dots, 2 k - 1

:

W_{X} = [\begin{matrix} Z^{(1)} & Z^{(2)} & Z^{(3)} & \dots & Z^{(k - 1)} & Z^{(k)} \\ Z^{(2)} & Z^{(3)} & Z^{(4)} & \dots & Z^{(k)} & Z^{(k + 1)} \\ Z^{(3)} & Z^{(4)} & ⋱ & ⋮ & ⋮ \\ ⋮ & ⋮ & ⋱ & ⋮ & ⋮ \\ Z^{(k - 1)} & Z^{(k)} & \dots & \dots & Z^{(2 k - 3)} & Z^{(2 k - 2)} \\ Z^{(k)} & Z^{(k + 1)} & \dots & \dots & Z^{(2 k - 2)} & Z^{(2 k - 1)} \end{matrix}] .

1.3.2. Circulant Matrices

In a symmetric circulant matrix, the number of distinct blocks is

m = \{\begin{matrix} s + 1, if k = 2 s, \\ s, if k = 2 s - 1 . \end{matrix}

This is a special case of Toeplitz matrices with equal blocks on the diagonals,

A_{l} = {(i, j) : | i - j | = l - 1 \lor k - l + 1}, l = 1, \dots, m

:

W_{X} = [\begin{matrix} Z^{(1)} & Z^{(2)} & \dots & Z^{(l)} & \dots & Z^{(3)} & Z^{(2)} \\ Z^{(2)} & Z^{(1)} & \dots & Z^{(l - 1)} & \dots & Z^{(4)} & Z^{(3)} \\ ⋮ & ⋮ & ⋮ & ⋮ & ⋮ \\ Z^{(3)} & Z^{(4)} & \dots & Z^{(m - 1)} & \dots & Z^{(1)} & Z^{(2)} \\ Z^{(2)} & Z^{(3)} & \dots & Z^{(m)} & \dots & Z^{(2)} & Z^{(1)} \end{matrix}] .

The spectral analysis of random block matrices like Toeplitz or Hankel matrices helps to understand convergence for deep learning optimization; see, e.g., Xia et al. [8].

2. Main Result

Let

V^{(1)}, V^{(2)}, \dots, V^{(m)}

be Hermitian random matrices of size

n \times n

whose elements are independent. Using the index sets introduced above (3), we define the following matrices:

\begin{matrix} Y^{(i j)} = V^{(l)}, for (i, j) \in A_{l}, l = 0, \dots, m, \\ W_{Y} = \sum_{i, j = 1}^{k} E^{(i j)} \otimes Y^{(i j)} = \sum_{l = 0}^{m} \sum_{(i, j) \in A_{l}} E^{(i j)} \otimes V^{(l)} . \end{matrix}

Assume that the random variables

Y_{p q}^{(i j)}

with any fixed

i, j = 1 < \dots k, i \leq j

, are independent for

p, q = 1, \dots, n, p \leq q

, and have the same first two moments as the random variables

X_{p q}^{(i j)}

; that is,

E Y_{p q}^{(i j)} = 0, and E {(Y_{p q}^{(i j)})}^{2} = {(σ_{p q}^{(i j)})}^{2} .

(5)

Denote the eigenvalues of matrix

\frac{1}{\sqrt{n k}} W_{X}

as

λ_{1}, \dots, λ_{n k}

and the eigenvalues of

\frac{1}{\sqrt{n k}} W_{Y}

as

μ_{1}, \dots, μ_{n k}

. We define the empirical spectral distribution functions

F_{n X} (x) = \frac{1}{n k} \sum_{j = 1}^{n k} I {λ_{j} < x}

and

F_{n Y} (x) = \frac{1}{n k} \sum_{j = 1}^{n k} I {μ_{j} < x},

where

I {\cdot}

stands for the event indicator.

The empirical spectral distributions represent one of the main subjects of investigation in random matrix theory. We are interested in conditions regarding the distributions of

X_{r s}^{(i j)}

(

X_{r s}^{(i j)}

) that would guarantee that the distributions

F_{n X} (x)

and

F_{n Y} (x)

become infinitely close in the limit of large n. We prove this closeness under Lindeberg’s condition (see (7)). Lindeberg’s condition shows that the contribution to the variance of the sum of

n^{2}

random variables and the values of individual random variables exceeding the

\sqrt{n}

level is negligibly small compared to

n^{2}

.

This condition is the optimal moment condition in many limit theorems, both for sums of independent random variables (the central limit theorem for instance) and in random matrix theory (Wigner’s semicircular law, as shown in [30], and the Marchenko–Pastur law for the empirical spectral distribution of sample covariance matrices; see [31]).

In what follows, for

(i, j) \in A_{l}

, we shall write

σ_{p q}^{(i j)} = σ_{p q}^{(l)} .

Theorem 1.

We assume that there exists a constant

C_{0}

such that for all

n \geq 1

,

\frac{1}{{(n k)}^{2}} \sum_{l = 1}^{m} \sum_{p, q = 1}^{n} {(σ_{p q}^{(l)})}^{2} \leq C_{0} .

(6)

Suppose that for all

i, j = 1, \dots, k

for random matrices

X^{(i j)}

and

Y^{(i j)}

, Lindeberg’s condition is satisfied, i.e., for any

τ > 0

,

\begin{matrix} L_{n} (τ) : = max {\frac{1}{n^{2}} \sum_{r, s = 1}^{n} E {(X_{r s}^{(i j)})}^{2} I {| X_{r s}^{(i j)} | > τ \sqrt{n}}, \\ \frac{1}{n^{2}} \sum_{r, s = 1}^{n} E {(Y_{r s}^{(i j)})}^{2} I {| Y_{r s}^{i j} | > τ \sqrt{n}},} \underset{n \to \infty}{\to} 0 . \end{matrix}

(7)

Then

F_{n X} (x) - F_{n Y} (x) \underset{n \to \infty}{\to} 0 in probability .

Remark 1.

Note that if for some

δ > 0

, there exists a constant

μ_{2 + δ} > 0

such that

sup_{i, j, p, q \geq 1} max {E | Y_{p q}^{(i, j)} |^{2 + δ}, E | X_{p q}^{(i, j)} |^{2 + δ}} \leq μ_{2 + δ},

(8)

then Lindeberg’s condition is satisfied. This follows from the Markov inequality. To find the limit distribution of

F_{n X} (x)

, we can compute it for

F_{n Y} (x)

, where

Y_{p q}^{(i j)}

are Gaussian, for instance.

Remark 2.

Figure 1, Figure 2, Figure 3 and Figure 4 show histograms of the distribution of the eigenvalues of the Hankel (Figure 1) and Teuplitz (Figure 2) matrices with different distributions of block elements. We consider distributions with polynomially decreasing tails (stepped tails with decreasing order

{| x |}^{- k}

, where

k = 100

b in Figure 1A and Figure 2A and

k = 7 / 2

in Figure 1B and Figure 2B, respectively. Figure 3 and Figure 4 show the histograms of the Teuplitz (Figure 3) and Hankel (Figure 4) matrices with block elements distributed according to Student’s law with five degrees of freedom (Figure 3A and Figure 4A) and with block elements distributed according to the normal law.

It was proven in [5] that there exists a limiting distribution

G (x) = {lim}_{n \to \infty} F_{n X} (x)

for the i.i.d. matrix elements

Z^{(l)}

, for

l = 1, \dots, m

, and

E | Z_{j k}^{(l)} |^{4} < \infty

. Since we can choose matrices with identically distributed Gaussian elements as a sequence of matrices

Y^{(l)}

, we immediately obtain the following result as a consequence of the Theorem 1.

Theorem 2.

Let

{(σ_{p q}^{(i, j)})}^{2} = σ^{2}

. Suppose that for all

i, j = 1, \dots, k

, Lindeberg’s condition for random variables

X_{p q}^{(i j)}

and

Y_{p q}^{(i j)}

is satisfied, i.e., for any

τ > 0

,

L_{n} (τ) : = \frac{1}{n^{2}} \sum_{r, s = 1}^{n} E {(X_{r s}^{i j})}^{2} I {| X_{r s}^{i j} |^{2} > τ \sqrt{n}} \underset{n \to \infty}{\to} 0 .

Then there exists a distribution function,

G (x)

, depending only on the structure of the block matrix, such that

lim_{n \to \infty} F_{n X} (x) = G (x) .

(9)

The proof follows obviously from Theorem 1 above and Theorem 1 in [5].

Rectangular Blocks

Consider a block random matrix model with rectangular blocks. In this case, matrices

X^{(i j)}

satisfy the relation

X^{(i j)} = {(X^{(j i)})}^{*}

(where, for any matrix

A

, its Hermite conjugate

{\bar{A}}^{T}

is denoted by

A^{*}

). The dimensions of the matrix

X^{(i j)}

are

n_{i} \times n_{j}

, respectively. Consider the class of generating matrices

H_{1}, \dots, H_{k}, Z_{1}, Z_{2}, \dots, Z_{m}, Z_{1}^{*}, Z_{2}^{*}, \dots, Z_{m}^{*}

, where the diagonal blocks

H_{1}, \dots, H_{k}

are Hermitian matrices. The sets

A_{l}

and

A_{l}^{*}

are defined by the equations

\begin{matrix} A_{l} = {(i, j) i, j = 1, \dots, k : X^{(i j)} = Z_{l}, \\ A_{l}^{*} = {(i, j) i, j = 1, \dots, k : X^{(i j)} = Z_{l}^{*}} . \end{matrix}

Clearly,

(i, j) \in A_{l}

if and only if

(j, i) \in A_{l}^{*}

. We then have the following representation:

\begin{matrix} W_{X} = & \sum_{i = 1}^{k} e_{i} e_{i}^{T} \otimes H_{i} + \sum_{l = 1}^{m} (\sum_{(i, j) \in A_{l}} e_{i} e_{j}^{T} \otimes Z^{(l)} + \sum_{(i, j) \in A_{l}^{*}} e_{i} e_{j}^{T} \otimes {Z^{(l)}}^{*}) . \end{matrix}

(10)

Since we consider the case where the matrix

W

is Hermitian, i.e.,

W_{X} = W_{X}^{*}

, we have a number of restrictions on the matrix block dimensions. For example, the matrix

\begin{matrix} W_{X} = [\begin{matrix} H_{1} & X^{(12)} & X^{(13)} \\ {X^{(12)}}^{*} & H_{2} & X^{(23)} \\ {X^{(13)}}^{*} & {X^{(23)}}^{*} & H_{3} \end{matrix}] \end{matrix}

(11)

can be represented as

\begin{matrix} W_{X} & = (e_{1} e_{1}^{T} \otimes H_{1} + e_{2} e_{2}^{T} \otimes H_{2} + e_{3} e_{3}^{T} \otimes H_{3}) + (e_{1} e_{2}^{T}) \otimes X^{(12)} \\ + (e_{2} e_{1}^{T}) \otimes {X^{12}}^{*} + (e_{1} e_{3}^{T}) \otimes X^{13} + (e_{3} e_{1}^{T}) \otimes {X^{(13}}^{*} + (e_{2} e_{3}^{T}) \otimes X^{(23)} \\ + (e_{3} e_{2}^{T}) \otimes {X^{(23)}}^{*} . \end{matrix}

(12)

Matrices

H^{(i)}

are independent Hermitian matrices of order

N_{i} \times N_{i}

. Assume that the elements of

H^{(i)} = {(h_{p q}^{(i)})}_{p, q = 1}^{N_{i}}

are independent (except for

H^{(i)}

being Hermitian) and

E h_{p q}^{(i)} = 0, E {| h_{p q}^{(i)} |}^{2} = {[σ_{p q}^{(i, 0)}]}^{2} < \infty .

(13)

For matrices

Z^{(1)}, Z^{(2)}, \dots, Z^{(m)}

, we also assume that the elements are independent and

E {[Z^{(l)}]}_{p q} = 0, E {| {[Z^{(l)}]}_{p q} |}^{2} = {[σ_{p q}^{(l)}]}^{2} < \infty .

(14)

We denote

n = N_{1} + N_{2} + \dots + N_{k} .

(15)

Figure 5 and Figure 6 show the histograms of random matrices in block form with rectangular blocks with different distributions of block elements. The figures illustrate the independence of the limiting spectral distribution of block random matrices with rectangular blocks from the block element distributions. In these notations, the matrix

W_{X}

has dimension

n \times n

.

Now, let

D^{(1)}, \dots, D^{(l)}

and

Y^{(1)}, \dots, Y^{(m)}

be random matrices whose dimensions coincide with those of

H^{(1)}, \dots, H^{(k)}

and

Z^{(1)}, Z^{(2)}, \dots, Z^{(m)}

and whose elements are independent and have moments of the first two orders, coinciding with those of the elements of the above-mentioned matrices. Denote the matrix obtained from

W_{X}

by replacing blocks

H_{1}, \dots, H_{k}, Z^{(1)}, Z^{(2)}, \dots, Z^{(m)}

with blocks

D^{(1)}, \dots, D^{(l)}, Y^{(1)}, \dots, Y^{(m)}

, respectively, by

W_{Y}

. We denote the eigenvalues of

W_{X}

by

λ_{1}, λ_{2} \dots λ_{n}

in decreasing order. We then have the empirical spectral distribution function

F_{n X} (x) = \frac{1}{n} \sum_{j = 1}^{n} I {λ_{j} \leq x},

We denote the eigenvalues of matrix

W_{Y}

by

μ_{1} \geq μ_{2} \geq \dots \geq μ_{n}

, and again, we have the empirical spectral distribution function

F_{n Y} (x) = \frac{1}{n} \sum_{j = 1}^{n} I {μ_{j} \leq x} .

Theorem 3.

Assume that there exists a constant

C_{0}

such that for all

n \geq 1

,

\frac{1}{{(n k)}^{2}} \sum_{l = 1}^{m} \sum_{(i, j) \in A_{l}} \sum_{p = 1}^{N_{i}} \sum_{q = 1}^{N_{j})} {(σ_{p q}^{(i, j)})}^{2} + \frac{1}{n^{2}} \sum_{l = 1}^{k} \sum_{p, q = 1}^{N_{l}} σ_{p q}^{(l)} \leq C_{0} .

(16)

Suppose that for all

i, j = 1, \dots, k

Lindeberg’s condition is satisfied, i.e., for any

τ > 0

,

\begin{matrix} L_{n} (τ) : = & \frac{1}{n^{2}} \sum_{l = 1}^{m} \sum_{(i, j) \in A_{l}} \sum_{r = 1}^{n_{i}} \sum_{s = 1}^{n_{j}} E {(X_{r s}^{i j})}^{2} I {| X_{r s}^{i j} |^{2} > τ \sqrt{n}} \\ + \frac{1}{n^{2}} \sum_{l = 1}^{k} \sum_{p, q = 1}^{n_{l}} E {[h_{p q}^{(l)}]}^{2} I {| h_{p q}^{(l)} | > τ \sqrt{n}} \underset{n \to \infty}{\to} 0 . \end{matrix}

Then

F_{n X} (x) - F_{n Y} (x) \underset{n \to \infty}{\to} 0 in probability .

Remark 3.

Figure 5 and Figure 6 show the histograms of random matrices in block form with rectangular blocks with different distributions of block elements. The figures illustrate the independence of the limiting spectral distribution of block random matrices with rectangular blocks from the block element distributions.

For many block-type random matrices (both with square blocks and rectangular blocks), we know the limiting empirical spectral distributions in the case when the block elements are Gaussian values and the variances

{[σ_{p q}^{(i j)}]}^{2}

satisfy certain conditions (e.g., they are all equal). Theorems 1 and 3 show that the same limit distributions will hold for blocks with arbitrary element distributions, as long as Lindeberg’s conditions are satisfied. We can consider several examples.

We first consider two simple ones, the Hermitian matrix

X = \frac{1}{\sqrt{n}} {(X_{p q})}_{p, q = 1}^{n}

and a rectangular matrix

\hat{X}

of dimensions

n \times m

, whose elements are independent. Consider the following block matrices:

W = [\begin{matrix} O & X & O \\ X & O & X \\ O & X & O \end{matrix}], \hat{W} = [\begin{matrix} O & \hat{X} & O \\ {\hat{X}}^{*} & O & \hat{X} \\ O & {\hat{X}}^{*} & O \end{matrix}],

(17)

It is well known that if the random variables

X_{i j}

are standard Gaussian and identically distributed, then the empirical spectral distribution function of matrix

X

is a standard semicircular law, i.e., the distribution density is

p (x) = \frac{1}{2 π} \sqrt{4 - x^{2}} I {| x | \leq 2} .

It is easy to see that

det {(W_{X} - z I} = z^{n} det (2 X^{2} - z^{2} I) .

(18)

It follows that the limiting empirical spectral distribution is a mixture of a

\frac{1}{3}

atom at zero and a distribution with density

p_{1} (x) = \frac{1}{4 π} \sqrt{8 - x^{2}} I {| x | \leq 2 \sqrt{2}}

and mass

\frac{2}{3}

, so that the density of the distribution can be written as

\tilde{p} (x) = \frac{1}{3} δ_{0} (x) + \frac{2}{3} p_{1} (x),

where

δ_{0} (x)

is the Dirac delta function. From Theorem 1, if for random variables

X_{p q}

,

p, q = 1, \dots, n

, we have

E [X_{p q} |^{2} = 1

and Lindeberg’s condition is satisfied, then the empirical spectral distribution function of

W_{X}

has a density

\tilde{p} (x)

. (Matrices of this type have also been considered in [12].) In a similar way, we can consider the case of a rectangular block. Assume that

{lim}_{n \to \infty} \frac{n}{m} = y \in (0, 1)

. Applying the formula for the determinant of a block matrix, we obtain

det {\hat{W} - z I} = {(- z)}^{n} det {2 {\hat{X}}^{*} \hat{X} - z^{2} I}^{- 1}

(19)

From here, one easily sees that the eigenvalues of

W_{\hat{X}}

are zero and of multiplicity

2 m - n

and

\pm s_{j}

,

j = 1, \dots, n

, where

s_{j}

are singular numbers of

\hat{X}

. For example, in the case where

X_{i j}

is Gaussian with variance, the relation

lim_{n \to \infty} \frac{1}{2 n} \sum_{j = 1}^{n} (I {s_{j} \leq x} + I {- s_{j} \leq x}) \to G_{y} (x), at n \to \infty,

(20)

where

G_{y} (x)

is the function of the symmetrized Marchenko–Pastur distribution with the parameter y, i.e., a distribution with density

g_{y} (x) = \frac{1}{2 π y x} \sqrt{(x^{2} - a^{2}) (b^{2} - a^{2}} I {a \leq | x | \leq b},

where

a = {(1 - \sqrt{y})}^{2}, b = {(1 + \sqrt{y})}^{2}

. From Theorem 3, we obtain the following: if

E {[{\hat{X}}_{i j}]}^{2} = 1

and Lindeberg’s condition is satisfied, then the limiting distribution of the spectral distribution function of the matrix

\hat{W}

will have the form

\hat{p} (x) = \frac{2 - y}{2 + y} δ_{0} (x) + \frac{2 y}{2 + y} \frac{1}{\sqrt{2}} g_{y} (x / \sqrt{2}) .

Results similar to Theorem 3 were obtained in [32]. In contrast to our paper, in [32], the proof was carried out using the method of moments, and instead of condition (6), the condition

sup_{j, l, p, q} {(σ_{p q}^{(j l)})}^{2} \leq C_{0}

(21)

was considered.

Furthermore, in [32], it was assumed that all blocks

X^{(i j)}

,

i, j = 1, \dots, k

, were independent, which rules out the case of coincident blocks, such as in the case of Hankel or Töplitz matrices. Finally, in [32], it was assumed that

σ_{p q}^{(i j)} = σ_{p q}

, i.e., the variance within blocks is the same. One cannot, for example, apply the results of [32] to the matrix

W = [\begin{matrix} O & X \\ X & O \end{matrix}]

, where

X

is a Wigner matrix (Hermitian matrices with independent elements (to Hermitian accuracy) with equal variance) whose limiting spectral distribution is a semicircular law (cf. Corollary 2 in [32]).

Based on Theorems 1 and 3 and the results of Theorems 2 and 4 in [12], we can formulate some theorems on the convergence of the empirical spectral distribution functions of block-type random matrices. For this purpose, we need one more notation. We denote the covariance of the random elements of the matrices

X^{(i j)}

and

X^{(p q)}

by

σ_{i, j; p . q}

. By virtue of our agreement that the matrices

X^{(i j)}

and

X^{(p q)}

are either independent or coincident, the values of the function

σ_{i, j; p . q}

are either 0 or 1. It is obvious that

\begin{matrix} σ_{i, j; p, q} = σ_{p, q; i, j} \end{matrix} .

Let

M_{k} (C)

be a set of

k \times k

matrices with complex elements. We define the mapping

η : M_{k} (C) \to M_{k} (C)

as follows. For a matrix

D = {(d_{i j})}_{i, j = 1}^{k} \in M_{k} (C)

, we have

η (D) = {(η {(D)}_{i j})}_{i, j = 1}^{k}

:

{[η (D)]}_{i j} = \frac{1}{k} \sum_{p, q = 1}^{k} σ (i, j; p, q) d_{p q} .

(22)

Theorem 4.

Suppose that the random variables

X_{p q}^{(i j)}

, where

i, j = 1, \dots, k

and

p, q = 1, \dots, n

, satisfy the following conditions:

(1)

E X_{p q}^{(i j)} = 0

and

E | X_{p q}^{(i j)} |^{2} = σ^{2}

;

(2) Lindeberg’s condition: for any

τ > 0

, we have

L_{n} (τ) : = \frac{1}{n^{2}} \sum_{i, j = 1}^{k} \sum_{p, q = 1}^{n} E ‖ X_{p q}^{(i j)} |^{2} I {| X_{p q}^{(i j)} | > τ \sqrt{n}} \to 0 when n \to \infty .

Then there exists a distribution function

F (x)

with a Stieltjes transform

S (z)

that satisfies the equality

S (z) = G (z) / k

, where the function

G (z)

is an analytic function taking values in the space of square matrices of dimensions

k \times k

, defined in the upper complex half-plane and uniquely defined by the properties

lim_{| z | \to \infty, Im z > 0} z G (z) = I_{k}

and

z G (z) = I_{k} + η (G (z)) G (z)) .

3. Proof of the Main Result

We only need to show the convergence of the corresponding Stieltjes transforms in any subset of the upper complex half-plane with a non-empty interior. Consider the resolvent

R_{A} (z)

of a symmetric matrix

A

. This is defined for all

z = u + i v

with

v > 0

. We only need to prove that in some region

G \subset C_{+}

with a non-empty interior, there is the convergence

\frac{1}{n k} Tr R_{W_{X}} (z) - \frac{1}{n k} Tr R_{W_{Y}} \to 0 for n \to \infty

(23)

in probability.

We shall divide the proof into three parts. First, we show that we can replace random variables with so-called truncated random variables. Then, we prove that for truncated quantities, the difference between the expectations of the Stiltjes transformations of the matrices

W_{X}

and

W_{Y}

tends to 0. Finally, we show that the variance of the Stiltjes transformations tends to 0 (Girko’s lemma). The convergence of expectations and the convergence of variances to zero entail the convergence of probabilities.

3.1. Truncation

As a first step in the proof, we reduce the problem to truncated random variables. For some

τ > 0

, we introduce the quantities

{\tilde{X}}_{p q}^{(l)} = X_{p q}^{(l)} I {| X_{p q}^{(l)} | \leq τ \sqrt{n}} .

Next, consider their centered version

{\hat{X}}_{p q}^{(l)} = {\tilde{X}}_{p q}^{(l)} - E {\tilde{X}}_{p q}^{(l)}

and the normalized version

{\overset{˘}{X}}_{p q}^{(l)} = \frac{σ_{p q}^{(l)}}{{\hat{σ}}_{p q}^{(l)}} {\hat{X}}_{p q}^{(l)},

where

{[{\hat{σ}}_{p q}^{(l)}]}^{2} = E {[{\hat{X}}_{p q}^{(l)}]}^{2} .

Note that

E {\overset{˘}{X}}_{p q}^{(l)} = 0, E {[{\overset{˘}{X}}_{p q}^{(l)}]}^{2} = {[σ_{p q}^{(l)}]}^{2} .

Consider the matrices

{\tilde{W}}_{X}

,

{\hat{W}}_{X}

, and

{\overset{˘}{W}}_{X}

obtained from

W_{X}

by replacing

X_{p q}^{(l)}

with

{\tilde{X}}_{p q}^{(l)}

,

{\hat{X}}_{p q}^{(l)}

, and

{\overset{˘}{X}}_{p q}^{(l)}

, respectively. Also, consider the resolvent matrices

{\tilde{R}}_{W_{X}} (z)

,

{\hat{R}}_{W_{X}} (z)

, and

{\overset{˘}{R}}_{W_{X}} (z)

.

Lemma 1.

Under the conditions of Theorem 1, we have

lim_{n \to \infty} (\frac{1}{n k} Tr {\overset{˘}{R}}_{W_{X}} - \frac{1}{n k} Tr R_{W_{X}}) = 0 i n p r o b a b i l i t y .

Proof.

We start by estimating the quantity

B_{1} : = \frac{1}{n k} Tr {\tilde{R}}_{W_{X}} (z) - \frac{1}{n k} Tr R_{W_{X}} (z) .

The following inequality holds:

\begin{matrix} E ∥ W_{X} - {\tilde{W}}_{X} ∥_{2}^{2} \leq \frac{1}{n k} \sum_{l = 1}^{m} \sum_{(i, j) \in A_{l}} \sum_{p, q = 1}^{n} E | X_{p q}^{(i j)} |^{2} I {| X_{p q}^{(i j)} | > τ \sqrt{n}} . \end{matrix}

(24)

Then, applying Cauchy’s inequality, it is easy to obtain that

E | B_{1} | \leq \frac{1}{n k} E {(\sum_{l, r = 1}^{n k} | \sum_{j = 1}^{n k} {[R (z)]}_{j l} {[\tilde{R} (z)]}_{j r} |^{2})}^{\frac{1}{2}} {(\sum_{l, r = 1}^{k n} E {| W_{l r} - {\tilde{W}}_{l r} |}^{2})}^{\frac{1}{2}} .

It is then not hard to see that

\sum_{l, r = 1}^{n k} | \sum_{j = 1}^{n k} {[R (z)]}_{j l} {[\tilde{R} (z)]}_{j r} |^{2} = \sum_{l, r = 1}^{n k} | {[R (z) \tilde{R} (z)]}_{l r} |^{2} = {∥ R (z) \tilde{R} (z) ∥}_{2}^{2} \leq \frac{n k}{v^{4}} .

(25)

Combining inequalities (24) and (25), we obtain the following:

E | B_{1} | \leq {(\frac{C}{n^{2}} \sum_{l = 1}^{m} \sum_{p, q = 1}^{n} E {(X_{p q}^{(l)})}^{2} I {| X_{p q}^{(l)} | > τ \sqrt{n}})}^{\frac{1}{2}} = \frac{1}{v^{2}} L_{n}^{\frac{1}{2}} (τ) .

(26)

We obtain a similar estimate for the quantity

E | B_{2} {| : = E | \frac{1}{n k} Tr \tilde{R} (z)]}_{X} - \frac{1}{n k} Tr {\hat{R}}_{X} (z) | \leq \frac{1}{v^{2}} L_{n}^{\frac{1}{2}} (τ) .

(27)

Let us now estimate the value

B_{3} : = \frac{1}{n k} Tr {\overset{˘}{R}}_{X} (z) - \frac{1}{n k} Tr {\hat{R}}_{X} (z) .

(28)

We have

\begin{matrix} E | B_{3} | \leq & \frac{1}{n k} E {∥ {\hat{R}}_{X} (z) {\overset{˘}{R}}_{X} (z) {\overset{˘}{R}}_{X} (z) ∥}_{2}, \\ E ∥ \overset{˘}{W} - \hat{W} ∥_{2} \leq & \frac{1}{\sqrt{n k} v^{2}} E {∥ \overset{˘}{W} - \hat{W} ∥}_{2} \\ = \frac{1}{n k v^{2}} E^{\frac{1}{2}} (\sum_{l = 1}^{m} \sum_{(i j) \in A_{l}} \sum_{p, q = 1}^{n} {| {\hat{X}}_{p q}^{(i j)} - {\overset{˘}{X}}_{p q}^{(i j)} |}^{2}) . \end{matrix}

(29)

Obviously,

\begin{matrix} \sum_{l = 1}^{m} \sum_{(i j) \in A_{l}} \sum_{p, q = 1}^{m} E | {\hat{X}}_{p q}^{(i j)} & - {\overset{˘}{X}}_{p q}^{(i j)} |^{2} \leq \sum_{l = 1}^{m} \sum_{(i j) \in A_{l}} \sum_{p, q = 1}^{m} | {\hat{σ}}_{p q}^{(i j)})^{2} - {(σ_{p q}^{(i j)})}^{2} | \\ \leq C {(n k)}^{2} L_{n} (τ) . \end{matrix}

Substituting the last estimate into (29), we obtain

E | B_{3} | \leq \frac{C}{v^{2}} L_{n}^{\frac{1}{2}} (τ),

and therefore,

E | \frac{1}{n k} Tr {\overset{˘}{R}}_{W_{X}} (z) - \frac{1}{n k} Tr R_{W_{X}} (z) | \leq \frac{C}{v^{2}} L_{n}^{\frac{1}{2}} (τ) .

Lemma 1 has been proven. □

Remark 4.

Since the function

L_{n} (τ)

at fixed τ is monotone on τ and

L_{n} (τ) \to 0

at

n \to \infty

for any fixed τ, there exists a sequence

τ_{n}

such that

τ_{n} \to 0

and

L_{n} (τ_{n}) \to 0

at

n \to \infty

. In what follows, we write

τ = τ_{n} \to 0

such that

L_{n} (τ_{n}) \to 0

as

n \to \infty

. Without loss of generality, we can assume that the random variables

X_{p q}^{(i j)}

,

i, j = 1, \dots, k; p, q = 1, \dots n

, and

Y_{p q}^{(i j)}

,

i, j = 1, \dots, k; p, q = 1, \dots n

, satisfy the condition

max {| Y_{p q}^{(i j)} |, | X_{p q}^{(i j)} |} \leq τ \sqrt{n} .

(30)

3.2. Special Representation for the Difference Between Two Resolvents

For any

α \in [0, π / 2]

, consider the random matrix

Z^{(l)} (α) = X_{l} cos α + Y_{l} sin α .

We denote

W_{X} (α) : = \frac{1}{\sqrt{n k}} \sum_{l = 1}^{m} (\sum_{(i, j) \in A_{l}} E^{(i, j)}) \otimes Z^{(l)} (α), R_{X} (z, α) = {(W (α) - z I)}^{- 1} .

In what follows, we denote the elements of

W_{X} (α) = (W_{j k})

and

Z^{(l)} (α) = (Z_{j k}^{(l)})

simply by

W

and

Z^{(l)}

, omitting

α

and

X

from notations (unless this is ambiguous). Obviously, we have

\begin{matrix} \frac{1}{n k} Tr R_{W_{Y}} (z) & - \frac{1}{n k} Tr R_{W_{X}} (z) = \frac{1}{n k} Tr R_{W_{X}} (z, \frac{π}{2}) - \frac{1}{n k} Tr R_{W_{X}} (z, 0) \\ = \frac{1}{n k} \int_{0}^{\frac{π}{2}} \frac{\partial Tr R_{W_{X}} (z, α)}{\partial α} d α = \frac{1}{n k} \sum_{j = 1}^{n k} \int_{0}^{\frac{π}{2}} \frac{\partial {[R_{W_{X}}]}_{j j} (z, α)}{\partial α} d α . \end{matrix}

(31)

We can now write

\frac{\partial {[R_{W_{X}}]}_{j j} (z, α)}{\partial α} = \sum_{l = 1}^{m} \sum_{p, q = 1}^{n} \frac{\partial {[R_{W_{X}}]}_{j j} (z, α)}{\partial Z_{p q}^{(l)}} \frac{\partial Z_{p q}^{(l)}}{\partial α} .

Note that for any invertible matrix

A

, the following differentiation formula is valid:

\frac{\partial A_{j k}^{- 1}}{\partial A_{p q}} = - {[A^{- 1}]}_{j p} {[A^{- 1}]}_{q k}

(32)

(see, for example, [33]). Applying (32) to the resolvent

R_{W_{X}} (z, α)

, we obtain

\frac{\partial {[R_{W_{X}}]}_{j j} (z, α)}{\partial Z_{p q}^{(l)}} = - \sum_{(r, s) \in A_{l}} (2 - δ_{p q}) {[R_{W_{X}} (z, α)]}_{j, (r - 1) n + p} {[R_{W_{X}} (z, α)]}_{j, (s - 1) n + q},

where

δ_{p q}

stands for the Kronecker symbol. Also, note that

\frac{\partial Z_{p q}^{(l)}}{\partial α} = - X_{p q}^{(l)} sin α + Y_{p q}^{(l)} cos α = : {\tilde{Z}}_{p q}^{(l)} .

Summing up the above formulas, we can write

\begin{matrix} E (\frac{1}{n k} Tr R_{W_{Y}} (z) & - \frac{1}{n k} Tr R_{W_{X}} (z)) \\ = - \frac{1}{n k \sqrt{n k}} \sum_{j = 1}^{n k} \sum_{l = 1}^{m} \sum_{p, q = 1}^{n} (2 - δ_{p q}) \int_{0}^{\frac{π}{2}} E {\tilde{Z}}_{p q}^{(l)} (α) \\ \times \sum_{(r, s) \in A_{l}} (2 - δ_{p q}) {[R_{W_{X}} (z, α)]}_{j, (r - 1) n + p} {[R_{W_{X}} (z, α)]}_{j, (s - 1) n + q} d α . \end{matrix}

(33)

In what follows, we are going to estimate the values

D_{p q}^{(l, r, s)} = E {\tilde{Z}}_{p q}^{(l)} {[R_{W_{X}} (z, α)]}_{j, (r - 1) n + p} {[R_{W_{X}} (z, α)]}_{j, (s - 1) n + q}

for

l = 1, \dots, m; r, s \in A_{l}

and

p, q = 1, \dots, n

. We introduce the matrices

Z^{(l, p, q)} = Z^{(l)} - \frac{1}{\sqrt{n k}} Z_{p q}^{(l)} (E^{(p, q)} + E^{(q, p)})

(i.e., the matrix

Z^{(l, p, q)}

has zeros instead of each

Z_{p q}^{(l)}

). In the same way, we define the matrices

W^{(l, p, q)} (z, α)

and

R_{W_{X}}^{(l, p, q)} (z, α)

. The simple resolvent equality

R_{A_{1}} - R_{A_{2}} = R_{A_{1}} (A_{2} - A_{1}) R_{A_{2}}

shows that

R_{W_{X}} (z, α) = R_{W_{X}}^{(l, p, q)} (z, α) - R_{W_{X}} (z, α) (W - W^{(l, p, q)} (z, α)) R_{W_{X}}^{(l, p, q)} (z, α) .

(34)

Let

u_{j}

,

j = 1, \dots, n

, denote the standard base vectors (column vectors) of the space

R^{n}

. It is easy to see that we can then write

W (α) - W^{(l, p, q)} (α) = \frac{1}{\sqrt{n k}} Z_{p q}^{(l)} \sum_{r, s \in A_{l}} E^{(r, s)} \otimes (u_{p} u_{q}^{T} + u_{q} u_{p}^{T})

Now, let

v_{j}

,

j = 1, \dots, k n

, be the corresponding basis vectors for

R^{k n}

. Then we can write

E^{(r, s)} \otimes (u_{p} u_{q}^{T} + u_{q} u_{p}^{T}) = v_{(r - 1) n + p} v_{(s - 1) n + q}^{T} + v_{(r - 1) n + q} v_{(s - 1) n + p}^{T} .

(35)

And, further,

\begin{matrix} [ & R_{W_{X}} {(z, α)]}_{j, (r - 1) n + p} = {[R_{W_{X}}^{(l, p, q)} (z, α)]}_{j, (r - 1) n + p} - \frac{1}{\sqrt{n k}} Z_{p q}^{(l)} \sum_{r_{1}, s_{1} \in A_{l}} v_{j}^{T} R_{W_{X}} (z, α) \\ \times (v_{(r_{1} - 1) n + p} v_{(s_{1} - 1) n + q}^{T} + v_{(r_{1} - 1) n + q} v_{(s_{1} - 1) n + p}^{T}) R_{W_{X}}^{(l, p, q)} (z, α) v_{(r - 1) n + p}, \\ R_{W_{X}} {(z, α)]}_{j, (s - 1) n + q} = R_{W_{X}}^{(l, p, q)} {(z, α)}_{j, (s - 1) n + q} - \frac{1}{\sqrt{n k}} Z_{p q}^{(l)} \sum_{r, s \in A_{l}} v_{j}^{T} R_{W_{X}} (z, α) \\ \times (v_{(r_{1} - 1) n + p} v_{(s_{1} - 1) n + q}^{T} + v_{(r_{1} - 1) n + q} v_{s_{1} - 1) k + p}^{T}) R_{W_{X}}^{(l, p, q)} (z, α) v_{(s - 1) n + q} . \end{matrix}

(36)

Now applying (34) to the matrix

R_{W_{X}}

on the right-hand side of (36), we arrive at the following equality:

\begin{matrix} [ & R_{W_{X}} {(z, α)]}_{j, (r - 1) n + p} = {[R_{W_{X}}^{(l, p, q)} (z, α)]}_{j, (r - 1) n + p} \\ - & \frac{1}{\sqrt{n k}} Z_{p q}^{(l)} \sum_{r_{1}, s_{1} \in A_{l}} (v_{j}^{T} R_{W_{X}}^{(l, p, q)} (z, α) (v_{(r_{1} - 1) n + p} v_{(s_{1} - 1) n + q}^{T} + v_{(r_{1} - q) k + q} v_{(s_{1} - q) k + p}^{T}) \\ \times R_{W_{X}}^{(l, p, q)} (z, α) v_{(r - 1) n + p}) + \frac{1}{n k} {(Z_{p q}^{(l)})}^{2} {[T_{1}^{(l, p, q)}]}_{j, r, s}, \\ R_{W_{X}}^{(l, p, q)} {(z, α)]}_{j, (s - 1) n + q} = {[R_{W_{X}}^{(l, p, q)} (z, α)]}_{j, (s - 1) n + q} \\ - & \frac{1}{\sqrt{n k}} Z_{p q}^{(l)} \sum_{r_{1}, s_{1} \in A_{l}} (v_{j}^{T} R_{W_{X}} (z, α) (v_{(r - 1) n + p} v_{(s - 1) n + q}^{T} + v_{(r - q) k + q} v_{s - q) k + p}^{T}) \\ \times R_{W_{X}}^{(l, p, q)} (z, α) v_{(s - 1) n + q}) + \frac{1}{n k} {(Z_{p q}^{(l)})}^{2} {[T_{2}^{(l, p, q)}]}_{j, r, s}, \end{matrix}

(37)

where

\begin{matrix} {[T_{1}^{(l, p, q)}]}_{j, r, s} = & \sum_{r_{1}, s_{1} \in A_{l}} \sum_{r_{2}, s_{2} \in A_{l}} v_{j}^{T} R_{W_{X}} (z, α) (v_{(r_{1} - 1) n + p} v_{(s_{1} - 1) n + q}^{T} + v_{(r_{1} - 1) n + q} v_{s_{1} - 1) k + p}^{T}) \\ \times R_{W_{X}}^{(l, p, q)} (z, α) (v_{(r_{2} - 1) n + p} v_{(s_{2} - 1) n + q}^{T} + v_{(r_{2} - 1) n + q} v_{s_{2} - 1) k + p}^{T}) R_{W_{X}}^{(l, p, q)} (z, α) v_{(s - 1) n + q}, \\ {}_{2}^{(l, p, q)}]_{j, r, s} = & \sum_{r_{1}, s_{1} \in A_{l}} v_{j}^{T} R_{W_{X}} (z, α) (v_{(r_{1} - 1) n + p} v_{(s_{1} - 1) n + q}^{T} + v_{(r_{1} - 1) n + q} v_{s_{1} - 1) k + p}^{T}) \\ \times R_{W_{X}}^{(l, p, q)} (z, α) (v_{(r_{2} - 1) n + p} v_{(s_{2} - 1) n + q}^{T} + v_{(r_{2} - 1) n + q} v_{s_{2} - 1) k + p}^{T}) R_{W_{X}}^{(l, p, q)} (z, α) v_{(r - 1) n + p} . \end{matrix}

Note that

E {\tilde{Z}}^{(l)} (α) Z_{p q}^{(l)} (α) = E ({(X_{p q}^{(l)})}^{2} - {(Y_{p q}^{(l)})}^{2}) sin α cos α = 0 .

Through the independence of the quantities

X_{p q}^{(l)}

and

Y_{p q}^{(l)}

and the matrices

R_{W_{X}}^{(l, p, q)} (z, α)

, we obtain

\begin{matrix} E {\tilde{Z}}^{(l)} (α) {[R_{W_{X}}^{(l, p, q)} (z, α)]}_{j, (r - 1) n + p} {[R_{W_{X}}^{(l, p, q)} (z, α)]}_{j, (s - 1) n + q} = 0, \\ E {\tilde{Z}}^{(l)} (α) Z_{p q}^{(l)} (α) v_{j}^{T} R_{W_{X}}^{(l, p, q)} (z, α) \\ \times (v_{(r - 1) n + p} v_{(s - 1) n + q}^{T} + v_{(r - 1) n + q} v_{s - 1) k + p}^{T}) R_{W_{X}}^{(l, p, q)} (z, α) v_{(r - 1) n + p} = 0 . \end{matrix}

(38)

Equalities (33), (36), and (38) imply that

\begin{matrix} E (\frac{1}{n k} Tr R_{W_{Y}} (z) & - \frac{1}{n k} Tr R_{W_{X}} (z)) = \int_{0}^{\frac{π}{2}} (Q_{1} + \dots + Q_{6}) d α, \end{matrix}

(39)

where

\begin{matrix} Q_{1} = & - \frac{1}{{(n k)}^{2} \sqrt{n k}} \sum_{j = 1}^{n k} \sum_{l = 1}^{m} \sum_{p, q = 1}^{n} \sum_{(r, s) \in A_{l}} \sum_{r_{1}, s_{1} \in A_{l}} \sum_{r_{2}, s_{2} \in A_{l}} (2 - δ_{p q}) E {\tilde{Z}}^{(l)} (α) {(Z_{p q}^{(l)})}^{2} \\ \times v_{j}^{T} R_{W_{X}}^{(l, p, q)} (z, α) (v_{(r_{1} - 1) n + p} v_{(s_{1} - 1) n + q}^{T} + v_{(r_{1} - 1) n + q} v_{(s_{1} - 1) k + p}^{T}) R_{W_{X}}^{(l, p, q)} (z, α) v_{(r - 1) n + p} \\ \times v_{j}^{T} R_{W_{X}}^{(l, p, q)} (z, α) (v_{(r_{2} - 1) n + p} v_{(s_{2} - 1) n + q}^{T} + v_{(r_{2} - 1) n + q} v_{(s_{2} - 1) n + p}^{T})) R_{W_{X}}^{(l, p, q)} (z, α) v_{(s - 1) n + q}, \end{matrix}

\begin{matrix} Q_{2} = & - \frac{1}{{(n k)}^{3}} \sum_{j = 1}^{n k} \sum_{l = 1}^{m} \sum_{p, q = 1}^{n} \sum_{(r, s) \in A_{l}} \sum_{r_{1}, s_{1} \in A_{l}} (2 - δ_{p q}) E {\tilde{Z}}^{(l)} (α) {(Z_{p q}^{(l)})}^{3} \\ \times v_{j}^{T} R_{W_{X}}^{(l, p, q)} (z, α) (v_{(r_{1} - 1) n + p} v_{(s_{1} - 1) n + q}^{T} + v_{(r_{1} - 1) n + q} v_{s_{1} - 1) k + p}^{T}) \\ R_{W_{X}}^{(l, p, q)} (z, α) v_{(r - 1) n + p} {[T_{2}^{(l, p, q)}]}_{j, r, s}, \\ Q_{3} = & - \frac{1}{{(n k)}^{3}} \sum_{j = 1}^{n k} \sum_{l = 1}^{m} \sum_{p, q = 1}^{n} (2 - δ_{p q}) E {\tilde{Z}}^{(l)} (α) {(Z_{p q}^{(l)})}^{3} \\ \times \sum_{r, s \in A_{l}} (2 - δ_{p q}) {[T_{1}^{(l, p, q)}]}_{j, r, s} \\ \times v_{j}^{T} R_{W_{X}}^{(l, p, q)} (z, α) (v_{(r_{1} - 1) n + p} v_{(s_{1} - 1) n + q}^{T} + v_{(r_{1} - 1) n + q} v_{s_{1} - 1) k + p}^{T}) R_{W_{X}}^{(l, p, q)} (z, α) v_{(s - 1) n + q}, \\ Q_{4} = & - \frac{1}{{(n k)}^{3} \sqrt{n k}} \sum_{j = 1}^{n k} \sum_{l = 1}^{m} \sum_{p . q = 1}^{n} (2 - δ_{p q}) E {\tilde{Z}}^{(l)} (α) {(Z_{p q}^{(l)})}^{4} \\ \times \sum_{(r, s) \in A_{l}} (2 - δ_{p q}) {[T_{1}^{(l, p, q)}]}_{j, r, s} {[T_{2}^{(l, p, q)}]}_{j, r, s}, \\ Q_{5} = & \frac{1}{{(n k)}^{2} \sqrt{n k}} \sum_{j = 1}^{n k} \sum_{l = 1}^{m} \sum_{p . q = 1}^{n} (2 - δ_{p q}) E {\tilde{Z}}^{(l)} (α) {(Z_{p q}^{(l)})}^{2} \\ \times {[R_{W_{X}}^{(l, p, q)} (z, α)]}_{j, (r - 1) n + p} {[T_{2}^{(l, p, q)}]}_{j, r, s}, \\ Q_{6} = & \frac{1}{{(n k)}^{2} \sqrt{n k}} \sum_{j = 1}^{n k} \sum_{l = 1}^{m} \sum_{p . q = 1}^{n} (2 - δ_{p q}) E {\tilde{Z}}^{(l)} (α) {(Z_{p q}^{(l)})}^{2} \\ \times {[R_{W_{X}}^{(l, p, q)} (z, α)]}_{j, (s - 1) n + q} {[T_{1}^{(l, p, q)}]}_{j, r, s} . \end{matrix}

3.3. Estimations of Quantities $Q_{1}$ – $Q_{6}$

3.3.1. Estimation of $Q_{1}$

We represent

Q_{1}

in the form

\begin{matrix} Q_{1} = Q_{11} + \dots + Q_{14}, \end{matrix}

where

\begin{matrix} Q_{11} = & - \frac{1}{{(n k)}^{2} \sqrt{n k}} \sum_{j = 1}^{n k} \sum_{l = 1}^{m} \sum_{p, q = 1}^{n} \sum_{(r, s) \in A_{l}} \sum_{r_{1}, s_{1} \in A_{l}} \sum_{r_{2}, s_{2} \in A_{l}} (2 - δ_{p q}) E {\tilde{Z}}_{p q}^{(l)} {(Z_{p q}^{(l)})}^{2} \\ \times {[R_{W_{X}}^{(l, p, q)} (z, α)]}_{j, (r_{1} - 1) n + p} {[R_{W_{X}}^{(l, p, q)} (z, α)]}_{(s_{1} - 1) n + q, (r - 1) n + p} \\ \times {[R_{W_{X}}^{(l, p, q)} (z, α)]}_{j, (r_{2} - 1) n + p} {[R_{W_{X}}^{(l, p, q)} (z, α)]}_{(s_{2} - 1) n + q, (s - 1) n + q}, \\ Q_{12} = & - \frac{1}{{(n k)}^{2} \sqrt{n k}} \sum_{j = 1}^{n k} \sum_{l = 1}^{m} \sum_{p, q = 1}^{n} \sum_{(r, s) \in A_{l}} \sum_{r_{1}, s_{1} \in A_{l}} \sum_{r_{2}, s_{2} \in A_{l}} (2 - δ_{p q}) E {\tilde{Z}}_{p q}^{(l)} {(Z_{p q}^{(l)})}^{2} \\ \times {[R_{W_{X}}^{(l, p, q)} (z, α)]}_{j, (r_{1} - 1) n + p} {[R_{W_{X}}^{(l, p, q)} (z, α)]}_{(s_{1} - 1) n + q, (r - 1) n + p} \\ \times {[R_{W_{X}}^{(l, p, q)} (z, α)]}_{j, (r_{2} - 1) n + q} {[R_{W_{X}}^{(l, p, q)} (z, α)]}_{(s_{2} - 1) n + p, (s - 1) n + q}, \end{matrix}

\begin{matrix} Q_{13} = & - \frac{1}{{(n k)}^{2} \sqrt{n k}} \sum_{j = 1}^{n k} \sum_{l = 1}^{m} \sum_{p, q = 1}^{n} \sum_{(r, s) \in A_{l}} \sum_{r_{1}, s_{1} \in A_{l}} \sum_{r_{2}, s_{2} \in A_{l}} (2 - δ_{p q}) E {\tilde{Z}}_{p q}^{(l)} {(Z_{p q}^{(l)})}^{2} \\ \times {[R_{W_{X}}^{(l, p, q)} (z, α)]}_{j, (r_{1} - 1) n + q} {[R_{W_{X}}^{(l, p, q)} (z, α)]}_{(s_{1} - 1) n + p, (r - 1) n + p} \\ \times {[R_{W_{X}}^{(l, p, q)} (z, α)]}_{j, (r_{2} - 1) n + p} {[R_{W_{X}}^{(l, p, q)} (z, α)]}_{(s_{2} - 1) n + q, (s - 1) n + q}, \\ Q_{14} = & - \frac{1}{{(n k)}^{2} \sqrt{n k}} \sum_{j = 1}^{n k} \sum_{l = 1}^{m} \sum_{p, q = 1}^{n} \sum_{(r, s) \in A_{l}} \sum_{r_{1}, s_{1} \in A_{l}} \sum_{r_{2}, s_{2} \in A_{l}} (2 - δ_{p q}) E {\tilde{Z}}_{p q}^{(l)} {(Z_{p q}^{(l)})}^{2} \\ \times {[R_{W_{X}}^{(l, p, q)} (z, α)]}_{j, (r_{1} - 1) n + q} {[R_{W_{X}}^{(l, p, q)} (z, α)]}_{(s_{1} - 1) n + p, (r - 1) n + p} \\ \times {[R_{W_{X}}^{(l, p, q)} (z, α)]}_{j, (r_{2} - 1) n + q} {[R_{W_{X}}^{(l, p, q)} (z, α)]}_{(s_{2} - 1) n + p, (s - 1) n + q} \end{matrix}

There is an obvious inequality for

ν = 2, 3, 4

:

\begin{matrix} \frac{1}{{(\sqrt{n})}^{ν - 1}} \end{matrix} E | {\tilde{Z}}_{p q}^{(l)} {(Z_{p q}^{(l)})}^{ν} | \leq C τ^{ν - 1} {(σ_{p q}^{(l)})}^{2} .

(40)

This follows from condition (30).

All values of

Q_{1 ν}

,

ν = 1, \dots, 4

, are estimated similarly. Let us first consider the estimation of

Q_{11}

. Note that the number of elements in sets

A_{l}

for

l = 1, \dots, m

is independent of n and does not exceed

k^{2}

. Given the independence of

X_{p q}^{(l)}, Y_{p q}^{(l)}

, and the matrices

R_{W_{X}}^{(l, p, q)}

, we can write

\begin{matrix} Q_{11} = & - \frac{1}{{(n k)}^{2} \sqrt{n k}} \sum_{j = 1}^{n k} \sum_{l = 1}^{m} \sum_{p, q = 1}^{n} \sum_{(r, s) \in A_{l}} \sum_{r_{1}, s_{1} \in A_{l}} \sum_{r_{2}, s_{2} \in A_{l}} (2 - δ_{p q}) E {\tilde{Z}}_{p q}^{(l)} {(Z_{p q}^{(l)})}^{2} \\ \times E {[R_{W_{X}}^{(l, p, q)} (z, α)]}_{j, (r_{1} - 1) n + p} {[R_{W_{X}}^{(l, p, q)} (z, α)]}_{(s_{1} - 1) n + q, (r - 1) n + p} \\ \times {[R_{W_{X}}^{(l, p, q)} (z, α)]}_{j, (r_{2} - 1) n + p} {[R_{W_{X}}^{(l, p, q)} (z, α)]}_{(s_{2} - 1) n + q, (s - 1) n + q} . \end{matrix}

Applying inequality (40), we get

\begin{matrix} | Q_{11} | \leq & C τ \frac{1}{{(n k)}^{2}} \sum_{j = 1}^{n k} \sum_{l = 1}^{m} \sum_{p, q = 1}^{n} \sum_{(r, s) \in A_{l}} \sum_{r_{1}, s_{1} \in A_{l}} \sum_{r_{2}, s_{2} \in A_{l}} {(σ_{p q}^{(l)})}^{2} \\ \times E | {[R_{W_{X}}^{(l, p, q)} (z, α)]}_{j, (r_{1} - 1) n + p} {[R_{W_{X}}^{(l, p, q)} (z, α)]}_{(s_{1} - 1) n + q, (r - 1) n + p} \\ \times {[R_{W_{X}}^{(l, p, q)} (z, α)]}_{j, (r_{2} - 1) n + p} {[R_{W_{X}}^{(l, p, q)} (z, α)]}_{(s_{2} - 1) n + q, (s - 1) n + q} | \end{matrix}

The following inequality is valid

\begin{matrix} \sum_{j = 1}^{n k} & | {[R_{W_{X}}^{(l, p, q)} (z, α)]}_{j, (r_{1} - 1) n + p} {[R_{W_{X}}^{(l, p, q)} (z, α)]}_{j, (r_{2} - 1) n + p} | \\ \leq {(\sum_{j = 1}^{n k} {| {[R_{W_{X}}^{(l, p, q)} (z, α)]}_{j, (r_{1} - 1) n + p} |}^{2})}^{\frac{1}{2}} {(\sum_{j = 1}^{n k} | R_{W_{X}}^{(l, p, q)} (z, α) {]_{j, (r_{2} - 1) n + p} |}^{2})}^{\frac{1}{2}} \leq \frac{1}{v^{2}} \end{matrix}

(41)

Here, we apply Cauchy’s inequality and use the fact that for any resolvent matrix

R_{A} = {(A - z I)}^{- 1}

constructed by a symmetric real or Hermite matrix

A = {(A_{i j})}_{i, j = 1}^{n}

, the operator norm at the point

z = u + i v

does not exceed

‖ R_{A} ‖ \leq \frac{1}{v}

, and hence the sum of squares of the elements of any row does not exceed

\frac{1}{v^{2}}

, i.e., for any

i = 1, \dots, n

,

\sum_{j = 1}^{n} {| {[R_{A}]}_{i j} |}^{2} \leq \frac{1}{v^{2}} .

(42)

In addition,

max {| {[R_{W_{X}}^{(l, p, q)} (z, α)]}_{(s_{1} - 1) n + q, (r - 1) n + p} |, {[R_{W_{X}}^{(l, p, q)} (z, α)]}_{(s_{2} - 1) n + q, (s - 1) n + q} |} \leq \frac{1}{v} .

(43)

Given these inequalities, we arrive at the following estimation:

\begin{matrix} | Q_{11} | \leq \frac{C τ}{v^{4}} \frac{1}{{(n k)}^{2}} \sum_{l = 1}^{m} \sum_{p, q = 1}^{n} {(σ_{p q}^{(l)})}^{2}, \end{matrix}

Similarly, we obtain estimates for

ν = 2, 3, 4

:

| Q_{1 ν} | \leq C τ v^{- 4} \frac{1}{{(n k)}^{2}} \sum_{l = 1}^{m} \sum_{p, q = 1}^{n} {(σ_{p q}^{(l)})}^{2} .

(44)

Hence, from condition (6),

\begin{matrix} | Q_{1} | \leq \frac{C τ}{v^{4}} . \end{matrix}

(45)

The estimates of

Q_{2}, Q_{3},

and

Q_{4}

are slightly different due to the dependence of the values

X_{p q}^{(l)}

and

Y_{p q}^{(l)}

and the matrices

R_{W_{X}}

. But this problem is easily circumvented. All estimates associated with the elements of the resolvent are fulfilled uniformly over the entire space where the random variables

X_{p q}^{(l)}

and

Y_{p q}^{(l)}

are defined.

3.3.2. Estimation of $Q_{2}$

To estimate

Q_{2}

, we note that there exists a constant C such that

| {[T_{2}^{(l, p, q)}]}_{j, r, s} | \leq C v^{- 3} \sum_{r_{1} \in A_{l}} | {[R_{W_{X}} (z, α)]}_{j, (r_{1} - 1) n + p} | .

(46)

Applying this inequality, we obtain

\begin{matrix} | Q_{2} | \leq & C \frac{1}{v^{4} {(n k)}^{3}} \sum_{l = 1}^{m} \sum_{p, q = 1}^{n} \sum_{(r, s) \in A_{l}} \sum_{r_{1}, s_{1} \in A_{l}} E | {\tilde{Z}}^{(l)} (α) {(Z_{p q}^{(l)})}^{3} | \\ \times \sum_{j = 1}^{n k} | R_{W_{X}}^{(l, p, q)} {(z, α)}_{j, (r_{1} - 1) n + p} | {[R_{W_{X}} (z, α)]}_{j, (r - 1) n + p} |, \end{matrix}

(47)

Using inequalities (42) and (43), we obtain

| Q_{2} | \leq C \frac{1}{v^{6} {(n k)}^{3}} \sum_{l = 1}^{m} \sum_{p, q = 1}^{n} E | {\tilde{Z}}^{(l)} (α) {(Z_{p q}^{(l)})}^{3} | .

(48)

Inequalities (40) and (16) and the last inequality together imply that

| Q_{2} | \leq C \frac{τ^{2}}{v^{5}} .

(49)

3.3.3. Estimation of $Q_{3}$

Let us rewrite the definition of the quantity

{[T_{1}^{l, p, q}]}_{j, r, s}

taking into account equality (35):

\begin{matrix} {[T_{1}^{(l, p, q)}]}_{j r s} = {[T_{11}^{(l, p, q)}]}_{j r s} + \dots + {[T_{14}^{(l, p, q)}]}_{j r s}, \end{matrix}

where

\begin{matrix} {[T_{11}^{(l, p, q)}]}_{j, r, s} = & \sum_{r_{1}, s_{1} \in A_{l}} {[R_{W_{X}} (z, α)]}_{j, (r_{1} - 1) n + p} {[R_{W_{X}}^{(l, p, q)} (z, α)]}_{(s_{1} - 1) n + q, (r - 1) n + p} \\ \times {[R_{W_{X}}^{(l, p, q)} (z, α)]}_{(s - 1) n + q, (s - 1) n + q}, \\ {[T_{12}^{(l, p, q)}]}_{j, r, s} = & \sum_{r_{1}, s_{1} \in A_{l}} {[R_{W_{X}} (z, α)]}_{j, (r_{1} - 1) n + p} {[R_{W_{X}}^{(l, p, q)} (z, α)]}_{(s_{1} - 1) n + q, (r - 1) n + q} \\ \times {[R_{W_{X}}^{(l, p, q)} (z, α)]}_{(s - 1) n + p, (s - 1) n + q}, \\ {[T_{13}^{(l, p, q)}]}_{j, r, s} = & \sum_{r_{1}, s_{1} \in A_{l}} {[R_{W_{X}} (z, α)]}_{j, (r_{1} - 1) n + q} {[R_{W_{X}}^{(l, p, q)} (z, α)]}_{(s_{1} - 1) n + p, (r - 1) n + p} \\ \times {[R_{W_{X}}^{(l, p, q)} (z, α)]}_{(s - 1) n + q, (s - 1) n + q}, \\ {[T_{14}^{(l, p, q)}]}_{j, r, s} = & \sum_{r_{1}, s_{1} \in A_{l}} {[R_{W_{X}} (z, α)]}_{j, (r_{1} - 1) n + q} {[R_{W_{X}}^{(l, p, q)} (z, α)]}_{(s_{1} - 1) n + p, (r - 1) n + q} \\ \times {[R_{W_{X}}^{(l, p, q)} (z, α)]}_{(s - 1) n + p, (s - 1) n + q} . \end{matrix}

Similarly to inequality (49), we obtain the estimate

\begin{matrix} max {| {[T_{11}^{(l, p, q)}]}_{j, r, s} |, & | {[T_{12}^{(l, p, q)}]}_{j, r, s} |, | {[T_{13}^{(l, p, q)}]}_{j, r, s} |, | {[T_{14}^{(l, p, q)}]}_{j, r, s} |} \\ \leq \frac{C}{v^{2}} \sum_{r_{1}, s_{1} \in A_{l}} ({[| R_{W_{X}} (z, α)]}_{j, (r_{1} - 1) n + p} | + | {[R_{W_{X}} (z, α)]}_{j, (r_{1} - 1) n + q} |) . \end{matrix}

(50)

Inequalities (50), (41), and (42) together imply that

\begin{matrix} | Q_{3} | \leq \frac{C τ^{2}}{{(n k)}^{2} v^{5}} \sum_{l = 1}^{m} \sum_{p, q = 1}^{n} σ_{p q}^{(l)} \leq \frac{C τ^{2}}{v^{5}} . \end{matrix}

(51)

3.3.4. Estimation of $Q_{4}$

Applying inequalities (50), (41), (42), and (46), we obtain the estimate

| Q_{4} | \leq \frac{C τ^{3}}{v^{6}} .

(52)

3.3.5. Estimation of $Q_{5}$

We use inequalities (41), (42), and (46) again. We obtain

| Q_{5} | \leq \frac{C τ}{v^{4}} .

(53)

3.3.6. Estimation of $Q_{6}$

Similarly to

Q_{5}

, we apply inequalities (41), (42), and (45). We obtain

| Q_{6} | \leq \frac{C τ}{v^{4}} .

(54)

Estimations (45), (49), and (51)–(54) together with the representation in (39) imply that

| \frac{1}{n k} E Tr R_{X} (z, α) - \frac{1}{n k} E Tr R_{Y} | \leq \frac{C τ}{v^{4}} + \frac{C τ^{3}}{v^{6}} + \frac{C τ^{2}}{v^{5}} .

(55)

Without loss of generality, we can assume that

τ < 1

. The final estimate can be written as

| \frac{1}{n k} E Tr R_{X} (z, α) - \frac{1}{n k} E Tr R_{Y} | \leq C τ,

(56)

where the constant C depends on v, k, and m. Considering the transition to truncated values, the final estimate is as follows:

| \frac{1}{n k} E Tr R_{X} (z, α) - \frac{1}{n k} E Tr R_{Y} | \leq C (τ + (L_{n} {(τ)}^{\frac{1}{2}}) .

(57)

According to Remark 4, we can choose a sequence

τ_{n}

such that

τ_{n} \to 0

and

L_{n} (τ_{n}) \to 0

as

n \to \infty

. It follows that

lim_{n \to \infty} E (\frac{1}{n k} Tr R_{W_{X}} (z, α) - E \frac{1}{n k} Tr R_{W_{Y}}) = 0 .

(58)

To complete the proof, we give the following lemma.

3.4. Girko’s Lemma

Lemma 2.

Under conditions of Theorem 1, the following inequality holds:

E | \frac{1}{n k} (Tr R_{W_{X}} - E Tr R_{W_{X}}) |^{2} \leq \frac{k^{2}}{n v^{2}} .

Proof.

We prove this lemma using the method proposed by Girko (with a slight modification). For

l = 1, \dots, m

and

j = 1, \dots, n

, we define

σ

-algebras

M^{(l, j)} = σ {X_{p q}^{(s)}, s < l; p, q \leq j}

. Note that

M^{(1, 1)} = {\emptyset, Ω}

is a trivial

σ

-algebra and

M^{m, n}

is a

σ

-algebra with respect to which all random variables

X_{p q}^{(l)}

for

l = 1 \dot{,} m

,

1 \leq p,

and

q \leq n

are measurable. It is obvious that for any fixed

l = 1, \dots, m

, the

σ

-algebras of

M^{(l, j)}

are non-decreasing for

j = 1, \dots, n

. Suppose that

η_{l j} = \frac{1}{n k} (E {Tr R_{W_{X}} (z) | M^{(l, j)}} - E {Tr R_{W_{X}} (z) | M^{(l, j - 1)}} .

When

l = 1, \dots, m

is fixed, the random variables are uncorrelated. It is easy to see that

\frac{1}{n k} (Tr R_{W_{X}} (z) - E Tr R_{W_{X}} (z)) = \sum_{l = 1}^{n} \sum_{j = 1}^{n} η_{l j} .

The Cauchy inequality implies that

E | \frac{1}{n k} (Tr R_{W_{X}} (z) - E Tr R_{W_{X}} (z)) |^{2} \leq \frac{m}{{(n k)}^{2}} \sum_{l = 1}^{m} E | \sum_{j = 1}^{n} η_{l j} |^{2} = \frac{m}{{(n k)}^{2}} \sum_{l = 1}^{m} \sum_{j = 1}^{n} E {| η_{l j} |}^{2} .

(59)

For any fixed

l = 1, \dots,, m

and

j = 1, \dots,, n

, define the matrix

W^{(l, j)}

obtained from the matrix

W

by removing, for any

p, q \in A_{l}

, rows with numbers

(p - 1) n + j

and columns with numbers

(q - 1) n + j

. Note that

E {Tr R_{W_{X}}^{(l, j)} | M^{l, j)}} - E {Tr R_{W_{X}} | M^{(l, j - 1)}} = 0 .

Hence, we have the representation

η_{l j} = \frac{1}{n k} (E {Tr R_{W_{X}} (z) - Tr R_{W_{X}}^{(l, j)} | M^{(l, j)}} - E {Tr R_{W_{X}} (z) - Tr R_{W_{X}}^{(l, j)} | M^{(l, j - 1)}} .

(60)

Now, we use the inequality

| Tr R_{W_{X}} (z) - Tr R_{W_{X}}^{l, j)} | \leq \frac{| A_{l} |}{v} \leq \frac{k^{2}}{v},

which, together with (59), gives the estimate

E | \frac{1}{n k} (Tr R_{W_{X}} (z) - E Tr R_{W_{X}} (z)) |^{2} \leq \frac{k^{2}}{n v^{2}} .

The lemma has been proven. □

Lemma 2, Remark 4, and relation (58) now complete the proof of the theorem.

4. Conclusions

Block-type random matrices play an important role in many studies. In this case, special attention is paid to the limiting distribution of the spectrum of such types of matrices. This is a difficult problem, but for special distributions of matrix elements, it is often possible to solve this problem. The universality of the limiting spectral distribution (its independence from the distribution of matrix elements) allows us to choose matrices with elements that have a distribution with certain good properties, for which special methods can be used, for example, for Gaussian distributions. However, this is not always possible. For example, even in the Gaussian case, there is no exact description of the limiting spectral distribution for the Teuplice matrix. It is known that it has an infinite carrier and is non-Gaussian (the second moment is 1 and the fourth moment is

8 / 3

; see, for instance, [3]).

Many important problems in random matrix theory remain unsolved for block-type random matrices. First of all is the study of the distribution of eigennumbers in the local regime (convergence rate, rigidity, delocalisation of eigenvectors, and so on), and then the distribution of extreme values of eigennumbers, and so on.

Funding

This work was carried out within the framework of the project “Mathematical problems of the theory of stochastic and deterministic systems, including high-dimensional systems” (project no. 122040600066-5).

Data Availability Statement

The original contributions presented in this study are included in the article. Further inquiries can be directed to the author.

Conflicts of Interest

The author declares no conflict of interest.

References

Girko, V. Random block matrix density and SS-Law. Random Oper. Stock. Equ. 2000, 8, 189–194. [Google Scholar] [CrossRef]
Bolla, M. Distribution of the eigenvalues of random block-matrices. Linear Algebra Its Appl. 2004, 377, 219–240. [Google Scholar] [CrossRef]
Bruc, W.; Dembo, A.; Iang, T.J. Spectral Measure of Large Random Hankel, Markov and Toeplitz Matrices. Ann. Probab. 2006, 34, 1–38. [Google Scholar] [CrossRef]
Far, R.; Oraby, T.; Bruc, W.; Speicher, R. Spectra of large block matrices. Eur. J. Pure Appl. Math. 2024, 17, 2550–2561. [Google Scholar] [CrossRef]
Oraby, T. The Spectral Laws og Hermitian Block-Matrices with Large Blocks. Elect. Comm. Probab. 2007, 12, 465–476. [Google Scholar]
Kologlu, M.; Kopp, G.S.; Miller, S.J. The limiting spectral measure for ensemble of symmetric block circulant matrices. arXiv 2010, arXiv:1008.4812v5. [Google Scholar] [CrossRef][Green Version]
Granzio, D.; Zohren, S.; Roberts, S. Learning Rates as a Function of Batch Size: A Random Matrix Theory Approach to Neural Network Training. J. Mach. Learn. Res. 2022, 23, 1–65. [Google Scholar]
Xia, J.; Li, S.; Yang, Z.; Jaimoukha, I.; Gunduz, D. Meta-learning based Altarnating Minimization Algorithm for Non-convex Optimization. IEEE Trans. Neural Netw. Learn. Syst. 2023, 34, 5366–5380. [Google Scholar] [CrossRef]
El Karoui, N. Graph connection Laplacian and random matrices with random blocks. Inf. Inference 2015, 4, 1–42. [Google Scholar] [CrossRef][Green Version]
Nadutkina, A.V.; Tikhomirov, A.N.; Timushev, D.A. Marchenko–Pastur law for the spectrum of a random weighted bipartite graph. Sib. Adv. Math. 2024, 34, 146–153. [Google Scholar] [CrossRef]
Avrachenkov, K.; Cottatellucci, L.; Kadavankandy, A. Spectral Properties of Random Matrices for Stochastic Block Model. In Proceedings of the 13th International Symposium on Modeling and Optimization in Mobile, Ad Hoc, and Wireless Networks (WiOpt), Mumbai, India, 25–29 May 2015; pp. 537–544. [Google Scholar] [CrossRef]
Speicher, R. Free probability and random matrices. arXiv 2014, arXiv:14043393v1. [Google Scholar]
Fawaz, N.; Zarifi, K.; Debbah, M.; Gesbert, D. Asymptotic Capacity and Optimal Precoding in MIMO Multi-Hop Relay Networks. IEEE Trans. Inf. Theory 2011, 57, 2050–2069. [Google Scholar] [CrossRef]
Müller, R. On the Asymptotic Eigenvalue Distribution of Concatenated Vector-Valued Fading Channels. IEEE Trans. Inf. Theory 2002, 48, 2086–2091. [Google Scholar] [CrossRef]
An, X.; Du, L.; Jiang, F.; Zhang, Y.; Deng, Z.; Kurths, J. A few-shot identification method for stochastic dynamical systems on residual multieaks adaptive sampling. Caos 2024, 34, 073118. [Google Scholar] [CrossRef]
Pfaffel, O.; Schlemm, E. Limiting Spectral Distribution of a New Random Matrix Model with Dependece across Rows and Columns. Linear Algebra Its Appl. 2012, 436, 2966–2979. [Google Scholar] [CrossRef]
Aljadeff, J.; Renfrew, D.; Stern, M. Eigenvalues of block structured asymmetric random matrices. J. Math. Phys. 2015, 56, 103502. [Google Scholar] [CrossRef]
Basu, R.; Bose, A.; Ganguly, S.; Hazra, R.S. Limiting Spectral Distribution of Block matrices with Töplitz Block Structure. arXiv 2011, arXiv:1111.1901v1. [Google Scholar]
Beckwith, O.; Luo, V.; Miller, S.J.; Shen, K.; Triantofillou, N. Distribution of eigenvalues of weighted structured matrix ensembles. arXiv 2015, arXiv:1112.3719v2. [Google Scholar]
Blackwell, K.; Borade, N.; Vi, C.D.; Luntzlara, N.; Ma, R.; Miller, S.J.; Wang, M.; Xu, W. Distribution of eigenvalues of random real symmetric block matrices. arXiv 2019, arXiv:1908.03834v4. [Google Scholar]
Cicuta, G.M.; Pernici, M. Sparse Random Block Matrices. J. Phys. A Math. Theor. 2021, 55, 175202. [Google Scholar] [CrossRef]
Dette, H.; Reuther, B. Random Block Matrices and Matrix Orthogonal Polynomials. J. Theor. Probab. 2010, 23, 378–400. [Google Scholar] [CrossRef]
Guhlich, M.; Nagel, J.; Dette, H. Random block matrices generalizing the classical Jacobi and Laguerre ensembles. J. Multivar. Anal. 2010, 101, 1884–1897. [Google Scholar] [CrossRef]
Dunn, T.; Fleishmann, H.L.; Jackson, F.; Khunger, S.; Miller, S.J.; Reifenberg, L.; Shashkov, A.; Willis, S. Limiting Spectral Distirbutions of Families of Block Matrix Ensembles. arXiv 2022, arXiv:2109.01464v1. [Google Scholar]
Bogomolny, E.; Giraud, O. Statistical properties of structured random matrices. arXiv 2021, arXiv:2012.14322v1. [Google Scholar] [CrossRef] [PubMed]
Krueger, T.; Renfrew, D. Singularity degree of structured random matrices. In Annales de l’Institut Henri Poincare (B) Probabilites et Statistiques; Institut Henri Poincaré: Paris, France, 2025; Volume 61, pp. 1416–1442. [Google Scholar] [CrossRef]
Tikhomirov, A.N.; Timushev, D.A.; Gulyaeva, S.T. Limit Theorems for Spectra of Circulant Block Matrices with Large Random Blocks. Mathematics 2024, 12, 2291. [Google Scholar] [CrossRef]
Blackwell, K.; Borade, N.; Bose, A.; Vi, C.D.; Luntzlara, N.; Ma, R.; Miller, S.J.; Mukherjee, S.S.; Wang, M.; Xu, W. Distribution of eigenvalues of matrix ensembles arising from Wigner and palindromic Toeplitz blocks. arXiv 2021, arXiv:2102.05839v1. [Google Scholar]
Li, Y.; Liu, D.; Wang, Z. Limit Distributions of Eigenvalues for Random Block Toeplitz and Hankel Matrices. J. Theor. Probab. 2011, 24, 1063–1086. [Google Scholar] [CrossRef]
Chin, C.W. Necessary and Sufficient Conditions for Convergence to the Semicircle Distribution. Random Matrices Theory Appl. 2023, 12, 2250045. [Google Scholar] [CrossRef]
Dong, Z.; Yao, J. Necessary and sufficient conditions for the Marcĕnko–Pastur law for sample correlation matrices. Stat. Probab. Lett. 2025, 221, 110377. [Google Scholar] [CrossRef]
Ding, X. Spectral analysis of large block random matrices with rectangular blocks. Lith. Math. J. 2014, 54, 115–126. [Google Scholar] [CrossRef]
Khorunzhy, A.M.; Khoruzhenko, B.A.; Pastur, L.A. On asymptotic propertiies of large random matrices with independnet entries. J. Math. Phys. 1996, 37, 5033–5060. [Google Scholar] [CrossRef]

Figure 1. The histograms of

5 \times 5

Hankel matrices with block entries distributed according to the polynomial densities

p (x) = \frac{C}{{(1 + | x |)}^{100}}

(A) and

p (x) = \frac{C}{{(1 + | x |)}^{\frac{7}{2}}}

(B).

Figure 1. The histograms of

5 \times 5

Hankel matrices with block entries distributed according to the polynomial densities

p (x) = \frac{C}{{(1 + | x |)}^{100}}

(A) and

p (x) = \frac{C}{{(1 + | x |)}^{\frac{7}{2}}}

(B).

Figure 2. The histograms of

5 \times 5

Toeplitz matrices with block entries distributed according to the polynomial densities

p (x) = \frac{C}{{(1 + | x |)}^{100}}

(A) and

p (x) = \frac{C}{{(1 + | x |)}^{\frac{7}{2}}}

(B).

Figure 2. The histograms of

5 \times 5

Toeplitz matrices with block entries distributed according to the polynomial densities

p (x) = \frac{C}{{(1 + | x |)}^{100}}

(A) and

p (x) = \frac{C}{{(1 + | x |)}^{\frac{7}{2}}}

(B).

Figure 3. The histograms of

5 \times 5

Töplitz matrices with block entries distributed according to Student’s distribution with

d f = 5

degrees of freedom (A) and a standard normal distribution (B).

Figure 3. The histograms of

5 \times 5

Töplitz matrices with block entries distributed according to Student’s distribution with

d f = 5

degrees of freedom (A) and a standard normal distribution (B).

Figure 4. The histograms of

5 \times 5

Hankel matrices with block entries distributed according to Student’s distribution with

d f = 5

degrees of freedom (A) and a standard normal distribution (B).

Figure 4. The histograms of

5 \times 5

Hankel matrices with block entries distributed according to Student’s distribution with

d f = 5

degrees of freedom (A) and a standard normal distribution (B).

Figure 5. The histogram of

3 \times 3

matrices with rectangular blocks

N_{1} = 5000

,

N_{2} = 3333

,

N_{3} = 1666

. Student distribution with

d f = 3

degrees of freedom with different variances for different blocks.

Figure 5. The histogram of

3 \times 3

matrices with rectangular blocks

N_{1} = 5000

,

N_{2} = 3333

,

N_{3} = 1666

. Student distribution with

d f = 3

degrees of freedom with different variances for different blocks.

Figure 6. The histograms of

3 \times 3

matrices with rectangular blocks:

N_{1} = 5000

,

N_{2} = 3333

, and

N_{3} = 1666

. Student’s distribution with

d f = 5

degrees of freedom (A) and normal distribution (B) with different variances for different blocks.

Figure 6. The histograms of

3 \times 3

matrices with rectangular blocks:

N_{1} = 5000

,

N_{2} = 3333

, and

N_{3} = 1666

. Student’s distribution with

d f = 5

degrees of freedom (A) and normal distribution (B) with different variances for different blocks.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Tikhomirov, A.N. On the Limiting Distribution of the Spectra of Random Block Matrices. Mathematics 2025, 13, 2056. https://doi.org/10.3390/math13132056

AMA Style

Tikhomirov AN. On the Limiting Distribution of the Spectra of Random Block Matrices. Mathematics. 2025; 13(13):2056. https://doi.org/10.3390/math13132056

Chicago/Turabian Style

Tikhomirov, Alexander N. 2025. "On the Limiting Distribution of the Spectra of Random Block Matrices" Mathematics 13, no. 13: 2056. https://doi.org/10.3390/math13132056

APA Style

Tikhomirov, A. N. (2025). On the Limiting Distribution of the Spectra of Random Block Matrices. Mathematics, 13(13), 2056. https://doi.org/10.3390/math13132056

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

On the Limiting Distribution of the Spectra of Random Block Matrices

Abstract

1. Introduction

1.1. List of Notations

1.2. Model Representation

1.3. Toeplitz and Hankel Random Matrices

1.3.1. Hankel Matrices

1.3.2. Circulant Matrices

2. Main Result

Rectangular Blocks

3. Proof of the Main Result

3.1. Truncation

3.2. Special Representation for the Difference Between Two Resolvents

3.3. Estimations of Quantities $Q_{1}$ – $Q_{6}$

3.3.1. Estimation of $Q_{1}$

3.3.2. Estimation of $Q_{2}$

3.3.3. Estimation of $Q_{3}$

3.3.4. Estimation of $Q_{4}$

3.3.5. Estimation of $Q_{5}$

3.3.6. Estimation of $Q_{6}$

3.4. Girko’s Lemma

4. Conclusions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

On the Limiting Distribution of the Spectra of Random Block Matrices

Abstract

1. Introduction

1.1. List of Notations

1.2. Model Representation

1.3. Toeplitz and Hankel Random Matrices

1.3.1. Hankel Matrices

1.3.2. Circulant Matrices

2. Main Result

Rectangular Blocks

3. Proof of the Main Result

3.1. Truncation

3.2. Special Representation for the Difference Between Two Resolvents

3.3. Estimations of Quantities Q 1 – Q 6

3.3.1. Estimation of Q 1

3.3.2. Estimation of Q 2

3.3.3. Estimation of Q 3

3.3.4. Estimation of Q 4

3.3.5. Estimation of Q 5

3.3.6. Estimation of Q 6

3.4. Girko’s Lemma

4. Conclusions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

3.3. Estimations of Quantities $Q_{1}$ – $Q_{6}$

3.3.1. Estimation of $Q_{1}$

3.3.2. Estimation of $Q_{2}$

3.3.3. Estimation of $Q_{3}$

3.3.4. Estimation of $Q_{4}$

3.3.5. Estimation of $Q_{5}$

3.3.6. Estimation of $Q_{6}$