Next Article in Journal
On Using the BMCSL Equation of State to Renormalize the Onsager Theory Approach to Modeling Hard Prolate Spheroidal Liquid Crystal Mixtures
Next Article in Special Issue
Using the Semantic Information G Measure to Explain and Extend Rate-Distortion Functions and Maximum Entropy Distributions
Previous Article in Journal
Clifford Wavelet Entropy for Fetal ECG Extraction
Previous Article in Special Issue
Designing Bivariate Auto-Regressive Timeseries with Controlled Granger Causality
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Upper Bounds for the Capacity for Severely Fading MIMO Channels under a Scale Mixture Assumption

by
Johannes T. Ferreira
1,2
1
Department of Statistics, University of Pretoria, Pretoria 0002, South Africa
2
Centre of Excellence in Mathematical and Statistical Science, University of the Witwatersrand, Johannesburg 2000, South Africa
Entropy 2021, 23(7), 845; https://doi.org/10.3390/e23070845
Submission received: 23 May 2021 / Revised: 23 June 2021 / Accepted: 28 June 2021 / Published: 30 June 2021
(This article belongs to the Special Issue Measures of Information)

Abstract

:
A cornerstone in the modeling of wireless communication is MIMO systems, where a complex matrix variate normal assumption is often made for the underlying distribution of the propagation matrix. A popular measure of information, namely capacity, is often investigated for the performance of MIMO designs. This paper derives upper bounds for this measure of information for the case of two transmitting antennae and an arbitrary number of receiving antennae when the propagation matrix is assumed to follow a scale mixture of complex matrix variate normal distribution. Furthermore, noncentrality is assumed to account for LOS scenarios within the MIMO environment. The insight of this paper illustrates the theoretical form of capacity under these key assumptions and paves the way for considerations of alternative distributional choices for the channel propagation matrix in potential cases of severe fading, when the assumption of normality may not be realistic.

1. Introduction

1.1. MIMO Channels and Capacity

Research in random matrix theory has seen unprecedented growth in not only theoretical advances over the last two decades, but also coupled with a variety of developments in fields of application including wireless communication. In fact, the development in this current era of growing need for continuous and more reliable communication motivates a sustained research interest in mathematical, and specifically statistical representation and theoretical frameworks to understand and extend global communication needs. A major consideration within wireless communication is the multiple-input-multiple-output (MIMO) design. MIMO designs are widely studied and employed in practice as their multiple-antenna design exhibits high spectral efficiency and increased capacity for the transfer of information via wireless signals passing through these systems. MIMO has a fundamental probabilistic foundation, and is intertwined with statistical distribution theory. Therefore, statistical distribution insight is essential for further development, refinement, and investigation of MIMO systems and the interconnected world we live and operate in.
In wireless communication, this design is characterized as y = Hx + e where y , e C n × 1 , x C p × 1 , and H C n × p . In MIMO terms, x and y denotes the transmit- and receiver signal vectors respectively, e denotes a noise vector, and n denotes the number of transmitter antennae and p denotes the number of receiver antennae (often denoted by n t and n r respectively—however, "usual" statistical notation of n and p will be retained). Here, C n × p denotes the space of all complex matrices of dimension n × p and C * p × p denotes the space of all Hermitian positive definite complex matrices of dimension p × p . H is called the propagation matrix, and is usually considered to be complex matrix variate normally distributed with probability density function (pdf)
f H = 1 π n p det Φ p det Σ n e t r Φ 1 H M Σ 1 H M H
denoted H C N n × p M , Φ Σ , where M C n × p , Φ C * n × n , Σ C * p × p with mean E H = M and c o v H = Φ Σ . X H , X 1 , and det ( X ) denotes the Hermitian transpose, the inverse, and the determinant of the complex matrix X respectively, ⊗ denotes the Kronecker product, e t r · denotes the exponential trace exp t r · , and E · and c o v · denotes the expectation and covariance operator.
MIMO systems have different measures of information that describes the efficacy and quality of transmitted information in the design: mutual information being an important characteristic. In particular, the ergodic mutual information of the MIMO channel is evaluated as (see [1])
C = E H log det I p + ρ p H H H = p E H log 1 + ρ p λ 1
where ρ denotes the signal-to-noise ratio, and I p denotes the identity matrix of dimension p. The measure of information (2) is often described as the capacity of the MIMO system (also called Shannon capacity, see [2]), and relies on the distribution of H H H - or what is known as the complex (noncentral) Wishart matrix when H is considered CN distributed. Here, λ 1 denotes the largest eigenvalue of H H H , and its distribution is essential in the evaluation of (2). This assumption for H is restrictive in the practical case when severe fading is observed, and it is the main focus of this paper to alleviate this restriction via the platform of scale mixture of complex matrix variate normals for H .

1.2. Scale Mixture of (Complex) Matrix Variate Normals and the Resulting Wishart

Normality remains a common and useful assumption within statistical theory- and application. However, data often display characteristics that do not align with the implied requirements of normality: characteristics such as heavier tails, even if the data still respects characteristics such as symmetry. The alleviation of this normal assumption has been the focus of much research. Two important research foci in this regard are Gaussian mixtures (see [3,4] for examples in the MIMO context) which can account for multimodality and skewness, and scale mixtures which can account for heavier-than-normal tails in the data. Scale mixtures are often considered for relaxing the normal assumption, and is a particular focus area of this paper. Pioneering work has been undertaken by [5,6,7] and has brought forth an important wave of research since due to the elegance and practicality of this broader elliptical class of models. In the complex variate realm, [8] capitalized on this mixture representation of complex normal variates. However, limited work has been done to introduce the platform of scale mixture of normal distributions within a MIMO context. Within MIMO, a normal assumption is made, but evidence exists that there are practical considerations from fieldwork which supports the argument that a departure from normality does not seem far-fetched [9,10,11,12]. In this light, the consideration of a scale mixture of (complex) matrix variate normals (SMCN) for the candidacy of H makes a meaningful contribution, as the scale mixture class has different distributional members which may very well suitably adapt to the practitioners need [9,13]. These members include the usual normal-, t-, contaminated normal-, and slash distribution among others—all of which provide a heavier-than-normal tailed alternative for potential practical considerations of H . In fact, [11] illustrates an SMCN assumption for a zero mean H in the MIMO context specifically where superior capacity performance is observed for an underlying complex matrix variate t model for H , and [10,12] demonstrates the added value of the scale mixture approach when H exhibits a nonzero mean; specifically focusing on rank-1 noncentrality and the condition number of the quadratic form, respectively. The data-driven consideration of the scale mixture approach in the MIMO environment is therefore well-motivated, and a valuable theoretical consideration.
The variable H has an SMCN distribution if it has pdf
f ( H ) = 1 det Φ p det Σ n h t r Φ 1 H M Σ 1 H M H = R + f C N n × p M , Φ t 1 Σ H | t k t d t
denoted by H S M C N n × p M , Φ Σ , h where f C N n × p M , Φ t 1 Σ denotes the CN pdf as in (1), h · denotes the generator function, and k t denotes the weight function (inherently dependent on h · ). The advantage of this mixture representation is that the variable H S M C N n × p M , Φ Σ , h retains key statistical characteristics E H = M and c o v H = Φ Σ . There are two aspects of (3) that warrants some comments for this paper: first, the assumption of a nonzero mean (in this case M ) which often causes a challenging theoretical environment in which to investigate properties of the model. Secondly, the large covariance structure which causes further theoretical challenges, where the trace operator in the argument of h · oftentimes restrict computable forms. Key characteristics under these challenges (such as pdfs of the joint eigenvalues, and thus capacity) are often derived in mathematically elegant yet computationally challenging forms of zonal- and Hayakawa polynomials (see [2], for example). This limits the practical consideration, and this paper will make additional assumptions (similar to [10,14,15]), such as Φ = I n to circumvent the cumbersome potential computational implementation of the results.

1.3. Main Contribution of this Paper

The main contribution in this study is assuming this SMCN distributional form (3) for H , compared to the usual complex normal assumption (1). This results in the distribution of S = H H H C * p × p (see [10]), necessary for deriving (2), with pdf
f S = det S n p Γ ˜ p n det Σ n R + t n p e t r t Σ 1 S + Δ 0 F ˜ 1 n ; t 2 Δ Σ 1 S k t d t
denoted by S S M C W p n , M , I n Σ where S C * p × p , Δ = Σ 1 M H M indicates the noncentral matrix parameter, and Γ ˜ p · denotes the p dimensional complex multivariate gamma function. The function r F ˜ q a 1 , . . . , a r ; b 1 , . . . , b q , X denotes the hypergeometric function of Hermitian matrix argument with r upper and q lower parameters (see [2,16] for further details) where r = 0 and q = 1 in (4) specifically. The distribution in (4) is called a scale mixture of complex Wishart (SMCW) and will be referred to as Case 1, where the consideration of the parameter Δ implies noncentrality. [17] (and subsequently [12]) also considered an uncorrelated SMCW model within the MIMO environment. This consideration simplifies the theoretical framework, but is practically motivated for spatial (in)dependence of transmitters for when antennae are spaced far enough apart geographically so that the assumption of independence is not unreasonable. Consider thus the transformation W = Σ 1 S C * p × p with Jacobian J S W = det Σ p , then (see [12])
f u n c W = det W n p Γ ˜ p n R + t n p e t r t W + Δ 0 F ˜ 1 n ; t 2 Δ W k t d t
and will be referred to as Case 2.
The second main consideration of this paper is assuming a nonzero mean matrix for H . This is a valid and oftentimes essential assumption for a MIMO system when there is a direct line-of-sight (LOS) component between transmitters and receivers (see [10]). This noncentrality is also a fundamental component in expressions left in zonal- and Hayakawa polynomial form. To this effect, expressions which are effectively an upper bound for the capacity are derived under the SMCN assumption, assuming noncentrality for H . Together, these two considerations will give insight into the (comparatively) little researched phenomena into severely fading channels and accounting for LOS between transmitters and receivers (see [1] as well as the Appendix A).
For observing the capacity, the joint pdf of the eigenvalues of (4) and (5) are derived; however, the form of these pdfs are challenging to work with for arbitrary n and p. The assumptions of this paper as well as the derivation of an upper bound for Case 1 and 2 will ease this investigation to allow clearer insight into the conceptual behavior in the case of a SMCN assumption H . The case when p = 2 will be of particular illustrative interest, for both Cases 1 and 2. This way, a contribution is also made to allow for arbitrary n: arbitrary degrees of freedom for the S M C W distributions under consideration in this paper, and arbitrary number of transmitter antennae in the MIMO environment.
The outline of this paper is as follows: in Section 2, the joint pdfs of the eigenvalues for both Case 1 and Case 2 are derived, and a specific upper bound for these densities are derived. These expressions are used in Section 3 to derive an upper bound for the capacity for Case 1 and Case 2. A particular exact expression for Case 2 is also given. Section 4 contains a discussion of obtained results and some potential areas for further work.

2. Eigenvalue Pdfs and an Upper Bound

As the eigenvalues of S (and W ) respectively play an essential role in understanding the capacity of a MIMO system subjected to this alternate distributional consideration for H , determining the joint pdf of the eigenvalues of S (and W ) are of interest. Suppose that λ 1 > λ 2 > . . . > λ p > 0 denote the real ordered eigenvalues of S , and let Λ = d i a g λ 1 , λ 2 , . . . , λ p . Additionally, let μ 1 > μ 2 > . . . > μ p > 0 denote the real ordered eigenvalues of Δ , and let Ω = d i a g μ 1 , μ 2 , . . . , μ p . The joint pdf of Λ is then given by [18])
f Λ = π p p 1 Γ ˜ p p k < l p λ k λ l 2 E U p f E Λ E H d E
where the pdf f · in the integral in (6) denotes the pdf of S , and U p denotes the unitary space of order p (see [18]). For Case 1 (see (4)) the joint pdf of Λ is then given by [10] (Note that k < l p λ k λ l is often referred to as the Vandermonde determinant.)
f Λ = π p p 1 det Λ n p Γ ˜ p p Γ ˜ p n det Σ n k < l p λ k λ l 2 R + t n p e t r t Δ × E U p e t r t Σ 1 E Λ E H 0 F ˜ 1 n ; t 2 Δ Σ 1 E Λ E H d E k t d t .
This equation, in terms of an integral over the unitary space, leads to a nonpractical representation of the joint pdf of the eigenvalues. Ref. [2] proved the following upper bound for X C n × p :
0 F ˜ 1 n ; X H X 0 F ˜ 0 X H X n = e t r X H X n
and illustrated the tightness of this upper bound. Thus, to circumvent the theoretical challenge in (8) we apply (8) to obtain an upper bound for this joint pdf, and use the splitting formula of [18]:
f Λ π p p 1 det Λ n p Γ ˜ p p Γ ˜ p n det Σ n k < l p λ k λ l 2 R + t n p e t r t Δ × E U p e t r t Σ 1 E Λ E H 0 F ˜ 0 t 2 Δ Σ 1 E Λ E H n d E k t d t = π p p 1 det Λ n p Γ ˜ p p Γ ˜ p n det Σ n k < l p λ k λ l 2 R + t n p e t r t Δ × E U p e t r t Σ 1 t 2 Δ Σ 1 n E Λ E H d E k t d t = π p p 1 det Λ n p Γ ˜ p p Γ ˜ p n det Σ n k < l p λ k λ l 2 R + t n p e t r t Δ 0 F ˜ 0 p Ψ , Λ k t d t
where Ψ = t Σ 1 t 2 Δ Σ 1 n and 0 F ˜ 0 p Ψ , Λ denotes the hypergeometric of double complex matrix argument [18]. For Case 2 (see (5)), in a similar way as above, this pdf (denoted by f u n c ( · ) for Case 2) is given by
f u n c Λ = π p p 1 det Λ n p Γ ˜ p p Γ ˜ p n k < l p λ k λ l 2 R + t n p e t r t Δ × E U p e t r t E Λ E H 0 F ˜ 1 n ; t 2 Δ E Λ E H d E k t d t π p p 1 det Λ n p Γ ˜ p p Γ ˜ p n k < l p λ k λ l 2 R + t n p e t r t Δ × E U p e t r t E Λ E H 0 F ˜ 0 t 2 Δ E Λ E H n d E k t d t = π p p 1 det Λ n p Γ ˜ p p Γ ˜ p n k < l p λ k λ l 2 R + t n p e t r t Δ 0 F ˜ 0 p Ψ * , Λ k t d t
where Ψ * = t I p t 2 Δ n . A useful representation of 0 F ˜ 0 p Ψ , Λ in (10) (and 0 F ˜ 0 p Ψ * , Λ in (11)) is given by [16]:
0 F ˜ 0 p Ψ , Λ = Γ ˜ p p det exp ψ i λ j π p p 1 2 k < l p λ k λ l k < l p ψ k ψ l
where ψ 1 > ψ 2 > . . . > ψ p > 0 denotes the real ordered eigenvalues of Ψ (and ψ 1 , * > ψ 2 , * > . . . > ψ p , * > 0 denotes the real ordered eigenvalues of Ψ * ). Using (11) and obtaining the unordered joint pdf of the eigenvalues (see [19]) leaves
f λ 1 , λ 2 , . . . , λ p π p p 1 det Λ n p p ! Γ ˜ p p Γ ˜ p n det Σ n k < l p λ k λ l 2 R + t n p e t r t Δ × Γ ˜ p p det exp ψ i λ j π p p 1 2 k < l p λ k λ l k < l p ψ k ψ l k t d t = π p p 1 2 det Λ n p p ! Γ ˜ p n det Σ n k < l p λ k λ l × R + t n p e t r t Δ k < l p ψ k ψ l det exp ψ i λ j k t d t
and
f u n c λ 1 , λ 2 , . . . , λ p π p p 1 det Λ n p p ! Γ ˜ p p Γ ˜ p n k < l p λ k λ l 2 R + t n p e t r t Δ × Γ ˜ p p det exp ψ i , * λ j π p p 1 2 k < l p λ k λ l k < l p ψ k , * ψ l , * k t d t = π p p 1 2 det Λ n p p ! Γ ˜ p n k < l p λ k λ l × R + t n p e t r t Δ k < l p ψ k , * ψ l , * det exp ψ i , * λ j k t d t .
There are clear theoretical challenges in using (8) or (10), or even (10) or (11) in the quest to determine explicit expressions for the capacity (see (2)). Using (13) and (14), an upper bound can now be obtained for the capacity using the marginal distribution for λ 1 in each respective case.

3. Capacity for the Case p = 2

In this section, expressions for an upper bound for the capacity in (2) when assuming an SMCN for H is derived when p = 2 for Case 1 and 2. An exact expression for C is also derived for Case 2.

3.1. Approximation for Case 1 and Case 2

Considering Case 1 (13) and setting p = 2 , see that
det exp ψ i λ j = exp ψ 1 λ 1 exp ψ 1 λ 2 exp ψ 2 λ 1 exp ψ 2 λ 2 = exp ψ 1 λ 1 ψ 2 λ 2 exp ψ 1 λ 2 ψ 2 λ 1 .
Therefore, by substituting (15) into (13) and determining (2):
C R + 2 log 1 + ρ 2 λ 1 R + π λ 1 λ 2 n 2 λ 1 λ 2 2 ! Γ ˜ 2 n det Σ n × R + t 2 n e t r t Δ ψ 1 ψ 2 exp ψ 1 λ 1 ψ 2 λ 2 exp ψ 1 λ 2 ψ 2 λ 1 k t d t d λ 2 d λ 1 = 1 Γ n Γ n 1 det Σ n R + log 1 + ρ 2 λ 1 R + t 2 n e t r t Δ ψ 1 ψ 2 × R + λ 1 n 1 λ 2 n 2 λ 1 n 2 λ 2 n 1 × exp ψ 1 λ 1 ψ 2 λ 2 exp ψ 1 λ 2 ψ 2 λ 1 d λ 2 k t d t d λ 1 .
Using the gamma integral relation (see [20], p. 337), see that
R + λ 1 n 1 λ 2 n 2 λ 1 n 2 λ 2 n 1 exp ψ 1 λ 1 ψ 2 λ 2 exp ψ 1 λ 2 ψ 2 λ 1 d λ 2 = λ 1 n 1 exp ψ 1 λ 1 ψ 2 n 1 Γ n 1 λ 1 n 1 exp ψ 2 λ 1 ψ 1 n 1 Γ n 1 λ 1 n 2 exp ψ 1 λ 1 ψ 2 n Γ n + λ 1 n 2 exp ψ 2 λ 1 ψ 1 n Γ n .
Substituting (16) into (16) leaves
C 1 Γ n Γ n 1 det Σ n R + R + log 1 + ρ 2 λ 1 t 2 n e t r t Δ ψ 1 ψ 2 × ( λ 1 n 1 exp ψ 1 λ 1 ψ 2 n 1 Γ n 1 λ 1 n 1 exp ψ 2 λ 1 ψ 1 n 1 Γ n 1 λ 1 n 2 exp ψ 1 λ 1 ψ 2 n Γ n + λ 1 n 2 exp ψ 2 λ 1 ψ 1 n Γ n ) k t d t d λ 1 = 1 det Σ n [ R + t 2 n e t r t Δ ψ 2 n 1 Γ n ψ 1 ψ 2 R + log 1 + ρ 2 λ 1 λ 1 n 1 exp ψ 1 λ 1 d λ 1 k t d t R + t 2 n e t r t Δ ψ 1 n 1 Γ n ψ 1 ψ 2 R + log 1 + ρ 2 λ 1 λ 1 n 1 exp ψ 2 λ 1 d λ 1 k t d t R + t 2 n e t r t Δ ψ 2 n Γ n 1 ψ 1 ψ 2 R + log 1 + ρ 2 λ 1 λ 1 n 2 exp ψ 1 λ 1 d λ 1 k t d t + R + t 2 n e t r t Δ ψ 1 n Γ n 1 ψ 1 ψ 2 R + log 1 + ρ 2 λ 1 λ 1 n 2 exp ψ 2 λ 1 d λ 1 k t d t ] .
For Case 2 (14), the result would follow in a similar manner:
C R + t 2 n e t r t Δ ψ 2 , * n 1 Γ n ψ 1 , * ψ 2 , * R + log 1 + ρ 2 λ 1 λ 1 n 1 exp ψ 1 , * λ 1 d λ 1 k t d t R + t 2 n e t r t Δ ψ 1 , * n 1 Γ n ψ 1 , * ψ 2 , * R + log 1 + ρ 2 λ 1 λ 1 n 1 exp ψ 2 , * λ 1 d λ 1 k t d t R + t 2 n e t r t Δ ψ 2 , * n Γ n 1 ψ 1 , * ψ 2 , * R + log 1 + ρ 2 λ 1 λ 1 n 2 exp ψ 1 , * λ 1 d λ 1 k t d t + R + t 2 n e t r t Δ ψ 1 , * n Γ n 1 ψ 1 , * ψ 2 , * R + log 1 + ρ 2 λ 1 λ 1 n 2 exp ψ 2 , * λ 1 d λ 1 k t d t .
Both these approximations for Case 1 and 2 illustrate the elegance which the scale mixture platform provides in a theoretical sense, and under the usual CN assumption (with corresponding weight k ( t ) ) the results reflect those obtained by [2]. In the zero mean case, i.e., when M = 0 and thus Λ = 0 , the results of [11] are closely reflected. A computational challenge remains due to the dependence of these expressions on ψ 1 , ψ 2 , , ψ p and ψ 1 , * , ψ 2 , * , , ψ p , * , the eigenvalues of Ψ and Ψ * respectively since both these matrices are functions of the mixing variable t.

3.2. Exact Expression for Case 2

An exact expression not consisting of zonal- or Hayakawa polynomials for the capacity (2) for Case 1 is not feasible, due to the intractability of the function 0 F ˜ 1 n ; t 2 Δ Σ 1 E Λ E H in (8). However, an exact expression can be derived for Case 2. For this expression, the computable form for the hypergeometric function of two complex matrix arguments from [21] is exploited. Thus, from (10):
f u n c Λ = π p p 1 det Λ n p Γ ˜ p p Γ ˜ p n k < l p λ k λ l 2 R + t n p e t r t Δ + Λ × E U p 0 F ˜ 1 n ; t 2 Δ E Λ E H d E k t d t = π p p 1 det Λ n p Γ ˜ p p Γ ˜ p n k < l p λ k λ l 2 R + t n p e t r t Δ + Λ × 0 F ˜ 1 n ; t Ω , t Λ k t d t .
Ref. [21] enables us to express the function 0 F ˜ 1 n ; t Ω , t Λ as follows:
0 F ˜ 1 n ; t Ω , t Λ = det 0 F 1 n p + 1 , t 2 μ i λ j π t p p 1 2 k < l p λ k λ l k < l p μ k μ l Γ ˜ p p Γ ˜ p n n p ! p
where 0 F 1 · ; · denotes the confluent hypergeometric function of scalar argument and det 0 F 1 · ; · denotes the determinant of a matrix with this confluent hypergeometric function of scalar argument as entries. Please note that 0 F 1 s + 1 ; x = s ! x s 2 I s 2 x and that I s 2 x = k = 0 1 k ! Γ s + k + 1 2 x 2 s + 2 k where Γ · denotes the usual gamma function (see [20], p. 919, eq. 8.445). Therefore
0 F 1 n p + 1 , t 2 μ i λ j = n p ! t 2 μ i λ j n p 2 k = 0 1 k ! Γ n p + k + 1 t μ i λ j n p + 2 k .
Using these expressions, it is theoretically possible to numerically solve for the capacity for any p by taking the expectation of (2) using the above for the pdf. Of course, this would prove computationally intensive, as the determinant of a (potentially) high dimensional matrix is an expensive operator—in addition to this, the entries of this determinant follow from (19), and would still require the marginal distribution of λ 1 to be numerically determined to evaluate (2). For particular insight into the theoretical framework in this case, assume that p = 2 :
det 0 F 1 n 1 , t 2 μ i λ j = 0 F 1 n 1 , t 2 μ 1 λ 1 0 F 1 n 1 , t 2 μ 1 λ 2 0 F 1 n 1 , t 2 μ 2 λ 1 0 F 1 n 1 , t 2 μ 2 λ 2 = 0 F 1 n 1 , t 2 μ 1 λ 1 0 F 1 n 1 , t 2 μ 2 λ 2 0 F 1 n 1 , t 2 μ 1 λ 2 0 F 1 n 1 , t 2 μ 2 λ 1 = n 2 ! t 2 μ 1 λ 1 n 2 2 k = 0 1 k ! Γ n 2 + k + 1 t μ 1 λ 1 n 2 + 2 k n 2 ! t 2 μ 2 λ 2 n 2 2 × m = 0 1 m ! Γ n 2 + m + 1 t μ 2 λ 2 n 2 + 2 m n 2 ! t 2 μ 1 λ 2 n 2 2 k = 0 1 k ! Γ n 2 + k + 1 t μ 1 λ 2 n 2 + 2 k × n 2 ! t 2 μ 2 λ 1 n 2 2 m = 0 1 m ! Γ n 2 + m + 1 t μ 2 λ 1 n 2 + 2 m = n 2 ! 2 k = 0 m = 0 t 2 k + 2 m k ! m ! Γ n + k 1 Γ n + m 1 μ 1 k μ 2 m λ 1 k λ 2 m λ 2 k λ 1 m .
By substituting (21) into (18) leaves:
0 F ˜ 1 n ; t Ω , t Λ = det 0 F 1 n p + 1 , t 2 μ i λ j π t λ 1 λ 2 μ 1 μ 2 π 2 π 2 Γ n Γ n 1 n 2 ! 2 = π 2 π 2 Γ n Γ n 1 n 2 ! 2 π t λ 1 λ 2 μ 1 μ 2 n 2 ! 2 × k = 0 m = 0 t 2 k + 2 m μ 1 k μ 2 m λ 1 k λ 2 m λ 2 k λ 1 m k ! m ! Γ n + k 1 Γ n + m 1 = π 3 Γ n Γ n 1 λ 1 λ 2 μ 1 μ 2 k = 0 m = 0 t 2 k + 2 m μ 1 k μ 2 m λ 1 k λ 2 m λ 2 k λ 1 m k ! m ! Γ n + k 1 Γ n + m 1
Thus, after considering (22), (18) is shown to be:
f u n c λ 1 , λ 2 = π 2 λ 1 λ 2 n 2 Γ ˜ 2 2 Γ ˜ 2 n λ 1 λ 2 2 R + t 2 n e t r t Δ + Λ π 3 Γ n Γ n 1 λ 1 λ 2 μ 1 μ 2 × k = 0 m = 0 t 2 k + 2 m μ 1 k μ 2 m λ 1 k λ 2 m λ 2 k λ 1 m k ! m ! Γ n + k 1 Γ n + m 1 k t d t = 1 μ 1 μ 2 k = 0 m = 0 μ 1 k μ 2 m λ 1 n 1 λ 2 n 2 λ 1 n 2 λ 2 n 1 λ 1 k λ 2 m λ 2 k λ 1 m k ! m ! Γ n + k 1 Γ n + m 1 × R + t 2 n + 2 k + 2 m e t r t Δ + Λ k t d t .
This pdf (23) can also be viewed as a weighted infinite sum with Poisson probabilities:
f u n c λ 1 , λ 2 = λ 1 n 1 λ 2 n 2 λ 1 n 2 λ 2 n 1 μ 1 μ 2 R + k = 0 m = 0 t 2 n + k + m e t r t Λ λ 1 k λ 2 m λ 2 k λ 1 m Γ n + k 1 Γ n + m 1 × t μ 1 k exp t μ 1 k ! t μ 2 m exp t μ 2 m ! k t d t
with Poisson parameters t μ 1 and t μ 2 . The noncentral aspect of this representation thus corresponds with similarly illustrated noncentral representations: in this case by relying on the noncentral matrix eigenvalues μ 1 and μ 2 as the parameters of Poisson probabilities, which has been observed across the literature: see for example [22,23], and more recently, [24] in the MIMO environment.
For evaluating the capacity (2), see that
C = R + 2 log 1 + ρ 2 λ 1 1 2 ! μ 1 μ 2 k = 0 m = 0 μ 1 k μ 2 m k ! m ! Γ n + k 1 Γ n + m 1 × R + λ 1 n 1 λ 2 n 2 λ 1 n 2 λ 2 n 1 λ 1 k λ 2 m λ 2 k λ 1 m × R + t 2 n + 2 k + 2 m e t r t Δ + Λ k t d t d λ 2 d λ 1 = 1 μ 1 μ 2 k = 0 m = 0 μ 1 k μ 2 m k ! m ! Γ n + k 1 Γ n + m 1 R + t 2 n + 2 k + 2 m e t r t Δ × R + log 1 + ρ 2 λ 1 exp t λ 1 × R + λ 1 n 1 λ 2 n 2 λ 1 n 2 λ 2 n 1 λ 1 k λ 2 m λ 2 k λ 1 m exp t λ 2 d λ 2 d λ 1 k t d t
where using the gamma integral relation again leaves
C = 1 μ 1 μ 2 k = 0 m = 0 μ 1 k μ 2 m k ! m ! Γ n + k 1 Γ n + m 1 R + t n + 2 k + 2 m e t r t Δ × [ t m 1 Γ n + m 1 R + log 1 + ρ 2 λ 1 exp t λ 1 λ 1 n + k 1 d λ 1 t k 1 Γ n + k 1 R + log 1 + ρ 2 λ 1 exp t λ 1 λ 1 n + m 1 d λ 1 t m Γ n + m R + log 1 + ρ 2 λ 1 exp t λ 1 λ 1 n + k 2 d λ 1 + t k Γ n + k R + log 1 + ρ 2 λ 1 exp t λ 1 λ 1 n + m 2 d λ 1 ] k t d t .
This exact expression for the capacity provides meaningful theoretical insight to the form of the capacity when considering uncorrelated MIMO channels subject to an SMCN propagation channel H , and has the results of [2,12] as a special case for corresponding choices of k ( t ) . The view of (24) solidifies the noncentral consideration for this bivariate case and acts as a "benchmark" of the noncentral representation which is often encountered in the literature.

4. Conclusions

In this paper, a motivated argument was presented for the introduction of an SMCN distribution for the underlying model of the propagation matrix H in a MIMO system environment. Not only does this assumption relieve the restriction of normality for the practitioner, but in this case a nonzero mean is also assumed as part of this model. Capacity, an essential measure of information within the MIMO context, relies on the distribution of H H H , and this paper investigates the eigenvalue relations of this SMCW distribution, as well as its uncorrelated counterpart. An upper bound for the capacity for both these cases are derived for the case when p = 2 , which in a MIMO context implies a two transmitter environment and is often employed for dual branch MIMO systems in practice, while considering arbitrary number of receivers n. This theoretical framework of the scale mixture assumption for the propagation matrix and the mathematical effect of this assumption in theoretically understanding and quantifying the capacity of such a system enjoys specific conceptual insight.
For higher orders of p the complexity of the calculation may be well reached in computational terms, but in a theoretical sense the representations become challenging—even more so because of the nonzero mean assumption: it is essential to note that ψ 1 , ψ 2 , , ψ p and ψ 1 , * , ψ 2 , * , , ψ p , * are eigenvalues of complex matrices inherently dependent on the scale mixture variable t. These challenges are mostly due to the potential high dimensionality of an arbitrary p, and the provision of Δ via Ψ or Ψ * in accounting for LOS in a practical consideration. Perhaps this does not facilitate convenient immediate computation for the capacity, but it does give us insight into the theoretical framework of the capacity when assuming H S M C N n × p M , Φ Σ , h which remains a meaningful and popular choice for the departure from normality within the statistical arena. In future, one might consider the further investigation of higher orders of p, the computational complexity involved with it, and the derivations and interpretations of other measures of information such as types of condition numbers and outage probabilities for models under the key assumptions made in this paper. A refreshing contribution to the literature could also be made by considering complex matrix variate Gaussian mixtures for the underlying distribution, as described in Section 1.

Funding

This work forms part of the grant RDP296/2021 based at the University of Pretoria, South Africa, and is partially supported by the Department of Statistics, University of Pretoria, South Africa.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Acknowledgments

The support of the StatDisT group and the Department of Statistics at the University of Pretoria is acknowledged. Special thanks to D.A. Burger and T. Botha for constructive discussions during the development of this paper. Thank you also to the two reviewers for constructive and meaningful insight into this paper.

Conflicts of Interest

The author declares no conflict of interest.

Abbreviations

MIMOmultiple-input-multiple-output
LOSline-of-sight
CNcomplex matrix variate normal
SMCNscale mixture of complex matrix variate normal
SMCWscale mixture of complex Wishart
pdfprobability density function

Appendix A

The statistical characterization of the fading (or, degradation) of the signal between transmitters and receivers in the case of a nonzero M case is described by the Rician distribution. This distribution has its direct roots in the usual CN assumption for H ; thus it is necessary to report the equivalent fading distribution in the case where H is assumed to be an SMCN distribution. Suppose R is the fading variable; then, under the SMCN assumption, the pdf is given by
f r = R + r σ 2 t 1 exp r 2 + s 2 2 σ 2 t 1 I 0 r s σ 2 t 1 k t d t , r > 0
where s 2 , σ 2 > 0 and I v . denotes the modified Bessel function of the first kind [20]. Note in particular that when s 2 = 0 , then (A1) reduces to the Rayleigh type distribution (or in other words, a Rayleigh type fading scenario) when the assumption of H is that of a zero mean SMCN distribution (see [11]). It is essential to observe that for the specific case when H is a CN candidate (for the specific choice of k t ) then (A1) reflects the usual Rician fading scenario–see also [10,12]. In addition to this, it is valuable to note the recent consideration of the Rayleigh type model emanating from a scale mixture of normal approach in the work of [25].

References

  1. Choi, S.H. Severly Fading MIMO Channels. Unpublished. Master’s Thesis, University of Canterbury, Christchurch, New Zealand, 2007. [Google Scholar]
  2. Ratnarajah, T.; Vaillancourt, R.; Alvo, M. Complex random matrices and Rician channel capacity. Probl. Inf. Transm. 2005, 41, 1–22. [Google Scholar] [CrossRef] [Green Version]
  3. Selim, B.; Alhussein, O.; Muhaidat, S.; Karagiannidis, G.K.; Liang, J. Modeling and analysis of wireless channels via the mixture of Gaussian distribution. IEEE Trans. Veh. Technol. 2015, 65, 8309–8321. [Google Scholar] [CrossRef] [Green Version]
  4. Alhussein, O.; Selim, B.; Assaf, T.; Muhaidat, S.; Liang, J.; Karagiannidis, G.K. A generalized mixture of Gaussians for fading channels. In Proceedings of the 2015 IEEE 81st Vehicular Technology Conference (VTC Spring), Piscataway, NJ, USA, 1–14 May 2015; pp. 1–6. [Google Scholar]
  5. Chu, K.C. Estimation and decision for linear systems with elliptical random processes. IEEE Trans. Autom. Control. 1973, 18, 499–505. [Google Scholar]
  6. Andrews, D.F.; Mallows, C.L. Scale mixtures of normal distributions. J. R. Stat. Soc. Ser. (Methodol.) 1974, 36, 99–102. [Google Scholar] [CrossRef]
  7. West, M. On scale mixtures of normal distributions. Biometrika 1987, 74, 646–648. [Google Scholar] [CrossRef]
  8. Provost, S.B.; Cheong, Y.H. The distribution of Hermitian quadratic forms in elliptically contoured random vectors. J. Stat. Plan. Inference 2002, 102, 303–316. [Google Scholar] [CrossRef]
  9. Choi, S.H.; Smith, P.; Allen, B.; Malik, W.Q.; Shafi, M. Severely fading MIMO channels: Models and mutual information. In Proceedings of the 2007 IEEE International Conference on Communications, Glasgow, Scotland, 24–28 June 2007; pp. 4628–4633. [Google Scholar]
  10. Ferreira, J.T.; Bekker, A. A unified complex noncentral Wishart type distribution inspired by massive MIMO systems. J. Stat. Distrib. Appl. 2019, 6, 1–19. [Google Scholar] [CrossRef]
  11. Ferreira, J.T.; Bekker, A.; Arashi, M. Advances in Wishart-type modelling of channel capacity. Revstat-Stat. J. 2020, 18, 237–256. [Google Scholar]
  12. Ferreira, J.T.; Bekker, A. Weighted condition number distributions emanating from complex noncentral Wishart type matrices. In Computational and Methodological Statistics and Biostatistics; Springer: Berlin/Heidelberg, Germany, 2020; pp. 99–119. [Google Scholar]
  13. Ollila, E.; Eriksson, J.; Koivunen, V. Complex elliptically symmetric random variables—generation, characterization, and circularity tests. IEEE Trans. Signal Process. 2010, 59, 58–69. [Google Scholar] [CrossRef]
  14. Kang, M.; Alouini, M.S. Capacity of MIMO Rician channels. IEEE Trans. Wirel. Commun. 2006, 5, 112–122. [Google Scholar] [CrossRef]
  15. McKay, M.R.; Collings, I.B. General capacity bounds for spatially correlated Rician MIMO channels. IEEE Trans. Inf. Theory 2005, 51, 3121–3145. [Google Scholar] [CrossRef]
  16. Khatri, C. Non-central distributions of ith largest characteristic roots of three matrices concerning complex multivariate normal populations. Ann. Inst. Stat. Math. 1969, 21, 23–32. [Google Scholar] [CrossRef]
  17. Matthaiou, M.; Laurenson, D.I.; Wang, C.X. On analytical derivations of the condition number distributions of dual non-central Wishart matrices. IEEE Trans. Wirel. Commun. 2009, 8, 1212–1217. [Google Scholar] [CrossRef] [Green Version]
  18. James, A.T. Distributions of matrix variates and latent roots derived from normal samples. Ann. Math. Stat. 1964, 35, 475–501. [Google Scholar] [CrossRef]
  19. Bain, L.J.; Engelhardt, M. Introduction to Probability and Mathematical Statistics; Brooks/Cole: Pacific Grove, CA, USA, 1987. [Google Scholar]
  20. Gradshteyn, I.S.; Ryzhik, I.M. Table of Integrals, Series, and Products; Academic Press: Cambridge, MA, USA, 2014. [Google Scholar]
  21. Gross, K.I.; Richards, D.S.P. Total positivity, spherical series, and hypergeometric functions of matrix argument. J. Approx. Theory 1989, 59, 224–246. [Google Scholar] [CrossRef] [Green Version]
  22. Patnaik, P. The non-central χ2-and F-distribution and their applications. Biometrika 1949, 36, 202–232. [Google Scholar] [CrossRef] [PubMed]
  23. Ferreira, J.T.; Bekker, A.; Arashi, M. Bivariate noncentral distributions: An approach via the compounding method. S. Afr. Stat. J. 2016, 50, 103–122. [Google Scholar]
  24. Bekker, A.; Ferreira, J.T. Bivariate gamma type distributions for modeling wireless performance metrics. Stat. Optim. Inf. Comput. 2018, 6, 335–353. [Google Scholar] [CrossRef]
  25. Saeedzarandi, M.; Nezamabadi-pour, H.; Jamalizadeh, A. Dual-Tree complex wavelet coefficient magnitude modeling using scale mixtures of Rayleigh distribution for image denoising. Circuits Syst. Signal Process. 2020, 39, 2968–2993. [Google Scholar] [CrossRef]
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Ferreira, J.T. Upper Bounds for the Capacity for Severely Fading MIMO Channels under a Scale Mixture Assumption. Entropy 2021, 23, 845. https://doi.org/10.3390/e23070845

AMA Style

Ferreira JT. Upper Bounds for the Capacity for Severely Fading MIMO Channels under a Scale Mixture Assumption. Entropy. 2021; 23(7):845. https://doi.org/10.3390/e23070845

Chicago/Turabian Style

Ferreira, Johannes T. 2021. "Upper Bounds for the Capacity for Severely Fading MIMO Channels under a Scale Mixture Assumption" Entropy 23, no. 7: 845. https://doi.org/10.3390/e23070845

APA Style

Ferreira, J. T. (2021). Upper Bounds for the Capacity for Severely Fading MIMO Channels under a Scale Mixture Assumption. Entropy, 23(7), 845. https://doi.org/10.3390/e23070845

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop