High-Precision Iterative Preconditioned Gauss–Seidel Detection Algorithm for Massive MIMO Systems

Ahmad, Mushtaq; Zhang, Xiaofei; Khoso, Imran A.; Shi, Xinlei; Qian, Yang

doi:10.3390/electronics11223806

Open AccessFeature PaperArticle

High-Precision Iterative Preconditioned Gauss–Seidel Detection Algorithm for Massive MIMO Systems

by

Mushtaq Ahmad

,

Xiaofei Zhang

^*,

Imran A. Khoso

,

Xinlei Shi

and

Yang Qian

College of Electronic and Information Engineering, Nanjing University of Aeronautics and Astronautics, Nanjing 211106, China

^*

Author to whom correspondence should be addressed.

Electronics 2022, 11(22), 3806; https://doi.org/10.3390/electronics11223806

Submission received: 30 September 2022 / Revised: 27 October 2022 / Accepted: 14 November 2022 / Published: 19 November 2022

(This article belongs to the Special Issue Massive MIMO Technology for 5G and Beyond)

Download

Browse Figures

Review Reports Versions Notes

Abstract

:

Signal detection is a serious challenge for uplink massive multiple-input multiple-output (MIMO) systems. The traditional linear minimum-mean-squared error (MMSE) achieves good detection performance for such systems, but involves matrix inversion, which is computationally expensive due to a large number of antennas. Thus, several iterative methods such as Gauss–Seidel (GS) have been studied to avoid the direct matrix inversion required in the MMSE. In this paper, we improve the GS iteration in order to enhance the detection performance of massive MIMO systems with a large loading factor. By exploiting the property of massive MIMO systems, we introduce a novel initialization strategy to render a quick start for the proposed algorithm. While maintaining the same accuracy of the designed detector, the computing load is further reduced by initialization approximation. In addition, an effective preconditioner is proposed that efficiently transforms the original GS iteration into a new one that has the same solution, but a faster convergence rate than that of the original GS. Numerical results show that the proposed algorithm is superior in terms of complexity and performance than state-of-the-art detectors. Moreover, it exhibits identical error performance to that of the linear MMSE with one-order-less complexity.

Keywords:

massive MIMO; linear MMSE; signal detection; iterative methods; low-complexity

1. Introduction

Wireless communications technology has lately seen a remarkable growth in terms of supporting the large amount of mobile users and offering high throughput, with the next generation of cellular networks trying to support exceptional data rates [1], large IoT networks [2], and massive machine-to-machine communications [3]. Moreover, modern wireless communication systems require having high reliability, high energy and spectral efficiency, and high transmission capacity. In order to meet the demands, massive multiple-input multiple-output (MIMO) technology has been applied in the Fifth-Generation (5G) cellular network to manage the limited spectrum resource. It has been considered as a key technology to meet the requirements of high data rates for 5G and beyond wireless systems [4,5,6]. A large number of antennas at the base station (BS) are employed to serve a relatively small number of user terminals in massive MIMO. By equipping a large number of antennas, more degrees of freedom can be obtained in the wireless channel to simultaneously accommodate more information data, which offer greatly improved energy efficiency and spectral efficiency and provide better reliability compared with the conventional small-scale MIMO systems [7,8,9,10]. However, these advantages of massive MIMO systems (over small-scale MIMO) come at the cost of considerably increased computational burden at the BS. The numerous data symbols transmitted by different user terminals undergo multipath and undesired copies of the data symbols coming from different directions of arrival [11,12], and different delays are combined with the direct signal at the receiver side, which corrupts the received symbols. Thus, one of the most computationally intensive tasks is the detection of symbols in the uplink (user transmits to the BS), since the presence of several antennas needs detection techniques that scale favorably to higher dimensions. The receiver at the BS observes a linear superposition of the independently transmitted information bits, and the task of the signal detection technique is to separate those transmitted information bits. As the number of antennas at the BS grows, the complexity of the detection process increases exponentially. Therefore, the detection process becomes very complex in massive MIMO systems.

The traditional maximum likelihood (ML) detection method [13] can obtain optimality via minimizing the probability of detection error, but computational cost scales exponentially with the number of transmit antennas, which is, hence, computationally prohibitive for large multi-antenna systems. The k-best detection method [14] and the sphere decoding (SD) method [15] are two variants of ML detectors, which balance error performance and computational complexity by controlling the number of nodes in every search phase. Nonetheless, the QR decomposition in these nonlinear detection schemes can lead to low parallelism and high computational cost because of the inclusion of matrix operations such as element elimination. Therefore, to cope with the complexity issue, researchers have considered suboptimal linear detection methods, such as the linear minimum-mean-squared error (MMSE) detector [16], which is computationally less expensive, and it has shown good performance for massive MIMO systems, in particular for a favorable propagation environment and a large loading ratio (M/K) [4], where M and K denote the number of receive and transmit antennas, respectively. It is considered near-optimal for massive MIMO and occupies the benchmark place for most linear iterative detectors. However, the linear MMSE detector needs to compute the inverse of a matrix, and the complexity of matrix inversion increases cubically with the number of users. In other words, the matrices involved in the MMSE become large in dimension for large MIMO systems, and as a result, obtaining the inverse of such high-dimensional matrices is computationally expensive as it increases the cost of receiver development and introduces a considerable delay in processing.

To cut the overhead of high-dimensional matrix inversion while achieving near-MMSE error rate performance, recent works have looked into approximate or implicit matrix inversion methods. The Neumann-series-based detection, which replaces the matrix inversion by either matrix–matrix multiplications or matrix vector multiplications, was developed in [17,18,19]. It reduces the complexity to some extent, but for

N_{i t e r} \geq 3

(N_iter shows the number of iterations), it has even higher complexity than the exact matrix inversion method. Newton-iteration (NI)-based [20] detection was proposed to speed up the convergence. Nevertheless, the main disadvantage of the NI method is the same as that of the Neumann method. That is, its computational complexity is higher compared to the exact matrix inversion for more iterations. To further reduce the computational cost, numerous implicit methods such as the Gauss–Seidel (GS) detector [21,22,23,24], Jacobi method [25], Richardson iteration [26,27], accelerated over-relaxation (AOR) [28,29], symmetric successive over-relaxation (SSOR) [30], the Lanczos-method-based detector [31], and the conjugate gradient detector [32] have been introduced. These methods compute the estimates of the transmitted symbol without ever computing the matrix inverse. Moreover, in [33], an efficient initialization technique for uplink massive MIMO linear detection methods was developed. The main purpose was to overcome the problem of computing the Gram matrix and match the filter output vector in a pre-processing phase. Although the aforementioned iterative detection schemes are able to realize near-MMSE performance with relatively less computations, they lack the consistency of maintaining a good error rate performance when the number of users grows. Thus, it is very important to develop new detection algorithms to realize a practical receiver for the massive MIMO system with acceptable computational complexity.

1.1. Contributions

The main objective of this research is to solve the linear MMSE detection problem for massive MIMO systems using a low-complexity iterative algorithm. To this end, an enhanced version of the GS-based MMSE algorithm is proposed, which replaces the direct matrix inversion by matrix–vector multiplications. Therefore, unlike the complexity of traditional MMSE method, which is approximately proportional to the cube of the number of users, the complexity of the proposed algorithm is approximately proportional to the square of the number of users for the worst-case scenario. We analyze the property of massive MIMO, and based on that analysis, a novel initializer is proposed. It is then approximated by exploiting the channel hardening property of massive MIMO systems to further reduce the computational load. The proposed initializer achieves the error performance of the conventional diagonal-based initializer with significantly reduced computations. In order to further accelerate the convergence rate of the proposed algorithm, we introduce an efficient preconditioner, which reduces the condition number of the coefficient matrix. The preconditioner converts the original linear system into an equivalent one with the same solution, but a better convergence rate. Computational complexity analysis is presented and simulation results are provided to numerically validate the superiority of the proposed detection algorithm.

Our results demonstrate that the proposed approach outperforms the conventional GS-based detectors for large loading factors and substantially reduces the complexity of the linear MMSE without sacrificing the error performance.

1.2. Paper Outline

The remainder of the article is structured as follows: Section 2 details the massive MIMO system model and discusses linear MMSE detection. In Section 3, a low-complexity approach for estimating the transmitted information is presented. Additionally, Section 3 describes the proposed initial solution and develops an efficient preconditioning technique. Simulation results and the analysis of the results are demonstrated in Section 4. Additionally, this section computes and analyzes the computational complexity of the proposed approach and compares it with the traditional massive MIMO detection approaches. Finally, the conclusions are drawn in Section 5.

1.3. Notation

Throughout this article, lowercase and bold uppercase letters denote column vectors and matrices, respectively. The K × K identity matrix is represented by I_K. We denote the inverse and Hermitian transpose, respectively, by (.)⁻¹ and (.)_H. The vector a in the ith iteration is denoted by

a^{(i)}

. a_n is the ith entry of vector a, and for the element in the nth row and mth column of matrix A, we use A_n,m.

2. Massive MIMO System Model and Signal Detection

We considered an uncoded uplink massive MIMO system with M active antennas at the BS and serving K single-antenna users simultaneously, as shown in the Figure 1. Usually, the number of antennas at the BS is much larger than the number of users in massive MIMO systems. Suppose the transmitted symbol sent from the mth user is denoted as

x_{m} (1 \leq m \leq K)

, which comprises

J (= {log}_{2} Q)

bits per symbol, which is generated from a Q-ary constellation

M

with

\sum_{x ϵ X} x = 0

and

\sum_{x ϵ X} {| x |}^{2} = Q

. The generated data are mapped and then demultiplexed into K separate independent bit streams, which results in the transmitted vector. For different users, the bit streams are intended to be simultaneously sent to the BS. Thus, the transmitted data stream vector of K users is denoted by

x = {[x_{1}, x_{2}, \dots, x_{K}]}^{T}

. Then, the standard input–output relation to model a MIMO wireless channel can be expressed as [34]

y = H x + n,

(1)

where

y = [y_{1}, y_{2}, \dots, y_{M}]

is the K × 1 received symbol vector and

H = {[h_{1}, h_{2}, \dots, h_{K}]}^{T}

denotes the M × K flat Rayleigh fading channel matrix, where the

m (1 \leq m \leq K) th

column vector

h_{m}

designates the channel response between the mth transmit antenna and all receiving active antennas. Moreover, n shows the additive white Gaussian noise (AWGN) vector with independent mean zero components with σ² being the variance. In this circumstance, the average received signal-to-noise ratio (SNR) can be computed as K/σ². We considered, for simplicity, that the channel state information at the receiver is perfectly known.

The information bits transmitted by various users to the BS overlap and characteristically result in multiuser interference at the receiver in the multiuser uplink large MIMO systems. The multiuser signal detector performs the task of estimating the the transmitted signal vector at the BS from the noisy received signal vector. The signal estimation at the BS employing linear MMSE criteria is given as [35]

\hat{x} = {(H^{H} H + \frac{σ^{2}}{E_{x}} I_{K})}^{- 1} H^{H} y = A^{- 1} \hat{y},

(2)

where

E_{x}

is the average symbol energy and A is the regularized Gram matrix (or MMSE filtering matrix), which can be described as

A = H^{H} H + \frac{σ^{2}}{E_{x}} I_{K} = G + \frac{σ^{2}}{E_{x}} I_{K},

(3)

where G is the Gram matrix. The vector

\hat{y}

in (2) denotes the matched filter vector, and it is given by

\hat{y} = H^{H} y .

(4)

The underlying idea of linear MMSE detector (2) is to invert the effect of the MIMO channel matrix. The matrix inversion involved in the MMSE detector makes it challenging since it entails cubic computational complexity with respect to the number of user terminals, which eventually restricts the possible application in future large wireless systems such as the beyond 5G and Sixth-Generation (6G) systems. It can easily be observed that finding the solution of the linear MMSE problem is nothing but solving a set of linear equations given by A_x = b. Hence, numerous alternate methods that do not require the matrix inversion, such as the GS method, have successfully been utilized for massive MIMO detection.

3. Proposed Algorithm

It was discussed in Section 2 that, for massive MIMO systems, the linear MMSE detection algorithm can realize good performance. However, a large number of users and antennas at the BS increase the computational burden of MIMO detection by orders of magnitude. Unlike conventional MIMO, in massive MIMO systems, the channel hardening phenomenon can be exploited due to a large number of antennas to cancel the characteristics of a small-scale fading [4]. In this phenomenon, as the number of antennas increases, the variance of the mutual information of the MIMO channel grows very slowly relative to its mean or even shrinks [36]. As the number of transmit and receive antennas increase while keeping their ratio unchanged, the singular-value distribution of the MIMO channel matrix turns out to be less sensitive to the actual distribution of the entries of the channel matrix [36], which is due to the Marchenko–Pastur theorem [37]. Channel hardening can be observed in a system when [38]

\frac{∥ h_{m k} ∥^{2}}{E [∥ h_{m k} ∥^{2}]} \to 1,

(5)

almost surely as M → ∞. Equation (5) states that the gain

∥ h_{m k} ∥^{2}

of an arbitrary fading channel

h_{m k}

is close to its mean value when there are many antennas.

The interesting characteristic in this phenomenon is that it becomes more dominant when the number of receive antennas is much greater that the number of transmit antennas. Furthermore, the MMSE filtering matrix is Hermitian positive definite for massive MIMO systems, and it was shown in [4] that each entry of the diagonal component converges to a fixed value M. This is due to the fact that, when the number of receive antennas is very large compared to the number of users, the channel matrix is asymptotically orthogonal [4]. Let d denote the zero vector; considering the fact that the components of full-rank matrix H are i.i.d. random variables, then Hd = 0, and for arbitrary non-zero vector f,

{(H f)}^{H} H f = f^{H} (H^{H} H) f > 0 .

(6)

Moreover,

G = H^{H} H = {(H^{H} H)}^{H} = G^{H} .

(7)

Hence, the MMSE filtering matrix is positive definite and Hermitian. Using these properties of large MIMO systems, iterative approaches can be applied to compute the approximate solution with significantly lower complexity. Consequently, various approximate iterative detection algorithms with low complexity are being developed or improved to achieve near-optimal error rate performance. Among these detection algorithms, GS-based detection [21] achieves good detection accuracy. We further improved the performance of the conventional GS and reduced the computational complexity in this work.

Consider the linear system:

A u = b,

(8)

where A is a square matrix and u and b are K × 1 vectors. Equation (8) is equivalent to the MMSE problem (2). Since it is computationally expensive to solve (8) directly, we applied the GS method to solve it iteratively. As previously mentioned, the MMSE filtering matrix A is Hermitian positive definite for massive MIMO systems, and we can decompose it as

A = D + L + L^{H},

(9)

where D, L, and

L^{H}

, respectively, stand for the diagonal part and the strictly lower and strictly upper triangular parts of A. Then, to reconstruct the transmitted signal vector, the GS iteration can be expressed as

\begin{matrix} x_{i}^{(k)} = \frac{1}{A_{i i}} ({\hat{y}}_{i} - \sum_{j < i} A_{i j} x_{j}^{(k)} - \sum_{j > i} A_{i j} x_{j}^{(k - 1)}), \\ i, j = 1, 2, \dots, N, \end{matrix}

(10)

where

x^{(0)}

is an arbitrary initial vector,

A_{i j}

denotes the entry of A in the ith row and jth column, and

{\hat{y}}_{i}

,

x_{i}^{(k)}

, and

x_{i}^{(k - 1)}

represent the ith entry of the received symbol vector

\hat{y}

and transmitted symbol vectors

x^{(k)}

and

x^{(k - 1)}

, respectively. The GS method was applied in [21] to detect the signal vector. A new initializer and efficient preconditioning technique are proposed in this section to make the GS method applicable in practical massive MIMO scenarios.

3.1. Proposed Initialization

A proper initialization can lead to a faster convergence and affect both the detection accuracy and complexity of the final solution. Iterative methods usually use a zero vector as the initialization, which requires more iterations to realize the final estimation. The computational cost of each iteration in massive MIMO systems is very high due to a large number of antenna elements. In addition, the conventional GS detector uses a diagonal component as the initialization. Though it obtains better results, this is at the cost of increased computational burden. Hence, finding the optimal solution with less iterations is crucial to implementing massive MIMO.

According to the random matrix theory, i.e., the Marchenko–Pasture theorem, when each component of the matrix channel H is independently and identically distributed at zero mean, the ratio of the two tends to a constant (

M / K \to constant

), and the number of columns and the number of rows tend to infinity, i.e.,

M, K \to \infty

, the off-diagonal entries tend to zero, and the diagonal component of the matrix

H^{H} H

tend to a certain constant. Thus, for massive MIMO systems, all diagonal components of

H^{H} H

are positive and [4]

H^{H} H \approx M I,

(11)

and the eigenvalues of A converge to a fixed deterministic distribution [4]. Inspired by this, the matrix A can be approximated as

A_{i, j} = \{\begin{matrix} λ_{m a x}, & i = j, \\ 0, & i \neq j, \end{matrix}

(12)

where

λ_{m a x}

is the maximum eigenvalue of the MMSE filtering matrix. (12) shows that each entry of A is approximately equal to

λ_{m a x}

. Based on the above analysis, we propose a low-complexity initialization given as follows:

x^{(0)} = \frac{1}{λ_{m a x}} \hat{y} .

(13)

The proposed initialization technique significantly accelerates the convergence rate compared to conventional zero vector initialization and achieves the desired detection performance with few iterations, which reduces the complexity of the proposed detector significantly. Note, however, that the proposed initializer depends on

λ_{m a x}

, which is difficult to determine in practice. However, since the elements of H are i.i.d. complex Gaussian random variables,

H^{H} H

is a complex central Wishart matrix. Hence, as M increases, the largest eigenvalue of A converges to a deterministic value [4]:

{\hat{λ}}_{m a x} = M {(1 + \sqrt{\frac{M}{K}})}^{2},

(14)

and from (14), it can be noted that the proposed initializer only depends on the system parameters.

3.2. Proposed Preconditioning Technique

Compared to direct approaches, iterative approaches often need fewer operations, especially when an approximate solution provides good accuracy. However, iterative techniques have degraded performance, and preconditioning is necessary in order to achieve convergence within few iterations. For challenging problems in scientific computation, it is generally known that preconditioning is the most important ingredient in the design of efficient solvers. The preconditioning techniques are used for transforming the system into another system (preconditioning system) that has more favorable properties for the iterative solution. The rate of convergence of many iterative methods depends inversely on the condition number of the coefficient matrix. If the spectral condition number is large, the asymptotic approximation demonstrates that the convergence is slow. If the spectral condition number is of moderate size, a moderate convergence speed results. If, however, the spectral condition number is very close to 1, we have very fast convergence [39]. In general, preconditioning, when applied to an iterative method, improves the spectral properties of the coefficient matrix, i.e., minimizes the condition number, thereby maximizing the convergence of the iterative method [40]. The idea to minimize the condition number and, hence, maximize the convergence rate by applying a preconditioning technique is shown to be computationally feasible. Since the preconditioner acts on the spectral radius of the iteration matrix, it would be useful to choose an optimal preconditioner for a given linear system, that is a preconditioner that is able to achieve the required convergence with fewer iterations.

If we split the matrix A = M − N with a nonsingular matrix M, then the basic iterative method based on (8) can be expressed as

M x^{(k + 1)} = N x^{(k)} + b,

(15)

We can also write (15) as follows:

x^{(k + 1)} = B x^{(k)} + c,

(16)

where

B = M^{- 1} N

and

c = M^{- 1} b

.

We assume, for simplicity, that the matrix A has unit diagonal entries, and let

A = I - L - U,

(17)

where U and L are the upper triangular part and strictly lower triangular part of A, respectively. Then, the iteration matrix of the classical GS scheme is given by

B = {(I - L)}^{- 1} U

.

In order to improve the convergence properties of the classical GS detection method, we transformed the original linear system (8) into the preconditioned linear system by multiplying both sides with a nonsingular matrix T:

T A x = T b,

(18)

Let

TA = M_{T} - N_{T}

be the regular splittingof TA, then the basic iterative method based on (18) can be defined as

x^{(k + 1)} = B_{T} x^{(k)} + c_{T},

(19)

where

c_{T} = M_{T}^{- 1} b_{T}

. The spectral condition number of the iteration matrix for the preconditioned system should be smaller than that for the original system. Thus, the matrix T should be constructed in such a way that it meets the above requirement and is easy to implement. We propose a simple and efficient preconditioning mechanism for the GS detection given as follows [41]:

T = I + R,

(20)

where the nonsingular matrix R is defined by

R = [\begin{matrix} 0 & 0 & \dots & 0 \\ ⋮ & ⋮ & ⋱ & ⋮ \\ 0 & 0 & \dots & 0 \\ - a_{n 1} & - a_{n 2} & \dots - a_{n, m - 1} & 0 \end{matrix}] .

Thus, we obtain

A_{T} = (I + R) A x = (I + R) (I - L - U) x = (I - L - U + R - RL - RU) x,

(21)

wherever

\sum_{j = l + 1}^{i} a_{l j} a_{j l} \neq 1, l = 1, 2, \dots, i - 1,

(22)

{(I - L + R - RL - RU)}^{- 1}

exists, and for

A_{T}

, the GS iteration matrix

B_{T}

is defined by

B_{T} = {(I - L + R - RL - RU)}^{- 1} U .

(23)

For the preconditioned system, the nonsingular matrix

M = I - L + R - RL - RU

and the matrix N = U. Then, the estimation of the transmitted signals based on the proposed preconditioning can be expressed as

\begin{matrix} x_{i}^{(k)} = \frac{1}{A_{T i i}} ({\hat{y}}_{T i} - \sum_{j < i} A_{T i j} x_{j}^{(k)} - \sum_{j > i} A_{T i j} x_{j}^{(k - 1)}), \\ i, j = 1, 2, \dots, N_{i t e r} \end{matrix}

(24)

where

{\hat{y}}_{T} = (I + R) \hat{y}

. Algorithm 1 summarizes the proposed detector. The proposed detection algorithm arrives at the final convergence much faster compared to conventional iterative methods, which was verified through numerical simulations in the Results Section 4.

Algorithm 1: Proposed algorithm.

1Input: H, y, M, K, N_iter, E_x, σ²

2Preconditioning:

3

A = H^{H} H + \frac{σ^{2}}{E_{x}} I_{K}

4

\hat{y} = H^{H} y

5D = diag(A)

6

s = D^{- 1} \hat{y}

7R = D⁻¹A

8

R (1 : K - 1, :) = 0; R (K, K) = 0

9

I_{K} = 1 \times K

identity matrix

10

T = I_{K} + R

11

{\hat{y}}_{T} = T s

12

A_{T} = T D^{- 1} A

13Initialization:

14

{\hat{λ}}_{m a x} = M {(1 + \sqrt{\frac{K}{M}})}^{2}

15

x^{(0)} = \frac{1}{{\hat{λ}}_{m a x}} \hat{y}

16Iteration:

17for

k = 1, \dots, N_{i t e r}

do

18 for n = 1, …, K do

19

x_{i}^{(k)} = \frac{1}{A_{T i i}} ({\hat{y}}_{T i} - \sum_{j < i} A_{T i j} x_{j}^{(k)} - \sum_{j > i} A_{T i j} x_{j}^{(k - 1)})

20 End for

21End for

22Output: Detected signal,

\hat{x}

;

4. Numerical Results

In this section, we evaluate the performance in terms of the symbol error rate (SER) of different detection algorithms in an uplink massive MIMO wireless communication system. To verify the validity of the proposed linear detection algorithm, we compared its performance with conventional GS [21] and preconditioned GS (CP-GS) [24] detectors. In addition, the proposed algorithm was also compared with state-of-the-art iterative detection methods such as Jacobi [25], Neumann series [17], second-order Richardson method (SORM) [27], and AOR [28]. The linear MMSE exact matrix inversion method was included as a benchmark. We assumed perfect knowledge of the channel state information is available at the receiver, and the channel matrices were generated using i.i.d. flat Rayleigh fading channel model. For a fair comparison, different system configurations employing higher-order modulation techniques such as 16-QAM and 64-QAM were considered. Table 1 summarizes the simulation model parameters.

4.1. Comparison of Different Initializers

In this subsection, the simulation results of the proposed algorithm employing the zero vector initial solution, diagonal initial solution, and proposed initial solution are provided. A massive MIMO system with 128 antennas at the BS and 16 users was considered. With the 64-QAM modulation technique, Figure 2 demonstrates that the proposed algorithm shows the worst performance with zero vector initialization. Moreover, it is observed that the proposed initialization reveals degraded detection results compared to the diagonal initial solution for k = 1 and k = 2. However, the developed algorithm with the proposed initializer shows a similar performance as that of the diagonal initialization for k = 3. Note that the diagonal initial solution requires relatively more computations. Hence, the proposed initial method is the best choice, which achieves the detection accuracy of the diagonal initial solution with reduced computations.

4.2. Error Rate Performance

We first demonstrate the error performance of various detection techniques for a system with 256 antennas at the BS and 32 users in Figure 3. The 64-QAM modulation scheme was applied for this simulation. It can be easily observed from the plot that the performance of all iterative algorithms improved as the number of iterations increased. The CP-GS exhibited degraded performance compared to the conventional GS-based detector. Note, moreover, that the designed detection algorithm is superior to the aforementioned iterative schemes in terms of the error rate in the considered massive MIMO scenario. Furthermore, it achieved almost identical accuracy to that of the linear MMSE with k = 4.

In Figure 4, we increased the number of users and kept the same number of antennas at the BS as in Figure 3, to study the error rate performance of the proposed algorithm and the existing GS and CP-GS detection methods. The considered antenna configuration was M × K = 256 × 64 with the 16-QAM modulation technique. It can be clearly seen from the figure that all methods performed well for the given system settings. The GS performed better than CP-GS, and the main reason for the relatively better performance of GS is the diagonal initial solution. However, significantly better performance achieved by the proposed approach compared to the aforementioned detectors is clear from the plot. For the proposed algorithm, the SNR required to obtain an SER of 10⁻⁴ was 17.2 dB, whereas for the benchmark method, it was 17.03 dB. Thus, the performance deference between them was only 0.17 dB.

Next, we compared the proposed algorithm with state-of-the-art iterative detection methods to further validate the superiority of the designed approach. For an M × K = 128 × 16 massive MIMO system, Figure 5 reveals that the Jacobi- and Neumann-series-based detectors showed degraded performance. In contrast to Jacobi and Neumann, the AOR exhibited better performance. Note further that the detection performance of the SORM method improved with the number of iterations and achieved good results compared to Jacobi, Neumann, and AOR for the given iterations. However, the proposed algorithm achieved a lower error for the same number of iteration count than all aforementioned iterative methods. Moreover, it realized the MMSE performance with only four iterations.

In Figure 6, we study the SER performance of the proposed scheme as a function of the number of user terminals and compare it with recently reported GS- and CP-GS-based detection algorithms. In this case, the considered antennas at the BS were 128 and the 16-QAM modulation technique was employed. One can see that the the performance of the proposed algorithm, GS, and CP-GS was almost similar to that of the linear MMSE as the number of user terminals grew up to 25. However, for more than 25 antennas, there existed a gap between the iterative detectors and linear MMSE. The CP-GS exhibited degraded results compared to GS and the proposed algorithm. Further, it can be observed that the proposed detector converged faster than the conventional GS detector.

To study the numerical stability of the proposed technique, we provide results of the error performance as a function of the number of iterations. Figure 7 demonstrates the results for M = 128 antennas at the BS with 16 user terminals utilizing the 64-QAM modulation scheme. For various SNR values, the figure shows that, after a few number of iterations, the performance became stable. We can observe that, to attain stability for smaller SNR values, a smaller number of iterations is required. Thus, the proposed method is numerically stable and a few iterations are sufficient to achieve the desired performance.

Figure 8 shows the simulation results for the SER performance against the number of iterations. In this case, we compared the proposed technique with other state-of-the-art techniques for a system with 128 antennas at the BS and 32 users employing 16-QAM modulation with the SNR set to 16 dB. It can be noted that the Jacobi and Neumann methods achieved a high error floor. The main reason that the Jacobi method attained a high error floor is the damping factor, which is only applicable for a certain antenna scenario. Further observe that the AOR also showed slow convergence, and its error performance was degraded in this case, which is due to its sensitivity to acceleration and relaxation parameters. The convergence rate of the GS-based detectors was better than all other iterative detectors. It can be observed that the proposed algorithm and the conventional GS obtained almost the same performance for higher iterations, and the proposed algorithm had a better SER than GS up to three iterations. However, in all above-provided performance results, the proposed method exhibited the fastest convergence compared to all mentioned iterative methods and achieved desired results only within a few iterations. Thus, it can be concluded that the proposed detector is superior to all compared iterative detectors in terms of convergence and error performance.

4.3. Complexity Analysis and Comparison

The computational complexity required for estimating the signal in terms of required number of multiplications is analyzed in this section. We first computed the complexity involved in each step of the proposed detection approach, then compared it with the conventional GS-based detectors. Since the complexity of A and

\hat{y}

is required by all methods, we calculated the complexity of the later parts. The complexity of iterative methods mainly depends on the iteration cycles. One can see from (24) that K − 1 total multiplications were required to obtain

x_{i}^{(k)}

for each i and k. While there were K number of elements in vector

x^{(k)}

, the overall required complex multiplications for the GS iteration were

i (K^{2} - K)

. The computations of the initializer originate from solving (13). It can be easily found that it involves K + 3 multiplications to achieve

x^{(0)}

. Finally, we calculated the complexity involved in the preconditioning step. For this step, first, we needed to compute x and R, that is Lines 6 and 7 in Algorithm 1. It can be easily observed that

K^{2} + K

multiplications are needed to obtain R. Similarly, K multiplications are required to obtain vector x. Next, it is required to compute

{\hat{y}}_{T}

and

A_{T}

, that is Steps 10 and 11 in Algorithm 1. To compute

{\hat{y}}_{T}

, it involves the multiplication of T and vector x. Since we used only K − 1 elements of the last row of matrix T in the proposed preconditioning, it requires K − 1 multiplications to obtain

{\hat{y}}_{T}

. The computation of

A_{T}

involves a multiplication matrix T and matrix R. Although, it involves the multiplication of matrices. as mentioned above, K − 1 elements of T were used. Therefore, its computational complexity is

K^{2} - K

. Thus, the total complexity of the proposed algorithm for each iteration is

k (K^{2} - K) + 2 (K^{2} + K) - 1

. Note that the proposed detector has one-order-less computational complexity than the linear MMSE. Next, we compared the complexity of the proposed method with recently reported iterative methods.

First, the complexity of the proposed algorithm and the GS-based detection schemes is compared in Figure 9. The complexity of the benchmark linear MMSE is also included. The figure shows that the linear MMSE has the highest complexity (cubic of the number of users) and the conventional GS detector has the lowest complexity among all detectors. The proposed algorithm and CP-GS exhibit similar complexity, as can be seen from the plot. However, the convergence rate of the proposed algorithm is much faster than the GS- and CP-GS-based detectors, which has already been demonstrated. Thus, the proposed algorithm requires less iterations to achieve the MMSE performance compared to other methods, which ultimately reduces the number of computations of the proposed detector. In addition, GS uses the diagonal-matrix-based initial solution, which has higher complexity than the proposed approximate-eigenvalue-based initial solution.

In addition, the computational complexity of state-of-the-art detectors is also provided in Table 2. The complexity of the Neumann series is included for k ≥ 3, since it usually needs more than three iterations to achieve the desired performance. It can be seen that it has cubic complexity for k ≥ 3, which is very high for massive MIMO systems. The complexity of the proposed algorithm is slightly higher than other state-of-the-art detectors such as Jacobi, SORM, and AOR. However, the proposed detector performs much better compared to other methods, which was shown in the previous subsections. Furthermore, it obtains the desired detection results with fewer iterations, whereas the aforementioned detection schemes need more iterations for obtaining the same performance, which further reduces the computations of the proposed algorithm. In summary, the proposed detector can obtain the best trade-off between the computational complexity and error rate performance among the discussed iterative detection schemes.

5. Conclusions

We considered an approximated linear MMSE detection for massive MIMO uplink systems. We presented a low-complexity matrix-inverse-free signal detection algorithm based on the GS method. By taking full advantage of the special property of massive MIMO systems, a novel initial method was proposed and then approximated. It was shown that the new initializer realizes the same detection results to that of the conventional diagonal initial solution with decreased computational burden. Moreover, it significantly outperforms the traditional zero vector initializer. To approach the MMSE accuracy with a small number of iterations, an effective preconditioned method was designed. The complexity–performance tradeoff of the proposed detector was also illustrated and compared with the recently reported detection schemes. The analysis and simulation results demonstrated that the proposed method, although having a much lower computational complexity, can achieve similar SER performance as the linear MMSE and obtains a better performance compared to existing iterative detectors.

Author Contributions

Methodology, M.A.; Software, M.A. and I.A.K.; Validation, X.Z.; Formal analysis, M.A. and X.S.; Investigation, M.A.; Resources, X.Z., X.S. and Y.Q.; Writing—original draft, M.A.; Writing—review & editing, I.A.K.; Visualization, Y.Q.; Supervision, X.Z.; Funding acquisition, X.Z. All authors have read and agreed to the published version of the manuscript.

Funding

This work is supported by the National Natural Science Foundation of China (62101250), the Natural Science Foundation of Jiangsu Province (BK20210281), the Jiangsu Key Research and Development Project (BE2020101), the National Key Research and Development Project Grant (2020YFB1807602), the National science foundation of China (61971217, 61971218, 61631020, 61601167), the Jiangsu NSF Grant (BK20200444), the Jiangsu Planned Projects for Postdoctoral Research Funds (2020Z013), the Postgraduate Research & Practice Innovation Program of Jiangsu Province (KYCX21_0215), and the China Postdoctoral Science Foundation (2020M681585).

Data Availability Statement

The data that support the findings of this study are available from the corresponding author upon reasonable request.

Conflicts of Interest

The authors declare no conflict of interest.

References

Qi, Y.; Hunukumbure, M.; Nekovee, M.; Lorca, J.V.; Sgardoni, V. Quantifying data rate and bandwidth requirements for immersive 5G experience. In Proceedings of the 2016 IEEE International Conference on Communications Workshops (ICC), Kuala Lumpur, Malaysia, 23–27 May 2016; pp. 455–461. [Google Scholar]
Narayanan, S.; Tsolkas, D.; Passas, N.; Merakos, L. Nb-iot: A Candidate Technology for Massive IOT in the 5G Era. In Proceedings of the 2018 IEEE 23rd International Workshop on Computer Aided Modeling and Design of Communication Links and Networks (CAMAD), Barcelona, Spain, 17–19 September 2018; pp. 1–6. [Google Scholar]
Kalalas, C.; Alonso-Zarate, J. Massive Connectivity in 5G and Beyond: Technical Enablers for the Energy and Automotive Verticals. In Proceedings of the 2020 2nd 6G Wireless Summit, 6G SUMMIT, Levi, Finland, 17–20 March 2020; pp. 1–5. [Google Scholar]
Rusek, F.; Persson, D.; Lau, B.K.; Larsson, E.G.; Marzetta, T.L.; Edfors, O.; Tufvesson, F. Scaling up MIMO: Opportunities and Challenges with Very Large Arrays. IEEE Signal Proces. Mag. 2013, 30, 40–60. [Google Scholar] [CrossRef] [Green Version]
Lu, L.; Li, G.Y.; Swindlehurst, A.L.; Ashikhmin, A.; Zhang, R. An Overview of Massive MIMO: Benefits and Challenges. IEEE J. Sel. Top. Signal Proc. 2014, 8, 742–748. [Google Scholar] [CrossRef]
Larsson, E.G.; Edfors, O.; Tufvesson, F.; Marzetta, T.L. Massive MIMO for Next Generation Wireless Systems. IEEE Commun. Mag. 2014, 52, 186–195. [Google Scholar] [CrossRef] [Green Version]
Feng, H.; Zhao, X.; Li, Z.; Xing, S. A Novel Iterative Discrete Estimation Algorithm for Low-Complexity Signal Detection in Uplink Massive MIMO Systems. Electronics 2019, 8, 980. [Google Scholar] [CrossRef] [Green Version]
Marzetta, T.L. Noncooperative Cellular Wireless with Unlimited Numbers of Base Station Antennas. IEEE Trans. Wirel. Commun. 2010, 9, 3590–3600. [Google Scholar]
Ngo, H.Q.; Larsson, E.G.; Marzetta, T.L. Energy and Spectral Efficiency of Very Large Multiuser MIMO Systems. IEEE Trans. Commun. 2013, 61, 1436–1449. [Google Scholar]
Hoydis, J.; Ten Brink, S.; Debbah, M. Massive MIMO in the UL/DL of Cellular Networks: How Many Antennas Do We Need? IEEE J. Sel. Areas Commun. 2013, 31, 160–171. [Google Scholar] [CrossRef] [Green Version]
Zhang, X.; Xu, L.; Xu, L.; Xu, D. Direction of departure (DOD) and direction of arrival (DOA) estimation in MIMO radar with reduced-dimension MUSIC. IEEE Commun. Lett. 2010, 14, 1161–1163. [Google Scholar] [CrossRef]
Zhang, X.; Xu, D. Angle estimation in bistatic MIMO radar using improved reduced dimension Capon algorithm. J. Syst. Eng. Elec. 2013, 24, 84–89. [Google Scholar] [CrossRef]
Damen, M.O.; HE, G.; Caire, G. On Maximum-Likelihood Detection and the Search for the Closest Lattice Point. IEEE Trans. Inf. Theory 2003, 49, 2389–2402. [Google Scholar] [CrossRef] [Green Version]
Shahabuddin, S.; Silven, O.; Juntti, M. Programmable asips for multimode mimo transceiver. J. Signal Process. Syst. 2018, 90, 1369–1381. [Google Scholar] [CrossRef] [Green Version]
Albreem, M.A.; Ismail, N.A.H.B. A review: Detection techniques for LTE system. Telecommun. Syst. 2016, 63, 153–168. [Google Scholar] [CrossRef]
Altamirano, C.D.; Minango, J.; De Almeida, C.; Orozco, N. On the Asymptotic BER of MMSE Detector in Massive MIMO Systems. In Proceedings of the International Conference on Applied Technologies, Quito, Ecuador, 3–5 December 2019; pp. 57–68. [Google Scholar]
Wu, M.; Yin, B.; Wang, G.; Dick, C.; Cavallaro, J.R.; Studer, C. Large-Scale MIMO Detection for 3GPP LTE: Algorithms and FPGA Implementations. IEEE J. Sel. Top. Signal Process. 2014, 8, 916–929. [Google Scholar] [CrossRef] [Green Version]
Zhang, X.; Zeng, H.; Ji, B.; Zhang, G. Low-Complexity Implicit Detection for Massive MIMO Using Neumann Series. IEEE Trans. Veh. Technol. 2022, 71, 9044–9049. [Google Scholar] [CrossRef]
Albreem, M.A. Approximate Matrix Inversion Methods for Massive MIMO Detectors. In Proceedings of the 2019 IEEE 23rd International Symposium on Consumer Technologies (ISCT), Selangor, Malaysia, 2–4 December 2019; pp. 87–92. [Google Scholar]
Tang, C.; Liu, C.; Yuan, L.; Xing, Z. High precision low complexity matrix inversion based on Newton iteration for data detection in the massive MIMO. IEEE Commun. Lett. 2016, 20, 490–493. [Google Scholar]
Dai, L.; Gao, X.; Su, X.; Han, S.; I, C.L.; Wang, Z. Low-Complexity Soft-Output Signal Detection Based on Gauss–Seidel Method for Uplink Multiuser Large-Scale MIMO Systems. IEEE Trans. Veh. Technol. 2015, 64, 4839–4845. [Google Scholar] [CrossRef]
Khoso, I.A.; Zhang, X.; Shaikh, A.H.; Sahito, F.; Dayo, Z.A. Improved Gauss–Seidel detector for large-scale MIMO systems. IET Commun. 2022, 16, 291–302. [Google Scholar] [CrossRef]
Zhang, C.; Wu, Z.; Studer, C.; Zhang, Z.; You, X. Efficient Soft-Output Gauss–Seidel Data Detector for Massive MIMO Systems. IEEE Trans. Circuits Syst. I Regul. Pap. 2021, 68, 5049–5060. [Google Scholar] [CrossRef] [Green Version]
Wu, Z.; Ge, L.; You, X.; Zhang, C. Efficient near-MMSE detector for large-scale MIMO systems. In Proceedings of the IEEE Workshop on Signal Processing Systems, Lorient, France, 3–5 October 2017; pp. 1–6. [Google Scholar]
Minango, J.; de Almeida, C.; Daniel Altamirano, C. Low-complexity MMSE detector for massive MIMO systems based on Damped Jacobi method. In Proceedings of the 2017 IEEE 28th Annual International Symposium on Personal, Indoor, and Mobile Radio Communications (PIMRC), Montreal, QC, Canada, 8–13 October 2017; pp. 1–5. [Google Scholar]
Khoso, I.A.; Zhang, X.; Dai, X.; Ahmed, A.; Dayo, Z.A. Joint steepest descent and non-stationary Richardson method for low-complexity detection in massive MIMO systems. Trans. Emerg. Telecommun. Technol. 2022, 33, e4566. [Google Scholar] [CrossRef]
Khoso, I.A.; Zhang, X.; Shaikh, A.H. Low-complexity signal detection for large-scale MIMO systems with second-order Richardson method. Electron. Lett. 2020, 56, 467–469. [Google Scholar] [CrossRef]
Zhang, Z.; Dai, X.; Dong, Y.; Wang, X.; Liu, T. A low-complexity signal detection utilizing AOR method for massive MIMO systems. China Commun. 2017, 41, 269–278. [Google Scholar] [CrossRef]
Tu, Y.-P.; Chen, C.-Y.; Lin, K.-H. An Efficient Two-Stage Receiver Base on AOR Iterative Algorithm and Chebyshev Acceleration for Uplink Multiuser Massive-MIMO OFDM Systems. Electronics 2022, 11, 92. [Google Scholar] [CrossRef]
Chataut, R.; Akl, R.; Dey, U.K.; Robaei, M. SSOR Preconditioned Gauss–Seidel Detection and Its Hardware Architecture for 5G and beyond Massive MIMO Networks. Electronics 2021, 10, 578. [Google Scholar] [CrossRef]
Jin, J.; Xue, Y.; Ueng, Y.-L.; You, X.; Zhang, C. A split pre-conditioned conjugate gradient method for massive mimo detection. In Proceedings of the 2017 IEEE International Workshop on Signal Processing Systems (SiPS), Lorient, France, 3–5 October 2017; pp. 1–6. [Google Scholar]
Jing, X.; Li, A.; Liu, H. A low-complexity Lanczos-algorithm-based detector with soft-output for multiuser massive MIMO systems. Digit. Signal Process. 2017, 69, 41–49. [Google Scholar] [CrossRef]
Seidel, P.; Gregorek, D.; Paul, S.; Rust, J. Efficient initialization of iterative linear massive MIMO uplink detectors by binary Jacobi synthesis. In Proceedings of the WSA 2019, 23rd International ITG Workshop on Smart Antennas, Vienna, Austria, 24–26 April 2019; pp. 1–5. [Google Scholar]
Gesbert, D.; Shafi, M.; Shiu, D.; Smith, P.J.; Naguib, A. From Theory to Practice: An Overview of MIMO Space-Time Coded Wireless Systems. IEEE J. Sel. Areas Commun. 2003, 21, 281–302. [Google Scholar] [CrossRef] [Green Version]
Zhang, M.; Kim, S. Evaluation of MMSE-based iterative soft detection schemes for coded massive MIMO system. IEEE Access 2018, 7, 10166–10175. [Google Scholar] [CrossRef]
Narasimhan, T.L.; Chockalingam, A. Channel hardening-exploiting message passing (CHEMP) receiver in large-scale MIMO systems. IEEE J. Sel. Top. Signal Process. 2014, 8, 847–860. [Google Scholar] [CrossRef] [Green Version]
Marchenko, V.A.; Pastur, L.A. Distribution of eigenvalues for some sets of random matrices. Math. USSR Sbornik 1967, 1, 457–483. [Google Scholar] [CrossRef]
Zhang, Q.; Jin, S.; Wong, K.K.; Zhu, H.; Matthaiou, M. Power scaling of uplink massive MIMO systems with arbitrary-rank channel means. IEEE J. Sel. Topics Signal Process. 2014, 8, 966–981. [Google Scholar] [CrossRef] [Green Version]
Hackbusch, W. Iterative Solution of Large Sparse Systems of Equations, 2nd ed.; Springer: Geneva, Switzerland, 2016. [Google Scholar]
Evans, D.J. The use of pre-conditioning in iterative methods for solving linear equations with symmetric positive definite matrices. IMA J. Appl. Math. 1967, 4, 295–314. [Google Scholar] [CrossRef]
Niki, H.; Harada, K.; Morimoto, M.; Sakakihara, M. The survey of preconditioners used for accelerating the rate of convergence in the Gauss–Seidel method. J. Comput. Appl. Math. 2004, 164, 587–600. [Google Scholar] [CrossRef]

Figure 1. Uplink massive MIMO system with a BS employing M antennas and simultaneously serving K users.

Figure 2. Performance of the proposed algorithm for a system deploying M × K = 128 × 16 with 64-QAM modulation scheme applying different initial methods.

Figure 3. SER performance versus SNR of the proposed algorithm and GS- and CP-GS based algorithms for a 256 × 32 antenna system employing the 64-QAM modulation scheme.

Figure 4. SER as a function of SNR for the systems deploying M = 256 antennas at the BS and 64 users with the 16-QAM modulation scheme.

Figure 5. SER performance comparison of the proposed algorithm and other recently proposed methods for M × K = 128 × 16 antenna system employing 64-QAM modulation.

Figure 6. SER as a function of the number of users for the system deploying Mr = 128 antennas at the BS with the 16-QAM modulation technique.

Figure 7. SER versus the number of iterations of the proposed detector at different values of SNR for M × K = 128 × 16 massive MIMO system employing the 64-QAM modulation scheme.

Figure 8. SER performance as a function of the number of iterations for M × K = 128 × 32 massive MIMO system at 16 dB SNR using 16-QAM modulation.

Figure 9. Complexity comparison of various detection algorithms against the number of users.

Table 1. Summary of the simulated model parameters.

Parameter	Value
Number of antennas at BS	M ∈ {128, 256}
Number of users	K ∈ {16, 32, 64, 5:100}
User antennas	Single-antenna users
SNR range	SNR ∈ {8:2:18} dB
Average SNR per receive antenna	KE_x/N₀
Number of realizations in the Monte Carlo simulations	25 × 10³
Number of iterations for iterative detectors	1 to 6
Channel	MIMO
Channel model	Uncorrelated Rayleigh fading
Channel availability	Perfectly known at the receiver
Modulation type	16-QAM, 64-QAM
Transmission	Uncoded

Table 2. Computational complexity.

Detector	Complexity
GS [21]	(k + 1)K² + 4K
CP-GS [24]	(k + 2)K² + (2 − k)K − 1
Jacobi [25]	k(2K² − K) + 2K
Neumann Series [17]	(k − 2)K³ + 3K² (k ≥ 3)
SORM [27]	k(K² + 2K + 7)
AOR [28]	k(3K² + 7K)/2
Proposed	k(K² − K) + 2(K² + K) − 1

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Ahmad, M.; Zhang, X.; Khoso, I.A.; Shi, X.; Qian, Y. High-Precision Iterative Preconditioned Gauss–Seidel Detection Algorithm for Massive MIMO Systems. Electronics 2022, 11, 3806. https://doi.org/10.3390/electronics11223806

AMA Style

Ahmad M, Zhang X, Khoso IA, Shi X, Qian Y. High-Precision Iterative Preconditioned Gauss–Seidel Detection Algorithm for Massive MIMO Systems. Electronics. 2022; 11(22):3806. https://doi.org/10.3390/electronics11223806

Chicago/Turabian Style

Ahmad, Mushtaq, Xiaofei Zhang, Imran A. Khoso, Xinlei Shi, and Yang Qian. 2022. "High-Precision Iterative Preconditioned Gauss–Seidel Detection Algorithm for Massive MIMO Systems" Electronics 11, no. 22: 3806. https://doi.org/10.3390/electronics11223806

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

High-Precision Iterative Preconditioned Gauss–Seidel Detection Algorithm for Massive MIMO Systems

Abstract

1. Introduction

1.1. Contributions

1.2. Paper Outline

1.3. Notation

2. Massive MIMO System Model and Signal Detection

3. Proposed Algorithm

3.1. Proposed Initialization

3.2. Proposed Preconditioning Technique

4. Numerical Results

4.1. Comparison of Different Initializers

4.2. Error Rate Performance

4.3. Complexity Analysis and Comparison

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI