1. Introduction
Multiple input multiple output (
MIMO) systems have received significant attention, owing to the rapid development of high-speed broadband wireless communication systems employing multiple transmit and receive antennas [
1,
2,
3,
4,
5,
6,
7,
8,
9,
10,
11,
12]. MIMO systems have many advantages such as spatial diversity and high throughput without increasing bandwidth. Thus, MIMO systems are employed in modern and next generation wireless standards [
1].
Meanwhile, there are many challenging issues to overcome from disadvantages owing to increasing of the number of antennas. For example, multi-user MIMO systems have been studied in [
2,
3,
4,
5,
6,
7,
8]. In [
2] and [
3], the authors tried to maximize signal to interference plus noise ratio by employing the precoding schemes to minimize interference among multiple users. In [
4], an adaptive antennas algorithm at the mobile and base stations has been proposed. In [
6], they proposed a near optimal scheduling algorithm by managing radio resource for multi-user MIMO systems.
Meanwhile, studies to solve the problems of mutual coupling between multiple antennas are receiving much attention [
9,
10,
11,
12]. In [
9], the authors designed a compact dual band-notched ultra wideband MIMO antenna in order to achieve high isolation. In [
10], the authors presented a slot structure perpendicular to current on the surface of patches to overcome the mutual coupling problems. In [
11], the authors tried to reduce the mutual coupling between closely packed antenna elements using defected ground structure.
As one of the most promising among the challenging issues for MIMO systems, the complexity reduction of MIMO detection algorithms has recently been studied [
13,
14,
15,
16,
17,
18,
19,
20,
21,
22]. The optimal detector for MIMO system is the maximum likelihood detection (MLD) algorithm. However, MLD requires exponential complexity regarding the number of transmit antennas [
1]. Recently, several algorithms achieving near
-MLD performance have been proposed for MIMO systems [
16,
17,
18,
19,
20,
21]. The tree search based QR decomposition (QRD)
-M algorithm and sphere decoding (SD) achieve near
-MLD performance, while requiring substantially low complexity in comparison with MLD.
The fixed QRD
-M algorithm compromises the complexity and performance by selecting only
M survivor paths among all possible paths at each layer of the tree search [
16,
17]. The metric computational complexity is determined by the product of
M and the number of the candidate symbols in the next detection layer,
S, which has been conventionally set to be the number of all constellation points
C. To achieve near
-MLD performance for this algorithm, the value of
M and
S should be large enough for the selected paths to include the correct path. This still requires high computational complexity. Recently, in [
18], a low complexity QRD
-M algorithm was proposed by reducing the redundant metric computation complexity with identical performance at the first and the last detection layer. However, this algorithm still has inefficient computation complexity because
M and
S are fixed regardless of the instantaneous channel condition. If
M or
S is adaptively controlled according to the instantaneous channel condition, the computational complexity should be effectively decreased compared with the conventional fixed QRD
-M algorithm.
Recently, adaptive QRD
-M algorithms based on average noise power have been proposed [
19,
20,
21]. They can be classified as two kinds of adaptation algorithms. One is the adaptation of
M and the other one is the adaptation of
S. The conventional adaptive QRD
-M algorithms control the number of survivor paths,
M, according to average noise power [
19,
20]. On the other hand, the conventional adaptive search area QRD
-M algorithm adaptively changes
S according to SNR [
21]. We will call the QRD
-M algorithm with the adaptation of
S the adaptive search area QRD
-M algorithm in other words. The simulation results reveal that adaptive search area QRD
-M algorithm enables us to more drastically reduce the computational complexity than the conventional adaptive QRD
-M algorithm. This implies that reducing the search area (candidate path space size) is safer than reducing the number of survivor paths in order to reduce the computation complexity without losing the correct path. However, the adaptive search area QRD
-M algorithm in [
21] still has a problem. This algorithm cannot adaptively change the search area,
S, against the instantaneous variation of the channel condition, because this algorithm controls
S based on the average noise power.
In this paper, we propose an adaptive search area QRD-M algorithm that overcomes the complexity reduction limit of the conventional adaptive search area QRD-M algorithm. The proposed algorithm accomplishes dynamic adaptation of S by observing the diagonal terms of the channel upper triangle matrix after QR decomposition and Euclidean distance between the received signal vector and temporarily estimated symbol vector by QRD. These two observations are found to form a good indicator of the instantaneous channel condition because they are the simple functions of the instantaneous noise sample power as well as the instantaneous channel fading magnitudes and correlation. First, we propose a simple, but efficient instantaneous channel condition indicator formula using these two observations, and then we show that, based on the proposed channel indicator, a very tight adaptive setting of search area is possible without performance loss. In addition, it is found that the performance of the QRD-M algorithm degrades when the diagonal terms of the channel upper triangle matrix instantaneously decrease. To overcome this problem, the proposed algorithm employs ratio of each diagonal term and total diagonal terms. Moreover, the proposed algorithm further decreases redundant complexity by considering the location of initial detection symbol in constellation. By doing so, the proposed algorithm effectively achieves performance near to the maximum likelihood detection algorithm, while maintaining the overall average computation complexity much lower than that of the conventional QRD-M systems. Especially, the proposed algorithm achieves reduction of 76% and 26% computational complexity with low signal to noise ratio (SNR) and high SNR, respectively, compared with the adaptive QRD-M algorithm based on noise power. Moreover, the simulation results show that the proposed algorithm achieves both low complexity and lower symbol error rate compared with the fixed QRD-M algorithms. As a result, the proposed algorithm efficiently decreases the computation complexity without performance degradation compared with the conventional adaptive search area QRD-M algorithm.
The remainder of the paper is organized as follows.
Section 2 describes the system model considered in this paper, and
Section 3 and
Section 4 explain the conventional QRD
-M algorithm and the adaptive search area QRD
-M algorithm. The proposed low complexity QRD
-M algorithm is introduced in
Section 5. In
Section 6, the performance and complexity of the proposed algorithm are evaluated through computer simulations. Finally, we present the conclusion in
Section 7.
3. Conventional Fixed QRD-M Algorithm
The conventional fixed QRD
-M algorithm is based on the classical
M-algorithm [
16,
17]. At each detection layer, instead of deciding the transmitted symbol, QRD
-M algorithm keeps
M reliable survivor paths. First, performing QR-decomposition on
H, we obtain
where
is a
L × L unitary matrix and
is
N × N upper triangular matrix. Noting that,
, where
I is the identity matrix and
is the conjugate transpose of
. After pre
-Multiplying received signal by
, we can rewrite (1) as follows:
where
is
and
is
. It follows from (4) that
where
is the
i-th element of vector
, and
rj,i is the element of the
j-th row and the
i-th column of
R and
rj,i = 0 when
j >
i.
From (5), the signal detection can be realized through a tree searching process from
sN to
s1, where
sN and
s1 are the last and the first element of
s, respectively. In the conventional QRD
-M algorithm, only
M-paths will survive in each detection layer according to the accumulated Euclidean distances on different paths. In the first detection layer (
N-th row in
R), path metrics
are calculated as follows:
where
is the
m-th candidate of
. The accumulated Euclidean distance of the
m-th path in the
i-th detection layer (= (
N− i + 1)th row in
R) is calculated as follows:
where
is the candidate of
on the
m-th path.
Figure 2 shows an example of the conventional QRD
-M algorithm (
M = 4) with
N = L = 4, and quadrature phase shift keying (QPSK). The number of layers is equal to
N, and
M nodes are maintained at each layer. The minimum path metric is chosen as an estimation symbol vector,
at the last layer.
At the final detection layer (i = N), the candidate vector that corresponds to the minimum path metric value is chosen as the detection signal set.
As the detection process spends its computation time mainly on metric computations, we take the number of metric computations as the complexity measure. If we denote
Tfixed as the number of metric computations of the conventional QRD
-M algorithm, it is calculated as follows:
Recently, a reduced complexity QRD
-M algorithm has been proposed in [
18]. They avoid the redundant computation at the first and the final layers with identical performance. The number of metric computations of this reduced algorithm,
Treduced is calculated as follows:
From (8) and (9), we note that the metric computational complexity of the conventional QRD-M algorithm is roughly determined by the product of M, C, and N, and this is much smaller than that of MLD, which is given as CN. However, if we want to make the performance of the conventional QRD-M algorithm approach that of MLD, we need a considerably large value of M for the worst case design.
4. SNR-Based Adaptive Search Area QRD-M Algorithm
The adaptive search area QRD
-M algorithm has been proposed in order to further reduce the complexity of the conventional fixed QRD
-M algorithms [
21]. In this algorithm, the number of extended nodes per survivor path
S is adaptively changed according to average noise power instead of all constellation points
C, at step 4 in the previous section. That is, the
S closest constellation points from the soft decision symbol by QRD are chosen at each layer. The soft decision by QRD at the
i-th detection layer is denoted by
and is calculated as follows:
Figure 3 shows an example of the closest order of the constellation points with 16 QAM plane from
. The numbers in parentheses mean the closest order from
.
The probability that the correct path’s metric is not included among the smallest
path metrics can be approximated by the probability of event
A that the correct hypothesis falls outside of the square box that centers at
, and encloses
constellation points. Assuming the constellation is sufficiently large such that the continuous approximation [
21] holds and the effects of points at the edge of the constellation are negligible, the side of such a box is
, where
is
S at the
i-th detection layer and
and
are the minimum distances of the transmitted and received constellations, respectively (
). Setting
to achieve the specified performance level, where
is pre-specified target level and using the assumption that each entry of
w is an i.i.d. Gaussian r.v. with zero mean and variance,
, we have
where
. With moderate to high SNRs, the second-order term of the
Q-function in (11) can be ignored. Furthermore, let
denote the largest allowable value of
that is set by the implementation constraints and the size of constellation, that is,
. It follows that the minimum integer value for
that satisfies (11) and the constraints in complexity and constellation size is found to be as follows:
where
is the smallest integer that is greater than or equal to
. Finally, to enable set partitioning,
is set to
where
is the operation that rounds up to the nearest integer that is a power of 2. The number of considered candidate symbols,
can be set to 1, 2, 4, 8, and 16 with 16 QAM.
By adaptively controlling the size of the search area based on SNR, the computational complexity of this algorithm can be significant decreased compared with the conventional fixed QRD-M algorithm, which is set to S = C as the search area. However, this algorithm cannot adaptively change against the instantaneous variation of the channel condition as this algorithm controls based on the average SNR does not reflect the instantaneous channel information, such as the instantaneous channel fading and instantaneous noise sample power. Therefore, we have to accept either the increasing complexity or performance degradation, which is the main shortcoming of this algorithm.
5. Proposed Adaptive Search Area QRD-M Algorithm
In this section, we illustrate the proposed adaptive search area QRD
-M algorithm, which controls
S using the instantaneous channel condition instead of average noise power of the conventional adaptive search area QRD
-M algorithm. The proposed algorithm is an advanced version of [
22]. Compared with [
22], the proposed algorithm employs additional channel indicator, that is, accumulated Euclidean distance at each layer in order to improve the performance and the average diagonal terms to reflect channel conditions of all the layers. The proposed algorithm uses diagonal elements of
R and Euclidean distance between the received symbol vector and the temporal QRD estimation symbol vector as a simple channel reliability indicator.
As one of channel indicators of the proposed scheme, we employ the ratio of power of diagonal term of channel matrix
R at the
i-th detection layer, as shown in [
17]. This indicator is denoted by
and is calculated as follows:
In the denominator in (14), the diagonal term is normalized by the average diagonal terms. This is because we intend to make this indicator dependent on the variable channel matrix not on the SNR in order to get rid of the signal scaling effect and measure the relative channel state with respect to all of the detection layers.
As another instantaneous channel indicator, we additively employ Euclidean distance between the temporarily estimated symbol by QRD
and the received signal
at
i-th detection layer. The accumulated Euclidean distance between
and
is denoted by
and is calculated as follows:
If
is the correct detection symbol, there remains only noise power in (15). This implies that Euclidean distance vector
reflects the instantaneous noise sample power. Let
be a instantaneous noise sample power indicator at
i-th detection layer, and we calculate it as follows:
If
decreases owing to the high SNR, then
becomes large, and vice versa. Finally, by multiplying two channel indicators
and
, the total channel condition indicator at
i-th detection layer
is calculated as follows:
As SNR increases, increases and decreases, and thus increases. According to calculated , the number of candidate symbols at each layer is adaptively controlled.
Meanwhile, the proposed algorithm employs four
-Mode adaptation according to
.
Figure 4 shows an example of four
-Mode adaptation of the proposed algorithm based on
with 16QAM at
i-th detection layer. In
Figure 4,
is the hard decision symbol by QRD and by comparing
to three predetermined thresholds
,
, and
.
Depending on which range belongs to, we determine how many neighboring symbols of we will include in the candidate symbol set. The proposed algorithm’s adaptation criterion is given as follows:
If , we decide that the channel condition is excellent, and thus the estimated symbol is very close to the correct symbol set mode I, where = 1.
If <<, we decide that the channel condition is good, and thus the estimated symbol is near to the correct symbol set mode II, where = 4.
If << , we decide the channel condition is normal, and thus we have to increase the range of search area set mode III, where = 9.
If < , we decide the channel condition is bad, and thus the estimated symbol might be far off from the correct symbol set mode IV, where = 16.
As the threshold values
,
, and
determine the computation complexity and detection performance, it is crucial to properly set their values so that the probability of the selected
neighboring candidate symbols including the correct symbol is high, but the margin should be tight enough to avoid extra computation. Let
be the path metric order of the correct path at
i-th detection layer. For example, for the case when the correct path’s metric is the second minimum at the third detection layer after sorting the metrics for all constellation symbols, we denote the metric order of the correct path as
= 2. This implies that the path metric order
can reflect the channel condition. In order to properly set the threshold values,
,
, and
on the channel condition,
Figure 5a,b illustrate the cumulative distribution function (CDF) of
at
i = 1 and 2 for the cases when
= 1, 2, 4, 5, 9, and 10, respectively, with
N = L = 4, 16 QAM, and
Eb/
N0 = 20 dB. We can observe that the distributions of
are clearly different according to the order of the correct path’s metric orders. The probabilities of
> 1.2 at
i = 1 and 2 are nearly zero when
. The probabilities of
> 0.4 at
i = 1 and 2 are nearly zero when
. This feature enables us to efficiently restrict the size of search area. If
> 1.2, it is highly probable that
, and thus we can safely set mode I, that is,
= 1. So, we take
= 1.2. If
> 0.4, the correct path’s metric is hardly larger than the fourth minimum, and thus we can safely set mode II, that is,
= 4. So, we set
= 0.4. In a similar way, we set the threshold value between mode III and IV,
= 0.1 because the correct path’s metric is hardly larger than the ninth minimum for
> 0.1. While it is clear that
is distributed in the small valued region for the case of not good channel condition, that is,
, the channel indicator,
, for the case of good channel condition, that is,
, is rather widely distributed, including a small valued region. So, their distributions are not as clearly disjointed as desired. This implies that we cannot help a tradeoff between complexity and performance in determining the thresholds,
,
, and
. As these thresholds get smaller, we can have more chance to have a small
, and thus we can reduce the complexity. At the same time, the possibility of bad channel condition becomes non-negligible, and thus we may lose the correct path if we take just a few search regions. The effort to minimize the computational complexity is meaningful only when the required symbol error rate (SER) performance is guaranteed. So, in our criterion, we first try to meet the SER performance, and then take the threshold as small as possible within the range where the SER performance is satisfied. By this criterion, predetermined values are set to
Thus, we adaptively set
at each detection layer and effectively decrease the metric computation complexity, while achieving almost the same performance as the conventional QRD
-M algorithm with large
M.
Figure 6 illustrates examples of the special cases of the proposed algorithm at Mode II and III when
is located at the edge in the constellation. From this figure, we can expect the proposed algorithm can additively reduce the computation complexity. In
Figure 6a,b, the number of neighboring points of
is just 1 or 2, instead of 4 in Mode II. Similarly, in
Figure 6c,d, the number of neighboring points of
is just 4 or 6, instead of 9 in Mode III.
Figure 7 illustrates an example of the proposed algorithm’s search tree hierarchy for the case when
= 9,
= 4, and
= 16 with
N = L = 4, and 16 QAM. At the first detection layer, the redundant complexity is reduced by employing the algorithm [
18], and thus
is set to 9. Then, at the second detection layer, based on (19),
is set to 4, and thus only four candidate symbols are considered. At the third detection layer, owing to the bad channel state,
is set to 16. Hence, all candidate symbols are considered at this layer. Finally, at the fourth detection layer, the nearest candidate symbol is chosen based on measurement reflecting good channel condition. The simulation results in the following section show that the average complexity of the proposed algorithm is much lower than that of conventional adaptive search area QRD
-M conditioned on the same error performance.
6. Simulation Results
This section shows the simulation results to check the performance gain of the proposed QRD
-M algorithm. We consider the cases of
N = L = 3 and
N = L = 4 with 16 QAM and the predetermined values are set to [
,
,
] = [0.1,0.4,1.2]. For performance comparison, we consider the conventional fixed QRD
-M algorithm with
M = 4 and 16, and the conventional adaptive search area QRD
-M algorithm with
M = 16 and MLD. For comparison with the conventional fixed QRD
-M algorithm, we select the reduced QRD
-M algorithm [
18] instead of the conventional fixed QRD
-M algorithm because the overall computation complexity is reduced with identical performance. In the sequel, we simply call the full constellation search area QRD
-M algorithm ‘reduced QRD
-M’. Commonly, in all of the algorithms we consider for comparison, we rearranged the detection order by the norms of the each row in the pseudo inverse of
H in order to minimize the error propagation effect as done in the ordered detection in successive interference cancellation [
23]. As the detection process spends its computation time mainly on metric computations, we take the number of metric computations as the complexity measure. At the last detection layer, we do not contain the metric computation complexity according to [
18] and [
21].
The SER performances with
N = L = 3 and
N = L = 4 are shown in
Figure 8 and
Figure 9, respectively. The proposed algorithm achieves near MLD performance and almost the same performance as the reduced QRD
-M algorithm [
18] and the conventional adaptive search area QRD
-M algorithm based on noise power. However, the conventional fixed QRD
-M algorithm with
M = 4 significantly deviates from MLD performance owing to the lack of candidate symbols.
Figure 10 shows the relative frequencies of the conventional adaptive search area QRD
-M algorithm and proposed algorithm with 16 QAM for
N =
L = 4 at 15 dB and 26 dB, respectively. In
Figure 10a, the frequencies of
are larger than 40% at the first and second detection layer and the frequencies of
are near to zero at all layers. In
Figure 10b, in the case of the proposed algorithm, the frequencies of
are less than 10% at the second and third detection layer, and the frequencies of
are larger than 30% at the first and the third detection layer. These results show that the indicator of the proposed algorithm reflects the instantaneous channel condition compared with [
21] in a low SNR environment. In
Figure 10c, the frequencies of
of the conventional search area QRD
-M algorithm are larger than 70% at all layers, and the frequencies of
are very low at all layers. From these results, the computation complexity of the conventional adaptive search area QRD
-M algorithm significantly decreases compared with the conventional fixed QRD
-M algorithm, which achieves the same performance. In
Figure 10d, the frequencies of
in the proposed algorithm are larger than 95% at the second and third detection layer. On the other hand, the frequencies of
= 4, 9, and 16 are nearly zero. From these results, we can expect that the proposed algorithm can significantly reduce the computation complexity compared with the conventional adaptive search area QRD
-M algorithm as well as the conventional fixed QRD
-M algorithm. This is because the frequencies of
and 16 are the most dominant factors to determine the computational complexity of the systems. These results show that the measurement of the proposed algorithm reflects the instantaneous channel condition compared with [
21] even in high SNR conditions.
Figure 11 and
Figure 12 show the metric computations’ complexity. We can observe that, as SNR increases, the computation complexities of the conventional adaptive search area QRD
-M algorithm and the proposed algorithm are significantly reduced compared with the reduced QRD
-M (
M = 4 and 16). In
Figure 11, as expected before, the proposed algorithm achieves more than about 93%, 96.7%, 98.2%, and 99% of the reduction of the metric computation compared with the reduced QRD
-M algorithm (
M = 16) at 15 dB, 20 dB, 23 dB, and 26 dB, respectively, with
N = L = 3. Even compared with the conventional adaptive search area QRD
-M algorithm, the proposed algorithm achieves more than about 85%, 72%, 63%, and 46% of the reduction of the metric computation.
Similarly, in
Figure 12, the metric computation complexity of the proposed algorithm is significantly decreased compared with the conventional QRD
-M algorithms in SNR regions lager than 12 dB with
N = L = 4. The proposed algorithm requires 13.87%, 4.99%, 2.41%, and 1.33% metric computations over the reduced QRD
-M (
M = 16), which achieve the same performance at 15 dB, 20 dB, 23 dB, and 26 dB, respectively. In high SNR (
Es/
N0 = 23 dB, 26 dB), the proposed algorithm achieves more than about 90% and 94.5% of the reduction of the metric computation, even compared with the reduced QRD
-M with
M = 4. In the same SER performance condition, the proposed algorithm achieves more than about 76%, 67%, 52%, and 26% of the reduction of the metric computations compared with the conventional adaptive search area QRD
-M algorithm at 15 dB, 20 dB, 23 dB, and 26 dB, respectively. This is because the channel indicator that is used in the proposed algorithm reflects instantaneous channel information about the instantaneous noise power as well as the instantaneous channel fading. On the other hand, noise power that is used as the adaptation indicator in the conventional adaptive search area QRD
-M algorithm does not contain the instantaneous channel information.