Robust Transceiver Design for IRS-Assisted Cascaded MIMO Communication Systems

Esmaeili, Hossein; Kariminezhad, Ali; Sezgin, Aydin

doi:10.3390/s22176587

Open AccessArticle

Robust Transceiver Design for IRS-Assisted Cascaded MIMO Communication Systems

by

Hossein Esmaeili

^1,*,

Ali Kariminezhad

² and

Aydin Sezgin

¹

Institute of Digital Communication Systems, Ruhr University Bochum, 44801 Bochum, Germany

²

e:fs TechHub GmbH, 85080 Gaimersheim, Germany

^*

Author to whom correspondence should be addressed.

Sensors 2022, 22(17), 6587; https://doi.org/10.3390/s22176587

Submission received: 21 July 2022 / Revised: 19 August 2022 / Accepted: 19 August 2022 / Published: 31 August 2022

(This article belongs to the Special Issue Reconfigurable Intelligent Surface-Aided MIMO Systems: Challenges and Trends)

Download

Browse Figures

Review Reports Versions Notes

Abstract

:

Intelligent reconfigurable surfaces (IRSs) have gained much attention due to their passive behavior that can be a successor to relays in many applications. However, traditional relay systems might still be a perfect choice when reliability and throughput are the main concerns in a communication system. In this work, we use an IRS along with a decode-and-forward relay to provide a possible solution to address one of the main challenges of future wireless networks which is providing reliability. We investigate a robust transceiver design against the residual self-interference (RSI), which maximizes the throughput rate under self-interference channel uncertainty-bound constraints. The yielded problem turns out to be a non-convex optimization problem, where the non-convex objective is optimized over the cone of semidefinite matrices. We propose a novel mathematical method to find a lower bound on the performance of the IRS that can be used as a benchmark. Eventually, we show an important result in which, for the worst-case scenario, IRS can be helpful only if the number of IRS elements are at least as large as the size of the interference channel. Moreover, a novel method based on majorization theory and singular value decomposition (SVD) is proposed to find the best response of the transmitters and relay against worst-case RSI. Furthermore, we propose a multi-level water-filling algorithm to obtain a locally optimal solution iteratively. We show that our algorithm performs better that the state of the art in terms of time complexity as well as robustness. For instance, our numerical results show that the acheivable rate can be increased twofold and almost sixfold, respectively, for the case of small and large antenna array at transceivers.

Keywords:

IRS; robust design; worst case; decode-and-forward; MIMO

1. Introduction

Reliability and throughput are two of the most crucial requirements for the next generation of wireless networks. Optimally relaying the signal from a source to a destination can help enhance reliability and capacity of networks and is currently an active research area [1].

Another emerging candidate for relaying signals is reconfigurable intelligent surfaces (IRSs) [2]. An IRS is a device equipped with multiple passive reconfigurable reflectors that can reflect the colliding waves with an adjustable phase. One of the biggest advantages of IRSs is that they work in a real-time manner without consuming a noticeable amount of power [3]. However, the characteristics of the IRS (e.g., the lack of signal amplification and decode-and-forward processes) can potentially limit its functionality. As a result, in cases where reliability and throughput are of greater importance than power consumption, conventional relays might still be a better option than IRSs. For instance, authors in [4] show that a simple full-duplex relay can outperform an IRS in terms of throughput under certain conditions.

In this paper, we investigate the IRS-assisted MIMO full-duplex (FD) relay system that suffers channel uncertainties. It is also considered that the relays have practical issues such as self-interference (SI) as well as antenna and power limits. The combination of IRS and DF relay can potentially be advantageous. This is due to the fact that both the IRS and the relay have their own limitations that can possibly be compensated for by exploiting each other. The objective of this paper is to maximize the achievable rate of the system by jointly optimizing the impact of the IRS as well as the covariance matrices of the source and the relay.

1.1. Related works

IRSs can be utilized in various ways to help the direct links enhance the performance of the system. In [5], authors utilized an IRS to maximize the weighted sum rate MISO system. Authors in [6] proposed a method to minimize the power consumption in a MISO system equipped with an IRS, and authors in [7] investigate the problem of energy efficiency in an IRS-assisted MISO downlink system. The problem of rate maximization in a MIMO system has been presented in [8], where the authors propose an iterative algorithm to find the best IRS pattern assuming the perfect channel state information (CSI) is given. Recently, the authors in [9] utilized IRS in a relay-aided network to minimize the power consumption and successfully showed that the combination of IRS and relay outperforms other scenarios. While the performance of IRS communication systems has been extensively studied, there is not much research that considers robust design when the perfect CSI is not available [10].

Recently, with the emergence of artificial intelligence, IRSs have shown a great potential to improve the existing protocols and technologies [11]. Authors in [12] investigate the benefit of employing an IRS equipped with a multi-task learning system on the transmit power and achievable throughput of aerial–terrestrial communications. In [13], authors use a reinforcement-learning-based approach to optimize the IRS reflection coefficients for buffer-aided relay selection.

One of the earliest studies of the robust transmission designs of IRS-assisted systems was undertaken in [14], where a bounded CSI error model is applied to a problem of power minimization in a MISO transmission system. There, by virtue of semidefinite programming (SDP), the authors turn the original problem into a sequence of convex sub-problems. The robust power minimization subject to the outage probability constraints under statistical cascaded channel error model is considered in [15], where the aim is to optimize the system under worst-case rate constraint. Authors in [16] have proposed a robust algorithm for mean squared error (MSE) minimization for a single user MISO system equipped with an IRS. Their method provides a closed form solution for each iteration. However, it can be used only for the case of a single user system and cannot be extended to more general cases where there are multiple users. Recently, a robust algorithm based on a penalty dual decomposition (PDD) technique is proposed in [10] for sum-rate maximization where they assumed that the channel estimation error follows a complex normal distribution.

Exploiting a relay to improve the communication throughput rate is a classic alternative for IRS in communication systems. However, utilizing a relay in a network raises some important questions to be answered. For instance, how should the relay process the received signal before dispatching it to the destination? Now, the relay can receive a signal from the source, process it and transmit it towards the destination in a successive manner. This type of relaying technique is known as half-duplex relaying. Alternatively, while receiving a signal at a certain time instant, the relay can simultaneously transmit the previously received signals. This technique is known as full-duplex relaying [17].

As a consequence of transmitting and receiving at a common resource unit, the relay is confronted with SI. Note that full-duplex relaying potentially increases the total throughput rate of the communication compared to the half-duplex counterpart only if the SI is handled properly at the relay input. By physically isolating the transmitter and receiver front ends of the relay, a significant portion of SI can be reduced [18]. Moreover, analog and/or digital signal processing at the relay input can be utilized to cancel a portion of SI [19,20,21,22]. This can be realized if the estimate of the SI can be obtained at the relay. These SI cancellation procedures can effectively mitigate the destructive impact of SI up to a certain level. Hence, the remaining portion, the so-called residual self-interference (RSI), is still present at the relay input. The distribution of the RSI is investigated in [23,24]. This RSI is mainly due to the channel estimation uncertainties and transmitter noise. Therefore, the quality of channel estimation plays an important role for limiting RSI if the conventional modulation techniques are utilized.

The authors in [25] employ a superimposed signaling procedure (asymmetric modulation constellation) in the basic point-to-point FD communication for cancelling the SI and further retrieving the desired information contents without requiring channel estimates. They show that for the same average energy per transmission block, the bit error rate of their proposed method is better than that of conventional ones. The RSI evidently degrades the performance of the communication quality. To this end, the authors in [26] study the degrees-of-freedom (DoF), i.e., the slope of the rate curve at asymptotically high SNR and its relation to the performance of an FD cellular network in the presence of RSI. Moreover, the authors in [27,28] investigate the joint rate-energy and delivery time optimization of FD communication, respectively, when RSI is still present. Furthermore, the authors in [29] study the sum rate capacity of the FD channel with and without such degradation. In the presence of RSI, the authors in [30] study the capacity of a Gaussian two-hop FD relay.

Robust transceiver design against the worst-case RSI channel helps find the threshold for switching between HD and FD operating modes. This setup is commonly known as a hybrid relay system [31]. The authors in [32] investigate a robust design for multi-user full-duplex relaying with multi-antenna DF relay. In that work, the sources and destinations are equipped with single antennas. Moreover, the authors in [33] investigate a robust transceiver design for FD multi-user MIMO systems for maximizing the weighted sum-rate of the network. The robust design against worst-case RSI is investigated by authors in [34].

1.2. Contribution

Motivated by the above, in this work, we consider a DF multi-hop system with multiple antennas at the source, relay and destination along with an IRS to provide additional links. Then we try to maximize the throughput rate for the worst-case RSI scenario. To the best of our knowledge, this is the first time that the throughput rate maximization against the worst-case RSI is evaluated for IRS-assisted DF full-duplex relay in MIMO systems. First, we simplify the problem by finding an analytical lower bound for the performance of the IRS. Then, the optimization of maximum achievable rate of the DF full-duplex relaying is cast as a non-convex optimization problem. Thereafter, we propose a low complexity method to find the solution using majorization theory. We propose an efficient algorithm to solve this problem in polynomial time. Finally, the transmit signal covariances at the source and the relay are designed efficiently to improve robustness against worst-case RSI channel in a given uncertainty bound. Notice that once the covariances are known, one can easily find the precoders using conventional methods such as singular-value decomposition (SVD), etc. To the best of our knowledge, this is the first work that uses the IRS for RSI cancellation in MIMO full-duplex DF relay systems.

1.3. Organization

The rest of the paper is organized as follows. Section 2 outlines the system model and introduces its characteristics. The three different tasks for employing the IRS are also given in this section. In Section 3, the optimization problem belonging to the FD scenario is formulated, and its proper solution is presented. In addition, analytical bounds for the performance of the IRS are given and their corresponding proofs are provided. Section 4 provides the optimization problem for the HD scenario along with the solution. In Section 5, the effectiveness of the proposed algorithm is evaluated and verified by performing numerical simulations over various aspects. Finally, the paper is concluded in Section 6, and technical proofs of the theorems are given in the Appendix A, Appendix B, Appendix C, Appendix D and Appendix E.

2. System Model

We consider the communication from a source equipped with

N_{t}

antennas to a destination with

N_{r}

antennas. The reliable communication from the transmitter to the destination is assumed to be only feasible by means of a relay with

K_{t}

transmitter and

K_{r}

receiver antennas at the output and input front ends, respectively. This means that the direct link from the transmitter to the destination and the link from the transmitter to the IRS and to the destination has a negligible impact on the throughput. This assumption is realistic for the scenarios where the path loss is high due to the high frequency ranges such as mmWave and Terahertz or due to far distances [35], as well as cases were there are objects that block the direct link between the source and the destination. An IRS consisting of M elements is established to either cancel the RSI or help enhance one of the transmitter–relay/relay–destination links. The overall system model can be found in Figure 1.

In this paper, it is assumed that signal delivery over the transmitter–IRS–receiver link is not available. This is mainly due to the power attenuation and the power radiation pattern effects [36]. As the IRS is a passive device, it has some power attenuation in practice, which makes the reflected waves weaker than the received ones. In addition, due to the power radiation pattern, based on the angle of arrival and departure, both received and reflected waves are subject to attenuation, respectively. In our system, as the IRS is established in the vicinity of the relay and is faced towards it, both aforementioned effects cause the source–IRS–destination link to be extremely weaker than the source–relay–destination link.

Next, we present the achievable throughput rates for the HD and FD relaying. We start with the HD relay in which

κ = 0

. In the second case, IRS can be exploited to enhance the quality of the channel between the source and the relay. In such a case, the received signals at the relay and destination can be expressed as

\begin{matrix} y_{r} & = (H_{1} + H_{I R} Θ H_{S I}) x_{s} + κ H_{r} x_{r} + n_{t}, \end{matrix}

(1)

\begin{matrix} y_{d} & = H_{2} x_{r} + n_{d}, \end{matrix}

(2)

where

H_{S I} \in C^{N_{t} M}

is the channel from the source to the IRS. Finally, IRS can be used to help the channel from the relay to the destination. In this case, the received signals are going to be

\begin{matrix} y_{r} & = H_{1} x_{s} + κ H_{r} x_{r} + n_{t}, \end{matrix}

(3)

\begin{matrix} y_{d} & = (H_{2} + H_{I D} Θ H_{R I}) x_{r} + n_{d}, \end{matrix}

(4)

where

H_{I D} \in C^{M \times N_{r}}

is the channel from the source to the IRS.

In what follows, we find the achievable rate for three aforementioned cases and compare them to see under what conditions each of them should be applied. Notation and definitions are summarized in Table 1.

3. Achievable Rate (Full-Duplex Relay)

3.1. Overview

Suppose that the relay employs a DF strategy. In the full-duplex scenario, both source–relay and relay–destination links are active at the same time. As a result, the signals from the relay transmitter interfere with the receiving signal at the relay receiver. We assume that an estimate of the SI channel

H_{r}

is available at the relay denoted by

{\hat{H}}_{r}

. Hence, the RSI represented by

{\bar{H}}_{r}

is given as

\begin{matrix} {\bar{H}}_{r} = H_{r} - {\hat{H}}_{r} . \end{matrix}

(5)

In the rest of the paper, we try to find an approach to deal with this RSI.

3.2. Mathematical Preliminaries

Considering a FD DF relay, the following rates are achievable [37],

\begin{matrix} R^{FD} = min (R_{sr}^{FD}, R_{rd}^{FD}), \end{matrix}

(6)

in which, depending on how the IRS is applied to the system, the three following sets of rates are possible. First,

\begin{matrix} R_{sr}^{FD} & = {log}_{2} \frac{| σ_{t}^{2} I_{K_{r}} + {\hat{H}}_{1} Q_{s} {\hat{H}}_{1}^{H} + {\bar{H}}_{1} Q_{s} {\bar{H}}_{1}^{H} + H_{t o t_{1}} Q_{r} H_{t o t_{1}}^{H} |}{| σ_{t}^{2} I_{K_{r}} + {\bar{H}}_{1} Q_{s} {\bar{H}}_{1}^{H} + H_{t o t_{1}} Q_{r} H_{t o t_{1}}^{H} |}, \end{matrix}

(7)

\begin{matrix} R_{rd}^{FD} & = {log}_{2} \frac{| σ_{d}^{2} I_{N} + {\hat{H}}_{2} Q_{r} {\hat{H}}_{2}^{H} + {\bar{H}}_{2} Q_{r} {\bar{H}}_{2}^{H} |}{| σ_{d}^{2} I_{N} + {\bar{H}}_{2} Q_{r} {\bar{H}}_{2}^{H} |}, \end{matrix}

(8)

where

H_{t o t_{1}} = ({\bar{H}}_{r} + H_{R I} Θ H_{I R})

when the IRS is used to cancel the self interference. Second,

\begin{matrix} R_{sr}^{FD} & = {log}_{2} \frac{| σ_{t}^{2} I_{K_{r}} + {\hat{H}}_{t o t_{2}} Q_{s} {\hat{H}}_{t o t_{2}}^{H} + {\bar{H}}_{t o t_{2}} Q_{s} {\bar{H}}_{t o t_{2}}^{H} + {\bar{H}}_{r} Q_{r} {\bar{H}}_{r}^{H} |}{| σ_{t}^{2} I_{K_{r}} + {\bar{H}}_{t o t_{2}} Q_{s} {\bar{H}}_{t o t_{2}}^{H} + {\bar{H}}_{r} Q_{r} {\bar{H}}_{r}^{H} |}, \end{matrix}

(9)

\begin{matrix} R_{rd}^{FD} & = {log}_{2} \frac{| σ_{d}^{2} I_{N} + {\hat{H}}_{2} Q_{r} {\hat{H}}_{2}^{H} + {\bar{H}}_{2} Q_{r} {\bar{H}}_{2}^{H} |}{| σ_{d}^{2} I_{N} + {\bar{H}}_{2} Q_{r} {\bar{H}}_{2}^{H} |}, \end{matrix}

(10)

where

{\hat{H}}_{t o t_{2}} = ({\hat{H}}_{1} + {\hat{H}}_{R I} Θ {\hat{H}}_{I R})

and

{\bar{H}}_{t o t_{2}} = ({\bar{H}}_{1} + {\bar{H}}_{R I} Θ {\bar{H}}_{I R})

if the IRS is established to help the source–relay channel and finally

\begin{matrix} R_{sr}^{FD} & = {log}_{2} \frac{| σ_{t}^{2} I_{K_{r}} + {\hat{H}}_{1} Q_{s} {\hat{H}}_{1}^{H} + {\bar{H}}_{1} Q_{s} {\bar{H}}_{1}^{H} + {\bar{H}}_{r} Q_{r} {\bar{H}}_{r}^{H} |}{| σ_{t}^{2} I_{K_{r}} + {\bar{H}}_{1} Q_{s} {\bar{H}}_{1}^{H} + {\bar{H}}_{r} Q_{r} {\bar{H}}_{r}^{H} |}, \end{matrix}

(11)

\begin{matrix} R_{rd}^{FD} & = {log}_{2} \frac{| σ_{d}^{2} I_{N} + {\hat{H}}_{t o t_{3}} Q_{r} {\hat{H}}_{t o t_{3}}^{H} + {\bar{H}}_{t o t_{3}} Q_{r} {\bar{H}}_{t o t_{3}}^{H} |}{| σ_{d}^{2} I_{N} + {\bar{H}}_{t o t_{3}} Q_{r} {\bar{H}}_{t o t_{3}}^{H} |}, \end{matrix}

(12)

where

{\hat{H}}_{t o t_{3}} = ({\hat{H}}_{2} + {\hat{H}}_{R I} Θ {\hat{H}}_{I R})

and

{\bar{H}}_{t o t_{3}} = ({\bar{H}}_{2} + {\bar{H}}_{R I} Θ {\bar{H}}_{I R})

if the IRS is utilized to enhance the rate of the relay–destination channel. Notice that assuming that the RSI remains uncanceled, a robust transceiver against the worst-case RSI channel is required which is formulated as an optimization problem as follows

\begin{matrix} max_{Q_{s}, Q_{r}, Θ} min_{{\bar{H}}_{r}} & min (R_{sr}^{FD}, R_{rd}^{FD}) \end{matrix}

(13)

\begin{matrix} subject to & Tr (Q_{s}) \leq P_{s}, \end{matrix}

(13a)

\begin{matrix} Tr (Q_{r}) \leq P_{r}, \\ Tr ({\bar{H}}_{x} {\bar{H}}_{x}^{H}) \leq T_{x}, \end{matrix}

(13b)

\begin{matrix} x \in \{1, 2, r, R I, I R, I D, S I\} \end{matrix}

(13c)

\begin{matrix} | θ_{m} | \leq 1, \forall m \end{matrix}

(13d)

in which the throughput rate with respect to the worst-case RSI channel is maximized. Two constraints,

P_{s}

and

P_{r}

, represent the transmit power budgets at the source and the relay, respectively. In constraint (13c),

T_{x}

represents the RSI or the channel estimation error bound corresponding to

H_{x}

. Notice that

Tr ({\bar{H}}_{x} {\bar{H}}_{x}^{H})

represents the sum of the squared singular values of

H_{x}

. It should be noted that using a bounded matrix norm is the most common way for modeling the uncertainty of a matrix [38,39]. In practice,

T_{x}

can be found using stochastic methods when the distribution of the channel error is known. Otherwise, one may find it using a sample average approximation method. Finally, constraints (13d) are due to the unit modulus limitation of the IRS elements.

The problem (13) is non-convex and hard to solve. As a result, for each of the above-mentioned scenarios, we propose a simplified version of the optimization problem and try to solve it instead. Note that as we are interested in finding the throughput corresponding to the worst-case RSI, any simplification in the optimization problem should be in favor of the RSI and interference. First, we analyse the performance of the system when the IRS is helping the relay cancel the RSI. Consequently, we show that the problem (13) can be simplified to the following optimization problem

\begin{matrix} max_{Q_{s}, Q_{r}} min_{H_{t o t}} & min (R_{sr}^{FD}, R_{rd}^{FD}) \end{matrix}

(14)

\begin{matrix} subject to & Tr (Q_{s}) \leq P_{s}, \\ Tr (Q_{r}) \leq P_{r}, \\ Tr (H_{t o t} H_{t o t}^{H}) \leq T^{'} (T_{r}, Θ) . \\ Tr ({\bar{H}}_{x} {\bar{H}}_{x}^{H}) \leq T_{x}, x \in \{1, 2\} \end{matrix}

(14a)

where

\begin{matrix} T^{'} (T_{r}, Θ) = min_{Θ} max_{{\bar{H}}_{r}} & | | {\bar{H}}_{r} + H_{R I} Θ H_{I R} {| |}_{F}^{2} \end{matrix}

(15)

\begin{matrix} subject to & Tr ({\bar{H}}_{r} {\bar{H}}_{r}^{H}) \leq T_{r}, \end{matrix}

(15a)

\begin{matrix} {| | V e c (Θ) | |}_{2}^{2} \leq 1, \end{matrix}

(15b)

and where

V e c (\cdot)

denotes the vector of all non-zero elements of its input matrix. We can equivalently write

T^{'}

as

\begin{matrix} T^{'} (T_{r}, Θ) = min_{Θ} max_{{\bar{H}}_{r}} & | | V e c ({\bar{H}}_{r}) + (H_{I R} * H_{R I}^{T}) V e c (Θ) {| |}_{2}^{2} \end{matrix}

(16)

\begin{matrix} subject to & Tr ({\bar{H}}_{r} {\bar{H}}_{r}^{H}) \leq T_{r}, \end{matrix}

(16a)

\begin{matrix} {| | V e c (Θ) | |}_{2}^{2} \leq 1, \end{matrix}

(16b)

where ∗ denotes a column-wise Khatri–Rao product defined as below

A * B = [A_{1} \otimes B_{1} | A_{2} \otimes B_{2} | \dots | A_{n} \otimes B_{n}],

(17)

and where

A_{i}

is the i’th column of A, and ⊗ denotes the Kronecker product. See Appendix A for proof.

One can show that

T^{'} \leq {(\sqrt{T_{r}} - σ_{min} (H_{I R} * H_{R I}^{T}))}^{2}

. As mentioned before, problem (14) is a simplification of problem (13). This means every achievable rate which is inside the feasible set of (14) is also inside the feasible set of (13) (Notice that the reverse is not necessarily true, i.e., every achievable rate which is a feasible solution of (13) is not necessarily a feasible solution for (14) as well. However, as we look for achievable rates, we can still use this method). The reason is that in problem (13), the minimization over RSI happens only one time, and the worst-case RSI simultaneously tries to cancel the effect of the best configuration of IRS and the best covariance matrices. In (14), first, the RSI does its worst damage on the performance of the best IRS configuration and after that performs another optimization to bring the worst power allocation against the best covariance matrices (This will be clearer later on when the geometrical representation of the problem is given). In what follows, we provide our proposed ways to deal with optimization problems (14) and (15), respectively.

Theorem 1.

For the optimization problem (14), one can show that

T^{'} (T, Θ) \leq {(\sqrt{T_{r}} - σ_{min} (H_{I R} * H_{R I}^{T}))}^{2}

.

Proof.

We begin the proof with an intuitive example and then extend it to the more general case. Assume that

K_{t} = 1, K_{r} = 2

and

M = 3

. Then we have

\begin{matrix} T^{'} = max_{Θ} min_{{\bar{h}}_{r}} & | | {\bar{h}}_{r} + H_{I R} d i a g (H_{R I}^{T}) V e c (Θ) {| |}_{2}^{2} \end{matrix}

(18)

\begin{matrix} subject to & {\bar{h}}_{11}^{2} + {\bar{h}}_{21}^{2} \leq T_{r}, \end{matrix}

(18a)

\begin{matrix} θ_{1}^{2} \leq 1, θ_{2}^{2} \leq 1, θ_{3}^{2} \leq 1 . \end{matrix}

(18b)

In addition, consider the following optimization problem

\begin{matrix} T^{''} = max_{Θ} min_{{\bar{h}}_{r}} & | | {\bar{h}}_{r} + H_{I R} d i a g (H_{R I}^{T}) V e c (Θ) {| |}_{2}^{2} \end{matrix}

(19)

\begin{matrix} subject to & {\bar{h}}_{11}^{2} + {\bar{h}}_{21}^{2} \leq T_{r}, \end{matrix}

(19a)

\begin{matrix} θ_{1}^{2} + θ_{2}^{2} + θ_{3}^{2} \leq 1, \end{matrix}

(19b)

Here, notice that

H_{I R} d i a g (H_{R I}^{T})

is a linear map from a three-dimensional into a two-dimensional space. One simple example of such a mapping can be found in Figure 2. Here, an example of mapping from a three-dimensional to a two-dimensional space is shown. The left shape shows the feasible set for the IRS with three elements in a real valued space. The cube belongs to the case of

T^{'}

, i.e., constraints

- 1 \leq θ_{m} \leq 1, \forall m

, while the sphere shows the constraint

θ_{1}^{2} + θ_{2}^{2} + θ_{3}^{2} \leq 1

which belongs to

T^{″}

. On the right, the feasible sets belonging to the two aforementioned regions after performing mapping f are presented as an example. It can be seen that the mapping of the first set of constraints (the hexagon) covers the whole area of that of the second one (the ellipse). One important key is, as mapping is a linear function, we have

A \subset B \overset{}{\to} f (A) \subset f (B)

, where

A

and

B

are two arbitrary sets and f is the mapping.

In general, as the number of IRS elements or the dimensions of

{\bar{h}}_{r}

increase, the mapping of the hypercube becomes more and more complicated and finding the optimal distance becomes more difficult. However, there is an upper bound for this distance. As shown in Figure 3, if instead of the cube, we limit the feasible set of IRS elements to the sphere inside the cube, i.e., replacing (18b) with (19b), the solution to the problem becomes

G E \geq G F

. It turns out that finding

G E

is very simple as by the definition we have

σ_{min} (H_{I R} d i a g (H_{R I}^{T})) = O E

, and also we know that

\sqrt{T_{r}} = G O

. Therefore, we can conclude

G E = \sqrt{T_{r}} - σ_{min} (H_{I R} d i a g (H_{R I}^{T}))

. Finally, we use one last upper bound to make the original problem even easier to solve. Note that if instead of the ellipse, we consider the circle inscribed in it, we will have

{max}_{{\bar{h}}_{r}} {min}_{Θ} | | {\bar{h}}_{r} + (H_{I R} * H_{R I}^{T}) θ {| |}_{2} = \sqrt{T_{r}} - σ_{min} (H_{I R} * H_{R I}^{T}), \forall {\bar{h}}_{r}

. As a result, we have

| | {\bar{h}}_{t o t} {\bar{h}}_{t o t}^{H} {| |}_{2} \leq T^{'}, \forall {\bar{h}}_{r},

(20)

where

T^{'} = {(\sqrt{T_{r}} - σ_{min} (H_{I R} * H_{R I}^{T}))}^{2}

. It is worth mentioning that the geometrical representation for the optimization problem (13) is different because there, considering that the RSI wants to bring the worst representation against the IRS configuration and covariance matrices simultaneously, the RSI cannot freely span the whole circle. This is due to the fact that some regions in the circle might not be a good choice when it comes to RSI design against covariance matrices. However, if the best representation of RSI against the covariance matrices also provides the best RSI against the IRS configuration, the solution to (14) and (13) will be the same. Eventually, instead of optimization problem (13), one can solve optimization problem (14). The solution to the new problem is guaranteed to be achievable by the original problem as well.

Notice that one can readily extend this interpretation into the complex domain, as the constraint (19b) will still be a subset of constraints (18b). It remains to show one can generalize the geometrical proof for arbitrary large dimensions. This means that the channel dimensions and the number of IRS elements can be any natural numbers. Interestingly, it is enough to show that the geometrical proof based on

ℓ_{2}

norms and Euclidean distance exists for higher dimensions. This proof is given in Appendix A where it is shown

| | {\bar{H}}_{r} + H_{R I} Θ H_{I R} {| |}_{F}^{2} = | | V e c ({\bar{H}}_{r}) + (H_{I R} * H_{R I}^{T}) V e c (Θ) {| |}_{2}^{2}

. □

Next we solve problem (14). Solving this problem is hard in general as it is non-convex. Hence, we first use the following lemma and theorem to solve it. There, it is shown that for every possible choice of

H_{1}

and

H_{2}

, there exists at least one set of simultaneously diagonalizable matrices

H_{t o t}

,

Q_{s}

and

Q_{r}

that are the solutions to the problem (14).

Lemma 1.

For two positive semi-definite and positive definite matrices

A

and

B

with eigenvalues

λ_{1} (A) \geq λ_{2} (A) \geq . . . \geq λ_{N} (A)

and

λ_{1} (B) \geq λ_{2} (B) \geq . . . \geq λ_{N} (B)

, respectively, the following inequalities hold,

\prod_{i = 1}^{N} (1 + \frac{λ_{i} (A)}{λ_{i} (B)}) \leq | I + A B^{- 1} | \leq \prod_{i = 1}^{N} (1 + \frac{λ_{i} (A)}{λ_{N + 1 - i} (B)}) .

(21)

Proof.

Consider Fiedler’s inequality given by [40],

\prod_{i = 1}^{N} (λ_{i} (A) + λ_{i} (B)) \leq | B + A | \leq \prod_{i = 1}^{N} (λ_{i} (A) + λ_{N + 1 - i} (B)) .

(22)

Furthermore, given

B

as a positive definite matrix, the following are true,

\begin{matrix} | B^{} | & > 0, \end{matrix}

(23)

\begin{matrix} | B^{- 1} | & = \prod_{i = 1}^{N} \frac{1}{λ_{i} (B)} . \end{matrix}

(24)

Now, dividing the sides of (22) by

| B |

, one can readily obtain (21). □

Note that in (21), the inequalities hold with equality if and only if

A

and

B

are diagonalizable over a common basis. Using the result of Lemma 1,

R_{sr}^{FD}

can be lower-bounded as

\begin{matrix} {log}_{2} \frac{| σ_{t}^{2} I_{K_{r}} + {\hat{H}}_{1} Q_{s} {\hat{H}}_{1}^{H} + {\bar{H}}_{1} Q_{s} {\bar{H}}_{1}^{H} + H_{t o t_{1}} Q_{r} H_{t o t_{1}}^{H} |}{| σ_{t}^{2} I_{K_{r}} + {\bar{H}}_{1} Q_{s} {\bar{H}}_{1}^{H} + H_{t o t_{1}} Q_{r} H_{t o t_{1}}^{H} |} \geq \\ \sum_{i = 1}^{min (M, K_{r})} {log}_{2} (1 + \frac{λ_{i} (H_{1} Q_{s} {H_{1}}^{H})}{λ_{i} (σ_{t}^{2} I_{K_{r}} + {\bar{H}}_{1} Q_{s} {\bar{H}}_{1}^{H} + H_{t o t_{1}} Q_{r} H_{t o t_{1}}^{H})}) . \end{matrix}

(25)

In addition, it holds that

λ_{i} (σ_{t}^{2} I_{K_{r}} + {\bar{H}}_{1} Q_{s} {\bar{H}}_{1}^{H} + H_{t o t_{1}} Q_{r} H_{t o t_{1}}^{H}) = σ_{t}^{2} +

λ_{i} ({\bar{H}}_{1} Q_{s} {\bar{H}}_{1}^{H} + H_{t o t_{1}} Q_{r} H_{t o t_{1}}^{H})

. Hence, we obtain

\begin{matrix} {log}_{2} \frac{| σ_{t}^{2} I_{K_{r}} + {\hat{H}}_{1} Q_{s} {\hat{H}}_{1}^{H} + {\bar{H}}_{1} Q_{s} {\bar{H}}_{1}^{H} + H_{t o t_{1}} Q_{r} H_{t o t_{1}}^{H} |}{| σ_{t}^{2} I_{K_{r}} + {\bar{H}}_{1} Q_{s} {\bar{H}}_{1}^{H} + H_{t o t_{1}} Q_{r} H_{t o t_{1}}^{H} |} \geq \\ \sum_{i = 1}^{min (M, K_{r})} {log}_{2} (1 + \frac{λ_{i} (H_{1} Q_{s} {H_{1}}^{H})}{σ_{t}^{2} + λ_{i} ({\bar{H}}_{1} Q_{s} {\bar{H}}_{1}^{H} + H_{t o t_{1}} Q_{r} H_{t o t_{1}}^{H})}) . \end{matrix}

(26)

Note that the inequality holds with equality whenever

H_{1} Q_{s} {H_{1}}^{H}

and

{\bar{H}}_{1} Q_{s} {\bar{H}}_{1}^{H} + H_{t o t_{1}} Q_{r} H_{t o t_{1}}^{H}

share a common basis. Next, we use the following inequality

\begin{matrix} \sum_{i = 1}^{min (M, K_{r})} {log}_{2} (1 + \frac{λ_{i} (H_{1} Q_{s} {H_{1}}^{H})}{σ_{t}^{2} + λ_{i} ({\bar{H}}_{1} Q_{s} {\bar{H}}_{1}^{H} + H_{t o t_{1}} Q_{r} H_{t o t_{1}}^{H})}) \geq \\ \sum_{i = 1}^{min (M, K_{r})} {log}_{2} (1 + \frac{λ_{i} (H_{1} Q_{s} {H_{1}}^{H})}{σ_{t}^{2} + T_{1} P_{s} + λ_{i} (H_{t o t_{1}} Q_{r} H_{t o t_{1}}^{H})}) . \end{matrix}

(27)

The above inequality holds true since

λ_{i} ({\bar{H}}_{1} Q_{s} {\bar{H}}_{1}^{H}) \leq T_{1} P_{s}

.

Now, instead of completing the minimization over the left-hand side (LHS) of Equation (26), we can first minimize the right-hand side (RHS) of Equation (27) to find an achievable rate. Similarly, for

R_{rd}^{FD}

we have

\begin{matrix} {log}_{2} \frac{| σ_{d}^{2} I_{N} + {\hat{H}}_{2} Q_{r} {\hat{H}}_{2}^{H} + {\bar{H}}_{2} Q_{r} {\bar{H}}_{2}^{H} |}{| σ_{d}^{2} I_{N} + {\bar{H}}_{2} Q_{r} {\bar{H}}_{2}^{H} |} \geq \sum_{i = 1}^{min (M, K_{r})} {log}_{2} (1 + \frac{λ_{i} (H_{2} Q_{r} {H_{2}}^{H})}{σ_{d}^{2} + T_{2} P_{r}}) . \end{matrix}

(28)

Remark 1.

Having the equality

C = {\bar{H}}_{r} Q_{r} {\bar{H}}_{r}^{H}

, one can generally conclude the rule of multiplication is determinant, i.e.,

det (C) = det ({\bar{H}}_{r}^{H} {\bar{H}}_{r}) det (Q_{r})

. Further, using the properties of determinants we can also conclude

\prod_{i = 1}^{N} λ_{i} (C) = \prod_{i = 1}^{N} (λ_{ρ (i)} ({\bar{H}}_{r}^{H} {\bar{H}}_{r}) λ_{i} (Q_{r}))

where

ρ (i)

is a random permutation of i and indicates that there is no need for

λ_{ρ (i)} ({\bar{H}}_{r}^{H} {\bar{H}}_{r})

to be in decreasing order. However, one cannot generally conclude

λ_{i} (C) = λ_{ρ (i)} ({\bar{H}}_{r}^{H} {\bar{H}}_{r}) λ_{i} (Q_{r}), \forall i

, unless

{\bar{H}}_{r}^{H} {\bar{H}}_{r}

and

Q_{r}

share common basis.

As a result of Remark 1, in general, we cannot rewrite (26) in terms of

λ_{i} ({\bar{H}}_{r}^{H} {\bar{H}}_{r})

,

λ_{i} (Q_{r})

,

λ_{i} (H_{1}^{H} H_{1})

and

λ_{i} (Q_{s})

. However, if we show that for every choice of

Q_{s}

, there exists a matrix

Q_{s}^{'}

with properties: (1)

λ_{i} (H_{1} Q_{s} H_{1}^{H}) = λ_{i} (H_{1} Q_{s}^{'} H_{1}^{H})

; (2)

λ_{i} (H_{1} Q_{s}^{'} H_{1}^{H}) = λ_{i} (Q_{s}^{'}) λ_{i} (H_{1}^{H} H_{1})

and 3)

Tr (Q_{s}^{'}) \leq Tr (Q_{s})

; then we can use

Q_{s}^{'}

instead and rewrite (26) in terms of

λ_{i} (H_{1}^{H} H_{1})

and

λ_{i} (Q_{s}^{'})

to simplify the problem. The first property implies that both

Q_{s}

and

Q_{s}^{'}

have the exact same impact on the capacity. Hence, if we find a

Q_{s}

which is the solution to the problem (14), its corresponding

Q_{s}^{'}

will also be a solution. The second property means, unlike

Q_{s}

,

Q_{s}^{'}

actually shares the common basis with

H_{1}^{H} H_{1}

. The last property implies that

Q_{s}^{'}

is at least as good as

Q_{s}

in terms of power consumption. Observe that if we show for every feasible

Q_{s}

there exists at least one such

Q_{s}^{'}

, then we can solve the problem (14) in a much easier way. The reason is, in such a case, instead of searching for optimal

Q_{s}

over the whole feasible set, we can search for the optimal

Q_{s}^{'}

. Unlike

Q_{s}

, finding

Q_{s}^{'}

does not need a complete search over the whole feasible set since

Q_{s}^{'}

shares a common basis with

H_{1}^{H} H_{1}

. Therefore, we can limit our search only to the portion of the feasible set in which the matrices have eigendirections identical to those of

H_{1}^{H} H_{1}

. Similarly, if we show for every choice of

{\bar{H}}_{r}

, there exist at least one

{\bar{H}}_{r}^{'}

for which we have three conditions

λ_{i} ({\bar{H}}_{r} Q_{r} {\bar{H}}_{r}^{H}) = λ_{i} ({\bar{H}}_{r}^{'} Q_{r} {\bar{H}}_{r}^{' H})

,

λ_{i} ({\bar{H}}_{r} Q_{r} {\bar{H}}_{r}^{' H}) = λ_{i} (Q_{r}) λ_{i} ({\bar{H}}_{1}^{' H} {\bar{H}}_{1}^{'})

and

Tr ({\bar{H}}_{1}^{^{'} H} {\bar{H}}_{1}^{'}) \leq Tr ({\bar{H}}_{1}^{H} {\bar{H}}_{1})

, we can simplify our search to finding

{\bar{H}}_{r}^{'}

instead of

{\bar{H}}_{r}

. In the next theorem, we show that such

Q_{s}^{'}

and

{\bar{H}}_{r}^{'}

exist.

Theorem 2.

For all matrices

Q_{s}

and

H_{1}

, there exists at least one matrix

Q_{s}^{'}

that satisfies the following conditions,

\begin{matrix} λ_{i} (H_{1} Q_{s} H_{1}^{H}) & = λ_{i} (H_{1} Q_{s}^{'} H_{1}^{H}), \end{matrix}

(29)

\begin{matrix} λ_{i} (H_{1} Q_{s}^{'} H_{1}^{H}) & = λ_{ρ (i)} (Q_{s}^{'}) λ_{i} (H_{1}^{H} H_{1}), \end{matrix}

(30)

\begin{matrix} Tr (Q_{s}^{'}) & \leq Tr (Q_{s}), \end{matrix}

(31)

where

ρ (i)

is a random permutation of i and indicates that there is no need for

λ_{ρ (i)} (Q_{s}^{'})

to be in decreasing order.

Proof.

The proof is given in Appendix B. □

For the sake of simplicity, we use the following notions for the rest of the paper,

\begin{matrix} γ_{s_{i}} = λ_{i} (Q_{s}), \end{matrix}

(32)

\begin{matrix} γ_{r_{i}} = λ_{i} (Q_{r}), \end{matrix}

(33)

\begin{matrix} σ_{1_{i}}^{2} = λ_{i} (H_{1} H_{1}^{H}), \end{matrix}

(34)

\begin{matrix} σ_{r_{i}}^{2} = λ_{i} ({\bar{H}}_{t o t_{1}} {\bar{H}}_{t o t_{1}}^{H}), \end{matrix}

(35)

\begin{matrix} σ_{2_{i}}^{2} = λ_{i} (H_{2} H_{2}^{H}) . \end{matrix}

(36)

Now, using Theorem 2 alongside Lemma 1, we infer that with no loss of generality, instead of optimising over matrices, one can complete the optimization over eigenvalues to find the optimal value for RSH of (27). Then we have

\begin{matrix} max_{γ_{s}, γ_{r}} min_{σ_{r}} & min (\sum_{i = 1}^{min (M, K_{r})} log (1 + \frac{σ_{1_{i}}^{2} γ_{s_{ρ (i)}}}{σ_{t}^{2} + T_{1} P_{s} + γ_{r_{i}} σ_{r_{ρ (i)}}^{2}}), \end{matrix}

\begin{matrix} \sum_{i = 1}^{min (K_{t}, N)} log (1 + \frac{σ_{2_{i}}^{2} γ_{r_{i}}}{σ_{d}^{2} + T_{2} P_{r}})) \end{matrix}

(37)

\begin{matrix} subject to & ∥ γ_{s} ∥_{1} \leq P_{s}, \end{matrix}

(37a)

\begin{matrix} ∥ γ_{r} ∥_{1} \leq P_{r}, \end{matrix}

(37b)

\begin{matrix} ∥ σ_{r}^{2} ∥_{1} \leq T^{'}, \end{matrix}

(37c)

\begin{matrix} σ_{1_{i}}^{2} γ_{s_{ρ (i)}} \geq σ_{1_{i + 1}}^{2} γ_{s_{ρ (i + 1)}}, \forall i \leq min (M, K_{r}), \end{matrix}

(37d)

\begin{matrix} γ_{r_{i}} σ_{r_{ρ (i)}}^{2} \geq γ_{r_{i + 1}} σ_{r_{ρ (i + 1)}}^{2}, \forall i \leq min (K_{t}, N) . \end{matrix}

(37e)

Note that the two additional constraints (37d) and (37e) need to be satisfied due to the conditions of Lemma 1 (i.e., eigenvalues have to be in decreasing order). Interestingly, these two additional constraints are affine. The above optimization problem can further be simplified using the following lemma,

Lemma 2.

The objective function of the optimization problem (37) is optimized when the constraints (37a) and (37c) are satisfied with equality.

Proof.

Intuitively, as the objective function is an increasing and decreasing function of each element of

γ_{s}

and

σ_{r}^{2}

, respectively, at convergence, the constraints are met with equality. See Appendix C for the proof. □

3.3. Algorithm Description

In this subsection, our proposed algorithm is given. In short, it works as follows. First, based on the task of the IRS in the system, we compute the effect of IRS on the RSI, source–relay and/or relay–destination channel links. After that, we design the best signal design for the source and relays transmitters with the objective of maximizing the throughput. In the rest of this subsection, the detailed explanation of the algorithm is given. First, we need to solve the optimization problem (37). It can be readily shown that

R_{rd}^{FD}

is a monotonically increasing function of

P_{r}

. Furthermore, one can show that

R_{sr}^{FD}

is an increasing function with respect to

P_{s}

and a decreasing function with respect to

T^{'}

and

P_{r}

(See Appendix D). Consequently, the worst-case RSI chooses a strategy to reduce the spectral efficiency, while the relay and the source cope with such strategy for improving the system robustness. That means, on one hand, the RSI hurts the stronger eigendirections of the received signal space more than the weaker ones. However, on the other hand, the source tries to cope with this strategy adaptively by smart eigen selection. This process clearly makes the optimization problem complicated at the source–relay hop. Unlike the source–relay hop, the resource allocation problem at the relay–receiver hop is rather easy. Since at the relay–receiver hop there is only one maximization, we can find the sum capacity simply by using the well-known water-filling algorithm.

Observe that although finding each

R_{sr}^{FD}

and

R_{rd}^{FD}

separately is a convex problem, the problem (37) as a whole is not convex. Therefore in this paper, we find the optimal

R_{sr}^{FD}

by keeping

R_{rd}^{FD}

fixed. Then we use the resulting

R_{sr}^{FD}

to find optimal

R_{rd}^{FD}

and again, using the new resulted

R_{rd}^{FD}

to find optimal

R_{sr}^{FD}

. This iterative process repeats until the convergence. Our simulation showed that the algorithm has a very fast convergence and only in rare cases does it take more than 20 iterations for the algorithm to converge. This is mainly due to the fact that inequalities (37d) and (37e) restrict the eigenvalues to vary up to a certain limit, which in turn, makes the whole outputs more stable. Figure 4 depicts a typical histogram of iterations. As it can be seen, only less than

3 %

of cases did not converge until 50 iterations.

Notice that the optimum values for the transmission power on relay hop may not sum to

P_{r}

. The reason is that

R_{sr}^{FD}

is a monotonically decreasing function of

P_{r}

and as we are interested in the

min (R_{sr}^{FD}, R_{rd}^{FD})

, with

R_{sr}^{FD} < R_{rd}^{FD}

we will have

min (R_{sr}^{FD}, R_{rd}^{FD}) = R_{sr}^{FD}

. Therefore, it is in our interest to keep

P_{r}

as low as possible to increase

R_{sr}^{FD}

as much as possible. Analogously, in the case of

R_{sr}^{FD} > R_{rd}^{FD}

we have

min (R_{sr}^{FD}, R_{rd}^{FD}) = R_{rd}^{FD}

which can be increased by increasing the total power usage of relay’s transmitter. As a result, the well-known bisection method can be used to find the optimal rate where we have

R_{sr}^{FD} = R_{rd}^{FD}

, unless the case

R_{sr}^{FD} \geq R_{rd}^{FD}

happens even if the maximum allowed power is used at the relay transmitter. In such a case, the relay–destination link becomes the bottleneck.

Now we focus on how to find

R_{sr}^{FD}

. In order to find the sum rate for the source–relay hop, we assume that we are already given

γ_{r}^{⋆}

which is the vector of relay input powers that maximizes the sum rate at the relay–destination hop. The next step is to complete the minimization over

σ_{r}

and the maximization over

γ_{s}

. One approach to solve this problem is to solve it iteratively. With this method, first one finds the optimal

γ_{s}

by solving the maximization part of (37) under the assumption that the optimal

σ_{r}

is given, and then, having the optimal

γ_{s}

, the minimization part of (37) can be solved efficiently. This process goes on until the convergence of

γ_{s}

and/or

σ_{r}

. The maximization part is performed using the water-filling method. However, the additional conditions

\forall i \leq min (M, K_{r}), σ_{1_{i}}^{2} γ_{s_{ρ (i)}} \geq σ_{1_{i + 1}}^{2} γ_{s_{ρ (i + 1)}}

should be taken into account. For instance, if the optimal value for

γ_{s_{i}}

turns out to be equal to zero, then we should have

γ_{s_{j}} = 0

for all

j > i

irrespective of their SNR. Figure 5 depicts two different examples of multi-level water-filling algorithms. As it can be seen, first, a regular water-filling algorithm is considered where for each subchannel we have

\frac{σ_{1_{i}}^{2}}{1 + γ_{r_{i}} σ_{r_{ρ (i)}}^{2}}

as its channel gain. After finding the water-level in this way, we need to impose

γ_{s_{ρ (i + 1)}} \leq \frac{{min}_{1 \leq i^{'} \leq i} {σ_{1_{i^{'}}}^{2} γ_{s_{ρ (i^{'})}}}}{σ_{1_{i + 1}}^{2}}

. These additional restrictions act like caps on top of the water and create multilevel water-filling which can be interpreted as a cave. Figure 5a shows the case where these caps do not make any subchannel to have zero power. However, Figure 5b shows the case where subchannel

i = 13

has to be zero as a result of the cap imposed by the additional constraints (37d). In this case, we have

γ_{s_{ρ (13)}} = 0

, and as a result

{min}_{1 \leq i^{'} \leq 13} {σ_{1_{i^{'}}}^{2} γ_{s_{ρ (i^{'})}}} = 0

. Thus, this condition forces all other subchannels (i.e.,

i > 13

) to get no power. Algorithm 1 provides the detail of multilevel water-filling. For the minimization part, a Lagrangian multiplier is used. We have

\begin{matrix} L = & \sum_{i = 1}^{min (M, K_{r})} {log}_{2} (1 + \frac{σ_{1_{i}}^{2} γ_{s_{ρ (i)}}}{σ_{t}^{2} + T_{1} P_{s} + γ_{r_{i}} σ_{r_{ρ (i)}}^{2}}) + λ (\sum_{i = 0}^{N} σ_{r_{i}}^{2} - T_{r}) . \end{matrix}

(38)

Calculating

\frac{\partial L}{\partial σ_{r_{i}}^{2}} = 0

we arrive at

\begin{matrix} σ_{r_{i}}^{2} = {[\frac{\sqrt{{(σ_{1_{i}}^{2} γ_{s_{i}})}^{2} + \frac{4 σ_{1_{i}}^{2} γ_{s_{i}} γ r_{i}}{λ}} - σ_{1_{i}}^{2} γ_{s_{i}} - 2 (σ_{t}^{2} + T_{1} P_{s})}{2 γ_{r_{i}}}]}^{+}, \end{matrix}

(39)

where

λ

is the water level.

Similarly to the maximization case, there are additional constraints

γ_{r_{i}} σ_{s_{ρ (i)}}^{2} \geq γ_{r_{i + 1}} σ_{r_{i + 1}}^{2}

that must be considered during the minimization process. However, it can be shown that if the constraints

γ_{r_{i}} \geq γ_{r_{i + 1}}

and

σ_{1_{i}}^{2} γ_{s_{ρ (i)}} \geq σ_{1_{i + 1}}^{2} γ_{s_{i + 1}}

are met, then the constraint

γ_{r_{i}} σ_{s_{ρ (i)}}^{2} \geq γ_{r_{i + 1}} σ_{r_{i + 1}}^{2}

becomes redundant. Please refer to Appendix E for proof. The summary of the algorithm to find the achievable rate can be found in Algorithm 2. Next we deal with the optimization for the cases where IRS is utilized to help either the source–relay or relay–destination channels. In such cases, the optimization part over the covariance matrices remains the same as the abovementioned case. In addition, the optimization of the IRS elements can be performed using eigenvalue decomposition and the algorithm introduced in [8]. Notice that for the case in which IRS is assisting the source–relay link, the term

T_{1} P_{s}

in (27) should be replaced with

(T_{1} + T_{S I} T_{I R}) P_{s}

, and for the case where IRS helps the relay–destination link, the term

T_{2} P_{s}

in (28) should be replaced with

(T_{2} + T_{R I} T_{I D}) P_{r}

. The pseudo code for these scenarios is given in Algorithm 3.

Algorithm 1 The optimal

γ_{s}

1:: Find power allocation $P^{0}$ using water-filling algorithm
2:: while $| P^{(q)} - P^{(q - 1)} |$ is large do
3:: Define $temp = 0$
4:: for i do
5:: Calculate ${cap}_{i} = {min}_{1 \leq i^{'} \leq i - 1} {σ_{1_{i^{'}}}^{2} γ_{s_{ρ (i^{'})}}} / σ_{1_{i}}^{2}$
6:: if $P_{i} > {cap}_{i}$ then
7:: $P_{i} = {cap}_{i}$
8:: $temp = temp + P_{i} - {cap}_{i}$
9:: end if
10:: end for
11:: $P = P + \frac{temp}{number of channels}$
12:: end while

Algorithm 2 Robust Transceiver Design for FD scenario, the first case

1:: Define $U = P_{r}$ , $L = 0$ , ${\bar{P}}_{r}^{(1)} = \frac{P_{r}}{2}$
2:: while $| U - L |$ is large do
3:: Determine $γ_{r}^{} = {[τ_{r}^{} - \frac{1}{σ_{2}^{2}}]}^{+}$ , s.t. $∥ γ_{r}^{} ∥_{1} = {\bar{P}}_{r}^{}$
4:: Define $σ_{r}^{{(0)}^{2}} = 0$ and $σ_{r}^{{(1)}^{2}} = 1$ and $q = 0$
5:: Set $T^{'} = {(\sqrt{T_{r}} - σ_{min} (H_{I R} * H_{R I}^{T}))}^{2}$
6:: while $∥ σ_{r}^{{(q)}^{2}} - σ_{r}^{{(q - 1)}^{2}} ∥_{1}$ is large do
7:: Obtain $σ_{r}^{{(q)}^{2}}$ , using Equation (39)
8:: Obtain $γ_{s}^{(q)}$ , using Algorithm 1
9:: $q = q + 1$
10:: end while
11:: Calculate $R_{sr}^{}$ and $R_{rd}^{}$
12:: if $R_{sr}^{} > R_{rd}^{}$ then
13:: $U = {\bar{P}}_{r}$
14:: else if $R_{sr}^{} < R_{rd}^{}$ then
15:: $L = {\bar{P}}_{r}$
16:: end if
17:: ${\bar{P}}_{r} = \frac{U + L}{2}$
18:: end while

Algorithm 3 Robust Transceiver Design for FD scenario, the second and third case

1:: Define $U = P_{r}$ , $L = 0$ , ${\bar{P}}_{r}^{(1)} = \frac{P_{r}}{2}$
2:: Define $C a s e = 1$ if the IRS is being used help the source–relay link or $C a s e = 0$ if IRS is being used to help the relay–destination link
3:: while $| U - L |$ is large do
4:: Determine $γ_{r}^{} = {[τ_{r}^{} - \frac{1}{σ_{2}^{2}}]}^{+}$ , s.t. $∥ γ_{r}^{} ∥_{1} = {\bar{P}}_{r}^{}$
5:: Define $σ_{r}^{{(0)}^{2}} = 0$ and $σ_{r}^{{(1)}^{2}} = 1$ and $q = 0$
6:: if $C a s e = 1$ then
7:: ${\bar{H}}_{t o t_{2}} = ({\bar{H}}_{1} + {\bar{H}}_{R I} Θ {\bar{H}}_{I R})$
8:: Find the optimum $Θ$ , using Algorithm 1 in [8]
9:: else if $C a s e = 0$ then
10:: ${\bar{H}}_{t o t_{3}} = ({\bar{H}}_{2} + {\bar{H}}_{R I} Θ {\bar{H}}_{I R})$
11:: Find the optimum $Θ$ , using Algorithm 1 in [8]
12:: end if
13:: Obtain $γ_{s}^{(q)}$ , using Algorithm 1
14:: $q = q + 1$
15:: end while
16:: Calculate $R_{sr}^{}$ and $R_{rd}^{}$
17:: if $R_{sr}^{} > R_{rd}^{}$ then
18:: $U = {\bar{P}}_{r}$
19:: else if $R_{sr}^{} < R_{rd}^{}$ then
20:: $L = {\bar{P}}_{r}$
21:: end if
22:: ${\bar{P}}_{r} = \frac{U + L}{2}$

3.4. Discussion

In this part, we evaluate the various aspects of our method. First, we examine the complexity of our algorithm and compare it with the state of the art. Algorithms 1 and 2 are the main solutions provided in this paper. Algorithm 1 is a multi-level water-filling, and as a result, it has the complexity of

O (I_{w} min (N_{t}, K_{r}))

, where

I_{w}

is a constant that is independent of system parameters and is only related to the accuracy of the multi-level water-filling algorithm. Algorithm 2 requires the SVD for matrices

H_{1}

,

H_{2}

and

(H_{I R} * H_{R I}^{T})

, with the complexity

O (N_{t} K_{r} min (N_{t}, K_{r}))

,

O (N_{r} K_{t} min (N_{r}, K_{t}))

and

O (M K_{r} K_{t} min (M, K_{r} K_{t}))

, respectively. Furthermore, the Khatri–Rao multiplication

H_{I R} * H_{R I}^{T}

is needed that has the complexity

O (N_{r} K_{t} M)

. As a result, the overall complexity of our method is

O (M K_{r} K_{t}

min (M, K_{r} K_{t}) + N_{r} K_{t} M + N_{t} K_{r} min (N_{t}, K_{r}) + N_{r} K_{t} min (N_{r}, K_{t}) + I_{t} (I_{w} min (N_{t}, K_{r})))

, where

I_{t}

is a constant independent of the system parameters. Interestingly, our method has a super linear complexity with respect to the number of IRS elements which is better than the state of the art works, e.g., [5,8]. This means that our algorithm in more energy efficient and suitable for latency sensitive applications. It should also be noted that our algorithm does not provide the optimal IRS pattern; instead, it provides analytical bounds for the performance of the IRS that can be used as a benchmark. In other words, our work provides a tool with which one can evaluate the efficiency of their robust design. A comparison between our method and previous works is summarized in Table 2

4. Achievable Rate (Half-Duplex Relay)

We consider a simple HD relay where the source and the relay transmit in two subsequent time instances. Notice that for the case of HD, IRS can be used to assist both the source–relay and the relay–destination channels as the signal is being sent over each of these channels in a different time slot. Therefore, the received signals at the relay and the destination can, respectively, be expressed as

\begin{matrix} y_{r} & = ({\hat{H}}_{1} + {\hat{H}}_{I R} Θ {\hat{H}}_{SI}) x_{r} + ({\bar{H}}_{1} + {\bar{H}}_{I R} Θ {\bar{H}}_{SI}) x_{r} + n_{t}, \end{matrix}

(40)

\begin{matrix} y_{d} & = ({\hat{H}}_{2} + {\hat{H}}_{I D} Θ {\hat{H}}_{R I}) x_{r} + ({\bar{H}}_{2} + {\bar{H}}_{I D} Θ {\bar{H}}_{R I}) x_{r} + n_{d} . \end{matrix}

(41)

Consequently, the achievable rates for the transmitter–relay and relay–destination links can be expressed as below

\begin{matrix} R_{sr}^{HD} = log | I_{K_{r}} + ({\hat{H}}_{1} + {\hat{H}}_{I R} Θ {\hat{H}}_{SI}) Q_{s} {({\hat{H}}_{1} + {\hat{H}}_{I R} Θ {\hat{H}}_{SI})}^{H} \\ {(σ_{t}^{2} I_{K_{r}} + ({\bar{H}}_{1} + {\bar{H}}_{I R} Θ {\bar{H}}_{SI}) Q_{s} ({\bar{H}}_{1} + {\bar{H}}_{I R} Θ {\bar{H}}_{SI})^{H})}^{- 1} | \\ \geq \sum_{i = 1}^{min (M, K_{r})} {log}_{2} (1 + \frac{λ_{i} (H_{1}^{'} Q_{s} {H_{1}^{'}}^{H})}{σ_{t}^{2} + T_{1} P_{s}}), \\ R_{rd}^{HD} = log | I_{N} + ({\hat{H}}_{2} + {\hat{H}}_{I D} Θ {\hat{H}}_{RI}) Q_{r} {({\hat{H}}_{2} + {\hat{H}}_{I D} Θ {\hat{H}}_{RI})}^{H} \\ {(σ_{d}^{2} I_{N} + ({\bar{H}}_{2} + {\bar{H}}_{I D} Θ {\bar{H}}_{RI}) Q_{r} {({\bar{H}}_{2} + {\bar{H}}_{I D} Θ {\bar{H}}_{RI})}^{H})}^{- 1} | \end{matrix}

(42)

\begin{matrix} \geq \sum_{i = 1}^{min (M, K_{r})} {log}_{2} (1 + \frac{λ_{i} (H_{2}^{'} Q_{r} {H_{2}^{'}}^{H})}{σ_{d}^{2} + T_{2} P_{r}}) . \end{matrix}

(43)

where

H_{1}^{'} = {\hat{H}}_{1} + {\hat{H}}_{I R} Θ {\hat{H}}_{SI}

and

H_{2}^{'} = {\bar{H}}_{2} + {\bar{H}}_{I D} Θ {\bar{H}}_{RI}

. In addition,

R_{sr}^{HD}

and

R_{rd}^{HD}

are the achievable rates on the source–relay and relay–destination links, respectively. Using time sharing, the achievable rate between the source and destination nodes is given by

\begin{matrix} R^{HD} = min (α R_{sr}^{HD}, (1 - α) R_{rd}^{HD}), \end{matrix}

(44)

where

α

is the time-sharing parameter.

Note that in half-duplex relaying, the source and relay transmissions are conducted in separate channel uses. Hence, the transmit covariance matrices

Q_{s} \in H^{N_{t} \times N_{t}}

and

Q_{r} \in H^{K_{t} \times K_{t}}

are optimized by maximizing the achievable rate from the source to the destination. Here, the convex cone of Hermitian positive semidefinite matrices of dimensions

N_{t} \times N_{t}

and

K_{t} \times K_{t}

are represented by

H^{N_{t} \times N_{t}}

and

H^{K_{t} \times K_{t}}

, respectively. Importantly, for maximizing this achievable rate, the time-sharing parameter, i.e.,

α

, needs to be optimized alongside the system parameters, e.g., power allocation. Readily, optimal

α

occurs at

α R_{sr}^{HD} = (1 - α) R_{rd}^{HD}

. Therefore, the achievable rate becomes as follows

\begin{matrix} R^{HD} = \frac{R_{sr}^{HD} R_{rd}^{HD}}{R_{sr}^{HD} + R_{rd}^{HD}} . \end{matrix}

(45)

Notice that as the objective function of the above optimization problem is a monotonically increasing function of both

R_{sr}^{HD}

and

R_{rd}^{HD}

, the problem can be simplified to maximizing

R_{sr}^{HD}

and

R_{rd}^{HD}

separately.

Next, we provide the solution to the rate optimization problem when IRS is assisting the source–relay link. We have

\begin{matrix} max_{Q_{s}} & min_{{\bar{H}}_{S I}, {\bar{H}}_{1}, {\bar{H}}_{IR}} R_{sr}^{HD} \end{matrix}

(46)

\begin{matrix} subject to & Tr (Q_{s}) \leq P_{s}, \end{matrix}

(46a)

\begin{matrix} Tr ({\bar{H}}_{1} {\bar{H}}_{1}^{H}) \leq T_{1}, \end{matrix}

(46b)

\begin{matrix} Tr ({\bar{H}}_{S I} {\bar{H}}_{S I}^{H}) \leq T_{S I}, \end{matrix}

(46c)

\begin{matrix} Tr ({\bar{H}}_{I R} {\bar{H}}_{I R}^{H}) \leq T_{I R}, \end{matrix}

(46d)

The above optimization problem follows the same approach applied for the optimization of the relay–destination link in the FD scenario. As a result, the same method could be applied to find it. In other words, the well-known water-filling algorithm can be used to find the optimal covariance matrices along with the algorithm introduced in [8] to find the best IRS pattern. This process continues iteratively until it finally converges. The solution to

R_{r d}^{H D}

is the same as well, and the same procedure can be applied to find

Q_{r}

. The overall procedure of finding the solution for the HD mode is summarized in Algorithm 4.

Algorithm 4 Robust Transceiver Design for HD scenario

1:: ${\bar{H}}_{t o t_{2}} = ({\bar{H}}_{1} + {\bar{H}}_{R I} Θ {\bar{H}}_{I R})$
2:: Find the optimum $Θ$ , using Algorithm 1 in [8]
3:: ${\bar{H}}_{t o t_{3}} = ({\bar{H}}_{2} + {\bar{H}}_{R I} Θ {\bar{H}}_{I R})$
4:: Find the optimum $Θ$ , using Algorithm 1 in [8]
5:: Find $R_{sr}^{HD}$ , using Equation (42)
6:: Find $R_{rd}^{HD}$ , using Equation (43)
7:: Find $R^{HD}$ , using Equation (45)

5. Numerical Results

We assume the transmit power budgets at the source and at the relay are

P_{s} = 5

and

P_{r} = 1

, respectively. Moreover, the AWGN spectral density is assumed to be

- 175

dBm and the bandwidth is

B W = 180

MHz. In this section, we investigate the performance of IRS-assisted full-duplex relaying with RSI channel uncertainty bound

T_{r}

, i.e.,

Tr ({\bar{H}}_{r} {\bar{H}}_{r}^{H}) \leq T_{r}

. We consider all the channels to follow the Rician distribution with the factor

ϵ = 0.1

and the specificaiton given in Table 3. We also assume

T_{x} = 0.001, x \in \{1, 2, S I, I R, R I, I D\}

. We perform Monte Carlo simulations with

L = 10^{3}

realizations from random channels and noise vectors. Hence, the average worst-case throughput rate is defined as the average of worst-case rates for L randomization, i.e.,

R_{av} = \frac{1}{L} \sum_{l = 1}^{L} R_{l} .

Notice that for each set of realizations, we solve the robust transceiver design as is elaborated in Algorithm 2. We run different sets of simulations as described in the following subsections.

5.1. Antenna Array Increment with No IRS

In this part, first we assume that there is no IRS installed. Then we evaluate the performance of the system using different strategies. Thereafter, we examine how installing an IRS can help increase the throughput. We consider two cases where the source, relay and destination are equipped with (a) a small antenna array, and (b) a large antenna arrays. In order to see the impact of IRS, we first assume that there is no IRS installed. For these cases, we have

(a): $N_{t} = 4, K_{r} + K_{t} = 10, N_{r} = 4$ ,
(b): $N_{t} = 10, K_{r} + K_{t} = 24, N_{r} = 10$ .

These cases are considered to highlight the performance of full-duplex DF relaying as a function of the number of antennas with the worst-case RSI. Interestingly, as the number of antennas at the source, relay and destination increase, full-duplex relaying achieves a higher throughput rate even with strong RSI. This can be seen by comparing rates from Figure 6a to those from Figure 6b.

Furthermore, notice that the worst-case RSI casts strong interference on the strong streams from the source to the destination. With very low RSI power

T_{r} \to 0

, full-duplex almost doubles the throughput rate compared to the half-duplex counterpart. This can be seen in Figure 6, where the curves have their intercept point with the vertical axis. However, as

T_{r}

increases, the efficiency of full-duplex operation drops. It is worth noting that at low RSI power the DoF plays the most important role to have a higher sum rate. For instance, consider Figure 6a in which the cases

FD = {4, 5, 5, 4}

,

FD = {4, 4, 6, 4}

and

FD = {4, 6, 4, 4}

have

{DoF}_{total} = 4

–

{DoF}_{total}

is the minimum of the DoF of source–relay and relay–destination channels, i.e.,

{DoF}_{total} = min ({DoF}_{sr}, {DoF}_{rd})

– while the cases

FD = {4, 7, 3, 4}

and

FD = {4, 3, 7, 4}

have

{DoF}_{total} = 3

. At

T_{r} = 0

, there is a noticeable gap between the first three cases and the last two, while the difference of the first three cases from each other is small. The big gap is due to the difference in

{DoF}_{total}

, and the small one is due to the difference in SNR. Similarly, in Figure 6b, the three cases

FD = {10, 12, 12, 10}

,

FD = {10, 10, 14, 10}

and

FD = {10, 10, 14, 10}

with

{DoF}_{total} = 10

have higher rates than the two cases

FD = {10, 6, 18, 10}

and

FD = {10, 18, 6, 10}

with

{DoF}_{total} = 6

.

Finally, it can be seen in both Figure 6a,b that at

T_{r} = 0

there is no difference between cases that have the same

{DoF}_{total}

but different

{DoF}_{sr}

and

{DoF}_{rd}

. As it can be seen in both Figure 6a,b, for cases with

K_{t} > K_{r}

the sum rate drops quickly as RSI increases. In fact, the more relative antennas at the relay transmitter compared to its receiver, the faster the sum rate drops with the rise in RSI. To understand this behaviour of the system better, again, consider case

{10, 18, 6, 10}

and also suppose

T_{r} \to \infty

. As discussed before, we have

{DoF}_{sr} = 10

and

{DoF}_{rd} = 6

. Moreover, we have

{DoF}_{I} = 6

for the interference channel (

{\bar{H}}_{r}

). Unlike the case with no interference, in this case the bottleneck is no longer the relay–destination link. This is due to the fact that interference can act to the detriment of some six of the source–relay subchannels. As we have

{DoF}_{I} = 6

, interference can choose at most six independent subchannels, and as we assumed

T_{r} \to \infty

, for those subchannels we obtain

SIN R \to 0

. Therefore, no information can be conveyed from those links, and the bottleneck becomes the source–relay link with 4 usable subchannels. It can be seen in Figure 6a,b that as

T_{r}

increases, the cases with the same sum-rate at

T_{r} = 0

start to diverge because of the different characteristics of the interference they experience. We explain the effect of interference in the following subsection in more detail.

5.2. The Impact of IRS

In this part, we evaluate the impact of IRS on the throughput rate when it is used to perform different tasks. Figure 7 shows the throughput for three different scenarios, namely, when IRS is used to help the transmitter–relay link, when it is applied to cancel RSI and when the IRS job is to help the relay–destination link. Then the results are compared with two cases where the system is working in HD with IRS and the case where the system is working in FD with no IRS. It is also assumed that

\frac{T_{r}}{(H_{r} H_{r}^{H})} = 75 %

, i.e., the system works at high RSI range. As it can be seen, the highest performance is achieved when the IRS is utilized to deal with the RSI. As a result, for the rest of the paper we use the IRS for this purpose.

Figure 8 shows the impact of IRS on the throughput. For Figure 8a, we considered the case

{N_{t}, K_{t}, K_{r}, N_{r}} = {4, 5, 5, 4}

, and for Figure 8b, we considered

{N_{t}, K_{t}, K_{r}, N_{r}} = {10, 12, 12, 10}

. As shown in the figure, the number of IRS elements has a great impact on RSI cancellation to the extend that having an IRS with

M = 100

and

M = 300

can cancel interference of

\frac{T_{r}}{Tr (H_{r} H_{r}^{H})} = 0.75

for

{N_{t}, K_{t}, K_{r}, N_{r}} = {4, 5, 5, 4}

, and

{N_{t}, K_{t}, K_{r}, N_{r}} = {10, 12, 12, 10}

, respectively. Further, it can also be seen in the figure that having IRS with 20 and 100 elements for the small and large antenna array cases, respectively, is not helpful at all. This is mainly due to the fact that unlike the average case, for the case of worst-case scenario, the number of IRS elements should be at least as large as the dimension of

{\bar{H}}_{r}

. Otherwise, the IRS feasible set cannot span into all dimensions of

{\bar{H}}_{r}

. Therefore, there is always at least one representation for

{\bar{H}}_{r}

in which IRS cannot perform any RSI cancellation. In addition, comparing two figures Figure 8a,b one can conclude that, when the dimension of

{\bar{H}}_{r}

increases, the effort that IRS has to make in order to cancel RSI remarkably increases which is consistent with the previous statement.

5.3. Relay Tx/Rx Antenna allocation

Suppose that the relay has

K_{t} + K_{r} = 8

in total. Furthermore, following cases in which the number of antenna at the source and destination are

{N_{t}, N_{r}} = {4, 4}

. The question is, from eight antennas at the relay, how many should be used for reception for the robust design? Figure 9 shows the sum rate as a function of

K_{r}

for different values of T where there is no IRS and there is an IRS with

M = 60

elements, respectively. As it can be seen, by using more antennas for reception than for transmission, i.e.,

K_{r} > K_{t}

, at the relay, the throughput rate is maximized. This is due to the fact that increasing the signal-to-noise ratio (SNR) of the source–relay streams enhances the overall throughput rate more than increasing the number of antennas for transmission in order to enhance the

DoF

of the relay–destination link. Furthermore, notice that in this setup the overall

DoF

from the source to destination is limited by the

DoF

of the source–relay link, i.e., the bottleneck is in the first hop.

By comparing two scenarios, we see that having an IRS not only improves the rates in all cases, but also it may change the best antenna allocation. For instance, for the case of

T = 15 %

, it is best to have six antennas at the relay receiver and four4 antennas at the relay transmitter. However, after establishing the IRS, the best antenna allocation changes to five antennas at each end. For instance, the results show that although the

{DoF}_{total}

for both

{N_{t}, K_{t}, K_{r}, N_{r}} = {4, 3, 5, 4}

and

{N_{t}, K_{t}, K_{r}, N_{r}} = {4, 5, 3, 4}

is three, the sum rate capacity of the latter is much better than that of the former at high interference. This is because of the fact when

{DoF}_{sr} > {DoF}_{rd}

, the source–relay link enjoys

{DoF}_{sr} - {DoF}_{rd}

subchannels with no interference. Therefore, the source can manage to obtain a higher sum rate by choosing its power allocation wisely. However, in the case of

{DoF}_{sr} \leq {DoF}_{rd}

, no matter how well the power allocation is performed, all sub channels suffer from interference at the source–relay end.

5.4. Full-Duplex vs. Half-Duplex

In this subsection, we determine the thresholds where the HD relaying outperforms the FD relaying. This threshold provides a mode-switching threshold in hybrid HD/FD relay systems. As it can be seen in Figure 10, for each case of

K_{r}

, there is a maximum value of

\frac{T}{P}

above which the HD mode outperforms the FD mode in terms of sum rate maximization. Furthermore, Figure 10 shows the threshold for different IRS configurations. For this part, we continued with the case of

N_{t} = 4, K_{r} = 5, K_{t} = 5, N_{r} = 4

. As it can be seen, by increasing the number of antennas, the threshold occurs at higher RSI. This is in fact a direct result of obtaining better performance by having more antennas at the relay’s receiver. It is worth noting that the IRS has a great impact on the performance of FD relaying. For instance, by having an IRS consisting of only 60 elements, the FD mode outperforms the HD mode in almost all cases.

6. Conclusions

In this paper, we investigated a multi-antenna source communicating with a multi-antenna destination through a multi-antenna relay. The relay is assumed to exploit a decode-and-forward (DF) strategy. An IRS is installed to help the relay cope with the RSI. The transceivers are designed in order to be robust against the worst-case residual self-interference (RSI). To this end, the worst-case achievable throughput rate is maximized. This optimization problem turns out to be a non-convex problem. Assuming that the degrees-of-freedom (DoF) of the source–relay link is less than the DoF of the relay–destination link, we determined the left and right matrices of the singular vectors of the worst-case RSI channel. Then, the problem is simplified to the optimal power allocation at the transmitters, which guarantees robustness against the worst-case RSI singular values. This simplified problem is still non-convex. Based on the intuitions for optimal power allocation at the source and relay, we proposed an efficient algorithm to capture a stationary point. Our proposed method showed a significant improvement in robustness. More precisely, we showed that in the case of high uncertainty, using our method can lead to at least 100% worst-case throughput improvement for the case of few antenna arrays and up to 500% for the case of large antenna arrays at transceivers. Furthermore, we confirmed that there is a direct relation between the performance of the system and the number of IRS elements. The simulations show that having the IRS with as low as 90 and 300 elements can completely remove the RSI for our system configuration. Finally, we showed that when there is no RSI, the impact of the relay can be fully harnessed where the number of antennas are equal at the relay transmitter and receiver. Therefore, employing the IRS to deal with the RSI can lead to the best performance of the relay.

Author Contributions

Conceptualization, H.E.; methodology, A.S. and H.E.; software, H.E.; validation, H.E., A.K. and A.S.; formal analysis, H.E; investigation, H.E, A.K. and A.S.; resources, H.E, A.K. and A.S.; data curation, H.E., A.K., A.S.; writing—original, draft preparation, H.E.; writing—review and editing, H.E.; visualization, H.E., A.K. and A.S.; supervision, A.S.; project administration, A.S.; funding acquisition, A.S. All authors have read and agreed to the published version of the manuscript.

Funding

This work is supported in part by the German Research Foundation, Deutsche Forschungsgemeinschaft (DFG), Germany, under the project grant number SE1967/16.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data can be provided by the authors H.E. upon reasonable request.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A. Proof of Equation (16)

To show the proof, we use the Bra–Ket notation. We have

\begin{matrix} H_{R I} Θ H_{I R} & = (\sum_{r_{1} = 1}^{K_{r}} \sum_{r_{2} = 1}^{M} {h_{R I}}_{r_{1}, r_{2}} | r_{1} 〉 〈 r_{2} |) (\sum_{r_{3} = 1}^{M} θ_{r_{3}, r_{3}} | r_{3} 〉 〈 r_{3} |) (\sum_{r_{4} = 1}^{M} \sum_{r_{5} = 1}^{K_{t}} {h_{I R}}_{r_{4}, r_{5}} | r_{4} 〉 〈 r_{5} |) \end{matrix}

(A1)

\begin{matrix} = \sum_{r_{1} = 1}^{K_{r}} \sum_{r_{2} = 1}^{M} \sum_{r_{3} = 1}^{M} \sum_{r_{4} = 1}^{M} \sum_{r_{5} = 1}^{K_{t}} {h_{R I}}_{r_{1}, r_{2}} θ_{r_{3}, r_{3}} {h_{I R}}_{r_{4}, r_{5}} | r_{1} 〉 〈 r_{2} | r_{3} 〉 〈 r_{3} | r_{4} 〉 〈 r_{5} | \end{matrix}

(A2)

\begin{matrix} = \sum_{r_{1} = 1}^{K_{r}} \sum_{r_{3} = 1}^{M} \sum_{r_{5} = 1}^{K_{t}} {h_{R I}}_{r_{1}, r_{3}} θ_{r_{3}, r_{3}} {h_{I R}}_{r_{3}, r_{5}} | r_{1} 〉 〈 r_{5} | . \end{matrix}

(A3)

Now, after turning the matrix representation into the vector representation we obtain

\begin{matrix} V e c (H_{R I} Θ H_{I R}) & = \sum_{r_{1} = 1}^{K_{r}} \sum_{r_{3} = 1}^{M} \sum_{r_{5} = 1}^{K_{t}} {h_{R I}}_{r_{1}, r_{3}} θ_{r_{3}, r_{3}} {h_{I R}}_{r_{3}, r_{5}} | r_{1}, r_{5} 〉 \end{matrix}

(A4)

\begin{matrix} = \sum_{r_{1} = 1}^{K_{r}} \sum_{r_{3} = 1}^{M} \sum_{r_{4} = 1}^{M} \sum_{r_{5} = 1}^{K_{t}} {h_{R I}}_{r_{1}, r_{4}} θ_{r_{3}, r_{3}} {h_{I R}}_{r_{4}, r_{5}} | r_{1}, r_{5} 〉 〈 r_{4} | r_{3} 〉 \end{matrix}

(A5)

\begin{matrix} = \sum_{r_{1} = 1}^{K_{r}} \sum_{r_{4} = 1}^{M} \sum_{r_{5} = 1}^{K_{t}} {h_{R I}}_{r_{1}, r_{4}} {h_{I R}}_{r_{4}, r_{5}} | r_{1}, r_{5} 〉 〈 r_{4} | \sum_{r_{3} = 1}^{M} θ_{r_{3}, r_{3}} | r_{3} 〉 \end{matrix}

(A6)

\begin{matrix} = (H_{I R} * H_{R I}^{T}) V e c (Θ) . \end{matrix}

(A7)

As a result, we have,

| | {\bar{H}}_{r} + H_{R I} Θ H_{I R} {| |}_{F}^{2} = | | V e c ({\bar{H}}_{r}) + (H_{I R} * H_{R I}^{T}) V e c (Θ) {| |}_{2}^{2}

, and the proof is complete.

Appendix B. Proof of Theorem 1

Before stating the proof, first we introduce the following definitions.

Definition A1.

For a vector

a

, we denote vector

a^{↓}

which has the same components as

a

except that they are sorted in a decreasing order.

Definition A2.

Vector

a

is said to be majorized by vector

b

and denoted by

a ≺ b

if:

\begin{matrix} \sum_{i = 1}^{K} a_{i}^{↓} \leq \sum_{i = 1}^{K} b_{i}^{↓}, \end{matrix}

(A8)

\begin{matrix} \sum_{i = 1}^{N} a_{i}^{↓} = \sum_{i = 1}^{N} b_{i}^{↓}, \end{matrix}

(A9)

where

a_{i}^{↓}

is the i’th component of

a^{↓}

,

N_{r}

is the number of vector components and

K \leq N

. If the last equality does not hold,

a

is said to be weakly majorized by

b

and denoted by

a ≺_{w} b

.

Definition A3.

Vector

a

is said to be multiplicatively majorized by vector

b

and denoted by

a ≺_{\times} b

if:

\begin{matrix} \prod_{i = 1}^{K} a_{i}^{↓} \leq \prod_{i = 1}^{K} b_{i}^{↓}, \end{matrix}

(A10)

\begin{matrix} \prod_{i = 1}^{N} a_{i}^{↓} = \prod_{i = 1}^{N} b_{i}^{↓} . \end{matrix}

(A11)

In addition, it is easy to check

a ≺_{\times} b \Leftrightarrow log (a) ≺ log (b) .

(A12)

To begin with, we know that for

n \times m

matrix

A

and

m \times n

matrix

B

we have

λ_{i} (A B) = λ_{i} (B A), \forall i \in {1, \dots, min (m, n)}

. In addition, the only difference between eigenvalues of

B A

and

A B

are the number of eigenvalues 0. Thus, non-zero eigenvalues of

{\bar{H}}_{r} Q_{r} {\bar{H}}_{r}^{H}

and

Q_{r} {\bar{H}}_{r}^{H} {\bar{H}}_{r}

and also

H_{1} Q_{s} H_{1}^{H}

and

Q_{s} H_{1}^{H} H_{1}

are equal, respectively. Notice that all

Q_{s}

,

H_{1}^{H} H_{1}

,

Q_{r}

and

{\bar{H}}_{r}^{H} {\bar{H}}_{r}

are square matrices. For

H_{1}^{H} H_{1}

and

{\bar{H}}_{r}^{H} {\bar{H}}_{r}

we define

λ_{i} (H_{1}^{H} H_{1}) = σ_{i}^{2} (H_{1})

and

λ_{i} ({\bar{H}}_{r}^{H} {\bar{H}}_{r}) = σ_{i}^{2} ({\bar{H}}_{r})

, respectively.

As discussed in Remark 1, the equality

λ_{i} (Q_{s} H_{1}^{H} H_{1}) = λ_{ρ (i)} (Q_{s}) λ_{i} (H_{1}^{H} H_{1})

does not hold in general. However, using the definition of determinant one can arrive at the following equality

\prod_{i = 1}^{\min (M, K_{r})} λ_{i} (Q_{s} H_{1}^{H} H_{1}) = \prod_{i = 1}^{\min (M, K_{r})} λ_{i} (Q_{s}) σ_{i}^{2} (H_{1}) .

(A13)

Now, we define vector

λ (Q_{s}^{'})

and set its components to be

λ_{ρ (i)} (Q_{s}^{'}) = \frac{λ_{i} (Q_{s} H_{1}^{H} H_{1})}{σ_{i}^{2} (H_{1})}

. By defining

λ_{ρ (i)} (Q_{s}^{'})

instead of

λ_{i} (Q_{s}^{'})

, we emphasize that the elements of

λ (Q_{s}^{'})

are not necessarily in decreasing order. Then, we construct the matrix

Q_{s}^{'}

having the same eigenvectors as those of

H_{1}^{H} H_{1}

and the eigenvalues

λ_{ρ (i)} (Q_{s}^{'})

. One can check that for each i we have

λ_{i} (Q_{s}^{'} H_{1}^{H} H_{1}) = λ_{i} (Q_{s} H_{1}^{H} H_{1})

. In addition, by the definition of

λ_{ρ (i)} (Q_{s}^{'})

we have

\begin{matrix} λ_{ρ (i)} (Q_{s}^{'}) = \frac{λ_{i} (Q_{s} H_{1}^{H} H_{1})}{σ_{i}^{2} (H_{1})}, \end{matrix}

(A14)

\begin{matrix} \Rightarrow & log (λ_{ρ (i)} (Q_{s}^{'})) = log (λ_{i} (Q_{s} H_{1}^{H} H_{1})) - log (σ_{i}^{2} (H_{1})), \end{matrix}

(A15)

\begin{matrix} \Rightarrow & log (λ (Q_{s}^{'})) = log (λ (Q_{s} H_{1}^{H} H_{1})) - log (σ^{2} (H_{1})) . \end{matrix}

(A16)

Lemma A1.

Let

A

and

B

be semidefinite Hermitian matrices with

λ_{min (m, n)} (AB) > 0

. Then

log (λ (AB)) - log (λ (B)) ≺ log (λ (A)) .

(A17)

Proof.

The proof is given in [41] (H.1,e). □

Using the above lemma, we can conclude

log (λ (Q_{s}^{'})) ≺ log (λ (Q_{s})) .

(A18)

Then, immediately we can conclude

λ (Q_{s}^{'}) ≺_{\times} λ (Q_{s}) .

(A19)

Remark A1.

It is worth mentioning that, depending on channel realizations, the optimal

Q_{s}

might contain some zero eigenvalues. In such cases, we can simply ignore the zeros and construct matrix

Q_{s}^{'}

with dimension

(n - k) \times (n - k)

. Similarly, in the cases where

{\bar{H}}_{r}^{H} {\bar{H}}_{r}

has some zero eigenvalues, we can do the same and proceed to constitute

{\bar{H}}_{s}^{'}

using only nonzero eigenvalues of

{\bar{H}}_{r}^{H} {\bar{H}}_{r}

and add the zeros back to the result again at the end.

Finally, we use the following lemma to show that

{\bar{H}}_{r}^{'}

and

Q_{s}^{'}

are in the feasible set.

Lemma A2.

For two vectors

a

and

b

, if

a ≺_{\times} b

, then

a ≺_{w} b

follows.

Proof.

The proof is given in [41] (5.A.2.b). □

Exploiting the above lemma, one concludes

\begin{matrix} λ (Q_{s}^{'}) ≺_{\times} λ (Q_{s}) \Rightarrow λ (Q_{s}^{'}) ≺_{w} λ (Q_{s}), \end{matrix}

(A20)

which consequently results in

\begin{matrix} \sum_{i = 1}^{N} λ_{i} (Q_{s}^{'}) \leq \sum_{i = 1}^{N} λ_{i} (Q_{s}) \Rightarrow Tr (Q_{s}^{'}) \leq Tr (Q_{s}) . \end{matrix}

(A21)

Therefore, there exists

Q_{s}^{'}

and

{\bar{H}}_{r}^{'}

fulfilling (29)–(31), which satisfy

\begin{matrix} \sum_{i = 1}^{min (M, K_{r})} {log}_{2} & (1 + \frac{λ_{i} (H_{1} Q_{s} H_{1}^{H})}{1 + λ_{i} ({\bar{H}}_{r} Q_{r} {\bar{H}}_{r}^{H})}) = \end{matrix}

(A22)

\begin{matrix} \sum_{i = 1}^{min (M, K_{r})} {log}_{2} (1 + \frac{λ_{ρ (i)} (Q_{s}^{'}) σ_{i}^{2} (H_{1})}{1 + λ_{i} (Q_{r}) σ_{ρ (i)}^{2} ({\bar{H}}^{'}_{r})}) . \end{matrix}

(A23)

Appendix C. Proof of Proposition 2

In this section, we prove that the problem

\begin{matrix} max_{γ_{s}, γ_{r}} min_{σ_{r}} & \sum_{i = 1}^{min (M, K_{r})} {log}_{2} (1 + \frac{σ_{1_{i}}^{2} γ_{s_{ρ (i)}}}{1 + γ_{r_{i}} σ_{s_{ρ (i)}}^{2}}) \end{matrix}

(A24)

\begin{matrix} s . t . & ∥ γ_{s} ∥_{1} \leq P_{s}, \end{matrix}

(A24a)

\begin{matrix} ∥ γ_{r} ∥_{1} \leq P_{r}, \end{matrix}

(A24b)

\begin{matrix} ∥ σ_{r}^{2} ∥_{1} \leq T, \end{matrix}

(A24c)

\begin{matrix} σ_{1_{i}}^{2} γ_{s_{ρ (i)}} \geq σ_{1_{i + 1}}^{2} γ_{s_{ρ (i + 1)}}, \forall i \leq min (M, K_{r}), \end{matrix}

(A24d)

\begin{matrix} γ_{r_{i}} σ_{r_{ρ (i)}}^{2} \geq γ_{r_{i + 1}} σ_{r_{ρ (i + 1)}}^{2}, \forall i \leq min (K_{t}, N) . \end{matrix}

(A24e)

can be further simplified to

\begin{matrix} max_{γ_{s}, γ_{r}} min_{σ_{r}} & \sum_{i = 1}^{min (M, K_{r})} {log}_{2} (1 + \frac{σ_{1_{i}}^{2} γ_{s_{ρ (i)}}}{1 + γ_{r_{i}} σ_{s_{ρ (i)}}^{2}}) \end{matrix}

(A25)

\begin{matrix} s . t . & ∥ γ_{s} ∥_{1} = P_{s}, \end{matrix}

(A25a)

\begin{matrix} ∥ γ_{r} ∥_{1} \leq P_{r}, \end{matrix}

(A25b)

\begin{matrix} ∥ σ_{r}^{2} ∥_{1} = T . \end{matrix}

(A25c)

\begin{matrix} σ_{1_{i}}^{2} γ_{s_{ρ (i)}} \geq σ_{1_{i + 1}}^{2} γ_{s_{ρ (i + 1)}}, \forall i \leq min (M, K_{r}), \end{matrix}

(A25d)

\begin{matrix} γ_{r_{i}} σ_{r_{ρ (i)}}^{2} \geq γ_{r_{i + 1}} σ_{r_{ρ (i + 1)}}^{2}, \forall i \leq min (K_{t}, N) . \end{matrix}

(A25e)

The proof is by contradiction. Starting with the minimization, assume that the optimal vector

{σ^{⋆}}_{r}^{2}

, for which we have

R_{sr}^{FD} ({σ^{⋆}}_{r}^{2}) \leq R_{sr}^{FD} (σ_{r}^{2})

, does not sum to T and thus, we have

∥ σ^{⋆} 2_{r} ∥_{1} < T

. Then there exists

ε > 0

for which we have

∥ σ^{⋆} 2_{r} ∥_{1} + ε = T

. Now define

\begin{matrix} ε_{i} = \frac{ε \frac{σ_{1_{i}}^{2}}{γ_{r_{i}}}}{\sum_{j} \frac{σ_{1_{j}}^{2}}{γ_{r_{j}}}} . \end{matrix}

(A26)

Note that we have

\begin{matrix} \sum_{i} ε_{i} = ε, ε_{i} \geq 0 . \end{matrix}

(A27)

In addition, as we have

ε > 0

, there is at least one

ε_{i}

which is strictly greater than zero, i.e.,

ε_{i} > 0

. Now define

\begin{matrix} σ_{r_{ρ (i)}}^{' 2} = {σ^{⋆}}_{r_{ρ (i)}}^{2} + ε_{i} . \end{matrix}

(A28)

One can check that

\sum_{i} {σ^{'}}_{r_{ρ (i)}}^{2} = T

and

\forall i \leq min (K_{t}, N) \Rightarrow γ_{r_{i}} σ_{r_{ρ (i)}}^{^{'} 2} \geq γ_{r_{i + 1}} σ_{r_{ρ (i + 1)}}^{^{'} 2}

. As a result,

σ_{r_{ρ (i)}}^{^{'} 2}

meets the constraints and could be a feasible solution. Note that as

γ_{s}^{⋆}

is the optimal source power allocation based on all other parameters, by changing

{σ^{⋆}}_{r}^{2}

to

σ^{'} 2_{r}

,

γ_{s}^{⋆}

might also change. However, we created each

{σ^{'}}_{r_{ρ (i)}}^{2}

in a special way to avoid this change. To show this, first notice that we have

\begin{matrix} γ_{s_{ρ (i)}}^{⋆} & = {[λ - \frac{1 + γ_{r_{i}} {σ^{⋆}}_{r_{ρ (i)}}^{2}}{σ_{1_{i}}^{2}}]}^{+}, \end{matrix}

(A29)

where

λ

is water level and can be found based on power constraints. Substituting the new power allocation for interference, we obtain new power allocation for input power as follows

\begin{matrix} γ_{s_{ρ (i)}} & = {[λ - \frac{1 + γ_{r_{i}} σ_{r_{ρ (i)}}^{' 2}}{σ_{1_{i}}^{2}}]}^{+} = {[λ - \frac{1 + γ_{r_{i}} ({σ^{⋆}}_{r_{ρ (i)}}^{2} + ε_{i})}{σ_{1_{i}}^{2}}]}^{+} \end{matrix}

(A30)

\begin{matrix} = {[λ - \frac{1 + γ_{r_{i}} {σ^{⋆}}_{r_{ρ (i)}}^{2}}{σ_{1_{i}}^{2}} + \frac{ε}{\sum_{j = 1}^{N} \frac{σ_{1_{j}}^{2}}{γ_{r_{j}}}}]}^{+} \overset{(a)}{=} {[λ^{'} - \frac{1 + γ_{r_{i}} {σ^{⋆}}_{r_{ρ (i)}}^{2}}{σ_{1_{i}}^{2}}]}^{+} = γ_{s_{ρ (i)}}^{⋆}, \end{matrix}

(A31)

where

(a)

comes from the fact that

\frac{ε}{\sum_{j = 1}^{N} \frac{σ_{1_{j}}^{2}}{γ_{r_{j}}}}

is a constant independent of i. So we can define

λ^{'} = λ + \frac{ε}{\sum_{j = 1}^{N} \frac{σ_{1_{j}}^{2}}{γ_{r_{j}}}}

. This shows, for

σ_{r}^{' 2}

, all the optimal variables and parameters remain the same as those of

{σ^{⋆}}_{r}^{2}

. Now we compare

R_{sr}^{FD}

for both cases. First, notice that we have

\forall i, ε_{i} \geq 0

and among them there is at least one index

i^{'}

, for which we have

ε_{i^{'}} > 0

. This means

\forall i, σ_{r_{ρ (i)}}^{' 2} \geq {σ^{⋆}}_{r_{ρ (i)}}^{2}

and

σ_{r_{ρ (i^{'})}}^{' 2} > {σ^{⋆}}_{r_{ρ (i^{'})}}^{2}

. Now, notice that

f_{i} (x) = {log}_{2} (1 + \frac{σ_{1_{i}}^{2} γ_{s_{ρ (i)}}^{⋆}}{1 + γ_{r_{i}} x})

is a monotonically decreasing function of x. Thus, we have

f_{i} (σ_{r_{ρ (i)}}^{' 2}) \leq f_{i} ({σ^{⋆}}_{r_{ρ (i)}}^{2})

and

f_{i^{'}} (σ_{r_{ρ (i^{'})}}^{' 2}) < f_{i^{'}} ({σ^{⋆}}_{r_{ρ (i^{'})}}^{2})

. Adding all above inequalities, we obtain

\begin{matrix} \sum_{i = 1}^{min (M, K_{r})} & {log}_{2} (1 + \frac{σ_{1_{i}}^{2} γ_{s_{ρ (i)}}^{⋆}}{1 + γ_{r_{i}} σ_{r_{ρ (i)}}^{' 2}}) < \sum_{i = 1}^{min (M, K_{r})} {log}_{2} (1 + \frac{σ_{1_{i}}^{2} γ_{s_{ρ (i)}}^{⋆}}{1 + γ_{r_{i}} {σ^{⋆}}_{r_{ρ (i)}}^{2}}) . \end{matrix}

(A32)

The above equation indicates

R_{sr}^{FD} ({σ^{⋆}}_{r}^{2}) > R_{sr}^{FD} (σ_{r}^{2})

which contradicts the first assumption

R_{sr}^{FD} ({σ^{⋆}}_{r}^{2}) \leq R_{sr}^{FD} (σ_{r}^{2})

. This completes the proof of the minimization part.

For the maximization part, the general idea is the same. Again, the proof is by contradiction. We assume the optimal vector

γ_{s}^{⋆}

, for which we have

R_{sr}^{FD} (γ_{s}^{⋆}) \geq R_{sr}^{FD} (γ_{s})

, does not sum to

P_{s}

. Therefore, we have

∥ γ_{s} ∥_{1} < P_{s}

. Then there exists

ε > 0

for which we have

∥ γ_{s} ∥_{1} + ε = P_{s}

. Now we define

\begin{matrix} ε_{i} = \frac{ε}{η} (\frac{1 + {σ^{⋆}}_{r_{i}}^{2} γ_{r_{i}}}{σ_{1_{i}}^{2}} + γ_{s_{i}}^{⋆}), \end{matrix}

(A33)

where,

η = \sum_{i} (\frac{1 + {σ^{⋆}}_{r_{i}}^{2} γ_{r_{i}}}{σ_{1_{i}}^{2}} + γ_{s_{i}}^{⋆})

. Now we define the new source power allocation as below

\begin{matrix} γ_{s_{ρ (i)}}^{'} = γ_{s_{ρ (i)}}^{⋆} + ε_{i} . \end{matrix}

(A34)

One can check that

\sum_{i} γ_{s_{ρ (i)}}^{'} = P_{s}

and

σ_{1_{i}}^{2} γ_{s_{ρ (i)}}^{'} \geq σ_{1_{i + 1}}^{2} γ_{s_{ρ (i + 1)}}^{'}

. Thus, the new source power allocation is in the feasible set. Now the remaining is to make sure the new allocation does not change the corresponding

σ_{r}^{2}

. Using Lagrangian multiplier, we have

\begin{matrix} L & = \sum_{i} {log}_{2} (1 + \frac{σ_{1_{i}}^{2} γ_{s_{ρ (i)}}^{'}}{1 + γ_{r_{i}} σ_{s_{ρ (i)}}^{2}}) + λ (\sum_{i = 0}^{N} σ_{r_{i}}^{2} - T), \end{matrix}

(A35)

\begin{matrix} = \sum_{i} {log}_{2} (1 + \frac{σ_{1_{i}}^{2} (γ_{s_{ρ (i)}}^{⋆} + ε_{i})}{1 + γ_{r_{i}} σ_{s_{ρ (i)}}^{2}}) + λ (\sum_{i = 0}^{N} σ_{r_{i}}^{2} - T), \end{matrix}

(A36)

\begin{matrix} = \sum_{i} {log}_{2} ((1 + \frac{ε}{η}) (1 + \frac{σ_{1_{i}}^{2} γ_{s_{ρ (i)}}^{⋆}}{1 + γ_{r_{i}} σ_{s_{ρ (i)}}^{2}})) + λ (\sum_{i = 0}^{N} σ_{r_{i}}^{2} - T), \end{matrix}

(A37)

\begin{matrix} = \sum_{i} {log}_{2} (1 + \frac{ε}{η}) + \sum_{i} {log}_{2} (1 + \frac{σ_{1_{i}}^{2} γ_{s_{ρ (i)}}^{⋆}}{1 + γ_{r_{i}} σ_{s_{ρ (i)}}^{2}}) + λ (\sum_{i = 0}^{N} σ_{r_{i}}^{2} - T) . \end{matrix}

(A38)

Now notice that as

\sum_{i} {log}_{2} (1 + \frac{ε}{η})

is a constant, we have

\frac{\partial \sum_{i} {log}_{2} (1 + \frac{ε}{η})}{\partial σ_{r_{i}}^{2}} = 0

and

\frac{\partial \sum_{i} {log}_{2} (1 + \frac{ε}{η})}{\partial λ} = 0

. As a result, the optimum interference allocation for

γ_{r}^{'}

is the same as that of

γ_{r}^{⋆}

. Similarly to the case of minimization, here we have

\sum_{i} ε_{i} = ε

. In addition, we have

ε_{i} \geq 0

and there exists at least one

i^{'}

for which we have

ε_{i^{'}} > 0

. Finally, as

f_{i} (x) = log (1 + {\frac{σ_{1_{i}}^{2} x}{1 + {P σ}_{2}^{⋆}}}_{r_{ρ (i)}} γ_{r_{i}})

is a monotonically increasing function of x, we conclude

R_{sr}^{FD} (γ_{s}^{⋆}) < R_{sr}^{FD} (γ_{s}^{'})

which contradicts the first assumption of

γ_{s}^{⋆}

being the optimal source power allocation, and the proof is complete.

Appendix D

First, we show

R_{sr}^{FD}

is a decreasing function of T and an increasing function of

P_{s}

. It is sufficient to show

\frac{d R_{sr}^{FD}}{d P_{s}} \geq 0

and

\frac{d R_{sr}^{FD}}{d T} \leq 0

. We have

\begin{matrix} \frac{d R_{sr}^{FD}}{d P_{s}} & = \frac{\sum_{i}^{} \frac{\partial R_{sr}^{FD}}{\partial γ_{s_{ρ (i)}}} d γ_{s_{ρ (i)}}}{\sum_{i}^{} \frac{\partial P_{s}}{\partial γ_{s_{ρ (i)}}} d γ_{s_{ρ (i)}}} = \frac{\sum_{i}^{} \frac{σ_{1_{i}}^{2} d γ_{s_{ρ (i)}}}{1 + σ_{r_{ρ (i)}}^{2} γ_{r_{i}} + σ_{1_{i}}^{2} γ_{s_{ρ (i)}}}}{\sum_{i}^{} d γ_{s_{ρ (i)}}} \geq \frac{\sum_{i}^{} ϕ_{1} d γ_{s_{ρ (i)}}}{\sum_{i}^{} d γ_{s_{ρ (i)}}} = ϕ_{1} > 0, \end{matrix}

(A39)

\begin{matrix} \frac{d R_{sr}^{FD}}{d T} & = \frac{\sum_{i}^{} \frac{\partial R_{sr}^{FD}}{\partial σ_{r_{ρ (i)}}^{2}} d σ_{r_{ρ (i)}}^{2}}{\sum_{i}^{} \frac{\partial T}{\partial σ_{r_{ρ (i)}}^{2}} d σ_{r_{ρ (i)}}^{2}} \end{matrix}

(A40)

\begin{matrix} = \frac{\sum_{i}^{} \frac{- σ_{1_{i}}^{2} γ_{s_{ρ (i)}} γ_{r_{i}}}{(1 + σ_{r_{ρ (i)}}^{2} γ_{r_{i}}) (1 + σ_{1_{i}}^{2} γ_{s_{ρ (i)}} + σ_{r_{i}}^{2} γ_{r_{i}})} d σ_{r_{ρ (i)}}^{2}}{\sum_{i}^{} d σ_{r_{ρ (i)}}^{2}} \leq \frac{\sum_{i}^{} - ϕ_{2} d σ_{r_{ρ (i)}}^{2}}{\sum_{i}^{} d σ_{r_{ρ (i)}}^{2}} = - ϕ_{2} \leq 0, \end{matrix}

(A41)

where

\begin{matrix} ϕ_{1} \dot{=} min_{i} \{\frac{σ_{1_{i}}^{2}}{1 + σ_{r_{ρ (i)}}^{2} γ_{r_{i}} + σ_{1_{i}}^{2} γ_{s_{ρ (i)}}}\} \end{matrix}

(A42)

and

\begin{matrix} ϕ_{2} \dot{=} min_{i} \{\frac{- σ_{1_{i}}^{2} γ_{s_{ρ (i)}} γ_{r_{i}}}{(1 + σ_{r_{ρ (i)}}^{2} γ_{r_{i}}) (1 + σ_{1_{i}}^{2} γ_{s_{ρ (i)}} + σ_{r_{ρ (i)}}^{2} γ_{r_{i}})}\} \end{matrix}

(A43)

respectively.

Next, we show

g (P_{r}) = R_{sr}^{FD} (P_{r}) - R_{rd}^{FD} (P_{r})

is a monotonically decreasing function of

P_{r}

. It is sufficient to show

\frac{d R_{sr}^{FD}}{d P_{r}} \leq 0

and

\frac{d R_{rd}^{FD}}{d P_{r}} > 0

. We have

\begin{matrix} d R_{rd}^{FD} = & \sum_{i}^{} \frac{\partial R_{rd}^{FD}}{\partial γ_{r_{i}}} d γ_{r_{i}} = \sum_{i}^{} \frac{σ_{2_{i}}^{2}}{1 + σ_{2_{i}}^{2} γ_{r_{i}}} d γ_{r_{i}} \end{matrix}

(A44)

\begin{matrix} d R_{sr}^{FD} = & \sum_{i}^{} \frac{\partial R_{sr}^{FD}}{\partial γ_{r_{i}}} d γ_{r_{i}} = \sum_{i}^{} \frac{- σ_{1_{i}}^{2} γ_{s_{ρ (i)}} σ_{r_{ρ (i)}}^{2}}{(1 + σ_{r_{ρ (i)}}^{2} γ_{r_{i}}) (1 + σ_{1_{i}}^{2} γ_{s_{ρ (i)}} + σ_{r_{ρ (i)}}^{2} γ_{r_{i}})} d γ_{r_{i}} \end{matrix}

(A45)

\begin{matrix} d P_{r} = & \sum_{i}^{} \frac{\partial P_{r}}{\partial γ_{r_{i}}} d γ_{r_{i}} = \sum_{i}^{} d γ_{r_{i}} . \end{matrix}

(A46)

Now we define

\begin{matrix} ψ_{1} \dot{=} min_{i} \{\frac{σ_{2_{i}}^{2}}{1 + σ_{2_{i}}^{2} γ_{r_{i}}}\} \end{matrix}

(A47)

\begin{matrix} ψ_{2} \dot{=} min_{i} \{\frac{σ_{1_{i}}^{2} γ_{s_{ρ (i)}} σ_{r_{ρ (i)}}^{2}}{(1 + σ_{r_{ρ (i)}}^{2} γ_{r_{i}}) (1 + σ_{1_{i}}^{2} γ_{s_{ρ (i)}} + σ_{r_{ρ (i)}}^{2} γ_{r_{i}})}\} . \end{matrix}

(A48)

It is obvious that

ψ_{1} > 0

and

ψ_{2} \geq 0

. Now we have

\begin{matrix} \frac{d R_{sr}^{FD}}{d P_{r}} = \frac{\sum_{i}^{} \frac{- σ_{1_{i}}^{2} γ_{s_{ρ (i)}} σ_{r_{ρ (i)}}^{2}}{(1 + σ_{r_{ρ (i)}}^{2} γ_{r_{i}}) (1 + σ_{1_{i}}^{2} γ_{s_{ρ (i)}} + σ_{r_{ρ (i)}}^{2} γ_{r_{i}})} d γ_{r_{i}}}{\sum_{i}^{} d γ_{r_{i}}} \leq \frac{\sum_{i}^{} - ψ_{2} d γ_{r_{i}}}{\sum_{i}^{} d γ_{r_{i}}} = - ψ_{2} \leq 0, \end{matrix}

(A49)

and

\begin{matrix} \frac{d R_{rd}^{FD}}{d P_{r}} & = \frac{\sum_{i}^{} \frac{σ_{2_{i}}^{2}}{1 + σ_{2_{i}}^{2} γ_{r_{i}}} d γ_{r_{i}}}{\sum_{i}^{} d γ_{r_{i}}} \geq \frac{\sum_{i}^{} ψ_{1} d γ_{r_{i}}}{\sum_{i}^{} d γ_{r_{i}}} = ψ_{1} > 0 . \end{matrix}

(A50)

Finally, one can conclude

\begin{matrix} \frac{d g}{d P_{r}} & = \frac{d R_{sr}^{FD}}{d P_{r}} - \frac{d R_{rd}^{FD}}{d P_{r}} \leq - ψ_{2} - ψ_{1} < 0 . \end{matrix}

(A51)

Appendix E

Here we show that if

γ_{r_{i}} \geq γ_{r_{i + 1}}

and

σ_{1_{i}}^{2} γ_{s_{ρ (i)}} \geq σ_{1_{i + 1}}^{2} γ_{s_{ρ (i + 1)}}

then

γ_{r_{i}} σ_{s_{ρ (i)}}^{2} \geq γ_{r_{i + 1}} σ_{r_{ρ (i + 1)}}^{2}

. First we define

f (x, y) = \sqrt{x^{2} + a x y} - x - b, x \geq 0, y \geq 0

in which a and b are positive constants. Now we have,

\begin{matrix} \frac{\partial f}{\partial y} = \frac{a x}{2 \sqrt{x^{2} + a x y}} \geq 0 . \end{matrix}

(A52)

In addition, for

\frac{\partial f}{\partial x}

we have

\begin{matrix} \frac{\partial f}{\partial x} = \frac{2 x + a y}{2 \sqrt{x^{2} + a x y}} - 1 \geq 0 . \end{matrix}

(A53)

One can check that for positive values

x, y

and a, we always have

\frac{2 x + a y}{2 \sqrt{x^{2} + a x y}} \geq 1

. As a result, f is an increasing function of both x and y. The rest of the proof is as follows

\begin{matrix} σ_{r_{i}}^{2} γ_{r_{i}} = & {[\frac{\sqrt{{(σ_{1_{i}}^{2} γ_{s_{ρ (i)}})}^{2} + \frac{4 σ_{1_{i}}^{2} γ_{s_{ρ (i)}} γ_{r_{i}}}{λ}} - σ_{1_{i}}^{2} γ_{s_{ρ (i)}} - 2 (σ_{t}^{2} + T_{1} P_{s})}{2}]}^{+} \end{matrix}

(A54)

\begin{matrix} \overset{(a)}{\geq} & {[\frac{\sqrt{{(σ_{1_{i}}^{2} γ_{s_{ρ (i)}})}^{2} + \frac{4 σ_{1_{i}}^{2} γ_{s_{ρ (i)}} γ_{r_{i + 1}}}{λ}} - σ_{1_{i}}^{2} γ_{s_{ρ (i)}} - 2 (σ_{t}^{2} + T_{1} P_{s})}{2}]}^{+} \\ \overset{(b)}{\geq} & [\frac{\sqrt{{(σ_{1_{i + 1}}^{2} γ_{s_{ρ (i + 1)}})}^{2} + \frac{4 σ_{1_{i + 1}}^{2} γ_{s_{ρ (i + 1)}} γ_{r_{i + 1}}}{λ}}}{2} \end{matrix}

(A55)

\begin{matrix} - {\frac{- σ_{1_{i + 1}}^{2} γ_{s_{ρ (i + 1)}} - 2 (σ_{t}^{2} + T_{1} P_{s})}{2}]}^{+} \end{matrix}

(A56)

\begin{matrix} = & σ_{r_{i + 1}}^{2} γ_{r_{i + 1}}, \end{matrix}

(A57)

in which

(a)

holds because

γ_{r_{i}} \geq γ_{r_{i + 1}}

and

(b)

holds because

σ_{1_{i}}^{2} γ_{s_{ρ (i)}} \geq σ_{1_{i + 1}}^{2} γ_{s_{ρ (i + 1)}}

.

References

Kariminezhad, A.; Sezgin, A.; Pesavento, M. Power efficiency of improper signaling in MIMO full-duplex relaying for K-user interference networks. In Proceedings of the 2017 IEEE International Conference on Communications (ICC), Paris, France, 21–25 May 2017. [Google Scholar]
Wu, Q.; Zhang, R. Towards smart and reconfigurable environment: Intelligent reflecting surface aided wireless network. IEEE Commun. Mag. 2019, 58, 106–112. [Google Scholar] [CrossRef]
Di Renzo, M.; Zappone, A.; Debbah, M.; Alouini, M.S.; Yuen, C.; De Rosny, J.; Tretyakov, S. Smart radio environments empowered by reconfigurable intelligent surfaces: How it works, state of research, and the road ahead. IEEE J. Sel. Areas Commun. 2020, 38, 2450–2525. [Google Scholar] [CrossRef]
Bazrafkan, A.; Poposka, M.; Hadzi-Velkov, Z.; Zlatanov, N. A simple single RF-chain multi-antenna full-duplex relay can outperform an intelligent reflecting surface. arXiv 2021, arXiv:2104.07980. [Google Scholar]
Guo, H.; Liang, Y.C.; Chen, J.; Larsson, E.G. Weighted sum-rate maximization for reconfigurable intelligent surface aided wireless networks. IEEE Trans. Wirel. Commun. 2020, 19, 3064–3076. [Google Scholar] [CrossRef]
Wu, Q.; Zhang, R. Intelligent reflecting surface enhanced wireless network via joint active and passive beamforming. IEEE Trans. Wirel. Commun. 2019, 18, 5394–5409. [Google Scholar] [CrossRef]
Huang, C.; Zappone, A.; Alex Ropoulos, G.C.; Debbah, M.; Yuen, C. Reconfigurable intelligent surfaces for energy efficiency in wireless communication. IEEE Trans. Wirel. Commun. 2019, 18, 4157–4170. [Google Scholar] [CrossRef]
Zhang, S.; Zhang, R. Capacity characterization for intelligent reflecting surface aided MIMO communication. IEEE J. Sel. Areas Commun. 2020, 38, 1823–1838. [Google Scholar] [CrossRef]
Obeed, M.; Chaaban, A. Joint beamforming design for multiuser MISO downlink aided by a reconfigurable intelligent surface and a relay. IEEE Trans. Wirel. Commun. 2022. [Google Scholar] [CrossRef]
Omid, Y.; Shahabi, S.M.; Pan, C.; Deng, Y.; Nallanathan, A. Low-complexity robust beamforming design for IRS-aided MISO systems with imperfect channels. IEEE Commun. Lett. 2021, 25, 1697–1701. [Google Scholar] [CrossRef]
Sejan, M.A.S.; Rahman, M.H.; Shin, B.S.; Oh, J.H.; You, Y.H.; Song, H.K. Machine Learning for Intelligent-Reflecting-Surface-Based Wireless Communication towards 6G: A Review. Sensors 2022, 22, 5405. [Google Scholar] [CrossRef]
Cao, X.; Yang, B.; Huang, C.; Yuen, C.; Di Renzo, M.; Niyato, D.; Han, Z. Reconfigurable intelligent surface-assisted aerial-terrestrial communications via multi-task learning. IEEE J. Sel. Areas Commun. 2021, 39, 3035–3050. [Google Scholar] [CrossRef]
Huang, C.; Chen, G.; Wong, K.K. Multi-agent reinforcement learning-based buffer-aided relay selection in IRS-assisted secure cooperative networks. IEEE Trans. Inf. Forensics Secur. 2021, 16, 4101–4112. [Google Scholar] [CrossRef]
Zhou, G.; Pan, C.; Ren, H.; Wang, K.; Di Renzo, M.; Nallanathan, A. Robust beamforming design for intelligent reflecting surface aided MISO communication systems. IEEE Wirel. Commun. Lett. 2020, 9, 1658–1662. [Google Scholar] [CrossRef]
Zhou, G.; Pan, C.; Ren, H.; Wang, K.; Nallanathan, A. A framework of robust transmission design for IRS-aided MISO communications with imperfect cascaded channels. IEEE Trans. Signal Process. 2020, 68, 5092–5106. [Google Scholar] [CrossRef]
Zhang, J.; Zhang, Y.; Zhong, C.; Zhang, Z. Robust design for intelligent reflecting surfaces assisted MISO systems. IEEE Commun. Lett. 2020, 24, 2353–2357. [Google Scholar] [CrossRef]
Bliss, D.W.; Parker, P.A.; Margetts, A.R. Simultaneous transmission and reception for improved wireless network performance. In Proceedings of the 2007 IEEE/SP 14th Workshop on Statistical Signal Processing, Madison, WI, USA, 26–29 August 2007. [Google Scholar]
Everett, E. Achaleshwar Sahai, and Ashutosh Sabharwal. Passive self-interference suppression for full-duplex infrastructure nodes. IEEE Trans. Wirel. Commun. 2014, 13, 680–694. [Google Scholar] [CrossRef]
Ahmed, E.; Eltawil, A.M. All-digital self-interference cancellation technique for full-duplex systems. IEEE Trans. Wirel. Commun. 2015, 14, 3519–3532. [Google Scholar] [CrossRef]
Vogt, H.; Enzner, G.; Sezgin, A. State-space adaptive nonlinear self-interference cancellation for full-duplex communication. IEEE Trans. Signal Process. 2019, 67, 2810–2825. [Google Scholar] [CrossRef]
Lee, H.; Kim, D.; Kim, S.; Kim, J.; Lee, C.; Hong, D. Rotated precoder based self-interference cancellation in full-duplex communication. In Proceedings of the 2014 International Conference on Electronics, Information and Communications (ICEIC), Kota Kinabalu, Malaysia, 15–18 January 2014. [Google Scholar]
Soriano-Irigaray, F.J.; Fernez-Prat, J.S.; Lopez-Martinez, F.J.; Martos-Naya, E.; Cobos-Morales, O.; Entrambasaguas, J.T. Adaptive self-interference cancellation for full duplex radio: Analytical model and experimental validation. IEEE Access 2018, 6, 65018–65026. [Google Scholar] [CrossRef]
Irio, L.; Oliveira, R.; Oliveira, L. Characterization of the residual self-interference power in full-duplex wireless systems. In Proceedings of the 2018 IEEE International Symposium on Circuits and Systems (ISCAS), Florence, Italy, 27–30 May 2018. [Google Scholar]
Irio, L.; Oliveira, R. Distribution of the residual self-interference power in in-band full-duplex wireless systems. IEEE Access 2019, 7, 57516–57526. [Google Scholar] [CrossRef]
Koohian, A.; Mehrpouyan, H.; Nasir, A.A.; Durrani, S.; Blostein, S.D. Residual self-interference cancellation and data detection in full-duplex communication systems. In Proceedings of the 2017 IEEE International Conference on Communications (ICC), Paris, France, 21–25 May 2017. [Google Scholar]
Chae, S.H.; Lee, K. Degrees of freedom of full-duplex cellular networks: Effect of self-interference. IEEE Trans. Commun. 2017, 65, 507–4518. [Google Scholar] [CrossRef]
Kariminezhad, A.; Sezgin, A. Heterogeneous multi-tier networks: Improper signaling for joint rate-energy optimization. IEEE Trans. Wirel. Commun. 2018, 18, 680–694. [Google Scholar] [CrossRef]
Kariminezhad, A.; Gherekhloo, S.; Sezgin, A. Full-duplex vs. half-duplex: Delivery-time optimization in cellular downlink. In Proceedings of the European Wireless 2017, 23th European Wireless Conference, Dresden, Germany, 17–19 May 2017; VDE: Merianstraße, Germany, 2017. [Google Scholar]
Herath, S.P.; Le-Ngoc, T. Sum-rate performance and impact of self-interference cancellation on full-duplex wireless systems. In Proceedings of the 2013 IEEE 24th Annual International Symposium on Personal, Indoor, and Mobile Radio Communications (PIMRC), London, UK, 8–11 September 2013. [Google Scholar]
Zlatanov, N.; Sippel, E.; Jamali, V.; Schober, R. Capacity of the Gaussian two-hop full-duplex relay channel with residual self-interference. IEEE Trans. Commun. 2017, 65, 1005–1021. [Google Scholar] [CrossRef]
Riihonen, T.; Werner, S.; Wichman, R. Hybrid full-duplex/half-duplex relaying with transmit power adaptation. IEEE Trans. Wirel. Commun. 2011, 10, 3074–3085. [Google Scholar] [CrossRef]
Taghizadeh, O.; Mathar, R. Robust multi-user decode-and-forward relaying with full-duplex operation. In Proceedings of the 11th International Symposium on Wireless Communications Systems (ISWCS), Barcelona, Spain, 26–29 August 2014. [Google Scholar]
Cirik, A.C.; Biswas, S.; Vuppala, S.; Ratnarajah, T. Robust transceiver design for full duplex multiuser MIMO systems. IEEE Wirel. Commun. Lett. 2016, 5, 260–263. [Google Scholar] [CrossRef]
Esmaeili, H.; Kariminezhad, A.; Sezgin, A. Robust transceiver design for full-duplex decode-and-forward relay-assisted MIMO systems. In Proceedings of the 54th Asilomar Conference on Signals, Systems, and Computers, Pacific Grove, CA, USA, 1–5 November 2020. [Google Scholar]
Han, C.; Akyildiz, I.F. Distance-aware bandwidth-adaptive resource allocation for wireless systems in the terahertz band. IEEE Trans. Terahertz Sci. Technol. 2016, 6, 541–553. [Google Scholar] [CrossRef]
Tang, W.; Chen, M.Z.; Chen, X.; Dai, J.Y.; Han, Y.; Di Renzo, M.; Zeng, Y.; Jin, S.; Cheng, Q.; Cui, T.J. Wireless communications with reconfigurable intelligent surface: Path loss modeling and experimental measurement. IEEE Trans. Wirel. Commun. 2020, 20, 421–439. [Google Scholar] [CrossRef]
Wang, B.; Zhang, J.; Host-Madsen, A. On the capacity of MIMO relay channels. IEEE Trans. Inf. Theory 2005, 51, 29–43. [Google Scholar] [CrossRef]
Wang, J.; Bengtsson, M.; Ottersten, B.; Palomar, D.P. Robust MIMO precoding for several classes of channel uncertainty. IEEE Trans. Signal Process. 2013, 61, 3056–3070. [Google Scholar] [CrossRef]
Shen, H.; Wang, J.; Levy, B.C.; Zhao, C. Robust optimization for amplify-and-forward MIMO relaying from a worst-case perspective. IEEE Trans. Signal Process. 2013, 61, 5458–5471. [Google Scholar] [CrossRef]
Fiedler, M. Bounds for the determinant of the sum of hermitian matrices. Proc. Am. Math. Soc. 1971, 30, 27–31. [Google Scholar] [CrossRef]
Marshall, A.W.; Olkin, I.; Arnold, B.C. Inequalities: Theory of Majorization and Its Applications; Academic Press: New York, NY, USA, 1979; Volume 143. [Google Scholar]

Figure 1. System model of an IRS assisted full-duplex relay. In our model both source and destination are equipped with

N_{t}

and

N_{r}

antennas, respectively. In addition, the relay is equipped with

K_{t}

transmitting and

K_{t}

receiving antennas, and the IRS has M passive elements.

Figure 1. System model of an IRS assisted full-duplex relay. In our model both source and destination are equipped with

N_{t}

and

N_{r}

antennas, respectively. In addition, the relay is equipped with

K_{t}

transmitting and

K_{t}

receiving antennas, and the IRS has M passive elements.

Figure 2. An example of mapping from three-dimensional to two-dimensional space. The hexagon and the circle are the mapping output of the cube and the sphere respectively.

Figure 3. Geometrical representation of optimization problem (18).

Figure 4. Cumulative distribution function (cdf) of iterations when

P_{s} = 5

and

P_{r} = 1

and

M = K_{t} = K_{r} = N = 10

. The maximum number of iterations is set to be 50. Cases that took more than 50 iterations to converge are considered to be divergent.

Figure 4. Cumulative distribution function (cdf) of iterations when

P_{s} = 5

and

P_{r} = 1

and

M = K_{t} = K_{r} = N = 10

. The maximum number of iterations is set to be 50. Cases that took more than 50 iterations to converge are considered to be divergent.

Figure 5. Examples of multilevel water-filling for two different cases. (a) No subchannel with optimum power equal to zero. Note that due to the power cap constraint (37d), water does not have the same level for all subchannels. (b) Subchannel

i = 13

receives the optimum of zero as its input power. Notice that in this case, due to the additional power cap constraint (37d), all the remaining subchannels

i > 13

also have zero power.

Figure 5. Examples of multilevel water-filling for two different cases. (a) No subchannel with optimum power equal to zero. Note that due to the power cap constraint (37d), water does not have the same level for all subchannels. (b) Subchannel

i = 13

receives the optimum of zero as its input power. Notice that in this case, due to the additional power cap constraint (37d), all the remaining subchannels

i > 13

also have zero power.

Figure 6. Sum rate throughput of the FD mode as a function of normalized interference

\frac{T}{P}

with different number of antennas at the source, relay and destination. The transmit power budget at the source and the relay are assumed to be equal, i.e.,

P_{s} = 5

and

P_{r} = 1

. (a)

{N_{t}, K_{r} + K_{t}, N_{r}} = {4, 10, 4}

. (b)

{N_{t}, K_{r} + K_{t}, N_{r}} = {10, 24, 10}

.

Figure 6. Sum rate throughput of the FD mode as a function of normalized interference

\frac{T}{P}

with different number of antennas at the source, relay and destination. The transmit power budget at the source and the relay are assumed to be equal, i.e.,

P_{s} = 5

and

P_{r} = 1

. (a)

{N_{t}, K_{r} + K_{t}, N_{r}} = {4, 10, 4}

. (b)

{N_{t}, K_{r} + K_{t}, N_{r}} = {10, 24, 10}

.

Figure 7. Comparison of five different scenarios: HD with IRS, FD with no IRS, FD with IRS as RSI cancelator; FD with IRS to help transmitter–relay link; FD with IRS to help relay–destination link. We considered the case where

{N_{t}, K_{t}, K_{r}, N_{r}, M} = {4, 5, 5, 4, 100}

.

Figure 7. Comparison of five different scenarios: HD with IRS, FD with no IRS, FD with IRS as RSI cancelator; FD with IRS to help transmitter–relay link; FD with IRS to help relay–destination link. We considered the case where

{N_{t}, K_{t}, K_{r}, N_{r}, M} = {4, 5, 5, 4, 100}

.

Figure 8. Sum rate throughput as a function of IRS elements. The transmit power budget at the source and the relay are assumed to be equal, i.e.,

P_{s} = 5

and

P_{r} = 1

: (a) {

N_{t}

,

K_{t}

,

K_{r}

,

N_{r}

} = {4, 5, 5, 4}; (b) {

N_{t}

,

K_{t}

,

K_{r}

,

N_{r}

} = {10, 12, 12, 10}.

Figure 8. Sum rate throughput as a function of IRS elements. The transmit power budget at the source and the relay are assumed to be equal, i.e.,

P_{s} = 5

and

P_{r} = 1

: (a) {

N_{t}

,

K_{t}

,

K_{r}

,

N_{r}

} = {4, 5, 5, 4}; (b) {

N_{t}

,

K_{t}

,

K_{r}

,

N_{r}

} = {10, 12, 12, 10}.

Figure 9. Sum rate throughput as a function of relay receiver antennas

K_{r}

with and without RSI. The transmit power budget at the source and the relay are assumed to be equal, i.e.,

P_{s} = 5

and

P_{r} = 1

: (a) {

N_{t}

,

K_{t}

+

K_{r}

,

N_{r}

} = {4, 10, 4} with no IRS; (b) {

N_{t}

,

K_{t}

+

K_{r}

N_{r}

} = {4, 10, 4} with M = 60.

Figure 9. Sum rate throughput as a function of relay receiver antennas

K_{r}

with and without RSI. The transmit power budget at the source and the relay are assumed to be equal, i.e.,

P_{s} = 5

and

P_{r} = 1

: (a) {

N_{t}

,

K_{t}

+

K_{r}

,

N_{r}

} = {4, 10, 4} with no IRS; (b) {

N_{t}

,

K_{t}

+

K_{r}

N_{r}

} = {4, 10, 4} with M = 60.

Figure 10. Thresholds for different

K_{r}

and M. The region above each curve indicates values of

\frac{T}{P}

for which HD outperforms FD. In contrast, points below the curve belong to cases where FD performs better than HD.

Figure 10. Thresholds for different

K_{r}

and M. The region above each curve indicates values of

\frac{T}{P}

for which HD outperforms FD. In contrast, points below the curve belong to cases where FD performs better than HD.

Table 1. Simulation parameters.

Notation	Definition	Notation	Definition
$N_{t}$	Number of antennas at the source	$Θ$	IRS phase profile
$N_{r}$	Number of antennas at the destination	$x_{s}$ , $x_{r}$	Source and relay transmit signals
$K_{t}$	Number of antennas at the relay’s transmitter	$n_{r}$ , $n_{d}$	Additive noise at the relay and destination
$K_{r}$	Number of antennas at the relay’s receiver	$y_{r}$ , $y_{d}$	Received signals at the relay and destination, respectively
M	Number of IRS elements	$κ$	FD/HD mode indicator
$H_{r}$ , ${\hat{H}}_{r}$ , ${\bar{H}}_{r}$	Self-interference actual channel, estimated channel and channel error, respectively	$R_{sr}^{FD}$ , $R_{rd}^{FD}$ , $R^{FD}$	Source–relay, relay–destination and the overall throughput in FD mode
$H_{1}$ , ${\hat{H}}_{1}$ , ${\bar{H}}_{1}$	Source–Relay actual channel, estimated channel and channel error, respectively	$R_{sr}^{HD}$ , $R_{rd}^{HD}$ , $R^{HD}$	Source–relay, relay–destination and the overall throughput in HD mode
$H_{2}$ , ${\hat{H}}_{2}$ , ${\bar{H}}_{2}$	Relay–Destination actual channel, estimated channel and channel error, respectively	$Q_{s}$ , $Q_{r}$	Source and relay covarriance matrices
$H_{S I}$ , ${\hat{H}}_{S I}$ , ${\bar{H}}_{S I}$	Source–IRS actual channel, estimated channel and channel error, respectively	$H_{t o t_{1}}$	$H_{t o t_{1}} = ({\bar{H}}_{r} + H_{R I} Θ H_{I R})$
$H_{I R}$ , ${\hat{H}}_{I R}$ , ${\bar{H}}_{I R}$	IRS–Destination actual channel, estimated channel and channel error, respectively	${\hat{H}}_{t o t_{2}}$ , ${\bar{H}}_{t o t_{2}}$	${\hat{H}}_{t o t_{2}} = ({\hat{H}}_{1} + H_{R I} Θ {\hat{H}}_{I R})$ ${\bar{H}}_{t o t_{2}} = ({\bar{H}}_{1} + {\bar{H}}_{R I} Θ {\bar{H}}_{I R})$
$H_{R I}$ , ${\hat{H}}_{R I}$ , ${\bar{H}}_{R I}$	Relay–IRS actual channel, estimated channel and channel error, respectively	${\hat{H}}_{t o t_{3}}$ , ${\bar{H}}_{t o t_{3}}$	${\hat{H}}_{t o t_{3}} = ({\hat{H}}_{2} + H_{R I} Θ {\hat{H}}_{I R})$ ${\bar{H}}_{t o t_{3}} = ({\bar{H}}_{2} + {\bar{H}}_{R I} Θ {\bar{H}}_{I R})$
$H_{I R}$ , ${\hat{H}}_{I R}$ , ${\bar{H}}_{I R}$	IRS–Relay actual channel, estimated channel and channel error, respectively	$λ_{i} (X)$	i’th largest eigenvalue of matrix X
$γ_{s_{i}}$	$γ_{s_{i}} = λ_{i} (Q_{s})$	$σ_{1_{i}}^{2}$	$σ_{1_{i}}^{2} = λ_{i} (H_{1} H_{1}^{H})$
$γ_{r_{i}}$	$γ_{r_{i}} = λ_{i} (Q_{r})$	$σ_{r_{i}}^{2}$	$σ_{r_{i}}^{2} = λ_{i} ({\bar{H}}_{t o t_{1}} {\bar{H}}_{t o t_{1}}^{H})$
$T_{r}$	The RSI channel uncertainty bound	$σ_{2_{i}}^{2}$	$σ_{2_{i}}^{2} = λ_{i} (H_{2} H_{2}^{H})$
$T_{x}$	Channel estimation error bound $x \in \{1, 2, S I, I R, R I, I D\}$	$T^{'}$	The remaining RSI channel uncertainty after considering the impact of the IRS

Table 2. Comparison of the proposed method with previous studies.

Method	Complexity	Robust Design *	Solution	System Model
This work	$O (M K_{r} K_{t} min (M, K_{r} K_{t}))$	Yes	Analytical bounds	Relay and IRS
Zhang et al. [8]	$O (I_{O} M^{3})$	No	Practical solution	Only IRS
Esmaeili et al. [34]	$O (I_{t} (I_{w} min (N_{t}, K_{r})))$	Yes	Practical solution	Only relay
Obeed et al. [9]	$O (I_{λ} I_{O} I_{w} M^{3})$	No	Practical solution	Relay and IRS

* This means that the method is robust against channel uncertainties and/or RSI.

Table 3. Simulation parameters.

Parameters	Values
Transmitter location	$(0 m, 0 m)$
IRS location	$(4000 m, 20 m)$
Relay location	$(4000 m, 0 m)$
Receiver location	$(8000 m, 0 m)$
Path-loss	$32.6 + 36.7 \log (d)$
Transmission bandwidth B	$180 Mb$

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Esmaeili, H.; Kariminezhad, A.; Sezgin, A. Robust Transceiver Design for IRS-Assisted Cascaded MIMO Communication Systems. Sensors 2022, 22, 6587. https://doi.org/10.3390/s22176587

AMA Style

Esmaeili H, Kariminezhad A, Sezgin A. Robust Transceiver Design for IRS-Assisted Cascaded MIMO Communication Systems. Sensors. 2022; 22(17):6587. https://doi.org/10.3390/s22176587

Chicago/Turabian Style

Esmaeili, Hossein, Ali Kariminezhad, and Aydin Sezgin. 2022. "Robust Transceiver Design for IRS-Assisted Cascaded MIMO Communication Systems" Sensors 22, no. 17: 6587. https://doi.org/10.3390/s22176587

APA Style

Esmaeili, H., Kariminezhad, A., & Sezgin, A. (2022). Robust Transceiver Design for IRS-Assisted Cascaded MIMO Communication Systems. Sensors, 22(17), 6587. https://doi.org/10.3390/s22176587

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Robust Transceiver Design for IRS-Assisted Cascaded MIMO Communication Systems

Abstract

1. Introduction

1.1. Related works

1.2. Contribution

1.3. Organization

2. System Model

3. Achievable Rate (Full-Duplex Relay)

3.1. Overview

3.2. Mathematical Preliminaries

3.3. Algorithm Description

3.4. Discussion

4. Achievable Rate (Half-Duplex Relay)

5. Numerical Results

5.1. Antenna Array Increment with No IRS

5.2. The Impact of IRS

5.3. Relay Tx/Rx Antenna allocation

5.4. Full-Duplex vs. Half-Duplex

6. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

Appendix A. Proof of Equation (16)

Appendix B. Proof of Theorem 1

Appendix C. Proof of Proposition 2

Appendix D

Appendix E

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI