A Novel Two-Step Channel Estimation Method for RIS-Assisted mmWave Systems

Yu, Jiarun

doi:10.3390/s24165362

Open AccessArticle

A Novel Two-Step Channel Estimation Method for RIS-Assisted mmWave Systems

by

Jiarun Yu

School of Information and Communications Engineering, Beijing University of Posts and Telecommunications, Beijing 100876, China

Sensors 2024, 24(16), 5362; https://doi.org/10.3390/s24165362

Submission received: 2 June 2024 / Revised: 2 August 2024 / Accepted: 16 August 2024 / Published: 19 August 2024

(This article belongs to the Topic Advanced Array Signal Processing for B5G/6G: Models, Algorithms, and Applications)

Download

Browse Figures

Versions Notes

Abstract

:

In this work, we resolve the cascaded channel estimation problem and the reflected channel estimation problem for the reconfigurable intelligent surface (RIS)-assisted millimeter-wave (mmWave) systems. The novel two-step method contains modified multiple population genetic algorithm (MMPGA), least squares (LS), residual network (ResNet), and multi-task regression model. In the first step, the proposed MMPGA-LS optimizes the crossover strategy and mutation strategy. Besides, the ResNet achieves cascaded channel estimation by learning the relationship between the cascaded channel obtained by the MMPGA-LS and the channel of the user (UE)-RIS-base station (BS). Then, the proposed multi-task-ResNet (MTRnet) is introduced for the reflected channel estimation. Relying on the output of ResNet, the MTRnet with multiple output layers estimates the coefficients of reflected channels and reconstructs the channel of UE-RIS and RIS-BS. Remarkably, the proposed MTRnet is capable of using a lower optimization model to estimate multiple reflected channels compared with the classical neural network with the single output layer. A series of experimental results validate the superiority of the proposed method in terms of a lower norm mean square error (NMSE). Besides, the proposed method also obtains a low NMSE in the RIS with the formulation of the uniform planar array.

Keywords:

channel estimation; RIS-assisted mmWave systems; MMPGA; ResNet; multi-task regression model; MTRnet

1. Introduction

Intelligent reflecting surfaces (IRS), also denoted as reconfigurable intelligent surfaces (RISs), have the potential to improve the system performance of the 6G network [1]. Generally speaking, RIS was one kind of meta-surface composed of a vast number of passive reflecting elements, which could be controlled in real time to dynamically alter the amplitude and/or phase of the reflecting signal, thus collaboratively enabling smart reconfiguration of the radio propagation environment [2]. However, channel estimation in passive RIS-assisted millimeter-wave (mmWave) systems is challenging. It is because the passive RIS is unable to perform signal processing, and the large number of reflecting elements in the RIS leads to high complexity [3,4].

During the past decade, some methodologies have been used to address channel estimation. The authors of [5] proposed a tensor modeling approach aiming at reducing the channel estimation error. This channel estimation problem was translated into recovering multiple rand-1 matrix approximation sub-problems [5]. Authors of [6] investigated the direction-of-departure (DoD) and direction-of-arrival (DoA) estimation in a bistatic multiple input multiple output (MIMO) radar, in which a backward IRS was exploited to receive the echoes reflected by the targets from the NLOS viewpoint. Wei et al. [7] applied the least-squares (LS) channel estimation solution for the channel of the user (UE)-base station (BS). However, the channel estimation performance in [7] was sensitive to the additional Gaussian white noise. Compressed sensing methods in [8,9] transformed the channel estimation into a sparse signal recovery problem. The channel estimation method based on compressed sensing required traversing the dictionary matrix to attain the low norm mean square error (NMSE). In [10], authors developed an improved version of the differential evolution (DE) algorithm for cell-free MIMO systems assisted by RIS. By configuring phase shift vectors for the RIS-assisted reflected channel estimation, Byun et al. [11] improved the channel estimation accuracy. The evolution algorithm used in [11] paid attention to the improvement from the mutation operation and neglected the influence due to a random children selection in the crossover strategy. The convex optimization approach [12] and hybrid evolution method [13] reduced the error between the actual signal and the reconstructed signal via optimizing the corresponding channel matrix.

As a popular approach for improving communication systems performance, some researchers have introduced deep learning for the channel estimation problem [14,15,16]. In [14,15], the authors proposed a convolutional-neural-network (CNN)-based channel estimation method. The method in [14] required the RIS to process the transmitted signal. Therefore, this method could not be applied to the passive RIS system. A conditional generative adversarial network (cGAN) was designed to estimate the cascaded channel with the received signals as conditional information [16]. A deep-learning-based approach with the traditional orthogonal matching pursuit followed by the residual network was introduced for the cascaded uplink channel estimation problem [17]. However, the methods in [16,17] did not achieve the channel estimation of RIS-BS and UE-RIS. Without the information about the directive channel of UE-BS, a deep-learning-based channel estimation method in [18] did not estimate the reflected channels of RIS-BS and UE-RIS effectively.

To further reduce the channel estimation NMSE, some methods integrated deep learning and other methods [19,20,21]. In [19], Wang et al. proposed a channel estimation scheme based on an LS solution for estimating the cascaded channel. Differently, the authors of [20] modeled the channel estimation as a denoising problem and developed a versatile deep residual learning-based cascaded channel estimation framework. Besides, the channel estimation method adopted a CNN-based deep residual network to learn the mapping between the noisy channel matrix and the denoised channel matrix [21]. The optimized network architectures in [19,20,21] did not estimate the reflected channels of RIS-BS and UE-RIS simultaneously. Wang et al. proposed a machine learning-based CS channel estimation method for wireless communication [22]. In [23], authors propose a channel estimation method for the passive RIS-assisted systems. The authors of [24] performed two stages by following atomic norm minimization to recover the channel parameters. In [25], authors proposed a strategy for joint target and user assignment, power allocation, and subchannel allocation (JCAPASA) in the RIS-assisted systems. The framework used in [26] integrated the CNN and Lagrange optimization algorithms, which aimed at achieving cascaded channel estimation. The method in [26] required the additional optimization of Lagrange factors to obtain the low channel estimation NMSE.

Most of the above-mentioned methods mainly focused on cascaded channel estimation and did not simultaneously achieve the channel estimation of RIS-BS and UE-RIS without considering the UE-to-BS communication. To overcome this shortcoming, we propose a novel two-step channel estimation method for the RIS-assisted mmWave systems. The scope of this work is to fill in the gap in the literature on reflected channel estimation with the use of deep learning. The residual network (ResNet) with the cross-layers operation [27] further improves the non-linear processing ability relative to some common neural networks. Compared with the single regression model, the multi-task model [28] has stronger learning ability. With the multiple output layers, the multi-task solved many sub-problems simultaneously. Therefore, we introduce a neural network integrating the multi-task regression model and ResNet for the channel estimation problem. Remarkably, the two-step method integrates the proposed modified multiple population genetic algorithm (MMPGA), LS estimator, ResNet, and multi-task-ResNet (MTRnet). The main contributions of this paper are summarized as follows:

In addition to the cascaded channel estimation, we further estimate the reflected channels of RIS-BS and UE-RIS. Remarkably, a novel two-step channel estimation method using MMPGA, LS estimator, ResNet, and MTRnet is introduced for the RIS-assisted mmWave systems.
The MMPGA-LS-ResNet is proposed to estimate the cascaded channel of UE-RIS-BS. The MMPGA-LS optimizes the crossover strategy and mutation strategy compared with the common evolution algorithm. As a result, the proposed MMPGA-LS is capable of reducing the estimation error. Then, ResNet is applied to further reduce the cascaded channel error. Relying on the designed network architecture, including the multiple cross-layer operations and layers, the proposed ResNet learns the relationship between the output of MMPGA-LS and the channel of UE-RIS-BS effectively.
Furthermore, the proposed MTRnet is introduced for estimating the reflected channels of RIS-UE and UE-RIS. Compared with the single regression model, the MTRnet integrates the multi-task learning model and ResNet. As a result, the proposed MTRnet with multiple output layers achieves the reflected channel estimation within fewer optimization models compared with that based on the single regression model.
A series of experimental results have validated the superiority of the novel two-step channel estimation method. For the cascaded channel estimation performance, the MMPGA-LS achieves a lower NMSE compared with a genetic algorithm (GA) [29] and particle swarm algorithm (PSO) [30]. Besides, the proposed ResNet also obtains a lower NMSE compared with convolutional recurrent neural network (CRNN) [17] and CNN [15]. Additionally, the proposed MTRnet based on the multi-task learning ability still outperforms some single-learning models in terms of a lower NMSE. Besides, the proposed method also obtains a low NMSE in the RIS with the formulation of the uniform planar array.

The rest of this paper is organized into the following parts: In Section 2, the problem of channel estimation is introduced. In Section 3, the proposed MMPGA-ResNet-MTRnet-based method is described in detail. In Section 4, the proposed two-step method is utilized for the simulations of channel estimation. The numerical results compared with other algorithms are presented to validate the effectiveness of the proposed method. The conclusions are given in Section 5.

2. Channel Estimation System Model

In the uplink RIS-assisted mmWave communication systems, there is no point-to-point communication from UE to BS. Considering the N-elements BS with the formulation of a uniform linear array and M-elements RIS with the formulation of a uniform linear array [31], the received signal at the BS is given [32,33]

y = H_{1} Ψ H_{2} s + n,

(1)

where

H_{1} \in C^{N \times M}

denotes the channel of RIS-BS,

Ψ = diag [b_{1} e^{j ψ_{1}}, b_{2} e^{j ψ_{2}}, \dots, b_{M} e^{j ψ_{M}}] \in C^{M \times M}

represents the reflecting matrix, and

ψ_{m}

is distributed in the interval

[- π / 2, + π / 2]

. The channel of UE-RIS is denoted as

H_{2} \in C^{M \times 1}

, the transmitted pilot signal sequence with the length

L_{s}

is

s

[34], the Gaussian white noise with mean 0 and variance

ξ_{n}^{2}

is

n

.

In the RIS with the formulation of ULA,

H_{ULA, 1}

is expressed as

H_{ULA, 1} = \sqrt{\frac{N M}{P}} \sum_{p = 1}^{P} α_{1, p} a_{BS, Rx} (θ_{p}) a_{RIS, Tx}^{H} (ω_{p}),

(2)

where the number of multipaths is P,

α_{1, p}

is the complex gain,

a_{BS, Rx} (θ_{p}) C^{N \times 1}

denotes the steering vector at the BS side,

a_{RIS, Tx} (ω_{p}) C^{M \times 1}

represents the steering vector from the departure direction at the RIS side,

θ_{p}

means the physical direction-of-arrival (DoA) at the BS side,

ω_{p}

is the direction-of-departure (DoD) at the RIS side, and

{(\cdot)}^{H}

expresses the conjugate transport operation,

α_{1} = {α_{1, p}}_{p = 1, \dots, P}

. The multipaths in the systems contain the single line-of-sight (LOS) path and

P - 1

non-line-of-sight (NLOS) paths.

a_{BS, Rx} (θ_{p}) = \frac{1}{\sqrt{N}} {[e^{j \frac{2 π}{λ} d n \sin (θ_{p})}]}^{T}, n = 0, \dots, N - 1,

(3)

where

λ

means the wavelength of the barrier frequency,

d = \frac{λ}{2}

is the spacing between adjacent elements, and

{(\cdot)}^{T}

presents the transport operation. The steering vector

a_{RIS, Tx} (ω_{p})

is expressed as

a_{RIS, Tx} (ω_{p}) = \frac{1}{\sqrt{M}} {[e^{j \frac{2 π}{λ} d m \sin (ω_{p})}]}^{T}, m = 0, \dots, M - 1 .

(4)

H_{ULA, 2}

is given as

H_{ULA, 2} = \sqrt{\frac{M}{P}} \sum_{p = 1}^{P} α_{2, p} a_{RIS, Rx} (ϕ_{p}),

(5)

where

α_{2, p}

is the complex gain; the steering vector at the arrival direction of RIS side is represented as

a_{RIS, Rx} (ϕ_{p})

;

ϕ_{p}

is DoA at the RIS side;

α_{2} = {α_{2, p}}_{p = 1, \dots, P}

a_{RIS, Rx} (ϕ_{p}) = \frac{1}{\sqrt{M}} {[e^{j \frac{2 π}{λ} d m \sin (ϕ_{p)}}]}^{T}, m = 0, \dots, M - 1 .

(6)

In the RIS with the formulation of UPA,

H_{UPA, 1}

is expressed as

H_{UPA, 1} = \sqrt{\frac{N M}{P}} \sum_{p = 1}^{P} α_{1, p} a_{BS, Rx} (θ_{p}) a_{RIS, Tx}^{H} (β_{1, p}, δ_{1, p}),

(7)

a_{RIS, Tx} (β_{p}, δ_{p}) = a_{Tx, x} (β_{1, p}, δ_{1, p}) \otimes a_{Tx, y} (β_{1, p}, δ_{1, p}),

(8)

a_{Tx, x} (β_{1, p}, δ_{1, p}) = \frac{1}{\sqrt{M_{x}}} {[e^{j \frac{2 π}{λ} d m_{x} \sin (β_{1, p}) \sin (δ_{1, p})}]}^{T}, m_{x} = 0, \dots, M_{x} - 1,

(9)

a_{Tx, y} (β_{1, p}, δ_{1, p}) = \frac{1}{\sqrt{M_{y}}} {[e^{j \frac{2 π}{λ} d m_{y} \sin (β_{1, p}) \cos (δ_{1, p})}]}^{T}, m_{y} = 0, \dots, M_{y} - 1,

(10)

where

M = M_{x} M_{y}

.

H_{UPA, 2}

is given as

H_{UPA, 2} = \sqrt{\frac{M}{P}} \sum_{p = 1}^{P} α_{2, p} a_{RIS, Rx} (β_{2, p}, δ_{2, p}),

(11)

a_{RIS, Rx} (β_{p}, δ_{p}) = a_{Rx, x} (β_{2, p}, δ_{2, p}) \otimes a_{Rx, y} (β_{2, p}, δ_{2, p}),

(12)

a_{Rx, x} (β_{2, p}, δ_{2, p}) = \frac{1}{\sqrt{M_{x}}} {[e^{j \frac{2 π}{λ} d m_{x} \sin (β_{2, p}) \sin (δ_{2, p})}]}^{T}, m_{x} = 0, \dots, M_{x} - 1,

(13)

a_{Rx, y} (β_{1, p}, δ_{1, p}) = \frac{1}{\sqrt{M_{y}}} {[e^{j \frac{2 π}{λ} d m_{y} \sin (β_{2, p}) \cos (δ_{2, p})}]}^{T}, m_{y} = 0, \dots, M_{y} - 1,

(14)

where

β_{1, p}, β_{2, p}

stand for the elevation angle,

δ_{1, p}, δ_{2, p}

mean the azimuth angle.

In the passive RIS system, we select the reflecting elements randomly, where

b_{m} \in {0, 1},

m = 0, \dots, M - 1

. According to [16], the cascaded channel

H

is given as follows:

H = H_{1} Ψ H_{2} .

(15)

According to Equations (1)–(15), there exists a relationship between

(s, H_{1}, H_{2})

, and

y

, which is written as

f^{- 1} (s, y) = H .

(16)

f^{- 1} (H) = {α, θ, ω, ϕ}, f (α, θ, ω, ϕ) \to {H_{1}, H_{2}} .

(17)

The resolvable problem in this paper is expressed as

f^{- 1} (s, y) \to {H_{1}, H_{2}} .

(18)

3. The Novel Two-Step Channel Estimation Method

The proposed channel estimation method contains the MMPGA-LS-ResNet-based cascaded channel estimation and the MTRnet-based reflected channel estimation. In the first step, the MMPGA-LS executes the population initialization, classification, crossover, adaptive mutation, and reservation strategies. Relying on the generation of MMPGA-LS, the proposed ResNet further improves cascaded channel estimation performance. Based on the predicted cascaded channel, MTRnet with multiple output layers simultaneously estimates the channel coefficients (DoAs, DoDs, and channel gains) in the second step. As a result, the proposed method reconstructs the reflected channels of RIS-BS and UE-RIS.

3.1. MMPGA-LS-ResNet-Based Cascaded Channel Estimation

In this subsection, the MMPGA-LS-ResNet-based cascaded channel estimation method is introduced for the RIS-assisted mmWave systems. The MMPGA-LS, with its improved crossover strategy and mutation strategy, initially estimates the cascaded channel. Then, the proposed ResNet learns the non-linear relationship between the cascaded channel obtained by the MMPGA-LS and the channel of the UE-RIS-BS. The ResNet aims at further reducing the channel estimation NMSE.

The proposed MMPGA-LS reduces the channel estimation error by optimizing the reflecting phases. Based on the population initialization, the MMPGA-LS classifies them via fitness ranking. The MMPGA-LS makes good use of the best one, corresponding to the highest fitness in the crossover strategy. Then, the adaptive mutation strategy flexibly adjusts the mutation factor according to fitness. Based on the generation of the mutation, the proposed method preserves the partial children with higher fitness. After using the LS estimator, the proposed method obtains

H_{MMPGA, LS}

. Figure 1 represents the flowcharts of the proposed MMPGA-LS.

3.1.1. Population Initialization

We assume that the initial population

κ \in R^{Q \times M_{acti}}

contains Q children, where

M_{acti}

denotes the number of active elements in the RIS.

κ

is given as below

κ = {[κ_{1}, κ_{2}, \dots, κ_{Q}]}^{T},

(19)

where

κ_{q} = [κ_{q, 1}, κ_{q, 2}, \dots, κ_{q, M_{acti}}] \in R^{1 \times M_{acti}}

,

κ_{q, m_{acti}} = f_{deci} ({g_{q, m_{acti}}}

,

g_{q, m_{acti}} = [g_{q, m_{acti}, 1}, \dots, g_{q, m_{acti}, l}, \dots, g_{q, m_{acti}, L_{c}}] \in Z^{1 \times L_{c}}

,

g_{q, m_{acti}, l} \in {0, 1}

,

L_{c}

stands for the length of a binary-gene sequence, and

f_{deci} (\cdot)

means a decimal-transportation function.

f_{deci} (g_{q, m_{acti}}) = l b + \frac{\sum_{l = 1}^{L_{c}} 2^{g_{q, m_{acti}, l}}}{2^{L_{c}}} (u p - l b), l = 1, \dots, L_{c},

(20)

where

l b

is the lower bound,

u p

means the upper bound,

l b \leq κ_{q, m_{acti}} \leq u p

. The whole gene population is defined as

G_{init} \in Z^{Q \times L_{c} \times M_{acti}}

.

g_{q, m_{acti}, l}

selects 0 or 1 randomly.

3.1.2. Population Classification

After using

κ_{q}

to obtain

f_{LS, MMSE} (H_{LS}, H)

, the corresponding fitness is defined as

f i t = \frac{1}{f_{LS, NMSE} (H_{LS}, H)},

(21)

f_{LS, NMSE} (H_{LS}, H) = E {\frac{| | H_{LS} - H {| |}_{F}^{2}}{{| | H | |}_{F}^{2}}},

(22)

where

E {\cdot}

expresses the expectation operation;

| | \cdot {| |}_{F}^{2}

means the square of the Frobenius norm.

According to the descending order criterion, the fitness set

F i t

is divided into

F i t_{fa} \in R^{Q_{1} \times 1}

,

F i t_{mo} \in R^{Q_{2} \times 1}

, and

F i t_{su} \in R^{Q_{3} \times 1}

, where

Q = Q_{1} + Q_{2} + Q_{3}

. The father population corresponding to

F i t_{fa}

is defined as

κ_{fa} \in R^{Q_{1} \times M_{acti}}

, and its gene population is given as

G_{fa} \in Z^{Q_{1} \times L_{c} \times M_{acti}}

. The mother-population corresponding to

F i t_{mo}

is defined as

κ_{mo} \in R^{Q_{2} \times M_{acti}}

, and its gene population is given as

G_{mo} \in Z^{Q_{2} \times L_{c} \times M_{acti}}

. The sub-population corresponding to

F i t_{su}

is defined as

κ_{su} \in R^{Q_{3} \times M_{acti}}

, and its gene population is given as

G_{su} \in Z^{Q_{3} \times L_{c} \times M_{acti}}

.

3.1.3. Crossover

Figure 2 illustrates the proposed crossover strategy. The MMPGA-LS generates a crossover probability

μ (0 \leq μ \leq 1)

and compares it with

μ_{c}

. The execution of the crossover strategy satisfies a condition, where

μ \leq μ_{c}

. Relying on

F i t

, the best one

κ_{best} \in R^{1 \times M_{acti}}

is selected as

κ_{best} \leftarrow f (max {F i t_{q}}), q = 1, 2, \dots, Q .

(23)

Then new gene populations are generated via

g_{best}

,

G_{mo}

,

G_{su}

, and an index

η

of the crossover position

\begin{matrix} g_{2 q_{2} - 1, m_{a c t i}} = {[g_{best} (1, 1 : η), g_{mo, q_{2}, m_{acti}} (q_{2}, η + 1 : L_{c})]}^{T}, \\ g_{2 q_{2}, m_{a c t i}} = {[g_{mo, q_{2}, m_{acti}} (q_{2}, 1 : η), g_{best} (1, η + 1 : L_{c})]}^{T}, \end{matrix}

(24)

where

q_{2} = 1, 2, \dots, Q_{2}

,

\begin{matrix} g_{2 q_{3} - 1, m_{a c t i}} = {[g_{best} (1, 1 : η), g_{su, q_{3}, m_{acti}} (q_{3}, η + 1 : L_{c})]}^{T}, \\ g_{2 q_{3}, m_{a c t i}} = {[g_{su, q_{3}, m_{acti}} (q_{3}, 1 : η), g_{best} (1, η + 1 : L_{c})]}^{T}, \end{matrix}

(25)

where

q_{3} = 1, 2, \dots, Q_{3}

. A new gene population

G_{cross} \in Z^{Q_{4} \times L_{c} \times M_{acti}}

is formulated according to Equations (24) and (25), where

Q_{4} = 2 Q_{2} + 2 Q_{3}

.

With the substitution of

G_{cross}

, the corresponding fitness

F i t_{cross} \in R^{Q_{4} \times 1}

is obtained. MMPGA-LS abandons the worst one corresponding to the lowest fitness in

G_{cross}

, reserves

κ_{best}

, and formulates

κ_{cross} \in R^{Q_{4} \times M_{acti}}

.

3.1.4. Adaptive Mutation

Figure 3 represents the flowcharts of the adaptive mutation strategy. The proposed method randomly generates a mutation probability

φ (0 \leq φ \leq 1)

and compares it with

ζ

. The condition of the adaptive mutation strategy satisfies

φ \leq ζ

. Based on the output of the crossover strategy, three random number sets

{Ra}_{1}, {Ra}_{2}, {Ra}_{3} \in Z^{Q_{4} \times 1}

and mutation scale factors

{ξ_{q_{4}}}_{q_{4} = 1, 2, \dots, Q_{4}} \in R^{Q_{4} \times 1}

,

κ_{muata} \in R^{Q_{4} \times M_{acti}}

are given as

κ_{muata, q_{4}} = κ_{cross, q_{4}} + ξ_{q_{4}} (κ_{cross, {Ra}_{1, q_{4}}} - κ_{cross, q_{4}}) + ξ_{q_{4}} (κ_{cross, {Ra}_{2, q_{4}}} - κ_{cross, {Ra}_{3, q_{4}}}) .

(26)

Sustain

κ_{muata}

to construct the corresponding fitness

F i t_{muata} \in R^{Q_{4} \times 1}

. Compared with the fixed mutation factor, the proposed MMPGA-LS adjusts

{ξ_{q_{4}, t}}

.

ξ_{q_{4}, t + 1} = \frac{1}{1 + 0.99 t} ξ_{q_{4}, t}, F i t_{muata, q_{4}, t} > F i t_{muata, q_{4}, t - 1} a n d F i t_{muata, q_{4}, t - 1} > F i t_{muata, q_{4}, t - 2},

(27)

where t denotes the number of the current iteration.

3.1.5. Population Reservation

Relying on

F i t_{muata, t}

, the proposed method selects

κ_{new, t} \in R^{Q \times M_{acti}}

with higher fitness.

G_{new, t} \in Z^{Q \times L_{c} \times M_{acti}}

is formulated via the binary transportation about

κ_{new, t}

. For an example of

κ_{new, t}

, the relationship between

κ_{q, t, m_{acti}}

and

g_{q, t, m_{acti}}

is expressed as

g_{q, t, m_{acti}} = f_{bina} ((\frac{(κ_{q, t, m_{acti}} - l b) 2^{L_{c}}}{u p - l b}) ⌋),

(28)

where

f_{bina} (\cdot)

denotes the binary transportation function, and

(\cdot) ⌋

is an operation of the integral down. The proposed method replaces

κ

with

κ_{new, t}

.

The proposed MMPGA-LS stops the iteration until

t > T

, where T is a number of the total iteration. Collecting the best one in each iteration, we get

κ_{iter} \in R^{T \times M_{acti}}

and its corresponding fitness set

F i t_{iter} \in R^{T \times 1}

. Based on

F i t_{iter}

,

κ^{*} \in R^{1 \times M_{acti}}

is given as

κ^{*} \leftarrow f (max {F i t_{iter}}_{t}), t = 1, \dots, T .

(29)

After using

κ^{*}

, the cascaded channel

H_{MMPGA, LS}

is obtained via the LS algorithm [35]. To evaluate the performance obtained by the proposed MMPGA-LS, the error function is defined as

f (H_{MMPGA, LS}, H) = E {\frac{| | H_{MMPGA, LS} - H {| |}_{F}^{2}}{{| | H | |}_{F}^{2}}} .

(30)

3.1.6. ResNet

Based on the output of the proposed MMPGA-LS, the proposed ResNet further reduces the cascaded channel estimation error. The dataset used in the network collects the real part

Re {H_{MMPGA, LS}}

and the imaginary part

Im {H_{MMPGA, LS}}

of

H_{MMPGA, LS}

to construct

H_{4} \in R^{N_{1} \times N_{2}}

, where

N_{1} N_{2} = 2 N

.

H_{4} = [\begin{matrix} Re {{[H_{MMPGA, LS}]}_{1, 1}} & Im {{[H_{MMPGA, LS}]}_{1, 1}} & \dots \\ ⋮ & ⋱ & ⋮ \\ \dots & Re {{[H_{MMPGA, LS}]}_{N, 1}} & Im {{[H_{MMPGA, LS}]}_{N, 1}} \end{matrix}] .

(31)

The corresponding operation between the input

H_{4}

and the output

{Re {H^{'}}, Im {H^{'}}}

is expressed as

f_{L_{1}}^{- 1} (f_{L_{1} - 1}^{- 1} \dots (f_{1}^{- 1} (H_{4}))) = {Re {H^{'}}, Im {H^{'}}},

(32)

where

L_{1}

denotes the total layers of ResNet. Figure 4 represents some primary layers of the proposed ResNet. The flowcharts of the cascaded channel estimation based on MMPGA-ResNet are summarized below.

Initialize $κ$ ;
Classify $κ$ based on fitness ranking;
Execute crossover operation shown in Figure 2;
Execute adaption mutation operation shown in Figure 3;
Reserve $κ_{new}$ with the higher fitness;
Select the best one $κ^{*}$ with the highest fitness;
Substitute $κ^{*}$ into Equations (1)–(6) and LS algorithm;
$H_{4} \leftarrow f (H_{MMPGA, LS})$ based on Equation (31);
Substitute $H_{4}$ into the proposed ResNet shown in Figure 4;
Train the network parameters in the ResNet;
Attain the optimization model of ResNet;
$H^{'} \leftarrow f (H_{4})$ .

Figure 4. The primary layers of the proposed ResNet.

The first convolution layer extracts the information in

H_{4}

. For each filter used in the convolution layer, the mathematical expression about the input

X \in R^{N_{x} \times N_{y}} (0 < N_{x} < N_{1}, 0 < N_{y} < N_{2})

and a 2D convolution kernel with the dimension of

N_{x} \times N_{y}

is expressed as

(W^{[l_{1}]} * X) = \sum_{i = 1}^{N_{x}} \sum_{j = 1}^{N_{y}} W_{i, j}^{[l_{1}]} X_{i, j},

(33)

where

W^{[l_{1}]}

means the weight of a kernel in the

l_{1}

th layer. The corresponding bias b is added to

(W^{[l_{1}]} * X)

, where b is an element of

b \in R^{F_{c} \times 1}

, and

F_{c}

is the number of filters. The convolution layer selects the filter with a size of

3 \times 3

. Following each convolution layer, the activation function selects the LeakyReLU function, which is expressed as

y = \{\begin{matrix} x, x \geq 0 \\ ι x, x < 0 \end{matrix} .

(34)

Furthermore, the network utilizes the batch-normalization operation to avoid the over-fit [36]. In

{f^{- 1}}_{i = 2, 6, 11, 15}

, we select 64 filters. The convolution layers in

{f^{- 1}}_{i = 3, 7, 12}

deploy 128 filters to further process the data from the current input. The operations in

{f^{- 1}}_{i = 4, 8, 13}

select 256 filters. The convolution layers in

{f^{- 1}}_{i = 5, 10}

use 32 filters. Following

{f^{- 1}}_{4}

, the ResNet processes the corresponding output in parallel. In

{f^{- 1}}_{9}

, the proposed ResNet implements the cross-layers operation by adding the outputs of

{f^{- 1}}_{4}

and

{f^{- 1}}_{8}

. The cross-layers operation is also done in

{f^{- 1}}_{14}

by adding the outputs of

{f^{- 1}}_{4}

and

{f^{- 1}}_{13}

. The classical InceptionNets [37] also uses the cross-layers operation to improve learning ability.

The flattening operation used in

{f^{- 1}}_{16}

transforms the 2D matrix obtained by the last convolution layer into a column scalar. Finally, the hidden layer, with multiple neurons in the output layer, processes the column scalar. The corresponding mathematical operation between the current input

x

and output

z

is given as

z = W_{hidd} x + b_{hidd} .

(35)

In the output layer, the proposed ResNet predicts the real and imaginary parts of the cascaded channel. Subsequently,

(Re {H^{'}}, Im {H^{'}})

reformulates the predicted channel

H^{'}

, which is expressed as

H^{'} = Re {H^{'}} + j Im {H^{'}}

.

To evaluate the estimation performance achieved by the ResNet, the NMSE function is used as the error function.

f_{NMSE} (H^{'}, H) = E {\frac{| | H^{'} - H {| |}_{F}^{2}}{{| | H | |}_{F}^{2}}} .,

(36)

Based on the gradient descent algorithm, learning rate

r_{1}

,

f_{NMSE} (H^{'}, H)

, and momentum factors, the ResNet updates the prediction. Table 1 represents the configuration of some primary layers in the ResNet.

3.2. MTRnet-Based Reflected Channel Estimation

Based on the output of the ResNet, the proposed MTRnet achieves the reflected channel estimation in the second step. The mapping between input

(Re {H^{'}}, Im {H^{'}})

and output

(θ^{'}, ω^{'}, ϕ^{'}, α_{BS}^{'}, α_{RIS}^{'})

is expressed as

f_{L_{2}}^{- 1} (f_{L_{2} - 1}^{- 1} \dots (f_{1}^{- 1} (Re {H^{'}}, Im {H^{'}})) = {θ^{'}, ω^{'}, ϕ^{'}, α_{BS}^{'}, α_{RIS}^{'}},

(37)

where

L_{2}

means the total layers of the MTRnet. Based on Equations (2)–(6), there exists a relationship between

(θ^{'}, ω^{'}, ϕ^{'}, α_{BS}^{'}, α_{RIS}^{'})

and

(H_{1}^{'}, H_{2}^{'})

. Figure 5 represents the network structure of the proposed MTRnet.

The MTRnet mainly contains a sharing part and multiple sub-tasks. The sharing part implements some convolution layers. Except for the output layer, the convolution layers in the sharing part are the same as the ResNet. Considering the 2D convolution kernel,

(Re {H^{'}}, Im {H^{'}})

is firstly reshaped into

H_{5} \in R^{N_{1} \times N_{2}}

.

H_{5} = [\begin{matrix} Re {{[H^{'}]}_{1, 1}} & Im {{[H^{'}]}_{1, 1}} & \dots \\ ⋮ & ⋱ & ⋮ \\ \dots & Re {{[H^{'}]}_{N, 1}} & Im {{[H^{'}]}_{N, 1}} \end{matrix}] .

(38)

The MTRnet selects the output of the sharing part as the input for all sub-tasks. Following the last convolution-activation-batch-normalization layers in the sharing part, five sub-tasks further process the current input simultaneously. Remarkably, each sub-task has its own exclusive training parameters. In sub-task I and sub-task II, the corresponding output layers both select P neurons to generate

{θ^{'}, ω^{'}}

. Meanwhile, the third sub-task with

2 P

neurons learns the mapping between the current input and

(Re {α_{1}^{'}}, Im {α_{1}^{'}})

.

α_{1}^{'} = Re {α_{1}^{'}} + j Im {α_{1}^{'}}

. The fourth sub-task with P neurons achieves the prediction of

ϕ^{'}

. The output layer in the fifth sub-task utilizes

2 P

neurons to generate

(Re {α_{2}^{'}}, Im {α_{2}^{'}})

and formulate

α_{2}^{'}

. The Tanh activation function is used as the activation function in the second network, which is expressed as

f_{Tanh} (x) = \frac{e^{x} - e^{- x}}{e^{x} + e^{- x}}

. The proposed multi-task regression network selects the mean square error function and Adam optimizer to update the network parameters. Table 2 shows the configuration of network parameters in the five sub-tasks.

Finally, the proposed method achieves the reflected channel estimation through the mapping between outputs obtained by the multi-task regression network and

(H_{1}^{'}, H_{2}^{'})

. It is clear that

H_{1}^{'} \leftarrow f (θ^{'}, ω^{'}, α_{1}^{'})

and

H_{2}^{'} \leftarrow f (ϕ^{'}, α_{2}^{'})

. To evaluate the estimation performance in the reflected channels, we also select the NMSE function.

f_{NMSE} (H_{1}^{'}, H_{1}) = E {\frac{| | H_{1}^{'} - H_{1} {| |}_{F}^{2}}{| | H_{1} {| |}_{F}^{2}}},

(39)

f_{NMSE} (H_{2}^{'}, H_{2}) = E {\frac{| | H_{2}^{'} - H_{2} {| |}_{F}^{2}}{| | H_{2} {| |}_{F}^{2}}} .

(40)

3.3. Implementation of the Novel Two-Step Method

The proposed method is decomposed into two steps, including the MMPGA-LS-ResNet-based cascaded channel estimation and the MTRnet-based reflected channel estimation. In the cascaded channel estimation, the proposed MMPGA-LS first generates the population

κ, G_{init}

. Then, the proposed MMPGA attains

κ^{*}

via the population classification, crossover operation, adaptive mutation operation, and population reservation. Based on

κ^{*}

and the LS estimator,

H_{4}

is formulated from

H_{MMPGA, LS}

. Furthermore, the proposed method achieves the cascaded channel estimation via learning the relationship between the cascaded channel obtained by MMPGA-LS and the channel of UE-RIS-BS.

H_{4}

is used as the input of the proposed ResNet. The corresponding output in the ResNet is expressed as

H^{'}

. As a strong de-noise ability, the ResNet is capable of further reducing the cascaded channel estimation error. The output of the ResNet is reshaped into

H_{5}

and used as the input to the proposed MTRnet. The MTRnet using a multi-task regression model and ResNet estimates the channels of RIS-BS and UE-RIS simultaneously. As a result, the channel parameters

(α_{1}^{'}, θ^{'}, ω^{'}, α_{2}^{'}, ϕ^{'})

are used as the output of the MTRnet. Finally, the proposed method reconstructs the reflected channels based on the output of MTRnet.

The flowcharts of the proposed two-step method are shown in Figure 6, which can be summarized as follows:

Generate population $κ, G_{init}$ ;
Attain $κ^{*}$ via the proposed MMPGA;
Obtain $H_{MMPGA, LS}$ by LS estimator;
$H_{4} \leftarrow f (H_{MMPGA, LS})$ ;
Substitute $H_{4}$ into the proposed ResNet;
Generate $H^{'}$ via the optimization model of ResNet;
$H_{5} \leftarrow f (H^{'})$ ;
Substitute $H_{5}$ into the proposed MTRnet;
Achieve $(α_{1}^{'}, θ^{'}, ω^{'}, α_{2}^{'}, ϕ^{'})$ via the optimization model of MTRnet;
$H_{1}^{'} \leftarrow f (θ^{'}, ω^{'}, α_{1}^{'})$ , $H_{2}^{'} \leftarrow f (ϕ^{'}, α_{2}^{'})$ .

Figure 6. The flowcharts of the novel two-step method.

In Figure 6, the light green part represents the MMPGA-LS. The portion with light blue stand for the ResNet. The beige colored part of Figure 6 denotes the MTRnet.

With much discussion of the cascaded channel estimation, the proposed MMPGA-LS-ResNet can be summarized as follows:

Initialize population;
Classify $κ, G_{init}$ via the descending order criterion of fitness;
Execute the crossover operation based on Equations (24) and (25);
Adopt the adaptive mutation strategy based on Equations (26) and (27);
Select $κ_{new, t}$ with higher fitness;
Replace $κ$ with $κ_{new, t}$ ;
Replace $G_{init}$ with $G_{new}$ ;
Estimate $H_{MMPGA, LS}$ ;
Construct $H_{4}$ ;
Predict $H^{'}$ via the optimization model of ResNet.

As a result,

H^{'}

is exported and used as the input to the MTRnet. Then, after much discussion of reflected channel estimation, the MTRnet-based method can be summarized as follows:

Construct $H_{5}$ ;
Predict $H_{1}^{'}$ and $H_{2}^{'}$ by the optimization model of MTRnet;
Evaluate the channel estimation performance based on Equations (39) and (40).

4. Simulation Results and Discussion

In this section, a series of results validate the superiority of the proposed method. The SNR regimes contain

{0, 5, 10, 15, 20}

dB. In each SNR, the network uses a dataset with a length of 110,000. The length of the training dataset is 90000, and the validation dataset contains the dataset with a length of 10,000. The remaining data belong to the test dataset. Throughout the simulations, the RIS implements the formulation of the uniform linear array with

M = 32

elements and

M_{acti} = 4

. The BS uses the uniform linear array with

N = 16

elements,

L_{s} \in {16, 32, 64}

,

P = 4

, and the half-wavelength spacing. Besides, we select

Q = 100, L_{c} = 20, Q_{1} = 20, Q_{2} = 20, Q_{3} = 60, μ_{1} = 0.8, ξ = 0.25, T = 200

. Keras 2.2 is used to implement the proposed neural networks. The networks are running on Python 3.5, cuda 10.0, cuDNN 7.6, and GPU 8G. MMPGA-LS is compared with GA [29] and PSO [30] in terms of cascaded channel estimation NMSE. For the reflected channel estimation performance, CNN [15] and CRNN [17] are compared with the proposed method in terms of the NMSE.

4.1. Comparisons with Reported Methods

Table 3 summarizes the computation complexity of different methods. The computation in the proposed MMPGA is mainly concentrated on initialization, crossover, and mutation operations. The computation complexity of the initialization is about the population

Q^{2}

and the complexity

O (N L_{s}^{4})

of the LS algorithm, which is expressed as

O (M_{acti} Q^{2} N L_{s}^{4})

. The computational complexity of the crossover operation is proportional to

Q_{4}

and the LS algorithm, which is shown as

O (M_{acti} Q_{4} N L_{s}^{4})

. The computational complexity of the mutation is proportional to

Q_{4}

L_{c}

and

O (N L_{s}^{4})

, and its computational complexity is denoted as

O (M_{acti} Q_{4} L_{c} N L_{s}^{4})

. As a result, the complexity of the proposed MMPGA is written as

O (T M_{acti} N L_{s}^{4} (Q^{2} + Q_{4} + Q_{4} L_{c}))

. The computational complexity of PSO [30] is

O (2 M_{acti} T Q N L_{s}^{4})

. The computational complexity of GA [29] stands for

O (T M_{acti} N L_{s}^{4} (Q^{2} + 2 Q))

. The training parameters are widely used for evaluating the computational complexity of the neuron network. In the convolution layer, the training parameters are expressed as

(N_{x} N_{y} C_{in} + 1) C_{out}

, where

C_{in}

means the filter numbers in the current layer and

C_{out}

are the filter numbers in the next layer [38]. The training parameters in the hidden layer are denoted as

(D_{in} + 1) D_{out}

, where

D_{in}

stands for the neurons number in the current layer and

D_{out}

is denoted as the neurons number in the next layer [38]. According to Table 1 and Figure 5, the computational complexity of the proposed neuron networks is expressed as

\sum_{l_{c} = 1}^{L_{conv} - 1} (N_{x} N_{y} C_{in} + 1) C_{out} + \sum_{l_{d} = 1}^{L_{hidd} - 1} (D_{in} + 1) D_{out}

. The computational complexity of CRNN [17] stands for

\sum_{l_{c} = 1}^{L_{CRNN, conv} - 1} (P + 3) (N_{CRNN, x} N_{CRNN, y} C_{CRNN, in} + 1) C_{CRNN, out} + \sum_{l_{d} = 1}^{L_{CRNN, hidd} - 1} (D_{CRNN, in} + 1) D_{CRNN, out}

, where

(N_{CRNN, x}, N_{CRNN, y})

means the 2D dimension of filters,

L_{CRNN, conv}

is the number of the convolution layers, and

L_{CRNN, hidd}

stands for the number of hidden layers. The computational complexity of CNN [15] is

\sum_{l_{c} = 1}^{L_{CNN, conv} - 1} (P + 3) (N_{CNN, x} N_{CNN, y} C_{CNN, in} + 1) C_{CNN, out} + \sum_{l_{d} = 1}^{L_{CNN, hidd} - 1} (D_{CNN, in} + 1)

.

Figure 7 represents the cascaded channel estimation performance obtained by different heuristic algorithms, including GA [29], PSO [30], and the proposed MMPGA-LS. As shown in Figure 7, the proposed MMPGA-LS achieves a lower channel estimation compared with that achieved by GA [29] and PSO [30]. In SNR 20 dB, the MMPGA-LS obtains the NMSE of

0.0205

, which is lower than

0.0272

achieved by PSO [30] and

0.0387

achieved by GA [29]. Compared with GA [29], the proposed MMPGA-LS abandons the random operation in the crossover operation and adaptively changes the factors in the mutation operation. As a result, the proposed MMPGA-LS is capable of further reducing the channel estimation error. It is concluded from Figure 7 that the proposed MMPGA-LS outperforms GA [29] and PSO [30] in terms of a lower NMSE.

In the next sub-simulation, we compare the cascaded channel estimation performance achieved by the traditional algorithm and some existing deep-learning-based methods. Figure 8a compares the channel estimation NMSE in the LS algorithm [35] and the proposed ResNet. The ResNet, with its strong de-noise ability, effectively suppresses the inference from the noise. Relying on the curves plotted in Figure 8a, the proposed ResNet obtains a lower NMSE across a range of SNR regimes. In Figure 8b, we compare the cascaded channel estimation NMSE obtained by CRNN [17], CNN [15], and the proposed ResNet. It is clear that the deep-learning-based methods significantly reduce the NMSE compared with the LS algorithm [35]. In SNR 20 dB, the ResNet achieves NMSE 0.0052, which is reduced by 32.47% relative to CRNN [17] and 36.25% relative to CNN [15]. The proposed ResNet with the cross-layers operation explores the relationship between different layers and has a stronger learning ability compared with CRNN [17] and CNN [15]. Therefore, the proposed ResNet can further reduce the cascaded channel estimation NMSE. It is concluded from Figure 8 that the proposed ResNet is superior to the LS algorithm [35], CRNN [17], and CNN [15].

Figure 9 compares the reflected channel estimation performance obtained by different deep learning models, including CRNN [17], CNN [15], and the proposed MTRnet. Figure 9a evaluates the channel estimation NMSE of RIS-BS. As the single output layer, CRNN [17] and CNN [15] both use multiple optimization models to achieve this channel estimation. Remarkably, the proposed MTRnet with multiple output layers only requires one model. In SNR 20 dB, the proposed MTRnet obtains an NMSE of

7.4688 \times 10^{- 5}

, which is lower than

7.5237 \times 10^{- 4}

achieved by CRNN [17] and

1.5689 \times 10^{- 3}

achieved by CNN [15]. Figure 9b exhibits the channel estimation NMSE of UE-RIS. The proposed MTRnet also obtains a lower NMSE compared with CRNN [17] and CNN [15]. In SNR 0 dB, the proposed MTRnet obtains an NMSE of

0.1587

, which is lower than

0.4415

achieved by CRNN [17] and

0.6581

achieved by CNN [15]. It is observed from Figure 9 that the proposed MTRnet can achieve the lower reflected channel estimation NMSE simultaneously.

4.2. Numerical Results of The Proposed Two-Step Mehod

Figure 10 demonstrates the cascaded channel estimation NMSE obtained by the proposed MMPGA-LS. In Figure 10a, we investigate the performance comparison with three crossover strategies. In crossover I, one randomly selected child of the father population, mother population, and sub-population execute the crossover operation. Differently, children in the father population and mother population are paired in descending order of fitness and perform crossover operations in turn. Besides, one randomly selected child of the father population and sub-population execute the crossover operation in the crossover II. As shown in Figure 10a, the proposed crossover strategy achieves a lower NMSE compared with crossover I and crossover II. The NMSE obtained by the crossover II is minor. The crossover I achieves the highest NMSE. The proposed crossover strategy takes advantage of the best one with the highest fitness and is conducive to reducing the cascaded channel estimation error. In SNR 20 dB, the proposed crossover strategy obtains an NMSE of

0.0456

, which is lower than

0.05579

achieved by crossover II and

0.06886

achieved by crossover I. Figure 10b compares the cascaded channel estimation NMSE in different mutation strategies. In mutation I, this mutation is based on the binary children, and a random position corresponding to each gene is changed to 0/1. Mutation II uses the decimal children and a fixed scale factor. The cascaded channel estimation performance in mutation I is sensitive to the length and requires a sufficiently long sequence to achieve a low NMSE. The proposed mutation strategy can adjust the scale factor set according to different fitness levels. As a result, the proposed mutation strategy reduces cascaded channel estimation NMSE compared with mutation I and mutation II. In SNR 20 dB, the proposed mutation strategy obtains an NMSE of

0.0205

, which is lower than

0.0324

achieved by mutation II and

0.0453

achieved by mutation I.

Figure 11 represents the cascaded channel estimation performance achieved by the proposed ResNet. In Figure 11a, we compare the cascaded channel estimation NMSE in the training dataset and test dataset. As shown in Figure 11a, the proposed ResNet achieves a lower NMSE in the training dataset compared with that in the test dataset. Figure 11b investigates the impact of different learning rates on the cascaded channel estimation performance, including

r_{1} \in {0.0001, 0.00001, 0.000001}

. The learning rate is one of the key parameters in neural network optimization and has an important influence on the learning ability of the ResNet. The ResNet achieves the lowest NMSE in

r_{1} = 0.00001

. The NMSE in

r_{1} = 0.000001

is minor. Based on the curves plotted in Figure 11b, the proposed ResNet selects

r_{1} = 0.00001

in terms of a lower cascaded channel estimation NMSE. In Figure 11c, we investigate the impact of different lengths of signal sequence on the cascaded channel estimation performance, including

L_{s} \in {16, 32, 64}

. In SNR 20 dB, the proposed ResNet obtains an NMSE of

0.0052

in

L_{s} = 32

, which is higher than

0.0034

in

L_{s} = 64

and lower than

0.01

in

L_{s} = 16

. Relying on the result shown in Figure 11c, the cascaded channel estimation NMSE reduces as the length of the signal sequence increases.

Figure 12 shows the reflected channel estimation performance achieved by the proposed MTRnet. Figure 12a,b represent the reflected channel estimation NMSE of RIS-BS. As shown in Figure 12a, the MTRnet with

r_{2} = 0.0001

attains the lowest NSME within the same iteration. In SNR 20 dB, the proposed ResNet with

r_{2} = 0.0001

obtains an NMSE of

7.4690 \times 10^{- 5}

, which is lower than

9.8017 \times 10^{- 5}

in

r_{2} = 0.00001

and

1.9822 \times 10^{- 4}

in

r_{2} = 0.00001

, and

7.2888 \times 10^{- 4}

in

r_{2} = 0.001

. Therefore,

r_{2} = 0.0001

is applied for the next sub-simulations. In Figure 12b, we investigate the impact of different lengths of signal sequence on the cascaded channel estimation performance of RIS-BS. The reflected channel estimation NMSE versus the growth of signal sequence reduces. Figure 12c,d represent the reflected channel estimation NMSE of UE-RIS. Figure 12c represents the impact of different learning rates on the reflected channel estimation performance of UE-RIS, including

r_{2} \in {0.001, 0.0001, 0.00001, 0.000001}

. Based on the result plotted in Figure 12c, the proposed MRRnet with

r_{2} = 0.0001

also attains the lowest NMSE in the reflected channel of UR-RIS. As a result, the proposed MTRnet selects the learning rate

r_{2} = 0.0001

. Figure 12d illustrates the reflected channel estimation performance in

L_{s} \in {16, 32, 64}

. As shown in Figure 12d, the proposed MTRnet achieves the highest NMSE in

L_{s} = 16

. In SNR 20 dB, the proposed ResNet obtains an NMSE of

1.4192 \times 10^{- 5}

in

L_{s} = 64

, which is lower than

5.5447 \times 10^{- 5}

in

L_{s} = 32

and

9.9193 \times 10^{- 5}

in

L_{s} = 16

. It is concluded from Figure 11c and Figure 12b,d that the channel estimation NMSE reduces as the length of the signal sequence grows.

4.3. Discussion of the Proposed Method

In the cascaded channel estimation problem, the proposed MMPGA-LS is capable of generating more children compared to GA [29], which does contribute to reducing the cascaded channel estimation NMSE. Besides, MMPGA-LS does not select one randomly and makes good use of the best gene during the crossover operation. The corresponding result shown in Figure 10a has validated the effectiveness of the proposed crossover strategy. Furthermore, the MMPGA-LS refers to the fitness and adjusts the scale factor during the adaptive mutation operation compared with the fixed factor in common evolution algorithms. Therefore, the proposed MMPGA-LS is capable of achieving a lower NMSE. The proposed ResNet with the cross-layers operation and key parameter optimization has a stronger non-linear processing ability compared with CRNN [17] and CNN [15]. As a result, the proposed ResNet with the designed network architecture can attain a lower cascaded channel estimation NMSE compared with CRNN [17] and CNN [15].

For the reflected channels estimation, the proposed MTRnet, integrating the multi-task regression model and ResNet, is introduced. The MTRnet with multiple output layers has a smaller number of network trainings. This is because the MTRnet abandons repetitive network training compared with the single regression model. However, CRNN [17] and CNN [15] with the single output layer require multiple optimization models to estimate the reflected channels. Additionally, the proposed MTRnet obtains a lower reflected channel estimation NMSE compared with CRNN [17] and CNN [15].

Figure 13 presents the convergence of the proposed method in terms of the cascaded channel estimation performance. As shown in Figure 13a, the NMSE versus the growth of iterations decreases. Besides, the error obtained by the proposed MMPGA slightly decreases in later iterations. In Figure 13b, the proposed ResNet also represents the same tendency as the MMPGA. It is concluded from Figure 13, the proposed method has good convergence on the cascaded channel estimation. The convergence of reflected channel estimation performance is shown in Figure 14. As observed from Figure 14a, the channel estimation error decreases as the neuron network with the gradient descent optimization trains. At NMSE

7.47 \times 10^{- 5}

, the channel estimation performance of RIS-BS obtained by the MTRnet decreases slowly. Figure 14b exhibits the convergence of UE-RIS channel estimation performance. The tendency of reflected channel estimation performance degradation can also be seen in Figure 14b. It is clear from Figure 14 that the proposed method has good convergence in terms of the reflected estimation NMSE.

The robustness of the proposed method is shown in Figure 15 and Figure 16, respectively. Figure 15 displays the robustness of the MMPGA and ResNet, where

P \in {3, 4, 5}

. The proposed method selects

P = 4

as the baseline. As shown in Figure 15a, the NMSE in

P = 3

obtained by the MMPGA is lower than that in

P = 4

. This is because the cascaded channel in

P = 3

is associated with fewer channel parameters. Therefore, the channel parameters in

P = 3

can be simply seen as a subset of those in

P = 4

. As a result, the proposed method is capable of obtaining a lower NMSE in

P = 3

compared with that in

P = 4

. In

SNR = 20

dB, the MMPGA achieves an NMSE of 0.0146, which is lower than 0.0205 in

P = 4

. Figure 15b represents the robustness of the proposed ResNet in terms of the cascaded channel estimation performance. The model is trained in

P = 4

and tested in

P \in {3, 5}

. The NMSE in

P = 5

is slightly higher than that in

P = 4

. The proposed ResNet has less robustness in

P = 5

. Due to the increased channel parameters, the ResNet can adjust the network architecture to obtain a lower NMSE. The robustness of the MTRnet is also shown in Figure 16. Figure 16a evaluates the robustness of the proposed MTRnet in terms of the RIS-BS channel. The proposed method has good robustness in

P = 3

. Besides, Figure 16b shows the robustness of the proposed MTRnet in terms of the UE-RIS channel. The proposed method has less robustness in

P = 5

.

Figure 17 displays the cascaded channel estimation under varying levels of interference, where

K \in {1, 2, 3}

, K means the number of UEs. The proposed method selects

K = 1

as the baseline. Based on the curves plotted in Figure 17a, the cascaded channel estimation NMSE versus the growth of interference level decreased. This is because the cascaded channel performance deteriorates with the interference among the multipath signals, including multi-UEs. Figure 17b evaluates the robustness of the proposed ResNet in terms of varying levels of interference. In

SNR = 20

dB, the proposed ResNet obtains the lowest NMSE in

K = 1

and the highest NMSE in

K = 3

. The NMSE in

K = 2

is minor. Figure 18 summarizes the reflected channel estimation performance under varying levels of interference. The tendency shown in Figure 17 is also represented in Figure 18. Figure 18a focuses on the reflected channel of RIS-BS. Differently, Figure 18b represents the UE-RIS channel estimation performance in terms of varying levels of interference. The proposed MTRnet achieves an NMSE of

9.79 \times 10^{- 5}

in

K = 2

, which is slightly higher than

5.54 \times 10^{- 5}

in

K = 1

and lower than 0.00016 in

K = 3

. According to Figure 17 and Figure 18, the proposed method can achieve a low NMSE under varying levels of interference.

Figure 19 and Figure 20 compare the channel estimation performance based on different models in the RIS with the formulation of UPA, where

M_{x} = M_{y} = 8

. As shown in Figure 19a, the proposed MMPGA also outperforms PSO [30] and GA [29] in terms of a lower NMSE. Based on the results plotted in Figure 19b, the proposed ResNet obtains the lowest NMSE across a range of SNR regimes. The NMSE obtained by CRNN [17] is minor. Figure 20 displays the reflected channel estimation performance in UPA. Relying on the results shown in Figure 20a,b, the proposed MTRnet simultaneously obtains a lower NMSE compared with that achieved by CRNN [17] and CNN [15].

5. Conclusions

In this paper, we proposed a novel two-step channel estimation method for RIS-assisted mmWave systems. In the first step, the proposed MMPGA-LS-ResNet is introduced for cascaded channel estimation. The MMPGA-LS is capable of reducing the NMSE compared with some existing methods. Furthermore, the proposed ResNet, with its strong non-linear processing ability, further reduces the cascaded channel estimation NMSE. Based on the output of ResNet, the proposed MTRnet, integrating multi-task regression model and ResNet, can estimate multiple reflected channels simultaneously. Remarkably, the MTRnet has a lower number of optimization models compared with CRNN [17] and CNN [15]. Besides, the proposed MTRnet outperforms CRNN [17] and CNN [15] in terms of lower NMSE. The future work will focus on the active RIS-mmWace systems and optimization of neural networks.

Funding

This work was funded by Beijing University of Posts and Telecommunications-China Mobile Research Institute Joint Innovation Center.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Data are contained within the article.

Conflicts of Interest

The author declares no conflicts of interest.

References

Ji, B.; Han, Y.; Liu, S.; Tao, F.; Zhang, G.; Fu, Z.; Li, C. Several key technologies for 6G: Challenges and opportunities. IEEE Commun. Stand. Mag. 2021, 5, 44–51. [Google Scholar] [CrossRef]
Yuan, X.; Zhang, Y.-J.A.; Shi, Y.; Yan, W.; Liu, H. Reconfigurable-intelligent-surface empowered wireless communications: Challenges and opportunities. IEEE Wirel. Commun. 2021, 28, 136–143. [Google Scholar] [CrossRef]
Guan, X.; Wu, Q.; Zhang, R. Anchor-assisted channel estimation for intelligent reflecting surface aided multiuser communication. IEEE Trans. Wirel. Commun. 2022, 21, 3764–3778. [Google Scholar] [CrossRef]
Cui, M.; Wu, Z.; Lu, Y.; Wei, X.; Dai, L. Near-Field MIMO Communications for 6G: Fundamentals, Challenges, Potentials, and Future Directions. IEEE Commun. Mag. 2023, 61, 40–46. [Google Scholar] [CrossRef]
Araújo, G.T.D.; Almeida, A.L.F.D.; Boyer, R. Channel estimation for intelligent reflecting surface assisted MIMO systems: A tensor modeling approach. IEEE J. Sel. Top. Signal Process. 2021, 15, 789–802. [Google Scholar] [CrossRef]
Wen, F.; Shi, J.; Lin, Y.; Gui, G.; Yuen, C.; Sari, H. Joint DOD and DOA Estimation for NLOS Target using IRS-aided Bistatic MIMO Radar. IEEE Trans. Veh. Technol. 2021, 1–6. [Google Scholar] [CrossRef]
Wei, Y.; Zhao, M.-M.; Zhao, M.-J.; Cai, Y. Channel estimation for IRS-aided multiuser communications with reduced error propagation. IEEE Trans. Wirel. Commun. 2022, 21, 2725–2741. [Google Scholar] [CrossRef]
Zhou, G.; Pan, C.; Ren, H.; Popovski, P.; Swindlehurst, A.L. Channel estimation for RIS-aided multiuser millimeter-wave systems. IEEE Trans. Signal Process. 2022, 70, 1478–1492. [Google Scholar] [CrossRef]
Wang, P.; Fang, J.; Duan, H.; Li, H. Compressed channel estimation for intelligent reflecting surface-assisted millimeter wave systems. IEEE Signal Process. Lett. 2020, 27, 905–909. [Google Scholar] [CrossRef]
Chien, T.V.; Le, C.V.; Binh, H.T.T.; Ngo, H.Q.; Chatzinotas, S. Phase Shift Design for RIS-Aided Cell-Free Massive MIMO With Improved Differential Evolution. IEEE Trans. Veh. Technol. 2023, 12, 1499–1503. [Google Scholar] [CrossRef]
Byun, Y.; Kim, H.; Kim, S.; Shim, B. Channel Estimation and Phase Shift Control for UAV-Carried RIS Communication Systems. IEEE Trans. Veh. Technol. 2023, 72, 13695–13700. [Google Scholar] [CrossRef]
Lin, T.; Yu, X.; Zhu, Y.; Schober, R. Channel estimation for IRS-assisted millimeter-wave MIMO systems: Sparsity-inspired approaches. IEEE Trans. Commun. 2022, 70, 4078–4092. [Google Scholar] [CrossRef]
Chen, Z.; Tang, J.; Zhang, X.Y.; So, D.K.C.; Jin, S.; Wong, K.-K. Hybrid evolutionary-based sparse channel estimation for IRS-assisted mmWave MIMO systems. IEEE Trans. Wirel. Commun. 2022, 21, 1586–1601. [Google Scholar] [CrossRef]
Gao, T.; He, M. Two-Stage Channel Estimation Using Convolutional Neural Networks for IRS-Assisted mmWave Systems. IEEE Syst. J. 2023, 17, 3183–3191. [Google Scholar] [CrossRef]
Liu, S.; Lei, M.; Zhao, M.-J. Deep learning based channel estimation for intelligent reflecting surface aided MISO-OFDM system. In Proceedings of the 2020 IEEE 92nd Vehicular Technology Conference (VTC2020-Fall), Victoria, BC, Canada, 18 November–16 December 2020; pp. 1–5. [Google Scholar]
Ye, M.; Zhang, H.; Wang, J.-B. Channel Estimation for Intelligent Reflecting Surface Aided Wireless Communications Using Conditional GAN. IEEE Commun. Lett. 2022, 26, 2340–2344. [Google Scholar] [CrossRef]
Mao, Z.; Liu, X.; Peng, M. Channel estimation for intelligent reflecting surface assisted massive MIMO systems—A deep learning approach. IEEE Commun. Lett. 2022, 26, 798–802. [Google Scholar] [CrossRef]
Liu, Y.; Al-Nahhal, I.; Dobre, O.A.; Wang, F. Deep-Learning Channel Estimation for IRS-Assisted Integrated Sensing and Communication System. IEEE Trans. Veh. Technol. 2022, 72, 6181–6193. [Google Scholar] [CrossRef]
Wang, Y.; Lu, H.; Sun, H. Channel estimation in IRS-enhanced mmWave system with super-resolution network. IEEE Commun. Lett. 2021, 25, 2599–2603. [Google Scholar] [CrossRef]
Liu, C.; Liu, X.; Ng, D.W.K.; Yuan, J. Deep residual learning for channel estimation in intelligent reflecting surface-assisted multiuser communications. IEEE Trans. Wirel. Commun. 2021, 21, 898–912. [Google Scholar] [CrossRef]
Liu, C.; Liu, X.; Ng, D.W.K.; Yuan, J. Deep residual network empowered channel estimation for IRS-assisted multi-user communication systems. In Proceedings of the ICC 2021-IEEE International Conference on Communications, Montreal, QC, Canada, 14–23 June 2021; pp. 1–7. [Google Scholar]
Wang, H.; Memon, F.H.; Wang, X.P.; Li, X.; Zhao, N.; Dev, K. Machine learning-enabled MIMO-FBMC communication channel parameter estimation in IIoT: A distributed CS approach. Digit. Commun. Netw. 2023, 9, 306–312. [Google Scholar] [CrossRef]
Choi, J.; Cho, H.J. A Joint Optimization of Pilot and Phase Shifts in Uplink Channel Estimation for Hybrid RIS-Aided Multi-User Communication Systems. IEEE Trans. Veh. Technol. 2024, 73, 5197–5212. [Google Scholar] [CrossRef]
Schroeder, R.; He, J.; Brante, G.; Juntti, M. Two-Stage Channel Estimation for Hybrid RIS Assisted MIMO Systems. IEEE Trans. Veh. Technol. 2022, 70, 4793–4806. [Google Scholar] [CrossRef]
Zhang, H.; Liu, W.; Zhang, Q.; Liu, B. Joint Customer Assignment, Power Allocation, and Subchannel Allocation in a UAV-Based Joint Radar and Communication Network. IEEE Internet Things J. 2024, 1. [Google Scholar] [CrossRef]
Chen, Z.; Tang, J.; Zhang, X.Y.; Wu, Q.; Wang, Y.; So, D.K.; Jin, S.; Wong, K.K. Offset Learning Based Channel Estimation for Intelligent Reflecting Surface-Assisted Indoor Communication. IEEE J. Sel. Top. Signal Process. 2022, 16, 41–55. [Google Scholar] [CrossRef]
Zhang, K.; Sun, M.; Han, T.X.; Yuan, X.; Guo, L.; Liu, T. Residual networks of residual networks: Multilevel residual networks. IEEE Trans. Circuits Syst. Video Technol. 2018, 28, 1303–1314. [Google Scholar] [CrossRef]
Ranjan, R.; Patel, V.M.; Chellappa, R. HyperFace: A deep multi task learning framework for face detection, landmark localization, pose estimation, and gender recognition. IEEE Trans. Pattern Anal. Mach. Intell. 2019, 41, 121–135. [Google Scholar] [CrossRef]
Ren, H.; Liu, X.; Pan, C.; Peng, Z.; Wang, J. Performance analysis for RIS-aided secure massive MIMO systems with statistical CSI. IEEE Wirel. Commun. Lett. 2023, 12, 124–128. [Google Scholar] [CrossRef]
Wang, R.; Wen, X.; Xu, F.; Ye, Z.; Cao, H.; Hu, Z.; Yuan, X. Joint particle swarm optimization of power and phase shift for IRS-aided D2D underlaying cellular systems. Sensors 2023, 23, 5266. [Google Scholar] [CrossRef] [PubMed]
Noh, S.; Yu, H.; Sung, Y. Training signal design for sparse channel estimation in intelligent reflecting surface-assisted millimeter-wave communication. IEEE Trans. Wirel. Commun. 2021, 21, 2399–2413. [Google Scholar] [CrossRef]
Rappaport, T.S.; Ben-Dor, E.; Murdock, J.N.; Qiao, Y. 38 GHz and 60 GHz angle-dependent propagation for cellular & peer-to-peer wireless communications. In Proceedings of the 2012 IEEE International Conference on Communications (ICC), Ottawa, ON, Canada, 10–15 June 2012; pp. 4568–4573. [Google Scholar]
Amadori, P.V.; Masouros, C. Low RF-complexity millimeter-wave beamspace-MIMO systems by beam selection. IEEE Trans. Commun. 2015, 63, 2212–2223. [Google Scholar] [CrossRef]
Guo, H.; Lau, V.K.N. Uplink Cascaded Channel Estimation for Intelligent Reflecting Surface Assisted Multiuser MISO Systems. IEEE Trans. Signal Process. 2022, 70, 3964–3977. [Google Scholar] [CrossRef]
Pan, C.; Zhou, G.; Zhi, K.; Hong, S.; Wu, T.; Pan, Y.; Ren, H.; Di Renzo, M.; Swindlehurst, A.L.; Zhang, R.; et al. An overview of signal processing techniques for RIS/IRS-aided wireless systems. IEEE J. Sel. Topics Signal Process. 2022, 16, 883–917. [Google Scholar] [CrossRef]
Ioffe, S.; Szegedy, C. Batch normalization: Accelerating deep network training by reducing internal covariate shift. arXiv 2015, arXiv:1502.03167. [Google Scholar]
Zhong, J.-L.; Pun, C.-M. An end-to-end dense-InceptionNet for imagecopy-move forgery detection. IEEE Trans. Inf. Forensics Secur. 2020, 15, 2134–2146. [Google Scholar] [CrossRef]
Miethig, B.; Huangfu, Y.; Dong, J.; Tjong, J.; Mohrenschildt, M.V.; Habibi, S. A Novel Method for Approximating Object Location Error in Bounding Box Detection Algorithms Using a Monocular Camera. IEEE Trans. Veh. Technol 2021, 70, 8682–8691. [Google Scholar] [CrossRef]

Figure 1. The flowcharts of proposed MMPGA-LS.

Figure 2. The flowcharts of crossover strategy.

Figure 3. The flowcharts of the adaptive mutation strategy.

Figure 5. Network structure of the proposed MTRnet.

Figure 7. Cascaded channel estimation performance obtained by different evolution algorithms.

Figure 8. Cascaded channel estimation performance comparison by different models. (a) Cascaded channel estimation performance comparison between the deep learning and model-driven. (b) Cascaded channel estimation performance comparison among different deep learning models.

Figure 9. Reflected channel estimation performance comparison in different deep learning models (a) Channel estimation NMSE of RIS-BS. (b) Channel estimation NMSE of UE-RIS.

Figure 10. Cascaded channel estimation performance of the proposed MMPGA-LS. (a) Performance comparison in different crossover strategies. (b) Performance comparison in different mutation strategies.

Figure 11. Cascaded channel estimation performance of the proposed ResNet. (a) Cascaded channel estimation performance in different datasets. (b) Cascaded channel estimation performance at different learning rates. (c) Cascaded channel estimation performance in different lengths of signal sequence.

Figure 12. Reflected channel estimation is achieved by the proposed MTRnet. (a) Reflected channel estimation of RIS-BS at different learning rates. (b) Reflected channel estimation of RIS-BS at different lengths of signal sequence. (c) Reflected channel estimation of UE-RIS at different learning rates. (d) Reflected channel estimation of UE-RIS at different lengths of signal sequence.

Figure 13. Cascaded channel estimation performance. (a) Convergence of the proposed MMPGA. (b) Convergence of the proposed ResNet.

Figure 14. Convergence of reflected channel estimation performance. (a) IRS-BS channel estimation. (b) UE-RIS channel estimation.

Figure 15. Cascaded channel estimation performance. (a) Robustness of the proposed MMPGA. (b) Robustness of the proposed ResNet.

Figure 16. Robustness of reflected channels estimation performance. (a) IRS-BS channel estimation. (b) UE-RIS channel estimation.

Figure 17. Cascaded channel estimation performance under varying levels of interference. (a) MMPGA. (b) ResNet.

Figure 18. Reflected channel estimation performance under varying levels of interference. (a) IRS-BS channel estimation. (b) UE-RIS channel estimation.

Figure 19. Cascaded channel estimation performance in UPA. (a) Comparison of different heuristic algorithms. (b) Comparison of different learning models.

Figure 20. Reflected channels estimation performance comparison in UPA. (a) IRS-BS channel estimation. (b) UE-RIS channel estimation.

Table 1. Configuration of some primary layers in the proposed ResNet.

${{f^{- 1}}}_{i}$	Layers	Description
$i = 2, 6, 11, 15$	Convolution	$64 : 3 \times 3$
$i = 3, 7, 12$	Convolution	$128 : 3 \times 3$
$i = 4, 8, 13$	Convolution	$256 : 3 \times 3$
$i = 5, 10$	Convolution	$32 : 3 \times 3$
$i = 9, 14$	Cross-layers	-

Table 2. Configuration of network parameters in the five sub-tasks.

Sub-Tasks	Layers	Description
Sub-task I	hidden	P neurons
Sub-task II	hidden	P neurons
Sub-task III	hidden	$2 P$ neurons
Sub-task IV	hidden	P neurons
Sub-task V	hidden	$2 P$ neurons

Table 3. Computational complexity comparison of different methods.

Method	Complexity
MMPGA	$O (T M_{acti} N L_{s}^{4} (Q^{2} + Q_{4} + Q_{4} L_{c}))$
PSO [30]	$O (2 M_{acti} T Q N L_{s}^{4})$
GA [29]	$O (T M_{acti} N L_{s}^{4} (Q^{2} + 2 Q))$
ResNet+MTRnet	$\sum_{l_{c} = 1}^{L_{conv} - 1} (N_{x} N_{y} C_{in} + 1) C_{out} + \sum_{l_{d} = 1}^{L_{hidd} - 1} (D_{in} + 1) D_{out}$
CRNN [17]	$\sum_{l_{c} = 1}^{L_{CRNN, conv} - 1} (P + 3) (N_{CRNN, x} N_{CRNN, y} C_{CRNN, in} + 1) C_{CRNN, out} +$ ld=1LCRNN,hidd-1(DCRNN,in+1)DCRNN,out
CNN [15]	$\sum_{l_{c} = 1}^{L_{CNN, conv} - 1} (P + 3) (N_{CNN, x} N_{CNN, y} C_{CNN, in} + 1) C_{CNN, out} +$ ld=1LCNN,hidd-1(DCNN,in+1)

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Yu, J. A Novel Two-Step Channel Estimation Method for RIS-Assisted mmWave Systems. Sensors 2024, 24, 5362. https://doi.org/10.3390/s24165362

AMA Style

Yu J. A Novel Two-Step Channel Estimation Method for RIS-Assisted mmWave Systems. Sensors. 2024; 24(16):5362. https://doi.org/10.3390/s24165362

Chicago/Turabian Style

Yu, Jiarun. 2024. "A Novel Two-Step Channel Estimation Method for RIS-Assisted mmWave Systems" Sensors 24, no. 16: 5362. https://doi.org/10.3390/s24165362

APA Style

Yu, J. (2024). A Novel Two-Step Channel Estimation Method for RIS-Assisted mmWave Systems. Sensors, 24(16), 5362. https://doi.org/10.3390/s24165362

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Novel Two-Step Channel Estimation Method for RIS-Assisted mmWave Systems

Abstract

1. Introduction

2. Channel Estimation System Model

3. The Novel Two-Step Channel Estimation Method

3.1. MMPGA-LS-ResNet-Based Cascaded Channel Estimation

3.1.1. Population Initialization

3.1.2. Population Classification

3.1.3. Crossover

3.1.4. Adaptive Mutation

3.1.5. Population Reservation

3.1.6. ResNet

3.2. MTRnet-Based Reflected Channel Estimation

3.3. Implementation of the Novel Two-Step Method

4. Simulation Results and Discussion

4.1. Comparisons with Reported Methods

4.2. Numerical Results of The Proposed Two-Step Mehod

4.3. Discussion of the Proposed Method

5. Conclusions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI