Demod-CNN: A Robust Deep Learning Approach for Intelligent Reflecting Surface-Assisted Multiuser MIMO Communication

Sejan, Mohammad Abrar Shakil; Rahman, Md Habibur; Song, Hyoung-Kyu

doi:10.3390/s22165971

Open AccessCommunication

Demod-CNN: A Robust Deep Learning Approach for Intelligent Reflecting Surface-Assisted Multiuser MIMO Communication

by

Mohammad Abrar Shakil Sejan

^1,2,†

,

Md Habibur Rahman

^1,2,†

and

Hyoung-Kyu Song

^1,2,*

¹

Department of Information and Communication Engineering, Sejong University, Seoul 05006, Korea

²

Department of Convergence Engineering for Intelligent Drone, Sejong University, Seoul 05006, Korea

^*

Author to whom correspondence should be addressed.

^†

These authors contributed equally to this work.

Sensors 2022, 22(16), 5971; https://doi.org/10.3390/s22165971

Submission received: 8 July 2022 / Revised: 6 August 2022 / Accepted: 8 August 2022 / Published: 10 August 2022

(This article belongs to the Section Communications)

Download

Browse Figures

Versions Notes

Abstract

:

The intelligent reflecting surface (IRS) is a novel and innovative communication technology that aims at the control of the wireless environment. The IRS is considered as a promising technology for sixth-generation wireless communication. In the last few years, machine learning has emerged as a powerful tool for solving complex problems in diverse application areas. In this paper, we propose a convolutional neural network (CNN)-based demodulation technique called Demod-CNN in IRS-based wireless communication for multiple users. A multiple-input multiple-output based orthogonal multiple frequency division multiplexing system is considered for channel modeling. The received signal data are used for training and testing the model. The simulation results show that the proposed model performs better than the conventional demodulation technique.

Keywords:

intelligent reflecting surface; demodulation; convolutional neural network; MIMO; OFDM

1. Introduction

The intelligent reflecting surface (IRS) is considered as a novel smart radio technology for wireless communication systems [1]. The IRS is a two-dimensional metasurface of electromagnetic material that is composed of a large array of passive scattering elements with a specially designed physical structure [2]. Each element in the IRS can be controlled in a software-defined manner to change the electromagnetic proprieties (e.g., phase shift, amplitude) of the reflecting RF signal. This provides a unique advantage for non-line-of-sight communication to achieve a strong signal reception using the IRS [3]. One of the important properties of the IRS is that it can be implemented with low hardware cost and energy consumption for high gain beamforming [4,5,6]. Thus, it is desired to use the IRS in implementing millimeter wave communication for the sixth generation (6G). In addition, the IRS can be a potential candidate to enhance wireless sensor network (WSN) communication as it has the following advantages [7]: the IRS is composed of a nearly passive device electromagnetic material that is low cost, the IRS can reconfigure the wireless propagation environment to work as an alternative link when a direct link is not available, the IRS is energy-efficient compared to traditional relaying schemes, and the IRS can be implemented as full-duplex communication. Each of these features is desirable for WSN systems.

Multiple-input-multiple-output (MIMO)-based communication has been widely adopted in wireless communication [8,9]. MIMO-based communication has also been introduced to IRS-based communication [10,11]. Machine learning (ML)-based methods have also gained significant attention from researchers [12,13]. ML techniques have become very popular in solving complex problems for enhancing wireless communication systems without explicit programming [14]. A part of ML, deep learning (DL)-based schemes are also effective in predicting the channel estimation, beamforming optimization, and conducting phase optimization for IRS-based communication. The convolutional neural network (CNN) is a well known DL architecture that can provide effective solutions for different problem domains. In [15], authors proposed a denoising-convolution-neural-network-based channel estimation model for MIMO IRS. The transmitted complex data are received as a noisy image to increase the channel accuracy. The study in [16] proposed a twin CNN for the estimation of the direct channel and the cascade channel for IRS. The CNN network is trained with pilot data and can tolerate a user location of about 4 degrees. Another study in [17] used a CNN-based approach to estimate milimeter wave channel estimation. In addition to channel estimation, CNN can be applied in performance analysis such as bit error rate (BER) or symbol error rate (SER) for IRS-based communication.

Motivated from above, in this paper we propose a CNN-based ML architecture to improve the BER and SER performance of the MIMO-based IRS system. Orthogonal frequency division multiplexing (OFDM) based pilot symbols are transmitted through a frequency selective fading channel. Both IRS-based cascade channel model and direct channel are considered for communication. A CNN-based demodulation technique was proposed named Demod-CNN (Demodulation using CNN) to improve the channel performance in the case of MIMO communication. The proposed CNN model is trained with received complex data to map into bits for the channel in different signal-to-noise ratio (SNR) values. The trained model was tested with different datasets for performance evaluation.

The contributions of this study can be listed as:

An IRS-based MIMO channel configuration system is considered for wireless communication to test machine learning assisted demodulation.
A CNN-based demodulation technique Demod-CNN is proposed to demodulate the received signal.

Notations: The lower-case and upper-case boldface letter

h

and

H

represent a vector and matrix, respectively;

H^{H}

denotes the conjugate transpose matrix of

H

; diag

(x)

denotes the diagonal matrix having vector

x

on its diagonal; and ⊗ is the Kronecker product.

2. System Model

2.1. OFDM Communication

OFDM is a multi carrier modulation technique applied in wireless communication for high spectral efficiency and good performance against frequency selective fading. The modulation process for OFDM starts by converting the binary inputs into phase shift Keying (PSK) or quadrature amplitude modulation for the purpose of mapping D parallel streams. O is the number of subcarriers in the OFDM system. If the value of O is increased, the performance of the model is degraded. Thus, the value of O should be chosen optimally depending on the required system. Let

X_{i} [q]

be the i-th transmit symbol for the q-th subcarrier, where

i = 0, 1, 2, \dots, \infty

and

q = 0, 1, 2, \dots, O - 1

. Next, the symbols are converted to the frequency domain from the time domain by using inverse fast fourier transform (IFFT). To avoid inter symbol interference, a cyclic prefix is added in the signal. One single symbol duration is considered as

T_{s}

, so for Q, symbol transmission requires

T_{s y m} = Q T_{s}

. Thus, for the i-th symbol and the q-th subcarrier, the OFDM signal

Υ_{i, q} (t)

is written as follows:

Υ_{i, q} (t) = \{\begin{matrix} e^{j 2 π f_{q} (t - i T_{s y m})}, & 0 < t \leq T_{s y m} \\ 0, & e l s e w h e r e, \end{matrix}

(1)

where

f_{q}

is the center frequency for the q-th subcarrier. The continuous time domain signal can be expressed as follows [18]:

X_{i} (t) = \sum_{i = 0}^{\infty} \sum_{q = 0}^{O - 1} X_{i} [q] e^{j 2 π f_{q} (t - i T s y m)} .

(2)

The transmitted symbol can be considered as follows:

X_{i} (n) = \sum_{q = 0}^{O - 1} X_{i} [q] e^{j 2 π q n / N} for n = 0, 1, \dots, O - 1,

(3)

where

f_{q} = q / T_{s y m}

.

2.2. IRS Based Communication

In this section, we describe the IRS network architecture and other related consideration. Figure 1 shows a communication scenario with the IRS where the base station (BS) has M uniform planner array (UPA) antenna and each user (UE) has a single antenna. The number of IRS elements is considered as N. The received signal via IRS for each UE can be expressed as follows [19]:

y_{i} = H_{u i}^{H} Ψ H_{b} x + n,

(4)

where

y_{i}

is the received signal at the i-th UE, the transmitted signal is

x \in C^{M \times 1}

,

H_{b} \in C^{N \times M}

is the channel matrix between BS and IRS,

H_{u i} \in C^{1 \times N}

is the channel matrix from IRS to user i, and

n \sim CN (0, σ^{2})

is the additive white Gaussian noise at the i-th user.

Ψ

is the diagonal matrix where

Ψ = d i a g (ξ) \in C^{N \times N}

represents the phase shift values of IRS elements. Each element can be defined as

ξ = [η_{1} e^{j ω_{1}}, η_{2} e^{j ω_{2}}, \dots, η_{N} e^{j ω_{N}}] \in C^{N \times 1}

, where

η_{n} \in [0, 1]

denotes the amplitude and

ω_{n} \in [0, 2 π]

is the phase shift coefficient of the n-th reflective element. Thus,

Ψ

can be written as:

Ψ = [\begin{matrix} η_{1} e^{j ω_{1}} & \dots & \dots & \dots \\ \dots & η_{2} e^{j ω_{2}} & \dots & \dots \\ ⋮ & ⋮ & \dots & ⋮ \\ \dots & \dots & \dots & η_{N} e^{j ω_{N}} \end{matrix}] .

(5)

For easy calculation, the constant amplitude coefficient is considered as

ω_{n} = 1

. The total channel with direct communication link can be represented by [20]:

y_{i t} = H_{u i}^{H} Ψ H_{b} x + H_{d} x + n,

(6)

where

H_{d} \in C^{M \times 1}

is the direct channel between BS and UE. The channel matrices

H_{u i}

,

H_{b}

, and

H_{d}

follow the Rayleigh fading distribution, and each column k is modeled as:

\begin{matrix} h_{u i (1, k)} = κ {\hat{h}}_{u i (1, k)} \end{matrix}

(7a)

\begin{matrix} h_{b (1, k)} = κ {\hat{h}}_{b (1, k)} \end{matrix}

(7b)

\begin{matrix} h_{d (1, k)} = κ {\hat{h}}_{d (1, k)}, \end{matrix}

(7c)

where

κ

is the path loss factor and

{\hat{h}}_{u i (1, k)}

,

{\hat{h}}_{b (1, k)}

, and

{\hat{h}}_{d (1, k)}

are

\sim CN (0, σ^{2})

. In addition, we consider the Saleh–Valenzuela channel model [8], which is applicable for a multipath propagation environment. The general theory of the Saleh–Valenzuela model for mmWave communication can be modeled as follows:

h = \sqrt{\frac{N}{L}} \sum_{l = 0}^{L} α_{l} a (γ_{l}^{H}, ϕ_{l}^{H}),

(8)

where

h

is the channel vector,

α_{l}

is the complex gain of the l-th path, L is the total number of paths,

γ_{l}^{H}

is the azimuth angle of departure, and

ϕ_{l}^{H}

is the elevation angle of departure and

a (γ_{l}^{H}, ϕ_{l}^{H})

is the array response vector. For a typical

N_{1} \times N_{2}

UPA, the array response vector written as follows [21]:

a (γ, ϕ) = \frac{1}{\sqrt{N}} [e^{- j 2 π d sin (γ) cos (ϕ) n_{1} / λ}] \otimes [e^{- j 2 π d sin (ϕ) n_{2} / λ}],

(9)

where

n_{1} = [0, 1, \dots, N_{1} - 1]

and

n_{2} = [0, 1, \dots, N_{2} - 1]

,

λ

is the carrier wavelength, and d is the antenna spacing fulfilling the condition

d = λ / 2

. The BS-IRS channel

H_{b}

can be expressed as follows:

H_{b} = \sqrt{\frac{M N}{L_{1}}} \sum_{l_{1} = 1}^{L_{1}} β_{l_{1}} b (γ_{l_{1}}^{H_{r}}, ϕ_{l_{1}}^{H_{r}}) a^{H} (γ_{l_{1}}^{H_{t}}, ϕ_{l_{1}}^{H_{t}}),

(10)

where

L_{1}

represents the number of paths between the BS and the i-th UE,

β_{l}

represents the complex gain of the paths,

b (γ_{l}^{H_{r}} ϕ_{l}^{H_{r}})

represents the steering vector related to the IRS,

a (γ_{l}^{H_{t}}, ϕ_{l}^{H_{t}})

is the steering vector related to BS for the l-th path.

Next, the channel between the IRS and UE can be defined as follows:

H_{u i}^{H} = \sqrt{\frac{N}{L_{2}}} \sum_{l_{2} = 1}^{L_{2}} β_{l_{2}} a^{H} (γ_{l_{2}}^{H_{t}}, ϕ_{l_{2}}^{H_{t}}),

(11)

where

L_{2}

is the number of paths between the IRS and the i-th user,

β_{l 2}

is the complex gain of paths,

γ_{l 2}^{H_{t}}

and

ϕ_{l 2}^{H_{t}}

are the azimuth and elevation angle of departure of the signal, and

a (γ_{l_{2}}^{H_{t}}, ϕ_{l_{2}}^{H_{t}})

is the steering vector. The cascade channel for BS to UE is as follows:

\begin{matrix} H_{c a} = \sqrt{\frac{M N}{L_{1} L_{1}}} \sum_{l_{1} = 1}^{L_{1}} \sum_{l_{2} = 1}^{L_{2}} β_{l_{1}} β_{l_{2}} d i a g (a^{H} (γ_{l_{2}}^{H_{t}}, ϕ_{l_{2}}^{H_{t}})) \\ b (γ_{l_{1}}^{H_{r}}, ϕ_{l_{1}}^{H_{r}}) a^{H} (γ_{l_{1}}^{H_{t}}, ϕ_{l_{1}}^{H_{t}}) . \end{matrix}

(12)

The channel matrix

H_{u i}^{H} Ψ H_{b}

can be expressed as follows:

H_{u i}^{H} Ψ H_{b} = H_{u i}^{H} d i a g (ξ) H_{b} = ξ^{T} d i a g (H_{u i}^{H}) H_{b} .

(13)

Then, the following equation is obtained:

H_{c a} ξ^{T} = H_{u i}^{H} Ψ H_{b} .

(14)

The total received signal

y_{i t}

can be written as follows:

y_{i t} = (H_{c a} ξ^{T} + H_{d}) x + n .

(15)

2.3. Deep Learning Model

Figure 2 shows the architectural design of the proposed CNN network. For successive training with the generated dataset, in this study, we have considered 1-D CNN model architecture to evaluate BER and SER for multiuser MIMO signals. The complex signal was first separated into real and imaginary parts. Then, the two numerical values along with the label are fed to the CNN model. In the proposed model, the input layer is fed into the OFDM data symbol, where the input size is equal to the number of features of input data. The size of the input features is considered as 2 × 2 × 2 = 8. The input size depends upon the number of users and the number of total antennas used in the system. The next two layers are connected by a convolutional 1D layer, ReLU activation function, and normalization layers. The first convolutional layer consists of a 3 × 3 filter size and total 32 numbers of filters are used. In the second convolutional layer, a 3 × 3 filter size, and 64 filters are used. The convolutional layer comprises a rectangular grid of neurons, where every neuron receives inputs from a rectangular part of the earlier layer. To reduce the output of the convolutional layers to a single vector, a global average pooling 1D layer is used. In the fully connected layer, each neuron is connected to all neurons in the previous layer and gathers all the features and internal information combined by the prior layers. In every time step of the CNN model, the fully connected layer works individually. We utilize the softmax activation function to derive the outputs for the final layer. In the terminal layer, we use the classification layer to map the output to vector probabilities and specify a fully connected layer with an output size matching the number of classes. The convolution operation for 1D forward propagation can be expressed as follows: [22]:

ζ_{k}^{v} = b_{k}^{v} + \sum_{p = 1}^{N_{v - 1}} conv 1 D (w_{p k}^{v - 1}, s_{p}^{v - 1}),

(16)

where the input is

ζ_{k}^{v}

, the bias of the k-th neuron at layer v represents

b_{k}^{v}

, the output of the p-th neuron at layer

(v - 1)

is defined as

s_{p}^{v - 1}

, and the kernel from the p-th neuron at layer

(v - 1)

to the k-th neuron at layer v presents

w_{p k}^{v - 1}

. Hence, the dimension of the output arrays

s_{p}^{v - 1}

is greater than the dimension of the input array

ζ_{k}^{v}

.

To normalize the b-th

ζ_{k}^{v}

, we use the batch normalization layer [23] with the batch size B, which can be expressed as follows:

μ_{k} = \frac{1}{P V B} \sum_{p = 1}^{P} \sum_{v = 1}^{V} \sum_{b = 1}^{B} {({ζ_{k}^{v}}_{p})}_{b},

(17)

σ_{k}^{2} = \frac{1}{P V B} \sum_{p = 1}^{P} \sum_{v = 1}^{V} \sum_{b = 1}^{B} ({({ζ_{k}^{v}}_{p})}_{b} - μ_{k})^{2},

(18)

{\hat{ζ}}_{k}^{v} = \frac{ζ_{k}^{v} - μ}{\sqrt{σ^{2} + ε}},

(19)

ζ_{k}^{v} = α {\hat{ζ}}_{k}^{v} + ϱ,

(20)

where

μ

and

σ^{2}

present the mean and variance of

ζ_{k}^{v}

respectively. P and V denote the size of the tensors for calculating the mean and variance.

ε

is a small constant, which is negligible. In the training process,

α

and

ϱ

are the learnable parameters, which will be updated. It can be noted that the learnable parameters

α

and

ϱ

denote the whole weights in convolutional layers, batch normalization layer, and fully-connected layer, respectively.

The activation function considered as the Relu function can be written as follows:

f (x) = \{\begin{matrix} 0 & f o r x < 0, \\ x & f o r x \geq 0 . \end{matrix}

(21)

To activate the output signal

ζ_{k}^{v}

, herein, we utilize the Relu activation function, which can be formulated as follows:

{ζ_{k}^{v}}_{R} = f ({ζ_{k}^{v}}_{R}) .

(22)

In addition, to determine the possibility

ζ_{k}

of each category, we use the Softmax layer which can be expressed as follows:

S o f t m a x (ζ_{k}) = \frac{exp (ζ_{f c k})}{\sum_{z = 1}^{Z} exp (ζ_{f c z})} .

(23)

Finally, the mean-squared error (

M S E

) defines the loss function for the overall network, which is presented as follows:

M S E = \sum_{i = 1}^{N_{ϑ}} {(ζ_{k} - y_{i t})}^{2},

(24)

where

N_{ϑ}

represents the number of class labels. Figure 3 shows the overall flowchart for the proposed system.

The computational complexity of the proposed model can be specified as

O (P_{s} \times F_{s} (i_{c} \times f_{c} \times n_{c}) \times 2)

, where

P_{s}

is the number of received input packets,

F_{s}

is the OFDM block size,

i_{c}

is the input size of CNN,

f_{c}

is the filter size of CNN, and

n_{c}

is the neuron size of CNN. In contrast, the conventional OFDM system has the complexity

O (M_{s})

, where

M_{s}

is the modulation order [24]. The conventional OFDM is computationally efficient because it only uses IFFT and FFT. The complexity of the proposed model is higher than the conventional OFDM, but the performance is improved.

3. Simulation Setup

For the simulation, it is assumed that the number of BS antennas is M = 2 and the number of IRS elements is N = 512. It is assumed that the number of horizontal and vertical IRS elements are 32 and 16. Two users receive the data from the BS to UE = 2. The number of paths between BS and IRS is two, and the IRS to the i-th UE is four. For direct BS to UE channel, two paths are considered. In this simulation, mutltiuser interference is not considered, and it is assumed that each UE has same channel properties. Then, the BS can demodulate different UE signals successfully.We first determine the BER using the conventional technique following the same channel configuration for IRS-based communication. The simulation parameters are listed in Table 1. Additive white Gaussian noise is added to the transmission to simulate the channel noise environment. For the downlink channel, the three-channel matrix is considered: BS-IRS, IRS-UE, and BS-UE. The total

10^{6}

packets are sent each time. One hundred and twenty-eight quadrature phase-shift keying (QPSK) symbols are generated using OFDM to estimate the channel error rate. To reduce the inter-symbol interference, a length of 32 cyclic prefix is added. For BER calculation, the data received by both users are considered simultaneously.

The CNN-based model is trained by using the label and data generated by the QPSK symbol. The labeling is performed by combining the data from two transmitting antennas. As each antenna can generate 4 unique symbols, thus

4 \times 4 = 16

labels are created for two data stream combinations. At the receiver, the received symbols are separated into real and imaginary parts along with the label. During the training, we capture data at the SNR 25 dB level to generate an offline input training set. We use 100,000 data sets for model development, of which (4/5) are used for training and (1/5) is used for validation. At

99.91 %

model accuracy, we stop the training process and save the model for inference. The training of the model less than 80,000 data sets does not produce the optimal performance. To achieve the highest accuracy, the model needs to have 80,000 data sets. However, 50,000 data sets can be employed for training, and in this case, the accuracy of the model is reduced. It is obvious that training with more data sets will require more time for training. Figure 4 shows the training and validation progress of the proposed model for 50 epochs. The upper left image shows the training accuracy, while the upper right image shows the validation accuracy. An aging lower left curve shows the training loss, and the lower right images shows the validation loss. It can be seen from the Figure 4 that after 20 epochs the model shows a stable performance, which indicates the model has learned the parameters from the data. If the number of antennas, users, IRS elements, OFDM parameters, or multi-paths is changed, the retraining of the model is required. In the future, we will try to build a model that can adopt these changes with minimum training.

4. Results and Discussion

In this section, the simulation results of the proposed model in terms of BER and SER are discussed. The BER and SER represent the average error rate for the two UEs. Both UEs data are first demodulated separately, and the error rate is calculated by

error_rate = (u s e r 1 + u s e r 2) / 2

. For BER,

error_rate

refers to the wrong demodulated bit at the receiver. For SER,

error_rate

refers to the wrong classification of the received symbol. In Figure 5, the BER curve is compared with the CNN-based demodulation against the conventional demodulation system. It is evident from the figure that CNN-based demodulation performs better than the traditional one. At SNR 0 dB, the model provides less accurate results than the conventional system because of the low SNR value. However, as the SNR values increase, the model performance improves. From SNR 5 dB, the model outperforms the conventional demodulation system. The BER for CNN-based demodulation has a similar trend to the conventional system up to 12 dB; after this point, BER improves dramatically for CNN-based demodulation. This indicates the efficient demodulation capability of the proposed CNN model. In Figure 6, SER versus SNR is plotted for the proposed model. The model shows better SER compared to the conventional demodulation scheme. After the SER 15 dB, the difference becomes clearer toward the higher SNR. Thus, the proposed CNN model can be used for improving BER and SER for IRS-based wireless communication.

5. Conclusions

In this paper, we have proposed a CNN-based model “Demod-CNN” OFDM demodulation system for IRS-aided multiuser MIMO communication. In the proposed model, we designed the input layer to receive the OFDM signal by separating real and imaginary parts of the symbol. It is then classified to recognize the transmitted bits by the classification layer. The simulation results provide that the proposed Demod-CNN can outperform the conventional demodulation system. In the future, we want to expand our model for multi-IRS systems to achieve massive MIMO communication.

Author Contributions

Conceptualization, M.A.S.S. and M.H.R.; methodology, M.A.S.S.; software, M.A.S.S. and M.H.R.; validation, M.A.S.S. and M.H.R.; formal analysis, M.A.S.S. and M.H.R.; investigation, M.A.S.S. and M.H.R.; resources, H.-K.S.; data curation, M.A.S.S. and M.H.R.; writing—original draft preparation, M.A.S.S. and M.H.R.; writing—review and editing, M.A.S.S., M.H.R., and H.-K.S.; visualization, M.A.S.S. and M.H.R.; supervision, H.-K.S.; project administration, H.-K.S.; and funding acquisition, H.-K.S. All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported by the MSIT (Ministry of Science and ICT), Korea, under the ITRC (Information Technology Research Center) support program (IITP-2022-2018-0-01423), supervised by the IITP (Institute for Information & Communications Technology Planning & Evaluation) and in part by the Basic Science Research Program through the National Research Foundation of Korea (NRF) funded by the Ministry of Education (2020R1A6A1A03038540).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

Basar, E.; Di Renzo, M.; De Rosny, J.; Debbah, M.; Alouini, M.S.; Zhang, R. Wireless communications through reconfigurable intelligent surfaces. IEEE Access 2019, 7, 116753–116773. [Google Scholar] [CrossRef]
Gong, S.; Lu, X.; Hoang, D.T.; Niyato, D.; Shu, L.; Kim, D.I.; Liang, Y.C. Toward smart wireless communications via intelligent reflecting surfaces: A contemporary survey. IEEE Commun. Surv. Tutor. 2020, 22, 2283–2314. [Google Scholar] [CrossRef]
Wu, Q.; Zhang, S.; Zheng, B.; You, C.; Zhang, R. Intelligent reflecting surface-aided wireless communications: A tutorial. IEEE Trans. Commun. 2021, 69, 3313–3351. [Google Scholar] [CrossRef]
Wei, X.; Shen, D.; Dai, L. Channel estimation for RIS assisted wireless communications—Part I: Fundamentals, solutions, and future opportunities. IEEE Commun. Lett. 2021, 25, 1398–1402. [Google Scholar] [CrossRef]
Jung, J.S.; Park, C.Y.; Oh, J.H.; Song, H.K. Intelligent Reflecting Surface for Spectral Efficiency Maximization in the Multi-User MISO Communication Systems. IEEE Access 2021, 9, 134695–134702. [Google Scholar] [CrossRef]
Lin, Z.; Niu, H.; An, K.; Wang, Y.; Zheng, G.; Chatzinotas, S.; Hu, Y. Refracting RIS Aided Hybrid Satellite-Terrestrial Relay Networks: Joint Beamforming Design and Optimization. IEEE Trans. Aerosp. Electron. Syst. 2022, 58, 3717–3724. [Google Scholar] [CrossRef]
Liu, Y.; Liu, X.; Mu, X.; Hou, T.; Xu, J.; Di Renzo, M.; Al-Dhahir, N. Reconfigurable Intelligent Surfaces: Principles and Opportunities. IEEE Commun. Surv. Tutor. 2021, 23, 1546–1577. [Google Scholar] [CrossRef]
Busari, S.A.; Huq, K.M.S.; Mumtaz, S.; Dai, L.; Rodriguez, J. Millimeter-Wave Massive MIMO Communication for Future Wireless Systems: A Survey. IEEE Commun. Surv. Tutor. 2017, 20, 836–869. [Google Scholar] [CrossRef]
An, K.; Lin, M.; Ouyang, J.; Zhu, W.; Member, S. Satellite Terrestrial Networks. IEEE J. Sel. Areas Commun. 2016, 34, 3025–3037. [Google Scholar] [CrossRef]
Zhang, J.; Liu, J.; Ma, S.; Wen, C.K.; Jin, S. Large System Achievable Rate Analysis of RIS-Assisted MIMO Wireless Communication with Statistical CSIT. IEEE Trans. Wirel. Commun. 2021, 20, 5572–5585. [Google Scholar] [CrossRef]
Zhang, S.; Zhang, R. Capacity Characterization for Intelligent Reflecting Surface Aided MIMO Communication. IEEE J. Sel. Areas Commun. 2020, 38, 1823–1838. [Google Scholar] [CrossRef]
Sejan, M.A.S.; Rahman, M.H.; Shin, B.-S.; Oh, J.-H.; You, Y.-H.; Song, H.-K. Machine Learning for Intelligent-Reflecting-Surface-Based Wireless Communication towards 6G: A Review. Sensors 2022, 22, 5405. [Google Scholar] [CrossRef] [PubMed]
Zhang, J.; Su, Q.; Tang, B.; Wang, C.; Li, Y. DPSNet: Multitask Learning Using Geometry Reasoning for Scene Depth and Semantics. IEEE Trans. Neural Netw. Learn. Syst. 2021, 1–12. [Google Scholar] [CrossRef] [PubMed]
Sun, Y.; Peng, M.; Zhou, Y.; Huang, Y.; Mao, S. Application of machine learning in wireless networks: Key techniques and open issues. IEEE Commun. Surv. Tutor. 2019, 21, 3072–3108. [Google Scholar] [CrossRef] [Green Version]
Liu, S.; Gao, Z.; Zhang, J.; Di Renzo, M.; Alouini, M.S. Deep denoising neural network assisted compressive channel estimation for mmWave intelligent reflecting surfaces. IEEE Trans. Veh. Technol. 2020, 69, 9223–9228. [Google Scholar] [CrossRef]
Elbir, A.M.; Papazafeiropoulos, A.; Kourtessis, P.; Chatzinotas, S. Deep channel learning for large intelligent surfaces aided mm-wave massive MIMO systems. IEEE Wirel. Commun. Lett. 2020, 9, 1447–1451. [Google Scholar] [CrossRef]
Shtaiwi, E.; Zhang, H.; Abdelhadi, A.; Han, Z. RIS-Assisted mmWave Channel Estimation Using Convolutional Neural Networks. In Proceedings of the 2021 IEEE Wireless Communications and Networking Conference Workshops (WCNCW), Nanjing, China, 29 March 2021; pp. 1–6. [Google Scholar] [CrossRef]
Coleri, S.; Ergen, M.; Puri, A.; Bahai, A. Channel estimation techniques based on pilot arrangement in OFDM systems. IEEE Trans. Broadcast. 2002, 48, 223–229. [Google Scholar] [CrossRef] [Green Version]
Shin, B.S.; Oh, J.H.; You, Y.H.; Hwang, D.D.; Song, H.K. Limited Channel Feedback Scheme for Reconfigurable Intelligent Surface Assisted MU-MIMO Wireless Communication Systems. IEEE Access 2022, 10, 50288–50297. [Google Scholar] [CrossRef]
Taha, A.; Alrabeiah, M.; Alkhateeb, A. Enabling Large Intelligent Surfaces With Compressive Sensing and Deep Learning. IEEE Access 2021, 9, 44304–44321. [Google Scholar] [CrossRef]
Wei, X.; Shen, D.; Dai, L. Channel Estimation for RIS Assisted Wireless Communications—Part II: An Improved Solution Based on Double-Structured Sparsity. IEEE Commun. Lett. 2021, 25, 1403–1407. [Google Scholar] [CrossRef]
Kiranyaz, S.; Avci, O.; Abdeljaber, O.; Ince, T.; Gabbouj, M.; Inman, D.J. 1D convolutional neural networks and applications: A survey. Mech. Syst. Signal Process. 2021, 151, 107398. [Google Scholar] [CrossRef]
Ioffe, S.; Szegedy, C. Batch normalization: Accelerating deep network training by reducing internal covariate shift. In Proceedings of the 32nd International Conference on Machine Learning (ICML), Lille, France, 6–11 July 2015; Volume 37, pp. 448–456. [Google Scholar] [CrossRef]
Jaradat, A.M.; Hamamreh, J.M.; Arslan, H. Modulation Options for OFDM-Based Waveforms: Classification, Comparison, and Future Directions. IEEE Access 2019, 7, 17263–17278. [Google Scholar] [CrossRef]

Figure 1. Intelligent reflecting surface in the multiuser MIMO communication system.

Figure 2. Deep learning architecture using CNN for IRS aided communication system.

Figure 3. Flowchart for the proposed system workflow.

Figure 4. Proposed Demod-CNN model training and validation progress for 50 epochs.

Figure 5. BER performance comparison of the proposed Demod-CNN and conventional technique.

Figure 6. SER performance comparison of the proposed Demod-CNN and conventional technique.

Table 1. Simulation parameters.

Parameters	Value
IRS elements	32 × 16
Transmitting antenna	2
Number of user	2
Number of subcarrier	128
Modulation	QPSK
Number of epoch	100
Minibatch size	200
Input size	8
Learning rate	0.01
Optimizer	ADAM
Noise	AWGN

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Sejan, M.A.S.; Rahman, M.H.; Song, H.-K. Demod-CNN: A Robust Deep Learning Approach for Intelligent Reflecting Surface-Assisted Multiuser MIMO Communication. Sensors 2022, 22, 5971. https://doi.org/10.3390/s22165971

AMA Style

Sejan MAS, Rahman MH, Song H-K. Demod-CNN: A Robust Deep Learning Approach for Intelligent Reflecting Surface-Assisted Multiuser MIMO Communication. Sensors. 2022; 22(16):5971. https://doi.org/10.3390/s22165971

Chicago/Turabian Style

Sejan, Mohammad Abrar Shakil, Md Habibur Rahman, and Hyoung-Kyu Song. 2022. "Demod-CNN: A Robust Deep Learning Approach for Intelligent Reflecting Surface-Assisted Multiuser MIMO Communication" Sensors 22, no. 16: 5971. https://doi.org/10.3390/s22165971

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Demod-CNN: A Robust Deep Learning Approach for Intelligent Reflecting Surface-Assisted Multiuser MIMO Communication

Abstract

1. Introduction

2. System Model

2.1. OFDM Communication

2.2. IRS Based Communication

2.3. Deep Learning Model

3. Simulation Setup

4. Results and Discussion

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI