A Joint Channel Estimation and Compression Method Based on GAN in 6G Communication Systems

Du, Ying; Li, Yang; Xu, Mingfeng; Jiang, Jiamo; Wang, Weidong

doi:10.3390/app13042319

Open AccessArticle

A Joint Channel Estimation and Compression Method Based on GAN in 6G Communication Systems

by

Ying Du

^1,2

,

Yang Li

²

,

Mingfeng Xu

^2,*,

Jiamo Jiang

²

and

Weidong Wang

¹

Department of Electronic Engineering and Information Science, University of Science and Technology of China, Hefei 230052, China

²

Mobile Communications Innovation Center, China Academy of Information and Communications Technology, Beijing 100191, China

^*

Author to whom correspondence should be addressed.

Appl. Sci. 2023, 13(4), 2319; https://doi.org/10.3390/app13042319

Submission received: 20 October 2022 / Revised: 25 January 2023 / Accepted: 7 February 2023 / Published: 10 February 2023

(This article belongs to the Special Issue Beyond 5G and 6G Communication Systems)

Download

Browse Figures

Review Reports Versions Notes

Abstract

:

Due to the increasing popularity of communication devices and vehicles, the channel environment becomes more and more complex, which makes conventional channel estimation methods further increase the pilot overhead to maintain estimation performance. However, it declines the throughput of communication networks. In this paper, we provide a novel two-stage based channel estimation method by using generative adversarial networks (GANs) to handle this problem in orthogonal frequency division multiplexing (OFDM) systems. Specifically, the first stage aims to learn the mapping from a low-dimensional latent variable to the real channel sample. During the second stage, an iterative algorithm method is designed to find the optimal latent variable by matching the pilot channels of a real channel and generated channel. Then, the data channels are recovered based on the learned mapping relationship between the latent variable and the real channel sample. The simulation results show that our proposed method can achieve a performance gain of more than 2 dB with a pilot reduction by

75 %

when SNR is 10 dB, by comparing with the widely used Wiener filter interpolation method. In addition, as the low-dimensional latent variable can be obtained simultaneously, it can also be used for reducing the feedback overhead.

Keywords:

channel estimation; generative adversarial networks

1. Introduction

Channel estimation is a fundamental issue to be addressed in wireless communication systems since its accuracy has a significant impact on the recovery of the received signals as well as the management of interference suppression and wireless resource allocation and other tasks [1]. According to whether prior information is used, the channel estimation methods can be divided into three categories, which are the blind estimation method, the pilot-based estimation method, and the semi-blind estimation method, respectively. In particular, blind estimation methods acquire channel state information (CSI) from the structure and statistics of the received signals, while pilot-based estimation methods allocate a part of wireless resource to transmit known signals to obtain CSI. To enable high precision channel estimation performance, the latter has been widely used [2].

For a future beyond 5G and 6G communication systems, it is more challenging to acquire channel estimation results with high accuracy since many higher mobility and denser-connection scenarios will appear [3]. As introduced in [4], the accuracy of channel estimation can benefit from the increase in the number of pilots. To support the estimation accuracy to satisfy the requirements of service in a complex channel environment, typical conventional channel estimation methods, such as the least square (LS) estimator and the minimum mean square error (MMSE) estimator, have to add many more pilots. However, it declines the spectral efficiency of communication systems due to more wireless resources being allocated to transmit pilot signals. In addition, both methods have their own shortcomings [5]. Specifically, the LS estimator cannot be used to estimate data channels directly, while the MMSE estimator needs to know the statistical characteristics of both pilot and data channels in advance and consumes extra computational resources to perform the matrix inversion operation.

To overcome these bottlenecks, some deep learning-based channel estimation methods have been developed. According to whether the conventional channel estimation method is combined, the main idea of these methods can be classified in the following two categories. On the one side, there are some works focusing on designing neural networks as the denoising module embedded into existing conventional estimation methods. In [6], the authors employed two convolutional neural networks (CNNs) connected sequentially to denoise and smooth the channel matrix obtained by using conventional interpolation methods. A scheme that denoises the received signals based on CNN model at first and then estimates channel coefficients by using conventional estimation methods was proposed in [7]. On the other side, some studies are dedicated to learning the correlation between pilot channels and data channels by using neural networks. In [8], a CNN model was trained to learn the time–frequency correlation so that the complete channel matrix can be obtained by feeding pilot channels into the model. Moreover, Ref. [9] designed a CNN model to learn the time–frequency–spatial correlation in massive multiple-input–multiple-output (MIMO) systems. However, it is worth pointing out that although the performance can be improved significantly by comparing with conventional methods, most of the designed structure of neural networks is very dependent on the specific pilot configuration. It means that the model needs to be retrained once the pilot configuration changes, which cannot provide adaptive deployment.

As a newly emerging neural network structure, GAN has shown powerful performance in generating synthetic samples following real data distribution [10], which has been applied in many aspects, including image generation [11], image restoration [12], dataset extension [13], communication networks [14] and others. In the wireless communication domain, the GAN-based channel modeling in complex channel environment has attracted a lot of attention. In particular, Ref. [15] has studied the feasibility of using the conditional GAN model to replace the modeling channel transfer function. Therefore, a more precise parametric backpropagation can be guaranteed during the training process of an end-to-end communication system. By employing the federated learning framework [16,17], a distributed conditional GAN framework was proposed in [18] to enable multiple users to train a global model collaboratively. In terms of the channel estimation problem, some GAN-based schemes have been proposed to enhance estimation accuracy and reduce pilot overhead. In [19,20], GAN was employed to learn the distribution of channel correlation matrix in vehicular millimeter wave systems and frequency division duplex massive MIMO systems, respectively. Moreover, Ref. [21] provided a scheme aiming to learn the gradient of any point in high-dimensional channel space, in which any channel to be estimated can be recovered by following the learned gradient direction. However, consuming a large amount of storage resource is the shortcoming of this scheme. In addition, Ref. [22] proposed a virtual pilot generation-based scheme, where GAN is used to learn the correlation between pilot signals. By combining real pilots with generated virtual pilots, the channel estimation accuracy can be improved without extra cost of wireless resource. Furthermore, GAN has been used to learn the correlation between elements of channel matrix directly in [23]. With the aid of compressed sensing, the channel can be estimated without a significant loss of accuracy when more than a half of the number of pilots is reduced.

Although the above GAN-based works have explored the ability of GAN to learn the distribution of multiorder statistics for wireless channels in depth, they mainly focus on low-speed mobile users and neglect to be compatible with high-speed mobile users. In this paper, a novel two-stage channel estimation method based on GAN is proposed to guarantee flexible channel estimation under any pilot configuration and the performance evaluation is extended into the medium- and high-speed mobile scenario for OFDM systems. In particular, a GAN model is designed to learn the correlation between elements of the channel matrix in the first stage. During the second stage, a channel recovery method is given to achieve channel estimation via using the pretrained GAN model. In particular, our proposed scheme can estimate data channels directly via the given pilots without generating extra virtual pilots at first and then using the conventional interpolation method as introduced in [22], which facilitates the efficiency of channel estimation. Moreover, compared with [23], our proposed scheme does not need to find a measurement matrix that is necessary for compressed sensing. The main contribution of this paper is summarized as follows:

Firstly, a novel GAN-based channel estimation method is proposed, in which the coefficients over all data channels are obtained by matching the coefficients of real channels and generated channels at pilot positions. Meanwhile, the compressed low-dimensional latent variable is obtained simultaneously, which can be used to support the CSI feedback service with a low communication overhead.
Secondly, the simulation results are provided to show the performance gain obtained by using our proposed method. In particular, by comparing with the conventional Wiener filtering interpolation method, our proposed method can improve the accuracy performance by more than 2 dB with a $75 %$ reduction of the number pilots when the signal-to-noise (SNR) is 10 dB. It shows the potential to reduce the pilots overhead drastically. The achieved compression ratio in this experiment is 2.4%.

2. System Model and Problem Formulation

In this section, the wireless system model in consideration is provided at first, followed by the illustration of the studied channel estimation problem.

2.1. System Model

Consider the channel estimation scenario in the orthogonal frequency division multiplexing (OFDM) system over the frequency selective fast fading channel, and the structure of an OFDM block with

N_{s}

symbols in the time domain and

N_{f}

subcarriers in the frequency domain is depicted in Figure 1. In the case of frequency selective fast fading, the channel environment keeps constant within each element during transmission but varies among elements. Then the received OFDM symbols

y (i, j)

at the receiver can be expressed as

y (i, j) = h (i, j) s (i, j) + w (i, j), 1 \leq i \leq N_{s}, 1 \leq j \leq N_{f},

(1)

where

s (i, j)

denotes the transmitted symbols at the i-th symbol and j-th subcarrier,

h (i, j)

denotes the channel coefficient of the wireless link between transceiver and receiver, and

w (i, j)

is the additive white Gaussian noise with 0 mean and

σ^{2}

variance.

As shown in Figure 1, an OFDM block contains two types of elements, named pilot element and data element, respectively. In particular, prior known symbols are transmitted in the pilot elements to obtain the partial instantaneous channel state of the entire OFDM block, while the data symbols are delivered in the data elements. Define

α_{p} (i, j)

as the position indicator of pilot elements,

α_{p} (i, j) = 0

or 1. Specifically, if

α_{p} (i, j) = 1

, it indicates that the OFDM element located at i-th symbol and j-th subcarrier is used to transmit pilot symbols; otherwise, it means that the OFDM element belongs to the data element set. Then

y (i, j)

in (1) can be divided into two parts, which are the received pilot symbols

y_{p} (i, j)

and data symbols

y_{d} (i, j)

. Their expressions can be written as:

\begin{matrix} y_{p} (i, j) & = α_{p} (i, j) y (i, j), \end{matrix}

(2a)

\begin{matrix} y_{d} (i, j) & = (1 - α_{p} (i, j)) y (i, j) . \end{matrix}

(2b)

Similarly,

h (i, j)

,

s (i, j)

and

w (i, j)

can also be divided into the pilot and data terms with the same form of (2a) and (2b), respectively.

2.2. Problem Illustration

Before concentrating on the studied problem, the conventional channel estimation procedure of data elements is summarized. Generally, it consists of the following two steps:

Firstly, the channel coefficients of pilot elements are determined via using the received pilot symbols $y_{p} (i, j)$ and the transmitted pilot symbols $s_{p} (i, j) = α_{p} (i, j) s (i, j)$ . The LS estimator [24] is an efficient method to solve this problem, and its optimal solution can be written as

${\hat{h}}_{p} (i, j) = y_{p} (i, j) / s_{p} (i, j) .$

(3)

Note that if the block fading channel is considered within an OFDM block, the channel coefficients of data elements can be estimated directly by averaging all estimated results of pilot channels since the channel coefficient remains constant during the whole OFDM block transmission. However, in the case of frequency selective fast fading assumption, the channel with fast variation makes the above scheme impractical. To acquire the channel coefficients of data elements more precise, the following step needs to be implemented.
Secondly, based on the estimated results of pilot channels ${\hat{h}}_{p} (i, j)$ , the remaining channel coefficients of data elements can be obtained by using two dimensional interpolation methods, which include nearest neighbor interpolation, linear interpolation, bicubic interpolation and other methods [25]. In addition, the MMSE estimator [26] is a competitive method to achieve high accuracy estimation with extra cost of computation complexity. Its optimal solution can be found by multiplying a filtering matrix $A_{MMSE}$ with the pilot channel coefficient matrix ${\hat{h}}_{p}$ , which is denoted as ${\hat{h}}_{d} = A_{MMSE} {\hat{h}}_{p}$ . By minimizing the gap between ${\hat{h}}_{d}$ and the ideal channel coefficient $h_{ideal}$ , the optimal $A_{MMSE}$ can be derived as [6]

$A_{MMSE} = R_{h_{d}, h_{p}} {(R_{h_{p}, h_{p}} + σ^{2} {(s s^{H})}^{- 1})}^{- 1},$

(4)

where $R_{h_{d}, h_{p}} = E {h_{d} h_{p}^{H}}$ denotes the cross-correlation matrix between $h_{d}$ and $h_{p}$ , $R_{h_{p}, h_{p}} = E {h_{p} h_{p}^{H}}$ denotes the self-correlation matrix of $h_{p}$ , ${(\cdot)}^{H}$ denotes the conjugate transpose operator, and ${(\cdot)}^{- 1}$ denotes the inversion operator.

The expression of

A_{MMSE}

given by (4) shows that the usage of MMSE estimator relies on the acquirement of a complete channel correlation matrix, which is challenging to be determined in real time. Besides, the calculation of the matrix inversion leads to a large cost of computational consumption. To avoid these disadvantages, we provide a novel GAN-based channel estimation scheme in this paper. The key idea is to employ GAN to learn the correlation between elements within an OFDM block. Next, the data channel coefficients can be recovered by using the learned correlation between pilot and data channels instead of calculating

A_{MMSE}

. The optimization problem can be formulated as

P_{1} = min E \{∥ h_{ideal} - H (M, {\hat{h}}_{p}) ∥^{2}\},

(5)

where

M

denotes the trained GAN model, and

H (\cdot)

denotes the interpolation function that uses

{\hat{h}}_{p}

with

M

to infer

{\hat{h}}_{d}

.

3. A GAN Based Channel Estimation Scheme

In this section, a novel channel estimation method based on GAN is provided. The pipeline of our proposed channel estimation scheme is illustrated in Figure 2. As shown in Figure 2, the procedure contains two stages, which are the model training and model usage stages, respectively. During the model training stage, a designed GAN model is trained to be capable of producing synthetic OFDM channel samples that are similar to the real OFDM channel samples. In particular, the model training stage can be conducted offline or online. For the offline training case, the model can be trained with pre-collected channel samples so that it can be applied directly during communications. In the case of online training, the model is trained with samples collected in the current channel environment, which makes the learned characteristic distribution of channels more in line with the current scenario. Hence, there is a trade-off about deployment latency and synthetic channel samples similarity between offline and online modes. The selection of an appropriate training mode is customized according to different requirements of services. In addition, the joint utilization of both modes is feasible, which can be done by training a preset model offline and fine-tuning it online. During the model usage stage, the already trained GAN model combined with the estimated pilot channel coefficients

{\hat{h}}_{p}

is imported into a designed interpolation function

H

to obtain the data channel coefficients

{\hat{h}}_{d}

. The details of these two stages are presented in the following two subsections.

3.1. Model Training Stage: Training A GAN Model to Capture the Distribution of Real Channel

In this subsection, the procedure of generating synthetic channel samples with high similarity to real channel samples by training a GAN model is illustrated in detail. Firstly, the basic concept of GAN is provided as follows.

3.1.1. Basic Framework of GAN

As shown in Figure 3, the general structure of GAN is depicted. It consists of two independent neural networks, which are named generator G and discriminator D [10], respectively. In particular, the generator aims to learn the mapping relationship from a latent variable z in low-dimensional space to the real data samples in high-dimensional space. In the ideal case, the trained generator can produce a large variety of different synthetic channel samples that follow the real data distribution. The key goal of discriminator is to distinguish whether the input data come from the real dataset. Generally, to achieve this goal, the output layer of the discriminator network is designed to be a sigmoid function such that a probability value

p_{s}

labeling the input data can be obtained,

p_{s} \in [0, 1]

. When

p_{s}

approaches 1, it means that the input data sample is most likely to be a real data sample, while when

p_{s}

approaches 0, it means that the input data sample, with high probability, belongs to the synthetic sample set.

Since the generator tries to generate data samples capable of confusing the discriminator while the discriminator aims to distinguish synthetic data samples produced by the generator from the real data samples, the key objectives of the generator and discriminator are in conflict. Therefore, a min-max game problem for training the GAN model can be formulated, which is expressed as [10]

P_{2} = min_{θ_{g}} max_{θ_{d}} E_{x \sim p_{r}} {log D (x; θ_{d})} + E_{z \sim p_{z}} {log (1 - D (G (z; θ_{g}); θ_{d}))},

(6)

where

θ_{g}

and

θ_{d}

denote the parameters of generator and discriminator, respectively,

G (\cdot)

and

D (\cdot)

are the output of the generator and discriminator, respectively, and

p_{z}

and

p_{r}

are the probability distribution of the latent variable z and real dataset, respectively.

3.1.2. Training Procedure of a GAN Model

The specific procedure of training a GAN model that generates synthetic channel samples is depicted in Figure 4. As shown in this figure, the training of the generator and discriminator are conducted iteratively. The training procedure can be divided into two cases based on the types of data samples fed into the discriminator. In particular, when real channel data samples are inputted, only the parameters of discriminator need to be updated. The discriminator tries to maximize the output probability

p_{s} (x)

so that every real sample can be identified well, and its training objective can be expressed as

max_{θ_{d}} O_{d, r} = E_{x \sim p_{r}} {log D (x; θ_{d})} .

(7)

Moreover, when synthetic channel data samples generated by the generator are fed into the discriminator, both the parameters of the generator and discriminator are ready to be updated. Specifically, in terms of the discriminator, its training objective is to minimize the output probability

p_{s} (x)

so that the synthetic samples can be distinguished, which is written as

min_{θ_{d}} O_{d, s} = E_{z \sim p_{z}} {log D (G (z; θ_{g}); θ_{d})} .

(8)

As for the generator, its training objective is to maximize the output probability

p_{s} (G (z; θ_{g}))

so that the produced synthetic samples can deceive the discriminator to regard themselves as real samples, which is expressed as

max_{θ_{g}} O_{g} = E_{z \sim p_{z}} {log D (G (z; θ_{g}); θ_{d})} .

(9)

Based on the training objectives

O_{d, r}

in (7),

O_{d, s}

in (8) and

O_{g}

in (9), the procedure of training a GAN model to generate synthetic channel samples is summarized in Algorithm 1. In Algorithm 1, the generator is trained once when the parameters of the discriminator have finished R times updates, which is of benefit for balancing the performance between the generator and discriminator during the training procedure and avoiding the mode collapse issue [27]. The parameters can be updated via using gradient descent methods. In addition, the widely used cross entropy function is usually selected as the loss function for training the GAN model. After the GAN model is converged, stable synthetic channel samples can be generated.

Algorithm 1 GAN-based synthetic channel samples generation.

Initialization: $θ_{d} (0)$ , $θ_{g} (0)$ , the maximum number of iterations K, the ratio of training rounds between the discriminator and generator r, batch size $N_{B}$ , learning rate $λ$ , the form of loss function $F_{T} (\cdot)$ , and the convergence threshold $δ$ .
Repeat: For k-th iterations, $1 \leq k \leq K$
- Sample $N_{B}$ real channel samples;
- Generate $N_{B}$ synthetic channel samples by feeding $N_{B}$ sampled latent variables z independently into the generator;
- Feed real samples and synthetic samples into the discriminator, then calculate the loss function with respect to real samples as $L_{d, r} = F_{T} (O_{d, r})$ , and calculate the loss function with respect to synthetic samples as $L_{d, s} = F_{T} (O_{d, s})$ ;
- Update $θ_{d} (k)$ by following $θ_{d} (k) = θ_{d} (k - 1) + \frac{1}{N_{B}} \sum_{n = 1}^{N_{B}} λ \nabla_{θ_{d}} (L_{d, r} - L_{d, s})$ ;
  Ifk mod r $= 0$ , do generator training:
- Generate another $N_{B}$ synthetic channel samples and feed into discriminator;
- Calculate the loss function $L_{g} = F_{T} (O_{g})$ ;
- Update $θ_{g}$ by following $θ_{g} (⌊ k / r ⌋) = θ_{g} (⌊ k / r ⌋ - 1) - λ \nabla_{θ_{g}} L_{g}$ .
Termination: When $k > K$ or $∥ θ_{d} (k) - θ_{d} (k - 1) ∥ \leq δ$ and $∥ θ_{g} (⌊ k / r ⌋) - θ_{g} (⌊ k / r ⌋ - 1) ∥ \leq δ$ .
Output: Trained discriminator with parameters $θ_{d}^{*}$ and generator with parameters $θ_{g}^{*}$ .

3.2. Model Usage Stage: Using Generator Model to Achieve Data Channel Estimation

In this subsection, the trained GAN model is used to estimate the data channels with the pilot channels known. The channel recovery problem is similar to the image completion problem in the computer vision domain [12]. This problem can be solved based on the premise condition that any channel images following the real data distribution can be generated if the generator is well trained. Since there is a one-to-one correspondence between the latent variable and the channel matrix, there is potential to find the target entire channel with the partial pilot channels information known by adjusting the latent variable z. Thus, the original channel recovery problem

P_{1}

given by (5) can be transformed into the search problem of an optimal latent variable

z^{*}

, which is expressed as

P_{3} = arg min_{z^{*}} \frac{1}{N_{p}} \sum_{i = 1}^{N_{s}} \sum_{j = 1}^{N_{f}} ∥ {\hat{h}}_{p} (i, j) - α_{p} (i, j) G (z; θ_{g}) ∥, 1 \leq i \leq N_{s}, 1 \leq j \leq N_{f},

(10)

where

N_{p} = \sum_{i = 1}^{N_{s}} \sum_{j = 1}^{N_{f}} α_{p} (i, j)

.

The procedure of recovering data channels to solve

P_{3}

is presented in Figure 5. Note that only the generator model is involved in channel estimation. During the searching process, the trained generator is used without parameter modification. The entire process can be regarded as the interpolation function

H

defined in (5). As shown in Figure 5, the comparison between the real channels and the generated channels over all pilot positions is conducted to determine the gap between each other. Afterward, the value of latent variable z can be updated by using gradient descent methods, which makes the new output generated channels at pilot positions closer to the real ones. Once the gap is minimized, an optimal latent variable

z^{*}

that reconstructs the pilot channels faithfully can be found. Hence, the channels over all data positions will be recovered well due to the learned strong correlations among elements. The corresponding algorithm is provided in Algorithm 2.

Algorithm 2 Generator based Channel Estimation.

Initialization: Trained generator model $θ_{g}^{*}$ , latent variable $z (0)$ , learning rate $η$ , the form of loss function $F_{u} (\cdot)$ , the maximum number of iterations L and the convergence threshold $ϵ$ .
Input: The position indicator of pilot channels $α_{p} (i, j)$ , $1 \leq i \leq N_{s}$ , $1 \leq j \leq N_{f}$ .
- Obtain the estimated channel coefficients of all pilot elements ${\hat{h}}_{p} (i, j)$ based on (3);
  Repeat: For l-th iterations, $1 \leq l \leq L$
- Generate a synthetic channel sample $G (z (l - 1); θ_{g}^{*})$ with latent variable $z (l - 1)$ ;
- Calculate the gap $Δ_{p} (l - 1)$ between ${\hat{h}}_{p}$ and $G (z (l - 1); θ_{g}^{*})$ over all pilot channels;
- Feed $Δ_{p} (l - 1)$ into the loss function $F_{u} (\cdot)$ ;
- Update $z (l)$ by following $z (l) = z (l - 1) - η \nabla_{z} F_{u} (Δ_{p} (l - 1))$ .
Termination: When $l > L$ or $∥ z (l) - z (l - 1) ∥ \leq ϵ$ .
Output: The recovered entire channel matrix $G (z^{*}; θ_{g}^{*})$ .

In Algorithm 2, the optional loss function

F_{u} (\cdot)

has a wide range, including

L_{1}

-norm and

L_{2}

-norm and others. When the updating process of latent variable z is terminated, the final version of recovered entire channel matrix

G (z^{*}; θ_{g}^{*})

with both pilot and data channels can be obtained. Note that the proposed algorithm is available for supporting the implement of any pilot configuration without retraining the neural network since there is no reliance on pilot information during the training process of the GAN model. Therefore, the proposed scheme does not require additional training cost if the pilot configuration changes, which is beneficial for flexible deployment.

4. Simulation Settings and Results

In this section, simulation results are provided to show the performance gain of our proposed GAN-based channel estimation scheme. In particular, the OFDM channel dataset containing

N = 20, 000

samples is generated by following the TDL-C channel model [28], which characterizes the Rayleigh fading channel environment in the urban macrocell scenario under the non-line of sight (NLOS) condition. The ratio of the training samples to the total samples is set as

0.8

. Power normalization is operated for all samples. The number of subcarriers and symbols within each OFDM block are set as

N_{f} = 48

and

N_{s} = 14

, respectively. The carrier frequency is set as

f = 3.5

GHz, and the bandwidth of each subcarrier is set as

B_{f} = 30

kHz. The time interval of an OFDM block is set as

T = 0.5

ms. To avoid the inter symbol interference, the cyclic prefix accounting for

6 %

of the symbol length is added to the front of each symbol. In addition, to ensure that the maximal propagation delay is not larger than the duration of cyclic prefix, the root mean square (RMS) delay spread is considered to be set as

T_{d} = 300

ns. Moreover, the scenario that the terminal user moving with high speed is also considered, in which two speed cases are simulated, which are

v_{1} = 150

km/h and

v_{2} = 300

km/h, respectively.

As shown in Figure 6, three types of pilot configurations with different numbers of pilots are considered in the evaluation of the channel estimation performance. To reduce the cost of pilots and ensure the efficiency of data transmission, the maximum number of pilot elements is limited to 96. In particular, the pilot positions are designed on the same columns for all three configuration schemes, which are the 3, 6, 9, and 12-th columns. The aim of configuring the sparse pilot pattern is to capture the fast-changing trend of the channel environment. The only difference between these three schemes is the row positions of pilots. Specifically, in the cases of pilot configurations (a), (b) and (c), the pilots are inserted every two rows, every four rows, and every eight rows, respectively.

To evaluate the accuracy of the channel estimation, the performance metric needs to be determined at first. The normalized mean square error (NMSE) that has also been used in [6,7,23] is selected as the metric in this paper, and its definition can be expressed as

{NMSE}_{h} = \frac{1}{N} \sum_{n = 1}^{N} \frac{∥ h_{ideal, n} - G (z_{n}^{*}; θ_{g}^{*}) ∥^{2}}{∥ h_{ideal, n} ∥^{2}},

(11)

where n denotes the index of test sample.

Next, the architecture of the GAN model used in this experiment is shown in detail. The subsequent subsection gives the simulation results.

4.1. GAN Model Architecture

The architectures of discriminator and generator are illustrated in Figure 7a,b, respectively. As shown in this figure, both models are designed as fully connected networks with five linear layers. In the case of the discriminator, the input channel sample is flattened into a vector from the original matrix. Considering that the coefficient of the channel is with a complex form, an efficient way to handle this is to split it into the real and imaginary parts. In this experiment, the real part and the imaginary part are treated as two different samples. Hence, the input layer is still designed with

N_{s} \times N_{f} = 672

neurons. The hidden layer consists of three downsampling layers, which contain 512, 256 and 64 neurons, respectively. The first four layers are each followed by a LeakyReLU active function and a LayerNorm normalized function. Finally, the output layer contains only one neuron followed by a sigmoid function, which outputs a probability

p_{s}

to judge the authenticity of the input samples.

In the case of the generator, the input latent variable z is designed as a standard Gaussian vector with

N_{z} = 16

dimensions. To evaluate the compression degree of the original channel matrix by the generator, the compression ratio is defined as the ratio of the dimension of z to the dimension of h, which is expressed as

ζ = \frac{N_{z}}{N_{s} \times N_{f}} .

(12)

Based on (12), the compression ratio in this experiment is

ζ = 2.4 %

. It means that the proposed method can also be used for CSI feedback services to reduce the communication overhead drastically. The hidden layer consists of three upsampling layers that are symmetric to the downsampling layers of the discriminator. Similar active and normalized functions are inserted between the first four layers. In the end, the output layer contains 672 neurons connected with the Tanh activation function.

Other hyperparameters are summarized as follows. The ratio of training rounds between discriminator and generator is set as

r = 2

. To generate synthetic channel samples with high quality, it is recommended to train the GAN model with a large number of epoches [23,29]. Thus, the parameters of the GAN model are updated by using the RMSprop optimizer with learning rate

λ = 0.0001

for up to 1000 epoches. The loss function

F_{T}

is set as the binary cross entropy function. In addition, the loss function

F_{u}

in Algorithm 2 is set as the

L_{1}

-norm function.

4.2. Simulation Results

In Figure 8, the comparisons between the magnitude of channel coefficients estimated by our proposed GAN based method and those of the ground truth channel samples in both medium- and high-speed mobile scenarios are provided. In particular, the pilot configuration setting follows the scheme shown in Figure 6a. As shown in this figure, the channel changes more dramatically in the high-speed scenario than that in the medium-speed scenario. The channel variation trends of the estimated results in both scenarios coincide well with those of the ground truth results from both the two-dimensional and three-dimensional perspectives. Moreover, Figure 8b,d show that the estimated results are only with a limited error, which verifies the feasibility of our proposed method.

As shown in Figure 9, the evaluation for the accuracy performance of our proposed GAN-based channel estimation method is provided, where two mobile scenarios with speeds of 150 km/h and 300 km/h are considered in Figure 9a,b, respectively. In addition, the pilot configuration settings are given in Figure 6. In particular, the widely used conventional Wiener filtering interpolation method [30], a method that can achieve the MMSE criteria when SNR is high enough, is selected as the benchmark. The estimated data channels after Wiener filtering can be expressed as

{\hat{h}}_{d, w} = A_{w} {\hat{h}}_{p},

(13)

where

A_{w} = R_{h_{d}, h_{p}} R_{h_{p}, h_{p}}^{- 1}

denotes the filtering matrix, which is pre-estimated in the link-level simulation platform. In the mobile scenario with speed of 150 km/h shown in Figure 9a, compared with the conventional method, the GAN-based methods always outperform, even with a fewer number of pilots. However, when SNR is low, neither the conventional method nor the proposed GAN-based method can achieve good performance due to the existence of non-negligible error for the channel coefficients at pilot positions estimated by the LS method. The maximum gain can be obtained when SNR is 10 dB. Specifically, the performance gain is

3.4

dB,

3.0

dB and

2.7

dB when the number of pilot is 96, 48 and 24, respectively. It shows that the proposed method can reduce the pilot overhead by

75 %

with only

0.7

dB performance loss. As the SNR increases, the performance of the proposed method will converge to a stable level due to the learning ability being limited by the experimental model structure. By re-designing a more complicated network structure, the error will converge to a smaller value.

Moreover, in the mobile scenario with a speed of 300 km/h shown in Figure 9b, the performance curves follow a similar trend with those in the mobile scenario with a speed of 150 km/h. In particular, the achieved accuracy performance is lower than that in Figure 9a due to the more variable channel environment. The maximum gain also appears in the case that SNR is 10 dB, where the performance gains under 96, 48 and 24 pilots conditions are

3.4

dB,

2.8

dB and

2.4

dB, respectively. It shows that the estimation error of the proposed method increases 1 dB when

75 %

pilots are removed. It can be observed that as the number of pilots decreases, the performance gain decays faster in the high speed scenario than that in the medium-speed scenario; as a result, the accurate acquisition of channel correlation in a complex channel environment needs sufficient pilots.

5. Conclusions

In this paper, we studied the feasibility of using the GAN model to address the channel estimation problem in the scenarios where the channel varies dramatically toward future 6G communication systems. Specifically, the entire estimation process contains two stages, which are the model training stage and the model usage stage. In the model training stage, a pre-designed GAN model is used to learn the channel distribution. In the model usage stage, based on the learned correlation between elements in channel matrix, the data channel coefficients are recovered via matching the generated pilot channel coefficients by the trained GAN model and real pilot channel coefficients by the LS method. Hence, the proposed GAN-based scheme can support the channel estimation for any pilot configuration. Simulation results show that our proposed method can improve the estimation accuracy with a large reduction in the pilots overhead in both medium- and high-speed mobile scenarios. Meanwhile, the corresponding channel matrix is compressed to a low dimension, which is also useful for dramatically reducing the feedback overhead in the CSI feedback service. For future works, the pilot channels’ denoised scheme can be considered to combine with the proposed method to further improve the channel estimation performance, especially in the low SNR cases.

Author Contributions

Y.D. and J.J. conceived and designed the research. Y.L. and M.X. implemented the simulations. Y.D. and M.X. drafted the manuscript. W.W. helped organize the manuscript. All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported by the National Key R&D Program of China (Grant No.2020YFB1807100).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Acknowledgments

The authors would like to thank the editors and reviewers for their efforts to help the publication of this work.

Conflicts of Interest

The authors declare no conflict of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript; or in the decision to publish the results.

Abbreviations

The following abbreviations are used in this manuscript:

GAN	Generative Adversarial Networks
OFDM	Orthogonal Frequency-Division Multiplexing
CSI	Channel State Information
LS	Least Squares
MMSE	Minimum Mean Square Error
CNN	Convolutional Neural Networks
MIMO	Multiple-Input–Multiple-Output
SNR	Signal-to-Noise
NLOS	Non-line of Sight
NMSE	Normalized Mean Square Error

References

Liu, Y.; Tan, Z.; Hu, H.; Cimini, L.J.; Li, G.Y. Channel estimation for OFDM. IEEE Commun. Surv. Tutorials 2014, 16, 1891–1908. [Google Scholar] [CrossRef]
Kao, Y.; Wu, K. A low-complexity channel estimation based on a least-squares algorithm in OFDM systmes. Appl. Sci. 2022, 12, 4258. [Google Scholar] [CrossRef]
Wang, Z.; Du, Y.; Wei, K.; Han, K.; Xu, X.; Wei, G.; Tong, W.; Zhu, P.; Ma, J.; Wang, J.; et al. Vision, application scenarios, key technology trends for 6G mobile communications. Sci. China Inf. Sci. 2022, 65, 151301. [Google Scholar] [CrossRef]
Kay, S.M. Fundamentals of Statistical Signal Processing: Estimation Theory; Prentice-Hall: Hoboken, NJ, USA, 1993. [Google Scholar]
Guo, Y.; Qin, Z.; Dobre, O.A. Federated generative adversarial networks based channel estimation. In Proceedings of the 2022 IEEE International Conference on Communications Workshops (ICC Workshops), Seoul, Republic of Korea, 16–20 May 2022. [Google Scholar]
Soltani, M.; Pourahmadi, V.; Mirzaei, A.; Sheikhzadeh, H. Deep learning-based channel estimation. IEEE Commun. Lett. 2019, 23, 652–655. [Google Scholar] [CrossRef]
Balevi, E.; Doshi, A.; Andrews, J.G. Massive MIMO channel estimation with an untrained deep neural network. IEEE Trans. Wirel. Commun. 2020, 19, 2079–2090. [Google Scholar] [CrossRef]
Safari, M.S.; Pourahmadi, V.; Sodagari, S. Deep UL2DL: Data-driven channel knowledge transfer from uplink to downlink. IEEE Open J. Veh. Technol. 2019, 1, 29–44. [Google Scholar] [CrossRef]
Dong, P.; Zhang, H.; Li, G.Y.; Gaspar, I.S.; NaderiAlizadeh, N. Deep CNN based channel estimation for mmWave massive MIMO systems. IEEE J. Sel. Top. Signal Process. 2019, 13, 989–1000. [Google Scholar] [CrossRef]
Goodfellow, I.; Pouget-Abadie, J.; Mirza, M.; Xu, B.; Warde-Farley, D.; Ozair, S.; Courville, A.; Bengio, Y. Generative adversarial networks. Commun. ACM 2020, 63, 139–144. [Google Scholar] [CrossRef]
Yang, G.; Li, C.; Liu, X.; Fang, G. A THz passive image generation method based on generative adversarial networks. Appl. Sci. 2022, 12, 1976. [Google Scholar] [CrossRef]
Pathak, D.; Krahenbuhl, P.; Donahue, J.; Darrell, T.; Efros, A.A. Context encoders: Feature learning by inpainting. In Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA, 27–30 June 2016. [Google Scholar]
Abedi, M.; Hempel, L.; Sadeghi, S.; Kirsten, T. GAN-based approaches for generating structured data in the medical domain. Appl. Sci. 2022, 12, 7075. [Google Scholar] [CrossRef]
Ayanoglu, E.; Davaslioglu, K.; Sagduyu, Y.E. Machine learning in NextG networks via generative adversarial networks. IEEE Trans. Cogn. Commun. Netw. 2022, 8, 480–501. [Google Scholar] [CrossRef]
Ye, H.; Li, G.Y.; Juang, B.-H.F.; Sivanesan, K. Channel agnostic end-to-end learning based communication systems with conditional GAN. In Proceedings of the 2018 IEEE Globecom Workshops (GC Workshops), Abu Dhabi, United Arab Emirates, 9–13 December 2018. [Google Scholar]
Feng, C.; Zhao, Z.; Wang, Y.; Quek, T.Q.S.; Peng, M. On the design of federated learning in the mobile edge computing systems. IEEE Trans. Commun. 2021, 69, 5902–5916. [Google Scholar] [CrossRef]
Zhao, Z.; Feng, C.; Yang, H.H.; Luo, X. Federated-learning-enabled intelligent fog radio access networks: Fundamental theory, key techniques, and future trends. IEEE Wirel. Commun. 2020, 27, 22–28. [Google Scholar] [CrossRef]
Zhang, Q.; Ferdowsi, A.; Saad, W.; Bennis, M. Distributed conditional generative adversarial networks (GANs) for data-driven milimeter wave communications in UAN networks. IEEE Trans. Wirel. Commun. 2021, 21, 1438–1452. [Google Scholar] [CrossRef]
Li, X.; Alkhateeb, A.; Tepedelenlioglu, C. Generative adversarial estimation of channel covariance in vehicular millimeter wave systems. In Proceedings of the 2018 52nd Asilomar Conference on Signals, Systems, and Computers (ACSSC), Pacific Grove, CA, USA, 28–31 October 2018. [Google Scholar]
Banerjee, B.; Elliott, R.C.; Krzymien, W.A.; Farmanbar, H. Towards FDD massive MIMO: Downlink channel covariance matrix estimation using conditional generative adversarial networks. In Proceedings of the 2021 32nd Annual International Symposium on Personal, Indoor and Mobile Radio Communications (PIMRC), Helsinki, Finland, 13–16 September 2021. [Google Scholar]
Arvinte, M.; Tamir, J.I. Score-based generative models for robust channel estimation. In Proceedings of the 2022 IEEE Wireless Communications and Networking Conference (WCNC), Austin, TX, USA, 10–13 April 2022. [Google Scholar]
Hu, T.; Hunag, Y.; Zhu, Q.; Wu, Q. Channel estimation enhancement with generative adversarial networks. IEEE Trans. Cogn. Commun. Netw. 2020, 7, 145–156. [Google Scholar] [CrossRef]
Balevi, E.; Andrews, J.G. Wideband channel estimation with a generative adversarial network. IEEE Trans. Wirel. Commun. 2021, 20, 3049–3060. [Google Scholar] [CrossRef]
Simko, M.; Mehlfhrer, C.; Wrulich, M.; Rupp, M. Doubly dispersive channel estimation with scalable complexity. In Proceedings of the 2010 International ITG Workshop on Smart Antennas (WSA), Bremen, Germany, 23–24 February 2010; pp. 251–256. [Google Scholar]
Dong, C.; Loy, C.C.; He, K.; Tang, X. Image super-resolution using deep convolutional networks. IEEE Trans. Pattern Anal. Mach. Intell. 2016, 38, 295–307. [Google Scholar] [CrossRef] [PubMed]
Omar, S.; Ancora, A.; Slock, D.T.M. Performance analysis of general pilot-aided linear channel estimation in LTE OFDMA systems with application to simplified mmse schemes. In Proceedings of the 2018 IEEE 19th International Symposium on Personal, Indoor and Mobile Radio Communications (PIMRC), Cannes, France, 15–18 September 2008. [Google Scholar]
Arjovsky, M.; Chintala, S.; Bottou, L. Wasserstein generative adversarial networks. In Proceedings of the 34th International Conference on Machine Learning (PMLR), Sydney, Australia, 6–11 August 2017. [Google Scholar]
Study on Channel Model for Frequencies from 0.5 to 100 GHz (Rel 14); 3GPP TS 38.901. 5G; 3GPP FTP Server. 2017. Available online: https://www.etsi.org/deliver/etsi_tr/138900_138999/138901/14.00.00_60/tr_138901v140000p.pdf (accessed on 15 October 2022).
Xiao, H.; Tian, W.; Liu, W.; Shen, J. ChannelGAN: Deep learning-based channel modeling and generating. IEEE Wirel. Commun. Lett. 2022, 11, 650–654. [Google Scholar] [CrossRef]
Cavers, J.K. An analysis of pilot symbol assisted modulation for reyleigh fading channels (mobile radio). IEEE Trans. Veh. Technol. 1991, 40, 686–693. [Google Scholar] [CrossRef]

Figure 1. The structure of an OFDM block.

Figure 2. The procedure of the proposed GAN-based channel estimation scheme.

Figure 3. The structure of GAN.

Figure 4. Training procedure of a GAN model.

Figure 5. Procedure of recovering data channels with trained generator.

Figure 6. Pilot configuration schemes.

Figure 7. Illustration of used GAN model architecture.

Figure 8. Comparison of GAN-based estimated channel samples and ideal channel samples (ground truth results) when SNR is set as 20 dB. In (a,b), the speed of terminal user is set as 150 km/h, while that in (c,d) is set as 300 km/h. Moreover, (a,c) show the channel magnitude from a two-dimensional perspective, while (b,d) show that from a three-dimensional perspective.

Figure 9. NMSE evaluation of our proposed GAN-based channel estimation scheme.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Du, Y.; Li, Y.; Xu, M.; Jiang, J.; Wang, W. A Joint Channel Estimation and Compression Method Based on GAN in 6G Communication Systems. Appl. Sci. 2023, 13, 2319. https://doi.org/10.3390/app13042319

AMA Style

Du Y, Li Y, Xu M, Jiang J, Wang W. A Joint Channel Estimation and Compression Method Based on GAN in 6G Communication Systems. Applied Sciences. 2023; 13(4):2319. https://doi.org/10.3390/app13042319

Chicago/Turabian Style

Du, Ying, Yang Li, Mingfeng Xu, Jiamo Jiang, and Weidong Wang. 2023. "A Joint Channel Estimation and Compression Method Based on GAN in 6G Communication Systems" Applied Sciences 13, no. 4: 2319. https://doi.org/10.3390/app13042319

APA Style

Du, Y., Li, Y., Xu, M., Jiang, J., & Wang, W. (2023). A Joint Channel Estimation and Compression Method Based on GAN in 6G Communication Systems. Applied Sciences, 13(4), 2319. https://doi.org/10.3390/app13042319

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Joint Channel Estimation and Compression Method Based on GAN in 6G Communication Systems

Abstract

1. Introduction

2. System Model and Problem Formulation

2.1. System Model

2.2. Problem Illustration

3. A GAN Based Channel Estimation Scheme

3.1. Model Training Stage: Training A GAN Model to Capture the Distribution of Real Channel

3.1.1. Basic Framework of GAN

3.1.2. Training Procedure of a GAN Model

3.2. Model Usage Stage: Using Generator Model to Achieve Data Channel Estimation

4. Simulation Settings and Results

4.1. GAN Model Architecture

4.2. Simulation Results

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI