Next Article in Journal
A New Truncated Lindley-Generated Family of Distributions: Properties, Regression Analysis, and Applications
Next Article in Special Issue
A Transformer-Based Channel Estimation Method for OTFS Systems
Previous Article in Journal
Nonlinear Fokker–Planck Equations, H-Theorem and Generalized Entropy of a Composed System
Previous Article in Special Issue
Superimposed Perfect Binary Array-Aided Channel Estimation for OTFS Systems
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Novel Joint Channel Estimation and Symbol Detection Receiver for Orthogonal Time Frequency Space in Vehicular Networks

1
Department of Electronic and Electrical Engineering, Southern University of Science and Technology, Shenzhen 518055, China
2
Information Hub, The Hong Kong University of Science and Technology (Guangzhou), Guangzhou 511458, China
*
Author to whom correspondence should be addressed.
Entropy 2023, 25(9), 1358; https://doi.org/10.3390/e25091358
Submission received: 26 June 2023 / Revised: 11 August 2023 / Accepted: 13 August 2023 / Published: 20 September 2023
(This article belongs to the Special Issue Delay-Doppler Domain Communications for Future Wireless Networks)

Abstract

:
A vehicular network embodies a specialized variant of wireless network systems, characterized by its capability to facilitate inter-vehicular communication and connectivity with the encompassing infrastructure. With the rapid development of wireless communication technology, high-speed and reliable communication has become increasingly important in vehicular networks. It has been demonstrated that orthogonal time frequency space (OTFS) modulation proves effective in addressing the challenges posed by high-mobility environments, as it transforms the time-varying channels into the delay-Doppler domain. Motivated by this, in this paper, we focus on the theme of integrated sensing and communication (ISAC)-assisted OTFS receiver design, which aims to perform sensing channel estimation and communication symbol detection. Specifically, the estimation of the sensing channel is accomplished through the utilization of a deep residual denoising network (DRDN), while the communication symbol detection is performed by orthogonal approximate message passing (OAMP) processing. The numerical results demonstrate that the proposed ISAC system exhibits superior performance and robustness compared to traditional methods, with a lower complexity as well. The proposed system has great potential for future applications in wireless communication systems, especially in challenging scenarios with high mobility and interference.

1. Introduction

A vehicular network is a type of wireless network that connects vehicles to each other and to the infrastructure around them. It constitutes a paramount technological substrate for intelligent transportation systems (ITS), smart urban environments, and autonomous vehicular navigation systems [1]. Vehicular networks furnish a comprehensive spectrum of onboard data utilities, encompassing enhanced road safety measures, uninterrupted navigational support, sophisticated traffic management, optimized driving comfort, and integrated infotainment provisions [2]. These networks operate as mobile ad hoc networks (MANETs) and employ various technologies such as Wi-Fi, Bluetooth, and dedicated short-range communications (DSRC) to facilitate communication among vehicles, roadside units (RSUs), traffic lights, and other entities. Meanwhile, it has gained extensive usage in providing real-time traffic information, navigation assistance, and entertainment services to drivers and passengers, as well as in monitoring traffic conditions and detecting accidents [3,4]. In addition to efficient communication, vehicular networks also require highly accurate sensing capabilities, typically achieved through radar systems. The ability to accurately sense both the spatial positioning and velocity of vehicles is a critical requirement for effectuating collision avoidance and empowering viable real-time vehicular safety applications. To address this requirement, integrated sensing and communication (ISAC) technology has been applied [5]. By incorporating ISAC technology into the vehicular network, vehicles are capable of simultaneously performing communication and sensing tasks. Furthermore, the sharing of hardware architecture between the sensing and communication systems enabled by ISAC technology significantly reduces hardware deployment costs, a highly advantageous feature in the context of vehicular networks [6].
Recently, orthogonal time frequency space (OTFS) modulation has been gaining attention due to its ability to provide more reliable communications than OFDM, especially in high-mobility environments [7]. Contrary to its OFDM-based ISAC counterpart, the OTFS-aided ISAC system harmoniously integrates the transmission of signals and the respective channel responses within a singular domain for both functionalities, concurrently displaying robustness against delay and Doppler spreads [8]. Moreover, the RSU has the capability to harness the OTFS-ISAC signals to disseminate downlink information to vehicles and concurrently infer their sensor data predicated on the reflective echoes. As such, the OTFS has recently attracted considerable interest as a viable alternative for ISAC systems [5,9,10,11].
As mentioned above, the sensing information of the environment can be obtained from the OTFS channel matrix without additional signaling costs. Accordingly, it is essential for the RSU and users to estimate channel information with low latency accurately. The authors in [12] proposed a message passing algorithm (MPA)-based channel estimation based on the hidden Markov model, which effectively addressed the effects of fractional Doppler for OTFS systems. For the purpose of radar sensing, Ref. [13] proposed a two-dimensional (2D) correlation-based algorithm for OTFS sensing, which overcame the channel estimation in the DD domain and enhanced the estimation accuracy. Furthermore, an off-grid channel estimation method was adopted for estimating effective channel response in the DD domain, which facilitated a reduction in estimation accuracy degradation associated with fractional Doppler. In some intricate scenarios such as underwater acoustic communication with a t-distribution noise, model-based approaches may fall short in addressing channel estimation. Due to the robust data-driven capabilities of deep learning, it is frequently employed to tackle previously intractable issues [14,15,16]. For instance, the authors in [15] employed a deep residual convolution neural network to deal with noise in RIS channels. In [16], the authors leveraged reinforcement learning to address MIMO channel estimation, addressing the computational complexity of algorithms and the time-varying nature of the channel. Inspired by the successful application of deep learning, we employed an innovative neural-network-based technique for OTFS channel estimation.
The rigorous exploitation of the time–frequency diversity facilitated by the wireless channel, associated with the OTFS modulation, generally necessitates a higher level of detection complexity compared to its conventional counterpart, the OFDM modulation. This amplified complexity arises from the distinctiveness of the delay-Doppler (DD) domain channel. In this context, the intercepted signal could be interpreted as an overlay of transmitted signals, each being power-diminished, phase-altered, and subjected to both delay and Doppler shifts, relative to each discernible path of the wireless channel. In order to confront this complexity escalation, a widely accepted solution is the implementation of a message-passing algorithm (MPA) for OTFS detection. Nevertheless, this algorithm may impose significant computational demands. To decrease the detection complexity, a Gaussian approximation technique has been introduced to address the intersymbol interference (ISI) in the DD domain [17]. Recently, a novel extension to the maximum a posterior (MAP), termed the hybrid MAP and parallel interference cancellation (PIC) detection, has been proposed [18]. In this innovative solution, the Gaussian approximation is selectively applied to portions of the DD domain ISI, based on the corresponding path attenuation levels. In a further initiative to streamline receiver complexity, a variational Bayes (VB) methodology was proposed as a surrogate to the optimal MAP detection [19]. Moreover, [20] introduced low-complexity detectors, capable of achieving commendable error rate performance by leveraging the block-diagonal characteristic of the time-domain effective channel matrix, using an SVD-based orthogonal linear estimator.
In this paper, we design a novel joint channel estimation and symbol detection framework for the OTFS-aided ISAC in uplink communications to reduce the signaling overhead, focusing on the dual tasks of channel estimation and symbol detection within vehicular networks. The RSU can leverage the OTFS-ISAC signals to garner estimates of temporal delays, Doppler shifts, and angles related to vehicles encompassed within its communication sphere. Specifically, for channel estimation, a deep neural network (DNN)-based estimator is employed to achieve accurate and robust channel estimation for some complex scenarios where the noise cannot be solved by model-based methods. On the other hand, we adopt a low-complexity algorithm named orthogonal approximate message passing (OAMP) for the symbol detection. To the best of our knowledge, this is the first joint channel estimation and detection scheme based on deep learning designed for uplink communication in OTFS-assisted ISAC scenarios. To further illustrate our work, the contributions are listed as follows:
  • We propose an integrated OTFS-ISAC system that leverages a novel deep residual denoising network and OAMP algorithm for joint channel estimation and symbol detection. Specifically, we design a DNN-based denoising module, incorporating an element-by-element subtraction operation that concurrently exploits the spatial attributes of noise-infected channel matrices as well as the additive character of the perturbation. In addition, a subnetwork that can generate thresholds is utilized to eliminate irrelevant features, thereby enhancing the estimation accuracy.
  • We employ the OAMP detector to carry out the OTFS symbol detection, as it has the potential for MMSE optimality and exhibits excellent detection performance.
  • We demonstrate the effectiveness of the proposed system through simulations and compare its performance with traditional communication systems. The proposed system shows superior performance in challenging environments such as a high Doppler frequency and delay spread, making it a promising solution for future wireless communication systems.
The subsequent sections of this paper are structured as follows. In Section 2, the system model is presented, encompassing the OTFS modulation, the communication model, and the sensing model. In Section 3, the ISAC-based OTFS transmitter design is presented. Section 4 summarizes the experimental results, and Section 5 concludes the paper.

2. System Model

As depicted in Figure 1, we consider a classical vehicular network where a roadside unit (RSU) is employed to serve P vehicles. In particular, the RSU is equipped with a uniform linear array (ULA) consisting of N t transmit antennas and a separate ULA with N r receive antennas. Assuming there is a sufficient distance between the transmit and receive arrays, the echoes will not interfere with the downlink communication. In addition, the ULAs of the RSU are positioned parallel to the road, resulting in identical angle-of-arrival (AoA) and angle-of-departure (AoD) values for each ULA. We then model the vehicles as point targets and assume that each vehicle is equipped with a single antenna for communicating with the RSU. In the following, we present the signal models of OTFS-assisted ISAC systems.

2.1. The Modulation of OTFS Signal

The system diagram of the OTFS modulation is presented in Figure 2. Let us denote M and N as the number of subcarriers and the number of time slots for each OTFS frame, respectively. The N M information symbols are taken from a modulation alphabet A = { a 1 , , a Q } comprising Q elements. Then, these symbols are arranged in a 2D matrix X C N × M with entries X [ k , l ] , where k { 0 , 1 , , N 1 } and l { 0 , 1 , , M 1 } , respectively. The transmitter proceeds to map the DD domain symbols X [ k , l ] to N M samples X tf on the time-frequency grid by using the inverse symplectic fast Fourier transform (ISFFT), which is expressed as
X tf [ n , m ] = 1 M N k = 0 N 1 l = 0 M 1 X [ k , l ] e j 2 π n k N m l M ,
where n { 0 , 1 , , N 1 } , m { 0 , 1 , , M 1 } , and X tf is the transmitted samples matrix in the time–frequency domain. Based on the TF domain transmitted symbols, we can adopt the conventional OFDM modulator to convert the 2D samples X tf into a continuous-time waveform x ( t ) with the aid of a transmit waveform g t x ( t ) given by
x ( t ) = n = 0 N 1 m = 0 M 1 X tf [ n , m ] g t x ( t n T ) e j 2 π m Δ f ( t n T ) ,
where Δ f represents the frequency spacing between any adjacent subcarriers. Note that Equation (2) is the Heisenberg transform in Figure 2, and the Winger transform is the inverse transformation of the Heisenberg transform [21]. By utilizing OTFS signals, communication and sensing can be enhanced more efficiently without requiring additional hardware devices and signal processing.

2.2. Communication Signal

During the downlink transmission phase, the RSU can utilize an antenna array for creating either a wide beam or an omnidirectional signal, which enables the detection of all relevant targets within the range. During the tracking mode, the RSU can generate multiple beams to facilitate information transmission and target tracking. The multibeam signal with a beamforming matrix W C N t × P can be expressed as
x ˜ ( t ) = W x ( t ) ,
where x ( t ) = [ x 1 ( t ) , , x P ( t ) ] is the transmitted signal to all targets. The ith column of W is denoted as w i = p i N t a N t θ ˜ i , fulfilling both power allocation and directional steering functions. Specifically, the term p i represents the power allocation factor for the ith column of W , and the column vector a N t ( θ ˜ i ) represents the steering vector pointing to the desired direction θ i , which is given by
a N t θ i = 1 , e j π sin θ i , , e j N t 1 π sin θ i T .
Thanks to the asymptotic orthogonality of the massive antenna array, the ith target’s communication channel is predominantly line-of-sight (LoS) after transmitting the beamformer, which is expressed as
C DD i ( τ , ν ) = h i a N u θ i a N t H θ i δ τ τ i δ ν ν i ,
where h i = c 4 π f c d i 2 represents the channel gain. Here, the term c represents the signal propagation speed, f c is the carrier frequency, d i is the range distance to the ith target, and τ i and ν i represent the delay and Doppler shift, respectively. The received signal can be written as
y i ( t ) = h i u i H a N u θ i a N t H θ i w i x i t τ i e j 2 π ν i t τ i + z i ( t ) ,
where u i C N u × 1 represents the received beamformer, and z i ( t ) denotes the noise signal in the time domain. With a received pulse shaping filter g r x ( t ) , the expression for the received OTFS signal in the DD domain can be obtained by the Winger transform and the symplectic Fourier transform (SFFT), i.e.,
Y [ l , k ] = h i u i H a N u θ i a N t H θ i f i X ( l l i ) M , ( k k i ) N + Z [ l , k ] ,
where the integers k i = ν i N T and l i = τ i M Δ f , and Z [ l , k ] denotes the independent white Gaussian noise sample with a power spectral density of N 0 .

2.3. Sensing Signal

In the downlink transmission, the information about the environment can be acquired from the OTFS channel matrix, which is given by
H ( t , τ ) = i = 1 K γ i a N r θ i a N t H θ i δ τ η i e j 2 π v i t ,
where γ i , η i , and ν i represent the reflected coefficient, delay, and Doppler shift corresponding to the ith target, respectively. The received sensing echoes at the RSU can then be expressed as:
r ( t ) = i = 1 K γ i a N r θ i a N t H θ i x ˜ t η i e j 2 π v i t + n ( t ) ,
where n ( t ) denotes the additional measurement noise. Recall that the steering vectors with different angular values are asymptotically orthogonal for massive MIMO receive antenna arrays [8], i.e., a N r H ( θ i ) a N r ( θ i 0 ) 0 for θ i θ i 0 . This implies that we can omit the interference resulted from different targets in the sensing echoes and the RSU is capable of differentiating various targets based on their angles of arrival (AoAs). Therefore, one can extract the sensing echo from the ith target from r ( t ) using a receive beamformer b i = a N r H ( θ ˜ i ) , which can be written as:
r i ( t ) = γ i b i H a N r θ i a N t H θ i f i s i t η i e j 2 π v i t + n ( t ) .
It is noteworthy that the angular parameter θ i can also be inferred when processing the receive beamformer by comparing the gains obtained from different beam directions. Therefore, Equation (9) can be recast as:
r i ( t ) = G a x i t η i e j 2 π v i t + n i ( t ) ,
where G a denotes the composite antenna array gain. As a step forward, by means of an ideal receive filter and operating OTFS demodulation, we have the input–output relationship in the DD domain given by:
R [ l , k ] = G a k = 0 N 1 l = 0 M 1 H l , k · X l l M , k k N + n [ l , k ] ,
where H [ l , k ] represents the gain of the ith target at the DD grid (bin) with indices l 0 and k 0 corresponding to the delay of l M Δ f and Doppler of k N T .

2.4. JCESD for OTFS-Based Vehicular Networks

In this subsection, we first present the OTFS-based ISAC system framework as illustrated in Figure 3. At a time slot, the users receive the y ( t ) from the RSU via beam assignment, while the RSU receives the reflected signal r ( t ) from the vehicles. Given the received echo, the RSU performs OTFS demodulation to obtain the DD domain signal R ( l , k ) as that establishes high-quality communication service quality, and the RSU needs to obtain information about the vehicles’ azimuth angles. As mentioned before, the vehicle status information can be acquired from the outcomes of the OTFS channel estimation C D D ( τ , ν ) , thus the RSU is capable of utilizing its antenna arrays to create “pencil-like” beams, enabling a precise alignment with user locations. On the other hand, based on the received signal, users estimate the channel matrix, containing sensing information about the surrounding environment. More importantly, the receiver must accurately and efficiently demodulate symbol information based on the channel information. For this purpose, a low-complexity OAMP detection technique is employed.

3. The Joint Channel Estimation and Symbol Detection

To ensure precise CSI for sensing and communication, we introduce a deep learning (DL)-based framework for the OTFS-assisted ISAC system, as illustrated in Figure 4. In this section, we first present an embedded pilot-aided scheme, which involves incorporating pilot symbols into the transmitted symbol matrix to facilitate the preprocessing of the channel estimation. Next, we convert the OTFS channel estimation problem into an issue about sparse image denoising. In particular, we utilize a DL-based framework for accurate channel estimation. Finally, upon acquiring the precise OTFS channel matrix, the RSU performs the OAMP detection to demodulate symbols effectively.

3.1. Pilot Placement

Pilot symbols can be inserted into the transmitted signal to aid the channel estimation, which is applicable in the OTFS-assisted ISAC system, as shown in Figure 5. In this paper, we consider the case of integer delay and Doppler frequencies. Let x p = X d d [ l p , k p ] represent the pilot symbol.
The entries of the DD domain matrix satisfy [22]
X d d [ l , k ] = x p l = l p , k = k p , 0 l [ l p l m a x , l p + l m a x ] , k [ k p 2 k m a x , k p + 2 k m a x ] , data symbol otherwise . ,
where l m a x and k m a x denote the spacing between the data symbol and the pilot symbol, which also corresponds to the maximum delay and Doppler shift of the target. Define X d and X p as the data matrix and pilot matrix which satisfy
X d [ l , k ] = 0 l [ l p l m a x , l p + l m a x ] k [ k p 2 k m a x , k p + 2 k m a x ] , data symbol otherwise .
and
X p [ l , k ] = x p l = l p , k = k p , 0 otherwise . .
Subsequently, Equation (7) can be reformulated as
Y d d [ l , k ] = H d d [ l , k ] X d d [ l , k ] + Z d d [ l , k ] = H d d [ l , k ] X d [ l , k ] + X p [ l , k ] + Z d d [ l , k ] ,
where ⊛ denotes the cyclic convolution operation. In order to obtain the CSI by using the pilot symbol, we introduce a received pilot matrix Y p C M × N , which satisfies
Y p [ l , k ] = H d d [ l , k ] X p [ l , k ] + Z d d [ l , k ] .
Since X p is known by the RSU, and Y p can be obtained by setting some entries of Y d d to 0, the channel estimation problem in the proposed framework can be formulated as a sparse image denoising problem, represented by:
Y p = H p + Z d d ,
where
H p [ l , k ] = H d d [ l , k ] X p [ l , k ] .
Here, the matrix H p [ l , k ] incorporates the OTFS effective channel H d d [ l , k ] , where each element is characterized by a Bernoulli–Gaussian distribution instead of a Gaussian PDF. Consequently, employing a Bayesian general linear model is unfeasible for the data model, e.g., a linear minimum mean square error (LMMSE). Moreover, the noise component Z d d is not limited to a Gaussian distribution but can include arbitrary noise. As a result, obtaining an explicit expression for a Bayesian estimator within the model-driven approach becomes intractable. In contrast, we employ a data-driven approach to introduce convolutional neural networks (CNNs)-based channel estimation framework for OTFS systems in the subsequent section.

3.2. The Architecture of the DL Network

Note that CNNs possess a favorable capability to extract features from matrices containing noisy observations. In addition, the subtraction architecture employed in deep neural networks (DNNs) facilitates the exploration of the additive characteristics of the noise [23,24]. Based on the CNNs and residual subtraction architecture [15], we developed a deep residual denoising network (DRDN) to effectively learn and eliminate the residual noise from received symbols while eliminating irrelevant features. Figure 6 illustrates the general structure of the DRDN. It comprises several components, including an input transformation layer, D denoising blocks, and an adaptive soft threshold layer. The hyperparameters associated with the DRDN are presented in Table 1. In the subsequent sections, we provide an introduction to each layer of the network.
Y = Y p Y p , H = H p H p ,
where Y C 2 k max × l max × 2 and H C M × N × 2 are the real-valued received signals and the channel coefficients, respectively. Here, subscripts and represent the real and imaginary components of the input variable, respectively. Therefore, it is possible to devise an efficient denoiser using a DRDN approach to enhance the OTFS channel estimation.
Denoising Block: Convolutional Neural Networks (CNNs) are widely employed in numerous DL applications [25,26] due to their remarkable feature extraction capabilities. As illustrated in Figure 6, the proposed DRDN model follows a general architecture. The denoising performance is gradually enhanced by leveraging the D denoising blocks with identical structures. Each denoising block comprises a residual subnetwork and an elementwise subtraction operation. The residual subnetwork consists of L layers that employ three types of functions. In the first layer, the “Conv+BN+ReLU” composition is employed to extract the spatial characteristics of the channel matrix. This composition involves applying convolution (Conv) and rectified linear unit (ReLU) operations. Batch normalization (BN) is incorporated between the Conv and ReLU operations to enhance network stability and expedite training. The second composition, with L 2 layers, employs the same structure but employs a kernel size suitable for nonlinear transformations among the features. The ultimate layer employs a solitary convolution operation to generate the noise matrix for subsequent elementwise subtraction. Specifically, the denoised channel matrix is obtained through the process of subtracting each element of the residual subnetwork from the corresponding element of the original input.
T N = I i = 0 D 1 S i = I i = 0 D 1 f θ i ( I i ) .
Here, f θ i , T N , and I denote the function of the ith DB, the output, and input of DB. The residual term, referred to as the residual noise, is denoted as S i .
DL-based threshold layer: At the last layer, a soft shrinkage function is designed to represent the sparse features, which is inspired by sparse code in image denoising [27]. To choose an appropriate threshold, we designed an adaptive thresholding layer, where the threshold is determined by a subnetwork. Figure 7 illustrates the proposed adaptive thresholding module, which begins by applying a nonlinear operation to the feature map T N obtained from the denoising module. The subnet f 1 ( · ) is adopted to obtain the coarse estimation λ t . In particular, the global average pooling (GAP) is used to acquire C × 1 × 1 dimensional output. Subsequently, λ t is propagated through a two-layer fully connected network to calculate the scaling parameter α using the sigmoid function, i.e., α = sigmoid ( λ t ) . This scaling parameter α represents the threshold scaling factor, which is used to compute the final threshold given by
λ = α λ t
The final channel estimation is performed by comparing each element in T N with the threshold λ . If an element is greater than λ , it is retained; otherwise, it is set to 0, effectively performing a filtering operation. Mathematically, it can be expressed as follows
η ( λ , T N ) = sign ( T N ) max { | T N | λ , 0 } ,
where η is the adaptive threshold filtering function, sign ( · ) denotes the sign function, and its value is 1 when the value of x is greater than zero and −1 when it is less than 0.
Output layer: At the end of the neural network, the effective area Y p ˜ C 2 k max × l max needs to be converted to complete channel matrix H d d C M × N , and is then applied to the sensing and communication. In summary, this paper introduces the DRDN architecture in channel estimation to improve the denoising performance. The architecture incorporates D denoising blocks, which systematically eliminate complex noise in unmodeled scenarios. Additionally, a DL-based soft shrinkage operation is employed to accurately determine the delay and Doppler parameters, which in turn facilitate effective target sensing and OTFS symbol detection.

3.3. Estimation of Neural Network

Utilizing the DRDN architecture as a foundation, we subsequently develop a channel estimation scheme which comprises two distinct phases: offline training and online estimation.
Offline training: In the offline training phase, a large quantity of data are used to train the network, eventually resulting in a trained model. The training data are denoted by
( Y ˜ , H ) = Y ˜ ( 1 ) , H ( 1 ) , , Y ˜ N , H N ,
where Y ˜ i , H i , i { 1 , 2 , N } , is the ith training example of ( Y ˜ , H ) . Moreover, Y ˜ i C 2 k max × l max , H i C 2 k max × l max are the input of the DRDN and the label. According to the mean square error (MSE) criterion, the cost function of the offline training phase can be formulated as
J M S E ( θ ) = 1 N i = 1 N f θ ( Y i ˜ ) H i 2 .
Based on this formulation, the DRDN model can utilize the backpropagation (BP) algorithm to achieve effective training. Specifically, the loss function is equivalent to the MMSE estimator when N [25]. That is, the performance of the DRDN estimator tends to converge to the optimal estimator as the size of the training data increases sufficiently.
Online estimation: In the online estimation, the initial coarse channel estimation result, denoted as Y ˜ t e s t , is obtained from the received symbols. Subsequently, the data are sent into the DRDN estimator, and the process of online estimation can be represented as follows:
H e s t = h θ ( f θ ( Y ˜ t e s t ) ) ,
where h θ denotes the hard-shrink operation. The developed channel estimation method based on the DRDN is a universal approach that enhances system performance by achieving superior channel estimation.

3.4. Communication Symbol Detection

In this section, we provide a brief introduction to the OAMP algorithm that we utilized to perform the OTFS symbol detection in the ISAC system, where the estimated effective channel matrix from the DRDN is utilized.
Now, let us develop the OTFS detection problem within the perspective of OAMP. To the best of our knowledge, the OTFS system can be considered as a linear system under a linear constraint Γ as well as a nonlinear constraint Φ , given by:
Linear constraint Γ : y = H ^ eff x + n ˜ ,
Nonlinear constraint Φ : X [ m , n ] A , m , n ,
where H ^ eff C M N × M N is the estimated effective channel matrix, n ˜ represents the additive white Gaussian noise with half power spectral density N 0 , and A is the constellation alphabet (e.g., QPSK) with cardinality | A | . Our objective is to find the minimum mean square error (MMSE) estimation of x , i.e.,
x ^ = E { x | y , H ^ eff , Γ , Φ } .
The aforementioned problem can be solved by OAMP with the aid of an iterative method that includes an orthogonal linear estimator (LE) γ and an orthogonal nonlinear estimator (NLE) ϕ . We now invoke the OAMP following the standard procedures shown in Ma et al. [28] as:
Orthogonal LE γ : x ^ γ ϕ = γ ( x ¯ ϕ γ ) ,
Orthogonal NLE ϕ : x ¯ ϕ γ = ϕ ( x ^ γ ϕ ) .
The core of OAMP is that both the LE and NLE should be designed orthogonally [28]. Hence, the correlation problem, arising from the errors of the iterative process, can be solved thanks to the orthogonality of LE and NLE. Furthermore, the orthogonality can also make the iterative process gradually and steadily converge to the MMSE. It should be noted that both the orthogonality LE and NLE hold if and only if the following constraints are satisfied for iteration l 0 :
E ( ξ l γ ϕ ) H ξ l ϕ γ = 0 , E ( ξ l ϕ γ ) H ξ l + 1 γ ϕ = 0 ,
where ξ l γ ϕ and ξ l ϕ γ represent the associated Gram–Schmidt (GS) errors, which are written as
x ^ l γ ϕ = α l γ ϕ x + ξ l γ ϕ ,
x ¯ l ϕ γ = α l ϕ γ x + ξ l ϕ γ ,
with α = 1 N E { x ^ T x } and E { x T ξ } = 0 . The optimal OAMP is then constructed for both LE and NLE using the GS orthogonalization (GSO) given by:
Orthogonal LE γ : x ^ l γ ϕ = γ ( x ¯ l ϕ γ ) = γ ^ ( x ¯ l ϕ γ ) B γ ^ x ¯ l ϕ γ , Orthogonal NLE ϕ : x ¯ l + 1 ϕ γ = ϕ ( x ^ l γ ϕ ) = ϕ ^ ( x ^ l γ ϕ ) B ϕ ^ x ^ l γ ϕ ,
where γ ^ is a linear MMSE (LMMSE) estimator, ϕ ^ represents an MMSE detector, e.g., a symbolwise constellation demapper, B γ ^ and B ϕ ^ denote the respective generalized signal orthogonalization (GSO) coefficients. These coefficients are designed to fulfill the orthogonality conditions stated in Equation (30). For instance, the value of B γ ^ for the LMMSE estimator is expressed as
B γ ^ = v γ ^ / v x ¯ ,
where v γ ^ = 1 N E { || γ ^ ( x ¯ l ϕ γ ) x || 2 } and v x ¯ = 1 N E { || ξ l ϕ γ || 2 } . The computational complexity of OAMP consists of the complexity of the orthogonal LE and orthogonal NLE, where the orthogonal LE contains the LMMSE and orthogonalization operations, whose complexities are O ( ( M N ) 3 ) and O ( M N ) , respectively, and the nonorthogonal NLE mainly contains the constellation demapper and orthogonalization operations, both of which have a complexity of O ( M N ) . Hence, the overall complexity of OAMP is O ( ( M N ) 3 + M N ) .

4. Simulation Result

In this section, we substantiate the efficiency of the proposed algorithm based on the DRDN through a comprehensive analysis of simulation outcomes. We also provide details of the simulation setups.

4.1. Simulation Setups

In this study, the OTFS frame was configured with the parameters N = 32 and M = 32, indicating the presence of 32 time slots and 32 subcarriers in the time–frequency (TF) domain, as [3]. According to [29], the carrier frequency can be set to 3 GHz, while the subcarrier spacing was maintained at 7.5 kHz. To characterize the channel, we considered a total number of paths denoted as P, which was set to six. Due to the limitation of the size of the OTFS data frame, the maximum Doppler shift and maximum delay shift were normalized and denoted by k max = 4 and l max = 5 , respectively. The channel gain was modeled by a complex Gaussian distribution with zero mean and a variance of 1 P [30]. Finally, we employed quadrature amplitude modulation (QAM) for efficient bit mapping in our system. For the neural network training, we used the Monte Carlo method to generate 2 × 10 4 samples. The learning rate was set as 0.01 , the batch size was B = 256 , and a weight decay = 0.001 was adopted to deal with overfitting.

4.2. Sensing Channel Estimation

To evaluate the performance of the channel estimation, a comparison of the proposed DL-based method with four baseline algorithms (i.e., the LS, the LMMSE, the orthogonal match pursuit (OMP) [31], and the threshold-based method) was made. Note that the threshold-based method is the optimal algorithm in scenarios with a Gaussian white noise when the noise variance is known at the receiver [22]. In addition, the evaluation metric employed was the normalized mean squared error (NMSE), which is defined as NMSE = 10 log 10 || H ^ H || 2 2 || H || 2 2 , where H ^ and H are the estimated and ground truths, respectively.
As shown in Figure 8, we assessed the proposed methods in scenarios involving a correlated noise [32] and a noise following a t-distribution [23], which are prevalent in real-world settings. It can be observed that a performance gap exists between the LS estimator and the LMMSE estimator, as seen in Figure 8a. This disparity arises because the LS estimator treats the channel as an unknown constant with deterministically defined attributes. In contrast, the LMMSE estimator involves the computation of the noise covariance matrix, leading to a performance improvement of approximately 3 dB compared to the LS method in correlated-noise scenarios. The OMP method frequently employed in compressed sensing algorithms outperforms both the LS and LMMSE algorithms, owing to its consideration of the inherent sparsity within the recovery channel. Nevertheless, the performance of this iterative algorithm experiences a significant degradation due to the stringent convergence conditions and the influence of noise. In comparison to the previous methods, the threshold-based methods perform better when the noise covariance is known at the receiver. Under sparse channel conditions, most errors come from grid points other than the main path. By setting an appropriate threshold, these noises can be effectively removed, resulting in a better performance and robustness compared to the LS and LMMSE methods. In comparison to the traditional LS, LMMSE, and threshold-based methods, deep learning neural networks can more accurately estimate channel characteristics and have a better robustness. By incorporating training into the network design, the absolute value of the residual information can weaken the impact of the noise to a certain extent. The proposed deep learning neural network outperforms existing threshold-based and linear detection methods, improving the performance by approximately 8 dB compared to the OMP method and approximately 2 dB compared to the threshold-based method. As depicted in Figure 8a, in the scenario of a t-distribution noise, it is noticed that the proposed DRDN demonstrates a significant enhancement in estimation performance, outperforming all other considered algorithms. This is due to the inability of the model-based approach to incorporate prior knowledge from the data. In contrast, the DRDN can substantially enhance channel estimation performance due to its robust data-driven capability and the incorporation of sparse prior information.
Figure 9 presents comparisons of the NMSE and BER achieved by the proposed joint channel estimation and symbol detection scheme, as well as by several benchmarks, including the LS, LMMSE, MP, and threshold-based methods. As illustrated in Figure 9, both threshold-based methods and DNN-based methods have superior channel estimation performance compared to traditional LS and LMMSE methods. Note that the LMMSE-based method considers the variance of the noise, resulting in a performance approximately 5 dB higher than the LS method. In comparison with the threshold-based and LMMSE methods, the DRDN method performs better, with a performance approximately 7 dB higher than the OMP algorithm. In the Gaussian noise scenario, the threshold method is the optimal solution when the noise level is ideally acquired at the receiver. In practice, the DRDN has a higher adaptability and generalization ability in complex noise scenarios. The DRDN can accurately complete noise reduction tasks, effectively distinguishing the main channel, and restore the channel.
On the other hand, the results show that when the estimated channel information from the DRDN is used as input, the proposed scheme achieves superior performance compared to the other detectors. Specifically, the OAMP detector exhibits the best performance, while the performance of the MP detector is close to that of OAMP. Furthermore, the results in Figure 9 demonstrate the robustness of OAMP, which maintains good performance even at lower SNR ( E b / N 0 ) and with imperfect channel estimation. Overall, these findings confirm the effectiveness of the proposed joint channel estimation and symbol detection scheme.

5. Conclusions

In this paper, we focused on the theme of ISAC-assisted OTFS receiver design, which aims to perform channel estimation for sensing purposes and detect communication symbols. In order to achieve an effective estimation of the channel, we proposed the utilization of a novel DRDN within our framework. The DRDN was meticulously designed, incorporating a denoising block based on CNNs that encompassed an elementwise subtraction structure. This unique architectural feature enabled the network to exploit both the spatial characteristics of noisy channel matrices and the inherent additive properties of the noise simultaneously. By capitalizing on the robust feature extraction and denoising capabilities of CNNs, our proposed DRDN method showcased an exceptional accuracy in channel estimation, surpassing alternative approaches in the field. The CNN-based channel estimation approach for OTFS had a good potential with promising directions for future work, including solving the fractional delay-Doppler cases and a novel network structure that may require a smaller number of trainable parameters. For the OTFS symbol detection, we utilized the OAMP detector, which had the potential for MMSE optimality and exhibited excellent detection performance. Through simulations, we demonstrated the effectiveness of the proposed system and compared its performance with traditional communication systems. The proposed system exhibited superior performance in challenging environments such as a high Doppler frequency and delay spread, making it a promising solution for future wireless communication systems.

Author Contributions

Conceptualization, X.Z. and Z.L.; Methodology, X.Z. and H.W.; Writing—original draft, X.Z.; Writing—review & editing, Z.Y. and J.W.; Supervision, W.Y. All authors have read and agreed to the published version of the manuscript.

Funding

This work is supported in part by National Natural Science Foundation of China under Grant 62101232, and in part by the Guangdong Provincial Natural Science Foundation under Grant 2022A1515011257.

Institutional Review Board Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Liu, X.; Xu, B.; Wang, X.; Zheng, K.; Chi, K.; Tian, X. Impacts of sensing energy and data availability on throughput of energy harvesting cognitive radio networks. IEEE Trans. Veh. Technol. 2022, 72, 747–759. [Google Scholar] [CrossRef]
  2. Yuan, W.; Li, S.; Wei, Z.; Jiang, J.; Zhang, H.; Fan, P. A Survey on Orthogonal Time Frequency Space: New Delay Doppler Communications Paradigm in 6G era. arXiv 2022, arXiv:2211.12955. [Google Scholar] [CrossRef]
  3. Wei, Z.; Yuan, W.; Li, S.; Yuan, J.; Ng, D.W.K. Off-grid channel estimation with sparse bayesian learning for OTFS systems. IEEE Trans. Wirel. Commun. 2022, 21, 7407–7426. [Google Scholar] [CrossRef]
  4. Li, S.; Yuan, W.; Liu, C.; Wei, Z.; Yuan, J.; Bai, B.; Ng, D.W.K. A novel ISAC transmission framework based on spatially-spread orthogonal time frequency space modulation. IEEE J. Sel. Areas Commun. 2022, 40, 1854–1872. [Google Scholar] [CrossRef]
  5. Liu, F.; Cui, Y.; Masouros, C.; Xu, J.; Han, T.X.; Eldar, Y.C.; Buzzi, S. Integrated sensing and communications: Towards dual-functional wireless networks for 6G and beyond. IEEE J. Sel. Areas Commun. 2022, 40, 1728–1767. [Google Scholar] [CrossRef]
  6. Yuan, W.; Wei, Z.; Li, S.; Yuan, J.; Ng, D.W.K. Integrated sensing and communication-assisted orthogonal time frequency space transmission for vehicular networks. IEEE J. Sel. Top. Signal Process. 2021, 15, 1515–1528. [Google Scholar] [CrossRef]
  7. Hadani, R.; Monk, A. OTFS: A new generation of modulation addressing the challenges of 5G. arXiv 2018, arXiv:1802.02623. [Google Scholar]
  8. Yuan, W.; Li, S.; Wei, Z.; Yuan, J.; Ng, D.W.K. Bypassing channel estimation for OTFS transmission: An integrated sensing and communication solution. In Proceedings of the 2021 IEEE Wireless Communications and Networking Conference Workshops (WCNCW), Nanjing, China, 29 March 2021; pp. 1–5. [Google Scholar]
  9. Xiong, Y.; Liu, F.; Cui, Y.; Yuan, W.; Han, T.X. Flowing the information from Shannon to Fisher: Towards the fundamental tradeoff in ISAC. In Proceedings of the GLOBECOM 2022—2022 IEEE Global Communications Conference, Nanjing, China, 29 March 2021; pp. 5601–5606. [Google Scholar]
  10. Dong, F.; Wang, W.; Hu, Z.; Hui, T. Low-complexity beamformer design for joint radar and communications systems. IEEE Commun. Lett. 2020, 25, 259–263. [Google Scholar] [CrossRef]
  11. Meng, X.; Liu, F.; Lu, S.; Chepuri, S.P.; Masouros, C. RIS-assisted integrated sensing and communications: A subspace rotation approach. arXiv 2022, arXiv:2210.13987. [Google Scholar]
  12. Li, Z.; Yuan, W.; Zhou, L. UAMP-Based Channel Estimation for OTFS in the Presence of the Fractional Doppler with HMM Prior. In Proceedings of the 2022 IEEE/CIC International Conference on Communications in China (ICCC Workshops), Foshan, China, 11–13 August 2022; pp. 304–308. [Google Scholar]
  13. Zhang, K.; Yuan, W.; Li, S.; Liu, F.; Gao, F.; Fan, P.; Cai, Y. Radar Sensing via OTFS Signaling: A Delay Doppler Signal Processing Perspective. arXiv 2023, arXiv:2301.09909. [Google Scholar]
  14. Zheng, K.; Jia, X.; Chi, K.; Liu, X. DDPG-based joint time and energy management in ambient backscatter-assisted hybrid underlay CRNs. IEEE Trans. Commun. 2022, 71, 441–456. [Google Scholar] [CrossRef]
  15. Liu, C.; Liu, X.; Ng, D.W.K.; Yuan, J. Deep residual learning for channel estimation in intelligent reflecting surface-assisted multi-user communications. IEEE Trans. Wirel. Commun. 2021, 21, 898–912. [Google Scholar] [CrossRef]
  16. Kim, T.K.; Min, M. Reinforcement Learning-Aided Channel Estimator in Time-Varying MIMO Systems. Sensors 2023, 23, 5689. [Google Scholar] [CrossRef] [PubMed]
  17. Raviteja, P.; Phan, K.T.; Hong, Y.; Viterbo, E. Interference Cancellation and Iterative Detection for Orthogonal Time Frequency Space Modulation. IEEE Trans. Wirel. Commun. 2018, 17, 6501–6515. [Google Scholar] [CrossRef]
  18. Li, S.; Yuan, W.; Wei, Z.; Yuan, J.; Bai, B.; Ng, D.W.K.; Xie, Y. Hybrid MAP and PIC detection for OTFS modulation. IEEE Trans. Veh. Technol. 2021, 70, 7193–7198. [Google Scholar] [CrossRef]
  19. Yuan, W.; Wei, Z.; Yuan, J.; Ng, D.W.K. A Simple Variational Bayes Detector for Orthogonal Time Frequency Space (OTFS) Modulation. IEEE Trans. Veh. Technol. 2020, 69, 7976–7980. [Google Scholar] [CrossRef]
  20. Wen, H.; Yuan, W.; Wu, N.; Wen, J. A Low-Complexity Cross-Domain OAMP Detector for OTFS: (Invited Paper). In Proceedings of the 2022 International Symposium on Wireless Communication Systems (ISWCS), Hangzhou, China, 19–22 October 2022; pp. 1–6. [Google Scholar] [CrossRef]
  21. Yuan, Z.; Liu, F.; Yuan, W.; Guo, Q.; Wang, Z.; Yuan, J. Iterative Detection for Orthogonal Time Frequency Space Modulation With Unitary Approximate Message Passing. IEEE Trans. Wirel. Commun. 2022, 21, 714–725. [Google Scholar] [CrossRef]
  22. Raviteja, P.; Phan, K.T.; Hong, Y. Embedded pilot-aided channel estimation for OTFS in delay–Doppler channels. IEEE Trans. Veh. Technol. 2019, 68, 4906–4917. [Google Scholar] [CrossRef]
  23. Zhang, X.; Yuan, W.; Liu, C. Deep Residual Learning for OTFS Channel Estimation with Arbitrary Noise. In Proceedings of the 2022 IEEE/CIC International Conference on Communications in China (ICCC Workshops), Foshan, China, 11–13 August 2022; pp. 320–324. [Google Scholar]
  24. Zhang, X.; Yuan, W.; Liu, C.; Liu, F.; Wen, M. Deep Learning with a Self-Adaptive Threshold for OTFS Channel Estimation. In Proceedings of the 2022 International Symposium on Wireless Communication Systems (ISWCS), Hangzhou, China, 19–22 October 2022; pp. 1–5. [Google Scholar]
  25. Liu, C.; Wei, Z.; Ng, D.W.K.; Yuan, J.; Liang, Y.C. Deep transfer learning for signal detection in ambient backscatter communications. IEEE Trans. Wirel. Commun. 2020, 20, 1624–1638. [Google Scholar] [CrossRef]
  26. Liu, X.; Liu, C.; Li, Y.; Vucetic, B.; Ng, D.W.K. Deep Residual Learning-Assisted Channel Estimation in Ambient Backscatter Communications. IEEE Wirel. Commun. Lett. 2021, 10, 339–343. [Google Scholar] [CrossRef]
  27. Hyvärinen, A.; Hoyer, P.; Oja, E. Sparse code shrinkage: Denoising by nonlinear maximum likelihood estimation. Adv. Neural Inf. Process. Syst. 1998, 11. [Google Scholar]
  28. Ma, J.; Ping, L. Orthogonal AMP. IEEE Access 2017, 5, 2020–2033. [Google Scholar] [CrossRef]
  29. Li, S.; Yuan, J.; Yuan, W.; Wei, Z.; Bai, B.; Ng, D.W.K. Performance analysis of coded OTFS systems over high-mobility channels. IEEE Trans. Wirel. Commun. 2021, 20, 6033–6048. [Google Scholar] [CrossRef]
  30. Liu, C.; Li, S.; Yuan, W.; Liu, X.; Ng, D.W.K. Predictive precoder design for OTFS-enabled URLLC: A deep learning approach. IEEE J. Sel. Areas Commun. 2023, 41, 2245–2260. [Google Scholar] [CrossRef]
  31. Li, Q.; Gong, Y.; Meng, F.; Li, Z.; Miao, L.; Xu, Z. Residual learning based channel estimation for OTFS system. In Proceedings of the 2022 IEEE/CIC International Conference on Communications in China (ICCC Workshops), Foshan, China, 11–13 August 2022; pp. 275–280. [Google Scholar]
  32. Moustakas, A.L.; Simon, S.H.; Sengupta, A.M. MIMO capacity through correlated channels in the presence of correlated interferers and noise: A (not so) large N analysis. IEEE Trans. Inf. Theory 2003, 49, 2545–2561. [Google Scholar] [CrossRef]
Figure 1. The ISAC-OTFS vehicular network.
Figure 1. The ISAC-OTFS vehicular network.
Entropy 25 01358 g001
Figure 2. OTFS modulation.
Figure 2. OTFS modulation.
Entropy 25 01358 g002
Figure 3. JCESD for OTFS-Based vehicular networks.
Figure 3. JCESD for OTFS-Based vehicular networks.
Entropy 25 01358 g003
Figure 4. The framework for JCESD.
Figure 4. The framework for JCESD.
Entropy 25 01358 g004
Figure 5. The embedded pilot scheme.
Figure 5. The embedded pilot scheme.
Entropy 25 01358 g005
Figure 6. Architecture of DRDN.
Figure 6. Architecture of DRDN.
Entropy 25 01358 g006
Figure 7. The subnet for the adaptive threshold.
Figure 7. The subnet for the adaptive threshold.
Entropy 25 01358 g007
Figure 8. NMSE comparisons with the proposed schemes and benchmarks with correlated noise (a) and with t-distribution noise (b).
Figure 8. NMSE comparisons with the proposed schemes and benchmarks with correlated noise (a) and with t-distribution noise (b).
Entropy 25 01358 g008
Figure 9. Joint channel estimation and symbol detection (a,b). (a) NMSE comparisons with the proposed schemes and benchmarks with Gaussian noise. (b) BER comparisons with the proposed schemes and benchmarks.
Figure 9. Joint channel estimation and symbol detection (a,b). (a) NMSE comparisons with the proposed schemes and benchmarks with Gaussian noise. (b) BER comparisons with the proposed schemes and benchmarks.
Entropy 25 01358 g009
Table 1. Hyperparameters of the DRDN.
Table 1. Hyperparameters of the DRDN.
Input layer: real-valued matrix with dimension 2 k max × l max
Denoising Module: D denoising blocks share the same construction
LayersOperationFilter size
1Conv + BN + ReLU 128 × ( 3 × 2 × 2 )
2~ L 1 Conv + BN + ReLU 128 × ( 3 × 2 × 128 )
3Conv 2 × ( 3 × 2 × 128 )
Subnetwork: generate the threshold array
Module NameOperationParameters
f 1 ( · ) Conv + BN + ReLU 32 × ( 3 × 2 × 2 )
f 2 ( · ) FC + BN + ReLU 2 × 1 × 1
Output layer: recovery channel matrix of size M × N × 2
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Zhang, X.; Wen, H.; Yan, Z.; Yuan, W.; Wu, J.; Li, Z. A Novel Joint Channel Estimation and Symbol Detection Receiver for Orthogonal Time Frequency Space in Vehicular Networks. Entropy 2023, 25, 1358. https://doi.org/10.3390/e25091358

AMA Style

Zhang X, Wen H, Yan Z, Yuan W, Wu J, Li Z. A Novel Joint Channel Estimation and Symbol Detection Receiver for Orthogonal Time Frequency Space in Vehicular Networks. Entropy. 2023; 25(9):1358. https://doi.org/10.3390/e25091358

Chicago/Turabian Style

Zhang, Xiaoqi, Haifeng Wen, Ziyu Yan, Weijie Yuan, Jun Wu, and Zhongjie Li. 2023. "A Novel Joint Channel Estimation and Symbol Detection Receiver for Orthogonal Time Frequency Space in Vehicular Networks" Entropy 25, no. 9: 1358. https://doi.org/10.3390/e25091358

APA Style

Zhang, X., Wen, H., Yan, Z., Yuan, W., Wu, J., & Li, Z. (2023). A Novel Joint Channel Estimation and Symbol Detection Receiver for Orthogonal Time Frequency Space in Vehicular Networks. Entropy, 25(9), 1358. https://doi.org/10.3390/e25091358

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop