Bayesian Learning-Based Clustered-Sparse Channel Estimation for Time-Varying Underwater Acoustic OFDM Communication

Wang, Shuaijun; Liu, Mingliu; Li, Deshi

doi:10.3390/s21144889

Open AccessArticle

Bayesian Learning-Based Clustered-Sparse Channel Estimation for Time-Varying Underwater Acoustic OFDM Communication^†

by

Shuaijun Wang

¹

,

Mingliu Liu

¹

and

Deshi Li

^1,2,*

¹

Electronic Information School, Wuhan University, Wuhan 430072, China

²

Collaborative Innovation Center of Geospatial Technology, 129 Luoyu Road, Wuhan 430072, China

^*

Author to whom correspondence should be addressed.

^†

This paper is an extended version of our paper published in Wang, S.; Li, D.; Liu, M.; Huang, W.; Chen, H.; Cen, Y. Clustered-Sparse Bayesian Learning for Channel Estimation in Underwater Acoustic OFDM Systems. In Proceedings of the 2020 International Conference on Wireless Communications and Signal Processing (WCSP), Nanjing, China, 21–23 October 2020; pp. 546–551.

Sensors 2021, 21(14), 4889; https://doi.org/10.3390/s21144889

Submission received: 17 June 2021 / Revised: 14 July 2021 / Accepted: 15 July 2021 / Published: 18 July 2021

(This article belongs to the Special Issue Underwater Acoustics Modelling and Control)

Download

Browse Figures

Versions Notes

Abstract

:

Orthogonal frequency division multiplexing (OFDM) has been widely adopted in underwater acoustic (UWA) communication due to its good anti-multipath performance and high spectral efficiency. For UWA-OFDM systems, channel state information (CSI) is essential for channel equalization and adaptive transmission, which can significantly affect the reliability and throughput. However, the time-varying UWA channel is difficult to estimate because of excessive delay spread and complex noise distribution. To this end, a novel Bayesian learning-based channel estimation architecture is proposed for UWA-OFDM systems. A clustered-sparse channel distribution model and a noise-resistant channel measurement model are constructed, and the model hyperparameters are iteratively optimized to obtain accurate Bayesian channel estimation. Accordingly, to obtain the clustered-sparse distribution, a partition-based clustered-sparse Bayesian learning (PB-CSBL) algorithm was designed. In order to lessen the effect of strong colored noise, a noise-corrected clustered-sparse channel estimation (NC-CSCE) algorithm was proposed to improve the estimation accuracy. Numerical simulations and lake trials are conducted to verify the effectiveness of the algorithms. Results show that the proposed algorithms achieve higher channel estimation accuracy and lower bit error rate (BER).

Keywords:

clustered-sparse channel estimation; Bayesian learning; underwater acoustic communication; orthogonal frequency division multiplexing

1. Introduction

Underwater acoustic (UWA) sensor networks can be applied to a variety of underwater activities, including environment monitoring, resource exploration and equipment navigation. Reliable high-rate communication is significant for massive data transmission between sensor nodes [1]. However, the UWA channel is widely known as one of the most difficult communication media, and high-rate communication with high reliability is quite challenging. Since the sound attenuation rapidly increases with frequency, the bandwidth available for communication is extremely limited. Due to the reflection and refraction of sound in the ocean, the multipath spread in a medium-range shallow water channel can extend over tens or even hundreds of milliseconds [2], resulting in severe frequency-selective signal distortion. Due to the dynamic environment and moving transceivers in ocean, the UWA channel is time-varying. In particular, the normalized frequency offset factor induced by Doppler can be on the order of

10^{- 3}

compared with

10^{- 7}

for high-speed mobile radio channels [2].

Due to high spectral efficiency and good anti-multipath performance, orthogonal frequency division multiplexing (OFDM) and some OFDM-based modulation techniques are widely adopted for UWA communications [3,4,5,6]. In UWA-OFDM systems, accurate channel state information (CSI) is vital for channel equalization and adaptive transmission [7,8], which can significantly affect the reliability and throughput. However, it is difficult to estimate the time-varying channel impulse responses (CIRs) with excessive delay spread. To overcome this problem, some structural information such as sparsity can be exploited to improve channel estimation performance. The paths of the UWA channel tend to be clustered-sparse, which carry comprehensive structural information [9,10,11,12,13]. As shown in Figure 1, there are many weak paths centering around the eigen-paths associated with the medium refractions and surface/bottom bounces, which form the clustered-sparse structure. As different clusters are usually derived from different transmission routes, the channel coherence time usually varies across clusters. Thus, the path variation for different clusters can reflect the channel variation characteristics [14], and can be used for predicting the time-varying UWA channel [15]. However, the fading process of the UWA channel may randomly vary [2], which makes it difficult to accurately obtain the path variation for different clusters between adjacent OFDM blocks. To obtain the clustered-sparse structure of the UWA channel, we proposed a clustered-sparse Bayesian channel estimation method in [16], which is the preliminary version of this work (This work extends the conference paper [16]) by (i) adding a Bayesian learning-based channel estimation architecture; (ii) adding a noise-corrected clustered-sparse channel estimation algorithm to lessen the effect of strong colored noise; (iii) giving the detailed derivation and performance analysis of the proposed algorithms; and (iv) presenting more complete performance results related to modulation order, model hyperparameters and noise distribution).

In addition to the time-varying CIRs with excessive delay spread, the complex distribution of noise in the UWA channel also makes the channel estimation difficult. Generally, there are two kinds of channel noise, including the ambient noise and site-specific noise. The former type is presented in the background of the quiet deep sea, and exhibits a colored characteristic when approximated by Gaussian distribution, while for the site-specific noise, significant non-Gaussian components should be carefully considered [2]. Some experiments have confirmed that the noise is non-uniform across different locations, times and frequencies [17]. As a popular noise model in most UWA-OFDM systems, additive white Gaussian noise (AWGN) is straightforward for modeling and solving the noise distribution of the UWA channel, but it deviates from the actual underwater noise. Obviously, the deviation of the noise model means the estimation error of the noise distribution, which will lead to the degradation of channel estimation and signal detection performance.

To tackle the challenge of estimating the time-varying UWA channel with excessive delay spread and complex noise distribution, we adopted Bayesian learning to obtain the multipath and noise distribution characteristics of the UWA channel, based on which Bayesian channel estimation is derived to improve the performance of pilot-based channel estimation. The main contributions of this work are summarized as follows:

To estimate the time-varying multipath channel with colored noise, we propose a novel Bayesian learning-based channel estimation architecture for UWA-OFDM systems. Specifically, a clustered-sparse channel distribution model is constructed to characterize the delay power spectrum and temporal correlation of each cluster in the multipath channel, and a noise-resistant channel measurement model is constructed to reduce the noise disturbance. By learning the model hyperparameters, the Bayesian channel estimation based on the two models can be iteratively optimized.
To obtain the clustered-sparse distribution, we propose a partition-based clustered-sparse Bayesian learning (PB-CSBL) algorithm. Through the cluster partition, different clusters can learn different channel correlation coefficients, and thus the inter-cluster interference of the multipath channel can be suppressed.
To lessen the effect of strong colored noise, we propose a noise-corrected clustered-sparse channel estimation (NC-CSCE) algorithm. Based on the iterative symbol decision and noise correction, the more accurate hyperparameters of the models can be obtained, which can improve the accuracy of the Bayesian channel estimation.

The rest of this paper is organized as follows. Section 2 discusses the related works. In Section 3, the system architecture and channel models are constructed. Then, the Bayesian learning-based clustered-sparse channel estimation method is designed in Section 4. In Section 5, the noise-corrected clustered-sparse channel estimation method is presented. The evaluation and result analysis are shown in Section 6. In Section 7, conclusions are given.

2. Related Works

The UWA channel is severely band-limited, and exhibits time-varying excessive delay spread. To this end, the sparse structure of the UWA channel can be exploited to improve the estimation performance, as most channel energy of the UWA channel is concentrated on a few paths [18,19]. The paths of the UWA channel also tend to be clustered-sparse [20], which has been frequently observed based on field experiments at different locations [9,10,11,12,13]. A detailed investigation on the variation of the significant paths in each cluster is presented in [21]. Based on the observation of channel sparsity, compressed sensing (CS)-based sparse channel estimation methods [22,23,24] have been studied, such as the greedy matching pursuit (MP) and orthogonal matching pursuit (OMP), which were adopted in [18,25], aiming to estimate the UWA channel impulse response (CIR) and the channel delay-Doppler-spread function, respectively. In order to reduce the convergence error of the OMP algorithm, the compressed sampling matching pursuit (CoSaMP) algorithm is proposed in [26]. However, CoSaMP-based channel estimators require knowledge of the channel sparsity level, which is difficult to obtain accurately in advance in practical applications. The joint sparsity among adjacent OFDM blocks was exploited by the simultaneous OMP (SOMP) [27] for higher estimation accuracy, but only stable path delays were considered and the temporal correlation of path gains was ignored. Typical convex optimization algorithms including basis pursuit (BP) and basis pursuit denoising (BPDN) tackle the NP-hard

l_{0}

norm problem by transforming the

l_{0}

norm to a convex

l_{1}

norm. The BP algorithm was applied to estimate the path delays and Doppler scale factors of the UWA channel in [25], and the BPDN algorithm was proposed to estimate the slowly time-varying UWA channels in [28]. However, the global minimum of the cost function in BP or BPDN is not necessarily consistent with the sparsest solution. An empirical channel variation model, in which different clusters vary independently, was proposed in [14], and demonstrated that adaptation with multiple clusters outperforms that with one cluster for UWA channel estimation.

In recent years, Bayesian learning (BL) has been used to improve the performance of channel estimation. Compared with the CS-based methods that provide point estimation, Bayesian techniques can give the interval estimation of channel response and have a desirable property of preventing structural errors [29]. By using BL-based methods, the high accuracy of channel estimation was achieved in OFDM systems [30,31,32,33]. The temporal multiple sparse Bayesian learning (TMSBL)-based estimation method was further proposed in [32] to estimate the sparse channels by taking advantage of the channel coherence between consecutive OFDM blocks, and it can achieve better performance in strong correlated channels and maintains robustness in weak temporal correlated channels. Due to the high computational complexity of BL, some accelerated algorithms were proposed, such as the approximate message passing (AMP)-based method in [34]. More recently, some researchers utilized deep learning (DL)-based channel estimators for OFDM systems [35,36,37], which showed better channel estimation or equalization performance for nonlinear channel distortion. However, how to go about improving the generalization performance of pre-training DL models and reduce the complexity of training and inferring remain challenging.

3. System Architecture and Channel Models

In this section, the system architecture of Bayesian learning-based channel estimation is presented first. Then, the noise-resistant channel measurement model and the clustered-sparse channel distribution model are constructed.

3.1. Bayesian Learning-Based Channel Estimation Architecture

A typical UWA-OFDM communication scenario was shown at the top of Figure 2. The receiver can obtain the transmitted OFDM signal through dynamic multipaths, and the received signal may be affected by random colored noise. At the bottom of Figure 2, the system architecture of Bayesian learning-based channel estimation is given. To characterize the delay power spectrum and temporal correlation of each cluster in the UWA channel, the clustered-sparse channel distribution model is constructed and the model hyperparameters can be optimized by the iterative partition-based cluster evolution. Demodulated signal and pilot symbols were utilized to measure the instantaneous channel response, and the iterative noise measurement and data detection can improve the anti-noise performance of the channel measurement model. Based on the channel distribution model and the channel measurement model, the Bayesian channel estimation can be obtained.

Important notations used in following sections are given by:

Uppercase and lowercase bold symbols are reserved for matrices and vectors, respectively. Particularly, $I_{M}$ denotes the identity matrix with size $M \times M$ . When the dimension is evident from the context, for simplicity, we only used $I$ ;
$A^{T}$ and $A^{H}$ represent the transpose and conjugate transpose of $A$ , respectively;
$A$ ⊗ $B$ denotes the Kronecker product of the two matrices $A$ and $B$ ;
$v e c (A)$ denotes the vectorization of $A$ formed by stacking its columns into a single column vector;
$T r (A)$ denotes the trace of $A$ ;
$T o e p l i t z (a_{1}, a_{2}, \dots, a_{N})$ denotes a Toeplitz matrix taking $a_{1}, \dots, a_{N}$ as first row;
$d i a g (a)$ or $d i a g (a_{1}, a_{2}, \dots, a_{N})$ denotes a diagonal matrix with principal diagonal elements being $a_{1}, a_{2}, \dots, a_{N}$ in turn; $d i a g (A_{1}, A_{2}, \dots, A_{N})$ denotes a block diagonal matrix with principal diagonal blocks being the square matrices $A_{1}, \dots, A_{N}$ in turn;
If $A$ is a square matrix, $P r d i a g (A)$ denotes a diagonal matrix with principal diagonal elements being the principal diagonal elements of $A$ in turn.
If some terms in a cost function do not contribute to the subsequent optimization of the parameters, ∝ is used to indicate that these terms have been dropped.

3.2. Noise-Resistant Channel Measurement Model

Considering the OFDM system with cyclic prefix (CP) and K subcarriers, the non-overlapping sets of

K_{d}

data subcarriers

x_{d}

,

K_{p}

pilot subcarriers

x_{p}

and

K_{n}

null subcarriers

x_{n}

satisfy

K = K_{d} + K_{p} + K_{n}

. Let T denotes the OFDM symbol period without CP,

T_{c p}

denotes the length of CP and

T_{s} = T + T_{c p}

is the whole OFDM symbol period. The kth subcarrier is at the frequency:

f_{k} = f_{c} + k / T, k = - K / 2, \dots, K / 2 - 1,

(1)

where

f_{c}

is the carrier frequency. Assuming that there are M OFDM blocks in one frame, a passband waveform of the mth OFDM block at time

t^{'} \in [- T_{c p} + m T_{s}, T + m T_{s}]

is then given by

{\tilde{x}}^{(m)} (t) = 2 R e {\frac{1}{\sqrt{K}} \sum_{k = - K / 2}^{K / 2 - 1} x_{k}^{(m)} e^{j 2 π f_{k} t} q (t)},

(2)

where

m = 1, 2, \dots, M

,

t = t' - m T_{s}

,

x_{k}^{(m)}

is defined as a transmitted symbol on the kth subcarrier at the mth OFDM block and q(t) is the pulse shaping filter:

q (t) = \{\begin{matrix} 1 & , t \in [- T_{c p}, T], \\ 0 & , o t h e r w i s e . \end{matrix}

(3)

The UWA channel is a typical sparse multipath channel. For horizontal shallow water multi-path transmission with a range much greater than depth, each path has almost the same Doppler factor [38]. Thus, we assume that all the paths have the equal Doppler scale factor a, and the Doppler effect can be almost eliminated through effective Doppler estimation and compensation. Furthermore, the path delay remains stable across several consecutive OFDM blocks, and the path gains and Doppler scale factors are constant during one OFDM block, but vary from block to block. The time-varying channel response during the mth OFDM block can be expressed as

h^{(m)} (τ, t) = \sum_{l = 0}^{L - 1} A_{l}^{(m)} δ [τ - (τ_{l} - a^{(m)} t)],

(4)

where L is the path number of the multipath channel, which may be distributed in more than one cluster.

A_{l}

and

τ_{l}

denote the gain and delay of the lth path.

Through the time-varying channel, the received passband signal during the mth OFDM block can be expressed as

{\tilde{y}}^{(m)} (t) = \sum_{l = 0}^{L - 1} A_{l}^{(m)} {\tilde{x}}^{(m)} [(1 + a^{(m)}) t - τ_{l}] + {\tilde{w}}^{(m)} (t),

(5)

where

\tilde{w} (t)

is the additive noise. After synchronization, a popular two-step approach [39] can be adopted to mitigate the Doppler effect, which takes a coarse Doppler estimation and resampling as the first step and then performs the fine Doppler shift compensation using the null subcarriers. Moreover, CP can also be used for Doppler shift estimation [40]. Performing CP-OFDM demodulation, the output

K \times 1

vector of the mth OFDM block can be expressed as

y^{(m)} = X^{(m)} F h^{(m)} + w^{(m)},

(6)

where

X^{(m)}

is an

K \times K

diagonal matrix whose diagonal entries are the K transmitted symbols,

F

is an

K \times L

discrete Fourier transform (DFT) matrix and

w^{(m)}

is an

K \times 1

additive noise vector.

h^{(m)}

is the discretized CIR vector as

h^{(m)} = [h_{1}^{(m)}, h_{2}^{(m)}, \dots, h_{L}^{(m)}]

.

For M consecutive OFDM blocks, the transmission model of all

K \times M

symbols can be expressed as

Y = [Φ^{(1)} h^{(1)}, Φ^{(2)} h^{(2)}, \dots, Φ^{(M)} h^{(M)}] + W,

(7)

where

Y = [y^{(1)}, y^{(2)}, \dots, y^{(M)}]

,

W = [w^{(1)}, w^{(2)}, \dots, w^{(M)}]

, and

H = [h^{(1)}, h^{(2)}, \dots, h^{(M)}]

.

Φ^{(m)}

is defined as

Φ^{(m)} ≜ X^{(m)} F

. Considering the comb-type pilot arrangement, the transmission model of

K_{p} \times M

pilot symbols can be expressed as

Y_{P} = Φ_{P} H + W_{P},

(8)

where

Y_{P} = [y_{p}^{(1)}, y_{p}^{(2)}, \dots, y_{p}^{(M)}]

and

W_{P} = [w_{p}^{(1)}, w_{p}^{(2)}, \dots, w_{p}^{(M)}]

are the submatrixs of

Y

and

W

at the location of the pilot subcarriers, respectively.

Φ_{P} ≜ X_{P} F_{P}

is the known dictionary matrix where

X_{P}

is the

K_{p} \times K_{p}

diagonal matrix with the known pilots along its diagonal and

F_{P}

is the

K_{p} \times L

DFT matrix.

By vectorizing

H

as

h = v e c (H^{T})

, a pilot-based channel measurement model can be constructed according to (8), as

y_{p} = U_{P} h + w_{p},

(9)

where

y_{p} = v e c (Y_{P}^{T})

,

w_{p} = v e c (W_{P}^{T})

and

U_{P} = Φ_{P} \otimes I_{M}

. AWGN can be adopted to approximate the additive noise

w_{p}

. However, the attenuation and noise of the UWA channel may vary over the signal bandwidth. When a small number of pilot symbols are used to estimate the UWA channel with strong colored noise, the accuracy of the pilot-based channel measurement will decrease severely. To improve the performance of channel measurement, a noise-resistant channel measurement model is constructed according to (7) and it can be expressed as

y = U h + w,

(10)

where

y = v e c (Y^{T})

,

w = v e c (W^{T})

and

U

is the

K M \times L M

matrix as

U = [\begin{matrix} d i a g {Φ^{(1)} [1, 1], \dots, Φ^{(M)} [1, 1]} & \dots & d i a g {Φ^{(1)} [1, L], \dots, Φ^{(M)} [1, L]} \\ ⋮ & ⋱ & ⋮ \\ d i a g {Φ^{(1)} [K, 1], \dots, Φ^{(M)} [K, 1]} & \dots & d i a g {Φ^{(1)} [K, L], \dots, Φ^{(M)} [K, L]} \end{matrix}],

(11)

where

Φ^{(m)} [i, j]

denotes the entry in the ith row and the jth column of

Φ^{(m)}

. All subcarriers are used for channel measurement in (10), as the unknown data symbols can be obtained by data detection or decoding. Assuming that the power distribution over all subcarriers is non-uniform,

λ = [λ_{- K / 2}, λ_{- K / 2 + 1}, \dots, λ_{K / 2 - 1}]

,

w

in (10) satisfies:

w \sim CN (0, Λ),

(12)

where

Λ = d i a g (λ) \otimes I_{M}

.

3.3. Clustered-Sparse Channel Distribution Model

According to the measurement equation in (9) or (10), the channel vector

h

can be estimated. However, the UWA channel usually has an excessive multipath delay spread. With so many undetermined channel taps, the prior clustered-sparse structure of the UWA channel can be exploited to obtain the sparse solution of the channel vector. First, we assume all that the L paths (i.e., L rows) in

H

are mutually independent, which can form C clusters. There are

L_{d}

paths in the dth cluster and these clusters do not overlap, each of which occupies dense path delays. With the same time correlation characteristics, the dth cluster satisfies a first-order auto-regressive (AR) model, given by

H_{d} [i, j + 1] = β_{d} H_{d} [i, j] + \sqrt{1 - | β_{d} |^{2}} V_{d} [i, j],

(13)

where

β_{d}

is the temporal correlation coefficient of the dth cluster.

H_{d}

is a

L_{d} \times M

submatrix of

H

,

i = 1, 2, \dots, L_{d}

and

j = 1, 2, \dots, M - 1

are the row and column indices of

H_{d}

, respectively.

V_{d}

is the model noise matrix with complex Gaussian distribution and assumed that

v e c (V_{d}^{T}) \sim CN (0, Γ_{d} \otimes I_{M})

, where:

Γ_{d} = d i a g (γ_{d, 1}, γ_{d, 2}, \dots, γ_{d, L_{d}})

(14)

is a positive semi-definite diagonal real-valued matrix. Although a higher-order AR model may express better model approximation performance, it would lead to higher complexity. More importantly, a higher-order AR model means a higher overfitting risk, especially when there are only a few OFDM blocks in one frame. Hence, a first-order AR process was adopted to model the temporal correlation characteristic for each cluster of CIRs.

By vectorizing

H_{d}

as

h_{d} = v e c (H_{d}^{T})

, the parameteric Bayesian prior of

h_{d}

can be written by

p (h_{d}; β_{d}, γ_{d}) = CN (0, Γ_{d} \otimes B_{d}),

(15)

where

γ_{d} = [γ_{d, 1}, γ_{d, 2}, \dots, γ_{d, L_{d}}]

controls the sparsity of the

L_{d}

paths in the dth cluster. When

T r (Γ_{d}) \to 0

, the associated

h_{d} \to 0

. In other words,

Γ_{d}

reflects the probability density of the dth cluster.

B_{d}

is a Toeplitz matrix with the following form:

B_{d} = T o e p l i t z (1, β_{d}, \dots, β_{d}^{M - 1}),

(16)

and it reflects the variation characteristics of the dth cluster.

Γ_{d}

and

B_{d}

jointly determine the cluster distribution of the dth cluster. Accordingly, the prior probability density function of

h

can be written as

p (h; γ_{1}, γ_{2}, \dots, γ_{C}, β_{1}, β_{2}, \dots, β_{C}) = CN (0, \sum_{0}),

(17)

where

\sum_{0} = d i a g (Γ_{1} \otimes B_{1}, Γ_{2} \otimes B_{2}, \dots, Γ_{C} \otimes B_{C})

.

4. Bayesian Learning-Based Clustered-Sparse Channel Estimation

In this section, the parametric form of Bayesian channel estimation is given first, and followed by the cluster distribution learning. Then, the partition-based clustered-sparse Bayesian learning algorithm is presented to obtain the clustered-sparse distribution and Bayesian channel estimate. At last, the complexity and performance analysis of this algorithm is given.

4.1. Bayesian Channel Estimation

To simplify the implementation complexity of Bayesian channel estimation, only known pilot symbols are used for channel measurement. As pilot symbols are sparsely distributed, it is difficult to estimate the non-uniform power spectrum at the location of pilot subcarriers. Therefore, the additive noise in (9) is approximated as white Gaussian noise with noise variance

λ

in this section. According to (9) and (17), using the Bayes rule, the posterior density of

h

can be obtained as

p (h | y_{p}; λ, γ_{1}, γ_{2} \dots, γ_{C}, β_{1}, β_{2}, \dots β_{C}) = CN (μ_{h}, \sum_{h}),

(18)

which is the complex Gaussian distribution with the posterior mean and covariance matrix:

μ_{h} = λ^{- 1} \sum_{h} U_{P}^{H} y_{p} = \sum_{0} U_{P}^{H} {(λ I + U_{P} \sum_{0} U_{P}^{H})}^{- 1} y_{p},

(19)

\sum_{h} = {(\sum_{0}^{- 1} + λ^{- 1} U_{P}^{H} U_{P})}^{- 1} = \sum_{0} - \sum_{0} U_{P}^{H} {(λ I + U_{P} \sum_{0} U_{P}^{H})}^{- 1} U_{P} \sum_{0} .

(20)

Obviously, the maximum posterior (MAP) estimates

{\hat{h}}_{M A P} ≜ μ_{h}

and the covariance matrix

\sum_{h}

are related to the hyperparameters

[λ, γ_{1}, γ_{2}, \dots, γ_{C}, β_{1}, β_{2}, \dots, β_{C}]

. Using

K_{n} \times M

null subcarriers

Y_{N} = [y_{n}^{(1)}, \dots, y_{n}^{(M)}]

,

λ

can be estimated as

λ = \frac{1}{K} E [∥ y^{(m)} ∥_{2}^{2}] \approx \frac{1}{M K_{n}} \sum_{j = 1}^{M} \sum_{i = 1}^{K_{n}} {| Y_{N} [i, j] |}^{2},

(21)

where i and j are the row and column indexes of

Y_{N}

, respectively. The remaining hyperparameters

[γ_{1}, γ_{2}, \dots, γ_{C}, β_{1}, β_{2}, \dots, β_{C}]

are related to the cluster distribution information.

4.2. Cluster Distribution Learning

The hyperparameters

θ ≜ [γ_{1}, γ_{2}, \dots, γ_{C}, β_{1}, β_{2}, \dots, β_{C}]

can be estimated by maximizing the marginal likelihood function

p (y_{p}; θ)

. This is equivalent to minimizing

- l o g p (y_{p}; θ)

, giving the effective cost function:

L (θ) = y_{p}^{H} \sum_{y} p^{- 1} (θ) y_{p} + l o g | \sum_{y} (θ) |,

(22)

where

\sum_{y} p (θ) = λ I + U_{P} \sum_{0} (θ) U_{P}^{H}

. The above problem cannot be solved in closed form, and the expectation maximization (EM) method [41] can be employed to solve it iteratively. For unknown values of hyperparameters governing the prior density,

h

is considered as the nuisance variable and

θ

is estimated by maximizing:

Q (θ | θ^{(o l d)}) = E_{h | y_{p}; θ^{(o l d)}} [l o g p (y_{p}, h; θ)] = E_{h | y_{p}; θ^{(o l d)}} [l o g p (y_{p} | h; λ)] + E_{h | y_{p}; θ^{(o l d)}} [l o g p (h; θ)],

(23)

where

θ^{(o l d)}

denotes the estimated hyperparameters in the previous iteration. Ignoring the first term unrelated to

θ

, the Q Function (23) can be simplified to:

\begin{matrix} Q (β_{1}, β_{2}, \dots, β_{C}, & γ_{1}, γ_{2}, \dots, γ_{C}) = E_{h | y_{p}; θ^{(o l d)}} \{l o g p [h; B_{1}, B_{2}, \dots, B_{C}, Γ_{1}, Γ_{2}, \dots, Γ_{C}]\} \\ \propto & - \frac{1}{2} \sum_{d = 1}^{C} [l o g (| Γ_{d} |^{M} | B_{d} |^{L_{d}}) + h_{d}^{H} (Γ_{d}^{- 1} \otimes B_{d}^{- 1}) h_{d}] \\ \propto & - \frac{1}{2} \sum_{d = 1}^{C} \{M l o g (| Γ_{d} |) + L_{d} l o g (| B_{d} |) + T r [(Γ_{d}^{- 1} \otimes B_{d}^{- 1}) (\sum_{h}^{d} + μ_{h}^{d} {(μ_{h}^{d})}^{H})]\} \end{matrix},

(24)

where

h_{d} \in C^{M L_{d} \times 1}

,

μ_{h}^{d} \in C^{M L_{d} \times 1}

and

\sum_{h}^{d} \in C^{M L_{d} \times M L_{d}}

denote the corresponding dth cluster in

h \in C^{M L \times 1}

,

μ_{h} \in C^{M L \times 1}

and

\sum_{h} \in C^{M L \times M L}

, respectively.

μ_{h}

and

\sum_{h}

are evaluated according to (19) and (20), given the estimated hyperparameters

θ^{(o l d)}

and the noise variance

λ

. As shown in (14) and (16),

Γ_{d}

and

B_{d}

are determined by the hyperparameters

γ_{d}

and

β_{d}

, respectively.

The partial derivative of (24) with respect to

γ_{d, i}

(d = 1, 2, \dots, C; i = 1, 2 \dots, L_{d})

is given by

\frac{\partial Q}{\partial γ_{d, i}} = - \frac{M}{2 γ_{d, i}} + \frac{1}{2 γ_{d, i}^{2}} T r [B_{d}^{- 1} (\sum_{h}^{d, i} + μ_{h}^{d, i} {(μ_{h}^{d, i})}^{H})],

(25)

where

μ_{h}^{d, i} \in C^{M \times 1}

denotes the corresponding ith path in

μ_{h}^{d}

, and

\sum_{h}^{d, i} \in C^{M \times M}

is the corresponding principal diagonal block for the ith path in

\sum_{h}^{d}

. So the learning rule for

γ_{d, i}

can be given by

γ_{d, i} = \frac{1}{M} T r [B_{d}^{- 1} (\sum_{h}^{d, i} + μ_{h}^{d, i} {(μ_{h}^{d, i})}^{H})] .

(26)

The gradient of (24) over

B_{d}

(d = 1, 2, \dots, C)

is:

\frac{\partial Q}{\partial B_{d}} = - \frac{L_{d}}{2} B_{d}^{- 1} + \frac{1}{2} \sum_{i = 1}^{L_{d}} \frac{1}{γ_{d, i}} B_{d}^{- 1} (\sum_{h}^{d, i} + μ_{h}^{d, i} {(μ_{h}^{d, i})}^{H}) B_{d}^{- 1} .

(27)

Thus, the learning rule for

B_{d}

can be derived as

B_{d} = \frac{1}{L_{d}} \sum_{i = 1}^{L_{d}} \frac{\sum_{h}^{d, i} + μ_{h}^{d, i} {(μ_{h}^{d, i})}^{H}}{γ_{d, i}} .

(28)

here, we constrain

B_{d}

to have the form as shown in (16), and thus

β_{d}

can be empirically calculated as

β_{d} ≜ \frac{α_{1}}{α_{0}}

where

α_{0}

(resp.

α_{1}

) is the average of the elements along the main diagonal (resp. the main sub-diagonal) of the matrix

B_{d}

in (28).

4.3. Partition-Based Clustered-Sparse Bayesian Learning Algorithm

When the cluster partition information is known, the hyperparameters

θ

, the posterior mean

μ_{h}

and the posterior covariance

\sum_{h}

can be iteratively obtained through the EM method. In other words, cluster partition and cluster distribution jointly determine the clustered-sparse channel distribution. However, it is usually difficult to obtain the cluster partition in advance. As shown in Figure 1, several significant clusters with high energy in CIRs are scattered over a long time spread, and usually do not overlap each other. As shown in (28), the boundary information of these clusters is important for calculating

B_{d}

, because different clusters have different coherence times (i.e., different AR coefficients). In particular, some weak clusters that can be ignored will cause the hyperparameter estimation error of significant clusters when these clusters are not distinguished. Therefore, the PB-CSBL algorithm is proposed to update cluster partition by pruning the paths with very small energy (i.e.,

γ_{d, i} \to 0

). As shown in Algorithm 1, the PB-CSBL algorithm mainly includes three sub-parts: Channel Estimation, Cluster Partition, Cluster Evolution, and they will loop until the iteration stop condition is reached. In the sub-part of Cluster Partition, only the dense paths with high energy are retained to form clusters and the paths with negligible energy are pruned. For a cluster with some pruned paths, it can further split into more clusters according to the positions of the pruned paths. To reduce the impact of an inaccurate clustering structure at the beginning of the iteration, the maximum possible number of discrete paths in one cluster is given. In the sub-part of Cluster Evolution, the derived cluster distribution learning rules are included. A good initial estimation of the unknown hyperparameters is significant for achieving the global maximum instead of a local maximum. For practical implementation, it was found that an initial estimation given by

Γ_{d} = I_{L_{d}}

and

B_{d} = I_{M}

was sufficient for the proposed PB-CSBL algorithm.

Algorithm 1: Partition-Based Clustered-Sparse Bayesian Learning Algorithm

Input: the received pilot signal

y_{p}

; the dictionary matrix

U_{P}

; the noise variance

λ

;

the length of discrete paths L; the maximum number of iterations

r_{m a x}

;

the maximum number of discrete paths in one cluster

L C_{m a x}

; the threshold for

prunning small hyperparameters

γ_{t h}

; the threshold to stop the whole

algorithm

ϵ

.

Initialize: the list of path power

γ_{l i s t} = 1_{L}

; the number of clusters

C = 1

;

the list of cluster structure

CLS

:

Γ_{1} = I_{L}

,

B_{1} = I_{M}

and the delay range of

the 1st cluster

R_{d} = [0, L - 1]

; the iteration counter

r = 0

.

Channel Estimation:

1:

\sum_{0} \leftarrow d i a g (Γ_{1} \otimes B_{1}, Γ_{2} \otimes B_{2}, \dots, Γ_{C} \otimes B_{C})

.

2:

\sum_{h} = \sum_{0} - \sum_{0} U_{P}^{H} {(λ I + U_{P} \sum_{0} U_{P}^{H})}^{- 1} U_{P} \sum_{0}

.

3:

μ_{h} \leftarrow λ^{- 1} \sum_{h} U_{P}^{H} y_{p}

.

Cluster Evolution:

4: for

d = 1, 2, \dots, C

do

5: for

i = 1, 2, \dots, L_{d}

do

6:

γ_{d, i} \leftarrow \frac{1}{M} T r [B_{d}^{- 1} (\sum_{h}^{d, i} + μ_{h}^{d, i} {(μ_{h}^{d, i})}^{H})]

, and update

γ_{l i s t}

with

γ_{d, i}

.

7: end for

8:

L_{d}^{'} \leftarrow m i n (L_{d}, L C_{m a x})

.

9:

B_{d} \leftarrow \frac{1}{L_{d}^{'}} \sum_{i = 1}^{L_{d}^{'}} \frac{\sum_{h}^{d, i} + μ_{h}^{d, i} {(μ_{h}^{d, i})}^{H}}{γ_{d, i}}

using the

L_{d}^{'}

most significant continuous paths.

10:

β_{d} \leftarrow \frac{α_{1}}{α_{0}}

, where

α_{1}

and

α_{0}

can be obtained through

B_{d}

.

11: end for

Cluster Partition:

12:

p^{(0)} \leftarrow

FindIndex

(0 < γ_{l i s t} < γ_{t h})

.

13: if

p^{(0)}

is not empty then

14:

γ_{l i s t}^{(o l d)} \leftarrow γ_{l i s t}

and

γ_{l i s t} [p^{(0)}] \leftarrow 0

.

15:

CLS \leftarrow

Split(

CLS

,

p^{(0)}

), where

CLS

is splitted into C clusters

according to

p^{(0)}

.

16:

p^{(1)} \leftarrow

FindIndex

(γ_{l i s t} \geq γ_{t h})

and

U_{P} \leftarrow U_{P} [:, p^{(1)} \otimes (1 : M)]

.

17: end if

18: for

d = 1, 2, \dots, C

do

19: Update

R_{d}

,

Γ_{d}

and

B_{d}

in

CLS

according to

p^{(0)}

,

γ_{l i s t}

and

β_{d}

, respectively.

20: end for

Check stopping conditions:

21:

r \leftarrow r + 1

.

22: return the sub-part of

Channel Estimation

until

r \geq r_{m a x}

or

∥ γ_{l i s t} - γ_{l i s t}^{(o l d)} ∥_{2}^{2} < ϵ

.

Output: the pruned channel

μ_{h}

with the covariance matrix

\sum_{h}

and the

cluster list

CLS

.

4.4. Complexity and Performance Analysis

Considering the excessive delay spread of the UWA channel and the purpose of saving pilot overhead, we assumed

K_{d} \geq L \geq K_{p} \geq M

. The time complexity of the proposed PB-CSBL algorithm is compared with the least squares (LS) [39], OMP [25], SOMP [27] and TMSBL [32]. In order to avoid involving matrix inversion, pilot subcarriers are equispaced in the LS method. For fair comparison, the residual Doppler expansion is ignored when evaluating the time complexity of the OMP method. The comparison of time complexity is shown in Table 1, where

\bar{N}

denotes the average iteration number. It can be seen that the time complexity of the PB-CSBL algorithm is higher than that of the LS, OMP and SOMP. This is due to the fact that the temporal correlation of CIRs is ignored in the LS and OMP. Although the stability of path delays is exploited in the SOMP, the correlation of path gains is discarded. Therefore, LS, OMP and SOMP may exhibit worse channel estimation performance compared to the PB-CSBL algorithm. As for the TMSBL with

O (L K_{p}^{2})

per iteration, it has lower time complexity than the proposed PB-CSBL with

O (L K_{p}^{2} M^{3})

per iteration. This is mainly because it adopts the measure in [41], assuming that there is no temporal correlation among consecutive CIRs or no additive noise in received signal, but this may bring some performance loss, especially for slow-varying or high-noise channels. Therefore, when M is small, we do not adopt the simplification in [41]. The computational complexity of the PB-CSBL algorithm can be further reduced by implementing a first-order algorithm in E-step [34], but the implementation details are omitted here.

According to (20), the mean square error (MSE) bound of the sparse vector

h

can be expressed as

E [∥ h - \hat{h} ∥_{2}^{2}] \geq T r (\sum_{h}) = T r [\sum_{0} - \sum_{0} U_{P}^{H} {(λ I + U_{P} \sum_{0} U_{P}^{H})}^{- 1} U_{P} \sum_{0}],

(29)

where

\hat{h}

is the estimate of

h

, and its estimation accuracy is related to the hyperparameter vector

θ

obtained by maximizing the marginal likelihood function. Thus, the lower bound of MSE is reached (i.e., equality holds) only when the perfect hyperparameters are obtained.

Assume that all L paths are in one cluster (i.e.,

C = 1

) and consider two special cases of

θ

:

(1): The temporal correlation coefficient $β = 0$ .
In this case, there is no temporal correlation among CIRs. It is assumed that each transmitted symbol is normalized to unit power. Then, the MSE bound of $h^{(m)}$ can be expressed by

$E [∥ h^{(m)} - {\hat{h}}^{(m)} ∥_{2}^{2}] \geq T r [{(Γ^{- 1} + λ^{- 1} Φ_{P}^{H} Φ_{P})}^{- 1}] = \sum_{l = 1}^{L} {(\frac{K_{p}}{λ} + \frac{1}{γ_{l}})}^{- 1},$

(30)

where $Γ = d i a g (γ)$ , $γ = [γ_{1}, γ_{2}, \dots, γ_{L}]$ and $γ_{l}$ controls the variance of the lth channel coefficient.
(2): The temporal correlation coefficient $β = 1$
In this case, the CIRs are time-invariant across M OFDM blocks. Thus, the additive noise variance $λ$ can be reduced to $λ / M$ , and the MSE bound of $h^{(m)}$ can be expressed by

$E [∥ h^{(m)} - {\hat{h}}^{(m)} ∥_{2}^{2}] \geq \sum_{l = 1}^{L} {(\frac{M K_{p}}{λ} + \frac{1}{γ_{l}})}^{- 1} .$

(31)

As shown in the case of the high temporal correlation (i.e.,

β = 1

), the anti-noise performance of channel estimation can be improved. As for the low temporal correlation among CIRs (i.e.,

β = 0

), the anti-noise performance is not significantly improved. However, with more reference information, the better hyperparameter vector

γ

can be obtained. Therefore, when multiple OFDM blocks are utilized simultaneously, higher channel estimation accuracy can be achieved.

5. Noise-Corrected Clustered-Sparse Channel Estimation

In this section, the parametric form of noise-resistant Bayesian channel estimation is given first. Then, the data detection and noise measurement are derived. Then, the noise-corrected clustered-sparse channel estimation algorithm is presented. At last, the complexity and performance analysis of this algorithm is given.

5.1. Noise-Resistant Bayesian Channel Estimation

As the number of pilots

K_{p}

decreases, the precision of channel estimation using the PB-CSBL algorithm will rapidly deteriorate. This is primarily because the algorithm does not utilize all the information available. Specifically, although both pilot and data symbols are transmitted, only the

K_{p} \times M

matrix

Y_{P}

corresponding to the pilot subcarriers in (8) is used for estimating the CIRs. For the matrix

Y

in (7), the remaining

K_{d} \times M

observations are typically not used as they contain the unknown data symbols. If the unknown data symbols are perfectly obtained, using the Bayes rule, the posterior mean of CIRs based on the prior probability in (17) and the channel measurement equation in (10) can be written by

{\bar{μ}}_{h} = \sum_{0} U^{H} {(Λ + U \sum_{0} U^{H})}^{- 1} y,

(32)

with the posterior covariance matrix:

{\sum^{¯}}_{h} = \sum_{0} - \sum_{0} U^{H} {(Λ + U \sum_{0} U^{H})}^{- 1} U \sum_{0} .

(33)

Moreover, AWGN is assumed to simplify implementation complexity in the PB-CSBL algorithm, as the noise power distribution

λ = [λ_{- K / 2}, λ_{- K / 2 + 1}, \dots, λ_{K / 2 - 1}]

depends on the observation of all subcarriers. However, in a complex UWA environment, the noise power may vary significantly over the signal bandwidth. Using decided data symbols and corrected colored noise, the high estimation accuracy can be achieved by the MAP estimate in (32).

5.2. Data Detection and Noise Measurement

To obtain the MAP estimate in (32), the hyperparameters related to the cluster distribution,

θ ≜ [γ_{1}, γ_{2}, \dots, γ_{C}, β_{1}, β_{2}, \dots, β_{C}]

, can be iteratively updated, as shown in the proposed PB-CSBL algorithm, however, the detection of data signal

x_{d}^{(m)}

and the noise distribution

λ

are still unresolved. Using the notations

x_{d} ≜ [{(x_{d}^{(1)})}^{T}, {(x_{d}^{(2)})}^{T}, \dots, {(x_{d}^{(M)})}^{T}]

,

φ ≜ [λ, x_{d}]

, and

ζ ≜ [θ, φ]

, the unknown hyperparameters set

ζ

can be estimated by maximizing the marginal likelihood function

p (y; ζ)

. This is equivalent to minimizing

- l o g p (y; ζ)

, giving the effective cost function:

L (ζ) = y^{H} \sum_{y}^{- 1} (ζ) y + l o g | \sum_{y} (ζ) |,

(34)

where

\sum_{y} (ζ) = Λ (λ) + U (x_{d}) \sum_{0} (θ) U^{H} (x_{d})

. Similar to the cluster distribution learning, the expectation maximization (EM) method [41] can be employed to solve (34) iteratively. While

h

is considered as the nuisance variable,

ζ

can be estimated by maximizing:

Q (ζ | ζ^{(o l d)}) = E_{h | y; ζ^{(o l d)}} [l o g p (y, h; ζ)] = E_{h | y; ζ^{(o l d)}} [l o g p (y | h; φ)] + E_{h | y; ζ^{(o l d)}} [l o g p (h; θ)],

(35)

where

ζ^{(o l d)}

denotes the estimated hyperparameters in the previous iteration. As shown in (35), the joint maximization simplifies into two independent maximizations over

θ

and

φ

. On optimizing the latter function in (35) with respect to

θ

, one obtains the learning rules about

θ

as (26) and (28). Maximizing the former over

x_{d}^{(m)}

yields the minimum mean squared error (MMSE) equalizer:

{\hat{x}}_{d}^{(m)} = {[d i a g (λ_{d}) C_{X}^{- 1} + P r d i a g (F_{D} Ξ_{h}^{(m)} F_{D}^{H})]}^{- 1} d i a g [{({\bar{μ}}_{h}^{(m)})}^{H} F_{D}^{H}] y_{d}^{(m)},

(36)

where

y_{d}^{(m)}

and

λ_{d}

are the subvectors of

y^{(m)}

and

λ

at the location of data subcarriers, respectively.

C_{X}

is the covariance matrix of

x_{d}^{(m)}

,

F_{D}

is the

K_{d} \times L

DFT matrix at the location of data subcarriers.

Ξ_{h}^{(m)}

is defined as

Ξ_{h}^{(m)} ≜ {\bar{μ}}_{h}^{(m)} {({\bar{μ}}_{h}^{(m)})}^{H} + {\sum^{¯}}_{h}^{(m)}

, where

{\bar{μ}}_{h}^{(m)} \in C^{L \times 1}

and

{\sum^{¯}}_{h}^{(m)} \in C^{L \times L}

denote the posterior mean and covariance matrix of the mth OFDM block, respectively. The decisions on the data symbols,

x_{d}^{(m)}

, are obtained by mapping the soft decisions,

{\hat{x}}_{d}^{(m)}

, to the closest point on the constellation, meaning that:

x_{d}^{(m)} = Q u a n t i z a t i o n ({\hat{x}}_{d}^{(m)}) .

(37)

Although the noise distribution is non-uniform, the noise powers of adjacent subcarriers are usually close (especially when the number of subcarriers is large). Then, we divide all subcarriers into non-overlapping Q groups, so that

K / Q

subcarriers can be grouped into the same one. Thus, the average noise power of the qth group can be calculated as

λ_{g}^{(q)} = \frac{Q}{K M} T r [y^{(q)} {(y^{(q)})}^{H} - U^{(q)} Ξ_{h} {(U^{(q)})}^{H}],

(38)

where

Ξ_{h}

is defined as

Ξ_{h} ≜ {\bar{μ}}_{h} {({\bar{μ}}_{h})}^{H} + {\sum^{¯}}_{h}

.

U \in C^{K M \times L M}

can be calculated according to (11), as unknown data symbols are estimated by (36) and (37).

U^{(q)} \in C^{\frac{K M}{Q} \times L M}

is the submatrix of

U

and denotes the qth group.

y^{(q)}

denotes the qth group of

y

. After

λ_{g}^{(1)}, λ_{g}^{(2)}, \dots, λ_{g}^{(Q)}

are calculated according to (38), the non-uniform noise power distribution across all subcarriers,

λ = [λ_{- K / 2}, λ_{- K / 2 + 1}, \dots, λ_{K / 2 - 1}]

, can be obtained by upsampling the Q average noise powers.

5.3. Noise-Corrected Clustered-Sparse Channel Estimation Algorithm

Based on the noise-resistant Bayesian channel estimation, noise measurement and data detection, we can develop the PB-CSBL algorithm into the NC-CSCE Algorithm. As shown in Algorithm 2, the sub-parts of Cluster Partition and Cluster Evolution have the similar process to the PB-CSBL algorithm. With the aid of Data Detection and Noise Measurement, the estimation accuracy in Channel Estimation is improved. Conversely, the improvement of channel estimation also ensures the high accuracy of data detection and noise measurement results. To achieve the global maximum instead of a local maximum, the NC-CSCE algorithm requires a good initial estimate with respect to the unknown cluster distribution, data signal and noise distribution. However, it is obviously difficult to initialize so many hyperparameters at the same time. Therefore, the PB-CSBL algorithm is utilized to initialize

{\bar{μ}}_{h}

and

{\sum^{¯}}_{h}

instead of initializing these hyperparameters. Once

{\bar{μ}}_{h}

and

{\sum^{¯}}_{h}

are initialized, these hyperparameters can be calculated as shown in Algorithm 2.

5.4. Complexity and Performance Analysis

The NC-CSCE algorithm utilizes all available subcarriers to achieve better channel estimation performance. Therefore, compared with the proposed PB-CSBL algorithm with

O (L K_{p}^{2} M^{3})

per iteration, the NC-CSCE algorithm has higher time complexity, i.e.,

O ({(K_{d} + K_{p})}^{3} M^{3})

per iteration. Similar to the PB-CSBL algorithm, the complexity of the NC-CSCE algorithm can be further simplified by using the techniques in [34,41], however, these simplifications may lead to the performance degradation of channel estimation because of the errors introduced by the approximations. The details of the simplifications are omitted here.

Algorithm 2: Noise-Corrected Clustered-Sparse Channel Estimation Algorithm

Input: the received signal

y

; the noise variance

λ

; the length of discreted paths L;

the maximum number of iterations

r_{m a x}

; the maximun number of discreted

paths in one cluster

L C_{m a x}

; the threshold for prunning small

hyperparameters

γ_{t h}

; the threshold to stop the whole algorithm

ϵ

.

Initialize: The PB-CSBL algorithm is utilized to obtain the posterior mean

{\bar{μ}}_{h} \leftarrow μ_{h}

and covariance matrix

{\sum^{¯}}_{h} \leftarrow \sum_{h}

; the noise vector

λ = λ I_{K}

; the list of path

power

γ_{l i s t} = 1_{L}

; the iteration counter

r = 0

.

Cluster Evolution:

1: for

d = 1, 2, \dots, C

do

2: for

i = 1, 2, \dots, L_{d}

do

3:

γ_{d, i} \leftarrow \frac{1}{M} T r [B_{d}^{- 1} ({\sum^{¯}}_{h}^{d, i} + {\bar{μ}}_{h}^{d, i} {({\bar{μ}}_{h}^{d, i})}^{H})]

, and update

γ_{l i s t}

with

γ_{d, i}

.

4: end for

5:

L_{d}^{'} \leftarrow m i n (L_{d}, L C_{m a x})

.

6:

B_{d} \leftarrow \frac{1}{L_{d}^{'}} \sum_{i = 1}^{L_{d}^{'}} \frac{{\sum^{¯}}_{h}^{d, i} + {\bar{μ}}_{h}^{d, i} {({\bar{μ}}_{h}^{d, i})}^{H}}{γ_{d, i}}

using the

L_{d}^{'}

most significant continuous paths.

7:

β_{d} \leftarrow \frac{α_{1}}{α_{0}}

, where

α_{1}

and

α_{0}

can be obtained through

B_{d}

.

8: end for

Cluster Partition:

9:

p^{(0)} \leftarrow

FindIndex

(0 < γ_{l i s t} < γ_{t h})

.

10: if

p^{(0)}

is not empty then

11:

γ_{l i s t}^{(o l d)} \leftarrow γ_{l i s t}

and

γ_{l i s t} [p^{(0)}] \leftarrow 0

.

12:

CLS \leftarrow

Split(

CLS

,

p^{(0)}

), where

CLS

is splitted into C clusters according

to

p^{(0)}

.

13: end if

14: for

d = 1, 2, \dots, C

do

15: Update

R_{d}

,

Γ_{d}

and

B_{d}

in

CLS

according to

p^{(0)}

,

γ_{l i s t}

and

β_{d}

, respectively.

16: end for

Data Detection:

17: Refer to (36) and (37).

Noise Measurement:

18: Update the noise vector

λ

according to (38).

Channel Estimation:

19:

\sum_{0} \leftarrow d i a g (Γ_{1} \otimes B_{1}, Γ_{2} \otimes B_{2}, \dots, Γ_{C} \otimes B_{C})

.

20: Obtain

U

according to (11).

21:

p^{(1)} \leftarrow

FindIndex

(γ_{l i s t} \geq γ_{t h})

and

U \leftarrow U [:, p^{(1)} \otimes (1 : M)]

.

22: Refer to (32) and (33).

Check stopping conditions:

23:

r \leftarrow r + 1

.

24: return the sub-part of

Cluster Evolution

until

r \geq r_{m a x}

or

∥ γ_{l i s t} - γ_{l i s t}^{(o l d)} ∥_{2}^{2} < ϵ

.

Output: the pruned channel

{\bar{μ}}_{h}

with the covariance matrix

{\sum^{¯}}_{h}

, the cluster list

CLS

,

the noise vector

λ

, and the transmitted data symbols

x_{d}^{(m)}

for

m = 1, 2, \dots, M

.

Based on the noise-resistant Bayesian channel estimation in the NC-CSCE algorithm, the MSE bound of the sparse vector

h

can be expressed as

E [∥ h - \hat{\bar{h}} ∥_{2}^{2}] \geq T r ({\sum^{¯}}_{h}) = T r [\sum_{0} - \sum_{0} U^{H} {(Λ + U \sum_{0} U^{H})}^{- 1} U \sum_{0}],

(39)

where

\hat{\bar{h}}

is the estimate of

h

. The MSE bound is related to the hyperparameters of the cluster distribution, data symbols and noise distribution. Only when the perfect hyperparameters are obtained is the lower bound of MSE reached. Similarly to analyzing the PB-CSBL algorithm, we assume that all L paths are in one cluster (i.e.,

C = 1

) and consider the two special cases, namely the temporal correlation coefficient

β = 0

and

β = 1

. For ease of analysis, the noise power keeps constant

\bar{λ}

over all subcarriers. Furthermore, each transmitted symbol is normalized to unit power. Then, the MSE bounds of

h^{(m)}

can be evaluated by

E [∥ h^{(m)} - {\hat{\bar{h}}}^{(m)} ∥_{2}^{2}] \geq \sum_{l = 1}^{L} {(\frac{K_{p} + K_{d}}{\bar{λ}} + \frac{1}{γ_{l}})}^{- 1},

(40)

E [∥ h^{(m)} - {\hat{\bar{h}}}^{(m)} ∥_{2}^{2}] \geq \sum_{l = 1}^{L} {(\frac{M (K_{p} + K_{d})}{\bar{λ}} + \frac{1}{γ_{l}})}^{- 1},

(41)

respectively. In addition to what has been discussed when analyzing the PB-CSBL algorithm, Equations (40) and (41) also prove that the NC-CSCE algorithm utilizing pilot symbols and unknown data symbols can achieve a smaller MSE compared with the PB-CSBL algorithm.

6. Evaluation and Result Analysis

In this section, to evaluate the performance of the proposed algorithms, numerical simulations and a lake trial were conducted. The parameter settings of CP-OFDM in the simulations and lake trial are shown in Table 2. The bandwidth, carrier frequency and sampling frequency of the lake trial are closely related to the transducers used in the trial, and these values are set smaller in the simulation system to increase the simulation speed. Because of the high cost of the lake trial, the CP length, the number of pilot subcarriers and the number of null subcarriers are designed to be larger than the simulation system to ensure the availability of trial data. In the simulation system (the specifications of our computer are as follows: Intel(R) Core(TM) i5-6400 CPU 2.7 GHz (4 cores), 16 GB RAM, 1 TB memory with Windows and MATLAB installed), it is assumed that the UWA channel has 12 randomly generated paths, and that these paths remain fixed within one OFDM frame. The maximum path delay is less than CP length, and the amplitudes of paths are Rayleigh distributed with the average power decreasing exponentially with delay. The Doppler scale factor

a \in (0, 10^{- 3})

is randomly chosen. Moreover, there is no channel code module in the simulation system.

6.1. Simulation Results

Firstly, the proposed PB-CSBL and NC-CSCE algorithms were compared with the LS, OMP, SOMP and TMSBL for channel estimation. All paths are distributed in three non-overlapping clusters, and these clusters are set with three different temporal correlation coefficients ranges over multiple OFDM blocks, which are

[0.1, 0.3]

,

[0.5, 0.7]

,

[0.8, 1.0]

. The additive noise is assumed to be white Gaussian noise, and thus all subcarriers are divided into the same group (i.e.,

Q = 1

) for the proposed NC-CSCE algorithm. The transmitted data bits are mapped to QPSK or 16-QAM constellation symbols. The performance metrics used in the numerical simulations are the MSE of CIRs and BER of data bits, which are verified in 500 simulations for each value of SNR.

Figure 3a,b show the MSE and BER performance of different channel estimation methods, respectively. LS performs worst, because the channel sparsity is not considered for this underdetermined system. Based on the joint estimation of multiple OFDM blocks, the SOMP makes full use of the strong temporal correlation of path delays and achieves better performance than the OMP. The delay and correlation of path gains are analyzed in the TMSBL, which makes it better than the SOMP. However, the proposed PB-CSBL performs even better, as it can learn different temporal correlation coefficients and reduce the interference among clusters. The proposed NC-CSCE algorithm utilizes all received pilot and data symbols for the channel estimation, which achieves the lowest BER close to the perfect CSI. When 16-QAM mapping is adopted, Figure 4 proves that the proposed algorithms still maintain good estimation performance. However, in the low SNR region from 0 to 10 dB, the NC-CSCE algorithm does not perform well. This is mainly because high-order modulation at low SNR is prone to symbol decision errors, which may be exacerbated in the iterative process of the NC-CSCE algorithm.

Then, to verify the estimation performance of the proposed algorithms under extreme temporal correlation coefficients (i.e.,

β = 0

and

β = 1

), the proposed algorithms are compared with their theoretical lower bounds. Multipaths are assumed to be sparse and share the same correlation coefficient. Then, under QPSK mapping and white Gaussian noise, the mean square error performance is presented in Figure 5. When

β = 0

, the MSE of the proposed algorithms is close to the theoretical lower bound in a high SNR region. A significant gap exists in the low SNR region, especially for the NC-CSCE algorithm. This is caused by imperfect hyperparameter estimation, as the lower bound is only reached when the perfect hyperparameters are obtained. When

β = 1

, the gap is more obvious in the whole SNR region because of the hyperparameter estimation error. However, the MSE performance with

β = 1

is better than the MSE performance with

β = 0

in a low SNR region. In short, a larger temporal correlation coefficient means stronger resistance to additive noise, however, the hyperparameter estimation error, such as the error of

γ

, is often larger than that of small temporal correlation coefficients in the high SNR region. Figure 6a,b show the MSE of the

γ

estimated by the proposed PB-CSBL and NC-CSCE algorithms, respectively. The MSE is evaluated under different SNRs and temporal correlation coefficients. As can be seen, a larger temporal correlation coefficient achieves a smaller MSE in the low SNR region, while a smaller temporal correlation coefficient reduces the MSE value in the high SNR region.

Finally, to evaluate the estimation performance of the proposed NC-CSCE algorithm under colored noise, we compared the NC-CSCE algorithm with the TMSBL and PB-CSBL algorithms. The group number of subcarriers in NC-CSCE is adjusted to evaluate the channel estimation performance. In the simulation, all paths are distributed in three non-overlapping clusters. The clusters are set with three different temporal correlation coefficients ranges over multiple OFDM blocks, which are

[0.1, 0.3]

,

[0.5, 0.7]

,

[0.8, 1.0]

. All subcarriers are divided into eight groups and each group is set with a different SNR. As shown in Figure 7b, the reference SNR ranges are from 5 to 20 dB. With QPSK mapping, the MSE and BER performance curves are shown in Figure 7a, and the results prove that the proposed NC-CSCE algorithm is superior to the TMSBL and PB-CSBL algorithms. The NC-CSCE-8 denotes the NC-CSCE algorithm with eight groups, and it performs better than the NC-CSCE algorithm with other groups. Therefore, the setting of group number is critical for the NC-CSCE algorithm. In one instance, the actual distribution of colored noise can hardly be reflected with a limited group number; yet in another, a small symbol number will result in low measurement accuracy in one group. Figure 7b also shows the statistical results of the estimated SNRs based on 500 simulations. Similarly, NC-CSCE-8 matches the reference best among all compared methods.

6.2. Lake Experimental Results

To further verify the proposed algorithms, a real lake experiment was performed at Huating Lake, Anhui Province, China, in May 2018. Huating Lake covers an area of about 70 square kilometers, with an average water depth of about 17 m. We hung the transmitter and receiver on the side of a passenger ship and a speedboat, respectively. The passenger ship has a semi-open roof and is about 10 m long, while the speedboat is about half the size of the ship. During the experiment, the transmitter depth was about 10 m and the receiver depth was about 13 m. Affected by wind and current, their relative speed was about 0.5 m/s, and the distance between them was more than 4.5 km. The parameter settings of the experiment are in Table 2. In addition, the linear frequency modulation (LFM) signal was added at the beginning and end of each frame for time synchronization and Doppler estimation. The transmitted data bits were encoded by a

1 / 2

rate convolutional code, and the encoded bits were mapped by QPSK constellation. As showed in Figure 1, using the proposed NC-CSCE algorithm, we obtained the estimated CIRs for a 14 frame history with five consecutive OFDM blocks in one frame. The CIRs are estimated after Doppler compensation, and the total delay spread is around 15 ms. From the estimated CIRs, we notice that the paths are distributed in multiple clusters, and the channel shows a clustered-sparse structure.

Figure 8 shows the BER performance after convolutional decoding. The BERs of the proposed two algorithms are still the lowest for most frames, followed by the TMSBL and the SOMP, while the OMP has the worst BER performance. Obviously, the performance gap between proposed algorithms and other methods further verify the superior performance of the proposed algorithms.

7. Conclusions

In this paper, we propose a novel Bayesian learning-based channel estimation architecture to estimate the time-varying multipath channel with colored noise for UWA-OFDM systems. Specifically, a clustered-sparse channel distribution model was constructed to characterize the multipath distribution, and a noise-resistant channel measurement model is constructed to reduce the noise disturbance. To obtain the clustered-sparse distribution, we propose the partition-based clustered-sparse Bayesian learning algorithm. To lessen the effect of colored noise, we proposed a noise-corrected clustered-sparse channel estimation algorithm to improve the estimation performance. Taking advantage of the iterative clustered-sparse distribution learning, symbol decision and noise correction, the accuracy of Bayesian channel estimation can be improved. Experiments proved the effectiveness of the proposed algorithms for channel estimation.

In future work, channel decoding can be incorporated into the Bayesian learning-based channel estimation architecture to explore the possibility of reducing symbol decision errors. In the noise-resistant channel measurement model, the channel gains are approximately constant within one OFDM block, which may be too optimistic in some rapidly time-varying UWA channels. Therefore, some channel estimation models that deal with doubly selective fading channels, such as the popular basis expansion model, can be combined with the clustered-sparse Bayesian learning to improve the estimation performance. At the same time, channel feedback and reconstruction will be analyzed to realize adaptive modulation and coding in UWA-OFDM communication.

Author Contributions

Conceptualization, S.W. and D.L.; data curation, S.W.; formal analysis, S.W. and M.L.; funding acquisition, D.L.; investigation, S.W.; methodology, S.W. and M.L.; project administration, D.L.; resources, D.L.; software, S.W.; validation, S.W. and D.L.; visualization, S.W.; writing—original draft, S.W.; writing—review and editing, M.L. and D.L. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Natural Science Foundation of China under Grant 61571334.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

Climent, S.; Sanchez, A.; Capella, J.; Meratnia, N.; Serrano Martín, J. Underwater Acoustic Wireless Sensor Networks: Advances and Future Trends in Physical, MAC and Routing Layers. Sensors 2014, 14, 795–833. [Google Scholar] [CrossRef]
Stojanovic, M.; Preisig, J. Underwater acoustic communication channels: Propagation models and statistical characterization. IEEE Commun. Mag. 2009, 47, 84–89. [Google Scholar] [CrossRef]
Coatelan, S.; Glavieux, A. Design and test of a multicarrier transmission system on the shallow water acoustic channel. In Proceedings of the OCEANS’94, Brest, France, 13–16 September 1994; Volume 3, pp. III/472–III/477. [Google Scholar] [CrossRef]
Zhou, S.; Wang, Z. OFDM for Underwater Acoustic Communications, 1st ed.; Wiley Publishing: Hoboken, NJ, USA, 2014. [Google Scholar]
Wen, M.; Cheng, X.; Yang, L.; Li, Y.; Cheng, X.; Ji, F. Index modulated OFDM for underwater acoustic communications. IEEE Commun. Mag. 2016, 54, 132–137. [Google Scholar] [CrossRef]
Basar, E.; Aygolu, U.; Panayirci, E.; Poor, H.V. Orthogonal Frequency Division Multiplexing With Index Modulation. IEEE Trans. Signal Process. 2013, 61, 5536–5549. [Google Scholar] [CrossRef]
Wan, L.; Zhou, H.; Xu, X.; Huang, Y.; Zhou, S.; Shi, Z.; Cui, J. Adaptive Modulation and Coding for Underwater Acoustic OFDM. IEEE J. Ocean. Eng. 2015, 40, 327–336. [Google Scholar] [CrossRef]
Zhang, Y.; Huang, Y.; Wan, L.; Zhou, H.; Zhou, S.; Shen, X.; Wang, H. Adaptive OFDMA for downlink underwater acoustic communications. In Proceedings of the 2014 Oceans-St. John’s, St. John’s, NL, Canada, 14–19 September 2014; pp. 1–5. [Google Scholar] [CrossRef]
Badiey, M.; Yongke, M.; Simmen, J.A.; Forsythe, S.E. Signal variability in shallow-water sound channels. IEEE J. Ocean. Eng. 2000, 25, 492–500. [Google Scholar] [CrossRef]
Preisig, J.C.; Deane, G.B. Surface wave focusing and acoustic communications in the surf zone. J. Acoust. Soc. Am. 2004, 116, 2067–2080. [Google Scholar] [CrossRef]
Song, A.; Mohsen, B.; Song, H.C.; Hodgkiss, W.S.; Porter, M.B. Impact of ocean variability on coherent underwater acoustic communications during the Kauai experiment (KauaiEx). J. Acoust. Soc. Am. 2008, 123, 856–865. [Google Scholar] [CrossRef] [Green Version]
Rouseff, D.; Badiey, M.; Song, A. Effect of reflected and refracted signals on coherent underwater acoustic communication: Results from the Kauai experiment (KauaiEx 2003). J. Acoust. Soc. Am. 2009, 126, 2359–2366. [Google Scholar] [CrossRef]
Song, A.; Badiey, M.; Newhall, A.E.; Lynch, J.F.; Deferrari, H.A.; Katsnelson, B.G. Passive Time Reversal Acoustic Communications Through Shallow-Water Internal Waves. IEEE J. Ocean. Eng. 2010, 35, 756–765. [Google Scholar] [CrossRef] [Green Version]
Wang, Z.; Zhou, S.; Preisig, J.C.; Pattipati, K.R.; Willett, P. Clustered Adaptation for Estimation of Time-Varying Underwater Acoustic Channels. IEEE Trans. Signal Process. 2012, 60, 3079–3091. [Google Scholar] [CrossRef]
Qiao, G.; Song, Q.; Ma, L.; Sun, Z.; Zhang, J. Channel prediction based temporal multiple sparse bayesian learning for channel estimation in fast time-varying underwater acoustic OFDM communications. Signal Process. 2020, 175, 107668. [Google Scholar] [CrossRef]
Wang, S.; Li, D.; Liu, M.; Huang, W.; Chen, H.; Cen, Y. Clustered-Sparse Bayesian Learning for Channel Estimation in Underwater Acoustic OFDM Systems. In Proceedings of the 2020 International Conference on Wireless Communications and Signal Processing (WCSP), Nanjing, China, 21–23 October 2020; pp. 546–551. [Google Scholar] [CrossRef]
Enguix, I.F.; Egea, M.S.; González, A.G.; Arenas, D. Acoustic Characterization of Impulsive Underwater Noise Present in Port Facilities: Practical Case in the Port of Cartagena. Proceedings 2018, 4, 5755. [Google Scholar] [CrossRef] [Green Version]
Li, W.; Preisig, J.C. Estimation of Rapidly Time-Varying Sparse Channels. IEEE J. Ocean. Eng. 2007, 32, 927–939. [Google Scholar] [CrossRef]
Stojanovic, M. OFDM for underwater acoustic communications: Adaptive synchronization and sparse channel estimation. In Proceedings of the 2008 IEEE International Conference on Acoustics, Speech and Signal Processing, Las Vegas, NV, USA, 31 March–4 April 2008; pp. 5288–5291. [Google Scholar]
Geng, X.; Zielinski, A. An eigenpath underwater acoustic communication channel model. In Proceedings of the ‘Challenges of Our Changing Global Environment’, OCEANS ’95 MTS/IEEE, San Diego, CA, USA, 9–12 October 1995. [Google Scholar]
Yang, T.C. Characteristics of underwater acoustic communication channels in shallow water. In Proceedings of the OCEANS 2011 IEEE–Spain, Santander, Spain, 6–9 June 2011. [Google Scholar]
Rani, M.; Dhok, S.B.; Deshmukh, R.B. A Systematic Review of Compressive Sensing: Concepts, Implementations and Applications. IEEE Access 2018, 6, 4875–4894. [Google Scholar] [CrossRef]
Khan, M.R.; Das, B.; Pati, B.B. Channel estimation strategies for underwater acoustic (UWA) communication: An overview. J. Frankl. Inst. 2020, 357, 7229–7265. [Google Scholar] [CrossRef]
Uwaechia, A.N.; Mahyuddin, N.M. A Review on Sparse Channel Estimation in OFDM System Using Compressed Sensing. IETE Tech. Rev. 2017, 34, 514–531. [Google Scholar] [CrossRef]
Berger, C.R.; Zhou, S.; Preisig, J.C.; Willett, P. Sparse Channel Estimation for Multicarrier Underwater Acoustic Communication: From Subspace Methods to Compressed Sensing. IEEE Trans. Signal Process. 2010, 58, 1708–1721. [Google Scholar] [CrossRef] [Green Version]
Needell, D.; Tropp, J.A. CoSaMP Iterative signal recovery from incomplete and inaccurate samples. Appl. Comput. Harmon. Anal. 2009, 26, 301–321. [Google Scholar] [CrossRef] [Green Version]
Zhou, Y.H.; Tong, F.; Zhang, G.Q. Distributed compressed sensing estimation of underwater acoustic OFDM channel. Appl. Acoust. 2017, 117, 160–166. [Google Scholar] [CrossRef]
Yin, Y.; Liu, S.; Qiao, G.; Yang, Y. OFDM Demodulation Using Virtual Time Reversal Processing in Underwater Acoustic Communications. J. Comput. Acoust. 2015, 23, 1540011. [Google Scholar] [CrossRef]
Wipf, D.P.; Rao, B.D. Sparse Bayesian learning for basis selection. IEEE Trans. Signal Process. 2004, 52, 2153–2164. [Google Scholar] [CrossRef]
Prasad, R.; Murthy, C.R. Bayesian Learning for Joint Sparse OFDM Channel Estimation and Data Detection. In Proceedings of the Global Telecommunications Conference (GLOBECOM 2010), Miami, FL, USA, 6–10 December 2010. [Google Scholar]
Prasad, R.; Murthy, C.R.; Rao, B.D. Joint Approximately Sparse Channel Estimation and Data Detection in OFDM Systems Using Sparse Bayesian Learning; IEEE Press: Piscataway, NJ, USA, 2014. [Google Scholar]
Qiao, G.; Song, Q.; Ma, L.; Liu, S.; Sun, Z.; Gan, S. Sparse Bayesian Learning For Channel Estimation In Time-varying Underwater Acoustic OFDM Communication. IEEE Access 2018, 56675–56684. [Google Scholar] [CrossRef]
Bajwa, W.U.; Haupt, J.; Sayeed, A.M.; Nowak, R. Compressed Channel Sensing: A New Approach to Estimating Sparse Multipath Channels. Proc. IEEE 2010, 98, 1058–1076. [Google Scholar] [CrossRef]
Liu, S.; Wu, H.; Huang, Y.; Yang, Y.; Jia, J. Accelerated Structure-Aware Sparse Bayesian Learning for Three-Dimensional Electrical Impedance Tomography. IEEE Trans. Ind. Inform. 2019, 15, 5033–5041. [Google Scholar] [CrossRef]
Ye, H.; Li, G.Y.; Juang, B. Power of Deep Learning for Channel Estimation and Signal Detection in OFDM Systems. IEEE Wirel. Commun. Lett. 2018, 7, 114–117. [Google Scholar] [CrossRef]
Jiang, R.; Wang, X.; Cao, S.; Zhao, J.; Li, X. Deep Neural Networks for Channel Estimation in Underwater Acoustic OFDM Systems. IEEE Access 2019, 7, 23579–23594. [Google Scholar] [CrossRef]
Zhang, Y.; Li, J.; Zakharov, Y.; Li, X.; Li, J. Deep learning based underwater acoustic OFDM communications. Appl. Acoust. 2019, 154, 53–58. [Google Scholar] [CrossRef]
Stojanovic, M. Low Complexity OFDM Detector for Underwater Acoustic Channels. In Proceedings of the OCEANS 2006, Boston, MA, USA, 18–21 September 2006; pp. 1–6. [Google Scholar]
Li, B.; Zhou, S.; Stojanovic, M.; Freitag, L.; Willett, P. Multicarrier Communication Over Underwater Acoustic Channels With Nonuniform Doppler Shifts. IEEE J. Ocean. Eng. 2008, 33, 198–209. [Google Scholar]
Kim, B.-C.; Lu, I.T. Parameter study of OFDM underwater communications system. In Proceedings of the OCEANS 2000 MTS/IEEE Conference and Exhibition, (Cat. No.00CH37158), Providence, RI, USA, 11–14 September 2000; Volume 2, pp. 1251–1255. [Google Scholar] [CrossRef]
Zhang, Z.; Rao, B.D. Sparse Signal Recovery with Temporally Correlated Source Vectors Using Sparse Bayesian Learning. IEEE J. Sel. Top. Signal Process. 2011, 5, 912–926. [Google Scholar] [CrossRef] [Green Version]

Figure 1. Time-varying multipath channel with clustered-sparse structure estimated in a lake communication trial.

Figure 2. Architecture of Bayesian learning-based channel estimation in UWA-OFDM communication.

Figure 3. The comparison of MSE and BER performance based on QPSK constellation mapping: (a) the comparison of MSE performance; and (b) the comparison of BER performance.

Figure 4. The comparison of MSE and BER performance based on 16-QAM constellation mapping: (a) the comparison of MSE performance; and (b) the comparison of BER performance.

Figure 5. The MSE performance of the proposed algorithms and their theoretical lower bounds: (a) the temporal correlation coefficient

β = 0

; and (b) the temporal correlation coefficient

β = 1

.

Figure 5. The MSE performance of the proposed algorithms and their theoretical lower bounds: (a) the temporal correlation coefficient

β = 0

; and (b) the temporal correlation coefficient

β = 1

.

Figure 6. The MSE of the estimated hyperparameter

γ

by the proposed PB-CSBL and NC-CSCE algorithms under different SNRs and temporal correlation coefficients: (a) the PB-CSBL algorithm; and (b) the NC-CSCE algorithm.

Figure 6. The MSE of the estimated hyperparameter

γ

by the proposed PB-CSBL and NC-CSCE algorithms under different SNRs and temporal correlation coefficients: (a) the PB-CSBL algorithm; and (b) the NC-CSCE algorithm.

Figure 7. The performance comparison of different algorithms under colored noise: (a) the MSE and BER performance; and (b) the estimated SNRs over all subcarriers.

Figure 8. The comparison of BER performance in the lake experiment.

Table 1. The comparison of time complexity.

LS	OMP	SOMP	TMSBL	PB-CSBL
$O (L K_{p} M)$	$O (L^{2} K_{p} M)$	$O (L^{2} K_{p} M)$	$O (L K_{p}^{2} \bar{N})$	$O (L K_{p}^{2} M^{3} \bar{N})$

Table 2. UWA CP-OFDM settings.

Parameters	Notations	Values of the Simulations	Values of the Lake Trial
Bandwidth	B	1.6 kHz	5 kHz
Carrier frequency	$f_{c}$	2.5 kHz	20 kHz
Sampling frequency	$f_{s}$	12.5 kHz	100 kHz
Number of subcarriers	K	256	256
Number of data subcarriers	$K_{d}$	218	193
Number of pilot subcarriers	$K_{p}$	14	32
Number of null subcarriers	$K_{n}$	24	31
Symbol duration without CP	T	160 ms	51.2 ms
CP length	$T_{c p}$	10 ms	25.6 ms
Blocks in one frame	M	4	5

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Wang, S.; Liu, M.; Li, D. Bayesian Learning-Based Clustered-Sparse Channel Estimation for Time-Varying Underwater Acoustic OFDM Communication. Sensors 2021, 21, 4889. https://doi.org/10.3390/s21144889

AMA Style

Wang S, Liu M, Li D. Bayesian Learning-Based Clustered-Sparse Channel Estimation for Time-Varying Underwater Acoustic OFDM Communication. Sensors. 2021; 21(14):4889. https://doi.org/10.3390/s21144889

Chicago/Turabian Style

Wang, Shuaijun, Mingliu Liu, and Deshi Li. 2021. "Bayesian Learning-Based Clustered-Sparse Channel Estimation for Time-Varying Underwater Acoustic OFDM Communication" Sensors 21, no. 14: 4889. https://doi.org/10.3390/s21144889

APA Style

Wang, S., Liu, M., & Li, D. (2021). Bayesian Learning-Based Clustered-Sparse Channel Estimation for Time-Varying Underwater Acoustic OFDM Communication. Sensors, 21(14), 4889. https://doi.org/10.3390/s21144889

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Bayesian Learning-Based Clustered-Sparse Channel Estimation for Time-Varying Underwater Acoustic OFDM Communication^†

Abstract

1. Introduction

2. Related Works

3. System Architecture and Channel Models

3.1. Bayesian Learning-Based Channel Estimation Architecture

3.2. Noise-Resistant Channel Measurement Model

3.3. Clustered-Sparse Channel Distribution Model

4. Bayesian Learning-Based Clustered-Sparse Channel Estimation

4.1. Bayesian Channel Estimation

4.2. Cluster Distribution Learning

4.3. Partition-Based Clustered-Sparse Bayesian Learning Algorithm

4.4. Complexity and Performance Analysis

5. Noise-Corrected Clustered-Sparse Channel Estimation

5.1. Noise-Resistant Bayesian Channel Estimation

5.2. Data Detection and Noise Measurement

5.3. Noise-Corrected Clustered-Sparse Channel Estimation Algorithm

5.4. Complexity and Performance Analysis

6. Evaluation and Result Analysis

6.1. Simulation Results

6.2. Lake Experimental Results

7. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

Bayesian Learning-Based Clustered-Sparse Channel Estimation for Time-Varying Underwater Acoustic OFDM Communication †

Abstract

1. Introduction

2. Related Works

3. System Architecture and Channel Models

3.1. Bayesian Learning-Based Channel Estimation Architecture

3.2. Noise-Resistant Channel Measurement Model

3.3. Clustered-Sparse Channel Distribution Model

4. Bayesian Learning-Based Clustered-Sparse Channel Estimation

4.1. Bayesian Channel Estimation

4.2. Cluster Distribution Learning

4.3. Partition-Based Clustered-Sparse Bayesian Learning Algorithm

4.4. Complexity and Performance Analysis

5. Noise-Corrected Clustered-Sparse Channel Estimation

5.1. Noise-Resistant Bayesian Channel Estimation

5.2. Data Detection and Noise Measurement

5.3. Noise-Corrected Clustered-Sparse Channel Estimation Algorithm

5.4. Complexity and Performance Analysis

6. Evaluation and Result Analysis

6.1. Simulation Results

6.2. Lake Experimental Results

7. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Bayesian Learning-Based Clustered-Sparse Channel Estimation for Time-Varying Underwater Acoustic OFDM Communication^†