Next Article in Journal
Modelling of Synthetic Fibre Rope Mooring for Floating Offshore Wind Turbines
Next Article in Special Issue
Effects of Internal Waves on Acoustic Temporal Coherence in the South China Sea
Previous Article in Journal
Effect of Relative Stiffness of Pile and Soil on Pile Group Effect
Previous Article in Special Issue
Waveform Design Using Coprime Frequency-Modulated Pulse Trains for Reverberation Suppression of Active Sonar
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Variational Bayesian Inference Time Delay Estimation for Passive Sonars

1
The Institute of Acoustics, Chinese Academy of Sciences, Beijing 100190, China
2
Key Laboratory of Science and Technology on Advanced Underwater Acoustic Signal Processing, Chinese Academy of Sciences, Beijing 100190, China
3
University of Chinese Academy of Sciences, Beijing 100049, China
*
Author to whom correspondence should be addressed.
J. Mar. Sci. Eng. 2023, 11(1), 194; https://doi.org/10.3390/jmse11010194
Submission received: 14 November 2022 / Revised: 23 December 2022 / Accepted: 6 January 2023 / Published: 12 January 2023
(This article belongs to the Special Issue Underwater Acoustics and Digital Signal Processing)

Abstract

:
In passive sonars, distance and depth estimation of underwater targets is often limited by the accuracy of time delay estimations. The estimation accuracy of the existing methods of time delay estimation is limited by the uniform discrete grid (signal sampling rate). When a true time delay is out of the grid, the estimation accuracy deteriorates due to the mismatch between the real-time delay and the discrete grid. This paper proposes a new method for time delay estimation, which realizes the time delay estimation under the framework of variational Bayesian inference. The proposed method is grid-less, that is, continuous in the time domain. Unlike the popular grid-less compressive time delay estimation method, this method does not require parameter adjustment, and can automatically estimate the number of time delays, noise variance, and amplitude variance. The simulation results showed that the performance of the proposed method was superior to the reference state-of-the-art time delay estimation methods.

1. Introduction

Time delay estimation is a basic problem in signal parameter estimation. It is required in many applications, such as speech analysis, channel estimation, radar navigation, and sonar signal processing [1,2,3,4,5,6,7,8]. In passive sonar systems, the distance and depth estimation of underwater targets is often affected by the performance of time delay estimation. In general, the accuracy of the time delay is of more concern when estimating the distance. A major time delay exists at this point without considering the multipath problem [9]. The multipath problem is very important when estimating the depth. Therefore, not only is the accuracy of the time delay estimation needed, but also the number of time delays [10].
The cross-correlation method is commonly used to estimate the time delay [11], which determines the time delay value by locating the peak of the cross-correlation function. The cross-correlation method is classical and easy to be implemented in practical systems, but its resolution is limited by the shortcomings of the wide main lobe and high sidelobes of the cross-correlation function [12]. Moreover, the estimation accuracy of this method is limited by the sampling frequency.
To mitigate the poor resolution problem of the cross-correlation method, many high-resolution methods of time delay estimation have been proposed, including look-ahead backtracking orthogonal matching pursuit (LABOMP) [4,5], multiple signal classification (MUSIC) [13], weighted l 1 norm minimization algorithm based on the cross-correlation method (CC-WL1) [14], and the deconvolution cross-correlation method (Dec-CC) [14]. These algorithms can obtain higher time delay resolution. However, there exists a common shortcoming of these methods, in that the performance of these methods is still limited by the grid. When the real-time delay is out of the grid, a grid mismatch arises, thereby reducing the performance of time delay estimation.
A simple way to solve the problem of grid mismatch is to increase the fineness of the grid, and solve the time delay through a denser grid. Although this can improve the accuracy of time delay estimation, dense grids often necessitate enormous data storage and computational pressure. Another method is to adaptively adjust the fineness of the grid to match the true time delay, which is known as off-grid time delay estimation. As described in Reference [15], sparse Bayesian inference can be used to iteratively update the fineness of the grid from the coarse grid to obtain a more accurate time delay estimation. However, this method still requires a coarse grid to be provided in advance.
Solving the time delay estimation problem in the continuous time domain, rather than the discrete one, may be the best solution to completely overcome the mismatch problem, which is referred to as grid-less time delay estimation. In reference [16], inspired by grid-free compressed beamforming [17], the time delay estimation problem was transformed into a sparse reconstruction problem, then the atomic-norm minimization (ANM) method was used to achieve the time delay estimation in the continuous time domain. A major limitation of the ANM method is that it requires sufficient separation between two adjacent time delays to be recovered. The reweighted atomic-norm minimization (RAM) method can avoid this limitation. According to reference [18], the RAM method has a higher resolution than the ANM method. Nevertheless, both the RAM and ANM methods meet a common issue, that is, when the noise variance is not known, it is necessary to manually adjust the parameters to obtain the best estimation result, a process which is often difficult and complex.
A VAriational Time Delay Estimation (VATDE) method is proposed in this paper. The VATDE method estimates the posterior probability density functions (PDFs) of time delays using variational Bayesian inference and calculates their expectations, rather than simply keeping the point estimates of time delays. The proposed method is grid-less, and, unlike the existing work in references [16,18], the VATDE does not need parameter adjustment, and can automatically estimate the number of time delays, noise variance, and amplitude variance. A simulation demonstrated that, even compared with other state-of-the-art methods, this method still had better time delay estimation performance.

2. Signal Model

Consider two spatially separated sensors in a passive system, which receive signals x 1 ( t ) and x 2 ( t ) from the same target source s ( t ) . Assume that there are no other interference sources, and the target source signal s ( t ) is affected by Gaussian white noise during the propagation process. Then, the cross-correlation function R x 1 x 2 of the received signals x 1 ( t ) and x 2 ( t ) can be obtained as follows:
R x 1 x 2 ( τ ) = k = 1 K α k R s s ( τ τ k ) + R e e ( τ )
where R s s is the autocorrelation function of the source signal s ( t ) ; K is the number of time delays; α k is the amplitude of each time delay; τ k represents the k th time delay; and R e e is the noise-related term. Ideally, the received signal and noise are independent of each other, at which point R e e = 0 .
Assume that the source signal s ( t ) is a broadband signal. Then, the Fourier transform is performed on the cross-correlation function R x 1 x 2 , and the frequency form of Formula (1) can be obtained as follows:
G x 1 x 2 ( f m ) = G s s ( f m ) k = 1 K α k e j 2 π f m τ k + G e e ( f m ) ,   m = 1 , 2 , M
where G x 1 x 2 ( f m ) is the cross-power spectrum between the received signal x 1 ( t ) and x 2 ( t ) ; G s s ( f m ) is the auto-power spectrum of the source signal s ( t ) ; G e e ( f m ) is the cross-power spectrum of the noise; f m = m f s / N is a sampling of the frequency band of the source signal s ( t ) ; f s is the sampling rate; and N is the signal sampling length. The aim of this paper was to estimate exact time delays from the cross-power spectrum G x 1 x 2 ( f m ) of the received signal.
Before using the VATDE algorithm proposed in this paper, according to the definition of Formula (3), the model in Formula (2) was further parameterized. Let
ω k = 2 π f s τ k N
where ω k [ π , π ] . Note that ω k and τ k are in a one-to-one correspondence: when ω k is evaluated correctly, τ k can be obtained by ω k N / 2 π f s . Therefore, Formula (2) can be further expressed as follows:
G x 1 x 2 ( m ) = G s s ( m ) k = 1 K α k e j ω k m + G e e ( m ) ,   m = 1 , 2 , M
where G s s ( m ) = | G s s ( m ) | , because the phase component of the auto-power spectrum is 0.
In fact, the value of K is difficult to obtain, so this paper used dimension M of the source signal frequency band as the number of time delays in the overcomplete model. Of course, it is necessary to ensure K < < M . At this point, the final modeling result of the VATDE algorithm can be given as follows:
G x 1 x 2 = i = 1 M W i a ( ω i ) + G e e
where W i = G s s ( i ) α i and W = [ W 1 , W 2 , , W M ] T represent the weight of the time delay; and a ( ω i ) = e j ω i m ,   m { 1 , 2 , , M } represents the Fourier transform of the time delay value. Since the goal of this paper was to obtain K non-zero weights, sparse prior information was introduced for the weights. Ideally, the model shown in Formula (5) would finally obtain K real-time delays and the corresponding non-zero weights and M K zero weights. Specifically, this paper introduces binary latent variables S = [ S 1 , S 2 , , S M ] T , S i [ 0 , 1 ] to control the weight W to promote sparsity. When S i = 1 , then W i is active, that is, W i is not equal to 0; otherwise, W i is equal to 0. This can be expressed as the following formula:
p ( W i | S i ; ε ) = ( 1 S i ) δ ( W i ) + S i f CN ( W i ; 0 ; ε )
where W i | S i follows the Bernoulli Gaussian distribution [19,20]; p ( S i ) = ρ S i ( 1 ρ ) ( 1 S i ) ; ρ is the probability that the i th weight component is active; and ε is the variance of the zero-mean gaussian distribution. In this paper, f CN ( ;   μ , Σ ) denotes the complex multivariate Gaussian PDFs with mean μ and variance Σ . For the i th time delay, let its prior distribution be p ( ω i ) . This is a von Mises distribution; it is extremely important in the distribution on the unit circle [21], and in most cases let p ( ω i ) = 1 / 2 π , which represents a lack of prior knowledge [22,23].
According to the overcomplete model (Formula (5)), the likelihood function p ( G x 1 x 2 | ω , W ; ν ) can be represented as follows:
p ( G x 1 x 2 | ω , W ; ν ) = f CN ( G x 1 x 2 ; i W i a ( ω i ) , ν I )
where ν represents the variance of background noise. The maximum likelihood (ML) estimation of the core parameters in this paper can then be represented as follows:
( ρ ^ ML , ε ^ ML , ν ^ ML ) = argmax ρ , ε , ν p ( G x 1 x 2 , ω , W , S ; ρ , ε , ν )   d S d W d ω
where p ( G x 1 x 2 , ω , W , S ; ρ , ε , ν ) is the joint PDFs, which is the product of the likelihood function (Formula (7)) and the prior PDFs:
p ( G x 1 x 2 , ω , W , S ; ρ , ε , ν ) = p ( G x 1 x 2 | ω , W ; ν ) i = 1 M p ( ω i ) p ( W i | S i ) p ( S i )
However, the calculation of Formula (8) is extremely complex and intractable; thus, in the next section an iterative algorithm is proposed to solve it.

3. Variational Bayesian Inference Time Delay Estimation Algorithm

In this section, the proposed VATDE algorithm relies on the mean-field theory of variational Bayesian inference. This paper wished to determine the approximation PDFs q ( ω , W , S | G x 1 x 2 ) that minimize the Kullback–Leibler (KL) divergence KL ( q ( ω , W , S | G x 1 x 2 ) p ( ω , W , S | G x 1 x 2 ) ) , which means that the function L ( q ( ω , W , S | G x 1 x 2 ) ) is a maximization [24] (pp. 732–733),
L ( q ( ω , W , S | G x 1 x 2 ) ) = E q ( ω , W , S | G x 1 x 2 ) [ ln p ( G x 1 x 2 , ω , W , S ; ρ , ε , ν ) q ( ω , W , S | G x 1 x 2 ) ]
where E [ ] represents the expectation operator. Furthermore, all time delays and other variables are assumed to be independent of each other, and q ( S | G x 1 x 2 ) = δ ( S S ^ ) , so q ( ω , W , S | G x 1 x 2 ) can be represented as follows:
q ( ω , W , S | G x 1 x 2 ) = i = 1 M q ( ω i | G x 1 x 2 ) q ( W | G x 1 x 2 , S ) δ ( S S ^ )
According to Formula (11), the following estimation can be obtained:
ω ^ i = arg ( E q ( ω i | G x 1 x 2 ) [ e j ω i ] )
a ^ i = E q ( ω i | G x 1 x 2 ) [ a ( ω i ) ]
For the detailed derivation process, please refer to reference [21] (p. 26).
Due to q ( S | G x 1 x 2 ) = δ ( S S ^ ) , the approximate posterior PDFs of the weights W can be represented as follows:
q ( W | G x 1 x 2 ) = q ( W | G x 1 x 2 , S ) δ ( S S ^ ) d S = q ( W | G x 1 x 2 , S ^ )
According to Formula (14), the mean and variance of the weights W can be estimated as follows:
W ^ = E q ( W | G x 1 x 2 ) [ W ] ,   C ^ = E q ( W | G x 1 x 2 ) [ W W H ] W ^ W ^ H
Then, this paper defines S as the estimation of the number of time delays K ^ , which is a set of non-zero parameters in the variable S :
S = { i | 1 i M , S i = 1 }
Therefore, K ^ is equivalent to the cardinality of S , K ^ = | S ^ | . Finally, the estimation of the cross-power spectrum G x 1 x 2 is given as follows:
G ^ x 1 x 2 = i = 1 S ^ W ^ i a ^ ( ω i )
Compared with Formula (5), Formula (17) represents the cross-power spectrum without noise.
Now, let us consider maximizing the function L ( q ( ω , W , S | G x 1 x 2 ) ) . Naturally, it is difficult to maximize the function L ( q ( ω , W , S | G x 1 x 2 ) ) on all variables simultaneously. Therefore, this section adopts the alternative optimization scheme, that is, the function L ( q ( ω , W , S | G x 1 x 2 ) ) is maximized for each variable, while other variables are fixed. Let z = ( ω 1 , ω 2 , , ω M , ( W , S ) ) be the set of all variables, and z t , t = 1 , 2 , , M + 1 represents each variable in the set, then the above alternative optimization scheme can be expressed as follows [24] (p. 735), Equations (21) and (25).:
ln q ( z t | G x 1 x 2 ) = E q ( ~ z t | G x 1 x 2 ) [ ln p ( G x 1 x 2 , z ) ] + const
where E q ( ~ z t | G x 1 x 2 ) [ ln p ( G x 1 x 2 , z ) ] represents the expectation of all the variables except variable z t . In the following sections, more optimization details are presented.

3.1. Inference Parameter ω

According to Formula (18), for each variable in the parameter ω , the optimization factor ln q ( ω i | G x 1 x 2 ) can be represented as follows [25] (Chapter 10, p. 466):
ln q ( ω i | G x 1 x 2 ) = E q ( ~ ω i | G x 1 x 2 ) [ ln p ( G x 1 x 2 , z ; ρ , ε , ν ) ] + const
By substituting Formula (9), the following can be obtained:
ln q ( ω i | G x 1 x 2 ) = E q ( ~ ω i | G x 1 x 2 ) [ ln p ( G x 1 x 2 | ω , W ; ν ) p ( ω ) p ( W | S ) p ( S ) ] + const = E q ( ~ ω i | G x 1 x 2 ) [ d = 1 M ln p ( ω d ) + d = 1 M ln p ( S d ) + ln p ( W | S ) ] + E q ( ~ ω i | G x 1 x 2 ) [ ln p ( G x 1 x 2 | ω , W ; ν ) ] + const
When i S , Formula (20) is further expressed as follows:
ln q ( ω i | G x 1 x 2 ) = E q ( ~ ω i | G x 1 x 2 ) [ ( G x 1 x 2 k S a ( ω k ) W k ) H ( G x 1 x 2 k S a ( ω k ) W k ) / ν ] + ln p ( ω i ) + const
Then, the following can be obtained using Formula (15):
ln q ( ω i | G x 1 x 2 ) = ln p ( ω i ) + { η i H a ( ω i ) } + const
where
η i = 2 ν ( G x 1 x 2 k S ^ \ { i } a ^ k W ^ k ) W i * 2 ν k S ^ \ { i } C ^ k . i a ^ k
When i S ^ , then η i = 0 , which means that only the prior information plays a role in Formula (22).
It is difficult to obtain the analytical results of Formula (22) directly. Fortunately, according to reference [22] (Heuristic 2), the q ( ω i | G x 1 x 2 ) can be approximated as a von Mises distribution:
q ( ω i | G x 1 x 2 ) V M ( ω ; μ ^ i , κ ^ i )
The von Mises distribution V M ( ω ; μ , κ ) is expressed as follows:
V M ( ω ; μ , κ ) = 1 2 π I 0 ( κ ) e κ cos ( ω μ )
where μ is the mean direction, κ is the concentration parameter, and I p ( κ ) is the modified Bessel function of the first kind of order p [21] (p. 348). According to Formula (25), the following can be obtained [21] (p. 26):
arg ( E V M ( ω ; μ , κ ) [ e j ω ] ) = arg ( e j μ I 1 ( κ ) I 0 ( κ ) ) = μ = E V M ( ω ; μ , κ ) [ ω ]
E [ e j ω ] = e j m μ I m ( κ ) I 0 ( κ )
Then, parameters ω ^ i and a ^ i in Formulas (12) and (13) can be accordingly calculated.

3.2. Inference Parameters W and S

Then, fix q ( ω i | G x 1 x 2 ) , i = 1 , 2 , , M , and give the maximization of the function of L ( q ( ω , W , S | G x 1 x 2 ) ) with respect to q ( W , S | G x 1 x 2 ) . We define matrix J and H as follows:
J i j = { M , i = j a ^ i H a ^ j , i j   i , j { 1 , 2 , , M }
H = [ a ^ 1 H G x 1 x 2 , a ^ 2 H G x 1 x 2 , , a ^ M H G x 1 x 2 ] T
According to Formula (18), ln q ( W , S | G x 1 x 2 ) can be calculated with the following formula:
ln q ( W , S | G x 1 x 2 ) = E q ( ~ ( W , S ) | G x 1 x 2 ) [ ln p ( G x 1 x 2 , z ) ] + const = E q ( ~ ( W , S ) | G x 1 x 2 ) [ d = 1 M ln p ( S d ) + ln p ( W | S ) + ln p ( G x 1 x 2 | ω , W ) ] + const = S 0 ln ρ 1 ρ + S 0 ln 1 π ε 1 ε W S W S H + [ 2 ν { W S H H S } 1 ν W S H J S W S ] + const = ( W S W ^ S ) H C ^ S 1 ( W S W ^ S ) + const
where W S represents the element of parameter W indexed by S ; J S and C ^ S 1 represent the submatrix with S as the row and column index; and S 0 represents the number of non-zero elements in S . Then, it has the following formula:
W ^ S = C ^ S H S ν , C ^ S = ν ( J S + ν ε I ) 1
According to Formula (14), the approximate posterior PDFs q ( W | G x 1 x 2 ) of the weight W can be represented as the product of q ( W | G x 1 x 2 , S ) and δ ( S S ^ ) . Therefore, to calculate q ( W | G x 1 x 2 ) , the value of S ^ must be estimated. By substituting the approximate PDFs q ( ω , W , S | G x 1 x 2 ) , put Formula (11) into Formula (10), and Formula (32) can be obtained as follows:
  ln Z ( S ^ ) = L ( q ( ω , W , S | G x 1 x 2 ) ; S ^ ) = E q ( ω , W , S | G x 1 x 2 ) [ ln p ( G x 1 x 2 , ω , W , S ; S ^ ) q ( ω , W , S | G x 1 x 2 ; S ^ ) ] = E q ( ω , W , S | G x 1 x 2 ) [ ln p ( S ) + ln p ( W | S ) + ln p ( G x 1 x 2 | ω , W ) ] E q ( ω , W , S | G x 1 x 2 ) [ ln q ( W , S | G x 1 x 2 ) ] + const = S ^ 0 ln ρ ν ( 1 ρ ) ε + ln det ( J S + ν ε I ) 1 + 1 ν H S H ( J S + ν ε I ) 1 H S + const
Consequently, when we maximize ln Z ( S ^ ) , it is possible to obtain S ^ :
S ^ = arg max S ln Z ( S )
The global optimal Bernoulli sequence S of the problem (Formula (33)) can be determined using an enumeration algorithm. However, for M-dimensional sequences, the computational cost of this operation is O ( 2 M ) and its computational complexity will increase exponentially as the M increases. For a given S ^ , to reduce the complexity, a greedy iterative search strategy is proposed to determine the local optimal value. The specific update policy is as follows: For every k = 1 , 2 , , M , calculate Δ k = ln Z ( S ^ k ) ln Z ( S ^ ) , where S ^ k is the same as S ^ , except that the k th element is flipped. Let k max = arg max k Δ k , if Δ k max > 0 , update S ^ by flipping the k max th element; otherwise, if S ^ remains unchanged, then terminate the algorithm. As for the computation of Δ k , a detailed method has been given in references [22,23], and will not be repeated here. If the algorithm does not stop halfway through, and S ^ 0 is initialized to 0 M , it usually only takes O ( K ^ ) steps to determine the local maximum of ln Z ( S ^ ) , in which K ^ is the estimate of the number of time delays.

3.3. Inference Parameters ρ ^ , ε ^ , and ν ^

After updating parameters ω and W , parameters { ρ , ε , ν } are then estimated by maximizing the lower bound of function L ( q ( ω , W , S | G x 1 x 2 ) ; ρ , ε , ν ) . First, we bring the approximate PDFs q ( ω , W , S | G x 1 x 2 ) from Formula (11) into Formula (10):
L ( q ( ω , W , S | G x 1 x 2 ) ; ρ , ε , ν ) = E q ( ω , W , S | G x 1 x 2 ) [ ln p ( G x 1 x 2 , ω , W , S ; ρ , ε , ν ) q ( ω , W , S | G x 1 x 2 ) ]
When only terms that depend on parameters { ρ , ε , ν } are considered, Formula (34) can be further expressed as follows:
L ( q ( ω , W , S | G x 1 x 2 ) ; ρ , ε , ν ) = E q ( ω , W , S | G x 1 x 2 ) [ ln p ( G x 1 x 2 , ω , W , S ; ρ , ε , ν ) ] + const = E q ( ω , W , S | G x 1 x 2 ) [ ln p ( S ) + ln p ( W | S ) ] + E q ( ω , W , S | G x 1 x 2 ) [ ln p ( G x 1 x 2 | ω , W ) ] + const
Based on the likelihood function (Formula (7)) defined in Section 2 and the form of the prior PDFs, Formula (36) can be obtained as follows:
L ( q ( ω , W , S | G x 1 x 2 ) ; ρ , ε , ν ) = 1 ν [ 2 { W ^ S ^ H H S ^ } W ^ S ^ H J S ^ W ^ S ^ G x 1 x 2 H G x 1 x 2 tr ( J S ^ C ^ S ^ ) ] M ln ν 1 ε [ W ^ S ^ H W ^ S ^ + tr ( C ^ S ^ ) ] + S ^ 0 ln ρ S ^ 0 ln ε + ( M S ^ 0 ) ln ( 1 ρ ) +   const
If we set L ρ = 0 and L ε = 0 , then Formula (37) can be obtained as follows:
ρ ^ = S ^ 0 M , ε ^ = W ^ S ^ H W ^ S ^ + tr ( C ^ S ^ ) S ^ 0
Clearly, ρ ^ is given by the number of non-zero elements of S ^ , and ε ^ is the variance of the corresponding weights of these non-zero elements. Similarly, set L ν = 0 , and Formula (38) can be obtained as follows:
ν ^ = 1 M G x 1 x 2 k S ^ a ^ k W ^ k 2 2 + 1 M tr ( J S ^ C ^ S ^ ) + k S ^ | W ^ k | 2 ( 1 a ^ k 2 2 M )
In summary, ν ^ is composed of three parts: the fitting of the residual, the error of the weight estimation, and the uncertainty of the parameter ω estimation. The better q ( ω , W , S | G x 1 x 2 ) fits, the smaller the values of the above three parts will be.

3.4. The VATDE Algorithm

This section gives the details of the update process of the VATDE algorithm, as shown in Algorithm 1.
Algorithm 1. Description of the VATDE algorithm.
Input: The cross-power spectrum G x 1 x 2 of the received signal.
Output: Estimate the number of delays K ^ , time delay estimation τ ^ , weight estimate W ^ , and reconstruct the cross-power spectrum G ^ x 1 x 2 .
(1) initialize ν ^ , ρ ^ , ε ^ and q ( ω i | G x 1 x 2 ) , i = 1 , 2 , , M , calculate a ^ i
(2) repeat
(3)  update S ^ , W ^ S and C ^ S (see Section 3.2)
(4)  update ν ^ , ρ ^ , and ε ^ (as shown in Formulas (37) and (38))
(5)  When i S ^ , update η i , and a ^ i (see Section 3.1)
(6) until the stopping criterion
(7) according to τ ^ i = ω ^ i N / 2 π f s , obtain τ ^
(8) return  K ^ , τ ^ , W ^ and G ^ x 1 x 2
For Algorithm 1, each step will increase the value of function L , and eventually converge to some local maximum of function L ; thus, initialization is very important for Algorithm 1. Next, a specific initialization scheme is described.
First, initialize q ( ω 1 | G x 1 x 2 ) as q ( ω 1 | G x 1 x 2 ) exp ( G x 1 x 2 H a ( ω 1 ) 2 2 / ( ν M ) ) by defining the following:
M = { m n | m , n { 0 , 1 , , M 1 } , m > n } a : [ π , π ) M , ω a ( ω ) ( e j ω m m M ) T , a ( ω ) = [ 1 ; a ( ω ) ]
Then, q ( ω 1 | G x 1 x 2 ) can be simplified to the form of Formula (22). By defining γ = 1 M G x 1 x 2 G x 1 x 2 H , q ( ω 1 | G x 1 x 2 ) will be re-expressed as follows:
q ( ω 1 | G x 1 x 2 ) exp ( { 2 ν γ H a ( ω 1 ) } )
Second, a ^ 1 = E ( a ( ω 1 ) ) can be calculated. from Formula (28) J 1 = M , and H 1 can be calculated using Formula (29). According to Formula (31), ω ^ 1 can be calculated. Then, update q ( ω 2 | G x 1 x 2 ) exp ( G 1 x 1 x 2 H a ( ω 2 ) 2 2 / ( ν M ) ) ; at this time, G 1 x 1 x 2 = G x 1 x 2 a ^ 1 ω ^ 1 T . According to the above description, q ( ω i | G x 1 x 2 ) , a ^ i , and ω ^ i are fully initialized.
Finally, we give the initialization of the parameters ν ^ , ρ ^ and, ε ^ . Let ν ^ be initialized to the average of the final quarter of the eigenvalues of matrix E ( G x 1 x 2 G x 1 x 2 H ) ; ρ ^ = 0.5 ; and ε ^ is initialized to ε ^ = ( tr [ G x 1 x 2 H G x 1 x 2 ] / M ν ^ ) / ( M ρ ^ ) .

4. Simulation

Since the proposed algorithm is a grid-less time delay estimation algorithm, the simulation in this section is mainly compared with the proposed grid-less time delay estimation algorithm. It was assumed that the target source signal was a Gaussian white noise signal with a frequency of 1–4 kHz, and the signal length was 0.1 s. Considering a passive system, the sampling rate of the system was 10 kHz, the sampling interval was 1 × 10 4 s, two spatially separated synchronous sensors received the signal from the target source at the same time, and there was background noise in the space, which was Gaussian white noise with a mean of 0.
First, the ideal state was considered, that is, there was only one time delay between the received signals of the two sensors, the time delay value was 2.75 × 10 4 s, and the normalized amplitude was 1. The signal-to-noise ratio (SNR) was set to 10 dB, and the end condition of the VATDE algorithm was G ^ x 1 x 2 i 1 G ^ x 1 x 2 i 2 / G ^ x 1 x 2 i 1 2 < 10 6 , then i = 3000 represents the maximum number of iterations. Next, we compared the time delay estimation performance of the VATDE algorithm proposed in this paper with the ANM and RAM algorithms.
It can be seen from Figure 1a that all three algorithms, i.e., the ANM and RAM methods, and the VATDE proposed in this paper, could obtain better time delay estimation results in the case of a single time delay. Meanwhile, the enlarged window in Figure 1a further shows that the VATDE algorithm was closer to the real-time delay than the RAM and ANM algorithms. Therefore, the VATDE algorithm’s time delay estimation accuracy was higher, while the time delay estimation accuracy of the RAM algorithm was also slightly better than that of the ANM algorithm. Figure 1b shows the estimation of the number of time delays existing using the VATDE algorithm. However, since the ANM and RAM algorithms cannot provide an estimate of the number of time delays, the number of time delays can only be manually determined. It can be seen from Figure 1b that the VATDE algorithm provided an accurate estimation of the number of time delays with only a few simple iterations, which avoids the interference caused by false peaks when determining the true time delay estimation.
Figure 2a shows a comparison of the phase of the cross-power spectrum between the received signals in the case of no noise and the phase of the cross-power spectrum when the SNR was 10 dB. Figure 2b shows the performance of different algorithms for reconstructing the original cross-power spectrum phase from the noisy cross-power spectrum phase. It can be clearly seen that the reconstructed cross-power spectrum phase obtained by the proposed VATDE algorithm fitted the original cross-power spectrum phase perfectly. In addition, the reconstruction performance of the ANM algorithm was similar to that of the RAM algorithm. Although the trend of the reconstructed cross-power spectrum phase was roughly the same as that of the original cross-power spectrum phase, there was still a certain deviation. Clearly, the time delay estimation performance of the algorithm is was to the reconstruction performance of the cross-power spectrum. When the time delay estimation of the algorithm was more accurate, then the reconstructed cross-power spectrum phase becomes more suitable for the original cross-power spectrum phase.
Second, assuming the presence of a multipath influence, there were two time delays between the received signals of the two sensors, which signifies that there will be three time delays in the cross-power spectrum. In this paper, the magnitudes of the three time delays were set to [ 0.45 × 10 4   s , 4.75 × 10 4   s , 12.79 × 10 4   s ] , and the amplitudes were [ 1 , 0.9 , 0.7 ] , respectively, while the SNR was also set as 10 dB. This paper compared the time delay estimation performance of the VATDE algorithm proposed in this paper with those of the ANM and RAM algorithms.
Figure 3a shows the time delay estimation results of the VATDE, ANM, and RAM algorithms in the presence of three time delay values. It can be seen that, compared with the ANM and RAM algorithms, the VATDE algorithm not only provided a more accurate time delay estimation result, it also yielded a more accurate amplitude estimate. The time delay estimation results of the RAM algorithm were similar to those of the ANM algorithm. However, under the same parameter selection, the RAM algorithm had fewer false peaks than the ANM algorithm, which reduced the interference of false peaks to the time delay estimation results. Figure 3b once again shows the estimated performance of the VATDE algorithm for the number of time delays present. It still only took a few simple iterations to estimate the real number of time delays, which is very meaningful for time delay estimation, as it avoids the problem of distinguishing the real delay from the obtained time delay estimation results.
Figure 4a shows a comparison of the cross-power spectrum phase between the received signals in the case of no noise and the cross-power spectrum phase in the case of SNR 10 dB when there were three time delays. Compared with Figure 2a, the multiple time delays made the original cross-power spectrum phase more irregular; thus, it was more difficult to reconstruct the original cross-power spectrum phase from the noisy cross-power spectrum phase. Figure 4b shows the performance of different algorithms in reconstructing the original cross-power spectrum phase in the presence of three time delays. It is clear that the reconstructed cross-power spectrum phases of the ANM and RAM algorithms exhibited significant deviations from the original cross-power spectrum phase, due to their inaccurate time delay estimation and poor time delay amplitude estimation. Compared with the ANM algorithm, the reconstructed cross-power spectrum phase of the RAM algorithm could still reflect the original cross-power spectrum phase trend to a certain extent. This was because the reconstructed results of the ANM algorithm were greatly influenced by false peaks, and, thus, could not reflect the general trend of the original cross-power spectrum phase. Compared with the ANM and RAM algorithms, the reconstructed cross-power spectrum phase of the VATDE algorithm proposed in this paper still perfectly fitted the original cross-power spectrum phase, which once again reflects the superiority of the VATDE algorithm.
Next, the computational efficiency of the different algorithms was compared. In this study, 100 Monte Carlo simulations were carried out under the multipath assumption. The ANM and RAM algorithms in this paper were solved using the SDPT3 [26] solver in the CVX toolbox [27]. Based on reference [28], the time complexity of the CVX toolbox for solving the semi-positive definite problem was approximately O ( N 3 ) Thus, the time complexity of ANM was approximately O ( N 3 ) . Since the estimation result of the RAM algorithm was obtained using multiple iterations of the ANM algorithm, the time complexity of RAM was approximately O ( L N 3 ) , where L denotes the number of iterations. The time complexity of VATDE was approximately O ( N 2 ) . Obviously, VATDE was the most computationally efficient in theory.
The average running times of the ANM, RAM, and VATDE algorithms in MATLAB 2020a are shown in Table 1. Compared with the ANM and RAM algorithms, VATDE clearly had the shortest running time. The running times of the ANM and RAM algorithms were much longer than that of the VATDE algorithm. The running time of the RAM algorithm was a multiple of the running time of the ANM algorithm, and the specific value depended on the number of iterations. In this paper, the number of iterations was set to three and it can be seen that RAM took three times as long as ANM to complete. The simulation results were consistent with the theory and prove the high computational efficiency of the VATDE algorithm.
Of course, for the ANM, RAM, and VATDE algorithms, the running time was affected by the number of sampling points. When the number of sampling points is higher, the running time will be longer. However, as shown by the results in Figure 5, when the number of sampling point varied from 30 to 300, the running time and sampling points did not change linearly, and the multiplier of the increase in running time was greater than the multiplier of the increase in sampling points. Therefore, when the running time is long due to a large number of sampling points, the calculation efficiency can be improved by dividing the dataset into multiple small sets of sampling points. In addition, the results in Figure 5 show that the VALSE algorithm possessed computational advantages compared with the ANM and RAM algorithms at different numbers of sampling points, and the computational efficiency was more than 10 times that of the ANM algorithm.
Based on the above calculation efficiency comparison, it can be concluded that the VATDE algorithm proposed in this paper is more likely to be applied to real-time data processing.
Finally, the root-mean-squared error (RMSE) and SNR curves were plotted to show the time delay estimation accuracy of the three algorithms. Considering the different uses of time delay estimation, this paper divided the accuracy of delay estimation into two categories. One is the time delay estimation accuracy in the case of a single time delay without considering the multipath effect, which is usually applied to the geometric positioning problem in passive sonars [9]. The second category is the time delay estimation accuracy with multiple time delays in the multipath environment. The multipath effect will lead to more than three peaks in the cross-correlation function. The depth estimation problem of underwater targets can be solved by estimating multiple time delays [10].
Meanwhile, the signal simulation remained unchanged. The settings of single time delay and multiple time delays were the same as those in the above simulation. When SNR changed from 0 dB to 20 dB, then the RMSE of the ANM, RAM, and VATDE algorithms was compared, and each result was obtained using 500 Monte Carlo simulations. The simulation results are shown in Figure 6. Figure 6a shows the RMSE and SNR curves under a single time delay, while Figure 6b shows the RMSE and SNR curves under multiple time delays. It can be seen that the RMSE of the VATDE algorithm was lower than that of the ANM and RAM algorithms in the cases of both single and multiple time delays. Especially in a low SNR environment, the time delay estimation accuracy of the ANM and RAM algorithms deteriorated significantly, while VATDE still had high time delay estimation accuracy. In addition, the RMSE of the ANM and RAM algorithms was approximately equal with a single time delay, but with multiple time delays, the RMSE of the RAM algorithm was lower than that of the ANM algorithm. This may be because the RAM algorithm had fewer false peaks than the ANM algorithm. Moreover, it can be seen from the RMSE results in Figure 6a,b that the RMSE curves of the ANM, RAM, and VATDE algorithms in Figure 6b were much more than three times those in Figure 6a, which may have been due to the mutual interference between multiple time delays, which reduced the accuracy of time delay estimation. If the post-processing method of the cross-correlation function is adopted, that is, only the peak region of the cross-correlation function is searched separately, avoiding the search of the whole continuous time region, then the accuracy of time delay estimation with multiple time delays may be further improved. This is the next step of this study, and will not be discussed here.

5. Conclusions

In this paper, a time delay estimation algorithm based on variational Bayesian inference was proposed. All parameters in this algorithm were obtained by approximating its posterior probability distribution through variational Bayesian inference. Therefore, the advantages of the algorithm in this paper are that, firstly, it does not require any parameter adjustment and avoids the delay estimation error caused by improper parameter selection. Secondly, the algorithm presented in this paper is a grid-less time delay estimation method, which avoids the problem of mismatch between the grid and the real-time delay, and breaks the limitation of sampling rate, so that a time delay of arbitrary size can be estimated. Thirdly, the performance of the delay estimation was significantly improved by accounting for the uncertainty of the time delay estimation rather than calculating only the point estimate. Finally, the simulation results showed that, even compared with other current state-of-the-art grid-less time delay estimation methods, the proposed algorithm had higher time delay estimation accuracy in the case of a single time delay and multiple time delays. In particular, in a low SNR environment, the time delay results could still be accurately estimated, which makes it suitable for passive sonar signal processing.
As an outlook, the next research direction following on from this paper is to find a better variational approximation to improve the performance of the algorithm, especially in situations where the time delays are very closely spaced. Moreover, this paper hopes to further improve the computational efficiency of the algorithm for use in large-scale scenarios.

Author Contributions

Conceptualization, Y.L.; methodology, C.C.; resources, H.H.; data curation, F.D.; writing—original draft preparation, F.D.; writing—review and editing, C.C. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Natural Science Foundation of China (No. 62001469).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

All data in this study has been included in this paper.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Narayana Murthy, B.H.V.S.; Yegnanarayana, B.; Kadiri, S.R. Time Delay Estimation from Mixed Multispeaker Speech Signals Using Single Frequency Filtering. Circuits Syst. Signal Process. 2019, 39, 1988–2005. [Google Scholar] [CrossRef] [Green Version]
  2. Jakubisin, D.J.; Buehrer, R.M. Performance, Complexity, and Receiver Design for Code-Aided Frame Synchronization in Multipath Channels. IEEE Trans. Commun. 2015, 63, 3363–3376. [Google Scholar] [CrossRef]
  3. Quazi, A. An overview on the time delay estimate in active and passive systems for target localization. IEEE Trans. Acoust. Speech Signal Process. 1981, 29, 527–533. [Google Scholar] [CrossRef] [Green Version]
  4. Junejo, N.U.R.; Esmaiel, H.; Zhou, M.; Sun, H.; Qi, J.; Wang, J. Sparse Channel Estimation of Underwater TDS-OFDM System Using Look-Ahead Backtracking Orthogonal Matching Pursuit. IEEE Access 2018, 6, 74389–74399. [Google Scholar] [CrossRef]
  5. Junejo, N.U.R.; Esmaiel, H.; Sattar, M.; Sun, H.; Khalil, M.A.; Ullah, I. Sea Experimental for Compressive Sensing-Based Sparse Channel Estimation of Underwater Acoustic TDS-OFDM System. Wirel. Commun. Mob. Comput. 2022, 2022, 2523196. [Google Scholar] [CrossRef]
  6. Zhang, T.; Liu, S.; He, X.; Huang, H.; Hao, K. Underwater Target Tracking Using Forward-Looking Sonar for Autonomous Underwater Vehicles. Sensors 2019, 20, 102. [Google Scholar] [CrossRef] [Green Version]
  7. Wawrzyniak, N.; Stateczny, A. MSIS Image Postioning in Port Areas with the Aid of Comparative Navigation Methods. Pol. Marit. Res. 2017, 24, 32–41. [Google Scholar] [CrossRef] [Green Version]
  8. Kazimierski, W.; Zaniewicz, G. Determination of Process Noise for Underwater Target Tracking with Forward Looking Sonar. Remote Sens. 2021, 13, 1014. [Google Scholar] [CrossRef]
  9. Shao, Y.; Liu, F.; Zheng, G.; Jiang, F. A Method for Improving Three-Point Ranging Accuracy under Low SNR. In Proceedings of the IEEE International Conference on Signal Processing, Communications and Computing (ICSPCC), Dalian, China, 1–5 September 2019. [Google Scholar] [CrossRef]
  10. Li, H.; Xu, Z.; Yang, K.; Duan, R. Use of multipath time-delay ratio for source depth estimation with a vertical line array in deep water. J. Acoust. Soc. Am. 2021, 149, 524–539. [Google Scholar] [CrossRef]
  11. Knapp, C.; Carter, G. The generalized correlation method for estimation of time delay. IEEE Trans. Acoust. Speech Signal Process. 1976, 24, 320–327. [Google Scholar] [CrossRef] [Green Version]
  12. Park, H.-R.; Li, J. A Frequency-Domain SPICE Approach to High-Resolution Time Delay Estimation. IEEE Wirel. Commun. Lett. 2017, 7, 360–363. [Google Scholar] [CrossRef]
  13. Ge, F.-X.; Shen, D.; Peng, Y.; Li, V.O.K. Super-Resolution Time Delay Estimation in Multipath Environments. IEEE Trans. Circuits Syst. I Regul. Pap. 2007, 54, 1977–1986. [Google Scholar] [CrossRef] [Green Version]
  14. Song, Q.; Ma, X. High-Resolution Time Delay Estimation Algorithms Through Cross-Correlation Post-Processing. IEEE Signal Process. Lett. 2021, 28, 479–483. [Google Scholar] [CrossRef]
  15. Wei, S.; Li, W.; Su, Y.; Liu, R. Off-Grid Sparse Bayesian Inference with Biased Total Grids for Dense Time Delay Estimation. J. Shanghai Jiaotong Univ. 2022, 27, 1–9. [Google Scholar] [CrossRef]
  16. Park, Y.; Seong, W.; Choo, Y. Compressive time delay estimation off the grid. J. Acoust. Soc. Am. 2017, 141, EL585–EL591. [Google Scholar] [CrossRef] [Green Version]
  17. Xenaki, A.; Gerstoft, P. Grid-free compressive beamforming. J. Acoust. Soc. Am. 2015, 137, 1923–1935. [Google Scholar] [CrossRef] [Green Version]
  18. Yang, Z.; Xie, L. Enhancing Sparsity and Resolution via Reweighted Atomic Norm Minimization. IEEE Trans. Signal Process. 2015, 64, 995–1006. [Google Scholar] [CrossRef]
  19. Kormylo, J.; Mendel, J. Maximum likelihood detection and estimation of Bernoulli—Gaussian processes. IEEE Trans. Inf. Theory 1982, 28, 482–488. [Google Scholar] [CrossRef]
  20. Soussen, C.; Idier, J.; Brie, D.; Duan, J. From Bernoulli–Gaussian Deconvolution to Sparse Signal Restoration. IEEE Trans. Signal Process. 2011, 59, 4572–4584. [Google Scholar] [CrossRef]
  21. Mardia, K.V.; Jupp, P.E. Directional Statistics; Wiley: New York, NY, USA, 2000. [Google Scholar]
  22. Badiu, M.-A.; Hansen, T.L.; Fleury, B.H. Variational Bayesian Inference of Line Spectra. IEEE Trans. Signal Process. 2017, 65, 2247–2261. [Google Scholar] [CrossRef] [Green Version]
  23. Zhang, Q.; Zhu, J.; Gu, Y.; Xu, Z. Gridless Variational Direction-of-Arrival Estimation in Heteroscedastic Noise Environment. IEEE J. Ocean. Eng. 2021, 46, 1313–1329. [Google Scholar] [CrossRef]
  24. Murphy, K.P. Machine Learning: A Probabilistic Perspective; MIT Press: Cambridge, MA, USA, 2012. [Google Scholar]
  25. Bishop, C.M.; Recognition, P. Machine Learning (Information Science and Statistics); Springer: Secaucus, NJ, USA, 2006. [Google Scholar]
  26. Toh, K.-C.; Michael, J.T.; Reha, H.T. SDPT3—A MATLAB software package for semidefinite programming, version 1.3. Optim. Methods Softw. 1999, 11, 545–581. [Google Scholar] [CrossRef]
  27. Grant, M.; Boyd, S. CVX: MATLAB Software for Disciplined Convex Programming, Version 2. March 2014. Available online: http://cvxr.com/cvx (accessed on 12 January 2022).
  28. Boyd, S.; Stephen, P.; Lieven, V. Convex Optimization; Cambridge University Press: Cambridge, UK, 2004. [Google Scholar]
Figure 1. Single time delay estimation performance of different algorithms in the case of SNR 10 dB. (a) Time delay estimation result, (b) time delay number estimation.
Figure 1. Single time delay estimation performance of different algorithms in the case of SNR 10 dB. (a) Time delay estimation result, (b) time delay number estimation.
Jmse 11 00194 g001
Figure 2. Phase reconstruction performance of different algorithms in the case of SNR 10 dB. (a) Original signal phase and noisy, (b) reconstruction results of signal phase by different algorithms.
Figure 2. Phase reconstruction performance of different algorithms in the case of SNR 10 dB. (a) Original signal phase and noisy, (b) reconstruction results of signal phase by different algorithms.
Jmse 11 00194 g002
Figure 3. Multiple time delay estimation performance of different algorithms with an SNR of 10 dB. (a) Time delay estimation result, (b) time delay number estimation.
Figure 3. Multiple time delay estimation performance of different algorithms with an SNR of 10 dB. (a) Time delay estimation result, (b) time delay number estimation.
Jmse 11 00194 g003
Figure 4. Signal reconstruction performance of different algorithms with an SNR of 10 dB. (a) Original signal phase and noise, (b) reconstruction results of signal phase by different algorithms.
Figure 4. Signal reconstruction performance of different algorithms with an SNR of 10 dB. (a) Original signal phase and noise, (b) reconstruction results of signal phase by different algorithms.
Jmse 11 00194 g004
Figure 5. Running time under different numbers of sampling points.
Figure 5. Running time under different numbers of sampling points.
Jmse 11 00194 g005
Figure 6. RMSE of time delay estimation. (a) Single time delay, (b) multiple time delays.
Figure 6. RMSE of time delay estimation. (a) Single time delay, (b) multiple time delays.
Jmse 11 00194 g006
Table 1. RUNNING TIME (INTELR CORETM i7-10510U CPU @ 1.80 GHz).
Table 1. RUNNING TIME (INTELR CORETM i7-10510U CPU @ 1.80 GHz).
AlgorithmANMRAMVATDE
Times (s)148.1816444.54487.8203
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Ding, F.; Chi, C.; Li, Y.; Huang, H. Variational Bayesian Inference Time Delay Estimation for Passive Sonars. J. Mar. Sci. Eng. 2023, 11, 194. https://doi.org/10.3390/jmse11010194

AMA Style

Ding F, Chi C, Li Y, Huang H. Variational Bayesian Inference Time Delay Estimation for Passive Sonars. Journal of Marine Science and Engineering. 2023; 11(1):194. https://doi.org/10.3390/jmse11010194

Chicago/Turabian Style

Ding, Feilong, Cheng Chi, Yu Li, and Haining Huang. 2023. "Variational Bayesian Inference Time Delay Estimation for Passive Sonars" Journal of Marine Science and Engineering 11, no. 1: 194. https://doi.org/10.3390/jmse11010194

APA Style

Ding, F., Chi, C., Li, Y., & Huang, H. (2023). Variational Bayesian Inference Time Delay Estimation for Passive Sonars. Journal of Marine Science and Engineering, 11(1), 194. https://doi.org/10.3390/jmse11010194

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop