Next Article in Journal
Surface Motion for P-Wave Scattering by an Arbitrary-Shaped Canyon in Saturated Half-Space
Previous Article in Journal
The Impact of Salinity and Nutrient Regimes on the Agro-Morphological Traits and Water Use Efficiency of Tomato under Hydroponic Conditions
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A 2D-DOA Sparse Estimation Method with Total Variation Regularization for Spatially Extended Sources

1
School of Mechanical and Automotive Engineering, Qingdao University of Technology, Qingdao 266520, China
2
Key Laboratory of Ministry of Education of China, Qingdao University of Technology, Qingdao 266520, China
*
Author to whom correspondence should be addressed.
Appl. Sci. 2023, 13(17), 9565; https://doi.org/10.3390/app13179565
Submission received: 20 July 2023 / Revised: 19 August 2023 / Accepted: 21 August 2023 / Published: 24 August 2023
(This article belongs to the Section Acoustics and Vibrations)

Abstract

:
In this paper, a novel two-dimensional direction of arrival (2D-DOA) estimation method with total variation regularization is proposed to deal with the problem of sparse DOA estimation for spatially extended sources. In a general sparse framework, the sparse 2D-DOA estimation problem is formulated with the regularization of extended source characteristics including spatial position grouping, acoustic signal block sparse, and correlation features. An extended sources acoustic model, two-dimensional array manifold and its complete representation, total variation regularization penalty term, and the regularization equation are built, and are utilized to seek the solutions where the non-zero coefficients are grouped together with optimum sparseness. A total variation sparse 2D-DOA estimation model is constructed by combining total variation regularization with LASSO. The model can be easily solved by the convex optimization algorithm, and the solving process can promote the sparsity of the solution on the spatial derivatives and the solution itself. The theoretical analysis results show that the steps of decorrelation processing and angle matching of traditional 2D-DOA estimation methods could be avoided when adopting the proposed method. The proposed method has better robustness to noise, better sparsity, and faster estimation speed with higher resolution than traditional methods. It is promising to provide a coherent sources sparse representation of a non-strictly sparse field.

1. Introduction

Estimation of the direction of arrival is an important research issue in array signal processing, which has been widely used in radar, sonar, speech processing, and wireless communication [1,2,3,4,5,6]. At present, the main azimuth estimation methods can be divided into subspace algorithm [7], maximum likelihood algorithm [8], sparse reconstruction algorithm [9], and beamforming algorithm [10,11]. The far-field point source model with sound source energy concentrated at discrete angles is often taken as the research object in the above algorithms; the algorithms also lay a theoretical foundation for DOA estimation of the spatially extended sources model, which is characterized by a small number of beats, dense distribution of multiple targets and strong correlation of multiple targets. For example, the algorithms can be used in the analysis of flat circular piston [12,13], wind turbine blade [14], and underwater objects in sonar imaging [15].
Malioutov D. et al. [16] presented a source localization method based on a sparse representation of sensor measurements with an overcomplete basis composed of samples from the array manifold. The sparse reconstruction algorithms by sound source spatial orientation distribution model of the sparse sex parameter estimation problem can be transformed into the traditional sparse spectrum estimation problem of sparse array. This method does not need complex statistical values to set the initial variable, which can overcome the low signal-to-noise ratio, under the condition of coherent signal source algorithm performance degradation or even failure fault [17]. The Orthogonal Matching Pursuit (OMP) [18], as an example of compressive sensing methods, is a type of novel high- resolution DOA estimation algorithm. Despite using the OMP algorithm for DOA estimation which has many advantages in comparison to other algorithms, it cannot distinguish between two adjacent signal sources. The FOMP algorithm [19,20] is an improved version of the OMP algorithm. It can detect two signal sources exactly when they are very close and the beam former has a single peak corresponding to a direction between right directions.
All of the compressive sensing algorithms for DOA estimation should satisfy the in- coherence condition [21]. Thus, almost all of these algorithms fail when the maximum pairwise coherence of the measurement matrix exceeds a constant value. However, the DOA estimation problem is usually underdetermined, and a stable solution can be obtained by the regularization technique. The 𝓁1-norm regularization has gradually become a hot spot because it can reduce the computational amount of sparse recovery and assume the role of using its sparse promotion characteristics to obtain high resolution [22,23,24,25,26]. The LASSO proposed by Tibshirani [27] can provide high-resolution DOA maps even in the presence of coherent sources or single snapshot data [28]. However, the LASSO cannot effectively promote the grouping of related positions considering that the extended source has the characteristics of grouping (referring to the grouping of related positions) and the amplitude distribution with the phase linearity and block sparsity [29], therefore it is not suitable for recovering multipole or spatially extended sources. In order to discuss these problems involving the sound features, the elastic net model [30] combines the sparse promotion feature using 𝓁1-norm with the component clustering feature based on 𝓁2-norm which is proposed to achieve the balance between smoothness and sparsity. However, this model can only achieve a smooth solution and cannot promote a piecewise constant outline. The weighted LASSO [31], namely WL, imposes more general geometric or structural constraints on solutions by constructing weighted structural matrices, and different matrices form total variation models such as focused total variation (FTV) [32], generalized total variation (GTV) [33], and total variation(TV) [25]. The total variation model has been widely used in image processing and one-dimensional (1D) DOA estimation in recent years [29] because it can suppress noise components at smaller scales while retaining larger feature edges in images [34].
Two-dimensional-DOA estimation has been widely used because it can better describe the spatial characteristics of the target. Lu et al. [35] proposed a 2D-DOA estimation algorithm based on the symmetric rotation invariable relation of non-circular signal characteristics, which is able to directly estimate the target position by constructing an objective function integrating the information of multiple observation stations without relying on the distribution model. Z. Zheng et al. [36] proposed a 2D-DOA estimation algorithm based on a double parallel line array combining rotation invariance and propagation operator method. This algorithm does not rely on the distribution model, which avoids spectral peak search and feature decomposition of the high-dimensional sample covariance matrix required by the traditional algorithm. However, an additional angle-matching process is required, and the array structure is complex. Since the ESPRIT algorithm has the advantage of automatic matching of two-dimensional parameters, a 2D-DOA estimation algorithm that could be accurately estimated in a multipath environment was proposed by Wei et al. [37]. However, in the case of coherent sources [38,39], the algorithm needs traditional decorrelation processing such as spatial smoothing. In contrast, sparse reconstruction algorithms with better performance are rarely used in 2D-DOA estimation.
Inspired by the above analysis, a 2D-DOA estimation method based on total variation (TV-norm)regularization is proposed for spatially extended sources. The spatially extended sources are remodeled by the grouping and correlation features. A planar array two-dimensional manifold matrix and its over-complete representation are constructed to achieve sparse representation of sound sources. The total variation regularization matrix corresponding to the structural characteristics is constructed to facilitate the formation of piecewise constant or linear solutions. Combining with the total variation regularization and LASSO, the 2D-DOA total variation sparse estimation model is constructed, and the accurate orientation estimation is realized by the convex optimization algorithm. This method does not prescribe the eigen-decomposition of the high-dimensional sample covariance matrix and avoids the extra angle matching and the de-coherence step of the conventional method. Meanwhile, it can realize the joint estimation of the pitch angle and azimuth angle of extended sources with high detection probability. The effectiveness of the method is verified by numerical simulation and experiments, and the performance of the method is evaluated and discussed.

2. Problem Formulation

Consider a planar array with dimensions M × N and M = N placed on the X Y plane, where M and N represent the number of elements in the   X   axis and Y   axis direction, respectively. The elements are evenly distributed with the interval d = 0.5 λ , where λ is the incident wavelength. Assuming the sampling time t , t = 1 , 2 , , L , L is the number of snapshots, the far-field narrowband sound sources s t = [ s t 1 , s t 2 , , s t N s ] T impinging on this array, where N s represents the number of sound sources, θ k , ϕ k represents the 2D incident wave arrival direction of the k th sound source, and k = 1 , 2 , , N s , θ k , and φ k respectively represent the incident azimuth and pitch angle, θ k 0 ° , 90 ° , φ k 0 ° , 360 ° .
The DOA estimation problem can be expressed as a linear equation. A typical array observation model for receiving signals at t time can be expressed as
y t = A s t + n t
where y t denotes the array output signal vector, A denotes the 2D array manifold matrix, n t denotes the additive noise vector, σ 2 is the mean square error, and the mean value of σ 2 is zero, which is independent from the sound source.

2.1. Spatial Extended Source Model

Given the above, and as shown in Figure 1, the spatially extended sources are remodeled by multiple monopole sources arranged in a certain shape, amplitudes, and the phase relationship between adjacent monopoles. Thus, multiple monopoles can be arranged as continuous linear sources with constant or linear amplitudes and random uniform phases. According to the significant grouping characteristics of spatially extended sources, its model can be established:
s t = s t 1 s t g s [ 1 ] , s t g + 1 s t 2 g s [ 2 ] , s t N s g s t N s s [ G ]
where g is the grouping length and G depicts the number of groups, N s = G × g . Further considering the correlation of monopoles in the groups, taking the first group s [ 1 ] as an example and the first monopole source s t 1 in the group as a reference, the k t h monopole source s t k in the group can be expressed as:
s t k = β k e j Δ φ k s t 1
where β k is the amplitude fading factor k = 1 , , g and Δ φ k represents the phase difference of s t k related to s t 1 , without losing the generic form, taking β k = 1 and Δ φ k = 0 .

2.2. 2D Array Manifold Matrix

As the key parameter of Equation (1), the array manifold matrix A directly affects the accuracy of DOA estimation. Considering the spatial geometric relationship between the extended sources and the array shown in Figure 1, the time difference τ between the transmissions of the k t h incident monopole source from the first sensor 0 , 0 to X t h sensor m , n can be defined as
τ X ( θ k , φ k ) = d c m 1 cos θ k sin φ k + n 1 sin θ k sin φ k
where c is the propagation velocity of sound waves in the air at standard atmospheric pressure. The X t h sensor’s received acoustic signal can be expressed as-
y t X = 1 M N k = 1 K a X s t k + n t
where a X = e j ω τ X , ω denotes the angular frequency of incident wave, ω = 2 π f , and f denotes the incident wave frequency.
The direction vectors of the k t h monopole source along the X axis and Y axis directions can be represented by the Vandermonde matrix
a x = [ 1 , e j 2 π d cos θ k sin φ k / λ , , e j 2 π ( M 1 ) d cos θ k sin φ k / λ ] T / M
a y = [ 1 , e j 2 π d sin θ k sin φ k / λ , , e j 2 π ( N 1 ) d sin θ k sin φ k / λ ] T / N
where a x and a y are the array steering vectors of the N s incident sound sources in the X axis and Y axis directions respectively, and the array manifold matrix of the axis directions X and Y are expressed
A x ( θ , φ ) = a x θ 1 , φ 1 , a x θ 2 , φ 2 , , a x θ K , φ K
A y ( θ , φ ) = a y θ 1 , φ 1 , a y θ 2 , φ 2 , , a y θ K , φ K
Since the 2D generalized direction vector can be approximately expressed as the Kronecker product of two 1D direction vectors along the X   axis and Y axis directions [39], and the array physical structure determines the array manifold matrix, therefore, the 2D direction vector of a planar array can be defined as:
a θ k , φ k = a x a y = a x θ 1 , φ 1 a y θ 1 , φ 1 a x θ 2 , φ 2 a y θ 2 , φ 2 a x θ K , φ K a y θ K , φ K
where a denotes the 2D directional vector and denotes the Kronecker products. The corresponding 2D array manifold matrix can be constructed as:
A = a θ 1 , φ 1 , a θ 2 , φ 2 , , a θ K , φ K
where A is the array manifold matrix combining with the axis directions X and Y.

2.3. Sparse DOA Estimation Model

In the sparse representation of DOA estimation, the overcomplete representation of the two-dimensional array manifold matrix A can be represented as:
A ¯ = a θ 1 , φ 1 , a θ 2 , φ 2 , a θ k , φ k , a θ P , φ P
where A ¯ is the overcomplete representation of A . a θ k , φ k is the steering vector, k = 1 , 2 , , P , P N s . θ 1 , φ 1 , θ 2 , φ 2 , , θ P , φ P is the set of all potential source positions. In the sparse DOA estimation, A ¯ is known and independent of the actual source positions.
The source position is represented as a P × 1 vector, in which only N s elements are nonzero, so the source is sparse in space. The number of nonzero elements in P × 1 vector corresponds to the angle position of the actual sound source. The array model of sparse representation can be expressed as:
y t = A ¯ s t + n t
In summary, the sparse DOA estimation problem is to find out the position index of all nonzero elements in the P × 1 vector under the given observation vector y t and overcomplete array manifold matrix A ¯ , which is also the goal of this paper.
From Equations (12) and (13), it can be found that the count of nonzero items is an ideal measure of sparsity, denoted by s 0 , which is called 𝓁0-norm. However, this is a difficult combinatorial optimization problem, therefore, the 𝓁1-norm is often adopted to replace the 𝓁0-norm, thus the problem can be transformed into a convex optimization problem. In this case, the global optimal solution can be solved through linear programming, namely, sparse regularization. The solution can be expressed as:
s ( μ 1 ) = arg min S y A ¯ s 2 2 + μ 1 s 1
Equation (14) is a traditional sparse DOA estimation model named LASSO. When there are only a small number of sound sources, the model can significantly improve the resolution of sparse DOA estimation. However, when the sound source is a spatially extended source with obvious grouping and block sparse characteristics, the sparsity of a single numerical solution domain cannot show the block sparse characteristics well and cannot effectively represent the sparse of sources. Therefore, the total variation is introduced and a 2D total variation sparse DOA estimation method is proposed.

3. 2D-DOA Total Variation Regularization

In this section, a novel 2D-DOA sparse estimation method with total variation regularization is present concretely. The definition of total variation matrix is given and the corresponding total variation regularization term is constructed, which is combined with the traditional LASSO to build the model of 2D-DOA sparse estimation. The proposed method (abbreviated 2DTV-CES) can utilize the group’s characteristics (block sparse) and spatial derivatives to promote the sparsity both on the spatial derivatives of the solution and on the solution itself.

3.1. Total Variation Regularization

Total variation regularization depends on the local smoothing characteristics or discrete gradient sparse of the spatially extended source, which can greatly promote the block sparsity of the sound source solution and gives a piecewise constant or linear fitting, which is used in the setting of the coordinate closely related to its adjacent coordinate (local correlation) in the real model, and inhibits the single noise peak. Firstly, in order to clarify the meaning of total variation, the variation (discrete gradient) definition is carried out for each of the two adjacent coefficients in the coefficient vector of a monopole source. Secondly, according to the variation relationship between each of the two adjacent coefficients, the total variation matrix D t v is constructed using 2D spatial derivatives. Finally, the total variation matrix D t v is used to regularize the source matrix S , and the total variation sparse solution is realized by convex optimization.

3.1.1. Total Variation Definition

Suppose that the source S matrix with dimension M × N , according to the column sorting matrix, S i , j represents any element in the matrix, i is the row vector index, j is the column vector index, i , j = 1 , , M . The variation of any row and adjacent elements in the matrix is defined as:
Δ S h i j = S i , j j = 1 S i , j S i , j 1 1 < j M
Δ S v i j = S i , j i = 1 S i , j S i 1 , j 1 < i N
where Δ S h and Δ S v represent the first-order X axis direction variation and Y axis direction variation of the adjacent elements of matrix S , respectively. The 2D total variation is defined as:
T V s = i , j Δ S h i , j + Δ S v i , j

3.1.2. Total Variation Matrix

The total variation matrix D t v is built from one-dimensional first-order total variational operator D 1 d . It is defined as a M 1 × M dimensional banded matrix. The sum of each row of the matrix is zero, and any element d i , i in row i t h is assigned the value −1, d i , i + 1 is assigned the value 1, and the rest is zero.
D 1 d = 1 1 0 0 0 0 0 1 1 0 0 0 0 0 0 1 0 0 0 0 0 0 0 1 1
The matrix D t v is generated by removing the first and last rows of the autocorrelation matrix of D 1 d . The total variation matrix D t v can be expressed as:
D tv = 1 2 1 0 0 0 1 2 0 0 0 0 1 0 0 0 0 0 1 2
Let S Z = S S T , then for the source matrix S , there is:
D tv S Z = D t v S S T

3.1.3. 2D-DOA Total Variation Regularization Model

When Equation (19) is substituted into Equation (14), the sparse DOA estimation model can be transformed into a 2D-DOA total variation regularization model:
s = arg min s y A ¯ s 2 2 + μ 1 D tv S z 1   = arg min s y A ¯ s 2 2 + μ 1 D tv S 1 + μ 1 D tv S T 1
In order to ensure the sparsity of the target solution s , the LASSO and two-dimensional total variation are combined based on Equation (20), which improves the sparsity of the coefficients (i.e., there are few active coefficients in the solution) and the sparsity of the continuous coefficient difference (i.e., the sparsity of the sparse contour flatness). As a result, the proposed method is suitable for the applications involving block sparsity or mixed peaks and flat regions. Therefore, the final 2D-DOA total variation sparse estimation model is expressed as:
s = arg min s y A ¯ s 2 2 + μ 1 D t v S Z 1 + μ 2 s 1
where μ 1 and μ 2 are regular parameter.
From the above derivation process, it can be seen that the method (2DTV-CES) does not need the traditional decoherence processing at the cost of losing the antenna aperture, and does not estimate the azimuth angle and the pitch angle respectively, therefore, the angle pairing is avoided totally and the algorithm is simplified sharply.

4. Numerical Simulations

The 2DTV-CES performance in 2D-DOA estimation is evaluated by simulations starting with spatially extended sources. We consider a planar array with M = 16 sensors and spacing d = λ / 2 . The complete array manifold matrix A ¯ in Equation (21) is defined on angular grids θ 0 ° : 1 ° : 90 ° and φ 0 ° : 1 ° : 90 ° . The spatially extended sources are divided into two categories with the same amplitudes. Case 1 is at DOAs θ 5 ° : 1 ° : 15 ° and φ 5 ° : 1 ° : 15 ° , Case 2 is at DOAs θ 18 ° : 1 ° : 28 ° and φ 25 ° : 1 ° : 35 ° , and two-point source DOAs are (3°, 5°) and (6°, 8°), respectively. The spatially extended sources are modeled as a series of monopoles with the equal constant amplitudes 1 dB with uniform distribution phase in Case 1, and the linear amplitude [1:0.1:0.6] dB with random uniformly distributed phase and two-point sources in Case 2. Monte Carlo experiments were introduced to compare the performance of 2DTV-CES, LASSO [23], and the Improved ESPRIT [30] in the presence of the same spatially extended sources under different SNR and different sparsity K .

4.1. Performance Metrics

The detection probability, as the accuracy evaluation factor of focusing directions, is built to evaluate the performance of the three different algorithms. When the absolute value of the difference between the estimated focusing direction s ^ ( θ i , φ j ) and the actual focusing direction s ( θ i , φ j ) is less than or equal to 2°, the detection is defined as correctly detecting. The greater the detection probability, the higher the accuracy.
The detection probability is defined as the relative root-mean-square error at the focusing direction g .
P g = 1 q g = 1 q s g ( θ i , φ j ) s ^ g ( θ i , φ j ) 2 1 q g = 1 q s g ( θ i , φ j ) 2
where q denotes the number of Monte Carlo experiments, s g ( θ i , φ j ) is the actual focusing direction, and s ^ g ( θ i , φ j ) is the corresponding focusing direction estimation.

4.2. Performance of TV-Norm Regularization

The results in Figure 2 and Figure 3 indicate qualitatively the performance of total variation regularization (TV-norm) of 2D-DOA estimation in the presence of two kinds of spatially extended sources in Case 1 and Case 2 under high-SNR (SNR = 20) and sparsity ( K = 10).
Figure 2 shows the 2D-DOA’s estimations of spatial spectrum distribution and DOA’ s map of 2DTV-CES and LASSO in Case 1. The results demonstrate the performance of total variation regularization (TV-norm) to promote the block sparsity of solution.
Figure 2a shows the spatial spectrum of the spatially extended sources with the equal amplitudes in Case 1. The amplitude of the solution coefficients is strict with the constant value 1 dB, and each focusing direction can be clearly distinguished. The solution coefficient is divided into two groups distinctly. It is clear that the TV-norm can promote the piecewise constant solution so that the solution coefficients are closely related to its neighbors, which reflects the block sparse characteristics of the spatially extended sources.
Figure 2b,c shows the 2D-DOA’s estimation maps of 2DTV-CES and LASSO respectively. It can be seen that the 2DTV-CES can accurately estimate all focusing directions with TV-norm regularization. There are 19 focusing directions of 21 directions that can be accurately distinguished, and only 2 focusing directions with 1° deviation. They are also within the allowable error range ±2°, and can be regarded as effective estimation. Moreover, the two groups of focusing direction can be effectively separated. Relatively speaking, the estimation accuracy of LASSO is poor. There are only 11 focusing directions out of 21 directions that can be identified and are unable to distinguish between different groups. This is because LASSO can only promote solution sparsity in the data domain but not in the gradient domain.
In contrast, 2DTV-CES not only promotes the sparsity in the data domain but in the gradient domain with TV-norm. Therefore, 2DTV-CES has a better capacity for the promotion of block sparsity.
Figure 3 depicts the 2D-DOA’s estimations of spatial spectrum distribution and DOA’s map of 2DTV-CES and LASSO in Case 2. The results show 2D-DOA’s estimates of complex extended sources mixed with point sources, to demonstrate the promotion of sparsity and clustering by the 2DTV-CES method.
Figure 3a shows the spatial spectrum of 2D-DOA estimation. It can be seen that when there are point sources and extended sources at the same time, the constant solution can be obviously separated into three groups by 𝓁1-norm and the TV-norm regularization of 2DTV-CES. The amplitude of the solution coefficients is strict with the constant value 1 dB, and each focusing direction can be clearly distinguished. The method can effectively identify the focusing directions of extended sources and two single-point sources.
Figure 3b,c demonstrates the 2D-DOA maps of the method 2DTV-CES and LASSO, respectively. The results show that 2DTV-CES can estimate all focusing directions. Especially, there are 19 directions with zero deviation, and 2 directions have 1° deviation. Only 11 of 21 directions can be estimated by LASSO with more than 1° deviation. Both methods can estimate two peaks within the allowable ±2° deviation. Compared with the DOA’s estimation, the 2DTV-CES is better than LASSO, since 2DTV-CES introduces and utilizes TV-norm regularization on the basis of LASSO, which promotes piecewise constant and peak solutions.

4.3. Performance of SNR and Sparsity K

The results in Figure 4 and Figure 5 indicate the performance of 2DTV-CES, LASSO, and Improved ESPRIT as a function of sparsity K and SNR, respectively. Figure 4 shows the detection probability of the 2DTV-CES, LASSO, and the Improved ESPRIT with the different sparsity K. The performance of three different methods in 2D-DOA’s estimation at the range of 8 + Δ k ° , 15 + Δ k ° , Δ k 1 : 1 : 9 , and the sparsity K at the range of [2:1:10] is evaluated. Each 2D-DOA is obtained by 1000 Monte Carlo experiments. It can be seen that the detection probability of 2DTV-CES gradually increases up to 98% and tends to stabilize with the increasing of the sparsity; 2DTV-CES has even better estimation performance in the case of high sparsity than that of lower ( K  < 3). When K is lower, the detection probability of LASSO fluctuates slightly between 97 and 100%, and the estimation performance is significantly better than that of the 2DTV-CES and the Improved ESPRIT. This phenomenon is caused by the weak grouping of an extended source with lower K , and the sparse promotion feature of the LASSO can achieve high resolution. However, with the increasing of K , the extended sources exhibit strongly grouped structural characteristics. Based on the sparsity promotion feature, the 2DTV-CES utilizes the TV-norm regularization to promote the formation of segmented constant distribution, resulting in a significantly better performance than the other two methods. To summarize, 2DTV-CES is not sensitive to the change of sparsity when its values are more than 3 and has good robustness.
With the same 2D-DOA estimation conditions as shown in Figure 4, the performance of 2DTV-CES, LASSO, and the Improved ESPRIT in the presence of additive noise at a range of [−5:5:20] dB SNR is evaluated. The results are shown in Figure 5.
Figure 5 describes the performance of the three methods under different SNRs. As can be seen, the average detection probability of 2DTV-CES is up to 75% and is 40% higher than that of LASSO with the increase of SNR. Obviously, the SNR robustness of 2DTV-CES is higher than that of the other two methods in the whole range of SNR [−5:1:20]. As described, the performance of SNR robustness of 2DTV-CES is optimal. In the case of exceeding the limitation of the array element, there is still a high detection probability at a smaller SNR.
The performance of the proposed method 2DTV-CES, the LASSO, and the Improved ESPRIT is compared. The spatial average detection probability is taken as the metric, and statistical analysis is carried out from three aspects: sparsity K , signal-to-noise ratio SNR, and computing speed. The results are shown in Table 1.
As shown in Table 1, the spatial average detection probability of 2DTV-CES is more than 85% with different sparsity and SNR. The average computing speed is only 29 s. Compared with the other two methods, 2DTV-CES has the uppermost spatial average detection probability under the same sparsity and SNR conditions and the fastest computing speed with the narrowest range of computing speed variation. Improved ESPRIT has the lowest detection probability and the longest detection time and it is not suitable for sparse DOA estimation of extended sound sources. The results further illustrate the advantages of the proposed method 2DTV-CES.

5. Experimental Results and Discussion

An experimental study is conducted to examine 2D-DOA sparse estimation by 2DTV-CES. The measurements were conducted in the semi-anechoic chamber (500 m3, cut-off frequency 20 Hz) at the Qingdao University of Technology, China, with a 16-channel 2D-random array of 1 m diameter. Its dimensions are 4 × 4 and the microphone spacing d was 8.5 cm. In this experiment, there were two different types of spatially extended sources. One type was a continuous source consisting of four monopole sources in Figure 6a, the other was a dipole source with two loudspeaker drivers closely driven in antiphase in Figure 6b. The two types of extended sources were driven by 2000 Hz acoustic signals and were oriented so that the sources’ plane is parallel to the array plane. The sources’ plane was located at z = 0 cm and the microphone array at z = 100 cm. According to the Nyquist sampling theorem, the sampling frequency of 4096 Hz was taken to obtain 10 s date records. Figure 6c depicts the 2D-DOA measurement metric in experimental procedure.
In order to verify the focusing and block sparsity features of TV-norm regularization of the 2DV-CES, the coefficient distributions of sound source sparse solution of the three methods were calculated by using continuous sources, and the results are shown in Figure 7. The central position of the continuous source is x = 17 cm, y = 42 cm. The azimuth coverage is [5°:1°:50°], and the pitch angle is [15°:1°:18°]. According to the order of 2D-DOA’s estimation, the DOA’s estimation was divided into three groups namely a, b, and c. The length of each group is different. The length of group a is 83, group b is 75, and group c is 46. The corresponding amplitudes of group a are linear values decreasing from 1 to 0.6, group b are constant values 1 mixed with linear values, and group c are constant values 1. The array test data were analyzed by 2DTV-CES, LASSO and Improved ESPRIT methods on the MATLAB numerical platform, and the calculated results were shown in Figure 7a, Figure 7b and Figure 7c, respectively. As shown in Figure 7a, 2DTV-CES clearly shows the division of three complete groups, the amplitude profile of each group is strictly displayed according to the set value, and each 2D-DOA can be effectively estimated. In Figure 7b, the results of LASSO can also be depicted in three groups named a, b, and c. The length of each group is short comparing with it in Figure 7a, and multiple values within the group are overlaped. The amplitude profile of each group is the constant value 1 and did not show any linear changes. Figure 7c shows the results of the Improved ESPRIT method. There are no group a and only a few single values in group b and group c. Comparing the three methods, it can be concluded that 2DTV-CES has good block focusing, sparsity, and can estimate every DOA value completely and clearly. LASSO shows good sparsity, but it cannot clearly estimate all DOAs with constant amplitude and cannot identified any DOAs with linear amplitude. Improved ESPRIT cannot achieve DOA’s estimation of extended sound sources. This result further verifies the role of TV-norm regularization and is consistent with the simulation results in Figure 2 and Figure 3.
Furthermore, to test the 2DTV-CES method regarding the robustness of SNR and sparsity K , the detection probability of the continuous sources and dipole source are analyzed. Figure 8 and Figure 9 show the results with the change of SNR [−5:1:20] and sparsity K [2:2:20]. From the DOA’s estimation of both continuous source and dipole source, Figure 9 DOA values are taken as detection objects, and the change of their detection probabilities with SNR and K are observed. This conclusion further verifies the 2DTV-CES method’s robustness and is consistent with the simulation results in Figure 4 and Figure 5.
As stated in the review, through the above two kinds of experiments, the advantages of 2DTV-CES in data sparsity promotion and SNR robustness are effectively demonstrated, and the experimental basis for sparse representation of non-sparse fields is provided.

6. Conclusions

In this paper, a novel 2D-DOA sparse estimation method with total variation regularization is proposed. It is intended to solve the sparse estimation of a non-strictly sparse sound field using the regularization of sound sources’ characteristics. The main conclusions are as follows.
The spatially extended source model is constructed using spatial positions’ grouping features of sound sources and sound signals’ correlation characteristics. The two-dimensional sparse representation of sound sources is realized by constructing a two-dimensional array manifold matrix and its overcomplete representation.
These characteristics, spatial grouping, block sparse, and correlation, are converted into regularization penalty terms by constructing the total variation regularization term. It promotes the sparsity of the block sparse solution on the spatial derivatives domain. Furthermore, combined with the 𝓁1-norm regularization term, the 2D-DOA sparse estimation model is built to be based on a joint estimate pitch angle and azimuth angle of an extended source by the convex optimization algorithm. This process promotes sparsity both on the spatial derivatives of the solution and on the solution itself, thus rapidly seeking solutions where the nonzero coefficients are grouped together.
The numerical and experimental results demonstrate that the method with regularization penalty term significantly improves the sparsity and the resolution and detection probability in the 2D-DOA sparse estimation. It also has certain robustness to noise. The solving process avoids the feature decomposition, decoherence, and angle pairing of the high-dimensional sample covariance matrix in the traditional methods. The method is promising for the capability of sparse representation, not necessarily sparse sound fields (e.g., in acoustic near-fields, reflective environments).

Author Contributions

Conceptualization, Z.L. (Zhihong Liu) and Q.L.; methodology, Z.L. (Zhihong Liu); software, Z.L. (Zunmin Liu) and C.L.; validation, Z.L. (Zhihong Liu), C.L. and Q.L.; formal analysis, Q.L.; investigation, Z.L. (Zunmin Liu); resources, C.L. and Q.X.; data curation, Q.X.; writing—original draft preparation, Z.L. (Zhihong Liu) and C.L.; writing—review and editing, Q.L. and Z.L. (Zhihong Liu); visualization, C.L.; supervision, Z.L. (Zunmin Liu); project administration, Q.L. and Z.L. (Zunmin Liu). All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Provincial Natural Science Foundation of Shandong under grant number ZR2023MF018 and the National Natural Science Foundation of China under grant number 61871447.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Fan, K.; Sun, C.; Liu, X.; Jiang, G. MIMO Sonar DOA Estimation with Joint Matched-Filtering Based on Transmission Diversity Smoothing. Northwestern Polytech. Univ. 2020, 38, 6–13. [Google Scholar] [CrossRef]
  2. Liaquat, M.U.; Munawar, H.S.; Rahman, A.; Qadir, Z.; Kouzani, A.Z.; Mahmud, M.A.P. Sound localization for ad-hoc microphone arrays. Energies 2021, 14, 3446. [Google Scholar] [CrossRef]
  3. Liu, M.M.; Dong, C.X.; Dong, Y.Y.; Zhao, G.Q. Super resolution 2D DOA Estimation for a Rectangular Array via Reweighted Decoupled Atomic Norm Minimization. Math. Probl. Eng. 2019, 2019, 6797168. [Google Scholar]
  4. Jeong, S.H.; Son, B.; Lee, J.H. Asymptotic performance analysis of the MUSIC algorithm for direction-of-arrival estimation. Appl. Sci. 2020, 10, 2063. [Google Scholar] [CrossRef]
  5. Geetha, P.; Nanda, S.J.; Yadav, R.P. A parallel chaotic sailfish optimization algorithm for estimation of DOA in wireless sensor array. Phys. Commun. 2022, 51, 101536. [Google Scholar]
  6. Zheng, G. DOA estimation in MIMO radar with non-perfectly orthogonal waveform. IEEE Commun. Lett. 2016, 21, 414–417. [Google Scholar] [CrossRef]
  7. Liu, S.; Yang, L.; Li, D.; Cao, H. Subspace extension algorithm for 2D DOA estimation with L-shaped sparse array. Multidimens. Syst. Signal Process. 2017, 28, 315–327. [Google Scholar] [CrossRef]
  8. Akhavan, S.; Bahabadi, M.A.; Norouzi, Y.; Lolaee, H. Direction of arrival estimation using array of antennas for low-altitude targets in multi-path environment. IET Radar Sonar Navig. 2016, 10, 1439–1445. [Google Scholar] [CrossRef]
  9. Qian, T.; Cui, W.; Shen, Q. Sparse reconstruction method for DOA estimation based on dynamic dictionary and negative exponent penalty. Chin. J. Electron. 2018, 27, 386–392. [Google Scholar] [CrossRef]
  10. Squali, L.; Riouch, F.; Tiyzi, H.; Squali, H. Hybrid Beamforming and DOA Algorithms for Smart Antenna System Deployed in 5G Network. Int. J. Microw. Opt. Technol. 2021, 16, 112–123. [Google Scholar]
  11. Liu, Y.; Yin, Y.; Li, R.; Fang, R. Flexible Null Broadening Robust Beamforming Based on JADE. Appl. Sci. 2022, 12, 9329. [Google Scholar] [CrossRef]
  12. Iliev, I.; Zhivomirov, H. On the spatial characteristics of a circular piston. Rom. J. Acoust. Vib. 2015, 12, 29–34. [Google Scholar]
  13. Hu, Z.; Wu, Q.; Zou, J.; Wan, Q. Fast and efficient two-dimensional DOA estimation for signals with known waveforms using uniform circular array. Appl. Sci. 2022, 12, 4007. [Google Scholar] [CrossRef]
  14. Cotté, B. Extended source models for wind turbine noise propagation. J. Acoust. Soc. Am. 2019, 145, 1363–1371. [Google Scholar] [CrossRef]
  15. Guerneve, T.; Subr, K.; Petillot, Y. Three-dimensional reconstruction of underwater objects using wide-aperture imaging SONAR. J. Field Robot. 2018, 35, 890–905. [Google Scholar] [CrossRef]
  16. Malioutov, D.; Çetin, M.; Willsky, A.S. A sparse signal reconstruction perspective for source localization with sensor arrays. IEEE Trans. Signal Process. 2005, 53, 3010–3022. [Google Scholar] [CrossRef]
  17. Huang, Q.; Fang, W. A Deep Learning Method for DOA Estimation with Covariance Matrices in Reverberant Environments. Appl. Sci. 2022, 12, 4278. [Google Scholar] [CrossRef]
  18. Aich, A.; Palanisamy, P. On-Grid DOA estimation method using orthogonal matching pursuit. In Proceedings of the 2017 International Conference on Signal Processing and Communication (ICSPC), Coimbatore, India, 28–29 July 2017. [Google Scholar]
  19. Dehghani, M.; Aghababaiyan, K. FOMP algorithm for Direction of Arrival estimation. Phys. Commun. 2018, 26, 170–174. [Google Scholar] [CrossRef]
  20. Aghababaiyan, K.; Shah-Mansouri, V.; Maham, B. High-Precision OMP-Based Direction of Arrival Estimation Scheme for Hybrid Non-Uniform Array. IEEE Commun. Lett. 2020, 24, 354–357. [Google Scholar] [CrossRef]
  21. Tropp, J.A.; Gilbert, A.C. Signal recovery from random measurements via orthogonal matching pursuit. IEEE Trans. Inf. Theory 2007, 53, 4655–4666. [Google Scholar] [CrossRef]
  22. Song, X.; Chen, B.; Yu, B. Error estimates for sparse optimal control problems by piecewise linear finite element approximation. J. Comput. Math. 2017, 1709, 09539. [Google Scholar] [CrossRef]
  23. Wu, D.; Luo, X. Robust latent factor analysis for precise representation of high-dimensional and sparse data. IEEE/CAA J. Autom. Sin. 2020, 8, 796–805. [Google Scholar] [CrossRef]
  24. Xenaki, A.; Gerstoft, P. Grid-free compressive beamforming. J. Acoust. Soc. Am. 2015, 137, 1923–1935. [Google Scholar] [CrossRef]
  25. Wang, H.Y.; Zhao, R.Z.; Cen, Y.G.; He, Q. Matric completion based on TV norm regularization and its application in image restoration. In Proceedings of the 2016 International Conference on Machine Learning and Cybernetics (ICMLC), Jeju, Republic of Korea, 10–13 July 2016. [Google Scholar]
  26. Ko, C.B.; Lee, J.H. Performance analysis of interferometer direction of arrival estimation under frequency mismatch of array manifold: DOA of frequency hopping signal. Appl. Sci. 2020, 10, 2331. [Google Scholar] [CrossRef]
  27. Tibshirani, R. Regression shrinkage and selection via the lasso. J. R. Stat. Soc. Ser. B Methodol. 1996, 58, 267–288. [Google Scholar] [CrossRef]
  28. Gerstoft, P.; Xenaki, A.; Mecklenbräuker, C.F. Multiple and single snapshot compressive beamforming. J. Acoust. Soc. Am. 2015, 138, 2003–2014. [Google Scholar] [CrossRef]
  29. Xenaki, A.; Fernandez-Grande, E.; Gerstoft, P. Block-sparse beamforming for spatially extended sources in a Bayesian formulation. J. Acoust. Soc. Am. 2016, 140, 1828–1838. [Google Scholar] [CrossRef]
  30. Zou, H.; Hastie, T. Regularization and variable selection via the elastic net. J. R. Stat. Soc. Ser. B Stat. Methodol. 2005, 67, 301–320. [Google Scholar] [CrossRef]
  31. Tibshirani, R.J. The Solution Path of the Generalized Lasso. Ph.D. Thesis, Stanford University, Stanford, CA, USA, 2011. [Google Scholar]
  32. Song, Y.M. The meanings of leisure constraints and facilitators in recreational specialization: A content analysis of specialized anglers’ interviews on F-TV. J. Tour. Sci. 2010, 34, 237–260. [Google Scholar]
  33. Di Serafino, D.; Landi, G.; Viola, M. Directional TGV-based image restoration under Poisson noise. J. Imaging 2021, 7, 99. [Google Scholar] [CrossRef]
  34. Rudin, L.I.; Osher, S.; Fatemi, E. Nonlinear total variation based noise removal algorithms. Phys. D Nonlinear Phenom. 1992, 60, 259–268. [Google Scholar] [CrossRef]
  35. Lu, Z.Y.; Wang, J.; Qin, T.; Ba, B. Direct position determination for coherently distributed noncircular source based on symmetric shift invariance. J. Electron. Inf. Technol. 2019, 41, 537–543. [Google Scholar]
  36. Yang, X.; Zheng, Z.; Ko, C.C.; Zhong, L. Low-complexity 2D parameter estimation of coherently distributed noncircular signals using modified propagator. Multidimens. Syst. Signal Process. 2017, 28, 407–426. [Google Scholar] [CrossRef]
  37. Wei, X.L.; Chen, J.; Lin, L. Two dimensional DOA estimation of coherent signals based on spatial smoothing method. J. Jilin Univ. Eng. Technol. Ed. 2008, 38, 245–247. [Google Scholar]
  38. Xu, C.; Kang, X.T.; Liu, Z.J.; Yang, Y.F.; Shi, Y.W. The new method based on the pair of two-dimensional array of parallel lines coherent signal DOA estimated. J. Chang. Univ. Sci. Technol. Nat. Sci. Ed. 2009, 32, 245–247. [Google Scholar]
  39. Zoltowski, M.D.; Haardt, M.; Mathews, C.P. Closed-form 2-D angle estimation with rectangular arrays in element space or beams pace via unitary ESPRIT. IEEE Trans. Signal Process. 1996, 44, 316–328. [Google Scholar] [CrossRef]
Figure 1. Acoustic incidence model of extended sources in planar array.
Figure 1. Acoustic incidence model of extended sources in planar array.
Applsci 13 09565 g001
Figure 2. In Case 1, the 2D-DOA’s estimation with and without TV-norm. (a) Spatial Spectrum of 2DTV-CES, (b) DOA’s map of 2DTV-CES, (c) DOA’s map of LASSO.
Figure 2. In Case 1, the 2D-DOA’s estimation with and without TV-norm. (a) Spatial Spectrum of 2DTV-CES, (b) DOA’s map of 2DTV-CES, (c) DOA’s map of LASSO.
Applsci 13 09565 g002
Figure 3. In Case 2, the 2D-DOA’s estimation with and without TV-norm. (a) Spatial Spectrum of 2DTV-CES, (b) DOA’s map of 2DTV-CES, (c) DOA’s map of LASSO.
Figure 3. In Case 2, the 2D-DOA’s estimation with and without TV-norm. (a) Spatial Spectrum of 2DTV-CES, (b) DOA’s map of 2DTV-CES, (c) DOA’s map of LASSO.
Applsci 13 09565 g003
Figure 4. The detection probability of different algorithms versus sparsity K .
Figure 4. The detection probability of different algorithms versus sparsity K .
Applsci 13 09565 g004
Figure 5. The detection probability of different algorithms versus SNR.
Figure 5. The detection probability of different algorithms versus SNR.
Applsci 13 09565 g005
Figure 6. Experimental setup: (a) Continuous sources and array, (b) Dipole source and array, (c) Angles’ measurement coordinates.
Figure 6. Experimental setup: (a) Continuous sources and array, (b) Dipole source and array, (c) Angles’ measurement coordinates.
Applsci 13 09565 g006
Figure 7. The continuous sources’ coefficient maps with three different source amplitude distributions (a) 2DTV-CES, (b) LASSO, (c) Improved ESPRIT.
Figure 7. The continuous sources’ coefficient maps with three different source amplitude distributions (a) 2DTV-CES, (b) LASSO, (c) Improved ESPRIT.
Applsci 13 09565 g007
Figure 8. The continuous sources’ detection probability with SNR = 10 and K = 8 (a) 2DTV-CES, (b) LASSO, (c) Improved ESPRIT.
Figure 8. The continuous sources’ detection probability with SNR = 10 and K = 8 (a) 2DTV-CES, (b) LASSO, (c) Improved ESPRIT.
Applsci 13 09565 g008
Figure 9. The dipole source’s detection probability with SNR = −5 and K = 4 (a) R 2DTV-CES, (b) LASSO, (c) Improved ESPRIT.
Figure 9. The dipole source’s detection probability with SNR = −5 and K = 4 (a) R 2DTV-CES, (b) LASSO, (c) Improved ESPRIT.
Applsci 13 09565 g009
Table 1. The performance comparison of the three methods.
Table 1. The performance comparison of the three methods.
Spatial Average Detection Probability (%)
2DTV-CESLASSOImproved ESPRIT
Sparsity K [2:2:20]87634
SNR [−5:1:20]86520
Computing Speed (s)28~3182~8890~95
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Liu, Z.; Liu, Q.; Liu, Z.; Li, C.; Xu, Q. A 2D-DOA Sparse Estimation Method with Total Variation Regularization for Spatially Extended Sources. Appl. Sci. 2023, 13, 9565. https://doi.org/10.3390/app13179565

AMA Style

Liu Z, Liu Q, Liu Z, Li C, Xu Q. A 2D-DOA Sparse Estimation Method with Total Variation Regularization for Spatially Extended Sources. Applied Sciences. 2023; 13(17):9565. https://doi.org/10.3390/app13179565

Chicago/Turabian Style

Liu, Zhihong, Qingyu Liu, Zunmin Liu, Chao Li, and Qixin Xu. 2023. "A 2D-DOA Sparse Estimation Method with Total Variation Regularization for Spatially Extended Sources" Applied Sciences 13, no. 17: 9565. https://doi.org/10.3390/app13179565

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop