1. Introduction
In the field of array signal processing, the direction of the array has been extensively researched and developed over decades, and is widely used in radar, sonar, medical and other fields. After a long time of research, scalar sensor arrays have achieved an accurate estimation of signal parameters by various methods. The most classic of these is the MUSIC (multiple signal classification) algorithm. For example, in the article [
1,
2], the orthogonality of the signal subspace and noise subspace is used to construct a spatial spectral function, and then the parameters to be measured are estimated by traversing the spatial spectrum. In [
3], the ESPRIT (estimating signal parameter via rotational invariance techniques) algorithm based on rotational invariance techniques is applied, which makes full use of the rotational invariance between array elements and improves the speed of the algorithm by avoiding a tedious spectral peak search. However, in addition to the DOA angles
and
, sometimes, we need more signal parameters to ensure the accuracy and high resolution of the information. In these cases, the traditional scalar array obviously cannot meet our needs.
At this point, EM (electromagnetic) vector sensor arrays were first applied to collect electrical and magnetic information from impact signals. The reason is that the polarization components under the influence of electric and magnetic fields describe the vectorial properties of the EM wave. The estimation of this parameter facilitates the improvement of the system’s resolution and immunity to interference. The MUSIC algorithm, for example, is widely used in dipole antenna element arrays [
4,
5,
6,
7]. In [
5], the authors investigate the maximum element distance constraint (MISC) theory for sparse vector sensor arrays. The MISC theory is proposed, and the MUSIC algorithm based on the quaternion theory under the vector MISC array is proposed, which takes into account the advantages of the quaternion model, MISC array, and MUSIC, and improves the performance of the DOA and polarization information estimation. In addition to the MUSIC algorithm, the ESPRIT algorithm based on the rotational invariance technique in [
8,
9,
10,
11] has been applied to polarization-sensitive arrays. To reduce the computational complexity, a search-free algorithm centered on the ESPRIT algorithm is proposed in [
12] to estimate the four-dimensional parameters of the PSFDA-MIMO radar. This algorithm avoids the high-complexity problem of four-dimensional joint parameter estimation. Other subspace-based algorithms are often applied to vector sensor arrays, such as the propagation method (PM) and the parallel factorization (PARAFAC) algorithm [
13]. The algorithms listed above can be used to obtain the electromagnetic information of the signal using vector sensors, but the methods used rely on a spectral function search and eigenvalue decomposition; moreover, the longitudinal dimension of the signal covariance matrix is expanded to three times its original size under polarization-sensitive arrays, which brings a great computational cost and complexity.
To avoid this, scholars have applied sparse signal representation (SSR)-like algorithms to the estimation of the DOA and polarization information of polarization vector sensor arrays. For example, in [
14], an algorithm based on a sparse representation is proposed to estimate only the 1D dictionary and 2D DOA angles. A novel cross-covariance matrix without polarization components is constructed in the estimation process to decouple the DOA and polarization component information to solve the joint estimation problem. The useful information contained in the received signal is fully utilized, thus improving the estimation accuracy. Most of the above-mentioned multidimensional parameter estimation methods use a joint estimation or dimensionality reduction approach, and they fundamentally avoid the problems of spectral function search and feature decomposition, which can reduce the computational burden while ensuring the estimation accuracy and have more outstanding performance under a low signal-to-noise ratio and fast beat. However, the above algorithms still have some shortcomings. The sparse signal representation class of algorithms relies on the division of the grid, and off-grid error estimation is required when the results to be measured do not fall on the divided grid. This makes it difficult to achieve a balance between efficiency and accuracy in this class of methods. Alternatively, an iterative algorithm such as Newton’s method can be used for DOA estimation. In articles [
15,
16], the gradient descent algorithm and the Newton method are applied to DOA estimation with low complexity. In [
17], the joint angle–frequency estimation problem of a multidimensional ESPRIT algorithm is discussed, and a novel Frame–Newton method is proposed to solve the singular value decomposition of complex asymmetric matrices based on the Newton method. The Newton method is applied to [
16,
18], and the former article uses the phase-comparison monopulse method to obtain the initial angle in Newton’s method and defines a cost function to find the peak for the MUSIC spectral function. The latter paper uses the Newton method as a rooting technique for global polynomials and specifies a minimum set of initial points for the Newton method using the Newton diagram of the Root-MUSIC polynomial to ensure that at least one of the initial points will converge to the range closest to the root of the unit circle. The two algorithms proposed above avoid falling into the local optimum in the gradient descent class of algorithms, but the estimation accuracy is slightly worse because they do not use the second-order derivatives in the Taylor expansion for the optimization calculation.
In this paper, a multidimensional joint parameter estimation algorithm for multistart patterns is proposed. The algorithm is based on the classical Newton method and extends the iterative optimization search to four-dimensional scales using Hessian matrices. An iterative optimization algorithm is proposed for multistart patterns with partitioned null fields in order to minimize the trapping in local optima and improve the accuracy of the first estimation. The threshold function of Newton’s method is proposed and improved to maximize the low number of iterations and the accuracy of the parameter estimation. In addition, we use the improved Nystrom method [
19,
20] to reduce the longitudinal dimension of the signal covariance matrix, which further reduces the computational burden of the algorithm. Simulation results show that the proposed algorithm takes less computation time than the conventional algorithm under the condition of guaranteed accuracy, and the experimental results further verify its performance.
The following is the structure of this paper.
Section 2 introduces the array model used and its principles.
Section 3 introduces the multidimensional joint estimation algorithm based on the Newton iteration method.
Section 4 presents the detailed implementation process of the joint estimation algorithm for the multistart model, which includes the compression of the signal covariance matrix by the Nystrom method, the derivation of the multiorder derivatives and the iterative process under the corresponding array model, and the improvement of the threshold discriminator. The reliability and feasibility of the algorithm are verified by comparative experiments in
Section 5.
2. Signal Model
Consider a scenario where
K narrowband signals impinge upon an EM sensor array with an arbitrary manifold from far field, where the signals are fully polarized and mutually uncorrelated. To simplify the model analysis, the array are supposed to consist of
M tripole sensors, which collect the electrical field components of impinging signals. The whole model is described in
Figure 1.
The DOA and polarization parameters of the
kth incident signal can be expressed as
,
,
,
,
k = 1, 2, …,
K, where
,
∈ [0,
/2],
,
∈ [0, 2
]. Interference noise is Gaussian white noise distributed with 0 mean and variance
, which is uncorrelated with the incident signals. The array elements are irregularly distributed, and their position coordinates are
, where
. The source signals impinge the different sensors with a time delay, which leads to the phase difference in received signals. Usually, the phase difference can be expressed with the steering vector
.
where
,
are azimuth and elevation angles of the
signal, and
is the wavelength of the signal. According to [
10], each EM sensor receives the signal in a 3D vector, which is influenced by the polarization vector
where
is only composed of DOA parameters, and
includes the auxiliary polarization angle
and the polarization phase difference
The received signal
can be denoted as an algebraic sum of the scalar steering vector
, the polarization vector
, incident signal
, and the Gaussian white noise
under a certain number of snapshots
L, where
where ⊗ is the Kronecker product,
stands for the Kronecker product of
and
, and the Gaussian white noise
is a
vector. Regarding the covariance matrix of the received signal
, it can be expressed in terms of mathematical expectations
In practical engineering applications, the covariance matrix is represented by taking the average of multiple snapshots.
3. The Proposed Algorithm
The traditional MUSIC algorithm applies an on-grid search to estimate the DOA and polarization, which loses estimation accuracy and brings massive computation complexity. In this section, an off-gird algorithm is proposed based on the multidimensional MUSIC algorithm. After obtaining the multivariate spectral function, Newton’s method is applied to find the extreme values. During the process, the first- and second-order derivatives regarding , , , and are obtained by the spectral function. Then, a Hessian matrix is constructed to find out the extreme values, which locate the DOA and polarization results of the source signals. In order to avoid false results, a threshold determinant is applied at the end of the algorithm.
3.1. Processing Signals with the Nystrom Method
According to Equation (
5), we can model the signal in a polarized steering vector. However, the dimension of the obtained covariance matrix is expanded to three times its original size compared to the traditional model. This directly leads to an increase in the complexity of the algorithm and a slower operation speed. Therefore, in this paper, the Nystrom method is utilized to chunk the received signal model and reconstruct the signal covariance matrix.
The received signal
is chunked and partitioned into the first
columns
and the last
columns
according to the number of sources
and we obtain their respective subcovariance matrices
Finally, the signal covariance matrix after rank reduction using Nystrom’s method can be obtained by reconstructing
using the subcovariance matrix
.
In the matrix reconstruction using this method, the information of submatrix is not utilized, so the method saves some of the information used, and the dimension of the signal covariance matrix is reduced.
3.2. Rough Estimation
Before starting the rough estimation, we first use the singular value decomposition to obtain the noise subspace
of the signal
and use the steering vector
of the signal and the noise subspace
to obtain the spectral peak function
fThe rough estimation of the algorithm is then performed using the first-order Newton method. In this process, we need to use the original function of the spectral peak function
f and its first-order derivative
and
. We can obtain the gradient of
f for the four components’ (
) directions according to the matrix derivative rule. It is worth noting that due to the different coupling relationships between the four parameters, the steering vector of the scalar array and the DOA component participate in the derivative operation as a whole (
and
are present in DOA components
,
, and the
’s are present in polarization component
).
Similarly, we can find the first-order derivative in the direction of and by simply replacing and . Usually, first-order derivatives are represented and operated on in matrix form .
After several iterations of the first-order Newton method, a rough estimation can be described as
where
represents the initial position before the iteration, and
represents the optimization result after the iteration in a single iteration. Please see the detailed explanation of the algorithm flow about Newton’s method in the
Appendix A.
3.3. Accurate Estimation
The rough estimation is used as the initial point and the accurate estimation of the algorithm is performed using the second-order Newton method. However, unlike the rough estimation in this process, it is necessary to iterate using the first-order derivative matrix
of the spectral peak function
f and its second-order derivative. In Equations (14) and (15), we list in detail how to solve the second-order derivative equation based on the first-order derivative equation. There are four different solutions depending on the first-order derivative, which are the DOA components to polarization components for the partial derivatives, the DOA components to DOA components for the partial derivatives, the polarization components to polarization components for the partial derivatives, and the polarization components to DOA components for the partial derivatives.
Similarly, we can find the second-order derivative in all the directions by simply swapping
with
and
with
according to Equation (
16).
In the algorithm proposed in this paper, the given objective spectral function is complex and requires a large number of dimensions for the joint estimation. In order to make the four directional parameters keep the synchronization during an iteration, the Hessian matrix is introduced to simplify the problem in Equation (
17). The Hessian matrix was introduced in the 19th century by the German mathematician Ludwig Otto Hesse, and it is a diagonal array of second-order partial derivatives of multivariate functions that can be used to describe the local curvature of a function. When a multivariate real function
has second-order continuous partial derivatives in the neighborhood of the point
, then its Hessian matrix at the point
can be described as
The Hessian matrix consists of a set of second-order partial derivatives corresponding to each variable. Using the Hessian matrix, sufficient conditions for the determination of extreme-value points of multivariate functions can be summarized as
(positive definite) ⇔ the function has a minimal value at point .
(negative positive definite) ⇔ the function has a maximum value at point .
(nonpositive definite) ⇔ the function has no extreme value point at point .
After several iterations of the second-order Newton method, an accurate estimation can be described as
where second-order derivatives are represented and operated on in matrix form
,
is the result of the rough estimation, and
is the result of the accurate estimation after many iterations, where
has the form of a vector, and
has the form of a matrix
3.4. Threshold Determiner
In order to maximize the estimation accuracy while reducing the number of iterations of the algorithm, the threshold of the algorithm is improved in this paper. Unlike the traditional Newtonian thresholding discriminant function, we divide the original thresholding function into two parts. One part is used as the discriminant function of the global meritocratic algorithm, and the other part is used as the discriminant function of the basic meritocratic algorithm. Moreover, a function for controlling the algorithm cycle is added for each of them, and the search is reperformed if the current search result does not satisfy the minimum threshold conditions of the spectral function, gradient, and Hessian matrix. The following is the threshold discriminant step of the two-part optimization seeking algorithm. The threshold discriminator for the global optimization algorithm and basic optimization algorithm are shown in
Figure 2 and
Figure 3.
The main function of the global discriminator in
Figure 2 is to filter out a preliminary estimate that is close to the target to be measured. It is mainly judged by the height of the spectral peaks and whether or not the extreme points are reached. The process may be repeated three to five times until an ideal preliminary estimate is found.
When the initial set of random input points is not ideal, it may cause the global optimization result to fall into a local optimum (optimization results are considered to be trapped in a local optimum if the spectral function value at that point is nonzero). In this case, the discriminator will then repick the value of the initial set of points and optimize again until the estimated result is ideal.
When the signal-to-noise ratio is low, the convergence of the basic optimization algorithm becomes difficult because the effect of the Gaussian white noise on the signal becomes larger. When the number of iterations reaches a preset maximum (i.e., it fails to converge within the preset number), the discriminator adjusts the global optimization result in a very small neighborhood (, + ) and performs a new basic optimization again. This process is repeated until the desired result is obtained.
3.5. Algorithm Steps
The proposed algorithm in this paper can be roughly divided into four parts: the data-processing part, the global optimization part of the multistart model, the basic optimization part, and the discriminator part. The detailed steps of the algorithm are as follows.
Obtain the received signal
(k) by Equation (
5), chunk the matrix, and reconstruct the new covariance matrix
. Then, obtain the noise subspace
by Equation (
9).
Constructing the target spectral function iteratively for Newton’s method, the expression of the spectral function
f can be obtained according to Equation (
10)
Use Equations (11)–(17) to find the gradient and the Hessian matrix of f in the four directions of , , , and
Divide the neighborhood of the space equally into 9 parts, i.e., spaces 1 to 9, as shown in
Figure 4.
A starting point = [, , …, ] is randomly taken within each group of empty space. To ensure that the starting points put in the space can cover the whole empty space, try to keep the points in the adjacent empty space at a certain interval. The global optimization algorithm of the multistart pattern is performed in nine spaces. It is worth noting that in order to keep the four angle components [, , , ] highly iterative and synchronous during the global optimization, the first-order derivative matrix and the second-order derivative matrix should be involved in the form of vectors and matrices .
Update the starting point using the global optimization algorithm until convergence.
The obtained estimations are used as a starting point for the basic optimization algorithm and are further optimized to obtain higher-accuracy estimations.
A judgment is performed on
and
using the adjudicators in
Figure 2 and
Figure 3. When the judgment is finished,
is the estimated parameter. The DOA component is [
,
] = [
(1),
(2)], and the polarization component is [
,
] = [
(3),
(4)], where
refers to the
mth element of the matrix
.
4. Simulation Results
In order to show more clearly the efficiency of the proposed algorithm, we conducted experiments using Monte Carlo experiments. It is worth stating that in the experiments, in order to more conveniently and quickly reflect the advantages that the algorithm can work in any array manifold and to demonstrate the performance of the proposed algorithm in this paper, the experiments were all conducted under uniform linear arrays. In the experiments, we assumed that there were 10 tripole sensor arrays. Two uncorrelated far-field narrowband signals were impinging into the array, and
L = 300 snapshots were collected (the term “preconditioned” is used in the following to refer to “
M = 10,
K = 2,
L = 300”). All simulations were performed in MATLAB R2021a. The RMSE is defined as
where
L is the total number of experiments,
and
are the true and measured values of the
kth experiment.
In the first experiment, we conducted 300 Monte Carlo experiments on the proposed algorithm for different SNRs in the preconditioned state, observed the accuracy of the algorithm, and recorded its RMSE. The results show that the algorithm has good performance in solving the multidimensional joint parameter estimation problem and can be used in practical engineering.
Figure 5 and
Figure 6 are the pictures of error analysis of two directions. These two figures present the error between the estimated results and the true values, i.e., the RMSE analysis plots of the angles, for the two sets of target angles goal1 and goal2. There are four sets of data for each target angle, which are azimuth angle, elevation angle, auxiliary polarization angle, and the polarization phase difference.
In this paper, the proposed method can be applied to arrays of any shape, and compared to other algorithms where the array manifold must satisfy certain conditions (e.g., the trilinear decomposition in the propagator method must ensure that the array manifold can provide information in multiple dimensions), the application scenario is much broader. However, the estimation of the four parameters (, , , ) can be performed only using a uniform linear array, in the case where trilinear dipole array elements are used.
In addition, the effectiveness of the basic optimization algorithm process was verified by building noise with different structures and comparing the results. First, an ideal noise
with a diagonal array structure was added to a source
in the ideal case, and the results of the global optimization algorithm and the basic optimization algorithm were compared in the same estimation process (when the noise is ideal enough, the estimation error of the latter should be infinitely close to zero, and the estimation accuracy of the former is slightly worse).
Figure 7 shows the comparison of the global Newton method and the basic Newton method errors for each SNR in the ideal noise case, which confirms the validity of the basic Newton method.
4.1. RMSE Comparison
In the second experiment, the proposed method was used to compare the performance with various classical subspace algorithms, where the search range was [
,
], and the search step was
in the preconditions. In order to represent the simulation results more intuitively, we represented the RMSE in the form of a two-parameter number in the experiments comparing it with that of other algorithms in the preconditions (i.e., the straight-line distance between the true value and the measured value in the spatial domain). It can be seen in
Figure 8. The spectral peak search algorithm is limited by the grid density leading to its own performance limitation at the
step. In contrast, the proposed method is an iterative algorithm with a variable step size, which has a very high convergence accuracy. This experiment was conducted at 5 dB–25 dB, a SNR that is more common and prevalent in everyday life. When the SNR is higher, the performance of the four algorithms will be closer.
In the third experiment, a comparison of the two RMSEs of the proposed algorithm before and after utilizing the rank reduction method was conducted. In
Figure 9, it can be seen that the covariance matrix is less affected by the accuracy of the algorithm after the rank reduction. Almost the full accuracy of the algorithm is retained.
4.2. Time Comparison
In the fourth experiment, we performed a statistical comparison of the average time for a single run of each algorithm, as shown in
Figure 10. The spectral peak search algorithm improves the algorithm performance at the expense of time in order to ensure its own excellent estimation accuracy. Moreover, it continues to increase the computation time as the grid is refined. In the proposed method, in addition to the advantage of variable step size, the presence of the adjudicator further saves the algorithm a lot of time cost. It is worth mentioning that the average computation time of the proposed method is more irregular for different SNRs than other algorithms. The reason is that we cannot guarantee that the adjudicator termination threshold is at the same level for each SNR (i.e., it is possible that if the threshold is slightly higher, at SNR = 10, the adjudicator will increase the operation time in order to get a more accurate estimation).
In the fifth trial, The efficiency of the improved Nystrom method for the rank reduction was verified by comparing the average time of a single run of the proposed algorithm before and after the same condition of rank reduction, as shown in
Figure 11. In combination with the above analysis, it can be seen that our method not only retains the accuracy of the original algorithm to a great extent but also reduces the time used for a single computation of the algorithm significantly.