The processing steps of the proposed method based on inverse mapping with target contour constraints are as follows. First, CSAR sub-aperture images are obtained through the process of sub-aperture division and BP imaging, and then the sub-aperture image preprocessing is executed. Second, the pre-processed sub-aperture image is binarized to obtain the projection points and contour points of targets on the imaging plane. Then, the projection points of targets in multi-aspect images are inversely mapped to 3D space, under the constraints of inverse mapping range and height calculated with target contour information. Finally, the scattering probabilities of scattering candidates are calculated in the 3D grid, and the scattering points are then selected in multi-aspect angles to reconstruct the 3D point clouds of targets.
3.1. CSAR Sub-Aperture Image Preprocessing
After dividing the circular synthetic aperture into multiple short-curve apertures according to the CSAR data acquisition parameters, the 2D coherent imaging for multi-aspect echo data is conducted via the BP algorithm, resulting in CSAR sub-aperture images with nearly identical range and azimuth resolutions.
The varying motion errors associated with sub-aperture flight paths can result in geometric distortion and radiation distortion between multi-aspect sub-aperture images. While traditional image registration algorithms that rely on image intensity or gradient information can effectively correct geometric distortion, they are susceptible to nonlinear radiation distortion (NRD), which can impair the reliability of feature matching. The radiation-variation insensitive feature transform (RIFT) algorithm has the potential to enhance the reliability of feature detection by leveraging the phase consistency and to circumvent the constraints of gradient information for feature description by employing the maximum index map, thereby improving the robustness of NRD correction [
28]. Accordingly, the geometric distortion and the radiation distortion between multi-aspect sub-aperture images are corrected by using the RIFT algorithm, which facilitates the conversion of these images to a uniform coordinate system.
It is acknowledged that background pixels near target pixels in multi-aspect images may potentially impact the reconstruction quality of 3D point clouds. Low-rank sparse decomposition (LRSD) of multi-aspect CSAR sub-aperture images is performed to separate target pixels and background pixels before inverse mapping. The projection points of the height target vary with the sub-aperture azimuth, whereas the inherent background points remain constant in multi-aspect sub-aperture images. According to the distribution of target points, background points, and speckle noise in the CSAR sub-aperture image, we construct the LRSD model for the CSAR sub-aperture image:
where
is the hybrid matrix formed by the splicing of adjacent sub-aperture images.
represents the background matrix independent of the aspect angle, satisfying the low-rank constraint.
represents the projection matrix of the height target varied with the aspect angle, which has the sparse characteristic.
represents the noise matrix in the sub-aperture image.
and
are the rank of the matrix
and the sparsity of the matrix
, respectively.
and
are the low-rank and sparse parameters, respectively.
According to the LRSD model in Equation (14), the problem of separating target pixels and background pixels can be transformed into the following optimization problem:
where
represents the
-norm of the matrix. The purpose of the optimization problem in Equation (14) is to minimize the extraction error of target pixels after LRSD.
Go-decomposition (GoDec) is a robust and efficient LSRD method in noisy cases [
29]. We use the GoDec algorithm to resolve the above optimization problem, the processing step of which is as follows. Initially, the optimization problem in Equation (15) is transformed into two subproblems:
where
and
are the global solutions of the above subproblems, respectively.
Then, the low-rank and sparse approximations based on bilateral random projections (BRP) are used to solve the subproblems in Equation (16) alternately. The low-rank approximate matrix
and the sparse approximate matrix
of the hybrid matrix
are
respectively, where
is the power exponent used to adjust the estimation error of the low-rank matrix
.
and
are the QR decomposition results of the right random projection matrix
.
and
are the QR decomposition results of the left random projection matrix
. The BRP matrices and
are constructed based on the random matrices and
, respectively.
and
are the number of rows and columns for the hybrid matrix
, respectively.
represents the sampling projection of the matrix about the set
, which is the non-zero subset comprising the first
largest elements of the matrix
.
The sparse component of multi-aspect CSAR sub-aperture images can be solved by using the GoDec algorithm, which represents the projection points of targets in multi-aspect images. Subsequently, the 3D point clouds of height targets are reconstructed through the implementation of inverse mapping for the obtained projection points.
3.2. Inverse Mapping with Target Contour Constraints
For anisotropic targets with limited azimuth persistence, due to the influence of non-target background points in the CSAR sub-aperture image, false points will be generated outside the actual scattering points. This phenomenon occurs when the IMV method is used to reconstruct the 3D point clouds for targets. Therefore, it is necessary to consider the contour distribution of the anisotropic target in multi-aspect CSAR sub-aperture images, which can constrain the coordinate range of scattering candidates formed by inverse mapping, thereby improving the quality of 3D point clouds. The processing steps for inverse mapping with target contour constraints are as follows.
First, the projection points of targets in the imaging plane are extracted based on the preprocessed CSAR sub-aperture image. The pixel whose amplitude exceeds the threshold
is designated as the projection point of the target and assigned a value of 1. The pixel whose amplitude is below the threshold
is designated as the non-target background point and assigned a value of zero. For vehicle targets and building targets, an accurate projected image of the target can be obtained when the threshold
is set to twice the average pixel value of the CSAR sub-aperture image. To reduce the influence of amplitude-distribution differences in multi-aspect CSAR sub-aperture images on the selection of projection points for the target, we convert absolute amplitudes in CSAR sub-aperture images to relative amplitudes according to Equation (18), thereby obtaining the more accurate projected image for targets. The projection points of targets in the obtained projected image
are used as the input for inverse mapping with target contour constraints:
where
is the CSAR sub-aperture image with relative amplitudes, and
represents the mean function.
Next, the contour points of targets on the imaging plane are extracted from the preprocessed CSAR sub-aperture image. The pixel amplitude corresponding to the elevation scattering point is relatively weak in the CSAR sub-aperture image, while the pixel amplitude corresponding to the scattering point on the imaging plane is relatively strong. Therefore, we binarize the CSAR sub-aperture image via threshold to obtain the image , which consists of the strong scattering points for targets on the imaging plane. Here, the selection of the threshold is related to the scattering coefficient distribution for targets on the imaging plane. The desired contour image, comprising strong scattering points, can be obtained by setting the semi-automatic threshold based on the average pixel value of the CSAR sub-aperture image and the unique weighting coefficient for different types of targets.
The morphological gradient (MG) method extracts the target contour by subtracting the eroded image from the dilated image [
30]. The computational complexity of the MG method is about
, where
is the number of CSAR imaging grid points, and
is the number of iterations for the dilation operation and the erosion operation. The MG method is an efficient and effective method for extracting the contour image of the conventional vehicle. However, due to the influence of interference points caused by the strong scattering points with height, there are still some non-contour points outside the target contour extracted by the MG method for some complex targets, affecting the extraction accuracy of the target contour. The fuzzy C-means method combining spatial neighborhood information (FCM-SNI) employs the object function with neighborhood constraints to determine the membership of the strong scattering points after setting the cluster center [
31]. The computational complexity of FCM-SNI is about
, where
is the number of strong scattering points in the CSAR sub-aperture image,
is the number of cluster centers, and
is the maximum number of iterations. The computational complexity results indicate that FCM-SNI has a comparable computational efficiency with the MG method. Therefore, FCM-SNI is used to eliminate the non-contour points outside the target from the strong scattering image.
The cluster center of the strong scattering image is initially determined according to the geometric center of the target and its number. Subsequently, the membership
of the
projection point in the strong scattering image to the
cluster center can be solved by minimizing the objective function
where
is the number of strong scattering points in the neighborhood, and
is the coordinate of the
strong scattering point in the neighborhood.
is the fuzzy weighting coefficient, and
is the neighborhood constraint operator of the
strong scattering point. The expression of the neighborhood constraint operator is
Then, the membership update equation of the
strong scattering point is
where
and
are the coordinate average of projection points located in the neighborhood of the
strong scattering point. The contour points of the target corresponding to the
cluster center can be obtained by judging whether strong scattering points belong to the
cluster center according to their membership degree.
After eliminating the non-contour points in the strong scattering image
by using the acquired membership, the contour image
of the target in the imaging plane can be obtained. According to the distribution of target contour points in the contour image
, the coordinate ranges of inverse mapping in the X-axis and the Y-axis can be calculated as
and
, respectively. Moreover, we calculate the height coordinate of inverse mapping based on the distance from the projection point in the image
to the contour point in the image
. The distance
between the projection point and the contour point with the largest amplitude in the direction of the aspect angle
is
where
is the 2D coordinate of the projection point, and
is the 2D coordinate of the corresponding contour point.
By substituting the distance in Equation (22) into the geometric transformation relationship in Equation (4), the maximum height coordinate for inverse mapping of the target projection point can be calculated as
Then, we perform inverse mapping on the projection points of the target in the image
under the constraints of the inverse mapping range and height to obtain the scattering candidates within the target contour. The scattering probability at each 3D grid point after the inverse mapping with target contour constraint can be calculated as
where
is the 3D coordinate of the scattering candidate in the 3D point cloud grid,
,
, and
.
is the weighted coefficient related to the quality of the
projected image, and
represents the value of the scattering candidate corresponding to the projection point in the
projected image after inverse mapping with target contour constraints. When the coordinate of the scattering candidate satisfies the contour constraints, the value of the scattering candidate is assigned to one; otherwise, it is assigned to zero.
Figure 5 shows the inverse mapping diagram of the projection points in two aspect angles under the constraints of the target contour. In
Figure 5, the blue and green solid points represent the projection points in two aspect angles, respectively. The corresponding scattering candidates are located on the blue and green inverse mapping curves. The blue and green dotted circles represent false points located within the target contour, which will not affect the quality of 3D point clouds, while the red and yellow solid points indicate the scattering candidates located on the target surface. The 3D point cloud reconstructed by the IMV method will include black solid points, which are false points outside the target surface, while the inverse mapping method with target contour constraints can eliminate the above false points. All scattering candidates of the target can be obtained through the implementation of inverse mapping with target contour constraints on multi-aspect CSAR sub-aperture images.
Finally, we calculate the scattering probabilities for the obtained scattering candidates and then select the actual scattering points of the target by comparing the calculated scattering probabilities in multi-aspect angles. The scattering candidate with the highest scattering probability among the scattering candidates, which is generated by the projection points of targets in a given azimuth, is selected as the actual scattering point. Note that if the maximum scattering probability of projection points for a given target is less than the threshold for selecting the actual scattering point, all of the scattering candidates are deemed to be false. The actual scattering points of the target can be identified by conducting the above scattering probability selection on all scattering candidates in multi-aspect angles. The elevation coordinates of the actual scattering points are assigned to the corresponding 3D grid points to realize high-quality reconstruction of 3D point clouds for targets.
The processing steps for reconstructing the 3D point cloud of the observation scene with multiple targets are as follows. First, the pixel areas of multiple targets are divided from multi-aspect CSAR sub-aperture images. Second, the threshold coefficients for different types of targets are set to extract their accurate contour image. Then, the above method based on inverse mapping with target contour constraints is applied to reconstructing the 3D point cloud of each target. Finally, the obtained 3D point clouds of all targets are placed in the corresponding positions of the 3D point cloud grid of the entire observation scene.