1. Introduction
In recent years, polarimetric synthetic aperture radar (PolSAR) images have been widely applied in land use and cover classification [
1], target detection and recognition [
2], natural disaster emergency management [
3], and urban planning [
4]. Many countries have successfully developed and launched PolSAR payloads, such as the AIRSAR [
5], Radarsat-2 [
6], and GF-3 [
7]. With the increase in PolSAR systems, an important issue is how to analyze the acquired images accurately and automatically. Among the interpretation techniques that have been proposed, the PolSAR classification algorithm is one of the most significant research branches. The purpose of the PolSAR classification algorithm is to assign objects of interest to different classes based on the obtained polarimetric information. Over the past few decades, many methods have been developed for PolSAR classification, which can be roughly generalized as conventional algorithms, machine learning algorithms, and deep learning algorithms [
8]. The classification accuracy of conventional algorithms is usually lower than machine learning and deep learning algorithms [
9]. However, they have still attracted extensive attention due to their simple expression and clear physical meaning [
10]. For conventional classification algorithms, statistical models used to characterize the distribution of PolSAR data are important. Statistical models can be generalized into two main categories based on the data used for modeling. One is to directly model the measured PolSAR data, such as the single-look scattering matrix [
11], the multi-look covariance/coherency matrix [
12], and the other is to model PolSAR data based on the polarimetric features [
13]. In this paper, the former category is adopted to model PolSAR data.
Kong et al. [
11] proposed the complex Gaussian-based maximum likelihood ML classification algorithm for single-look PolSAR data. Then, Lee et al. [
12] proposed the classification algorithm for multi-look PolSAR data based on the complex Wishart distribution. The complex Gaussian distribution is a common approach to modeling the scattering matrix, and the complex Wishart distribution is a widely used model for the covariance/coherency matrix. However, these two distributions only apply well to homogeneous regions, which means that there is a single class with no textures. To better characterize the statistical distribution in complicated scenes, product models were developed [
14]. Product models assume that the distribution model of PolSAR data is the product of two independent variables. One variable is a positive scalar, representing the texture of the PolSAR scene, and the other variable is considered to obey the complex Gaussian/Wishart distribution for describing speckle noise. Lee et al. [
15] proved that multi-look PolSAR data obeys the K-distribution. After that, many product models, such as G distribution [
16] and KummerU distribution, ref. [
17] were developed. Moreover, mixture models have also been proposed to model PolSAR data [
18,
19]. The Wishart mixture model (WMM), each component of which obeys the complex Wishart distribution, is a widely used mixture model for PolSAR data. For WMM, there are two strategies to model PolSAR data. One is to characterize the statistical distribution of the whole PolSAR image, and the other is to characterize the statistical distribution of a single class. With the continuous improvement in resolution, the distribution of PolSAR data has become more complicated. The performance of traditional statistical models in characterizing the distribution of PolSAR data is gradually declining. More effective statistical models are needed to describe the distribution of PolSAR data.
The number of looks is an important parameter for distribution models. There is a certain inter-pixel correlation in PolSAR data, which is equivalent to reducing the number of independent looks. Therefore, the nominal number of looks needs to be substituted by the equivalent number of looks (ENL). In [
20], the ENL is considered a free parameter that varies between different classes. The ENL is regarded as a shape parameter about texture. In this way, the ability of the Wishart model to characterize the statistical distribution is improved without increasing the mathematical complexity. Based on the idea, a new mixture model named the relaxed Wishart mixture model (RWMM) was first proposed. RWMM contains multiple mixture components that satisfy the Wishart distribution and each with a unique ENL. The ENL is no longer a known constant for the whole PolSAR image but a variable that makes the theoretical model fit the distribution of the real data more closely.
ML-based classification algorithms are widely used for PolSAR data. However, these algorithms only utilize the information of the target itself and do not consider the surrounding spatial information. Therefore, classification results are often affected by noise. To further improve classification accuracy, it is necessary to combine spatial information and statistical models. Markov random field (MRF) is a widely used probability model, which mainly quantifies the spatial-contextual information around the target [
21]. Over the past few decades, many researchers have applied MRF to image classification and segmentation algorithms for noise reduction. In [
22], MRF is used to obtain spatial-contextual information and combine spectral data to classify multispectral remote sensing images. Li et al. [
23] obtained the spatial-contextual information from MRF and combined it with the Gaussian mixture models to complete the classification work. Wu et al. [
24] proposed a region-based classification algorithm by introducing MRF into the Wishart model. Compared with traditional classification algorithms based on statistical models, the Wishart–MRF algorithm has a smoother classification map and higher classification accuracy.
Usually, the size and shape of a neighborhood are fixed when MRF is applied to classification algorithms. In other words, the same neighborhood model is used to obtain spatial-contextual information for each center pixel in an image. The drawback lies in that it may cause the disappearance of some fine structures and the blurring of regional boundaries. In [
25], adaptive neighborhood (AN) systems are used to preserve small features and border areas. Among different neighborhood alternatives in the AN system, the one that is the most appropriate for the local image around the center pixel is chosen to obtain spatial-contextual information in the MRF process. However, the selection criterion used is based on span, which is a sum of the three polarimetric intensity channels. A new criterion based on the covariance/coherency matrix is proposed in this paper. The new criterion makes effective use of polarimetric information to choose the most suitable neighborhood.
Combining the aforementioned RWMM and MRF, a new supervised classification algorithm is proposed: RWMM-MRFt. In the algorithm, RWMM is utilized to characterize the statistical distribution of the multi-look covariance/coherency matrices in a single class. MRF captures the spatial-contextual information of each center pixel through the AN system and the new selection criterion. The likelihood probability is obtained by RWMM, and after the prior probability is captured by MRF, each sample is classified into the class with the maximum posterior probability (MAP) in the classification algorithm.
The rest of this paper is organized as follows.
Section 2 first gives the basis of the PolSAR data, then statistical models and the proposed RWMM are introduced.
Section 3 gives the fundamentals of MRF and introduces the proposed RWMM-MRFt classification algorithm for PolSAR data.
Section 4 first gives the performances of the proposed RWMM, then the classification results of the proposed algorithm and other algorithms are presented. Further analysis is given in
Section 5, followed by the conclusion in
Section 6.
5. Discussion
In this paper, RWMM is used for supervised classification. For supervised classification, RWMM is used to characterize the statistical distribution of data belonging to a single class. The number of mixture components represents the number of subclasses in a single class. As
K increases, that is, a single class is divided into more and more subclasses, the modeling accuracy of RWMM on the data gradually increases first and then tends to stabilize. In [
18], the overall classification accuracy generally increases when
K is gradually increased to 10 and only slightly increases when
K is further increased. This is consistent with the theoretical analysis. We also analyzed the influence of the number of mixture components based on the simulated and real PolSAR data. The results show that
is sufficient for RWMM to model the PolSAR data accurately. When
K continues to increase, the improvement of the overall classification accuracy is very limited. Take the Boao region as an example; when K = 1, 3, 5, and 10, the OA of RWMM is 76.40%, 88.78%, 89.17%, and 89.84%, respectively. However, when K = 12, 15, and 20, the OA of RWMM is 89.84%, 89.89%, and 90.03%, respectively. Therefore, K = 10 is set for the proposed algorithm.
In fact, RWMM can also be used for unsupervised classifications. For unsupervised classifications, RWMM is used to characterize the statistical distribution of the whole PolSAR image, where the number of mixture components
K represents the number of classes in the scene. When
K is set too small, data of different classes will be classified into one class, and when
K is too large, data belonging to one class will be split. Therefore, an adaptive method for determining the number of mixture components is necessary for unsupervised classification. In [
39], a mixture component is preset at initialization, and then every few iterations, the current components are split and merged. A goodness-of-fit testing strategy is employed to control for splitting and merging. In this paper,
K is set to a large value at initialization, and a merging strategy can be performed in the last step of parameter estimation. This is to avoid splitting components. Compared to the method in [
39], although this method may be difficult to precisely determine the number of classes in the scene, it avoids re-estimating the parameters at each split. It should be emphasized that for supervised classification, it is not necessary to accurately estimate the number of mixture components.
6. Conclusions
This paper first proposes a new mixture Wishart model named RWMM. Compared with WMM, RWMM does not introduce new parameters, so the mathematical complexity of the statistical model does not increase. RWMM further improves the goodness of fi,t the model to the distribution of PolSAR data by relaxing the number of looks parameters. The EM algorithm is utilized to estimate parameters through an iterative process. The ML-based classification results of simulated and real PolSAR datasets show that RWMM has the highest overall classification accuracy and Kappa coefficient. It demonstrates that RWMM has greater flexibility and effectiveness than the Wishart, K-Wishart, G, and WMM models in modeling PolSAR data. Then, MRF is used to improve the classification results by fusing the spatial information in the PolSAR images. In the MRF process, an AN system is adopted, and a new criterion is proposed to further improve the overall classification accuracy. The new criterion can make more effective use of the PolSAR scattering information than the selection criterion based on span. Regardless of whether the statistical model used is WMM or RWMM, the classification results indicate that the proposed criterion based on the complex-kind HLT statistic can find a more suitable neighborhood than the criterion based on span.
Based on RWMM and the new selection criterion for the AN system, a new supervised classification algorithm named RWMM-MRFt is proposed. The proposed algorithm classifies PolSAR images according to the likelihood probability obtained by RWMM and the prior probability obtained by MRF. The classification results of the simulated and two real PolSAR datasets, including the San Francisco and Boao datasets, have shown that RWMM-MRFt has the highest overall classification accuracy and Kappa coefficient among the mixture model-based MRF classification algorithms. Overall, the classification accuracy of the proposed algorithm is improved by about 5–7% compared with WMM, improved by about 2–4% compared with WMM-MRF, and improved by about 1–3% compared with WMM-MRFs. The results demonstrate the superiority of the RWMM-MRFt algorithm in PolSAR data classification. The proposed algorithm belongs to the conventional classification algorithm, and the classification accuracy may not be as high as that of machine learning and deep learning algorithms. However, it has simple expressions and clear physical meanings. The AN system in the proposed algorithm has only five simple neighborhoods, which may not fully represent the shape of the regional boundary. In future work, more neighborhoods could be added to the AN system to further improve the classification accuracy.