Next Article in Journal
The Challenge of Spatial Resolutions for GRACE-Based Estimates Volume Changes of Larger Man-Made Lake: The Case of China’s Three Gorges Reservoir in the Yangtze River
Previous Article in Journal
Flood Hazard Mapping and Assessment on the Angkor World Heritage Site, Cambodia
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Combining Binary and Post-Classification Change Analysis of Augmented ALOS Backscatter for Identifying Subtle Land Cover Changes

by
Dyah R. Panuju
1,2,*,
David J. Paull
1 and
Bambang H. Trisasongko
1,2
1
School of Physical, Environmental and Mathematical Sciences, UNSW Canberra, Northcott Drive, Campbell, ACT 2600, Australia
2
Department of Soil Science and Land Resource, Bogor Agricultural University, Jalan Meranti, Kampus IPB Darmaga, Bogor 16680, Indonesia
*
Author to whom correspondence should be addressed.
Remote Sens. 2019, 11(1), 100; https://doi.org/10.3390/rs11010100
Submission received: 29 November 2018 / Revised: 3 January 2019 / Accepted: 4 January 2019 / Published: 8 January 2019
(This article belongs to the Section Remote Sensing in Agriculture and Vegetation)

Abstract

:
This research aims to detect subtle changes by combining binary change analysis, the Iteratively Reweighted Multivariate Alteration Detection (IRMAD), over dual polarimetric Advanced Land Observing Satellite (ALOS) backscatter with augmented data for post-classification change analysis. The accuracy of change detection was iteratively evaluated based on thresholds composed of mean and a range constant of standard deviation. Four datasets were examined for post-classification change analysis including the dual polarimetric backscatter as the benchmark and its augmented data with indices, entropy alpha decomposition and selected texture features. Variable importance was then evaluated to build a best subset model employing seven classifiers, including Bagged Classification and Regression Tree (CAB), Extreme Learning Machine Neural Network (ENN), Bagged Multivariate Adaptive Regression Spline (MAB), Regularised Random Forest (RFG), Original Random Forest (RFO), Support Vector Machine (SVM), and Extreme Gradient Boosting Tree (XGB). The best accuracy was 98.8%, which resulted from thresholding MAD variate-2 with constants at 1.7. The highest improvement of classification accuracy was obtained by amending the grey level co-occurrence matrix (GLCM) texture. The identification of variable importance (VI) confirmed that selected GLCM textures (mean and variance of HH or HV) were equally superior, while the contribution of index and decomposition were negligible. The best model produced similar classification accuracy at about 90% for both years 2007 and 2010. Tree-based algorithms including RFO, RFG and XGB were more robust than SVM and ENN. Subtle changes indicated by binary change analysis were somewhat hidden in post-classification analysis. Reclassification by combining all important variables and adding five classes to include subtle changes assisted by Google Earth yielded an accuracy of 82%.

Graphical Abstract

1. Introduction

Monitoring of change over conserved areas may not always identify significant differences between two observations. Subtle changes might be more relevant than abrupt changes when land uses are well-managed and there are no unexpected disturbances such as natural disasters. Vogelman explained that subtle change was a gradual change related to “within state” alteration of energy response that is commonly related to vegetation dynamic other than the normal phenological cycle [1]. The subtle change in a vegetated land cover may be a sign of vegetation damage due to diseases, insects, drought, and changes of plant community [1], or indicating the change of vegetation condition as a result of harvesting, thinning and farming practices [2]. The gradual or subtle changes are less noticeable than the abrupt change, and it has often been overlooked [3].
Ground survey is a practical effort for monitoring montane conserved areas, nonetheless, the remoteness of location and limited accessibility may challenge the process. Meanwhile, clouds limit the capability of monitoring by using optical images for areas covered with clouds, which can remain unobservable for days, weeks or months at a time [4,5,6]. In these instances, successful change detection is not guaranteed through the exclusive use of optical imagery such as Landsat or other sensors [7]. Microwave remote sensing such as synthetic aperture radar (SAR) is an option for monitoring remote areas that are severely affected by clouds. The use of SAR data for monitoring tropical rain forests has delivered invaluable information by tackling the challenge of poor accessibility and persistent cloud cover [8]. Change detection (CD) of SAR data has been implemented over urban [9,10], agricultural [11] or forested [12,13] areas.
The dynamics of change is essential information provided by change detection techniques. A recent development in binary change analysis, i.e., Iteratively Reweighted Multivariate Alteration Detection (IRMAD), is an alternative technique to identify change while performing relative radiometric calibration. IRMAD has been applied to analyse optical images, including Landsat [14], Advanced Spaceborne Thermal Emission and Reflection Radiometer (ASTER) [15] and hyperspectral images [16]. IRMAD was reportedly able to eliminate strong changes to improve the detection of unchanged background or subtle changes [17]. The employment of IRMAD to analyse radar data appears lacking, thus applying IRMAD on SAR backscatter will provide understanding regarding its potential. Moreover, IRMAD produces binary change maps consisting of changed and unchanged areas, which may not be adequate for managing land cover/uses. Combining IRMAD with other techniques would, therefore, complete the information of change processes.
“From-to” information is typical data derived from post-classification comparison that cannot be obtained from bi-temporal change analysis [18]. Post-classification change analysis depends on the classification process that relies on involving spectral layers. The number of commonly used predictors for polarimetric SAR images is one, two or three for single, dual or fully polarized images, respectively. A limited number of predictors in single or dual polarization radar data may limit the accuracy of classification from data instances, while error detection is propagated by each classification [19]. Amending synthetic data may enrich information and in turn improve the accuracy of change detection by using radar data.
An appropriate use of a classifier may optimize the accuracy of classification. It was demonstrated that tuning parameters of potential classifiers improved the accuracy of classification [20]. Many investigations have reported that no particular classifiers have been consistently superior when applied in various study cases such as contrasting the result of maximum likelihood classifiers compared to decision tree and support vector machines, as demonstrated by Huang et al. [21] and Pal et al. [22]. Moreover, employing multitemporal datasets supported with knowledge about best possible times may improve the likelihood of producing high accuracy of classification. The strategy is essential when optical images that potentially suffer from atmospheric disturbances are employed, which is not the case of SAR [23].
This research combined the benefit of SAR data that are less sensitive to atmospheric disturbances for binary change analysis and enriched the data with synthetic layers for post-classification analysis to provide “from-to” information for identifying the dynamics of vegetative cover at Halimun Salak National Park. The popular bi-temporal technique, IRMAD followed with thresholding was employed for highlighting changed/unchanged locations. By comparing seven latest pixel-wise classifiers, this research explored the effect of data amendment to ALOS L-band dual polarimetric backscatter and evaluated potential algorithms to generate the best possible accuracy for post-classification change analysis to identify subtle changes.

2. Iteratively Reweighted Multivariate Alteration Detection and Data Amendment of Synthetic Aperture Radar for Classification

2.1. Multivariate Alteration Detection (MAD) and The Iteratively Reweighted Multivariate Alteration Detection

Multivariate Alteration Detection (MAD) is a popular binary change detection technique based on Hotteling’s canonical correlation, by transforming two sets of vector images of the same place, acquired at two time points, M = (M1, …, Mp) and N = (N1, …, Np) into new images Q = aT M and R = bT N. From p spectral bands in the original bi-temporal images, two images can be generated where each new image is composed of p MAD-variates. As suggested by Nielsen et al. [24], vectors aT and bT are selected simultaneously by maximising the variance of the difference between Q and R subject to the constraint that the variance of Q and R are both equal to 1.
Max[Var(Q-R)], subject to Var(Q) = Var(R) = 1,
Var(Q-R) = Var(Q) + Var(R) − 2Cov(Q,R)
By maximising the variance of Q and R, MAD generates variance of [Q-R] to comply with canonical correlation analysis. Hence, maximising the difference between Q and R can be achieved by minimising non-negative correlation (ρ):
ρ = Cov ( Q , R ) Var ( Q ) Var ( R )
The variance of the difference then can be written as Var(Q-R) = 2(1 − ρ), where ρ is the correlation between Q and R. The MAD transformation as the change result was defined by Reference [24] as:
[ M N ] [ a p T M b p T N a 1 T M b 1 T N ]
The MAD variates are ordered by descending variance. MAD variate-1 is the difference between the highest-order of canonical variates while MAD variate-2 is the difference between the second-highest-order of canonical variates [14]. Standardising values by computing correlation instead of covariance helps to cope with differing scales that result from different gain factors and atmospheric conditions. The improvement of MAD by iterative processing, IRMAD, is performed to improve separation among classes by adding more weight to no-change probabilities during the process [14].
Compared to other binary or post-classification change analysis, MAD does not strictly demand pre-processing to produce an accurate change map. This advantage is rooted from a process that selects pseudo-invariant features (PIF) or invariant pixels as non-change samples for relative normalization of images [25]. Pseudo-invariant features are usually selected from features such as buildings or constructions that are relatively constant in their reflectance over acquisitions, albeit with minor effects of seasonal conditions.
Iteratively Reweighted Multivariate Alteration Detection is an iterated version of MAD that excludes pixels of change detected at the preceding iteration for the next calculation [25]. The iteration is intended to reduce the adverse effect of change occurrence in the feature spaces by assigning a higher weight to unchanged pixels [25,26]. The weight is a probability of non-changed pixels, modeled by using the Chi-square distribution [17]. The iteration of MAD processing would stop when criteria, such as a lack of change in the canonical correlation, are met [25]. The IRMAD performed better than single MAD transformation in analyzing multitemporal images with a dominant change [25] and provided accurate yet more concentrated change indication by suppressing salt-and-pepper effects [25,27]. According to Marpu et al. [17], the limitation of IRMAD includes the standardization of individual MAD to define non change distribution and the large proportion of change pixels that leads to incorrect projection of MAD variates.
Multivariate Alteration Detection or its modified version is frequently used for change detection employing optical images. Examples include the application of IRMAD to characterize decadal change processes by employing Landsat images [28]; the employment of MAD to pre-process ASTER preceding classification [15]; and the use of IRMAD for reducing false detection of change with Hyperion [16]. Pre-processed or pre-transformed spectral imagery from optical sensors have been employed. A comparison of the performance of IRMAD using original surface reflectance and tasselled cap transformation of Landsat 5 images demonstrated comparable change results [29]. Nonetheless, we are not aware of change analysis that employed IRMAD using SAR data.

2.2. The Amendment of Synthetic Aperture Radar Data with Synthetic Layers for Improving Classification Accuracy

Another alternative of change detection is through comparative-based analysis that analyses change from post-classified images to provide “from-to” information on pre- and post-class labels, enriching the information of binary change detection [30]. The quality of post-classification detection relies on the accuracy of each classification process of each data pair. The main challenge is to obtain an adequate accuracy such as the standard accuracy for thematic maps implemented by the US National Park Service at 80% [31].
The use of radar data to provide “from-to” change information for investigating land cover dynamics should deal with a limited number of data layers contributing to class separation. SAR data may have single, dual or triple layers composed of horizontal, vertical and cross-polarisation modes. More layers, i.e., four are possible from bi-static images when the transmitter separates signals from the receiver and the reciprocity of cross-polarisation does not dictate the result [32]; nonetheless, the image mode is yet to be systematically produced for monitoring. Data fusion is an alternative technique that combines two or more data sources that will provide more layers for input to the classification process [33]. Fusing SAR and optical images has been used to study earthquake damage detection [34] or to assess tsunami damage [35]. Hence, data fusion necessitates additional data sources and adjustment techniques to integrate geometric, radiometric and other differing properties.
An alternative strategy to add variables in SAR data is by amending synthetic layers. Since the amendment is derived from the data itself, geometric correction is irrelevant. The synthetic data can be generated by simple algebra such as differencing, ratioing, or through advanced techniques such as decomposition and texture analyses. Adding synthetic data to original layers has been implemented in several investigations, for instance to monitor deforestation and land use at the Samarinda rain forest [36], to detect Phragmites [37], and to identify smallholder oil palm plantation [38].
An index is synthetic data commonly utilized to identify or differentiate features, either for optical or radar imagery. A popular index in optical images is the Normalized Difference Vegetation Index (NDVI) [39,40], while for SAR data, a comparable vegetation index is the Radar Vegetation Index (RVI). Radar Vegetation Index may be derived from dual polarimetric or fully polarimetric SAR images. The following equation is used to derive RVI from fully polarimetric SAR [41]:
RVI = 8 σ h v 0 σ h h 0 + σ v v 0 + 2 σ h v 0
where σ h h 0 refers to horizontal backscatter values, σ v v 0 denotes vertical backscatter values and σ h v 0 is for cross-polarization. For dual polarimetric images, RVI can be calculated by using this formula [42]:
RVI h h = 4 σ h v 0 σ h h 0 + σ h v 0
RVI h h symbolizes RVI for horizontal polarization backscatter and the RVI for vertical polarization image can be obtained using a similar equation by changing the horizontal backscatter component with the vertical one (RVIvv ≈ RVIhh). The use of RVI to amend SAR data in assisting land cover classification can be found in several studies, which have mostly employed fully polarimetric backscatter, such as Ling et al. [43] or Avtar et al. [44].
Synthetic data can also be formed through decomposition techniques, such as Entropy–Alpha decomposition introduced by [45]. This decomposition generates three layers including entropy, alpha and anisotropy. A detailed explanation about the Entropy–Alpha decomposition is available in Cloude and Pottier [45]. The use of entropy and alpha to improve classification results has been implemented by Rodriguez et al. [46] and Qi et al. [47].
Another popular synthetic layer is texture, which can be derived either from optical or SAR data. Grey Level Co-occurrence Matrix and Generalized Co-occurrence Matrix (GCM) are among the available techniques to derive texture [48,49]. Many features can be generated in the texture analysis including angular second moment (ASM), contrast, dissimilarity, energy, entropy, GLCM correlation, GLCM mean, GLCM variance, homogeneity, and maximum. Readers should refer to Haralick et al. [48] for details. The use of texture for amending SAR data in order to improve classification results have been implemented by several investigators [50,51,52]. However, it has been demonstrated that only a few texture features contributed significantly to the improvement of accuracy.

2.3. Pixel-Based Techniques for Land Cover Classification

Lu and Weng [53] highlighted the importance of employing suitable techniques as a prerequisite for successful land cover classification. The algorithms of classification can be grouped into pixel-based, sub pixel-based, per field-based, contextual-based, and combinative ones [53]. Pixel-based classifiers have been developed and implemented to classify land cover/use employing optical and radar imageries. Notable pixel-based algorithms include the maximum likelihood classifier (MLC), SVM and decision tree (DT), in which DT does not need statistical assumptions [54]. Support Vector Machine and NN are learning techniques having several advantages including that NN is a distribution free analysis that easily combines multisource data and is claimed to be free from accumulative errors and less affected by atmospheric conditions, illumination and surface moisture [55,56]. Both SVM and NN are superior when dealing with spectral mixture cases [57] while also being responsive to tuning parameters [20]. Meanwhile, SVM has been shown to outweigh MLC and NN when handling small training samples [58]. In contrast, NN has outperformed MLC and SVM for differentiating crop areas [59].
Improving the accuracy of classification is targeted through various strategies. Ensemble learning is an approach for improving the accuracy of classification through modifying the selection of training samples. Various strategies for training selection have been proposed, for instance bagging, a technique to construct multiple versions of a predictor for producing an aggregate predictor [60], and boosting, a general method to improve prediction capability of an algorithm by reducing error from a weak algorithm through reweighting the samples [61]. Implementing the strategy in the real classification of remote sensing images has improved the accuracy of classification by 3–6% [22] or reduced misclassification rates by 20–50% [62]. Various algorithms have been developed for implementing bagging or boosting strategies, including CAB, MAB and RF either in their original version (RFO) or modified one (RFG), and Extreme Gradient Boosting Tree (XGB). Exploring these newly developed algorithms is necessary for better understanding their potential.
The Bagged Classification and Regression Tree (CAB) originated from the Classification and Regression Tree (CART) introduced by Breiman, that produces monotone outcomes by calculating probability of classes [63]. The classification and regression technique can classify categorical or continuous data through recursive binary partitioning [64]. The main disadvantages of CART and CAB are sensitivity to noise and data size and they tend to overfit [65]. Extreme learning is an algorithm to regularize learning by minimizing training error and weights that can be implemented in various classifiers such as SVM or neural networks [66].
Extreme Learning Machine Neural Networks is extreme learning combined in neural networks. The classifier inherits the benefit of neural networks, i.e., the ability to tease apart convoluted connections of large datasets, suitable for parametric or non-parametric variables [67], the capability to reduce false alarms [68], and overcoming spectral mixtures in moderate spatial resolution images [69]. However, the technique also contains the drawbacks of neural networks, i.e., generating complexity (hidden nodes) that is problematic to explain, and a likelihood of overfitting [70].
The Bagged MARS roots from Multivariate Adaptive Regression Spline (MARS), a non-parametric regression combining classical regression and splines approximation for predicting unknown function simultaneously [63,71]. MARS outweighed CART in predicting gullies by generating a smoother estimation [64]. However, the prediction of MARS was highly affected by the local nature of the dataset [65].
Random Forest is one of the popular ensemble decision tree classifiers in remote sensing. Many researchers have demonstrated that the performance of RF is better than traditional single tree learning [72,73,74]. The advantages of RF include being less sensitive to overtraining and noises, the ability to generate variable importance for eliminating less important features in order to reduce dimension and computing time, as well as being unresponsive to overtraining [72,75]. Nonetheless, RF tended to be insensitive to mislabeled training [76], delicate to spatial autocorrelation [77,78], and failed to deal with imbalance training [78].
Extreme gradient boosting tree is the implementation of Friedman’s concept on a gradient boosting machine which generates constant approximations with finer granularity [79]. A gradient boosting tree has the potential to optimize processing when the access to memory is insufficient for storing a big dataset, hence being sensitive to data modification [79,80].

3. Materials and Methods

3.1. Site

Taman Nasional Gunung Halimun Salak (TNGHS) is the last remaining rain forest in Java island [81], situated approximately 65 km south of Jakarta, the capital city of Indonesia (Figure 1). The park is the last montane rain forest on Java Island, hosting several endangered species such as the Javan leopard (Panthera pardus melas), the Javan eagle (Spizaetus bartelsi) and the Javan gibbon (Hylobates moloch). This park is located in rugged terrain and has not experienced a major disturbance in the past decades, hence the claim should be assessed. Previous research reported that the dynamic of land cover at the surrounding of protected areas like parks may compromise the ecological function of the areas, including the conservation of protected species as well as downstream water provision [82]. Land cover dynamics of this critically important area and its surroundings threaten numerous ecosystem services. Monitoring of this site and its surroundings is needed to provide information on how resilient the park’s natural values are.

3.2. Datasets

In this research, Advanced Land Observing Satellite (ALOS) Fine Beam Dual (FBD) mode of Phased Array type L-band Synthetic Aperture Radar (PALSAR) was used to identify changes between paired datasets and to investigate the effect of synthetic data amendment on improving classification accuracy. ALOS FBD Level 1.1 comprised of horizontal polarisation (HH) and cross-polarisation (HV) was provided by Japan Aerospace Exploration Agency (JAXA) through the 6th Research Announcement (RA-6) for ALOS-2. The images were provided in slant range geometry (single look complex products) with ascending mode, swath width of 70 km and ground resolution about 19 m × 10 m [83]. A pair of images acquired on 20 August 2007 and 28 August 2010 were employed for change detection analysis. Other data were the Shuttle Radar Topography Mission (SRTM) Digital Elevation Model (DEM) 1 arc second and ancillary data, including forest and plantation mapping from the regional state-forest company (PT Perhutani), the map of TNGHS from the Ciliwung river basin organisation (Balai Besar Daerah Aliran Sungai Ciliwung), and historical Google Earth images to assist terrain correction and sampling selection.

3.3. Procedure

3.3.1. Preparing Synthetic Aperture Radar datasets

Figure 2 describes the workflow of this research. Prior to change detection, all images were calibrated, terrain corrected, de-speckled, and converted to dB (sigma0). Single Look Complex (SLC) of ALOS PALSAR were internally calibrated by JAXA prior distribution. SLC format was used to derive polarimetric decomposition features from the Cloude–Pottier theorem. Further calibration of SLC was performed based on Shimada et al. [84] to convert a complex number into a conventionally-used sigma nought (sigma0) in decibels from the SLC data. Terrain correction was assisted with SRTM-DEM 1 arc second and resampled with bi-linear interpolation to result in spatial resolution at 30 m [85]. Image speckles were filtered by using Gamma Map with a window size 5x5 following Reference [86]. Gamma Map filtering was known as simple yet time-efficient [11].

3.3.2. An Iteratively Reweighted Multivariate Alteration Detection and Thresholding to Determine Change and Unchanged

The bi-temporal analysis was performed onto dual polarisation of the backscatter coefficient for the data pair comprising of the years 2007 and 2010, by using IRMAD. The IRMAD processing produced three layers including MAD variate-1, MAD variate-2 and chi-square. The MAD variates are the transformed version of dual polarisation ALOS PALSAR, while chi-square layer denotes a transformed version of both MAD variates into a single layer that is generated based on chi-square distribution [87]. These layers were examined visually to identify whether changed and unchanged pixels were distinctively indicated. Visual assessment was assisted with higher resolution optical image products from freely accessible high-resolution images provided by Google Earth. For each extent, at least two images were examined, captured at times that were proximate to the first and second PALSAR acquisition dates.
The delineation between changed and unchanged was determined based on the thresholding of the selected MAD variate in the visual examination. Statistical parameters, mean (µ) and standard deviation (σ) were then derived from the selected MAD variate for thresholding [88]. Various ranges of statistical parameters have been attempted to define the optimum threshold, such as between µ ± 0.5σ and 1σ [89], µ ± 0.1σ and 2.0σ [88], or µ ± 0.5σ and 3.0σ [90,91]. In this research, a range of constants between 0.1 and 2.0 were selected and iterated following the approach of Fung and Mas [30]. The accuracy of the binary change map was assessed by taking 60 samples of positive change, 60 samples of negative change and 120 samples of unchanged for a 3x3 window size or 2060 pixels in total. The iterated constants and overall accuracy were graphed for assessing the improvement or declining trend within the range. The constants that generated the biggest overall accuracy were then used as the threshold for change–unchanged delineation to produce binary change data. They were then utilised for guiding the selection of samples for post-classification detection.

3.3.3. The Assessment of Synthetic Data Amendment of Dual Polarisation Synthetic Aperture Radar for Land cover Classification

Classification of years 2007 and 2010 was performed independently by employing several dataset combinations and supervised classification techniques. A set of combinations was tested to yield the highest possible accuracy, including horizontal co-polarisation, cross-polarisation and synthetic data. A summary of data combinations to evaluate the efficacy of each synthetic data type is presented in Table 1, showing the synthetic data layers used to improve accuracy. Synthetic data included a set of indices, entropy alpha decomposition and the texture analysis.
The set of indices comprised the differencing of HH-HV, ratioing of HH/HV, and dual polarimetric radar vegetation index (RVI). Dual polarisation data were the control treatment to evaluate the robustness of the synthetic data amendment. RVI for dual polarization data was determined following Reference [42]. The entropy-alpha decomposition was calculated based on Cloude and Pottier [92]. Considering the contribution of texture components for classification as suggested by Yayusman and Nagasawa [38], only selected textures from GLCM [48], i.e., mean and variance, were utilised for post-classification.
The imagery was classified into seven classes, i.e., forest, rubber, tea, oil palm, crop, built-up, and waterbody. Figure 3 shows pictures of six of these seven classification classes. The class ‘waterbodies’ was specifically taken from the ocean since the area has no inland water. The selection of training samples for classification was guided by Google Earth, the map of the regional state-forest company and the national park map. The training samples were taken from locations having consistent cover within two observations. The samples were divided into 75% for the developing model and 25% for accuracy assessment. Seven supervised pixel-based classifications including CAB, MAB, RFO, RFG, XGB, ENN, and SVM were employed. Selected classifiers were coded in R statistical software employing caret and raster packages [93,94], following the suggestion of Reference [20]. The classifiers were run by implementing default parameters of the R packages as summarised in Table 2.

3.3.4. Selecting the Best Model for Post Classification Change Analysis

Further investigation employed all synthetic layers in order to classify land cover types and to evaluate the importance of variables. The variables (layers) and their acronyms are summarised in Table 3. Variable importance (VI) is a percentage of the variable’s contribution calculated from error generation during the permutation of a variable using its out of bag data [68]. The measure assisted in model reduction while it provided a first impression of controlling variables [83,84]. In this research, VI was used to select variables for modelling the best subset. When adding a variable did not improve the accuracy, then the augmentation of the layer was stopped and the combination before augmentation was considered as the best subset.
A further step was assessing the accuracy of classification of the best subset and all data combinations. The highest accuracy from either all variables or the best subset was used for the post-classification change evaluation. To reduce the salt and pepper effect, a raster sieve was used. If the change that was indicated in binary analysis was unidentified, then a modified classification was performed by taking samples from areas identified as changed by IRMAD and consequently adding more classes guided by historical images of Google Earth. The reclassification employed the best method composing of important variables.

4. Results

4.1. The Result of Iteratively Reweighted Multivariate Alteration Detection and The Determination of Changed and Unchanged

The binary change identification using IRMAD is presented in Figure 4. The figure shows MAD variate-1, MAD variate-2, and Chi-square layers and an RGB layer composing those three layers. From the gray-scale layers, MAD variate-2 displayed the clearest indication of change in the site while chi-square layer could not indicate changes as MAD variate-1 and MAD variate-2 did. Two labels were placed to show areas with indication of changes. MAD variate-1 yielded a similar indication for location-1, but it failed to indicate any changes in location-2. The RGB layer shows a differing tone that may signpost different change types.
A qualitative assessment through visual checking was further performed guided by the indication of change as presented in the figure. The visual check was done to validate the indication from the analysis. The visual check was made using Google Earth in the proximate date (see example in Figure 5), which guided the interpretation of different tones indicated in the binary change maps of differencing cross-polarisation data or filtered MAD variate-2 of IRMAD processing. As indicated by selected features (see a, b, c, d on Figure 5), there was a change in the sample areas. A dark tone indicated a denser vegetation from 2007 to 2010, while a light tone signified the inverse condition. A darker tone indicated a change from fields prepared for cultivation to land fully-covered with oil palm (a) or from juvenile to maturing oil palm plantation (c). The light tone signified the inverse condition from vegetated crop areas to semi-bare ones as indicated in (b) and (d).
Considering that MAD variate-2 provided the clearest indication of changed and unchanged, the thresholding was then determined based on MAD variate-2. The mean value of MAD variate-2 at 142.8 and the standard deviation at 52.2 were multiplied with the selected range of constant for iteration. The overall accuracy, false detection and missed detection from the iteration are presented in Figure 6.
Figure 6 demonstrates the increasing accuracy of iterating the constant of MAD variate-2 from a value of 0.1 to 2.0. The accuracy increased up to 1.7 and declined after 1.9. The best accuracy for MAD variate-2 was 98.8% obtained by constants between 1.7 and 1.9. The curve of false and missed detections of MAD showed a contrasting pattern for the range of constants. Generated false detection started from 28% and missed detection was from 12%, and the rates were suppressed to zero at constants of 1.7 and 1.8. An optimum constant was obtained at 1.7 to be used for delineating changed from unchanged.

4.2. The Improvement of Classification Accuracy by Synthetic Layers’ Amendment

Figure 7 shows the response of classifiers in accuracy generation to the amendment of synthetic layers. Two observations, year 2007 and 2010, portrayed comparable descriptions of the response. It seems that amending dual polarization backscatter with indices did not yield a significant improvement to the accuracy, while some of the classifiers responded slightly to the amendment of the entropy-alpha decomposition. Some classifiers generated a slightly lower accuracy (−0.5%) by index amendment compared to the standard data, including MAB, SVM and CAB. Meanwhile, texture amendment substantially increased the accuracy by 20% in two classifiers, i.e., RFO, RFG, while two others CAB and XGB improved it by about 15% and 16%, respectively. SVM followed with MAB generated slightly lower improvement with texture amendment by around 13% and 10%, respectively. Extreme Learning Neural Network yielded the least accurate results by 7% with texture amendment. In general, texture amendment substantially improved the accuracy of classification and produced the highest increase to the accuracy of classification, up to 20%.

4.3. The Identification of Variable Importance for Classification

Figure 8 demonstrates the comparison of variable importance for five techniques. Only tree-based techniques generated high variable importance, while similar capabilities from ENN and SVM were not evident. The figure indicates a different order of the importance of variables by different classifiers. A similar result was demonstrated by random forest techniques either for the original version or regularized ones.
Several classifiers did not respond to the addition of some variables, such as MAB that identified the order of four variables similar to RFO and RFG but did not consider other variables that shared an importance weight with RFO and RFG.
Among variables, the texture-mean either derived from HV or HH was consistently considered as the highest important variable followed by texture-variance. Backscatter intensity either horizontal or cross-polarization, was valued as an important variable in four classifiers, except MAB. Seven variables that consistently shared significant contributions following four classifiers included texture-mean HV, texture-mean HH, texture-variance HH, texture-variance HV, Sigma0 HH, Sigma0 HV, and index-differencing.
Figure 9 shows the increment of accuracy when each variable was added consecutively following the descending order of VI resulting from RFO. RFO was used to determine variable importance since it generated the highest accuracy when all variables were employed. It appears that variables with contribution less than 10% brought a contra productive effect to accuracy generation by decreasing the accuracy when they were added. In this research, six appears to be an adequate number of variables for land cover classification with seven targets to develop the best subset model.
Unknown error occurred during the classification process employing a different number of variables on two classifiers, i.e., ENN and MAB. The error of ENN processing took place during the process of data composition with 5 or 7 to 11 variables, while the error of MAB occurred during the process of data composition with 9 to 11 variables.

4.4. Comparing the Accuracy from All Data Layers and the Best Subset

Table 4 describes the accuracy of classification by employing all variables and from the best subset model defined with VI assistance. It appears the best subset model generated the same or higher accuracy than the model developed from all variables except for MAB and ENN. For MAB, the best subset produced slightly lower accuracies than when employing all variables.
In general, tree-based classifiers including RFO, RFG, CAB and XGB generated a greater accuracy than SVM and ENN. The table demonstrates that RFG and RFO were superior compared to other classifiers by producing accuracy at about 90% for the years 2007 and 2010. The map of classification generated from the best subset model using RFG was then employed for the years 2007 and 2010.

4.5. From-To Information of Change

Post classification detection employing the best model could not identify the change indicated in IRMAD, and thus, the subtle change was hidden. Reclassifying images by combining important variables employing the best method for data pairs and adding more classes related to subtle change were performed for 12 classes. Five more classes were added to integrate types of subtle changes as identified in IRMAD. The additional classes included previously vegetated crop areas to semi-fallow crop areas, juvenile to maturing oil palm, old to regenerated rubber plantation, juvenile to maturing rubber, and previously rubber to newly planted oil palm. The reclassification resulted in an accuracy of about 82%.
Figure 10 provides information regarding the land cover types that were unchanged between 2007 and 2010, as well as the type of change in the hotspot location as indicated by IRMAD analysis. Water was better classified amongst the classes while stable oil palm, tea and rubber were less accurate due to varying stand ages. The change from juvenile to maturing oil palm yielded high accuracy at 95% and 90%, respectively, for the producer and user accuracies. Rubber growth stages from juvenile to maturing plants or from old to regenerated ones were also well classified at 82% and 61% for producer accuracies, and 85% and 76% for user accuracies, respectively. The change of rubber into the oil palm plantation was identified with producer accuracy at 70% and user accuracy at 69%.

5. Discussion

As suggested by previous research [88,89,90,95], a threshold that generates the highest accuracy should be selected to discriminate changed and unchanged land cover from binary change detection such as IRMAD. Previous attempts on iterating constants for binary change map generation mostly employed Landsat or other optical images [30,96]. This research demonstrated the value of dual polarimetric ALOS backscatter for IRMAD analysis and indicated that the highest accuracy was achieved within a constant range of 0.1 to 2.0. The visual check of binary change against Google Earth imagery may be directly utilized to validate the change being detected by IRMAD. Nonetheless, freely accessible optical imagery with higher spatial resolution may not always be available due to cloud cover or limited recurring observations.
Post-classification change detection may complete the information of change by providing “from-to” information. However, error detection is proliferated by each classification. The challenge to have reliable post-classification change results is to obtain an adequate accuracy rate. Following the standard acceptable accuracy for thematic mapping of the National Park Service, US Department of the Interior, the minimum expected accuracy is 80% [31]. Limited bands of dual polarization SAR images for generating the “from-to” information may result in low accuracy of classification, thus enriching the dual polarization images is required to obtain adequate accuracy. Synthetic data amendment has been an option to enrich available data and in turn achieve greater accuracy. By amending original layers with several types of synthetic data, it appears that texture has the greatest potential to result in greater accuracy of classification when employing SAR images. Simple indices constructed from differencing and ratioing layers appears ineffective in increasing the accuracy. Meanwhile, decomposition seems better than indices, but not as robust as texture to be implemented in backscatter data for investigating a tropical montane environment. Pseudo-cross-variogram, a multi-temporal texture feature derived from a geostatistical approach, was reported to yield greater accuracy compared to GLCM textures [97]. Utilizing this feature might further improve the accuracy of classification employing SAR data.
With many optional classifiers asserting their advantages for land cover classification, researchers need to selectively employ the most suitable one for their cases. The growing use of ensemble classifiers demonstrates the potential. This research demonstrates that decision tree classifiers perform well in generating accuracy of classification. Random forest, either in the original or regularized mode, or extreme gradient boosting tree classifiers generated high accuracy at about 90%. Nonetheless, the neural network seems less responsive to data amendment and synthetic data when implemented in this research. The unknown error which occurred in the classification process employing ENN and MAB requires in-depth exploration in the future. The comparison of accuracies suggests that ensemble trees such as RF and XGB outweighs other classifiers and would likely produce adequate classification accuracy.
The use of variable importance appears effective to assist in generating high accuracy of classification. Meanwhile, the order of variable importance differs by classifiers, which disproves the previous claim of Reference [98] about a likely similar order of variable importance to be generated across classifiers albeit making their different contributions. Employing all available layers would not always yield the highest accuracy. Variable selection is therefore essential to result in optimum accuracy and may be more relevant when employing hyperspectral images with hundreds of layers [99].
The strategy of taking samples from consistent land uses was practical for land cover classification; however, it was not adequate to allow for the identification of subtle change. As subtle changes were less noticeable, it is difficult to define sub classes at the initial classification with limited information of the sites. Iteratively Reweighted Multivariate Alteration Detection of dual polarization ALOS PALSAR was successful to indicate subtle changes related to farming practices like crop staging and changing vegetation type. This technique serves as an alternative for traditional change analysis employing SAR data such as differencing of log intensity or ratioing [11]. As details of the growing stages of plants could be identified as change in the IRMAD, a more detailed class definition is required. The changes of semi-bare land to denser-vegetated paddy fields, of juvenile to mature oil palm plantation as well as of rubber to juvenile oil palm plantation were unobserved by using seven general land classes, even if the accuracy was about 90%. The possible reason includes the changing surface moisture at the same land cover since the test site is situated in humid tropics and the L-band SAR has some penetration into foliar canopy. Soil background remains an important issue when interaction with low biomass is involved [86]. Reclassification by identification of sub classes at the hotspot of change as indicated in IRMAD was required to better identify subtle changes related to the stage of vegetation growth and vegetative use of change. Nonetheless, adding sub classes to the classification may reduce accuracy. Taking samples on the hotspot for change identified in IRMAD appears effective in improving classification to include the subtle change. Google Earth assisted for better interpretation of change particularly to validate subtle changes identified in IRMAD. Identifying temporal patterns of spectra or indices may complement the information of the change process.

6. Conclusions

Employing microwave satellite images is an option for monitoring areas with poor accessibility and areas that are severely affected by persistent cloud cover. The second MAD variate produced from IRMAD processing filtered with Gamma Map successfully indicated the location of change. The employment of post-classification change analysis informs “from-to” change. Dual layers of ALOS FBD images may limit the capabilities for classification of complex classes, hence injecting synthetic images enriches information to improve the accuracy. The increment of accuracy by data injection was demonstrated for instance by employing random forest. The greatest accuracy was yielded by the amendment of selected texture features comprising of GLCM layers, i.e., mean and variance. Tree-based methods including RFO, RFG and XGB appear superior in generating the accuracy of classification compared to the support vector machine and neural network. The identification of variable importance assisted in defining the best subset model for reclassification to allow the identification of change indicated in IRMAD analysis. However, subtle change related to growth stages could not be identified with general land cover classes. The different tone resulting from differencing and IRMAD indicated subtle changes due to the change of vegetation type or growth phases such as from juvenile to maturing oil palm or semi-bare to vegetated crop fields. Combining the important variables to reclassify the change process may be adequate to improve accuracy and hence, to emphasize their utility in the detection of subtle changes. Ancillary data from respectful institutions and Google Earth assists in the interpretation of binary change results. Exploring the temporal pattern of changes would likely enhance the understanding of the gradual processes.

Author Contributions

D.R.P. conceptualized and designed the experiment; D.R.P. performed analysis and validation with the assistance of B.H.T.; D.R.P. wrote the original draft with further contribution of D.J.P. and B.H.T.; D.J.P. assisted by commenting on the manuscript and providing editorial oversight.

Funding

This research was funded by Japan Aerospace Exploration Agency (JAXA), grant number RA6-3040; and the APC was funded by Australia Awards Scholarship.

Acknowledgments

The authors would like to thank JAXA for data provision through the RA6 campaign and Australia Awards scholarship for the opportunity pursuing this PhD research program. We are grateful to Amy Griffin for her support and two anonymous reviewers for their constructive comments leading to a much improved manuscript.

Conflicts of Interest

The authors declare no conflict of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, or in the decision to publish the results.

References

  1. Vogelmann, J.E.; Xian, G.; Homer, C.; Tolk, B. Monitoring gradual ecosystem change using Landsat time series analyses: Case studies in selected forest and rangeland ecosystems. Remote Sens. Environ. 2012, 122, 92–105. [Google Scholar] [CrossRef]
  2. Kayastha, N.; Thomas, V.; Galbraith, J.; Banskota, A. Monitoring wetland change using inter-annual landsat time-series data. Wetlands 2012, 32, 1149–1162. [Google Scholar] [CrossRef]
  3. Vogelmann, J.E.; Tolk, B.; Zhu, Z. Monitoring forest changes in the southwestern United States using multitemporal Landsat data. Remote Sens. Environ. 2009, 113, 1739–1748. [Google Scholar] [CrossRef]
  4. Townshend, J.R.G.; Justice, C.O. Analysis of the dynamics of African vegetation using the normalized difference vegetation index. Int. J. Remote Sens. 1986, 7, 1435–1445. [Google Scholar] [CrossRef]
  5. Jones, M.O.; Kimball, J.S.; Jones, L.A.; McDonald, K.C. Satellite passive microwave detection of North America start of season. Remote Sens. Environ. 2012, 123, 324–333. [Google Scholar] [CrossRef]
  6. Wang, Y.; Zhao, F.; Cheng, L.; Yang, K. Framework for monitoring the conversion of cultivated land to construction land using SAR image time series. Remote Sens. Lett. 2015, 6, 794–803. [Google Scholar] [CrossRef]
  7. Coppin, P.R.; Bauer, M.E. Processing of Multitemporal Landsat TM Imagery to Optimize Extraction of Forest Cover Change Features. IEEE Trans. Geosci. Remote Sens. 1994, 32, 918–927. [Google Scholar] [CrossRef]
  8. Bijker, W. Radar for Rainforest—A Monitoring System for Land Cover Change in the Colombian Amazon; European Space Agency, (Special Publication) ESA SP: Paris, France, 1997; pp. 53–60. [Google Scholar]
  9. Jin, Y.Q.; Chen, F.; Luo, L. Automatic detection of change direction of multi-temporal ERS-2 SAR images using two-threshold EM and MRF algorithms. Imaging Sci. J. 2004, 52, 234–241. [Google Scholar] [CrossRef]
  10. Chi, H.; Sun, G.; Ling, F. Urban dynamic change detection in southeastern China based on interferometric SAR. In Proceedings of the 2009 IEEE International Geoscience and Remote Sensing Symposium, Cape Town, South Africa , 12–17 July 2009; pp. III-432–III-435. [Google Scholar]
  11. Moser, G.; Serpico, S.B. Generalized minimum-error thresholding for unsupervised change detection from SAR amplitude imagery. IEEE Trans. Geosci. Remote Sens. 2006, 44, 2972–2982. [Google Scholar] [CrossRef] [Green Version]
  12. Grover, K.; Quegan, S.; Da Costa Freitas, C. Quantitative estimation of tropical forest cover by SAR. IEEE Trans. Geosci. Remote Sens. 1999, 37, 479–490. [Google Scholar] [CrossRef]
  13. Whittle, M.; Quegan, S.; Uryu, Y.; Stuewe, M.; Yulianto, K. Detection of tropical deforestation using ALOS-PALSAR: A Sumatran case study. Remote Sens. Environ. 2012, 124, 83–98. [Google Scholar] [CrossRef]
  14. Nielsen, A.A. The regularized iteratively reweighted MAD method for change detection in multi- and hyperspectral data. IEEE Trans. Image Process. 2007, 16, 463–478. [Google Scholar] [CrossRef]
  15. Crocetto, N.; Tarantino, E. A class-oriented strategy for features extraction from multidate ASTER imagery. Remote Sens. 2009, 1, 1171–1189. [Google Scholar] [CrossRef]
  16. Wang, B.; Choi, S.K.; Han, Y.K.; Lee, S.K.; Choi, J.W. Application of IRMAD using synthetically fused images for change detection in hyperspectral data. Remote Sens. Lett. 2015, 6, 578–586. [Google Scholar] [CrossRef]
  17. Marpu, P.R.; Gamba, P.; Canty, M.J. Improving change detection results of IRMAD by eliminating strong changes. IEEE Geosci. Remote Sens. Lett. 2011, 8, 799–803. [Google Scholar] [CrossRef]
  18. Lu, D.; Batistella, M.; Moran, E. Multitemporal spectral mixture analysis for Amazonian land-cover change detection. Can. J. Remote Sens. 2004, 30, 87–100. [Google Scholar] [CrossRef]
  19. Macleod, R.D.; Congalton, R.G. A quantitative comparison of change-detection algorithms for monitoring eelgrass from remotely sensed data. Photogramm. Eng. Remote Sens. 1998, 64, 207–216. [Google Scholar]
  20. Trisasongko, B.H.; Panuju, D.R.; Paull, D.J.; Jia, X.; Griffin, A.L. Comparing six pixel-wise classifiers for tropical rural land cover mapping using four forms of fully polarimetric SAR data. Int. J. Remote Sens. 2017, 38, 3274–3293. [Google Scholar] [CrossRef]
  21. Huang, C.; Davis, L.S.; Townshend, J.R.G. An assessment of support vector machines for land cover classification. Int. J. Remote Sens. 2002, 23, 725–749. [Google Scholar] [CrossRef]
  22. Pal, M.; Mather, P.M. An assessment of the effectiveness of decision tree methods for land cover classification. Remote Sens. Environ. 2003, 86, 554–565. [Google Scholar] [CrossRef]
  23. Nitze, I.; Barrett, B.; Cawkwell, F. Temporal optimisation of image acquisition for land cover classification with random forest and MODIS time-series. Int. J. Appl. Earth Obs. Geoinf. 2015, 34, 136–146. [Google Scholar] [CrossRef]
  24. Nielsen, A.A.; Conradsen, K.; Simpson, J.J. Multivariate alteration detection (MAD) and MAF postprocessing in multispectral, bitemporal image data: New approaches to change detection studies. Remote Sens. Environ. 1998, 64, 1–19. [Google Scholar] [CrossRef]
  25. Canty, M.J.; Nielsen, A.A. Automatic radiometric normalization of multitemporal satellite imagery with the iteratively re-weighted MAD transformation. Remote Sens. Environ. 2008, 112, 1025–1036. [Google Scholar] [CrossRef] [Green Version]
  26. Luo, H.; Liu, C.; Wu, C.; Guo, X. Urban Change Detection Based on Dempster–Shafer Theory for Multitemporal Very High-Resolution Imagery. Remote Sens. 2018, 10, 980. [Google Scholar] [CrossRef]
  27. Gong, J.Q.; Liu, Z.; Yan, Q.; Xu, Q. Object-Oriented Island Land Cover Change Detection by Iteratively Reweighted Multivariate Statistical Analysis. Mar. Geod. 2017, 40, 87–103. [Google Scholar] [CrossRef]
  28. Alaibakhsh, M.; Emelyanova, I.; Barron, O.; Mohyeddin, A.; Khiadani, M. Multivariate detection and attribution of land-cover changes in the Central Pilbara, Western Australia. Int. J. Remote Sens. 2015, 36, 2599–2621. [Google Scholar] [CrossRef]
  29. Panuju, D.R.; Paull, D.J.; Griffin, A.L.; Trisasongko, B.H. The performance of iteratively reweighted multivariate alteration detection (IRMAD) applied on untransformed and transformed Landsat surface reflectance. In Proceedings of the 38th Asian Conference on Remote Sensing—Space Applications: Touching Human Lives, New Delhi, India, 23–27 October 2017; pp. 317–325. [Google Scholar]
  30. Mas, J.F. Monitoring land-cover changes: A comparison of change detection techniques. Int. J. Remote Sens. 1999, 20, 139–152. [Google Scholar] [CrossRef]
  31. Lea, C.; Curtis, A. Thematic Accuracy Assessment Procedures: National Park Service Vegetation Inventory, Version 2.0; Natural resource report NPS/2010/NRR-2010/204; National Park Service, U.S. Department of the Interior: Fort Collins, CO, USA, 2010.
  32. Van Zyl, J.J.; Zebker, H.A.; Elachi, C. Imaging radar polarization signatures: Theory and observation. Radio Sci. 1987, 22, 529–543. [Google Scholar] [CrossRef]
  33. Pohl, C.; Van Genderen, J.L. Review article multisensor image fusion in remote sensing: Concepts, methods and applications. Int. J. Remote Sens. 1998, 19, 823–854. [Google Scholar] [CrossRef]
  34. Stramondo, S.; Bignami, C.; Chini, M.; Pierdicca, N.; Tertulliani, A. Satellite radar and optical remote sensing for earthquake damage detection: Results from different case studies. Int. J. Remote Sens. 2006, 27, 4433–4447. [Google Scholar] [CrossRef]
  35. Bovolo, F.; Bruzzone, L. A split-based approach to unsupervised change detection in large-size multitemporal images: Application to tsunami-damage assessment. IEEE Trans. Geosci. Remote Sens. 2007, 45, 1658–1670. [Google Scholar] [CrossRef]
  36. Kuntz, S.; Siegert, F. Monitoring of deforestation and land use in Indonesia with multitemporal ERS data. Int. J. Remote Sens. 1999, 20, 2835–2853. [Google Scholar] [CrossRef]
  37. Arzandeh, S.; Wang, J.F. Monitoring the change of Phragmites distribution using satellite data. Can. J. Remote Sens. 2003, 29, 24–35. [Google Scholar] [CrossRef]
  38. Yayusman, L.F.; Nagasawa, R. ALOS-Sensor data integration for the detection of smallholder’s oil palm plantation in Southern Sumatra, Indonesia. J. Jpn. Agric. Syst. Soc. 2015, 31, 27–40. [Google Scholar] [CrossRef]
  39. Myneni, R.B.; Hall, F.G. The interpretation of spectral vegetation indexes. IEEE Trans. Geosci. Remote Sens. 1995, 33, 481–486. [Google Scholar] [CrossRef]
  40. Peddle, D.R.; Brunke, S.P.; Hall, F.G. A comparison of spectral mixture analysis and ten vegetation indices for estimating boreal forest biophysical information from airborne data. Can. J. Remote Sens. 2001, 27, 627–635. [Google Scholar] [CrossRef]
  41. Kim, K.; van Zyl, J.J. A Time-Series Approach to Estimate Soil Moisture Using Polarimetric Radar Data. IEEE Trans. Geosci. Remote Sens. 2009, 47, 2519–2527. [Google Scholar] [CrossRef]
  42. Trudel, M.; Charbonneau, F.J.; Leconte, R. Using RADARSAT-2 polarimetric and ENVISAT-ASAR dual-polarization data for estimating soil moisture over agricultural fields. Can. J. Remote Sens. 2012, 38, 514–527. [Google Scholar] [CrossRef]
  43. Ling, F.; Li, Z.; Chen, E.; Wang, Q. Comparison of ALOS PALSAR RVI and Landsat TM NDVI for forest area mapping. In Proceedings of the 2009 2nd Asian-Pacific Conference on Synthetic Aperture Radar, Xi’an China, 26–30 October 2009; pp. 132–135. [Google Scholar]
  44. Avtar, R.; Sawada, H.; Takeuchi, W.; Singh, G. Characterization of forests and deforestation in Cambodia using ALOS/PALSAR observation. Geocarto Int. 2012, 27, 119–137. [Google Scholar] [CrossRef]
  45. Cloude, S.R.; Pottier, E. A review of target decomposition theorems in radar polarimetry. IEEE Trans. Geosci. Remote Sens. 1996, 34, 498–518. [Google Scholar] [CrossRef]
  46. Rodriguez, K.M.; Weissel, J.K.; Kim, Y. Classification of landslide surfaces using fully polarimetric SAR: Examples from Taiwan. In Proceedings of the IEEE International Geoscience and Remote Sensing Symposium, Toronto, ON, Canada, 24–28 June 2002; Volume 2915, pp. 2918–2920. [Google Scholar]
  47. Qi, Z.; Yeh, A.G.O.; Li, X.; Lin, Z. Integration of Polarimetric Decomposition, Object-Oriented Image Analysis, and Decision Tree Algorithms for Land-Use and Land-Cover Classification using RADARSAT-2 Polarimetric SAR Data. Photogramm. Eng. Remote Sens. 2012, 78, 169–181. [Google Scholar] [CrossRef]
  48. Haralick, R.M.; Shanmugam, K.; Dinstein, I. Textural Features for Image Classification. IEEE Trans. Syst. Man Cybern. 1973, SMC-3, 610–621. [Google Scholar] [CrossRef]
  49. Davis, L.S.; Johns, S.A.; Aggarwal, J.K. Texture Analysis Using Generalized Co-Occurrence Matrices. IEEE Trans. Pattern Anal. Mach. Intell. 1979, PAMI-1, 251–259. [Google Scholar] [CrossRef]
  50. Nyoungui, A.N.; Tonye, E.; Akono, A. Evaluation of speckle filtering and texture analysis methods for land cover classification from SAR images. Int. J. Remote Sens. 2002, 23, 1895–1925. [Google Scholar] [CrossRef]
  51. Dekker, R.J. Texture analysis and classification of ERS SAR images for map updating of urban areas in The Netherlands. IEEE Trans. Geosci. Remote Sens. 2003, 41, 1950–1958. [Google Scholar] [CrossRef]
  52. Barreto, T.L.M.; Rosa, R.A.S.; Wimmer, C.; Moreira, J.R.; Bins, L.S.; Cappabianco, F.A.M.; Almeida, J. Classification of Detected Changes From Multitemporal High-Res Xband SAR Images: Intensity and Texture Descriptors From SuperPixels. IEEE J.-STARS 2016, 9, 5436–5448. [Google Scholar] [CrossRef]
  53. Lu, D.; Weng, Q. A survey of image classification methods and techniques for improving classification performance. Int. J. Remote Sens. 2007, 28, 823–870. [Google Scholar] [CrossRef] [Green Version]
  54. Otukei, J.R.; Blaschke, T. Land cover change assessment using decision trees, support vector machines and maximum likelihood classification algorithms. Int. J. Appl. Earth Obs. Geoinf. 2010, 12 (Suppl. 1), S27–S31. [Google Scholar] [CrossRef]
  55. Erbek, F.S.; Özkan, C.; Taberner, M. Comparison of maximum likelihood classification method with supervised artificial neural network algorithms for land use activities. Int. J. Remote Sens. 2004, 25, 1733–1748. [Google Scholar] [CrossRef]
  56. Dai, X.L.; Khorram, S. Remotely sensed change detection based on artificial neural networks. Photogramm. Eng. Remote Sens. 1999, 65, 1187–1194. [Google Scholar]
  57. Paola, J.D.; Schowengerdt, R.A. A detailed comparison of backpropagation neural network and maximum-likelihood classifiers for urban land use classification. IEEE Trans. Geosci. Remote Sens. 1995, 33, 981–996. [Google Scholar] [CrossRef]
  58. Pal, M.; Mather, P.M. Support vector machines for classification in remote sensing. Int. J. Remote Sens. 2005, 26, 1007–1011. [Google Scholar] [CrossRef]
  59. Srivastava, P.K.; Han, D.; Rico-Ramirez, M.A.; Bray, M.; Islam, T. Selection of classification techniques for land use/land cover change investigation. Adv. Space Res. 2012, 50, 1250–1265. [Google Scholar] [CrossRef]
  60. Breiman, L. Bagging predictors. Mach. Learn. 1996, 24, 123–140. [Google Scholar] [CrossRef] [Green Version]
  61. Freund, Y.; Schapire, R.E. A Decision-Theoretic Generalization of On-Line Learning and an Application to Boosting. J. Comput. Syst. Sci. 1997, 55, 119–139. [Google Scholar] [CrossRef] [Green Version]
  62. Friedl, M.A.; Brodley, C.E.; Strahler, A.H. Maximizing land cover classification accuracies produced by decision trees at continental to global scales. IEEE Trans. Geosci. Remote Sens. 1999, 37, 969–977. [Google Scholar] [CrossRef]
  63. Tayyebi, A.; Pijanowski, B.C. Modeling multiple land use changes using ANN, CART and MARS: Comparing tradeoffs in goodness of fit and explanatory power of data mining tools. Int. J. Appl. Earth Obs. Geoinf. 2014, 28, 102–116. [Google Scholar] [CrossRef]
  64. Gutiérrez, Á.G.; Schnabel, S.; Lavado Contador, J.F. Using and comparing two nonparametric methods (CART and MARS) to model the potential distribution of gullies. Ecol. Model. 2009, 220, 3630–3637. [Google Scholar] [CrossRef]
  65. Prasad, A.M.; Iverson, L.R.; Liaw, A. Newer classification and regression tree techniques: Bagging and random forests for ecological prediction. Ecosystems 2006, 9, 181–199. [Google Scholar] [CrossRef]
  66. Huang, G.; Zhou, H.; Ding, X.; Zhang, R. Extreme Learning Machine for Regression and Multiclass Classification. IEEE Trans. Syst. Man Cybern. B 2012, 42, 513–529. [Google Scholar] [CrossRef] [Green Version]
  67. Razi, M.A.; Athappilly, K. A comparative predictive analysis of neural networks (NNs), nonlinear regression and classification and regression tree (CART) models. Expert Syst. Appl. 2005, 29, 65–74. [Google Scholar] [CrossRef]
  68. Nemmour, H.; Chibani, Y. Fuzzy neural network architecture for change detection in remotely sensed imagery. Int. J. Remote Sens. 2006, 27, 705–717. [Google Scholar] [CrossRef]
  69. Pacifici, F.; Del Frate, F.; Solimini, C.; Emery, W.J. An innovative neural-net method to detect temporal changes in high-resolution optical satellite imagery. IEEE Trans. Geosci. Remote Sens. 2007, 45, 2940–2952. [Google Scholar] [CrossRef]
  70. Wylie, B.K.; Pastick, N.J.; Picotte, J.J.; Deering, C.A. Geospatial data mining for digital raster mapping. GISci. Remote Sens. 2018. [Google Scholar] [CrossRef]
  71. Li, Y.F.; Ng, S.H.; Xie, M.; Goh, T.N. A systematic comparison of metamodeling techniques for simulation optimization in Decision Support Systems. Appl. Soft Comput. 2010, 10, 1257–1273. [Google Scholar] [CrossRef]
  72. Jin, Y.; Liu, X.; Chen, Y.; Liang, X. Land-cover mapping using Random Forest classification and incorporating NDVI time-series and texture: A case study of central Shandong. Int. J. Remote Sens. 2018. [Google Scholar] [CrossRef]
  73. Na, X.; Zhang, S.; Li, X.; Yu, H.; Liu, C. Improved land cover mapping using random forests combined with landsat thematic mapper imagery and ancillary geographic data. Photogramm. Eng. Remote Sens. 2010, 76, 833–840. [Google Scholar] [CrossRef]
  74. Duro, D.C.; Franklin, S.E.; Dubé, M.G. A comparison of pixel-based and object-based image analysis with selected machine learning algorithms for the classification of agricultural landscapes using SPOT-5 HRG imagery. Remote Sens. Environ. 2012, 118, 259–272. [Google Scholar] [CrossRef]
  75. Chan, J.C.-W.; Paelinckx, D. Evaluation of Random Forest and Adaboost tree-based ensemble classification and spectral band selection for ecotope mapping using airborne hyperspectral imagery. Remote Sens. Environ. 2008, 112, 2999–3011. [Google Scholar] [CrossRef]
  76. Mellor, A.; Boukir, S.; Haywood, A.; Jones, S. Exploring issues of training data imbalance and mislabelling on random forest performance for large area land cover classification using the ensemble margin. ISPRS J. Photogramm. 2015, 105, 155–168. [Google Scholar] [CrossRef]
  77. Millard, K.; Richardson, M. On the Importance of Training Data Sample Selection in Random Forest Image Classification: A Case Study in Peatland Ecosystem Mapping. Remote Sens. 2015, 7, 8489–8515. [Google Scholar] [CrossRef] [Green Version]
  78. Dalponte, M.; Ørka, H.O.; Gobakken, T.; Gianelle, D.; Næsset, E. Tree Species Classification in Boreal Forests With Hyperspectral Data. IEEE Trans. Geosci. Remote Sens. 2013, 51, 2632–2645. [Google Scholar] [CrossRef]
  79. Friedman, J.H. Greedy Function Approximation: A Gradient Boosting Machine. Ann. Stat. 2001, 29, 1189–1232. [Google Scholar] [CrossRef]
  80. Lombardo, L.; Cama, M.; Conoscenti, C.; Märker, M.; Rotigliano, E. Binary logistic regression versus stochastic gradient boosted decision trees in assessing landslide susceptibility for multiple-occurring landslide events: Application to the 2009 storm event in Messina (Sicily, southern Italy). Nat. Hazards 2015, 79, 1621–1648. [Google Scholar] [CrossRef]
  81. Hartono, T.; Kobayashi, H.; Widjaya, H.; Suparmo, M. Taman Nasional Gunung Halimun Salak, Edisi Revisi ed.; Japan International Cooperation Agency (JICA), Gunung Halimun Salak National Park Management Project: Bogor, Indonesia, 2007; p. 48. [Google Scholar]
  82. Jones, D.A.; Hansen, A.J.; Bly, K.; Doherty, K.; Verschuyl, J.P.; Paugh, J.I.; Carle, R.; Story, S.J. Monitoring land use and cover around parks: A conceptual approach. Remote Sens. Environ. 2009, 113, 1346–1356. [Google Scholar] [CrossRef]
  83. Rosenqvist, A.; Shimada, M.; Watanabe, M. ALOS PALSAR: Technical outline and mission concepts. In Proceedings of the 4th International Symposium on Retrieval of Bio- and Geophysical Parameters from SAR Data for Land Applications, Innsbruck, Austria, 16–19 November 2004. [Google Scholar]
  84. Shimada, M.; Isoguchi, O.; Tadono, T.; Isono, K. PALSAR radiometric and geometric calibration. IEEE Trans. Geosci. Remote Sens. 2009, 47, 3915–3932. [Google Scholar] [CrossRef]
  85. Small, D.; Schubert, A. Guide to ASAR Geocoding; ESA-ESRIN Technical Note RSL-ASAR-GC-AD; Remote Sensing Laboratory, University of Zurich: Zurich, Switzerland, 2008; p. 36. [Google Scholar]
  86. Trisasongko, B.H. Mapping stand age of rubber plantation using ALOS-2 polarimetric SAR data. Eur. J. Remote Sens. 2017, 50, 64–76. [Google Scholar] [CrossRef] [Green Version]
  87. Canty, M.J.; Nielsen, A.A. Visualization and unsupervised classification of changes in multispectral satellite imagery. Int. J. Remote Sens. 2006, 27, 3961–3975. [Google Scholar] [CrossRef] [Green Version]
  88. Fung, T.; LeDrew, E. The determination of optimal threshold levels for change detection using various accuracy indices. Photogramm. Eng. Remote Sens. 1988, 54, 1449–1454. [Google Scholar]
  89. Nelson, R.F. Detecting Forest Canopy Change Due to Insect Activity Using Landsat Mss. Photogramm. Eng. Remote Sens. 1983, 49, 1303–1314. [Google Scholar]
  90. Nordberg, M.L.; Evertson, J. Monitoring Change in Mountainous Dry-heath Vegetation at a Regional Scale Using Multitemporal Landsat TM Data. Ambio 2003, 32, 502–509. [Google Scholar] [CrossRef] [PubMed]
  91. Li, Z.; Shi, W.; Myint, S.W.; Lu, P.; Wang, Q. Semi-automated landslide inventory mapping from bitemporal aerial photographs using change detection and level set method. Remote Sens. Environ. 2016, 175, 215–230. [Google Scholar] [CrossRef]
  92. Cloude, S.R.; Pottier, E. An entropy based classification scheme for land applications of polarimetric SAR. IEEE Trans. Geosci. Remote Sens. 1997, 35, 68–78. [Google Scholar] [CrossRef]
  93. Hijmans, R.J. Package Raster: Geographic Data Analysis and Modeling; R Package Version 2.6.7.; R Foundation for Statistical Computing: Vienna, Austria, 2017. [Google Scholar]
  94. Kuhn, M. Caret: Classification and Regression Training; R Foundation for Statistical Computing: Vienna, Austria, 2018. [Google Scholar]
  95. Li, Z.; Shi, W.; Lu, P.; Yan, L.; Wang, Q.; Miao, Z. Landslide mapping from aerial photographs using change detection-based Markov random field. Remote Sens. Environ. 2016, 187, 76–90. [Google Scholar] [CrossRef]
  96. Fung, T. An Assessment of TM Imagery for Land-Cover Change Detection. IEEE Trans. Geosci. Remote Sens. 1990, 28, 681–684. [Google Scholar] [CrossRef]
  97. Rodriguez-Galiano, V.F.; Chica-Olmo, M.; Abarca-Hernandez, F.; Atkinson, P.M.; Jeganathan, C. Random Forest classification of Mediterranean land cover using multi-seasonal imagery and multi-seasonal texture. Remote Sens. Environ. 2012, 121, 93–107. [Google Scholar] [CrossRef]
  98. Liaw, A.; Wiener, M. Classification and regression by randomForest. R News 2002, 2, 18–22. [Google Scholar]
  99. Belgiu, M.; Drăguţ, L. Random forest in remote sensing: A review of applications and future directions. Isprs. J. Photogramm. 2016, 114, 24–31. [Google Scholar] [CrossRef]
Figure 1. Site for change detection using dual polarimetric Advanced Land Observing Satellite Phased-Array type L-Band Synthetic Aperture Radar.
Figure 1. Site for change detection using dual polarimetric Advanced Land Observing Satellite Phased-Array type L-Band Synthetic Aperture Radar.
Remotesensing 11 00100 g001
Figure 2. The workflow of identifying subtle changes at the Halimun Salak National Park.
Figure 2. The workflow of identifying subtle changes at the Halimun Salak National Park.
Remotesensing 11 00100 g002
Figure 3. Pictures representing six types of the seven land-cover classes, including (a) forest, (b) rubber, (c) tea, (d) oil palm, (e) crop, and (f) built-up. Waterbody is sampled from Indian ocean.
Figure 3. Pictures representing six types of the seven land-cover classes, including (a) forest, (b) rubber, (c) tea, (d) oil palm, (e) crop, and (f) built-up. Waterbody is sampled from Indian ocean.
Remotesensing 11 00100 g003
Figure 4. The result of IRMAD processing, i.e., (a) MAD variate-1, (b) MAD variate-2, (c) Chi-square layer, and (d) RGB of those three layers. Letters 1 and 2 were located at areas with different visual change indication.
Figure 4. The result of IRMAD processing, i.e., (a) MAD variate-1, (b) MAD variate-2, (c) Chi-square layer, and (d) RGB of those three layers. Letters 1 and 2 were located at areas with different visual change indication.
Remotesensing 11 00100 g004
Figure 5. The binary image resulting from the Gamma-Map Filter of MAD variate-2 and historical Google Earth images in the proximate acquisition of ALOS PALSAR. A change from fields prepared for cultivation to land fully-covered with oil palm (a) or from juvenile to maturing oil palm plantation (c). The light tone signified the inverse condition from vegetated crop areas to semi-bare ones as indicated in (b,d).
Figure 5. The binary image resulting from the Gamma-Map Filter of MAD variate-2 and historical Google Earth images in the proximate acquisition of ALOS PALSAR. A change from fields prepared for cultivation to land fully-covered with oil palm (a) or from juvenile to maturing oil palm plantation (c). The light tone signified the inverse condition from vegetated crop areas to semi-bare ones as indicated in (b,d).
Remotesensing 11 00100 g005
Figure 6. The overall accuracy, false detection and missed detection of MAD variate-2 derived from an Iteratively Reweighted Multivariate Alteration Detection for consecutive multiplier constants of standard deviation between 0.1 and 2.0 to determine threshold for change detection.
Figure 6. The overall accuracy, false detection and missed detection of MAD variate-2 derived from an Iteratively Reweighted Multivariate Alteration Detection for consecutive multiplier constants of standard deviation between 0.1 and 2.0 to determine threshold for change detection.
Remotesensing 11 00100 g006
Figure 7. Comparing the accuracy of classification from backscatter intensity and synthetic data amendment, which includes decomposition entropy alpha, radar vegetation index and two parameters of texture Grey Level Co-occurrence Matrix, i.e., mean and variance resulting from seven classifiers including Bagged Classification and Regression Tree (CAB), Extreme Learning Machine Neural Networks (ENN), Bagged Multivariate Adaptive Regression Spline (MARS), Original Random Forest (RFO), and Regularised Random Forest (RFG), Support Vector Machine (SVM) and Extreme Gradient Boosting Tree (XGB).
Figure 7. Comparing the accuracy of classification from backscatter intensity and synthetic data amendment, which includes decomposition entropy alpha, radar vegetation index and two parameters of texture Grey Level Co-occurrence Matrix, i.e., mean and variance resulting from seven classifiers including Bagged Classification and Regression Tree (CAB), Extreme Learning Machine Neural Networks (ENN), Bagged Multivariate Adaptive Regression Spline (MARS), Original Random Forest (RFO), and Regularised Random Forest (RFG), Support Vector Machine (SVM) and Extreme Gradient Boosting Tree (XGB).
Remotesensing 11 00100 g007
Figure 8. Variable importance derived from five classifiers including CAB and MAB, RFO, RFG, and XGB of 2007 and 2010 datasets.
Figure 8. Variable importance derived from five classifiers including CAB and MAB, RFO, RFG, and XGB of 2007 and 2010 datasets.
Remotesensing 11 00100 g008
Figure 9. Consecutive accuracy of a different number of variables ordered by variable importance generated by seven classifiers based on RF.
Figure 9. Consecutive accuracy of a different number of variables ordered by variable importance generated by seven classifiers based on RF.
Remotesensing 11 00100 g009
Figure 10. The distribution of 12 land cover/change classes resulting from RFG added with producer and user accuracies.
Figure 10. The distribution of 12 land cover/change classes resulting from RFG added with producer and user accuracies.
Remotesensing 11 00100 g010
Table 1. Data combinations to assess the augmentation of dual polarisation SAR with synthetic layers for classification.
Table 1. Data combinations to assess the augmentation of dual polarisation SAR with synthetic layers for classification.
NameNumber of LayersData Combination
Backscatter2Dual polarisation HH, HV
Backscatter + Indices5Backscatter + Differencing HH-HV, ratioing HH/HV, Radar Vegetation Index (RVI)
Backscatter + Decomposition5Backscatter + Entropy Alpha decomposition (Entropy, Alpha, Anisotropy)
Backscatter + Textures8Backscatter + Texture components (mean, variance and correlation of HH or HV)
Table 2. Tuning parameters for each classifier.
Table 2. Tuning parameters for each classifier.
ClassifiersParameters
Bagged Classification and Regression TreeNo tuning parameters
Extreme Learning Machine Neural NetworkNumber of hidden unit (nhid) = 1–20
Action function (actfun) = sine, radial basis, linear, tan-sigmoid
Bagged Multivariate Adaptive Regression SplineNumber of terms (nprune) = 2–200
Product degree (degree) = 1–2
Regularised Random ForestRandomly selected predictors (mtry) = 1-number of classes (7 or 12)
Regularisation value (coefReg) = 0–1
Random ForestRandomly selected predictors (mtry) = 1–number of classes (7 or 12)
Support Vector MachineSigma = 0.1–0.9
Cost (C) = 25–210
Xtreme Gradient Boosting TreeNumber of boosting iteration (nrounds) = 1–1000;
Max tree depth (maxdepth) = 1–10;
Shrinkage (eta) = 0.001–0.6;
Minimum loss reduction (gamma) = 0–10;
Subsample ratio of column (colsample-bytree) = 0.3–0.7;
Minimum sum of instance weight (min-child-weight) = 0–20;
Subsample percentage (subsample) = 0.25–1
Table 3. Variables for assessing variable importance.
Table 3. Variables for assessing variable importance.
Name of VariableAcronym
Sigma0 HHHH
Sigma0 HVHV
Differencing HH-HVI_DIF
Ratioing HH/HVI_RAT
Radar vegetation indexI_RVI
Decomposition Cloude Pottierþ—AlphaCP_α
Decomposition Cloude Pottier—EntropyCP_H
Decomposition Cloude Pottie—AnisotropyCP_A
Texture GLCM—Mean of HHTH_M
Texture GLCM—Variance of HHTH_V
Texture GLCM—Mean of HVTV_M
Texture GLCM—Mean of HVTV_V
Table 4. The accuracy of classification over seven algorithms in 2007 and 2010 generated from all variables and the best subset. The blue colour indicates accuracies above the average; the red colour signifies accuracies below the average.
Table 4. The accuracy of classification over seven algorithms in 2007 and 2010 generated from all variables and the best subset. The blue colour indicates accuracies above the average; the red colour signifies accuracies below the average.
MethodsAbbreviations20072010Average
AllBest SubsetAllBest Subset
Bagged CARTCAB88.789.887.889.789.0
Bagged MARSMAB84.682.984.783.283.9
ELM Neural NetworkENN69.273.163.973.169.8
Extreme Gradient Boosting TreeXGB89.088.889.889.089.2
Random Forest OriginalRFO89.990.489.090.089.8
Regularised Random ForestRFG89.890.689.090.690.0
Support Vector MachineSVM81.386.582.586.584.2
Average accuracy (%)84.686.083.886.085.1

Share and Cite

MDPI and ACS Style

Panuju, D.R.; Paull, D.J.; Trisasongko, B.H. Combining Binary and Post-Classification Change Analysis of Augmented ALOS Backscatter for Identifying Subtle Land Cover Changes. Remote Sens. 2019, 11, 100. https://doi.org/10.3390/rs11010100

AMA Style

Panuju DR, Paull DJ, Trisasongko BH. Combining Binary and Post-Classification Change Analysis of Augmented ALOS Backscatter for Identifying Subtle Land Cover Changes. Remote Sensing. 2019; 11(1):100. https://doi.org/10.3390/rs11010100

Chicago/Turabian Style

Panuju, Dyah R., David J. Paull, and Bambang H. Trisasongko. 2019. "Combining Binary and Post-Classification Change Analysis of Augmented ALOS Backscatter for Identifying Subtle Land Cover Changes" Remote Sensing 11, no. 1: 100. https://doi.org/10.3390/rs11010100

APA Style

Panuju, D. R., Paull, D. J., & Trisasongko, B. H. (2019). Combining Binary and Post-Classification Change Analysis of Augmented ALOS Backscatter for Identifying Subtle Land Cover Changes. Remote Sensing, 11(1), 100. https://doi.org/10.3390/rs11010100

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop