Improving Land Cover Classification Using Extended Multi-Attribute Profiles (EMAP) Enhanced Color, Near Infrared, and LiDAR Data

Kwan, Chiman; Gribben, David; Ayhan, Bulent; Bernabe, Sergio; Plaza, Antonio; Selva, Massimo

doi:10.3390/rs12091392

Open AccessFeature PaperArticle

Improving Land Cover Classification Using Extended Multi-Attribute Profiles (EMAP) Enhanced Color, Near Infrared, and LiDAR Data

¹

Applied Research LLC, Rockville, MD 20850, USA

²

Department of Computer Architecture and Automation, Complutense University of Madrid, 28040 Madrid, Spain

³

Department of Technology of Computers and Communications, University of Extremadura, 10003 Cáceres, Spain

⁴

Institute of Applied Physics “Nello Carrara”, IFAC- CNR, Research Area of Florence, 50019 Sesto Fiorentino (FI), Italy

^*

Author to whom correspondence should be addressed.

Remote Sens. 2020, 12(9), 1392; https://doi.org/10.3390/rs12091392

Submission received: 4 April 2020 / Revised: 22 April 2020 / Accepted: 26 April 2020 / Published: 28 April 2020

(This article belongs to the Special Issue Recent Advances in Land Cover Classification and Change Detection in 2D and 3D)

Download

Browse Figures

Versions Notes

Abstract

:

Hyperspectral (HS) data have found a wide range of applications in recent years. Researchers observed that more spectral information helps land cover classification performance in many cases. However, in some practical applications, HS data may not be available, due to cost, data storage, or bandwidth issues. Instead, users may only have RGB and near infrared (NIR) bands available for land cover classification. Sometimes, light detection and ranging (LiDAR) data may also be available to assist land cover classification. A natural research problem is to investigate how well land cover classification can be achieved under the aforementioned data constraints. In this paper, we investigate the performance of land cover classification while only using four bands (RGB+NIR) or five bands (RGB+NIR+LiDAR). A number of algorithms have been applied to a well-known dataset (2013 IEEE Geoscience and Remote Sensing Society Data Fusion Contest). One key observation is that some algorithms can achieve better land cover classification performance by using only four bands as compared to that of using all 144 bands in the original hyperspectral data with the help of synthetic bands generated by Extended Multi-attribute Profiles (EMAP). Moreover, LiDAR data do improve the land cover classification performance even further.

Keywords:

land cover classification; hyperspectral; EMAP; synthetic bands; LiDAR; data fusion

1. Introduction

Hyperspectral (HS) images have been used in many applications [1]. Examples of HS sensors include Airborne Visible/Infrared Imaging Spectrometer (AVIRIS) [2] and Adaptive Infrared Imaging Spectroradiometer (AIRIS) [3]. The AVIRIS images have 224 bands in the range of 0.4 to 2.5

μ m

. AIRIS is a longwave infrared (LWIR) sensor with 20 bands for the remote detection of chemical agents, such as nerve gas. In the literature, people have used HS data in small target detection [4,5], fire damage assessment [6,7], anomaly detection [8,9,10,11,12,13,14], chemical agent detection and classification [3,15], border monitoring [16], change detection [17,18,19,20], and Mars mineral map abundance estimation [21,22]. There are also many papers on land cover classification. For instance, the fusion of HS and LiDAR data was proposed in [23] and applied to the 2013 IEEE Geoscience and Remote Sensing Society (GRSS) Data Fusion Contest dataset. All 144 bands with help from Extended Multi-attribute Profiles (EMAP) were used. The results achieved 90% in overall accuracy. In another paper [24], a graph-based approach was proposed in order to fuse HS and LiDAR data for land cover classification.

However, HS sensors are expensive and they usually demand large data storage. In some real-time applications, HS data may need to be transmitted via bandwidth constrained channels to ground stations for data processing. The above scenarios prohibit the use of HS sensors for some applications, such as precision agriculture, where farmers may have a limited budget. In [25], some challenges in practical applications are mentioned. One of them is that, in the event that no HS data are available, synthetic spectral bands using EMAP may be a good alternative. Some recent applications of using EMAP for soil detection and change detection can be found in [26].

In this paper, we focus on addressing the above practical problem in land cover classification, where only a few bands, namely RGB and near infrared (NIR) bands, are available. Under such data constrained situations, our first question is about what performance we can achieve for land cover classification while only using those four bands. Second, if synthetic bands using EMAP are available, what kind of performance boost can one get? Is the resulting performance close to the case of using all hyperspectral bands? Third, if LiDAR data are available, can we see even further enhancement in land cover classification?

It should be noted that RGB and NIR images are mostly used for vegetation detection, which involves the simple calculation of normalized difference vegetation index (NDVI). The generation of NDVI can be done in real-time. There are also some recent researchers who have used color images for visual object classification (VOC) using deep learning methods (DeepLabV3+ [27], SegNet [28], Pyramid Scene Parsing network (PSP) [29], and Fully Convolutional Network (FCN) [30]). In principle, those deep learning methods for object detection using color images can be adapted to land cover classification. Recently, our team initiated an investigation along that direction in [31]. However, those deep learning methods require a lot of training data and may not yield better results when data are scarce, which is the case for the IEEE Houston dataset presented in this paper.

In our investigations, we applied nine algorithms, including three hyperspectral classification methods, Matched Signature Detection (MSD), Adaptive Subspace Detection (ASD), Reed-Xiaoli Detection (RXD), and their kernel versions, and also Sparse Representation (SR), Joint SR (JSR), and Support Vector Machine (SVM) to the 2013 IEEE GRSS Data Fusion Contest dataset [23] for land cover classification. Some of them cannot be directly applied and required some customization. For instance, RXD is a well-known technique for anomaly detection. We modified it for land cover classification. The customization of existing algorithms can be considered as our first contribution. In our studies, we clearly saw the advantages of using EMAP. In most of the cases, EMAP versions resulted in a significant performance increase. Answering the question of what performance gain using synthetic bands for land cover classification is our second contribution. Moreover, we also confirmed that land cover classification performance can be further enhanced if LiDAR is combined with EMAP. Confirming that the LiDAR does help the classification accuracy is our third contribution.

It is worth mentioning that the proposed methodology has been applied to another dataset known as the Trento dataset, which contains both hyperspectral and LiDAR. The results clearly demonstrated that our proposed approach achieved land cover classification results that were very close to the state-of-the-art methods in the literature by using only RGB, NIR, and LiDAR with EMAP. However, we did not have permission to publish those results related to the Trento dataset. Interested readers can contact us directly for those results.

The paper is organized as follows. In Section 2, we will review those classification algorithms, the 2013 IEEE GRSS Data Fusion Contest Data, EMAP, and evaluation metrics. In Section 3, we will summarize our findings. The classification results using different methods and combinations of spectral bands will be presented with tables and classification maps. Moreover, we compared with two representative results in the literature for the same data set. Our results of using only four available bands with help from EMAP are very close to those results presented in [23,24] that used 144 bands. Finally, we will conclude our paper with a few remarks.

2. Methods and Data

2.1. Land Cover Classification Methods

Although land cover classification can be done using object based detection methods, here we perform pixel-based classification. This is because the IEEE dataset only has ground truth land cover labels in pixels rather than land cover maps. There are 15 land cover classes in the 2013 IEEE GRSS Data Fusion Contest. For each class, a number of signatures from the training data are available. For a particular classifier, the classification process begins by separately detecting each class. The maximum detection value at each pixel location across all classes’ maps is taken. The index of that maximum value will then be the class label. Aggregating all of those classification labels into a two-dimensional (2D) matrix yields the overall classification map. The details of each method are as follows.

2.1.1. Matched Subspace Detection (MSD)

MSD [32] is the process of matching the signatures of a background and target dataset in order to classify a given pixel as a specific class. There are two separate hypothetical scenarios, either with a present or absent target. These equations are established as H₀ (target absent) and H₁ (target present)

H_{0} : y = B ζ + n,

(1)

H_{1} : y = T θ + B ζ + n = [T B] [\begin{matrix} θ \\ ζ \end{matrix}] + n,

(2)

where T and B are the orthogonal matrices with column vectors of a certain dimension that span the subspace of the target and background/non-target, θ and ζ are the unknown vectors that account for the various different corresponding column vectors of T and B, respectively, and n represents random noise. These equations can then be transformed to create a generalized likelihood ratio test (GLRT) to predict whether a specific pixel will be a target or background pixel:

L_{2} (y) = \frac{y^{T} (I - P_{B}) y}{y^{T} (I - P_{T B}) y} .

(3)

where

P_{B} = B {(B^{T} B)}^{- 1} B^{T}

,

P_{T B} = [T B] {({[T B]}^{T} [T B])}^{- 1} {[T B]}^{T}

.

The reason that we chose MSD as one of the methods is because MSD has been proven to work well in some hyperspectral target detection applications [32]. In this paper, we use MSD, as follows. To detect class i, we use pixel signatures from the training samples in class i to form the target matrix T_i and the rest of the samples in other classes to form the matrix B_i. We then insert T_i and B_i in (1) and (2) and the perform target detection for class i using (3) for the whole image cube. The detection map i is saved. We then repeat the process for i = 1 to 15. The label that corresponds to maximum value out of the 15 maps at each pixel location will be the class identity.

2.1.2. Adaptive Subspace Detection (ASD)

ASD [32] has a very similar process to MSD with a slightly varying set of hypotheses:

H_{0} : x = n,

(4)

H_{1} : x = U θ + σ n,

(5)

where U is the orthogonal matrix whose column vectors are eigenvectors of the target subspace, θ is the unknown vector whose entries are coefficients for the target subspace, and n is the random Gaussian noise. Those equations are then solved in order to create a similar GLRT to MSD. The classification is done for each class first and the final decision is made by picking the class label that corresponds the maximum detection value at each pixel location.

Similar to MSD, we adopted ASD simply because it has been proven to work quite well in target detection while using hyperspectral images [32,33].

2.1.3. Reed-Xiaoli Detection (RXD)

In hyperspectral image processing community, RXD [34] is usually used for anomaly detection. It is simple and efficient. We would like to emphasize that, to the best of our knowledge, no one has applied RXD for land cover classification before. Here, we apply RXD in a very different way. For land cover classification, RXD follows the same procedure as MSD and ASD, using a H₀ and H₁ equation and combining them to generate a GLRT. The background pixels come from training samples in the 14 other classes to detect pixels in class i. That is, RXD can be expressed as:

R X (r) = {(r - μ_{b})}^{T} C_{b}^{- 1} (r - μ_{b}),

(6)

where r is the test pixel, μ_b is the estimated sample mean of the 14 background classes, and C_b is the background covariance of the training samples in the 14 other classes. The process will repeat for 15 classes, one for each class. The final classification is done by choosing the class label that corresponds to the maximum detection value at each pixel location.

The kernel versions of each of the above methods—ASD, MSD, and RXD—all follow a similar fashion.

2.1.4. Kernel MSD (KMSD)

In [32], it was demonstrated that KMSD has a better performance than MSD. In light of that, we also included KMSD in our investigations. In KMSD, the input data have been implicitly mapped by a nonlinear function Φ into a high dimensional feature space F. The detection model in F is then given by:

H_{0 Φ} : Φ (y) = B_{Φ} ζ_{Φ} + n_{Φ} T a r g e t a b s e n t

(7)

H_{1 Φ} : Φ (x) = T_{Φ} θ_{Φ} + B_{Φ} ζ_{Φ} + n_{Φ} T a r g e t p r e s e n t

(8)

where the variables are defined in [32]. The kernelized GLRT for KMSD can be found in Equation (9) of [32].

2.1.5. Kernel ASD (KASD)

Similar to KMSD, KASD [32] was also adopted in our investigations because of its good performance in [32]. In KASD, similar to KMSD, the detection formulation can be written as

H_{0 Φ} : Φ (x) = n_{Φ} T a r g e t a b s e n t

(9)

H_{1 Φ} : Φ (x) = U_{Φ} θ_{Φ} + σ_{Φ} n_{Φ} T a r g e t p r e s e n t

(10)

The various variables are defined in [15]. The final detector in kernelized format can be found in Equation (30) of [15].

2.1.6. Kernel RXD (KRXD)

The reason for including KRXD was because KRXD was demonstrated to perform much better than RXD in [34]. In KRXD, every pixel is transformed to a high dimensional space via a nonlinear transformation. The kernel representation for the dot product in feature space between two arbitrary vectors x_i and x_j is expressed as

k(x_i,x_j) = 〈Φ(x_i), Φ(x_j)〉 = Φ(x_i)·Φ(x_j)

(11)

A commonly used kernel is the Gaussian radial basis function (RBF) kernel

k(x,y) = exp ((−‖x − y‖²)/c)

(12)

where c is a constant and x and y are spectral signatures of two pixels in a hyperspectral image. The above kernel function is the well-known kernel trick that avoids the actual computation of high dimensional features and enables the implementation of KRXD. Details of KRXD can be found in [19,34].

2.1.7. Sparse Representation (SR)

In [16], we applied SR to detect soil due to illegal tunnel excavation. It was observed that SR was one of the high performing methods. We also included SR in this paper because of the above. SR exploits the structure of only having a few nonzero values by solving the convex

l_{1, q}

-norm minimization problem:

\min_{S} {‖ S ‖}_{q} \leq s_{0} s . t . Y = D S

(13)

where

{‖ S ‖}_{q}

is defined as the number of non-zero rows of S, the signature values of a given pixel,

s_{0}

is a pre-defined maximum row-sparsity parameter, q > 1 is a norm of matrix S that encourages sparsity patterns across multiple observations, and D is the dictionary of class signatures.

2.1.8. Joint Sparse Representation (JSR)

Similar to SR, JSR was used in our earlier study in soil detection [16]. Although JSR is more computationally demanding, it exploits neighborhood pixels for joint land type classification. In JSR, 3 × 3 or 5 × 5 patch of pixels are used in the S target matrix. It is the same equation as Equation (13), but with an added dimension to S that accounts for each pixel within whatever patch size is used. Details of the mathematics can be found in [16].

2.1.9. Support Vector Machine (SVM)

SVMs were first suggested in the 1960s [35] for classification and they have been an area of intense research, owing to developments in the techniques and theory coupled with extensions to regression and density estimation. An SVM is a general architecture that can be applied to pattern recognition and classification [36], regression estimation, and other problems, such as speech and target recognition. SVM can be constructed from a simple linear maximum margin classifier that can be trained by solving a convex quadratic programming problem with constraints.

The reason for including SVM in our experiments is simply because several past papers [23,24] also used SVM in land cover classifications.

2.2. EMAP

In this section, we briefly introduce EMAP, which has been shown to yield good classification performance when only one has a few spectral bands available. Mathematically, given an input grayscale image

f

and a sequence of threshold levels

{T h_{1}, T h_{2}, \dots T h_{n}}

, the attribute profile (AP) of

f

is obtained by applying a sequence of thinning and thickening attribute transformations to every pixel in

f

, as follows:

A P (f) = {ϕ_{1} (f), ϕ_{2} (f), \dots ϕ_{n} (f), f, γ_{1} (f), γ_{2} (f), \dots γ_{n} (f)}

(14)

where

ϕ_{i}

and

γ_{i} (i = 1, 2, \dots n)

are the thickening and thinning operators at threshold

T h_{i}

, respectively. The EMAP of

f

is then acquired by stacking two or more APs while using any feature reduction technique on multispectral/hyperspectral data, such as purely geometric attributes (e.g., area, length of the perimeter, image moments, shape factors), or textural attributes (e.g., range, standard deviation, entropy) [37,38,39,40].

EMAP (f) = {A P_{1} (f), A P_{2} (f) \dots A P_{m} (f)}

(15)

In this paper, the “area (a)” and “length of the diagonal of the bounding box (d)” attributes of EMAP [26] were used. For the area attribute of EMAP, two thresholds used by the morphological attribute filters were set to 10 and 15. For the Length attribute of EMAP, the thresholds were set to 50, 100, and 500. The above thresholds were chosen based on experience, because we observed them to yield consistent results in our experiments. With this parameter setting, EMAP creates 11 synthetic bands for a given single band image. One of the bands comes from the original image.

EMAP has been used in hyperspectral image processing before. More technical details and applications of EMAP can be found in [37,38,39,40]. In fact, in [23,24], EMAP has been used for land cover classification before. One key difference between the above references and our approach here is that we applied EMAP to only RGB+NIR and RGB+NIR+LiDAR, whereas the above methods all used the original hyperspectral data.

2.3. Dataset Used

From the IEEE GRSS Data Fusion package [23], we obtained the ground truth classification pixels, the hyperspectral image of the University of Houston area, and the LiDAR data of the same area. The instruments used to collect the dataset are a hyperspectral sensor and a LiDAR sensor. The hyperspectral data contain 144 bands that range in wavelength from 380–1050 nm with spatial resolution of 2.5 m. Each band has a spectral width of 4.65 nm. The LiDAR data contain the elevation information with a resolution of 2.5 m.

Table 1 displays the number of training and testing pixels per class. Figure 1 below shows the Houston area with the ground truth classifications that the test used to compare and determine the overall accuracy.

The predetermined training data set includes 2832 pixels and the testing data included the remaining 12,197 labeled pixels from the University of Houston dataset. The results from using this predetermined training data were found to be considerably worse than the random subsampling approach, which suggested that this predetermined training data might not be entirely indicative of the testing data. In any event, we conducted our investigations with the predetermined data set to compare our results with other past studies.

There are a number of datasets used for analysis, as shown in Table 2. The first group is the RGB (band # 60, 30, 22 in the hyperspectral data) and the NIR band (band #103). It should be noted that the above selection of bands is not the same as band selection in the literature [41]. In band selection, the objective is to select the most informative bands out of the available hyperspectral bands. In our case, we are restricted to only having a few bands. One might have some concerns about the use of narrow bands in the HS data to emulate RGB and NIR bands. We will address this issue in Section 3.2. It turns out that such simple selection is almost equivalent to creating “realistic” RGB and NIR bands by applying spectral responses of actual color and NIR imagers to the hyperspectral bands. We call this group Dataset-4 (DS-4). The second group is the RGB and NIR coupled with LiDAR data. We denote is as Dataset-5 (DS-5). The third group is the four band group put through EMAP augmentation to produce 44 bands as each band produces ten other bands in addition to the original band (denoted as Dataset-44 (DS-44)). The fifth group is the five band plus EMAP augmentation (denoted as Dataset-55 (DS-55)). The sixth group is the full hyperspectral image of 144 bands (denoted as Dataset-144 (DS-144)). Finally, the last group is the 144 bands + LiDAR case (denoted as Dataset-145 (DS-145)). The DS-4 and DS-5 editions should require less computational times but with degradation in accuracy. DS-5 will result in a lesser reduction in accuracy with a minimal increase in time. The LiDAR data will help the classification of tall structures as well as the differentiation of trees, shrubs, and grass. These two cases will not expect to work well in land cover classification. Meanwhile, the DS-44 and DS-55 cases provide a middle ground in both time consumption and accuracy loss that, depending on the method, could prove to be useful in practical applications. The full hyperspectral image (DS-144) and the DS-145 case (144 + LiDAR) are simply used as benchmarks to compare with the rest of the combinations.

2.4. Evaluation Metrics

We have adopted overall accuracy (OA), average accuracy (AA), and kappa (k) coefficient as the performance metrics to be consistent with existing land cover classification methods in the literature. OA is defined as the ratio between the sum of correctly classified pixels from all classes and the total number of pixels in all classes. AA is the average of the individual class accuracies. Kappa coefficient is defined as (overall accuracy–random accuracy)/(1—random accuracy) where random accuracy is also known as accuracy by chance. For more details, please visit L3 Harris’ website at https://www.harrisgeospatial.com/docs/CalculatingConfusionMatrices.html.

3. Land Cover Classification Results

3.1. Results

3.1.1. Results of Using Narrow Bands

Analysis was conducted while using each method, with different combinations of bands in Table 2, to determine their accuracy and computational efficiency. We used well-known performance metrics in the research community, namely, overall accuracy (OA), average accuracy (AA), and Kappa (κ) coefficient. The definitions of those metrics can be found in the public domain.

Table 3, Table 4 and Table 5 summarize the key metrics (OA, AA, and Kappa) of each method for different band combinations. Red numbers indicate the best accuracy for each method and the bold numbers indicate the best performance for each dataset.

Table 6, Table 7 and Table 8 provide the detailed class-specific accuracies. For the majority of the tables, there is a progression of accuracy improvement from least bands to most bands used. The most obvious exception to this is the full hyperspectral image. It should be noted that, without EMAP, the performance metrics of SVM in DS-4 and DS-5 are, in general, inferior to the full HS cases. This basically answers our first question raised in Section 1 about the feasibility of using only RGB and NIR for land cover classification. In short, if one only uses RGB and NIR bands, the land cover classification performance is not accurate enough.

Now, the advantage of using EMAP is stressed here. In most of the cases, EMAP versions resulted in a significant performance increase. For instance, when we used the four bands with EMAP (DS-44), the JSR and SVM methods achieved 80.77% and 82.64% overall accuracy, respectively. The corresponding results of using JSR and SVM without EMAP (DS-4) are only 59.83% and 70.43%, respectively. When using EMAP with five bands (DS-55), Joint Sparse Representation (JSR) provided the highest overall accuracy with 86.86 %. The overall accuracy for JSR was 70.81% without EMAP (DS-5) while using the same five bands. This is followed by the SVM method, again, when using EMAP with five bands (DS-55), which had an overall accuracy of 86.00%. The overall accuracy for SVM was 74.62% without EMAP using the same five bands (DS-5). The above practically answers our second question about whether EMAP can help classification performance raised in Section 1.

In several instances, the accuracy will decrease from the DS-55 case of each method when compared to the DS-144 case. Interestingly, in both of the two best performing cases (JSR and SVM), if all 144 original bands were used, the classification accuracies were 72.57% (JSR in DS-144) and 78.68% (SVM in DS-144), respectively, which are considerably lower. This shows that sometimes using all of the hyperspectral bands for land cover classification could lead to poor results. We will discuss this point in Section 3.2.

From Table 3, Table 4 and Table 5, it is quite clear to see that LiDAR did help the performance, especially for the SVM cases. For example, DS-5 is better than DS-4 in SVM; DS-55 is better than DS-44 in SVM; and, DS-145 is better than DS-144 in SVM. This answers the third question about whether LiDAR can help the classification performance that we raised in Section 1.

The next important information is regarding computational complexity. As a general rule, all of the standard methods are faster than their kernel counterparts and the kernel methods are faster than the SR and JSR methods. Table 9 shows the varying elapsed time (ET) values during training in minutes. Table 9 adopts the same layout of Table 3, Table 4, and Table 5, except that the best value is the minimum value and bold values are not implemented. A Windows-7 PC without GPU (16 G RAM and i7-CPU) was used in our experiments. By looking at Table 9, it is clear that there are some major advantages to certain methods when it comes to the ET value. The most drastic difference in time is absolutely the SR and JSR methods. While the kernel methods take up to two hours to process the image, the JSR methods can take over a day and a half to process it. It is clear that the final three methods are most consistently accurate, but only SVM is really worth the increased consistency due to the vast amount of time that it takes for SR and JSR to generate results. At its slowest, JSR is still classifying about five pixels per second; however, MSD at its slowest still processes over 1,250 pixels per second and KASD processes about 85. JSR simply cannot keep up.

In the SVM row in Table 9, we noticed that the ET training times for DS-4 and DS-5 are actually higher than the other cases. This observation might look strange, even though we have repeated our experiments multiple times. The ET times are all correct. The reason is most likely because the support vectors in SVM are obtained via an iterative optimization process during training. If the optimization metric reaches a pre-specified threshold, then the iteration stops. We believe that, in the cases of DS-44, DS-45, DS-144, and DS-145, the convergence speeds were faster than those DS-4 and DS-5 cases and, hence, less time is needed to reach good performance. This might be corroborated by the fact that DS-4 and DS-5 indeed have inferior performance than the other cases, because more computational iterations (more time) were needed and yet the final performance metrics were still lower than the other cases.

The accuracy values are generated from a small subsection of the full image, but the ET value is calculated across the entire image. The ground truth values of around twelve thousand pixels while the full image encompasses more than 650,000 pixels. It is important to look at the classification maps of the full image in order to obtain a better sense of the accuracy of the methods. All of the classification maps (Figure A1, Figure A2, Figure A3, Figure A4, Figure A5, Figure A6, Figure A7, Figure A8 and Figure A9) are in the Appendix A but the classification maps of DS-44 case for the nine methods are shown in Figure 2, as it is the most practical case and gives a good sense of the accuracies of each method.

A large distortion is clearly present in the right quarter of each image, as can be seen in Figure 2. This is most likely caused by a cloud, which then affects the detection performance of each method. It can also be seen that certain images, such as KRXD (sub-image (f)), have a decent amount of noise in its classifications. Even with that noise, the accuracy of the ground truth for KRXD is still around 64%. In contrast, the SVM map is quite consistent with much less noise.

3.1.2. Comparison with Khodadadzadeh et al.’s Results [23]

Here, we extracted some numbers from Table V in [23] and put them in Table 10. According to [23], the case of X^h+EMAP(X^h) used all 144 bands and some additional EMAP bands; the case of X^h + AP(X^L) used 144 bands and some additional bands from the LiDAR data; the last case is the combination of 144 bands, LiDAR, and EMAP bands.

We also extracted our best performing numbers from Table 3 and put them Table 10. Our DS-44 band case includes four optical bands (RGB+NIR) and 40 EMAP bands. This case is similar to the X^h+EMAP(X^h), except that we only used four bands out of the 144 bands. It can be seen that our results are only 2 to 4 % lower than that of using 144 bands. The DS-55 case includes LiDAR information. When comparing our results to those two cases (X^h + AP(X^L)) and (X^h + AP(X^L) + EMAP(X^h)) in [23], our results are only 1 to 4% lower.

The above comparison shows that our results of using only four or five bands with EMAP can achieve results that are only a few percentage points lower than MLRsub in [23]. This means that it is feasible to use RGB+NIR with EMAP for practical land cover classification.

3.1.3. Comparison with Liao et al.’s Results

Now, we compare our results with those results that were generated by using Generalized Graph-based Fusion (GGF) [24]. We extract some numbers from Table I of [24] and put them in Table 10. The HS case (known as Raw HS in [24]) is the result of directly using SVM on the 144 bands. The MPSHSLi used SVM on features generated by morphological profile of HS and LiDAR. The GGF case is the case of using SVM to GGF features (both hyperspectral and LiDAR).

When comparing our DS-44 case with the Raw HS case in [24], one can see that our results are almost the same. Comparing our DS-55 case to the MPSHSLi case in [24], we can see the results are also comparable. Finally, our DS-55 results are 7% lower than that of GGF. However, it is important to mention that the GGF method utilized some additional information from the test samples. From the paragraph below Figure 2 in Liao et al.’s paper [24], the authors mentioned “in our experiments, 5000 samples were randomly selected to train … our proposed GGF”. As there are only 2832 pixels in the training data, the 5000 samples must have come from the test data and this is not a fair comparison between GGF and our results.

3.1.4. Wide RGB and NIR Bands

When we created the RGB and NIR bands, we directly selected those narrow bands from the hyperspectral data. There are many color imagers with different wavelength ranges. Some imagers may have narrow bands that indeed look like those hyperspectral bands. In some imagers, the spectral response of each individual band might be wider. See Figure 3 for actual spectral responses of RGB and NIR bands of a commercial imager [42]. It will be important to investigate the robustness of our approach with respect to the different sensors. Will the bandwidth affect the observations in this paper? Here, we carry out a comparative study between the use of our choice of narrow RGB and NIR bands, and the wide band images by combining multiple bands from the hyperspectral image using the actual spectral response functions.

Here, we first synthesize wide RGB and NIR bands from the hyperspectral bands. We generate the individual bands by a weighted average of the hyperspectral bands based on the spectral response functions shown in Figure 3. That is, the weights come from the spectral response curves and we multiply the weights with each corresponding band in the hyperspectral data and then add the results together. After we generate the four wide bands, we then apply the EMAP algorithm to generate the synthetic EMAP bands. Finally, we applied the SVM classifier to those wide bands. Table 11 shows the classification results. From Table 11, it can be seen that the land cover classification performance using narrow bands from the HS data is only less than 0.5% from that of using wide bands. In four out of 12 cases, the narrow bands have slightly better performance than the wide bands. This study shows that, if wide RGB and NIR bands were used, the land cover classification performance would be better than that of using narrow bands from the HS data. This makes sense, because the wide bands have more spectral information. In short, the land cover classification performances of narrow and wide bands are comparable and our results are consistent, regardless of the bandwidths.

3.2. Discussion

It should be noted that our emphasis in this paper is to investigate the performance of using four bands (DS-4) and, if LiDAR data are available, five bands (DS-5) for land cover classification. The four and five bands were augmented with EMAP. We will address some additional discussions in the following.

3.2.1. Full Hyperspectral Data vs. Synthetic Bands

One might expect the case of using all 144 bands of hyperspectral data to yield the best performance. This turns out to be not the case in our findings. Although we do not have a solid theory to explain this behavior, we have certainly seen several observations in other researchers’ results. For instance, in Table IV of [43], there are at least two approaches (PCA and MPsHSI) that utilized a fewer number of bands and yet still achieved better overall accuracies than that of using all bands. In Table 4 of [41], the method that is known as ISSC used fewer bands and yet has better performance than that of using all bands. In Table 5 of [41], another method known as LP also used fewer bands and achieved higher accuracy than that of using all bands. One potential explanation for the above observations is the curse of dimensionality in the hyperspectral data. In other words, more bands may confuse the classifiers in some ways. Another potential explanation might be related to the “redundancy” issue, as suggested by one anonymous reviewer, in hyperspectral data. That is, more but redundant spectral bands may be harmful than that of fewer but non-redundant bands. However, more theoretical research might be needed to fully understand the above observations.

3.2.2. EMAP Based Augmentation vs. Deep Learning

Another natural question is that EMAP augmentation might be outdated in the era of deep learning because deep learning has some generalization capability. This viewpoint may or may not be valid, depending on applications. For the same data set, we are currently investigating two deep learning methods using only four (RGB+NIR) or five (RGB+NIR+LiDAR) bands. We developed one for soil detection using multispectral images. It is a customized structure with six layers. Details of the architecture can be found in [44]. Someone else developed another one for hyperspectral image classification. We found the open source code from Github [45]. Both deep learning methods are based on convolutional neural network (CNN).

In our preliminary experiments, we observed that the overall accuracies are around 80% using four or five bands. However, with the EMAP augmented bands, the deep learning results are improved to close to 88%.

In our opinion, the power of deep learning can only emerge when one has a vast amount of training data. Unfortunately, in the IEEE GRSS Data Fusion Contest dataset, the training samples are much fewer than the testing samples and, consequently, deep learning methods did not show its power in boosting up the land cover classification performance. We plan to wrap up this deep learning work in the near future.

3.3. Potential of Using Object Based Approaches

In recent years, object based classification approaches have gained popularity in remote sensing. Object based approaches involve segmentation and classification steps, and the salt-and-pepper classification maps that are generated by pixel based approaches can be avoided. Moreover, object based approaches can incorporate geometric shapes, sizes, and spectral information into the land cover classification process. In theory, object based approaches should have better performance in land cover classification. Some researchers concluded that object based methods have better performance than pixel based methods [46,47].

Unfortunately, the training, testing, and ground truth label data are all in pixels for both the IEEE and the Trento datasets. That is, we do not have the ground truth land cover maps for both datasets. It will be a good future direction to work with those dataset owners to define and generate the ground truth land cover maps for all of the land cover types, so that object based approaches can be evaluated for those datasets.

4. Conclusions and Future Directions

In this paper, we have investigated the performance of land cover classification while only using four bands (RGB+NIR) or five bands (RGB+NIR+LiDAR). Our first key observation is that the land cover classification performance using four or five bands without EMAP is not good enough, regardless of the classification algorithm. Our second key observation is that, with help from EMAP, the four or five bands can produce very good classification performance using the SVM and JSR algorithms. We also observed that LiDAR data further enhance the classification performance. Comparing our results with representative papers in the literature shows that using four or five bands with EMAP is feasible for land cover classification, as the accuracies are only a few percentage points lower than some of the best performing methods in the literature that utilize all of the hyperspectral bands. When taking computational times into account, there is further complexity, as the best performing methods often take a significantly longer amount of time to process information. The one exception to this is the SVM method, as it performs above average in all scenarios and is the best in all but the DS-55 case while maintaining a computational time of less than five minutes for all but the DS-4 case.

There are future directions for our work. The first one is about whether one can perform fusion of multiple classification maps to further improve the classification accuracy. For instance, we applied nine methods and each one generates a land cover classification map. It will be interesting to investigate the fusion of those nine maps via some voting or Dempster Shafer fusion algorithms. The second direction is to explore deep learning approaches for land classification while only using RGB+NIR bands. A third direction is to investigate what bands in the EMAP enhanced data are more useful than others.

Author Contributions

Conceptualization, C.K.; methodology, C.K., B.A., D.G., S.B., A.P.; writing—original draft preparation, C.K.; supervision, C.K.; project administration, C.K.; Writing—review & editing—S.B., A.P., M.S.; funding acquisition, C.K. All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported by US Department of Energy under grant # DE-SC0019936. The views, opinions and/or findings expressed are those of the author and should not be interpreted as representing the official views or policies of the Department of Defense or the U.S. Government.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

ASD maps for DS-4, DS-5, DS-55, DS-144, and DS-145 cases

Figure A1. ASD classification maps.

MSD maps for DS-4, DS-5, DS-55, DS-144, and DS-145 cases

Figure A2. MSD classification maps.

RXD maps for DS-4, DS-5, DS-55, DS-144, and DS-145 cases

Figure A3. RXD classification maps.

KASD maps for DS-4, DS-5, DS-55, DS-144, and DS-145 cases

Figure A4. KASD classification maps.

KMSD maps for DS-4, DS-5, DS-55, DS-144, and DS-145 cases

Figure A5. KMSD classification maps.

KRXD maps for DS-4, DS-5, DS-55, DS-144, and DS-145 cases

Figure A6. KRXD classification maps.

SR maps for DS-4, DS-5, DS-55, DS-144, and DS-145 cases

Figure A7. SR classification maps.

JSR maps for DS-4, DS-5, DS-55, DS-144, and DS-145 cases

Figure A8. JSR classification maps.

SVM maps for DS-4, DS-5, DS-55, DS-144, and DS-145 cases

Figure A9. SVM classification maps.

References

Lee, C.M.; Cable, M.L.; Hook, S.J.; Green, R.O.; Ustin, S.L.; Mandl, D.J.; Middleton, E.M. An introduction to the NASA Hyperspectral InfraRed Imager (HyspIRI) mission and preparatory activities. Remote. Sens. Environ. 2015, 167, 6–19. [Google Scholar] [CrossRef]
AVIRIS. Available online: https://aviris.jpl.nasa.gov/aviris/index.html (accessed on 18 December 2019).
Ayhan, B.; Kwan, C.; Jensen, J.O. Remote vapor detection and classification using hyperspectral images. In Proceedings of the Chemical, Biological, Radiological, Nuclear, and Explosives (CBRNE) Sensing XX, Bellingham, WA, USA, 25 July 2019. [Google Scholar]
Harsanyi, J.C.; Chang, C.-I. Hyperspectral image classification and dimensionality reduction: An orthogonal subspace projection approach. IEEE Trans. Geosci. Remote. Sens. 1994, 32, 779–785. [Google Scholar] [CrossRef] [Green Version]
Heinz, D.; Chang, C.-I. Fully constrained least squares linear spectral mixture analysis method for material quantification in hyperspectral imagery. IEEE Trans. Geosci. Remote. Sens. 2001, 39, 529–545. [Google Scholar] [CrossRef] [Green Version]
Dao, M.; Kwan, C.; Ayhan, B.; Tran, T.D. Burn scar detection using cloudy MODIS images via low-rank and sparsity-based models. In Proceedings of the 2016 IEEE Global Conference on Signal and Information Processing (GlobalSIP), Washington, DC, USA, 7–9 December 2016; pp. 177–181. [Google Scholar] [CrossRef]
Veraverbeke, S.; Dennison, P.; Gitas, I.; Hulley, G.; Kalashnikova, O.; Katagis, T.; Kuai, L.; Meng, R.; Roberts, D.; Stavros, N. Hyperspectral remote sensing of fire: State-of-the-art and future perspectives. Remote. Sens. Environ. 2018, 216, 105–121. [Google Scholar] [CrossRef]
Wang, W.; Li, S.; Qi, H.; Ayhan, B.; Kwan, C.; Vance, S. Identify anomaly componentbysparsity and low rank. In Proceedings of the 2015 7th Workshop on Hyperspectral Image and Signal Processing: Evolution in Remote Sensing (WHISPERS), Tokyo, Japan, 2–5 June 2015; pp. 1–4. [Google Scholar] [CrossRef]
Chang, C.-I. Hyperspectral Imaging; Springer: New York, NY, USA, 2003. [Google Scholar]
Li, S.; Wang, W.; Qi, H.; Ayhan, B.; Kwan, C.; Vance, S. Low-rank tensor decomposition based anomaly detection for hyperspectral imagery. In Proceedings of the 2015 IEEE International Conference on Image Processing (ICIP), Quebec City, QC, Canada, 27–30 September 2015; pp. 4525–4529. [Google Scholar]
Yang, Y.; Zhang, J.; Song, S.; Liu, D. Hyperspectral Anomaly Detection via Dictionary Construction-Based Low-Rank Representation and Adaptive Weighting. Remote. Sens. 2019, 11, 192. [Google Scholar] [CrossRef] [Green Version]
Qu, Y.; Guo, R.; Wang, W.; Qi, H.; Ayhan, B.; Kwan, C.; Vance, S. Anomaly detection in hyperspectral images through spectral unmixing and low rank decomposition. In Proceedings of the 2016 IEEE International Geoscience and Remote Sensing Symposium (IGARSS), Beijing, China, 10–15 July 2016; pp. 1855–1858. [Google Scholar] [CrossRef]
Li, F.; Zhang, L.; Zhang, X.; Chen, Y.; Jiang, D.; Zhao, G.; Zhang, Y. Structured Background Modeling for Hyperspectral Anomaly Detection. Sensors 2018, 18, 3137. [Google Scholar] [CrossRef] [Green Version]
Qu, Y.; Qi, H.; Ayhan, B.; Kwan, C.; Kidd, R. DOES multispectral/hyperspectral pansharpening improve the performance of anomaly detection? In Proceedings of the 2017 IEEE International Geoscience and Remote Sensing Symposium (IGARSS), Fort Worth, TX, USA, 23–28 July 2017; pp. 6130–6133. [Google Scholar] [CrossRef]
Kwan, C.; Ayhan, B.; Chen, G.; Wang, J.; Ji, B.; Chang, C.-I. A novel approach for spectral unmixing, classification, and concentration estimation of chemical and biological agents. IEEE Trans. Geosci. Remote. Sens. 2006, 44, 409–419. [Google Scholar] [CrossRef]
Dao, M.; Kwan, C.; Koperski, K.; Marchisio, G. A joint sparsity approach to tunnel activity monitoring using high resolution satellite images. In Proceedings of the 2017 IEEE 8th Annual Ubiquitous Computing, Electronics and Mobile Communication Conference (UEMCON), New York, NY, USA, 19–21 October 2017; pp. 322–328. [Google Scholar] [CrossRef]
Radke, R.; Andra, S.; Al-Kofahi, O.; Roysam, B. Image change detection algorithms: A systematic survey. IEEE Trans. Image Process. 2005, 14, 294–307. [Google Scholar] [CrossRef]
Ilsever, M.; Ünsalan, C. Two-Dimensional Change Detection Methods; Springer Science and Business Media LLC: London, UK, 2012. [Google Scholar]
Zhou, J.; Kwan, C.; Ayhan, B.; Eismann, M.T. A Novel Cluster Kernel RX Algorithm for Anomaly and Change Detection Using Hyperspectral Images. IEEE Trans. Geosci. Remote. Sens. 2016, 54, 6497–6504. [Google Scholar] [CrossRef]
Bovolo, F.; Bruzzone, L. The Time Variable in Data Fusion: A Change Detection Perspective. IEEE Geosci. Remote. Sens. Mag. 2015, 3, 8–26. [Google Scholar] [CrossRef]
Kwan, C.; Haberle, C.; Echavarren, A.; Ayhan, B.; Chou, B.; Budavari, B.; Dickenshied, S. Mars Surface Mineral Abundance Estimation Using THEMIS and TES Images. In Proceedings of the 2018 9th IEEE Annual Ubiquitous Computing, Electronics & Mobile Communication Conference (UEMCON), New York, NY, USA, 8–9 November 2018. [Google Scholar]
CRISM. Available online: http://crism.jhuapl.edu/ (accessed on 18 December 2019).
Khodadadzadeh, M.; Li, J.; Prasad, S.; Plaza, J. Fusion of Hyperspectral and LiDAR Remote Sensing Data Using Multiple Feature Learning. IEEE J. Sel. Top. Appl. Earth Obs. Remote. Sens. 2015, 8, 2971–2983. [Google Scholar] [CrossRef]
Liao, W.; Pižurica, A.; Bellens, R.; Gautama, S.; Philips, W. Generalized Graph-Based Fusion of Hyperspectral and LiDAR Data Using Morphological Features. IEEE Geosci. Remote. Sens. Lett. 2014, 12, 552–556. [Google Scholar] [CrossRef]
Kwan, C. Remote Sensing Performance Enhancement in Hyperspectral Images. Sensors 2018, 18, 3598. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Dao, M.; Kwan, C.; Bernabe, S.; Plaza, J.; Koperski, K. A Joint Sparsity Approach to Soil Detection Using Expanded Bands of WV-2 Images. IEEE Geosci. Remote. Sens. Lett. 2019, 16, 1869–1873. [Google Scholar] [CrossRef]
Chen, L.-C.; Papandreou, G.; Kokkinos, I.; Murphy, K.; Yuille, A.L. DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs. IEEE Trans. Pattern Anal. Mach. Intell. 2018, 40, 834–848. [Google Scholar] [CrossRef]
Badrinarayanan, V.; Badrinarayanan, V.; Cipolla, R. SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 2017, 39, 2481–2495. [Google Scholar] [CrossRef]
Zhao, H.; Shi, J.; Qi, X.; Wang, X.; Jia, J. Pyramid Scene Parsing Network. In Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA, 21–26 July 2017; pp. 6230–6239. [Google Scholar]
Zhou, B.; Zhao, H.; Puig, X.; Fidler, S.; Barriuso, A.; Torralba, A. Scene Parsing through ADE20K Dataset. In Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA, 21–26 July 2017; pp. 5122–5130. [Google Scholar]
Ayhan, B.; Kwan, C. Tree, Shrub, and Grass Classification Using Only RGB Images. Remote. Sens. 2020, 12, 1333. [Google Scholar] [CrossRef] [Green Version]
Nasrabadi, N.M. Kernel-Based Spectral Matched Signal Detectors for Hyperspectral Target Detection. In Proceedings of the Computer Vision; Springer Science and Business Media LLC: London, UK, 2007; Volume 4815, pp. 67–76. [Google Scholar]
Nguyen, D.; Kwan, C.; Ayhan, B. A comparative study of several supervised target detection algorithms for hyperspectral images. In Proceedings of the 2017 IEEE 8th Annual Ubiquitous Computing, Electronics and Mobile Communication Conference (UEMCON), New York, NY, USA, 19–21 October 2017; pp. 192–196. [Google Scholar] [CrossRef]
Kwon, H.; Nasrabadi, N. Kernel RX-algorithm: A nonlinear anomaly detector for hyperspectral imagery. IEEE Trans. Geosci. Remote. Sens. 2005, 43, 388–397. [Google Scholar] [CrossRef]
Burges, C.J. A Tutorial on Support Vector Machines for Pattern Recognition. Data Min. Knowl. Discov. 1998, 2, 121–167. [Google Scholar] [CrossRef]
Qian, T.; Li, X.; Ayhan, B.; Xu, R.; Kwan, C.; Griffin, T. Application of Support Vector Machines to Vapor Detection and Classification for Environmental Monitoring of Spacecraft; Springer Science and Business Media LLC: London, UK, 2006; Volume 3973, pp. 1216–1222. [Google Scholar]
Bernabé, S.; Marpu, P.; Plaza, J.; Mura, M.D.; Benediktsson, J.A. Spectral–Spatial Classification of Multispectral Images Using Kernel Feature Space Representation. IEEE Geosci. Remote. Sens. Lett. 2014, 11, 288–292. [Google Scholar] [CrossRef]
Bernabé, S.; Marpu, P.; Benediktsson, J.A. Spectral unmixing of multispectral satellite images with dimensionality expansion using morphological profiles. In Proceedings of the Satellite Data Compression, Communications, and Processing VIII, San Diego, CA, USA, 12–13 August 2012. [Google Scholar] [CrossRef]
Mura, M.D.; Benediktsson, J.A.; Waske, B.; Bruzzone, L. Morphological Attribute Profiles for the Analysis of Very High Resolution Images. IEEE Trans. Geosci. Remote. Sens. 2010, 48, 3747–3762. [Google Scholar] [CrossRef]
Mura, M.D.; Benediktsson, J.A.; Waske, B.; Bruzzone, L. Extended profiles with morphological attribute filters for the analysis of hyperspectral data. Int. J. Remote. Sens. 2010, 31, 5975–5991. [Google Scholar] [CrossRef]
Sun, W.; Du, Q. Hyperspectral Band Selection: A Review. IEEE Geosci. Remote. Sens. Mag. 2019, 7, 118–139. [Google Scholar] [CrossRef]
Spectral Responses of RGB and NIR Bands. Available online: https://www.spectraldevices.com/content/multispectral-imaging-technology (accessed on 27 April 2020).
Debes, C.; Merentitis, A.; Heremans, R.; Hahn, J.; Frangiadakis, N.; Van Kasteren, T.; Liao, W.; Bellens, R.; Pižurica, A.; Gautama, S.; et al. Hyperspectral and LiDAR Data Fusion: Outcome of the 2013 GRSS Data Fusion Contest. IEEE J. Sel. Top. Appl. Earth Obs. Remote. Sens. 2014, 7, 2405–2418. [Google Scholar] [CrossRef]
Lu, Y.; Perez, D.; Dao, M.; Kwan, C.; Li, J. Deep Learning with Synthetic Hyperspectral Images for Improved Soil Detection in Multispectral Imagery. In Proceedings of the 2018 9th IEEE Annual Ubiquitous Computing, Electronics & Mobile Communication Conference (UEMCON), New York, NY, USA, 8–9 November 2018; pp. 666–672. [Google Scholar]
Deep-Learning-for-HSI-Classification. Available online: https://github.com/luozm/Deep-Learning-for-HSI-classification (accessed on 27 April 2020).
Ma, L.; Li, M.; Ma, X.; Cheng, L.; Du, P.; Liu, Y. A review of supervised object-based land-cover image classification. ISPRS J. Photogramm. Remote. Sens. 2017, 130, 277–293. [Google Scholar] [CrossRef]
Blaschke, T. Object based image analysis for remote sensing. ISPRS J. Photogramm. Remote. Sens. 2010, 65, 2–16. [Google Scholar] [CrossRef] [Green Version]

Figure 1. RGB values in tandem with the ground truth pixel values.

Figure 2. DS-44 band classification maps for (a) ASD, (b) MSD, (c) RXD, (d) KASD, (e) KMSD, (f) KRXD, (g) SR; (h) JSR; and, (i) SVM.

Figure 3. Spectral response of RGB and near infrared (NIR) bands [42].

Table 1. Number of pixels per class in the IEEE Geoscience and Remote Sensing Society (GRSS) data. Total number of unlabeled pixels is 649816 and total number of pixels is 664845.

Class
Name	Number	Color Legend	Samples
Name	Number	Color Legend	Train	Test
Healthy grass	1		198	1053
Stressed grass	2		190	1064
Synthetic grass	3		192	505
Tree	4		188	1056
Soil	5		186	1056
Water	6		182	143
Residential	7		196	1072
Commercial	8		191	1053
Road	9		193	1059
Highway	10		191	1036
Railway	11		181	1054
Parking lot 1	12		192	1041
Parking lot 2	13		184	285
Tennis court	14		181	247
Running track	15		181	473
	1–15		2832	12197

Table 2. Dataset labels and the corresponding bands.

Dataset Label	Short Label	Bands Present in the Corresponding Dataset
RGBNIR	DS-4	RGB and the NIR bands (respectively bands # 60, # 30, # 22 and # 103 in the hyperspectral data).
RGBNIR_LiDAR	DS-5	RGB and the NIR bands; LiDAR data
EMAP_RGBNIR	DS-44	RGB and the NIR bands. 40 bands obtained by EMAP augmentation applied to RGB and the NIR bands.
EMAP_RGBNIR_LiDAR	DS-55	RGB and the NIR bands; LiDAR data; 50 bands obtained by EMAP augmentation applied to RGB, NIR and LiDAR.
HYPER	DS-144	Hyperspectral data set
HYPER_LiDAR	DS-145	Hyperspectral data set; LiDAR data

Table 3. Overall accuracy (OA) in percentage of each method and band combination. Red numbers indicate the best accuracy for each method and bold numbers indicate the best accuracy for each dataset.

OA	DS-4	DS-5	DS-44	DS-55	DS-144	DS-145
ASD	4.28	0.07	27.89	56.92	37.37	38.38
MSD	0.11	4.16	48.65	67.55	55.56	55.57
RXD	28.93	38.87	46.09	33.29	42.69	42.71
KASD	6.16	7.99	79.70	81.28	53.57	53.26
KMSD	26.32	39.15	69.26	51.40	53.61	53.10
KRXD	5.72	7.82	64.14	38.53	71.79	71.79
SR	39.99	42.9	64.4	70.97	57.46	57.46
JSR	59.83	70.81	80.77	86.86	72.57	59.04
SVM	70.43	74.62	82.64	86.00	78.68	81.76

Table 4. Average accuracy (AA) in percentage of each method and band combination. Red numbers indicate the best accuracy for each method and bold numbers indicate the best accuracy for each dataset.

AA	DS-4	DS-5	DS-44	DS-55	DS-144	DS-145
ASD	0.70	3.80	65.40	67.39	33.83	47.68
MSD	0.34	2.42	56.39	72.68	59.45	58.71
RXD	39.53	43.83	56.30	32.67	49.04	47.84
KASD	6.16	6.80	83.51	81.43	50.29	60.92
KMSD	41.34	48.98	68.23	58.22	65.89	56.06
KRXD	6.49	6.65	78.63	53.17	75.85	76.21
SR	44.24	46.86	69.82	74.58	61.72	61.72
JSR	60.19	71.21	83.27	88.45	74.80	60.90
SVM	70.74	73.12	85.61	86.48	81.16	81.04

Table 5. Kappa coefficient (κ) of each method and band combination. Red numbers indicate the best accuracy for each method and bold numbers indicate the best accuracy for each dataset.

Kappa	DS-4	DS-5	DS-44	DS-55	DS-144	DS-145
ASD	−0.02	0.01	0.21	0.53	0.33	0.38
MSD	−0.04	−0.01	0.45	0.65	0.52	0.56
RXD	0.23	0.34	0.42	0.28	0.38	0.38
KASD	−0.01	0.001	0.78	0.80	0.50	0.496
KMSD	0.22	0.35	0.67	0.48	0.50	0.4912
KRXD	−0.01	0.01	0.62	0.33	0.70	0.70
SR	0.358	0.390	0.615	0.685	0.541	0.541
JSR	0.567	0.684	0.791	0.857	0.704	0.557
SVM	0.704	0.750	0.812	0.859	0.765	0.864

Table 6. Class specific accuracies of Adaptive Subspace Detection (ASD), Matched Signature Detection (MSD), and Reed-Xiaoli Detection (RXD) with various band combinations.

	ASD						MSD						RXD
	DS-4	DS-5	DS-44	DS-55	DS-144	DS-145	DS-4	DS-5	DS-44	DS-55	DS-144	DS-145	DS-4	DS-5	DS-44	DS-55	DS-144	DS-145
1-Healthy grass	0.00	0.00	43.97	66.00	75.78	75.78	0.00	0.00	13.01	55.46	51.28	51.28	83.10	83.00	42.26	34.09	60.59	60.68
2-Stressed grass	0.00	0.09	13.91	4.04	42.29	44.08	0.00	0.00	23.50	43.98	84.68	84.68	0.00	0.00	44.08	8.27	80.73	80.73
3-Synthetic grass	0.00	0.00	11.09	99.41	100.0	100.0	0.00	98.81	100.0	100.0	100.0	100.0	99.01	98.81	100.0	100.0	82.18	82.18
4-Trees	0.00	0.00	43.18	83.62	45.83	47.16	0.00	0.00	78.22	75.38	54.45	54.45	4.73	91.19	71.02	45.93	45.45	45.45
5-Soil	0.00	0.19	21.40	27.75	27.75	25.57	0.00	0.00	80.40	92.80	98.30	98.30	98.77	98.20	87.88	58.05	99.62	99.62
6-Water	0.00	0.00	0.00	91.61	89.51	89.51	0.00	0.00	65.73	79.72	82.52	82.52	56.64	56.64	67.83	78.32	71.33	71.33
7-Residential	0.00	0.09	92.63	67.63	12.50	12.22	0.00	0.00	32.93	73.79	67.44	67.54	10.35	20.24	14.09	0.28	46.74	46.74
8-Commercial	0.00	0.00	57.08	80.06	59.64	66.29	0.00	0.00	20.13	46.06	49.10	49.10	37.51	51.57	42.92	53.56	21.84	21.84
9-Road	0.00	40.98	0.38	68.84	11.80	13.88	0.00	0.19	29.65	43.63	39.47	39.47	0.00	0.00	0.00	0.00	5.57	5.57
10-Highway	0.00	44.02	6.37	44.31	5.60	6.66	0.00	0.00	41.60	68.15	41.70	41.70	0.00	0.00	45.17	1.35	9.07	9.07
11-Railway	4.65	0.00	0.28	33.49	6.93	6.74	0.38	0.00	64.23	94.50	23.06	23.06	0.00	0.00	31.88	19.54	5.12	5.22
12-Parking lot 1	0.00	0.00	10.57	39.48	13.26	15.66	0.00	0.00	42.75	46.78	9.41	9.41	0.00	0.00	28.34	37.18	3.55	3.55
13-Parking lot 2	0.00	0.35	22.11	57.89	51.23	51.93	0.00	2.46	58.25	58.25	8.42	8.42	0.00	18.95	40.35	54.04	15.44	15.44
14-Tennis Court	0.00	0.00	45.75	97.98	74.90	74.90	3.64	0.00	99.60	98.79	72.06	72.06	0.00	0.00	55.87	39.27	72.06	72.06
15-Running Track	100.0	0.00	21.14	99.15	87.32	84.78	0.00	0.00	90.70	96.19	98.73	98.73	100.0	100.0	100.0	100.0	98.10	98.10

Table 7. Class specific accuracies of Kernel ASD (KASD), Kernel MSD (KMSD), and Kernel RXD (KRXD) with various band combinations.

	KASD						KMSD						KRXD
	DS-4	DS-5	DS-44	DS-55	DS-144	DS-145	DS-4	DS-5	DS-44	DS-55	DS-144	DS-145	DS-4	DS-5	DS-44	DS-55	DS-144	DS-145
1-Healthy grass	4.94	7.31	80.25	83.00	80.91	81.01	28.02	45.58	82.91	65.81	97.25	97.44	3.89	1.33	29.06	13.20	80.82	80.82
2-Stressed grass	4.42	0.66	71.33	71.52	61.84	61.94	13.44	33.55	74.25	48.68	59.02	59.87	10.53	1.50	73.68	45.39	80.92	80.92
3-Synthetic grass	1.39	3.17	99.60	100.0	99.41	99.60	55.64	89.31	100.0	89.70	78.81	79.01	2.18	0.00	87.13	0.00	99.80	99.80
4-Trees	6.72	2.37	91.76	91.76	91.86	92.52	47.06	59.38	91.67	20.08	80.02	80.40	14.87	2.84	64.77	0.85	93.66	93.66
5-Soil	4.36	0.95	98.96	96.69	72.63	72.73	21.31	38.64	73.39	65.34	82.95	82.86	3.69	2.75	87.78	74.34	97.92	97.92
6-Water	15.38	3.50	95.80	97.20	100.0	100.0	42.66	68.53	94.41	93.01	79.02	81.12	6.29	5.59	53.15	0.00	95.10	95.10
7-Residential	0.84	20.71	91.79	90.67	61.01	59.70	18.38	31.44	61.47	29.20	45.62	45.62	4.57	1.49	64.65	4.29	79.20	79.20
8-Commercial	9.31	1.61	46.53	73.12	44.92	43.87	24.98	25.17	37.80	25.07	20.51	21.18	6.93	35.14	25.26	53.75	30.48	30.48
9-Road	2.93	11.05	75.64	80.55	21.15	21.62	21.25	31.73	55.43	36.64	39.00	32.29	3.97	35.22	73.75	71.48	67.52	67.52
10-Highway	13.90	2.03	66.70	45.46	11.39	12.07	16.22	21.53	61.10	66.22	29.05	29.83	7.34	0.39	55.02	36.20	43.73	43.73
11-Railway	6.36	40.61	76.38	83.97	11.39	11.86	16.89	22.87	73.62	67.17	37.10	39.28	2.18	0.76	72.30	61.76	63.19	63.19
12-Parking lot 1	9.61	1.63	73.68	74.64	23.25	22.00	21.90	27.76	54.47	43.71	24.98	22.00	2.59	5.28	64.17	29.30	45.44	45.44
13-Parking lot 2	5.96	2.11	71.93	72.28	65.26	65.61	25.61	33.33	27.72	16.49	18.60	13.33	5.61	1.75	76.49	51.58	66.67	66.67
14-Tennis Court	9.31	0.81	100.0	97.17	90.69	90.69	31.17	53.44	95.14	99.19	93.52	93.52	8.50	10.12	93.93	30.77	100.0	100.0
15-Running Track	3.59	1.06	100.0	99.58	84.78	78.65	63.21	92.18	98.94	98.10	63.64	63.21	0.42	0.21	87.95	76.32	98.73	98.73

Table 8. Class specific accuracies of Sparse Representation (SR), Joint SR (JSR), and Support Vector Machine (SVM) with various band combinations.

	SR						JSR						SVM
	DS-4	DS-5	DS-44	DS-55	DS-144	DS-145	DS-4	DS-5	DS-44	DS-55	DS-144	DS-145	DS-4	DS-5	DS-44	DS-55	DS-144	DS-145
1-Healthy grass	80.63	81.39	81.39	83.10	82.05	82.05	86.84	98.87	94.36	98.98	87.01	82.43	82.62	82.53	81.96	97.33	82.05	96.76
2-Stressed grass	7.61	14.00	53.57	54.42	78.29	78.29	99.34	98.76	89.16	97.71	99.33	83.18	83.08	82.24	81.30	99.89	82.61	97.55
3-Synthetic grass	98.42	98.81	100.0	100.0	99.60	99.60	92.45	100.0	100.0	100.0	100.0	97.23	99.60	99.60	100.00	100.0	99.80	39.28
4-Trees	66.95	66.67	71.69	90.34	82.01	82.01	76.94	84.36	94.51	99.27	75.69	79.73	98.58	99.24	89.39	99.49	92.80	97.40
5-Soil	94.89	95.27	91.38	97.35	99.53	99.53	95.50	97.08	97.05	98.05	97.48	99.62	96.78	97.25	97.82	98.01	98.48	97.48
6-Water	99.30	99.30	99.30	93.71	97.20	97.20	44.94	45.79	100.0	96.45	39.81	68.53	93.01	91.61	95.10	22.70	94.41	47.06
7-Residential	53.64	55.97	71.83	93.10	71.55	71.55	59.52	45.67	56.94	74.31	81.22	51.49	82.18	83.21	89.55	69.20	76.31	89.93
8-Commercial	0.47	0.47	18.04	38.84	16.24	16.24	50.46	50.16	72.69	69.62	65.14	11.40	18.23	52.23	42.26	85.14	44.82	83.62
9-Road	21.06	35.03	16.53	80.83	15.49	15.49	48.21	69.87	65.54	94.97	0.00	36.45	55.90	73.09	77.62	92.09	72.80	85.97
10-Highway	14.67	28.96	58.01	43.73	49.61	49.61	43.64	54.04	81.89	84.43	0.00	48.17	53.38	48.46	68.44	87.75	56.95	86.55
11-Railway	4.36	3.98	94.88	94.12	26.57	26.57	44.92	66.94	83.41	69.62	70.35	30.55	56.45	77.99	92.79	75.69	78.37	77.91
12-Parking lot 1	9.13	3.27	46.30	1.63	13.64	13.64	37.91	63.61	84.71	85.93	66.32	38.04	50.62	32.37	85.21	91.00	73.49	77.48
13-Parking lot 2	0.70	13.33	45.96	49.12	23.16	23.16	5.53	23.20	49.12	74.75	12.25	17.19	33.33	37.54	74.39	82.29	67.02	49.40
14-Tennis Court	11.74	6.48	98.38	98.38	70.85	70.85	50.12	69.74	79.68	94.27	65.78	71.26	98.38	100.00	100.00	96.86	100.00	89.17
15-Running Track	100.0	100.0	100.0	100.0	100.0	100.0	66.49	100.0	100.0	100.0	88.52	98.31	97.04	97.89	100.00	99.79	97.46	100.00

Table 9. Elapsed time (ET) values (minutes) for different methods and datasets in the training process. Red numbers indicate the most efficient cases.

ET (min)	DS-4	DS-5	DS-44	DS-55	DS-144	DS-145
ASD	0.71	0.73	0.87	1.00	2.48	2.21
MSD	0.75	0.77	1.21	1.76	8.74	8.12
RXD	0.30	0.31	0.37	0.45	0.94	1.02
KASD	60.60	64.13	89.71	81.22	127.42	147.85
KMSD	16.33	16.43	21.56	23.01	29.44	29.89
KRXD	32.93	33.34	54.45	60.23	87.72	92.74
SR	492.83	694.33	941.06	921.45	1037.99	1056.98
JSR	629.23	891.71	2248.17	2198.56	2210.15	2310.42
SVM	5.32	3.76	0.69	0.47	1.30	1.41

Table 10. Comparison of the overall accuracies (%) of several methods.

Reference	Dataset Adopted	Algorithm Adopted	Overall Accuracy
This paper	EMAP_RGBNIR (DS-44)	JSR	80.77
	EMAP_RGBNIR (DS-44)	SVM	82.64
	EMAP_RGBNIR_LiDAR (DS-55)	JSR	86.86
	EMAP_RGBNIR_LiDAR (DS-55)	SVM	86.00
[23]	Hyperspectral data; EMAP augmentation applied hyperspectral data (X^h+EMAP(X^h))	MLRsub	84.40
	Hyperspectral data; Additional bands from LiDAR data (X^h + AP(X^L))	MLRsub	87.91
	Hyperspectral data; EMAP augmentation applied hyperspectral data; Additional bands from LiDAR data (X^h + AP(X^L) + EMAP(X^h))	MLRsub	90.65
[24]	Hyperspectral data	SVM	80.72
	Morphological Profile of hyperspectral and LiDAR data (MPSHSLi)	SVM	86.39
	Generalized graph-based fusion features from hyperspectral and LiDAR data (GGF)	SVM	94

Table 11. Comparison of SVM classification results using narrow bands and wide bands. Bold numbers indicate better numbers when one compares metrics with the same number of bands.

	Narrow Bands from HS Data				Wide RGB and NIR Bands Based on Spectral Response
	DS-4	DS-5	DS-44	DS-55	DS-4	DS-5	DS-44	DS-55
OA (%)	69.99	75.32	81.31	85.72	70.43	74.62	82.64	86.00
AA (%)	70.97	73.48	84.37	87.36	70.74	73.12	85.61	86.48
$κ$	0.677	0.733	0.799	0.846	0.704	0.750	0.812	0.859
1-Healthy grass	82.72	82.43	82.43	83.10	82.62	82.53	81.96	97.33
2-Stressed grass	83.65	82.14	80.55	80.92	83.08	82.24	81.30	99.89
3-Synthetic grass	99.60	99.60	100.00	100.00	99.60	99.60	100.00	100.00
4-Trees	95.36	99.15	92.61	96.78	98.58	99.24	89.39	99.49
5-Soil	96.97	97.35	98.86	97.73	96.78	97.25	97.82	98.01
6-Water	93.01	95.10	95.10	95.10	93.01	91.61	95.10	22.70
7-Residential	78.45	81.62	80.69	84.42	82.18	83.21	89.55	69.20
8-Commercial	16.81	51.28	44.92	71.60	18.23	52.23	42.26	85.14
9-Road	51.46	63.46	81.78	89.99	55.90	73.09	77.62	92.09
10-Highway	54.15	48.75	65.06	65.35	53.38	48.46	68.44	87.75
11-Railway	59.30	77.42	73.24	88.33	56.45	77.99	92.79	75.69
12-Parking lot 1	55.04	48.41	90.87	82.61	50.62	32.37	85.21	91.00
13-Parking lot 2	29.82	36.84	74.74	78.60	33.33	37.54	74.39	82.29
14-Tennis Court	97.98	100.00	100.00	100.00	98.38	100.00	100.00	96.86
15-Running Track	97.25	98.73	100.00	100.00	97.04	97.89	100.00	99.79

© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Kwan, C.; Gribben, D.; Ayhan, B.; Bernabe, S.; Plaza, A.; Selva, M. Improving Land Cover Classification Using Extended Multi-Attribute Profiles (EMAP) Enhanced Color, Near Infrared, and LiDAR Data. Remote Sens. 2020, 12, 1392. https://doi.org/10.3390/rs12091392

AMA Style

Kwan C, Gribben D, Ayhan B, Bernabe S, Plaza A, Selva M. Improving Land Cover Classification Using Extended Multi-Attribute Profiles (EMAP) Enhanced Color, Near Infrared, and LiDAR Data. Remote Sensing. 2020; 12(9):1392. https://doi.org/10.3390/rs12091392

Chicago/Turabian Style

Kwan, Chiman, David Gribben, Bulent Ayhan, Sergio Bernabe, Antonio Plaza, and Massimo Selva. 2020. "Improving Land Cover Classification Using Extended Multi-Attribute Profiles (EMAP) Enhanced Color, Near Infrared, and LiDAR Data" Remote Sensing 12, no. 9: 1392. https://doi.org/10.3390/rs12091392

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Improving Land Cover Classification Using Extended Multi-Attribute Profiles (EMAP) Enhanced Color, Near Infrared, and LiDAR Data

Abstract

1. Introduction

2. Methods and Data

2.1. Land Cover Classification Methods

2.1.1. Matched Subspace Detection (MSD)

2.1.2. Adaptive Subspace Detection (ASD)

2.1.3. Reed-Xiaoli Detection (RXD)

2.1.4. Kernel MSD (KMSD)

2.1.5. Kernel ASD (KASD)

2.1.6. Kernel RXD (KRXD)

2.1.7. Sparse Representation (SR)

2.1.8. Joint Sparse Representation (JSR)

2.1.9. Support Vector Machine (SVM)

2.2. EMAP

2.3. Dataset Used

2.4. Evaluation Metrics

3. Land Cover Classification Results

3.1. Results

3.1.1. Results of Using Narrow Bands

3.1.2. Comparison with Khodadadzadeh et al.’s Results [23]

3.1.3. Comparison with Liao et al.’s Results

3.1.4. Wide RGB and NIR Bands

3.2. Discussion

3.2.1. Full Hyperspectral Data vs. Synthetic Bands

3.2.2. EMAP Based Augmentation vs. Deep Learning

3.3. Potential of Using Object Based Approaches

4. Conclusions and Future Directions

Author Contributions

Funding

Conflicts of Interest

Appendix A

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI