Fine Land-Cover Mapping in China Using Landsat Datacube and an Operational SPECLib-Based Approach

Xiao Zhang; Liangyun Liu; Xidong Chen; Shuai Xie; Yuan Gao

doi:10.3390/rs11091056

,

and

¹

Key Laboratory of Remote Sensing Science, Institute of Remote Sensing and Digital Earth, Chinese Academy of Sciences, Beijing 100094, China

²

University of Chinese Academy of Sciences, Beijing 100049, China

³

College of Geomatics, Xi’an University of Science and Technology, Xi’an 710054, China

^*

Author to whom correspondence should be addressed.

Remote Sens.2019, 11(9), 1056;https://doi.org/10.3390/rs11091056

This article belongs to the Section Remote Sensing Image Processing

Version Notes

Order Reprints

Abstract

Fine resolution land cover information is a vital foundation of Earth science. In this paper, a novel SPECLib-based operational method is presented for the classification of multi-temporal Landsat imagery using reflectance spectra from the spatial-temporal spectral library (SPECLib) for 30 m land-cover mapping for the whole of China. Firstly, using the European Space Agency (ESA) Climate Change Initiative Global Land Cover (CCI_LC) product and the MODIS Version 6 Nadir bidirectional reflectance distribution function adjusted reflectance (NBAR) product (MCD43A4), a global SPECLib with a spatial resolution of 158.85 km (equivalent to 1.43° at the equator) and a temporal resolution of eight days was developed in the sinusoidal projection. Then, the Landsat datacube covering the whole of China was developed using all available observations of Landsat OLI imagery in 2015. Thirdly, the multi-temporal random forest method based on SPECLib was presented to produce an annual land-cover map with 22 land-cover types using the Landsat datacube. Finally, the annual China land-cover map was validated by two different validation systems using approximately 11,000 interpretation points. The mapping results achieved the overall accuracy of 71.3% and 80.7% and the kappa coefficient of 0.664 and 0.757 for the level-2 validation system (19 land-cover types) and the level-1 validation system (nine land-cover types), respectively. Therefore, the case study in China indicates that the proposed SPECLib method is an operational and accurate method for regional/global fine land-cover mapping at a spatial resolution of 30 m.

Keywords:

spatial-temporal spectral library; multi-temporal; fine classification; land-cover mapping; random forest; Landsat OLI; large area

1. Introduction

Land cover products are fundamental to many applications in environmental monitoring, land management, and global change studies [,]. They are also an important input to climate change modeling, greenhouse gas inventories, and biodiversity conservation planning []. The characterization of land cover has become a discipline with a central influence on many research fields [,].

Many efforts have been paid on mapping land cover at various spatial resolutions. However, due to differences in spatial resolution, classification schemes, thematic detail, and classification accuracy, the land cover datasets are barely good enough to meet the needs of various user communities [,,,,,,]. Recently, the advent of free medium-resolution satellite data (e.g., Landsat and Sentinel-2), combined with rapidly-increasing data-storage and computation capabilities, has greatly enhanced the ability to produce regional/global fine spatial resolution land-cover products []. FROM-GLC and GlobeLand30 are two representative global 30-m land-cover products that use Landsat imagery [,]. Although the resolution of these two products meets user requirements, there are still some challenges that need to be overcome for many specific applications at the regional/global scale []. For example, the GlobeLand30 classification system is simpler, and the interpretation of training samples in FROM_GLC is time-consuming.

Various approaches have been presented for land-cover mapping based on single-date or multi-temporal satellite images [,,]. The multi-temporal classification approaches have been demonstrated to have better performance than the single-date approaches [,]. The temporal variability of land surfaces can be captured with repeated observations over time and thus time-series of remote sensing images have been successfully applied to discriminate between land-cover types [,,]. For example, Senf et al. [] used multi-temporal Landsat and MODIS imagery to separate croplands from savannah, and Belgiu and Csillik [] successfully classified various croplands from Sentinel-2 time series. Yamazaki et al. [] developed a global inland water-body map using multi-temporal Landsat images. Dennison and Roberts [] explored the potential of vegetation phenology for tree-species mapping. Therefore, temporal information is very useful in land-cover mapping [].

Another critical issue in land-cover mapping is the classification strategy used. The supervised classification methods are generally considered superior to the unsupervised ones for large-scale land-cover mapping [] but they involve the collection of sufficient and accurate training samples, which is especially challenging for global land-cover mapping [,,]. Alternative and effective options include: using the spectral signature generalization or extension methods [,,,,] and deriving training samples from existing land cover products [,]. However, Olthof et al. [] analyzed the influences of temporal difference and spatial distance on the spectral generalization methods, and found a noticeably negative correlation between the mapping accuracy and the temporal and spatial differences, which indicates that the spectral signature generalization method might be more suitable at the regional scale than for large-areas []. As for the second option, if the land-cover product and the corresponding remote sensing data are available, the training samples’ spectra can be automatically extracted. However, the accuracy of these can be directly affected by the mapping accuracy of the land-cover product and the quality of the satellite data []. Recently, Zhang et al. [] proposed a novel approach to building a global priori spatial-temporal spectral library (SPECLib) by using the GlobCover2009 and the eight-day composite MODIS surface reflectance product (MOD09A1) products, which was able to overcome the spatial and temporal limitations of the signature generalization method.

Furthermore, the specific techniques used in classifiers can be divided into parametric and non-parametric classifiers depending on whether or not parameters are needed []. The use of parametric classifiers usually involves there being several limitations on the training samples: specifically, they need to meet the probability sampling design criteria and capture all the relevant spectral heterogeneity within and between land covers [,]. In contrast, non-parametric supervised classifiers, such as the artificial neural network (ANN) [], the classification and regression tree (CART) [], the support vector machine (SVM) [], and the random forest (RF) [] classifiers, get rid of the constraints of frequency distribution, so they have been widely used in land-cover mapping [,]. Random forest classifier is an ensemble of decision-tree for classification task. Compared with the decision-tree classifiers, where each tree is created using the best split among the training variables, the RF classifier uses a resampling technique to randomly select a subset of training variables and samples for splitting each tree node, so it is less sensitive to the over-fitting problem [,]. Compared with other widely used non-parametric classifiers, the RF classifier has been demonstrated to have higher classification accuracy, better performance for high dimensional data and be more robust to noise and feature selection [,,].

The aim of this research was to develop an operational SPECLib-based multi-temporal land-cover mapping method for a fine classification system with a global resolution of 30 m. In order to achieve this goal, a global SPECLib, defined every 158.85 km (equivalent to 1.43° at the equator) with a temporal resolution of eight days was first developed using the MCD43A4 NBAR (Nadir bidirectional reflectance distribution function adjusted reflectance) product and the CCI_LC (European Space Agency Climate Change Initiative Global Land Cover) land-cover product for 2015. Then, using the reflectance spectra in SPECLib, a multi-temporal classification method based on the stacked random forest was proposed to produce an annual land-cover map with 22 land-cover types. Thirdly, the Landsat datacube covering the whole of China was developed using all available observations of Landsat OLI imagery in 2015. Finally, the SPECLib-based multi-temporal land-cover mapping method was tested using the above Landsat datacube to produce China’s 2015 annual land-cover map. The validation results indicated that the proposed method was effective and accurate for large-area land-cover mapping.

2. Datasets and Preprocessing

2.1. Landsat Imagery and Datacube

More than 7000 Landsat OLI images covering China and acquired in 2015 were downloaded from the USGS (United States Geological Survey) and RADI (Institute of Remote Sensing and Digital Earth, Chinese Academy of Sciences). These images had a spatial resolution of 30 m and a cloud coverage of less than 60%. In this study, all images were radiometrically corrected to surface reflectance (SR) using the C-correction topographical correction method [,,] and the Landsat Surface Reflectance Code (LaSRC) atmospheric correction method [,,].

2.1.1. Reprojection and Tiling

Due to the poleward convergence of Landsat orbits, the across-track scene overlap for adjacent Landsat eight orbits increases significantly from a minimum of 0.105° at the equator to 12.5° at a latitude of 80° []. Therefore, using a regular grid of points in the universal transverse mercator (UTM) or geographic projection might be inappropriate for large-area applications because the land surface recorded by the sensor is sampled with the different spatial grid density at different latitudes []. In order to increase the chance of having more frequent surface observations, the global sinusoidal equal area projection grid (GSinGrid) defined every 5.560 km (equivalent to 0.05° at the equator) was used so that the grid sampling density did not vary by geographic region [].

As there are about 305,000 GSinGrid tiles covering the whole of China (Figure 1), monitoring and analyzing all of the reprojected grid data separately would have been inefficient and complicated. In order to facilitate subsequent handling and application, each GSinGrid tile was dealt with as the minimum unit of the three-level land-tile. Specifically, the level-1 land-tile was defined as being the 10° × 10° MODIS land tile and the level-2 land-tile was defined as the global Web-enabled Landsat Data (GWELD) land tile because the size of GWELD tile (158.85km) is almost the same as the 185-km swaths in the Landsat scene []. There were 7 × 7 GWELD tiles within each MODIS land tile (Figure 1). The final level, the level-3 land-tile, was the GSinGrid tile, each tile being composed of 186×186 30-m pixels. Each GWELD tile contained 29 × 29 GSinGrid tiles (Figure 1). In order to facilitate the application and monitoring of the data like GWELD tile in Zhang and Roy [], each GSinGrid tile locations were recorded in the filenames, these were given as ‘hh(x0)vv(y0)h(x1)v(y1)_p(x2)r(y2)’, where x0 and y0 are the horizontal and vertical coordinates for MODIS land tile, x1 and y1, ranged from 0 to 6, are the horizontal and vertical coordinates of GWELD tile, and x2 and y2, ranged from 0 to 28, are the horizontal and vertical coordinates of GSinGrid tile. Moreover, it should be noted that the coordinate origin of the lower level tile, GWELD, and GSinGrid tile, corresponds to the left-upper corner of the upper-level tile, MODIS and GWELD tile, respectively.

Figure 1. The three-level GSinGrid tiles used to store the multi-temporal images of China. The study area was hierarchically tiled into 19 standard MODIS level-1 land tiles (red), approximately 550 GWELD level-2 tiles (cyan) and 305,000 GSinGrid tiles (yellow), respectively.

To store and retrieve the GSinGrid tile more efficiently and flexibly, a simplified database which was similar to the Australian Geoscience Data Cube (AGDM) [], called Landsat datacube, was designed to manage these massive GSinGrid tile data. In this study, all 7,059 Landsat SR images were processed into the GSinGrid tiles using the aforementioned method and then stored in the datacube. Figure 2 displays the locations and frequency distributions of the Landsat WRS-2 scenes and corresponding GSinGrid tiles in the datacube. The comparison illustrates that the tiling process significantly increased the data availability especially in the overlap regions between two orbits.

Figure 2. The locations and frequency distributions of Landsat WRS-2 scenes (shown in the geographical projection) and corresponding GSinGrid tiles (shown in the sinusoidal projection) (a,b).

2.1.2. Cloud and Shadow Detection and Filling

As clouds and cloud shadows obscure the spectral characteristics of ground surfaces and significantly influence the availability of useful remote sensing data, identification and filling of cloud- and shadow-contaminated pixels was necessary. First, all cloud and shadow pixels were identified using the Fmask algorithm, which has previously been demonstrated to have higher accuracy and more stability than traditional single-date detection methods [,]. Next, a modified neighborhood similar pixel interpolator approach [] was used to restore the cloud- and shadow-contaminated pixels in the GSinGrid tile data. As the accuracy of the restored images slightly decreased as the amount of cloud increased [], to ensure the quality of the restored images, the image was discarded if the percentage of cloud and shadow pixels exceeded 30%.

2.2. Validation Dataset

The validation dataset used in this study was collected by visual interpretation of high-resolution Google Earth images. Due to the easy access to high-resolution images, Google Earth has great advantages in the collection of ground truth samples [,]. In addition, several auxiliary datasets, including the existing land cover maps (GlobeLand30 [], FROM_GLC from 2015 [], ESA CCI_LC [] and MCD12Q1 []), as well as topographical data (digital elevation model and slope data) and the normalized difference vegetation index (NDVI) seasonality product [], were collected to assist the interpretation of each validation sample.

The validation dataset contained approximately 11,000 points and included nine level-1 land-cover types (cropland, forest, shrubland, grassland, wetland, water body, impervious, bare area, and snow/ice) and 19 level-2 land-cover types (Table 1), for the nominal year 2015. In this study, as some forest validation samples—closed broadleaf/needle-leaved forest and open broadleaf/needle-leaved forest—were difficult to distinguish in the visual interpretation process, the classification system containing 22 land-cover types was simplified to the two-level validation systems (Table 1). Figure 3 illustrates the spatial distribution of all validation samples. As the locations of all samples were randomly generated, the proportions of bareland and grassland samples are larger than those of the other land-cover types. Furthermore, to minimize the subjective influence of interpretation by experts, the validation samples were independently interpreted by four different scientists.

Table 1. The classification system used in the study and the corresponding two-level validation systems.

Figure 3. The spatial distributions of the validation samples.

3. Methods

3.1. The Spatial-Temporal Spectral Library

As explained in our previous work [], the spatial-temporal spectral library was developed to store the reflectance spectra of different land-covers types within each geographic grid with a temporal resolution of eight days for global land areas. It makes full use of the spectral consistency between Landsat and MODIS data [] and also the high classification accuracy and detailed classification scheme of GlobCover2009 []. However, in contrast to the previous version of SPECLib developed using the MOD09A1 reflectance product and the GlobCover2009 land-cover product, the current SPECLib has been updated using the MCD43A4 NBAR product and the ESA CCI_LC land-cover product. The use of these updated products (MCD43A4 and CCI_LC) leads to the following changes and advantages for the current SPECLib version.

First, as the Landsat images processed into the GSinGrid tiles shared the same sinusoidal projection as MCD43A4, and the size of level-2 (GWELD) land tiles (158.85km) was similar to that of the 185 km Landsat swaths, the SPECLib cells were defined to have a size of 158.85 km × 158.85 km (equivalent to 1.43° at the equator) on a sinusoidal projection. If the cells had been larger (as for the level-1 10° × 10° MODIS land product tiles), the spectral signature extension performance would have been significantly poorer []. Smaller cells (as for the level-3 0.05° × 0.05° GSinGrid tiles) would have meant that most SPECLib cells had no spectral reflectance because the spatial extent of GSinGrid covered approximately 11 × 11 500-m MCD43A4 pixels when collecting spectral uniform points. To facilitate the spatial matching between the GSinGrid SR with that in the SPECLib, the name of each SPECLib cell also included the tile information as ‘hh(x0)vv(y0)h(x1)v(y1)’. Further, the MCD43A4 has an overlapping, eight-day processing cycle [], therefore, the SPECLib was developed to have a temporal resolution of eight days. This resulted in there being 46 epochs per year for each 158.85 km × 158.85 km cell and, for each GSinGrid tile, a spectral library that matched temporally and spatially and that had a phenological difference of less than five days could be found.

Secondly, at the stage of where spectrally uniform points within each 158.85 km × 158.85 km cell were collected, the uniform points were still determined based on the variance of a 3 × 3 local window using spectral thresholds of [0.03, 0.03, 0.03, 0.06, 0.03, and 0.03] for the six MODIS bands (blue, green, red, NIR, SWIR1, and SWIR2) []. Since the view-angle effects had already been removed from the directional reflectances of MOD09A1, this resulted in a more stable and consistent product (MCD43A4), and more candidate points for spectral uniformity were selected after removing the constraint of view angle between Landsat and MODIS.

Finally, compared with the GlobeCover2009 land-cover product [], the CCI_LC has a higher classification accuracy, more detailed classification scheme and better stability of the land-cover maps from year to year [,]. Therefore, the classification system (Table 1) adopted in this study contained a fine classification scheme that included 22 land-cover types after the removal of several mosaic land-cover types (these included mosaic cropland and natural vegetation, and mosaic tree and shrub and herbaceous cover). The CCI_LC is provided with a geographical projection while the SPECLib and MCD43A4 are provided with a sinusoidal projection. The global CCI_LC map for 2015 was first reprojected to the sinusoidal projection and further tiled into the 10° × 10° MODIS land product tiles. Since the spatial resolution of the MCD43A4 (500 m) data was roughly two times that of the CCI_LC (300 m) and the uniform points were determined in 3 × 3 local window, and the CCI_LC achieved higher accuracy over homogeneous areas [], the larger window of 4 × 4 instead of 2 × 2 was used to determine the land-cover types for these uniform points, namely, the boundary of each uniform point collected in MCD43A4 would be composed of 4 × 4 CCI_LC pixels. In order to ensure the reliability of the reflectance spectra in SPECLib, only those spectrally uniform points that were further identified as homogeneous in the CCI_LC were retained. In other words, if the maximum frequency of dominant land-cover types was less than 14 in the 4 × 4 local window, the point was excluded from the SPECLib.

3.2. Normalization of the SPECLib Reflectance Spectra

Although there was a great deal of consistency between the Landsat SR and MCD43A4 NBAR products [], the minor spectral differences caused by the differences in the acquisition date, the quantitative processing, and the spectral response function, also had to be considered and minimized using relative radiometric normalization. The SPECLib was defined for each level-2 land tile using MCD43A4 and had the same projection as the level-2 land-tile Landsat SR (Section 2.1). To remove the resolution differences between the two sensors, the Landsat SR data in the level-2 land-tiles were further aggregated to the MODIS resolution by averaging the Landsat values. The homogeneous points selected by the spectral thresholds [0.03, 0.03, 0.03, 0.06, 0.03, and 0.03] for the six spectral bands (see also [] and []) in both types of imagery were used to build the linear regression defined as:

ρ_{O L I} (λ_{O L I}) = α \times ρ_{M O D I S} (λ_{M O D I S}) + β

(1)

where the coefficients α and β describe the radiometric differences between the two sensors in the six spectral bands: the closer the slope to the 1:1 line, the better the spectral consistency. Therefore, all the reflectance spectra in SPECLib were normalized using the above formula and then used to train the multi-temporal classification method.

3.3. Multi-Temporal Classification Method Based on SPECLib

The RF classifier can handle high data dimensionality and multicollinearity. It is also fast, insensitive to overfitting, less sensitive to noise and more efficient and accurate than other non-parametric classifiers []. Therefore, the RF was selected for use in our operational SPECLib-based multi-temporal land-cover mapping method. To generate the annual land-cover map using the intra-annual multi-temporal data, a stacked random forest was developed and adopted (Figure 4). The stacked random forest usually consists of two steps: the independent RF classifier (base classifier) was first generated using the training data from each time/date. Individual outputs from all the independent classifiers were then used as input features to train the second classifier.

Figure 4. The flowchart of the multi-temporal classification method.

3.3.1. Training the Base Classifier

For a given GSinGrid tile, there were multiple epochs (Nepoch) Landsat SR data (the temporal frequencies are shown in Figure 2). According to the spatial location (hh(x0)vv(y0)h(x1)v(y1)_p(x2)r(y2)) and day of year (DOY) corresponding to these SR data, the Nepoch spectral libraries that were temporally closest (phenological difference less than five days) could be found in the SPECLib which was defined for each GWELD tile (hh(x0)vv(y0)h(x1)v(y1)) and the temporal resolution of eight days. It should be noted that all GSinGrid tiles belonging to the same GWELD tile shared the same reflectance spectra because the SPECLib was defined at the GWELD scale. In addition, to ensure classification consistency across these spatial-neighbor GWELD tiles and to avoid losing reflectance spectra for broken land-cover types caused by the coarse resolution of MCD43A4 and CCI_LC, the adjacent 3 × 3 GWELD tiles from SPECLib were imported to train the random forest for classifying the central tile. After importing adjacent training spectra, the total number of training spectra for each GWELD tile was more than 7200.

Spectral indexes have been demonstrated effective to improve classification accuracy for land-cover types with similar reflectance spectra []: for example, the NDVI has been used for vegetation separation [], the normalized difference water index (NDWI) for inland water detection [] and the normalized difference built-up index (NDBI) for building detection []. In this study, including the six spectral reflectance bands, NDVI, NDWI, and NDBI, there was a total of nine spectral features for each training sample. In addition, as altitude information is helpful in land-cover classification [], especially for vegetation-related land-cover types [], the topographical elevations from the Shuttle Radar Topography Mission (SRTM) [] were also included in the training of the RF classifier. For example, Jin et al. [] found that the vegetation distribution in the Qilian mountains exhibited an obvious vertical gradient and that the vegetation types from low to high altitude progressed from desert-grassland vegetation through dry shrub-grassland vegetation, mountain forest-grassland vegetation, sub-alpine shrub-grassland vegetation, and cold-desert alpine meadow vegetation.

The RF has only two main parameters: the number of selected prediction variables (Mtry) and the number of classification trees (Ntree) []. Du et al. [] investigated the influence of Ntree on the classification accuracy of the RF classifier (from 10 to 200 trees at 10 intervals) and found no significant relationship between them. In this study, a value of 100 for Ntree was, therefore, selected for each base RF classifier. For the Mtry parameter, Gislason et al. [] found that the classification accuracy was insensitive to the parameter, so the default value of the square root of the total number of training features was used [,]. In this paper, there were ten training features used, including nine spectral features and one elevation channel, so the default value of three for Mtry was selected.

3.3.2. Stacking of Base Classifiers

After training each base classifier using single-date training data, there were Nepoch independent RF classifiers. As Healey et al. [] explained, how to fuse these single-date land-cover results derived from corresponding classifiers to produce an annual land-cover map can be divided into two main types: one type uses the simple algebraic operators, such as mean or majority voting. The other uses a secondary classification model to reweight the output of these classifiers according to their performance against similar cases in the reference data, namely, using the outputs from these single-date classifiers as input to train the secondary model []. The final decision came from outputs of the secondary model (or stacking classifier). Compared with the simple combination, the stacking (the second solution) strategy has been proven to give better prediction accuracy [,,].

The stacked random forest classifier was trained using the outputs of all the independent base classifiers (posterior probabilities and corresponding land-cover type). Moreover, whether the training data needed to participate in the training of the stacked RF classifier depended on the total number of input features of posterior probabilities and land-cover types because it was possible that the problem of high dimensionality could be amplified when the number of training samples was relatively small, thus resulting in an increase of classification error []. Our solution was to train two stacked random forest models, one of which added training data and the other that did not: the model that achieved the higher accuracy was adopted to predict land-cover types for the GSinGrid tile data. Lastly, for the two main parameters (Ntree and Mtry) default values of 500 and the square root of the total number of input features [], respectively, were used.

3.3.3. Rule-Based Verification

Although the temporal information and topographical variables were considered, spectral similarities, including that between cropland and grassland or water bodies and terrain shadow, together with errors in cloud and shadow detection and filling (described in Section 2.1.2) led to the misclassification of a small number of pixels. For each pixel, if the pixel’s land-cover type were to be changed, the new land-cover type would be determined by the corresponding probabilities estimated by the stacked random forest classifier. Specifically, pixels classified as water body and cropland were further checked using slope thresholds of 10° and 20°, respectively [], and classification as permanent snow and ice was restrained using the normalized difference snow index (NDSI) threshold of 0.4 [,].

3.4. Accuracy Assessment

Although the classification scheme included 22 land-cover types, the validation system needed to be further simplified because the confidence in the validation samples could not be guaranteed for more detailed land-cover types, for example, closed broadleaf/needle-leaved forest and open broadleaf/needle-leaved forest were combined into broadleaf/needle-leaf forest. In this study, the validation system was split into two parts (Table 1): a level-1 validation system containing nine land-cover types and a level-2 validation system containing 19 land-cover types. Specifically, the level-2 validation system was derived from the classification system after merging these similar land-cover types according to the CCI_LC validation scheme []. The level-1 validation system was inherited from GlobeLand30 [] and FROM_GLC [], and the merging strategy integrated the works of Yang et al. [] and Defourny et al. [].

There are many metrics available to assess the performance of a classifier in the literature [,]. In this study, the classification error matrixes [] were developed for the two independent validation systems (Table 1). The derived parameters in the error matrix—the producer accuracy (P.A.) (measure of omission), user accuracy (U.A.) (measure of commission), overall accuracy (O.A.) and kappa coefficient []—were used to assess the performance of the classifier.

4. Results and Validation

Figure 5 illustrates the 30-m annual land-cover map for 2015 derived using the multi-temporal random forest method for all 305,000 GSinGrid tiles covering the whole of China. To make the land-cover map more intuitive, the sinusoidal projection has been transformed into a geographical projection. It can be seen that bare areas, grassland, cropland, and forest are among the most abundant land-cover types, a result that is consistent with the true spatial patterns of land-cover types in China. Specifically, the mosaic effect widely existing in many large-area land-cover maps using a supervised classification approach was very slight in our annual land-cover map in Figure 5.

Figure 5. The annual land cover map in 2015 derived from multi-temporal Landsat imagery over all 305,000 GSinGrid tiles using Landsat datacube.

In addition, Table 2 and Table 3 include quantitative assessments of the annual land-cover map obtained using different validation systems (see Table 1) and 11,232 validation samples. Table 2 summarizes the accuracy metrics for 19 different land-cover types. Overall, the proposed method achieved a kappa coefficient of 0.664 and an overall accuracy of 71.3%. Specifically, among these land-cover types, water body had the best accuracies (94.1% for the user’s accuracy and 90.9% for the producer’s accuracy) because inland water had the most distinctive spectral curve. This was followed, in terms of accuracy, by bare areas, snow and ice, grassland, and deciduous needle-leaved forest. For the impervious, wetland and shrubland cover types, the performance was relatively poor because these land-cover types were more complex and had more variable spectral and temporal features. For example, the impervious type (accuracy approximately only 50.7%) primarily consisted of asphalt, concrete, sand and stone, brick and glass, and could be divided into high-reflectance (airports and greenhouses), medium-reflectance (urban residential areas) and low-reflectance (rural cottages) sub-types []. Mixed-leaf forest had a very low accuracy with an 8.1% user’s accuracy because of errors related to confusion between this land-cover type and other forest types—the CCI_LC user guide also gives a low user’s accuracy of 0.051 for mixed forest []. In addition, there was serious misclassification of land-cover types with similar spectral and phonological features: for example, rainfed cropland and irrigated cropland, evergreen broadleaved forest and evergreen needle-leaved forest, and grassland and bare areas. As these errors were mainly caused by the high degree of similarity between these land-cover types, it was concluded that some confusion between similar land-cover types was inevitable.

Table 2. Confusion matrix for the annual land cover map using the level-2 validation system.

Table 3. Confusion matrix for the annual land cover map using the level-1 validation system.

Finally, for a more comprehensive evaluation of the classification accuracy of the proposed method, the confusion matrix obtained using the level-1 validation system is shown in Table 3. After combining similar land-cover types, the classification accuracy clearly improved, giving a kappa coefficient of 0.757 and an overall accuracy of 80.7%. In fact, the major difference between the level-1 and level-2 validation systems was mainly concentrated in the cropland and forest types. Correspondingly, the average producer’s accuracy for cropland and forest improved from the 0.611 to 0.768 and from 0.625 to the 0.909, respectively. This sharp increase also indicates the degree of confusion between these similar sub-types.

5. Discussion

5.1. Influence of the Temporal Frequency

Due to the contamination by cloud and cloud shadow, the temporal frequency of Landsat SR varied greatly by geographical location (Figure 2). As the temporal information contributed a lot to the land-cover mapping, the relationship between the temporal frequency and the classification accuracy was analyzed using the validation samples described in Section 2.2. The analysis in Figure 6 shows that the classification accuracy first increases and then slowly decreases with increasing temporal frequency. In addition, it was found that a temporal frequency around 21 gave the highest overall accuracy.

Figure 6. The relationship between the temporal frequency of the Landsat SR and the overall accuracy, as found using the validation samples.

A similar conclusion can also be found in the work of Karakizi et al. []: a higher classification accuracy was achieved when only some of the temporal-spectral features were used to train the classifier because a greater number of temporal features can easily lead to the Hughes phenomenon when the number of training samples is fixed []. Therefore, the further research work would combine feature-space optimization algorithms, such as wrapper methods, embedded methods, and filter methods [,], with the RF classification model to achieve the best classification accuracy for areas with a high temporal frequency. As for the areas with temporally sparse data, such as the cloudy regions of the southwest of China, according to Figure 2, the temporal frequency was usually less than five because of cloud contamination. Our future work will, therefore, combine other medium-resolution satellite data, such as Landsat 7 [,] and Sentinel-2 [] imagery, to build high temporal frequency datacube for accurate land-cover mapping in cloudy regions.

5.2. Consistency between MCD43A4 and Landsat SR for Land-Cover Mapping

As the reflectance spectra in the SPECLib came from the MCD43A4 NBAR product and were further used to classify the Landsat imagery, the radiometric consistency between the two SR products needed to be analyzed. As illustrated in Figure 7, the consistency between the SR data retrieved from Landsat OLI and the MODIS NBAR product is strong in every spectral band and gives an average coefficient of determination of 0.865 and root mean square error (RMSE) of 0.017. As for the fitted coefficients (slope and bias) of the fitted line, the shorter wavelength bands are closer to the 1:1 line than the longer wavelength bands (SWIR1 and SWIR2). The main cause of the reflectance discrepancy is that the two MODIS SWIR bands are spectrally narrower than the corresponding Landsat OLI bands []. Therefore, after the linear radiometric normalization for the reflectance spectra in SPECLib, the radiometric discrepancy between the two sensors could be minimized as much as possible.

Figure 7. Density plots for Landsat images (y-axis) in land tile hh26vv05h1v2 land-tile (see Section 2.1) acquired on 13 July 2015 against the MODIS 16-day NBAR (x-axis). The black dashed line is the 1:1 line and the solid line is the fitted line.

Similarly, Liu et al. [] used the normalized reflectance spectra from the MOD09A1 SR product to classify time-series Landsat images at a different year in Maduo county, Qinghai province, China, and achieved the average overall accuracy of 90.17%, which is even slightly higher than that of traditional scene-by-scene supervised classification. Therefore, the normalized reflectance spectra in SPECLib can be considered as suitable for land-cover mapping using Landsat imagery.

5.3. Limitations of SPECLib for Fine-Resolution Land-Cover Mapping

Although the 500-m reflectance spectra in SPECLib were successfully used to produce annual 30-m land-cover map of China, the coarse resolution of the SPECLib might still result in a lack of land-cover types that cover small areas, such as roads and villages. In this study, by importing the reflectance spectra from the 3 × 3 adjacent tiles in SPECLib, we were able to minimize the resolution limitation efficiently. Similarly, Zhang and Roy [] also imported the training data from adjacent tiles to train the local classifiers. However, it is undeniable that this limitation still exists in some extreme areas, therefore, in subsequent work we will develop another candidate spectral library containing the standard reflectance spectra for all land-cover types for every MODIS land tile.

According to the quantitative statistics shown in Table 2 and Table 3, ‘impervious’ had a lower accuracy than other land-cover types. Similarly, the FROM_GLC land-cover product also suffers from having a low accuracy of 30.77% for the impervious type []. Due to the spectral variety of impervious, such as urban areas, rural cottages, and roads, it can be classified into three sub-categories including ‘high reflectance’, ‘low reflectance’ and ‘vegetated’ []. In this study, the impervious reflectance spectra contained in SPECLib were treated as a whole instead of as three different sub-types, therefore, the misclassification of this cover type may have been quite serious. Fortunately, some scientists have proposed several methods to independently identify impervious pixels. For example, Tian et al. [] proposed the construction of a perpendicular impervious surface index for impervious mapping and achieved an overall accuracy of more than 90% for the four cities that they studied. Gao et al. [] used time series of normalized spectral distances and decision trees to map the expansion of the impervious cover in the Yangtze River Delta, achieving an overall accuracy of 92.5%. Therefore, in future work, this cover type will be classified independently using multi-temporal imagery.

6. Conclusions

Large area land-cover mapping is a difficult and time-consuming task that includes sample collection, image processing, feature extraction, classification, mosaicking, and accuracy assessment []. In this study, a novel operational methodology to classify Landsat OLI datacube using reflectance spectra from SPECLib was used to automatically produce the annual land-cover map for the whole of China. First, SPECLib was developed using a time series of the MCD43A4 NBAR and CCI_LC land-cover products. This SPECLib had a spatial resolution of 158.85 km (equivalent to 1.43° at the equator) in the sinusoidal projection and had a temporal resolution of eight days. Secondly, the reflectance spectra in SPECLib were radiometrically normalized according to the linear regression relationship between MCD43A4 NBAR and the Landsat OLI datacube. Lastly, the multi-temporal random forest models were trained using the normalized reflectance spectra and then applied to the Landsat datacube for producing an annual land-cover map with 22 land-cover types. Validation of the results gave an overall accuracy of 80.7% and kappa coefficient of 0.757 for the level-1 validation system (nine land-cover types) and an overall accuracy of 71.3% and kappa coefficient of 0.664 for the level-2 validation system (19 land-cover types). Therefore, it was concluded that the proposed method provides a novel strategy for large-area land-cover mapping. Future work will adopt this method to produce a global land-cover map with a fine classification system.

Author Contributions

Conceptualization, L.L.; investigation, X.Z.; methodology, L.L. and X.Z.; software, X.Z. and X.C.; validation, X.Z., S.X., X.C. and Y.G.; Writing—Original Draft preparation, X.Z.; Writing—Review and Editing, S.X.

Funding

This research was funded by the Strategic Priority Research Program of the Chinese Academy of Sciences (XDA19080304), the National Key Research and Development Program of China (2016YFB0501501), and the National Natural Science Foundation of China (41825002).

Acknowledgments

We thank the USGS and RADI for providing the Landsat OLI data and surface reflectance products free.

Conflicts of Interest

The authors declare no conflict of interest.

References

Azzari, G.; Lobell, D. Landsat-based classification in the cloud: An opportunity for a paradigm shift in land cover monitoring. Remote. Sens. Environ. 2017, 202, 64–74. [Google Scholar] [CrossRef]
Zhao, Y.; Feng, D.; Yu, L.; Wang, X.; Chen, Y.; Bai, Y.; Hernández, H.J.; Galleguillos, M.; Estades, C.; Biging, G.S.; et al. Detailed dynamic land cover mapping of Chile: Accuracy improvement by integrating multi-temporal data. Remote. Sens. Environ. 2016, 183, 170–185. [Google Scholar]
Wessels, K.J.; Bergh, F.V.D.; Roy, D.P.; Salmon, B.P.; Steenkamp, K.C.; MacAlister, B.; Swanepoel, D.; Jewitt, D. Rapid Land Cover Map Updates Using Change Detection and Robust Random Forest Classifiers. Remote. Sens. 2016, 8, 888. [Google Scholar] [CrossRef]
Chen, J.; Chen, J.; Liao, A.; Cao, X.; Chen, L.; Chen, X.; He, C.; Han, G.; Peng, S.; Lu, M.; et al. Global land cover mapping at 30m resolution: A POK-based operational approach. ISPRS J. Photogramm. Sens. 2015, 103, 7–27. [Google Scholar] [CrossRef]
Yu, L.; Wang, J.; Li, X.; Li, C.; Zhao, Y.; Gong, P. A multi-resolution global land cover dataset through multisource data aggregation. Sci. China Earth Sci. 2014, 57, 2317–2329. [Google Scholar] [CrossRef]
Yang, Y.; Xiao, P.; Feng, X.; Li, H. Accuracy assessment of seven global land cover datasets over China. ISPRS J. Photogramm. Sens. 2017, 125, 156–173. [Google Scholar] [CrossRef]
Tsendbazar, N.-E.; De Bruin, S.; Herold, M. Assessing global land cover reference datasets for different user communities. ISPRS J. Photogramm. Sens. 2015, 103, 93–114. [Google Scholar] [CrossRef]
Tsendbazar, N.-E.; De Bruin, S.; Fritz, S.; Herold, M. Spatial Accuracy Assessment and Integration of Global Land Cover Datasets. Remote. Sens. 2015, 7, 15804–15821. [Google Scholar] [CrossRef]
Ban, Y.; Gong, P.; Giri, C. Global land cover mapping using Earth observation satellite data: Recent progresses and challenges. ISPRS J. Photogramm. Sens. 2015, 103, 1–6. [Google Scholar] [CrossRef]
Giri, C.; Pengra, B.; Long, J.; Loveland, T. Next generation of global land cover characterization, mapping, and monitoring. Int. J. Appl. Earth Obs. Geoinformation 2013, 25, 30–37. [Google Scholar]
Gómez, C.; White, J.C.; Wulder, M.A. Optical remotely sensed time series data for land cover classification: A review. ISPRS J. Photogramm. Sens. 2016, 116, 55–72. [Google Scholar] [CrossRef]
Gong, P.; Wang, J.; Yu, L.; Zhao, Y.; Zhao, Y.; Liang, L.; Niu, Z.; Huang, X.; Fu, H.; Liu, S.; et al. Finer resolution observation and monitoring of global land cover: first mapping results with Landsat TM and ETM+ data. Int. J. Remote Sens. 2013, 34, 2607–2654. [Google Scholar] [CrossRef]
Symeonakis, E.; Caccetta, P.; Koukoulas, S.; Furby, S.; Karathanasis, N. Multi-temporal land-cover classification and change analysis with conditional probability networks: the case of Lesvos Island (Greece). Int. J. Remote Sens. 2012, 33, 4075–4093. [Google Scholar] [CrossRef]
Potapov, P.; Turubanova, S.; Hansen, M.C. Regional-scale boreal forest cover and change mapping using Landsat data composites for European Russia. Remote. Sens. Environ. 2011, 115, 548–561. [Google Scholar] [CrossRef]
Giménez, M.G.; De Jong, R.; Della Peruta, R.; Keller, A.; Schaepman, M.E. Determination of grassland use intensity based on multi-temporal remote sensing data and ecological indicators. Remote. Sens. Environ. 2017, 198, 126–139. [Google Scholar] [CrossRef]
Franklin, S.E.; Ahmed, O.S.; Wulder, M.A.; White, J.C.; Hermosilla, T.; Coops, N.C. Large Area Mapping of Annual Land Cover Dynamics Using Multitemporal Change Detection and Classification of Landsat Time Series Data. Can. J. Sens. 2015, 41, 293–314. [Google Scholar] [CrossRef]
Senf, C.; Leitão, P.J.; Pflugmacher, D.; Van Der Linden, S.; Hostert, P. Mapping land cover in complex Mediterranean landscapes using Landsat: Improved classification accuracies from integrating multi-seasonal and synthetic imagery. Remote. Sens. Environ. 2015, 156, 527–536. [Google Scholar] [CrossRef]
Belgiu, M.; Csillik, O. Sentinel-2 cropland mapping using pixel-based and object-based time-weighted dynamic time warping analysis. Remote. Sens. Environ. 2018, 204, 509–523. [Google Scholar] [CrossRef]
Yamazaki, D.; Trigg, M.A.; Ikeshima, D. Development of a global ~90m water body map using multi-temporal Landsat images. Remote. Sens. Environ. 2015, 171, 337–351. [Google Scholar] [CrossRef]
Dennison, P.E.; Roberts, D.A. The effects of vegetation phenology on endmember selection and species mapping in southern California chaparral. Remote. Sens. Environ. 2003, 87, 295–309. [Google Scholar] [CrossRef]
Dudley, K.L.; Dennison, P.E.; Roth, K.L.; Roberts, D.A.; Coates, A.R. A multi-temporal spectral library approach for mapping vegetation species across spatial and temporal phenological gradients. Remote. Sens. Environ. 2015, 167, 121–134. [Google Scholar] [CrossRef]
Hansen, M.C.; Loveland, T.R. A review of large area monitoring of land cover change using Landsat data. Remote. Sens. Environ. 2012, 122, 66–74. [Google Scholar] [CrossRef]
Yu, L.; Wang, J.; Clinton, N.; Xin, Q.; Zhong, L.; Chen, Y.; Gong, P. FROM-GC: 30 m global cropland extent derived through multisource data integration. Int. J. Digit. Earth 2013, 6, 521–533. [Google Scholar] [CrossRef]
Li, C.; Gong, P.; Wang, J.; Zhu, Z.; Biging, G.S.; Yuan, C.; Hu, T.; Zhang, H.; Wang, Q.; Li, X.; et al. The first all-season sample set for mapping global land cover with Landsat-8 data. Sci. Bull. 2017, 62, 508–515. [Google Scholar] [CrossRef]
Pax-Lenney, M.; E Woodcock, C.; A Macomber, S.; Gopal, S.; Song, C. Forest mapping with a generalized classifier and Landsat TM data. Remote. Sens. Environ. 2001, 77, 241–250. [Google Scholar] [CrossRef]
E Woodcock, C.; A Macomber, S.; Pax-Lenney, M.; Cohen, W.B. Monitoring large areas for forest change using Landsat: Generalization across space, time and Landsat sensors. Remote. Sens. Environ. 2001, 78, 194–203. [Google Scholar] [CrossRef]
Liu, L.; Xiao, Z.; Yong, H.; Wang, Y. Automatic land cover mapping for Landsat data based on the time-series spectral image database. In Proceedings of the Geoscience & Remote Sensing Symposium, 2017, Fort Worth, TX, USA, 23–28 July 2017. [Google Scholar]
Dannenberg, M.P.; Hakkenberg, C.R.; Song, C. Consistent Classification of Landsat Time Series with an Improved Automatic Adaptive Signature Generalization Algorithm. Remote. Sens. 2016, 8, 691. [Google Scholar] [CrossRef]
Hu, Y.; Liu, L. Landsat time-series land cover mapping with spectral signature extension method. Remote Sens. 2015, 19, 639–656. [Google Scholar]
Zhang, H.K.; Roy, D.P. Using the 500 m MODIS land cover product to derive a consistent continental scale 30 m Landsat land cover classification. Remote. Sens. Environ. 2017, 197, 15–34. [Google Scholar] [CrossRef]
Radoux, J.; Lamarche, C.; Van Bogaert, E.; Bontemps, S.; Brockmann, C.; Defourny, P. Automated Training Sample Extraction for Global Land Cover Mapping. Remote. Sens. 2014, 6, 3965–3987. [Google Scholar] [CrossRef]
Olthof, I.; Butson, C.; Fraser, R. Signature extension through space for northern land cover classification: A comparison of radiometric correction methods. Remote Sens. Environ. 2005, 95, 290–302. [Google Scholar] [CrossRef]
Zhang, X.; Liu, L.; Wang, Y.; Hu, Y.; Zhang, B. A SPECLib-based operational classification approach: A preliminary test on China land cover mapping at 30 m. Int. J. Appl. Earth Obs. Geoinformation 2018, 71, 83–94. [Google Scholar] [CrossRef]
Chen, Z. Hasituya Mapping Plastic-Mulched Farmland with Multi-Temporal Landsat-8 Data. Remote. Sens. 2017, 9, 557. [Google Scholar]
Egorov, A.; Hansen, M.; Roy, D.; Kommareddy, A.; Potapov, P. Image interpretation-guided supervised classification using nested segmentation. Remote. Sens. Environ. 2015, 165, 135–147. [Google Scholar] [CrossRef]
Paola, J.; Schowengerdt, R. A detailed comparison of backpropagation neural network and maximum-likelihood classifiers for urban land use classification. IEEE Trans. Geosci. Sens. 1995, 33, 981–996. [Google Scholar] [CrossRef]
Breiman, L.; Friedman, J.H.; Olshen, R.A.; Stone, C.J. Classification and Regression Trees; Routledge: New York, NY, USA, 1984; Available online: https://doi.org/10.1201/9781315139470 (accessed on 4 May 2019).
Vapnik, V.; Cortes, C. Support Vector Networks. Available online: https://doi.org/10.1007/BF00994018 (accessed on 4 May 2019).
Breiman, L. Random Forests. Mach. Learn. 2001, 45, 5–32. Available online: https://doi.org/10.1023/A:1010933404324 (accessed on 4 May 2019). [CrossRef]
Belgiu, M.; Drăguț, L. Random forest in remote sensing: A review of applications and future directions. ISPRS J. Photogramm. Sens. 2016, 114, 24–31. [Google Scholar] [CrossRef]
Shao, Y.; Lunetta, R.S. Comparison of support vector machine, neural network, and CART algorithms for the land-cover classification using limited training data points. ISPRS J. Photogramm. Sens. 2012, 70, 78–87. [Google Scholar] [CrossRef]
Pelletier, C.; Valero, S.; Inglada, J.; Champion, N.; Dedieu, G. Assessing the robustness of Random Forests to map land cover with high resolution satellite image time series over large areas. Remote. Sens. Environ. 2016, 187, 156–168. [Google Scholar] [CrossRef]
Du, P.; Samat, A.; Waske, B.; Liu, S.; Li, Z. Random Forest and Rotation Forest for fully polarized SAR image classification using polarimetric and spatial features. ISPRS J. Photogramm. Sens. 2015, 105, 38–53. [Google Scholar] [CrossRef]
Teillet, P.; Guindon, B.; Goodenough, D. On the Slope-Aspect Correction of Multispectral Scanner Data. Can. J. Sens. 1982, 8, 84–106. [Google Scholar] [CrossRef]
Tan, B.; Masek, J.G.; Wolfe, R.; Gao, F.; Huang, C.; Vermote, E.F.; Sexton, J.O.; Ederer, G. Improved forest change detection with terrain illumination corrected Landsat images. Remote. Sens. Environ. 2013, 136, 469–483. [Google Scholar] [CrossRef]
Hu, Y.; Liu, L.; Liu, L.; Peng, D.; Jiao, Q.; Zhang, H. A Landsat-5 Atmospheric Correction Based on MODIS Atmosphere Products and 6S Model. IEEE J. Sel. Top. Appl. Earth Obs. Sens. 2014, 7, 1609–1615. [Google Scholar] [CrossRef]
Roy, D.P.; Qin, Y.; Kovalskyy, V.; Vermote, E.F.; Ju, J.; Egorov, A.; Hansen, M.C.; Kommareddy, I.; Yan, L. Conterminous United States demonstration and characterization of MODIS-based Landsat ETM+ atmospheric correction. Remote Sens. Environ. 2014, 140, 433–449. [Google Scholar] [CrossRef]
Vermote, E.; Justice, C.; Claverie, M.; Franch, B. Preliminary analysis of the performance of the Landsat 8/OLI land surface reflectance product. Remote. Sens. Environ. 2016, 185, 46–56. [Google Scholar] [CrossRef]
Kovalskyy, V.; Roy, D.P. The global availability of Landsat 5 TM and Landsat 7 ETM+ land surface observations and implications for global 30m Landsat data product generation. Remote Sens. Environ. 2013, 130, 280–293. [Google Scholar] [CrossRef]
Li, J.; Roy, D.P. A Global Analysis of Sentinel-2A, Sentinel-2B and Landsat-8 Data Revisit Intervals and Implications for Terrestrial Monitoring. Remote. Sens. 2017, 9, 902. [Google Scholar]
Lewis, A.; Oliver, S.; Lymburner, L.; Evans, B.; Wyborn, L.; Mueller, N.; Raevksi, G.; Hooke, J.; Woodcock, R.; Sixsmith, J.; et al. The Australian Geoscience Data Cube — Foundations and lessons learned. Remote. Sens. Environ. 2017, 202, 276–292. [Google Scholar] [CrossRef]
Zhu, Z.; Wang, S.; Woodcock, C.E. Improvement and expansion of the Fmask algorithm: cloud, cloud shadow, and snow detection for Landsats 4–7, 8, and Sentinel 2 images. Remote. Sens. Environ. 2015, 159, 269–277. [Google Scholar] [CrossRef]
Zhu, Z.; Woodcock, C.E. Object-based cloud and cloud shadow detection in Landsat imagery. Remote. Sens. Environ. 2012, 118, 83–94. [Google Scholar] [CrossRef]
Zhu, X.; Gao, F.; Liu, D.; Chen, J. A Modified Neighborhood Similar Pixel Interpolator Approach for Removing Thick Clouds in Landsat Images. IEEE Geosci. Sens. Lett. 2012, 9, 521–525. [Google Scholar] [CrossRef]
Defourny, P.; Kirches, G.; Brockmann, C.; Boettcher, M.; Peters, M.; Bontemps, S.; Lamarche, C.; Schlerf, M.; Santoro, M. Land Cover CCI: Product User Guide Version 2. 2018. Available online: https://www.esa-landcover-cci.org/?q=webfm_send/84 (accessed on 4 May 2019).
Friedl, M.A.; Sulla-Menashe, D.; Tan, B.; Schneider, A.; Ramankutty, N.; Sibley, A.; Huang, X. MODIS Collection 5 global land cover: Algorithm refinements and characterization of new datasets. Remote. Sens. Environ. 2010, 114, 168–182. [Google Scholar] [CrossRef]
Feng, M.; Huang, C.; Channan, S.; Vermote, E.F.; Masek, J.G.; Townshend, J.R. Quality assessment of Landsat surface reflectance products using MODIS data. Comput. Geosci. 2012, 38, 9–22. [Google Scholar] [CrossRef]
Bontemps, S.; Defourny, P.; Bogaert, E.V.; Arino, O.; Kalogirou, V.; Perez, J.R. GLOBCOVER 2009 Products Description and Validation Report. 2010. Available online: http://due.esrin.esa.int/files/GLOBCOVER2009_Validation_Report_2.2.pdf (accessed on 4 May 2019).
Wang, Z.; Schaaf, C.B.; Strahler, A.H.; Chopping, M.J.; Román, M.O.; Shuai, Y.; Woodcock, C.E.; Hollinger, D.Y.; Fitzjarrald, D.R. Evaluation of MODIS albedo product (MCD43A) over grassland, agriculture and forest surface types during dormant and snow-covered periods. Remote. Sens. Environ. 2014, 140, 60–77. [Google Scholar] [CrossRef]
Feng, M.; Sexton, J.O.; Huang, C.; Masek, J.G.; Vermote, E.F.; Gao, F.; Narasimhan, R.; Channan, S.; Wolfe, R.E.; Townshend, J.R. Global surface reflectance products from Landsat: Assessment using coincident MODIS observations. Remote. Sens. Environ. 2013, 134, 276–293. [Google Scholar] [CrossRef]
Tucker, C.J. Red and photographic infrared linear combinations for monitoring vegetation. Remote. Sens. Environ. 1979, 8, 127–150. [Google Scholar] [CrossRef]
Xu, H. Modification of normalised difference water index (NDWI) to enhance open water features in remotely sensed imagery. Int. J. Sens. 2006, 27, 3025–3033. [Google Scholar] [CrossRef]
Zha, Y.; Gao, J.; Ni, S. Use of normalized difference built-up index in automatically mapping urban areas from TM imagery. Int. J. Sens. 2003, 24, 583–594. [Google Scholar] [CrossRef]
Gomariz-Castillo, F.; Alonso-Sarría, F.; Cánovas-García, F. Improving Classification Accuracy of Multi-Temporal Landsat Images by Assessing the Use of Different Algorithms, Textural and Ancillary Information for a Mediterranean Semiarid Area from 2000 to 2015. Remote. Sens. 2017, 9, 1058. [Google Scholar] [CrossRef]
Rumpf, S.B.; Hülber, K.; Klonner, G.; Moser, D.; Schütz, M.; Wessely, J.; Willner, W.; Zimmermann, N.E.; Dullinger, S. Range dynamics of mountain plants decrease with elevation. Proc. Natl. Acad. Sci. USA 2018, 115, 1848–1853. Available online: https://www.pnas.org/content/115/8/1848 (accessed on 4 May 2019). [CrossRef]
Yang, L.; Meng, X.; Zhang, X. SRTM DEM and its application advances. Int. J. Sens. 2011, 32, 3875–3896. [Google Scholar] [CrossRef]
Jin, X.M.; Zhang, Y.K.; Schaepman, M.E.; Clevers, J.G.P.W.; Su, Z. Impact of Elevation and Aspect on the Spatial Distribution of Vegetation in the Qilian Mountain Area with Remote Sensing Data. Available online: https://bit.ly/2Lmlkkj (accessed on 4 May 2019).
Gislason, P.O.; Benediktsson, J.A.; Sveinsson, J.R. Random Forests for land cover classification. Pattern Recognit. Lett. 2006, 27, 294–300. [Google Scholar] [CrossRef]
Ghosh, A.; Fassnacht, F.E.; Joshi, P.; Koch, B. A framework for mapping tree species combining hyperspectral and LiDAR data: Role of selected classifiers and sensor across three spatial scales. Int. J. Appl. Earth Obs. Geoinformation 2014, 26, 49–63. [Google Scholar] [CrossRef]
Healey, S.P.; Cohen, W.B.; Yang, Z.; Brewer, C.K.; Brooks, E.B.; Gorelick, N.; Hernandez, A.J.; Huang, C.; Hughes, M.J.; Kennedy, R.E.; et al. Mapping forest change using stacked generalization: An ensemble approach. Remote. Sens. Environ. 2018, 204, 717–728. [Google Scholar] [CrossRef]
Yang, X.; Lo, D.; Xia, X.; Sun, J. TLEL: A two-layer ensemble learning approach for just-in-time defect prediction. Inf. Softw. Technol. 2017, 87, 206–220. [Google Scholar] [CrossRef]
Löw, F.; Conrad, C.; Michel, U. Decision fusion and non-parametric classifiers for land use mapping using multi-temporal RapidEye data. ISPRS J. Photogramm. Sens. 2015, 108, 191–204. [Google Scholar] [CrossRef]
Yin, D.; Cao, X.; Chen, X.; Shao, Y.; Chen, J. Comparison of automatic thresholding methods for snow-cover mapping using Landsat TM imagery. Int. J. Sens. 2013, 34, 6529–6538. [Google Scholar] [CrossRef]
Liu, C.; Frazier, P.; Kumar, L. Comparative assessment of the measures of thematic classification accuracy. Remote. Sens. Environ. 2007, 107, 606–616. [Google Scholar] [CrossRef]
Olofsson, P.; Foody, G.M.; Herold, M.; Stehman, S.V.; Woodcock, C.E.; Wulder, M.A. Good practices for estimating area and assessing accuracy of land change. Remote. Sens. Environ. 2014, 148, 42–57. [Google Scholar] [CrossRef]
Karakizi, C.; Karantzalos, K.; Vakalopoulou, M.; Antoniou, G. Detailed Land Cover Mapping from Multitemporal Landsat-8 Data of Different Cloud Cover. Remote. Sens. 2018, 10, 1214. [Google Scholar] [CrossRef]
Yvan, S.; Iñaki, I.; Pedro, L. A review of feature selection techniques in bioinformatics. Bioinformatics 2007, 23, 2507–2517. [Google Scholar]
Roy, D.; Kovalskyy, V.; Zhang, H.; Vermote, E.; Yan, L.; Kumar, S.S.; Egorov, A. Characterization of Landsat-7 to Landsat-8 reflective wavelength and normalized difference vegetation index continuity. Remote. Sens. Environ. 2016, 185, 57–70. [Google Scholar] [CrossRef]
Irons, J.R.; Dwyer, J.L.; Barsi, J.A. The next Landsat satellite: The Landsat Data Continuity Mission. Remote. Sens. Environ. 2012, 122, 11–21. [Google Scholar] [CrossRef]
Drusch, M.; Del Bello, U.; Carlier, S.; Colin, O.; Fernandez, V.; Gascon, F.; Hoersch, B.; Isola, C.; Laberinti, P.; Martimort, P.; et al. Sentinel-2: ESA’s Optical High-Resolution Mission for GMES Operational Services. Remote Sens. Environ. 2012, 120, 25–36. [Google Scholar] [CrossRef]
Tian, Y.; Chen, H.; Song, Q.; Zheng, K. A Novel Index for Impervious Surface Area Mapping: Development and Validation. Remote. Sens. 2018, 10, 1521. [Google Scholar] [CrossRef]
Gao, F.; De Colstoun, E.B.; Ma, R.; Weng, Q.; Masek, J.G.; Chen, J.; Pan, Y.; Song, C. Mapping impervious surface expansion using medium-resolution satellite image time series: a case study in the Yangtze River Delta, China. Int. J. Sens. 2012, 33, 7609–7628. [Google Scholar] [CrossRef]

Figure 1. The three-level GSinGrid tiles used to store the multi-temporal images of China. The study area was hierarchically tiled into 19 standard MODIS level-1 land tiles (red), approximately 550 GWELD level-2 tiles (cyan) and 305,000 GSinGrid tiles (yellow), respectively.

Figure 2. The locations and frequency distributions of Landsat WRS-2 scenes (shown in the geographical projection) and corresponding GSinGrid tiles (shown in the sinusoidal projection) (a,b).

Figure 3. The spatial distributions of the validation samples.

Figure 4. The flowchart of the multi-temporal classification method.

Figure 5. The annual land cover map in 2015 derived from multi-temporal Landsat imagery over all 305,000 GSinGrid tiles using Landsat datacube.

Figure 6. The relationship between the temporal frequency of the Landsat SR and the overall accuracy, as found using the validation samples.

Figure 7. Density plots for Landsat images (y-axis) in land tile hh26vv05h1v2 land-tile (see Section 2.1) acquired on 13 July 2015 against the MODIS 16-day NBAR (x-axis). The black dashed line is the 1:1 line and the solid line is the fitted line.

Table 1. The classification system used in the study and the corresponding two-level validation systems.

Level-1 Vali-System	Level-2 Vali-System	Classification System	LC Id
Cropland	Herbaceous rainfed cropland	Herbaceous cover	11
	Tree rainfed cropland	Tree or shrub cover (Orchard)	12
	Irrigated cropland	Irrigated cropland	20
Forest	Evergreen broadleaved forest	Evergreen broadleaved forest	50
	Deciduous broadleaved forest	Open deciduous broadleaved forest (0.15 < fc < 0.4)	61
	Deciduous broadleaved forest	Closed deciduous broadleaved forest (fc > 0.4)	62
	Evergreen needle-leaved forest	Open evergreen needle-leaved forest (0.15 < fc < 0.4)	71
	Evergreen needle-leaved forest	Closed evergreen needle-leaved forest (fc > 40%)	72
	Deciduous needle-leaved forest	Open deciduous needle-leaved forest (0.15 < fc < 0.4)	81
	Deciduous needle-leaved forest	Closed deciduous needle-leaved forest (fc > 0.4)	82
	Mixed leaf forest	Mixed leaf forest (broadleaved and needle-leaved)	90
Shrubland	Evergreen shrubland	Evergreen shrubland	121
Shrubland	Deciduous shrubland	Deciduous shrubland	122
Grassland	Grassland	Grassland	130
Wetlands	Lichens and mosses	Lichens and mosses	140
Wetlands	Wetlands	Wetlands	180
Impervious	Impervious	Impervious	190
Bare areas	Sparse vegetation	Sparse vegetation (tree, herbaceous cover) (fc < 15%)	150
	Consolidated bare areas	Consolidated bare areas	201
	Unconsolidated bare areas	Unconsolidated bare areas	202
Water body	Water body	Water body	210
Ice and snow	Permanent ice and snow	Permanent ice and snow	220

Table 2. Confusion matrix for the annual land cover map using the level-2 validation system.

	HRC	TRC	ICL	EBF	DBF	ENF	DNF	MLF	ESH	DSH	GRL	LIM	SPV	WEL	Imp	CBA	UBA	Water	SNI	Total	P.A.
HRC	759	0	90	55	13	17	0	0	6	1	167	0	5	1	31	2	0	2	0	1149	0.661
TRC	20	50	0	3	4	7	20	0	0	2	0	0	3	0	0	0	0	0	0	109	0.459
ICL	127	8	479	33	1	6	0	0	4	1	23	0	3	1	43	4	0	4	0	737	0.650
EBF	25	4	7	346	37	136	0	0	3	5	0	0	0	0	0	0	0	0	0	563	0.615
DBF	20	4	7	57	434	21	26	32	13	0	18	0	0	0	1	1	0	0	0	634	0.685
ENF	35	8	8	190	92	373	16	25	4	0	23	0	1	0	0	0	0	1	1	777	0.480
DNF	1	0	0	1	38	3	187	15	2	0	11	0	0	0	1	0	0	0	1	260	0.719
MLF	0	0	0	0	5	0	3	7	0	0	0	0	0	0	0	0	0	0	0	15	0.467
ESH	4	2	1	17	1	16	0	0	54	0	0	0	0	0	0	0	0	0	0	95	0.568
DSH	0	0	1	6	11	2	6	0	0	23	0	0	0	0	0	0	0	0	0	49	0.469
GRL	156	0	57	8	22	22	3	4	13	0	2372	0	114	6	21	194	1	6	17	3016	0.786
LIM	0	0	0	0	0	0	0	0	0	0	6	9	1	0	0	8	0	0	0	24	0.375
SVE	4	0	4	1	0	0	0	0	2	0	98	0	125	3	9	28	0	0	1	275	0.455
WEL	3	2	7	3	2	1	0	1	0	0	19	0	4	51	8	15	1	3	2	122	0.418
Imp	46	1	27	3	8	6	1	0	1	0	35	0	7	4	152	4	0	5	0	300	0.507
CBA	1	0	16	0	0	1	0	1	7	0	275	13	67	5	7	1932	6	2	7	2340	0.826
UBA	0	0	0	0	0	0	0	0	0	0	6	0	1	0	0	11	38	0	0	56	0.679
Water	9	0	9	1	5	1	1	1	3	0	0	0	1	4	3	0	0	427	5	470	0.909
SNI	0	0	0	1	2	5	0	0	0	0	23	1	2	0	0	13	0	4	190	241	0.788
Total	1210	79	713	725	675	617	263	86	112	32	3076	23	334	75	276	2212	46	454	224	11,232
U.A.	0.627	0.633	0.672	0.477	0.643	0.605	0.711	0.081	0.482	0.719	0.771	0.391	0.374	0.680	0.551	0.873	0.826	0.941	0.848
O.A.	0.713
Kappa	0.664

HRC: Herbaceous rainfed cropland, TRC: Tree rainfed cropland, ICL: Irrigated cropland, EBF: Evergreen broadleaved forest, DBF: Deciduous broadleaved forest, ENF: Evergreen needle-leaved forest, DNF: Deciduous needle-leaved forest, MLF: Mixed-leaf forest, ESH: Evergreen shrubland, DSH: Deciduous shrubland, GRL: Grassland, LIM: Lichens and mosses, SPV: Sparse vegetation, WEL: Wetlands, Imp: Impervious, CBA: Consolidated bare areas, UBA: Unconsolidated bare areas, Water: Water body, SNI: Snow and ice.

Table 3. Confusion matrix for the annual land cover map using the level-1 validation system.

	CRL	FST	Shru	GRL	WEL	Imp	BareA	Water	SNI	Total	P.A.
CRL	1533	159	14	190	2	74	17	6	0	1995	0.768
FST	119	2044	27	52	0	2	2	1	2	2249	0.909
Shru	8	59	77	0	0	0	0	0	0	144	0.535
GRL	213	59	13	2372	6	21	309	6	17	3016	0.786
WEL	12	7	0	25	60	8	29	3	2	146	0.411
Imp	74	18	1	35	4	152	11	5	0	300	0.507
BareA	25	3	9	379	21	16	2208	2	8	2671	0.827
Water	18	9	3	0	4	3	1	427	5	470	0.909
SNI	0	8	0	23	1	0	15	4	190	241	0.788
Total	2002	2366	144	3076	98	276	2592	454	224	11,232
U.A.	0.766	0.864	0.535	0.771	0.612	0.551	0.852	0.941	0.848
O.A.	0.807
Kappa	0.757

CRL: Cropland, FST: Forest, Shru: Shrubland, GRL: Grassland, WEL: Wetlands, Imp: Impervious, Water: Water body, BareA: Bare areas, SNI: Permanent snow and ice.

© 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Fine Land-Cover Mapping in China Using Landsat Datacube and an Operational SPECLib-Based Approach

Abstract

1. Introduction

2. Datasets and Preprocessing

2.1. Landsat Imagery and Datacube

2.1.1. Reprojection and Tiling

2.1.2. Cloud and Shadow Detection and Filling

2.2. Validation Dataset

3. Methods

3.1. The Spatial-Temporal Spectral Library

3.2. Normalization of the SPECLib Reflectance Spectra

3.3. Multi-Temporal Classification Method Based on SPECLib

3.3.1. Training the Base Classifier

3.3.2. Stacking of Base Classifiers

3.3.3. Rule-Based Verification

3.4. Accuracy Assessment

4. Results and Validation

5. Discussion

5.1. Influence of the Temporal Frequency

5.2. Consistency between MCD43A4 and Landsat SR for Land-Cover Mapping

5.3. Limitations of SPECLib for Fine-Resolution Land-Cover Mapping

6. Conclusions

Author Contributions

Funding

Acknowledgments

Conflicts of Interest

References

Article Metrics

Citations

Article Access Statistics