1. Introduction
Fuzziness is an inherent uncertainty prevalent in the geographic world. Almost all natural geographical objects have a vague boundary, such as meadows, animal habits, wetlands, and city influence zones [
1,
2,
3]. The extensive ambiguity between fuzzy geospatial objects pose many challenges when processing multispectral remote sensing data. Since the introduction of the fuzzy set theory by Zadeh [
4], geographers have widely applied it in the study of geographical issues [
5,
6,
7,
8]. Fuzzy methods use the concept of membership function (MF) to describe the vagueness of geographic objects. MFs can be generated from remote sensing images by fuzzy clustering or classification methods [
7,
9,
10,
11], and the membership values in the MF of each object (such as pixels, image objects, and super-pixels) express the possibility that each object partially belonging to different classes. Specifically, membership values (denoted as
) are assigned to an object
for all considered
classes after fuzzy clustering or classification tasks. A value of
indicates that the object does not completely belong to a class, while a value
indicates that it completely belongs to the class. The closer the value is to 1, the higher membership degree, and vice versa [
7].
Medium-resolution remote sensing images, such as Sentinel-2 and Landsat remote sensing images, can be used to support global land services, including vegetation monitoring and water coverage [
12,
13,
14]. Because of their availability and low cost, these data have received more and more attention in remote sensing research and applications. However, due to the limitation of the coarse spatial resolution of these images, many mixed pixels exist, leading to a limited ability to express local detail information, such as small objects. Therefore, medium-resolution images and their products (such as classification results and membership degrees) often need to be downscaled, such as data fusion [
15,
16,
17,
18,
19,
20] and sub-pixel mapping [
19,
21,
22,
23,
24,
25,
26,
27], to improve the resolution of remote sensing images. Downscaling aims to improve the spatial resolution of the image and reproduce more information that is not available in the original data [
28,
29,
30]. Generally, two classes of downscaling can be distinguished based on predicting a continuous or categorical variable [
30]. For downscaling continuous variables, the methods can be divided into downscaling without and with fine-scale auxiliary information [
28,
31,
32,
33]. The downscaling method involved in this study is a downscaling method without fine-scale auxiliary information.
The downscaling method without auxiliary information mainly contains two forms of resampling and spatial statistical analysis [
28]. The univariate downscaling technique considers only a single spatial variance function and uses interpolation to achieve the downscaling of geospatial variables. However, interpolation methods, such as Bilinear interpolation [
34], inverse distance-weighted methods [
35], and function-based interpolation [
36], do not consider the shape of the data scale and the spatial correlation information. When downscaling continuous variables, geostatistical methods consider the spatial correlation between geological data. However, simple kriging and ordinary kriging do not solve the problem of modifiable regional units when downscaling [
37].
To address this challenge, Kyriakidis [
35] developed the area-to-point kriging method (ATPK). The principle of ATPK is that the unknown point value is a linearly weighted sum of the data on the area where it is located and the surrounding area. ATPK is distinguished from conventional centroid-based kriging, which ignores spatial support. ATPK explicitly considers the size of the support and predicts variables from regional supports to points by semivariogram deconvolution, with parameterized random function models and kriging [
36]. Additionally, ATPK can precisely honor the observed areal data and provide predictions on finer support (point or area) than those of the original data, ensuring the coherence of prediction [
36]. For example, the sum of downscaled predictions within any area is equal to the original aggregated count. Therefore, ATPK was originally proposed for downscaling data with irregular geographic units, where the size and shape supported by observations vary by geographic location, such as population and disease data across counties. After elaborating on the technique in the general case of census data, Kyriakidis and Yoo [
37] further developed the technique for direct application to remote sensing imagery and demonstrated that the method reproduces point histograms, point semivariograms, and coarse resolution data. Although the basic model is easily applicable to remote sensing data, in the initial stage of the development of the ATPK method, the method was mostly used for downscaling data such as regional population [
37] and soil types [
36]. Therefore, the number of applications of ATPK in remote sensing was quite limited.
Based on the characteristics of the ATPK method, many researchers have extended and developed the ATPK method in recent years and proposed many methods for downscaling remote sensing data. Liu et al. [
38] proposed an area-to-point regression kriging (ATPRK) method to decompose population density from irregular census units into land use areas within them, where remote sensing images (i.e., IKONOS images) were used to obtain auxiliary information. ATPRK shows its potential to classify irregular geographic unit data. Combining the advantages of regression kriging and ATPK, Wang et al. [
16] proposed an ATPRK method that can be used for remote sensing image downscaling and proved that the method has perfect coherence properties and can produce high-quality sharpened images. Zhang et al. [
20] combined the fuzzy c-means segmentation algorithm with the ATRPK algorithm to propose an image pan-sharpening method for the OATPRK that can always achieve the sharpening of multi-spectral images with the highest precision.
Geostatistics uses variance functions as a tool to describe spatial variability, and accurate predictions can only be made using variance functions that truly reflect the structure of spatial variability [
39]. Although the ATPK method has its unique advantages in remote sensing image downscaling, it is known that the ATPK method requires an accurate semivariogram to support it [
40]. Nevertheless, the existing research on spatial downscaling based on the ATPK method has focused on using the overall semivariogram of spatial variables to estimate the spatial variation and relevant information of geographical objects [
16,
20,
31]. However, this approach may not be effective as the global image is not stationary, which can limit the performance of extraction algorithms. Some researchers have used geostatistical methods for downscaling studies in the form of moving windows to consider the variation across different spatial geographic objects, but this method requires setting the size of the window to be sufficient to estimate the variation, and cannot consider individual geographic objects independently [
15]. Universal kriging is one of the spatial interpolators from geostatistics, a tool that deals with spatial autocorrelation using a probabilistic approach of regionalized variables [
39,
41]. As we know, in the real world, remote sensing images cover different spatial patterns of geographic objects or different terrain type areas, and the spatial boundary changes in these geographic objects are different, so the variogram for each feature object is also different [
42,
43,
44,
45,
46]. However, the existing ATPK method and its optimization method do not fully consider this spatial heterogeneity feature, i.e., they do not consider the unique shape, size, and spatial correlation of objects.
Therefore, in response to the above problems, this study takes into account the heterogeneity of fuzzy geospatial objects and proposes a fuzzy geospatial object-based ATPK membership function downscaling. A fuzzy geospatial object is a geospatial object with fuzzy boundaries, distinguishing it from a typical fuzzy image object. This unique characteristic has been identified in previous research [
42,
43,
47]. In the study, first of all, the MFs with the coarse spatial resolution are produced by the fuzzy random forest method using medium spatial remote sensing images, i.e., Sentinel-2. Secondly, the extraction of fuzzy geospatial objects is obtained combining slope analysis and connected component labelling algorithm. Thirdly, two different semivariogram calculation methods are proposed for the different fuzzy geospatial objects. Then, the ATPK method is used to downscale each fuzzy geospatial object based on its semivariogram. Finally, the downscaling results are compared and analyzed based on other downscaling methods with this method.
Compared to classical ATPKs, the method proposed in this study takes into account the heterogeneity of different fuzzy geospatial objects and enables ATPK downscaling for semivariograms of different fuzzy geospatial objects. More importantly, the MFs used in this study can identify the boundaries of fuzzy geospatial objects and extract them more easily. The contributions of this study are as follows:
Obtaining a low-error MF image for each land use type based on the fuzzy random forest method.
Using the threshold segmentation method and gradient analysis method to study fuzzy objects and topology (internal, boundary, and external); a connected component labelling algorithm is used to label each fuzzy geospatial object.
Studying different methods of calculating semivariograms for geographic objects at different scales: (1) based on fuzzy geospatial object boundaries and (2) based on small-scale fuzzy geospatial objects.
Developing an ATPK downscaling method for fuzzy spatial variables with composite multiple semivariograms.
The organization of this paper is as follows.
Section 2 introduces the principles of the new downscaling approach in detail, which includes three parts: the extraction of fuzzy geospatial objects, the calculation of semivariogram for each geographic object, and ATPK downscaling methods.
Section 3 presents the study area, study data, and experiment set.
Section 4 presents the discussion. The conclusion is summarized in
Section 5.
2. Methodology
Four main steps were adopted for the fuzzy geospatial object level ATPK, as shown in
Figure 1. Firstly, the MFs of medium-resolution remote sensing imagery were generated with a coarse spatial resolution using the fuzzy random forest method. Secondly, fuzzy geospatial objects were extracted by combining slope analysis and a connected component labelling algorithm. Thirdly, different semivariogram calculation methods were used for each fuzzy geospatial object with different scale sizes. Lastly, each MF image was downscaled using the ATPK method according to the semivariogram of each fuzzy geospatial object.
2.1. Generating Membership Function by the Fuzzy Random Forest
It has been proved that random forest class probabilities are consistent estimates of the true probabilities [
48]. According to Marcio et al. [
49], land use and land cover area can be estimated from class membership probability for the random forest classification. Therefore, in this study, this conclusion was used to generate a membership map of each land use type for downscaling. The method mainly consists of two steps, mainly random forest classification algorithm and class proportion estimates.
In the fuzzy random forest algorithm, the land cover type is first predicted using the random forest classification, and then class proportions in the prediction dataset are estimated by Equation (1) counting the number of pixels in each classification and dividing by the number of pixels in the prediction dataset.
where
is the number of pixels in the population,
is the class index, and
is the number of classes
for pixel
. Equation (2) averages the class membership probabilities estimates of each class over all pixels.
where
is the class membership probability of class
for pixel
.
Through the above steps, the membership function of each land cover can be obtained.
2.2. Fuzzy Geospatial Object Extraction
In the extraction of fuzzy geospatial objects, threshold segmentation and slope analysis were used to study the fuzzy objects and their topological components (internal, external, and boundary); later, a connected component labeling algorithm was used to label each fuzzy geospatial object.
2.2.1. Slope Analysis
Slope analysis determines the steepness at each cell of the raster surface, with the smaller slope values indicating a flatter terrain, and vice versa. Slope analysis is most commonly used in elevation dataset processing. In this paper, slope analysis was used to identify geographic boundaries in membership maps. The slope calculation formula is as follows:
where
is the slope (°), between 0° and 90°;
is the gradient change rate of the membership value in the
direction; and
represents the gradient change rate of the membership value in the
direction [
50].
is the conversion factor for the radian rotation angle.
As far as the characteristics of the membership map are concerned, the membership value of each type of membership map is 1 within the general geographic object, and the membership value at the boundary gradually transitions from 1 to 0. Therefore, within the geographic object, there is no membership gradient change, and the slope value is 0, while in the edge area of the geographic object, there is a gradient change in which the membership value gradually transitions from 1 to 0, and the slope value is greater than 0. The values of the center cell and its eight neighbors determine the horizontal and vertical gradients. It is thus possible to extract edge regions of geographic objects, as shown in
Figure 2.
For general fuzzy geospatial objects, gradient variation occurs only at the edge region. Therefore, slope analysis can be used to identify the edge regions of fuzzy geospatial objects. The edge regions can be extracted by combining the connected component-labeling algorithm and the slope analysis method. Since there is no gradient change within the general fuzzy geospatial objects, it may affect the changing trend of the semivariogram. Therefore, only the boundary regions with gradient changes are extracted for the subsequent semivariogram calculation. On the other hand, for small-scale fuzzy geospatial objects, there is a gradient variation in MF values due to their small number of pixels and the fact that a pixel may represent more than one type of feature. Hence, slope analysis can extract the entire fuzzy geospatial object regions.
2.2.2. Connected Component Labeling
Since the range of MF value is from 0 to 1, the greater the MF value, the greater the possibility that the object belongs to this class, and vice versa. Since very small MF values of less than 0.05 are generated when generating MF images, the thresholds [0.05, 0.95] were used in this study for each type of geographical object MF for normalization. The pixel whose MF value is less than 0.05 was assigned a MF value of 0, and the pixel whose MF value is greater than 0.95 was assigned a MF value of 1. The purpose is to reduce the interference of very small MF value pixels among fuzzy geospatial objects so that the interconnected regions form one independent fuzzy geospatial object, which can be labeled by the connected component-labeling algorithm.
The connected component labeling (CCL) technique is used in image processing to identify pixels in an image [
51,
52] and mark the object pixels in the binary image. It enables each individual connected domain to form a marked block, and relevant image feature information, such as the area, bounding box, and centroid of these regions, can be obtained. Once the connected components in an image have been labeled, they can be further analyzed individually. In this study, the MF was first binarized, and then the 8-connectivity algorithm was used to obtain the connectivity region of each fuzzy geospatial object.
2.3. Determing Semivariograms of Fuzzy Geospatial Objects
The semivariogram is an important tool for analyzing the spatial heterogeneity of regionalized variables. Previous studies have shown that the variogram provides a concise description of the scale and pattern of spatial variability, which is essential for geostatistics [
53]. In this paper, the semivariogram was used to consider the spatial heterogeneity of different geographical objects.
Unlike previous studies that calculated only the global semivariogram of images, this study calculated the semivariogram for each extracted fuzzy geospatial object. To consider the spatial characteristics of fuzzy geospatial objects at different scales, two methods were used to construct semivariograms: (1) the semivariogram construction method based on the boundary of geographic objects and (2) the semivariogram construction method for small-scale geographic objects.
2.3.1. The Semivariogram Construction Method Based on the Boundary of Fuzzy Geospatial Objects
For larger-scale fuzzy geospatial objects, there is no gradient change in the value of the MF within the geospatial object. As a result, the changing trend of the semivariogram may be affected. To address this issue, only the boundary parts with gradient changes were considered in the calculation of the semivariogram of larger-scale fuzzy geospatial objects.
To calculate the semivariogram of larger-scale fuzzy geospatial objects, we first used the slope analysis algorithm to identify the edge regions of each fuzzy geospatial object where the gradient of fuzzy MF changes. We then used the CCL to obtain a label for each individual fuzzy geospatial object. Finally, we combined these two results to obtain the MF value of each fuzzy geospatial object’s edge region, which was used to calculate its semivariogram.
Suppose
denotes the
fuzzy geospatial object in the coarse resolution MF
. The semivariogram of the geographic object of the MF can be defined as half of the variance of the value difference between the value
of the pixel
in the fuzzy geospatial object at point
and
, where
represents the lag distance between pixels. For remote sensing images, the separation distance is extracted at regular intervals, as shown in
Figure 3. In this study, a lag distance
was used when calculating the semivariogram for the boundary of each fuzzy geospatial object. The semivariogram
for each coarse resolution fuzzy geospatial object was defined as the variance of the difference in values between all pairs of membership pixels at distance
:
where
represents the coarse resolution membership map and
represents the
fuzzy geospatial object in a certain membership function.
For each fuzzy geospatial object, there are
pairs of pixel membership values in a fuzzy geospatial object, separated by the same pixel interval
. The semivariogram estimator of each geographic object at lag
is given by
This algorithm was used to calculate the semivariogram of each extracted fuzzy geospatial object, which was then downscaled using ATPK method. The quantity
is an estimate of the semivariance
[
54], which is a useful measure of the variance between spatially separated pixels [
55]. The larger the
, the less similar the pixels are.
2.3.2. The Semivariogram Construction Method for Small-Scale Fuzzy Geographical Spatial Objects
Due to the small number of pixels contained in the small-scale fuzzy geospatial objects, the semivariogram has a large uncertainty. Therefore, we studied the method of constructing the semivariogram for small-scale fuzzy geospatial objects. For small-scale fuzzy geospatial objects, there is a gradient variation in MF values due to their small number of pixels and the fact that a pixel may represent more than one type of feature. In the slope analysis of coarse resolution MF, small-scale fuzzy geospatial objects do not have regions where the gradient changes to zero, and these regions are regarded as small-scale geographic objects. The semivariogram for small-scale fuzzy geospatial objects were calculated as in Equations (4) and (5).
Since the number of pixels in small-scale geographic objects is small, we set when calculating their semivariogram.
2.4. Geospatial Object-Based Membership Function ATPK Downscaling
In this paper, we downscaled the membership maps to the target fine spatial resolution using ATPK.
ATPK refers to prediction on a support that is smaller than that of the original data [
30]. It differs from traditional centroid-based kriging, which ignores spatial support and always treats it as equivalent to observational support. ATPK specifies the supported size, spatial correlation, and PSF of the sensor [
56]. ATPK explicitly considers the size of the support and predicts variables from regional supports to points by semivariogram deconvolution, with parameterized random function models and kriging.
The principle of ATPK is as follows:
Suppose
is the membership value of the coarse resolution pixel
centered at
(
, where
is the number of pixels) for the
geographic object in membership map Z. Based on ATPK, the membership value
of all fine pixel
F of the
fine resolution geographic object is predicted by the downscaling process. For a fine pixel centered at
, the pixel value predicted by ATPK is a linear combination of the measurements of neighboring coarse pixels.
In Equation (6), is the number of coarse pixels in the neighboring system, such as the windows of pixels. is the weight for the neighbor centered at . As is already known, the estimation of requires the estimation of , and this can be solved by the following kriging matrix. Thus, the spatial correlation between coarse pixels is accounted for in ATPK.
The task becomes the estimation of weights
in Equation (7). They are calculated by minimizing the prediction error variance, and the corresponding ordinary kriging system is [
57,
58]:
where
is the coarse-to-coarse semivariogram calculated by the point support covariance between coarse pixels located at
and
for coarse resolution geographic object
,
is the fine-to-coarse semivariogram between fine and coarse pixels centered at
and
, and
is the Lagrange multiplier [
59].
The PSF is used to transform fine spatial resolution data into coarse spatial resolution. Let
denote the Euclidean distance between the centroids of any two pixels, and
represents the fine-to-fine semivariogram between two fine pixels. Suppose
is the point spread function (PSF) of the satellite sensor. The fine-to-coarse semivariogram
and the coarse-to-coarse semivariogram
in Equation (7) are calculated by convolving
with the PFS, i.e.,
where
is the convolution operator and
represents the distance between point A within a pixel and point B within another pixel, which is denoted as
to indicate that it is the opposite direction of the distance from point B to point A (which is denoted as
s).
Assuming that the pixel value is the average of the fine pixel values within it, the PSF is defined as follows:
where
is the spatial support of pixel
C centered at
and
is the surface/size of pixel
. Using the PSF defined in Equation (10), the calculation in Equations (8) and (9) can be further simplified:
In Equations (11) and (12), is the pixel size ratio (zoom factor) between the coarse and fine pixels and is the distance between the centroid of fine pixel and the centroid of any fine pixel within the coarse pixel centered at .
The fine-to-fine semivariogram
in Equations (11) and (12) is derived by the deconvolution (also called deregularization in geostatistics) of each coarse resolution geospatial object semivariogram, denoted by
in Equation (5). In semivariogram modeling, the fitting function is typically characterized by three parameters: nugget, sill, and range. However, the computational cost increases linearly with the number of nugget candidates. To ease the computational burden, we adopted the assumption of Atkinson et al. [
60] and Pardo-Igúzquiza et al. [
15,
61] in this paper: there is zero nugget effect (absence of local noise) in the fine-to-fine semivariogram, which refers to the spatial autocorrelation between observations that are very close together. To select the punctual sill and range intervals, we set the interval for punctual sill selection between 1 and 3 times that of the areal sill, and the interval for punctual range selection was set between 0.5 and 2.5 times that of the areal range. The step was set to 0.1.
The above methodology differs from the previous downscaling method that calculates a semivariogram for the entire remote sensing image. In this study, the membership map was used to extract each geospatial object unit and downscale each unit using its unique semivariogram. Compared to traditional interpolation methods, geostatistical methods can take into account the shape and intrinsic spatial correlation of data. Additionally, ATPK explicitly considers the size of the support and predicts variables from regional supports to points by semivariogram deconvolution, with parameterized random function models and kriging. Therefore, when downscaling using the ATPK method, the semivariogram of each geographic object can be used to obtain more accurate downscaling results. By using the ATPK method to downscale remote sensing images and combining it with the semivariogram of each fuzzy geospatial object, it is possible to obtain more accurate downscaling results.
2.5. Proof of Methodological Validity
The semivariogram is an important tool for analyzing the spatial heterogeneity of regionalized variables. In this paper, we used standardized image with 512 × 512 pixels to verify the effect of the semivariogram on the ATPK downscaling method and the effectiveness of the proposed method. The original 512 × 512 resolution image was used as the reference image for accuracy verification, and it was upscaled by a zoom factor of S = 2 to produce a coarse image with 256 × 256 resolution. Unlike geospatial object extraction, for grayscale images, we segmented the Lena image into 19 objects based on grayscale values, as shown in
Figure 4b.
To obtain the downscaling result, the semivariogram of each image object was computed using the method presented in this paper. Then, the semivariogram of each object was calculated using the small-scale object method. Finally, the ATPK method was used to downscale each object on its semivariogram, resulting in the downscaling of each image object. The effectiveness of the proposed method was verified using six classical downscaling methods, namely Bilinear, Cubic, ICBI (Iteractive Curvature Based Interpolation) [
62], INEDI (Improved New Edge-Directed Interpolation) [
63], RBF (Radial Basis Function Interpolation) [
23], and the classical ATPK downscaling methods based on global semivariogram. Four evaluation indicators were used, namely root-mean-square error (RMSE), correlation coefficient (CC), peak signal-to-noise ratio (PSNR), and universal image quality index (UIQI) [
64]. In addition to these evaluation indices, the proposed method was also evaluated using the reduction in remaining error (RRE) index, which measures the improvement over other methods. The RRE is calculated as follows:
where
and
are the mean squared error (or remaining errors) of the compared downscaling method and the method proposed, respectively. The RRE index indicates the percentage of reduction in error achieved by the proposed method compared to other methods.
Figure 4 presents the outcomes of the different downscaling algorithms that were employed to process the standardized image. During the experiments, we partitioned the Lena image into 19 objects by considering their gray values, in order to draw a comparison with the fuzzy geospatial objects. For each segmented object, a semivariogram was computed and utilized in the ATPK downscaling approach.
From an overall perspective, the method proposed in this paper can better downscale coarse resolution images.
Table 1 shows the accuracy evaluation results of various downscaling methods. Compared to Bilinear, Cubic, ICBI, INEDI, RBF, and the classical ATPK, the RMSE value increased by 22.46%, 22.46%, 40.22%, 39.20%, 11.93%, and 4.46%, respectively. It is clear that the method proposed in this paper is also effective for downscaling ordinary grayscale images.
3. Experiments
To verify the validity of this study, a total of two groups of experiments were set up. For the simulation experiment, the Worldview-2 MF images were used, while Sentinel-2 membership images were used for the real data experiment. In both groups of experiments, the results of downscaling the semivariogram for the entire region were compared and analyzed with the results of downscaling the semivariogram for geographic objects. The accuracy of both sets of experiments was evaluated using Worldview-2 2 m resolution membership images.
3.1. Study Area
This research was performed in the Beidagang Wetland core area, located in southern Tianjin city, China, where the basic elements are mainly vegetation and water bodies, as shown in
Figure 5. Due to sea level regression and historical river impacts, the soil salinity is relatively high, the predominant plant species are Phragmites australis, Typha angustifolia, and Suaeda salsa. The water bodies include tidal flats, swamps, and rivers, and Suaeda salsa is classified separately as it appears as a red hue in the RGB composite color images, unlike other green vegetation. The land cover classification is divided into five types: water body, bare land, vegetation, Suaeda salsa, and withered vegetation. Wetlands have intertwined and fuzzy features with no clear boundary between vegetation and water bodies. Therefore, accurate boundary distinction of wetland land cover types is a meaningful method to study.
3.2. Study Data
3.2.1. Worldview-2 Dataset
The Worldview-2 image used in this study contains eight multispectral bands (coastal blue, blue, green, yellow, red, red-edge, near-infrared 1 (NIR1), and near-infrared 2 (NIR2)) with a resolution of 1.8 m. It was resampled to 2 m resolution for analysis. The Worldview-2 dataset covers the Beidagang Wetland core area of Tianjin province, China, and was acquired on 16 September 2020. Radiometric and atmospheric corrections were performed using the ENVI 5.3 software.
3.2.2. Sentinel-2 Dataset
Sentinel-2 image comprises 13 spectral channels in the visible, near-infrared, and short-wave infrared parts of the spectrum. Among these 13 bands, the spatial resolution of the visible bands 2–4 and NIR band 8 (spectral region ranging from 0.705 to 0.885 µm) is 10 m. For this study, we used the 10 m resolution bands. The high-resolution multispectral image from the Sentinel-2 satellite was acquired on 16 September 2020 and was free of clouds.
3.3. Experiment Setup
In the simulation experiments, the Worldview-2 dataset was upscaled by a zoom factor of S = 2 to produce a coarse image with 4 m resolution. The original 2 m MFs were used as the reference image. These MFs also served as a reference image for the Sentinel-2 image downscaled to 2 m. Four indices were used to quantitatively evaluate the performance of the downscaling methods in both the simulation and real experiments. These indices included RMSE, CC, PSNR, and UIQI.
Six classical downscaling methods were used for comparison against the method proposed in this study in both the simulation and real experiments. These methods included Bilinear, Cubic, ICBI, INEDI, RBF, and the classical ATPK downscaling methods based on global semivariogram.
3.4. Simulation Experiment
The study area is covered mainly by water bodies, bare land, vegetation, Suaeda salsa, and withered vegetation. Therefore, in the experiment, the membership maps of these five types of ground cover (i.e., water bodies, bare land, vegetation, Suaeda salsa, and withered vegetation) were downscaled.
Figure 6 shows the MFs for the five land cover types, which were zoomed to a resolution of 4 m by the Worldview-2 2 m resolution MFs. As can be seen from the membership maps, the boundaries between the different land cover classes are not well-defined and are therefore difficult to distinguish, which necessitates the use of downscaling techniques. For the downscaling of the five coarse images in this experiment, the zoom factor was set to S = 2 to restore the fine spatial resolution images. The scaling factor S = 2 was chosen so that the original 2 m resolution membership maps could be used for accuracy verification after downscaling. The six downscaling methods (i.e., Bilinear, Cubic, ICBI, INEDI, RBF, and the classical ATPK) were used to evaluate the performance of the method proposed in this study.
To better illustrate the effectiveness of the method proposed in this study, we selected five sub-areas within the study area to display the membership images of different types of ground objects, as shown in
Figure 7.
Figure 8 displays the results of the various downscaling algorithms applied to the Worldview-2 dataset. The visual comparison of the seven downscaling results in each membership map in
Figure 8 indicates that the proposed method is suitable for effectively downscaling coarse images.
Moreover, the proposed method provides the most satisfactory downscaling results among the seven methods. Specifically, in the case of the water body membership image shown in
Figure 8, the proposed method and the downscaling method based on semivariogram computation for different fuzzy geospatial objects presented in this paper resulted in a smaller MF value for the water body boundary, indicating that the gradient value between the fuzzy geospatial objects is larger. This result demonstrates that the proposed method can identify the boundary area of ground objects better.
Figure 8 shows that, when compared to other methods such as Bilinear, Cubic, ICBI, INDEI, and RBF, the ATPK downscaling method based on the semivariogram of the entire image yields a smaller boundary membership value. However, the boundary membership value is still slightly larger than the downscaling method proposed in this study, which is based on the semivariogram of fuzzy geospatial objects. Furthermore, the downscaling result based on the semivariogram of the whole image may produce a MF value of less than 1 in the inner edge area of the water body, which is not observed in the downscaling result based on the semivariogram of the fuzzy geospatial object. Additionally, the other methods cause the fuzzy geographical object boundary to have a small change gradient after downscaling and its boundary expands outward to varying degrees. In contrast, the method proposed in this study, accurately extracts the boundary range of small water bodies by compressing their internal range and preventing outward expansion of their boundaries.
Table 2 presents the quantitative assessment of seven downscaling methods based on various evaluation indices, including RMSE, CC, PSNR, and UIQI indices. The optimal results for each land type are shown in bold. In addition to these evaluation indices, the proposed method was also evaluated using the reduction in remaining error (RRE) index, which measures the improvement over other methods. The RRE is calculated as follows:
where
and
are the mean squared error (or remaining errors) of the compared downscaling method and the method proposed, respectively. The RRE index indicates the percentage of reduction in error achieved by the proposed method compared to other methods.
According to the visual comparison, both geostatistical approaches, the ATPK downscaling method based on global semivariogram and ATPK downscaling method based on fuzzy geospatial object semivariogram, outperform the other five methods in terms of RMSE, CC, PSNR, and UIQI for all MF images. Only the CC index of vegetation in the ATPK downscaling method is slightly better than the method proposed in this paper, while all other accuracy evaluation indexes are the most effective in the method proposed in this paper. Comparing the method proposed in this study to Bilinear, Cubic, ICBI, INEDI, RBF, and ATPK, the RMSE value increased by 23.66%, 22.30%, 27.36%, 26.82%, 12.76%, and 3.43%, respectively. Additionally, the UIQI value increased by 25.59%, 22.42%, 25.61%, 25.45%, 21.13%, and 15.73%, respectively, demonstrating the superiority of the method proposed in this study for downscaling the Worldview-2 dataset.
3.5. Real Experiment
To verify the effectiveness of the method described in this paper on real data, an experiment was conducted to downscale the 10 m membership images of Sentinel-2 to 2 m in the same experimental area. The accuracy of the downscaled images was then verified using Worldview-2 membership images.
Figure 9 shows a 10 m resolution image of Sentinel-2 without downscaling. Five sub-areas were selected from the study area to display MF images of different types of ground objects, as shown in
Figure 10. For this experiment, the Sentinel-2 MF images were downscaled using a zoom factor of S = 5 to reduce the fine spatial resolution images. By setting the scaling factor S = 5, the Worldview-2 2 m resolution membership maps could be used to verify accuracy after downscaling.
Figure 11 shows the downscaling results using Bilinear, Cubic, ICBI, INEDI, RBF, the classical ATPK, and the proposed method.
Table 3 reports the accuracy assessment in terms of RMSE, CC, PSNR, and UIQI indices for the seven downscaling methods of coarse membership images for Sentinel-2.
It is important to note that, before downscaling, the pixels of the MF images with a of 10 m resolution could not accurately fit the boundary of the fuzzy geospatial object, resulting in a rough outline. However, after downscaling, the edge of the membership images with 2 m resolution more closely fits the fuzzy geospatial object boundary, resulting in a smoother outline of the object.
Although the Bilinear, Cubic, INEDI, and ICBI methods can also achieve better downscaling visual effects compared to the proposed methods, the boundary range of the ground object will spread outward by these methods, making it difficult to obtain accurate fuzzy geospatial object boundaries. The RBF downscaling method cannot produce a smoother boundary when downscaling the MF images with a large zoom factor (i.e., S = 5), resulting in a poor visual effect. The classic ATPK downscaling method can lead to the phenomenon that the MF value of the internal pixel whose MF value should be 1 of the fuzzy geospatial object is less than 1 in the downscaling result. Moreover, for small-scale fuzzy geospatial objects, this method caused the coverage area of fuzzy geospatial objects to shrink. Compared with other methods, the method proposed in this paper can more accurately downscale the coarse-resolution membership images and obtain a downscaling result that is closer to the boundary of the actual fuzzy geospatial object.
Table 3 presents a quantitative assessment of the seven downscaling methods in terms of RMSE, CC, PSNR, and UIQI indices for the Sentinel-2 membership images downscaled with a zoom factor of S = 5 in comparison with the Worldview-2 experiment. The accuracy results of each downscaling method exhibit varying degrees of decline. However, the proposed method for downscaling in this paper achieved the best results in all accuracy evaluation indicators, especially UIQI. Compared to the UIQI value of the proposed method, the UIQI values of the Bilinear, Cubic, ICBI, INEDI, RBF, and the classic ATPK methods increased by 45.82%, 52.12%, 45.78%, 46.58%, 52.24%, and 49.14%, respectively. This demonstrates the superiority of the proposed method for downscaling larger-scale resolution images.