Next Article in Journal
An Optimization Method for Multi-Functional Radar Network Deployment in Complex Regions
Previous Article in Journal
Physical and AI-Based Algorithms for Retrieving Cloud Liquid Water and Total Precipitable Water from Microwave Observation
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Spatial Estimation of Soil Organic Matter and Total Nitrogen by Fusing Field Vis–NIR Spectroscopy and Multispectral Remote Sensing Data

by
Dongyun Xu
1,2,
Songchao Chen
3,
Yin Zhou
4,
Wenjun Ji
5 and
Zhou Shi
2,6,*
1
College of Resources and Environment, Shandong Agricultural University, Taian 271000, China
2
Institute of Agricultural Remote Sensing and Information Technology Application, College of Environmental and Resource Sciences, Zhejiang University, Hangzhou 310058, China
3
ZJU-Hangzhou Global Scientific and Technological Innovation Center, Hangzhou 311200, China
4
School of Public Management, Zhejiang University of Finance and Economics, Hangzhou 310018, China
5
College of Land Science and Technology, China Agricultural University, Beijing 100193, China
6
Key Laboratory of Spectroscopy Sensing, Ministry of Agriculture, Hangzhou 310058, China
*
Author to whom correspondence should be addressed.
Remote Sens. 2025, 17(4), 729; https://doi.org/10.3390/rs17040729
Submission received: 27 November 2024 / Revised: 14 February 2025 / Accepted: 17 February 2025 / Published: 19 February 2025
(This article belongs to the Section Remote Sensing in Agriculture and Vegetation)

Abstract

:
Accurate and timely acquisition of soil information is crucial for precision agriculture, food security, and environmental protection. Proximal visible near-infrared reflectance (vis–NIR) spectroscopy has been widely employed for rapid and accurate soil measurement, but its point measurement nature limits its direct applicability for large-scale soil surveys. On the other hand, remote sensing techniques can provide soil information at a larger scale, but their resolution is relatively coarse. While both techniques have been used independently for soil analyses, integrating vis–NIR spectroscopy with remote sensing remains a challenge and is underexplored, especially at the field scale. This study addresses this gap by combining field vis–NIR spectra with Gaofen-1 remote sensing data to spatially analyze soil organic matter and total nitrogen at the field scale. Unlike previous work, we first applied Gaofen-1 data and 10 derived spectral indices to estimate soil organic matter and total nitrogen using partial least squares regression and random forest, identifying the optimal combination of spectral indices. Then, we integrated the proximal vis–NIR spectra with this optimal spectral index combination for improved soil property estimation. This integration advanced existing methodologies by leveraging the high spatial resolution of Gaofen-1 data and the detailed spectral information from vis–NIR spectroscopy. The results showed the following: (1) the coefficient of variation across different crop growth stages of Gaofen-1 data was more crucial for modeling these two properties compared to bare soil Gaofen-1 data; (2) integrating proximal vis–NIR spectra with Gaofen-1 data improved model performance, yielding Lin’s concordance correlation coefficient ( ρ c ) values of 0.63 and 0.72 and ratios of performance to interquartile distance (RPIQ) of 1.99 and 1.59 for soil organic matter and total nitrogen, respectively; and (3) the combined use of vis–NIR spectra and Gaofen-1 data provided higher spatial estimation accuracy (R2 of 0.68 and 0.57 for soil organic matter and total nitrogen) compared to ordinary kriging (R2 of 0.63 and 0.31 for soil organic matter and total nitrogen). These results demonstrate that the synergistic use of remote sensing and proximal soil sensing is a practical approach for spatially estimating soil organic matter and total nitrogen at the field scale.

1. Introduction

Soil organic matter (SOM) and total nitrogen (TN) are critical soil fertility indicators. SOM improves soil structure, the availability of soil phosphorus and potassium, and soil water retention. Thus, it is of great significance to soil fertility, crop growth, and sustainable agricultural systems [1]. TN is the crucial nutrient element for crop growth and is necessary for maintaining a sufficient soil nitrogen supply to ensure the healthy growth of crops [2]. However, the excessive application of nitrogen fertilizer causes soil degradation (i.e., soil acidification) and a series of environmental problems, such as N leaching in groundwater and emissions of nitrous oxide [3]. Therefore, it is vital to acquire the spatial distribution of SOM and TN in a timely manner and as accurately as possible for precision agriculture and environmental protection.
Visible near-infrared reflectance (vis–NIR) spectroscopy, as a rapid, non-destructive, cost-effective proximal soil sensing technique, has been widely used in fast soil measurement [4,5,6,7,8]. Currently, vis–NIR has been successfully used to estimate soil physical, chemical, and biological properties, such as SOM, soil organic carbon (SOC), particle size fractions (i.e., clay, silt, and sand), iron and aluminum oxides, and TN [9,10,11,12]. SOM/SOC, clay, and TN are the most successfully estimated properties due to corresponding spectral absorption characteristics in the vis–NIR spectral region [13]. For example, the absorption characteristics of SOM in the near-infrared region are affected by stretching vibration and angular vibration of groups of C–H, N–H, O–H, C–O, C–N, N–O, C–C, and C=O [14]. The band characteristics near 1100 nm, 1600 nm, 700~1800 nm, 2000 nm, and 2200~2400 nm are mainly related to the SOC and TN contents [4,15,16]. Although vis–NIR can achieve rapid and accurate estimation of SOM and TN, its measurement is still based on point samples. Therefore, we cannot acquire spatial information on soil properties without spatial predictive models.
Remote sensing technology has the advantages of wide coverage, strong timeliness, and rapid information acquisition. Thus, it is widely used in large-area ground information acquisition, soil monitoring, and digital soil mapping [17,18]. Alabbas et al. (1972) conducted aerial experiments to study the relationship between soil organic matter and clay content with visible and near-infrared reflectance [19]. Since then, researchers have carried out a series of studies using remote sensing for spatial estimation of soil attributes, especially for measuring SOM/SOC and soil texture. For example, Chen et al. estimated SOC concentrations in a 115 ha field using remote sensing imagery, establishing a logarithmic linear relationship (R2 = 0.93) between SOC and RGB band intensities [20]. Fox et al. developed a soil line Euclidean distance technique using red and near-infrared band intensities to map SOM, demonstrating its effectiveness in two Midwest fields [21]. Early studies usually used the bands of remote sensing images and their mathematical transformations (i.e., logarithm, reciprocal) to build predictive models.
With the development of advanced sensors, the use of hyperspectral and multispectral data has significantly enhanced soil property analysis. Hyperspectral data, with their numerous narrow spectral bands, enable detailed soil property analysis, while multispectral data, with fewer but broader bands, are more suitable for efficient large-scale mapping. Both types of data play complementary roles in remote sensing, with hyperspectral data being particularly useful for detailed analysis and multispectral data for broader spatial coverage [22,23]. Especially since the launch of Sentinel-2 in 2015, its broad spectral range, high spatial resolution (10–60 m), and short revisit time (5 days) have greatly improved the spatial estimation of soil properties by enabling frequent and detailed observations of soil reflectance characteristics [22,24,25,26,27]. However, soil property prediction based on remote sensing data with moderate spatial resolution, such as Sentinel-2, has primarily focused on large-scale areas or has used remote sensing data as auxiliary variables in soil mapping studies. For example, Peng et al. used Sentinel-1, Sentinel-2, and Landsat-8 along with DEM data and machine learning methods for SOM mapping in hilly and mountainous regions, and their results showed that Sentinel-2 data have a significant impact on cropland SOM prediction in hilly and mountainous areas [28]. Ji et al. utilized Sentinel-2 remote sensing data as predictors for predicting SOC content at varying spatial resolutions in Germany, demonstrating that hybrid models, particularly at 100 m resolution, provided more accurate predictions for national-scale SOC monitoring [29]. These studies often leverage the broad spatial coverage of medium-resolution sensors to address large-area soil monitoring challenges. However, their application in small-scale or high-resolution studies remains limited due to spatial resolution constraints and the influence of vegetation cover. The Chinese Gaofen-1 satellite (GF-1), with its high-resolution multispectral sensors (spatial resolution of 2 m for panchromatic and 8 m for multispectral) and frequent revisit cycle (4 days), offers a valuable data source for soil property estimation and provides high-spatial-resolution soil data for precision agriculture and farmland management.
Although remote sensing has the advantages of rapid data acquisition, extensive area monitoring, and data simplification [25,30], its application in modeling the spatial distribution of soil properties is limited by spatial resolution and plant disturbance. The proximal soil sensing through vis–NIR provides accurate soil property estimation but lacks the capability for large-area monitoring. Previous research has primarily focused on individual approaches for soil measuring and mapping, such as solely relying on vis–NIR spectra or remote sensing data. However, there is a lack of comprehensive research exploring the synergistic utilization of these two data sources for spatial estimation of soil properties.
To the best of our knowledge, few studies have investigated the combined use of field vis–NIR spectroscopy and remote sensing data, particularly applying this approach to spatially estimate soil attributes at field scale. By integrating the high-resolution information derived from field vis–NIR spectroscopy and the wide coverage and timeliness offered by remote sensing data, we aim to introduce a novel approach that overcomes the limitations of individual techniques and provides accurate and extensive area monitoring. Therefore, in this study, we directly integrated in situ proximal vis–NIR spectroscopy with remote sensing data (GF-1) for spatial estimation of SOM and TN. Unlike traditional approaches that use remote sensing data as auxiliary variables in predictive models, our method establishes a direct fusion framework, leveraging the high spectral resolution of vis–NIR for precise local measurements while utilizing GF-1’s broad spatial coverage for large-area mapping. By synergistically combining these complementary datasets and employing machine learning algorithms, we aim to enhance both the spatial continuity and estimation accuracy of SOM and TN, addressing the limitations of single-source data.
The main objectives are as follows: (i) to compare the model performance based on bare soil GF-1 data and multi-temporal GF-1data in different crop growth stages using linear and nonlinear models, (ii) to compare the model performance using GF-1 data and the combination of GF-1 data and field vis–NIR spectra data, and (iii) to optimize the best spatial predictive model for digital soil mapping.
This paper is structured as follows: firstly, we introduce the materials and methods of this study, including the study area, sampling methods, physicochemical and spectral testing, and modeling approaches. Then, we present the main research results, focusing on the prediction results based on GF-1 remote sensing data, the prediction results from the fusion of GF-1 and field vis–NIR data, and a comparison between the two. We also cover the spatial estimation results based on the fusion model. Then, we discuss and analyze the main findings, comparing them with those of previous studies. The final section summarizes the key conclusions of this research. The study process is shown in Figure 1.

2. Materials and Methods

2.1. Study Area and Soil Sampling

The study area (114°0′50″–114°1′15″E, 34°6′30″–34°6′40″N) is located in Xuchang, Henan Province, a critical food production region (i.e., wheat, corn) in China (Figure 2). This region has a warm temperate continental monsoon climate with an annual rainfall of between 671.1 mm and 736.0 mm and a mean annual temperature of 14.6 °C. The predominant soil type in the study area is fluvo-aquic soil, classified as Calcaric Cambisols under the World Reference Base for Soil Resources (WRB). The soil is characterized by a loam texture, which contributes to its favorable water retention capacity and permeability properties. In this study, 240 topsoil samples (0–20 cm) were collected in October, 2019, before wheat planting based on grid sampling with a spacing of 20 m using a cuboid sampling tool (10 × 10 × 20 cm). After removing stones and roots, all the soil samples were air-dried, ground, and sieved to less than 2 mm in the laboratory for further analysis [13].

2.2. Data Acquisition and Treatment

2.2.1. Chemical Analysis and Spectra Measurement

SOM and TN were analyzed at Zhejiang Academy of Agricultural Sciences (located in Hangzhou, China), a nationally certified laboratory, using the H2SO4–K2Cr2O7 oxidation method for SOM and the semi-micro Macro Kjeldahl method for TN, following standard analytical protocols. The vis–NIR spectra were measured in field and laboratory conditions using a Fieldspec® ProFR spectrometer and a high-intensity contact probe with its own light source (Malvern Panalytical Ltd., Malvern, UK). In field conditions, three vertical locations (i.e., bottom, middle, and top) were selected for spectral measurement from each soil sample (10 × 10 × 20 cm). Ten spectra were scanned at each selected location, and a total of 30 spectra were averaged to represent the final spectra of each soil sample. To maintain the quality of the data, we calibrated the spectrometer using a Spectralon panel with 99% reflectance every 10 soil samples. In laboratory conditions, the spectra were measured on the air-dried, ground, and sieved soil samples in a petri dish. Three points were selected for measurement, with the final results being the average of the results of the three points.

2.2.2. Spectra Preprocessing

The vis–NIR spectra were first reduced to 400–2450 nm to eliminate the influence of noise at the edge and then down-sampled to a resolution of 10 nm to improve the calculation efficiency and reduce collinearity. Then, the reflectance spectra (R) were transformed into absorbance with log(1/R) and smoothed using the Savitzky–Golay algorithm (SG, a window size of 15 and polynomial of order 2) to further reduce the noise and enhance the signal [31]. Direct standardization (DS) [32] was used to transform field spectra into laboratory spectra to reduce the influence of water and other environmental factors. After spectral preprocessing, principal component analysis (PCA) was applied for dimensionality reduction [33]. The first four principal components were then interpolated onto the study area using ordinary kriging in ArcGIS. All these processes were conducted in R 4.2.1 [34] and Matlab (The MathWorks Inc., Natick, MA, USA).

2.2.3. Remote Sensing Data Acquisition

The remote sensing data used in this study were GF-1 satellite data (Table 1). The GF-1 satellite is the first satellite of China’s high-resolution earth observation system with high spatial and temporal resolution. It has been widely used in land monitoring, disaster management, environmental protection, and various other areas. We collected four periods of GF-1 data on different growth stages of wheat (December 27, February 21, and March 14) and the bare soil (October 17).

2.2.4. Remote Sensing Data Preprocessing and Spectra Indices Calculation

The preprocessing of GF-1 data included radiometric calibration, atmospheric correction, geometric correction, and fusion panchromatic and multispectral images. The resulting multispectral data with a spatial resolution of 2 m were used in this study. The preprocessing was conducted in ENVI 5.3.
We calculated the spectral indices (Table 2), which mainly included vegetation indices closely related to soil organic matter and some indices related to soil brightness [25,30]. Moreover, we calculated the change rate (SIr) and coefficient of variation (SIcv) of spectral indices in different periods as the variables for soil estimation. The spectra indices were calculated using the ‘rgdal’ package [35] in R 4.2.1.
S I r = S I t 2 S I t 1 S I t 1
S I c v = s d ( S I ) m e a n ( S I )
where SIr represents the change rate of spectra indices in different periods,   S I t 1 and S I t 2 represent the spectra indices in different periods, and SIcv represents the coefficient of variation in spectra indices for different periods.

2.3. Estimation Model

In this study, we used partial least squares regression (PLSR) and random forest (RF) for spatial estimation of SOM and TN. Partial least squares regression (PLSR) is one of the most commonly used multiple linear calibration algorithms for spectral calibration and prediction [45]. It has the advantage of eliminating the problem of multiple collinearities of the independent variables [46].
Random forest (RF) is a combined tree prediction method [47] that contains a series of random classification regression tree models. The final regression or classification was achieved by assembling these tree models. RF does not need to screen variables and can deal with classified and continuous variables simultaneously. Compared with other models, RF can produce lower estimation deviation and variance. In the PLSR model, the main parameter is the number of components (ncomp). In RF, the main parameters include the number of trees (ntree), the minimum number of terminal nodes (node size), and the number of covariates that are randomly selected at each tree (mtry) [47,48]. All of these parameters were optimized through 10-fold cross-validation using “caret” package in R 4.2.1.
We also used ordinary kriging interpolation (OK) for spatial analysis of SOM and TN and compared the results with the best model based on the combination of GF-1 data and field vis–NIR spectra data.

2.4. Model Development and Performance Evaluation

Before model construction, we randomly split the data into calibration (two-thirds) and validation (one-third) sets using a random sampling method. To improve the model’s robustness and quantify the prediction uncertainty, we used 50 bootstraps on the calibration data [49].
Lin’s concordance correlation coefficient ( ρ c ) [50], the coefficient of determination (R2), root mean square error (RMSE), the ratio of performance to interquartile distance (RPIQ), the standard deviation of the error (SDE), and the mean error (ME) were used to evaluate and compare the performance of the models. Additionally, the estimation uncertainty was calculated using their 90% confidence intervals.
S D E = I = 1 n ( y ^ i y ¯ ) 2 ( n 1 )
M E = i = 1 n ( y ^ i y ¯ ) n
U j = S u p p e r , j S l o w e r , j S ¯ j
where y ^ is the predicted value of soil properties;   y ¯ is the mean value of the measured value of soil properties, n represents the number of soil samples; U j represents the uncertainty of the j-th grid of the spatial estimation; S u p p e r , j and S l o w e r , j are the upper confidence interval and lower confidence interval of the j-th grid, respectively; and S ¯ j is the mean value of 50 bootstraps of the j-th grid.

3. Results

3.1. Descriptive Statistics of SOM and TN

The statistics of SOM and TN are shown in Figure 3 and Table 3. The SOM ranged from 9.10 to 24.10 g/kg, and its mean value was approximate to the median value. The variance of SOM was small, with a coefficient of variance (CV) value of 14.48%. As for TN, the range was from 0.82 to 1.74 g/kg, and its variance was also slight, with a CV of 10.54%. From the violin plots of SOM and TN, we found that the calibration and validation datasets were similar to that of the whole dataset, indicating that the data division was representative.

3.2. Estimation of SOM and TN Using Remote Sensing Data

In this study, we selected the GF-1 data of bare soil, which closely matched the sampling data, along with three periods of GF-1 data at the different growth stages of wheat to estimate SOM and TN. We compared the model performance of SOM and TN based on the bands and spectra indices of four periods and the change rate (SIr) and coefficient of variation (SIcv) of the bands and spectral indices in different periods using a PLSR and RF model. The model performance based on bare soil GF-1 data was not good, which may result from the non-tillage system in the study area (Figure 4). Although the crop harvest was completed in October, the soil surface was still covered with corn straw, which affected the acquisition of bare soil image data. Compared to the estimation results of SIr and SIcv of different periods, the best model performance was achieved in the model based on the SIcv of data from four periods. This result indicates that the growth difference between different growth stages of crops plays an important role in estimating soil properties.
According to ρ c , RMSE, and RPIQ, the RF models had better model performance than the PLSR models in estimating SOM and TN (Figure 4). When considering SDE, the RF models were better than the PLSR models, while the overfitting problem of RF was greater than that of the PLSR model (Table 4). The estimation result of SOM was acceptable, with a ρ c of 0.53 and RPIQ of 1.79, which showed that GF-1 data could achieve quantitative analysis for SOM. As for TN, the estimation using GF-1data was poor, with a ρ c of 0.31 and RPIQ of 1.20. Fertilization management is necessary during the growing process of wheat, and the content of nitrogen is greatly affected by fertilization management. Therefore, it is difficult to estimate TN using remote sensing data.

3.3. Field Spectra Transformation and PCA Analysis

The field spectra fluctuate considerably compared with the laboratory spectra, especially around 1400 nm and 1900 nm (Figure 5a). We used DS for the transformation from field to laboratory spectra, which has been proven to be an effective method for reducing the effect of environmental factors [12]. Before DS transformation, we used KS [51] to select the representative transformation samples from field spectra, and the result showed that the best number of transformation samples was 55 in this study (Figure 6). The DS-transferred spectra were similar to laboratory spectra, and their absolute differences were generally lower than 0.1, which indicates good transformation (Figure 5b).
The DS-transferred spectra were then analyzed using PCA. The PCA analysis showed that the cumulative contribution of the first four principal components reached 99%, of which the contributions of the first two principal components were larger, which were 64.55% and 30.93%, respectively (Table 5). We reserved the first four principal components for subsequent analysis to maintain more spectral information and improve the estimation accuracy.

3.4. Estimation of SOM and TN Through the Synergistic Use of Field Spectra and GF-1 Data

We combined GF-1 data and the first four PCs of field spectra for the spatial estimation of SOM and TN using the PLSR and RF model (Table 6). The estimation accuracies of SOM and TN were generally improved following the synergistic use of the above two techniques (Table 4 and Table 6). The ρ c of SOM improved from 0.53 to 0.72, RMSE decreased from 1.98 g/kg to 1.78 g/kg, RPIQ increased from 1.79 to 1.99, and SDE was also reduced. As for TN, the ρ c improved from 0.31 to 0.72; RMSE and SDE decreased from 0.14 g/kg and 0.12 to 0.11 g/kg and 0.11, respectively; and RPIQ increased from 1.20 to 1.59. Compared with the PLSR and RF models, we found that the RF model was better than the PLSR model for SOM considering all the five model evaluation indexes as well as the model uncertainty. Additionally, the PLSR model was better than the RF model for TN, while the model uncertainty was larger than that of the RF model. Generally, the SOM and TN estimations were improved and could be successfully estimated through the synergistic use of proximal soil sensing and remote sensing.
We analyzed the significance of each variable for SOM and TN (Figure 7). In general, the PCs of field spectra played an essential role in estimating SOM and TN. This explains why the estimation accuracy was vastly improved after adding field spectra in the model (Table 3 and Table 4). PC1 and PC3 of field spectra, B4, NDVI, EVI, SAVI, GNDVI, and V of GF-1 were found to be significant, all exceeding 40% for SOM. As for TN, the significance of GF-1 data was relatively low, around 20%. The PCs of field spectra were found to be highly significant, among which PC1, PC3, and PC4 were the main driving factors, especially PC3 (above 90%).

3.5. Spatial Analysis of SOM and TN

We used the best predictive models of SOM and TN for spatial analysis in the study area and compared the results with the ordinary kriging interpolation (OK) of SOM and TN (Figure 8). The spatial estimation based on the synergistic use of remote sensing and proximal soil sensing had a similar distribution trend with the ordinary kriging interpolation, and it had higher estimation accuracy than the latter (Figure 9). Moreover, it expressed more detailed information than ordinary kriging interpolation (Figure 8). The distribution of SOM was generally low in the west and high in the east, which is roughly a block distribution in the north–south direction. The area with the highest SOM content (SOM > 20.00 g/kg) was located in the southeast of the field, and the area with the lowest SOM content (SOM < 14.27 g/kg) was located in the west and a banding area in the west-central area of the field. The TN distribution was similar to that of SOM and was also low in the west and high in the east on the whole. Less detailed information on TN was available compared to that of SOM because the GF-1 data played a less important role in the estimation model of TN (Figure 7).
The spatial estimation uncertainties of SOM and TN were expressed by calculating the ratio of the 90% confidence intervals and the corresponding average value of the point (the 90% confidence intervals were calculated using 50 bootstraps) (Figure 8). The uncertainty of SOM was relatively low, with most areas less than 0.06 and only a few small areas larger than 0.10. The estimation uncertainty of soil TN was slightly higher than SOM, with most areas of the field between 0.03 and 0.19. The region with uncertainty greater than 0.19 accounted for only a small part of the area, and the area greater than 0.37 was mainly located at the edge of the field.

4. Discussion

Remote sensing data can be analyzed using physically based or empirical methods to derive soil properties. For soil attributes, remote sensing data are mainly used for deriving soil moisture, texture, or soil salinity with acceptable accuracy [17]. For SOM and TN, the feasibility was generally low, which was consistent with the findings of our study (Figure 4 and Table 4) [17]. Moreover, remote sensing data were mainly used as secondary sources in digital soil mapping for soil attributes in large areas [52,53,54,55]. With the development of the spatial and spectral resolution of the recent multispectral satellite sensors, the application of multispectral satellite data, particularly Sentinel data, for soil property estimation has increased significantly [24,56]. While Sentinel data are not suitable for small areas like in this study, Gholizadeh et al. utilized 18 spectral indices for soil property estimation, demonstrating good performance in predicting soil organic carbon (SOC) across different locations, with a prediction RMSE of less than 0.24 and an RPD exceeding 1.60 [25]. Based on the spectral indices used by Gholizadeh et al. [25], we selected 10 indices that were compatible with our dataset. Unlike Sentinel-2, which offers 10 bands, including shortwave infrared (SWIR), the GF-1 satellite provides only 4 bands (red, green, blue, and near-infrared). As a result, we referred to these indices and used our data to determine 10 spectral indices in this study.
It has been proved by many researchers that SOM and TN can be estimated using vis–NIR spectra [4,13]. However, these estimations have mostly been made in lab conditions. In field conditions, spectra are largely affected by factors such as the water content, particle size, etc.; therefore, algorithms are needed to eliminate these effects [12,57,58]. In this study, we used DS algorithms. After transmission, the field spectra closely resembled laboratory spectra, aligning with previous studies [12]. To further analyze the mechanism of vis–NIR spectra, we analyzed the eigenvectors of the four PCs (Figure 10); the loadings of PC1 had prominent peaks around 600 nm and 1923 nm and small peaks around 1420 nm and 2214 nm, and the loadings increased in the range of 2300–2450 nm. These characteristics are mainly related to iron oxides, clay minerals, and organic matter [14,59,60,61]. Loadings around 600 nm are affected by hematite, and loadings around 1420 nm and 2214 nm are characteristic of clay minerals such as kaolinite and montmorillonite [62]. Loadings around 1420 nm and 1923 nm are also due to the OH- groups of water, and loadings in the range of 2300–2450 nm are dominated by the effect of CH3 groups in organic matter [63,64]. The characteristics of PC2 loadings were mainly observed around 1100 nm, 1400 nm, and 1914 nm. These loadings around 1100 nm are due to the aromatic cyclic hydrocarbon in organic matter. The characteristics of PC3 loadings were mainly around 480 nm, 1124 nm, 1420 nm, 1916 nm, and 2208 nm. Loadings around 480 nm were primarily affected by goethite, loadings around 2208 nm were related to kaolinite, and other characteristics were similar to the first two components. The characteristics of PC4 loadings were mainly around 640 nm, 1100 nm, 1390 nm, 1955 nm, 2212 nm, and 2287 nm. Loadings around 640 nm may be due to the effect of organic matter [33]. Loadings around 1955 nm and 2287 nm are mainly related to phenols and aliphatic compounds in organic matter.
The combined use of different sensors or data has been proven to be an effective approach for soil assessment [45,65]. Previous studies have combined vis–NIR spectra with remote sensing data through digital soil mapping, using vis–NIR spectra as a covariate [66,67]. For example, Zhou et al. combined vis–NIR spectra with remote sensing data and digital elevation model data for digital mapping of SOM [67]. Their results showed that vis–NIR spectra and vegetation factors were strongly correlated with SOM, which was similar to our study (Figure 8). In this study, we only combined vis–NIR spectra with remote sensing data directly and attained satisfactory results for soil mapping (Table 6 and Figure 10). In future work, we will consider exploring different fusing methods, temporal information, spatial information, and spectral information of proximal and remote sensing data for improving soil mapping accuracy.

5. Conclusions

In this study, we investigated the synergistic use of proximal soil sensing and remote sensing for the spatial estimation of SOM and TN. We compared the single period of GF-1 data with the change rate and coefficient of variation in different periods of GF-1 data. We found that the best estimation for SOM and TN came from the coefficient of variation in four periods of GF-1 data. Moreover, the RF model had better estimation accuracy than that of the PLSR model. We compared the combination of proximal soil sensing and remote sensing with remote sensing alone. We found that the estimations for SOM and TN based on the combination were more accurate than those using remote sensing data alone. Additionally, the best estimates for SOM and TN came from the RF model based on the combination of variation in four periods of GF-1 data and the first four PCs of field spectra. We used these models for the spatial analysis of SOM and TN and compared them with ordinary kriging interpolation results, finding that they had a similar distribution trend. Moreover, the RF models contained more detailed information and had higher estimation accuracy. Therefore, the synergistic use of proximal soil sensing and remote sensing was an effective approach for rapid spatial analysis of soil properties.

Author Contributions

Conceptualization, D.X. and Z.S.; formal analysis, D.X.; funding acquisition, D.X. and Z.S.; investigation, D.X., S.C. and Y.Z.; methodology, D.X., S.C. and W.J.; project administration, Z.S.; resources, Z.S.; software, D.X.; supervision, Z.S.; validation, Y.Z. and W.J.; visualization, D.X.; writing—original draft, D.X. and Y.Z.; writing—review and editing, S.C., W.J. and Z.S. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Natural Science Foundation of China (42301065) and the Natural Science Foundation of Shandong Province (ZR2022QD122).

Data Availability Statement

The original contributions presented in this study are included in the article. Further inquiries can be directed to the corresponding author.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Johnston, A.E.; Poulton, P.R.; Coleman, K. Soil Organic Matter: Its Importance in Sustainable Agriculture and Carbon Dioxide Fluxes. Adv. Agron. 2009, 101, 1–57. [Google Scholar]
  2. Reeves, D.W. The role of soil organic matter in maintaining soil quality in continuous cropping systems. Soil Tillage Res. 1997, 43, 131–167. [Google Scholar] [CrossRef]
  3. von Braun, J.; Afsana, K.; Fresco, L.O.; Hassan, M. Food systems: Seven priorities to end hunger and protect the planet. Nature 2021, 597, 28–30. [Google Scholar] [CrossRef] [PubMed]
  4. Stenberg, B.; Viscarra Rossel, R.A.; Mouazen, A.M.; Wetterlind, J. Visible and near infrared spectroscopy in soil science. Adv. Agron. 2010, 107, 163–215. [Google Scholar]
  5. Mouazen, A.M.; Kuang, B. On-line visible and near infrared spectroscopy for in-field phosphorous management. Soil Tillage Res. 2016, 155, 471–477. [Google Scholar] [CrossRef]
  6. Xu, D.; Chen, S.; Viscarra Rossel, R.A.; Biswas, A.; Li, S.; Zhou, Y.; Shi, Z. X-ray fluorescence and visible near infrared sensor fusion for predicting soil chromium content. Geoderma 2019, 352, 61–69. [Google Scholar] [CrossRef]
  7. Xu, D.; Zhao, R.; Li, S.; Chen, S.; Jiang, Q.; Zhou, L.; Shi, Z. Multi-sensor fusion for the determination of several soil properties in the Yangtze River Delta, China. Eur. J. Soil Sci. 2019, 70, 162–173. [Google Scholar] [CrossRef]
  8. Gozukara, G.; Acar, M.; Ozlu, E.; Dengiz, O.; Hartemink, A.E.; Zhang, Y. A soil quality index using Vis-NIR and pXRF spectra of a soil profile. Catena 2022, 211, 105954. [Google Scholar] [CrossRef]
  9. Chang, C.W.; Laird, D.A.; Mausbach, M.J.; Hurburgh, C.R. Near infrared reflectance spectroscopy principal components regression analyses of soil properties. Soil Sci. Soc. Am. J. 2001, 65, 480–490. [Google Scholar] [CrossRef]
  10. Sorensen, L.K.; Dalsgaard, S. Determination of clay and other soil properties by near infrared spectroscopy. Soil Sci. Soc. Am. J. 2005, 69, 159–167. [Google Scholar] [CrossRef]
  11. Nocita, M.; Stevens, A.; Noon, C.; van Wesemael, B. Prediction of soil organic carbon for different levels of soil moisture using Vis-NIR spectroscopy. Geoderma 2013, 199, 37–42. [Google Scholar] [CrossRef]
  12. Ji, W.J.; Li, S.; Chen, S.C.; Shi, Z.; Rossel, R.A.V.; Mouazen, A.M. Prediction of soil attributes using the Chinese soil spectral library and standardized spectra recorded at field conditions. Soil Tillage Res. 2016, 155, 492–500. [Google Scholar] [CrossRef]
  13. Viscarra Rossel, R.A.; Walvoort, D.J.J.; McBratney, A.B.; Janik, L.J.; Skjemstad, J.O. Visible, near infrared, mid infrared or combined diffuse reflectance spectroscopy for simultaneous assessment of various soil properties. Geoderma 2006, 131, 59–75. [Google Scholar] [CrossRef]
  14. Shi, Z. Principle and Method of Soil Ground Hyperspectral Remote Sensing; Science Press: Beijing, China, 2014. (In Chinese) [Google Scholar]
  15. Bendor, E.; Banin, A. Near-infrared analysis as a rapid method to simultaneously evaluate several soil properties. Soil Sci. Soc. Am. J. 1995, 59, 364–372. [Google Scholar] [CrossRef]
  16. Martin, P.D.; Malley, D.F.; Manning, G.; Fuller, L. Determination of soil organic carbon and nitrogen at the field level using near-infrared spectroscopy. Can. J. Soil Sci. 2002, 82, 413–422. [Google Scholar] [CrossRef]
  17. Mulder, V.L.; de Bruin, S.; Schaepman, M.E.; Mayr, T.R. The use of remote sensing in soil and terrain mapping—A review. Geoderma 2011, 162, 1–19. [Google Scholar] [CrossRef]
  18. Grunwald, S.; Vasques, G.M.; Rivero, R.G. Fusion of soil and remote sensing data to model soil properties. Adv. Agron. 2015, 131, 1–109. [Google Scholar]
  19. Alabbas, A.H.; Swain, P.H.; Baumgardner, M.F. Relating organic-matter and clay content to multispectral radiance of soils. Soil Sci. 1972, 114, 477–485. [Google Scholar] [CrossRef]
  20. Chen, F.; Kissel, D.E.; West, L.T.; Adkins, W. Field-Scale Mapping of Surface Soil Organic Carbon Using Remotely Sensed Imagery. Soil Sci. Soc. Am. J. 2000, 64, 746–753. [Google Scholar] [CrossRef]
  21. Fox, G.A.; Sabbagh, G.J. Estimation of Soil Organic Matter from Red and Near-Infrared Remotely Sensed Data Using a Soil Line Euclidean Distance Technique. Soil Sci. Soc. Am. J. 2002, 66, 1922–1929. [Google Scholar] [CrossRef]
  22. Castaldi, F.; Palombo, A.; Santini, F.; Pascucci, S.; Pignatti, S.; Casa, R. Evaluation of the potential of the current and forthcoming multispectral and hyperspectral imagers to estimate soil texture and organic carbon. Remote Sens. Environ. 2016, 179, 54–65. [Google Scholar] [CrossRef]
  23. Angelopoulou, T.; Chabrillat, S.; Pignatti, S.; Milewski, R.; Karyotis, K.; Brell, M.; Ruhtz, T.; Bochtis, D.; Zalidis, G. Evaluation of airborne hySpex and spaceborne PRISMA hyperspectral remote sensing data for soil organic matter and carbonates estimation. Remote Sens. 2023, 15, 1106. [Google Scholar] [CrossRef]
  24. Castaldi, F.; Hueni, A.; Chabrillat, S.; Ward, K.; Buttafuoco, G.; Bomans, B.; Vreys, K.; Brell, M.; van Wesemael, B. Evaluating the capability of the Sentinel 2 data for soil organic carbon prediction in croplands. ISPRS J. Photogramm. Remote Sens. 2019, 147, 267–282. [Google Scholar] [CrossRef]
  25. Gholizadeh, A.; Žižala, D.; Saberioon, M.; Borůvka, L. Soil organic carbon and texture retrieving and mapping using proximal, airborne and Sentinel-2 spectral imaging. Remote Sens. Environ. 2018, 218, 89–103. [Google Scholar] [CrossRef]
  26. van Wesemael, B.; Chabrillat, S.; Sanz Dias, A.; Berger, M.; Szantoi, Z. Remote sensing for soil organic carbon mapping and monitoring. Remote Sensing. 2023, 15, 3464. [Google Scholar] [CrossRef]
  27. Chen, S.; Richer-de-Forges, A.C.; Mulder, V.L.; Martelet, G.; Loiseau, T.; Lehmann, S.; Arrouays, D. Digital mapping of the soil thickness of loess deposits over a calcareous bedrock in central France. Catena 2021, 198, 105062. [Google Scholar] [CrossRef]
  28. Li, P.; Wu, X.; Feng, C.; Gao, L.; Li, Q.; Xu, J.; Li, B. Assessing the potential of multi-source remote sensing data for cropland soil organic matter mapping in hilly and mountainous areas. Catena 2024, 245, 108312. [Google Scholar]
  29. Ji, X.; Purushothaman, B.; Venkatesha Prasad, R.; Aravind, P.V. Developing a digital mapping of soil organic carbon on a national scale using Sentinel-2 and hybrid models at varying spatial resolutions. Ecol. Indic. 2024, 167, 112654. [Google Scholar] [CrossRef]
  30. BenDor, E. Quantitative remote sensing of soil properties. Adv. Agron. 2002, 75, 173–243. [Google Scholar]
  31. Savitzky, A.; Golay, M.J.E. Smoothing + differentiation of data by simplified least squares procedures. Anal. Chem. 1964, 36, 1627. [Google Scholar] [CrossRef]
  32. Wang, Y.D.; Veltkamp, D.J.; Kowalski, B.R. Multivariate Instrument Standardization. Anal. Chem. 1991, 63, 2750–2756. [Google Scholar] [CrossRef]
  33. Viscarra Rossel, R.A.; Chen, C. Digitally mapping the information content of visible–near infrared spectra of surficial Australian soils. Remote Sens. Environ. 2011, 115, 1443–1455. [Google Scholar] [CrossRef]
  34. R Core Team. R: A Language and Environment for Statistical Computing; R Foundation for Statistical Computing: Vienna, Austria, 2022; Available online: https://www.r-project.org/ (accessed on 23 June 2022).
  35. Bivand, R.; Keitt, T.; Rowlingson, B. rgdal: Bindings for the ’Geospatial’ Data Abstraction Library. R Package Version 1.5-12. 2020. Available online: https://CRAN.R-project.org/package=rgdal (accessed on 26 June 2020).
  36. Rouse, J.W.; Haas, J.R.H.; Schell, J.A.; Deering, D.W. Monitoring vegetation systems in the Great Plains witherts. In Proceedings of the 3rd ERTS Symposium, Washington, DC, USA, 10–14 December 1973. [Google Scholar]
  37. Nellis, M.D.; Briggs, J.M. Transformed vegetation index for measuring spatial variation in drought impactedbiomass on Konza Prairie, Kansas. Trans. Kans. Acad. Sci. 1992, 95, 93–99. [Google Scholar] [CrossRef]
  38. Huete, A.; Didan, K.; Miura, T.; Rodriguez, E.P.; Gao, X.; Ferreira, L.G. Overview of the radiometric and biophysical performance of the MODIS vegetation indices. Remote Sens. Environ. 2002, 83, 195–213. [Google Scholar] [CrossRef]
  39. Huete, A.R. A soil-adjusted vegetation index (SAVI). Remote Sens. Environ. 1988, 25, 295–309. [Google Scholar] [CrossRef]
  40. Gitelson, A.A.; Kaufman, Y.J.; Merzlyak, M.N. Use of a green channel in remote sensing of global vegetation from EOS-MODIS. Remote Sens. Environ. 1996, 58, 289–298. [Google Scholar] [CrossRef]
  41. Tucker, C.J. Red and photographic infrared linear combinations for monitoring vegetation. Remote Sens. Environ. 1979, 8, 127–150. [Google Scholar] [CrossRef]
  42. Escadafal, R. Remote sensing of arid soil surface color with Landsat Thematic Mapper. Adv. Space Res. 1989, 9, 159–163. [Google Scholar] [CrossRef]
  43. Pouget, M.; Madeira, J.; Le Floch, E.; Kamal, S. Caracteristiques spectrales des surfaces sableuses de la region cot&e Nord-Ouest de I’Egypte: Application aux don&es satellitaires SPOT. In 2eme JoumCes de T&detection: Caracterisation et Suivi des Milieux Terrestres en Regions Arides et Tropicales; 4–6/12/1990; ORSTOM, Collection Colloques et Seminaires: Paris, France, 1990. [Google Scholar]
  44. Jordan, C.F. Derivation of leaf-area index from quality of light on forest floor. Ecology 1969, 50, 663–666. [Google Scholar] [CrossRef]
  45. Wold, S.; Martens, H.; Wold, H. The multivariate calibration-problem in chemistry solved by the PLS method. Lect. Notes Math. 1983, 973, 286–293. [Google Scholar]
  46. Næsset, E.; Bollandsas, O.M.; Gobakken, T. Comparing regression methods in estimation of biophysical properties of forest stands from two different inventories using laser scanner data. Remote Sens. Environ. 2005, 94, 541–553. [Google Scholar] [CrossRef]
  47. Breiman, L. RandomForests. Mach. Learn. 2001, 45, 5–32. [Google Scholar] [CrossRef]
  48. Hu, B.; Xie, M.; Zhou, Y.; Chen, S.; Zhou, Y.; Ni, H.; Peng, J.; Ji, W.; Hong, Y.; Li, H.; et al. A high-resolution map of soil organic carbon in cropland of Southern China. Catena 2024, 237, 107813. [Google Scholar] [CrossRef]
  49. Xu, D.Y.; Chen, S.C.; Xu, H.Y.; Wang, N.; Zhou, Y.; Shi, Z. Data fusion for the measurement of potentially toxic elements in soil using portable spectrometers. Environ. Pollut. 2020, 263, 114649. [Google Scholar] [CrossRef] [PubMed]
  50. Lin, L.I. A Concordance Correlation Coefficient to Evaluate Reproducibility. Biometrics 1989, 45, 255–268. [Google Scholar] [CrossRef]
  51. Kennard, R.W.; Stone, L.A. Computer aided design of experiments. Technometrics 1969, 11, 137–148. [Google Scholar] [CrossRef]
  52. Liang, Z.; Chen, S.; Yang, Y.; Zhao, R.; Shi, Z.; Rossel, R.A.V. National digital soil map of organic matter in topsoil and its associated uncertainty in 1980’s China. Geoderma 2019, 335, 47–56. [Google Scholar] [CrossRef]
  53. Lamichhane, S.; Adhikari, K.; Kumar, L. Use of Multi-Seasonal Satellite Images to Predict SOC from Cultivated Lands in a Montane Ecosystem. Remote Sens. 2021, 13, 4772. [Google Scholar] [CrossRef]
  54. Chen, S.; Arrouays, D.; Mulder, V.L.; Poggio, L.; Minasny, B.; Roudier, P.; Libohova, Z.; Lagacherie, P.; Shi, Z.; Hannam, J.; et al. Digital mapping of GlobalSoilMap soil properties at a broad scale: A review. Geoderma 2022, 409, 115567. [Google Scholar] [CrossRef]
  55. Chen, S.; Chen, Z.; Zhang, X.; Luo, Z.; Schillaci, C.; Arrouays, D.; Richer-de-Forges, A.C.; Shi, Z. European topsoil bulk density and organic carbon stock database (0–20 cm) using machine-learning-based pedotransfer functions. Earth Syst. Sci. Data 2024, 16, 2367–2383. [Google Scholar] [CrossRef]
  56. Gomez, C.; Adeline, K.; Bacha, S.; Driessen, B.; Gorretta, N.; Lagacherie, P.; Roger, J.M.; Briottet, X. Sensitivity of clay content prediction to spectral configuration of VNIR/SWIR imaging data, from multispectral to hyperspectral scenarios. Remote Sens. Environ. 2018, 204, 18–30. [Google Scholar] [CrossRef]
  57. Minasny, B.; McBratney, A.B.; Bellon-Maurel, V.; Roger, J.M.; Gobrecht, A.; Ferrand, L.; Joalland, S. Removing the effect of soil moisture from NIR diffuse reflectance spectra for the prediction of soil organic carbon. Geoderma 2011, 167–168, 118–124. [Google Scholar] [CrossRef]
  58. Guerrero, C.; Stenberg, B.; Wetterlind, J.; Rossel RA, V.; Maestre, F.T.; Mouazen, A.M.; Zornoza, R.; Ruiz-Sinoga, J.D.; Kuang, B. Assessment of soil organic carbon at local scale with spiked NIR calibrations: Effects of selection and extra-weighting on the spiking subset. Eur. J. Soil Sci. 2014, 65, 248–263. [Google Scholar] [CrossRef]
  59. BenDor, E.; Inbar, Y.; Chen, Y. The reflectance spectra of organic matter in the visible near-infrared and short wave infrared region (400–2500 nm) during a controlled decomposition process. Remote Sens. Environ. 1997, 61, 1–15. [Google Scholar] [CrossRef]
  60. Viscarra Rossel, R.A.; Behrens, T. Using data mining to model and interpret soil diffuse reflectance spectra. Geoderma 2010, 158, 46–54. [Google Scholar] [CrossRef]
  61. Xu, D.; Ma, W.; Chen, S.; Jiang, Q.; He, K.; Shi, Z. Assessment of important soil properties related to Chinese Soil Taxonomy based on vis–NIR reflectance spectroscopy. Comput. Electron. Agric. 2018, 144, 1–8. [Google Scholar] [CrossRef]
  62. Clark, R.N. Spectroscopy of rocks and minerals, and principles of spectroscopy. In Remote Sensing for the Earth Sciences: Manual of Remote Sensing; Rencz, N., Ed.; John Wiley & Sons: New York, NY, USA, 1999. [Google Scholar]
  63. Demattê, J.A.M.; Horak-Terra, I.; Beirigo, R.M.; Terra, F.d.S.; Prazeres Marques, K.P.; Fongaro, C.T.; Silva, A.C.; Vidal-Torrado, P. Genesis and properties of wetland soils by VIS-NIR-SWIR as a technique for environmental monitoring. J. Environ. Manag. 2017, 197, 50–62. [Google Scholar] [CrossRef] [PubMed]
  64. Dotto, A.C.; Dalmolin, R.S.D.; ten Caten, A.; Grunwald, S. A systematic study on the application of scatter-corrective and spectral-derivative preprocessing for multivariate prediction of soil organic carbon by Vis-NIR spectra. Geoderma 2018, 314, 262–274. [Google Scholar] [CrossRef]
  65. Meng, X.; Bao, Y.; Wang, Y.; Zhang, X.; Liu, H. An advanced soil organic carbon content prediction model via fused temporal-spatial-spectral (TSS) information based on machine learning and deep learning algorithms. Remote Sens. Environ. 2022, 280, 113166. [Google Scholar] [CrossRef]
  66. Yang, Y.; Viscarra Rossel, R.A.; Li, S.; Bissett, A.; Lee, J.; Shi, Z.; Behrens, T.; Court, L. Soil bacterial abundance and diversity better explained and predicted with spectro-transfer functions. Soil Biol. Biochem. 2019, 129, 29–38. [Google Scholar] [CrossRef]
  67. Zhou, Y.; Biswas, A.; Ma, Z.; Lu, Y.; Chen, Q.; Shi, Z. Revealing the scale-specific controls of soil organic matter at large scale in Northeast and North China Plain. Geoderma 2016, 271, 71–79. [Google Scholar] [CrossRef]
Figure 1. Flow chart of the study.
Figure 1. Flow chart of the study.
Remotesensing 17 00729 g001
Figure 2. Study area and soil sampling sites.
Figure 2. Study area and soil sampling sites.
Remotesensing 17 00729 g002
Figure 3. Raincloud plots of the calibration, validation, and overall data for SOM and TN.
Figure 3. Raincloud plots of the calibration, validation, and overall data for SOM and TN.
Remotesensing 17 00729 g003
Figure 4. Estimation results of SOM and TN based on temporal GF-1 data using PLSR and RF models. Note: red and blue symbols represent the validation and OOB results, respectively; the abscissa is the PLSR and RF model based on GF data from four periods, the rate of change between pairs of data from four periods, and the coefficient of variation between three periods and four periods. For clarity, we used three y axes; the left y axis represents ρ c , the middle y axis represents RPIQ, and the right y axis represents RMSE.
Figure 4. Estimation results of SOM and TN based on temporal GF-1 data using PLSR and RF models. Note: red and blue symbols represent the validation and OOB results, respectively; the abscissa is the PLSR and RF model based on GF data from four periods, the rate of change between pairs of data from four periods, and the coefficient of variation between three periods and four periods. For clarity, we used three y axes; the left y axis represents ρ c , the middle y axis represents RPIQ, and the right y axis represents RMSE.
Remotesensing 17 00729 g004
Figure 5. Comparison of (a) field spectra (blue line) and laboratory spectra (red line); (b) DS-transferred spectra (light blue line) and laboratory spectra (red line) and their difference (green line).
Figure 5. Comparison of (a) field spectra (blue line) and laboratory spectra (red line); (b) DS-transferred spectra (light blue line) and laboratory spectra (red line) and their difference (green line).
Remotesensing 17 00729 g005
Figure 6. The cross-validation of PLSR models using different transformed samples.
Figure 6. The cross-validation of PLSR models using different transformed samples.
Remotesensing 17 00729 g006
Figure 7. Variable significance in SOM and TN modeling. Note: B1, B2, B3, and B4 are the blue, green, red, and near-infrared bands of GF-1 data, respectively.
Figure 7. Variable significance in SOM and TN modeling. Note: B1, B2, B3, and B4 are the blue, green, red, and near-infrared bands of GF-1 data, respectively.
Remotesensing 17 00729 g007
Figure 8. Comparison of spatial estimation based on the fusing of proximal sensing and remote sensing with ordinary kriging interpolation (OK) results for SOM and TN and their uncertainties.
Figure 8. Comparison of spatial estimation based on the fusing of proximal sensing and remote sensing with ordinary kriging interpolation (OK) results for SOM and TN and their uncertainties.
Remotesensing 17 00729 g008
Figure 9. Comparison of accuracy based on the fusing of proximal sensing and remote sensing with ordinary kriging interpolation (OK) results for SOM and TN. Note: the circle represents the results of OK, while the diamond represents the results of fusing prediction of proximal sensing and remote sensing; here, we use F to represent it. The blue line represents the fitted line of the fused prediction, while the red line represents the fitted line of the Kriging interpolation. R O K 2 represents the coefficients of determination of the OK results; R F 2 represents the coefficients of determination of the fusing prediction results.
Figure 9. Comparison of accuracy based on the fusing of proximal sensing and remote sensing with ordinary kriging interpolation (OK) results for SOM and TN. Note: the circle represents the results of OK, while the diamond represents the results of fusing prediction of proximal sensing and remote sensing; here, we use F to represent it. The blue line represents the fitted line of the fused prediction, while the red line represents the fitted line of the Kriging interpolation. R O K 2 represents the coefficients of determination of the OK results; R F 2 represents the coefficients of determination of the fusing prediction results.
Remotesensing 17 00729 g009
Figure 10. Eigenvectors of the first four principal components (the left y axis is for PC1, while the right y axis is for PC2, PC3, and PC4).
Figure 10. Eigenvectors of the first four principal components (the left y axis is for PC1, while the right y axis is for PC2, PC3, and PC4).
Remotesensing 17 00729 g010
Table 1. Parameters of GF-1 satellite data.
Table 1. Parameters of GF-1 satellite data.
SatelliteBandBand Range (nm)Resolution (m)
GF-1Panchromatic image450~9002
Multispectral image1 (blue)450~5208
2 (green)530~5908
3 (red)630~6908
4 (near infrared)770~8908
Table 2. Definitions of spectral indices and their calculations.
Table 2. Definitions of spectral indices and their calculations.
IndexDefinitionReference
NDVI r N I R r R e d r N I R + r R e d [36]
TVI ( r N I R r R e d r N I R + r R e d + 0.5 ) 1 2 × 100 [37]
EVI 2.5 × ( r N I R r R e d ) r N I R + 6 × r R e d 7.5 × r R e d + 1 [38]
SAVI ( r N I R r R e d ) × 1.5 r N I R + r R e d + 0.5 [39]
GNDVI r N I R r G r e e n r N I R + r G r e e n [40]
GRVI r G r e e n r R e d r G r e e n + r R e d [41]
BI ( r R e d × r R e d ) + ( r G r e e n × r G r e e n ) 2 [42]
BI2 ( r R e d × r R e d ) + ( r G r e e n × r G r e e n ) + ( r N I R × r N I R ) 3 [42]
RI ( r R e d × r R e d ) r G r e e n × r G r e e n × r G r e e n [43]
V r N I R r R e d [44]
Table 3. Statistics of SOM and TN for different datasets.
Table 3. Statistics of SOM and TN for different datasets.
PropertiesDatasetNMax.MedianMin.MeanStandard Deviation
SOMAll24024.118.19.118.152.62
calibration16024.118.0512.718.162.6
validation802418.159.118.152.67
TNAll2401.741.330.821.340.14
calibration1601.731.330.821.340.14
validation801.741.341.031.350.15
Table 4. Model performance based on GF-1 data for SOM and TN (in g/kg).
Table 4. Model performance based on GF-1 data for SOM and TN (in g/kg).
Properties SOMTN
Methods PLSRRFPLSRRF
Calibration ρ c 0.570.960.310.95
RMSE2.050.690.140.04
RPIQ1.905.651.434.96
ME0−0.0100
SDE2.060.690.150.04
Cross-validation ρ c 0.330.430.110.10
RMSE3.022.460.170.18
RPIQ1.421.661.161.04
ME−0.24−0.190.00−0.01
SDE2.992.450.170.18
Validation ρ c 0.440.530.160.31
RMSE2.291.980.160.14
RPIQ1.561.791.061.20
ME0.020.000.050.04
SDE2.291.980.150.13
Table 5. Principal component analysis of DS transferred vis-NIR spectra.
Table 5. Principal component analysis of DS transferred vis-NIR spectra.
Standard DeviationVariance Contribution (%)Cumulative Contribution (%)
PC12.3964.5564.55
PC21.6530.9395.48
PC30.533.2098.68
PC40.240.6599.32
Table 6. Estimation of SOM and TN and their uncertainties based on a combination of field spectra and GF-1 using PLSR and RF (g/kg).
Table 6. Estimation of SOM and TN and their uncertainties based on a combination of field spectra and GF-1 using PLSR and RF (g/kg).
PropertiesMethods ρ c RMSERPIQMESDE
SOMPLSR0.72 (0.66~0.78)1.93 (1.60~2.27)1.84 (1.54~2.14)0.32 (0.06~0.59)1.91 (1.60~2.22)
RF0.63 (0.57~0.70)1.78 (1.63~1.93)1.99 (1.83~2.15)−0.04 (−0.30~0.25)1.78 (1.64~1.93)
TNPLSR0.72 (0.57~0.88)0.11 (0.07~0.15)1.59 (1.08~2.09)0.02 (0.00~0.05)0.11 (0.06~0.15)
RF0.56 (0.44~0.68)0.11 (0.10~0.13)1.51 (1.32~1.69)0.03 (0.02~0.05)0.11 (0.09~0.12)
Note: The data in brackets are 90% confidence intervals.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Xu, D.; Chen, S.; Zhou, Y.; Ji, W.; Shi, Z. Spatial Estimation of Soil Organic Matter and Total Nitrogen by Fusing Field Vis–NIR Spectroscopy and Multispectral Remote Sensing Data. Remote Sens. 2025, 17, 729. https://doi.org/10.3390/rs17040729

AMA Style

Xu D, Chen S, Zhou Y, Ji W, Shi Z. Spatial Estimation of Soil Organic Matter and Total Nitrogen by Fusing Field Vis–NIR Spectroscopy and Multispectral Remote Sensing Data. Remote Sensing. 2025; 17(4):729. https://doi.org/10.3390/rs17040729

Chicago/Turabian Style

Xu, Dongyun, Songchao Chen, Yin Zhou, Wenjun Ji, and Zhou Shi. 2025. "Spatial Estimation of Soil Organic Matter and Total Nitrogen by Fusing Field Vis–NIR Spectroscopy and Multispectral Remote Sensing Data" Remote Sensing 17, no. 4: 729. https://doi.org/10.3390/rs17040729

APA Style

Xu, D., Chen, S., Zhou, Y., Ji, W., & Shi, Z. (2025). Spatial Estimation of Soil Organic Matter and Total Nitrogen by Fusing Field Vis–NIR Spectroscopy and Multispectral Remote Sensing Data. Remote Sensing, 17(4), 729. https://doi.org/10.3390/rs17040729

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop