4.1.1. Correlation Analysis Between Soil Nitrogen and Multispectral Vegetation Indices
Based on the constructed spectral indices, Pearson correlation analysis was used to calculate the linear correlation coefficients between seven Sentinel-2 vegetation indices and soil nitrogen content. The indices used and their calculation formulas are detailed in
Table 4.
First, the vegetation indices in
Table 4 are calculated using the reflectance of different bands, aiming to assess vegetation growth status and environmental impacts. In the statistical analysis shown in
Figure 3, EVI (enhanced vegetation index) performs the best. EVI combines data from the red, blue, and near-infrared bands, effectively reducing atmospheric interference and soil background influences. As a result, it shows strong correlation and significance in soil nitrogen estimation. The Pearson correlation coefficient for EVI exceeds 0.4, indicating a strong positive correlation with soil nitrogen compared to other indices. Moreover, as shown in
Figure 3, its statistical significance (log10(
p) value) is also high, reaching 0.01, which indicates that this index is statistically significant and suitable for remote sensing monitoring in the current ecological environment.
Following closely, GNDVI (Green–Red Vegetation Index) is another prominent index. GNDVI uses the ratio of the green band to the near-infrared band to reflect the photosynthetic capacity and moisture status of vegetation. Its Pearson correlation coefficient is also close to 0.4, indicating a positive correlation with soil nitrogen, and it demonstrates high statistical significance. This suggests that GNDVI is effective in reflecting the spatial variation of soil nitrogen content and is a powerful tool for soil nitrogen monitoring.
SAVI (Soil Adjusted Vegetation Index) performs relatively weakly in this study. Although SAVI helps reduce soil background interference, its correlation with soil nitrogen is lower, with a Pearson correlation coefficient of 0.32. The log10(p) value exceeds 1.3, and its p-value reaches 0.05 but does not meet the 0.01 threshold, indicating limited statistical significance. As a result, its application in soil nitrogen estimation is less effective.
Next, OSAVI (Optimized Soil Adjusted Vegetation Index) and NDVI (Normalized Difference Vegetation Index) show relatively poor performance. OSAVI adjusts for soil background to improve estimation accuracy but has a lower Pearson correlation coefficient of 0.31 in this study, with low statistical significance, and its p-value does not even reach 0.05. NDVI, a classic vegetation index, despite being widely used, performs fairly poorly in this study. It has a Pearson correlation coefficient of 0.27, similar to DVI (Difference Vegetation Index), and its log10(p) value does not exceed 1.1, indicating weak correlation with soil nitrogen, thus limiting its effectiveness in soil nitrogen estimation.
Finally, RVI (Ratio Vegetation Index) performs the worst. Its correlation with soil nitrogen is very low, with a Pearson correlation coefficient of 0.25 and extremely low statistical significance, making it nearly ineffective in providing reliable soil nitrogen predictions.
Overall, none of the vegetation indices had a correlation coefficient exceeding 0.5 with nitrogen content, indicating that their explanatory power for nitrogen variation remains limited. In this study, EVI was selected as the optimal vegetation index for nitrogen inversion using the Sentinel-2 platform.
4.1.2. Correlation Analysis Between Soil Nitrogen and Hyperspectral Vegetation Indices
Based on the Pearson correlation coefficients between nitrogen content and Zhuhai-1 reflectance shown in
Figure 4, this study found varying degrees of correlation between soil nitrogen content and multiple hyperspectral bands. In particular, bands around the 730 nm region exhibited stronger correlations with soil nitrogen content. Moreover, correlation peaks were observed in the 500–600 nm and 700–800 nm ranges, which is consistent with findings from previous studies [
36]. These bands exhibit strong responses in capturing soil nitrogen characteristics and provide robust support for hyperspectral remote sensing inversion of soil nitrogen content, indicating that hyperspectral data enable more precise identification of nitrogen-related spectral features. Based on the correlation analysis between 32 bands and nitrogen content, and in consideration of spectroscopic mechanisms, this study optimized the band combinations for vegetation indices. Specifically, bands with higher correlations were selected and aligned with the spectroscopic characteristics of nitrogen to ensure a more accurate reflection of soil nitrogen variations. The optimized band selections are presented in
Table 5, and Pearson correlation analyses between the indices and nitrogen content are illustrated in
Figure 5.
The optimization of the band combinations for vegetation indices in
Table 5 is based on spectral mechanisms, the correlation analysis of soil nitrogen content, and the rationality of the formula structure. Through the correlation analysis of different bands with soil nitrogen content, we optimized the selection of vegetation index bands by combining the spectral characteristics of the bands and their responses to nitrogen content variations.
For ratio-based indices (such as GNDVI, NDVI, and RVI), we selected bands with higher correlations to nitrogen content changes. GNDVI uses the 730 nm and 550 nm bands, where the 550 nm band is in the green light region and shows a higher correlation with soil nitrogen content, while the 730 nm band is in the near-infrared region and has the highest correlation with nitrogen. This band effectively reflects the plant growth state. By combining these two bands, GNDVI can better capture soil nitrogen content changes, particularly in areas with active vegetation growth.
NDVI uses the 730 nm and 670 nm bands. The 730 nm band, located in the near-infrared region, responds most strongly to vegetation growth and nitrogen content changes, while the 670 nm band, located in the red light region, has the lowest correlation with soil nitrogen content. Although the reflectance of the 670 nm band has a weaker correlation with nitrogen content, it still enhances sensitivity to nitrogen content changes through its difference in reflectance with the 730 nm band, especially in reflecting the relationship between vegetation health and soil nitrogen content.
RVI selects the 730 nm and 670 nm bands, where the reflectance ratio between these two bands effectively reflects soil nitrogen content, especially in areas with exposed soil or sparse vegetation. The combination of the 730 nm and 670 nm bands, through the ratio calculation, directly reflects changes in soil and vegetation, thereby improving nitrogen content estimation accuracy.
For difference-based indices, DVI uses the reflectance difference between the 730 nm and 670 nm bands. By calculating this stark difference between these two bands, we can directly reflect soil nitrogen content changes, especially in low vegetation cover or arid regions. The difference-based index is particularly effective at capturing nitrogen content changes in such areas.
Regulation-based indices (such as SAVI and OSAVI) introduce constants to adjust for soil background or atmospheric interference, enhancing the response to vegetation and reducing the impact of external factors. SAVI uses the 730 nm and 670 nm bands and adjusts with a constant of 0.5. The reflectance difference between these two bands is highly sensitive to changes in vegetation growth and nitrogen content, effectively reducing soil background interference. OSAVI also uses the 730 nm and 670 nm bands and introduces a constant of 0.16 to further adjust for soil background effects, particularly in regions with complex soil backgrounds, allowing for more accurate reflection of nitrogen content changes.
Finally, the enhanced vegetation index (EVI) combines the 730 nm, 670 nm, and 490 nm bands. The 490 nm band, located in the blue light region, has the highest correlation with soil nitrogen content in the blue light range, so it was chosen to enhance sensitivity to nitrogen content changes. The selection of the 730 nm and 670 nm bands is also to enhance the index’s response to nitrogen. Although the 670 nm band has a lower correlation with nitrogen content, the 730 nm band is the one that responds most strongly to changes in soil nitrogen content. The inclusion of the 490 nm band helps reduce atmospheric interference, and through the weighted combination of multiple bands, further improves the accuracy of soil nitrogen content estimation.
Through this optimization of band combinations, this study ensures that each vegetation index can maximize the accuracy of soil nitrogen content estimation. The selection of bands is not only based on the high correlation of the bands but also considers the formula structure of each index to make them more suitable for soil nitrogen content detection. By optimizing different types of vegetation indices, we ensure that we can more accurately reflect changes in soil nitrogen content and reduce the influence of external interference factors.
Figure 5 shows the Pearson correlation coefficients between different vegetation indices and soil nitrogen content. GNDVI exhibits the highest correlation with nitrogen content (r = 0.69), indicating that this index is the most effective in capturing nitrogen variations. SAVI (r = 0.65) and EVI (r = 0.63) follow closely, suggesting that they also have strong responsiveness to nitrogen variation. OSAVI (r = 0.58), NDVI (r = 0.57), and DVI (r = 0.57) show lower correlations; although their performance in nitrogen inversion is weaker than that of GNDVI and NDVI, they still retain a certain explanatory capacity. RVI (r = 0.50) demonstrates the weakest correlation, indicating a relatively limited sensitivity to changes in soil nitrogen content.
Overall, GNDVI was identified as the optimal vegetation index for nitrogen inversion using the Zhuhai-1 remote sensing platform.
4.1.3. Correlation Analysis Between Soil Nitrogen and Texture Indices
Eight texture features were extracted from four Sentinel-2 multispectral bands using the gray-level co-occurrence matrix, and their correlations with nitrogen content were then established, as shown in
Figure 6. For convenience, the notation XY is used to represent the texture feature Y extracted from parameter X, such as B2mea indicating the mean (mea) texture feature derived from band B2.
As shown in
Figure 6, the sensitivity of the eight texture features extracted from different bands to nitrogen content varies. The correlation between B8mea and nitrogen content passed the significance test at
p < 0.05, with a correlation coefficient of 0.39, which is the highest among the features. The remaining texture features show no significant correlations with nitrogen content, as their correlation coefficients did not pass the significance test.
As shown in
Figure 4, the 730 nm, 806 nm, 746 nm, and 760 nm bands exhibit the highest correlations with nitrogen; therefore, eight texture features were extracted from these bands, and Pearson correlation analyses with nitrogen content were performed. The results are presented in
Figure 7.
Figure 7 illustrates the correlations between texture features extracted from four hyperspectral nitrogen-sensitive bands of Zhuhai-1 and nitrogen content. The results indicate that 730dis, 806hom, 746hom, and 746sec passed the significance test at
p < 0.05, while 730mea, 730hom, 730sec, 806mea, 746mea, and 760mea passed at
p < 0.01. Among them, 730mea shows the highest correlation with nitrogen, with a coefficient of 0.49.
Based on
Figure 6 and
Figure 7, individual texture features show varying degrees of correlation with soil nitrogen, but their overall sensitivity is low. To address the weak correlations of single texture features with soil nitrogen, this study combined them in pairs to construct texture indices. The resulting correlations are presented in
Table 5.
As shown in
Table 6, the three optimal combined texture indices selected from the Sentinel-2 platform for nitrogen all passed the significance test at
p < 0.01, indicating highly significant correlations. Among them, DTI (B4var, B8var) shows the strongest correlation with nitrogen content, with a coefficient of 0.65. Overall, DTI (B4var, B8var) was identified as the optimal texture index for nitrogen inversion using the Sentinel-2 platform.
As shown in
Table 6, three optimal composite texture indices for nitrogen estimation using the Zhuhai-1 platform all passed the
p < 0.01 significance test. These indices exhibited highly significant correlations. Among them, RTI (730Hom, 760Sec) showed the highest correlation with nitrogen content, reaching 0.78. This study selected RTI (730Hom, 760Sec) as the optimal texture index for nitrogen inversion on the Zhuhai-1 platform.
The results in
Table 6 and
Table 7 demonstrate that texture indices derived from the combination of multiple texture features exhibit stronger correlations with soil nitrogen. This indicates that incorporating texture indices can improve the inversion accuracy of soil nitrogen.