3.1. Descriptive Statistics for Soil Properties
The mean of EC was >4 dS m
−1, indicating that salinity is a major challenge in the area. EC ranged from 0.14 to 46.46 dS m
−1. In fact, EC values ranged from non-saline to extremely saline (
Table 1). This supports past results that indicate there is excessive soil salinity in central Iran [
63,
64]. The high standard deviation (SD) (6.30 dS m
−1) and coefficient of variation (CV) of 84% emphasize the vast variability of EC in the current study. CVs were categorized as low (<15%), medium (15–35%), high (35–75%), and extremely high (75–150%) variation [
65]. The lowest and highest CVs related to pH and P
av, respectively, while the other properties were either high or very high, and sand had moderate CVs (
Table 1).
Sand, Silt, and CCE showed moderate variability (12 ≤ CV < 60%) (
Table 1). The high CVs (CV ≥ 60%), for EC, clay, SOC, TN, P
av, Ca
++, Mg
++, Na
+ and SAR indicate high variability of nutrients. Additionally, the results of several studies and according to our findings, soil phosphorus had the highest variability [
9,
24,
66] which can be caused by using different phosphorus fertilizers and different management approaches by farmers in areas with diverse cultivation patterns. There was positive skewness (>2) and kurtosis (>3) for EC values which indicates a non-normal distribution of salinity. The SAR ranged from 0.46 to 41 with a mean of 11.57 which shows that the soils range from extremely sodic to non-sodic levels and had high SD and CV values.
The most significant positive correlations were observed between six variables: TN and SOC (r = 0.89), SAR and Na
+ (r = 0.85), Na
+ and EC (r = 0.78), SAR and EC (r = 0.66), Mg
++ and Ca
++ (r = 0.60), and P
av and SOC (r = 0.56). Na
+ and EC (r = 0.78), SAR and EC (r = 0.66), and SAR and Na
+ (r = 0.85) illustrated positive correlations (
Figure 3). The highest significant negative correlation was observed between silt and sand (r = −0.82) (
Figure 3) but this is not a true correlation as silt and sand are part of a composition. Other negative correlations were all weak.
3.2. Random Forest Prediction Accuracy
The validation results of soil property maps showed the highest R
2 = 0.69 for TN (
Table 2). In addition, the validation of map predictions of different soil properties indicated R
2 > 0.50 and CCCs = 0.65–0.79 in the study area. Taghizadeh-Mehrjardi et al. [
63], Mulder et al. [
67], Gomes et al. [
68] and Reddy et al. [
57] all reported that an R
2 ≤ 0.5 is more common and >0.7 is less common in prediction of soil properties. Therefore, the RF model mapped soil spatial variability well in this study. Additionally, RMSE and MAE results for all soil properties except for K
av (RMSE = 67.58 and MAE = 46.0) and Na
+ (RMSE = 28.57 and MAE = 18.71) were excellent. There was a very small positive bias for predicted values for all parameters (
Table 2) which RMSE and MAE exhibited a positive value with regard to the overestimated prediction of the DSM of soil property. This bias seemed closely related to the average distance of soil sampling, differences of type of cultivated lands and laboratory methods.
The high ranges of Na
+ (Min = 0.46 meq L
−1, Max = 272.0 meq L
−1) and K
av (Min = 17.0 mg kg
−1, Max = 695.0 mg kg
−1) may be the result of different management by farmers in using chemical fertilizer and irrigation water of different qualities. High variations in the distribution of soil particle size influence the capability to retain soluble salts in the soil, and these variations are connected to geological conditions [
28]. It may also be that, due to the extent of the study region and limited soil data, using more soil samples in the validation data would improve the R
2. These results are consistent with other studies that showed good performance by RFs for predicting soil properties with reasonable RMSE and bias [
64,
69,
70,
71]. In this study, the sampling distance was large; therefore, as Jiang et al. [
60], de Oliveira et al. [
72] and Zeraatpisheh et al. [
10] reported, denser sampling may be needed in future studies for more effective soil property mapping.
3.3. Spatial Prediction of Soil Properties
The prediction maps of the studied soil properties showed that the north-western part of the study area had the lowest soil pH value (
Figure 4a). Low amounts of EC were found in the soils through the center and south of the study area (
Figure 4b). The predicted SOC map (
Figure 4c) showed that the highest amount is present in the southern parts and in most of the region, especially in the central, north-eastern, and north-western parts of the area, minimal levels of organic carbon were observed. The reason for naturally low amounts of SOC is the arid climate of this region. However, management of the soils by farmers with the use of animal manure or organic fertilizers have caused changes in the amount of SOC in some areas of the study region (CV = 98.21%). The highest and lowest amounts of CCE were found in the north and north-western part and western parts, respectively, which is related to the geology of area (
Figure 4d).
Differences in the soil particle size distribution are varied in the region (
Figure 4) due to the sediments and soils of this region being affected by different stratigraphic deposits and elevations. The spatial pattern of TN, as shown in
Figure 4h, showed a similar pattern to SOC, as exhibited in
Figure 4c (r = 0.89,
p < 0.01). The P
av and K
av exhibited a low value from the northwest to the northeast and a high value to the south, with a nearly homogeneous spatial pattern over the rest of the study region. Based on the maps of TN, P
av and K
av, higher values of these three properties were observed in the southern regions (
Figure 4i,j). These areas have better conditions for the growth and cultivation of more products such as pomegranate and the highest amount of SOC, which provides the ability to grow a greater variety of crops. Zeraatpisheh et al. [
9] expressed the advantages of the prediction map of properties related to soil fertility, including TN, P
av and K
av to make fertilizer recommendations and showed that areas with the highest amounts of these three properties have better conditions for growing and producing more citrus fruits in the north of Iran and Mazandaran province.
The maps shown in
Figure 4, illustrate that the distribution of Ca
++ was very similar to the Mg
++ distribution except in the southern regions, which had higher values of Ca
++. It seems that low levels of Ca
++ and Mg
++ in the central part of the region frequently occurred where there were low amounts of clay. In addition, due to the relationship between Na
+ and SAR (r = 0.85,
p < 0.01), as well as the initial composition of salt, groundwater with alkaline conditions, and the use of alkaline fertilizers in the region, the spatial change maps of Na
+ and SAR represented a similar distribution in the southern part of the region (
Figure 4m,n).
The SAR and EC were high and very high in the northern portion of the region where pistachio is grown (
Figure 4). Groundwater quality maps (based on SAR, Na
+, TDS, Cl
−, EC, Ca
++, and Mg
++) reveal that the associated groundwater parameters are also high in the same area [
28]. Cultivated lands of the north are in close proximity of Bajestan playa (Kavir-e Namak), which is another important factor that contributes to high salinity and SAR in this area. This is particularly true in the northwestern part of the region. This is the location of the saltiest part of the playa. Maleki et al. [
28] and Taghizadeh-Mehrjardi et al. [
63] reported similar findings in studies of the variation of soil properties in the Bajestan region and soil salinity prediction in the Ardakan region of Yazd Province. In some sections of the northern part of the region, heavy soil texture (clay) also increases salt retention. Improper water management on irrigated cultivated lands in northern Bajestan that have low slopes and poor natural drainage has enhanced the salinity of groundwater, changed the depth of the water table, and salinized the soils above it. Raiesi [
73] and Nabiollahi et al. [
71] highlighted the role of increasing soil salinity in decreasing SQ in arid and semiarid regions. Groundwater and gradual salinization of soil have been observed in the study area. Bhutta and Alam [
74] showed that saline soils increased by approximately 17 million ha (from 56% to 73%) between 1953–1954 and 2001–2003 due to the dropping water table and increased salinity from over-pumping of groundwater.
3.4. Principal Component Analysis (PCA)
As shown in
Table 3, PCA is a good method for summarizing parameter effects because of the high correlation between some soil properties. PCs with eigenvalues >1.0 represented 73.76% of the dataset variability in the study area (
Table 3). PC1 accounted for most of the total data variance (25.68%), with the strongest positive loading factors for EC, Na
+, and SAR. The map of PC1 showed the high values in north and northeastern parts of the study area (
Figure 5a) similar to SAR, Na
+, and EC distribution (
Figure 4). PC2 explained 17.27% of the total variance and had the strongest loadings for sand (large negative loading factor) and silt (large positive loading factor). The higher values of PC2 (
Figure 5b) coincide with the sand map (
Figure 4). PC3 had the strongest loadings for SOC, TN, P
av, and K
av and had positive loading factors that explained another 12.43% of the total variance of the dataset. PC3 has high values in part of the southern area (
Figure 5c) where TN and SOC maps showed the higher values. PC4 explained 9.40% of the data variation with strong positive loading factors for pH and clay and negative loading factors for CCE (
Table 3) and the distribution map of PC4 (
Figure 5d) showed similar changes to the mentioned soil properties maps (
Figure 4). The last PC (PC5) described 8.96% of the data variability with a positive loading factor for Ca
++ and Mg
++ (
Table 3 and
Figure 5e). Thus, five principal components (PC1, PC2, PC3, PC4, and PC5) were used to delineate the MZs for the study region. Davatgar et al. [
19] used PCA and “fuzzy k-means” to distinguish MZs for a paddy field in the north of Iran and reported that 72.95% of soil property changes were described by three PCs. Zeraatpisheh et al. [
9], in a study in Iran, used PCA and fuzzy c-means to define soil MZs in citrus orchards and stated that four PCs could explain 78.66% of the variation in soil properties. Accordingly, the PCA method incorporated 14 input variables into five new PCs which explain most of the spatial variation of these properties.
3.5. Clustering Analysis
Five PCs were used with cluster analysis to define MZs. The FuzMe software was used to run the “fuzzy k-means” clustering on the five PCs. This method allows differentiation between different areas with similar properties. Hence, when the FPI and NCE values were negligible, the optimal number of MZs could be distinguished, as shown in
Figure 6. Four MZs were chosen as optimal as this is where the FPI is at a minimum and NCE is at its second-lowest value. A one-way analysis of variance (ANOVA) was performed to assess the effectiveness of the combination of the PCA clustering algorithm and “fuzzy k-means” to delineate MZs as well as their spatial variability. The four different MZs that were distinguished as the optimal are shown in
Figure 7. Several studies reported good results using the analysis of variance to delineate areas [
18,
61,
75]. Zeraatpisheh et al. [
9] performed a recent study where two MZs were detected by the fuzzy k-means method in an area in Darab city, Iran.
Based on the results obtained from
Table 4, there were significant differences (
p < 0.05) between some soil properties except for sand, SOC, TN, P
av, and Ca
++ for the four MZs. In comparison to other MZs, pH showed a significant difference in MZ4; clay and K
av showed significant differences in MZ3; and Mg
++, Na
+, and SAR showed a significant difference in MZ2. MZ3 had lower clay and SOC values and could have lower potential soil fertility than the other zones. This suggests that in this area, more chemical fertilizers and organic fertilizers are needed to increase SOC in MZ3. Generally, the ordering of MZs was MZ4 > MZ1 > MZ3 > MZ2 in relation to soil fertility for all investigated soil properties, based on the results in
Table 4. These results can be useful for deciding how to distribute fertilizer in each zone with an emphasis on regionally specific decision making for agriculture lands to allow more sustainable production and, in turn, to reduce environmental risks from uneven application of chemical fertilizers.
3.6. Management Zones (MZs) and Fertilizer Recommendation
Figure 7 shows the MZ map with four MZs. ANOVA was used to evaluate differences in soil nutrients among MZs. There was a significant difference (
p < 0.05) between the MZs for each of the soil properties (
Table 4). Soils in MZ4 with higher values of clay and SOC and lower values of EC and SAR showed higher soil fertility potential than other zones. There was greater variability in EC and SAR between MZs (
Table 4). Mean EC values in MZ2 and MZ3 were 10.45 and 7.78 dS m
−1, respectively. Similar to EC, the highest values of SAR were found under MZ2 and MZ3 with values greater than 10. The greatest TN, P
av, and K
av deficiencies occurred in MZ2. The most common reason for these deficiencies in this unit could be the result of reduced capacity of mineral fertilizers and insufficient clay content in this zone. It appears that native N input (low OC) and higher N reduction by leaching (low clay content) limited MZs.
MZ3 contained P
av and K
av around 12.23 and 291.90 mg kg
−1 with these areas appearing to have sufficient soil nutrients. MZ4 has the best intrinsic soil fertility due to higher clay, SOC, and nutrient reserves. However, the area of this zone was 16,211.32 ha and MZ2 had the highest area at about 24,580.50 ha (
Figure 7).
Maleki et al. [
43] predicted SQ maps in earlier study using two datasets (total data set–TDS and minimum data set–MDS) by linear (L) and nonlinear (NL) scoring methods from 223 surface soil samples (0–30 cm depth). The results of the soil quality indices (SQIs) map indicated the maximum values of the SQIs in the southern and central parts of the study area, while the minimum values were found in the north of the study area due to higher EC and SAR. The DSM-predicted soil property maps generated here were transformed to SQI maps according to Maleki et al. [
43] and were classified into five classes of SQ including very high (I), high (II), moderate (III), low (IV), and very low (V). The area of MZs within each SQ grade is presented in
Table 5. The findings indicate that there are different SQ grades within each of the four delineated zones.
MZ4 had an area of 16211.32 ha, the largest area, categorized as moderate (III) SQ; although part of the area was in the high class (II) SQ (area of 828.21 ha), while the lowest area of very low (V) SQ was located in MZ4 (
Table 5). In contrast, MZ1 contained five SQ grades (
Table 5) with 0.47 ha located in very high (I) SQ and 66.16 in very low (V).
The results for SQ maps [
43] showed that the lower grades of SQ with the highest EC and SAR levels were in the northern portion of the study area in proximity to the playa where elevations are the lowest and topographic wetness index (TWI) is highest [
76]. Soil salinity and alkalinity are subject to two primary and secondary factors in the study area. The primary agent includes the conditions of the physical environment (climate, geology, and topography). The secondary factor includes the human uses of the land and management practices (particularly the use of poor-quality irrigation water and inadequate agricultural operation that increase the likelihood of salinity, alkalinity, and sodicity).
The results of
Table 6 show the main restricted indicators in each MZ regarding SQI. According to the results of SQ and MZs (
Table 6), prescription of fertilizer should be applied based on the uniform fertilization management within each SQ or MZ area. It should be noted that determining the amount of fertilizer required in each region is based on available and common fertilizers used by farmers in the region. Reducing production costs, increasing the quantity and quality of the product, and keeping the environment and soil safe from pollution are potential advantages of fertilizing based on specialized prescriptions and in line with sustainable agricultural practices. Numerous researchers have also reported the distribution of different fertilizers based on MZ delineation [
9,
10,
11,
18,
19,
72].
The fertilization method is very important in the nutritional management of plants. Fertilization should be performed in such a way that the elements required by the plant are provided to the plant in a suitable form and at the desired time. Various factors, such as the type of fertilizer and plant species, affect the fertilization method. In addition, the amount and time for the applied fertilizers will be different in different regions. To increase the efficiency of fertilizers, the management approach and method of consumption have been taken into consideration. The amount of phosphorus in the region is relatively low to medium; in general, it indicates the relative poverty of this element in the soils of the region. However, the deficiency is not as severe as nitrogen (TN < 0.05). Therefore, nitrogen fertilizer consumption should be given serious attention. In order to achieve the highest absorption efficiency of chemical fertilizers for phosphorus, it should be used at the right time and according to the needs of the plant.
The results in
Table 4 show that the amount of potassium is relatively moderate amount with a significant difference in terms of this element in MZ3 (
Table 6) and other cases (probably due to the difference in soil texture and cultivation pattern). Potassium plays an important role in plant resistance to disease. Among its other important roles, it is possible to regulate the work of stomata openings and water relations and accelerates the growth of generative tissues. Most importantly, the special conditions of the region, which is under the stress of drought, wind, and sometimes cold, should be considered. Therefore, it is necessary to use potassium in the form of potassium sulfate fertilizer in the soils of the region.
The findings showed that farmlands in the study area are generally lacking in SOC (<0.70%), especially in MZ2 and MZ3 with the low amount <0.40 (
Table 4 and
Table 6). The lack of SOC is one of the most notable problems of the study region. Considering the importance of soil organic matter, increasing it as the first priority for improving soil fertility in plant nutrition management should be given serious attention (
Table 6).
Annual applications of gypsum or liquid organic sulfur and organic fertilizers in appropriate ratios to promote effective leaching of salt may be a highly effective approach to remediation of saline and sodic soils. Determining the appropriate cultivation pattern based on MZ maps and changing cultivation towards greenhouse crops can lead to sustainable agriculture and prevent soil degradation. Recognition of the importance of irrigation, water quality, and the patterns of SQ predictions on the MZs map provides scientific guidance for enhanced management of the soils of the Bajestan area, for developing more sustainable agricultural practices. This approach can also be applied in similar regions through DSM.