Next Article in Journal
Cities and Governance for Net-Zero: Assessing Procedures and Tools for Innovative Design of Urban Climate Governance in Europe
Previous Article in Journal
Collaboration and Resource Sharing for the Multi-Depot Electric Vehicle Routing Problem with Time Windows and Dynamic Customer Demands
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Prediction Model of Farmland Water Conservancy Project Cost Index Based on PCA–DBO–SVR

1
College of Water Conservancy, Shenyang Agricultural University, Shenyang 110866, China
2
Center of Engineering Construction Service, Ministry of Agriculture and Rural Affairs, Beijing 100081, China
3
Liaoning Agricultural Development Service Center, Shenyang 110030, China
*
Author to whom correspondence should be addressed.
Sustainability 2025, 17(6), 2702; https://doi.org/10.3390/su17062702
Submission received: 22 January 2025 / Revised: 6 March 2025 / Accepted: 14 March 2025 / Published: 18 March 2025

Abstract

:
With the gradual cessation of budget quota standards and the emphasis on market-based pricing, accurately predicting project investments has become a critical issue in construction management. This study focuses on cost indicator prediction for irrigation and drainage projects to address the absence of cost standards for farmland water conservancy projects and achieve accurate and efficient investment prediction. Engineering characteristics affecting cost indicators were comprehensively analyzed, and principal component analysis (PCA) was employed to identify key influencing factors. A prediction model was proposed based on support vector regression (SVR) optimized using the dung beetle optimizer (DBO) algorithm. The DBO algorithm optimized SVR hyperparameters, resolving issues of poor generalization and long prediction times. Validation using 2024 farmland water conservancy project data from Liaoning Province showed that the PCA–DBO–SVR model achieved superior performance. For electromechanical well projects, the root mean square error (RMSE) was 1.116 million CNY, mean absolute error (MAE) was 0.910 million CNY, mean absolute percentage error (MAPE) was 3.261%, and R2 reached 0.962. For drainage ditch projects, RMSE was 0.500 million CNY, MAE was 0.281 million CNY, MAPE was 3.732%, and R2 reached 0.923. The PCA–DBO–SVR model outperformed BP, SVR, and PCA–SVR models in all evaluations, demonstrating higher prediction accuracy and better generalization capability. This study provides theoretical support for developing cost indicators for farmland water conservancy projects and offers valuable insights for dynamically adjusting national investment standards and improving construction fund management.

1. Introduction

Grain security is a critical element of national strategy. To implement the “storing grain in land and technology” policy effectively, alleviate water scarcity, and promote sustainable development [1,2], farmland development has become a priority for national investment. In this context, the significance and role of farmland water conservancy projects have grown increasingly prominent. However, the absence of a systematic standard for investment in such projects remains an issue. Cost indicators are vital in investment estimation during the decision-making phase and are integral to cost management throughout the project lifecycle. Accurately and efficiently predicting cost indicators is essential for scientifically controlling project investment. By forecasting these indicators, project costs and their trends can be identified, enabling the formulation of cost policies and controlling expenditures at various project stages.
Current research on cost indicator prediction methods primarily includes traditional and machine learning techniques. Traditional methods predominantly rely on mathematical statistics, such as linear regression analysis [3] and grey system prediction [4]. Linear regression analysis is limited to fitting linear relationships in original data, while grey system prediction exhibits low fault tolerance and requires large sample sizes. These methods are insufficient for nonlinear data fitting and are unsuitable for predicting farmland water conservancy project cost indicators. With advancements in artificial intelligence, researchers and experts have explored various cost prediction simulation models in the engineering field. Models such as BP neural networks, support vector machine (SVM) [5], and convolutional neural networks [6,7] are increasingly applied to construction cost forecasting. For example, Wahyu Hayati et al. [8] and Feng et al. [9] developed cost prediction models based on SVM and neural networks tailored to the characteristics of transmission and wind power projects, demonstrating high practical value in determining cost rationality for transmission projects. Wu et al. [10] proposed a two-stage integrated model combining SVM and least-squares support vector machines (LS-SVM) to predict the costs of green building projects, achieving high prediction accuracy and strong generalization capabilities. Compared to traditional neural networks and other learning methods, SVM effectively addresses challenges such as overfitting, local minima, and the curse of dimensionality [11], offering distinct advantages for predicting cost indicators of farmland water conservancy projects. The quality of input data significantly impacts the accuracy of prediction models, necessitating feature selection methods to reduce the dimensionality of raw data. Wyke et al. [12] utilized principal component analysis (PCA) to reduce dimensionality on high-dimensional data, minimizing redundancy and ensuring data independence. PCA is particularly suitable for integrating with other algorithms as an unsupervised learning method [13]. Studies have shown that parameter selection plays a critical role in determining the accuracy and generalization capability of models. In recent years, experts and researchers in cost prediction have frequently introduced optimization algorithms to improve prediction accuracy by optimizing model parameters [14]. At the same time, optimization algorithms can also optimize the design of engineering structural parameters to achieve the advantages of cost reduction, performance enhancement, cycle time reduction, etc. [15,16]. Lin et al. [17] identified key factors influencing substation project costs during the decision-making phase and applied particle swarm optimization (PSO) to determine optimal parameters for support vector machines, ultimately developing a PSO–SVM-based cost prediction model for substations. Cheng et al. [18] highlighted the challenge of predicting construction costs due to price fluctuations and proposed a hybrid model combining LS-SVM with differential evolution (DE) to forecast construction cost indices. The model achieved a prediction error of less than 10%, demonstrating its feasibility. Ai et al. [19] constructed a PSO–SVM-based model to predict environmental management costs, achieving higher prediction accuracy than other methods. Among these optimization algorithms, the dung beetle optimizer (DBO) stands out for its ability to rapidly approach optimal solutions and effectively optimize model parameters [20,21]. Its robust exploration capability has made it a valuable tool for optimizing hyperparameters in support vector machines and neural network models.
A review of research literature on the development and prediction of cost indicators shows that existing prediction models are primarily applied to building construction, highway engineering, and power transmission projects [22,23,24], with limited studies focusing on farmland water conservancy project cost indicators. Consequently, systematic and in-depth research in this area is necessary. This study applies PCA to preprocess model data, identifying key factors influencing the cost indicators of farmland water conservancy projects. Qualitative factors are further analyzed to construct a support vector regression (SVR) model optimized with the DBO. The proposed PCA–DBO–SVR model predicts cost indicators under varying regional, project-type, and environmental conditions. This approach aims to provide methodological and theoretical support for cost indicator prediction in farmland water conservancy projects, improving cost management throughout the project lifecycle in China.

2. Materials and Methods

2.1. Overview of the Study Area

Liaoning Province is located in the southern part of northeast China, with geographic coordinates ranging from 118°53′ E to 125°46′ E and 38°43′ N to 43°26′ N. The province covers a land area of 148,700 km2 and a sea area of 150,000 km2, with a total coastline length of 2290 km. Liaoning comprises 14 cities (including the sub-provincial cities of Shenyang and Dalian) and the Shenyang–Fushun Demonstration Zone. There are 100 counties (cities and districts), including 16 county-level cities, 25 counties, and 59 municipal districts. The administrative divisions include 513 subdistricts, 640 towns, 201 townships, 4617 communities, and 11,561 administrative villages, with a total permanent population of 41.97 million. In 2024, Liaoning added 22,670 hectares of newly constructed farmland. The main construction activities in this region involve canal lining and dredging, irrigation and drainage works, the construction of new bridges and culverts, rural road development, land leveling and soil improvement, and farmland protection projects. These initiatives aim to enhance the overall ecological environment of farmland, mitigate the impact of floods on crops, and improve agricultural productivity in the region. A geographic map of Liaoning Province is shown in Figure 1.

2.2. Data Sources and Processing

This study collected cost indicator data for farmland water conservancy projects across cities in Liaoning Province in 2024. The sample data were obtained through a combination of field surveys and engineering acceptance reports. The surveyed areas included Wanghua District in Fushun, Hunnan District in Shenyang, Fuxin County in Fuxin, Gongchangling District in Liaoyang, Shuangta District in Chaoyang, and Linghai City in Jinzhou. Data preprocessing includes missing value processing, outlier removal and data normalization. Missing value processing is conducted in accordance with the mean value of the same type of project and the proportion of the method of interpolation, according to the constraints on the composition of the cost and the reasonableness of the amount of work to eliminate outliers, and with the use of normalization methods to eliminate the impact of the data outline and scaling the data to the [0, 1] range. The normalization formula is shown in Equation (1). The dataset consisted of 40 electromechanical well projects and 100 drainage ditch projects, split into training and testing sets at a 7:3 ratio.
x i * = x i x m i n x m a x x m i n
where x m a x and x m i n represent the maximum and minimum values in the sample data, respectively; x i and x i * represent the original and normalized data, respectively.

2.3. Analysis of Factors Influencing Cost Indicators in Farmland Water Conservancy Projects

The cost indicators of farmland water conservancy projects are influenced by numerous factors with complex interrelationships [25,26]. Selecting comprehensive and reasonable influencing factors is foundational in predicting these indicators. In this study, the investment characteristics of farmland water conservancy projects were analyzed to identify key factors affecting cost indicators. Relevant literature was consulted to ensure a robust selection of influencing factors, while practical considerations specific to the projects were also incorporated. To ensure the feasibility of the selected factors for model construction, PCA was employed to conduct an in-depth analysis of the identified influencing factors.

2.3.1. Preliminary Selection of Influencing Factors

To ensure the accuracy of the prediction model, the preliminary selection process involved gathering a comprehensive range of factors influencing the cost indicators of farmland water conservancy projects. Given the limited literature on the topic, efforts were made to ensure thoroughness and scientific rigor in factor selection. Relevant materials were collected and analyzed for comparison, including preliminary design documents, feasibility study reports [27,28], and data from actual engineering projects. The influencing factors were organized and summarized accordingly. This study focused on electromechanical well and drainage ditch projects within farmland water conservancy projects. The influencing factors considered included geographical environment, engineering, consumption, price, and management factors. The statistical results of the selected factors are presented in Table 1.
Some of the qualitative factors need to be quantified. The quantification principle should be clearly defined. The quantification principles for geographic location are: 1 for the east, 2 for the south, 3 for the west, 4 for the north, and 5 for the central part; the quantification principles for topography and geomorphology are: 1 for the hills and 2 for the plains; the quantification principles for the geologic structure, the temporary traffic condition inside and outside the site, the management level of the contractors, the management level of the supervisory authority, and the construction conditions in the site are: 1 for the lack of influence on the construction, 2 for the general influence, and 3 for the significant influence; the quantification principles for the intensity of earthquake: for example, the intensity of 6 is quantified as 6.

2.3.2. Screening Key Influencing Factors Using Principal Component Analysis

Common dimensionality reduction methods include grey relational analysis (GRA) and PCA. Considering the non-temporal characteristics of the sample data and the importance of influencing factors as the foundation for constructing prediction models [29], PCA was employed in this study to identify key influencing factors. PCA is a statistical method that extracts and transforms multiple correlated variables into fewer independent composite variables through dimensionality reduction. This approach aims to maximize the information retained in the sample data with a reduced set of variables. The detailed method is as follows:
  • Suppose the dataset consists of n samples, each containing m variables, forming an n × m sample matrix X, as shown in Equation (2):
    X n × m = X 11 X 1 m X n 1 X n m ,
  • Calculate the eigenvalues λ 1 λ 2 ≥ ⋯ λ m ≥ 0, and corresponding eigenvectors u 1 , u 2 , ⋯, u m , of the correlation coefficient matrix R, where u j = ( u 1 j , u 2 j , ⋯, u m j )T. Find the composition of m new indicator variables from feature vectors.
  • Determination of principal components: After standardizing the data, the variance contribution rate b j and cumulative contribution rate a j of each factor are calculated using Equations (3) and (4). Finally, only components with eigenvalue ≥ 1 and cumulative contribution greater than 80% are retained as principal components [30].
    b j = λ j j = 1 m   λ j , ( j = 1,2 , , m ) ,
    a j = j = 1 m   b j , ( j = 1,2 , , m ) ,
  • Calculation of comprehensive scores: The factor loading matrix is used to obtain the scoring coefficients of each influencing factor on the principal components. Comprehensive scores are calculated and ranked after normalization. This process ensures the identification of key influencing factors crucial for the prediction model.

2.4. DBO–SVR Prediction Model

2.4.1. Support Vector Regression (SVR)

Support vector machines (SVMs) are predictive methods based on structural risk minimization, offering significant advantages in handling high-dimensional, nonlinear, and small-sample problems. These methods demonstrate high predictive accuracy and robust nonlinear classification capabilities [31,32]. Support vector regression (SVR), an extension of SVM, addresses regression prediction problems by mapping data into a high-dimensional feature space through nonlinear transformations, thereby converting nonlinear regression problems into linear regression problems in higher dimensions. As this study involves regression prediction, the final decision function is:
f x = w , Φ x + b ,
where Φ ( x ) represents the mapping function, w is the weight coefficient, and b is the bias term.
The SVR problem can be described as:
min 1 2 w 2 + C i = 1 n   ξ i + ξ i * ,
where ξ i ,   ξ i *   represent constants, and C is the penalty factor. C influences the stability of the SVR model; larger values of C result in higher model accuracy.
Common kernel functions include linear, polynomial, and Gaussian kernels. This study adopts the Gaussian radial basis function (RBF) as the kernel function due to its superior noise resistance compared to other kernel functions. The formula for the RBF kernel is as follows:
k x i , x j = exp g ( x i x j ) 2 g > 0 ,
where x j represents the center of the Gaussian kernel function, and g is the variance of the kernel function. The variance influences the complexity of sample distribution in the feature space; larger g values lead to lower model precision.
In conclusion, the accuracy and stability of the SVR model are determined by the choice of kernel function type, the kernel parameter g, and the penalty factor C.

2.4.2. Dung Beetle Optimizer (DBO)

DBO is a novel population-based optimization algorithm introduced by Xue et al. in 2022 [33]. This algorithm simulates the social behaviors of dung beetles, such as rolling balls, dancing, reproducing, foraging, and stealing, to achieve optimization. DBO can efficiently identify optimal parameters, including the support vector weights and thresholds for the SVR model, within a short time frame [33]. Compared to traditional optimization algorithms like the genetic algorithm (GA), DBO offers advantages such as faster convergence and higher accuracy. It effectively addresses issues of insufficient prediction precision and weak generalization capability caused by the subjective selection of SVR hyperparameters [34]. Based on these advantages, a DBO–SVR model is constructed for predicting farmland water conservancy project cost indicators. The algorithm’s steps are detailed as follows:
When dung beetles roll balls forward, their positions are updated as shown in Equation (8). Upon encountering obstacles, the positions are re-updated using Equation (9):
x i t + 1 = x i t + a k x i t 1 + b x i t x w o r s t t ,
x i t + 1 = x i t + tan θ x i t x i t 1 ,
where a takes a value of 1 or −1; k ( 0,0.2 ] ; b ( 0,1 ) ; x i t represents the position of the i-th dung beetle at the t-th iteration; and x w o r s t t indicates the worst position in the population.
The updated formula for simulating the selection of a safe area for reproduction by female dung beetles is given as follows:
x i + 1 = x g b e s t + b 1 ( x i L b * ) + b 2 | x i U b * | ,
where L b * and U b * denote the lower and upper boundaries of the breeding zone within the safe area, respectively, while x g b e s t represents the optimal position within the population.
When the small dung beetle forages in the optimal feeding area, the foraging position is calculated as follows:
x i + 1 = x i + C 1 ( x i L b ) + C 2 | x i U b | ,
where C 1   and   C 2 are random numbers within the range (0, 1), and L b and U b denote the lower and upper boundaries of the optimal feeding area for the small dung beetle, respectively.
When the dung beetle steals dung balls from others, the position update is calculated as follows:
x i + 1 = x l b e s t + Q g ( | x i x g b e s t | + | x i x l b e s t | ) ,
where x l b e s t represents the optimal position within the region, and Q is a constant.

2.5. PCA–DBO–SVR Model Development Steps

The model is developed using the influencing factors obtained through PCA as input data and the unit cost of each sample project as output data. According to the sample data size of this study, in order to balance the model training adequacy and evaluation reliability, the sample data are split into training and testing sets at a ratio of 7:3, respectively, to train and validate input–output relationships. The DBO is used to optimize the hyperparameters C and g of the SVR model. These optimized parameters are then incorporated into the SVR model. Training continues using the same sample data until the training error approaches zero; at this point, the final SVR prediction model is established. The trained model is saved if the model demonstrates high accuracy and satisfactory performance. The input variables are standardized, and principal component scores are calculated and input into the trained SVR model to predict unit costs, representing the cost indicators obtained through the proposed method [35].
The detailed steps are as shown in Figure 2.

3. Results

3.1. Selection of Influencing Factors Using PCA

A total of 140 sample datasets from electromechanical well and drainage ditch projects were analyzed and processed. Correlation analysis was first applied to evaluate the relationship between influencing factors and cost indicators, and factors with a correlation coefficient of less than 0.5 were excluded.
The influencing factors to obtain the cost index of electrical and mechanical well engineering are the diameter of the borehole (E1), length of the irrigation pipeline (E2), design flow rate (E3), depth of the well (E4), head (E5), on-site construction conditions (E6), consumption of main materials (E7), consumption of machinery (E8), consumption of labor (E9), area of the irrigated area (E10), seismic intensity (E11), meteorological conditions (E12), labor cost (E13), machinery cost (E14), material price (E15), and duration (E16).
Drainage ditch engineering cost indicators include the length of the drainage ditch (D1), the flow of the drainage ditch (D2), the bottom width of the drainage ditch (D3), the height of the drainage ditch (D4), the specific drop (D5), machinery consumption (D6), the consumption of major materials (D7), the construction conditions on the site (D8), seismic intensity (D9), meteorological conditions (D10), the site of the temporary traffic conditions on the site and off (D11), labor cost (D12), labor consumption (D13), material price (D14), machinery cost (D15), and construction duration (D16).
PCA was then used to identify the key influencing factors for cost indicators. Principal component eigenvalues, contribution rates, and eigenvectors were calculated based on Equations (3) and (4). The results are presented in Table 2. The Kaiser–Meyer–Olkin (KMO) value was greater than 0.5, indicating that the data are suitable for PCA [36,37].
Principal components with a cumulative variance contribution rate exceeding 80% and an initial eigenvalue greater than 1 were selected for the next calculation step. As shown in Table 2, the first five components for electromechanical well projects achieved a cumulative contribution rate of 83.422%, with eigenvalues exceeding 1.212, indicating that these five components capture most of the information in the original data, making these five components sufficient to represent the data. The first four components reached a cumulative contribution rate of 95.424% for drainage ditch projects, with eigenvalues exceeding 1.076, which satisfied the principal component determination requirements and made these four components sufficiently representative of the data. Using the factor loadings and score coefficients of the principal components, the comprehensive scores for influencing factors in farmland water conservancy project cost indicators were calculated. As shown in Table 3, the influencing factors with normalized comprehensive score results greater than 0.5 were identified as key factors. Accordingly, the key influencing factors for electromechanical well projects included 15 factors, such as borehole diameter and irrigation pipeline length, while drainage ditch projects identified 13 key factors, including length and flow rate.
Finally, the key influencing factors identified through PCA were used as input variables for the SVR model, with cost indicators as output variables, to predict the cost indicators of farmland water conservancy projects.

3.2. Comparison of Prediction Models

This study compared four models using the same sample database to validate the effectiveness of the PCA–DBO–SVR model in predicting cost indicators for farmland water conservancy projects. The BP neural network, SVR, PCA–SVR, and PCA–DBO–SVR were used.
The BP neural network was configured with 1000 training iterations, a learning rate of 1 × 10−2, a minimum error target of 1 × 10−5, and five hidden layer neurons. The DBO optimization algorithm was applied to tune the hyperparameters of the SVR model. DBO settings included a population size of 10, a maximum of 100 iterations, and boundaries of [−8, 8]. The dataset was divided into 70% training and 30% testing samples.
Four metrics were used to evaluate the prediction accuracy of the models: root mean square error (RMSE), mean absolute error (MAE), mean absolute percentage error (MAPE), and coefficient of determination ( R 2 ). Smaller RMSE, MAE, and MAPE values and R 2 values closer to 1 indicate better model performance, lower error, and more reliable predictions. The formulas for these evaluation metrics are as follows:
R M S E = 1 n i = 1 n y i y ^ i 2 ,
M A E = 1 n i = 1 n   Y i Y i ^ ,
M A P E = 1 n i = 1 n   Y i Y i ^ Y i × 100 % ,
R 2 = 1 i = 1 n   Y i Y i ^ 2 i = 1 n   Y i Y i ¯ 2 ,
where Y is the actual value of the i-th sample, Y   ^ represents the predicted value of the i-th sample, and Y ¯ is the mean of the actual values across n samples.
The actual values in Figure 3 are the test set divided by the sample data collected in Section 2.2 according to the 7:3 ratio. A comparison between the BP neural network model and the SVR model highlights notable differences. The BP neural network forms a complex structure by simulating human brain neurons, allowing for enhanced learning and adaptability through adjustments to the number of neurons and layers. However, when comparing the prediction results for electromechanical well projects and drainage ditch projects, the BP model prediction lines show significantly greater deviation from the actual value lines than those of the SVR model. These results demonstrate that the SVR regression prediction model is well-suited to the sample data and engineering characteristics in this study, greatly improving the prediction accuracy of cost indicators for farmland water conservancy projects. To evaluate the effectiveness of PCA in dimensionality reduction, a PCA–SVR model was constructed for comparison. As shown in Figure 3, the prediction results indicate that the PCA-reduced SVR model achieves smaller distances between the prediction lines and the actual value lines across sample points compared to the SVR model without dimensionality reduction. To further examine the impact of the DBO on parameter optimization, a PCA–DBO–SVR model was developed. The results demonstrate that the PCA–DBO–SVR model achieves the closest alignment between the prediction and actual value lines, with the smallest distances between sample points. In addition, the scatter plot in Figure 4 and the residual plot in Figure 5 show that the PCA–DBO–SVR model is closest to the 1:1 fitting line in terms of scatter and closest to the zero level line in terms of residuals compared to the other models, with no obvious non-stochastic tendency, which suggests that the predicted values of the model are the closest to the actual values and have the highest prediction accuracy. Therefore, the PCA–DBO–SVR model significantly outperforms the standalone BP neural network, SVR, and PCA–SVR models in prediction accuracy and simulation performance.
The evaluation metrics for the BP, SVR, PCA–SVR, and PCA–DBO–SVR models are shown in Table 4. A comparison of the single models demonstrates that the standalone SVR model outperforms the BP neural network in both electromechanical well and drainage ditch projects. Specifically, the SVR model reduces RMSE by 48.39% and 37.99%, MAE by 37.87% and 39.27%, and MAPE by 33.87% and 25.83%, while improving R2 by 2.07% and 2.34%, respectively. These improvements can be attributed to the BP neural network’s tendency to encounter issues such as local optima and longer training times, which the SVR model effectively avoids. Therefore, the SVR model provides superior prediction results to the BP neural network for cost indicator predictions.
To enhance model efficiency and prediction accuracy, PCA was incorporated to construct the PCA–SVR model. Compared to the standalone SVR model, the PCA–SVR model achieves significant improvements. For electromechanical well and drainage ditch projects, RMSE decreases by 11.79% and 28.19%, MAE decreases by 17.86% and 7.98%, MAPE decreases by 22.64% and 34.29%, and R2 increases by 6.16% and 7.09%, respectively. These results indicate that applying PCA to analyze and reduce the dimensionality of SVR input variables eliminates redundancy in the input data, improving the validity of training samples and resulting in higher model accuracy.
With its excellent optimization capabilities, the DBO algorithm addresses the challenge of determining hyperparameters in the SVR model. Compared to the PCA–SVR model, the PCA–DBO–SVR model achieves significant improvements in predictive accuracy. For electromechanical well and drainage ditch projects, the PCA–DBO–SVR model reduces RMSE by 44.78% and 49.26%, MAE by 50.89% and 56.50%, and MAPE by 49.11% and 48.98%, and increases R2 by 3.74% and 5.31%, respectively. These results indicate that the model without an optimization algorithm will have instability in prediction accuracy due to the stochastic nature of hyperparameters, while the DBO algorithm compares with traditional PSO, GA, and other algorithms, as the mechanism of simulating dung beetles’ social behavior of DBO improves the hyperparameter search space coverage and reduces the number of iteration convergences, which indicates that it has stronger robustness and optimality searching performance in high-dimensional parameter space, and can further improve the model’s prediction accuracy.
In summary, the PCA–DBO–SVR model outperforms standalone SVR and BP neural network models in all evaluation metrics, demonstrating enhanced efficiency and effectiveness by integrating PCA and DBO. The inclusion of DBO eliminates the subjectivity in parameter selection for SVR, resulting in significantly better predictive accuracy and generalization capability than single SVR or PCA–SVR models. Moreover, the R2 values for the PCA–DBO–SVR model exceed 0.9 in all cases, further highlighting its stability and superior performance in predicting farmland water conservancy project cost indicators. Given the lack of comprehensive cost indicator data for such projects, the PCA–DBO–SVR model provides robust data support and a theoretical foundation for prediction tasks.
This study employs a combination of PCA, DBO, and SVR to predict cost indicators. The results demonstrate the high applicability and accuracy of this model, offering critical support for the dynamic adjustment of investment standards, enhanced planning of construction funding, and improved cost management in farmland water conservancy projects.

4. Discussion

Using prediction models as a novel approach for compiling cost indicators provides inspiration to explore predictive methods in other fields. Identifying methods aligned with the characteristics of farmland water conservancy project cost indicators can offer valuable insights for adaptation and application. Considering the numerous factors influencing project cost indicators, incorporating all variables into machine learning prediction models often leads to excessive computation time and reduced accuracy. Factor selection is essential. Zhang et al. [38] utilized LASSO regression to compile highway project cost indicators. Their method involved quantitatively identifying key cost drivers, selecting highly correlated indicators as input variables, and inputting these variables into the LASSO regression model to calculate comprehensive cost indicators. The quantification and selection of influencing factors significantly improved the model’s efficiency and accuracy. Similarly, Elmousalami et al. [27] analyzed the dynamics and patterns of cost information in engineering projects, identifying major qualitative factors influencing project costs. These factors included project characteristics, construction location, and project start/completion times. Drawing on these insights, the current study established a system of influencing factors for farmland water conservancy project cost indicators. The system encompasses engineering characteristics, geographic environment, and management factors. These factors were quantified, and PCA was used to identify key influencing factors with normalized comprehensive score results above 0.5. Experimental results demonstrate that prediction models processed using PCA achieve significantly higher accuracy.
In the selection of prediction models, Zhou et al. [39] constructed artificial neural network (ANN), K-nearest neighbor (KNN), random forest (RF), and SVR models to predict operating expenses for U.S. light rail transit (LRT) projects. Using data from 22 LRT systems in the United States from 2008 to 2018, the performance of these models was validated and analyzed. The results showed that the SVR model outperformed the other three models in prediction accuracy. Based on these findings, the SVR model was selected as the cost indicator prediction model for this study.
Mahmoodzadeh et al. [40] applied the grey wolf optimizer (GWO) algorithm to optimize the parameters of an SVR model for predicting tunnel construction costs. The optimized model significantly improved prediction accuracy, with a notable reduction in mean squared error compared to linear and standalone SVR models. These findings demonstrated the enhanced predictive performance and feasibility of combining optimization algorithms with SVR. However, the GWO algorithm and other traditional intelligent optimization algorithms often face issues such as susceptibility to local optima, low convergence accuracy, and weak theoretical foundations. Wu et al. [34] employed the DBO algorithm to optimize SVR parameters, resulting in a model with high prediction accuracy and strong fitting ability, with prediction errors consistently below 1%. Based on these findings, the present study adopts the DBO algorithm for hyperparameter optimization in the SVR model, achieving a precise and effective improvement in prediction accuracy.
The PCA–DBO–SVR model constructed in this study can predict the differentiated cost indices of farmland water conservancy projects by inputting factors such as different regions and different engineering characteristics, which provides basic data and theoretical support for the accurate prediction of project investment in the absence of budget quotas for farmland water conservancy projects at this stage. At the same time, in the feasibility study stage, the key influencing factors in the project can be input into the prediction model to output the predicted value of cost indices, which can assist in the optimization of the design scheme, and accordingly, project managers can reasonably formulate the capital investment and arrangement during the construction period. Currently, this study focuses on the cost prediction of farmland water conservancy projects in Liaoning Province and verifies the applicability to electromechanical wells and drainage ditches, but it lacks research on other provinces and does not yet cover water conservancy projects such as sluices, bridges, and culverts. In the future, we plan to expand the dataset to cover the sample range of different geographic regions and increase the diversity of project types to further improve the generalization and reasonableness of the model and provide theoretical support for the dynamic adjustment of the national investment standard.

5. Conclusions

To address the lack of unified standards and low prediction accuracy for farmland water conservancy project cost indicators, this study focused on these projects and proposed a prediction method based on the PCA–DBO–SVR model. PCA was used to identify key influencing factors, and the resulting model demonstrated high prediction accuracy and strong generalization capability. The main conclusions are as follows:
(1)
Based on the characteristics of farmland water conservancy projects, influencing factors were selected from engineering features, environmental conditions, and management aspects. Principal component analysis was used to identify key factors and establish an influencing factor system for cost indicators, providing important support for constructing prediction models.
(2)
The proposed PCA–DBO–SVR prediction model was compared with other models using case studies. Results showed that the maximum absolute relative error was only 3.732%, demonstrating the highest prediction accuracy and best-fitting performance. Compared to BP, SVR, and PCA–SVR models, the PCA–DBO–SVR model achieved the lowest error metrics. For electromechanical well and drainage ditch projects, RMSE values were 1.116 million CNY and 0.500 million CNY, MAE values were 0.910 million CNY and 0.281 million CNY, and R2 values were 0.962 and 0.923, respectively. The PCA–DBO–SVR model combines the advantage of factor selection from PCA with the high prediction accuracy of DBO–SVR, delivering optimal predictive performance. It provides a robust scientific foundation and theoretical support for cost indicator prediction in farmland water conservancy projects.

Author Contributions

Conceptualization, X.L.; methodology, X.L.; software, X.L.; validation, K.H., X.X., and Y.Y.; formal analysis, X.L.; investigation, X.X. and Y.Y.; resources, W.L., T.W., C.L., and C.H.; data curation, X.L., T.W., and C.L.; writing—original draft preparation, X.L.; writing—review and editing, W.L., B.Y., X.X., and Y.Y.; visualization, X.L.; supervision, K.H., C.L., and C.H.; project administration, C.L.; funding acquisition, C.L. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the Liaoning Provincial Local Standards Development Plan Project (2024223, 2024224, 2024225).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Some or all of the data, models, or codes that support the findings of this study are available from the first author, Xuenan Li (email: 2023240188@stu.syau.edu.cn), upon reasonable request.

Acknowledgments

This work has received support from Shenyang Agricultural University and the Liaoning Provincial Local Standard Project. We sincerely appreciate your support. The authors express their gratitude to the editor and reviewers for their suggestions for improvement.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:
BPBack-propagation
SVMSupport vector machine
LSSVMLeast-squares support vector machine
PCAPrincipal component analysis
PSOParticle swarm optimization
DEDifferential evolution
DBODung beetle optimizer
GRAGrey relational analysis
SVRSupport vector regression
GAGenetic algorithm
LASSOLeast absolute shrinkage and selection operator
GWOGrey wolf optimization

References

  1. Zaimes, G.N.; Kiosses, C. Experts views on water scarcity and flooding from six countries around the Black Sea region. Desalination Water Treat. 2021, 216, 118–128. [Google Scholar] [CrossRef]
  2. Zaimes, G.N.; Emmanouloudis, D. Sustainable Management of the Freshwater Resourceso f Greece. J. Eng. Sci. Technol. Rev. 2012, 5, 77–82. [Google Scholar] [CrossRef]
  3. Ottaviani, F.M.; Marco, A.D. Multiple Linear Regression Model for Improved Project Cost Forecasting. Procedia Comput. Sci. 2022, 196, 808–815. [Google Scholar] [CrossRef]
  4. Zeng, B.; Ma, X.; Zhou, M. A new-structure grey Verhulst model for China’s tight gas production forecasting. Appl. Soft Comput. 2020, 96, 106600. [Google Scholar] [CrossRef]
  5. Zhao, X.-h.; Miao, X.-j.; Zhang, Z.-g.; Hao, Z. Research on Prediction Method of Reasonable Cost Level of Transmission Line Project Based on PCA-LSSVM-KDE. Math. Probl. Eng. 2019, 2019, 1649086. [Google Scholar] [CrossRef]
  6. Zeng, Y.; Chen, H. An Improved Convolution Neural Network-Based Fast Estimation Method for Construction Project Cost. J. Circuits Syst. Comput. 2023, 33, 2450136. [Google Scholar] [CrossRef]
  7. Han, K.; Wang, W.; Huang, X.; Li, P.; Li, C.; Zheng, J. Predicting the construction cost of high standard farmland irrigation projects using NGO-CNN-SVM. Trans. Chin. Soc. Agric. Eng. (Trans. CSAE) 2024, 40, 62–72. [Google Scholar] [CrossRef]
  8. Hayati, D.W.; Chen, J.-H.; Chen, Y.-C.; Li, S.; Machsus, M.; Khoiri, M.; Wang, Q.-C.; Wei, H.-H. Smart cost estimation: Empirical case for extra-high voltage transmission towers. Heliyon 2024, 10, e31466. [Google Scholar] [CrossRef]
  9. Feng, R.; Wencheng, L. LSSA-BP-based cost forecasting for onshore wind power. Energy Rep. 2023, 9, 362–370. [Google Scholar] [CrossRef]
  10. Wu, Z.; Liu, M.; Ma, G. A machine learning-based two-stage integrated framework for cost reasonableness prediction of green building projects. J. Build. Eng. 2025, 100, 111733. [Google Scholar] [CrossRef]
  11. Pham, T.Q.D.; Le-Hong, T.; Tran, X.V. Efficient estimation and optimization of building costs using machine learning. Int. J. Constr. Manag. 2023, 23, 909–921. [Google Scholar] [CrossRef]
  12. Wyke, S.; Lindhard, S.M.; Larsen, J.K. Using principal component analysis to identify latent factors affecting cost and time overrun in public construction projects. Eng. Constr. Archit. Manag. 2024, 31, 2415–2436. [Google Scholar] [CrossRef]
  13. Yata, K.; Aoshima, M. Effective PCA for high-dimension, low-sample-size data with noise reduction via geometric representations. J. Multivar. Anal. 2012, 105, 193–215. [Google Scholar] [CrossRef]
  14. Yang, G.; Xu, Y.; Huo, L.; Guo, D.; Wang, J.; Xia, S.; Liu, Y.; Liu, Q. Genetic algorithm optimized back propagation artificial neural network for a study on a wastewater treatment facility cost model. Desalination Water Treat. 2023, 282, 96–106. [Google Scholar] [CrossRef]
  15. Yan, D.; Lin, J.; Chen, Y.; Yang, X.; Lu, Y.; Song, J. High-Efficiency Water Collection of Superhydrophobic Condensation Absorber. Adv. Sci. 2025, 2417024. [Google Scholar] [CrossRef]
  16. Yan, D.; Lu, Y.; Lin, J.; Li, W.; Song, J. Enhancing water transportation capacity by asymmetrical patterned surface with super-wettability. Appl. Phys. Lett. 2024, 125, 071601. [Google Scholar] [CrossRef]
  17. Lin, T.; Yi, T.; Zhang, C.; Liu, J. Intelligent Prediction of the Construction Cost of Substation Projects Using Support Vector Machine Optimized by Particle Swarm Optimization. Math. Probl. Eng. 2019, 2019, 7631362. [Google Scholar] [CrossRef]
  18. Cheng, M.-Y.; Hoang, N.-D.; Wu, Y.-W. Hybrid intelligence approach based on LS-SVM and Differential Evolution for construction cost index estimation: A Taiwan case study. Autom. Constr. 2013, 35, 306–313. [Google Scholar] [CrossRef]
  19. Ai, D.; Yang, J. A machine learning approach for cost prediction analysis in environmental governance engineering. Neural Comput. Appl. 2019, 31, 8195–8203. [Google Scholar] [CrossRef]
  20. Zhang, R.; Zhu, Y.; Liu, Z.; Feng, G.; Diao, P.; Wang, H.; Fu, S.; Lv, S.; Zhang, C. A Back Propagation Neural Network Model for Postharvest Blueberry Shelf-Life Prediction Based on Feature Selection and Dung Beetle Optimizer. Agriculture 2023, 13, 1784. [Google Scholar] [CrossRef]
  21. Duan, J.; Gong, Y.; Luo, J.; Zhao, Z. Air-quality prediction based on the ARIMA-CNN-LSTM combination model optimized by dung beetle optimizer. Sci. Rep. 2023, 13, 12127. [Google Scholar] [CrossRef] [PubMed]
  22. Cheng, M.-Y.; Vu, Q.-T.; Gosal, F.E. Hybrid deep learning model for accurate cost and schedule estimation in construction projects using sequential and non-sequential data. Autom. Constr. 2025, 170, 105904. [Google Scholar] [CrossRef]
  23. Mahmoodzadeh, A.; Nejati, H.R.; Mohammadi, M.; Hashim Ibrahim, H.; Khishe, M.; Rashidi, S.; Hussein Mohammed, A. Developing six hybrid machine learning models based on gaussian process regression and meta-heuristic optimization algorithms for prediction of duration and cost of road tunnels construction. Tunn. Undergr. Space Technol. 2022, 130, 104759. [Google Scholar] [CrossRef]
  24. Butchers, J.; Williamson, S.; Booker, J.; Maitland, T.; Karki, P.B.; Pradhan, B.R.; Pradhan, S.R.; Gautam, B. Cost estimation of micro-hydropower equipment in Nepal. Dev. Eng. 2022, 7, 100097. [Google Scholar] [CrossRef]
  25. Du, L. Analysis on Safeguard Measures of High-standard Farmland Water Conservancy Project Construction in the Era of Big Data. Comput. Informatiz. Mech. Syst. 2023, 6, 10–15. [Google Scholar]
  26. Pu, L.; Zhang, S.; Yang, J.; Yan, F.; Chang, L. Assessment of High-standard Farmland Construction Effectiveness in Liaoning Province During 2011–2015. Chin. Geogr. Sci. 2019, 29, 667–678. [Google Scholar] [CrossRef]
  27. Elmousalami, H.H.; Elshaboury, N.; Ibrahim, A.H.; Elyamany, A.H. Bayesian Optimized Ensemble Learning System for Predicting Conceptual Cost and Construction Duration of Irrigation Improvement Systems. KSCE J. Civ. Eng. 2024, 29, 100014. [Google Scholar] [CrossRef]
  28. Kannazarova, Z.; Juliev, M.; Abuduwaili, J.; Muratov, A.; Bekchanov, F. Drainage in irrigated agriculture: Bibliometric analysis for the period of 2017–2021. Agric. Water Manag. 2024, 305, 109118. [Google Scholar] [CrossRef]
  29. Fariz, T.K.N.; Basha, S.S. Enhancing solar radiation predictions through COA optimized neural networks and PCA dimensionality reduction. Energy Rep. 2024, 12, 341–359. [Google Scholar] [CrossRef]
  30. Li, H.; Tan, Y.; Zeng, D.; Su, D.; Qiao, S. Attitude-Predictive Control of Large-Diameter Shield Tunneling: PCA-SVR Machine Learning Algorithm Application in a Case Study of the Zhuhai Xingye Express Tunnel. Appl. Sci. 2025, 15, 1880. [Google Scholar] [CrossRef]
  31. Zhou, J.; Lu, Y.; Tian, Q.; Liu, H.; Hasanipanah, M.; Huang, J. Advanced Machine Learning Methods for Prediction of Blast-Induced Flyrock Using Hybrid SVR Methods. CMES—Comput. Model. Eng. Sci. 2024, 140, 1595–1617. [Google Scholar] [CrossRef]
  32. Fan, C.; Zheng, Y.; Wang, S.; Ma, J. Prediction of bond strength of reinforced concrete structures based on feature selection and GWO-SVR model. Constr. Build. Mater. 2023, 400, 132602. [Google Scholar] [CrossRef]
  33. Xue, J.; Shen, B. Dung beetle optimizer: A new meta-heuristic algorithm for global optimization. J. Supercomput. 2023, 79, 7305–7336. [Google Scholar] [CrossRef]
  34. Wu, C.; Fu, J.; Huang, X.; Xu, X.; Meng, J. Lithium-Ion Battery Health State Prediction Based on VMD and DBO-SVR. Energies 2023, 16, 3993. [Google Scholar] [CrossRef]
  35. Zhong, K.; Li, Y.; Huan, W.; Weng, X.; Wu, B.; Chen, Z.; Liang, H.; Feng, H. A novel near infrared spectroscopy analytical strategy for soil nutrients detection based on the DBO-SVR method. Spectrochim. Acta Part A Mol. Biomol. Spectrosc. 2024, 315, 124259. [Google Scholar] [CrossRef]
  36. Qiu, T.; Cheng, S.; Zhang, D.; Du, X. PCA-TANN with model-based transfer learning for predicting blast load time series on structures. Process Saf. Environ. Prot. 2024, 187, 1161–1177. [Google Scholar] [CrossRef]
  37. Zhang, Y.; Ma, N.; Gu, X.; Shi, Q. Geometric space construction method combined of a spline-skinning based geometric variation method and PCA dimensionality reduction for ship hull form optimization. Ocean Eng. 2024, 302, 117604. [Google Scholar] [CrossRef]
  38. Zhang, Y.; Minchin, R.E.; Agdas, D. Forecasting Completed Cost of Highway Construction Projects Using LASSO Regularized Regression. J. Constr. Eng. Manag. 2017, 143, 04017071. [Google Scholar] [CrossRef]
  39. Zhou, G.; Etemadi, A.; Mardon, A. Machine learning-based cost predictive model for better operating expenditure estimations of U.S. light rail transit projects. J. Public Transp. 2022, 24, 100031. [Google Scholar] [CrossRef]
  40. Mahmoodzadeh, A.; Nejati, H.R.; Mohammadi, M. Optimized machine learning modelling for predicting the construction cost and duration of tunnelling projects. Autom. Constr. 2022, 139, 104305. [Google Scholar] [CrossRef]
Figure 1. Geographic map of Liaoning Province.
Figure 1. Geographic map of Liaoning Province.
Sustainability 17 02702 g001
Figure 2. PCA–DBO–SVR model prediction flow chart.
Figure 2. PCA–DBO–SVR model prediction flow chart.
Sustainability 17 02702 g002
Figure 3. Comparison of predicted results. (a) Electrical and mechanical shaft engineering. (b) Drainage ditch engineering.
Figure 3. Comparison of predicted results. (a) Electrical and mechanical shaft engineering. (b) Drainage ditch engineering.
Sustainability 17 02702 g003
Figure 4. Scatterplot of predicted versus actual values in the test set. (a) Electrical and mechanical shaft engineering. (b) Drainage ditch engineering.
Figure 4. Scatterplot of predicted versus actual values in the test set. (a) Electrical and mechanical shaft engineering. (b) Drainage ditch engineering.
Sustainability 17 02702 g004
Figure 5. Plot of residuals between predicted and actual values in the test set. (a) Electrical and mechanical shaft engineering. (b) Drainage ditch engineering.
Figure 5. Plot of residuals between predicted and actual values in the test set. (a) Electrical and mechanical shaft engineering. (b) Drainage ditch engineering.
Sustainability 17 02702 g005
Table 1. Summary of influencing factors for farmland water conservancy project cost indicators.
Table 1. Summary of influencing factors for farmland water conservancy project cost indicators.
CategoryInfluencing FactorsImpact Factor Subcomponent
Geographic environmental factorsGeographic locationDifferences in project locations lead to variations in economic development levels, address conditions, and design or construction complexity, all of which significantly impact project costs.
Topography and geomorphology
Geological structure
Seismic intensity
On-site and off-site temporary traffic
On-site construction conditions
Consumption factorsLabor consumptionDifferences in materials used and construction complexity influence project costs. Subcomponents of farmland water conservancy projects involve various quantity indicators, reflecting the scale of installation and construction, which subsequently affect cost indicators.
Machinery consumption
Key material consumption
Price factorsLabor costsPrice factors reflect the impact of market economic fluctuations on project costs. Changes in material and machinery prices and labor and other costs contribute to project cost variations.
Material prices
Machinery costs
Other costs
Management factorsContractor management levelManagement factors and construction periods influence on-site construction, affecting project costs.
Supervision management level
Construction duration (days)
Electrical and mechanical well engineeringBorehole diameter (m)Engineering factors related to structural design significantly affect resource estimation, playing a critical role in the overall construction process.
Irrigation pipeline length (m)
Designed flow rate/(m3·s−1)
Well depth/m
Pumping head/m
Irrigation area/m2
Drainage ditch engineeringDrainage ditch length/m
Drainage flow rate/(m3·s−1)
Drainage ditch bottom width/m
Gradient
Table 2. Principal component eigenvalues, contribution rates, and eigenvectors.
Table 2. Principal component eigenvalues, contribution rates, and eigenvectors.
Engineering
Category
No.EigenvaluePercentage/%Cumulative Percentage/%Factor 1Factor 2Factor 3Factor 4Factor 5
Electrical and mechanical well engineering16.31339.45839.458−0.1250.3950.1520.791−0.134
22.70316.89556.3530.8510.017−0.2350.110−0.311
31.75810.98767.3400.7140.010−0.1270.179−0.247
41.3658.52875.8690.6050.6950.031−0.014−0.235
51.2127.57483.4420.5700.631−0.0390.250−0.143
60.7424.64188.083−0.103−0.157−0.629−0.293−0.127
70.6383.98892.0710.763−0.0870.346−0.161−0.266
80.5153.21995.2900.890−0.2640.219−0.174−0.052
90.3292.05997.3490.560−0.1960.580−0.202−0.141
100.1540.96498.313−0.1650.6920.289−0.4150.295
110.0960.60098.9130.042−0.4490.3530.4980.501
120.0900.56499.4770.1170.8250.206−0.1510.361
130.0510.32099.7960.870−0.022−0.3210.0410.336
140.0270.17099.9660.840−0.011−0.3320.0350.392
150.0050.02999.9950.881−0.046−0.2920.0110.337
160.0010.005100.0000.627−0.4010.461−0.0680.048
Drainage ditch engineering17.68848.05248.052−0.0160.9440.266−0.051
23.97624.85072.903−0.1670.5070.579−0.410
32.52715.79588.6980.4890.2500.809−0.132
41.0766.72695.424−0.002−0.1260.931−0.006
50.4332.70398.127−0.792−0.065−0.276−0.277
60.2071.29599.422−0.0240.898−0.2010.368
70.0640.40299.824−0.1390.933−0.2720.119
80.0260.16199.9850.989−0.0420.0170.129
90.0020.015100.0000.942−0.1240.072−0.295
100.0000.000100.000−0.9420.124−0.0720.295
110.0000.000100.0000.1690.179−0.1220.949
120.0000.000100.0000.995−0.0720.037−0.018
130.0000.000100.0000.989−0.0420.0170.129
140.0000.000100.0000.971−0.0200.0020.231
150.0000.000100.0000.951−0.005−0.0080.300
160.0000.000100.000−0.0160.9440.265−0.051
Table 3. Scores of each principal component and composite scores.
Table 3. Scores of each principal component and composite scores.
Engineering CategoryInfluencing FactorsScore Coefficients on Each Principal ComponentFactor 4Factor 5
12345
Electrical and mechanical well engineeringE1−0.1250.3950.1520.791−0.1340.0980.717
E20.8510.017−0.2350.110−0.3110.1230.792
E30.7140.010−0.1270.179−0.2470.1180.778
E40.6050.6950.031−0.014−0.2350.1820.972
E50.5700.631−0.0390.250−0.1430.1911.000
E6−0.103−0.157−0.629−0.293−0.127−0.1370.000
E70.763−0.0870.346−0.161−0.2660.1310.817
E80.890−0.2640.219−0.174−0.0520.1370.836
E90.560−0.1960.580−0.202−0.1410.1100.751
E10−0.1650.6920.289−0.4150.2950.0710.634
E110.042−0.4490.3530.4980.5010.0730.639
E120.1170.8250.206−0.1510.3610.1610.907
E130.870−0.022−0.3210.0410.3360.1600.906
E140.840−0.011−0.3320.0350.3920.1590.902
E150.881−0.046−0.2920.0110.3370.1600.905
E160.627−0.4010.461−0.0680.0480.1120.760
Drainage ditch engineeringD1−0.0860.9660.136−0.085 0.1190.721
D2−0.1790.5650.656−0.080 0.1040.686
D30.5410.4410.6610.225 0.2401.000
D40.1180.0860.7530.543 0.1480.788
D5−0.825−0.2070.003−0.244 −0.1930.000
D6−0.1120.861−0.479−0.005 0.0420.543
D7−0.2560.844−0.382−0.231 0.0080.465
D80.9900.043−0.119−0.025 0.1710.842
D90.928−0.0730.192−0.305 0.1580.811
D10−0.9280.073−0.1920.305 −0.1580.080
D110.2040.259−0.6960.619 0.0400.539
D120.9910.003−0.012−0.124 0.1710.840
D130.9900.043−0.119−0.025 0.1710.842
D140.9750.070−0.1930.046 0.1690.837
D150.9590.089−0.2430.095 0.1670.831
D16−0.0860.9660.135−0.085 0.1190.720
Table 4. Comparison of assessment metrics across models.
Table 4. Comparison of assessment metrics across models.
Engineering CategoryModelRMSE (10,000 CNY)MAE (10,000 CNY)MAPE (%)R2
Electrical and mechanical well engineeringBP4.4393.63112.526%0.851
SVR2.2912.2568.283%0.869
PCA–SVR2.0211.8536.408%0.926
PCA–DBO–SVR1.1160.9103.261%0.962
Drainage ditch engineeringBP2.2141.15615.010%0.793
SVR1.3730.70211.133%0.812
PCA–SVR0.9860.6467.315%0.874
PCA–DBO–SVR0.5000.2813.732%0.923
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Li, X.; Han, K.; Liu, W.; Wang, T.; Li, C.; Yan, B.; Hao, C.; Xian, X.; Yang, Y. Prediction Model of Farmland Water Conservancy Project Cost Index Based on PCA–DBO–SVR. Sustainability 2025, 17, 2702. https://doi.org/10.3390/su17062702

AMA Style

Li X, Han K, Liu W, Wang T, Li C, Yan B, Hao C, Xian X, Yang Y. Prediction Model of Farmland Water Conservancy Project Cost Index Based on PCA–DBO–SVR. Sustainability. 2025; 17(6):2702. https://doi.org/10.3390/su17062702

Chicago/Turabian Style

Li, Xuenan, Kun Han, Wenhe Liu, Tieliang Wang, Chunsheng Li, Bin Yan, Congming Hao, Xiaochen Xian, and Yingying Yang. 2025. "Prediction Model of Farmland Water Conservancy Project Cost Index Based on PCA–DBO–SVR" Sustainability 17, no. 6: 2702. https://doi.org/10.3390/su17062702

APA Style

Li, X., Han, K., Liu, W., Wang, T., Li, C., Yan, B., Hao, C., Xian, X., & Yang, Y. (2025). Prediction Model of Farmland Water Conservancy Project Cost Index Based on PCA–DBO–SVR. Sustainability, 17(6), 2702. https://doi.org/10.3390/su17062702

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop