Next Article in Journal
Roasting Extract of Handroanthus impetiginosus Enhances Its Anticancer Activity in A549 Lung Cancer Cells and Improves Its Antioxidant and Anti-Inflammatory Effects in Normal Cells
Next Article in Special Issue
Experimental Study on the Relationship between Time-Varying Uplift Displacement and Grout Diffusion in Sand
Previous Article in Journal
Deep Learning-Based Consistent Object Detection in Distance on Highly Perspective Region
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Examination of Determinants and Predictive Modeling of Artificially Frozen Soil Strength Utilizing the XGBoost Algorithm

1
Department of Civil Engineering and Smart Cities, Shantou University, Shantou 515063, China
2
China Railway 14th Bureau Group Second Engineering Corporation Limited, Taian 271000, China
3
School of Civil Engineering, Southeast University, Nanjing 211189, China
*
Author to whom correspondence should be addressed.
Appl. Sci. 2023, 13(24), 13170; https://doi.org/10.3390/app132413170
Submission received: 10 November 2023 / Revised: 30 November 2023 / Accepted: 7 December 2023 / Published: 12 December 2023
(This article belongs to the Special Issue Recent Advances in Soft Soil Engineering)

Abstract

:
A freezing method is usually employed in the construction of metro links. Unconfined compressive strength (UCS) is a pivotal mechanical parameter in freezing design. Due to the limitations of indoor experiments and the complexity of influencing factors, the applicability of empirical strength formulas is poor. This study predicts the strength of frozen soil with different particle size distributions based on the highly integrated XGBoost algorithm. Compared with other empirical formula methods, the accuracy is high. Through the analysis of Pearson’s correlation coefficient results, further analysis is needed on the nonlinear correlation between the temperature, the strain rate, and the unconfined compressive strength of frozen soil. The results indicated a strong negative correlation between temperature and unconfined compressive strength; the strength initially increased at a faster rate, slowed down during the intermediate phase, and again increased at a faster rate toward the end. There was a positive correlation between the strain rate and the unconfined compressive strength, with the strength exhibiting varying sensitivities to different sizes of strain rates. When the strain rate was relatively small, the strength increased slightly; as the strain rate increased, the strength increased more significantly. Different soils showed similar trends, but differences in the particle size distribution resulted in variations in the final strength. This study can provide a scientific basis for predicting the strength of soil bodies in the freeze–thaw construction of subway connection tunnels.

1. Introduction

Saturated permafrost is primarily composed of a solid soil matrix, unfrozen water, and ice, resulting in significant differences in physical properties compared with conventional soils. The artificial freezing method utilizes refrigeration technology to transform regular soil into permafrost, effectively isolating groundwater from underground structures. This technique has found wide application in the construction of underground passages and tunnels [1,2,3]. Simultaneously, numerous infrastructure projects are underway in permanently frozen soil regions [4,5,6]. Accidents resulting from insufficient freezing strength have occurred frequently during engineering construction, drawing considerable attention from scholars [7]. Unconfined compressive strength (UCS), as a vital indicator reflecting the physical-mechanical properties of soil in engineering, finds extensive application across various fields [8,9,10]. It commonly serves as a basis for studying the reinforcement level of soil after freezing.
Numerous scholars have conducted indoor experiments on the unconfined compressive strength of frozen soil. Jiang Zihua [11] and others studied the variation in the uniaxial compressive strength of remodeled frozen soil under different sand contents and moisture levels, but they overlooked the influence of temperature on strength. Wang et al. [12] conducted experiments on the unconfined compressive strength of different types of in situ frozen soil, considering various soils. Yang et al. [13] investigated the impact of high moisture content on the mechanical properties of frozen silty soil. Bai et al. [14] conducted a series of repetitive unilateral freezing tests on an unsaturated silty clay soil under uniform initial and boundary conditions to reveal changes in temperature, moisture content, and freezing expansion. More researchers have focused on studying the influence of external factors on the unconfined compressive strength of frozen soil. Among these factors, temperature is the most crucial determinant of the unfrozen water content, significantly affecting the degree of soil freezing [15,16]. Ma Qinyong [17], based on the experimental results of uniaxial compressive strength of remodeled frozen soil, concluded that the unconfined compressive strength of frozen soil increases with decreasing temperature. However, limited by experimental conditions, more results under different temperatures were not provided. Li Haipeng et al. [18] studied the trend of strength changes with different temperatures for saturated frozen clay at different dry densities and constant strain rates, deriving empirical formulas. However, the complexity of the formulas arises from the consideration of different parameters for different soil categories. Du et al. [19] investigated the strength characteristics of frozen sandy silt under different strain loading rates. Chen Xin et al. [20] further explored the influence of loading rate on the strength of frozen improved soil, taking into account variations in particle size. The strain loading rate also has a significant impact on strength [21,22], with varying loading rates used in experimental studies by different researchers [23,24], resulting in noticeable differences in test results. Therefore, it is essential to involve strain rates and other factors collectively in the data analysis process across different studies.
In previous studies, numerous empirical formulas for permafrost strength have been derived; however, these formulas are limited to specific experimental soils, and the extrapolation capability of empirical formulas has not been thoroughly investigated. There is a lack of universality in the study of permafrost strength characteristics. The extreme gradient boosting (XGB) algorithm [25], based on gradient boosting trees, is renowned for its high predictive accuracy and strong generalization capabilities. It has been widely applied in various domains, including land subsidence [26,27], soil strength [28,29], and slope stability [30,31]. This study, by collecting a substantial amount of existing research data, employed the XGB algorithm to establish a predictive model for the unconfined compressive strength of frozen soil. The application of this model in data analysis explored the relationship between the unconfined compressive strength of frozen soil under different particle size distributions and temperature and strain rates, yielding insights into strength properties applicable to common permafrost.

2. Data Preprocessing

2.1. Data Preprocessing

In this investigation, a comprehensive database was constructed by compiling soil physical properties and unconfined compressive strength test data from the relevant literature sources [12,17,23,32,33,34,35]. These data are not randomly generated but rather obtained from rigorous experimental processes, ensuring the accuracy and reliability of the information. During the data collection process, with the unconfined compressive strength of frozen soil as the core physical indicator, the literature related to temperature variations and strain rates was gathered, and the raw data along with environmental information were documented. Utilizing these data for further analysis can lead to more reliable results. The collected data were utilized to develop a strength prediction model using the XGB algorithm.
If the extraction of data from images using Origin software (2018C SR1 Version) is required, it is crucial to select clearly discernible data points to minimize potential errors during the data extraction process. The key factors taken into account for the strength prediction model development based on the XGB algorithm included the grain size distribution, dry density, initial moisture content, temperature, and strain loading rate.
To ensure a high prediction accuracy and generalization ability of the XGB prediction model, preprocessing of the parameters was necessary. A total of 487 datasets were compiled from the results of unconfined compressive strength tests reported in the literature [12,17,18,23,32,33,34,35,36,37,38,39]. Since the XGB algorithm utilizes gradient descent to search for the optimal solution, it can face difficulties in convergence when there is a significant difference in magnitude among different input variables. To address this issue, the data were standardized. Normalization helps eliminate scale differences between features, ensuring that the model’s impact on each feature is relatively balanced and facilitating faster convergence of the algorithm [40]. Standardization transformed the data into a normal distribution with a mean of 0 and a standard deviation of 1, as represented by Equation (1).
The purpose of dividing the dataset into training and testing sets was to assess the model’s performance on unseen data. The training set was utilized for training the model parameters, while the testing set was employed to evaluate the model’s ability to generalize to new data, i.e., whether the model could effectively adapt to unseen samples. The choice of the partition ratio involved a trade-off. If the training set was too small, the model may not learn sufficiently complex patterns, leading to underfitting. Conversely, if the testing set was too small, the evaluation results may lack reliability. In common practice, a widely accepted and used partition ratio for machine learning is 70% for training and 30% for testing. Therefore, this widely agreed-upon ratio was adopted in this paper as well. A random selection of 70% of the samples (341 groups) was used for training, while 30% (146 groups) were used for testing. The overall distribution of the data is illustrated in Table 1.
x = x μ σ
where x is the sample value, µ is the sample mean, and σ is the sample standard deviation.

2.2. The XGB Algorithm

2.2.1. Principle of XGB

The gradient boosted decision tree (GBDT) algorithm employs gradient descent to iteratively create new trees, aiming to minimize the objective function [41]. The XGB algorithm is derived from GBDT and is widely applied to classification and regression tasks. It is a potent machine learning algorithm, characterized by notable advantages such as high predictive performance, robust modeling capabilities for complex relationships, and excellent performance on large-scale datasets [42,43,44].
XGB possesses an inherent advantage over models like neural networks in terms of mitigating overfitting. This advantage stems from the algorithm’s built-in regularization terms and pruning mechanisms, allowing for effective control of model complexity. This characteristic makes XGB particularly adept at handling relatively small datasets or situations with high data noise, contributing to its widespread adoption in various practical applications. By incorporating regression trees, the model continuously learns feature partitions for the data at each internal node. It generates a new classification and regression tree (CART) based on the residuals of the previous model, keeps track of the total number of trees in the training model, and assigns weights to the predictions from each tree to obtain the final prediction value. In this study, Python 3.9 was utilized to construct the XGB model for data training.
As an integrated algorithm, XGB combines multiple weak learners to produce the final learning outcome. During the processing of the objective function, a regularization term is introduced to decrease the model’s variance and prevent overfitting. Additionally, XGB employs a second-order Taylor expansion to enhance the convergence speed and improve the prediction accuracy. The predicted value for the i-th sample at the t-th iteration is given by the following equation:
y ^ i ( t ) = y ^ i ( t 1 ) + f t ( x i )
where y ^ i ( t ) is the predicted value of sample i after the t-th iteration, y ^ i ( t 1 ) is the predicted value of the t−1st tree, f t ( x i ) is the model of the t-th tree, and xi is the input value.
The XGB objective function form is shown in Equation (3):
O b j ( t ) = i = 1 n l ( y i , y ^ i ( t ) ) + Ω ( f t ) + c
where l ( y i , y ^ i ( t ) ) is the loss function of the model, yi is the true value of the i-th sample, n is the total number of samples, Ω ( f t ) is the regularization term, and c is a constant term. The specific form of the regularization term is
Ω ( f t ) = γ T + 1 2 λ | | ω | | 2
where γ and λ are penalty coefficients, T is the number of leaves, and ω is the weight of the leaf node.
The second-order Taylor approximation is expanded for the objective function, omitting higher-order infinitesimal and constant terms:
O b j ( t ) = i = 1 n [ l ( y i , y ^ i ( t ) ) + g i f t ( x i ) + 1 2 h i f t 2 ( x i ) ] + Ω ( f t ) + c
where gi and hi are Taylor expansion constant terms.
The simplified objective function is obtained by removing the constant term for the t-th round of iterations:
O b j ( t ) = i = 1 n [ g i f t ( x i ) + 1 2 h i f t 2 ( x i ) ] + γ T + 1 2 λ j = 1 T ω j 2
Simplify the objective function by making H j = i I j h i and G j = i I j g i :
O b j ( t ) = i = 1 n [ G i f t ( x i ) + 1 2 ( H i + λ ) ω j 2 ] + γ T
The optimal objective function is obtained as shown in Equation (8):
O b j = 1 2 j = 1 T G j 2 H j + λ + γ T

2.2.2. Ten-Fold Cross-Validation Parameter Tuning

The XGB algorithm does not rely on predefined relationships between input and output variables The specificity of hyperparameters lies in the fact that they are not learned from data but require experience and experimentation to determine the optimal values. Their importance is evident in the fact that selecting appropriate hyperparameters can significantly enhance model performance, influencing both the model’s generalization ability and training effectiveness. Hyperparameter tuning is a crucial task in machine learning and is essential for constructing effective models. Therefore, it is essential to tune these hyperparameters [45,46,47]. The hyperparameters associated with the XGB algorithm are presented in Table 2.
For hyperparameter tuning, 10-fold cross-validation is commonly employed as a statistical method to assess the model’s generalization ability [48,49]. According to the theory of 10-fold cross-validation, the training set is divided into ten parts, with nine parts used as the test data and one part as the validation data in each iteration. The results of 10-fold cross-validation are more robust compared with the randomness of data partitioning. Since each data point appears in different rounds of model selection, the final performance evaluation is more statistically meaningful. By adjusting hyperparameters in each iteration of cross-validation and selecting the best-performing ones, it is more efficient to find the parameter configuration that optimizes the model’s performance. To evaluate the XGB algorithm’s performance in predicting the unconfined compressive strength of the training set under different parameter combinations, various hyperparameters were introduced, and the 10-fold cross-validation method was applied. The optimal parameter was selected based on the performance observed during cross-validation. Figure 1 illustrates the 10-fold cross-validation method, and Table 3 presents the results of hyperparameter tuning.

3. Results of Model Predictions

3.1. Evaluation Indicators

Using the optimal hyperparameters identified through the 10-fold cross-validation, the model was applied to the test set to assess its accuracy and generalization ability. The process of predicting the unconfined compressive strength using the XGB algorithm is illustrated in Figure 2. The evaluation of the model’s prediction accuracy was based on the mean square error (MSE) and coefficient of determination (R2). MSE measures the average squared error between the model’s predicted values and the actual observed values, serving as an indicator of the model’s prediction accuracy. On the other hand, R2 quantifies the extent to which the model explains the variance of the target variable, representing the model’s ability to fit the data’s variability.

3.2. Projected Results

Figure 3a displays the results of predicting the unconfined compressive strength of permafrost. The test set exhibited a mean square error (MSE) of 0.39 and a coefficient of determination (R2) of 0.962. These results indicated that the model demonstrated high prediction accuracy and could be effectively utilized for predicting the unconfined compressive strength of permafrost. The scatter plot in Figure 3b illustrates the relationship between the experimental and predicted values. It can be observed that a significant number of experimental results fell within the range of 0~8 MPa, and the model exhibited high prediction accuracy within this range. However, the number of samples exceeding 8 MPa was noticeably limited, resulting in reduced prediction accuracy for values beyond this range. The empirical formulas used for strength prediction under various working conditions often involve significant parameter variations and high complexity. In contrast, the strength model based on the XGB algorithm offered high prediction accuracy. By inputting the relevant parameters, it enabled the estimation of the unconfined compressive strength of permafrost. Moreover, this model exhibited strong generalization capabilities and offered a simplified approach.

4. Discussion

The Pearson correlation coefficient is employed to measure the linear relationship between two variables. Numerically, a correlation coefficient closer to 1 or −1 indicates a stronger linear correlation, while proximity to 0 suggests a weaker correlation. This holds significant implications for under-standing the interdependence, trends, and mutual influences among variables.
The Pearson correlation coefficient was applied to measure the variable correlation. The principle of calculation is shown in Equation (9), and the results are shown in Figure 4. The Pearson correlation coefficient between the temperature and unconfined compressive strength of permafrost was −0.590, while the Pearson correlation coefficient between the strain rate and unconfined compressive strength was 0.370. According to the Pearson correlation coefficient correlation standards, the temperature and strength exhibited a moderately linear correlation, potentially indicating a negative linear relationship due to the negative coefficient. The strain rate and strength showed a weak correlation, suggesting a relatively weak positive correlation between the strain rate and strength, necessitating further assessment of their relationship based on predictions from the XGB model.
P e a r s o n = i = 1 n ( x i x ¯ ) ( y i y ¯ ) i = 1 n ( x i x ¯ ) 2 i = 1 n ( y i y ¯ ) 2
where xi and yi are the input and output values of the i-th sample, x ¯ and y ¯ are the input and output averages of the overall sample, respectively.
To explore the impact of temperature and strain rate on the variation in unconfined compressive strength in clay, pulverized soil, and sandy soil, a dataset was created due to the limited availability of experimental data. This dataset was based on common soil physical properties and experimental conditions that resembled actual working conditions. Additionally, the study aimed to analyze the strength differences among clay, silt, and sand after freezing. The soil particle size distribution was represented as the relative content percentage of particles, as outlined in Table 4.
A new dataset (2160 sets) was finally obtained. The XGB model was used to predict the magnitude of the unconfined compressive strength. The trends of strength with temperature and strain rate were analyzed for different permafrosts; the results for strength with temperature are shown in Figure 5, and the results for strength with strain rate are shown in Figure 6.
From Figure 5, it can be observed that the unconfined compressive strength of the three types of frozen soil exhibited a similar trend, increasing with decreasing temperature and increasing strain rate. The most rapid increase in unconfined compressive strength occurred within the S1 and S3 ranges, while the strength increase was relatively gradual during the S2 stage. In the temperature range of −2 to −4 °C (S1), the soil reached the freezing point of free water, leading to rapid freezing of the free water and consequently a significant increase in strength. In the temperature range of −4 to −16 °C (S2), the linear relationship between strength and temperature weakened. However, in the range of −16 to −20 °C (S3), the strength experienced a substantial increase due to the complete freezing of weakly bound water within the pores. When considering strain rates between 10−5 s−1 and 10−4 s−1, as well as between 10−3 s−1 and 10−2 s−1, the strength values exhibited similar levels with only minor increments. During this stage, the dominant factor influencing the strength was the strain rate, where higher strain rates led to increased soil density and higher unconfined compressive strength. When the temperature dropped below −20 °C, it further affected the freezing process of strongly bound water within the soil body. However, due to the limited range of experimental data, the influence of strongly bound water on the strength of frozen soil was not taken into consideration. Consequently, the unconfined compressive strength exhibited a substantial growth rate in the initial period, a moderate growth rate in the intermediate period, and a significant growth rate in the later period as the temperature decreased. This finding aligned with the research conducted by Du et al. [19] and Zhang et al. [50], confirming the generalizability of the model. Comparing the three different permafrost soils, pulverized soils exhibited the highest overall strength, followed by clays and sands. Fine-grained soils, such as clay and silt, contained a higher proportion of weakly bound water. As the temperature decreased, more weakly bound water froze, leading to increased soil cohesion and higher strength. However, the strongly bound water remained unfrozen but enveloped the soil particles, maintaining its ice cementation capacity. On the other hand, coarse-grained soils, represented by sandy soils, possessed a monogranular structure characterized by low densities and large interparticle pores. This weakened the ice cementation ability after the freezing of unfrozen water. Additionally, a significant amount of pore water remained unfrozen in these soils, resulting in an overall lower strength. The bonding pattern is illustrated in Figure 7.
As depicted in Figure 6, the unconfined compressive strength of three permafrost types increased with an increase in the strain rate. The strength exhibited varying degrees of sensitivity to different strain rates, demonstrating stronger sensitivity within the S2 strain rate range, while exhibiting relatively minor variations within the S1 and S3 rate intervals. At a small strain rate (S1), although there was an increase in compressive strength, the overall increment was modest, indicating low sensitivity. Conversely, when the strain rate increased from 10−4 s−1 to 10−3 s−1 (S2), the rise in the compressive strength became significant. Moreover, the lower the temperature was, the more pronounced the increase and the higher the sensitivity. Consequently, the unconfined compressive strength of permafrost was higher at elevated strain rates and lower temperatures. It is important to note that at a temperature of −20 °C, there was a tendency for the rate of strength increase to diminish when the strain rate increased from 10−3 s−1 to 10−2 s−1 (S3). This change could likely be attributed to the freezing of all free and weakly bonded water within the soil, resulting in a decrease in unfrozen water within the pore space of the densely compacted soil. As a result, the magnitude of the strength increase diminished.
Earlier studies mostly focused on the physical properties of some single permafrost, this study was conducted by collecting a wide range of different data and analyzing them using XGB. A broader understanding of the strength properties of permafrost can be obtained through big data analysis. However, due to the prevalent lack of interpretability in machine learning, it fails to intuitively illustrate underlying physical mechanisms. Future research endeavors could overcome the limitations of purely data-driven methods by integrating physical information into computational models, such as employing physical information as constraints in data processing. This integration would aid in bridging the gap between data-driven methodologies and physical insights, enhancing the interpretability of the models used in permafrost analysis.

5. Conclusions

With the increasing prevalence of frozen soil engineering, there is a growing need for a more profound understanding of the key factor of unfrozen compressive strength in frozen soils. Previous research on frozen soil strength has largely been confined to individual soil types, characterized by limited sample sizes and a lack of generalized studies on the strength properties of frozen soil. This study focused on remodeled permafrost as the research subject and employed a machine learning approach to develop a prediction model for unconfined compressive strength using the XGB algorithm. By incorporating the Pearson correlation coefficient, the most influential factors affecting the strength were identified. Additionally, a new dataset was created to investigate the relationship between strength, temperature, and strain rate.
In comparison with previous unconfined compressive strength characteristic tests on individual soil samples, this study, leveraging a large dataset, obtained results that were more representative and comprehensive. The following conclusions can be drawn from this study:
  • The XGB-based prediction model for unconfined compressive strength demonstrated a high level of accuracy in the test set, with a mean squared error (MSE) of 0.39 and a coefficient of determination (R2) of 0.962. This predictive model contributes to assessing the strength characteristics crucial for engineering design, infrastructure development, and mitigating potential hazards in permafrost regions, thereby aiding in informed decision-making and ensuring the safety and sustainability of various constructions and activities in these environments.
  • The Pearson correlation coefficient was employed to quantify the strength and direction of the relationship between variables. The Pearson correlation coefficient between the temperature and unconfined compressive strength of permafrost was −0.590. For the strain rate and unconfined compressive strength, the Pearson correlation coefficient was 0.370. The analysis indicated a moderate negative relationship between the temperature and permafrost strength and a weak positive relationship between the strain rate and permafrost strength. Utilizing the XGB model provides a more precise representation of such correlations.
  • The unconfined compressive strength of frozen silt is the highest, followed by frozen clay and frozen sand. The reason may be that the content of weak bound water in the fine-grained soil represented by clay and silt was higher, and the weak bound water froze, increasing the cohesion of the soil and obtaining higher strength. Coarse-grained soil, represented by sandy soil, presented a single-grain structure, which weakened the ice cementation ability after freezing by unfrozen water, resulting in a low overall strength.
  • Temperature exerted a substantial influence on the unconfined compressive strength of permafrost. As the temperature decreased, the strength of permafrost exhibited a noticeable upward trend. In the early stages of freezing (−2 to −4 °C), the strength of the soil experienced a rapid increase due to the freezing of free water within the soil. Subsequently, the growth tendency became relatively stable. In engineering, the adequate strength of frozen soil can be ensured by controlling temperature, thereby mitigating the risk of accidents caused by insufficient strength.
  • The loading rate applied during testing significantly impacted the unconfined compressive strength of frozen soil. A clear increasing trend was observed as the strain rate rose. The sensitivity of the compressive strength to the strain rate was relatively low within the range of 10−5~10−4 s−1, while it became more pronounced within the range of 10−4~10−2 s−1. Therefore, in the design and implementation of engineering projects, the consideration of strain rate has become one of the crucial factors in ensuring the stability of frozen soil structures. When formulating engineering plans, it is essential to control the strain rate to meet the requirements of diverse environmental and loading conditions.
Utilizing a large volume of data for permafrost property prediction and analysis allows for the identification of more generalized features. However, the data collected in this experiment remain limited. While the results of this study demonstrate good performance in common soils, they are still subject to limitations, particularly in the case of diverse soil types where the effectiveness may be suboptimal. Furthermore, this entirely data-dependent analytical approach has room for optimization. XGB, as a black-box model, exhibits relatively weak interpretability, a common challenge in machine learning. Despite the outstanding predictive performance of the XGB algorithm, enhancing model interpretability is crucial for understanding the underlying mechanisms of predicted outcomes in practical applications. Therefore, future research could focus on improving the interpretability of the XGB model, considering the incorporation of physical information into the computational model to overcome the limitations of purely data-driven methods.

Author Contributions

Conceptualization, Y.W. and H.Q.; methodology, C.W.; software, H.Q.; validation, C.Y. and C.W.; formal analysis, C.W.; investigation, C.Y.; resources, C.W.; data curation, H.Q.; writing—original draft preparation, C.W., C.Y. and H.Q.; writing—review and editing, C.Y. and H.Q.; visualization, C.Y. and H.Q.; supervision, Y.W.; project administration, C.W.; funding acquisition, Y.W. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Natural Science foundation of Guangdong Province of China (2022A1515011200) and the Science and Technology Planning Project of Guangdong Province of China (STKJ2021129).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Data will be made available on request. The data are not publicly available due to further studies will be conducted based on the data.

Acknowledgments

The author would like to extend gratitude to the China Railway 14th Bureau Group Second Engineering Corporation Limited, the Department of Civil and Environmental Engineering at Shantou University, and the School of Civil Engineering at Southeast University for their assistance in the experimentation and data analysis involved in this research. Thanks are also extended to the anonymous reviewers and sponsors.

Conflicts of Interest

Author Chenguang Wang was employed by the company China Railway 14th Bureau Group Second Engineering Corporation Limited. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

References

  1. Dong, X.; Yu, S.; Zang, Y.; Wang, Y. The Causes and Prevention Measures of Water Inflow at Cross Passage Located in Sandy Gravel Layers and Constructed by Artificial Ground Freezing. Chin. J. Undergr. Space Eng. 2022, 18, 322–329. [Google Scholar]
  2. Liu, X.; Nowamooz, H.; Shen, Y.; Liu, Y.; Han, Y.; An, Y. Heat transfer analysis in artificial ground freezing for subway cross passage under seepage flow. Tunn. Undergr. Space Technol. 2023, 133, 104943. [Google Scholar] [CrossRef]
  3. Schindler, U.; Chrisopoulos, S.; Cudmani, R. Artificial ground freezing applications using an advanced elastic-viscoplastic model for frozen granular soils. Cold Reg. Sci. Technol. 2023, 215, 103964. [Google Scholar] [CrossRef]
  4. Dong, J.; Wu, X.; Shi, L.; Yu, X.Y.; He, P.F. Calculation Method and Analysis of Horizontal Frost Heave Effect of L-shaped Retaining Wall in Permafrost Regions. Rock Soil Mech. 2022, 43, 879–890+900. [Google Scholar]
  5. Deng, H.; Zhao, B.; Xiao, Y.; Tian, G. Experimental Study on Macroscopic Mechanical Characteristics and Microscopic Pore Structure Evolution of Soil–Rock Mixture under Repeated Freeze–Thaw Cycles. Appl. Sci. 2023, 13, 11504. [Google Scholar] [CrossRef]
  6. Zhang, S.; Niu, J.; Wang, Z.; Wang, J.; Dong, T. Mechanical Stability Assessment of Reinforced Embankments in Permafrost Regions Using Evolution Analysis. Chin. J. Rock Mech. Eng. 2022, 41, 1285–1295. [Google Scholar]
  7. Tan, Y.; Lu, Y.; Wang, D. Catastrophic Failure of Shanghai Metro Line 4 in July 2003: Occurrence, Emergency Response, and Disaster Relief. J. Perform. Constr. Facil. 2021, 35, 04020125. [Google Scholar] [CrossRef]
  8. Qin, H.; Wang, Y.; Wang, Y. Mechanical properties and microscopic mechanism of active MgO-fly ash solidified saline soil in seasonal freezing areas. Constr. Build. Mater. 2023, 393, 132093. [Google Scholar] [CrossRef]
  9. Wang, Y.; Wang, C.; Hu, Z.; Sun, R. The Influence of Different Curing Environments on the Mechanical Properties and Reinforcement Mechanism of Dredger Fill Stabilized with Cement and Polypropylene Fibers. Materials 2023, 16, 6827. [Google Scholar] [CrossRef]
  10. Mbengue, M.T.M.; Lawane Gana, A.; Messan, A.; Mone, O.; Pantet, A. Effect of the Type of Lateritic Soil on the Effectiveness of Geomechanical Improvement Using a Low Quantity of Cement for Sustainable Road Construction. Materials 2023, 16, 6891. [Google Scholar] [CrossRef]
  11. Jiang, Z.; Yao, Z.; Chen, J. Influence of Sand Ratio and Water Content on Uniaxial Compressive Strength of Artificial Frozen Soil. Min. Res. Dev. 2016, 36, 47–50. [Google Scholar]
  12. Wang, H.; Li, G.; Ning, F.; Gao, W.; Su, B. Unconfined Compression Test on In Situ Frozen Clay Sampled from Frozen Wellbore. Geofluids 2022, 2022, 6564345. [Google Scholar] [CrossRef]
  13. Yang, Z.; Still, B.; Ge, X. Mechanical Properties of Seasonally Frozen and Permafrost Soils at High Strain Rate. Cold Reg. Sci. Technol. 2015, 113, 12–19. [Google Scholar] [CrossRef]
  14. Bai, R.; Lai, Y.; Pei, W.; Zhang, M. Study on the frost heave behavior of the freezing unsaturated silty clay. Cold Reg. Sci. Technol. 2022, 197, 103525. [Google Scholar] [CrossRef]
  15. Wan, X.; Lai, Y.; Liao, M. Relationship between Temperature and Water Content of Sodium Saline Soils without Phase Transformation. Chin. J. Geotech. Eng. 2015, 37, 2175–2181. [Google Scholar]
  16. Hu, K.; Lai, Y. Experimental Study on the Strength Parameters and Strength Criteria of Saline Frozen Silty Sand. J. Glaciol. Geocryol. 2014, 36, 1199–1204. [Google Scholar]
  17. Ma, Q. Tensile Strength, Uniaxial Compressive Strength Test on Artificially Frozen Soils. Rock Soil Mech. 1996, 3, 76–81. (In Chinese) [Google Scholar]
  18. Li, H.; Lin, C.; Zhang, J.; Zhu, Y. Uniaxial Compressive Strength of Saturated Frozen Clay at Constant Strain Rate. Chin. J. Geotech. Eng. 2004, 01, 105–109. [Google Scholar]
  19. Du, H.; Ma, W.; Zhang, S.; Zhou, Z.; Liu, E. Strength Properties of Ice-Rich Frozen Silty Sands under Uniaxial Compression for a Wide Range of Strain Rates and Moisture Contents. Cold Reg. Sci. Technol. 2016, 123, 107–113. [Google Scholar] [CrossRef]
  20. Chen, X.; Zhang, Z.; Li, D. Effect of Size and Loading Rate on the Uniaxial Compression Characteristics of Frozen Cement Soil. Hydrogeol. Eng. Geol. 2019, 46, 74–82. [Google Scholar]
  21. Zhou, S.; Guo, Z.; Huang, G.; Liu, K. Improving the Understanding of Landslide Development in Alpine Forest Regions Using the InSAR Technique: A Case Study in Xiaojin County China. Appl. Sci. 2023, 13, 11851. [Google Scholar] [CrossRef]
  22. Bai, R.; Xu, X.; Hua, S.; Wang, J. Significant Analysis of the Influence Factors on Strength of Frozen Soil Base on Multivariable Linear Regression Model. J. Glaciol. Geocryol. 2019, 41, 416–423. [Google Scholar]
  23. Xu, X.; Wang, Y.; Yin, Z.; Zhang, H. Effect of Temperature and Strain Rate on Mechanical Characteristics and Constitutive Model of Frozen Helin Loess. Cold Reg. Sci. Technol. 2017, 136, 44–51. [Google Scholar] [CrossRef]
  24. Jiao, G.; Zhao, S.; Ma, W. Experimental Study of the Dynamic Characteristics of Warm-Frozen Silt after Freeze–Thaw Cycles Under Cyclic Loading. China Civ. Eng. J. 2010, 43, 107–113. [Google Scholar]
  25. Qiu, Y.; Zhou, J.; Khandelwal, M.; Yang, H.; Yang, P.; Li, C. Performance Evaluation of Hybrid WOA-XGBoost, GWO-XGBoost and BO-XGBoost Models to Predict Blast-Induced Ground Vibration. Eng. Comput. 2021, 38, 4145–4162. [Google Scholar] [CrossRef]
  26. Zhang, R.; Li, Y.; Goh, A.T.; Zhang, W.; Chen, Z. Analysis of Ground Surface Settlement in Anisotropic Clays Using Extreme Gradient Boosting and Random Forest Regression Models. J. Rock Mech. Geotech. Eng. 2021, 13, 1478–1484. [Google Scholar] [CrossRef]
  27. Zhang, W.; Li, H.; Wu, C.; Li, Y.; Liu, Z.; Liu, H. Soft Computing Approach for Prediction of Surface Settlement Induced by Earth Pressure Balance Shield Tunneling. Undergr. Space 2021, 6, 353–363. [Google Scholar] [CrossRef]
  28. Chala, A.T.; Ray, R.P. Machine Learning Techniques for Soil Characterization Using Cone Penetration Test Data. Appl. Sci. 2023, 13, 8286. [Google Scholar] [CrossRef]
  29. Feng, D.-C.; Wang, W.-J.; Mangalathu, S.; Taciroglu, E. Interpretable XGBoost-SHAP Machine-Learning Model for Shear Strength Prediction of Squat RC Walls. J. Struct. Eng. 2021, 147, 04021173. [Google Scholar] [CrossRef]
  30. Wang, M.-X.; Huang, D.; Wang, G.; Li, D.-Q. SS-XGBoost: A Machine Learning Framework for Predicting Newmark Sliding Displacements of Slopes. J. Geotech. Geoenviron. Eng. 2020, 146, 04020074. [Google Scholar] [CrossRef]
  31. Wang, L.; Wu, C.; Tang, L.; Zhang, W.; Lacasse, S.; Liu, H.; Gao, L. Efficient Reliability Analysis of Earth Dam Slope Stability Using Extreme Gradient Boosting Method. Acta Geotech. 2020, 15, 3135–3150. [Google Scholar] [CrossRef]
  32. Fei, W.; Yang, Z.J. Modeling Unconfined Compression Behavior of Frozen Fairbanks Silt Considering Effects of Temperature, Strain Rate and Dry Density. Cold Reg. Sci. Technol. 2019, 158, 252–263. [Google Scholar] [CrossRef]
  33. Chen, Y.; Azzam, R.; Wang, M.; Xu, S.; Chang, L. The Uniaxial Compressive and Tensile Tests of Frozen Saturated Clay in Shanghai Area. Environ. Earth Sci. 2011, 64, 29–36. [Google Scholar] [CrossRef]
  34. Zhu, Y.; Carbee, D.L. Uniaxial Compressive Strength of Frozen Silt under Constant Deformation Rates. Cold Reg. Sci. Technol. 1984, 9, 3–15. [Google Scholar] [CrossRef]
  35. Ma, D.; Ma, Y.; Huang, K.; Yuan, P.; Yao, Z. Strength of Artificially Frozen Silty Clay with Different Negative Temperatures Under Dynamic and Static Combined Loading and Deformation Characteristic Analysis. J. Vib. Shock 2022, 41, 154–160. [Google Scholar]
  36. Zang, J.; Li, H.; Lin, C.; Zhu, Y. Compressive Strength of Saturated Frozen Silt under Constant Strain Rate. Chin. J. Rock Mech. Eng. 2003, S2, 2865–2870. [Google Scholar]
  37. Zhang, Y.; Yang, P.; Jiang, W.; Zhu, Y. Effect of Water Content and Strain Rate on the Strength Characteristics of Frozen Silty Clay. J. Zhengzhou Univ. (Eng. Ed.) 2020, 41, 79–84. [Google Scholar]
  38. Chen, Y.; Wang, M.; Xu, S.; Chang, L.; Yin, Z. Tensile and Compressive Strength Tests on Artificial Frozen Soft Clay in Shanghai. Chin. J. Geotech. Eng. 2009, 31, 1046–1051. [Google Scholar]
  39. Chen, Y.; Chang, L.; Xu, S.; Luo, R. Experimental Study on Uniaxial Compressive Strength of Shanghai’s Frozen Soils. J. Shanghai Univ. (Nat. Sci.) 2009, 15, 310–315. [Google Scholar]
  40. Zhao, L.; Chen, Y.; Wang, Y. Modeling diameter of jet grouting columns using Bayesian framework. Int. J. Numer. Anal. Methods Geomech. 2023, 47, 3158–3176. [Google Scholar] [CrossRef]
  41. Friedman, J.H. Greedy Function Approximation: A Gradient Boosting Machine. Ann. Stat. 2001, 29, 1189–1232. [Google Scholar] [CrossRef]
  42. Zhang, W.; Li, H.; Li, Y.; Liu, H.; Chen, Y.; Ding, X. Application of deep learning algorithms in geotechnical engineering: A short critical review. Artif. Intell. Rev. 2021, 54, 5633–5673. [Google Scholar] [CrossRef]
  43. Zhou, J.; Qiu, Y.; Armaghani, D.J.; Zhang, W.; Li, C.; Zhu, S.; Tarinejad, R. Predicting TBM penetration rate in hard rock condition: A comparative study among six XGB-based metaheuristic techniques. Geosci. Front. 2021, 12, 101091. [Google Scholar] [CrossRef]
  44. Esmaeili-falak, M.; Benemaran, R. Ensemble deep learning-based models to predict the resilient modulus of modified base materials subjected to wet-dry cycles. Geomech. Eng. 2023, 32, 583–600. [Google Scholar]
  45. Hoque, K.E.; Aljamaan, H. Impact of Hyperparameter Tuning on Machine Learning Models in Stock Price Forecasting. IEEE Access 2021, 9, 163815–163830. [Google Scholar] [CrossRef]
  46. Probst, P.; Boulesteix, A.L.; Bischl, B. Tunability: Importance of Hyperparameters of Machine Learning Algorithms. J. Mach. Learn. Res. 2019, 20, 1934–1965. [Google Scholar]
  47. Maldonado-Cruz, E.; Pyrcz, M.J. Tuning Machine Learning Dropout for Subsurface Uncertainty Model Accuracy. J. Petrol. Sci. Eng. 2021, 205, 108975. [Google Scholar] [CrossRef]
  48. Vu, H.L.; Ng, K.T.W.; Richter, A.; An, C. Analysis of Input Set Characteristics and Variances on K-Fold Cross Validation for a Recurrent Neural Network Model on Waste Disposal Rate Estimation. J. Environ. Manag. 2022, 311, 114869. [Google Scholar] [CrossRef]
  49. Kokkinos, Y.; Margaritis, K.G. Managing the Computational Cost of Model Selection and Cross-Validation in Extreme Learning Machines Via Cholesky, SVD, QR and Eigen Decompositions. Neurocomputing 2018, 295, 29–45. [Google Scholar] [CrossRef]
  50. Zhang, M.; Zhang, X.; Lu, J.; Pei, W.; Wang, C. Analysis of Volumetric Unfrozen Water Contents in Freezing Soils. Exp. Heat Transf. 2019, 32, 426–438. [Google Scholar] [CrossRef]
Figure 1. Ten-fold cross-validation.
Figure 1. Ten-fold cross-validation.
Applsci 13 13170 g001
Figure 2. XGB model workflow diagram.
Figure 2. XGB model workflow diagram.
Applsci 13 13170 g002
Figure 3. Model prediction results: (a) Comparison of experimental and predicted values for the test set. (b) Scatter plot of experimental and predicted values.
Figure 3. Model prediction results: (a) Comparison of experimental and predicted values for the test set. (b) Scatter plot of experimental and predicted values.
Applsci 13 13170 g003
Figure 4. Heat map of correlation.
Figure 4. Heat map of correlation.
Applsci 13 13170 g004
Figure 5. Trend of UCS with temperature: (a) Clay. (b) Silty soil. (c) Sandy soil.
Figure 5. Trend of UCS with temperature: (a) Clay. (b) Silty soil. (c) Sandy soil.
Applsci 13 13170 g005
Figure 6. Trend of UCS with strain rate: (a) Clay. (b) Silty soil. (c) Sandy soil.
Figure 6. Trend of UCS with strain rate: (a) Clay. (b) Silty soil. (c) Sandy soil.
Applsci 13 13170 g006
Figure 7. Distribution of unfrozen water in different soils: (a) Fine-grained soils. (b) Coarse-grained soil.
Figure 7. Distribution of unfrozen water in different soils: (a) Fine-grained soils. (b) Coarse-grained soil.
Applsci 13 13170 g007
Table 1. Data distribution for the XGB model.
Table 1. Data distribution for the XGB model.
Input VariableMinimum ValueMaximum ValueAverage Value
Moisture content (%)10.4851.229.98
Dry density (g·cm−3)1.022.081.49
Temperature (°C)−0.5−25−7.15
Strain rate (s−1)1.05 × 10−70.07-
Sand content (%)090-
Powder content (%)594.66-
Sticky grain content (%)3.4389.5-
Table 2. Hyperparameters of XGB algorithm.
Table 2. Hyperparameters of XGB algorithm.
HyperparameterizationMeaning of ParameterParameter Tuning Range
learning_rateLearning rate0.01, 0.05, 0.07, 0.1, 0.2
n_estimatorsOptimal number of iterations[100,1000]
max_depthMaximum depth of the tree3, 4, 5, 6, 7, 8, 9, 10
min_child_weightThe minimum sum of weights of the child nodes generated from the sample1, 2, 3, 4, 5, 6
subsampleSample proportion for each tree0.5, 0.6, 0.7, 0.8, 0.9, 1
colsample_bytreeThe ratio of features randomly selected for each tree0.5, 0.6, 0.7, 0.8, 0.9, 1
gammaThe reduction ratio of the loss function0.1, 0.2, 0.3, 0.4, 0.5, 0.6
reg_alphaWeights for L1 regularization0.05, 0.1, 1, 2, 3
reg_lambdaWeights for L2 regularization0.05, 0.1, 1, 2, 3
Table 3. Hyperparameter tuning results.
Table 3. Hyperparameter tuning results.
HyperparameterizationOptimal Results
learning_rate0.1
n_estimators225
max_depth10
min_child_weight4
Subsample0.5
colsample_bytree0.9
Gamma0.3
reg_alpha0.02
reg_lambda0.9
Table 4. Values of different variables in the new dataset.
Table 4. Values of different variables in the new dataset.
Input VariableMinimum ValueMaximum ValueValue Range
Moisture content (%)10.4851.210, 20, 30, 40, 50
Dry density (g·cm−3)1.022.081.1, 1.3, 1.5, 1.7, 1.9
Temperature (°C)−0.5−25−2, −4, −8, 12, −16, −20
Strain rate (s−1)1.05 × 10−70.0710−5, 10−4, 10−3, 10−2
Sand (%)--Sand particles: 90%, silt particles: 5%, clay particles: 5%
Silt (%)--Sand particles: 5%, silt particles: 87%, clay particles: 8%
Clay (%)--Sand particles: 5%, silt particles: 5%, clay particles: 90%
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Wang, C.; Yang, C.; Qin, H.; Wang, Y. Examination of Determinants and Predictive Modeling of Artificially Frozen Soil Strength Utilizing the XGBoost Algorithm. Appl. Sci. 2023, 13, 13170. https://doi.org/10.3390/app132413170

AMA Style

Wang C, Yang C, Qin H, Wang Y. Examination of Determinants and Predictive Modeling of Artificially Frozen Soil Strength Utilizing the XGBoost Algorithm. Applied Sciences. 2023; 13(24):13170. https://doi.org/10.3390/app132413170

Chicago/Turabian Style

Wang, Chenguang, Chaoyue Yang, Haoran Qin, and Yanning Wang. 2023. "Examination of Determinants and Predictive Modeling of Artificially Frozen Soil Strength Utilizing the XGBoost Algorithm" Applied Sciences 13, no. 24: 13170. https://doi.org/10.3390/app132413170

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop