Novel Insights in Soil Mechanics: Integrating Experimental Investigation with Machine Learning for Unconfined Compression Parameter Prediction of Expansive Soil

Alnmr, Ammar; Hosamo, Haidar Hosamo; Lyu, Chuangxin; Ray, Richard Paul; Alzawi, Mounzer Omran

doi:10.3390/app14114819

Open AccessArticle

Novel Insights in Soil Mechanics: Integrating Experimental Investigation with Machine Learning for Unconfined Compression Parameter Prediction of Expansive Soil

by

Ammar Alnmr

^1,*

,

Haidar Hosamo Hosamo

²

,

Chuangxin Lyu

²,

Richard Paul Ray

¹

and

Mounzer Omran Alzawi

³

¹

Department of Structural and Geotechnical Engineering, Széchenyi István University, 9026 Győr, Hungary

²

Department of Built Environment, OsloMet—Oslo Metroplitan University, 0176 Oslo, Norway

³

Department of Geotechnical Engineering, Tishreen University, Latakia P.O. Box 2230, Syria

^*

Author to whom correspondence should be addressed.

Appl. Sci. 2024, 14(11), 4819; https://doi.org/10.3390/app14114819

Submission received: 30 April 2024 / Revised: 30 May 2024 / Accepted: 31 May 2024 / Published: 2 June 2024

(This article belongs to the Special Issue The Application of Machine Learning in Geotechnical Engineering, 2nd Edition)

Download

Browse Figures

Versions Notes

Abstract

:

This paper presents a novel application of machine learning models to clarify the intricate behaviors of expansive soils, focusing on the impact of sand content, saturation level, and dry density. Departing from conventional methods, this research utilizes a data-centric approach, employing a suite of sophisticated machine learning models to predict soil properties with remarkable precision. The inclusion of a 30% sand mixture is identified as a critical threshold for optimizing soil strength and stiffness, a finding that underscores the transformative potential of sand amendment in soil engineering. In a significant advancement, the study benchmarks the predictive power of several models including extreme gradient boosting (XGBoost), gradient boosting regression (GBR), random forest regression (RFR), decision tree regression (DTR), support vector regression (SVR), symbolic regression (SR), and artificial neural networks (ANNs and proposed ANN-GMDH). Symbolic regression equations have been developed to predict the elasticity modulus and unconfined compressive strength of the investigated expansive soil. Despite the complex behaviors of expansive soil, the trained models allow for optimally predicting the values of unconfined compressive parameters. As a result, this paper provides for the first time a reliable and simply applicable approach for estimating the unconfined compressive parameters of expansive soils. The proposed ANN-GMDH model emerges as the pre-eminent model, demonstrating exceptional accuracy with the best metrics. These results not only highlight the ANN’s superior performance but also mark this study as a groundbreaking endeavor in the application of machine learning to soil behavior prediction, setting a new benchmark in the field.

Keywords:

expansive soils; sand content; machine learning innovation; predictive modeling; soil property optimization; artificial neural networks

1. Introduction

Understanding unconfined compression parameters (

q_{u}

and

E_{t}

) is critical in geotechnical engineering. These parameters aid in determining soil cohesion (c), which is required for calculating cohesive soil bearing capacity. Furthermore, the elasticity modulus (

E_{t}

) is important in predicting settlements. To perform the unconfined compression test, a soil sample must be collected from the field, which requires advanced skills and extensive experience. However, performing these tests in a laboratory is known to be both tedious and time-consuming [1]. To overcome these challenges, researchers investigated the relationships between unconfined compression parameters and physical soil properties. This correlation is useful for reducing the number of tests and the overall investigation costs. Several studies have been conducted over the last decade to establish relationships between soil’s mechanical and physical properties. This approach, aimed at saving time and money during investigations, is based on previous research in soil mechanics and foundation engineering [1,2]. Preliminary investigations and studies for any project can be conducted using predictive models and correlations in conjunction with relevant information, making the process of obtaining design parameters simpler. Machine learning beats traditional lab tests and basic regression models in unconfined compression testing because of its automation, speed, and cost-effectiveness. Machine learning reduces the need for tedious manual testing by rapidly analyzing soil data and predicting compression parameters, leading to faster results and reduced expenses.

Expansive soils, which cover large areas of land worldwide, present a significant challenge in the construction of structures and roads on them due to the high cost associated with repairing damage caused by expansive soil shrink–swell cycles, including building and road cracks and collapse, costing billions of dollars annually [3,4,5,6]. Nelson and Miller [7] assert that expansive soils may result in greater financial losses than earthquakes or floods. In addressing this challenge, the behavior of expansive soils has been studied in numerous studies by adding different materials to improve their characteristics [8,9,10,11,12,13,14,15]. Although these studies have shown that soil characteristics can be improved via the addition of materials, many unresolved questions remain. Therefore, the main purpose of this study is to investigate the effects of sand content on the unconfined compression test characteristics of expansive soil. This study evaluates the impact of varied percentages of sand, initial saturation levels, and dry unit weights on expansive soil behavior. Additionally, it aims to build a dataset for predicting unconfined compression test parameters.

1.1. Sand as an Additive Material and Its Impact on Soil Parameters

Sand is a granular material of natural origin with high strength and is often utilized as a filler material in cohesive soil mixtures to alter their plasticity, compaction, and strength properties [9,13,16,17,18]. The sand fraction significantly influences the cracking behavior of expansive soil. As the sand content increases, it can moderate the volume fluctuations caused by soil swelling and shrinking, minimizing the amount of cracking. However, an excessive sand component may modify the soil’s mechanical properties, thereby influencing its overall behavior. Balancing the sand fraction is critical in engineering applications because it improves soil performance and reduces the negative impacts of cracking in expansive soil [19]. The addition of sand to expansive soils has been extensively investigated by many researchers to determine its effect on the soil parameters [10,20,21,22,23,24,25,26]. Previous studies have confirmed that increasing the percentage of sand increases the unconfined compressive strength (UCS) of expansive soils. Khan et al. [27] performed unconfined compressive experiments on expansive soils. They found that the compressive strength increases with increasing dry density. Increased water content, on the other hand, shows the opposite trend. It is worth noting, however, that their study only looked at two levels of added sand (20% and 40%). Nagaraj [10] performed unconfined compression tests to investigate the impact of sand on the unconfined compressive strength (UCS) of expansive soil, using a static compactor to prepare samples at moisture levels determined by the proctor test only. Their findings showed that, up to a certain point, the unconfined compression strength increased with the percentage of sand. But as the sand percentage increased further, the strength decreased. Mallikarjuna Rao and Subba Rao [28] also investigated the effect of sand on the unconfined compressive strength (UCS) of expansive soil. Nonetheless, their research was limited to samples prepared with the maximum dry density and optimum moisture content with only three percentages of added sand (25%, 40%, and 70%). Sometimes, disparities in results emerged across studies, as can be seen in Nagaraj [10] versus Mallikarjuna Rao and Subba Rao [28]. The primary cause of this variation was significant differences in the percentages of added sand used in each study. Nagaraj [10], for example, used percentages ranging from 10% to 80%, whereas Mallikarjuna Rao and Subba Rao [28] used 25%, 40%, and 70%. Furthermore, the existing sand content in the expansive soil was not taken into consideration in some studies. It is best to remove all pre-existing sand content from the expansive soil to ensure consistency before conducting experiments and adding different percentages of sand.

It is worth noting that saturated soil has all its voids entirely filled with water. The soil is referred to as partially saturated or unsaturated soil when the voids are partially filled with water. The Soil–Water Characteristic Curve (SWCC) is a fundamental relationship between the content of water and soil suction [29] that effectively describes the behavior of unsaturated soils. The retention behavior of soil water can be influenced by various factors, including dry unit weight, degree of saturation, stress conditions, and soil amendments [26,30,31]. The degree of saturation, moisture content, and volumetric water content are terms commonly used to describe the amount of water in the soil. As moisture content in the soil changes from full saturation to drought, the wet area of contact between grains or agglomerates decreases as suction stresses in the soil increase. Put differently, there is a correlation between the SWCC and the shear strength of unsaturated soils [29,32,33]. However, obtaining this curve requires special equipment, effort, and time, which can lead to inconsistent results. As a simpler and more precise alternative, this study focuses on the degree of saturation to predict the elasticity modulus and strength of expansive soil, without considering the characteristic curve.

Prior experimental investigations into the impact of sand on unconfined compression tests were constrained by a lack of comprehensive assessments that considered all factors at the same time. Sand percentage, initial dry density, and initial degree of saturation were frequently studied separately. Some studies varied sand proportions, whereas others focused solely on the Proctor test’s maximum dry density and optimal moisture content. This research seeks to overcome these constraints to gain a more comprehensive understanding of unconfined compression experiments and the potential for accurate predictive modeling.

1.2. Artificial Intelligence as a Predictive Tool in Geotechnical Engineering

The flexible and precise data-driven approach of artificial intelligence (AI) has gained increasing attention as computer technology advances. Many scholars have tried to integrate machine learning (ML) and geotechnical reliability analysis because of the complexity of expansive soils to improve computational precision and efficacy. This endeavor has resulted in a number of successful applications [34,35,36,37,38]. Within geotechnical reliability research, the major goal of machine learning (ML) is to recreate complicated, high-dimensional implicit performance functions by exploiting insights from carefully selected data. These datasets are primarily made up of input factors such as plastic index, liquid limit, degree of saturation, sand content, and unit weight. Similarly, relevant parameters of interest, such as unconfined compressive strength and elasticity modulus, are included in the datasets. Using this methodology to implement the ML analysis model provides a powerful method for accurately predicting outcomes in the geotechnical sector. Via training and validation processes, the model’s accuracy and computational efficiency are validated, confirming its accuracy and computational efficiency in meeting particular performance requirements.

Effective applications of various machine learning (ML) models have been found in geotechnical analysis. These models cover a broad range, including support vector machines (SVMs) [39], multivariate adaptive regression splines (MARS) [40,41], relevant vector machines (RVMs) [42], decision tree regression (DTR) [43], gradient boosting regression (GBR) [43], K nearest neighbor regression (KNR) [43], particle swarm optimization (PSO) [44], random forest regression (RFR) [43], extreme gradient boosting (XGBoost) [38,45,46,47], extreme learning machines (ELMs) [40], symbolic regression (SR) [48,49], and artificial neural networks (ANNs) [50,51,52,53,54]. Table 1 summarizes several studies that employed machine learning in geotechnical engineering.

The symbolic regression (SR) algorithm is used effectively in many areas of civil engineering. It adeptly models the Colebrook equation governing hydraulic flow friction in hydraulics, significantly improving accuracy in turbulent scenarios [58]. Symbolic regression is used in structural engineering to estimate seismic peak drift ratio, penetration depth into concrete blocks, shear capacity of concrete beams reinforced with steel fibers, fire response of concrete structures, seismic fragility analysis, remaining fatigue life, and shear resistance in bolted connections [59,60,61,62,63,64]. Notably, in some cases [59,63,64], symbolic regression equations outperformed traditional formulas.

Geotechnical engineers are becoming interested in ANNs because of their exceptional capacity to represent nonlinear problems with many different parameters. ANNs can create non-linear relationships between variables and make relatively accurate predictions [65]. Several geotechnical investigations have demonstrated that ANN models outperform multiple linear regression (MLR) models [51,65,66,67].

Table 2 summarizes several studies that used machine learning to predict the behavior of expansive soils, including some of the most used regression algorithms such as GBR, SVR, and DT. According to Table 1 and Table 2, investigating the best behavior prediction remains a difficult task because there is no general agreement on the best algorithm for predicting the behavior of expansive soil. The following machine learning models that have demonstrated the best performance according to Table 2 will be used in this paper: artificial neural networks (ANNs and the proposed ANN-GMDH), extreme gradient boosting (XGBoost), gradient boosting regression (GBR), random forest regression (RFR), decision tree regression (DTR), and support vector regression (SVR). The outcomes of those models will be compared, and the most accurate one will be used for prediction. Additionally, symbolic regression (SR) will be used to formulate the prediction equations for

E_{t}

and

q_{u}

.

1.3. Research Scope

A series of tests on expansive soils, including the Atterberg limits test, unconfined compressive test, and Proctor test, were carried out in this study, resulting in a dataset of 225 records with 5 input features and 2 labels. To develop individual predictive models for the unconfined compressive parameters, eight ML approaches were considered, including three classical ones (the proposed ANN-GMDH, ANN, and SVR), four ensemble ones (DTR, RFR, GBR, and XGBoost), and symbolic modeling (SR). The proposed ANN-GMDH is intended to improve the ANN’s prediction accuracy by incorporating new relationships between input features. The suggested ML models’ performances were carefully tested and compared.

This pioneering research marks a significant advance in the field of soil mechanics. To date, no research effort has systematically applied a comprehensive suite of advanced machine learning models to analyze both

q_{u}

and

E_{t}

properties of expansive soil while accounting for the influence of sand. Our work fills this crucial gap by integrating advanced machine learning techniques with experimental data on expansive soil properties. The study’s innovative approach provides a unique combination of classical and ensemble algorithms to analyze and predict the unconfined compression parameters of expansive soils with varying sand content, saturation levels, and dry densities.

By tapping into the vast potential of these diverse models, the research provides a nuanced understanding of how different variables influence soil behavior, a task that traditional methods have found challenging.

The study’s novelty stems from the direct application of sophisticated machine learning techniques to experimental data, resulting in remarkably precise predictions of previously unattainable soil parameters. Unlike traditional methods, which rely on theoretical assumptions or simplified models, this novel methodology uses advanced methods to analyze raw experimental data. Through this, the study achieves remarkably precise predictions of soil parameters, outperforming previous methods. One important feature of this novel approach is its consideration of the percentage of sand in the soil, which has frequently been overlooked in previous research efforts. By incorporating this critical factor into the analysis, the predictive models developed in this study provide a more complete picture of soil behavior. The inclusion of sand percentage allows for more accurate predictions, filling a significant gap in the literature and increasing the findings’ applicability to real-world scenarios. Furthermore, this study presents predictive models and equations designed to estimate the elasticity modulus (

E_{t}

) of expansive soil. Unlike previous studies, which may have focused solely on parameters such as unconfined compressive strength (

q_{u}

) or investigated different soil types, our study fills a gap by developing models specifically for expansive soils. Considering the unique properties and behaviors of expansive soils, which require tailored predictive models for precise assessment of elastic modulus and unconfined compressive strength, this study aims to contribute to the field of geotechnical engineering by addressing identified gaps in the literature and proposing innovative methodologies. The use of advanced machine learning techniques, combined with the consideration of previously overlooked factors such as sand percentage and the creation of specialized models for expansive soils, improves prediction accuracy and reliability. Finally, these advancements improve our understanding of soil behavior and have practical engineering applications, laying the groundwork for future field study.

2. Materials and Methods

2.1. Methodology Overview

Figure 1 depicts the overall workflow, which includes laboratory and machine learning predictions for unconfined compressive parameters (

q_{u}

and

E_{t}

). The process followed standard machine learning procedures, beginning with the collection of experimental data from 225 unconfined compression test samples in the laboratory. Two cross-validation techniques were used: the Train/Validation/Test Split and (k-fold) methods. For the (k-fold) technique, data were randomly divided into 80% for model training and 20% for testing, with k = 5. Meanwhile, the Train/Validation/Test Split approach randomly divided the data into three sets: 70% for model training, 10% for validation, and 20% for testing. Preprocessing entailed dimensionless normalization of input to speed up model convergence. Following model training, the mean absolute error (MAE), root-mean-square error (RMSE), and coefficient of determination (R²) were used to assess machine learning model performance.

2.2. Laboratory Tests

The materials utilized in this study included expansive clayey soil sourced from Demsarkhu, Lattakia, Syria and fine sea sand. Figure 2 displays the case study location. For further details on the characteristics of these materials, refer to Alnmr and Ray [13]. The sand and clay were mixed in varying proportions, ranging from 10% to 50% relative to the dry weight. All the laboratory tests were conducted according to ASTM. Granular gradient curves were plotted and presented in Figure 3. To ensure that the materials were free of impurities, the clay and sand samples were cleaned using sieve N200. The chemical composition of the expansive clay was determined and is presented in Table 3.

According to the Unified Soil Classification System [77] and the AASHTO classification system [78], the expansive clay employed in this investigation was classified as CH and A-7-5, respectively.

Prior research [79,80] established a relationship between the Atterberg limits plot position on a plasticity chart and the soil’s clay mineralogy. Plotting the liquid limit (LL) and plastic index (PI) for each percentage of sand (Fs) resulted in a plasticity chart (Figure 4) with ranges for kaolinite, montmorillonite, illite, and chlorite. The results indicated that the samples in this study with varying amounts of added sand are composed of illite.

The standard proctor test was used to determine the optimal moisture content and maximum dry density of the sand–expansive clay mixtures (Figure 5). Refer to Alnmr and Ray [13] for detailed information on Proctor, Atterberg, and consolidation tests.

In the current study, unconfined compression tests (UCTs) were performed on cylindrical samples that were twice their diameter in length. The tests were conducted at various densities and saturations for each sand percentage added, with a deformation rate of 2 mm/min.

Figure 6 depicts the step-by-step procedure used in the modified sample preparation at the laboratory. Initially, sand and clay soils were thoroughly mixed to create a homogeneous mixture (A). Following that, the blend was sealed in a plastic bag and left isolated for 24 h to ensure consistent moisture distribution (B). After that, the mixture was put into an unconfined compression test mold. The specimen was compressed to the required unit weight using a hydraulic jack and static pressure (C). After being prepared, the specimen was placed in the unconfined compression apparatus (D). The apparatus applied a vertical load until the specimen failed (E). Figure 7 shows the relationship between strain and vertical stress to clarify how

E_{t}

and

q_{u}

are defined.

A dataset comprising 225 unconfined compression tests was compiled from experimental outcomes for utilization in machine learning models. The input data consist of specific gravity (G), plastic index (PI), liquid limit (LL), dry unit weight (

γ_{d}

), sand content (Fs), degree of saturation (SR), optimum degree of saturation (SR_opt), maximum dry unit weight (

γ_{d m a x}

), and void ratio (

e_{0}

). The dataset is used to predict the elasticity modulus (

E_{t}

) and unconfined compressive strength (

q_{u}

). Density plots and distribution histograms of the dataset are presented in Figure 8.

2.3. Selection of Variables

Correlation analysis plays a vital role in exploring the connections among different inputs in datasets. However, given the diverse nature of datasets, the assumptions associated with each correlation method often vary, influencing their suitability and reliability.

The primary criterion for selection involves eliminating variables that are challenging to obtain and opting for readily available data. In this study, Spearman, Kendall, and Pearson correlation coefficients are calculated among different parameters to examine the correlation between input and output parameters, as illustrated in Figure 9. The strength of the correlation between factors is depicted via numerical values ranging from 0 to 1, where a value of 1 indicates a robust positive correlation, while a value of −1 signifies a strong negative correlation. When comparing the three methods (Spearman, Kendall, and Pearson) across different datasets with different inputs, the variables SR,

γ_{d}

, and

e_{0}

consistently show the highest correlation coefficients. Regardless of the dataset’s composition or the underlying presumptions of each correlation technique, this steady trend points to a solid and robust relationship between SR,

γ_{d}

, and

e_{0}

and

E_{t}

and

q_{u}

. Consequently, LL, Fs, and these three parameters (SR,

γ_{d}

, and

e_{0}

) were selected as inputs. The high correlation coefficients observed for degree of saturation (SR), dry unit weight, and void ratio (

e_{0}

), with unconfined compressive strength and the elasticity modulus, can be attributed to their fundamental roles in soil mechanics. The degree of saturation (SR) reflects the soil’s moisture content relative to its maximum water-holding capacity, directly impacting soil strength and stiffness via its influence on pore water pressure. Dry unit weight and void ratio directly influence soil density, porosity, compressibility, and shear strength, explaining their strong correlations with unconfined compressive strength and elasticity modulus. Conversely, the lower correlation coefficients observed for sand content (Fs) and liquid limit (LL) suggest their relatively weaker direct influence on soil mechanics. Fs primarily reflects the composition of soil particles, which may affect soil strength and stiffness to a lesser extent than density and porosity represented by dry unit weight and void ratio. LL, indicating soil plasticity, may indirectly affect mechanical properties via its influence on soil structure and behavior under different moisture conditions. However, despite achieving low correlation coefficients, LL and Fs were selected as inputs due to their importance in soil classification and the ease of determination via simple tests. This decision aids machine learning models in comprehending the complex relationships within multidimensional data, ensuring a more comprehensive analysis of soil behavior. The statistical characteristics of the chosen variables are presented in Table 4.

2.4. Machine Learning Algorithms Used in the Study

In this section, a comprehensive comparison is conducted among the algorithms employed in this study, including decision tree regression (DTR), random forest regression (RFR), gradient boosting regression (GBR), support vector regression (SVR), and artificial neural networks (ANNs). These algorithms were chosen for their common use in previous studies.

The dataset’s complexity influenced the selection of machine learning (ML) models. ML models that are adept at handling such complexities may be necessary because traditional models may not be able to capture these subtleties. Aiming primarily for higher predictive accuracy for variables such as

q_{u}

and

E_{t}

in expansive soils, machine learning models were chosen due to their capacity to efficiently navigate intricate relationships.

This study employs a variety of machine learning models, including extreme gradient boosting (XGBoost), gradient boosting regression (GBR), artificial neural networks (ANN-GMDH, and ANNs), support vector regression (SVR), decision tree regression (DTR), random forest regression (RFR), and symbolic regression (SR) in the Python software 3.11 environment. These models were chosen based on their widespread use in previous studies. For more information on the machine learning models used in this study, including their most important hyperparameters, see [81].

When evaluating efficacy of the ML models, the first step is to analyze their performance on the training data. Following this, to thoroughly evaluate the models’ ability to generalize, two cross-validation techniques are utilized. This entails testing the models on datasets distinct from those utilized during training. In this study, each predictive model was developed using a total of 225 datasets. Among these, 80% were assigned to the training phase, while the remaining 20% were set aside for critical testing purposes.

The quality of the predictive models were evaluated using statistical indices, including MAE (mean absolute error), RMSE (root-mean-square error), and R² (coefficient of determination), which are defined in Equations (1)–(3), respectively [82,83,84].

R M S E = \sqrt{\frac{\sum_{i = 1}^{n} {(y_{i}^{'} - y_{i})}^{2}}{n}}

(1)

M A E = \frac{1}{n} \sum_{i = 1}^{n} |(y_{i}^{'} - y_{i})|

(2)

R^{2} = {(\frac{\sum_{i = 1}^{n} (y_{i} - \bar{y}) (y_{i}^{'} - \bar{y^{'}})}{(\sqrt{\sum_{i = 1}^{n} {(y_{i} - \bar{y})}^{2}}) (\sqrt{\sum_{i = 1}^{n} {(y_{i}^{'} - \bar{y^{'}})}^{2}})})}^{2}

(3)

where “

y

” denotes the actual values, “

y^{'}

” stands for the predicted values, “

\bar{y}

” represents the mean of the set “y”, and “n” signifies the total number of data points.

2.4.1. Decision Tree Regression

Decision tree regression (DTR) is a structured hierarchical data representation characterized by variable branches and nodes. In this framework, certain nodes extend outward with connecting lines, while others, termed ‘leaves’, do not. The data points intended for regression or categorization are systematically divided into distinct categories by utilizing specific internal nodes. In the training phase, the algorithm compares the values of input variables to designated functions. Strategically, the algorithm seeks to construct the most effective decision trees by iteratively minimizing the fitness function. The dataset is partitioned at multiple points within each set of independent variables. At these points, the algorithm calculates prediction errors, reflecting differences between expected and actual values based on the fitness function. Identifying the optimal split point involves selecting the variable that yields the smallest fitness function value, considering split point errors across all variables [85].

2.4.2. Random Forest Regression

The random forest regression (RFR) algorithm, which relies on ensemble learning, is extensively applied in both academic and industrial settings for tasks involving classification and regression due to its efficacy. By employing bootstrap aggregation, it coordinates an ensemble of varied, randomly formed, and unpruned decision trees. An RFR is built by assembling a collection of decision trees that have been systematically altered. Random feature selection emerges as a critical component of the training process. The careful selection of different attributes is required to ensure a rich diversity of decision trees. As a result, each decision tree contributes a unique evaluative perspective to the collective forest, culminating in the outcome being determined by the aggregated votes of these trees. The random forest approach is explained in detail in earlier research [73,86].

2.4.3. Gradient Boosting Regression

Gradient boosting regression (GBR) refers to an ensemble-trained supervised machine learning model. Boosting is a technique that involves combining many simple models into a unified composite model. Boosting is frequently described as an additive model because basic models are sequentially added while the trees of the model remain unchanged. Combining more fundamental models results in more accurate predictions. Gradient boosting aims to reduce losses via gradient descent, a first-order iterative optimization procedure. Employing a squared error loss function, decision trees act as the weak learners within the framework of gradient boosting. GBR proceeds by training a weak model to map features to the predicted residuals of the weak model. These residuals are incorporated into the input of the current model, guiding it toward the intended outcome. When this process is repeated on a regular basis, the model’s overall predicted accuracy steadily improves [87,88].

2.4.4. Extreme Gradient Boosting

Extreme gradient boosting (XGBoost), a powerful implementation of gradient boosting machines (GBM), is regarded as one of the best performers in the field of supervised learning. Its capability spans across both regression and classification problems, making it a versatile tool in data analysis. Data scientists frequently favor XGBoost because of its faster execution speed, particularly when dealing with out-of-core computations. The performance of the algorithm is driven by a combination of advanced processes such as gradient boosting, tree pruning, and regularization. These mechanisms collaborate to improve its prediction powers. Notably, XGBoost is adaptable in dealing with missing data instances and provides regularization procedures, validating its appropriateness for predicting applications. To completely unlock the algorithm’s latent potential, hyperparameters must be carefully fine-tuned. This tuning method entails tweaking crucial parameters such as learning rate and maximum depth to achieve maximum efficiency [89].

2.4.5. Support Vector Regression

Support vector machine (SVM) finds extensive application across diverse prediction scenarios, encompassing tasks related to classification and regression. In the domain of regression, it is commonly referred to as support vector regression (SVR) [90]. SVR operates by transforming the model’s inputs into a higher-dimensional space, a fundamental step in its training process based on the principles of structural risk minimization (SRM) [91]. Facilitating this transformation is the kernel function, tasked with mapping inputs into this expanded space to enable complex processing.

2.4.6. Artificial Neural Networks

Artificial neural networks (ANNs) represent a computational architecture comprising artificial neurons, engineered to mimic the information processing mechanisms observed in the human brain, thereby enabling knowledge acquisition. These networks construct intricate input–output models capable of identifying intricate correlations within multidimensional datasets. Their ubiquitous use is found in a variety of engineering specialties [92,93]. ANNs support a wide range of network architectures, including single and multiple layer networks. One instance is the feedforward back propagation neural network (FFBP), consisting of an input layer for receiving data, an intermediate hidden layer, and an output layer that provides outcome-specific insights in reaction to input data.

This paper proposes a comprehensive integration between the GMDH features and the ANN network model. The group method of data handling (GMDH) algorithm finds applications in diverse research domains. For instance, the authors of [94,95] employed GMDH for pore pressure analysis and permeability modeling, resulting in accurate permeability predictions. Additionally, GMDH has been utilized in rock deformation prediction, as demonstrated by [96]. In GMDH, the Kolmogorov–Gabor polynomial, referenced in [97], serves as a mathematical framework for understanding the relationship between input and output variables. For instance, when predicting

E_{t}

using input features like LL, Fs, and SR, the mathematical Equation (4) is formulated as follows:

E_{t} = a_{0} L L + a_{1} F s + a_{2} S R + a_{3} L L^{2} + a_{4} F s^{2} + a_{5} S R^{2} + a_{6} (L L \times F S) + a_{7} (L L \times S R) + a_{8} (F s \times S R)

(4)

where LL, Fs, and SR denote the input features representing liquid limit, sand percentage, and degree of saturation, respectively;

a_{0}

,

a_{1}

,

a_{2}

,…,

a_{8}

are the coefficients; and

E_{t}

symbolizes the output, specifically the elasticity modulus.

In the conventional ANN, five features (SR, Fs,

e_{0}

, LL, and

γ_{d}

) are utilized. In contrast, the proposed GMDH incorporates 20 features derived from the original five features to capture interaction effects. These features include both individual and interaction terms: SR, Fs,

e_{0}

, LL,

γ_{d}

, SR², Fs²,

e_{0}^{2}

, LL²,

γ_{d}^{2}

, SR × Fs, SR ×

e_{0}

, SR × LL, SR ×

γ_{d}

, Fs ×

e_{0}

, Fs × LL, Fs ×

γ_{d}

,

e_{0}

× LL,

e_{0} \times γ_{d}

, and LL ×

γ_{d}

. The architecture of the suggested ANN-GMDH model, depicting the input characteristics, is illustrated in Figure 10B, while the conventional ANN model is represented in Figure 10A.

2.4.7. Symbolic Regression

Symbolic regression (SR) is a machine learning algorithm that searches for mathematical equations or formulas that best describe and fit data relationships. It allows for the modeling of complex relationships and the creation of predictive equations without making any assumptions about the underlying mathematical form. In capturing complex nonlinear relationships, symbolic regression outperforms multiple linear regression (MLR), which is limited to linear relationships between variables [98]. The symbolic regression algorithm is used in this study to determine the equation of

q_{u}

and

E_{t}

. In this study, EUREQA 1.24.0 software is used for this purpose. EUREQA begins the symbolic regression process by configuring algorithm parameters and establishing the equation space, as shown in Figure 11. It begins with a small set of equations. These equations are subjected to a fitness evaluation, which determines how well they correspond to the given dataset. The algorithm grows and refines this set by including and excluding equations dynamically based on their fitness and relevance. Equations with high fitness are subject to adaptive evolutionary mechanisms, allowing the algorithm to iteratively explore and refine equations. EUREQA adjusts its equation population, concentrating computational resources on promising equations. Iterations of this dynamic evolution continue until convergence or the fulfillment of predetermined termination criteria. In the end, EUREQA chooses the best-fit equation or equations to analyze and validate their correctness in capturing the underlying patterns in the dataset.

2.5. Hyperparameters Optimization

In this study, a systematic strategy is employed for hyperparameter tuning, which focuses on refining machine learning models via the GridSearchCV technique. Specifically, GridSearchCV is utilized across GBR, XGBoost, SVR, ANN, ANN-GMDH RFR and DTR, models. This approach involves systematically exploring a predefined grid of hyperparameter values, coupled with cross-validation for each combination, to identify the most effective set that enhances model performance. Considering the more compact and manageable hyperparameter spaces of these models, a comprehensive exploration proves both feasible and advantageous [99,100,101]. While acknowledging the larger and intricate hyperparameter spaces of the GBR and ANN models, requiring additional optimization time, the study opts for GridSearchCV for consistency. This strategic choice guarantees a systematic and exhaustive examination of the hyperparameter space, aligning with the study’s methodology and obviating the necessity for Randomized Search CV [101].

Artificial neural networks (ANNs) powered by TensorFlow are instrumental in unraveling complex patterns within datasets. Specifically designed for regression tasks, ANNs utilize a structure of interconnected nodes organized into layers to establish correlations between input features and continuous output values. In TensorFlow-based ANNs, hyperparameter optimization involves thorough exploration of the parameter space, encompassing architecture parameters like activation functions, neuron count per layer, and layer count, as well as training process parameters such as number of training epochs, batch size, and learning rate. This systematic approach aims to identify the optimal combination of hyperparameters that enhances predictive accuracy, exploring various configurations and leveraging TensorFlow’s capabilities for efficient evaluation and training.

This process entails a thorough exploration of these models’ parameter spaces to identify optimal configurations. The optimal settings for each model regarding

q_{u}

and

E_{t}

, respectively, are displayed in Table 5 and Table 6.

3. Results and Discussion

3.1. Effect of Initial Dry Unit Weight

The study investigated the effect of initial dry unit weight on the elasticity modulus (

E_{t}

) and unconfined compressive strength (

q_{u}

) of sand–clay mixtures with varying percentages of added sand. The initial saturation level was set at 75% for all samples, while the densities were varied for each sample. Figure 12 shows the change in

q_{u}

and

E_{t}

with the initial dry unit weight for each percentage of sand.

The results showed an increase in both

q_{u}

and

E_{t}

with an increase in the dry unit weight for each sand-added percentage. On the other hand, both

q_{u}

and

E_{t}

decreased with an increase in the percentage of sand for all added sand percentages. The reason for this trend is that as the percentage of sand increases for the same dry density, the size of the voids in the soil increases, while the amount of clay minerals that have high water affinity decreases, leading to a reduction in suction forces within the pores of the soil.

Figure 12A: This graph is a scatter plot with ascending lines reflecting the trend for a certain percentage of added sand (Fs) inside the soil mixture. The x-axis shows the dry unit weight (

γ_{d}

kN/

m^{3}

), which ranges from 13 kN/

m^{3}

to 18% kN/

m^{3}

, and the y-axis indicates the unconfined compressive strength (

q_{u}

), which ranges from 0 to approximately 1100 kPa. The general trend indicates that as dry unit weight increases, so does unconfined compressive strength, especially after exceeding the maximum dry unit weight of the standard proctor test for each sand content.

Figure 12B: This graph, like Figure 12A, is a scatter plot with ascending lines, but it illustrates the change in elasticity modulus (

E_{t}

). The y-axis has a significantly broader scale, extending from 0 to 90,000 kPa, representing the higher values typical of elasticity modulus measurements. The lines use the same color and style codes as in Figure 12A and demonstrate the similar pattern of increasing elasticity modulus with increasing dry unit weight. The sharp rise in modulus indicates that the soil becomes much stiffer and less deformable as it becomes denser.

The trend in Figure 12A shows a clear direct relationship between

γ_{d}

and

q_{u}

. As

γ_{d}

increases,

q_{u}

increases across all sand content percentages (Fs), indicating that density compromises soil strength. The results show a significant increase in strength after exceeding the maximum dry unit weight of a standard proctor. The various lines for different Fs percentages indicate that mixtures with more added sand tend to retain lower strength at the same dry unit weight than those with less sand. This behavior implies that soils with lower sand content are more susceptible to strength gain with increased dry unit weight.

In Figure 12B, the elasticity modulus (

E_{t}

) shows a similar upward trend with increasing dry unit weight, indicating an increase in soil stiffness. The lines plotted for different Fs percentages all slope upward, indicating that as the soil becomes denser, its ability to elastically deform and return to its original shape improves.

Overall, the results of this study suggest that the addition of sand to clayey soils can enhance their strength and stiffness properties, but the percentage of sand should be carefully considered to avoid reducing these properties due to an increase in voids size and a decrease in water affinity of clay minerals.

3.2. Effect of Initial Degree of Saturation

The impact of the initial degree of saturation on the mechanical properties of sand-clay mixtures with varying sand content was investigated. The maximum dry density was calculated for each percentage of sand using the standard Proctor test, and it was found that the maximum dry density increased with an increasing percentage of sand, as depicted in Figure 5. The relationship between

q_{u}

and

E_{t}

with the degree of saturation (SR) was studied separately for each sand percentage, and the results are presented in Figure 13. A sharp decrease in both the unconfined compressive strength and elasticity modulus was observed as SR increased, and the decrease became less severe as SR increased due to the filling of soil pores with water and the corresponding decrease in suction forces.

Figure 13A: This graph is a scatter plot with a series of descending lines, each line representing the trend for a specific percentage of sand (Fs) within the soil mixture. The x-axis represents SR% ranging from about 30% to 100%, and the y-axis represents

q_{u}

measured in kilopascals (kPa), which ranges from approximately 0 to 1800 kPa. The general trend shows that as saturation increases, the unconfined compressive strength decreases sharply initially and then levels off, indicating a reduction in strength gain as saturation approaches 100%.

Figure 13B: This graph is also a scatter plot with descending lines, similar to Figure 13A, but it depicts the change in elasticity modulus (

E_{t}

). The y-axis has a much larger scale, ranging from 0 to 160,000 kPa, reflecting the higher values typical of elasticity modulus measurements. The lines follow the same color and style codes as Figure 13A and show the same trend of reduction in the elasticity modulus with an increase in soil saturation. The sharp decline in modulus suggests that the material becomes significantly less stiff and more deformable as it becomes wetter, which is a critical consideration for soil mechanics and engineering.

The trend in Figure 13A demonstrates a clear inverse relationship between SR and

q_{u}

of the soil. As the saturation increases,

q_{u}

decreases across all sand content percentages (Fs), indicating that soil strength is compromised by the presence of water. The results show a steep decline in strength at lower saturation levels, which then tapers as saturation approaches full capacity. The various lines for different Fs percentages suggest that the mixtures with more added sand tend to retain higher strength at the same saturation level compared to those with less sand. This behavior implies that soils with lower sand content are more susceptible to strength loss with increased water content.

In Figure 13B, the elasticity modulus (

E_{t}

) exhibits a similar downward trend with increasing soil saturation, highlighting a decrease in soil stiffness. The lines plotted for different Fs percentages all slope downward, indicating that as the soil becomes more saturated, its ability to elastically deform and return to its original shape is diminished. The sharpness of the slope varies with the percentage of added sand, with higher sand content mixtures showing a slightly less severe reduction in

E_{t}

. This reduction in elasticity with increased saturation is crucial for civil engineering applications, as it affects how soil will behave under structural loads, with potential implications for the design and stability of foundations and other soil–structure interactions.

3.3. Effect of Sand Content

The effect of varying percentages of added sand (FS) on the elasticity modulus and unconfined compressive strength is investigated by setting the maximum dry density for each percentage of added sand. Figure 14A,B are scatter plots with a series of lines, with each line representing the trend for a specific degree of saturation (SR) within the soil mixture. Figure 14 displays the change in unconfined compressive strength and elasticity modulus as a function of the percentage of sand for each degree of saturation. The results indicate that an increase in the percentage of sand leads to an increase in both elasticity modulus and unconfined compressive strength up to a certain percentage related to the degree of saturation. However, this tendency of increase becomes less significant with the increase in the degree of saturation. Moreover, after reaching a specific percentage of added sand, the strength decreases. This percentage is about 30% for high saturation degrees, where the percentage of increase in

E_{t}

and

q_{u}

is about 75% and 65%, respectively. Notably, this critical percentage increases with a decrease in the degree of saturation.

This behavior can be explained by the suction forces within the soil. The increase in density increases suction forces, while an increase in saturation and sand percentage decreases suction forces. For instance, in Figure 14A,B, for 90% or greater saturation, both strength and stiffness increase with the percentage of sand due to the increase in maximum dry density. However, after about 30% of the added sand, there is a greater decrease in suction forces due to the increase in the percentage of sand, which overcomes the increase in suction forces caused by the increase in dry density. The study found that there is a substantial rise in suction forces when the saturation degree decreases, resulting in a continuous increase in unconfined compressive strength for larger percentages of the added sand.

Unconfined compression tests on mixtures corresponding to the optimum moisture and maximum dry density according to standard Proctor’s tests revealed that the strength and elasticity modulus increase with the percentage of sand. The reason for the continued increase in strength and elasticity modulus is that the saturation degree corresponding to the optimum moisture is less than 90% for added sand percentages greater than 30%. Additionally, it is worth noting that there is an inflection point between 20% and 30% of the added sand, after which the increase becomes less severe, as shown in Figure 15. As a result, a sand percentage of 30% can be considered the ideal percentage in clay soil. The percentage of increase in the unconfined compressive strength and elasticity modulus when the sand percentage increases from 0 to 30% is approximately 68% and 81%, respectively.

3.4. Machine Learning to Predict $q_{u}$ and $E_{t}$

Initially, several established explicit models such as linear regression, Lasso, Ridge, and GMDH were utilized. The corresponding equations and associated metrics are summarized in Table 7. Notably, the GMDH model yielded the most favorable metrics, indicating its potential for generating reliable results. However, due to the lengthy equations involved, we opted to integrate the GMDH features with the most effective implicit model, namely ANN, as elaborated in the subsequent section. This combined model is denoted as ANN-GMDH. Lastly, a user-friendly program will be developed to facilitate straightforward predictive tasks.

3.4.1. Comparison of Machine Learning Models

This section presents a comparison of the models used in this study. The metrics derived from k-fold and TVT cross-validation techniques are shown in Table 8 and Table 9, respectively. Table 8 and Table 9 show that ANN-GMDH, ANN and SVR have superior performance metrics, with SVR closely matching ANN’s performance. As a result, ANN-GMDH, ANN and SVR outperform XGBoost, GBR, RFR, and DTR in predicting

E_{t}

and

q_{u}

for the expansive soil under consideration.

A comprehensive assessment of machine learning models was undertaken to evaluate their effectiveness in predicting

E_{t}

and

q_{u}

, with results presented across mean absolute error (MAE), root-mean=square error (RMSE), and R-squared (R²) metrics in Table 8 and Table 9. Decision tree regression (DTR) demonstrates strong performance on the training set but struggles with overfitting on validation and testing sets, limiting its extrapolative capability. Random forest regression (RFR) consistently produces robust results but shows slightly higher error metrics, indicating some constraints in its predictive capacity. Gradient boosting regression (GBR) maintains remarkable performance with acceptable generalization ability, while XGBoost outshines others with exceptional performance and robust generalization. Artificial neural networks (ANNs) and support vector regression (SVR) exhibit commendable performance, with ANN-GMDH emerging as a top performer, boasting high R² values and low error metrics.

It should be noted that an ML model’s capacity for generalization is assessed by its performance on unseen (testing) data. Metric scores during testing typically indicate a slight decrease in performance across all models compared to the training stage due to the unfamiliar data. To quantify this decline in performance, the degradation rate, or

r_{d}

, is defined in Equation (5) [76]:

r_{d} = |\frac{m_{t r a i n} - m_{t e s t}}{m_{t r a i n}}| \times 100 %

(5)

where

m_{t e s t}

and

m_{t r a i n}

stand for a particular measure’s values during the testing and training stages, respectively.

Notably, the K-fold technique produces the least amount of performance decay and outperforms the TVT technique, except for ANN, where the TVT technique outperforms the ANN technique in terms of consistency for

E_{t}

prediction.

All performance metrics show that the ANN-GMDH and ANN produce the least amount of performance decay among all models, with only the

r_{d}

of RMSE exceeding 10% for

E_{t}

predictions. The SVR follows. The performance of the other models significantly declined, particularly when it came to the MAE and RMSE for DTR using TVT technique for

q_{u}

predictions, where the rd values were 1544% and 4969%, respectively. Throughout both training and testing phases, the ANN-GMDH and ANN models demonstrated remarkable consistency in their performance, with ANN-GMDH, ANN and SVR delivering the most accurate predictions. However, because of the widespread use of ANN in geotechnical engineering [50,51,52,54,66,68,69,70,71,72] and consistently superior performance in achieving better metrics. ANNs build complex input–output models that can learn intricate relationships within multidimensional data. Their widespread application spans a wide range of engineering disciplines [92,93]. This paper will primarily focus on ANN-GMDH and ANN predictions using the TVT technique because it provides more consistent results than the K-fold for ANN predictions.

ANN Models to Predict Elasticity Modulus ( $E_{t}$ )

Figure 16 casts light on the regression cross plots of

E_{t}

model, which harnesses the power of artificial neural networks (ANNs) for elasticity modulus prediction. The figure delineates the congruence and disparities between true and predicted values across the training, validation, and test sets. Figure 16A–H highlights the precise predictions of ANN

E_{t}

models, displaying a favorable alignment with the x = y reference line and best fit trend line (ideal line) across a range of elasticity moduli [54]. In the training set, the

E_{t}

models achieve an R-squared of 0.9951 for ANN and 0.9963 for ANN-GMDH. the mean absolute error is 109.116 for ANN and 95.66 for ANN-GMDH, while the root-mean-square error (RMSE) is 145.4 for ANN and 125.67 for ANN-GMDH. This evidences the models’ capability to apprehend intricate patterns within the training data and produce predictions that exhibit a commendable level of accuracy. In the validation set, the models maintain their predictive strength, achieving an R-squared of 0.9955 for ANN and 0.9956 for ANN-GMDH. The mean absolute error (MAE) is 108.792 for ANN and 96.658 for ANN-GMDH, while the root-mean-square error (RMSE) is 129.15 for ANN and 128.24 for ANN-GMDH. This highlights the models’ prowess in extrapolating predictions to new data, with minimal deviations from the actual values. In the test set, consistent performance is observed, with an R-squared of 0.9924 for ANN and 0.9935 for ANN-GMDH. The mean absolute error (MAE) is 115.408 for ANN and 120.86 for ANN-GMDH, while the root-mean-square error (RMSE) is 170.03 for ANN and 157.31 for ANN-GMDH. This performance reiterates the models’ proficiency in generating dependable predictions on independent data instances. When juxtaposed with SR model, it becomes apparent that ANN models occupy a favorable position. The ANN-GMDH model consistently outperforms the ANN model across all evaluation metrics, including R², MAE, and RMSE. However, it is important to note that the

r_{d}

value of the ANN-GMDH model is slightly worse than that of the ANN model. Overall, the ANN-GMDH model demonstrates better predictive accuracy and error minimization compared to the ANN model. The models aptly capture underlying relationships within the data and translates them into reliable predictions. This performance is poised to deliver valuable insights for elasticity prediction applications.

ANN Model to Predict Unconfined Compressive Strength ( $q_{u}$ )

Figure 17 casts light on the regression cross plots of

q_{u}

model, which harnesses the power of artificial neural networks (ANNs) for unconfined compression strength prediction. The figure delineates the congruence and disparities between true and predicted values across the training, validation, and test sets. In Figure 17A–H, the ANN

q_{u}

models display precise predictions, aligning favorably with the x = y reference line and best-fit trend line (ideal line) across a range of unconfined compression strengths [54]. In the training set, the ANN model achieves an R² of 0.995 with an MAE of 19.471 and an RMSE of 28.54, while the ANN-GMDH model achieves an R² of 0.996 with an MAE of 17.73 and an RMSE of 25.54. This evidences the models’ capability to apprehend intricate patterns within the training data and produce predictions that exhibit a commendable level of accuracy. In the test set, both the ANN and ANN-GMDH models exhibit consistent performance, with the ANN model achieving an R-squared of 0.9944, an MAE of 18.9, and an RMSE of 27.63, while the ANN-GMDH model achieves an R-squared of 0.9949, an MAE of 18.45, and an RMSE of 26.37. This performance reiterates the models’ proficiency in generating dependable predictions on independent data instances. When juxtaposed with SR model, it becomes apparent that ANN-GMDH and ANN models occupy a favorable position. The models aptly capture underlying relationships within the data and translates them into reliable predictions. This performance is poised to deliver valuable insights for compressive strength prediction applications. While both the ANN-GMDH and ANN models achieve almost the same metrics, the ANN-GMDH model achieves better accuracy overall. However, it is important to note that its

r_{d}

value is slightly worse than that of the ANN model.

Moreover, establishing an acceptable error margin is pivotal in assessing the sufficiency of the training dataset. Figure 16 and Figure 17 depict a 20% error margin on each cross plot to illustrate this concept. The performance of both the

E_{t}

and

q_{u}

models was commendable, as the majority of predictions fell within the error margin lines across the training, validation, and testing phases. This underscores the adequacy of the training dataset in supporting the predictive models for

E_{t}

and

q_{u}

using the proposed ANN model in this study.

The precision observed in predicted accuracy is the result of a strategy based on statistical rigor and practical relevance. The models were trained to detect subtle patterns in the data by carefully selecting variables and exhaustively assessing their statistical features. Additionally, a diverse array of machine learning methods including ANN, GBR, XGBoost, SVR, DTR, and RFR were employed to ensure comprehensive exploration of predictive patterns. Furthermore, the intensive evaluation approach, which included cross-validation techniques on 225 datasets, enabled a thorough assessment of the models’ generalization capabilities. Hyperparameter tuning fine-tuned the models’ performance, ensuring they were well-suited to the dataset’s subtleties. Recognizing the inherent randomness and possibility for human error, the methodology promoted openness and accountability by employing statistical indicators such as RMSE, MAE, and R². This comprehensive approach emphasizes the methodology’s rigor and the predictive models’ reliability.

3.4.2. Streamlined Interface for ANN Model Predictions

The incorporation of a user-friendly interface enhances this approach by providing a streamlined, one-click prediction process. This interface simplifies the utilization of the ANN model, ensuring accessibility for users of varying technical expertise. For instance, to utilize the ANN

q_{u}

model, users can easily open the “ANN

q_{u}

model.ipynb” (Supplementary Materials) Python file using the Jupyter Notebook app. By simply pressing shift + enter, the interface promptly appears, facilitating a user-friendly experience, as depicted in Figure 18. This straightforward process remains consistent for the ANN

E_{t}

model, ensuring uniform and accessible user interaction.

3.4.3. Symbolic Regression Equations

The EUREQA software is used in this study to uncover latent patterns and relationships in the dataset and generate equations for predicting the unconfined compressive strength (

q_{u}

) and elasticity modulus (

E_{t}

) [98].

Proposed Equations of Elasticity Modulus ( $E_{t}$ )

Equations (6)–(8) are formulated via the utilization of the EUREQA software. It is crucial to acknowledge that as the metric is enhanced, the resulting equations tend to become more intricate, necessitating additional time to achieve optimal performance.

E_{t} = \frac{γ_{d} * L L * S R^{2}}{e^{\frac{e_{0} + S R}{γ_{d}}}} (R^{2} = 0.90816; RMSE = 615.6; MAE = 495.03)

(6)

E_{t} = γ_{d} * \sqrt{\frac{e^{e_{0} + γ_{d} + \frac{L L}{γ_{d}}}}{e^{\frac{S R}{γ_{d}}}}} (R^{2} = 0.93394; RMSE = 522.13; MAE = 412.17)

(7)

E_{t} = γ_{d} * \frac{L L * S R^{2} + F S * γ_{d}^{2}}{S R^{\frac{1}{γ_{d}} + \frac{{S R * l n (e}_{0})}{γ_{d}^{2}}} e^{\frac{S R}{γ_{d}} + \frac{{e_{0} * γ}_{d} * F S * S R^{2}}{e_{0} + F S * L L^{3}}}} (R^{2} = 0.9676; RMSE = 365.54; MAE = 283.45)

(8)

where the following are defined:

$E_{t}$ —Elasticity modulus;
e₀—Initial void ratio: $e_{0} = \frac{γ_{s}}{γ_{d}} - 1$ ;
$γ_{d}$ —Dry unit weight kN/ $m^{3}$ ;
$γ_{s}$ —Unit weight of solids kN/ $m^{3}$ ;
SR—Initial degree of saturation %;
Fs—Percentage of added sand %;
LL–Liquid limit %.

Figure 19 illustrates the distinction between true and predicted

(\frac{E_{t}}{γ_{d}})

data points. The data points in the comparative visual representation align more closely with the best-fit line (ideal line) for Equations (6)–(8), respectively, as the metrics R², RMSE, and MAE improve. It should be noted that most data points are located between the 20% error margin lines. However, caution must be exercised when

(\frac{E_{t}}{γ_{d}} \leq 1000 m)

where there are notable bounds out of the +20% error margine line, so it is preferable to reduce it by 20% to be on the safe side in this case.

Proposed Equations of Unconfined Compression Strength $q_{u}$

For the prediction of unconfined compression strength (

q_{u}

), Equations (9)–(11) are formulated using the EUREQA software.

q_{u} = \frac{L L^{2} * {γ_{d}}^{2}}{{e_{0}}^{3} * S R^{2}} + \frac{γ_{d} - F s}{e_{0}} (R^{2} = 0.96; RMSE = 77; MAE = 51.6)

(9)

q_{u} = \frac{e_{0} * γ_{d} * {γ_{d}}^{l n (L L)}}{{e_{0}}^{3} * S R^{2} - S R} + L L + γ_{d} - S R (R^{2} = 0.971; RMSE = 65.66; MAE = 35.7)

(10)

q_{u} = \frac{e_{0}^{2} * γ_{d} * {γ_{d}}^{l n (L L)}}{{e_{0}}^{4} * S R^{2} + L L - γ_{d} - S R} + L L + γ_{d} - S R (R^{2} = 0.983; RMSE = 50.24; MAE = 29.9)

(11)

where the following are defined:

$q_{u}$ —unconfined compressive strength.

Figure 20 shows improved alignment of true and predicted data points with the reference line y = x and best-fit trend line (ideal line) for Equations (9)–(11) as metrics (R², RMSE, and MAE) improve. Equation 11 has the highest metrics, while Equation (9) has the lowest. It should be noted that most data points are located between the 20% error margin lines. This demonstrates that

q_{u}

equations are capable of accurately predicting the

q_{u}

.

It is crucial to highlight that as metrics are improved, the generated equations become more complex, necessitating more time to achieve optimal performance. EUREQA can generate many equations. Equations (6)–(11) are chosen for their practicality and ability to produce favorable metrics.

To ensure the validity of the Equations (6)–(11), the following conditions must be met: sand content below 50%, The Atterberg limits ought to be situated slightly above and in proximity to line A on the Casagrande chart, initial degree of saturation exceeding 50%, and a liquid limit within the range of 40–80%.

When the metrics of ANN and SR are compared (Figure 21), as well as the visual examination of true and predicted targets in Figure 22, artificial neural networks outperform symbolic regression. This observation highlights the superior performance of artificial neural networks in comprehending complex patterns within data, outperforming symbolic regression.

Prior research using single or multiple regression techniques focused on only one or two inputs, including (w_n, LL, and/or

γ_{d}

) [1,2]. The highest achieved R-squared values were 0.89 in work by Khalid et al. (2015) and 0.606 in work by Senoon and Hussein, (2018), confirming symbolic regression equations’ superior performance over single and multiple regression models. Upon applying the equations from Senoon and Hussein (2018) and Khalid et al. (2015)’s works to our dataset, a large number of data points were predicted to have negative values. This happened as a result of the models in Senoon and Hussein (2018) and Khalid et al. (2015)’s works depending only on one or two inputs. Furthermore, their datasets were not sufficiently explored in terms of expansive soil properties. Senoon and Hussein (2018) concentrated on MH classified soils and specifically avoided expansive soil behavior, whereas Khalid et al. (2015) only addressed a small portion of this soil classification. The power of the equations in our work comes from the ability to accurately predict

q_{u}

and

E_{t}

in expansive soils that are partially or fully saturated.

4. Feature Importance

Two methods were used to evaluate the importance of the selected variables: the SHapley Additive ExPlanations (SHAP) values [102] and the algorithm’s inherent computation of traditional feature importance. In the latter approach, following algorithm training, relative relevance scores for each input characteristic can be derived [103]. A higher score indicates a greater impact on prediction. The analysis results are shown in Table 10, indicating that across all models, saturation degree holds the highest influence on

q_{u}

and

E_{t}

, followed by void ratio and dry unit weight, with liquid limit and sand percentage having the least influence. Notably, XGBoost places saturation degree on top, followed by void ratio and liquid limit, with dry unit weight and sand percentage at the bottom.

Additionally, the SHAP method was employed, providing a consistent approach to explaining the output of any tree-based model and offering distinct advantages over other methods. Figure 23 and Figure 24 illustrates the combination of feature importance with feature effects. Each point in the graph represents a SHAP value for a feature and prediction, with the y-axis displaying features in descending order of importance. SHAP values are plotted along the x-axis, where a value of zero indicates no contribution to the prediction, with increasing contributions as the SHAP value moves away from zero. Furthermore, color gradation from low (blue) to high (red) feature values provides insight into the directional impact of features. The distribution of blue and red dots offers an overall understanding of the directionality impact of features [55].

According to the random forest regression (RFR) model, the degree of saturation emerges as the most influential variable impacting both

q_{u}

and

E_{t}

, showing a negative correlation, consistent with findings from [104,105]. Following closely is the void ratio, exhibiting a smaller yet still significant negative correlation, as observed in the study by [106].

Additionally, the dry unit weight emerges as another influential factor affecting

q_{u}

and

E_{t}

, demonstrating a positive correlation, as indicated in previous studies such as those by [106,107]. The sand content parameter shows a negative correlation, aligning with the findings of [108]. Conversely, the liquid limit variable demonstrates the least impact on both

q_{u}

and

E_{t}

, with a positive correlation, consistent with studies by [106,107].

These observations suggest that the models have effectively captured the relationships between each independent variable and

q_{u}

and

E_{t}

, in line with existing technical literature. A concise summary of the feature importance analysis conducted using the two approaches is presented in Table 10.

5. Conclusions

This study investigates the effect of sand in expansive soil on unconfined test parameters and presents a novel application of machine learning models to clarify the intricate behaviors of expansive soils. Focusing on the impact of sand content, saturation level, and dry density, the following conclusions were made:

The modulus of elasticity ( $E_{t}$ ) and unconfined compressive strength ( $q_{u}$ ) are positively correlated with the soil’s dry unit weight.
Increasing sand content, while keeping dry unit weight constant, reduces modulus of elasticity and unconfined compressive strength due to expanded voids decreasing suction forces.
A rise in the degree of saturation results in a notable but nonlinear decrease in both modulus of elasticity and unconfined compressive strength, becoming less pronounced at higher saturation levels.
For samples at maximum dry unit weight, adding sand initially increases strength and elasticity up to a threshold of 30% sand content, beyond which the effect reverses.
Optimal sand content for maximum strength and elasticity at full saturation is 30%.

Further into the study, equations derived from symbolic regression are introduced to predict the modulus of elasticity and unconfined compressive strength. These equations consider a range of factors including void ratio, degree of saturation, liquid limit, and sand percentage, thereby providing a robust predictive framework for soil properties.

Feature importance analysis reveals the following:

Degree of saturation is the most influential variable, showing a negative correlation with $q_{u}$ and $E_{t}$ .
Void ratio shows a significant negative correlation.
Dry unit weight demonstrates a positive correlation.
Sand content exhibits a negative correlation.
Liquid limit has the least impact, with a positive correlation.

The ANN-GMDH model, integrating GMDH features, outperforms other models in predicting soil behavior. Both ANN and SVR models are effective, with ANNs excelling in capturing complex patterns.

This study’s limitations arise from depending on a specific dataset tailored to a particular range for each parameter, as indicated in Table 4. Additionally, Atterberg limits should be positioned slightly above and near line A on the Casagrande chart. This commitment to achieving accurate predictions may, however, pose constraints on the generalizability of the findings.

Although the evaluated models produced reasonable results, there is still room for improvement. Future work could explore new methods such as deep learning or reinforcement learning. Additionally, enhancing the models’ prediction capabilities might require incorporating other characteristics, such as the plasticity index, clay percentage, maximum dry density, and optimum saturation degree. Expanding the dataset and refining risk mitigation strategies related to expansive soils should be a focus of future research, contributing to the continuous advancement of geotechnical engineering practices. These enhancements aim to further solidify the applicability and effectiveness of the proposed approach in real-world scenarios. In conclusion, while this study provides valuable insights, addressing these limitations via broader datasets and refined strategies will be crucial for improving the reliability and robustness of predictive models in geotechnical engineering.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/app14114819/s1.

Author Contributions

Conceptualization, A.A., M.O.A. and R.P.R.; investigation, A.A. and M.O.A.; experimental work, A.A.; writing—original draft preparation, A.A., H.H.H. and C.L.; modeling, A.A., R.P.R., H.H.H. and C.L; writing—review and editing, R.P.R., M.O.A. and A.A.; supervision, R.P.R. and M.O.A. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding. Funding for open access granted by Szechenyi István University (SZE).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The raw data supporting the conclusions of this article will be made available by the authors on request.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Senoon, A.-A.; Hussein, M. Correlation between Unconfined Compression Strength (UCS) and Index Properties of Soil in Assiut Governorate, Egypt. In Proceedings of the Fifteenth International Conference on Structural and Geotechnical Engineering, Cairo , Egypt, December 2018; Advances In Construction Techniques. Ain Shams University, Faculty of Engineering Structural Engineering Department: Cairo , Egypt, 2018. Available online: https://staffsites.sohag-univ.edu.eg/uploads/470/1546201444%20-%20Correlation%20between%20Unconfined%20Compression%20Strength%20(UCS)%20and%20Index%20Properties%20of%20Soil%20in%20Assiut%20Governorate,%20Egypt.pdf (accessed on 29 April 2024).
Khalid, U.; Ur-Rehman, Z.; Farooq, K.; Mujtaba, H. Prediction of Unconfined Compressive Strength from Index Properties of Soils. Sci. Int. 2015, 27, 4071–4075. [Google Scholar]
Steinberg, M.L. Controlling Expansive Soil Destructiveness by Deep Vertical Geomembranes on Four Highways; Transportation Research Board: Washington, DC, USA, 1985; ISBN 0309039231. Available online: https://trid.trb.org/View/272080 (accessed on 29 April 2024).
Goodarzi, A.R.; Akbari, H.R.; Salimi, M. Enhanced Stabilization of Highly Expansive Clays by Mixing Cement and Silica Fume. Appl. Clay Sci. 2016, 132, 675–684. [Google Scholar] [CrossRef]
Kolay, P.K.; Ramesh, K.C. Reduction of Expansive Index, Swelling and Compression Behavior of Kaolinite and Bentonite Clay with Sand and Class C Fly Ash. Geotech. Geol. Eng. 2016, 34, 87–101. [Google Scholar] [CrossRef]
Salimi, M.; Ilkhani, M.; Vakili, A.H. Stabilization Treatment of Na-Montmorillonite with Binary Mixtures of Lime and Steelmaking Slag. Int. J. Geotech. Eng. 2018, 14, 295–301. [Google Scholar] [CrossRef]
Nelson, J.; Miller, D.J. Expansive Soils: Problems and Practice in Foundation and Pavement Engineering; John Wiley & Sons: New York, NY, USA, 1997; ISBN 0471181145. [Google Scholar]
Kaoua, F.; Derriche, Z.; Laradi, N. Contribution à l’étude de La Stabilisation Des Sols Gonflants Par Ajouts de Sable. Algérie Equip. Rev. De L’ecole Natl. Des Trav. Publics 1994, 15, 12–15. [Google Scholar]
Bahia, L.; Ramdane, B. SAND: An Additive for Stabilzation of Swelling Clay Soils. Int. J. Geosci. 2012, 3, 719–725. [Google Scholar] [CrossRef]
Nagaraj, H.B. Influence of Gradation and Proportion of Sand on Stress–Strain Behavior of Clay–Sand Mixtures. Int. J. Geo-Eng. 2016, 7, 19. [Google Scholar] [CrossRef]
Deng, Y.; Wu, Z.; Cui, Y.; Liu, S.; Wang, Q. Sand Fraction Effect on Hydro-Mechanical Behavior of Sand-Clay Mixture. Appl. Clay Sci. 2017, 135, 355–361. [Google Scholar] [CrossRef]
Phanikumar, B.R.; Dembla, S.; Yatindra, A. Swelling Behaviour of an Expansive Clay Blended With Fine Sand and Fly Ash. Geotech. Geol. Eng. 2021, 39, 583–591. [Google Scholar] [CrossRef]
Alnmr, A.; Ray, R. Investigating the Impact of Varying Sand Content on the Physical Characteristics of Expansive Clay Soils from Syria. Geotech. Geol. Eng. 2023, 42, 2675–2691. [Google Scholar] [CrossRef]
Alnmr, A.; Ray, R.P.; Alsirawan, R. A State-of-the-Art Review and Numerical Study of Reinforced Expansive Soil with Granular Anchor Piles and Helical Piles. Sustainability 2023, 15, 2802. [Google Scholar] [CrossRef]
Alnmr, A.; Ray, R.P.; Alsirawan, R. Comparative Analysis of Helical Piles and Granular Anchor Piles for Foundation Stabilization in Expansive Soil: A 3D Numerical Study. Sustainability 2023, 15, 11975. [Google Scholar] [CrossRef]
Khemissa, M.; Mahamedi, A.; Mekki, L. Problematic Soil Mechanics in the Algerian Arid and Semi-Arid Regions: Case of M’sila Expansive Clays. J. Appl. Eng. Sci. Technol. 2015, 1, 37–41. [Google Scholar]
Roy, T.K. Influence of Sand on Strength Characteristics of Cohesive Soil for Using as Subgrade of Road. Procedia Soc. Behav. Sci. 2013, 104, 218–224. [Google Scholar] [CrossRef]
Yao, Y.; Li, J.; Ni, J.; Liang, C.; Zhang, A. Effects of Gravel Content and Shape on Shear Behaviour of Soil-Rock Mixture: Experiment and DEM Modelling. Comput. Geotech. 2022, 141, 104476. [Google Scholar] [CrossRef]
Alnmr, A.; Alzawi, M.; Abdullah, S.; Ibraheem, J. Prediction by Improving the Characteristics of Compaction and Strength in the Expansive Soils According to Its Content of Sand. Tishreen Univ. J. -Eng. Sci. Ser. 2020, 42, 203–219. [Google Scholar]
Prasad, C.R.V.; Sharma, R.K. Influence of Sand and Fly Ash on Clayey Soil Stabilization. IOSR J. Mech. Civ. Eng. 2014, 334, 36–40. [Google Scholar]
Al Rawi, O.S.; Assaf, M.N.; Hussein, N.M. Effect of Sand Additives on the Engineering Properties of Fine Grained Soils. ARPN J. Eng. Appl. Sci. 2018, 13, 3197–3206. [Google Scholar]
Alnmr, A. Material Models to Study the Effect of Fines in Sandy Soils Based on Experimental and Numerical Results. Acta Tech. Jaurinensis 2021, 14, 651–680. [Google Scholar] [CrossRef]
Alnmr, A.; Ray, R.P. Review of the Effect of Sand on the Behavior of Expansive Clayey Soils. Acta Tech. Jaurinensis 2021, 14, 521–552. [Google Scholar] [CrossRef]
Alnmr, A.N.; Omran Alzawi, M.; Abdallah, S. Study the Effect of the Percentage of Fines on the Compressibility Behavior of Sandy Soils. Tishreen Univ. J. Res. Sci. Stud. 2017, 4, 2079–3001. Available online: http://www.journal.tishreen.edu.sy/index.php/engscnc/article/view/3677/0 (accessed on 29 April 2024).
Alnmr, A.; Ray, R. Numerical Simulation of Replacement Method to Improve Unsaturated Expansive Soil. Pollack Period. 2023, 18, 41–47. [Google Scholar] [CrossRef]
Alnmr, A.; Alzawi, M.O.; Ray, R.; Abdullah, S.; Ibraheem, J. Experimental Investigation of the Soil-Water Characteristic Curves (SWCC) of Expansive Soil: Effects of Sand Content, Initial Saturation, and Initial Dry Unit Weight. Water 2024, 16, 627. [Google Scholar] [CrossRef]
Khan, F.S.; Azam, S.; Raghunandan, M.E.; Clark, R. Compressive Strength of Compacted Clay-Sand Mixes. Adv. Mater. Sci. Eng. 2014, 2014, 1–6. [Google Scholar] [CrossRef]
Mallikarjuna Rao, K.; Subba Rao, G.V.R. Influence of Coarse Fraction on Characteristics of Expansive Soil–Sand Mixtures. Int. J. Geosynth. Ground Eng. 2018, 4, 19. [Google Scholar] [CrossRef]
Fredlund, D.G.; Rahardjo, H. Soil Mechanics for Unsaturated Soils; John Wiley & Sons, Inc.: Hoboken, NJ, USA, 1993; ISBN 9780470172759. [Google Scholar]
Yao, Y.; Ni, J.; Li, J. Stress-Dependent Water Retention of Granite Residual Soil and Its Implications for Ground Settlement. Comput. Geotech. 2021, 129, 103835. [Google Scholar] [CrossRef]
Wong, J.T.F.; Chow, K.L.; Chen, X.W.; Ng, C.W.W.; Wong, M.H. Effects of Biochar on Soil Water Retention Curves of Compacted Clay during Wetting and Drying. Biochar 2022, 4, 4. [Google Scholar] [CrossRef]
Vanapalli, S.K.; Fredlund, D.G.; Pufahl, D.E.; Clifton, A.W. Model for the Prediction of Shear Strength with Respect to Soil Suction. Can. Geotech. J. 2011, 33, 379–392. [Google Scholar] [CrossRef]
Vanapalli, S.K.; Fredlund, D.G.; Pufahl, D.E. Comparison of Saturated-Unsaturated Shear Strength and Hydraulic Conductivity Behavior of a Compacted Sandy-Clay Till. In Proceedings of the Proceedings of the 50th Canadian Geotechnical Conference; Ottawa, ON, USA, 20 October 1997, pp. 20–22.
Ling, Q.; Zhang, Q.; Wei, Y.; Kong, L.; Zhu, L. Slope Reliability Evaluation Based on Multi-Objective Grey Wolf Optimization-Multi-Kernel-Based Extreme Learning Machine Agent Model. Bull. Eng. Geol. Environ. 2021, 80, 2011–2024. [Google Scholar] [CrossRef]
Liu, L.; Zhang, S.; Cheng, Y.M.; Liang, L. Advanced Reliability Analysis of Slopes in Spatially Variable Soils Using Multivariate Adaptive Regression Splines. Geosci. Front. 2019, 10, 671–682. [Google Scholar] [CrossRef]
Wang, H.; Zhang, L.; Yin, K.; Luo, H.; Li, J. Landslide Identification Using Machine Learning. Geosci. Front. 2021, 12, 351–364. [Google Scholar] [CrossRef]
Ray, R.; Kumar, D.; Samui, P.; Roy, L.B.; Goh, A.T.C.; Zhang, W. Application of Soft Computing Techniques for Shallow Foundation Reliability in Geotechnical Engineering. Geosci. Front. 2021, 12, 375–383. [Google Scholar] [CrossRef]
Wang, L.; Wu, C.; Tang, L.; Zhang, W.; Lacasse, S.; Liu, H.; Gao, L. Efficient Reliability Analysis of Earth Dam Slope Stability Using Extreme Gradient Boosting Method. Acta Geotech. 2020, 15, 3135–3150. [Google Scholar] [CrossRef] [PubMed]
Li, S.; Zhao, H.B.; Ru, Z. Slope Reliability Analysis by Updated Support Vector Machine and Monte Carlo Simulation. Nat. Hazards 2013, 65, 707–722. [Google Scholar] [CrossRef]
Kumar, M.; Samui, P. Reliability Analysis of Pile Foundation Using ELM and MARS. Geotech. Geol. Eng. 2019, 37, 3447–3457. [Google Scholar] [CrossRef]
Shen, H.; Li, J.; Wang, S.; Xie, Z. Prediction of Load-Displacement Performance of Grouted Anchors in Weathered Granites Using FastICA-MARS as a Novel Model. Geosci. Front. 2021, 12, 415–423. [Google Scholar] [CrossRef]
Li, T.Z.; Pan, Q.; Dias, D. Active Learning Relevant Vector Machine for Reliability Analysis. Appl. Math. Model. 2021, 89, 381–399. [Google Scholar] [CrossRef]
Kardani, N.; Aminpour, M.; Nouman Amjad Raja, M.; Kumar, G.; Bardhan, A.; Nazem, M. Prediction of the Resilient Modulus of Compacted Subgrade Soils Using Ensemble Machine Learning Methods. Transp. Geotech. 2022, 36, 100827. [Google Scholar] [CrossRef]
Yi, P.; Wei, K.; Kong, X.; Zhu, Z. Cumulative PSO-Kriging Model for Slope Reliability Analysis. Probabilistic Eng. Mech. 2015, 39, 39–45. [Google Scholar] [CrossRef]
Zhang, W.; Wu, C.; Tang, L.; Gu, X.; Wang, L. Efficient Time-Variant Reliability Analysis of Bazimen Landslide in the Three Gorges Reservoir Area Using XGBoost and LightGBM Algorithms. Gondwana Res. 2022, 123, 41–53. [Google Scholar] [CrossRef]
Shi, C.; Wang, Y. Development of Subsurface Geological Cross-Section from Limited Site-Specific Boreholes and Prior Geological Knowledge Using Iterative Convolution XGBoost. J. Geotech. Geoenviron. Eng. 2021, 147, 04021082. [Google Scholar] [CrossRef]
Wang, M.-X.; Huang, D.; Wang, G.; Li, D.-Q. SS-XGBoost: A Machine Learning Framework for Predicting Newmark Sliding Displacements of Slopes. J. Geotech. Geoenviron. Eng. 2020, 146, 04020074. [Google Scholar] [CrossRef]
Alsirawan, R.; Sheble, A.; Alnmr, A. Two-Dimensional Numerical Analysis for TBM Tunneling-Induced Structure Settlement: A Proposed Modeling Method and Parametric Study. Infrastructures 2023, 8, 88. [Google Scholar] [CrossRef]
Alnmr, A. Impact of Soil Composition on Maximum Depth of Wetting in Expansive Soils. Pollack Period. 2023, 19, 85–92. [Google Scholar] [CrossRef]
Merouane, F.Z.; Mamoune, S.M.A. Prediction of Swelling Parameters of Two Clayey Soils from Algeria Using Artificial Neural Networks. Math. Model. Civ. Eng. 2018, 14, 11–26. [Google Scholar] [CrossRef]
Dutta, R.K.; Singh, A.; Gnananandarao, T. Prediction of Free Swell Index for the Expansive Soil Using Artificial Neural Networks. J. Soft Comput. Civ. Eng. 2019, 3, 47–62. [Google Scholar] [CrossRef]
Cho, S.E. Probabilistic Stability Analyses of Slopes Using the ANN-Based Response Surface. Comput. Geotech. 2009, 36, 787–797. [Google Scholar] [CrossRef]
Wang, Z.Z.; Xiao, C.; Goh, S.H.; Deng, M.-X. Metamodel-Based Reliability Analysis in Spatially Variable Soils Using Convolutional Neural Networks. J. Geotech. Geoenviron. Eng. 2021, 147, 04021003. [Google Scholar] [CrossRef]
Jalal, F.E.; Xu, Y.; Iqbal, M.; Javed, M.F.; Jamhiri, B. Predictive Modeling of Swell-Strength of Expansive Soils Using Artificial Intelligence Approaches: ANN, ANFIS and GEP. J. Environ. Manag. 2021, 289, 112420. [Google Scholar] [CrossRef]
Díaz, E.; Salamanca-Medina, E.L.; Tomás, R. Assessment of Compressive Strength of Jet Grouting by Machine Learning. J. Rock Mech. Geotech. Eng. 2023, 16, 102–111. [Google Scholar] [CrossRef]
Wu, K.; Meng, Q.; Li, R.; Luo, L.; Ke, Q.; Wang, C.; Ma, C. A Machine Learning-Based Strategy for Predicting the Mechanical Strength of Coral Reef Limestone Using X-Ray Computed Tomography. J. Rock Mech. Geotech. Eng. 2023. [Google Scholar] [CrossRef]
Chang, Z.; Catani, F.; Huang, F.; Liu, G.; Meena, S.R.; Huang, J.; Zhou, C. Landslide Susceptibility Prediction Using Slope Unit-Based Machine Learning Models Considering the Heterogeneity of Conditioning Factors. J. Rock Mech. Geotech. Eng. 2023, 15, 1127–1143. [Google Scholar] [CrossRef]
Praks, P.; Brkić, D. Symbolic Regression-Based Genetic Approximations of the Colebrook Equation for Flow Friction. Water 2018, 10, 1175. [Google Scholar] [CrossRef]
Latif, Q.B.A.I.; Memon, Z.A.; Mahmood, Z.; Qureshi, M.U.; Milad, A. A Machine Learning Model for the Prediction of Concrete Penetration by the Ogive Nose Rigid Projectile. Appl. Sci. 2022, 12, 2040. [Google Scholar] [CrossRef]
Hamidia, M.; Ganjizadeh, A. Post-Earthquake Damage Evaluation of Non-Ductile RC Moment Frames Using Surface Crack Patterns. Struct. Control Health Monit. 2022, 29, e3024. [Google Scholar] [CrossRef]
Naser, M.Z. Heuristic Machine Cognition to Predict Fire-Induced Spalling and Fire Resistance of Concrete Structures. Autom. Constr. 2019, 106, 102916. [Google Scholar] [CrossRef]
Rezaei, H.; Zarfam, P.; Golafshani, E.M.; Amiri, G.G. Seismic Fragility Analysis of RC Box-Girder Bridges Based on Symbolic Regression Method. Structures 2022, 38, 306–322. [Google Scholar] [CrossRef]
Gan, L.; Wu, H.; Zhong, Z. Integration of Symbolic Regression and Domain Knowledge for Interpretable Modeling of Remaining Fatigue Life under Multistep Loading. Int. J. Fatigue 2022, 161, 106889. [Google Scholar] [CrossRef]
Ben Chaabene, W.; Nehdi, M.L. Genetic Programming Based Symbolic Regression for Shear Capacity Prediction of SFRC Beams. Constr. Build. Mater. 2021, 280, 122523. [Google Scholar] [CrossRef]
Amanabadi, S.; Vazirinia, M.; Vereecken, H.; Vakilian, K.A.; Mohammadi, M.H. Comparative Study of Statistical, Numerical and Machine Learning-Based Pedotransfer Functions of Water Retention Curve with Particle Size Distribution Data. Eurasian Soil Sci. 2019, 52, 1555–1571. [Google Scholar] [CrossRef]
Ikizler, S.B.; Aytekin, M.; Vekli, M.; Kocabaş, F. Prediction of Swelling Pressures of Expansive Soils Using Artificial Neural Networks. Adv. Eng. Softw. 2010, 41, 647–655. [Google Scholar] [CrossRef]
Bachir, R.; Mohammed, A.M.S.; Habib, T. Using Artificial Neural Networks Approach to Estimate Compressive Strength for Rubberized Concrete. Period. Polytech. Civ. Eng. 2018, 62, 858–865. [Google Scholar] [CrossRef]
Najjar, Y.M.; Basheer, I.A.; Mcreynolds, R. Neural Modeling of Kansas Soil Swelling. Transp. Res. Rec. J. Transp. Res. Board 1996, 1526, 14–19. [Google Scholar] [CrossRef]
Najjar, Y.M.; Basheer, I.A. Modeling of Soil Swelling via Regression and Neural Network Approaches; Kansas Department of Transportation: Topeka, KS, USA, 1998. Available online: https://trid.trb.org/View/473791 (accessed on 29 April 2024).
Doris, J.J.; Rizzo, D.M.; Dewoolkar, M.M. Forecasting Vertical Ground Surface Movement from Shrinking/Swelling Soils with Artificial Neural Networks. Int. J. Numer. Anal. Methods Geomech. 2008, 32, 1229–1245. [Google Scholar] [CrossRef]
Ashayeri, I.; Yasrebi, S. Free-Swell and Swelling Pressure of Unsaturated Compacted Clays; Experiments and Neural Networks Modeling. Geotech. Geol. Eng. 2009, 27, 137–153. [Google Scholar] [CrossRef]
Erzin, Y.; Güneş, N. The Prediction of Swell Percent and Swell Pressure by Using Neural Networks. Math. Comput. Appl. 2011, 16, 425–436. [Google Scholar] [CrossRef]
Ikeagwuani, C.C. Estimation of Modified Expansive Soil CBR with Multivariate Adaptive Regression Splines, Random Forest and Gradient Boosting Machine. Innov. Infrastruct. Solut. 2021, 6, 199. [Google Scholar] [CrossRef]
Eyo, E.U.; Abbey, S.J.; Lawrence, T.T.; Tetteh, F.K. Improved Prediction of Clay Soil Expansion Using Machine Learning Algorithms and Meta-Heuristic Dichotomous Ensemble Classifiers. Geosci. Front. 2022, 13, 101296. [Google Scholar] [CrossRef]
Benbouras, M.A.; Petrisor, A.I. Prediction of Swelling Index Using Advanced Machine Learning Techniques for Cohesive Soils. Appl. Sci. 2021, 11, 536. [Google Scholar] [CrossRef]
Chen, Y.; Xu, Y.; Jamhiri, B.; Wang, L.; Li, T. Predicting Uniaxial Tensile Strength of Expansive Soil with Ensemble Learning Methods. Comput. Geotech. 2022, 150, 104904. [Google Scholar] [CrossRef]
ASTM D2487-17e1; Standard Practice for Classification of Soils for Engineering Purposes (Unified Soil Classification System). ASTM International: West Conshohocken, PA, USA, 2017. [CrossRef]
AASHTO Standard Method of Test for The Classification of Soils and SoilAggregate Mixtures for Highway Construction Purposes, Test Designation M145-91, Standard Specifications for Transportation Materials and Methods of Sampling and Testing. 2002. Available online: https://store.accuristech.com/standards/aashto-m-145-91-2021?product_id=1320718 (accessed on 29 April 2024).
Casagrande, A. Plasticity Chart for the Classification of Cohesive Soils. Transacfions Am. Soc. Civ. Eng. 1948, 113, 901–930. [Google Scholar] [CrossRef]
Holtz, R.D.; Kovacs, W.D. Kansas Geotechnical Survey. Curr. Res. Earth Science. Bull. 1981, 244. [Google Scholar]
Alnmr, A.; Ray, R.; Omran Alzawi, M. A Novel Approach to Swell Mitigation: Machine-Learning-Powered Optimal Unit Weight and Stress Prediction in Expansive Soils. Appl. Sci. 2024, 14, 1411. [Google Scholar] [CrossRef]
Hasanipanah, M.; Faradonbeh, R.S.; Amnieh, H.B.; Armaghani, D.J.; Monjezi, M. Forecasting Blast-Induced Ground Vibration Developing a CART Model. Eng. Comput. 2017, 33, 307–316. [Google Scholar] [CrossRef]
Monjezi, M.; Baghestani, M.; Shirani Faradonbeh, R.; Pourghasemi Saghand, M.; Jahed Armaghani, D. Modification and Prediction of Blast-Induced Ground Vibrations Based on Both Empirical and Computational Techniques. Eng. Comput. 2016, 32, 717–728. [Google Scholar] [CrossRef]
Singh, R.; Umrao, R.K.; Ahmad, M.; Ansari, M.K.; Sharma, L.K.; Singh, T.N. Prediction of Geomechanical Parameters Using Soft Computing and Multiple Regression Approach. Measurement 2017, 99, 108–119. [Google Scholar] [CrossRef]
Tso, G.K.F.; Yau, K.K.W. Predicting Electricity Energy Consumption: A Comparison of Regression Analysis, Decision Tree and Neural Networks. Energy 2007, 32, 1761–1768. [Google Scholar] [CrossRef]
Rodriguez-Galiano, V.; Sanchez-Castillo, M.; Chica-Olmo, M.; Chica-Rivas, M. Machine Learning Predictive Models for Mineral Prospectivity: An Evaluation of Neural Networks, Random Forest, Regression Trees and Support Vector Machines. Ore Geol. Rev. 2015, 71, 804–818. [Google Scholar] [CrossRef]
Friedman, J.H. Greedy Function Approximation: A Gradient Boosting Machine. Ann. Stat. 2001, 29, 1189–1232. [Google Scholar] [CrossRef]
Bardhan, A.; Kardani, N.; Alzo’ubi, A.K.; Roy, B.; Samui, P.; Gandomi, A.H. Novel Integration of Extreme Learning Machine and Improved Harris Hawks Optimization with Particle Swarm Optimization-Based Mutation for Predicting Soil Consolidation Parameter. J. Rock Mech. Geotech. Eng. 2022, 14, 1588–1608. [Google Scholar] [CrossRef]
Chen, T.; Guestrin, C. XGBoost: A Scalable Tree Boosting System. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Francisco, CA, USA, 13–17 August 2016; ACM: New York, NY, USA, 2016. [Google Scholar]
Lai, V.; Ahmed, A.N.; Malek, M.A.; Afan, H.A.; Ibrahim, R.K.; El-Shafie, A.; El-Shafie, A. Modeling the Nonlinearity of Sea Level Oscillations in the Malaysian Coastal Areas Using Machine Learning Algorithms. Sustainability 2019, 11, 4643. [Google Scholar] [CrossRef]
Wu, J.; Liu, H.; Wei, G.; Song, T.; Zhang, C.; Zhou, H. Flash Flood Forecasting Using Support Vector Regression Model in a Small Mountainous Catchment. Water 2019, 11, 1327. [Google Scholar] [CrossRef]
Ibrahem Ahmed Osman, A.; Najah Ahmed, A.; Chow, M.F.; Feng Huang, Y.; El-Shafie, A. Extreme Gradient Boosting (Xgboost) Model to Predict the Groundwater Levels in Selangor Malaysia. Ain Shams Eng. J. 2021, 12, 1545–1556. [Google Scholar] [CrossRef]
Ali, I.; Alharbi, O.M.L.; Alothman, Z.A.; Badjah, A.Y.; Alwarthan, A.; Basheer, A.A. Artificial Neural Network Modelling of Amido Black Dye Sorption on Iron Composite Nano Material: Kinetics and Thermodynamics Studies. J. Mol. Liq. 2018, 250, 1–8. [Google Scholar] [CrossRef]
Zhao, L.; Guo, Y.; Mohammadian, E.; Hadavimoghaddam, F.; Jafari, M.; Kheirollahi, M.; Rozhenko, A.; Liu, B. Modeling Permeability Using Advanced White-Box Machine Learning Technique: Application to a Heterogeneous Carbonate Reservoir. ACS Omega 2023, 8, 22922–22933. [Google Scholar] [CrossRef]
Bograchev, D.A.; Volfkovich, Y.M.; Sosenkin, V.E.; Podgornova, O.A.; Kosova, N.V. Prediction of Permeability Using Group Method of Data Handling (GMDH) Neural Network from Well Log Data. Energies 2020, 13, 551. [Google Scholar] [CrossRef]
Li, D.; Armaghani, D.J.; Zhou, J.; Lai, S.H.; Hasanipanah, M. A GMDH Predictive Model to Predict Rock Material Strength Using Three Non-Destructive Tests. J. Nondestruct. Eval. 2020, 39, 81. [Google Scholar] [CrossRef]
Ivakhnenko, A.G. Polynomial Theory of Complex Systems. IEEE Trans. Syst. Man Cybern. 1971, 1, 364–378. [Google Scholar] [CrossRef]
Dubčáková, R. Eureqa: Software Review. Genet. Program. Evolvable Mach. 2011, 12, 173–178. [Google Scholar] [CrossRef]
Lizama, E.; Morales, B.; Somos-Valenzuela, M.; Chen, N.; Liu, M. Understanding Landslide Susceptibility in Northern Chilean Patagonia: A Basin-Scale Study Using Machine Learning and Field Data. Remote Sensing 2022, 14, 907. [Google Scholar] [CrossRef]
Zeini, H.A.; Al-Jeznawi, D.; Imran, H.; Bernardo, L.F.A.; Al-Khafaji, Z.; Ostrowski, K.A.; Kazmi, S.; Zeini, H.A.; Al-Jeznawi, D.; Imran, H.; et al. Random Forest Algorithm for the Strength Prediction of Geopolymer Stabilized Clayey Soil. Sustainability 2023, 15, 1408. [Google Scholar] [CrossRef]
Scikit-Learn Developers Scikit-Learn. Machine Learning in Python. Available online: https://scikit-learn.org/stable/modules/grid_search.html#randomized-parameter-search (accessed on 21 January 2024).
Lundberg, S.M.; Lee, S.-I. A Unified Approach to Interpreting Model Predictions. In Proceedings of the 31st International Conference on Neural Information Processing Systems, Long Beach, CA, USA, 4–9 December 2017; pp. 4768–4777. [Google Scholar]
Zhang, W.; Wu, C.; Zhong, H.; Li, Y.; Wang, L. Prediction of Undrained Shear Strength Using Extreme Gradient Boosting and Random Forest Based on Bayesian Optimization. Geosci. Front. 2021, 12, 469–477. [Google Scholar] [CrossRef]
Rosone, M.; Airò Farulla, C.; Ferrari, A. Shear Strength of a Compacted Scaly Clay in Variable Saturation Conditions. Acta Geotech. 2016, 11, 37–50. [Google Scholar] [CrossRef]
Han, Z.; Vanapalli, S.K. Stiffness and Shear Strength of Unsaturated Soils in Relation to Soil-Water Characteristic Curve. Géotechnique 2016, 66, 627–647. [Google Scholar] [CrossRef]
Yan, X.Q.; Zhou, C.Y.; Rashed Hama, A.; Tahir, T.A.; Jamal Ali, B.; Professor Nihad Bahaaldeen Salih, A. Geotechnical Characteristics Correlations for Fine-Grained Soils. IOP Conf. Ser. Mater. Sci. Eng. 2020, 737, 012099. [Google Scholar] [CrossRef]
Mohammed, A.S. Property Correlations and Statistical Variations in the Geotechnical Properties of (CH) Clay Soils. Geotech. Geol. Eng. 2018, 36, 267–281. [Google Scholar] [CrossRef]
Cabalar, A.F.; Mustafa, W.S. Behaviour of Sand–Clay Mixtures for Road Pavement Subgrade. Int. J. Pavement Eng. 2017, 18, 714–726. [Google Scholar] [CrossRef]

Figure 1. Machine learning model implementation workflow.

Figure 2. Case study location: Coordinates: 35°33′12.2″ N, 35°46′35.9″ E.

Figure 3. Variation in granular gradients across tested mixtures.

Figure 4. Effect of sand content on the Atterberg limits.

Figure 5. The maximum dry density and optimal moisture values as a function of the sand content; (A) the change in

γ_{d m a x}

as a function of Fs, (B) the change in

w_{o p t}

as a function of Fs.

Figure 5. The maximum dry density and optimal moisture values as a function of the sand content; (A) the change in

γ_{d m a x}

as a function of Fs, (B) the change in

w_{o p t}

as a function of Fs.

Figure 6. Visualization of sample preparation process: (A) sand and expansive clay mixing, (B) sample encasement in plastic bags for moisture retention, (C) mold placement and application of static pressure via hydraulic piston, (D) final prepared specimen in unconfined compression apparatus, and (E) post-failure specimen.

Figure 7. Relationship between the strain and vertical stress (G = 2.69;

e_{0}

= 0.758; Fs = 20%;

γ_{d} = γ_{d m a x} = 15.3 k N / m^{3}

; SR = SR_opt = 88.7%; LL = 63.3%; PI = 35.6%).

Figure 7. Relationship between the strain and vertical stress (G = 2.69;

e_{0}

= 0.758; Fs = 20%;

γ_{d} = γ_{d m a x} = 15.3 k N / m^{3}

; SR = SR_opt = 88.7%; LL = 63.3%; PI = 35.6%).

Figure 8. Density plots and distribution histograms of the dataset used in this study: (A)

γ_{d}

, (B) SR, (C)

e_{0}

, (D) Fs, (E) LL, (F)

γ_{d m a x}

, (G) G, (H) PI, (I) SR_opt, (J) q_u, (K) E_t.

Figure 8. Density plots and distribution histograms of the dataset used in this study: (A)

γ_{d}

, (B) SR, (C)

e_{0}

, (D) Fs, (E) LL, (F)

γ_{d m a x}

, (G) G, (H) PI, (I) SR_opt, (J) q_u, (K) E_t.

Figure 9. Assessing the impact of different input parameters for

q_{u}

and

E_{t}

via correlation coefficients: (A) Pearson for

q_{u}

, (B) Pearson for

E_{t}

, (C) Kendall for

q_{u}

, (D) Kendall for

E_{t}

, (E) Spearman for

q_{u}

, and (F) Spearman for

E_{t}

.

Figure 9. Assessing the impact of different input parameters for

q_{u}

and

E_{t}

via correlation coefficients: (A) Pearson for

q_{u}

, (B) Pearson for

E_{t}

, (C) Kendall for

q_{u}

, (D) Kendall for

E_{t}

, (E) Spearman for

q_{u}

, and (F) Spearman for

E_{t}

.

Figure 10. Illustration of a graphical representation of the neural network models: (A) conventional ANN and (B) proposed ANN-GMDH.

Figure 11. Flowchart of symbolic regression process.

Figure 12. The change in

q_{u}

and

E_{t}

with

γ_{d}

according to Fs; (A) the change in

q_{u}

with

γ_{d}

according to Fs; (B) the change in

E_{t}

with

γ_{d}

according to Fs.

Figure 12. The change in

q_{u}

and

E_{t}

with

γ_{d}

according to Fs; (A) the change in

q_{u}

with

γ_{d}

according to Fs; (B) the change in

E_{t}

with

γ_{d}

according to Fs.

Figure 13. The change in

q_{u}

and

E_{t}

with SR according to Fs; (A) the changes in

q_{u}

with SR according to Fs; (B) the changes in

E_{t}

with SR according to Fs.

Figure 13. The change in

q_{u}

and

E_{t}

with SR according to Fs; (A) the changes in

q_{u}

with SR according to Fs; (B) the changes in

E_{t}

with SR according to Fs.

Figure 14. The change in

q_{u}

and

E_{t}

with Fs according to SR: (A) change in

q_{u}

with Fs according to SR’ (B) change in

E_{t}

with Fs according to SR.

Figure 14. The change in

q_{u}

and

E_{t}

with Fs according to SR: (A) change in

q_{u}

with Fs according to SR’ (B) change in

E_{t}

with Fs according to SR.

Figure 15. Unconfined compression tests on mixtures corresponding to the optimal moisture and maximum dry density according to standard Proctor’s tests: (A) the change in

q_{u}

as a function of Fs, (B) the change in

E_{t}

as a function of Fs.

Figure 15. Unconfined compression tests on mixtures corresponding to the optimal moisture and maximum dry density according to standard Proctor’s tests: (A) the change in

q_{u}

as a function of Fs, (B) the change in

E_{t}

as a function of Fs.

Figure 16. The cross plots for actual and predicted

\frac{E t}{γ_{d}}

targets of training, validation, test, and combined data for ANN (

E_{t}

) model and ANN-GMDH (

E_{t}

) model: (A) ANN (

E_{t}

) model for training data, (B) ANN (

E_{t}

) model for validation data, (C) ANN (

E_{t}

) model for test data, (D) ANN (

E_{t}

) model for combined data, (E) ANN-GMDH (

E_{t}

) model for training data, (F) ANN-GMDH (

E_{t}

) model for validation data, (G) ANN-GMDH (

E_{t}

) model for test data, and (H) ANN-GMDH (

E_{t}

) model for combined data.

Figure 16. The cross plots for actual and predicted

\frac{E t}{γ_{d}}

targets of training, validation, test, and combined data for ANN (

E_{t}

) model and ANN-GMDH (

E_{t}

) model: (A) ANN (

E_{t}

) model for training data, (B) ANN (

E_{t}

) model for validation data, (C) ANN (

E_{t}

) model for test data, (D) ANN (

E_{t}

) model for combined data, (E) ANN-GMDH (

E_{t}

) model for training data, (F) ANN-GMDH (

E_{t}

) model for validation data, (G) ANN-GMDH (

E_{t}

) model for test data, and (H) ANN-GMDH (

E_{t}

) model for combined data.

Figure 17. Comparison of true and predicted

q_{u}

targets in training, validation, test, and combined data for ANN (

q_{u}

) model and ANN-GMDH (

q_{u}

) model: (A) ANN (

q_{u}

) model for training data, (B) ANN (

q_{u}

) model for validation data, (C) ANN (

q_{u}

) model for test data, (D) ANN (

q_{u}

) model for combined data, (E) ANN-GMDH (

q_{u}

) model for training data, (F) ANN-GMDH (

q_{u}

) model for validation data, (G) ANN-GMDH (

q_{u}

) model for test data, and (H) ANN-GMDH (

q_{u}

) model for combined data.

Figure 17. Comparison of true and predicted

q_{u}

targets in training, validation, test, and combined data for ANN (

q_{u}

) model and ANN-GMDH (

q_{u}

) model: (A) ANN (

q_{u}

) model for training data, (B) ANN (

q_{u}

) model for validation data, (C) ANN (

q_{u}

) model for test data, (D) ANN (

q_{u}

) model for combined data, (E) ANN-GMDH (

q_{u}

) model for training data, (F) ANN-GMDH (

q_{u}

) model for validation data, (G) ANN-GMDH (

q_{u}

) model for test data, and (H) ANN-GMDH (

q_{u}

) model for combined data.

Figure 18. Illustration of user-friendly interface: (A) ANN

q_{u}

model, (B) ANN

E_{t}

model.

Figure 18. Illustration of user-friendly interface: (A) ANN

q_{u}

model, (B) ANN

E_{t}

model.

Figure 19. The cross plots of actual and predicted targets for (A) Equation (6), (B) Equation (7), and (C) Equation (8).

Figure 20. The cross plots of actual and predicted targets for (A) Equation (9), (B) Equation (10), and (C) Equation (11).

Figure 21. RMSE and MAE metrics ANN (

E_{t}

) vs. symbolic regression

E_{t}

equations.

Figure 21. RMSE and MAE metrics ANN (

E_{t}

) vs. symbolic regression

E_{t}

equations.

Figure 22. Comparison of true and predicted targets for Symbolic regression and ANN models: (A)

E_{t}

/

γ_{d}

models, (B)

q_{u}

models.

Figure 22. Comparison of true and predicted targets for Symbolic regression and ANN models: (A)

E_{t}

/

γ_{d}

models, (B)

q_{u}

models.

Figure 23. Relative importance of features with SHAP values in XGBoost model for (A)

E_{t}

and (B)

q_{u}

.

Figure 23. Relative importance of features with SHAP values in XGBoost model for (A)

E_{t}

and (B)

q_{u}

.

Figure 24. Relative importance of features with SHAP values in RFR model for (A)

E_{t}

and (B)

q_{u}

.

Figure 24. Relative importance of features with SHAP values in RFR model for (A)

E_{t}

and (B)

q_{u}

.

Table 1. Overview of machine learning models used in the geotechnical sector.

Reference	Model	Description
Díaz 2023 [55]	GBR, RFR, DTR, ET, Linear R, LASSO, EN, and others	The current study predicts the UCS of jet grouting columns using supervised machine learning algorithms. The study concludes that DTR, GBR, RFR, and extremely randomized trees (ET) are the top algorithms in terms of performance.
Kumar 2019 [40]	MARS and ELM	The study focused on predicting pile-bearing capacity in sandy soil using machine learning algorithms (ELM and MARS). In terms of performance measures, the MARS model outperformed the ELM model significantly.
Kardani 2022 [43]	GBR, DTR, KNR, RFR, ST-ENSM, BG-ENSM, and others	The study evaluated machine learning models for estimating the resilient modulus (M_R) of compacted subgrade soils. Bagging ensemble (BG-ENSM) ranked first, while RFR ranked last. Stacking ensemble (ST-ENSM) and DTR also ranked fairly well.
Wu 2023 [56]	RFR, SVR, and ANN	In this study, SVR, RFR, and ANN models were trained on coral reef limestone (CRL) samples using X-ray CT and lab data. On test data, the ANN model performed admirably, predicting UCS with 92% accuracy.
Zhang 2022 [45]	XGBoost and GBR	The study presented an effective use of XGBoost and GBR in time-varying reliability analysis. The probabilities of a landslide failing at particular times were computed by these models effectively and precisely. While both models accurately predicted outcomes, XGBoost exhibited marginally superior performance.
Chang 2023 [57]	RFR and ANN	The multi-scale segmentation method was used in this study to identify slope units in Chongyi County, China. The various characteristics of these units were used to construct various slope-machine learning models. In terms of predictive accuracy, RF models outperform ANN models.
Wang 2021 [36]	LR, SVM, RFR, GBR, and ANN	This study provided a new machine learning method for identifying natural-terrain landslides. It revealed that ANN excelled because of its superior feature extraction and data processing capabilities. Boosting algorithms rank second in accuracy after ANN, followed by RFR, LR, and SVM.

Table 2. Overview of machine learning models used in the literature to predict expansive soil behavior.

Reference	Model	Description
Najjar 1996; Najjar 1998 [68,69]	ANN	These papers focus on the development of statistical models and neural network-based models to predict swelling potential. It has been demonstrated that neural models outperform statistical models in terms of prediction accuracy.
Doris 2008 [70]	ANN	This study employs artificial neural networks (ANNs) to forecast soil surface movements caused by changes in moisture caused by climate variations. Based on climatic changes, ANN predictions match field measurements of soil movements caused by moisture changes as well as a physics-based method based on soil water content measurements.
Merouane 2018; Ikizler 2010; Ashayeri 2009; Erzin 2011 [50,66,71,72]	ANN	ANNs were used to predict the swelling pressure and amplitude of unsaturated clay
Dutta 2019 [51]	ANN	ANNs were used to predict the free swell index (FSI) of the expansive soil. The ANN model predicted the FSI more accurately than the multiple linear regression model.
Ikeagwuani 2021 [73]	MARS, RFR, and GBR	Three distinct machine learning models were employed in this study to predict the California Bearing Ratio (CBR) of an expansive soil subgrade treated with regular Portland cement, quarry dust, and sawdust ash. The findings of the study revealed that the random forest model surpassed both the gradient boosting machine and MARS models in terms of predictive accuracy.
Eyo 2022 [74]	ANN, SVR, LR, RFR, and others	This work used a broad range of machine learning techniques to evaluate and estimate the swelling of soils with different plastic characteristics under expanding behavior during inundation. According to the findings, the support vector machine (SVR) outperformed other techniques, including random decision forests (RFR), logistic regressors (LRs), and artificial neural networks (ANNs).
Benbouras 2021 [75]	SVR, RFR, ANN, DNN, LASSO, ELM, and others	This study used many advanced algorithms to predict the swelling index of expansive soil. The results show that the random forest method yielded more effective and accurate results than the other methods in modeling geotechnical phenomena.
Chen 2022 [76]	RFR, XGBoost, SG, ANN, SVR, and MARS	This work developed a prediction model of the uniaxial tensile strength using three ensemble machine learning approaches and three classical methods. Based on all metrics, the results indicate that the stacked generalization (SG) and XGBoost perform best, with the ANN, RF, MARS, and SVM following closely after.

Table 3. Chemical composition of the expansive clay.

Chemical Composition	Alumina (Al₂O₃)	Silica (SiO₂)	Magnesium (MgO)	Ferric (Fe₂O₃)	Potassium (K₂O)	Calcium (CaO)	(Sodium (Na₂O)	Loss of Ignition (LoI)
%	11.51	49.79	2.41	5.49	0.37	12	1.2	17.23

Table 4. The statistical attributes of the selected variables.

	LL	Fs	$e_{0}$	SR	$γ_{d}$	$E_{t}$	$q_{u}$
count	225	225	225	225	225	225	225
mean	59.7	25.3	0.73	76.2	15.66	42,267	517.1
std	13	17.06	0.14	15.68	1.228	33,966	393.7
min	41.1	0	0.45	50	13.3	950	17.79
25%	48.6	10	0.63	60	14.66	14,717	238.3
50%	55.8	30	0.74	74.25	15.3	34,418	416.97
75%	72.1	40	0.858	90	16.6	61,706	665.11
max	78.8	50	1.032	100	18.451	159,238	2277.9

Table 5. Optimal hyperparameters for models predicting ‘

q_{u}

’.

Table 5. Optimal hyperparameters for models predicting ‘

q_{u}

’.

Model	Hyperparameter	Value
DTR	max_depth	10
	min_samples_leaf	1
	min_samples_split	1
RFR	max_depth	10
	min_samples_leaf	1
	min_samples_split	1
	n_estimators	130
GBR	subsample	0.8
	n_estimators	100
	min_samples_split	12
	min_samples_leaf	2
	max_depth	3
	learning_rate	0.6
XGBoost	colsample_bytree	0.8
	learning_rate	0.4
	max_depth	3
	n_estimators	200
	subsample	0.6
SVR	C	100,000
	epsilon	10
	kernel	rbf
ANN	number of neurons in layer1	10
	number of neurons in layer2	5
	learning_rate	0.05
	batch size	10
	hidden layers function	ReLU
	linkage between the hidden layer and the ultimate output layer function	ReLU
ANN-GMDH	number of neurons in layer1	20
	number of neurons in layer2	30
	learning_rate	0.09
	batch size	14
	hidden layers function	ReLU
	linkage between the hidden layer and the ultimate output layer function	ReLU

Table 6. Optimal hyperparameters for models predicting ‘

E_{t}

’.

Table 6. Optimal hyperparameters for models predicting ‘

E_{t}

’.

Model	Hyperparameter	Value
DTR	max_depth	10
	min_samples_leaf	1
	min_samples_split	1
RFR	max_depth	12
	min_samples_leaf	1
	min_samples_split	1
	n_estimators	70
GBR	subsample	0.8
	n_estimators	100
	min_samples_split	12
	min_samples_leaf	2
	max_depth	3
	learning_rate	0.6
XGBoost	colsample_bytree	0.6
	learning_rate	0.2
	max_depth	3
	n_estimators	200
	subsample	0.6
SVR	C	100,000
	epsilon	0.1
	kernel	rbf
ANN	number of neurons in layer1	18
	number of neurons in layer2	18
	learning_rate	0.07
	batch size	8
	hidden layers function	ReLU
	linkage between the hidden layer and the ultimate output layer function	ReLU
ANN-GMDH	number of neurons in layer1	20
	number of neurons in layer2	16
	learning_rate	0.085
	batch size	41
	hidden layers function	ReLU
	linkage between the hidden layer and the ultimate output layer function	ReLU

Table 7. Equation and metrics of the explicit models.

Model	Equation	Metrics	Training	Testing
Linear regression	$q_{u} = 20.43 L L - 0.19 F s + 2282.73 e_{0} - 14.69 S R + 581.26 γ_{d} - 10337.54$	RMSE	156.58	152.37
		MAE	113.37	111.70
		R²	0.8502	0.8302
	$E_{t} = 102277.05 e_{0} - 3650.79 L L + 33535.90 γ_{d} - 3766.92 F s - 1467.53 S R - 129466.22$	RMSE	9750.54	10,376.38
		MAE	7673.28	8035.28
		R²	0.9196	0.8960
Lasso	$q_{u} = 21.89 L L - 0.08 F s + 152.78 e_{0} - 14.67 S R + 352.01 γ_{d} - 5290.28$	RMSE	159.24	151.95
		MAE	115.22	116.70
		R²	0.8450	0.8312
	$E_{t} = 59441.79 e_{0} - 2414.77 L L + 28635.07 γ_{d} - 2835.58 F s - 1466.10 S R - 119044.50$	RMSE	9777.38	10,362.64
		MAE	7667.83	7998.31
		R²	0.9192	0.8963
Ridge	$q_{u} = 20.69 * L L - 0.61 F s + 973.61 e_{0} - 14.68 S R + 440.47 γ_{d} - 7186.86$	RMSE	157.59	151.61
		MAE	114.34	114.57
		R²	0.8482	0.8319
	$E_{t} = - 881.03 e_{0} - 3028.96 L L + 21526.18 γ_{d} - 3307.39 F s - 1463.22 S R + 84560.49$	RMSE	9850.17	10,303.14
		MAE	7680.33	8002.29
		R²	0.9180	0.8974
GMDH	$q_{u} = 883490.10 - 7004.26 L L - 5180.73 F s - 346890.29 e_{0} + 412.66 S R - 48116.59 γ_{d} + 31.89 {L L}^{2} + 47.51 L L F s + 701.47 L L e_{0} - 1.10 L L S R + 99.11 L L γ_{d} + 17.92 {F s}^{2} + 538.04 F s e_{0} - 0.51 F s S R + 65.26 F s γ_{d} + 61461.40 e_{0}^{2} - 101.58 e_{0} S R + 6983.98 e_{0} γ_{d} + 0.20 {S R}^{2} - 19.49 S R γ_{d} + 836.99 γ_{d}^{2}$	RMSE	49.52	56.56
		MAE	34.68	38.45
		R²	0.9850	0.9766
	$E_{t} = 46991712.22 - 11093891.97 e_{0} - 1106276.20 L L - 1064926.86 γ_{d} - 839473.26 F s - 3565.98 S R + 2067379.01 e_{0}^{2} - 10309.94 e_{0} L L + 1069107.43 e_{0} γ_{d} - 16039.98 e_{0} F s - 1160.06 e_{0} S R + 7192.25 {L L}^{2} - 2621.05 L L γ_{d} + 11280.37 L L F s + 94.18 L L S R + 45439.18 γ_{d}^{2} - 3445.81 γ_{d} F s - 461.22 γ_{d} S R + 4426.94 {F s}^{2} + 79.94 F s S R + 17.17 {S R}^{2}$	RMSE	2544.52	2889.17
		MAE	1932.47	2248.35
		R²	0.9945	0.9919

Table 8. The study’s metrics for the various models using the Train/Validation/Test Split cross-validation technique.

Model	Metrics	$E_{t}$				$q_{u}$
Model	Metrics	Training	Validation	Testing	r_d (%)	Training	Validation	Testing	r_d (%)
DTR	R²	0.9975	0.8243	0.8372	16.07	0.99971	0.8132	0.9065	9.32
	RMSE	102.41	808.88	787.88	669.34	6.88	140.64	113.09	1543.75
	MAE	22.73	634.1	622.84	2640.17	1.4963	91.589	75.847	4968.97
RFR	R²	0.9857	0.9088	0.898	8.90	0.982	0.83	0.883	10.08
	RMSE	244.33	582.83	623.55	155.21	54.21	134.144	126.724	133.76
	MAE	184.3	483.757	484.45	162.86	36.84	89.92	77.85	111.32
GBR	R²	0.9991	0.985	0.9764	2.27	0.99886	0.9133	0.9495	4.94
	RMSE	59.39	236.44	300	405.14	13.637	95.806	83.11	509.44
	MAE	48	164.36	224.48	367.67	10.462	55.105	49.745	375.48
XGBoost	R²	0.99824	0.988	0.9853	1.30	0.99902	0.9568	0.9557	4.34
	RMSE	85.81	211.14	237.2	176.42	12.664	67.644	77.818	514.48
	MAE	66.1	148.44	169.82	156.91	9.155	44.433	51.099	458.15
SVR	R²	0.996	0.9947	0.9909	0.51	0.9972	0.99655	0.98873	0.85
	RMSE	131.02	140.61	186.27	42.17	21.5785	19.124	39.265	81.96
	MAE	62.83	110.55	128.37	104.31	10.99	13.27	22.374	103.59
ANN	R²	0.9951	0.9955	0.9924	0.27	0.995	0.9963	0.9944	0.06
	RMSE	145.4	129.15	170.03	16.94	28.54	19.8	27.63	3.19
	MAE	109.116	108.792	115.408	5.77	19.471	15.83	18.9	2.93
ANN-GMDH	R²	0.9963	0.9956	0.9935	0.28	0.996	0.9968	0.9949	0.11
	RMSE	125.67	128.24	157.31	25.18	25.54	18.43	26.37	3.25
	MAE	95.66	96.658	120.86	26.34	17.73	14.24	18.45	4.06

Table 9. The study’s metrics for the various models using (k-fold) cross-validation technique.

Model	Metrics	$E_{t}$			$q_{u}$
Model	Metrics	Training	Testing	r_d (%)	Training	Testing	r_d (%)
DTR	R²	0.99492	0.8766	11.89	0.9835	0.9042	8.06
	RMSE	145.52	681.11	368.05	51.083	114.465	124.08
	MAE	66.272	546.266	724.28	27.73	80.778	191.30
RFR	R²	0.9854	0.9388	4.73	0.9828	0.9007	8.35
	RMSE	241.62	521.71	115.92	52.17	116.51	123.33
	MAE	189.75	408.33	115.19	33.3	76.91	130.96
GBR	R²	0.999694	0.9861	1.36	0.99755	0.9591	3.85
	RMSE	35.72	228.93	540.90	19.694	74.827	279.95
	MAE	29.35	167.56	470.90	14.83	48.52	227.17
XGBoost	R²	0.99803	0.9906	0.74	0.99995	0.9652	3.48
	RMSE	90.51	187.53	107.19	2.7669	69.011	2394.16
	MAE	69.5	148.97	114.35	2.08	40.92	1867.31
SVR	R²	0.99394	0.99323	0.07	0.9958	0.98926	0.66
	RMSE	158.865	159.557	0.44	26.203	35.051	33.77
	MAE	97.77	121.494	24.27	13.409	23.053	71.92
ANN	R²	0.99487	0.9901	0.48	0.9914	0.9903	0.11
	RMSE	147.19	187.71	27.53	36.971	36.0973	2.36
	MAE	107.69	145.744	35.34	23.58	24.35	3.27

Table 10. Analysis of the importance of variables.

Model	DTR		RFR		GBR		XGBoost		Correlation According to SHAP	Correlation According to Previous Research
Model	$q_{u}$	$E_{t}$	$q_{u}$	$E_{t}$	$q_{u}$	$E_{t}$	$q_{u}$	$E_{t}$	Correlation According to SHAP	Correlation According to Previous Research
SR	0.406	0.677	0.429	0.67	0.394	0.662	0.291	0.577	Negative	Negative [104,105]
$e_{0}$	0.163	0.127	0.263	0.147	0.325	0.162	0.332	0.122	Negative	Negative [106]
$γ_{d}$	0.294	0.125	0.204	0.11	0.154	0.101	0.097	0.135	Positive	Positive [106,107]
LL	0.089	0.052	0.052	0.033	0.039	0.03	0.213	0.118	Positive	Positive [106,107]
Fs	0.048	0.019	0.052	0.04	0.088	0.045	0.067	0.048	Negative	Negative [108]

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Alnmr, A.; Hosamo, H.H.; Lyu, C.; Ray, R.P.; Alzawi, M.O. Novel Insights in Soil Mechanics: Integrating Experimental Investigation with Machine Learning for Unconfined Compression Parameter Prediction of Expansive Soil. Appl. Sci. 2024, 14, 4819. https://doi.org/10.3390/app14114819

AMA Style

Alnmr A, Hosamo HH, Lyu C, Ray RP, Alzawi MO. Novel Insights in Soil Mechanics: Integrating Experimental Investigation with Machine Learning for Unconfined Compression Parameter Prediction of Expansive Soil. Applied Sciences. 2024; 14(11):4819. https://doi.org/10.3390/app14114819

Chicago/Turabian Style

Alnmr, Ammar, Haidar Hosamo Hosamo, Chuangxin Lyu, Richard Paul Ray, and Mounzer Omran Alzawi. 2024. "Novel Insights in Soil Mechanics: Integrating Experimental Investigation with Machine Learning for Unconfined Compression Parameter Prediction of Expansive Soil" Applied Sciences 14, no. 11: 4819. https://doi.org/10.3390/app14114819

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Novel Insights in Soil Mechanics: Integrating Experimental Investigation with Machine Learning for Unconfined Compression Parameter Prediction of Expansive Soil

Abstract

1. Introduction

1.1. Sand as an Additive Material and Its Impact on Soil Parameters

1.2. Artificial Intelligence as a Predictive Tool in Geotechnical Engineering

1.3. Research Scope

2. Materials and Methods

2.1. Methodology Overview

2.2. Laboratory Tests

2.3. Selection of Variables

2.4. Machine Learning Algorithms Used in the Study

2.4.1. Decision Tree Regression

2.4.2. Random Forest Regression

2.4.3. Gradient Boosting Regression

2.4.4. Extreme Gradient Boosting

2.4.5. Support Vector Regression

2.4.6. Artificial Neural Networks

2.4.7. Symbolic Regression

2.5. Hyperparameters Optimization

3. Results and Discussion

3.1. Effect of Initial Dry Unit Weight

3.2. Effect of Initial Degree of Saturation

3.3. Effect of Sand Content

3.4. Machine Learning to Predict q u and E t

3.4.1. Comparison of Machine Learning Models

ANN Models to Predict Elasticity Modulus ( E t )

ANN Model to Predict Unconfined Compressive Strength ( q u )

3.4.2. Streamlined Interface for ANN Model Predictions

3.4.3. Symbolic Regression Equations

Proposed Equations of Elasticity Modulus ( E t )

Proposed Equations of Unconfined Compression Strength q u

4. Feature Importance

5. Conclusions

Supplementary Materials

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

3.4. Machine Learning to Predict $q_{u}$ and $E_{t}$

ANN Models to Predict Elasticity Modulus ( $E_{t}$ )

ANN Model to Predict Unconfined Compressive Strength ( $q_{u}$ )

Proposed Equations of Elasticity Modulus ( $E_{t}$ )

Proposed Equations of Unconfined Compression Strength $q_{u}$