1. Introduction
The release of greenhouse gases into the environment has led to the melting of glacier reservoirs, posing significant threats globally [
1]. The concrete sector stands out as a major contributor to greenhouse gas emissions, accounting for up to 50% of global emissions [
2,
3]. Notably, Portland cement (PC), a key component of concrete, plays a substantial role in GHG emissions [
4]. PC production contributes approximately 7% of atmospheric emissions, with the calcination of calcium oxide (CaO) during the manufacturing process accounting for 50% of CO
2 emissions [
5]. Current production levels of PC stand at 4000 million tons annually, projected to reach around 6000 million tons by 2060 [
6]. These figures underscore the need for alternative measures to satisfy the growing demand for concrete while minimizing resource use and CO
2 emissions [
7]. On the other hand, with the development of cementing technology, concrete is no longer only used in the construction of infrastructure. Because of its greater rigidity and stability, it has become an important carrier of artistic creation. Due to the physical characteristics of concrete, art sculptures (as shown in
Figure 1) created not only have a stable structure but can also effectively reduce the environmental damage to the sculpture, such as rain erosion and wind damage. However, due to the different shapes of sculptures, their stress structure is changeable and special compared with the fixed structure of a house. The lack of flexibility of traditional concrete may lead to the material being unable to meet the needs of sculptures [
8]. Therefore, adding geopolymer to concrete can not only effectively improve the flexibility of concrete, but also effectively reduce the large amount of carbon dioxide generated by cement production, which is in line with the national strategy of green development [
9].
The incorporation of recycled and waste materials into concrete emerges as a scientific and pragmatic solution to address its high demand [
10]. This approach not only meets the increasing need for concrete but also mitigates environmental risks [
11]. In this context, fly ash (FA) and ground granulated blast furnace slag (GGBS), recognized natural pozzolanic materials, offer effective supplementary cementitious options in the construction sector [
12,
13,
14]. Utilizing these materials, along with alkaline solvents such as sodium silicate (Na
2SiO
3) and sodium hydroxide (NaOH), results in the production of environmentally friendly concrete, such as geopolymer concrete [
15,
16,
17,
18]. Geopolymer concretes, in their amorphous gel form, exhibit exceptional characteristics, including resistance to sulfate attack, acid resistance, enhanced durability, fire resistance, and significantly greater compressive strength than conventional concrete [
19,
20,
21,
22,
23]. Their use in construction can substantially reduce CO
2 emissions into the atmosphere [
24,
25]. A visual representation of the distinction between ordinary Portland cement and geopolymer concretes is provided in
Figure 2.
Studies indicate that the chemical and physical properties of the matrix play a crucial role in determining the strength of geopolymer concretes [
27,
28,
29]. Parameters such as fly-ash-to-NaOH ratio, Na
2SiO
3-to-NaOH ratio, workability, fly-ash-to-sand ratio, molarity, and alkaline ratio have notable effects on concrete strength. Observations by Ukritnukun et al. [
30] highlight the positive impact of blast furnace slag concentration, curing temperature, and silicate modulus. Gholizadeh-Vayghan et al. [
31] identified optimal molar ratios of Ca/Si and (Na + K)/Si, as well as an ideal volume ratio (H
2O/solid)vol, for enhancing the strength properties of geopolymer concretes. Songpiriyakij et al. [
32] determined that a Si-to-Al ratio of 15.9 resulted in geopolymer concretes with a high compressive strength of 73 MPa. Puertas et al. [
33] examined the strength and growth characteristics of fly-ash/slag-paste-hydration products, reporting mechanical properties exceeding 50 MPa for a mix with a fly-ash/slag ratio of 1.0, cured at 25 °C and stimulated with a 10 M NaOH solution after 28 days.
In the production of geopolymer concretes, pozzolanic materials with binding properties undergo polymerization at elevated temperatures in an alkaline medium, resulting in the formation of a crystalline and amorphous compound that imparts the desired mechanical properties [
34]. However, the substantial heat curing required for geopolymerization compound production poses challenges for in-field applications. The elevated heat demand associated with curing may hinder the widespread use of fly ash (FA)-based geopolymer concretes in construction [
35,
36,
37]. Mitigating this heat demand can be achieved by incorporating a slag blend rich in calcium, silica, and alumina. Combining ground granulated blast furnace slag (GGBS) with FA yields a dense microstructure with enhanced hydration and polymerization products, significantly boosting the early-age strength of geopolymer concretes. A study by Yazdi et al. [
38] investigated the impact of varying FA dosage with GGBS, demonstrating a substantial increase in compressive and flexural strength by 100 MPa and 10 MPa, respectively, when FA was replaced with GGBS. Additionally, Fang et al. [
35] explored the effect of varying slag content on the flexural and split tensile strength of FA-based geopolymer concretes, revealing increased strength due to the formation of C-A-S-H gel and N-A-S-H, expediting the reaction process of geopolymer concretes.
Noteworthy, thermodynamic modeling enables a precise prediction of the potential reaction products in geopolymeric systems, providing valuable insights into the material’s composition and properties. By understanding the thermodynamic landscape, researchers can guide the design of geopolymers by selecting appropriate precursors and controlling the synthesis conditions. Notably, thermodynamic modeling facilitates the optimization of precursor mixtures to achieve the desired properties in the final geopolymeric material.
The assessment of compressive strength in concrete traditionally involves physical tests on cubical and cylindrical specimens, produced with precise mixture ratios and cured in water for approximately 28 days to obtain hydrated products [
25,
39,
40]. The compressive strength is subsequently determined using a compression-testing machine, a method employed both in fieldwork and laboratory settings, albeit known for its inefficiency and time consumption [
41,
42]. Empirical regression methodologies are emerging as preferable alternatives for estimating concrete strength, deviating from standard experimental procedures. The literature underscores the significant influence of chemical composition and physical proportions on geopolymer concretes [
43,
44], creating heterogeneity in their production due to various involved parameters [
45,
46,
47,
48]. While statistical approaches offer methods for evaluating the compressive nature of geopolymer concretes, the intricate relationship between factors and mechanical strength remains not fully understood [
49,
50,
51,
52]. As a response, machine learning (ML) approaches are gaining prominence in predicting compressive strength, leveraging recent advancements in artificial intelligence algorithms [
53,
54,
55]. Advanced prediction algorithms developed through ML techniques serve various purposes, involving regressions, classifications, and data clustering [
56,
57,
58,
59,
60]. The application of ML regression functions extends to estimating the compressive loading capacity of concretes, demonstrating higher precision compared to traditional regression methods [
61,
62,
63,
64]. The advent of artificial intelligence algorithms has empowered researchers to tackle challenging problems [
65,
66,
67,
68,
69]. Some ML algorithms used to predict various characteristics of concretes are tabulated in
Table 1 [
70,
71,
72,
73,
74].
The application of machine learning algorithms, particularly the novel Ensemble RF–GWO–XGBoost Algorithm proposed in this research, signifies a pioneering effort to improve the performance and productiveness of estimating geopolymer concretes’ compressive strength [
3,
41,
42]. RF was chosen for its ability to handle complex, non-linear relationships in the data and provide a robust performance. Its ensemble nature helps mitigate overfitting and enhances generalization on diverse datasets. Notably, GWO was selected for optimization tasks due to its inspiration from the social behavior of gray wolves, offering a balance between exploration and exploitation. Its adaptability makes it suitable for various optimization problems, including those encountered in our research. Furthermore, XGBoost was included for its strong performance in handling structured/tabular data and its efficiency in boosting weak learners. Its capability to model complex relationships and feature interactions makes it well suited for predictive modeling tasks. Hence, the combination of these algorithms was intended to leverage their individual strengths and address different aspects of the problem at hand. RF provides robustness, GWO aids in optimizing parameters, and XGBoost enhances predictive accuracy. Traditional methods often fail to capture the intricate relationships between various input parameters and the resulting mechanical properties. The Ensemble approach, combining random forest (RF), gray wolf optimizer (GWO), and XGBoost algorithms, offers a sophisticated solution by harnessing the strengths of each constituent algorithm, thereby presenting a robust and innovative predictive model [
75]. The predictive accuracy achieved through the Ensemble RF–GWO–XGBoost Algorithm not only contributes to the optimization of geopolymer concrete production but also holds the potential to streamline and advance construction practices. Accurate predictions of compressive strength are crucial for ensuring the structural stability and longevity of geopolymer concrete structures, thereby facilitating their widespread adoption in sustainable construction projects [
62,
76,
77]. Furthermore, this research contributes to the broader scientific community by showcasing the applicability of advanced machine learning techniques in the realm of construction materials. The methodology presented in this study opens avenues for further exploration and adaptation in predicting other material properties, fostering a paradigm shift toward data-driven and precision-based approaches in the construction industry. In conclusion, the significance of this research lies in its pioneering approach to anticipate the CSGePoCo through the innovative Ensemble RF–GWO–XGBoost Algorithm.
Table 1.
The used ML techniques in predicting different characteristics of concretes.
Table 1.
The used ML techniques in predicting different characteristics of concretes.
Authors | Years | Techniques | Number of Datasets |
---|
Huang et al. | 2021 | Support Vector Machine | 114 |
Sarir et al. [78] | 2019 | Gene Expression Programming | 303 |
Balf et al. [79] | 2021 | Data Envelopment Analysis | 114 |
Ahmad et al. [80] | 2021 | Gene Expression Programming, Artificial Neural Network, Decision Tree | 642 |
Azimi-Pour et al. [81] | 2020 | Support Vector Machine | - |
Saha et al. [82] | 2020 | Support Vector Machine | 115 |
Hahmansouri et al. [83] | 2020 | Gene Expression Programming | 351 |
Hahmansouri et al. [84] | 2019 | Gene Expression Programming | 54 |
Aslam et al. [85] | 2020 | Gene Expression Programming | 357 |
Farooq et al. [86] | 2020 | Random Forest and Gene Expression Programming | 357 |
Asteris and Kolovos [87] | 2019 | Artificial Neural Network | 205 |
Selvaraj and Sivaraman [88] | 2019 | Support Vector Machine–Random Forest, Response Surface Method | 114 |
Zhang et al. [89] | 2019 | Random Forest | 131 |
Kaveh et al. [90] | 2018 | M5, Multivariate Adaptive Regression Splines | 114 |
Sathyan et al. [91] | 2018 | Random Kitchen Sink Algorithm | 40 |
Vakhshouri and Nejadi [92] | 2018 | Adaptive Neuro Fuzzy Inference System | 55 |
Belalia Douma et al. [93] | 2017 | Artificial Neural Network | 114 |
Abu Yaman et al. [94] | 2017 | Artificial Neural Network | 69 |
Ahmad et al. [95] | 2021 | Gene Expression Programming, Decision Tree and Bagging | 270 |
Farooq et al. [96] | 2021 | Artificial Neural Network, bagging and boosting | 1030 |
Javad et al. [51] | 2020 | Gene Expression Programming | 277 |
Nematzadeh et al. [97] | 2020 | Response Surface Method, Gene Expression Programming | 108 |
Zhou et al. [60] | 2023 | Decision Tree, Random Forest, Extreme Gradient Boosting | 259 |
3. Data Preparation
The data for this study were collected from the available literature and encompass 15 parameters [
114,
115,
116], as shown in
Table 2. These parameters are FA (kg/m
3), SiO
2-Fly (% weight), Al
2O
3-Fly (% weight), CaO-Fly (% weight), GGBS (kg/m
3), SiO
2-GGBS (% weight), Al
2O
3-GGBS (% weight), CaO-GGBS (% weight), Fine (kg/m
3), Coarse (kg/m
3), NaOH (kg/m
3), Na
2SiO
3 (kg/m
3), SP dosage (kg/m
3), NaOH, and CT (°C), and the output parameter of CSGePoCo (MPa), as depicted in
Figure 7. Each selected parameter was chosen based on its considerable effect on the strength properties of fly ash-slag-based concretes [
117,
118,
119,
120]. Furthermore, the selection of input parameters is conducted based on the suggestions of the literature [
121,
122,
123,
124]. Fly ash is a byproduct of coal combustion and is commonly used as a supplementary cementitious material in concrete [
63,
64]. Its composition, including SiO
2 (silica), Al
2O
3 (alumina), and CaO (calcium oxide), can influence concrete properties such as strength, durability, and workability. Furthermore, GGBS is a byproduct of the iron-making process and is used as a supplementary cementitious material. Its composition, particularly SiO
2, Al
2O
3, and CaO, is crucial in determining its pozzolanic reactivity and impact on concrete performance.
Aggregates contribute to the mechanical properties of concrete. Fine aggregates are typically sand, while coarse aggregates are larger particles like gravel or crushed stone. The size, shape, and gradation of aggregates affect the workability, strength, and durability of concrete [
125,
126].
Alkaline activators are used in alkali-activated materials to enhance the reactivity of supplementary cementitious materials. NaOH (sodium hydroxide) and Na
2SiO
3 (sodium silicate) are commonly used activators. Their concentrations and ratios can significantly impact the performance of alkali-activated materials [
127,
128,
129]. Superplasticizers are additives that improve the workability of concrete without sacrificing strength. The dosage of superplasticizer affects the fluidity and slump of the concrete mix. Sodium hydroxide (NaOH) molarity refers to the concentration of the alkaline activator. The molarity can influence the rate of chemical reactions and the strength development of alkali-activated materials. Curing temperature is the temperature at which concrete is maintained during the curing period. Curing at different temperatures can impact the development of strength and durability of concrete. The consideration and manipulation, rationale, effect on properties, and collective effects of these parameters are presented in
Table 3.
Table 2.
The choice of parameters for predicting fly-ash-slag-based concrete by thoroughly examining the literature and utilizing published data.
Table 2.
The choice of parameters for predicting fly-ash-slag-based concrete by thoroughly examining the literature and utilizing published data.
FA | SiO2-Fly | Al2O3-Fly | CaO-Fly | GGBS | SiO2-GGBS | Al2O3-GGBS | CaO-GGBS | Fine | Coarse | NaOH | Na2SiO3 | SP Dosage | NaOH Molarity | CT | CSGePoCo | References |
---|
252 | 60.11 | 26.53 | 4 | 108 | 34.06 | 20 | 32.6 | 774 | 1090.8 | 46.3 | 115.7 | 144 | 8 | 30 | 33.83 | [124] |
252 | 60.11 | 26.53 | 4 | 108 | 34.06 | 20 | 32.6 | 774 | 1090.8 | 61.7 | 154.3 | 144 | 8 | 30 | 25.71 |
… | … | … | … | … | … | … | … | … | … | … | … | … | … | … | … |
225 | 60.11 | 26.53 | 4 | 225 | 34.06 | 20 | 32.6 | 760.5 | 972 | 64.3 | 160.7 | 180 | 8 | 60 | 61.9 |
225 | 60.11 | 26.53 | 4 | 225 | 34.06 | 20 | 32.6 | 760.5 | 972 | 77.1 | 192.9 | 180 | 8 | 60 | 53.79 |
293 | 46 | 33 | 2.6 | 88 | 21 | 17 | 56.1 | 760 | 1005 | 143.3 | 71.7 | 0 | 6 | 20 | 37.4 | [123] |
… | … | … | … | … | … | … | … | … | … | … | … | … | … | … | … |
293 | 46 | 33 | 2.6 | 88 | 21 | 17 | 56.1 | 760 | 1005 | 107.5 | 107.5 | 0 | 6 | 20 | 26.6 |
253 | 46 | 33 | 2.60 | 126 | 21 | 17 | 56.1 | 760 | 1005 | 143.3 | 71.7 | 0 | 4 | 20 | 28 |
237 | 63.53 | 27.4 | 1.26 | 158 | 34.26 | 11.32 | 38.34 | 547 | 1277 | 52 | 129 | 7.9 | 8 | 32 | 28.36 | [122] |
… | … | … | … | … | … | … | … | … | … | … | … | … | … | … | … |
360 | 53.71 | 27.2 | 1.90 | 40 | 29.96 | 12.25 | 45.45 | 655.9 | 1218.1 | 40 | 100 | 6 | 14 | 23 | 33.3 |
200 | 52.6 | 34.1 | 1.32 | 200 | 33.8 | 14.8 | 38.8 | 716 | 1074 | 9 | 56 | 0 | 0 | 0 | 31.7 | [121] |
… | … | … | … | … | … | … | … | … | … | … | … | … | … | … | … |
200 | 52.6 | 34.1 | 1.32 | 200 | 33.8 | 14.8 | 38.8 | 700 | 1050 | 15 | 93 | 0 | 0 | 0 | 42.3 |
360 | 53.71 | 27.2 | 1.9 | 40 | 29.96 | 12.25 | 45.45 | 651 | 1209 | 45.7 | 114.3 | 0 | 14 | 22 | 40 | [120] |
… | … | … | … | … | … | … | … | … | … | … | … | … | … | … | … |
340 | 53.24 | 26.42 | 3.65 | 60 | 36.77 | 13.56 | 37.6 | 646 | 1200 | 53 | 107 | 4 | 12 | 22 | 35 |
320 | 53.24 | 26.42 | 3.65 | 80 | 36.77 | 13.56 | 37.6 | 648 | 1203 | 53 | 107 | 4 | 12 | 22 | 45 |
300 | 53.24 | 26.42 | 3.65 | 100 | 36.77 | 13.56 | 37.6 | 658 | 1222 | 53 | 107 | 4 | 12 | 22 | 57 | [119] |
… | … | … | … | … | … | … | … | … | … | … | … | … | … | … | … |
300 | 53.24 | 26.42 | 3.65 | 100 | 36.77 | 13.56 | 37.6 | 659 | 1223 | 46 | 114 | 4 | 10 | 22 | 45 |
303.75 | 45.8 | 21.4 | 13.7 | 101.25 | 34.52 | 20.66 | 32.43 | 683 | 1269 | 81 | 81 | 4.05 | 8 | 0 | 10.5 | [118] |
… | … | … | … | … | … | … | … | … | … | … | … | … | … | … | … |
303.75 | 45.8 | 21.4 | 13.7 | 101.25 | 34.52 | 20.66 | 32.43 | 683 | 1269 | 81 | 81 | 4.05 | 10 | 0 | 13 |
303.75 | 45.8 | 21.4 | 13.7 | 101.25 | 34.52 | 20.66 | 32.43 | 683 | 1269 | 40.5 | 121.5 | 4.05 | 14 | 0 | 31.7 |
0 | 0 | 0 | 0 | 400 | 37.73 | 14.42 | 37.34 | 810 | 990 | 57.1 | 143 | 8 | 12 | 25 | 89.6 | [130] |
… | … | … | … | … | … | … | … | … | … | … | … | … | … | … | … |
0 | 0 | 0 | 0 | 400 | 37.73 | 14.42 | 37.34 | 810 | 990 | 57.1 | 143 | 12 | 12 | 25 | 89.2 |
0 | 0 | 0 | 0 | 400 | 37.73 | 14.42 | 37.34 | 810 | 990 | 57.1 | 143 | 24 | 12 | 25 | 84.1 |
204.5 | 65.6 | 28 | 1 | 204.5 | 30.61 | 16.24 | 34.48 | 554 | 1293 | 41 | 102 | 0 | 10 | 0 | 53.5 | [115] |
… | … | … | … | … | … | … | … | … | … | … | … | … | … | … | … |
102 | 65.6 | 28 | 1 | 307 | 30.61 | 16.24 | 34.48 | 554 | 1293 | 41 | 102 | 0 | 10 | 0 | 55.5 |
0 | 65.6 | 28 | 1 | 409 | 30.61 | 16.24 | 34.48 | 554 | 1293 | 41 | 102 | 0 | 10 | 0 | 58.6 |
0 | 0 | 0 | 0 | 400 | 31.63 | 13.42 | 36.35 | 740 | 1110 | 12.9 | 82.5 | 0 | 0 | 22 | 34.6 | [114] |
… | … | … | … | … | … | … | … | … | … | … | … | … | … | … | … |
0 | 0 | 0 | 0 | 400 | 31.63 | 13.42 | 36.35 | 785 | 1085 | 12.9 | 82.5 | 0 | 0 | 22 | 53.6 |
0 | 0 | 0 | 0 | 400 | 31.63 | 13.42 | 36.35 | 790 | 1065 | 12.9 | 82.5 | 0 | 0 | 22 | 66.7 |
The Pearson correlation coefficient was employed to discern the relationships among effective parameters and CSGePoCo, as shown in
Figure 8. In addition, the pivotal variables influencing the prediction of mechanical strength were assessed using permutation features. Furthermore,
Table 4 summarizes the statistical properties of the data used, indicating their maximum and minimum based on 156 data samples. Meanwhile,
Table 4 presents the summary of the descriptive statistics, including the average, standard deviation, minimum, and maximum. It is noteworthy that the variables incorporated in developing the models exert a significant effect on the robustness of the models. It is acknowledged that the reactivity of raw materials, such as fly ash, can exhibit significant variations depending on their geographical origin and processing conditions. This inherent variability is attributed to differences in mineralogical composition, particle size distribution, and the presence of impurities.
A box plot, also known as a box-and-whisker plot, is a graphical representation of the distribution of a dataset. It provides a visual summary of key statistics, including the minimum, first quartile (Q1), median, third quartile (Q3), and maximum. Box plots are particularly useful for identifying outliers in a dataset.
The box in the plot represents the interquartile range (IQR), which is the range between the first quartile (Q1) and the third quartile (Q3). The line inside the box represents the median (Q2). Furthermore, whiskers extend from the box to the minimum and maximum values within a specified range. They can be determined using different methods, such as 1.5 times the IQR or a fixed percentage of the data. Any data point beyond the whiskers is considered a potential outlier.
Notably, outliers are individual data points that fall significantly outside the overall pattern of the dataset. Outliers can be identified as points beyond the whiskers of the box plot. Outliers may be classified into mild outliers (1.5 to 3 times IQR) and extreme outliers (beyond 3 times IQR). Outliers are often represented as individual points or asterisks outside the whiskers. The shape of the box plot can also provide insights into the symmetry or skewness of the data. A skewed distribution might have one whisker longer than the other. If data points lie beyond the whiskers, they are potential outliers.
The box plot of parameters is depicted in
Figure 9. It can be seen that the parameters of CT, SiO
2-GGBS, SiO
2-Fly, Al
2O
3-Fly, and CaO-Fly suffer from outlier data.
5. Results and Discussion
Analyzing the results presented in
Table 9 revealed that all four artificial intelligence (AI) models, namely, RF, XGBoost, GWO–XGBoost, and RF–GWO–XGBoost, exhibit relatively low errors, and their accuracy levels are deemed acceptable. Notably, RF’s performance slightly trails that of XGBoost in both the training and testing phases. Interestingly, the optimization features of the GWO algorithm enhance the accuracy of the XGBoost model, leading to superior performance of the GWO–XGBoost model compared to both XGBoost and RF approaches.
This research introduces a hybrid strategy, resulting in the novel hybrid RF–GWO–XGBoost system for anticipating CSGePoCo. The outcomes in
Table 9 demonstrate that the developed RF–GWO–XGBoost system achieves acceptable efficiency with a high level of accuracy, surpassing even the GWO–XGBoost system across all datasets. This suggests that the integration plan of stacking and optimization techniques applied to RF and XGBoost approaches provides an alternative training set with improved normalization and regressions, thereby enhancing the predictive capability of the GWO–XGBoost model. Visual representations of these results can be observed in
Figure 12,
Figure 13,
Figure 14 and
Figure 15.
Figure 12,
Figure 13,
Figure 14 and
Figure 15 illustrate the correlation of the CSGePoCo value (actual versus estimated) for the RF–GWO–XGBoost model, demonstrating its superiority over the RF, XGBoost, and GWO–XGBoost models, particularly on the training dataset. Notably, the optimization models (GWO–XGBoost and RF–GWO–XGBoost) exhibit significantly improved correlation, with the proposed hybrid RF–GWO–XGBoost model performing exceptionally well. In the testing phase, the regression levels of the optimization models surpass those of other models, with the proposed RF–GWO–XGBoost model standing out.
A further assessment of the developed models is conducted by comparing their performance against measured values of CSGePoCo in
Figure 16. To provide a comprehensive evaluation, violin plots and Taylor diagrams are employed on both the training and testing datasets, as illustrated in
Figure 17 and
Figure 18, respectively.
Figure 17 reveals that the violin plots exhibit a close mean value of predicted CSGePoCo by the proposed RF–GWO–XGBoost model to measured CSGePoCo. Moreover, the Taylor diagram demonstrates that the proposed RF–GWO–XGBoost model excels in predicting CSGePoCo, closely aligning with the actual values.
In order to gain deeper insights into the operational mechanism of the proposed RF–GWO–XGBoost model and to elucidate the contributions of input variables in the modeling process, an investigation into the significance levels of these effective parameters was conducted using an analysis of the sensitivity. Sensitivity analysis is an integral technique applied across diverse fields, such as engineering, economics, and environmental sciences, aiming to assess the impact of input parameters or assumptions on the output of a model or system. Within sensitivity analysis, the cosine amplitude method (CAM), a specific approach, was employed to gauge how changes in input parameters influence the variability in the model’s output. This method systematically varies individual input parameters while keeping other factors constant, measuring resultant changes in the model’s output. The nomenclature of this method is derived from the cosine function, which defines the amplitude of the input parameter variations. Through this methodology, researchers can quantitatively measure the sensitivity of the model to specific input variations and pinpoint parameters exerting the most substantial influence on the model’s behavior. This analysis enables researchers to identify critical parameters contributing significantly to output variability, facilitating the prioritization of resources and efforts toward addressing and optimizing these influential factors. Moreover, the CAM offers a systematic and structured approach for exploring the impact of parameter changes, allowing researchers to assess how sensitive the model is to both small and large fluctuations in input parameters [
140,
141]. By conducting sensitivity analysis utilizing the CAM, researchers can make informed decisions regarding the model’s design, input parameter selection, and overall robustness. This enhances the model’s predictive capabilities and ensures its applicability in real-world scenarios [
142]. The effectiveness and relative significance of the influential parameters were assessed through sensitivity analysis using the CAM, employing the following two renowned techniques [
113,
143,
144,
145]:
where x
ik and x
jk are the input and output parameters, and m stands for the number of datasets.
The importance of the parameters in the proposed RF–GWO–XGBoost model can be presented as CaO-Fly < SP dosage < NaOH < FA < CT < NaOH < Al
2O
3-Fly < SiO
2-Fly < CaO-GGBS < Al
2O
3-GGBS < Coarse < Na
2SiO
3 < GGBS < SiO
2-GGBS < Fine with sensitivity values of 0.609, 0.71, 0.813, 0.814, 0.822, 0.832, 0.884, 0.892, 0.916, 0.921, 0.922, 0.933, 0.938, 0.947, and 0.948. The findings presented in
Figure 19 reveal that the variables Fine, SiO
2-GGBS, GGBS, Na
2SiO
3, Coarse, Al
2O
3-GGBS, and CaO-GGBS exert considerable influence over CSGePoCo, with Al
2O
3-Fly and SiO
2-Fly playing particularly significant roles. Notably, the variable CaO-Fly and SP dosage exhibit the least effect on CSGePoCo, and its exact application in predicting CSGePoCo remains indeterminate. Furthermore, the variables CaO-GGBS, Al
2O
3-GGBS, Coarse, Na
2SiO
3, GGBS, SiO
2-GGBS, and Fine demonstrate high sensitivity to CSGePoCo, especially CaO-GGBS and Al
2O
3-GGBS. These variables should be diligently considered and incorporated into the predictive modeling of CSGePoCo. Additionally, the variable CaO-Fly, while showing minimal impact, lacks precision in its role in CSGePoCo modeling.
In this step, for validation of the prediction results and to evaluate the performance of the developed models, a set of data points comprising ten unseen data points was employed as shown in
Table 10.
6. Conclusions
This research addresses the pressing need for sustainable alternatives to traditional Portland cement production for landscape concrete by introducing a novel hybrid RF–GWO–XGBoost algorithm for anticipating the CSGePoCo. GePoCo presents challenges as an eco-friendly alternative due to its complex cementitious matrix and intricate mix design, necessitating accurate predictive models for optimizing its performance. Through the utilization of machine learning approaches, specifically the integration of random forest (RF), the gray wolf optimization (GWO) algorithm, and extreme gradient boosting (XGBoost), this study successfully developed the RF–GWO–XGBoost model. The stacking strategy, involving multiple RF models with diverse hyperparameters and subsequent optimization through the GWO algorithm, resulted in the superior RF–GWO–XGBoost model. Comparative analysis with stand-alone RF and XGBoost models, and a proposed GWO–XGBoost system, revealed the remarkable performance enhancement achieved by the proposed methodology. The RF–GWO–XGBoost model exhibited the highest accuracy and reliability, as indicated by the RMSE, MAE, R2, and a20 index values. This comprehensive evaluation demonstrated the model’s effectiveness in predicting GePoCo compressive strength, emphasizing the significant impact of certain variables, such as Fine, SiO2-GGBS, GGBS, Na2SiO3, Coarse, Al2O3-GGBS, and CaO-GGBS. Importantly, this study highlighted the limited influence of the CaO-Fly variable on GePoCo, underlining its imprecise role in the modeling process. Overall, the findings of this research contribute to the advancement of predictive modeling for GePoCo and underscore the RF–GWO–XGBoost algorithm’s potential in addressing the intricate challenges associated with alternative and sustainable concrete production. This novel approach has practical implications for the construction industry, providing a reliable tool for optimizing geopolymer concrete formulations, thereby promoting environmentally conscious practices in the field of civil engineering.