Next Article in Journal
Thin Layers of Cerium Oxynitride Deposited via RF Sputtering
Previous Article in Journal
Effects of Curing Temperature on Expansion of Concrete Due to ASR
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Efficient Compressive Strength Prediction of Alkali-Activated Waste Materials Using Machine Learning

Material and Chemical Research Laboratories, Industrial Technology Research Institute, Hsinchu 31040, Taiwan
*
Author to whom correspondence should be addressed.
Materials 2024, 17(13), 3141; https://doi.org/10.3390/ma17133141
Submission received: 16 May 2024 / Revised: 19 June 2024 / Accepted: 25 June 2024 / Published: 27 June 2024
(This article belongs to the Section Construction and Building Materials)

Abstract

:
This study explores the integration of machine learning (ML) techniques to predict and optimize the compressive strength of alkali-activated materials (AAMs) sourced from four industrial waste streams: blast furnace slag, fly ash, reducing slag, and waste glass. Aimed at mitigating the labor-intensive trial-and-error method in AAM formulation, ML models can predict the compressive strength and then streamline the mixture compositions. By leveraging a dataset of only 42 samples, the Random Forest (RF) model underwent fivefold cross-validation to ensure reliability. Despite challenges posed by the limited datasets, meticulous data processing steps facilitated the identification of pivotal features that influence compressive strength. Substantial enhancement in predicting compressive strength was achieved with the RF model, improving the model accuracy from 0.05 to 0.62. Experimental validation further confirmed the ML model’s efficacy, as the formulations ultimately achieved the desired strength threshold, with a significant 59.65% improvement over the initial experiments. Additionally, the fact that the recommended formulations using ML methods only required about 5 min underscores the transformative potential of ML in reshaping AAM design paradigms and expediting the development process.

1. Introduction

Alkali-activated materials (AAMs) are increasingly acknowledged for their environmentally friendly characteristics and reduced carbon footprint [1]. A key factor in fostering the principles of the circular economy is the utilization of alkali activation technology. Alkali-activated materials (AAMs), encompassing geopolymers, are of interest due to their ability to utilize industrial by-products and waste materials, thereby further enhancing their environmental benefits. Thapa et al. [2] propose that the reaction mechanisms of AAM and geopolymer differ. AAMs, containing higher calcium content in their raw materials, form a binder gel after dissolution in alkaline solutions of moderate to high alkalinity, resulting in phases of calcium silicate hydrate/aluminate hydrate and an alumino-silicate network structure. In contrast, geopolymers, with lower calcium content in their raw materials, form a polymeric structure after dissolution in alkaline solutions of lower alkalinity, ultimately resulting in an alumino-silicate network structure [3,4,5]. AAMs exhibit wide-ranging applications across various industrial sectors, bolstering their commercial viability [6]. Manufacturing AAMs using industrial waste materials, such as blast furnace slag [7,8], fly ashes [9,10], metallurgical slags [11,12], or waste glass [13,14,15], presents two benefits: they not only involve the utilization of waste products to create value-added products, but also significantly decrease the environmental impact across many industries by conserving vast amounts of raw materials [16,17].
However, the application of AAMs is currently confronting persistent engineering difficulties, such as loss of workability [18], rheological issues [19], cost concerns [20], the need for special types of curing [21], and the limited reactivity of precursors, especially for those derived from waste streams [22]. This necessitates the incorporation of alkaline activators, resulting in intricate chemical interactions. Additionally, the diverse range of raw materials poses challenges in formulating standardized mix designs. Despite these issues, optimism about the use of AAMs persists due to several compelling reasons.
Studies have demonstrated the excellent performance of AAMs and their remarkable effectiveness in various specialized engineering applications, such as their use as repair materials [23], in marine constructions [24], as pavement base materials [25], as 3D printing raw materials [26], and in the containment of hazardous waste materials [27]. The integration of environmentally conscious substitutes like AAMs in these specialized domains offers the potential for enhanced performance and longevity, contributing to the sustainability and resource efficiency of infrastructure. In 2023, research conducted by Liu et al. [28] using borax overcame the problem of AAMs being unable to solidify at room temperature, thereby avoiding the complexity and cost of thermal solidification. By solving these drawbacks, the practical applicability of AAMs in construction can be significantly enhanced, making them a more viable and sustainable alternative. Therefore, the belief is that future solutions to these engineering problems will not impede the feasibility of applying AAMs.
Conventional approaches to AAM design heavily rely on empirical experiments conducted by experienced operators [29,30]. These methods involve trial-and-error processes, demanding substantial time and resource investments [31,32]. The recent advancements in AI have significantly transformed materials science and computational design. By harnessing machine learning (ML) algorithms, researchers can now predict material properties, optimize mixtures, and expedite the discovery of novel materials. This AI-driven approach allows scientists to simulate and analyze various material compositions, dramatically reducing the need for extensive physical experiments and addressing the traditionally time-consuming validation cycles.
The assessment of compressive strength (CS), a crucial parameter in AAM design, typically involves 28 days of destructive compressive strength testing, even by experienced operators. While non-destructive methods exist for estimating compressive strength, direct and precise testing of compressive strength still requires materials to undergo destruction, making destructive compressive strength tests the mainstream analysis method. Additionally, the prolonged durations of trial-and-error experiments and curing periods consume substantial manpower and time [33,34,35,36]. Therefore, there is a critical need to develop rapid and effective mixture design methods to achieve optimal performance of AAMs.
ML has recently emerged as a powerful technique to predict the compressive strength of concrete using various algorithms [37,38,39,40,41,42,43,44]. Table 1 summarizes the findings from prior research relating to ML algorithms employed in predicting the diverse properties of concrete. Based on this literature, it is understood that at least 100 sets of concrete material data are required to establish a model robust enough for application. This is due to the significant variability in the composition of waste materials, necessitating a substantial amount of data for the potential development of applicable ML models. Thus, the size of a dataset plays a pivotal role in achieving high efficacy in terms of ML models, particularly in the field of materials science. Datasets in this field can vary widely, ranging from small-scale to extensive (e.g., small-scale (<100) to extensive (>1000)), and significantly impact the training processes of ML models [45]. In cases where the available data are insufficient, the resulting model might exhibit diminished predictive accuracy or struggle with generalization, leading to overfitting [46,47,48]. Therefore, it is highly desired to devise effective strategies to alleviate the adverse effects that arise from working with small datasets.
In this study, the model’s performance was validated using various evaluation metrics commonly used in the literature [49,50,51] to ensure its reliability. To our knowledge, this study is the first to apply ML techniques to investigate over 20 factors related to blast furnace slag (BFS), fly ash (FA), reducing slag (RS), and waste glass (WG) in the context of AAMs. The insights gained can guide the design of future BFS–FA–RS–WG-based AAMs and support the use of industrial waste materials in construction.
The main objective of this research is to develop a robust ML model for predicting the compressive strength of AAMs, addressing the challenges posed by the variability in raw material compositions and the limitations of small datasets. This study is original in its application of ML to AAMs, particularly in dealing with diverse recycled materials and small dataset constraints, which have not been extensively explored in previous research.
To address these challenges, this research achieved the following aims:
  • Establish ML models using experimental data from AAMs to accelerate the development of AAMs;
  • Overcome the challenges in building ML models with small datasets using data-processing methods;
  • Utilize the established ML models in conjunction with experimental validation to identify AAM formulations with high compressive strength.

2. Materials and Methods

2.1. Experimental Design and Dataset

(1)
Raw materials
The dataset, provided by the Material and Chemical Research Laboratories (MCL) and Industrial Technology Research Laboratories (ITRI), is referred to as the “ITRI AAM database [52]”. It includes 42 samples, each characterized by various raw materials of Blast furnace slag (BFS), Fly ash (FA), Reducing slag (RS), Waste glass (WG).
(2)
Physical and chemical characteristics
-
The recipes for the AAMs consist of precursor activators (PAs), alkali activators (AAs), and process parameters such as mixing time (t) and alkali solution temperature (T).
-
The precursor activators include BFS obtained from a steel plant (Taiwan), with a D50 of 9.5 µm, and fly ashes (FAs) obtained from three different power plants (Taiwan), with D50 values ranging from 14 μm to 15 μm.
-
The alkali activators include RS obtained from three different steel plants (Taiwan), where the activator purities range from 0.071% to 0.050%, and the molar ratios of Na/Si range from 0.068% to 0.043%. The D50 of the RS powder ranges from 14 μm to 15 μm. Additionally, the alkali activators include WG obtained from four different glass plants (Taiwan), where the activator purities range from 2.180% to 6.959%, and the molar ratios of Na/Si range from 2.435% to 9.540%. The D50 of the WG powder ranges from 6 μm to 7 μm.
-
For the alkali solutions, there are two types of alkali solutions (NaOH and Na2SO4), with 99% purity.
-
The composition transformation dataset was obtained via an X-ray fluorescence analysis of the materials used. An automated spectrometer (RIX 200, manufactured by Thermo Fisher Scientific, Wilmington, MA, USA) was used to determine the chemical compositions. The key constituents include Fe2O3, Al2O3, SiO2, K2O, Na2O, CaO, and MgO. The minor constituents include ZrO2, B2O3, TiO2, Bi2O3, and SrO. The chemical composition analysis of the powder is shown in Table 2.
(3)
Sample preparation and testing
The dataset includes experimental results relating to high-strength AAMs designed for construction applications. The samples were prepared by the ITRI–MCL laboratory as follows:
-
Step 1: The alkali activators were mixed for 3 min and then cooled to room temperature;
-
Step 2: The precursor activators were then stirred and mixed with the alkali activators for 10 min;
-
Step 3: The paste was cast into molds and cured at room temperature;
-
Step 4: The compressive strength was assessed using a 50 mm cube following ASTM C109 procedures [53], with three independent tests.
The process parameters concern the alkali solution mixing time (t): t1 for the alkali activators, and t2 for both the precursor activators (PAs) and alkali activators (AAs).
(4)
Validation and application of ML models
The validation method involved several steps:
-
Step 1: Experimental Validation
The recommended recipes were tested experimentally. The results were compared to the predicted values, with the goal of achieving a target compressive strength of 30 MPa.
-
Step 2: Model Improvement
Additional experimental data were collected, expanding the dataset to 45 data points. The model was retrained, and new recipes were tested to assess prediction accuracy.
-
Step 3: Final Validation
The dataset was further expanded to 48 data points through continuous experimentation. New recipes were tested to confirm whether a target compressive strength of 30 MPa was consistently achieved.

2.2. Data Processing

Data processing encompassed several stages: initial data processing, composition transformation, composition feature classification, feature selection, and feature construction. Figure 1 illustrates the overall process. The steps are defined as follows:
Initial data processing involved data cleaning and the preparation of the initial dataset. This process included identifying and handling missing values, addressing outliers, and performing data scaling and normalization to ensure data quality and consistency.
Composition transformation involves converting the composition data in the dataset into a format that is understandable by ML models.
At this stage, the composition data were classified based on their features for further analysis and modeling.
Feature selection chooses the most informative features from the dataset to improve the performance and effectiveness of the ML models.
Feature construction involved combining, transforming, and/or generating new features to enhance the performance and generalization of models.
This comprehensive approach aims to ensure data quality, enhance model performance, and facilitate subsequent analysis and modeling tasks.

2.2.1. Composition Transformation

The composition transformation dataset was obtained via an X-ray fluorescence analysis of the materials used. An automated spectrometer (RIX 200, manufactured by Thermo Fisher Scientific, Wilmington, MA, USA) was used to determine the chemical compositions. Then, the individual components were combined in a dataset. Table S1 presents the statistical characteristics and distribution of the variables under investigation, providing the featured information of the composition transformation dataset containing 42 pieces of data. During the measurement, the process parameters and the target features remain unchanged, as indicated in Table 3.

2.2.2. Composition Feature Classification

The composition feature classification dataset was derived from the composition transformation dataset. This process categorizes the precursor activator and alkali activator into separate columns based on their reaction mechanisms, as shown in Table 4. Given that the reaction mechanisms of precursor activator and alkali activator differ in actual chemical reactions, separating the compositions of these two distinct AAM mechanisms enables the ML model to effectively distinguish the impact of these materials on the compressive strength of AAMs. This separation aids in improving the model’s accuracy in predicting compressive strength. Table 4 presents the statistical characteristics and distribution of the investigated variables, providing the feature information of the composition feature classification dataset for these 42 pieces of data. During the measurement, the process parameters and target features remain unchanged, as indicated in Table 3.

2.2.3. Feature Selection

The feature selection dataset removes composition fields with lower content, preserving only the fields of primary composition. The increased number of columns in the dataset after classification (adding an additional 14 columns) may lead to poor accuracy in ML models. Therefore, essential constituents affecting compressive strength were selected based on alkali activation principles [48]. These key constituents included Fe2O3, Al2O3, SiO2, K2O, Na2O, CaO, and MgO, along with a category denoted as “Others”. The “Others” category includes the remaining components, such as ZrO2, B2O3, TiO2, Bi2O3, and SrO, which are those other than the above-mentioned seven major constituents. Table S2 presents the statistical characteristics and distribution of the variables under investigation, providing the feature information of the feature selection dataset containing 42 pieces of data. During the measurement, the process parameters and target features remain unchanged, as indicated in Table 3.

2.2.4. Feature Construction

The feature construction dataset includes the creation of new feature columns, such as volume molar concentration, based on alkali activation principles. In this study, a total of 6 features were established, all of which were calculated from the dataset filtered through key constituents of compressive strength. Table S3 presents the statistical characteristics and distribution of the variables under investigation in our study, providing the featured information of the feature construction dataset containing 42 pieces of data. During the measurement, the composition, process parameters, and target features remained unchanged, as indicated in Table 3 and Table S2.

2.3. Model Assessment

In this study, all of the data in the initial dataset were trained. To mitigate potential issues with poor model generalization, the model was evaluated using 5-fold cross-validation and assessed for performance using metrics such as Mean Absolute Error (MAE), Root Mean Squared Error (RMSE), and Coefficient of Determination (R2) values.

2.3.1. Fivefold Cross-Validation

Fivefold cross-validation was employed to assess the performance of the model, as shown in Figure 2. This method involves dividing the dataset into 5 subsets, using 4 subsets for training the model each time, and reserving the remaining subset for testing. This process was repeated 5 times, and each time used a different subset for testing. The final performance evaluation is the average of the results from the 5 tests.

2.3.2. Error Matrices

Mean Absolute Error (MAE)

Mean Absolute Error (MAE) is a measure of the average difference between the predicted and the actual values.
The mathematical formula for MAE is as follows:
M A E = i n y f n
The vector y = ( y 1 ,   y 2 , , y n ) represents the actual values of the target variable. The vector f = ( f 1 ,   f 2 , , f n ) represents the predicted values of the target variable.

Root Mean Squared Error (RMSE)

Root Mean Squared Error (RMSE) is a measure of the square root of the average of the squared differences between the predicted and the actual values. It provides a measure of the typical deviation of the predictions from the actual values.
The mathematical formula for RMSE is as follows:
R M S E = i n y f 2 n
The vectors y and f are as above.

Coefficient of Determination (R2)

The mathematical formula for R2 is as follows:
R 2 = 1 i n ( y y ¯ ) 2 i n ( y f ) 2
The vectors y and f are as above.

3. Results and Discussion

3.1. Input and Output Variables

Figure 3 illustrates a pair plot depicting the distribution of precursor activator, alkali activator, process parameters, and compressive strength (target feature). The pair plot is created using the Seaborn package [54]. The shaded areas in the plot result from setting the parameter “kind” to “reg” in the “pairplot” function. This means that a regression line is added to the scatter plots between each pair of variables, with the shaded area representing the confidence interval of the regression line, indicating the uncertainty of the regression model. The wider the shaded area, the lower the confidence in the model; conversely, the narrower the shaded area, the higher the confidence in the model. In certain instances, the distribution of data points appears sparse, such as the correlation of t1 with other features/targets, t2 with other features/targets, and curing temperature with other features/targets. This data imbalance can pose challenges in model development. Figure 4 displays the Pearson correlation matrix of the precursor activator, alkali activator, process parameters, and compressive strength (target feature). Further to the moderately positive/negative correlation observed between BFS and WG/NaOH/t2, FA and NaOH, WG and NaOH/t2, NaOH and t2, and t1 and t2, other variables do not show significant correlations. These results agree with results reported in the literature [55].

3.2. ML Model Performance

Figure S1 and Table S4 present correlation scatter plots for each dataset used in this study that were obtained from the employed ML models, specifically SGD [49], DT [50], RF [56], and K-NN [57]. All these ML algorithms were implemented using the Python library Scikit-learn [58]. These results were obtained through fivefold cross-validation, with model performance evaluated using metrics i.e., MAE, RMSE, and R2, which are concisely summarized in Tables S4–S8. The correlation scatter plots of the initial dataset are depicted in Figure S1, accompanied by statistical error metrics provided in Table S4. The best model performance (i.e., MAE (41.83 MPa), RMSE (54.40 MPa), R2 (0.05)) for the initial dataset was achieved by K-NN model. The poor performance of the SGD model is attributed to its suitability for modeling larger datasets, whereas the initial dataset only consisted of 42 entries, resulting in suboptimal performance for SGD. Similarly, Figure 5 shows the correlation scatter plots of the composition transformation dataset, with corresponding statistical error metrics shown in Table S5. Here, the RF model demonstrates the best performance (i.e., MAE (36.65 MPa), RMSE (44.29 MPa), and R2 (0.50)). Figure S2 illustrates the correlation scatter plots of the composition feature classification dataset, and Table S6 presents the statistical error metrics. Once again, the RF model exhibits the highest performance (i.e., MAE (33.65 MPa), RMSE (41.55 MPa), and R2 (0.53)). The correlation scatter plots of the feature selection dataset are displayed in Figure S3, along with statistical error metrics in Table S7. In this case, the RF model also yields the best performance (i.e., MAE (32.83 MPa), RMSE (38.85 MPa), and R2 (0.59)). Additionally, Figure S4 presents the correlation scatter plots of the feature construction dataset, with statistical error metrics provided in Table S8. The RF model performs the best in this scenario as well. Figure 6 shows the performance of the established models in terms of the values of MAE, RMSE, and R2, respectively, using datasets processed through different data processing methods. Figure 6a–c illustrates the model evaluation results using the MAE, RMSE, and R2 metrics, respectively. Regardless of the metric used for model evaluation, the results consistently indicate a significant improvement in accuracy after the initial dataset undergoes composition transformation. This suggests that the accuracy of the models is indeed influenced by the different precursor activators and alkali activators (composition differences), aligning with the theoretical understanding that CS of AAMs is affected by the composition of precursor activators and alkali activators. Furthermore, the results show a notable improvement in accuracy after the composition transformation dataset is subjected to composition feature classification. Because the composition fields of precursor activators and alkali activators have different chemical reactions that affect AAM compressive strength [59], separating the composition fields of precursor activators and alkali activators facilitates the identification of correlations between the AAM compressive strength of the composition of precursor activators and alkali activators. Moreover, after performing feature selection on the composition feature classification dataset, there is a significant increase in model accuracy. Feature selection involved filtering out major composition components and treating minor components as impurities (other columns), and the results show that minor composition components have negligible effects on CS of AAMs. Additionally, adding six chemical indicator feature factors effectively enhances model accuracy, with these six features strongly correlated with CS (e.g., liquid binder ratio, molarity). The modeling results obtained after the aforementioned data processing demonstrate that ML models, combined with knowledge from the chemical domain (e.g., composition transformation, composition feature classification), can effectively improve model accuracy.

3.3. Prediction and Experimental Results

To verify this result, the established model from the feature construction dataset was utilized to recommend recipes. The model utilized a dataset consisting of 42 data points. The recipes selected for experimental validation were the top three with the highest predicted compressive strength recommended by the ML model. Figure 7 shows the comparison between the predicted results and the actual experimental values for these three recipes (samples 1 to 3). The predicted compressive strengths for these recipes were 14.58 MPa, 14.04 MPa, and 13.41 MPa, respectively, while the actual compressive strengths were 26.99 MPa (SD = 0.752), 19.9 MPa (SD = 0.684), and 28.91 MPa (SD = 1.953), respectively. Although there is a significant difference between the predicted and experimentally verified compressive strength, the experimental results show that the AAMs formulation recommended by the model achieved a compressive strength of 28.91 MPa. This indicates that the model has indeed learned the formulation features associated with high compressive strength AAMs. However, because the initial data had an average compressive strength of 11.85 MPa (refer to Table 3), predicting high compressive strength recipes constitutes extrapolation for the model, leading to a large discrepancy between the predicted and actual values. In the future, continuously adding high compressive strength data can improve the model’s accuracy. Notably, the actual compressive strengths of two of the recipes were higher than the highest compressive strength of 23.84 MPa (SD = 1.222) from the initial 42 data points. Although the recipes recommended by this model did not reach the target compressive strength of 30 MPa, the literature suggests that adding new data can improve the model’s predictive ability [60]. Additionally, this study also employed the “active learning“ method from the literature, which is a cyclic optimization approach where the model recommends recipes, experiments produce new data, and this new data is then added to the model training to enhance its predictive ability. Therefore, we attempted to add three experimental recipes (Figure 7, samples 1 to 3) that obtained values close to 30 MPa, with the expectation of enhancing the model’s capability to recommend recipes. These data (45 data points) were incorporated into the training of the ML model, aiming to find recipes that achieved the target compressive strength. Samples 4 to 6 in Figure 7 show the experimental validation results when repeating the above-mentioned process. The predicted compressive strengths for these recipes were 23.64 MPa, 23.64 MPa, and 22.32 MPa, respectively, while the actual compressive strengths predicted by the ML for these recipes were 27.76 MPa (SD = 1.708), 17.58 MPa (SD = 1.964), and 19.96 MPa (SD = 1.791), respectively, meaning that we were unable to achieve the target strength. Therefore, recipes continued to be recommended in the same manner as shown in samples 7 to 9 in Figure 7. The predicted compressive strengths for these recipes were 18.63 MPa, 17.52 MPa, and 17.25 MPa, respectively, while the actual compressive strengths of these recipes were 33.42 MPa (SD = 2.283), 21.79 MPa (SD = 0.545), and 38.06 MPa (SD = 1.722), respectively. With this model established using 48 data points, the recommended recipes ultimately achieved the target compressive strength of 30 MPa. From Figure 8, it can be seen that the initial data obtained from the initial experiments (42 samples) showed a maximum compressive strength of 23.84 MPa. After two iterations of modeling (48 samples), the recommended recipe achieved a compressive strength of 38.06 MPa. Therefore, it is evident that the formula recommended by this method can effectively increase the compressive strength by 14.22 MPa (a 59.65% improvement in compressive strength).

4. Conclusions

This study successfully demonstrates the potential of ML techniques in predicting the compressive strength of AAMs derived from industrial waste, specifically blast furnace slag (BFS), fly ash (FA), reducing slag (RS), and waste glass (WG). By addressing the inherent challenges in traditional AAM design, which relies heavily on empirical experiments, our research provides a more efficient and cost-effective approach.
Key conclusions from this study include the following:
  • Effective data processing: Rigorous data processing, including composition transformation, feature classification, feature selection, and feature construction, was crucial in enhancing model accuracy. Each stage contributed to refining the dataset, allowing the ML models to better capture the relationships between material compositions and compressive strength. This approach significantly improved model accuracy (R2), increasing the R2 value by 0.57 (from 0.05 to 0.62), to accelerate the identification of optimal AAM recipes.
  • Significant experimental findings: The ML model, initially built with 42 data points, predicted compressive strengths for the top recipes that were subsequently validated experimentally. Although the initial predictions were below the target compressive strength of 30 MPa, iterative improvements, including the addition of new data, led to substantial gains. The final model, based on 48 data points, achieved a maximum compressive strength of 38.06 MPa, representing a 59.65% improvement. This iterative process demonstrated the ML model’s ability to significantly enhance the predictive accuracy and effectiveness of AAM recipes.
  • Overcoming traditional challenges: By addressing the time-consuming nature of traditional AAM design methods reliant on empirical experiments, our ML-driven approach offers an efficient and cost-effective alternative. It not only predicts material properties accurately but also optimizes mixtures, expediting the discovery of novel materials and mitigating the resource-intensive validation cycles. This ML method recommends recipes in just 5 min, revolutionizing AAM design in construction.

Supplementary Materials

The following supporting information can be downloaded at https://www.mdpi.com/article/10.3390/ma17133141/s1, Table S1: Variable statistical characteristics of composition transformation dataset; Table S2. Variable statistical characteristics of feature selection dataset; Table S3 Variable statistical characteristics of additional features in Feature construction dataset; Table S4. Statistical error metrics of initial dataset obtained for employed ML models; Table S5. Statistical error metrics of composition transformation dataset obtained for employed ML models; Table S6. Statistical error metrics of composition feature classification dataset obtained for employed ML models; Table S7. Statistical error metrics of composition feature selection dataset obtained for employed ML models; Table S8. Statistical error metrics of feature construction dataset obtained for employed ML models; Figure S1. Correlation scatter plots of initial dataset obtained for employed ML models; Figure S2. Correlation scatter plots of composition feature classification dataset obtained for employed ML models; Figure S3. Correlation scatter plots of feature selection dataset obtained for employed ML models; Figure S4. Correlation scatter plots of feature construction dataset obtained for employed ML models.

Author Contributions

Methodology, C.-H.H., M.-H.C. and H.-Y.C.; Software, H.-Y.C. and C.-H.H.; Validation, C.-H.H. and H.-Y.C.; Formal Analysis, C.-H.H. and H.-Y.C.; Investigation. C.-H.H. and H.-Y.C.; Writing—Original Draft Preparation, C.-H.H. and H.-Y.C.; Writing—Review and Editing, C.-H.H., M.-H.C., C.-F.L., T.-Y.L. and K.-C.C.; Project Administration, C.-H.H. and H.-Y.C. All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported by the Department of Industrial Technology, Ministry of Economic Affairs.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The ITRI-AAM database is not publicly available and, therefore, these data are not included in any of the shared files. Requests for the ITRI-AAM database should be directly sent to Hao-Yu Chan at [email protected].

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Nodehi, M.; Taghvaee, V.M. Alkali-Activated Materials and Geopolymer: A Review of Common Precursors and Activators Addressing Circular Economy. Circ. Econ. Sustain. 2022, 2, 165–196. [Google Scholar] [CrossRef]
  2. Thapa, V.B.; Waldmann, D. A short review on alkali-activated binders and geopolymer binders. In Vielfalt im Massivbau—Festschrift zum 65. Geburtstag von Prof. Dr. Ing. Jürgen Schnell; Ernst & Sohn: Berlin, Germany, 2018. [Google Scholar]
  3. Davidovits, J. Geopolymers—Inorganic polymeric new materials. J. Therm. Anal. Calorim. 1991, 37, 1633–1656. [Google Scholar] [CrossRef]
  4. Lamaa, G.; Duarte, A.P.C.; Silva, R.V.; Brito, J.D. Carbonation of Alkali-Activated Materials: A Review. Materials 2023, 16, 3086. [Google Scholar] [CrossRef]
  5. Peys, A.; Douvalis, A.P.; Siakati, C.; Rahier, H.; Blanpain, B.; Pontikes, Y. The influence of air and temperature on the reaction mechanism and molecular structure of Fe-silicate inorganic polymers. J. Non-Cryst. Solids 2019, 526, 119675. [Google Scholar] [CrossRef]
  6. Shirkouh, A.H.; Meftahi, F.; Soliman, A.; Godbout, S.; Palacios, J. Performance of Eco-Friendly Zero-Cement Particle Board under Harsh Environment. Appl. Sci. 2024, 14, 3118. [Google Scholar] [CrossRef]
  7. Font, A.; Soriano, L.; Pinheiro, S.M.D.M.; Tashima, M.M.; Borrachero, M.V.B.; Paya, J. Design and properties of 100% waste-based ternary alkali-activated mortars: Blast furnace slag, olive-stone biomass ash and rice husk ash. J. Clean. Prod. 2020, 243, 118568. [Google Scholar] [CrossRef]
  8. Hu, Y.; Tang, Z.; Li, W. Physical-mechanical properties of fly ash/GGBFS geopolymer composites with recycled aggregates. Constr. Build. Mater. 2019, 226, 139–151. [Google Scholar] [CrossRef]
  9. Guo, X.; Yang, J. Intrinsic properties and micro-crack characteristics of ultra-high toughness fly ash/steel slag based geopolymer. Constr. Build. Mater. 2020, 230, 116965. [Google Scholar] [CrossRef]
  10. Rutkowska, G.; Wichowski, P.; Fronczyk, J. Use of fly ashes from municipal sewage sludge combustion in production of ash concretes. Constr. Build. Mater. 2018, 188, 874–883. [Google Scholar] [CrossRef]
  11. Thi, K.-D.T.; Liao, M.-C.; Vo, D.-H. The characteristics of alkali-activated slag-fly ash incorporating the high volume wood bottom ash: Mechanical properties and microstructures. Constr. Build. Mater. 2023, 394, 132240. [Google Scholar]
  12. Athira, V.S.; Charitha, V.; Athira, G.; Bahurudeen, A. Agro-waste ash based alkali-activated binder: Cleaner production of zero cement concrete for construction. J. Clean. Prod. 2021, 286, 125429. [Google Scholar] [CrossRef]
  13. Liu, Y.; Shi, C.; Zhang, Z. An overview on the reuse of waste glasses in alkali-activated materials. Resour. Conserv. Recycl. 2019, 144, 297–309. [Google Scholar] [CrossRef]
  14. Tchakoute, H.K.; Ruscher, C.H.; Kong, S. Thermal behavior of metakaolin-based geopolymer cements using sodium waterglass from rice husk ash and waste glass as alternative activators. Waste Biomass Valoriz. 2017, 3, 573–584. [Google Scholar] [CrossRef]
  15. Vafaei, M.; Allahverdi, A.; Dong, P. Acid attack on geopolymer cement mortar based on waste-glass powder and calcium aluminate cement at mild concentration. Constr. Build. Mater. 2018, 193, 363–372. [Google Scholar] [CrossRef]
  16. Almusaed, A.; Yitmen, I.; Myhren, J.A.; Almssad, A. Assessing the Impact of Recycled Building Materials on Environmental Sustainability and Energy Efficiency: A Comprehensive Framework for Reducing Greenhouse Gas Emissions. Buildings 2024, 14, 1566. [Google Scholar] [CrossRef]
  17. Islam, N.; Sandanayake, M.; Muthukumaran, S.; Navaratna, D. Review on Sustainable Construction and Demolition Waste Management-Challenges and Research Prospects. Sustainability 2024, 16, 3289. [Google Scholar] [CrossRef]
  18. Marvila, M.T.; Azevedo, A.R.G.D.; Matos, P.R.D.; Monteiro, S.N.; Vieira, C.M.F. Rheological and the Fresh State Properties of Alkali-Activated Mortars by Blast Furnace Slag. Materials 2021, 14, 2069. [Google Scholar] [CrossRef] [PubMed]
  19. Liu, J.; Yu, C.; Shu, X.; Ran, Q.; Yang, Y. Recent Advance of Chemical Admixtures in Concrete. Cem. Concr. Res. 2019, 124, 105834. [Google Scholar] [CrossRef]
  20. Scrivener, K.L.; John, V.M.; Gartner, E.M. Eco-efficient cements: Potential economically viable solutions for a low-CO2 cement-based materials industry. Cem. Concr. Res. 2018, 114, 2–26. [Google Scholar] [CrossRef]
  21. Marvila, M.T.; Azevedo, A.R.G.D. Activated alkali cement based on blast furnace slag: Effect of curing type and concentration of Na2O. J. Mater. Res. Technol. 2023, 23, 4551–4656. [Google Scholar] [CrossRef]
  22. Provis, J.L. Activating solution chemistry for geopolymers. In Geopolymers; Elsevier: Amsterdam, The Netherlands, 2009; pp. 50–71. [Google Scholar]
  23. Robayo-Salazar, R.; Jesus, C.; de Gutierrez, R.M. Alkali-activated binary mortar based on natural volcanic pozzolan for repair applications. J. Build. Eng. 2019, 25, 100785. [Google Scholar] [CrossRef]
  24. Nuaklong, P.; Sata, V.; Chindaprasirt, P. Properties of metakaolin-high calcium fly ash geopolymer concrete containing recycled aggregate from crushed concrete specimens. Constr. Build. Mater. 2018, 161, 365–373. [Google Scholar] [CrossRef]
  25. Provis, J.L.; Deventer, J.S.J.V. Alkali Activated Materials: State-of-the-Art Report; RILEM TC 224-AAM; Springer: Berlin/Heidelberg, Germany, 2014. [Google Scholar]
  26. Panda, B.; Ruan, S.Q.; Unluer, C. Investigation of the properties of alkali-activated slag mixes involving the use of nanoclay and nucleation seeds for 3D printing. Compos. Part B Eng. 2020, 186, 107826. [Google Scholar] [CrossRef]
  27. Shi, W.; Ren, H.; Huang, X. Low cost red mud modified graphitic carbon nitride for the removal of organic pollutants in wastewater by the synergistic effect of adsorption and photocatalysis. Sep. Purif. Technol. 2020, 237, 116477. [Google Scholar] [CrossRef]
  28. Li, P.; Chen, D.; Jia, Z.; Li, Y.; Li, S.; Yu, B. Effects of Borax, Sucrose, and Citric Acid on the Setting Time and Mechanical Properties of Alkali-Activated Slag. Materials 2023, 16, 3010. [Google Scholar] [CrossRef] [PubMed]
  29. DeRousseau, M.A.; Kasprzyk, J.R.; Srubar, W.V. Computational design optimization of concrete mixtures: A review. Cem. Concr. Res. 2018, 109, 42–53. [Google Scholar] [CrossRef]
  30. Soudki, K.A.; El-Salakawy, E.F.; Elkum, N.B. Full Factorial Optimization of Concrete Mix Design for Hot Climates. J. Mater. Civ. Eng. 2001, 13, 427–433. [Google Scholar] [CrossRef]
  31. Dolado, J.S.; Van Breugel, K. Recent advances in modeling for cementitious materials. Cem. Concr. Res. 2011, 41, 711–726. [Google Scholar] [CrossRef]
  32. Ford, E.; Kailas, S.; Maneparambil, K.; Neithalath, N. Machine learning approaches to predict the micromechanical properties of cementitious hydration phases from microstructural chemical maps. Cem. Concr. Res. 2020, 265, 120647. [Google Scholar] [CrossRef]
  33. Kong, D.L.Y.; Sanjayan, J.G.; Sagoe-Crentsil, K. Comparative performance of geopolymers made with metakaolin and fly ash after exposure to elevated temperatures. Cem. Concr. Res. 2007, 37, 1583–1589. [Google Scholar] [CrossRef]
  34. Kong, D.L.Y.; Sanjayan, J.G. Effect of elevated temperatures on geopolymer paste, mortar and concrete. Cem. Concr. Res. 2010, 40, 334–339. [Google Scholar] [CrossRef]
  35. Vargas, A.S.D.; Dal Molin, D.C.C.; Vilela, A.C.F.; Silva, F.J.D.; Pavão, B.; Veit, H. The effects of Na2O/SiO2 molar ratio, curing temperature and age on compressive strength, morphology and microstructure of alkali-activated FA-based geopolymers. Cem. Concr. Compos. 2011, 33, 653–660. [Google Scholar] [CrossRef]
  36. Nguyen, K.T.; Nguyen, Q.D.; Le, T.A.; Shin, J.; Lee, K. Analyzing the compressive strength of green fly ash based geopolymer concrete using experiment and machine learning approaches. Constr. Build. Mater. 2020, 247, 118581. [Google Scholar] [CrossRef]
  37. Hadi, M.N.S.; Al-Azzawi, M.; Yu, T. Effects of fly ash characteristics and alkaline activator components on compressive strength of fly ash-based geopolymer mortar. Constr. Build. Mater. 2018, 175, 41–54. [Google Scholar] [CrossRef]
  38. Thomas, R.J.; Peethamparan, S. Stepwise regression modeling for compressive strength of alkali-activated concrete. Constr. Build. Mater. 2017, 141, 315–324. [Google Scholar] [CrossRef]
  39. Lokuge, W.; Wilson, A.; Gunasekara, C.; Law, D.W.; Setunge, S. Design of fly ash geopolymer concrete mix proportions using Multivariate Adaptive Regression Spline model. Constr. Build. Mater. 2018, 166, 472–481. [Google Scholar] [CrossRef]
  40. Zhang, L.V.; Zhang, C.; Zhang, J.; Wang, L.; Wang, F. Effect of composition and curing on alkali activated fly ash-slag binders: Machine learning prediction with a random forest-genetic algorithm hybrid model. Constr. Build. Mater. 2023, 366, 129940. [Google Scholar] [CrossRef]
  41. Zhang, L.V.; Marani, A.; Nehdi, M.L. Chemistry-informed machine learning prediction of compressive strength for alkali-activated materials. Constr. Build. Mater. 2022, 316, 126103. [Google Scholar] [CrossRef]
  42. Xu, J.; Zhao, X.; Yu, Y.; Xie, T.; Yang, G.; Xue, J. Parametric sensitivity analysis and modelling of mechanical properties of normal- and high-strength recycled aggregate concrete using grey theory, multiple nonlinear regression and artificial neural networks. Constr. Build. Mater. 2019, 211, 479–491. [Google Scholar] [CrossRef]
  43. Nguyen-Sy, T.; Wakim, J.; To, Q.D.; Vu, M.N.; Nguyen, T.D. Predicting the compressive strength of concrete from its compositions and age using the extreme gradient boosting method. Constr. Build. Mater. 2020, 260, 119757. [Google Scholar] [CrossRef]
  44. Zhang, M.; Li, M.; Shen, Y.; Ren, Q.; Zhang, J. Multiple mechanical properties prediction of hydraulic concrete in the form of combined damming by experimental data mining. Constr. Build. Mater. 2019, 207, 661–671. [Google Scholar] [CrossRef]
  45. ASTM C595-08a; Specification for Blended Hydraulic Cements. ASTM International: West Conshohocken, PA, USA, 2003.
  46. Aliabdo, A.A.; Elmoaty, A.E.M.A.; Emam, M.A. Factors affecting the mechanical properties of alkali activated ground granulated blast furnace slag concrete. Constr. Build. Mater. 2019, 197, 339–355. [Google Scholar] [CrossRef]
  47. Manjunath, G.S.; Giridhar, C.; Jadhav, M. Compressive strength development in ambient cured geo-polymer mortar. Int. J. Earth Sci. Eng. 2011, 6, 830–834. [Google Scholar]
  48. Castillo, H.; Collado, H.; Droguett, T.; Vesely, M.; Garrido, P.; Palma, S. State of the art of geopolymer: A review. e-Polymers 2022, 22, 108–124. [Google Scholar] [CrossRef]
  49. Ruder, S. An overview of gradient descent optimization algorithms. arXiv 2016, arXiv:1609.04747. [Google Scholar]
  50. Quinlan, J.R. Induction of Decision Trees. Mach. Learn. 1986, 1, 81–106. [Google Scholar] [CrossRef]
  51. Breiman, L. Classification and Regression Trees; Wiley: Hoboken, NJ, USA, 1984. [Google Scholar]
  52. Material and Chemical Research Laboratories, Industrial Technology Research Institute. MACSiMUM. Available online: https://www.macsimum.org/ (accessed on 19 February 2020).
  53. ASTM C109/C109M-20; Standard Test Method for Compressive Strength of Hydraulic Cement Mortars (Using 2-in. or [50-mm] Cube Specimens). ASTM International: West Conshohocken, PA, USA, 2020.
  54. Waskom, M.L. seaborn: Statistical data visualization. J. Open Source Softw. 2021, 6, 3021. [Google Scholar] [CrossRef]
  55. Ratnasari, D.; Toresano, L.O.H.Z.; Pawiro, S.A.; Soejoko, D.S. The correlation between effective renal plasma flow (ERPF) and glomerular filtration rate (GRF) with renal scintigraphy 99mTc-DTPA study. J. Phys. Conf. Ser. 2016, 694, 012062. [Google Scholar] [CrossRef]
  56. Breiman, L. Random Forest. Mach. Learn. 2001, 45, 5–32. [Google Scholar] [CrossRef]
  57. Levy, K.Y. k-Nearest Neighbors: From Global to Local. In Proceedings of the 30th Conference on Neural Information Processing Systems, Barcelona, Spain, 5–10 December 2016. [Google Scholar]
  58. Pedregosa, F.; Varoquaux, G.; Gramfort, A. Scikit-learn: Machine learning in Python. J. Mach. Learn. Res. 2011, 12, 2825–2830. [Google Scholar]
  59. Cong, P.; Cheng, Y. Advances in geopolymer materials: A comprehensive review. J. Traffic Transp. Eng. 2021, 8, 283–314. [Google Scholar] [CrossRef]
  60. Pruksawana, S.; Lambardc, G.; Samitsua, S.; Sodeyamac, K.; Naitoa, K. Prediction and optimization of epoxy adhesive strength from a small dataset through active learning. Sci. Technol. Adv. Mater. 2019, 20, 1010–1021. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Flowchart of the proposed approach for data processing.
Figure 1. Flowchart of the proposed approach for data processing.
Materials 17 03141 g001
Figure 2. Schematic structure for the 5-fold CV.
Figure 2. Schematic structure for the 5-fold CV.
Materials 17 03141 g002
Figure 3. Pair plot of the initial dataset.
Figure 3. Pair plot of the initial dataset.
Materials 17 03141 g003
Figure 4. Pearson correlation matrix of the initial dataset.
Figure 4. Pearson correlation matrix of the initial dataset.
Materials 17 03141 g004
Figure 5. Correlation scatter plots of composition transformation dataset obtained for the ML models employed. Blue color: predicted strength (5-fold cross-validation); (a) SGD, (b) DT, (c) RF, (d) K-NN. Red color: actual strength.
Figure 5. Correlation scatter plots of composition transformation dataset obtained for the ML models employed. Blue color: predicted strength (5-fold cross-validation); (a) SGD, (b) DT, (c) RF, (d) K-NN. Red color: actual strength.
Materials 17 03141 g005
Figure 6. Bar chart of the model performance on each dataset. (a) MAE, (b) RMSE, and (c) R2.
Figure 6. Bar chart of the model performance on each dataset. (a) MAE, (b) RMSE, and (c) R2.
Materials 17 03141 g006
Figure 7. Distribution of the compressive strength of the recommended recipes.
Figure 7. Distribution of the compressive strength of the recommended recipes.
Materials 17 03141 g007
Figure 8. Distribution of the initial data (black), proposal 1 (red), proposal 2 (blue), and proposal 3 (gray).
Figure 8. Distribution of the initial data (black), proposal 1 (red), proposal 2 (blue), and proposal 3 (gray).
Materials 17 03141 g008
Table 1. Application of ML approaches for property predictions.
Table 1. Application of ML approaches for property predictions.
Material UsedFeaturesAlgorithm UsedData PointsPrediction PropertiesRef.
FA3ANN180CS[37]
FA, slag, cement3CNN>5000CS[38]
FA4MARS154CS[39]
FA, cement, agg.9BR, GD, DT, GB645CS, STS, EM, UTS, DSR[40]
Slag, FA5SVR, RF, ET, GB676CS[41]
RC, agg.13RF, SVM526EM[42]
Cement, BFS, FA, agg.8ANN, XGB, SVM100CS[43]
BFS, FA2GA894FST, UCS[44]
BFS, FA, RS, WGs 27SGD, DT, RF, K-NN 42CSThis work
Notes: Aggregate: Agg.; Recycled concrete: RC; Splitting tensile: STS; Elastic modulus: EM; Ultimate tensile strain: UTS; Dry shrinkage rate: DSR; Final setting time: FST; Uniaxial compressive strength: UCS; ANN: Artificial neural network; Convolutional Neural Network: CNN; Multivariate Adaptive Regression Splines: MARS; Bidirectional Recognition: BR; Gradient descent: GD; Gradient Boosting: GB; Support Vector Machine: SVM; eXtreme Gradient Boosting: XGB; Genetic Algorithm: GA; DT: Decision tree; RF: Random Forest; Extra trees: ET; Stochastic Gradient Descent: SGD; K-nearest neighbors: K-NN.
Table 2. Chemical composition (wt. %) of the BFS, FA, RS, WG as determined by XRF-analysis.
Table 2. Chemical composition (wt. %) of the BFS, FA, RS, WG as determined by XRF-analysis.
Composition (%)Raw Materials
BFSFA aFA bFA cRS dRS eRS fWG gWG hWG iWG j
Fe2O30.309.903.875.490.880.311.29ND0.100.060.43
Al2O315.0020.6028.4621.363.302.197.685.8018.7018.502.48
SiO233.6054.7058.3538.6125.225.17.5853.8057.9058.0869.03
K2OND1.501.191.24NDND0.052.80ND0.120.76
Na2OND1.200.670.94NDNDND5.80NDND12.87
CaOND6.802.9521.9456.8061.1841.751.906.905.9711.88
MgO6.101.600.971.089.804.5939.781.501.802.011.50
Notes: FA a–c: These fly ashes refer to the particulate matter collected during the combustion process at different three power plants; RS d–f: These reduction slags refer to the waste slag collected during the production processes at different three steel plants; WG g–j: These waste glasses refer to the waste generated during the manufacturing processes of different four glass plants; ND: Not detected.
Table 3. Statistical characteristics of the variables in the initial dataset.
Table 3. Statistical characteristics of the variables in the initial dataset.
VariableUnitMin.Max.AverageStandard DeviationCategory
BFSg71.54436.60201.4898.55PAs
FAg0.00300.00133.3170.87PAs
WGg0.00654.74290.87163.69AAs
RSg0.00321.43130.4077.18AAs
NaOHmL129.54500.00266.96103.37AAs
t1min10.0030.0010.7128.33process
t2min53028.335.31process
Curing temperature°C259229.9514.33process
CSMPa0.3623.8411.856.02target feature
Table 4. Variable statistical characteristics of the composition feature classification dataset.
Table 4. Variable statistical characteristics of the composition feature classification dataset.
VariableMin.Max.AverageStandard Deviation
Fe2O3-PAs0.00090.04010.01480.0091
Al2O3-PAs0.04290.10810.07520.0136
SiO2-PAs0.09600.27640.17840.0365
K2O-PAsND0.00600.00220.0013
Na2O-PAsND0.00480.00170.0011
CaO-PAs0.05410.17260.12770.0251
MgO-PAs0.00940.02620.01980.0034
ZrO2-PAsND0.0001NDND
B2O3-PAsNDNDNDND
TiO2-PAsND0.00240.00030.0007
Bi2O3-PAsNDNDNDND
BaO-PAsND0.0002NDND
SrO-PAsND0.0003ND0.0001
Other-PAs0.01200.02230.01760.0028
Fe2O3-AAsND0.00380.00150.0010
Al2O3-AAs0.01240.09310.04340.0275
SiO2-AAs0.09690.35240.24060.0611
K2O-AAsND0.01330.00630.0053
Na2O-AAsND0.06420.01460.0140
CaO-AAs0.00500.24890.09450.0618
MgO-AAs0.00390.05620.01960.0118
ZrO2-AAs0.00010.01560.00820.0063
B2O3-AAsND0.07330.04960.0218
TiO2-AAsND0.00890.00400.0033
Bi2O3-AAsND0.04510.01680.0138
BaO-AAsND0.00490.00120.0019
SrO-AAsND0.03830.00410.0076
Other-AAs0.00050.01600.00630.0037
Note: ND: Not detected.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Hsu, C.-H.; Chan, H.-Y.; Chang, M.-H.; Liu, C.-F.; Liu, T.-Y.; Chiu, K.-C. Efficient Compressive Strength Prediction of Alkali-Activated Waste Materials Using Machine Learning. Materials 2024, 17, 3141. https://doi.org/10.3390/ma17133141

AMA Style

Hsu C-H, Chan H-Y, Chang M-H, Liu C-F, Liu T-Y, Chiu K-C. Efficient Compressive Strength Prediction of Alkali-Activated Waste Materials Using Machine Learning. Materials. 2024; 17(13):3141. https://doi.org/10.3390/ma17133141

Chicago/Turabian Style

Hsu, Chien-Hua, Hao-Yu Chan, Ming-Hui Chang, Chiung-Fang Liu, Tzu-Yu Liu, and Kuo-Chuang Chiu. 2024. "Efficient Compressive Strength Prediction of Alkali-Activated Waste Materials Using Machine Learning" Materials 17, no. 13: 3141. https://doi.org/10.3390/ma17133141

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop