1. Introduction
It is estimated that the construction industry accounts for 39% of global carbon dioxide (CO
2) emissions [
1]. Cement alone makes up approximately 5% of global anthropogenic CO
2 emissions [
2] due to its high embodied carbon and concrete’s great consumption rates—concrete is the second most consumed material on the planet, second only to water. With the introduction of the United Nation’s (UN) Sustainable Development Goals (SDGs) and increasing national and global sustainability targets, the construction industry must adopt greener construction practices. One approach to reducing the embodied carbon of cement and concrete-based materials is through the partial substitution of cement with materials featuring reduced environmental impacts and cementitious properties, referred to as Supplementary Cementitious Materials (SCMs). Fly ash and slag, bi-products of coal burning and steel production, respectively, have been studied extensively and are commonly adopted as SCMs in the industry. Replacing high volumes of ordinary Portland cement (OPC) with SCMs is becoming increasingly common in the construction industry due to the high CO
2 emission savings that can be achieved.
Blending fly ash or slag with OPC leads to the creation of additional hydration products, particularly calcium silicate hydrate (C-S-H), which is the primary component of strength and durability in concrete. However, negative changes to the microstructure of the cement matrix may be observed, including larger pore volumes with a finer pore distribution. This can reduce durability performance due to the greater potential for chloride ion permeability and water absorption [
3]. Creep and shrinkage are important durability properties of concrete that significantly impact long-term performance and are influenced by the incorporation of mineral admixtures. Creep in concrete is a time-dependent, macroscopic deformation that occurs due to sustained loading [
4]. Total creep strain is characterised by two components: drying creep and basic creep. Basic creep is the component of total creep strain that grows with time under sustained loading and is not influenced by drying effects. The drying creep element is the portion of creep strain that occurs when loaded concrete loses moisture and dries. Substitution of OPC with SCMs has been found to have various effects on creep strain. Shariq et al. [
5] experimentally investigated creep and shrinkage strains in concrete containing Ground Granulated Blast-Furnace Slag (GGBFS) at replacement ratios of 20%, 40% and 60%. Creep and shrinkage strains were found to increase with GGBFS content. Average creep coefficients for 20%, 40% and 60% GGBFS concrete were 16.3%, 33.3% and 55.2% higher than concrete with no GGBFS concrete, respectively, after 150 days of loading. Conversely, many researchers have found that SCMs may lead to reduced creep strains. Qin et al. [
6] analysed creep and shrinkage in self-compacting concrete (SCC) containing recycled aggregates and various blends of fly ash, slag and silica fume and observed that incorporating combinations of SCMs led to reductions in creep strains. Similar behaviour has been reported for the inclusion of ultra-fine GGBFS and silica fume in high-performance concrete (HPC) by Jianyong and Yan [
7]. Creep strain more than halved when substituting 30% of OPC with ultra-fine GBFS and reduced by approximately 65% when a blend of silica fume and GBFS were used at 10% and 30% substitution ratios, respectively. Gedam et al. [
8] experimentally investigated four different HPC mixes that included either fly ash, silica fume or GGBFS as an SCM. After 500 days of loading, total creep in mixes containing GGBFS, fly ash and silica fume were 40.12%, 22.49%, and 57.11% lower, respectively, than that of the reference mix containing OPC only. Reduced creep due to SCM incorporation has been attributed to additional C-S-H gel formation, which effectively refines early-age micro-cracks and improves the physical performance of HPC [
8]. Chern and Chan [
9] examined basic and drying creep in blast furnace slag concrete. The experimental test results suggested that basic creep is lower in concrete containing GGBFS when compared to OPC concrete. However, drying creep and total deformation were found to increase when incorporating the SCM. It is clear from the above that the effect of SCMs on the time-dependent behaviour of concrete is complex and depends on a multitude of factors, including the material properties of cement and SCMs, mix design, and curing and mixing conditions.
The application of artificial intelligence (AI) in many fields is growing rapidly, including in engineering and material sciences [
10,
11]. Complex systems and behaviour such as concrete creep can be accurately predicted using machine learning (ML) algorithms trained on comprehensive datasets [
12,
13,
14]. Additionally, ML can be employed to elucidate underlying mechanisms governing behaviour. Molecular dynamics simulation is another emerging technique to predict material properties and behaviour [
15]. Artificial Neural Networks (ANNs) and regression analysis are commonly adopted ML techniques for civil engineering systems. The Infrastructure Technology Institute of Northwestern University’s (NU-ITI) concrete creep and shrinkage database [
16] is a valuable tool that can be harnessed for deep analysis of time-dependent behaviour in concrete. A number of researchers have utilised the NU-ITI dataset to develop accurate prediction models for concrete creep using ML techniques [
17,
18,
19,
20]. Developing creep models for sustainable concrete mixes using ML has also received some research attention. Back-propagation neural network (BPNN) and support vector machine (SVM) models were utilised by Rong et al. [
21] to investigate the drying creep of recycled aggregate concrete (RAC). Various ML models were compared for their suitability in predicting total creep in RAC by Feng et al. [
22,
23]. Few research works have attempted to analyse creep in concrete containing SCMs using ML. Sadowski et al. [
24] employed a firefly-algorithm (FA)-based ANN technique for modelling creep in concrete containing GGBFS. The developed model was highly accurate based on an assessment of the following statistical indicators: Correlation coefficient
, mean absolute error (MAE), mean squared error (MSE) and root mean square error (RMSE). However, the dataset utilised for ML model training was limited to 12 different mix designs and only 132 individual data points. All specimens were loaded at 28 days at 50% of first-crack strength, with loading sustained for 150 days. OPC replacement ratios ranged from 0–60% at increments of 20%. Gedam et al. [
25] applied ANNs to develop drying shrinkage and specific creep prediction models for HPC. A set of 106 experimental results, consisting of 2176 data points, was used to develop the dataset for model training for both drying shrinkage and creep prediction. It is unclear how many data points were specifically for creep measurements. The academic references used to source the data are listed, though many of them are not available in the open literature. Compressive strengths ranged from 20–83 MPa, and loading ages from 7–73 days. The developed models were able to accurately reflect creep behaviour and outperformed existing creep equations. A modified ACI model that considers the effect of SCMs on creep strain was developed by Chen et al. [
26] using particle-swarm optimisation. Liu et al. [
27] analysed creep in concrete containing SCMs using linear regression, support vector regression (SVR), random forest (RF) and extreme gradient boosting (XGB) techniques. A large database was constructed using experimental results reported in the literature. The influence of each adopted input parameter on creep prediction was investigated by conducting a sensitivity analysis based on the Shapley additive explanation (SHAP). The strength-to-stress ratio was determined to be the most influential input parameter, followed by time. This is likely due to compressive strength not being adopted as an input parameter directly and is instead reflected through applied stress and strength-to-stress ratio. Furthermore, the XBG model was found to accurately predict the creep compliance of SCM concrete and performed better than the other ML models considered. This study, however, did not consider curing conditions on creep behaviour aside from loading age.
In this paper, ML techniques are employed to predict and analyse creep in concretes containing SCMs, including fly ash, slag and silica fume. ANN, RF, and decision tree regression (DTR) are considered in this study as they are commonly adopted for concrete property prediction [
28]. Gaussian process regression (GPR) model is also considered, as the ML techniques mentioned above may lead to overfitting. This paper builds on the existing literature by considering high volumes of cement substitution with fly ash and slag and through investigation of various input variables not considered in previous ML studies. Additionally, a sizable database is developed, consisting of creep data that are available either publicly or in the open literature. This study is necessitated by the increasing use of concrete with high-volume fly ash and/or slag as cement substitutes and the complexities associated with accurately modelling creep. The paper first presents a discussion on the development of the database and the selection of input variables. Subsequently, the background theory for each adopted ML technique is outlined. The performances of the ANN, RF, DTR and GPR models are then analysed and compared. Sensitivity and Shapley value additive (SHAP) analyses are also conducted, which assess the influence of input variables on model performance and creep behaviour, respectively.
2. Concrete Creep Database and Input Variables
A database was compiled by supplementing the Infrastructure Technology Institute of Northwestern University’s (NU-ITI) database for concrete creep and shrinkage [
16] with experimental results from the literature. Data relating to concrete containing fly ash and/or slag were first extracted from the NU-ITI dataset, including any related control tests. This included nine creep curves for concrete containing slag (185 data points) and six control creep curves (97 data points) that did not feature any slag content. For fly ash-based concrete, 64 curves (1111 data points) and 18 control curves (159 data points) were extracted. The following changes were made to filter and clean the extracted data;
‘Suspicious data’, as labelled in the NU-ITI database, was removed
Negative time and creep data points were removed
Tests conducted at temperatures exceeding 50 °C were removed
Data points at times days were removed
Corrections were made to data for tests conducted by Li and Yao [
7] (CT_id 1173 and 1174). Slag and superplasticiser quantities changed to 180 kg/m
3 and 9.6 kg/m
3, respectively.
Results by Vincent et al. [
29] (CT_id 1374 and 1375) removed. This was due to exceptionally large creep strains when compared to other experiments, which also appeared to differ from the source article.
Results by Collins et al. [
30] (CT_id 1140–1154) removed. This was due to exceptionally large creep strains when compared to other experiments.
Due to the small size of available data for slag concrete, the experimental results of Shariq et al. [
5] and Qin et al. [
6] were incorporated into the dataset. Shariq et al. [
5] experimentally investigated creep and shrinkage in GBFS concrete with cement substitution ranging from 0–60%. In total, 12 creep curves were reported, for which 3 were controls (0% GBFS). Manual extraction of these data resulted in the generation of 111 data points. Qin et al. [
6] analysed drying shrinkage and creep deformations in concrete containing combinations of fly ash, slag and silica fume. SCM replacement ratios of 50% and 75% were considered. Three control curves (35 data points), six curves for concrete containing fly ash and slag (69 data points) and two curves for concrete with fly ash only (28 data points) were added to the dataset. After making the aforementioned modifications, 465 data points were available for creep in concrete with slag or slag + fly ash as SCMs (including related control tests), and 991 data points for fly ash concrete. Thus, the database consisted of 1456 data points in total.
Across key properties relating to concrete creep modelling, there were missing data in the complied dataset for relative humidity during creep testing (
), relative humidity during preconditioning (
) and 28-day elastic modulus
. Missing data for
and
were estimated using the mode imputation method. The mode for
was 40% for the slag-based concrete data and 60% for fly ash concrete. The data were separated for mode determination to capture potential differences in local conditions between tests on slag and fly ash concrete. Note that only 31 and 129
data points were missing for slag and fly ash concrete, respectively. In total, 45 data points were missing for
data relating to slag concrete, and no data were missing for fly ash concrete. The mode for
was 98%. Median imputation was not applicable due to the partial categorical nature of the
and
data: 99% means sealed specimen, 100% means storage in water, 101% represents steam and 85% moist conditions. The 28-day elastic modulus was estimated using the following ACI-209 equation;
The ACI-209 equation was selected due to its common adoption and dependence on compressive strength only, which, for the case of this study, was convenient due to data availability. The creep compliance reported in the NU-ITI database was split into two groups: measurement of elastic strain at the beginning of the creep test and measurement of creep strain only. The majority of the slag concrete creep data did not record elastic strain. Conversely, most of the data for fly ash concrete did measure creep strain. During preliminary analyses, it was observed that model development using ML was more successful when elastic strain was considered. Hence, was utilised to estimate elastic strain for relevant data points, ensuring uniformity across the dataset. This approach introduces two potential sources of error: inaccurate estimation of and use of 28-day modulus for elastic strain calculation despite loading age. These errors are, however, assumed to be small. In total, only 142 data points (9.39% of the dataset) did not measure elastic strain, report 28-day elastic modulus and were not loaded at 28 days.
The adopted input variables and their ranges, mean and standard deviation are listed in
Table 1. Input variables selection considered the significance of creep, limitations of the available data and parameters required by codified creep equations. Histograms for each input parameter are shown in
Figure 1 and
Figure 2. It can be seen that the number of data points for time decreases exponentially as time increases. Most of the data were measured at times
days, with limited data available at
days. The majority of the data featured a loading age of 28 days (899 data points) or 7 days (405 data points). Only a few experiments adopted different loading ages.
ratios fell mostly into three ranges: 12.0, 20.0–22.22 and 30.0–33.5 mm. Ratios of 50.8 and 60 mm are also present in the data, albeit to a lesser extent. The most common relative humidity ranges were 40–45%, followed by 60–65% and 50–55%. A smaller number of tests were conducted on sealed specimens (99%) and on specimens stored in water (100%). The 28-day compressive strengths ranged from 15.5–188.9 MPa, with the 80–90 MPa band being the most frequently occurring in the dataset by a large margin. Consequently, water-to-binder ratios were commonly between 20–30% and elastic moduli between 40,000–45,000 MPa. There is a relatively uniform distribution of other strength grades. Slag substitution ratios varied from 0–60% of total binder content, see
Figure 2c. Note that 0% is omitted for clarity due to the high number of data points without slag content (1149). Higher replacement ratios (40–42% and 50–60%) are better represented in the data than lower ratios (20% and 30–35%). The fly ash-to-total binder ratios are shown in
Figure 2d and ranged from 0–50%. Unlike slag, lower substitution rations were more common for fly ash, with 10–20% appearing the most frequently followed by 30–32%. Only a small section of the dataset features ratios above 32%. In total, 464 data points featured silica fume content (SF), of which more than half (261 points) equated to 29.37 kg/m
3, with the remaining points evenly distributed from 34–89 kg/m
3. The impact of adopting relative humidity
H0 during preconditioning as an input variable on model accuracy is explored in the Sensitivity Analysis presented in
Section 5.
Table 2 compares the input variables adopted herein with those considered in studies in the literature on training ML models to predict the creep in concrete containing SCMs [
24,
25,
27] and the ACI and B3 creep equations. It can be seen that mix proportions, compressive strength,
,
, and
are commonly considered. Elastic modulus of concrete, at 28-days or at
, has not been adopted as an input variable by other studies, though it is considered by design equations. Additionally, the relative humidity of preconditioning is only considered herein. Various factors considered by design equations, including the curing method, cement type, air content and unit weight of concrete, could not be adopted here, and likely in other studies, due to lack of available information.
5. Sensitivity Analysis
The random selection of validation can influence the performance of the ML models. To examine this, the RQ and Matern 5/2 GPR models were trained an additional five times, and the changes in the accuracy indicators were examined. The results of this analysis are presented in
Table 4, revealing that the selected GPR models demonstrated consistent performance with small reductions in accuracy shown across multiple training repetitions. The sensitivity of the GPR models to input variables and validation methodology is also analysed. Training was conducted with consideration of relative humidity during preconditioning. The performance of the GPR models when considering this variable is provided in
Table 5. The results provided in
Table 3,
Table 4 and
Table 5 show that the accuracy indicators do not noticeably change when adopting
as an input variable and that these small changes are within variations observed due to training repetitions. Changing the validation methodology from holdout (30%) to cross-validation (5-fold) caused a small reduction in model performance based on the accuracy indicators. However, the accuracy of the trained models is still high. The additional input variable had a negligible impact on the accuracy of the GPR models when adopting cross-validation.
The effect of dataset size is an important parameter that requires further validation. This was examined herein by varying the size of the dataset used for model training and assessing the effect on prediction accuracy. The
R2 indicator is plotted against training data size, which ranges from 436 (30% of the total dataset) to 1019 data points (70% of the total dataset) for the RQ-GPR, ANN, DTR and RFR models, see
Figure 10a. Data points were removed at random to reduce dataset size. It can be seen that
R2 converges for the DTR, RFR and GPR models when the training data size reaches approximately 800 data points. The ANN model converges at approximately 860 data points. In this paper, the number of training data is 1019. Therefore, the training of ANN and DTR can achieve a high accuracy for concrete creep prediction. The
R2 for RFR is always above 0.98 for the data size from 436 to 1019. This indicates that RFR performs better when there is a limited sample size.
The random selection of data points may, however, not cause a significant reduction in accuracies of model prediction due to the input variable of time, which causes every creep curve to feature multiple data points. In order to investigate this, a second analysis was conducted where entire creep curves were removed at random as opposed to removing individual data points. The results of this analysis are presented in
Figure 10b for the RQ-GPR model. It can be seen that the effect of dataset size now has a much larger impact on model accuracy (
R2), though the model accuracy still plateaus at 80% of the dataset size, which validates the size of the dataset used for model development.
6. Shapley Additive Explanation (SHAP) Analysis
The significance of each input parameter on creep compliance prediction is assessed by conducting a SHAP analysis [
40]. This involves computing and analysing Shapley values for a set of query points across a dataset. Shapley values, a concept originally from cooperative game theory, assist with ML model interpretability and explain predictions by quantifying the contribution of each feature (input variable) to the prediction [
41]. Shapley values consider all possible combinations of inputs/features as well as interactions and dependencies between them. For a given prediction, the Shapley values (contributions) of each feature sum to the value of the prediction, and the Shapley value for a specific feature is the difference between the actual prediction and the mean prediction.
In order to ensure a high representation of the data, the query points were selected herein using 3–4 data points from every creep curve available in the dataset that included the initial time and final time of the creep test. The Shapley values were computed using the developed RQ-GPR model and the “explainer” function in MATLAB.
Figure 11a,b depicts the averaged Shapley values (absolute) and the Shapley value distribution, respectively, for slag concretes. In order of decreasing significance, the input variables are found to be
and
. Aside from time,
was the most influential variable and was mostly negatively correlated with creep compliance. This indicates that an increasing
results in decreasing creep strain. The averaged Shapley values for
and
were similar with both parameters featuring a strong influence on creep compliance prediction. The remaining input variables were noticeably less influential, particularly
and
. The lower importance of
and
may be attributed to their low variability in the dataset. Most tests on the slag concrete commenced at 28 days, and the
varied only from 45–65%.
Figure 12a depicts the averaged Shapley values (absolute), and
Figure 12b displays the Shapley value distribution for fly ash-based concretes. For fly ash concretes, the input variables by order of reducing influence are
and
. As with slag concrete,
and
were highly influential. However, the absolute Shapley values for
and
are substantially higher in fly ash concrete than in slag concrete. This is to be expected and is now reflected in the Shapley values due to the higher variability of these parameters in the dataset. Across both slag and fly ash datasets, the Shapley values for
and
are small. This suggests that these parameters are not as important in creep compliance prediction as the other parameters. Conversely, water content
shows high significance in both datasets. Slag content
was important for predicting creep in slag concrete, whereas fly ash content showed a low impact on creep for fly ash concrete. Liu et al. [
27] also observed the low importance of SCM content when compared to other variables on creep prediction in concrete through Shapley value additive analysis. However, the significance of slag content appears to be higher in the analysis presented herein. Interestingly, there are differences in the significance rankings of parameters
and
. Aggregate content was reported to have a major influence on creep compliance prediction, with the volume-to-surface ratio showing low importance in [
27]. The opposite effect of these two parameters is observed herein. Furthermore, time since loading was only the 6th most influential parameter in [
27], whereas in this analysis, it is determined to be the most significant. These differences may be due to the adoption of different input variables and datasets adopted for model training. For example, there is a higher maximum quantity of slag content in the dataset developed in this paper.
Figure 13 and
Figure 14 depict the dependence analysis of the Shapley values. This involves plotting the input parameters against their individual Shapley values, which enables an assessment to be conducted on the influence of parameter value on creep prediction. An obvious and expected trend is observed in
Figure 13a for time since loading, where the Shapley values rise with time in a fashion resembling a creep curve.
Figure 13d–f shows that increasing loading age, compressive strength and elastic modulus lead to a reduction in Shapley values, indicating that increasing these parameters reduces creep strain development. No clear trend can be seen for relative humidity and volume-to-surface ratio, aside from the negative Shapley values occurring at the maximum input values (
and
mm). These query points are the outliers (dots) shown in
Figure 12b, meaning the input values are greater than 1.5 times the interquartile range below the first quartile. Similar behaviour is shown for slag and fly ash, as the largest input values give negative Shapley scores without an evident trend occurring at lower input values. This suggests that high SCM ratios lead to a reduction in creep strain. However, this effect is likely eclipsed by the changes in compressive strength and elastic modulus that occur when increasing SCM content based on the significance analysis discussed earlier. There is no noticeable relationship between cement, total aggregate or silica fume content on Shapley values. Increasing water content is, however, shown to lead to an increase in Shapley values and, therefore, creep strain.
8. Conclusions
A creep compliance prediction model was developed for sustainable concretes containing slag and fly ash using ANN-, DTR-, RFR- and GPR-supervised ML algorithms. A dataset of concrete creep was constructed using data extracted from the NU-ITI creep database and from experimental results reported in the literature. The selected input variables were time, loading age, 28-day compressive strength, 28-day elastic modulus, relative humidity during creep testing, volume-to-surface ratio, cement content, water content, total aggregate mass, slag content, fly ash content and silica fume content. The effects of considering humidity during specimen preconditioning as an input variable were also explored. Mode imputation was conducted for missing data on relative humidity and humidity during preconditioning, whereas a 28-day elastic modulus was estimated using the AC1 equation when required. Model training utilised 70% of the dataset, while the remaining 30% was reserved for model verification. A sensitivity analysis was conducted on the GPR model to analyse changes in performance due to training repetitions, input variable selection and validation methodology. A SHAP analysis was performed to investigate the contribution, and hence significance, of each input variable on creep prediction. Additionally, the predictions of the trained ML models were compared to experimental data and the effect of varying input parameters was examined. The following conclusions have been drawn from this work;
RFR and GPR models were the best-performing ML techniques, achieving R-squared values of 0.99 for validation data. ANN and DTR also achieved accurate results and obtained R-squared values for validation data of 0.91 and 0.90, respectively.
The selection of kernel function for the GPR model was not found to greatly impact accuracy indicators. Exponential, squared exponential, Matern 5/2 and RQ kernel functions of the GPR model all demonstrated consistent and accurate performance.
The performance of the GPR model was proven to not be sensitive to training repetitions and random splitting of training and testing data. Only small reductions in accuracy indicators were observed across five training repetitions.
Consideration of humidity during preconditioning as an input variable for the GPR models did not noticeably improve or reduce accuracy.
Time since loading was the most significant parameter for creep prediction, followed by compressive strength, elastic modulus, volume-to-surface ratio, loading age, relative humidity and water content.
Cement content, aggregate content and humidity during preconditioning did not show great importance in creep prediction. Similarly, fly ash and silica fume content were found to have a low influence.
Slag content did, however, show a moderate importance level for creep modelling.
The RQ-GPR, RFR and ANN models accurately reflect creep behaviour. However, the DTR model cannot accurately produce concrete creep curves.