Elastic Modulus Prediction of Ultra-High-Performance Concrete with Different Machine Learning Models

Zhang, Chaohui; Liu, Peng; Song, Tiantian; He, Bin; Li, Wei; Peng, Yuansheng

doi:10.3390/buildings14103184

Open AccessArticle

Elastic Modulus Prediction of Ultra-High-Performance Concrete with Different Machine Learning Models

by

Chaohui Zhang

^1,2,

Peng Liu

^2,3,*,

Tiantian Song

¹,

Bin He

¹,

Wei Li

^1,* and

Yuansheng Peng

²

¹

Shenzhen Metro Group Co., Ltd., Shenzhen 518026, China

²

College of Civil and Transportation Engineering, Shenzhen University, Shenzhen 518060, China

³

Department of Mechanical Engineering, Hunan University of Science and Technology, Xiangtan 425100, China

^*

Authors to whom correspondence should be addressed.

Buildings 2024, 14(10), 3184; https://doi.org/10.3390/buildings14103184 (registering DOI)

Submission received: 31 August 2024 / Revised: 2 October 2024 / Accepted: 4 October 2024 / Published: 6 October 2024

(This article belongs to the Section Building Energy, Physics, Environment, and Systems)

Download

Browse Figures

Versions Notes

Abstract

:

Elastic modulus, crucial for assessing material stiffness and structural deformation, has recently gained popularity in predictions using data-driven methods. However, research systematically comparing different machine learning models under the same conditions, especially for ultra-high-performance concrete (UHPC), remains limited. In this study, 10 different machine learning models were evaluated for their capacity to predict the elastic modulus of UHPC. The results showed that XGBoost demonstrated the highest accuracy in predictions with large training datasets, followed by KNNs. For smaller training datasets, Decision Tree exhibited the greatest accuracy, while XGBoost was the second-best performing model. Linear regression displayed the lowest accuracy. XGBoost demonstrated the most potential for accurately predicting the elastic modulus of UHPC, particularly when a comprehensive dataset is available for model training. The optimized XGBoost exhibited better predictive performance than fitting equations for different UHPC formulations. The findings of this study provide valuable insights for researchers and engineers working on the data-driven design and characterization of UHPC.

Keywords:

elastic modulus; UHPC; machine learning; XGBoost; decision tree

1. Introduction

Ultra-high-performance concrete (UHPC), as a new construction material, exhibits exceptional mechanical properties, including compressive strength, durability, and impact resistance [1,2]. The elastic modulus, a critical parameter that characterizes the stiffness of materials, is essential for evaluating the stiffness and creep of concrete structures. Measuring the mechanical strength of UHPC is relatively straightforward using standard concrete testing procedures. These tests are well established and are not overly sensitive to minor variations in UHPC mix design or testing conditions. However, determining the elastic modulus of UHPC is a more complex task. The stress–strain response of samples should be precisely measured, typically using advanced instrumentation [3,4]. The challenge arises from the fact that the elastic modulus is heavily influenced by the microstructural composition and packing of UHPC constituents. Subtle changes in the mix proportions or curing conditions can significantly affect the elastic behavior. Additionally, the stress–strain response of UHPC is nonlinear, even at low stress levels, due to its heterogeneous nature [5]. Accurately determining the tangent modulus, which represents the true elastic stiffness, requires careful analysis and interpretation of the stress–strain data.

To address the challenges in predicting the elastic modulus of UHPC, researchers have developed various codes and standards to establish relationships between compressive strength and elastic modulus [6,7,8,9]. Many researchers have proposed empirical models tailored to specific conditions. Ma et al. [10] predicted the elastic modulus of UHPC without and with coarse aggregates and suggested that the elastic modulus of UHPC is proportion to the 1/3 power of the compressive strength, which falls in a range of 150–180 MPa, and Association Française de Génie Civil (AFGC) obtained a similar relationship between elastic modulus and compressive strength for heat-cured UHPC when the compressive strength is larger than 140 MPa. Sritharan et al. [11] and Graybeal et al. [12,13] found that the elastic modulus of UHPC is proportional to the 1/2 power of the compressive strength. However, it is difficult to predict the elastic modulus of UHPC with different mix designs. Zhang et al. [14] proposed a new computational mesoscale model generation to predict the elastic modulus of concrete and achieved good prediction accuracy, but the computational cost of this model is significant, especially when multiple phases should be modeled. Ouyang et al. [15] developed an analytical model to analyze the effective elastic modulus of UHPC at different scales. However, accurately modeling this relationship is challenging, as variations in mix designs, raw materials, and curing methods across different regions can lead to different results.

On the other hand, machine learning has been successfully applied to various aspects of concrete, including mixture design [16,17,18], fiber pullout performance [19,20], mechanical performance [21,22,23,24], and durability [25,26], etc. For predicting the compressive strength of concrete, Naderpour et al. [27] used an artificial neural network (ANN) to assess recycled aggregate concrete (RAC), reporting regression values of 0.903 for training, 0.89 for validation, and 0.829 for testing, with optimal performance at epoch 5. Fan et al. [28] combined the Modified Andreasen and Andersen (MAA) model with a Genetic Algorithm-based ANN (GA-ANN) to study the compressive strength of UHPC. They developed an efficient graphical user interface (GUI), and their results indicated a correlation coefficient (R²) of over 0.95, with a mean squared error (MSE) of 0.004447. Khodaparasti et al. [29] employed an improved random forest method to investigate the compressive strength of concrete, achieving R², mean absolute error (MAE), and root mean square error (RMSE) values of 0.931, 3.21, and 4.71, respectively. Several models have also been developed to predict the elastic modulus of concrete. For example, Demir et al. [30] used artificial neural networks (ANNs) to predict the elastic modulus of normal-strength concrete (NSC) and high-performance concrete (HPC) and proved the validity of ANN compared with those obtained using empirical models. Yan et al. [31] predicted the elastic modulus of NSC and HPC with support vector machine (SVM) and showed a similar level of prediction errors to the ANN model but with a better generalization capability. Golafshani et al. [32] used four types of soft computing models, including ANN, fuzzy Takagi–Sugeno-Kang (TSK), support vector regression (SVR), and radial basis function neural network (RBFNN) to predict the elastic modulus of recycled aggregate concrete (RAC). The results suggested that ANN and SVR better exhibit the performance than that of fuzzy TSK and RBFNN. Golafshani et al. [32] compared three automatic regression models, that is, genetic programming (GP), artificial bee colony programming (ABCP), and biogeography-based programming (BBP), for the prediction of the elastic modulus of RAC and proved that BBP has the highest accuracy. As the model tree (MT) is more accurate than regression-based models, Behnood et al. [33] developed M5’s tree algorithm to analyze the elastic modulus prediction of RAC, demonstrating an 80% improvement in accuracy compared to other empirical models. Additionally, a new prediction model has been proposed to estimate the post-fire elastic modulus of circular recycled aggregate concrete-filled steel tubular stub columns [34]. It uses gene expression programming and experimental data from both heated and non-heated samples. The model improves accuracy in terms of predicting structural behavior after fire exposure and performs better than existing models, considering key factors like steel ratio, temperature, and material strength.

Despite significant advances in the use of machine learning models to predict the elastic modulus of various concrete types, a critical gap remains in understanding the comparative performance of these models, specifically for UHPC. Most existing studies have focused on individual machine learning approaches, with limited research systematically comparing different models under the same conditions, particularly for UHPC. It is important to note, however, that the superiority of machine learning models over traditional forecasting or prediction methods is not a foregone conclusion. The performance of any given method can vary significantly depending on the specific dataset and problem at hand. In this manuscript, we aim to optimize the ML model to achieve good predictive accuracy for various data. This research introduces the development of ten distinct automated machine learning models for predicting the elastic modulus of UHPC. To validate their effectiveness, a comprehensive analysis and comparison of the performance of these models were conducted. Section 2 details the machine learning models employed, while Section 4 offers an in-depth comparison of the prediction results, demonstrating the strengths and weaknesses of each model. The optimized XGBoost model was implemented to predict the elastic modulus of different types of UHPC in Section 5. The findings are then presented in Section 6, where the overall conclusions are drawn.

2. Methods

2.1. Linear Regression

Linear regression is generally used to predict a single dependent variable. It can establish the relationship between variables as either linear or nonlinear, such as concave, convex, or other arbitrary polynomial forms. The equation of linear regression can be expressed as follows [35]:

Y = w \cdot x + b + ε

(1)

where Y is the sum of a “known” deterministic component, w is the coefficient and b represents an intercept, and

ε

represents the combined effect of potentially factors of y. As shown in Figure 1, to estimate the minimum error in general, the least squares method [36] is used, which is

\min \sum_{i} (y - y_{1 i})^{2} = \sum_{i} (y_{i} - (w \cdot x_{i} + b))^{2}

(2)

To avoid over-fitting, a regularization term is introduced into Equation (2), for example, the LASSO minimization formulation can be expressed by

\min \sum_{i} (y_{i} - (w \cdot x_{i} + b))^{2} + c \sum_{i} ∣ w_{j}^{2} ∣

(3)

Root mean squared error (RMSE) is used to evaluate the individuals as follows:

R M S E_{i} = \sqrt{\frac{1}{n} \sum_{i = 1}^{n} {(y_{i} - y_{1 i})}^{2}}

(4)

2.2. Bayesian Ridge Regression

In the Bayesian Ridge Regression model [37], the minimum can be expressed as follows:

\arg_{β} \min {‖ y - X β ‖}_{2}^{2} + λ {‖ β ‖}_{2}^{2}

(5)

In the Bayesian setting, the posterior distribution was estimated using Bayes theorem [38]. For Ridge regression, normal likelihood and normal prior need to be set as the parameters. After dropping the normalizing constant, the log-density function of the normal distribution is as follows:

\log p (x | μ, σ) = \log [\frac{1}{σ \sqrt{2 π}} e^{- \frac{1}{2} {(\frac{x - μ}{σ})}^{2}}] = \log [\frac{1}{σ \sqrt{2 π}}] + \log [e^{- \frac{1}{2} {(\frac{x - μ}{σ})}^{2}}] \propto - \frac{1}{2} {(\frac{x - μ}{σ})}^{2} \propto - \frac{1}{σ^{2}} {‖ x - μ ‖}_{2}^{2}

(6)

Hence, maximizing the normal log likelihood with normal priors is equivalent to minimizing the squared loss with a ridge penalty:

\arg_{β} \max \log N (y ∣ X β, σ) + \log N (0, τ) = \arg_{β} \min - {\log N (y ∣ X β, σ) + \log N (0, τ)} = \arg_{β} \min \frac{1}{σ^{2}} {‖ y - X β ‖}_{2}^{2} + \frac{1}{τ^{2}} {‖ β ‖}_{2}^{2}

(7)

2.3. ARD Regression

Automatic Relevance Determination (ARD) Regression [39], considering that each weight has an individual variance, is as follows:

p (w | λ) ~ N (0, Λ^{- 1})

(8)

where

Λ = d i a g (λ_{1}, \dots, λ_{H})

,

λ_{H} \in R_{+}

are the features of the parameters. The minimization problem can be expressed as follows:

\min_{w} α {‖ ϕ_{w} - t ‖}^{2} + w^{2} Λ w

(9)

ARD Regression is a sparse Bayesian learning method. In contrast to Bayesian Ridge Regression, each coordinate of

w_{i}

has its own individual

λ_{i}

. ARD Regression adapts well when few features are relevant.

2.4. RANSAC Regression

Random Sample Consensus (RANSAC) Regression, which was proposed by Fischler et al. [40], is implemented as follows [41]:

(1): A minimum number of random points are selected to determine the model parameters.
(2): Solve the parameters of the model.
(3): Determine the number of points from the set of all points that fit within a predefined tolerance.
(4): Otherwise, repeat steps 1 through 4 (a maximum of N times). RANSAC estimates the parameters of a mathematical model through iterations to analyze a set of data containing outliers. Therefore, it can also be explained as an approach to outlier detection.

2.5. ANN Regression

The linear regression feature can be expressed by

y_{i} = h (x_{i}, w) = w^{T} x_{i}

(10)

The least squares error or loss of data is given by

L (w) = \sum i {(h (x_{i}, w) - y_{i})}^{2}

(11)

In ANN regression [42] (as shown in Figure 2),

x_{i}^{1}

,

x_{i}^{2}

represent the features, and

w_{1}

,

w_{2}

denote the weights; the prediction can be computed by the sums. Gradient descent is used to minimize the error in the training data, that is,

\nabla_{w} L (w) = (\frac{\partial L (w)}{\partial w_{1}}, \frac{\partial L (w)}{\partial w_{2}}) = (\sum {x_{1}}^{^} \sum i 2 x (2) i h (x_{i}, w))

(12)

The update of w is given by

w = w - η \nabla_{w} L (w)

(13)

where

η

represents the step size. After a set number of epochs, the weights

w

tend to convergence and obtain the best fit. Different activation functions, including unit step (threshold), sigmoid, piecewise linear, Gaussian, etc., are used to transform input data to output data.

2.6. Support Vector Regression

In SVR, Vapnik [43] introduced slack variables ξ, ξ* to the constraints of the optimization problem, as shown in Figure 3. The minimization function of SVR is formulated as follows:

\min \frac{1}{2} ‖ ω ‖^{2} + C \sum_{i = 1}^{n} (ξ_{i} + ξ_{i}^{*})

(14)

s.t.

{\begin{matrix} y_{i} - f (x_{i}, ω) \leq ε + ξ_{i}^{*} \\ f (x_{i}, ω) - y_{i} \leq ε + ξ_{i} \\ ξ_{i}, ξ_{i}^{*} \geq 0, i = 1, \dots, n \end{matrix}

(15)

In addition, a loss function called

ε

-insensitive loss function, which was proposed by Vapnik [43], is used:

L_{ε} (y, f (x, ω)) = {\begin{matrix} 0 & i f ∣ y - f (x, ω) ∣ \leq ε \\ ∣ y - f (x, ω) ∣ - ε & o t h e r w i s e \end{matrix}

(16)

The empirical risk : R_{e m p} (ω) = \frac{1}{n} \sum_{i = 1}^{n} L_{ε} (y_{i}, f (x_{i}, ω))

(17)

where

ε

is the insensitive loss that coincides with the least-modulus loss (

ε = 0

) [43], which is used to estimate the prediction performance of SVR with various noise densities. This specifies the kernel types, such as “linear”, “poly”, and “gauss rbf”, used to transform the data into linear forms.

2.7. Decision Tree Regression

Decision Tree regression [44] is used to fit a curve with additional noise, and it is similar to a local linear regression approximation model. To implement the Decision Tree regression, based on the data features, the data can be split into decision nodes, which are determined by mathematical entropy. For example, as shown in Figure 4, based on the selection of the critical entropy, the output nodes (R1 to R5) can be obtained based on the Decision Tree nodes.

2.8. Random Forest Regression

To overcome the weakness of the generalization ability of Decision Trees, random forest regression was developed [45], and the process is shown in Figure 5. In step 1, k random data points were selected from the training set x. In step 2, a Decision Tree was built at each of the k data points. In step 3, the number of N-trees to build was chosen, and steps 1 and 2 were repeated. In step 4, after training, the final predictions can be made by averaging the predictions from all individual regression trees.

2.9. XGBoost

XGBoost [46], which is a fast Gradient Boosted Decision Trees algorithm [47], is commonly used in Kaggle competitions. To learn the model with a set of functions, a minimized regularized objective is used, as follows:

O b j^{(t)} = \sum_{i = 1}^{n} l (y_{i}, {\hat{y}}_{i}^{(t - 1)} + f_{t} (x_{i})) + Ω (f_{t})

(18)

where l represents a differentiable convex loss function prediction, and

{\hat{y}}_{i}, y_{i}

are the prediction and target values, respectively. Ω is the complexity of the model, and f_t means the regression tree. According to the Taylor expansion, Equation (11) can be rewritten as follows:

O b j^{(t)} = \sum_{i = 1}^{n} [l (y_{i}, {\hat{y}}_{i}^{(t - 1)}) + g_{i} f_{t} (x_{i}) + \frac{1}{2} h_{i} f_{t}^{2} (x_{i})] + Ω (f_{t})

(19)

where

g_{i} = \partial_{\hat{y} (t - 1)} l (y_{i}, {\hat{y}}^{(t - 1)})

and

h_{i} = \partial_{\hat{y} (t - 1)}^{2} l (y_{i}, {\hat{y}}^{(t - 1)})

, and by removing the constant term, Equation (12) can be simplified as follows:

O b j^{(t)} = \sum_{i = 1}^{n} [g_{i} f_{t} (x_{i}) + \frac{1}{2} h_{i} f_{t}^{2} (x_{i})] + Ω (f_{t})

(20)

The complexity of the tree is defined as follows:

Ω (f_{t}) = γ T + \frac{1}{2} \sum_{j = 1}^{T} w_{j}^{2}

(21)

The final objective function is expressed by

O b j^{(t)} = - \frac{1}{2} \sum_{j = 1}^{T} \frac{G_{j}^{2}}{H_{j} + λ} + γ T

(22)

where

G = \sum_{i \in I_{j}} g_{i}

,

H = \sum_{i \in I_{j}} h_{i}

, and

I_{j} = {i ∣ q (x_{i}) = j}

, in which q is the tree structure. A greedy algorithm is used to add branches to the tree.

2.10. KNNs

K-nearest neighbors (KNNs) is a non-parametric regression model [48], and the prediction is the average of the values of the K-nearest neighbors. The KNN algorithm uses similar features to predict the new data points. In detail, at first, the distance between the new point and all training data points is calculated, and then, based on the distance, the closest k data points are selected. Finally, the prediction is computed by the average of these data points. In addition, three distance functions, including the Euclidean distance, Manhattan distances, and Hamming distance, are selected to calculate the distance between the new point and the training data points. These three distance functions are expressed as follows:

Euclidean : \sqrt{\sum_{i = 1}^{k} {(x_{i} - y_{i})}^{2}}

(23)

Manhattan : \sqrt{\sum_{i = 1}^{k} ∣ x_{i} - y_{i} ∣}

(24)

Hamming : D_{H} = \sum_{i = 1}^{k} ∣ x_{i} - y_{i} ∣ x = y \Rightarrow D = 0 x \neq y \Rightarrow D = 1

(25)

To obtain the best prediction, it is important to choose an optimal k kernel. In general, a large k value obtains a better prediction due to less noise; however, the compromise is that the distinct boundaries within the feature space are blurred. Instead, cross-validation enables a way to determine a good k value by using an independent dataset to validate the k value.

3. Dataset Preparation and Strategies to Address Overfitting

3.1. Dataset Preparation

All models were implemented as regression models, with the dataset consisting of discrete values for both the features and target variables. The error was quantified using relative errors, and the R² score was employed to evaluate the performance of the optimized XGBoost model. The dataset used for training the machine learning models in this study was sourced from previous works [49,50,51]. The data were collected from multiple experimental studies focused on the compressive strength of concrete. Each data point represents specific concrete samples tested under varying conditions, including different mix proportions, curing times, and testing environments. The compressive strength values in the dataset span a wide range, typically from 100 MPa to 220 MPa. The dataset was normalized by Z-score normalization, which is a technique used to normalize data so that it has a mean (average) of 0 and a standard deviation of 1, and expressed by

Z = \frac{x - μ}{σ}

(26)

where, Z, X,

μ

, and

σ

represent Z-score, original value, mean, and standard deviation of the dataset, respectively. And R² score, also known as the coefficient of determination, was used to evaluate the performance of the optimized XGBoost in Section 5.

3.2. Strategies to Address Overfitting

The mitigation of overfitting in machine learning models is achieved through the application of various techniques, including regularization in linear models, Bayesian Ridge, ARD, SVR, and XGBoost; data sampling in RANSAC and Random Forest; pruning in Decision Trees; and the selection of optimal neighbors in KNNs. Additionally, methodologies such as dropout and early stopping are employed in ANN to enhance model robustness and generalization across different datasets.

4. Results and Discussion

Machine learning models were applied to predict the elastic modulus of UHPC, which was given by [49] based on a sufficient amount of data. The number of training data points was set to 50. Figure 6 and Figure 7 show the elastic modulus predictions and the corresponding relative errors for each model. It can be seen that XGBoost performs the best prediction, and the linear regression shows the worst prediction when the compressive strength is less than 200 MPa, while the Decision Tree has the lowest accuracy compared to other models when the compressive strength is greater than 200 MPa. SVR has a similar prediction to linear regression and lower accuracy than other models. RANSAC regression exhibits better accuracy than the other linear models (Linear, Bayesian Ridge, and ARD regression). Random forest shows a more stable prediction than Decision Tree. Other models, including ANN, KNNs, SVR-poly, and SVR-rbf, exhibited the same level of prediction accuracy. Through machine learning, it can be observed that the increase in the elastic modulus of UHPC tends to stabilize when the compressive strength reaches 197 MPa. In further, in experimental data, the elastic modulus is 62.1 GPa at a compressive strength of 206 MPa, obviously greater than the elastic modulus of 55.9 GPa at a compressive strength of 205 MPa, and the prediction at a compressive strength of 206 MPa shows a large error, as shown in Table S1 (SI).

Another case of using a small data sample given by [50] to predict the elastic modulus of UHPC with machine learning is shown in Figure 8 and Figure 9. In this case, all data were selected as training data, and the results indicated that the Decision Tree showed the lowest relative error because all data were the tree nodes. Next is XGBoost, and the linear models including linear, Bayesian Ridge, ARD, and regression perform with lower prediction accuracy compared with other models. Hence, the linear model is unsuitable for predicting the elastic modulus of UHPC. ANN, SVR-linear, and Random Forest exhibited instability in terms of accuracy. The relative errors of SVR-poly, SVR-rbf, and KNNs were lower than 2%, as listed in Table S2 (SI).

Random forest regression, ANN regression, and the K-nearest neighbor algorithm have unstable prediction errors and large deviations in the prediction results. Among these seven machine learning models, linear regression has the worst prediction accuracy, and the second worst is SVR-linear. This also indicates that the elastic modulus of ultra-high-performance concrete has a nonlinear relationship with compressive strength, and it is difficult to accurately predict the elastic modulus of UHPC using linear regression-like models for small sample data.

Based on sufficient data, the fitting equation for the elastic modulus of UHPC was as follows:

Data fitting : E_{\exp e r i} = - 1860 f_{c}^{- 1.6} + 59.23

(27)

XGBoost predicted fitting : E_{\exp e r i} = - 1974 f_{c}^{- 1.6} + 59.62

(28)

Based on small sample data, the fitting equation for the elastic modulus of UHPC is as follows:

Data fitting : E_{\exp ri} = 3 . 326 \sqrt{f_{c}} + 7 . 106

(29)

XGBoost predicted fitting : E_{p r e d i} = 3 . 432 \sqrt{f_{c}} + 5 . 686

(30)

The experimental data, XGBoost predicted data, and fitted curves are shown in Figure 10. It can be seen that XGBoost can predict the elastic modulus of different types of UHPC very well. For the prediction of the elastic modulus of UHPC with large sample data, the difference between the predicted and tested values in the range of a compressive strength of 205–208 MPa is large, which is due to the deviation of the test data in this compressive strength range, where the same compressive strength corresponds to several different values of the elastic modulus and thus affects the prediction. The difference in the relationship between the modulus of elasticity and compressive strength of the UHPCs is mainly due to the different compositions of the UHPC.

5. Optimization of XGBoost Prediction

To determine the validation of XGBoost, data were collected from previous works [49,51] (see Table S3 (SI)). Different specimen names were converted to numbers ranging from 0 to 7, as shown in S3 (SI). Hyperopt was implemented to optimize XGBoost. The hyperparameter selection and the best parameters are listed in Table 1.

Figure 11 illustrates the performance of the XGBoost model in predicting the elastic modulus (in GPa), showing both prediction results and errors. Figure 11a compares the predicted and true elastic modulus values, with green points for the training set (R² = 0.9858) and blue points for the test set (R² = 0.8925). The red dashed line represents the ideal fit, with most points clustering around it, demonstrating strong prediction accuracy, although the test points exhibit more variability. Figure 11b displays the prediction errors (true—predicted) as a percentage, with the zero-error line in red. Both training and test errors are generally small, but test set errors increase slightly at higher modulus values, reflecting some overfitting on the training data. Overall, the model demonstrates high accuracy, particularly on the training data.

Figure 12 presents the feature importance and residuals distribution for the XGBoost model used to predict the elastic modulus. Figure 12a displays a bar chart of feature importance, where the feature “compressive strength” has a substantially higher F-score (664.0) compared to “specimen number” (220.0), indicating that compressive strength has a greater influence on the predictions of the XGBoost model. Figure 12b illustrates a histogram of residuals (true—predicted) for both the training and testing sets. Green bars represent the training errors, while blue bars denote the testing errors. The red dashed line marks zero error, indicating perfect prediction. The majority of errors are concentrated around zero for both datasets, with a slightly broader distribution in the test set, demonstrating that most predictions are accurate, although a few test residuals deviate beyond ±4.

In addition, Figure 13 illustrates the comparison of relative errors for different types of UHPC, including Traditional UHPC, CNF-enhanced UHPC, and economic UHPC, using the XGBoost model, Ref. Equations (3) and (4), as referenced in [51]. It is evident that the relative error for the three different UHPC types with XGBoost is lower than that of Ref. Equations (3) and (4). Similarly, for Traditional and CNF-enhanced UHPC, XGBoost demonstrates higher accuracy compared to the fitting equation in Ref. [51]. XGBoost exhibits a comparable level of accuracy to the fitting equation in Ref. [51] for economic UHPC. Additionally, the fitting equation in Ref. [51] requires the use of different fitting parameters for various UHPC types. In contrast, XGBoost implements optimized parameters to predict the elastic modulus for different UHPC types based on hyperparameter tuning with hyperopt. As a result, XGBoost demonstrates superior prediction accuracy for different UHPC types compared to the fitting equation. Further research could investigate the use of automated hyperparameter optimization and meta-learning techniques to streamline the process of selecting and tuning machine learning models for predicting UHPC properties, improving efficiency in practical applications.

6. Conclusions

In this work, 10 different machine learning models, including linear regression, ANN, SVR (linear, poly, and rbf), Decision Tree, Random Forest, XGBoost, and KNNs, were compared to predict the elastic modulus of UHPC. The conclusions are summarized as follows:

(1): XGBoost demonstrated superior predictive performance for the elastic modulus of UHPC, especially when trained on large datasets. XGBoost outperformed other models, including linear regression, ANN, SVR (linear, poly, and rbf), Decision Tree, Random Forest, and KNNs, in terms of accuracy and stability.
(2): Decision Tree provided the most accurate predictions when dealing with smaller datasets, while linear regression models consistently showed the lowest accuracy across various scenarios, highlighting limitations in capturing the nonlinear relationships present in UHPC data.
(3): XGBoost can effectively predict the elastic modulus of UHPC regardless of the dataset size, and machine learning models can be used to generate fitting curves for target predictions in complex datasets. The fitting equations demonstrate that the XGBoost model is highly effective at predicting the relationship between compressive strength and elastic modulus in UHPC.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/buildings14103184/s1, Table S1: Comparison of Errors of estimated elastic modulus by different models with a large number of training data; Table S2: Comparison the errors of estimated elastic modulus by different models with a small training data set; Table S3: Dataset and prediction of different types of UHPC.

Author Contributions

Conceptualization, C.Z.; methodology, C.Z. and P.L.; software, C.Z. and P.L.; validation, C.Z. and P.L.; formal analysis, C.Z.; investigation, T.S. and Y.P.; resources, B.H.; data curation, W.L.; writing—original draft preparation, C.Z.; writing—review and editing, P.L. and B.H.; visualization, T.S.; supervision, B.H.; project administration, P.L.; funding acquisition, Y.P. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Natural Science Foundation of China (Grant No. 52208353), Guangxi Science and Technology Program (Grant No. AD21220073), Natural Science Foundation of Hunan Province (Grant No. 2023JJ60176), and Research Project on Shenzhen Metro Group Co., Ltd. (Grant No. SZTD-JSZX-ZC-2020-0022).

Data Availability Statement

The original contributions presented in this study are included in the article/Supplementary Material; further inquiries can be directed to the corresponding author.

Conflicts of Interest

Authors Chaohui Zhang, Tiantian Song, Bin He and Wei Li were employed by the company Shenzhen Metro Group Co., Ltd. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

References

Zhang, C.; Wu, Z.; Luo, C.; Hu, X.; Li, K.; Shi, C.; Banthia, N. Size effect of ultra-high-performance concrete under compression: Effects of steel fiber characteristics and water-to-binder ratio. Constr. Build. Mater. 2022, 330, 127170. [Google Scholar] [CrossRef]
Pouraminian, M.; Akbari Baghal, A.E.; Andalibi, K.; Khosravi, F.; Arab Maleki, V. Enhancing the pull-out behavior of ribbed steel bars in CNT-modified UHPFRC using recycled steel fibers from waste tires: A multiscale finite element study. Sci. Rep. 2024, 14, 19939. [Google Scholar] [CrossRef] [PubMed]
Qin, Y.; Guo, Z.W.; Ke, L.; Zhao, K.X. Finite element analysis and hysteretic model of self-centering steel connection with dual energy dissipation mechanism. Eng. Struct. 2024, 300, 117238. [Google Scholar] [CrossRef]
Rahgozar, N.; Pouraminian, M.; Rahgozar, N. Reliability-based seismic assessment of controlled rocking steel cores. J. Build. Eng. 2021, 44, 102623. [Google Scholar] [CrossRef]
Thomas, R.J.; Sorensen, A.D. Review of strain rate effects for UHPC in tension. Constr. Build. Mater. 2017, 153, 846–856. [Google Scholar] [CrossRef]
NF P 18-710; Design of Concrete Structures: Specific Rules for Ultra-High-Performance Fiber Reinforced Concrete (UHPFRC). French Association for Civil Engineering (AFNOR): Paris, France, 2016.
Haber, Z.B.; Munoz, J.F.; Graybeal, B.A. Field Testing of an Ultra-High Performance Concrete Overlay (No. FHWA-HRT-17-096); Federal Highway Administration, Office of Infrastructure Research and Development: Washington, DC, USA, 2017. [Google Scholar]
Yokota, H.; Keitetsu, R.; Noboru, S. JSCE recommendations for design and construction of high performance fiber reinforced cement composite with multiple fine cracks. In High Performance Fiber Reinforced Cement Composites; Springer: Tokyo, Japan, 2008. [Google Scholar]
Korea Concrete Institute. Design Recommendations for Ultra-High Performance Concrete K-UHPC; KCI-M-12-003; Korea Concrete Institute: Seoul, Republic of Korea, 2012; Volume 2. [Google Scholar]
Ma, J.; Orgass, M.; Dehn, F.; Schmidt, D.; Tue, N.V. Comparative investigations on ultra-high performance concrete with and without coarse aggregates. In Proceedings of the International Symposium on Ultra High Performance Concrete, Kassel, Germany, 13–15 September 2004; pp. 205–212. [Google Scholar]
Sritharan, S.; Bristow, B.; Perry, V. Characterizing an ultra-high performance material for bridge applications under extreme loads. In Proceedings of the 3rd International Symposium on High Performance Concrete, Orlando, FL, USA, 19–22 October 2003. [Google Scholar]
Graybeal, B.A. Compressive behavior of ultra-high-performance fiber-reinforced concrete. ACI Mater. J. 2007, 104, 146–152. [Google Scholar]
Graybeal, B.A.; Brenton, S. Compression Response of a Rapid-Strengthening Ultra-High Performance Concrete Formulation; No. FHWA-HRT-12-065; Federal Highway Administration, Office of Infrastructure Research and Development: Washington, DC, USA, 2012. [Google Scholar]
Zhang, C.; Liu, P.; Li, K.; Shi, C. Generation and property analyses of 3D mesoscale models for plain and fiber reinforced concretes. Cem. Concr Compos. 2020, 114, 103714. [Google Scholar] [CrossRef]
Ouyang, X.; Shi, C.; Wu, Z.; Li, K.; Shan, B.; Shi, J. Experimental investigation and prediction of elastic modulus of ultra-high performance concrete (UHPC) based on its composition. Cem. Concr. Res. 2020, 138, 106241. [Google Scholar] [CrossRef]
Golafshani, E.M.; Kim, T.; Behnood, A.; Ngo, T.; Kashani, A. Sustainable mix design of recycled aggregate concrete using artificial intelligence. J. Clean. Prod. 2024, 442, 140994. [Google Scholar] [CrossRef]
Liu, Q.; Andersen, L.V.; Wu, M. Prediction of concrete abrasion depth and computational design optimization of concrete mixtures. Cem. Concr. Compos. 2024, 148, 105431. [Google Scholar] [CrossRef]
Que, Z.; Tang, J.; Wei, H.; Zhou, A.; Wu, K.; Zou, D.; Yang, J.; Liu, T.; De Schutter, G. Predicting the tensile strength of ultra-high performance concrete: New insights into the synergistic effects of steel fiber geometry and distribution. Constr. Build. Mater. 2024, 444, 137822. [Google Scholar] [CrossRef]
Ziolkowski, P.; Niedostatkiewicz, M.; Kang, S.B. Model-based adaptive machine learning approach in concrete mix design. Materials 2021, 14, 1661. [Google Scholar] [CrossRef]
Kazemi, F.; Shafighfard, T.; Yoo, D.Y. Data-driven modeling of mechanical properties of fiber-reinforced concrete: A critical review. Arch. Comput. Methods Eng. 2024, 31, 2049–2078. [Google Scholar] [CrossRef]
Lee, S.; Nguyen, N.H.; Karamanli, A.; Lee, J.; Vo, T.P. Super learner machine-learning algorithms for compressive strength prediction of high performance concrete. Struct. Concr. 2023, 24, 2208–2228. [Google Scholar] [CrossRef]
Kang, M.C.; Yoo, D.Y.; Gupta, R. Machine learning-based prediction for compressive and flexural strengths of steel fiber-reinforced concrete. Constr. Build. Mater. 2021, 266, 121117. [Google Scholar] [CrossRef]
Mehta, V. Machine learning approach for predicting concrete compressive, splitting tensile, and flexural strength with waste foundry sand. J. Build. Eng. 2023, 70, 106363. [Google Scholar] [CrossRef]
Vargas, J.F.; Oviedo, A.I.; Ortega, N.A.; Orozco, E.; Gómez, A.; Londoño, J.M. Machine-Learning-Based Predictive Models for Compressive Strength, Flexural Strength, and Slump of Concrete. Appl. Sci. 2024, 14, 4426. [Google Scholar] [CrossRef]
Zhang, H.; Li, X.; Amin, M.N.; Al-Naghi AA, A.; Arifeen, S.U.; Althoey, F.; Ahmad, A. Analyzing chloride diffusion for durability predictions of concrete using contemporary machine learning strategies. Mater. Today Commun. 2024, 38, 108543. [Google Scholar] [CrossRef]
Chen, H.; Cao, Y.; Liu, Y.; Qin, Y.; Xia, L. Enhancing the durability of concrete in severely cold regions: Mix proportion optimization based on machine learning. Constr. Build. Mater. 2023, 371, 130644. [Google Scholar] [CrossRef]
Naderpour, H.; Rafiean, A.H.; Fakharian, P. Compressive strength prediction of environmentally friendly concrete using artificial neural networks. J. Build. Eng. 2018, 16, 213–219. [Google Scholar] [CrossRef]
Fan, D.; Yu, R.; Fu, S.; Yue, L.; Wu, C.; Shui, Z.; Liu, K.; Song, Q.; Sun, M.; Jiang, C. Precise design and characteristics prediction of Ultra-High Performance Concrete (UHPC) based on artificial intelligence techniques. Cem. Concr. Compos. 2021, 122, 104171. [Google Scholar] [CrossRef]
Khodaparasti, M.; Alijamaat, A.; Pouraminian, M. Prediction of the concrete compressive strength using improved random forest algorithm. J. Build. Pathol. Rehabil. 2023, 8, 92. [Google Scholar] [CrossRef]
Demir, F. Prediction of elastic modulus of normal and high strength concrete by artificial neural networks. Constr. Build. Mater. 2008, 22, 1428–1435. [Google Scholar] [CrossRef]
Yan, K.; Shi, C. Prediction of elastic modulus of normal and high strength concrete by support vector machine. Constr. Build. Mater. 2010, 24, 1479–1485. [Google Scholar] [CrossRef]
Golafshani, E.M.; Behnood, A. Application of soft computing methods for predicting the elastic modulus of recycled aggregate concrete. J. Clean. Prod. 2018, 176, 1163–1176. [Google Scholar] [CrossRef]
Behnood, A.; Olek, J.; Glinicki, M.A. Predicting modulus elasticity of recycled aggregate concrete using M5′ model tree algorithm. Constr. Build. Mater. 2015, 94, 137–147. [Google Scholar] [CrossRef]
Memarzadeh, A.; Shahmansouri, A.A.; Poologanathan, K. A novel prediction model for post-fire elastic modulus of circular recycled aggregate concrete-filled steel tubular stub columns. Steel Compos. Struct. Int. J. 2022, 44, 309–324. [Google Scholar] [CrossRef]
Montgomery, D.C.; Peck, E.A.; Vining, G.G. Introduction to Linear Regression Analysis; John Wiley & Sons: Hoboken, NJ, USA, 2021. [Google Scholar]
Dempster, A.P.; Schatzoff, M.; Wermuth, N. A simulation study of alternatives to ordinary least squares. J. Am. Stat. Assoc. 1977, 72, 77–91. [Google Scholar] [CrossRef]
Wang, X.; Yue, Y.R.; Faraway, J.J. Bayesian Regression Modeling with INLA; Chapman and Hall/CRC: Boca Raton, FL, USA, 2018. [Google Scholar]
Berrar, D. Bayes’ theorem and naive Bayes classifier. In Encyclopedia of Bioinformatics and Computational Biology: ABC of Bioinformatics; Elsevier Inc.: Amsterdam, The Netherlands, 2018; p. 403. [Google Scholar]
Wipf, D.; Nagarajan, S. A new view of automatic relevance determination. Proc. Adv. Neural Inf. Process. Syst. 2008. 1625–1632.
Fischler, M.A.; Bolles, R.C. Random sample consensus: A paradigm for model fitting with applications to image analysis and automated cartography. Commun. ACM 1981, 24, 381–395. [Google Scholar] [CrossRef]
Derpanis, K.G. Overview of the RANSAC Algorithm. Image 2010, 4, 2–3. [Google Scholar]
Specht, D.F. A general regression neural network. IEEE Trans. Neural Netw. 1991, 2, 568–576. [Google Scholar] [CrossRef] [PubMed]
Vapnik, V. The Nature of Statistical Learning Theory; Springer Science & Business Media: Berlin/Heidelberg, Germany, 2013. [Google Scholar]
Breiman, L. Classification and Regression Trees; Routledge: Abington, UK, 2017. [Google Scholar]
Liaw, A.; Wiener, M. Classification and regression by randomForest. R News 2002, 2, 18–22. [Google Scholar]
Chen, T.; Carlos, G. Xgboost: A scalable tree boosting system. In Proceedings of the 22nd ACM Sigkdd International Conference on Knowledge Discovery and Data Mining, San Francisco, CA, USA, 13–17 August 2016; pp. 785–794. [Google Scholar] [CrossRef]
Natekin, A.; Knoll, A. Gradient boosting machines, a tutorial. Front. Neurorobot. 2013, 7, 21. [Google Scholar] [CrossRef]
Altman, N.S. An introduction to kernel and nearest-neighbor nonparametric regression. Am. Stat. 1992, 46, 175–185. [Google Scholar] [CrossRef]
Ahlborn, T.M.; Peuse, E.J.; Misson, D.L. Ultra-High Performance Concrete for Michigan Bridges, Material Performance: Phase I (No. RC-1525); Department of Transportation: Detroit, MI, USA, 2008. [Google Scholar]
Graybeal, B.A. Material Property Characterization of Ultra-High Performance Concrete (No. FHWA-HRT-06-103); Federal Highway Administration, Office of Infrastructure Research and Development: Washington, DC, USA, 2006. [Google Scholar]
Cimesa, M.; Moustafa, M.A. UHPC modulus of elasticity: Assessment and new developments using companion materials and structural data. Eng. Struct. 2024, 310, 118146. [Google Scholar] [CrossRef]

Figure 1. Linear regression model.

Figure 2. The diagram of ANN regression.

Figure 3. The diagram of SVR.

Figure 4. Decision Tree regression: (a) data; (b) Decision Tree.

Figure 5. Random Forest regression.

Figure 6. Prediction of the estimated elastic modulus using a large number of training data with different machine learning models.

Figure 7. Errors of estimated elastic modulus of a large number of training data with different machine learning models.

Figure 8. Prediction of estimated elastic modulus using a small number of training data with different machine learning models.

Figure 9. Errors of estimated elastic modulus of a small number of training data with machine learning models.

Figure 10. Prediction of estimated elastic modulus of (a) a large number of training data and (b) a small number of training data with different models.

Figure 11. Prediction of (a) elastic modulus and (b) error distribution.

Figure 12. Prediction of (a) feature importance and (b) residuals distribution (where SN represents specimen number and CS represents compressive strength, respectively).

Figure 13. Relative error of (a) three different types of UHPC, (b) Traditional UHPC, (c) CNF-enhanced UHPC, and (d) economic UHPC [51].

Table 1. Hyperparameters’ selection.

Hyperparameter	Range	Best Parameters
n_estimators	1000~3000	4300
learning_rate	0.001~0.1	0.16
max_depth	3~10	6
min_child_weight	1~10	3.86
subsample	0.5~1.0	0.89
colsample_bytree	0.5~1.0	0.96
gamma	0~5	0.01
reg_alpha	0~10	1.35
reg_lambda	0~10	5.19

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Zhang, C.; Liu, P.; Song, T.; He, B.; Li, W.; Peng, Y. Elastic Modulus Prediction of Ultra-High-Performance Concrete with Different Machine Learning Models. Buildings 2024, 14, 3184. https://doi.org/10.3390/buildings14103184

AMA Style

Zhang C, Liu P, Song T, He B, Li W, Peng Y. Elastic Modulus Prediction of Ultra-High-Performance Concrete with Different Machine Learning Models. Buildings. 2024; 14(10):3184. https://doi.org/10.3390/buildings14103184

Chicago/Turabian Style

Zhang, Chaohui, Peng Liu, Tiantian Song, Bin He, Wei Li, and Yuansheng Peng. 2024. "Elastic Modulus Prediction of Ultra-High-Performance Concrete with Different Machine Learning Models" Buildings 14, no. 10: 3184. https://doi.org/10.3390/buildings14103184

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Elastic Modulus Prediction of Ultra-High-Performance Concrete with Different Machine Learning Models

Abstract

1. Introduction

2. Methods

2.1. Linear Regression

2.2. Bayesian Ridge Regression

2.3. ARD Regression

2.4. RANSAC Regression

2.5. ANN Regression

2.6. Support Vector Regression

2.7. Decision Tree Regression

2.8. Random Forest Regression

2.9. XGBoost

2.10. KNNs

3. Dataset Preparation and Strategies to Address Overfitting

3.1. Dataset Preparation

3.2. Strategies to Address Overfitting

4. Results and Discussion

5. Optimization of XGBoost Prediction

6. Conclusions

Supplementary Materials

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI