The Application of Machine Learning Algorithms to Bond Strength between Steel Rebars and Concrete Using Bayesian Optimization

Yan, Huajun; Xie, Nan; Shen, Dandan

doi:10.3390/ma17184641

Open AccessArticle

The Application of Machine Learning Algorithms to Bond Strength between Steel Rebars and Concrete Using Bayesian Optimization

by

Huajun Yan

^1,*

,

Nan Xie

¹ and

Dandan Shen

²

¹

School of Civil Engineering, Beijing Jiaotong University, Beijing 100044, China

²

SANY Heavy Industry Co., Ltd., Beijing 100044, China

^*

Author to whom correspondence should be addressed.

Materials 2024, 17(18), 4641; https://doi.org/10.3390/ma17184641

Submission received: 3 August 2024 / Revised: 28 August 2024 / Accepted: 19 September 2024 / Published: 21 September 2024

(This article belongs to the Section Construction and Building Materials)

Download

Browse Figures

Versions Notes

Abstract

:

The purpose of this study is to estimate the bond strength between steel rebars and concrete using machine learning (ML) algorithms with Bayesian optimization (BO). It is important to conduct beam tests to determine the bond strength since it is affected by stress fields. A machine learning approach for bond strength based on 401 beam tests with six impact factors is presented in this paper. The model is composed of three standard algorithms, including random forest (RF), support vector regression (SVR), and extreme gradient boosting (XGBoost), combined with the BO technique. Compared to empirical models, BO-XGB`oost was found to be the most accurate method, with values of R², MAE, and RMSE of 0.87, 0.897 MPa, and 1.516 MPa for the test set. The development of a simplified model that contains three input variables (diameter of the rebar, yield strength of reinforcement, concrete compressive strength) has been proposed to make it more convenient to apply. According to this prediction, the Shapley additive explanation (SHAP) can help explain why the ML-based model predicts the particular outcome it does. By utilizing machine learning algorithms to predict complex interfacial mechanical behavior, it is possible to improve the accuracy of the model.

Keywords:

machine learning; Bayesian optimization; bond strength; Shapley additive explanation

1. Introduction

Composite actions between steel rebars and concrete affect the mechanics of structures significantly. The effectiveness of an integrated reinforced concrete (RC) structure is determined by the strength of the bond between the steel rebars and concrete. Tests for bond strength have been conducted in experimental studies using several methods, including pull-out tests, beam anchorage tests, beam-end tests, and lap spice tests [1]. A key reason for the popularity of the pull-out test is its simplicity of manufacture. The steel rebars are under tension in the pull-out test specimens, whereas the concrete is under compression. Therefore, it is not appropriate to use pull-out tests to determine bond behavior, as also discouraged in ACI 408R-03 [1]. Due to the tensile nature of RC members, pull-out tests are not suitable for determining bond strength. In general, a beam test is considered a reliable method for assessing bond strength because it is able to provide a description of the bonding state in real-life situations [2,3].

Figure 1 illustrates the stress state of the concrete surrounding the steel rebar. In the pull-out test,

σ_{P}

refers to the compressive stress that results from the pressure applied at the end of the specimen. The compression stress and the shear stress are represented by

σ_{P r}

and

τ_{P}

, respectively. The

σ_{P θ}

represents the circumferential tensile stress that results from the compressive force exerted by a steel rebar. Only splitting cracks are observed by

σ_{P}

in the pull-out test specimens, and the bond stress distribution along the rebar is continuous.

σ_{B}

refers to the tensile stress produced during the flexure of a beam specimen during the beam test. The ring compressive stress and shear stress are represented by

σ_{B r}

and

τ_{B}

, respectively. A steel rebar generates circumferential tensile stress, which is described by

σ_{B θ}

. Rebar stress is redistributed when a crack occurs due to flexure. During crack progression, bond pressure cannot exert enough force on concrete to cause it to split or separate. Assuming the same local bond strength and the same development length, the average bond strength of the beam test should be lower than that of the pull-out test.

The concrete and steel rebar interfacial bonds enable the transfer of forces and the development of composite responses [4]. A combination of three factors contributes to the force transfer between steel rebars and concrete: chemical adhesion, friction, and mechanical interlocking [5]. The latter action is typically dominant when relative slip occurs on ribbed bars. Due to this, the strength of the bond is primarily determined by the steel rebars and the concrete, as well as their stress levels. There are several empirical models available for calculating the bond strength (

τ_{m a x}

) [6,7,8,9]. The development length of the ACI 318 code is determined according to a model developed by Orangun et al. [9]. Recent studies by Torre-Casanova et al. [10] have proposed formulas for separating failures caused by splitting from those caused by pull-out failure. Furthermore, mechanical models were proposed as a complement to experimental testing for the evaluation of the interfacial properties [6,7]. In spite of their past success, mechanical models have shown some shortcomings, including assumptions that are not valid under certain conditions [8]. A mechanical model generally considers a small number of influencing factors, since many factors complicate the mechanical model. The high dependence on the database used in the analysis may result in significant errors when applying these equations to different scenarios [11]. For instance, the square root of concrete’s compressive strength is typically used to estimate the bond’s strength [12], but this calculation was found to be inaccurate due to coupling effects [13]. Although empirical formulas are reliable, fracture mechanics methods only take into account a limited number of variables (such as the compressive strength of concrete) [14].

As a result of the nonlinearity of the relationship between steel rebars and concrete, the aforementioned empirical models are insufficient to explain the mechanism of bond strength. The development of data-driven models was based on various machine learning (ML) approaches, which have proven to be effective [15,16,17,18,19]. An artificial neural network (ANN) approach was developed by Dahou et al. [20] on the basis of 112 pull-out tests. Furthermore, 117 pull-out tests were used to develop a model that predicts bond strength [21]. As a result of Wang et al.’s study, relevant data have been collected on the bond strength of concrete and corroded steel rebars, a bond strength model has been proposed, and the relationship between various parameters and bond strength has been clarified [22]. With artificial neural network (ANN) models, Mousavi et al. [23] then predict the ultimate and relative bond strength between corroded rebars and the surrounding concrete with and without transverse rebars. Based on the results of these studies, the proposed model was found to be acceptable. However, the bond-slip behavior of steel-concrete in the pull-out test is not indicative of the behavior of real reinforced concrete beams [24]. In light of this, Jeong et al. [25] collected 75 beam specimens and 255 pull-out data and used ML algorithms (random forest and K-means clustering) to analyze the influence of each feature within two groups. Compared to previous equations, the proposed model could fit test results more accurately and with less dispersion.

ML approaches are capable of predicting steel rebar-concrete behavior, however certain challenges still need to be overcome, including the lack of credible beam test data, which makes it difficult to develop a reliable bond strength model using hyperparameter optimized ML algorithms. In structural mechanics, it has always been difficult to combine ML approaches for nonlinear material properties with theoretical knowledge.

To overcome the limitations of conventional empirical formulas and data-driven models, this study proposes a method for calculating the bond strength between steel rebars and the surrounding concrete using ML approaches. It is possible to enhance the accuracy of the model by using ML algorithms with Bayesian optimization technology.

Further, complex ML models are difficult to explain, and only a few studies have explored this question [26]. The Shapley additive explanation (SHAP) method simplifies the interpretation of ML models by utilizing game theory [27]. A wide range of fields have been affected by the SHAP, including structural engineering [28], infrastructure systems [29], and material properties of concrete [30]. The use of ML-based models coupled with the SHAP method for bond strength prediction has not been widely adopted. As a result of combining SHAP with ML algorithms, this paper offers vital insights into the complicated nonlinear behavior of bond strength, as well as illustrating how various features contribute to bond strength.

2. Methodology

2.1. Existing Bond Strength Equations

As shown in Table 1, empirical equations are presented for estimating the ultimate bond strength (

τ_{u}

). A model proposed by Orangun et al. [9] indicates that bond stress is linearly impacted by the minimum spacing and the thickness of the cover. In contrast, Esfahani’s models are based on beam-end samples, while other equations were derived from beam splice test results [7,31]. Several models are also proposed in the design codes, such as ACI 408R-03 [1] and AS 3600 [32].

2.2. The Considered Machine Learning (ML) Algorithms

The ML models have been generated using Python 3.7. It is possible to categorize ML algorithms as linear or nonlinear models based on the linear correlation or the complex nonlinear correlation between input variables and outcomes. Several models are selected to train estimators of bond strength, including a linear model named support vector machine regression (SVR) and two nonlinear models named random forest (RF) and extreme gradient boosting (XGBoost). In SVR models, parameters are optimized by maximizing the minimum distance between the support vector and the hyperplane. As opposed to the RF model where individual learners are independent, the XGBoost model is dependent. Therefore, the data-driven models in this study are trained using these three algorithms. This selection was based on the fact that these three models have different mechanisms and typical characteristics in linear models as well as in nonlinear models.

2.2.1. Support Vector Regression (SVR)

SVR is an application for minimizing the structural risk associated with support vector machines [34]. A regression function in hyperspace can be obtained by using the SVR model presented by

f (x) = ω^{T} ϕ (x) + b

. By using the loss function, the SVR model is obtained as shown in the following equations:

\min_{ω, b, ξ_{1,} ξ_{2,}} F (X) = (\frac{1}{2}) {‖ω‖}^{2} + C (e^{T} ξ_{1} + e^{T} ξ_{2})

(1)

where

ω

,

x

, and

b

are the weight vector, input vector, and bias vector, respectively. In the preceding equations, C is the fixed parameter,

ξ_{1}

and

ξ_{2}

are dummy parameters, and e represents the vector unit.

\underset{α, α^{*}}{m a x} G (X) = (\frac{1}{2}) {(α^{*} - α)}^{T} K (A, A^{T}) (α^{*} - α) + e^{T} (α^{*} + α)

(2)

f (x) = \sum_{i = 1}^{n} (α^{*} - α) K (x_{i}, x) + b

(3)

where

α

and

α^{*}

are the Lagrange multipliers,

K (x_{i}, x)

is the Kernel function, and A is the input for the training set.

2.2.2. Random Forest (RF)

A random forest (RF) is an ensemble learning algorithm that involves the combination of a large number of decision trees [35]. The concept of randomly chosen decision trees was introduced by Tin Kam [36], and Breiman [37] elaborated on it with additional algorithmic parameters. To construct different decision trees and to average their forecasts, the RF utilizes the bootstrap resampling technique. Bootstrap resampling minimizes the correlation between decision trees, improving the algorithm’s generalization accuracy. Various bootstrap samples are used to build each regression tree. The hyperparameters determine the maximum depth and branching of each regression tree.

Combining several decision tree learners using least square boost allows for improved results, as well as reduced variance and overfitting. A major advantage of using trees for bagging is that they are capable of capturing complex interactions in data and have a relatively low level of bias when they are cultivated to a sufficient depth.

2.2.3. Extreme Gradient Boosting (XGBoost)

XGBoost is an advanced ensemble machine learning algorithm that incorporates gradient boosting [38]. Within this framework, XGBoost introduces a number of enhancements, including: (1) As a result of XGBoost, a structural risk term is incorporated into the objective function in order to achieve greater accuracy and avoid overfitting; (2) As part of XGBoost’s moderation process, it samples rows and columns to reduce overfitting; and (3) Through XGBoost, the branching metric is reset, leading to a more direct growth of the decision tree.

The XGBoost algorithm is an ensemble algorithm that calculates the predicted value by combining the values of the decision trees.

{\hat{Y}}_{i} = \sum_{k = 1}^{K} {α_{k} f}_{k} (X_{i})

(4)

where K represents the maximum depth of a tree,

{\hat{Y}}_{i}

is the predicted value. In a single decision tree,

f_{k} (X_{i})

represents the prediction function, and

α_{k}

is a learning rate used to prevent overfitting in the training model.

The objective function for the XGBoost algorithm can be expressed as follows:

F_{o b j}^{t} = \sum_{i = 1}^{n} (L (Y_{i}, {\hat{Y}}_{i}^{t - 1}) + f_{t} (x_{i})) + Ω (f_{t})

(5)

where an objective function is defined as

F_{o b j}^{t}

, L represents the loss function, Y_i represents the actual value of the i-th sample, and

Ω (f_{t})

is the regularization terms.

2.2.4. Bayesian Optimization (BO)

In order to optimize model performance, the most appropriate set of hyperparameters within a particular search space must be chosen. This study selects the most appropriate set of hyperparameters using Bayesian optimization, which is an efficient global optimization technique for complex black-box functions [39]. The workflow of BO is illustrated in Figure 2.

It is possible to accomplish this by assuming that the objective function is a multivariate Gaussian process (GP). To implement GP, the kernel of the covariance matrix is fitted as follows:

K = [\begin{matrix} K (x_{1}, x_{1}) & \dots & K (x_{1}, x_{n}) \\ ⋮ & ⋱ & ⋮ \\ K (x_{n}, x_{1}) & \dots & K (x_{n}, x_{n}) \end{matrix}] + σ_{n o i s e}^{2}

(6)

where GP is calculated using n sample points of aging creep;

K

represents the covariance kernel, which is calculated as an exponential function of the second norm of the difference between two samples; and in normal distribution,

σ_{n o i s e}

represents the standard deviation of the noise [40].

In order to estimate the improving potential of every possible sample point, an acquisition function is calculated by expected improvement (EI) as follows:

I (x_{n}) = (y_{b e s t} - μ (x_{n})) Φ (\frac{y_{b e s t} - μ (x_{n})}{σ (x_{n})}) + σ (x_{n}) ϕ (\frac{y_{b e s t} - μ (x_{n})}{σ (x_{n})})

(7)

where

Φ

and

ϕ

represent the standard normal density and distribution functions, respectively;

y_{b e s t}

represents a tentative optimal value.

2.3. Evaluation Metrics

In order to evaluate ML-based models effectively, it is imperative to select metrics that are accurate reflections of their performance. A total of three evaluation metrics were selected as part of this study, namely the coefficient of determination (R²), the root mean square error (RMSE), and the mean absolute error (MAE). A model is generally considered to perform better if R² is close to 1. The MAE and RMSE values are all within

[0, + \infty)

and the smaller values indicate that the predicted value is closer to the true value, thus indicating that the model is more accurate. The formulas are as follow:

R^{2} = 1 - \frac{\sum_{i = 1}^{n} {(y_{p r e d, i} - y_{e x p, i})}^{2}}{\sum_{i = 1}^{n} {(y_{p r e d, i} - \frac{1}{n} \sum_{i = 1}^{n} y_{e x p, i})}^{2}}

(8)

R M S E = \sqrt{\frac{1}{n} \sum_{i = 1}^{n} {(y_{p r e d, i} - y_{e x p, i})}^{2}}

(9)

M A E = \frac{1}{n} \sum_{i = 1}^{n} |y_{p r e d, i} - y_{e x p, i}|

(10)

where the value of

y_{p r e d, i}

and

y_{e x p, i}

are the predicted value and actual value, respectively.

3. Model Development

3.1. Database for Beam Tests

The bond strength was calculated using ML algorithms based on a regression analysis of existing beam test specimen results. There has been previous evidence that lap splice specimens are similar in bond properties to beam anchorage specimens [31,41,42,43]. As a result, the beam test database no longer distinguishes between lap splice specimens and anchorage specimens. Based on the beam test (Supplementary data) gathered from previous studies [44,45,46,47,48,49,50,51,52,53,54,55,56], the following database requirements were identified: (1) There was sufficient concrete cover on test specimens with transverse bars; (2) There was a bond failure prior to rebar yielding; and (3) Concrete cast below rebars did not cause problems with its casting position.

The beam test database contains the test parameters for 401 beam specimens. As seen in Supplementary data:

{f_{c}}^{'}

represents the compressive strength of concrete cylinders (

150 m m \times 300 m m

);

f_{y}

refers to the yield strength of the reinforcing bars;

c_{b}

stands for concrete bottom cover for reinforcing bars;

d_{b}

refers to the diameter of the rebar;

l

represents the development length;

h

represents the specimen’s height; and

τ_{u}

is the bond stress of the reinforcing bars. It was found that transverse bars rarely yielded, and that the yield strength of transverse bars had little effect on bond strength [57].

3.2. Definitions for Input and Output Variables

It is necessary to identify input and output variables in order to conduct further analysis. The bond strength is predicted based on six input variables, and the bond stress is the output variable (Table 2). As noted in the references [44,45,46,47,48,49,50,51,52,53,54,55,56], these input variables have a significant impact on the bond strength of reinforcing bars. Moreover, prior knowledge should be reflected in the input variables. For instance, prior studies have demonstrated a linear relationship between ultimate bond strength and the square root of the compressive strength of concrete [9]. The bond strength of the high-performance concrete might be related to a high tensile strength [58]. The contribution of concrete to bond strength has also been described by existing formulas using the same parameters [32,33,44]. Thus, it can effectively increase the accuracy and interpretability of the model.

3.3. Implementation Process

There are generally four stages involved in the implementation of ML algorithms with Bayesian optimization (BO): As a first step, high-fidelity data must be collected and then randomly divided into two sets (80% for training and 20% for testing); As a second step, in order to reduce the error in prediction, it is recommended that a five-fold cross-validation method be used during training; In addition, the model parameters will be adjusted using Bayesian optimization; Lastly, the results of the model will be discussed using feature importance analysis through the SHAP explanation. Figure 3 illustrates how machine learning algorithms can be used to predict the performance of the target.

For predicting bond strength (

τ_{u}

), ML-6, an ML-based model containing six parameters, was proposed in the previous section. However, some of its parameters cannot be obtained directly from existing building structures (such as concrete cover and beam specimen height), which means that this model cannot be directly applied in practice. The study presents a simplified model (ML-3), whose input variables and output variables are shown in Table 2.

3.4. Data Normalization

It is recommended that the input variables be normalized prior to training, as the extracted data have different units and ranges [59]. A similar scale is applied to all features during the normalization process. It is possible to reduce the statistical bias and improve the reliability of ML-based models by using the following normalization procedure.

X_{i, n} = \frac{X_{i} - X_{m i n}}{X_{m a x} - X_{m i n}}

(11)

where

X_{m i n}

represents the minimum value of the input variable, while

X_{m a x}

represents the maximum value of the input variable.

3.5. K-Fold Validation

In the absence of an adequate dataset size, cross-validation can be used to avoid the overfitting problem. A cross-validation process involves dividing the original data into training and testing sets and reusing the data on each set. According to Figure 4, K = 5, which is referred to as 5-fold cross-validation. The following steps should be followed in detail: (1) Currently available data are divided into five equal subsets by random selection; (2) Four subsets of data are used for training, and the remaining subsets are used for testing the model; (3) It is recommended that this process be repeated five times with different folds for testing; (4) The average R square is used in this study as a criterion for evaluating the results of the model.

4. Results and Discussion

4.1. The Impact of Bayesian Optimization

As shown in Figure 5, the model was trained for 50 iterations before the optimal set of hyperparameters was determined. Table 3 summarizes the parameters of the ML algorithms. The objective function of Bayesian optimization is calculated based on the average R² of iterations, and the following result can be obtained:

X_{b e s t} = \arg \max f (x)

(12)

Table 4 presents the final prediction results of ML-6 models in the training set and test set. Table 4 illustrates that BO-XGBoost is the most effective prediction model in both the training and test sets, with satisfied prediction results indicating adequate training for the proposed model. Based on the Bayesian optimization, the accuracy of the model is significantly improved over that of standard algorithms. Thus, optimizing the initial weights and biases using Bayesian optimization can enhance the performance of the hyperparameter determination. In addition, BO-XGBoost has developed an in-depth understanding of the relationship between influential factors and bond strength.

4.2. Comparison with Empirical Models for Bond Strength

For the purpose of illustrating the advantages of ML-based models, five empirical models, including two design codes, have been introduced and their performance has been compared. Table 5 summarizes the predictions obtained from empirical models and optimized ML models for all datasets. The mean value and coefficient of variation (CoV) of the ratio between the actual value and predicted value are also reported. It is evident that ACI 408R provides the highest

R^{2}

value (0.84), the lowest CoV (0.402), MAE (1.654 MPa) and RMSE (2.538 MPa), indicating that it is the most appropriate empirical model. This may be due to the fact that the contribution of different variables in the ACI 408R formula is more reasonable. Other formulas, for example, overestimate the contribution of the concrete cover to bond strength or underestimate the contribution of the steel rebar diameters and the compression strength of the concrete.

The prediction performance of all ML-based models is higher than that of empirical models. The imperfect prediction performance indicates that the empirical equations may have been oversimplified and that several important factors need to be taken into account. In contrast, the BO-XGBoost model outperforms other models in terms of the highest R² (0.95), lowest MAE (0.470 MPa), and RMSE (0.743 MPa), a mean value of 0.99, and the lowest CoV (0.109).

It is shown in Figure 6 that the optimized ML-based models are compared with the five existing empirical models. The graph shows the correlation between predicted values and actual values. The scatter points of the BO-XGBoost model are closer to the diagonal than others. Accordingly, the BO-XGBoost model’s predicted values seem to be in better agreement with those observed.

4.3. Model Interpretations

4.3.1. The Shapley Additive Explanation (SHAP) Theory

The Shapley additive explanation (SHAP) theory was employed in this study in order to explore the potential complexity of the nonlinear relationship between bond strength and input variables [60]. As a methodology for interpreting individual predictions, SHAP is based on the optimal Shapley value in game theory [61]. Based on this algorithm, the marginal contribution of each feature to the model output is calculated and interpreted both globally and locally by constructing an additive explanation model. In terms of the original model

f (x)

, this explanation model

g (x_{S})

is expressed by the following equations:

f (x) = g (x_{S}) = φ_{0} + \sum_{i = 1}^{N} φ_{i} x_{S}^{i}

(13)

where

N

refers to the number of features,

x

represents the input variable’s original matrix,

x_{S}

represents the simplified matrix of input variable,

φ_{0}

is defined as a constant value when there are no input variables, and the value of

φ_{i}

corresponds to the Shapley value of the feature.

φ^{k} (f, x) = \sum_{S \subseteq \{x^{1}, x^{2} \dots, x^{N}\}} \frac{|S|! (N - |S| - 1)!}{N!} (f_{x} (S \cup \{x^{N}\}) - f (S))

(14)

where

S

represents the set of features that are included in model,

f (S)

represents the output value of a model for a particular combination of features. A summary of the SHAP theory can be found in the following reference [60,61].

4.3.2. Model Interpretations for Bond Strength

While the BO-XGBoost model has excellent bond strength prediction accuracy, in general, it tends to behave like a black box, lacking adequate interpretation of factors and strength relationships. The purpose of this paper is to demonstrate the utility of the SHAP method as a tool for analyzing a complex BO-XGBoost model that has multiple input variables.

Based on the BO-XGBoost model, Figure 7 presents the mean SHAP value of the input variables used to predict bond strength and ranks them from high to low impact. According to Figure 7, (

l / d_{b}

) is the input variable with the highest SHAP value and is the most important component that can be used to predict bond strength. In terms of importance, (

f_{y}

) ranks second, followed by (

\sqrt{{f_{c}}^{'}}

), (

d_{b}

), (

h / d_{b}

) and (

c / d_{b}

). It was also noted in Refs. [20,21,24,25] that these three variables (

l

,

f_{y}

,

\sqrt{{f_{c}}^{'}}

) have a significant impact on the bond strength in physical tests.

SHAP summary plots are shown in Figure 8 as a means of illustrating how input variables influence bond strength predictions. In the figure below, the SHAP value represents the contribution of each feature to the output metric. As each feature value is colored from red to blue, it indicates its size from high to low. A visual inspection of Figure 7 reveals that the bond strength increases with the increase of each of these input variables (

l / d_{b}

,

f_{y}

,

\sqrt{{f_{c}}^{'}}

). In contrast, the

d_{b}

has a negative impact on the output bond strength, which suggests that the bond strength decreases as

d_{b}

increases.

Several studies [2,62] have investigated the effects of concrete compressive strength and rebar yield strength on bond strength. It is evident from the results that the yield strength of the rebar had a greater influence on the bond strength than the compressive strength of the concrete. The results of this study are consistent with this conclusion, and the direct contribution of these two parameters can also be seen in Figure 7 and Figure 8. In parallel, it is interesting to note that the results obtained by different researchers appear to be contradictory. It has been found by Teresa et al. [63] that an increase in rebar diameter increases bond strength, whereas Siempu et al. [64] have found that an increase in rebar diameter decreases bond strength. In spite of this, it is evident from Figure 7 and Figure 8 that the diameter of the rebar has a negative impact on bond strength.

The influence of these two variables (

h / d_{b}

and

c / d_{b}

) on bond strength is less than that of other input variables, and as shown in Figure 8, it is unclear whether these two parameters have a positive or negative influence. This may be due to the fact that existing research conclusions have been drawn from univariate variation results. Several factors are interconnected, and this effect of multivariate dependence can be explored further.

4.4. Performance of Simplified Models (ML-3)

A simple ML model (ML-3) involving only three variables is presented in this paper as a means of obtaining bond stress in existing structures and making it easier for designers to apply it to the design process. The three input variables are

\sqrt{{f_{c}}^{'}}

,

f_{y}

,

d_{b}

(Table 2), which are obtained based on the model interpretations. The selected parameters were those that have a greater impact on bond strength (Figure 7) and are relatively easy to obtain.

The Bayesian optimization method (BO) is used in Table 6 to determine the hyperparameter values of ML-3 models. A summary of the final prediction results for the ML-3 models in the training set and test set is presented in Table 7. According to the test set, the BO-XGBoost model performed better than other models in terms of the highest

R^{2}

(0.74), the lowest MAE (1.412 MPa), and the lowest RMSE (1.516 MPa). As compared to Table 4, the models with six parameters (ML-6) exhibit a better correlation with experimental data than simplified models (ML-3). As the number of input variables is increased, some influential factors can be taken into account that are difficult to evaluate when only three parameters (

\sqrt{{f_{c}}^{'}}

,

f_{y}

,

d_{b}

) are used. There is a possibility that the simplified models (ML-3) may be too global and do not completely represent the local phenomena at the interface between rebars and concrete. While both models have their advantages, they should be used in accordance with their intended purposes. A ML-6 model is more precise and conservative, but it requires six variables as inputs. Since ML-3 requires only three input variables, it is more convenient for designers to use it in practical situations. ML-3 models tend to provide better results than empirical models, except for the BO-SVR model.

4.5. Limitations and Future Study

In comparison to ML algorithms that require six input variables, the simplified model (ML-3) is more convenient to use. However, the results of this study indicate that the accuracy of the simplified model needs to be improved. The prediction accuracy of the simplified model needs to be improved, and future research will focus on this. Using more test data or more effective optimization parameters will enable the model to be trained more effectively. The purpose of developing a simplified machine learning model is to facilitate designers’ use of the model. Additionally, the deployment of the developed machine learning models in the form of graphical user interface tools and the creation of related design software will be beneficial to designers.

5. Conclusions

In this study, a data-driven approach is proposed for estimating the bond strength between steel rebars and concrete. A total of 401 beam tests have been collected and used for the validation of the proposed ML models. The accuracy of existing bond strength equations revealed significant dispersion and a lack of precision when applied to the collected data. To predict bond strength, three ML algorithms were used: SVR, RF, and XGBoost. A Bayesian optimization approach was used to enhance the performance of the model. In order to make it more convenient to apply, a simplified model (ML-3) was proposed. Conclusions were drawn as follows:

(1): Empirical models have a low prediction accuracy for the experimental data collected, and the scatter between the prediction and measurement shows the inherent difficulty of conventional explicit approaches to bond strength estimation.
(2): As a result of adequate training, BO-XGBoost proved to be the most effective prediction model in both training and test sets.
(3): With the increase of each of these three input variables ( $l / d_{b}$ , $f_{y}$ , $\sqrt{{f_{c}}^{'}}$ ), the bond strength increases, while the $d_{b}$ has a negative impact on it.
(4): It is unclear how ( $h / d_{b}$ ) and ( $c / d_{b}$ ) affect bond strength. It is possible that several factors are interconnected in predicted models, which should be explored further.
(5): Both models have advantages, however, and should be utilized appropriately. A ML-6 model is more precise and conservative, but it requires six variables as inputs. Since ML-3 requires only three input variables, it is more convenient for designers to use it in practical situations.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/ma17184641/s1, supplementary data and code of manuscript.

Author Contributions

H.Y.: Investigation, methodology, writing-original draft and project administration. N.X.: Supervision. D.S.: Software. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data presented in this study are available on request from the corresponding author due to privacy.

Conflicts of Interest

Author Dandan Shen was employed by SANY Heavy Industry Co., Ltd. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

References

ACI 408 R-03: 2003; Bond and Development of Straight Reinforcing Bars Intension. ACI Committee: Montreal, QC, Canada, 2003.
Lin, X.S.; Zhang, Y.X. Bond-slip behavior of FRP-reinforced beam concrete beams. Constr. Build. Mater. 2013, 44, 110–117. [Google Scholar] [CrossRef]
Dancygier, A.N.; Katz, A.; Wexler, U. Bond between deformed reinforcement and normal and high-strength concrete with and without fibers. Mater. Struct. 2010, 43, 839–856. [Google Scholar] [CrossRef]
Tirassa, M.; Ruiz, M.F.; Muttoni, A. An interlocking approach for the rebar-to-concrete contact in bond. Mag. Concr. Res. 2021, 73, 379–393. [Google Scholar] [CrossRef]
Zhou, B.B.; Wu, R.Y.; Gu, L.M.; Liu, Y.Q.; Sheng, J.; Li, Y. Analytical model for evaluating bond strength of steel rebar in cracked concrete considering confinement effect. Eng. Fract. Mech. 2024, 306, 110243. [Google Scholar] [CrossRef]
Wu, Y.F.; Zhao, X.M. Unified Bond Stress-Slip Model for Reinforced Concrete. J. Struct. Eng. 2013, 139, 1951–1962. [Google Scholar] [CrossRef]
Esfahani, M.R.; Kianoush, M.R. Development/splice length of reinforcing bars. ACI Struct. J. 2005, 102, 22–30. [Google Scholar] [CrossRef]
Harajli, M.H.; Hout, M.; Jalkh, W. Local bond stress-slip behavior of reinforcing bars embedded in plain and fiber concrete. ACI Mater. J. 1995, 94, 343–354. [Google Scholar]
Orangun, C.O.; Jirsa, J.O.; Breen, J.E. A Reevaulation of Test Data on Development Length and Splices. ACI J. Proc. 1977, 74, 114–122. [Google Scholar] [CrossRef]
Torre-Casanova, A.; Jason, L.; Davenne, L.; Pinelli, X. Confinement effects on the steel-concrete bond strength and pull-out failure. Eng. Fract. Mech. 2013, 97, 92–104. [Google Scholar] [CrossRef]
Irshidat, M.R. Bond strength evaluation between steel rebars and carbon nanotubes modified concrete. Case Stud. Constr. Mat. 2021, 14, e00477. [Google Scholar] [CrossRef]
ACI 308-08: 2008; Building Code Requirements for Structural Concrete and Commentary. ACI Committee: Montreal, QC, Canada, 2008.
Eurocode 2: 2004; Design of Concrete Structures: Part 1-1: General Rules and Rules for Buildings. British Standard Institution: London, UK, 2004.
Mahjoubi, S.; Meng, W.N.; Bao, Y. Logic-guided neural network for predicting steel-concrete interfacial behaviors. Expert Syst. Appl. 2022, 198, 116820. [Google Scholar] [CrossRef]
Zheng, S.; Hu, T.Y.; Yu, Y. Interpretable Machine Learning-Based Prediction Model for Concrete Cover Separation of FRP-Strengthened RC Beams. Materials 2024, 17, 1957. [Google Scholar] [CrossRef]
Wang, R.Q.; Huo, Y.P.; Wang, T.; Hou, P.; Gong, Z.; Li, G.D.; Li, C.Y. Machine Learning Method to Explore the Correlation between Fly Ash Content and Chloride Resistance. Materials 2024, 17, 1192. [Google Scholar] [CrossRef]
Haruna, S.I.; Ibrahim, Y.E.; Hassan, I.H.; Al-shawafi, A.; Zhu, H. Bond Strength Assessment of Normal Strength Concrete-Ultra-High-Performance Fiber Reinforced Concrete Using Repeated Drop-Weight Impact Test: Experimental and Machine Learning Technique. Materials 2024, 17, 3032. [Google Scholar] [CrossRef] [PubMed]
Yang, P.X.; Li, C.Q.; Qiu, Y.G.; Huang, S.; Zhou, J. Metaheuristic Optimization of Random Forest for Predicting Punch Shear Strength of FRP-Reinforced Concrete Beams. Materials 2024, 16, 4034. [Google Scholar] [CrossRef]
Abbas, Y.M.; Khan, M.I. Strength of SFRC: Database Compilation, Predictive Analysis, and Empirical Verification. Materials 2023, 16, 7178. [Google Scholar] [CrossRef]
Dahou, Z.; Sbartaï, Z.M.; Castel, A.; Ghomari, F. Artificial neural network model for steel-concrete bond prediction. Eng. Struct. 2009, 31, 1724–1733. [Google Scholar] [CrossRef]
Makni, M.; Daoud, A.; Karray, M.A.; Lorrain, M. Artificial neural network for the prediction of the steel-concrete bond behaviour. Eur. J. Environ. Civ. Eng. 2014, 18, 862–881. [Google Scholar] [CrossRef]
Wang, Y.; Geem, Z.W.; Nagai, K. Bond Strength Assessment of Concrete-Corroded Rebar Interface Using Artificial Neutral Network. Appl. Sci. 2020, 10, 4724. [Google Scholar] [CrossRef]
Mousavi, S.M.; Peyma, A.B.; Mousavi, S.R.; Moodi, Y. Predicting the Ultimate and Relative Bond Strength of Corroded Bars and Surrounding Concrete by Considering the Effect of Transverse Rebar Using Machine Learning. Iran. J. Sci. Technol. Trans. Civ. Eng. 2023, 47, 193–219. [Google Scholar] [CrossRef]
Krishnaveni, S.; Rajendran, S. A state of the art on characterization and application of artificial neural networks on bond strength between steel rebar and concrete. Constr. Build. Mater. 2022, 354, 129124. [Google Scholar] [CrossRef]
Jeong, H.; Ji, S.; Kim, J.H.; Choi, S.H.; Heo, I.; Kim, K.S. Development of Mapping Function to Estimate Bond-Slip and Bond Strength of RC Beams Using Genetic Programming. Int. J. Concr. Struct. 2022, 16, 49. [Google Scholar] [CrossRef]
Mangalathu, S.; Hwang, S.H.; Jeon, J.S. Failure mode and effects analysis of RC members based on machine-learning-based SHapley Additive exPlanations (SHAP) approach. Eng. Struct. 2020, 219, 110927. [Google Scholar] [CrossRef]
Xiong, M.L.; Wang, H.W.; Che, C.C.; Lin, R.G. Toward safer aviation: Application of GA-XGBoost-SHAP for incident cognition and model explainability. Proc. Inst. Mech. Eng. Part O J. Risk Reliab. 2023. [Google Scholar] [CrossRef]
Ma, C.L.; Wang, S.X.; Zhao, J.P.; Xiao, X.F.; Xie, C.X.; Feng, X.L. Prediction of shear strength of RC deep beams based on interpretable machine learning. Constr. Build. Mater. 2023, 387, 131640. [Google Scholar] [CrossRef]
Mangalathu, S.; Karthikeyan, K.; Feng, D.C.; Jeon, J.S. Machine-learning interpretability techniques for seismic performance assessment of infrastructure systems. Eng. Struct. 2021, 250, 112883. [Google Scholar] [CrossRef]
Lyngdoh, G.A.; Zaki, M.; Krishnan, N.M.A.; Das, S. Prediction of concrete strengths enabled by missing data imputation and interpretable machine learning. Cem. Concr. Compos. 2022, 128, 104414. [Google Scholar] [CrossRef]
Darwin, D.; Zuo, J.; Tholen, M.L.; Idun, E.K. Development length criteria for conventional and high relative rib area reinforcing bars. ACI Struct. J. 1996, 93, 347–359. [Google Scholar]
AS 3600: 2009; Concrete Structures. Standards Australia Committee BD-002: Sydney, Australia, 2009.
Hadi, M.N.S. Bond of High Strength Concrete with High Strength Reinforcing Steel. Open Civ. Eng. J. 2008, 26, 143–147. [Google Scholar] [CrossRef]
Naderpour, H.; Rafiean, A.H.; Fakharian, P. Compressive strength prediction of environmentally friendly concrete using artificial neural networks. J. Build. Eng. 2018, 16, 213–219. [Google Scholar] [CrossRef]
Ghanizadeh, A.R.; Amlashi, A.T.; Dessouky, S. A novel hybrid adaptive boosting approach for evaluating properties of sustainable materials: A case of concrete containing waste foundry sand. J. Build. Eng. 2023, 72, 106595. [Google Scholar] [CrossRef]
Ho, T.K. Random decision forests. In Proceedings of the 3rd International Conference on Document Analysis and Recognition, Montreal, QC, Canada, 14–16 August 1995; pp. 278–282. [Google Scholar] [CrossRef]
Breiman, L. Random forests. Mach. Learn. 2001, 45, 5–32. [Google Scholar] [CrossRef]
Ester, M.; Kriegel, H.P.; Xu, X. XGBoost: A scalable tree boosting system. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Francisco, CA, USA, 13–17 August 2023. [Google Scholar] [CrossRef]
Mockus, J.; Tiesis, V.; Zilinskas, A. The application of Bayesian methods for seeking the extremum. Towards Global Optim. 2023, 2, 117–129. [Google Scholar]
Jones, D.R. A taxonomy of global optimization methods based on response surfaces. J. Global Optim. 2023, 21, 345–383. [Google Scholar] [CrossRef]
Liang, R.; Huang, Y. Development length and bond behavior of lap-spliced reinforcement in Ultra-high performance concrete beams. Eng. Struct. 2023, 291, 116354. [Google Scholar] [CrossRef]
Cairns, J. Bond and anchorage of embedded steel reinforcement in fib Model Code 2010. Struct. Concr. 2015, 16, 45–55. [Google Scholar] [CrossRef]
Tastani, S.P.; Brokalaki, E.; Pantazopoulou, S.J. State of Bond along Lap Splices. J. Struct. Eng. 2015, 141, 04015007. [Google Scholar] [CrossRef]
Hou, L.J.; Xu, P.; Zang, Y.P.; Ouyang, F.; Chen, D.; Zhong, L. Bond behavior between reinforcement and ultra-high toughness cementitious composite in flexural members. Eng. Struct. 2020, 210, 110357. [Google Scholar] [CrossRef]
Lin, H.W.; Zhao, Y.X. Effects of confinements on the bond strength between concrete and corroded steel bars. Constr. Build. Mater. 2016, 118, 127–138. [Google Scholar] [CrossRef]
Seis, M.; Beycioglu, A. Bond performance of basalt fiber-reinforced polymer bars in conventional Portland cement concrete: A relative comparison with steel rebar using the hinged beam approach. Sci. Eng. Compos. Mater. 2017, 24, 909–918. [Google Scholar] [CrossRef]
Bandelt, M.J.; Billington, S.L. Bond behavior of steel reinforcement in high-performance fiber-reinforced cementitious composite flexural members. Mater. Struct. 2016, 49, 71–86. [Google Scholar] [CrossRef]
Petean, A.I.; Sabau, M.; Onet, T. Bond-Slip behavior of self-compacting concrete. Bul. Institutului Politeh. Din Lasi Sect. Constr. Arhit. 2013, 59, 139–146. [Google Scholar]
Desnerck, P.; De Schutter, G.; Taerwe, L. Bond behaviour of reinforcing bars in self-compacting concrete: Experimental determination by using beam tests. Mater. Struct. 2010, 43, 53–62. [Google Scholar] [CrossRef]
De Almeida, F.M.A.; El Debs, M.K.; El Debs, A.L.H.C. Bond-slip behavior of self-compacting concrete and vibrated concrete using pull-out and beam tests. Mater. Struct. 2008, 41, 1073–1089. [Google Scholar] [CrossRef]
Hamad, B.S.; Machaka, M.F. Effect of transverse reinforcement on bond strength of reinforcing bars in silica fume concrete. Mater. Struct. 1999, 32, 468–476. [Google Scholar] [CrossRef]
Azizinamini, A.; Pavel, R.; Hatfield, E.; Ghosh, S.K. Behavior of lap-spliced reinforcing bars embedded in high-strength concrete. ACI Struct. J. 1999, 96, 826–835. [Google Scholar]
Zuo, J. Bond Strength of High Relative Rib Area Reinforcing Bars. Doctoral Dissertation, University of Kansas Center for Research, Lawrence, KS, USA, 1998. [Google Scholar]
Darwin, D.; Tholen, M.L.; Idun, E.K.; Zuo, J. Splice strength of high relative rib area reinforcing bars. ACI Struct. J. 1996, 93, 95–107. [Google Scholar]
Hester, C.J.; Salamizavaregh, S.; Darwin, D.; Mc Cabe, S.L. Bond of epoxy-coated reinforcement: Splices. ACI Struct. J. 1993, 90, 89–102. [Google Scholar]
Choi, O.C.; Hadjeghaffari, H.; Darwin, D.; Mccabe, S.L. Bond of epoxy-coated reinforcement-bar parameters. ACI Mater. J. 1991, 88, 207–217. [Google Scholar]
Azizinamini, A.; Chisala, M.; Ghosh, S.K. Tension development length of reinforcing bars embedded in high-strength concrete. Eng. Struct. 1995, 17, 512–522. [Google Scholar] [CrossRef]
Kucharska, M.; Jaskowska-Lemanska, J. Properties of a bond between the steel reinforcement and the new generation concretes-a review. Conf. Ser. Mater. Sci. Eng. 2019, 603, 42057. [Google Scholar] [CrossRef]
Taffese, W.Z.; Espinosa-Leal, L. Prediction of chloride resistance level of concrete using machine learning for durability and service life assessment of building structures. J. Build. Eng. 2022, 60, 105146. [Google Scholar] [CrossRef]
Lundberg, S.M.; Erion, G.; Chen, H.; DeGrave, A.; Prutkin, J.M.; Nair, B.; Katz, R.; Himmelfarb, J.; Bansal, N.; Lee, S.I. From local explanations to global understanding with explainable AI for trees. Nat. Mach. Intell. 2020, 2, 56–67. [Google Scholar] [CrossRef] [PubMed]
Wu, Y.; Zhou, Y. Hybrid machine learning model and Shapley additive explanations for compressive strength of sustainable concrete. Constr. Build. Mater. 2022, 330, 127298. [Google Scholar] [CrossRef]
Liu, X.; Liu, Y.; Wu, T.; Wei, H. Bond-slip properties between lightweight aggregate concrete and rebar. Constr. Build. Mater. 2020, 255, 119355. [Google Scholar] [CrossRef]
Teresa, M.; Barbosa, G.; Sanchez Filho, S. Investigation of bond stress in pull out specimens with high strength concrete. Glob. J. Res. Eng. Struct. Eng. 2013, 13, 55–64. [Google Scholar]
Siempu, R.; Pancharathi, R.K. A study on the parameters influencing flexural bond stress in reinforced concrete. Structures 2018, 16, 198–207. [Google Scholar] [CrossRef]

Figure 1. The distribution of stress in the pull-out test and beam test.

Figure 2. Workflow of Bayesian optimization.

Figure 3. Flowchart for predicting target performance using ML algorithms.

Figure 4. Flowchart of 5-fold cross-validation.

Figure 5. Optimizing history in model training: (a) BO-SVR; (b) BO-RF; and (c) BO-XGBoost.

Figure 6. Comparison of proposed bond strength models with empirical models: (a) Orangun’s model; (b) Darwin’s model; (c) Haidi’s model; (d) ACI 408R; (e) AS 3600; (f) BO-SVR; (g) BO-RF; (h) BO-XGBoost.

Figure 7. Input parameters influencing bond strength.

Figure 8. SHAP summary plot for bond strength model.

Table 1. Existing bond strength equations.

Authors	Equations
Orangun et al. [9]	$τ_{u} = 0.083045 \sqrt{{f_{c}}^{'}} [1.2 + 3 (c / d_{b}) + 50 (d_{b} / L)]$
Darwin et al. [31]	$τ_{u} = 0.083045 \sqrt{{f_{c}}^{'}} [(1.06 + 2.12 (c / d_{b})) (0.92 + 0.08 (c_{m a x} / c_{m i n})) + 75 (d_{b} / c_{m i n})]$
Haidi [33]	$τ_{u} = 0.083045 \sqrt{{f_{c}}^{'}} [22.8 - 0.208 (c / d_{b}) - 38.212 (d_{b} / L)]$
ACI 408R-03 [1]	$\frac{τ_{u}}{\sqrt{{f_{c}}^{'}}} = 0.33 + 0.025 (c / d_{b}) + 8.3 (d_{b} / L)$
AS 3600 [32]	$τ_{u} = 0.265 \sqrt{{f_{c}}^{'}} [0.5 + (c / d_{b})]$

Note:

{f_{c}}^{'}

represents the cylinder compressive strength of concrete in MPa, c represents the front of the concrete that has the least thickness in mm,

d_{b}

refers to the diameter of the reinforcing bar, and L indicates the embedded length.

Table 2. Summary information on the variables.

Notation	Unit	Variables	Types
Notation	Unit	Variables	ML-6	ML-3
$\sqrt{{f_{c}}^{'}}$	-	Square root of the compressive strength of concrete	Input	Input
$f_{y}$	MPa	Yield strength of the reinforcing bars	Input	Input
$d_{b}$	mm	Diameter of the rebars	Input	Input
$c_{b} / d_{b}$	-	Concrete cover to rebar diameter ratio	Input	-
$l / d_{b}$	-	Development length to rebar diameter ratio	Input	-
$h / d_{b}$	-	Height of specimen to rebar diameter ratio	Input	-
$τ_{u}$	MPa	Bond stress	Output	Output

Table 3. The main parameters of comparison models (ML-6).

Algorithm	Initial Basic Parameters	After Bayesian Optimization
SVR	C = 1; gamma = 1; kernel’linear’; degree = 3; coef0 = 0; tolerance = 1 × 10⁻³; C = 1; epsilon = 0.1; shrinking = True.	C = 11; gamma = 2; kernel’linear’; degree = 3; coef0 = 0; tolerance = 1 × 10⁻³; C = 1; epsilon = 0.1; shrinking = True.
RF	n of estimators = 30; max depth = 3; criterion = ’squared error’; min samples split = 2; min samples leaf = 1; random state = 1	n of estimators = 52; max depth = 10; criterion = ’squared error’; min samples split = 2; min samples leaf = 1; random state = 1.
XGBoost	n estimators = 30; learning rate = 0.1; max depth = 3; objective = ’linear’; booster = ’gbtree’; min child weight = 1; subsample = 1; colsample bytree = 1; alpha = 0; lambda = 1.	n estimators = 99; learning rate = 0.1121; max depth = 4; objective = ’linear’; booster = ’gbtree’; min child weight = 1; subsample = 1; colsample bytree = 1; alpha = 0; lambda = 1.

Table 4. Summary of prediction results in training set and test set.

Proposed Models	Training Set			Test Set
Proposed Models	R²	MAE (MPa)	RMSE (MPa)	R²	MAE (MPa)	RMSE (MPa)
SVR	0.54	1.203	2.073	0.44	1.755	3.189
RF	0.85	0.882	1.191	0.79	1.231	1.949
XGBoost	0.90	0.710	0.977	0.81	1.160	1.865
BO-SVR	0.61	1.132	1.896	0.52	1.715	2.960
BO-RF	0.96	0.367	0.595	0.85	0.947	1.621
BO-XGBoost	0.97	0.364	0.550	0.87	0.897	1.516

Table 5. Summary of bond strength prediction in all datasets.

Model	$Mean (V_{n, e x p} / V_{n, p r e d}$ )	$CoV (V_{n, e x p} / V_{n, p r e d}$ )	$R^{2}$	MAE (MPa)	RMSE (MPa)
Orangun [9]	1.128	0.502	0.62	1.742	2.896
Darwin [31]	1.125	0.403	0.72	1.506	2.573
Haidi [33]	0.517	0.633	0.59	7.068	7.858
ACI 408R-03 [1]	0.971	0.402	0.84	1.654	2.538
AS 3600 [32]	1.694	0.902	0.60	2.400	4.158
BO-SVR	1.161	1.268	0.60	1.249	2.109
BO-RF	0.993	0.109	0.94	0.483	0.800
BO-XGBoost	0.994	0.109	0.95	0.470	0.743

Table 6. Values of hyperparameters of ML-3 models.

Proposed Models	Value of Hyperparameters
BO-SVR	C = 15; gamma = 13; kernel’linear’; degree = 3; coef0 = 0; tolerance = 1 × 10⁻³; C = 1; epsilon = 0.1; shrinking = True.
BO-RF	n of estimators = 105; max depth = 6; criterion = ’squared error’; min samples split = 2; min samples leaf = 1; random state = 1.
BO-XGBoost	n estimators = 32; learning rate = 0.2034; max depth = 3; objective = ’linear’; booster = ’gbtree’; min child weight = 1; subsample = 1; colsample bytree = 1; alpha = 0; lambda = 1.

Table 7. Summary of ML-3 prediction results.

Proposed Models	Training Set			Test Set
Proposed Models	R²	MAE (MPa)	RMSE (MPa)	R²	MAE (MPa)	RMSE (MPa)
BO-SVR	0.25	1.695	2.644	0.15	2.296	3.941
BO-RF	0.86	0.852	1.113	0.70	1.450	2.347
BO-XGBoost	0.85	0.910	1.196	0.74	1.412	1.516

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Yan, H.; Xie, N.; Shen, D. The Application of Machine Learning Algorithms to Bond Strength between Steel Rebars and Concrete Using Bayesian Optimization. Materials 2024, 17, 4641. https://doi.org/10.3390/ma17184641

AMA Style

Yan H, Xie N, Shen D. The Application of Machine Learning Algorithms to Bond Strength between Steel Rebars and Concrete Using Bayesian Optimization. Materials. 2024; 17(18):4641. https://doi.org/10.3390/ma17184641

Chicago/Turabian Style

Yan, Huajun, Nan Xie, and Dandan Shen. 2024. "The Application of Machine Learning Algorithms to Bond Strength between Steel Rebars and Concrete Using Bayesian Optimization" Materials 17, no. 18: 4641. https://doi.org/10.3390/ma17184641

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

The Application of Machine Learning Algorithms to Bond Strength between Steel Rebars and Concrete Using Bayesian Optimization

Abstract

1. Introduction

2. Methodology

2.1. Existing Bond Strength Equations

2.2. The Considered Machine Learning (ML) Algorithms

2.2.1. Support Vector Regression (SVR)

2.2.2. Random Forest (RF)

2.2.3. Extreme Gradient Boosting (XGBoost)

2.2.4. Bayesian Optimization (BO)

2.3. Evaluation Metrics

3. Model Development

3.1. Database for Beam Tests

3.2. Definitions for Input and Output Variables

3.3. Implementation Process

3.4. Data Normalization

3.5. K-Fold Validation

4. Results and Discussion

4.1. The Impact of Bayesian Optimization

4.2. Comparison with Empirical Models for Bond Strength

4.3. Model Interpretations

4.3.1. The Shapley Additive Explanation (SHAP) Theory

4.3.2. Model Interpretations for Bond Strength

4.4. Performance of Simplified Models (ML-3)

4.5. Limitations and Future Study

5. Conclusions

Supplementary Materials

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI