Next Article in Journal
Correction: Liu et al. The Impact of Commercial Credit on Firm Innovation: Evidence from Chinese A-Share Listed Companies. Sustainability 2022, 14, 1481
Previous Article in Journal
Heterogeneity Analysis of Industrial Structure Upgrading on Eco-Environmental Quality from a Spatial Perspective: Evidence from 11 Coastal Provinces in China
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Prediction of Service Life of Thermoplastic Road Markings on Expressways

1
College of Transportation, Shandong University of Science and Technology, Qingdao 266590, China
2
Shandong High Speed Maintenance Group Co., Ltd., Jinan 250032, China
3
Institute of High Performance Computing (IHPC), Agency for Science Technology and Research (A*STAR), 3 Fusionopolis Way, #16-16 Connexis, Singapore 138632, Singapore
*
Authors to whom correspondence should be addressed.
Sustainability 2023, 15(21), 15237; https://doi.org/10.3390/su152115237
Submission received: 13 September 2023 / Revised: 13 October 2023 / Accepted: 19 October 2023 / Published: 25 October 2023

Abstract

:
Currently, historical data and on-site surveys—particularly in the context of China—are heavily relied upon to determine the best time to maintain expressway road markings. This study aims to determine what influences the service life of thermoplastic road markings on expressways in Shandong Province, China, while considering both those motorways’ unique characteristics and the local environment. Additionally, a scientific evaluation of the road markings’ retroreflective coefficient’s decay pattern will be undertaken. We collected the retroreflective data for twelve consecutive months regarding the thermoplastic road markings on five expressways and potential influencing factors such as age of marking and annual average daily traffic. The service life of the markings was forecast using a multiple linear regression. Dominance analysis was used to quantitatively analyze each explanatory factor’s impact on the service life of the markings, and statistically significant variables were also found. Using LightGBM, a machine learning technique, a nonparametric prediction model was also created based on examining the relevance of influencing elements. The modeling results show that LightGBM generates an R2 of 0.942, implying that it offers better interpretability and higher accuracy than the regression-based approach. Additionally, LightGBM outperforms MLR according to final validation accuracies, with a score of 95.02% or more than 8% that of MLR. The results are useful for expressway marking upkeep and for driving safety.

Graphical Abstract

1. Introduction

Road markings are one of the most important components of expressways. They convey traffic information such as guidance, restrictions, and warnings to traffic participants. Due to their exceptional abrasion and weather resistance, high construction efficiency, cost-effectiveness, and prolonged service life, thermoplastic road markings have emerged as one of the most extensively utilized road marking solutions on expressways at present [1].
Road markings with higher visibility improve drivers’ attention to the road, while drivers tend to take their gaze off the road and look for other references when road markings are less visible [2]. Road marking visibility is even more important at night, as drivers can only rely on the area illuminated by their vehicle’s headlights to keep their vehicle moving safely. The visibility of road markings at night is determined by the core index of the retroreflectivity coefficient. Where this coefficient is larger, the higher is the visibility [3] and the more drivers are able to see the road and lane contours clearly, thus reducing the probability of traffic accidents [3].
Under the influence of various factors, the retroreflectivity of road markings may diminish, affecting a driver’s visual recognition of road markings and creating traffic safety hazards [4]. Traffic accidents and congestion are often related to drivers’ difficulty in the visual recognition of road markings, especially at night, in bad weather or in low light conditions. Therefore, by scientifically studying the law of reflectivity attenuation, we can develop more accurate and efficient maintenance plans, ensure good visibility of road marking, and reduce the risk of accidents. In addition, scientific marking maintenance can also reduce the waste of resources. At present, the maintenance time of expressway marking mostly relies on historical experience and field surveys, which leads to maintenance that is either early or too late, thus wasting human, material, and financial resources. By understanding the attenuation law of the reverse reflection coefficient and making accurate life cycle prediction, the maintenance work can be rationally arranged to extend the service life of road markings to their maximum extent and improve maintenance efficiency.
Life prediction is of great significance in the promotion of the sustainable development of various industries. Nagoor Basha Shaik et al. (2020) have evaluated the life of crude oil pipelines and natural gas pipelines and have proposed a feedforward inverse propagation network (FFBPN) method by which to build a model for the prediction of the remaining service life of crude oil pipelines. The accuracy and robustness of the model showed very satisfactory results. Nagoor Basha Shaik et al. (2022) proposed a recurrent neural network (RNN)-based method for predicting the life of equipment with corrosion dimension classes exposed to critical condition monitoring has been proposed, and the life prediction of dry gas pipeline has been realized. The above studies help to assess the life condition of existing pipelines, can reduce annual maintenance costs, and help to take necessary measures to better protect pipelines and safety [5,6]. Liu Mingjun et al. (2023) have established a new nonlinear Wiener degradation model to characterize the degradation process of slewing supports. The developed model can be effectively applied to the degradation process of fan slewing supports and contribute to the sustainable development of wind power generation [7]. Similarly, it is of great significance to scientifically predict the service life of expressway marking for the sustainable development of expressways and driving safety.
The service life of a marking is the time required to reduce the value of the retroreflectivity of a newly laid marking to a minimum standard. The minimum reverse reflection coefficient standard of China’s thermoplastic road marking is 80 mcd·m−2·lx−1. If the coefficient of a segment of markings is lower than the level mentioned above, they must be re-laid [8,9]. The service life of markings is mainly based on their retroreflectivity [9]. Numerous scholars have primarily focused on retroreflectivity and have conducted corresponding research on road markings [10,11,12]. In terms of predicting the service life of markings, the main research approach is to establish parameterized predictive models [13,14,15,16,17,18,19].
Previous studies have shown the importance of reflectance in studying the life cycle of road markings. Zhang et al. (2006) have stated that road markings must exhibit adequate visibility throughout their life cycle. When analyzing the performance and cost-effectiveness of road marking materials, a critical factor is the retroreflectivity [10]. Dario Babić et al. (2017) compared dynamic and static methods for measuring the retroreflectivity of markings. They found that the inverse reflectance coefficients obtained would vary with different measurement methods and suggested using dynamic or dynamic–static combination measurement methods [11]. Bahar et al. (2006) have described the National Transportation Product Evaluation Program (NTPEP) dataset that is used to study road markings in the U.S. The NTPEP dataset contains the following influencing factors: age of marking, marking color, marking material, traffic volume, roadway surface material, climate, and frequency of snow removal. They have noted that the presence and visibility of markings significantly impacts driving safety [12].
For the studies regarding predictive models of service life, Sitzabee et al. (2009) developed a linear attenuation model for the retroreflectivity of thermoplastic road markings in North Carolina. The influencing factors of the model were the age of marking, initial retroreflective value, annual average daily traffic (AADT), marking color, and marking location. The model has an R2 value of 0.6 [13]. Luana Ozelim et al. (2014) established a linear prediction model for white and yellow road markings based on regional conditions in Alabama. The factors considered were the age of marking and AADT and the model R2 was 0.448 and 0.488 [14]. Chieh Wang et al. (2016) investigated the service life prediction method of markings under different weather conditions in winter. Segmented multivariate linear modeling was proposed to explain the impact of winter weather events explicitly [15]. Dario Babić, Andelko Scukanec et al. (2019) developed linear models for predicting the service life of three road markings based on the age of the markings, initial retroreflectivity, marking location and AADT for ambient solvent-based coatings, thermoplastic coatings, and cold-plasticized coatings [16].
In addition to developed linear models, Adam M. Pike et al. (2015) proposed a markings service life index model that considers factors such as age of marking, marking material, and pavement type [17]. Deo Chimba et al. (2018) compared the Markov Chain (MC) model with a linear model and found that the attenuation of road marking retroreflectivity showed an exponential curvilinear trend, with decreased rates of attenuation as time increased [18]. Maged Mohamed et al. (2020) used a three-wheel polishing device (TWPD) and a weather monitor to study the performance indicators (retroreflectivity, color change and durability) of waterborne and thermoplastic road marking. Their results show that the durability of thermoplastic road markings obeyed a linear degradation function, and the rest of the performance indicators were logarithmically degraded under different loads [19]. Momen R. Mousa et al. (2021) used the NTPEP dataset to develop prediction models for high dimensionality and high multicollinearity between the variables and used the CatBoost algorithm for transverse and longitudinal markings separately, with the model accuracy being higher than that of previous studies [20].
Dong Kai (2009) took the reflective brightness of different functional markings on roads in Beijing as an example from which to analyze the remaining effective life of the markings. It is proposed that the loss of marking luminance has a very important relationship with the function and location of the marking [21]. Hao Xiaoyan et al. (2019) analyzed the visibility decay law of expressway markings in Shanxi Province and established an exponential model for the decay of road markings’ retroreflectivity over time, and the model R2 values in the range of 0.177–0.475 [22]. Huang Kai and Wang Wenhui (2020) explored the correspondence between road marking life and traffic volume and preliminarily explored the estimation curves and corresponding equations of marking life under Class 1–3 service levels [23]. Yang Furong and Gao Ziqiang (2022) compared the performance decay characteristics of two-component, thermoplastic and water-based pavement markings under actual working conditions. They established the respective life prediction models, but the consideration factor was only the age of marking, and was combined with the use of the cost and the service life of the markings to give a proposal for the paving of markings in different situations [24].
Table 1 summarizes relevant studies found in the literature and presents the influencing factors included in the model.
To summarize, the research on the prediction of road marking service life mainly focuses on modeling road marking retroreflectivity attenuation under the consideration of different influencing factors. However, there are no fixed criteria for determining influencing factors, and more studies have used the establishment of linear, exponential, and logarithmic models. There has been little research on this aspect in China [21,22,23,24]. Current research fails to consider other, more complex, factors, such as climatic conditions, partly due to limited data availability. An inability to grasp important influencing factors and the use of relatively simple prediction methods result in a smaller-than-expected R2, poor interpretation, and low prediction accuracy. This paper attempts to fill these gaps by analyzing the important factors affecting the service life of the markings, establishing a prediction model, and striving to improve the prediction accuracy.

2. Methodology

In assessing the magnitude of influence exerted by diverse factors on the service life of road markings, conventional single-factor models, such as simple linear regression and exponential models, are deemed inadequate because of the large number of influencing factors. The logarithmic model limits the types of independent variables and can only analyze the relationship between categorical variables. Consequently, the above similar methods are not applicable in this study. Multiple linear regression serves as a valuable tool for scrutinizing the causal connection between dependent variables and multiple independent variables, unhampered by any constraint on the nature of the independent variables under consideration. Notably, this approach facilitates the preliminary assessment of whether independent variables exert a significant influence on dependent variables, an observation that finds validation in prior research endeavors [13,14,16]. However, it is not rigorous enough to analyze the relative importance of each influencing factor by relying only on multiple linear regression because the relative importance of the influencing factors may change with the change of the sub-models derived from the full model. The method of dominance analysis effectively addresses this issue. One of this method’s most notable merits is its comprehensive examination of all conceivable sub-model scenarios, allowing for the assessment of the relative significance of each influencing factor.
Because most previous studies used only multiple linear regression parameterized models for their predictions, the predictive accuracy and efficacy of the models have been suboptimal. A possible reason for this is that the high dimensional problem between variables is not solved, and the analysis of the nonlinear effects of the independent variables is also missing. This conundrum can be deftly resolved through the implementation of non-parametric methodologies. The LightGBM (a machine learning method) algorithm uses an efficient feature-splitting strategy and parallel computation, which greatly improves the training speed of the model, especially for large-scale data sets and high-dimensional feature spaces. It can also continuously improve the model’s predictive ability during training, and optimize the model through gradient lifting techniques, thus obtaining high accuracy in classification and regression tasks. The non-parametric method based on LightGBM can solve the high-dimensional problem between variables and can reflect the nonlinear influence of independent variables more accurately. It has been applied to passenger flow and navigation speed predictions in transportation [25,26]. The model achieves not only a high prediction accuracy, but also ensures strong generalization ability and fast calculation speed.
Therefore, to determine the important influencing factors affecting the service life of the markings and further improve the prediction accuracy, this study used multiple linear regression to preliminarily analyze the influence of various factors on the service life of markings and established a parameter prediction model. Based on the derived parameterized prediction model, the important influencing factors determining the service life of the markings were identified through the method of dominance analysis. Based on the important influencing factors, LightGBM was used to train the non-parametric prediction model.

2.1. Multiple Linear Regression

The multiple linear regression method is an analytical method by which to study the influence of multiple independent variables on dependent variables. Through the significance test (p-value), we can determine whether independent variables have a significant influence on dependent variables. The Statistical Package for the Social Sciences (SPSS R26.0.0.0) software is used to process and analyze the collected data, analyze the relationship between the influencing factors and the attenuation of the retroreflective coefficient of the markings, and establish a parametric markings service life prediction model. The model solely encompasses variables that exhibit statistical significance exceeding the 95% threshold.

2.2. Dominance Analysis

The primary approach of the dominance analysis is to establish a regression model, known as the full model, based on empirical research. All derived sub-models are generated from the full model, amounting to 2^p−1 (where p denotes the number of independent variables in the full model). After conducting a comprehensive comparative analysis of all of the sub-models, the relative importance of each influencing factor is determined. The criterion for assessing the magnitude of influence is the percentage of each independent variable’s overall average contribution to the model variance among all sub-models. By employing this criterion, no exaggerated or overlooked importance of any independent variable in relation to the dependent variable occurs. Consequently, this method resolves the issues associated with traditional approaches that rely on slope-based indicators and variance-reduction indicators for determination [27].
This study focuses on the retroreflectivity of the markings as the dependent variable. It calculates the change in R-squared when each influencing factor is incorporated into the sub-model equations that do not include that specific factor. The cumulative average of these changes in R-squared represents the value-added contribution of the influencing factor. The calculation formula is as follows:
C x i ( k ) = ( R y x h x i 2 ) / ( k P 1 ) x i x h k
where x i is one of the influences in the full model; C x i ( k ) is the average contribution of the explained influence factors when x i is added to the sub-model containing each influence factor of k but excluding x itself; and x h is the other k influences included in the sub-model in addition to the influences x i .
Therefore, the total average contribution of the influencing factors to the dependent variable is calculated as follows:
C x i = 1 P k = 0 p 1 c x i ( k )
The overall average contribution of each influencing factor reflects its degree of importance.

2.3. LightGBM Algorithm

2.3.1. The Basic Principle of Gradient Boosting Decision Trees (GBDT)

The main idea of GBDT as an algorithm is that it should iteratively train weak classifiers with the objective function of reducing residuals, and ultimately should linearly combine the weak classifiers through an additive model to classify or regress data. When GBDT performs regression analysis, the residuals of the predecessor prediction are fitted to the predecessor prediction residuals by continuously adding new learners, which makes the residuals of the subsequent learners decrease continuously, thus obtaining a model with higher accuracy.

2.3.2. LightGBM Algorithm

The LightGBM algorithm is a framework that implements the idea of GBDT algorithm and belongs to one of the boosting integration algorithms. Boosting is a branch of machine learning that integrates learning algorithms and is the classical machine learning algorithm currently used for prediction [26]. The LightGBM algorithm, on the other hand, uses gradient-based one-side sampling (GOSS), exclusive feature bundling (EFB) and a histogram-based algorithm to further improve on XGBoost [28]. Figure 1 depicts the predictive analytics process based on the LightGBM algorithm.

3. Data Sources and Description

3.1. Selection of Influencing Factors

This study referred to the previous relevant literature to select influencing factors [16,17,18]. We examined whether widely validated influencing factors, such as the age of the markings, the road marking pattern and the volume of traffic, were suitable for the characteristic conditions of the expressways under study.
Furthermore, we hold great esteem for the insights provided by industry practitioners, be they the dedicated personnel engaged in expressway marking maintenance or the adept technicians tasked with measuring and formulating marking maintenance strategies.
The influencing factors selected in this study refer to previous research and the feedback of industry practitioners, and thus have a certain reliability. Compared with previous studies, this study proposes two factors that influence the maximum road speed limit and the glass bead content. The average monthly temperature and average monthly precipitation are used to represent the influence of climate conditions. The discussion of the selection of influencing factors is given below.
  • Maximum road speed limit
There are differences in the maximum speed limits of different expressways and the driving conditions of vehicles under different speed limits are different, as are the degrees of wear and tear on the road markings.
  • Annual average daily traffic
The higher the traffic volume, the more the wheels run over the road markings, increasing the wear and tear on the markings and affecting their service life [29].
  • Initial retroreflectivity
Under the same conditions, the higher the initial reverse reflection coefficient of the road markings, the longer the attenuation time to the minimum required reverse reflection coefficient.
  • Age of marking
The age of marking is the time between the completion of laying and each measurement. The longer the same marking is applied, the shorter the remaining service life.
  • Glass bead content
The thermoplastic road marking relies mostly on the glass beads contained within the reflective light source. Within a certain range, the higher the content of glass beads, the higher the retroreflectivity of the markings.
  • Road marking pattern
The most common types of markings on expressways are dashed lines and solid lines, and different types of lines have different functions and different levels of wear and tear.
  • Average monthly temperature and average monthly precipitation
The long-term exposure of road markings to the outdoors—to wind, sun, rain and snow corrosion—will have a certain impact on the marking, affecting its service life.
Through the above discussion, eight influencing factors that affected the service life of expressway thermoplastic road markings were determined, including the maximum speed limit of the road, the average daily traffic volume, the road marking pattern, the initial retroreflectivity, the age of marking, the glass bead content, the average monthly temperature, the average monthly precipitation and the average monthly precipitation. As shown in Table 2.

3.2. Data Collection

3.2.1. Data Sources

Five expressways in Shandong Province were selected to collect data on the retroreflectivity of thermoplastic road markings, road marking pattern, initial retroreflectivity, age of marking, glass bead content and AADT flow for 12 consecutive months in the field. Maximum road speed limits are derived from publicly available data on government expressway speed limits [30]. Average monthly temperature and average monthly precipitation were obtained from the 1 km-resolution monthly mean air temperature and monthly mean precipitation datasets of China from 1901 to 2022 from the National Earth System Science Data Center (NESDC) [31].

3.2.2. Equipment for Collecting Retroreflectivity Data

To improve the accuracy of the data, a combination of dynamic and static measurements was used to collect the relevant retroreflective data. The applied equipment included a Zehntner vehicle-mounted retroreflectometer and a Jingqu RP-R12, RP-R18 marking retro-reflectometer respectively. All of the equipment had an angle of incidence of 88.76° and an observation angle of 1.05°. Dynamic and static methods were used to measure the left and right solid lines and dashed lanes of the selected road section, and the average value of the two measurements was the inverse reflection coefficient of the real and dashed lines of the road section. The dynamic vehicle-mounted device is shown in Figure 2 and the static handheld devices are shown in Figure 3.

3.2.3. Road Sections for Data Collection

The expressways names and section information are shown in Table 3. Thirty miles of each segment of the expressway were randomly selected and data on eight variables were collected using the method described above. The collected data correspond to each variable and were used to obtain a preliminary statistical database. Some of the collected data are shown in Table 4 and Table 5.

3.2.4. Data Pre-Processing

The collected data necessitates preprocessing prior to direct application. Data preprocessing includes data cleaning and data normalization. Noisy samples in the raw data can be identified and processed through data cleansing. Data cleansing through Python mainly uses the isnull and fillna functions to detect and fill lost values, respectively, deleting outliers that are larger than a specific standard deviation value, and using the duplicated and drop_duplicates functions to detect and delete duplicate data, respectively. Normalization of the data by Equation (3) removes the effect of magnitude.
X j = X X min X max X min
where X j is the result of normalizing the parameter X and X min and X max are the minimum and maximum values of X , respectively.

4. Results and Discussion

4.1. Predictive Modeling of Markings Service Life Based on Multiple Linear Regression

4.1.1. Diagnosis of Independent Variable Multicollinearity

The multicollinearity diagnosis results are shown in Table 6. A general VIF value of more than 5 indicates a possible multicollinearity problem.
Because the model only includes variables with more than 95% statistical significance, the test of significance shows that the maximum speed limit is not statistically significant at p = 0.261 > 0.05, so this variable is removed. After removing this variable, the VIF values for each independent variable remained below 5, indicating that multicollinearity is not a concern and is within an acceptable range.

4.1.2. Model Summary and Analysis of Variance (ANOVA)

The results of the goodness-of-fit test and residual independence test are shown in Table 7, the model’s goodness-of-fit R2 is 0.599, and the model table is relatively good; the Durbin–Watson value is 1.035, and the result is greater than 1 and between 0 and 4, which is consistent with independence.
The results of the ANOVA are shown in Table 8, F = 126.527 in this model with a significance of p < 0.001, which leads to a reduction in the residual variance and indicates that the modeling is successful.

4.1.3. Modeling

The regression analysis results are shown in Table 9, and the resulting prediction model for markings service life is as follows.
R L = 148.261 11.241 × t 10.771 × T A a d + 59.848 × S L + 0.743 × R L , i n i t i a l         + 1.88 × T m a v g 0.059 × R f m a v g 28.392 × C G l a s s B
where R L is the retroreflectivity of existing road markings, mcd·m−2·lx−1; t is the age of marking, in months; T A a d is the annual average daily traffic, in cars; S L is the road marking pattern, with categorical variables that are represented by a dashed line (noted as 0) and a solid line (noted as 1); R L , i n i t i a l is the initial retroreflectivity, mcd·m−2·lx−1; T m a v g is the average monthly temperature, in °C; R f m a v g is the average monthly precipitation, in mm; and C G l a s s B is the percentage of glass bead content.
The histograms of regression normalized residuals and the P–P plots are shown in Figure 4. Figure 4a,b shows the distribution of residual errors of the model. The dispersion of residual errors or their deviation from the normal distribution can be ignored, thus showing a normal distribution and indicating that the requirements of linear regression are met.
Figure 5 shows the relationship between the model regression residuals and the regression standardized values. The expectation with a scatter plot is that the results should be a set of uniformly distributed data points with a mean of zero. In this study, the data points have a good distribution around the mean value, indicating a symmetrical distribution up and down. The distribution characteristics do not change with the increase in value, and the conditions of homogeneity and independence of data variance are thus met.

4.2. Judgment of Important Influencing Factors of Service Life of Markings Based on Dominance Analysis

As can be seen from Table 9, “Age of marking”, “AADT”, “Road marking pattern”, “Initial retroreflectivity”, “Glass bead content” and “Average monthly precipitation” have a significant effect on the service life of the markings (p < 0.01), “Average monthly temperature” has an effect on the service life of the markings (0.01 < p < 0.05). The above influencing factors are expressed as X 1 , X 2 , X 3 , X 4 , X 5 , X 6 , X 7 in that order, and their respective value-added contributions and total average contributions are calculated.
The value-added contribution and total average contribution of the above seven influencing factors are calculated from Equations (1) and (2), and the results are shown in Table 10.
From the previous section, we can see that as many as 127 (27−1) sub-models are derived from the seven influences. Due to space constraints, it is not possible to list all of them, only the value-added contribution of each influencing factor after consolidation. Table 10 shows that when k = 0, the value-added contribution of each influencing factor is ranked as follows: X 1 > X 3 > X 5 > X 2 > X 6 > X 4 > X 7 ; when k = 1–4, the rankings are: X 1 > X 3 > X 5 > X 6 > X 2 > X 4 > X 7 ; when k = 5, the order is: X 1 > X 3 > X 6 > X 2 > X 4 > X 5 > X 7 ; when k = 6, the order is: X 1 > X 3 > X 4 > X 2 > X 7 > X 6 > X 5 . Thus, X 1 is completely dominant over X 3 and X 3 is completely dominant over X 2 , X 4 , X 5 , X 6 , X 7 [23]. The remaining variables are compared with the total average contribution because the value-added contribution varies with the inclusion of different influencing factors. Finally, the influence degrees of each influencing factor on the service life of the thermoplastic road markings are obtained in descending order: X 1 age of marking (0.258), X 3 road marking pattern (0.128), X 5 glass bead content (0.063), X 6 average monthly precipitation (0.062), X 2 annual average daily traffic (0.047), X 4 initial retroreflectivity (0.035), and X 7 average monthly temperature (0.018). The total average contribution of each variable explaining or influencing the dependent variable as a percentage of the known variance is, in order, X 1 (42.3%), X 3 (21%), X 5 (10.4%), X 6 (10.1%), X 2 (7.7%), X 4 (5.7%), X 7 (2.9%). The results show that the most important factors affecting the service life of expressway thermoplastic road markings are the age of marking and the road marking pattern, followed by the glass bead content, the average monthly precipitation and the AADT, and finally the initial retroreflectivity and the average monthly temperature.

4.3. Judgment of Important Influencing Factors on the Service Life of Markings Based on Dominance Analysis

The machine learning prediction model is trained using the LightGBM algorithm, encompassing the following steps in the training process.

4.3.1. Identification of Variables and Data Collection

Based on the results of multiple linear regression and dominance analysis, the significant influences on the service life of markings are used as variables for model training. The data are collected in the same manner as above.

4.3.2. Data Pre-Processing

By means of data cleansing, it is possible to identify and address noisy samples present in the raw data. Furthermore, the normalization process, as denoted by Equation (3), facilitates the elimination of dimensional influence on the data. Table 11 shows the pre-processed data.

4.3.3. The Partitioning of the Dataset

The dataset is partitioned into a training set and a testing set, with the training set used to train the model and the testing set used to validate and evaluate the model’s accuracy. The data set is split using a random partitioning approach, where 70% of the samples are randomly selected from the preprocessed dataset to compose the training set, while the remaining 30% of the samples form the testing set.

4.3.4. Model Training

The computer configuration for model training and validation is a 64-bit Windows 11 operating system with 16 GB of operating memory and an AMD Ryzen5 5600 4.50 GHz processor. The LightGBM algorithm is invoked on the training set data using the sklearn interface of the Python 4.10 programming language. The main packages and machine learning libraries used for the modeling process are Pandas, Numpy, Scikit-learn, etc. The model inputs are age of marking, AADT, road marking pattern, initial retroreflectivity, average monthly temperature, average monthly precipitation, and glass bead content. The model output is set to the scalar retroreflectivity.
When the LightGBM algorithm is used to build the prediction model, the parameters have a great influence on the model prediction. Therefore, random search parameters are used to optimize LightGBM parameters including n_estimators, learning_rate, bagging_fraction, feature_fraction, max_depth, and min_child_samples. The MAE values are used as the scoring function and the values of each parameter are shown in Table 12.

4.3.5. Model Validation

Validation of a model is one of the most critical steps in assessing the quality of a developed AI-based model. It is common practice to assess a model’s performance through various statistical metrics. However, researchers have argued that the accuracy of a model cannot be assessed or judged solely on statistical criteria [32]. A third criterion for evaluating the strength of a developed data-driven model is its ability to predict responses when presented with a completely independent test data set [33]. Therefore, these criteria were used in this study to evaluate the prediction accuracy of the developed LightGBM-based model.
  • Model performance evaluation
In this study, the following statistical parameters were used to evaluate the performance of the established model: the coefficient of determination, R2; the mean square error, MSE; the mean absolute error, MAE; and the mean absolute percentage error, MAPE. R2 reflects the goodness of fit between the predicted value and the actual value. The closer R2 is to 1, the better the model fits. MSE is determined by calculating the average of the square of the difference between the predicted value and the actual observed value, and the closer the MSE is to 0, the better the model fits. MAE is a commonly used index to measure the difference between the predicted value of a model and the actual observed value. It is used to evaluate the fitting degree of a model on given data. The closer MAE is to 0, the higher the accuracy of the model. MAPE reflects the degree of deviation between the predicted value and the real value. Generally, we believe that the MAPE value of the model is less than 10%, indicating that the prediction model has high accuracy [24].
M S E = 1 n i = 1 n ( y a , i y m , i ) 2
M A E = 1 n i = 1 n | y a , i y m , i |
M A P E = 1 n i = 1 n | y a , i y m , i y a , i | × 100 %
where n is the number of samples; y a , i is the actual value of the i observation sample; and y m , i is the predicted value of the i sample. The results of the model evaluation are shown in Table 13.
The evaluation of the model reveals that the training and testing sets’ R2 values for the service life prediction model of markings, based on LightGBM, are 0.916 and 0.942, respectively, indicating a high level of model interpretability. Additionally, although there are many types of variables in this study (continuous variables, categorical variables, fixed values), which may affect the accuracy of the model, the results obtained through model evaluation show a relatively ideal result. The MSE values of the model training set and the test set were 49.7909 and 41.5114, MAE values were 11.944 and 16.793, and MAPE values were 4.167% and 5.949%, respectively, showing reasonable model accuracy and precision.
2.
Variable importance analysis
The LightGBM model uses a gradient lifting algorithm based on decision trees, which evaluates the importance of variables in the training process. The importance of variables is calculated based on two factors: split gain and feature frequency. LightGBM modeling can use a built-in model function—plot importance—to extract the importance of features to the model. Figure 6 shows the percentage of each variable’s importance to the model.
As can be seen from Figure 6, the age of marking is the most important variable, followed by the road marking pattern, and the average monthly precipitation is the input variable with the least contribution to the model performance.

4.3.6. The Prediction Results of the Testing Set

The predicted outcomes of the testing set are presented in Table 14, and the corresponding prediction graph is depicted in Figure 7. The method employed in this study yields predicted values for the retroreflective coefficient of road markings. The service life of road markings is determined by reaching the minimum retroreflective coefficient value.
Figure 7 illustrates the comparison between predicted values and actual values. The minimum difference between the actual value and the predicted value is 0.36102 mcd·m−2·lx−1, the maximum difference is 49.4863 mcd·m−2·lx−1, and the average difference is 15.2486 mcd·m−2·lx−1. The difference of less than 20 mcd·m−2·lx−1 accounts for 86.9% of the test set, and the difference of less than 10 mcd·m−2·lx−1 accounts for 50.3% of the test set. The prediction results of the test set show that the model has good prediction performance, but there are still some predicted values that are significantly different from the actual values. A possible reason for this is that the life decays of the inner solid line and the outer solid line (emergency lane line) are different, as the inner solid line is run over by vehicles less frequently, and the outer solid line is often run over by vehicles.

5. Example Analysis

5.1. Example Data

The prediction model obtained in the previous section is applied to the example and the prediction results are evaluated. Example data are derived from the five previous expressways, and the data are not applied to model training. The data are processed and imported into the model, and the prediction results are evaluated by the prediction validity equation [34].
P = ( 1 Y 1 Y Y ) × 100 %
where P is the predictive validity, Y 1 is the predicted value, and Y is the actual value. Some of the data are shown in Table 15.

5.2. Evaluation of Prediction Results

Table 16 and Table 17 show the prediction validity of the two models.
The average prediction validity of the multiple linear regression parameterized prediction model is 87.36%, the model has good prediction effect. The combination of the dominance analyses provides a better response to the magnitude of the variable impact relationship.
The LightGBM-based prediction model has an average prediction validity of 95.02% and a high prediction accuracy. Because the model explains some of the nonlinearity and high dimensionality of the independent variables, the predictive validity is improved. It can also better predict the attenuation of the retroreflectivity of expressways thermoplastic road markings and determine the service life of the markings.

6. Discussion on Results

  • By synthesizing relevant research and conducting on-site investigations of expressways, we have identified the factors that influence the service life of road markings. This determination was made by considering the characteristics of regional expressways and the environmental features surrounding them.
  • Multivariate linear regression and dominance analysis were used to establish a parametric prediction model. A parametric prediction model was established using multiple linear regression and dominance analysis, and the ranking of important factors affecting the service life of the thermoplastic road marking was obtained. The degree of influence in descending order was: “Age of marking” (0.258), “Road marking pattern” (0.128), “Glass bead content” (0.063), “Average monthly precipitation” (0.062), “AADT” (0.047), “Initial retroreflectivity” (0.035), and “Average monthly temperature” (0.018).
  • Based on linear regression and dominance analysis, the LightGBM machine learning method was used to build a non-parametric prediction model. The R2 of the model training set and testing set were 0.916 and 0.942, and the MAPE were 4.167% and 5.949%, with high model accuracy and strong generalization ability.
  • In order to judge the prediction effect of the model, the average prediction effectiveness of the multiple linear regression prediction model was 87.36%, and the average prediction effectiveness of the prediction model based on LightGBM algorithm was 95.02%, as obtained by example analysis. Both models showed better predictions. The former better reflected the relationship between the influencing factors and the dependent variable. The latter could achieve the prediction of the service life of expressway thermoplastic road marking under the premise of ensuring higher prediction accuracy and stronger generalization ability.

7. Conclusions

This study employs multiple linear regression, dominance analysis, and the LightGBM machine learning method, in conjunction with the characteristics of regional expressways, to conduct a comprehensive analysis of the significant influencing factors of thermoplastic road markings. Consequently, parametric and non-parametric prediction models are established. Through rigorous validation, the models demonstrate a notable degree of accuracy, providing robust theoretical support for the prediction of the service life of thermoplastic road markings on expressways. Moreover, these predictive models offer valuable guidance for the maintenance of road markings, which can effectively avoid the replacement of markings at inappropriate times. Every time thermoplastic marking is replaced, it can cause irreversible damage to the expressway, so avoiding premature road marking replacement can reduce the overall cost associated with maintenance and promote sustainable development. By avoiding the late replacement of road markings, better visibility of the road at night is ensured, improving the safety associated with driving on the expressway.
In the future, data on the relevant variables of more sections of expressways should be collected to train the model further and to render the model more reliable and accurate. In addition, the universality of the study in this paper needs further discussion, especially for thermoplastic marking in other countries and regions of the world.

Author Contributions

L.Z. conceived the entire framework of the study and participated in its design and coordination. H.D. proposed the initial model and solution method, performed numerical experiments and analyzed the results. J.S. (Jie Song) optimized the overall details of the manuscript and provided support for language translation. J.S. (Junjing Sun), G.W. and H.X. were involved in the design of the study and analysis of the results. W.W. provided the experimental site, measurement equipment, design of the experiment and data preparation. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Informed consent was obtained from all subjects involved in the study.

Data Availability Statement

The datasets used and/or analyzed during the current study are available from the corresponding author on reasonable request.

Conflicts of Interest

The author declares no conflict of interest.

References

  1. Zhang, X.D.; Zhao, X.L.; Liu, Z.Z.; Pang, S.D. Research on Relevant Issues of Thermoplastic Road Traffic Markings in Northern China. J. Highw. Transp. Sci. Technol. (Appl. Technol. Ed.) 2019, 6, 193–196+217. [Google Scholar]
  2. Mazzoni, L.N.; Ho, L.L.; Vasconcelos, K.L.; Bernucci, L.L.B. Probabilistic Service Life Model of Pavement Marking by Degradation Data. Transp. Res. Rec. 2022, 2676, 328–340. [Google Scholar] [CrossRef]
  3. Smadi, O.; Souleyrette, R.R.; Ormand, D.J.; Hawkins, N. Pavement Marking Retroreflectivity: Analysis of Safety Effectiveness. Transp. Res. Rec. 2008, 2056, 17–24. [Google Scholar] [CrossRef]
  4. Ré, J.M.; Miles, J.D.; Carlson, P.J. Analysis of In-Service Traffic Sign Retroreflectivity and Deterioration Rates in Texas. Transp. Res. Rec. 2011, 2258, 88–94. [Google Scholar] [CrossRef]
  5. Shaik, N.B.; Pedapati, S.R.; Taqvi, S.A.A.; Othman, A.R.; Dzubir, F.A.A. A Feed-Forward Back Propagation Neural Network Approach to Predict the Life Condition of Crude Oil Pipeline. Processes 2020, 8, 661. [Google Scholar] [CrossRef]
  6. Shaik, N.B.; Benjapolakul, W.; Pedapati, S.R.; Bingi, K.; Thien Le, N.; Asdornwised, W.; Chaitusaney, S. Recurrent neural network-based model for estimating the life condition of a dry gas pipeline. Process Saf. Environ. Prot. 2022, 164, 639–650. [Google Scholar] [CrossRef]
  7. Liu, M.; Dong, Z.; Shi, H. Multi-Sensor Information Fusion and Multi-Model Fusion-Based Remaining Useful Life Prediction of Fan Slewing Bearings with the Nonlinear Wiener Process. Sustainability 2023, 15, 12010. [Google Scholar] [CrossRef]
  8. GB/T16311-2009; Specification and Test Method for Road Traffic Markings. Standardization Administration of the People’s Republic of China: Beijing, China, 2009.
  9. Owusu, V.; Tuffour, Y.; Obeng, D.; Salifu, M. Degradation of Retro-Reflectivity of Thermoplastic Pavement Markings: A Review. Open J. Civ. Eng. 2018, 8, 301–311. [Google Scholar] [CrossRef]
  10. Zhang, Y.; Wu, D. Methodologies to Predict Service Lives of Pavement Marking Materials. J. Transp. Res. Forum 2006, 45, 5–18. [Google Scholar] [CrossRef]
  11. Babić, D.; Fiolić, M.; Žilioniene, D. Evaluation of static and dynamic method for measuring retroreflection of road markings. Građevinar 2017, 69, 907–914. [Google Scholar] [CrossRef]
  12. Bahar, G.; Masliah, M.; Erwin, T.; Tan, E.; Hauer, E. Pavement marking materials and markers real-world relationship between retroreflectivity and safety over time. In Nchrp Web Document; National Academies Press: Washington, DC, USA, 2006. [Google Scholar] [CrossRef]
  13. Sitzabee William, E.; Hummer Joseph, E.; Rasdorf, W. Pavement Marking Degradation Modeling and Analysis. J. Infrastruct. Syst. 2009, 15, 190–199. [Google Scholar] [CrossRef]
  14. Ozelim, L.; Turochy, R.E. Modeling Retroreflectivity Performance of Thermoplastic Pavement Markings in Alabama. J. Transp. Eng. 2014, 140, 05014001. [Google Scholar] [CrossRef]
  15. Wang, C.; Wang, Z.; Tsai, Y.-C. Piecewise Multiple Linear Models for Pavement Marking Retroreflectivity Prediction Under Effect of Winter Weather Events. Transp. Res. Rec. 2016, 2551, 52–61. [Google Scholar] [CrossRef]
  16. Babic, D.; Scukanec, A.; Babic, D.; Fiolic, M. Model for predicting road markings service life. Balt. J. Road Bridge Eng. 2019, 14, 341–359. [Google Scholar] [CrossRef]
  17. Pike, A.M.; Songchitruksa, P. Predicting Pavement Marking Service Life with Transverse Test Deck Data. Transp. Res. Rec. 2015, 2482, 16–22. [Google Scholar] [CrossRef]
  18. Chimba, D.; Kidando, E.; Onyango, M. Evaluating the Service Life of Thermoplastic Pavement Markings: Stochastic Approach. J. Transp. Eng. Part B Pavements 2018, 144, 04018029. [Google Scholar] [CrossRef]
  19. Mohamed, M.; Abdel-Rahim, A.; Kassem, E.; Chang, K.; McDonald, A.G. Laboratory-Based Evaluation of Pavement Marking Characteristics. J. Transp. Eng. Part B Pavements 2020, 146, 04020016. [Google Scholar] [CrossRef]
  20. Mousa, M.R.; Mousa, S.R.; Hassan, M.; Carlson, P.; Elnaml, I.A. Predicting the Retroreflectivity Degradation of Waterborne Paint Pavement Markings using Advanced Machine Learning Techniques. Transp. Res. Rec. 2021, 2675, 483–494. [Google Scholar] [CrossRef]
  21. Dong, K. Residual Serve Life of Pavement Mark in Urban Area. Transp. Stand. 2009, 8, 98–102. [Google Scholar] [CrossRef]
  22. Hao, X.Y.; Guo, Z.Y.; Song, C.C.; Zhu, B.C. Safety Maintenance of Highway Traffic Sign and Pavement Marking Based on Visibility Attenuation. Highway 2019, 64, 7. [Google Scholar]
  23. Huang, K.; Wang, W.H. Evaluation Method for Quality Decay of Road Markings: A Discussion. J. Highw. Transp. Technol. Appl. Technol. Ed. 2020, 10, 114–115. [Google Scholar]
  24. Yang, F.R.; Gao, Z.Q. Study on Performance Decay of Retroreflective Road Traffic Markings on Asphalt Pavement. Highway 2022, 67, 7. [Google Scholar]
  25. Zhang, Z.; Wang, C.; Gao, Y.; Chen, J.; Zhang, Y. Short-Term Passenger Flow Forecast of Rail Transit Station Based on MIC Feature Selection and ST-LightGBM considering Transfer Passenger Flow. Sci. Program. 2020, 2020, 3180628. [Google Scholar] [CrossRef]
  26. Peng, J.; Xiang, W.; Xie, Y.; Huang, Y.H.; Han, L.Q.; Wu, Y.C. Prediction of Waiting Time for Multi-Stage Medical Services Based on LightGBM. Comput. Appl. Softw. 2022, 39, 119–124. [Google Scholar]
  27. Sun, Y.L.; Wang, L.R.; Zhang, Q.; Li, T.; Wei, T.X. Threshold Research of Rainstorm Disaster on the Houses in Hebei Province Based on Dominance Analysis Method. Clim. Environ. Res. 2021, 26, 10. [Google Scholar]
  28. Zhu, X.C.; Yin, Q.Z.; Zhao, F.Q.; Qian, W.W.; Zhao, K.K. Ship speed prediction model based on LightGBM. J. Dalian Marit. Univ. 2023, 49, 10. [Google Scholar]
  29. Zhang, B. Investigating the Service Life of Thermoplastic Road Markings on Highways. Archit. Eng. Technol. Des. 2014, 28, 358–358. [Google Scholar] [CrossRef]
  30. “Notice and Announcement”. Shandong Department of Transportation. Available online: http://jtt.shandong.gov.cn/col/col15843/index.html (accessed on 28 October 2022).
  31. National Earth System Science Data Center, National Science & Technology Infrastructure of China. Available online: http://www.geodata.cn (accessed on 10 November 2022).
  32. Raja, M.N.A.; Abdoun, T.; El-Sekelly, W. Smart prediction of liquefaction-induced lateral spreading. J. Rock Mech. Geotech. Eng. 2023. [Google Scholar] [CrossRef]
  33. Raja, M.N.A.; Shukla, S.K. Predicting the settlement of geosynthetic-reinforced soil foundations using evolutionary artificial intelligence technique. Geotext. Geomembr. 2021, 49, 1280–1293. [Google Scholar] [CrossRef]
  34. Li, Z.X. The Prediction of House Price Based on Multiple Regression Analysis and Support Vector Machine. Master’s Thesis, Tsinghua University, Beijing, China, 2021. [Google Scholar]
Figure 1. Service life prediction process of expressway markings based on the LightGBM algorithm.
Figure 1. Service life prediction process of expressway markings based on the LightGBM algorithm.
Sustainability 15 15237 g001
Figure 2. (a) Installation of the Zehntner vehicle-mounted retroreflectometer and (b) calibration of the Zehntner vehicle-mounted retroreflectometer.
Figure 2. (a) Installation of the Zehntner vehicle-mounted retroreflectometer and (b) calibration of the Zehntner vehicle-mounted retroreflectometer.
Sustainability 15 15237 g002
Figure 3. (a) Calibration of the static handheld device and (b) application of the static handheld device.
Figure 3. (a) Calibration of the static handheld device and (b) application of the static handheld device.
Sustainability 15 15237 g003
Figure 4. (a) Histogram of regression standardized residuals and (b) normal P–P plot of regression standardized residuals.
Figure 4. (a) Histogram of regression standardized residuals and (b) normal P–P plot of regression standardized residuals.
Sustainability 15 15237 g004
Figure 5. Plot of regression residuals versus regression standardized values.
Figure 5. Plot of regression residuals versus regression standardized values.
Sustainability 15 15237 g005
Figure 6. The results of variable importance analysis.
Figure 6. The results of variable importance analysis.
Sustainability 15 15237 g006
Figure 7. The predictive graph of the testing set.
Figure 7. The predictive graph of the testing set.
Sustainability 15 15237 g007
Table 1. Summary of retroreflectivity prediction models in the literature.
Table 1. Summary of retroreflectivity prediction models in the literature.
YearAuthorsInfluencing Factors
(Variables Included in the Model)
Model
1997AndradyAge (age of marking), initial retroreflectivityLogarithmic
1999Lee, Maleck and TaylorAgeSimple linear regression
2002Abboud and BowmanAnnual average daily traffic (AADT)Logarithmic
2004KopfAge, marking color, AADT Simple linear regression
2007FitchAge, AADTLogarithmic
2009Sitzabee et al. [13]Age, AADT, initial retroreflectivity, marking color, marking locationMultiple linear regression
2011Hummer et al.Age, initial retroreflectivityLinear mixed-effects model
2012Mull and SitzabeeAge, AADT, initial retroreflectivity, plow eventsMultiple linear regression
2012Robertson et al.Age, initial retroreflectivity, lane width, AADTMultiple linear regression
2019Dario Babic [16]Age, initial retroreflectivity, marking location, winter maintenanceMultiple linear regression
2015Pike and Songchitruksa [17]Age, initial retroreflectivityExponential models
2018Deo Chimba et al. [18]Age, marking color, marking locationMarkov chain model
2019Hao Xiaoyan et al. [19]AgeExponential models
2020Huang Kai and Wang Wenhui [23]Age, traffic volumeSimple linear regression
2020Chia-Pei Chou et al.Retroreflectivity, marking imageVisual recognition methods
2022Laura, Nascimento,
Mazzoni et al.
Age, initial retroreflectivity, traffic volume, weather condition, marking materialsGeneralized linear mixed models
2022Yang Furong et al.AgeSimple linear regression
Table 2. Selection of influencing factors.
Table 2. Selection of influencing factors.
Classification of
Influencing Factors
Name of Influencing FactorsLiteratures
Road factorsMaximum road speed limitNovel proposition
Annual average daily traffic[13,14,16]
Marking factorsRoad marking pattern[17]
Initial retroreflectivity[13,16,17]
Age of marking[13,14,16,17,18,20]
Glass bead contentNovel proposition
Environmental factorsAverage monthly temperature[20]
Average monthly precipitation[20]
Table 3. Road sections for data collection.
Table 3. Road sections for data collection.
Expressway NameSections
G20 Qingdao–Yinchuan ExpresswayJi’nan section
G3 Beijing–Taipei ExpresswayJi’nan section
G3 Beijing–Taipei ExpresswayTai’an section
S38 Lanshan–Heze ExpresswayLinzao section
G2001 Ring Expressway of Ji’nanNorthern surround section
Table 4. Marking retroreflectivity data for G20, Qingdao–Yinchuan Expressway, Ji’nan section (mcd·m−2·lx−1).
Table 4. Marking retroreflectivity data for G20, Qingdao–Yinchuan Expressway, Ji’nan section (mcd·m−2·lx−1).
Age of MarkingMonths
123456789101112
Dotted line 1451436381297232232315221155252184140
Solid line 1379381291259359359321284315291285287
Dotted line 23563211702151351351201209413013078
Solid line 2381351305346324324308308220265265221
Table 5. Data for each influencing factor of G20, Qingdao–Yinchuan Expressway, Ji’nan section.
Table 5. Data for each influencing factor of G20, Qingdao–Yinchuan Expressway, Ji’nan section.
Influencing FactorsData
Age of Marking (M)123456789101112
AADT (cars)Converted to 115,000 by vehicle conversion factor
Road marking patternDotted lineSolid line
Initial retroreflectivity (mcd·m−2·lx−1)Dotted line 1: 488
Dotted line 2: 418
Solid line 1: 425
Solid line 2: 430
Average monthly temperature (°C)15.51921.623.324.323.923.79.410.813.413.915.1
Average monthly precipitation (mm)26832310251085927809748234250468540520
Maximum road speed limit (km/h)120
Glass bead content (%)35%
Table 6. Diagnostic table for multicollinearity of independent variables.
Table 6. Diagnostic table for multicollinearity of independent variables.
VariablesSignificance (Sig)ToleranceVIF
Age of marking0.0000.5541.838
AADT0.0000.1875.336
Road marking pattern0.0000.8551.169
Initial retroreflectivity0.0000.8531.172
Average monthly temperature0.0350.2164.631
Average monthly precipitation0.0010.2744.650
Maximum road speed limit0.2610.1596.289
Glass bead content0.0000.3612.767
Table 7. Model summary.
Table 7. Model summary.
ModelRR2Adjusted R2Std. Error of the EstimateDurbin–Watson
10.7730.5990.59350.4671.035
Table 8. ANOVA.
Table 8. ANOVA.
ModelSum of SquaresDegree of
Freedom (df)
Mean SquareFSig.
Regression2,255,778.1267322,254.018126.5270.000
Residual1,517,968.7405962546.927
Total3,773,746.866603
Table 9. Results of multiple linear regression analysis.
Table 9. Results of multiple linear regression analysis.
ModelUnstandardized
Coefficients
Standardized Coefficients
Beta
tSig.
BStd. Error
(Constant)148.26144.578 4.4020.001
Age of marking−11.2410.681−0.577−16.5160.000
AADT−10.7711.326−0.242−8.1260.000
Road marking pattern59.8484.4150.37814.5570.000
Initial retroreflectivity0.7430.0850.2448.6990.000
Average monthly temperature1.8800.9260.1132.0310.043
Average monthly precipitation−0.0590.012−0.241−4.8560.001
Glass bead content−28.3924.186−0.180−6.7820.000
Table 10. Value-added contribution and total average contribution of each influencing factor.
Table 10. Value-added contribution and total average contribution of each influencing factor.
Number of Influencing
Factors in the Model (k)
Value   Added   Contribution   ( C )
X 1 X 2 X 3 X 4 X 5 X 6 X 7
k = 00.3070.0380.1160.0120.0490.0170.001
k = 10.312 0.044 0.123 0.019 0.075 0.054 0.003
k = 20.304 0.047 0.126 0.026 0.088 0.076 0.008
k = 30.285 0.050 0.127 0.031 0.088 0.084 0.013
k = 40.255 0.049 0.128 0.037 0.081 0.077 0.020
k = 50.217 0.047 0.129 0.043 0.043 0.056 0.028
k = 60.175 0.044 0.135 0.053 0.005 0.022 0.035
Total average contribution0.2580.047 0.128 0.035 0.063 0.062 0.018
Percentage/%42.37.721.05.710.410.12.9
Table 11. Data after pre-processing.
Table 11. Data after pre-processing.
Age of MarkingAADTRoad Marking PatternInitial RetroreflectivityAverage Monthly TemperatureAverage Monthly PrecipitationGlass Bead ContentRetroreflectivity of Road Marking
111.5 × 104041815.526830356
211.5 × 104041819322.530321
311.5 × 104041821.61024.730170
411.5 × 104041824.3108530215
511.5 × 104041824.3926.830135
611.5 × 104041824.9808.830135
711.5 × 104041822.7747.430120
…………………………………………
Table 12. Setting of each parameter value.
Table 12. Setting of each parameter value.
Parameter Names and Optimization BoundariesParameter Value
1 ≤ n_estimators ≤ 300200
0.1 ≤ learning_rate ≤ 10.1
0.1 < bagging_fraction ≤ 11
0.1 < feature_fraction ≤ 10.8
5 ≤ max_depth ≤ 1010
5 ≤ min_child_samples ≤ 2010
Table 13. Results of model evaluation.
Table 13. Results of model evaluation.
MSERMSEMAEMAPE (%)
Training set49.79097.056211.9444.1670.916
Testing set41.51146.442916.7935.9490.942
Table 14. The prediction results of the testing set.
Table 14. The prediction results of the testing set.
Testing Set Prediction ResultsRetroreflectivityAge of MarkingAADTRoad Marking PatternInitial RetroreflectivityAverage Monthly TemperatureAverage Monthly PrecipitationGlass Bead Content
135.375135511.5 × 104041824.3926.830
120.157126711.5 × 104041822.7747.430
117.894120811.5 × 10404189.4234.630
96.07599911.5 × 104041810.8249.630
324.329326312 × 104140621.51150.330
232.8112341112 × 104140614.2662.830
355.325359610.5 × 104143924.9808.830
372.20838928.5 × 104047119.717435
250.599247314 × 104043721.81120.735
359.599359611.5 × 104142524.9808.835
………………………………………………
Table 15. Selected example data.
Table 15. Selected example data.
Age of MarkingAADTRoad Marking PatternInitial RetroreflectivityAverage Monthly TemperatureAverage Monthly PrecipitationGlass Bead ContentRetroreflectivity
110.5 × 104046715.526835404
210.5 × 104046719322.535436
310.5 × 104046721.61024.735276
410.5 × 104046723.3108535256
510.5 × 104046725.3926.835235
610.5 × 104046723.9808.835232
18.5 × 104141016.213330374
28.5 × 104141019.717430348
312 × 104140621.51150.330326
412 × 104140623.21300.530298
…………………………………………
Table 16. Prediction validity of multiple linear regression prediction model.
Table 16. Prediction validity of multiple linear regression prediction model.
Real ValuesProjected ValuesValidityAverage Validity
404395.8497.98%87.36%
436407.9793.35%
276300.1891.24%
256288.5887.27%
235288.5577.21%
232283.5277.79%
315273.6586.87%
221267.7278.86%
298302.4498.51%
………………
Table 17. Prediction validity of LightGBM-based prediction model.
Table 17. Prediction validity of LightGBM-based prediction model.
Real ValuesProjected ValuesValidityAverage Validity
404404.9999.76%95.02%
436435.0199.77%
276275.9199.97%
256255.9699.98%
235235.4699.77%
232232.2599.89%
315315.2999.78%
221222.9399.13%
298270.0890.63%
………………
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Zhao, L.; Ding, H.; Sun, J.; Wu, G.; Xing, H.; Wang, W.; Song, J. Prediction of Service Life of Thermoplastic Road Markings on Expressways. Sustainability 2023, 15, 15237. https://doi.org/10.3390/su152115237

AMA Style

Zhao L, Ding H, Sun J, Wu G, Xing H, Wang W, Song J. Prediction of Service Life of Thermoplastic Road Markings on Expressways. Sustainability. 2023; 15(21):15237. https://doi.org/10.3390/su152115237

Chicago/Turabian Style

Zhao, Luhua, Haonan Ding, Junjing Sun, Guangna Wu, Huiyao Xing, Wei Wang, and Jie Song. 2023. "Prediction of Service Life of Thermoplastic Road Markings on Expressways" Sustainability 15, no. 21: 15237. https://doi.org/10.3390/su152115237

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop