1. Introduction
In order to strengthen inter-regional economic and social links and shorten travel time, extra-long highway tunnels have gained high importance in Chinese transportation planning [
1,
2]. By the end of 2020, there were 1394 very long road tunnels (L > 3000 m) with a total length of 6235.5 km, an increase of 19.5% compared to the same period last year (Ministry of Transport of the People’s Republic of China, 2021). Extra-long tunnels have a semi-enclosed structure, which can cause severe air pollution if pollutants are not discharged in time [
3,
4]. Combined with the intensive traffic volume, the air pollution inside can induce traffic accidents and endanger the health of drivers and passengers [
5,
6,
7]. Traffic pollutants, including metal ions and aerosols, can cause harm to human health [
8,
9,
10,
11]. Yang et al. noted through field tests that NH
3 levels in road tunnels are several times higher than those in ordinary roads [
12]. At the same time, the traffic of heavy trucks in mountainous areas and their proportion of the total traffic in road tunnels increased significantly, exacerbating the levels of CO, NO
2, SO
2, and particulate matter in the air of such buildings [
13,
14,
15,
16]. Song et al. conducted a two-week field test and found a linear relationship between the proportion of heavy vehicle traffic and pollutant levels, with the PM
2.5, NO, NO
2, NO
x, and CO being 75, 81, 24, 65, and 33 times more intense than those from light-duty vehicles, respectively [
17]. In addition, such buildings hurt the local environment, especially in residential and environmental areas [
18,
19]. These findings highlight the importance of understanding the dynamics of pollutants in extra-long road tunnels to help derive an appropriate response to control traffic pollution.
The spatial distribution of tunnel pollutants is well studied, and the pollutant concentration is linearly related to the length of the ventilation interval [
20,
21]. There are two main approaches to studying the temporal evolution patterns of tunnel pollutants: computational fluid dynamics (CFD) models and field measurement methods. CFD models, mainly numerical simulations, simulate air pollutants’ diffusion and decay processes based on hydrodynamics under pre-defined conditions [
22,
23,
24,
25]. Previous studies have focused on identifying the key influencing factors of air pollutants inside tunnels and their mechanisms of action, including tunnel slope [
26], building section size [
27], natural wind [
28], traffic wind [
29], ventilation system [
30], meteorological variables [
31], nearby vegetation [
32,
33], particulate properties [
34], etc. Song et al. investigated utilizing mutual validation between field tests and model experiments. The dynamic changes of pollutants inside the tunnel were investigated, and the effect of vehicle travel speed on the diffusion of pollutants was evaluated [
35]. They suggested that the accumulation of pollutants along the traffic direction is significant for road tunnels and concluded that the level of pollutants after stabilization over time is higher than that in megacities. The core advantages of CFD models are high computational efficiency and good operability. However, the simulation conditions are ideal, leading to unanticipated differences in the temporal distribution of pollutants in the simulation results, such as neglecting non-tailpipe emissions.
Field measurements in real road tunnels are rare because of coordination with tunnel managers, cooperation with traffic management, and high experimental costs. Thus, tunnel field measurements are ideal for directly estimating pollutant levels (both tailpipe and non-tailpipe emissions) and erasing the external environment’s effect [
17]. Li et al. conducted field measurements on four typical bifurcated tunnels and proposed that tunnel pollutant dispersion varies with tunnel structure [
36]. Monitoring Taiwan’s traffic emissions from the Hsuehshan tunnel showed that the pollutant emission factor was twice as high uphill as downhill [
37,
38,
39]. Xu et al. obtained a more complex pattern of organic emission evolution inside the tunnel considering new energy vehicles through field tests [
40]. Zhang et al. conducted field measurements of tunnels in four Chinese cities using tracers. They put forward that non-tailpipe emissions account for 60–74% of PM
10 components of tunnels, and road slope and road roughness are the fundamental factors determining non-tail gas emissions of roads [
15]. Hou et al. performed in situ sampling of aerosol particles in a tunnel in a coastal city in southern China and analyzed the particles using transmission electron microscopy and energy dispersive X-ray spectrometry. They suggested that the aging of particles is weaker than the atmospheric environment due to the absence of photochemical reactions inside the tunnel [
41]. Based on the monitoring results for the Caldecott Tunnel in San Francisco, Dallmann et al. quantified the emission factors of motor vehicles, including medium- and heavy-duty trucks [
42]. Tong et al. performed continuous and single-point measurements in five urban road tunnels and showed that the wind speed inside the tunnel depends strongly on the vehicle speed [
43]. Xu et al. used a multiple fractal detrended fluctuation analysis (MF-DFA) model to analyze the collected aerosol data. The data were decomposed and analyzed. The aerosol level was found to have multifractal properties and long-term persistence [
44]. On-site monitoring allows for quick and direct access to first-hand data. However, the single field monitoring scenario and short monitoring period limit the field measurement results.
In summary, although these two methods reveal the evolutionary patterns of air pollutants inside extra-long road tunnels from different perspectives, the bidirectional effects of traffic on air pollutants have not been fully understood. Moreover, most studies tend to be descriptive in their analysis and rarely predict air pollutants inside tunnels, in particular, prediction models that consider the long-term persistence characteristics of pollutants. In addition, the international spread of COVID-19 in 2020 has severely limited human activity. China is one of the earliest countries to enter the post-COVID-19 era. China has adopted resilient policies to control the epidemic spread. As a result, industrial activities such as energy production have largely resumed, but traffic emissions have been significantly affected ([
45]).
To fill these research gaps, we selected the YEL Tunnel, a vital traffic link in the Beijing-Tianjin-Hebei region of China, as the subject of our study, aiming to (1) analyze the variation in traffic flow and pollutant concentrations under different control levels, based on the field measurements; (2) determine the traffic pollutant temporal evolution mechanism in an extra-long expressway tunnel; (3) confirm the contribution of each factor to traffic emissions, and to forecast pollutant emissions and concentrations. The findings will inform environmental policymakers and researchers and contribute to understanding the bidirectional role of traffic winds on pollutant concentrations in a tunnel environment.
5. Results and Discussion
5.1. Traffic Characteristic
On 23 January 2021, to stop the spread of COVID-19, Hebei Province, China, activated a Level-I control for a major public health emergency. The government adopted a series of strict control measures, and unprecedented traffic activity reduction. According to the difference in control levels, 1 January 2020–31 July 2021 is divided into 10 periods, as shown in
Table 3.
The daily traffic volume of different vehicle types at the YEL Tunnel showed significant differences under different epidemic control levels, as shown in
Figure 4. The average daily traffic flow at the YEL Tunnel was 11,708 veh/d when no epidemic control measures were in place (1–23 January 2020). The traffic flow of LDV decreased by approximately 66%, HDV decreased by 32%, LDT decreased by 31%, MDT decreased by 28%, HDT decreased by 34% on the day when level-I control was in place for COVID-19 (24 January 2020). The decrease in LDV daily traffic is the most obvious. The epidemic control policy affected the traffic flow significantly, except for HDT. The HDT only shows a solid response to the level-I control. The HDV traffic flow was low throughout the observation period, which means that public transport was significantly restricted. The LDV traffic flow rose rapidly after the initial level-I control. However, after the second level-II control, it still remained low.
5.2. Monitoring Results
5.2.1. Pollutant Concentrations
The pollutant (CO, VOCs, NO
2, PM
2.5, PM
10) concentration difference between entrance and exit are shown in
Figure 5. The concentration data were processed to the interval [0, 1] using the normalization method to compare the pollutant concentration data characterized by different magnitudes. Non-aerosol pollutants (CO, VOCs, NO
2) correspond to the level of epidemic control. The control level is stricter, the pollutant concentration is lower. Interestingly, CO concentration peaks at the end of the relatively stringent level-II control (29 April 2020). Traffic flow, particularly LDVs, increased significantly in the short term, resulting in higher CO concentration. The CO concentration then levels off as the desire to travel decreases. Aerosol pollutants show the opposite characteristics, where the more stringent the epidemic control, the higher the concentrations instead, except for level-I control. It should be noted that the pollutant concentration differences were not significant, and the mechanical ventilation system was inactive during the observed period. The maximum difference ratio between the average pollutant concentration in different periods (
) is 35% for CO, 28% for VOCs,25% for NO
2, 52% for PM
2.5, 52% for PM
10.
5.2.2. Tunnel Environment Parameters
The data collection results of temperature, air pressure, and relative humidity are shown in
Figure 6. The temperature and air pressure inside the tunnel have seasonal characteristics and opposite patterns. The temperature range was −12.2~32.2 °C, and the air pressure was 993.2~1038.8 hPa. The characteristic of relative humidity variation was not noticeable.
Traffic winds significantly influenced the wind speed inside the tunnel because the mechanical ventilation system was mothballed for energy saving. The semi-enclosed structure of the tunnel determines the intensity of the traffic wind is positively correlated with the traffic flow. The multivariate nonlinear function was used to fit the collected wind speed data and traffic flow, as shown in Equation (8), showing the HDT type has the most stimulating effect on the tunnel wind speed.
where
v (m/s) is the wind speed.
NLDV,
NHDV,
NLDT,
NMDT,
NHDT, are daily traffic of LDV, HDV, LDT, MDT, HDT, respectively.
5.3. Relationship between Pollutant Concentrations and Environmental Parameters
The various pollutant concentrations were grouped into a troop. The environmental parameters were another troop. Considering that there is a certain amount of error in the data collection process, the Spearman method was used to conduct the Canonical Correlation Analysis between the pollutant troop and the environmental parameter troop in the Spss software. The result is shown in
Figure 7. As the wind speed increased, the concentrations of CO, COVs, NO
2 increased, while PM
2.5 and PM
10 decreased. The correlation between relative humidity and pollutant concentrations is low, with a positive correlation with CO, and a negative correlation with other pollutants. Due to the linear relationship between temperature and air pressure, the correlation between temperature and pollutant concentrations is not analyzed here to avoid multicollinearity and air pressure.
The wind speed has a weighty effect on pollutant concentrations for expressway tunnels where the mechanical ventilation system is mothballed. On the one hand, traffic wind decreases pollutant concentrations through dilution, but greater traffic wind means higher daily traffic and larger pollutant emissions. The variation feature in CO, VOCs, NO2 concentrations evidenced the perspective. Therefore, despite considerable fluctuations in daily traffic, the variation of pollutant concentrations is slight. Remarkably, aerosol pollutants decrease with increasing daily traffic. Compared to the pollutant concentrations on the day when the maximal average daily wind speed was 8.49 m/s, the CO concentration decreased by 30.9%, 28.5% for VOCs, 21.4% for NO2, 4.2% for PM2.5, 4.1% for PM10 on the day with the lowest average daily wind speed of 2.27 m/s. In addition, to avoid multicollinearity, it is necessary to further analyze pollutant emission factors using RF models based on the analysis of pollutant concentration evolution mechanisms.
5.4. Pollutants’ Nonlinear Evolution
The MFDFA was used to explore and analyze various pollutant concentrations.
Figure 8 depicts the relationship between
Fq(s) and
s for each pollutant. They all have significant power-law scaling relationships within the monitoring period. The DFA scaling exponent
a was estimated linearly using the Least Squares Method. All DFA scaling exponents are greater than 1.4 (as shown in
Figure 8f).
The scaling exponent a for all pollutants is higher than 0.5, indicating that pollutants have solid long-term persistence. Long-term persistence reflects the variation pattern of correlation between pollutant concentration and time in the YEL Tunnel. In other words, the correlation between pollutant concentration and time does not strictly follow the Markov process but follows the power-law decay. The larger the scaling exponent a, the stronger the long-term persistence. The long-term persistence of CO concentration is the most significant, while that of PM2.5 is relatively weak.
The MFDFA was used to explore the multifractal characteristics caused by the long-term persistence of pollutant concentration. The scaling exponent function τ(q), the generalized Hurst exponent h(q), the multifractal spectrum f(α), and ∆α were calculated.
Figure 9a shows the relationships between the Generalized Hurst exponent
h(q) and the parameter
q show multifractal solid characteristics. All Generalized Hurst exponent
h(q) > 0.5, leading to the same conclusion as
Figure 8. There is a slight difference between the
h(2) and
a, which is caused by the Finite Size Effect (FSE) ([
49]). The
h(q) shows a pattern of decreasing with increasing
q values, which demonstrates that the long-term persistence of all pollutants has the structure of multifractal scale-invariance.
The scaling exponent function
τ(q) is a nonlinear function of the parameter
q, as shown in
Figure 9b. The
τ(q) has a similar evolutionary trend, and
τ(q) of each pollutant intersects at
q = 0. This reveals that all pollutant concentration groups have self-similarity in structure and dynamics. They have similar nonlinear dynamical evolutionary mechanisms.
Figure 9c shows the variation curves of the multifractal spectrum
f(α) corresponding to pollutant groups. The multifractal strength can be characterized by the width of singular exponent ∆
α (∆
α = ∆
αmax − ∆
αmin). The larger the value of ∆
α, the stronger the multifractal, which implies the strength of the long-term persistence in actual pollutant concentration changes. The
∆f reflects the frequency change of the maximum and minimum fluctuations in the long-term persistence of pollutants.
Figure 9d presents the degree of multifractality: VOCs > CO > NO
2 > PM
10 > PM
2.5. This is partly because the YEL Tunnel is located in a mountainous area with high forest cover and the long-term nature of VOCs released by vegetation. Another reason is the photochemical reaction of VOCs. CO is slightly higher than VOCs in data dispersion. The long-term fluctuation of CO is the most significant. The distribution homogeneity of dynamic change in PM
2.5 and PM
10 is the most uneven.
α0 is the abscissa of the extreme point in the multifractal spectrum.
The results of MFDFA for pollutant evolution in different periods are shown in
Figure 10. Compared to the non-epidemic period, the
∆α of pollutant evolution was slower during the epidemic period. The trend of multifractal strength (
∆α) is consistent across pollutants. This suggests that the reduction in daily traffic resulted in a weaker long-term persistence of pollutants during COVID-19. The combination of seasonality and control level results in a fluctuating alteration of ∆
α. Overall, the tighter epidemic control level leads to long-term persistence and weaker fluctuating intensity of pollutants. In particular, the long-term persistence of pollutants in the relaxed control time of 2021 has still not returned to the status of pre-epidemic time.
From the above studies, the response mechanism of pollutant concentration to traffic pollution emission reduction is governed by the dynamics of nonlinear interactions. Firstly, the tunnel ventilation was mainly dependent on traffic winds during the measurement period. Secondly, the pollutant concentration evolution has a steady power-law distribution structure. Therefore, the temporal evolution of each pollutant concentration may have a Self-organizing Critical (SOC) state [
50]. It is challenging to predict pollutant concentrations directly, as they have long-term persistence.
5.5. Prediction of Air Pollutants in YEL Tunnel
The pollutant concentration needs to be converted into pollutant emission for further analysis to clarify the traffic emission characteristics. Equation (9) calculates the pollutant emission mass for the vehicle fleet through the tunnel ([
51]).
where:
EFtotal (mg/km) is the traffic emissions.
∆C (mg/m
3) is the difference in pollutant concentration between the outlet and inlet.
A (m
2) is the cross-sectional area of the tunnel (99.47 m
2).
v (m/s) is the air velocity parallel to the tunnel alignment.
T is for one day.
L is the tunnel length (5.6773 km).
Using MSE, RMSE, MAE, MAPE error functions and the coefficient of determination R
2 calculated for each RF model clarified the model’s fit quality. The established RF models were validated using the partitioned validation dataset (1 January 2020 to 31 January 2021). The RF model generalization ability was certified by the 10-Fold Cross-validation (as shown in
Figure 11).
The RF model has an excellent fitting effect, as shown in
Table 4. The 10-Fold cross-validation results show a slight variability in MAE and MSE values. The PM
2.5 has the most significant difference with a 12.04% increase in MSE and a 13.58% increase in MAE, while the MSE value for VOCs decreased by 11.26% and the MAE value decreased by 6.19%. Overall, the 10-Fold Cross-validation results show very little difference from the original test results, indicating the RF model generalizes well. The model can explain most of the fluctuations in pollutant emissions.
The importance of various independent variables in the RF model is shown in
Figure 12, which indicates the main impact of the traffic flow. The total traffic flow contributes 84.55%, 96.24%, 93.66%, 76.47%, 77.60% to the output results of CO, VOCs, NO
2, PM
2.5, PM
10, respectively. The YEL Tunnel is the main transportation corridor for heavy-industrial raw materials in North China. PM sources include coal dust, secondary dust, clutch, and tire wear. However, all these factors are closely related to traffic flow. From the perspective of traffic flow of different vehicle types, the HDT has the largest influence on the generation of various pollutants, contributing 61.54%, 58.76%, 82.38%, 39.26%, 40.61% to the output results of CO, VOCs, NO
2, PM
2.5, PM
10, respectively. Analyzing the reason, firstly, the traffic flow of HDT in the ERL Tunnel is least affected by the control policy; secondly, the emission coefficient of HDT is high. Its influence weight on NO
2 is the largest, so HDT is the main source of NO
2 emissions. The contributions of all environmental parameters to the output results of CO, VOCs, NO
2, PM
2.5, PM
10 are 15.45%, 3.76%, 6.34%, 23.53%, 22.40%, respectively. Among these, the effects of air pressure and precipitation are negligible. However, CO and aerosol pollutant emissions are sensitive to temperature and humidity, as vehicle engines have different working efficiency under different temperatures and relative humidity. CO and aerosol pollutant emissions will increase as the low-temperature condition causes poor fuel atomization. The high-temperature condition causes premature combustion, which leads to larger CO and aerosol pollutants emissions. Similarly, an increase in relative humidity favors the production of particulate matter. Temperature contributes 10.35% to CO generation, 19.98% to PM
2.5 and 16.90% to PM
10. Relative humidity is 5.04% for CO, 3.13% for PM
2.5 and 5.43% for PM
10. To sum up, the HDT flow is the controlling factor for emissions of each pollutant.
To reflect the prediction accuracy of the RF model, the Multiple Linear Regression model (MLR), Polynomial Regression model (PR), and Classification And Regression Tree model (CART), eXtreme Gradient Boosting model (XGB) were established based on the divided training data set. The prediction accuracy in pollutant emissions, calculated by different models, is obtained as shown in
Table 5.
The machine learning algorithms have higher prediction accuracy than traditional regression models for analyzing traffic emissions. The RF model is 88–221% better than the MLR model and 90–334% better than the PR model in prediction accuracy. The prediction capability of the CART and XGB model for non-aerosol pollutants is similar to that of the RF model. However, the RF model is better at predicting aerosol pollutants emissions, with an 8–13% improvement over the CART model and 5–7% over the XGB model. The RF model still has a considerable advantage over similar machine learning algorithms in this study.
The prediction results for traffic emissions from 1 February–31 July 2021 are shown in
Figure 13. As can be seen from the values of R
2 and MAPE, the established RF model shows good accuracy. The maximum R
2 is 0.9948, and the minimum is 0.9514, as shown in
Table 6. Combined with the calculation results on the validation data set, the RF model has better prediction accuracy for CO, COVs, NO
2. In particular, the calculation accuracy for aerosol pollutants slightly improves the prediction period, which proves that the RF model has a good generalization ability (compared to 10 cross-validation results).
The predicted concentrations at different control levels are shown in
Figure 13. The coefficient of determination (R
2) for CO is 0.9825, R
2 = 0.9825 for VOCs, R
2 = 0.9903 for NO
2, R
2 = 0.9758 for PM
2.5, and R
2 = 0.9845 for PM
10. Therefore, the method of back-calculating pollutant concentrations by predicting traffic emissions is implementable, and the results are highly accurate. Compared to the results for pollutant emissions, the concentrations of aerosol pollutants have a little improvement in the prediction accuracy. In combination with the MFDFA results, the long-term persistence of aerosol pollutant concentrations is minimized. Therefore the prediction accuracy is improved by converting emissions to concentrations due to the effect of traffic wind. Collectively, the prediction results for level-3 control are better than the others, mainly due to the relatively stable traffic flow in the situation. Overall, the RF model’s good performance indicates the prediction results’ reliability.