3.1. Time Series Analysis of O3 and Its Related Factors
By monitoring the hourly variations in O
3 and its influencing factors at the Chongming Dongtan supersite in Shanghai from 1 January to 31 December 2021 (as shown in
Figure 2), this study identified patterns in O
3 concentrations based on long-term observations of atmospheric pollutants and meteorological parameters in the Chongming region. It was found that O
3 concentrations were higher in both spring and autumn. Specifically,
Figure 2a illustrates the seasonal variation of O
3 alongside NOx, PM
2.5, and PAN. O
3 concentrations gradually increased during spring (March–May), slightly declined in summer, rose again in autumn, and dropped to their lowest levels in winter. This pattern is similar to findings from numerous coastal regions, where O
3 concentrations in spring and autumn are higher than in summer [
47,
48]. In contrast to northern regions, where O
3 peaks typically occur in summer, the highest concentrations in coastal areas are observed in spring and autumn [
49]. This trend indicates that O
3 formation is primarily driven by photochemical reactions, with significant increases in O
3 production. The decline in O
3 concentrations during summer can be attributed to the reduction in pollutant levels. NOx concentrations were higher during the winter months (December and January) and lowest in summer, likely due to increased fossil fuel combustion in winter and the rapid photochemical consumption of NOx in summer. The seasonal variation in PAN is closely correlated with that of PM
2.5, suggesting a possible shared source. NOx in the atmosphere is involved in converting into nitrates to promote PM
2.5 formation and also in PAN formation. The similar temporal characteristics of PAN and PM
2.5 imply that PAN may share a similar origin with PM
2.5. We will explore this further in the backward trajectory analysis later.
Figure 2b illustrates the seasonal variation in different VOCs components, including alkanes, alkenes, aromatics, alkynes, and total VOC concentrations. The total concentration of VOCs peaks in winter and gradually declines through spring and summer. This seasonal variation reflects increased combustion activities in winter, coupled with poor atmospheric dispersion conditions. Alkanes constitute the largest proportion of total VOCs, particularly during winter, indicating that their primary source may be combustion emissions. Although alkenes, aromatics, and alkynes represent a smaller proportion, they make significant contributions to the formation of O
3 and PAN. In summer, O
3 levels decreased primarily due to relatively lower precursor concentrations of VOCs and NOx, as shown in
Figure 2, while higher wind speeds, as indicated in
Figure S2, facilitated the diffusion of these precursors, further contributing to the reduced O
3 concentrations. The seasonal variation in VOCs is closely related to atmospheric photochemical reactions, particularly as they serve as precursors for O
3 and PAN. The seasonal fluctuations of VOCs directly influence the formation processes of atmospheric pollutants.
Figure 2c presents the seasonal variations in temperature, relative humidity [
33], and wind speed [
34] throughout the year. The temperature peaks in summer and reaches its lowest levels in winter, displaying typical seasonal fluctuations. High temperatures promote photochemical reactions, thereby facilitating the formation of O
3 and PAN. Relative humidity is higher in spring (around 80%) and lower in winter (approximately 60%), with humidity variations influencing the formation of particulate matter and the dispersion of pollutants. Wind speeds are lower in winter and slightly higher in summer. Elevated wind speeds assist in the dispersion and dilution of pollutants, whereas lower wind speeds can lead to pollutant accumulation, particularly in winter. In combination with higher humidity, lower wind speeds in winter may contribute to the accumulation of particulates such as PM
2.5. The seasonal variations in temperature, relative humidity, and wind speed have a significant impact on pollutant concentrations, particularly in summer, when higher temperatures and wind speeds contribute to O
3 formation and the dispersion of particulate matter.
The seasonal variations of O3, NOx, PM2.5, PAN, and VOCs are influenced by a combination of atmospheric photochemical reactions and meteorological conditions. The summer peaks in O3 and PAN are primarily driven by active photochemical reactions, with VOCs and NOx serving as precursors interacting under conditions of sunlight and high temperatures. The concentrations of NOx and PM2.5 are higher in winter, largely due to anthropogenic emissions and meteorological factors such as combustion emissions and poor atmospheric dispersion. VOC compositions are dominated by alkanes in winter, whereas in summer, they participate in atmospheric chemical reactions and are rapidly consumed, exacerbating O3 formation. The seasonal fluctuations in meteorological conditions (temperature, relative humidity, and wind speed) play a critical role in pollutant formation, dispersion, and deposition. In particular, the higher temperatures and wind speeds in summer facilitate the generation or dispersion of pollutants.
The seasonal analysis of PAN, O3, and NO2 concentrations reveals distinct temporal patterns across spring, summer, autumn, and winter. In spring, summer, and autumn, PAN concentrations peak between 10:00 and 12:00, reaching approximately 0.81 μg/m3, 0.44 μg/m3, and 0.53 μg/m3, respectively. In contrast, O3 concentrations exhibit a delayed peak between 14:00 and 16:00, with values of approximately 120 μg/m3, 130 μg/m3, and 120 μg/m3 for the same seasons. A similar pattern emerges in winter, where PAN peaks at around 0.75 μg/m3 between 10:00 and 12:00, followed by an O3 peak of approximately 90 μg/m3 between 14:00 and 16:00. This consistent 2–4 h lag between the PAN and O3 peaks suggests a potential mechanistic link between PAN decomposition and subsequent O3 formation.
As shown at
Figure 3, the observed temporal relationship is consistent with known photochemical processes. PAN serves as a temporary reservoir for nitrogen oxides (NOx) and organic radicals, which are released upon its decomposition under conditions of elevated temperature and sunlight. This decomposition produces NO
2 and RO
2, both of which are critical precursors in O
3 formation. The peak of PAN concentrations in the late morning, followed by elevated O
3 levels in the early to mid-afternoon, supports the hypothesis that PAN decomposition contributes significantly to O
3 production. Notably, this pattern persists across all seasons, indicating that the underlying photochemical mechanism is robust and not confined to specific seasonal conditions.
While the data provide compelling evidence for the role of PAN in driving O3 formation, other factors warrant consideration. For example, the availability of VOCs plays a key role in radical cycling within photochemical systems and may enhance O3 production. Additionally, meteorological variables—such as temperature, relative humidity, and solar radiation—likely influence the rates of PAN decomposition and O3 formation. Nevertheless, the consistent lag between PAN and O3 peaks across multiple seasons suggests that PAN decomposition is a primary contributor to the observed O3 trends, even if modulated by these external factors.
These findings highlight the necessity of incorporating PAN into models of O3 formation, particularly in suburban environments where PAN concentrations are often elevated. Given its toxicity to human health and its role as a precursor to O3, a deeper understanding of PAN’s behavior and its interactions with O3 is essential for evaluating air quality and associated public health risks.
3.2. Model Performance and Feature Important
The dataset used in this study was obtained through the use of multiple instruments that measured the hourly concentrations of various atmospheric parameters. These parameters include PAN, NOx, O
3, PM
2.5, and VOCs. In addition, meteorological parameters such as temperature, relative humidity [
33], wind speed [
34], and wind direction [
47] were also recorded.
Too many features may lead to overfitting of the model, so this study calculated the O
3 formation potential (OFP) of all VOCs using the maximum incremental reactivity [
27] method. Based on the OFP, 10 volatile organic compounds were selected as VOC features in descending order according to their OFP. These selected VOCs were then used as input features in the RF model.
In total, 6924 valid samples were obtained during this period. These samples were divided into two subsets: 5539 samples were used for training the model, while the remaining 1385 samples were reserved for testing. The training dataset was used to develop and fine-tune the RF model, enabling it to capture the relationships between the input variables (PAN, NOx, O
3, PM
2.5, selected VOCs, meteorological factors) and the target variable (O
3 concentration). The test dataset was then employed to evaluate the model’s performance and generalizability. For more details on the observation instruments, please refer to the
Supplementary Materials.
Based on the results shown in
Figure S1, both random forest (RF) and XGBoost demonstrated high predictive accuracy for O
3 concentration prediction. However, a closer examination revealed that RF exhibited a lower R
2 variance (0.03335 for RF versus 0.04082 for XGBoost) and a slightly higher average R
2 score, indicating more consistent performance across different cross-validation folds. In contrast, other models such as ANN, RNN, SVR, and KNN produced noticeably lower R
2 values. Given the ecological sensitivity of the nearshore region under investigation, where robust and reliable predictions are critical, we ultimately selected random forest as the primary model for further analysis and interpretation.
Figure 4a shows the comparison between the O
3 concentration predicted by the RF model and the actual observations, and the results show that the model has good performance, with an R
2 value of 0.90 and a root mean square error (RMSE) of 11.498. This indicates that the model has high accuracy in predicting O
3 concentration. For a detailed model performance comparison, see the
Supplementary Materials. The scatter plot of actual O
3 concentration and model prediction values shows that most data points are closely distributed near the 1:1 line, especially in the medium and low concentration range (about 50–150 μg/m
3), showing a strong linear correlation. This indicates that the model has high prediction accuracy. The research by Watson et al. (2019) [
33] yielded analogous results when forecasting wildfire-related O
3 exposure in California, with random forest demonstrating superior predictive capability among ten evaluated machine learning models. Zhan et al. (2018) [
1] successfully applied random forest (RF) to predict O
3 concentrations in China with high accuracy. RF is widely recognized for its strong predictive performance, particularly in capturing nonlinear relationships between input variables and the target output—a capability that distinguishes it from alternative methods such as support vector machines (SVMs) and neural networks [
50,
51]. However, there is a slight underestimation trend in the high concentration area (>150 μg/m
3), which may be caused by the complexity and scarcity of extreme pollution events.
Feature importance analysis (
Figure 4b) reveals the key factors affecting O
3 concentration. PAN and NOx (nitrogen oxides) are identified as the most important predictors, followed by temperature, wind direction, and relative humidity [
33]. This ranking emphasizes the combined role of precursors, meteorological conditions, and particulate matter in O
3 generation. Yao et al.’s [
52] RF-based O
3 prediction study identified temperature and NOx as the most influential features through importance analysis, aligning broadly with our results. The dominance of NOx highlights the importance of controlling traffic and industrial emissions, while the higher importance of temperature illustrates the significant impact of climate factors on O
3 pollution levels.
Figure S3, which presents SHAP values, further supports this analysis. The SHAP plot is similar to the feature importance chart, but it reveals that NOx shows a negative effect on O
3 when its concentration is high, and a positive effect when its concentration is low. This dynamic behavior of NOx emphasizes the complex relationship between precursor levels and O
3 formation.
3.3. Partial Dependence Plot
Based on the machine learning analysis using partial dependence plots (PDPs), the figure reflects the role of four key factors—PAN, temperature, NOx, and relative humidity [
33]—in O
3 generation. These figures, respectively, show the impact of each factor on O
3 concentration changes within a specific range, from which we can identify how these factors affect O
3 concentration and thus deduce their positive and negative effects in predicting O
3 concentration.
Figure 5a shows that PAN has a strong positive effect on the production of O
3. As PAN increases from 0 to about 3.5 ppb, O
3 concentration rises from approximately 60 µg/m
3 to around 110 µg/m
3, suggesting that PAN contributes to O
3 formation. While PAN and O
3 are both products of VOC oxidation in the presence of NOx and may share similar variation characteristics, the observed trend from the PDP highlights that PAN, as a precursor, can indeed elevate O
3 levels under certain conditions. Consistent with Liu’s findings, our analysis reveals that PAN contributes to O
3 formation under NOx-rich conditions. As demonstrated in
Figure 3 and
Figure 5, the nocturnal increase in NOx concentrations leads to subsequent PAN-driven O
3 enhancement, mirroring the mechanisms reported in Liu’s study [
53].
Figure 5b shows the effect of temperature on O
3 concentration. It can be seen from the figure that as the temperature increases, the O
3 concentration gradually increases, showing an obvious positive effect. This is consistent with previous studies [
54,
55]. When the temperature increases from 10 °C to 30 °C, the O
3 concentration increases from approximately 80 µg/m
3 to over 100 µg/m
3. This indicates that higher temperatures help promote the generation of O
3, mainly because higher temperatures are usually accompanied by stronger solar radiation and more active photochemical reactions. This phenomenon is especially obvious in summer, because high temperature provides favorable conditions and accelerates the photochemical reaction rate of VOCs and NOx, resulting in a significant increase in O
3 concentration. Therefore, temperature shows a positive contribution in O
3 concentration prediction, especially in high-temperature seasons, when the impact is particularly significant.
Figure 5c shows the impact of NOx on O
3 concentration, showing a typical negative correlation. As the NOx concentration increases from approximately 10 µg/m
3 to 50 µg/m
3, the O
3 concentration gradually decreases from 100 µg/m
3 to approximately 60 µg/m
3. This negative correlation can be explained by the “NOx suppression effect”. When the NOx concentration is high, NO will react with O
3 to generate NO
2 and consume O
3, thus inhibiting the generation of O
3. When the NOx concentration is low, O
3 generation is more active; however, when the NOx concentration is too high, O
3 is consumed instead. This negative effect is especially obvious in urban areas with high NOx emissions, indicating that NOx is one of the keys limiting factors for O
3 generation [
45]. In the prediction model of O
3, NOx usually shows a negative contribution, and its inhibitory effect on O
3 increases as the concentration increases.
Figure 5d shows the effect of relative humidity [
33] on O
3 concentration, which also shows a negative effect. This is similar to previous studies [
56]. As the relative humidity increases from 20% to 100%, the O
3 concentration decreases from 110 µg/m
3 to 80 µg/m
3. High humidity conditions usually mean more cloud cover and weaker solar radiation, inhibiting photochemical reactions from occurring. In addition, high humidity helps the sedimentation of pollutants, further reducing O
3 production. Therefore, increased humidity has a suppressive effect on O
3 concentrations, which is particularly significant in humid climate conditions, especially during cloudy or rainy seasons. This shows that the influence of relative humidity contributes negatively to the prediction of O
3, and higher humidity usually means a lower O
3 production rate.
Similarly to Wang et al.’s [
57] study using random forest for O
3 prediction, our partial dependence plot (PDP) analysis of feature impacts on O
3 concentrations revealed comparable patterns: temperature showed limited influence below 0 °C, and NOx exhibited increasing effects at higher concentrations. However, our study demonstrated a stronger impact of relative humidity [
33] on O
3 levels compared to Wang’s findings, which is potentially attributable to our study’s unique ecoregion characteristics.
Overall, these four variables show different directions of influence in the prediction of O3 concentration. PAN and temperature have significant positive contributions to O3 generation, while NOx and relative humidity have significant negative effects on O3 concentration. In model predictions, the combined effect of these factors reveals the complex change mechanism of O3 concentration, emphasizing the different impact paths of different meteorological conditions and precursors in the O3 generation process.
3.4. Feature Importance in Four Seasons
Since the response of O3 to different factors may vary by season, this study divides the months into different seasons, trains RF models for each season, and estimates the feature importance for each factor influencing O3 concentrations. Feature importance plots are then drawn to examine the response of O3 concentrations to different factors across the seasons.
In
Figure 6, we observe the contributions of various chemical components and meteorological factors to the prediction of O
3 concentrations in spring. Notably, temperature, relative humidity, NOx, and PAN all play significant roles in O
3 formation. Among these, NOx is the largest contributing chemical factor in spring. Temperature and relative humidity also have key roles, which is closely related to the accelerated photochemical reaction rates and the influence of humidity on O
3 generation. Changes in relative humidity notably impact O
3 concentration prediction, while higher temperatures promote O
3 formation [
54,
55]. It is worth noting that VOCs, such as ethylbenzene, n-pentane, and isoprene, contribute less in spring, likely due to the lower reactivity of VOCs under cooler conditions.
In summer, the most significant contributor to O3 formation is PAN, followed by NOx and relative humidity. This result indicates that O3 concentrations in summer are primarily driven by high temperatures and photochemical reactions. The rise in temperature accelerates the decomposition of PAN, further intensifying O3 formation. Additionally, the longer and more intense sunlight during the summer provides abundant energy for O3 generation. Although relative humidity continues to contribute, its influence is weaker compared to spring.
As we transition to autumn, the influence of PAN on O
3 concentrations becomes more pronounced. At the same time, m/p-Xylene also contributes to O
3 formation. According to research by Xiao et al. [
58], the main contribution of m/p-Xylene in Shanghai comes from fuel evaporation. Moreover, PAN still accounts for a significant portion of the contribution, indicating that precursor substances remain crucial for O
3 formation. The photochemical reaction rate in autumn is slower than in spring and summer, so the dominant drivers of O
3 formation shift toward chemical components rather than meteorological conditions.
In winter, the impact of drivers on O3 formation differs significantly from the previous seasons. The contribution of NOx sharply increases and becomes the most important factor, likely due to the lower temperature and reduced VOC reactivity in winter. Furthermore, the contribution of PM2.5 increases in winter, potentially due to the higher particle emissions from heating activities, which indirectly affect O3 formation. It is important to note that meteorological factors have a minimal effect on O3 in winter, suggesting that under low-temperature and weak-light conditions, O3 formation is primarily controlled by chemical precursors (especially NOx) and particulate matter. The contribution of VOCs, such as ethylbenzene and isoprene, is negligible in winter, further emphasizing their limited chemical reactivity under low-temperature conditions.
Overall, seasonal variations have a clear impact on the main drivers of O
3 concentrations. In spring, summer, and autumn, PAN is consistently the dominant driver of O
3 formation, followed by meteorological factors and NOx [
54,
55]. This result is in line with the typical mechanism of O
3 generation, which is driven by photochemical reactions. In the hot and bright summer months, meteorological conditions dominate O
3 formation, while in the colder seasons, O
3 generation is more dependent on the supply of precursor substances. For pollution control, O
3 emission reduction measures should focus on different aspects in different seasons. For example, in summer, emphasis should be placed on controlling O
3 formation driven by meteorological factors, while in autumn and winter, reducing emissions of NOx, VOCs, and other precursor substances should be prioritized.
3.5. Two-DimensionalPartial Dependence Plot of Four Seasons
Figure 7 shows the influence of different meteorological factors and pollutants on O
3 concentration by 2D-PDP analysis and compares them in spring, summer, autumn, and winter. Overall, the generation of O
3 is closely related to season, precursor substances (such as PAN and NOx) and meteorological factors (such as temperature and relative humidity), and their influences show significant differences in different seasons.
Firstly,
Figure 7a shows the influence of PAN concentration on O
3 generation. In all seasons, the increase in PAN concentration leads to an increase in O
3 concentration, but this upward trend is most significant in summer. This can be attributed to the fact that under the conditions of abundant sunshine and high temperature in summer, PAN, as a product of photochemical reaction, greatly promotes the generation of O
3. Especially in summer, the O
3 concentration rises rapidly with the increase in PAN, reflecting the strong driving effect of summer photochemical reaction. In contrast, the O
3 concentration in winter is lower and less affected by the change in PAN, which reflects the characteristics of insufficient light and inactive photochemical reaction in winter.
Secondly, temperature, as another important meteorological factor, its influence on O
3 generation is reflected in
Figure 7b. The O
3 concentration increased significantly with the increase in temperature in spring and autumn, especially in spring, where the increase in temperature was most closely related to the increase of O
3 concentration. This indicates that the photochemical reaction in spring is more active, and the increase in temperature promotes the transformation of precursor substances and the generation of O
3. However, the O
3 concentration changes more slowly in summer and winter, especially in summer. Although high temperature is a favorable condition for the generation of O
3, at a certain temperature, the O
3 concentration no longer increases significantly due to other inhibitory factors (such as high humidity or the saturation effect of NOx). Similarly, the O
3 concentration in winter does not increase significantly with the change in temperature, indicating that the photochemical reaction is limited in winter.
Figure 7c shows the effect of NOx concentration on O
3. In all seasons, the increase in NOx leads to a decrease in O
3 concentration, especially in winter. This phenomenon can be attributed to the so-called “NOx saturation” effect, that is, when the NOx concentration is too high, it will not only not promote the generation of O
3 but will reduce its concentration by reacting with O
3. In winter, due to the weak photochemical reaction, the increase in NOx will significantly inhibit the generation of O
3. In contrast, although O
3 concentrations also decreased in spring and autumn, the negative effects of NOx were not as significant as in winter due to moderate sunlight and temperature.
Figure 7d shows the effect of relative humidity on O
3 generation. In all seasons, an increase in relative humidity will lead to a decrease in O
3 concentration, especially in spring and autumn, when the humidity exceeds 60%, the O
3 concentration decreases significantly. This may be because O
3 in the atmosphere is more easily removed under high humidity conditions, thereby reducing its concentration. In addition, increased humidity may also inhibit the occurrence of photochemical reactions, further reducing the generation of O
3. The O
3 concentration in winter was originally low and did not change much with humidity, indicating that humidity had a limited effect on O
3 in winter.
In summary, the figure reveals the sensitivity of O3 concentration to multiple driving factors in different seasons. In summer, the increase in PAN and temperature significantly promoted the generation of O3, while NOx and relative humidity had an inhibitory effect on it. In winter, the effect of NOx was the most significant, and high concentrations of NOx significantly inhibited the generation of O3, while humidity had little effect on O3. Spring and autumn are relatively balanced seasons, with O3 concentrations rising when temperature and PAN increase, but high humidity or NOx concentrations also inhibit their production. This seasonal difference reveals the complexity of the O3 generation mechanism and emphasizes the need to develop O3 pollution control strategies for different seasons.
3.6. Three-DimensionalPartial Dependence Plot
Figure 8 shows the predicted impact of different combinations of factors on O
3 concentration through 3D-PDP analysis. The four sub-graphs in the figure, respectively, combine different meteorological and pollutant factors in binary form to explore their synergistic effects on O
3 generation. The color represents the O
3 concentration, and the color scale changes from light blue (low concentration) to orange (high concentration), which illustrates the trend of O
3 concentration under specific conditions. From these charts, it can be seen that O
3 concentration is affected by multiple factors, and the change pattern of O
3 concentration under different combinations of factors shows significant differences.
First, the interaction between PAN and NOx on O
3 concentration. Overall, as the concentrations of PAN and NOx increase, the concentration of O
3 also increases. This is similar to previous studies [
53]. In particular, when the concentration of PAN gradually increases from 0.01 ppb, the concentration of O
3 increases rapidly, indicating that PAN significantly promotes the generation of O
3 under high NOx concentration conditions. This is consistent with the characteristics of PAN as a photochemical reaction product, and PAN can accelerate the generation of O
3 in a high NOx environment. This phenomenon shows that PAN and NOx have a synergistic effect in the generation of photochemical smog, especially in areas with more serious pollution. When NOx and PAN increase at the same time, the O
3 concentration may rise sharply, leading to more serious photochemical pollution events.
From the synergistic effect of temperature and PAN concentration on O
3. It can be seen that under higher PAN concentrations (1–2 ppb) and higher temperatures (20–30 °C), the O
3 concentration increases significantly. This shows that high temperature and high PAN concentration jointly promote the generation of O
3, especially when the temperature exceeds 20 °C, the O
3 concentration rises rapidly, indicating that the photochemical reaction under high temperature conditions is very active. Elevated temperatures accelerate the thermal degradation of PAN, thereby enhancing photochemical reaction rates and subsequently increasing O
3 concentrations [
53]. However, at low PAN concentrations (less than 1 ppb), the O
3 concentration does not change much with increasing temperature. This may be due to insufficient reactants at low PAN concentrations, which cannot fully promote the generation of O
3, so the change in temperature has limited effect on the O
3 concentration. This shows that PAN concentration is a prerequisite for temperature-driven O
3 generation.
The interaction between NOx and temperature found that at lower NOx concentrations, increased temperature significantly promoted O
3 generation [
59], especially when NOx concentration was less than 10 µg/m
3, and increased temperature significantly increased O
3 concentration. However, as NOx concentration increased to nearly 30 µg/m
3, O
3 concentration showed a downward trend regardless of temperature changes and could not be reversed even under high temperature conditions. This suggests that under high NOx concentrations, a “NOx suppression effect” may occur, that is, excessive NOx will react with O
3, resulting in a decrease in O
3 concentration. This result highlights the complexity of the effect of temperature on O
3 generation under high NOx conditions, indicating that a single control of temperature is not sufficient to effectively reduce O
3 pollution in a highly polluted environment.
Finally, the lower right figure shows the effect of relative humidity and temperature on O3 concentration. It can be seen that under higher temperature conditions (20–30 °C), the increase in humidity has a significant inhibitory effect on O3 concentration, and when humidity exceeds 60%, the O3 concentration decreases significantly. This may be because the presence of water vapor under high humidity conditions accelerates the scavenging reaction of O3 or inhibits the photochemical reaction of precursor substances, thereby reducing the generation of O3. On the other hand, under low temperature conditions (less than 10 °C), the effect of humidity on O3 concentration is relatively gentle, indicating that temperature is the key factor determining the generation of O3, while the effect of humidity is mainly reflected in the inhibitory effect on O3 concentration under high temperature conditions.
In summary, this figure reveals the effect of the interaction of different meteorological factors and pollutants on O3 concentration through the 3D-PDP method. High temperature, low humidity, higher PAN, and moderate NOx concentration help promote the generation of O3, while high humidity and high NOx concentration have the effect of inhibiting the generation of O3. These findings provide important insights into the formation mechanism of O3 pollution and emphasize the need to consider the interaction of multiple factors when controlling O3 pollution, especially to formulate targeted response strategies under different meteorological and pollution conditions.
3.7. Min Depth
The above RF-based analysis of O
3 drivers briefly illustrates their respective impacts. Since O
3 is affected by multiple complex drivers, the interaction between multiple drivers also plays a vital role in the formation of O
3. Therefore, it is necessary to study the different factors. Although RF cannot be directly used to study the interaction between paired drivers, exploring the structure of RF decision trees can help extract information from different factors. The main method of this structural mining analysis is based on the concept of minimum depth, that is, the minimum distance from the depth of a factor to the root of the tree. The minimum depth is within 3, which represents a strong contribution to the prediction of O
3. For a single factor, if its MD is shallow, it is considered to be more important for the prediction of O
3. As shown in
Figure 9, in the minimum depth analysis of the RF model, NOx, and PAN are the most important features for predicting O
3 concentration. The same as the previous feature importance and SHAP analysis. Although PAN itself is a secondary pollutant, it has a far-reaching impact in atmospheric chemistry, especially through its thermal decomposition to release NOx, which continues to affect O
3 production. In addition, as one of the VOCs, propane also has a relatively low minimum depth, showing its greater contribution in the model. This is attributed to the reaction of propane with OH· in the atmosphere to form PAN precursors. Although PM
2.5 (fine particulate matter) has a complex relationship with O
3 generation, it can indirectly affect O
3 concentration by adsorption, scattering, or changing the chemical environment in the atmosphere. The effects of meteorological factors such as relative humidity [
33] and wind direction on O
3 generation are more reflected in the transmission and diffusion of pollutants. For example, changes in wind direction may affect the accumulation and dilution of pollutants between regions, while humidity can inhibit the occurrence of certain photochemical reactions and indirectly reduce O
3 generation.
For the effect of temperature, although temperature is one of the variables with a more significant impact on O3 concentration in the previous feature importance and SHAP value analysis, its importance is lower in this minimum depth analysis. This phenomenon may be caused by a variety of factors. First, NOx and PAN show a stronger dominant role in the model, especially in the atmospheric photochemical reaction chain, where NOx is directly related to O3 production, while temperature indirectly affects O3 production mainly by regulating the reaction rate. Therefore, although temperature has an impact on atmospheric reactions, its effect may be more indirect than that of NOx and PAN, resulting in a relatively large minimum depth. Secondly, the RF model is able to capture the complex nonlinear interactions between features, and temperature may have a strong interactive effect with other meteorological variables (such as humidity or wind speed), resulting in a small contribution of temperature alone, but it is still important in the overall impact. The feature importance and SHAP value focus more on the direct impact of quantitative variables on the prediction results and may ignore the complexity of this interaction to a certain extent. This shows that although temperature still has an impact that cannot be ignored in O3 prediction.
3.8. Backward Trajectory Concentration Clustering
The trajectory clustering is shown in
Figure 10, and the pollutant concentration clustering table is shown in
Table 1. Starting from spring, the air mass clustering shows that the airflow from the inland southwest (passing through the Shanghai urban area) accounts for 40.46%, the airflow from the eastern sea direction accounts for 23.14%, and the northward airflow passing through the Shandong Peninsula along the Chinese coastline to Chongming accounts for 36.4%. The clustering analysis of pollutant concentrations reveals that the main sources of PAN and O
3 are located in the inland and marine directions. This suggests that the primary pollution in Chongming Dongtan originates from Shanghai city and from the marine direction. Unlike local sources in Chongming, nitrogen oxides (NOx) emitted by ships have a positive effect on O
3 generation in the marine area farther from the land. Furthermore, time series analysis shows that the variation trends of PM
2.5 and PAN are similar, indicating that PAN and PM
2.5 may share common sources, and PAN is likely to originate from long-range transport.
In summer, the prevailing wind direction (Class I) is from the south of Chongming, accounting for 57.24%, passing through the Jinshan Chemical Park in Shanghai before reaching Chongming. Class II and III wind directions account for 10.01% and 32.75%, respectively, and both come from the sea. Notably, the main contribution to PAN is a result of Class I winds, especially those passing through the Jinshan Chemical Park in Shanghai. In addition, the O3 concentration contribution is higher under Class II winds, further confirming that ship emissions from the sea lead to an increase in O3 levels, which are then transported to Chongming Dongtan.
In autumn, the dominant wind direction remains the sea breeze, with Class I winds from the sea accounting for 57.12%. For this wind direction, PAN and O3 concentrations are generally higher, which further demonstrates that PAN contributes positively to the O3 levels in Chongming Dongtan after being transported from a distance. In winter, inland winds account for only 21.74%. Although PAN concentrations are higher under Class I winds, its effect on O3 concentration increase is limited due to low temperatures. However, Class II and III winds account for nearly 80%, and the O3 clustering concentrations are 87.5 μg/m3 and 88.83 μg/m3, respectively. This suggests that O3 generation in winter primarily originates from the sea and is transported long-distance, significantly influencing O3 levels in Chongming Dongtan. PAN’s promoting effect on O3 generation may exist in certain seasons (such as spring and autumn), but in other seasons (such as summer and winter), its effect may be regulated by other atmospheric factors. Therefore, it can be inferred that the role of PAN in O3 generation is seasonal and condition-dependent, rather than a simple co-transport relationship. According to the results of machine learning, PAN contributes the most to O3 in summer, indicating that under high-temperature conditions in summer, the life of PAN is reduced and it decomposes into NOx and VOCs, resulting in a higher contribution to O3 generation than in other seasons. Additionally, considering the backward trajectory analysis, it is believed that the transport of O3 from the ocean to Chongming Dongtan affects the local O3 concentration, but the high concentration of NOx in the local area has a negative impact on O3 generation.
In summary, although ship emissions play a leading role in the generation of O3 in spring and summer, especially in high-temperature and strong-light conditions, NOx emitted by ships generates a large amount of O3 through photochemical reactions; urban emissions are also not to be ignored, especially in autumn and winter, when urban emissions contribute significantly to pollutants such as PAN and PM2.5. This seasonal difference reveals the combined effect of ship and urban emission sources on the concentration of atmospheric pollutants.