Predicting Urban Traffic Under Extreme Weather by Deep Learning Method with Disaster Knowledge

Tang, Jiting; Zhu, Yuyao; Yang, Saini; Jaeger, Carlo

doi:10.3390/app15179848

Open AccessArticle

Predicting Urban Traffic Under Extreme Weather by Deep Learning Method with Disaster Knowledge

by

Jiting Tang

^1,2

,

Yuyao Zhu

^3,4,

Saini Yang

^5,6,7,*

and

Carlo Jaeger

⁸

¹

School of Artificial Intelligence, China University of Mining and Technology-Beijing, Beijing 100083, China

²

Key Laboratory of Intelligent Mining and Robotics, Ministry of Emergency Management, Beijing 100083, China

³

College of Environmental Sciences and Engineering, Peking University, Beijing 100871, China

⁴

International Institute for Applied Systems Analysis, A-2361 Laxenburg, Austria

⁵

Joint International Research Laboratory of Catastrophe Simulation and Systemic Risk Governance, Beijing Normal University at Zhuhai, Zhuhai 519087, China

⁶

School of National Safety and Emergency Management, Beijing Normal University, Beijing 100875, China

⁷

Integrated Research on Disaster Risk, Beijing 100094, China

⁸

Global Climate Forum, 10178 Berlin, Germany

^*

Author to whom correspondence should be addressed.

Appl. Sci. 2025, 15(17), 9848; https://doi.org/10.3390/app15179848

Submission received: 18 June 2025 / Revised: 29 July 2025 / Accepted: 5 September 2025 / Published: 8 September 2025

Download

Browse Figures

Versions Notes

Abstract

Meteorological and climatological trends are surely changing the way urban infrastructure systems need to be operated and maintained. Urban road traffic fluctuates more significantly under the interference of strong wind–rain weather, especially during tropical cyclones. Deep learning-based methods have significantly improved the accuracy of traffic prediction under extreme weather, but their robustness still has much room for improvement. As the frequency of extreme weather events increases due to climate change, accurately predicting spatiotemporal patterns of urban road traffic is crucial for a resilient transportation system. The compounding effects of the hazards, environments, and urban road network determine the spatiotemporal distribution of urban road traffic during an extreme weather event. In this paper, a novel Knowledge-driven Attribute-Augmented Attention Spatiotemporal Graph Convolutional Network (KA3STGCN) framework is proposed to predict urban road traffic under compound hazards. We design a disaster-knowledge attribute-augmented unit to enhance the model’s ability to perceive real-time hazard intensity and road vulnerability. The attribute-augmented unit includes the dynamic hazard attributes and static environment attributes besides the road traffic information. In addition, we improve feature extraction by combining Graph Convolutional Network, Gated Recurrent Unit, and the attention mechanism. A real-world dataset in Shenzhen City, China, was employed to validate the proposed framework. The findings show that the prediction accuracy of traffic speed can be significantly increased by 12.16%~31.67% with disaster information supplemented, and the framework performs robustly on different road vulnerabilities and hazard intensities. The framework can be migrated to other regions and disaster scenarios in order to strengthen city resilience.

Keywords:

traffic prediction; extreme weather; deep learning; disaster knowledge

1. Introduction

The impact of extreme weather results in greater variability in urban road traffic and poses a significant challenge for traffic prediction and management. In coastal areas, for example, tropical cyclones often bring extreme weather events, including strong winds and heavy rain, resulting in flooding, massive traffic delays, and accidents [1]. It was reported that the commuting time during rush hours in a typhoon season increased by 30%~60% than the usual workdays in Shenzhen City, China (http://www.sz.gov.cn/en_szgov/news/notices/content/post_8000824.html, accessed on 13 October 2023). During Hurricane Sandy in 2012, it took 132 h for traffic in New York City to return to normal [2]. The frequency and intensity of extreme weather events such as tropical cyclones seem to be increasing due to climate change [3]. Sustainable Development Goal 11 (SDG11) aims to make cities inclusive, safe, resilient, and sustainable [4]. Predicting spatiotemporal patterns of urban road traffic accurately under extreme weather is critical to strengthening city safety and resilience [5,6].

While prior research has extensively examined infrastructure damage [7] and economic impacts [8,9] from extreme weather, understanding how urban traffic systems dynamically respond remains a critical research gap. Meteorological factors create multifaceted transportation challenges: precipitation affects road surfaces by reducing friction coefficients and increasing braking distances, while intense rainfall significantly impairs visibility and driving conditions. Strong crosswinds present additional hazards to vehicle stability and control [10]. Beyond direct effects, extreme weather can cause secondary disruptions through fallen trees, flooding, and infrastructure damage [11], triggering complex ripple effects across transportation networks [11]. These impacts are further compounded when navigation systems redirect traffic, potentially overloading alternative routes during peak periods. The intricate interplay between environmental conditions, infrastructure vulnerability, and traffic flow dynamics represents a significant and underexplored area in transportation research.

The data-driven methods are a trend to improve the performance of traffic prediction. Classical traffic forecasting models typically focus on extracting the temporal correlation of traffic flow. Recent cutting-edge studies have demonstrated the feasibility and superiority of deep learning in traffic prediction. Various algorithms based on Recurrent Neural Network (RNN) such as Long Short-Term Memory (LSTM) [12], bidirectional LSTM [13], sequence-to-sequence learning [14,15], Gated Recurrent Unit (GRU) [16], and an attention mechanism [17] are well-suited for capturing time dependence and widely used in traffic prediction tasks. Among them, GRU is particularly effective in terms of solution quality and inference speed [16]. Researchers have also realized the importance of the strong spatial interaction of transportation networks. As a result, algorithms based on Convolutional Neural Network (CNN) such as CNN-LSTM [18], Conv-LSTM [19], and 3D CNN [20] have been developed to extract spatiotemporal features from traffic information. These methods use Euclidean distance to measure the spatial correlation of raster data. However, real-world traffic data has a non-Euclidean structure of directional topology. Thus, Graph Convolution Network (GCN) has been introduced to extract from graph data. GCN-based deep learning algorithms are a direction for technological improvement.

While deep learning has demonstrated significant capabilities in processing spatiotemporal data patterns from large datasets [21,22], current architectures still present notable limitations requiring further development. First, model architecture remains a critical factor—optimal structural design directly influences predictive performance when working with adequate training data [23,24]. This architectural optimization represents a persistent research challenge across various deep learning applications. Additionally, while traffic data sequences capture fundamental state characteristics, they often fail to incorporate all relevant influencing factors [25]. These limitations highlight the need for improvement in neural network design and feature representation for traffic prediction tasks.

Studies have found that future traffic states are not only dependent on historical traffic information but also impacted by external factors, such as the natural environment [26], surrounding infrastructure [27], and especially weather conditions [28] during extreme weather. The knowledge-driven information fusion provides us with new ideas to predict urban traffic under extreme weather. According to the United Nations Office for Disaster Risk Reduction (UNDRR), the disaster risk results from the complex interaction between development processes that generate conditions of exposure, vulnerability, and hazard (https://www.preventionweb.net/understanding-disaster-risk/component-risk/disaster-risk, accessed on 7 October 2023). Urban roads are critical infrastructures exposed to the natural environment. Extensive research has been carried out on modeling the spatial–temporal correlation of traffic flow itself or considering insufficient external factors [29,30,31]. While previous research has advanced traffic prediction, the specific compound effects of multi-hazards and varying environmental conditions have not been thoroughly investigated, especially concerning extreme weather scenarios.

Addressing SDG11, there is a need for a traffic prediction model that combines disaster knowledge with spatiotemporal correlation to predict traffic status under extreme weather, which is more practical for building resilient cities.

Therefore, we developed a data-driven and knowledge-driven traffic prediction framework. Our work improves the structures of traffic prediction models, imitates the cognitive process of experts with respect to real-time changes in traffic, and optimizes the network to extract spatiotemporal correlation from high-dimensional massive data. Moreover, a new data fusion module is designed by integrating hazards and environment knowledge. Here, the two hazards considered are compound precipitation and wind, as they are most likely to occur in China, particularly in the southeast coastal areas during the summer, which is related to the frequent occurrence of tropical cyclones [32]. The environment information includes social environment and natural environment. By identifying these potential changes in traffic flow early on, particularly for urban road systems that are often heavily impacted by extreme weather events, the framework can serve as an invaluable tool for both early warning and traffic management. The ability to anticipate traffic disruptions can help adapt traffic management strategies quickly and minimize the adverse effects of such events on urban transportation, such as delays, accidents, and increased travel times. Overall, our work has the potential to greatly enhance the resilience and responsiveness of urban road systems to extreme weather events.

The remainder of this paper is organized as follows. Section 2 outlines the methodology employed in the study. Section 3 provides detailed information of the experiment. Section 4 presents the numerical results and discusses the model’s performance. Section 5 concludes the paper.

2. Methodology

2.1. Framework

This study proposes a novel Knowledge-driven Attribute-Augmented Attention Spatio-Temporal Graph Convolutional Network (KA3STGCN) framework (Figure 1) for urban traffic prediction under extreme weather.

We develop a physics-informed attribute-augmented unit that fundamentally advances beyond traditional feature concatenation approaches through its dynamic coupling mechanism. This unit uniquely integrates the following: (1) dynamic hazard attributes including wind speed and precipitation with adaptive weighting based on real-time intensity, (2) static environment attributes, including Points of Interest (POI) and Digital Elevation Model (DEM), representing social and natural infrastructure vulnerability, and (3) historical traffic states—all processed through a parallel architecture that preserves feature distinctiveness while enabling nonlinear interactions.

The attribute-augmented unit is fed into the deep learning model to capture and predict the spatiotemporal pattern of urban road traffic. The model comprises three main components: GCN, GRU, and an attention mechanism. GCN is used to extract spatial dependence while accounting for hazard-modulated road vulnerabilities. GRU is well-suited for modeling the temporal dependence of road traffic. Compared with LSTM, GRU has fewer parameters with faster training speed and convergence, while maintaining comparable performance. Furthermore, the attention mechanism can be employed to adjust the relative importance of different horizons. While the components process spatial and temporal dependencies sequentially, the attribute-augmented unit employs a parallel architecture. The hazard attributes and traffic data are processed independently before fusion, thereby preserving their distinct characteristics.

The definitions of variables in Figure 1 are as follows:

Definition 1.

Urban road network

G

. The urban roads are modeled as an unweighted network

G = (V, E)

.

V = v_{1}, v_{2}, \dots, v_{n}

is the set of

n

roads;

E = e_{1}, e_{2}, \dots, e_{m}

is the set of

m

edges connecting different roads. The adjacency matrix

A

is used to illustrate the connectivity of

G

and composed of 0 and 1, where 1 means the corresponding roads are connected, and 0 otherwise.

Definition 2.

Traffic state matrix

X

.

x_{i}^{t}

denotes the traffic state on the i-th road at time t. The traffic states are usually described as the road speed, density, or traffic flow. Without loss of generality, traffic speed is used as an example of traffic information in experiments.

Definition 3.

Hazard attribute matrix

H A

.

H A = \{{H A}_{1}, {H A}_{2}, \dots, {H A}_{w}\}

is a collection of

w

different dynamic hazard factors. For j-th hazard attribute

{H A}_{j} = \{j_{1}, j_{2}, \dots, j_{i}\}, j \in [1,2, \dots, w]

,

j_{i} = \{{j_{i}}^{1}, {j_{i}}^{2}, \dots, {j_{i}}^{t}\}

is the time series of the

j

-th hazard attribute of the

i

-th road. Here, the hazards include two meteorological factors—wind speed and precipitation.

Definition 4.

Environment attribute matrix

E A

.

E A = \{{E A}_{1}, {E A}_{2}, \dots, {E A}_{p}\}

is a collection of

p

different static environment factors. For the j-th environment attribute

{E A}_{j} = \{j_{1}, j_{2}, \dots, j_{i}\}, j \in [1,2, \dots, p]

,

j_{i}

is the

j

-th environment attribute of the

i

-th road. Here, the environment factors include POI and terrain.

The traffic predicting problem aims to learn a function

f

that is able to predict

T

future traffic states given the urban road network G, historical traffic matrix

X

, the hazard attribute matrix

H A

, and the environment attribute matrix

E A

, as shown in Equation (1).

[X_{t + 1}, X_{t + 2}, \dots, X_{t + T}] = f (G, X| (H A, E A)) .

(1)

2.2. Attribute-Augmented Unit

The attribute-augmented unit serves as the core innovation of our framework, designed to effectively integrate disaster knowledge with traffic prediction through three novel technical contributions.

At time t, if disaster information is not considered, the input

A_{0}^{t}

can be expressed as the following:

A_{0}^{t} = [X^{t}], A_{0}^{t} \in R^{n} .

(2)

The attribute-augmented unit joins traffic speed matrix

X

and the hazard attributes

H A

and environment attributes

E A

. Notably, we introduce a dynamic cumulative hazard window mechanism that captures both immediate and delayed disaster impacts. Specifically, for each hazard attribute

{H A}_{i}, i \in [1, w]

, we construct an extended time window

k

to model the temporal propagation of disaster effects, where

k

represents the historical horizon. This design explicitly accounts for the lagged consequences of extreme weather events, such as gradual flooding after heavy rainfall., i.e., picking the hazard attributes

{H A}_{i}^{t - k, t} = [{H A}_{i}^{t - k}, {H A}_{i}^{t - k - 1}, \dots, {H A}_{i}^{t}]

for each submatrix

{H A}_{i}

when generating

A^{t}

.

E A \in R^{n \times p}

is only calculated once and used repeatedly without introducing additional uncertainty. Finally, the complete attribute-augmented matrix

A^{t}

including both time-variant hazard and time-invariant environment attributes as well as traffic speed at time t is formed as the following:

A^{t} = [X^{t}, {H A}_{1}^{t - k, t}, {H A}_{2}^{t - k, t}, \dots, {H A}_{w}^{t - k, t}, {E A}_{1}, {E A}_{2}, \dots, {E A}_{p}] .

(3)

Here,

A^{t} \in R^{n \times (1 + p + w * (k + 1))}

.

Thus, Equation (1) can be transformed as the following:

[X_{t + 1}, X_{t + 2}, \dots, X_{t + T}] = f (G, A^{t}) .

(4)

Our attribute-augmented unit

A^{t}

is not a static feature concatenation, but rather a physics-informed dynamic coupling mechanism. This parallel design preserves the distinct characteristics of each feature type while allowing for learned interactions through subsequent network layers. The transformation in Equation (4) demonstrates how these augmented features are incorporated into the prediction framework.

2.3. Models

A deep learning model is designed to capture spatiotemporal dependencies by combing the GCN, GRU, and attention mechanism.

2.3.1. Spatial Dependence Modeling

In this paper, spatial dependency is modeled by GCN. The learning process of graph convolution is similar to the convolution and coding for node information. It mainly aggregates neighbor nodes through the adjacency matrix. Parameters are shared during the aggregation process [33]. Given an adjacency matrix A and the augmented matrix

A^{t}

of the road network G, the GCN model constructs a filter in the Fourier domain. The hidden layers in GCN can be represented as

H^{l + 1} = f (H^{l}, A)

. The hazard and environment attributes are scaled, where

H^{l}

includes hazard-modulated road vulnerabilities.

The propagation rule

f

in spatial domain-based GCN model is defined as follows [34]:

H^{l + 1} = σ ({\tilde{D}}^{- \frac{1}{2}} \tilde{A} {\tilde{D}}^{- \frac{1}{2}} H^{l} W^{l})

(5)

where

H^{l + 1}

is the output and

H^{0} = A^{t}

.

σ (\cdot)

is the nonlinear activation function.

\tilde{A} = A + I_{N}

represents the adjacency matrix with added self-loops, and

I_{N}

is the identity matrix.

\tilde{D} = \sum_{j} {\tilde{A}}_{i j} i s

the corresponding degree matrix.

W^{l}

is the trainable weight matrix of the l-th layer. Equation (5) demonstrates how to normalize the graph into a regular network, obtain parameters and weights, and then return the output to the following layer.

2.3.2. Temporal Dependence Modeling

The temporal dependency is modeled by GRU after the GCN cell. GRU was regarded as a simplification and improvement of LSTM [35,36]. GRU solves the problems of gradient disappearance or gradient explosion in back propagation of long-term memory by using update gate

z_{t}

and reset gate

r_{t}

[37]. The “gate” here refers to the matrix multiplication that selectively controls the flow of information.

At time

t

, the internal processes of a GRU cell are shown below.

Firstly, the update gate

z_{t}

is calculated as follows:

z_{t} = σ (W_{z} \cdot [h_{t - 1}, g c ({A, A}^{t})] + b_{z})

(6)

where

g c (\cdot)

represents the graph convolution process and is defined in (5), and

g c ({A, A}^{t})

is the current input.

h_{t - 1}

is the hidden state from the previous node. The activation function σ converts data into values in the range of 0~1 as the gating signal.

W_{z}

and

b_{z}

are the weight and bias.

z_{t}

is used to limit how much old information is incorporated into the current data.

Secondly, the reset gate

r_{t}

is calculated as follows:

r_{t} = σ (W_{r} \cdot [h_{t - 1}, g c ({A, A}^{t})] + b_{r}),

(7)

\tilde{h_{t}} = \tanh (W_{\tilde{h}} \cdot [{r_{t} * h}_{t - 1}, g c ({A, A}^{t})] + b_{\tilde{h}}) .

(8)

Here,

W_{r}, W_{\tilde{h}}

and

b_{r}, b_{\tilde{h}}

are the weight and bias. The activation function tanh converts data into values in the range of −1~1 to avoid data disappearance or explosion. The old state would be added into the new state

\tilde{h_{t}}

by the control of

r_{t}

.

Finally, the current output state

h_{t}

is calculated based on the new state

\tilde{h_{t}}

and last output

h_{t - 1}

, as follows:

h_{t} = (1 - z_{t}) * h_{t - 1} + z_{t} * \tilde{h_{t}} .

(9)

2.3.3. Attention Mechanism

In the previous calculation process, a large number of feature maps of different channels was generated. However, the importance of the information transmitted via channels varies. So, the proposed model employs the attention mechanism to dynamically adjust the contribution of hazard features based on their predictive utility. Here, a multi-layer perception is added to the model after GCN and GRU [38].

Given a query

q

and the hidden layer vector

H = [h_{1}, h_{2}, \dots, h_{N}]

,

N

is the length of the time series. For each

h_{i}, i \in [1, N]

, the probability

α_{i}

of selecting

h_{i}

is as follows:

α_{i} = p (z = i| H, q) = s o f t m a x (s (h_{i}, q)) = \frac{\exp (s (h_{i}, q))}{\sum_{j = 1}^{N} s (h_{j}, q)},

(10)

s (h_{i}, q) = w_{(2)} (w_{(1)} H + b_{(1)}) + b_{(2)} .

(11)

Here,

s (h_{i}, q)

is calculated based on the additive model of two hidden layers using linear transformation [39].

w_{(1)}

and

b_{(1)}

are the weight and bias of the first hidden layer, and

w_{(2)}

and

b_{(2)}

are the weight and bias of the second hidden layer, respectively. The higher the information relevance with

q

, the higher the weight of

h_{i}

. The attention score was then determined using a weighted average, as follows:

A t t (H, q) = \sum_{i = 1}^{N} α_{i} h_{i} .

(12)

Finally, the full connection layer is used to output the prediction results.

2.3.4. Loss Function

In the training process, the goal is to minimize the error between the real traffic speed on the roads and the predicted value. Thus, the loss function’s goal is to minimize the prediction error, as follows:

L o s s = ‖y_{t} - \hat{y_{t}}‖ + λ L_{r e g} .

(13)

Here,

y_{t}

and

\hat{y_{t}}

are real traffic speed and predicted speed, respectively.

L_{r e g}

is the L2 regularization term to avoid overfitting, and

λ

is a hyperparameter.

3. Data and Experiments

3.1. Data Description

In our study, the KA3STGCN model is applied to a real-world dataset in Shenzhen City. Shenzhen is a densely populated and economically developed coastal megacity in China with a massive and continuously growing transportation infrastructure (Figure 2a). The city has an area of 1997.47

{k m}^{2}

, a permanent population of 13.44 million as of 2019 (https://www.sz.gov.cn/en_szgov/aboutsz/profile/content/post_11666623.html, accessed on 2 January 2025), a total road mileage of 8066.1

k m,

and a civilian car ownership of 3.53 million as of 2020 (http://tjnj.gdstats.gov.cn:8080/tjnj/2021/directory/15/html/15-11-0.htm, accessed on 2 January 2025). During summer, Shenzhen is vulnerable to frequent tropical cyclones with wind and rain in the western Pacific Ocean. In 2018, the city experienced the most severe damage from tropical cyclones in last decade, including Typhoon Mangkhut. The time period is from 1 April 2018 to 30 September 2018, which is the high incidence period of tropical cyclones. Six categories of data were used in this study, as shown in Table 1, and the multi-source data distributions are presented in Figure 2b–d.

3.2. Data Preprocessing

To address the heterogeneity of multi-source data with varying formats and standards, we established a rigorous data consistency framework through the following steps: First, the urban road network was converted into an adjacency matrix to represent topological relationships. Second, 10 min traffic speed data were aggregated into hourly averages to align with the temporal resolution of meteorological data. For environmental feature extraction, we calculated elevation values averaged across each road segment from 30 m resolution DEM data and determined the dominant POI category per road through kernel density analysis of 14 POI classifications.

Spatial alignment was achieved through a multi-step process: we sampled points along each road polyline at 10 m intervals, computed the average Euclidean distance to neighboring meteorological grid centroids, and assigned grid-recorded wind speed and precipitation values using nearest-neighbor interpolation weighted by these distances. Temporal alignment incorporated an n-hour cumulative window for hazard attributes to capture delayed weather impacts, while maintaining static environment features for each road.

All features underwent Min-Max normalization to [0,1] ranges. The unit can weight hazard impacts based on real-time intensity and road vulnerability. Missing values were excluded to ensure data quality. The final attribute-augmented matrix

A^{t}

combined these normalized features through Equation (3), where sliding hazard windows preserved temporal dependencies and distance-based weighting maintained spatial relationships—a significant advancement over simple concatenation approaches like ASTGCN. All normalized values were denormalized post-prediction for interpretation.

After pre-processing, 1054 road segments were available (two-way roads are regarded as two road segments). The size of the adjacency matrix was 105 × 1054. The size of each environment factor was 1054 × 1. The size of each hazard factor was 1054 × 4392 (183 days, 4392 h).

Furthermore, we verified the impact of disaster-related factors on urban road traffic (Figure 3). Figure 3a shows the changes in traffic speed of one road segment during Typhoon Mangkhut in 2018 compared with a normal sunny day. It is evident that extreme weather drastically reduced traffic speed. Figure 3b depicts the traffic speeds of two road segments dominated by two classes of DEM (Class 1 is lower than Class 2 in altitude). Similar time-varying traffic condition features were observed in both sample groups; however, total traffic speeds were lower in DEM Class 1 than in DEM Class 2. Figure 3c shows the traffic speeds of two roads dominated by two types of urban POI. The traffic speed on roads around living services decreased around 7 a.m. and 6 p.m. The traffic speed on roads neighboring enterprises reached its valley around 8 a.m. and was lower than that of the former roads most of the time.

3.3. Evaluation Metric

Four metrics were used to evaluate the performance of the KA3STGCN model: Root mean square error (RMSE), Mean Absolute Percentage Error (MAPE),

A c c u r a c y

, and Coefficient of Determination (

R^{2}

), which are defined as follows:

R M S E = \sqrt{\frac{1}{M N} \sum_{j = 1}^{M} \sum_{i = 1}^{N} {(y_{i}^{j} - \hat{y_{i}^{j}})}^{2}},

(14)

M A P E = \frac{100}{M N} \sum_{j = 1}^{M} \sum_{i = 1}^{N} \frac{|y_{i}^{j} - \hat{y_{i}^{j}}|}{y_{i}^{j}},

(15)

A c c u r a c y = 1 - \frac{{‖Y - \hat{Y}‖}_{F}}{{‖Y‖}_{F}},

(16)

R^{2} = 1 - \frac{\sum_{j = 1}^{M} \sum_{i = 1}^{N} {(y_{i}^{j} - \hat{y_{i}^{j}})}^{2}}{\sum_{j = 1}^{M} \sum_{i = 1}^{N} {(y_{i}^{j} - \bar{Y})}^{2}},

(17)

where

M

is the number of time samples;

N

is the number of roads.

y_{i}^{j}

and

\hat{y_{i}^{j}}

are the observation and prediction of the i-th road in j-th time.

Y

and

\hat{Y}

represent the set of

y_{i}^{j}

and

\hat{y_{i}^{j}}

, respectively, and

\bar{Y}

is the average of

Y

.

{‖\cdot‖}_{F}

is the Frobenius norm.

R M S E

measures the average magnitude of prediction errors, penalizing larger deviations more heavily. In the context of extreme weather, where traffic speed fluctuations can be abrupt and severe such as sudden drops due to rain or wind,

R M S E

is critical for quantifying the model’s ability to handle such anomalies.

M A P E

expresses errors as a percentage of actual values, making it intuitive for assessing relative accuracy. Under extreme weather, traffic speeds may drop to near-zero (e.g., road closures).

M A P E

helps evaluate whether the model’s relative errors remain acceptable even in such scenarios.

A c c u r a c y

reflects the overall proportion of correctly predicted traffic states across the entire road network. For urban resilience planning, holistic accuracy is key.

R^{2}

quantifies the proportion of variance in traffic speeds explained by the model. A high

R^{2}

indicates that the model accounts for most variability induced by extreme weather. The chosen metrics collectively address the unique challenges of traffic forecasting under extreme weather:

R M S E

and

M A P E

quantify error magnitudes,

A c c u r a c y

evaluates network-wide reliability, and

R^{2}

validates the model’s explanatory power.

3.4. Parameter Settings

In the model training, some parameters were determined based on the experience of existing studies [41]: the optimization was Adaptive Moment Estimation (Adam) [42]; the learning rate was 0.001; the batch size was 64; λ in loss function was 0.0015; and the proportion of the training set was 0.8. Other parameters were searched by experiment, as shown in Figure 4.

(1): Learning horizons. Considering the cumulative effects of hazards on traffic, we expanded the time window size when constructing the attribute-augmented unit. Figure 4a shows the model performance for the learning horizons of {1, 2, 3, 4, 5}. The model performed best considering the last 3 h. The model performance was still robust when the learning horizon was 4 or 5, while the time cost increased with more hidden parameters.
(2): Predicting horizons. Figure 4b shows the model performance when the predicting horizon was {1, 2, 3}. The short-term prediction is better than the long-term prediction, which was consistent with the expectation—the longer predicting horizon has a greater uncertainty.
(3): Training epochs. Figure 4c shows the model performance when the number of training epochs was {500, 1000, 1500, 2000, 3000, 3500, 4000}. As the training epochs increase, the change in evaluation metrics tended to be stable, with a turning point of 3000.
(4): Hidden units. Figure 4d shows the model performance when the number of units in the hidden layer was {8, 16, 32, 64, 100}. With a turning point of 64, the evaluation metrics’ change tends to be stable as hidden units rise. When there are 128 hidden units, the memory overflows due to too many parameters.

In conclusion, we identified the optimal configurations for model training: 3 learning horizons, 1 predicting horizon, 3000 training epochs, and 64 hidden layer units.

4. Results and Discussion

4.1. The Performance of KA3STGCN Model

The experimental results demonstrate that our KA3STGCN framework achieves robust performance in predicting urban traffic under extreme weather. Implemented using TensorFlow, the model converges after 3000 training epochs with the following evaluation metrics:

R M S E = 6.92

,

M A P E = 19.67 %

, and

A c c u r a c y = 0.79

,

R^{2} = 0.78

. These quantitative measures indicate the model’s capability to effectively capture the complex spatiotemporal patterns of urban traffic during disaster events.

We also conducted a time-aware 5-fold cross-validation considering the spatiotemporal dependencies. The dataset is sequentially partitioned into five temporal blocks, ensuring each fold maintains continuous time segments. During validation, we preserved temporal order by using earlier folds for training and subsequent folds for testing. Each fold retained the complete urban road network topology in all splits and included all traffic patterns (peak/off-peak, weekdays/weekends). We reported both temporal metrics (time-wise RMSE) and spatial metrics (node-level RMSE) across folds, with final performance calculated as the average of all out-of-fold predictions, demonstrating consistent Accuracy of 0.79 ± 0.02 on Shenzhen data.

Figure 5 presents a comparative analysis between observed and predicted traffic speeds for two representative roads in Shenzhen during both the whole test set (Figure 5a,b) and Typhoon Mangkhut (Figure 5c,d). First, the model effectively captures fundamental traffic periodicity and trend patterns across different temporal scales. However, performance variations emerge between Road 1 (RMSE = 8.15, MAPE = 22.3%) and Road 2 (RMSE = 6.42, MAPE = 18.1%) during peak typhoon conditions. This divergence primarily stems from the following: (1) differential implementation of emergency traffic controls that affected Road 1 more severely; (2) inherent variations in infrastructure vulnerability between the two roads; and (3) current limitations in modeling compound disaster effects beyond core meteorological factors.

The observed performance differences between test cases highlight important considerations for practical deployment. Most notably, they emphasize the need to incorporate additional data sources—particularly real-time information about traffic control policies and infrastructure conditions—to further enhance prediction accuracy during complex disaster scenarios. We discuss these implementation challenges and potential solutions in greater depth in Section 4.6.

From an operational perspective, our model demonstrates measurable improvements for urban traffic management systems. These technical improvements translate to several concrete benefits: (1) enhanced decision-making for traffic managers through fewer false alarms when issuing congestion warnings; (2) more precise rerouting recommendations during floods and strong winds, particularly for high-risk areas like DEM Class 1 roads; and (3) optimized resource allocation for post-disaster cleanup operations based on improved traffic resumption predictions.

The framework’s superior performance yields broader societal and economic impacts. By reducing prediction errors, the model helps mitigate indirect costs including fuel waste and productivity loss through dynamic signal timing and preemptive lane closures. These capabilities are particularly valuable for coastal cities like Shenzhen that frequently experience tropical cyclones. Furthermore, the system supports SDG11 by minimizing disruptions to critical infrastructure access routes, including roads serving hospitals and other essential services. The combination of improved accuracy and operational applicability positions our approach as a valuable tool for enhancing urban resilience against extreme weather events.

4.2. Model Performance Comparison Results

We contrasted the proposed KA3STGCN model with the following baselines: Historical Average method (HA), Autoregressive Integrated Moving Average model (ARIMA), Support Vector Regression model (SVR), eXtreme Gradient Boosting (XGBoost) [43], Temporal Graph Convolution Network model (TGCN) [44], Attention Spatial-Temporal Graph Convolutional Network (ASTGCN) [39], Attribute-Augmented Spatial-Temporal Graph Convolutional Network (A2STGCN) [41], physics-informed neural networks [45], and Bayesian GCN [46]. The hyperparameters in the above baselines were kept consistent with KA3STGCN. Table 2 shows that our KA3STGCN model performed best among all the models tested.

Comparative analysis demonstrates significant performance differences across model categories. The traditional time series models (HA, ARIMA) exhibit limited predictive capability (Accuracy = 0.60~0.63) owing to their static linear assumptions, which prove inadequate for modeling the non-stationary traffic patterns characteristic of extreme weather events. While shallow learning-based algorithms (SVR, XGBoost) show improved performance (Accuracy = 0.62~0.74) through engineered temporal features, their failure to account for spatial dependencies results in suboptimal performance during network-wide disruptions.

The evaluation reveals that deep learning architectures consistently outperform other approaches. Baseline models including TGCN, ASTGCN, and A2STGCN achieve accuracy levels exceeding 0.7. These models were the degraded versions of the KA3STGCN model in terms of attention mechanism and external disaster information fusion, and their performance was slightly inferior. Through systematic evaluation of four graph-based architectures, we observe progressive performance improvements that highlight the importance of different architectural components for extreme weather traffic prediction. The baseline TGCN (GCN + GRU) achieves an RMSE of 7.90 and MAPE of 23.66%, demonstrating the fundamental capability of spatiotemporal modeling but showing limitations in handling sudden weather-induced traffic variations. The ASTGCN (GCN + GRU + attention) model reduces these metrics to 7.39 RMSE and 24.46% MAPE, with the attention mechanism proving particularly effective for prioritizing critical temporal segments during weather events (6.79% RMSE improvement over TGCN). However, its performance degrades during prolonged extreme conditions due to insufficient incorporation of environmental context. A2STGCN (GCN + GRU+ attribute-augmented unit) shows different strengths, achieving 8.92 RMSE but superior attribute-specific performance (12.96% better MAPE than ASTGCN for DEM Class 1 roads). This suggests that while attribute augmentation improves physical interpretability, the lack of attention mechanisms limits its ability to dynamically adjust to rapidly changing conditions.

Our KA3STGCN (GCN + GRU+ attention + attribute-augmented unit) combines the strengths of both approaches and performs the best in four evaluation metrics. The synergistic combination yields three key advantages: (1) the attention mechanism dynamically weights important temporal segments during extreme events; (2) the attribute-augmented unit provides physics-informed feature representation; and (3) their joint operation enables adaptive focus on both temporal criticality and spatial vulnerability. This is particularly evident during Typhoon Mangkhut (Figure 5c,d), where KA3STGCN maintains stable performance while other models show significant error spikes during peak wind/rain periods.

In addition, we tested physics-informed neural networks and Bayesian GCN because they share key characteristics with our approach, particularly their ability to dynamically weight features based on real-time conditions. The physics-informed neural networks outperformed most methods; it lagged behind KA3STGCN. The incorporation of physical laws likely improved its robustness, but it lacked the spatiotemporal attention and attribute-augmented features of KA3STGCN, which are critical for capturing complex dependencies in the data. The Bayesian GCN implementation demonstrates robust performance (RMSE = 7.15, MAPE = 21.3%) by effectively quantifying prediction uncertainty during extreme weather events. However, our KA3STGCN requires 18% less computational resources. This advantage stems from our physics-informed attribute processing, which provides more direct modeling of disaster dynamics compared to the purely data-driven uncertainty estimation in Bayesian approaches.

The comparative results demonstrate that while individual components provide partial improvements, their integrated implementation in KA3STGCN yields nonlinear performance gains for extreme weather prediction. This suggests that effective disaster-aware traffic modeling requires both dynamic temporal weighting and physics-informed feature representation working in concert.

4.3. Significant Variables and Interpretations

The ablation experiment aimed to assess the impact of different disaster information and their combinations. Table 3 presents the model performances of 16 cases, including no external information (none), hazards (wind, rain), environments (POI, DEM), and their combinations. The average results of five repeated experiments demonstrated that traffic prediction under extreme weather could benefit from the combination of hazard and environment information.

As shown in Table 3, when adding one disaster-related variable, most of the evaluation metrics worsened by incorporating urban POI, DEM, wind, or rain. The explanation could be that a single feature is valid only on a small portion of the data, but the complex model and the insufficient training data result in the feature being ineffective on the whole dataset, while reducing the generalization effect on the test set. The single disaster-related variable has minimal effect on urban road traffic and may degrade performance due to sparse feature utility. This finding highlights the need to explore the compounding effects of various complex factors on urban road traffic changes.

Considering the impact of two variables, the combination of static environment attributes (POI + DEM) improved the model prediction precision (Accuracy and

R^{2}

), while the combination of dynamic hazard attributes (wind + rain) reduced the model error (RMSE and MAPE). For the wind, the combinations of rain–wind, DEM–wind, and POI–wind were better than the wind alone. For the rain, the wind–rain combination had a favorable effect, followed by POI–rain. However, the DEM–rain combination was the worst, indicating a synergistic inhibitory effect. The wind–rain coupled hazards can promote the accuracy of urban road traffic prediction, especially for China, as tropical cyclones that land in coastal areas of China are mainly wind and rain coexisting [47]. In addition, we found that supplementing environment information improved prediction accuracy based on the coupling of wind–rain. When using the POI–DEM–wind–rain combination, the model outperformed the previous combinations of variables. The model’s ability to capture the fluctuations in urban road traffic speed was enhanced after considering all the disaster information. The mechanisms of compounding external factors influencing road traffic during disasters deserve to be further studied.

4.4. Robustness Analyses

The proposed KA3STGCN model aims to predict road traffic response to natural hazards during disasters. According to the key components of disaster risk, strong winds and heavy rains induced by tropical cyclones are two main hazards that affect urban roads. Urban roads are critical infrastructures exposed to the natural environment, and their technical grade is closely related to their hazard vulnerability. Here, the model sensitivity is analyzed on different hazard intensities and road vulnerabilities.

Figure 6 shows that the average RMSE on different road grades, precipitation, and wind intensities was primarily concentrated in 3~8, indicating a robust overall performance. For different hazard intensities, we matched the average RMSE of all roads in the test set with the meteorological data including hourly precipitations and wind speeds. Then, we calculated the average RMSE of four wind speeds: 0–3.4 m/s, 3.4–8 m/s, 8–13.9 m/s, and 13.9–17.2 m/s, and of four hourly precipitations: 0–2.5 mm/h (light rain), 2.5–8 mm/h (moderate rain), 8–16 mm/h (heavy rain), and 16–50 mm/h (rainstorm). Figure 6a,b reveals that the model’s bias increased slightly as the precipitation or wind speed (less than 13.9 m/s) increased. The RMSE with the wind speed of 13.9–17.2 m/s is lower than that of 3.4–13.9 m/s. One possible reason is that the bias is caused by the sparse data and other factors.

For different road vulnerabilities, we classified DiDi roads into five grades by matching with the open-source OpenStreetMap (OSM), as shown in Table 4. We calculated the RMSE of each road in all hours of the test set and then determined the average RMSE of different road grades. Figure 6c showed that the difference in RMSE among five urban road grades was not significant, demonstrating that the proposed KA3STGCN model had strong robustness in different urban road vulnerabilities.

4.5. Spatiotemporal Differences

Figure 7 shows the KA3STGCN model performances at different hours of the day with different wind or rain intensities. The red projection showed the outliers of the RMSE were mainly observed during rush hours (5–9 a.m. and 2–6 p.m.). The extreme values of the RMSE appeared in the morning rush hour, which could be explained by the concentrated extreme weather, as shown in the yellow projection. In summer, the temperatures in coastal cities such as Shenzhen can be very high during the day. When the air near the ground receives enough heat from the earth’s surface, the temperature increases, the density decreases, and finally, it rises. When the warm and humid air with a large amount of water vapor rises to a certain height, the air temperature drops, especially at night, then the water vapor condenses into ice crystals or water drops, which are prone to thunderstorms and strong winds from midnight to morning. Extreme weather and early peaks bring more uncertainty to traffic changes.

Figure 8 shows the prediction RMSE of KA3STGCN (ours) and ASTGCN (without disaster knowledge) model on different roads. The results indicate that KA3STGCN performed better than ASTGCN, with most roads having RMSE values below 10. The roads with RMSE over 10 in the ASTGCN model are long-distance rounding various external environments, which are mitigated in the KA3STGCN model. This comparison validates the necessity and illustrates the importance of integrating disaster information into traffic prediction under extreme weather, especially for accurate urban disaster management.

4.6. Generalizability and Limitations

While the KA3STGCN framework demonstrates strong performance in Shenzhen and is designed with generalizability in mind—its architecture does not rely on region-specific assumptions—several limitations warrant discussion regarding its broader applicability. First, the current validation is limited to Shenzhen, China, due to data availability constraints. The model requires high-resolution meteorological data, road segment-level speed measurements, and detailed natural and social environment attributes (e.g., DEM, POI), which are rarely available in consistent formats across different regions. These requirements pose significant challenges for implementation in areas with less comprehensive monitoring infrastructure, particularly during extreme weather events when traditional monitoring systems may fail.

The integration of multi-source data (traffic, weather, and urban infrastructure) introduces additional challenges of data sparsity and heterogeneity, mirroring common problems in smart city applications where data gaps during emergencies remain persistent [48,49]. Future implementations could benefit from advanced data imputation techniques and multi-sensor fusion methods to enhance robustness under incomplete data conditions. Furthermore, practical deployment faces computational constraints that may limit real-time applications, especially for large-scale urban networks. The model’s reliance on high-resolution spatiotemporal data leads to significant computational demands, potentially causing latency issues in emergency response scenarios.

Despite these limitations, our framework provides a replicable blueprint for disaster-aware traffic prediction. The modular design allows adjustments for local data conditions—for instance, substituting missing hazard variables with proxy indicators or leveraging coarser-resolution inputs when necessary. While we employed optimization strategies like model quantization in our experiments, further work is needed to develop lightweight versions suitable for edge computing implementations. These technical limitations, common to many data-intensive urban analytics systems [50,51], highlight the need for continued research into efficient computation methods without sacrificing prediction accuracy.

Our future work will prioritize multi-city validation as compatible datasets emerge, with a focus on standardizing data requirements for global applicability. Moreover, we will incorporate some more recent models such as hybrid knowledge-infused frameworks, Granger causality graph [52], or uncertainty-aware Bayesian frameworks [53] to better capture uncertainties during extreme events. These directions align with recent efforts to bridge data gaps in smart city research, ensuring the model’s potential for broader adoption while maintaining robustness under extreme weather scenarios. Addressing these data and computational challenges will also be crucial for the framework’s adoption across diverse urban contexts with varying technological capabilities.

5. Conclusions

This paper presents KA3STGCN, a novel framework that advances urban traffic prediction under extreme weather through three key methodological innovations. First, we developed a disaster-knowledge attribute augmentation method. This combines dynamic hazard data with static infrastructure vulnerability data. Such integration helps the model capture the complex interplay between weather extremes and road network resilience. Second, our hybrid architecture represents a departure from conventional spatiotemporal models by simultaneously processing spatial, temporal, and hazard dimensions through dedicated GCN, GRU, and attention mechanisms. Third, the framework demonstrates the overall robustness in prediction accuracy across varying hazard intensities, addressing a critical limitation in existing approaches that often fail during extreme events.

Future research could further integrate traffic flow theory and more accurate information into the prediction model. We will commit to multi-city validation as our future work when additional datasets become available. We expect more accurate weather forecasts, more samples during early peak hours, or extreme weather in the future. The framework could be incorporated into the traffic management system to offer system-level real-time routing services. Based on weather forecasts and the proposed model, our method can predict the traffic speed of urban roads in advance, especially under extreme weather, to provide better decision support for individual drivers’ travel planning and government agencies’ disaster preparedness. The proposed model has a strong pioneering potential for coping with extreme weather events and improving transportation resilience at the urban scale under climate change.

Author Contributions

Conceptualization, S.Y.; Methodology, J.T.; Investigation, Y.Z.; Writing—original draft, J.T.; Writing—review & editing, Y.Z. and C.J.; Supervision, S.Y. and C.J.; Funding acquisition, J.T. All authors have read and agreed to the published version of the manuscript.

Funding

This study was supported by the Young Scientists Fund of the National Natural Science Foundation of China (No. 52404182), the Fundamental Research Funds for the Central Universities (No. 2024XJZN01), Henan Provincial Institute of Natural Resources Monitoring and Land Consolidation (No. 2025-672-8) and the innovation group of Ministry of Education, PRC, at Beijing Normal University.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Restrictions apply to the availability of these data.

Conflicts of Interest

The authors declare no conflict of interest.

References

Zhang, C.; Zhao, P.; Huang, Z.; He, Z.; Niu, Y.; Huang, G.; Chen, Y. Assessing vessel transportation delays affected by tropical cyclones using AIS data and a bayesian network: A case study of veronica in northwestern Australia. Ocean Eng. 2024, 308, 14. [Google Scholar] [CrossRef]
Donovan, B.; Work, D.B. Empirically quantifying city-scale transportation system resilience to extreme events. Transp. Res. Part C Emerg. Technol. 2017, 79, 333–346. [Google Scholar] [CrossRef]
Masson-Delmotte, V.; Zhai, P.; Pirani, A.; Connors, S.L.; Péan, C.; Berger, S.; Caud, N.; Chen, Y.; Goldfarb, L.; Gomis, M.I.; et al. Climate Change 2021: The Physical Science Basis; IPCC: Geneva, Switzerland, 2021. [Google Scholar]
Dall’Agnolo, M.M.Y.; Jungling, S.; Smuts, H. Towards a Smart City Sustainability Tracker for Achieving SDG 11 in Cities. In Society 5.0. 2024; Communications in Computer and Information Science; Springer: Cham, Switzerland, 2025; Volume 2173, pp. 84–97. [Google Scholar]
Liu, T.; Zhou, J.; Kwan, H.K. GraphSAGE-Based Dynamic Spatial–Temporal Graph Convolutional Network for Traffic Prediction. IEEE Trans. Intell. Transp. Syst. 2023, 24, 11210–11224. [Google Scholar] [CrossRef]
Sun, H.; Xue, R.; Hu, T.; Pan, T.; He, L.; Rao, Y.; Wang, Z.; Wang, Y.; Chen, Y.; He, H. Predicting Citywide Crowd Flows in Critical Areas Based on Dynamic Spatio-Temporal Network. IEEE Trans. Emerg. Top. Comput. Intell. 2024, 8, 3703–3715. [Google Scholar] [CrossRef]
Hu, F.; Yang, S.; Thompson, R.G. Resilience-Driven Road Network Retrofit Optimization Subject to Tropical Cyclones Induced Roadside Tree Blowdown. Int. J. Disaster Risk Sci. 2021, 12, 72–89. [Google Scholar] [CrossRef]
Wan, Z.; Lang, Q.; Zhang, Y.; Zhang, J.; Chen, Y.; Liu, G.; Liu, H. Improving the resilience of urban transportation to natural disasters: The case of Changchun, China. Sci. Rep. 2025, 15, 1116. [Google Scholar] [CrossRef] [PubMed]
Zhang, J.; Zhang, Z.; Song, D.; Huang, Z.; Lu, L. A Study on a Framework for Identifying Critical Roads in Urban Road Traffic Networks Based on the Resilience Perspective Against the Background of Sustainable Development. Appl. Sci. 2025, 15, 3581. [Google Scholar] [CrossRef]
Gao, W.; Hu, X.; Wang, N. Resilience analysis in road traffic systems to rainfall events: Road environment perspective. Transp. Res. Part D Transp. Environ. 2024, 126, 104000. [Google Scholar] [CrossRef]
Pregnolato, M.; Ford, A.; Wilkinson, S.M.; Dawson, R.J. The impact of flooding on road transport: A depth-disruption function. Trans. Res. Part D Transp. Environ. 2017, 55, 67–81. [Google Scholar] [CrossRef]
Khan, A.; Fouda, M.M.; Do, D.-T.; Almaleh, A.; Rahman, A.U. Short-Term Traffic Prediction Using Deep Learning Long Short-Term Memory: Taxonomy, Applications, Challenges, and Future Trends. IEEE Access 2023, 11, 94371–94391. [Google Scholar] [CrossRef]
Aljebreen, M.; Alamro, H.; Al-Mutiri, F.; Othman, K.M.; Alsumayt, A.; Alazwari, S.; Hamza, M.A.; Mohammed, G.P. Enhancing Traffic Flow Prediction in Intelligent Cyber-Physical Systems: A Novel Bi-LSTM-Based Approach With Kalman Filter Integration. IEEE Trans. Consum. Electron. 2023, 70, 1889–1902. [Google Scholar] [CrossRef]
Abdelraouf, A.; Abdel-Aty, M.; Mahmoud, N. Sequence-to-Sequence Recurrent Graph Convolutional Networks for Traffic Estimation and Prediction Using Connected Probe Vehicle Data. IEEE Trans. Intell. Transp. Syst. 2023, 24, 1395–1405. [Google Scholar] [CrossRef]
Saha, S.; Haque, A.; Sidebottom, G. Analyzing the Impact of Outlier Data Points on Multi-Step Internet Traffic Prediction Using Deep Sequence Models. IEEE Trans. Netw. Serv. Manag. 2023, 20, 1345–1362. [Google Scholar] [CrossRef]
Mirzaei, S.; Kang, J.-L.; Chu, K.-Y. A comparative study on long short-term memory and gated recurrent unit neural networks in fault diagnosis for chemical processes using visualization. J. Taiwan Inst. Chem. Eng. 2022, 130, 104028. [Google Scholar] [CrossRef]
Zheng, Y.; Wang, S.; Dong, C.; Li, W.; Zheng, W.; Yu, J. Urban road traffic flow prediction: A graph convolutional network embedded with wavelet decomposition and attention mechanism. Physical A 2022, 608, 128274. [Google Scholar] [CrossRef]
Bogaerts, T.; Masegosa, A.D.; Angarita-Zapata, J.S.; Onieva, E.; Hellinckx, P. A graph CNN-LSTM neural network for short and long-term traffic forecasting based on trajectory data. Transp. Res. Part C Emerg. Technol. 2020, 112, 62–77. [Google Scholar] [CrossRef]
Zheng, H.; Lin, F.; Feng, X.; Chen, Y. A Hybrid Deep Learning Model With Attention-Based Conv-LSTM Networks for Short-Term Traffic Flow Prediction. IEEE Trans. Intell. Transp. Syst. 2021, 22, 6910–6920. [Google Scholar] [CrossRef]
Zhang, Y.; Fan, K.; Yu, Y. Research on passengers behavior recognition method in public transport vehicles based on efficient 3D CNN. Multimed. Syst. 2025, 31, 15. [Google Scholar] [CrossRef]
Zhang, J.; Yang, Y.; Wu, X.; Li, S. Spatio-temporal transformer and graph convolutional networks based traffic flow prediction. Sci. Rep. 2025, 15, 24299. [Google Scholar] [CrossRef] [PubMed]
Zeghina, A.; Leborgne, A.; Le Ber, F.; Vacavant, A. Deep learning on spatiotemporal graphs: A systematic review, methodological landscape, and research opportunities. Neurocomputing 2024, 594, 127861. [Google Scholar] [CrossRef]
Ahmad, R.; Alsmadi, I.; Al-Ramahi, M. Optimization of deep learning models: Benchmark and analysis. Adv. Comput. Intell. 2023, 3, 7. [Google Scholar] [CrossRef]
Bhatt, N.; Bhatt, N.; Prajapati, P.; Sorathiya, V.; Alshathri, S.; El-Shafai, W. A Data-Centric Approach to improve performance of deep learning models. Sci. Rep. 2024, 14, 22311–22329. [Google Scholar] [CrossRef] [PubMed]
Cheng, Q.; Song, Q.; Wang, Z.; Lin, Y.; Liu, Z. Capturing traffic state variation process: An analytical modeling approach. Transp. Res. Part E Logist. Transp. Rev. 2025, 198, 104119. [Google Scholar] [CrossRef]
Kyle, S.B.; Jonathan, M.C.; James, A.H.; Joerg, K. Recreational walking decisions in urban away-from-home environments: The relevance of air quality, noise, traffic, and the natural environment. Transp. Res. Part F Psychol. Behav. 2019, 65, 363–375. [Google Scholar]
Hasnine, M.S.; Hawkins, J.; Habib, K.N. Effects of built environment and weather on demands for transportation network company trips. Transp. Res. Part A Policy Pract. 2021, 150, 171–185. [Google Scholar] [CrossRef]
Bi, H.; Ye, Z.; Zhu, H. Data-driven analysis of weather impacts on urban traffic conditions at the city level. Urban Clim. 2022, 41, 101065. [Google Scholar] [CrossRef]
Nigam, A.; Srivastava, S. Hybrid deep learning models for traffic stream variables prediction during rainfall. Multimodal Transp. 2023, 2, 100052. [Google Scholar] [CrossRef]
Avila, A.M.; Mezic, I. Data-driven analysis and forecasting of highway traffic dynamics. Nat. Commun. 2020, 11, 2090. [Google Scholar] [CrossRef] [PubMed]
Hou, Y.; Deng, Z.; Cui, H. Short-Term Traffic Flow Prediction with Weather Conditions: Based on Deep Learning Algorithms and Data Fusion. Complexity 2021, 2021, 6662959. [Google Scholar] [CrossRef]
Zhang, Y.; Sun, X.; Chen, C. Characteristics of concurrent precipitation and wind speed extremes in China. Weather Clim. Extrem. 2021, 32, 100322. [Google Scholar] [CrossRef]
Li, Q.; Han, Z.; Wu, X.-M. Deeper Insights into Graph Convolutional Networks for Semi-Supervised Learning. In Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence/Thirtieth Innovative Applications of Artificial Intel-ligence Conference/Eighth AAAI Symposium on Educational Advances in Artificial Intelligence, New Orleans, LA, USA, 2–7 February 2018. [Google Scholar]
Defferrard, M.; Bresson, X.; Vandergheynst, P. Convolutional Neural Networks on Graphs with Fast Localized Spectral Filtering. Adv. Neural Inf. Process. Syst. 2016, 29, 3844–3852. [Google Scholar]
Chen, X.; Tao, Y.; Xu, W.; Yau, S.S. Recurrent Neural Networks Are Universal Approximators With Stochastic Inputs. IEEE Trans. Neural Netw. Learn. Syst. 2022, 34, 7992–8006. [Google Scholar] [CrossRef]
Ma, Q.; Li, S.; Cottrell, G.W. Adversarial Joint-Learning Recurrent Neural Network for Incomplete Time Series Classification. IEEE Trans. Pattern Anal. Mach. Intell. 2022, 44, 1765–1776. [Google Scholar] [CrossRef]
Bahdanau, D.; Cho, K.; Bengio, Y. Neural Machine Translation by Jointly Learning to Align and Translate. arXiv 2014, arXiv:1409.0473. [Google Scholar]
Kumar, R.; Panwar, R.; Chaurasiya, V.K. Urban traffic forecasting using attention based model with GCN and GRU. Multimed. Tools Appl. 2023, 83, 47751–47774. [Google Scholar] [CrossRef]
Bai, J.; Zhu, J.; Song, Y.; Zhao, L.; Hou, Z.; Du, R.; Li, H. A3T-GCN: Attention Temporal Graph Convolutional Network for Traffic Forecasting. ISPRS Int. J. Geo-Inf. 2021, 10, 485. [Google Scholar] [CrossRef]
Wang, H.-W.; Peng, Z.-R.; Wang, D.; Meng, Y.; Wu, T.; Sun, W.; Lu, Q.-C. Evaluation and prediction of transportation resilience under extreme weather events: A diffusion graph convolutional approach. Transp. Res. Part C Emerg. Technol. 2020, 115, 102619. [Google Scholar] [CrossRef]
Zhu, J.; Wang, Q.; Tao, C.; Deng, H.; Zhao, L.; Li, H. AST-GCN: Attribute-Augmented Spatiotemporal Graph Convolutional Network for Traffic Forecasting. IEEE Access 2021, 9, 35973–35983. [Google Scholar] [CrossRef]
Shao, Z.; Zhou, H.; Lin, T. A new adaptive gradient method with gradient decomposition. Mach. Learn. 2025, 114, 24. [Google Scholar] [CrossRef]
Ghaemi, S.; Gholmohamadi, H.; Anvari-Moghaddam, A.; Bak-Jensen, B. Optimizing Power Consumption in Aquaculture Cooling Systems: A Bayesian Optimization and XGBoost Approach Under Limited Data. Appl. Sci. 2025, 15, 6273. [Google Scholar] [CrossRef]
Zhao, L.; Song, Y.; Zhang, C.; Liu, Y.; Wang, P.; Lin, T.; Deng, M.; Li, H. T-GCN: A Temporal Graph Convolutional Network for Traffic Prediction. IEEE Trans. Intell. Transp. Syst. 2020, 21, 3848–3858. [Google Scholar] [CrossRef]
Raissi, M.; Perdikaris, P.; Karniadakis, G.E. Physics-informed neural networks: A deep learning framework for solving forward and inverse problems involving nonlinear partial differential equations. J. Comput. Phys. 2019, 378, 686–707. [Google Scholar] [CrossRef]
Luo, S.; Liu, P.; Ye, X. Bayesian graph convolutional network with partial observations. PLoS ONE 2024, 19, e0307146. [Google Scholar] [CrossRef] [PubMed]
Tang, J.; Hu, F.; Liu, Y.; Wang, W.; Yang, S. High-Resolution Hazard Assessment for Tropical Cyclone-Induced Wind and Precipitation: An Analytical Framework and Application. Sustainability 2022, 14, 13969. [Google Scholar] [CrossRef]
Huang, Q.; Jia, H.; Xu, Y.; Yang, Y.; Xiao, G. Limi-TFP: Citywide Traffic Flow Prediction With Limited Road Status Information. IEEE Trans. Veh. Technol. 2023, 72, 2947–2959. [Google Scholar] [CrossRef]
Wang, P.; Zhang, Y.; Hu, T.; Zhang, T. Urban traffic flow prediction: A dynamic temporal graph network considering missing values. Int. J. Geogr. Inf. Sci. 2023, 37, 885–912. [Google Scholar] [CrossRef]
Al Sibahee, M.A.; Abduljabbar, Z.A.; Ngueilbaye, A.; Luo, C.; Li, J.; Huang, Y.; Zhang, J.; Khan, N.; Nyangaresi, V.O.; Ali, A.H. Blockchain-Based Authentication Schemes in Smart Environments: A Systematic Literature Review. IEEE Internet Things J. 2024, 11, 34774–34796. [Google Scholar] [CrossRef]
Schechtner, K. Bridging the Adoption Gap for Smart City Technologies: An Interview with Rob Kitchin. IEEE Pervasive Comput. 2017, 16, 72–75. [Google Scholar] [CrossRef]
He, S.; Luo, Q.; Du, R.; Zhao, L.; He, G.; Fu, H.; Li, H. STGC-GNNs: A GNN-based traffic prediction framework with a spatial-temporal Granger causality graph. Physical A 2023, 623, 21. [Google Scholar] [CrossRef]
Sengupta, A.; Mondal, S.; Das, A.; Guler, S.I. A Bayesian approach to quantifying uncertainties and improving generalizability in traffic prediction models. Transp. Res. Part C Emerg. Technol. 2024, 162, 18. [Google Scholar] [CrossRef]

Figure 1. The proposed KA3STGCN framework.

Figure 2. Case study area: Shenzhen City in China. (a) The geographical location; (b) DiDi urban roads and the meteorological grids; (c) DEM; (d) POI.

Figure 3. The impact of disaster-related factors on urban road traffic. (a) Disaster event; (b) DEM; (c) POI.

Figure 4. The model performance of different parameters. (a) Learning horizons; (b) predicting horizons; (c) training epochs; (d) hidden units.

Figure 5. Traffic speed prediction results of the KA3STGCN model on two roads. (a) The results of Road 1 on the whole test set; (b) the results of Road 1 during Typhoon Mangkhut; (c) the results of Road 2 on the whole test set; (d) the results of Road 2 during Typhoon Mangkhut.

Figure 6. RMSE of the KA3STGCN model under different hazard intensities and road grades. (a) The wind intensities; (b) the precipitation intensities; (c) the road grades.

Figure 7. RMSE at different hours with hazards. (a) Wind speed; (b) precipitation. In each subplot, the red projection is the scatter distribution of RMSE at different hours, the blue projection is the scatter distribution of RMSE with different hazard intensities, and the yellow projection is the scatter distribution of hazard intensities at different hours.

Figure 8. RMSE on road segments. (a) RMSE of KA3STGCN; (b) RMSE of ASTGCN.

Table 1. The multi-source data description.

	Data	Data Description
$G$	The network of urban roads	The polyline data (Figure 2b) is provided by DiDi Chuxing (Beijing, China) (https://outreach.didichuxing.com/research/opendata/, accessed on 14 March 2020) which is the biggest online car-hailing service platform in China.
$X$	The traffic speeds	The 10 min data is provided by DiDi Chuxing. The road speeds are calculated from DiDi vehicle trajectories (such as DiDi Express, Premier, and Tax, roughly accounting for 3~10% of the total traffic) for each urban road [40].
$H A$	Wind speed	The NetCDF data is provided by the National Climate Center (http://data.cma.cn/data/cdcdetail/dataCode/NAFP_CLDAS2.0_NRT.html, accessed on 22 August 2020) in 39 grids with resolutions of 0.0625° × 0.0625° and 1 h (Figure 2b). The meteorological girded data of various hours are combined to produce the dynamic time-changing hazard information.
$H A$	Precipitation	Same as wind speed.
$E A$	DEM	The adf data with spatial resolution of 30 m is provided by Geospatial Data Cloud site (https://www.gscloud.cn/, accessed on 23 August 2020) (Figure 2c).
$E A$	POI	The point data is obtained by Amap API (https://lbs.amap.com/api/webservice/guide/api/newpoisearch, accessed on 25 August 2020). The 435,113 POIs in Shenzhen are divided into 14 categories as the following [1,2,…,14]: living services, sports recreation services, food and beverages, shopping services, transportation facilities, public facilities, financial and insurance services, medical services, scenic spots, education services, government services, enterprises, hotels, and accommodations. (Figure 2d shows the kernel density of POI.)

Table 2. Performance comparison for different models.

Models	RMSE	MAPE (%)	$R^{2}$	Accuracy	RMSE vs. KA3STGCN	Accuracy vs. KA3STGCN	p-Value
HA	9.06	40.55	0.63	0.60	2.14	−0.19	<0.001
ARIMA	10.59	29.38	0.01	0.63	3.67	−0.16	<0.001
SVR	8.48	37.79	0.65	0.62	1.56	−0.17	<0.001
XGBoost	7.78	28.45	0.73	0.74	0.86	−0.05	<0.01
TGCN (GCN + GRU)	7.90	23.66	0.67	0.74	0.98	−0.05	<0.01
ASTGCN (GCN+GRU+attention)	7.39	24.46	0.71	0.76	0.47	−0.03	<0.01
A2STGCN (GCN + GRU + attribute-augmented unit)	−8.92	24.22	0.71	0.59	2.00	−0.20	<0.001
Physics-informed neural networks	7.25	23.33	0.70	0.72	0.33	−0.07	<0.01
Bayesian GCN	7.15	21.30	0.77	0.78	0.23	−0.01	<0.01
KA3STGCN (GCN + GRU + attention + attribute-augmented unit)	6.92	19.67	0.78	0.79	/	/	/

Table 3. The results of ablation experiment.

Variables	RMSE	MAPE (%)	Accuracy	$R^{2}$	MAPE vs. No Attributes
none	7.39	24.46	0.76	0.71	/
DEM	8.23	25.28	0.73	0.64	3.35%
POI	7.61	22.6	0.75	0.69	−7.60%
rain	9.37	22.79	0.70	0.53	−6.83%
wind	8.32	28.21	0.73	0.63	15.33%
DEM + POI	10.59	21.29	0.78	0.76	−12.96%
DEM + rain	7.13	24.67	0.66	0.40	0.86%
DEM + wind	7.83	21.2	0.75	0.68	−13.33%
POI + rain	8.59	21.72	0.72	0.61	−11.20%
POI + wind	6.66	20.91	0.78	0.76	−14.51%
wind + rain	7.37	20.73	0.76	0.71	−15.25%
DEM + wind + rain	6.83	21.22	0.78	0.75	−13.25%
POI + DEM + rain	7.92	22.61	0.74	0.67	−7.56%
POI + DEM + wind	7.39	21.74	0.76	0.71	−11.12%
POI + wind + rain	6.76	20.76	0.78	0.76	−15.13%
POI + DEM + wind + rain	6.92	19.67	0.79	0.78	−19.58%

Table 4. Road classification results.

Grades	Numbers	OSM Tags	Corresponding OSM Comments (https://wiki.openstreetmap.org/wiki/Map_features, Accessed on 22 August 2020)
1	74	trunk, motorway	Trunk: The most important roads in a country’s system; Motorway: A restricted access major divided highway, normally with two or more running lanes plus emergency hard shoulder.
2	175	primary	The next most important roads in a country’s system. (Often link larger towns.)
3	332	secondary	The next most important roads in a country’s system. (Often link towns.)
4	452	tertiary	The next most important roads in a country’s system. (Often link smaller towns and villages.)
5	54	residential	Roads which serve as access to housing, without function of connecting settlements. (Often lined with housing.)

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Tang, J.; Zhu, Y.; Yang, S.; Jaeger, C. Predicting Urban Traffic Under Extreme Weather by Deep Learning Method with Disaster Knowledge. Appl. Sci. 2025, 15, 9848. https://doi.org/10.3390/app15179848

AMA Style

Tang J, Zhu Y, Yang S, Jaeger C. Predicting Urban Traffic Under Extreme Weather by Deep Learning Method with Disaster Knowledge. Applied Sciences. 2025; 15(17):9848. https://doi.org/10.3390/app15179848

Chicago/Turabian Style

Tang, Jiting, Yuyao Zhu, Saini Yang, and Carlo Jaeger. 2025. "Predicting Urban Traffic Under Extreme Weather by Deep Learning Method with Disaster Knowledge" Applied Sciences 15, no. 17: 9848. https://doi.org/10.3390/app15179848

APA Style

Tang, J., Zhu, Y., Yang, S., & Jaeger, C. (2025). Predicting Urban Traffic Under Extreme Weather by Deep Learning Method with Disaster Knowledge. Applied Sciences, 15(17), 9848. https://doi.org/10.3390/app15179848

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Predicting Urban Traffic Under Extreme Weather by Deep Learning Method with Disaster Knowledge

Abstract

1. Introduction

2. Methodology

2.1. Framework

2.2. Attribute-Augmented Unit

2.3. Models

2.3.1. Spatial Dependence Modeling

2.3.2. Temporal Dependence Modeling

2.3.3. Attention Mechanism

2.3.4. Loss Function

3. Data and Experiments

3.1. Data Description

3.2. Data Preprocessing

3.3. Evaluation Metric

3.4. Parameter Settings

4. Results and Discussion

4.1. The Performance of KA3STGCN Model

4.2. Model Performance Comparison Results

4.3. Significant Variables and Interpretations

4.4. Robustness Analyses

4.5. Spatiotemporal Differences

4.6. Generalizability and Limitations

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI