Analysis of Passenger Flow Characteristics and Origin–Destination Passenger Flow Prediction in Urban Rail Transit Based on Deep Learning
Abstract
:1. Introduction
- (1)
- A prediction model for interval OD passenger flow is proposed based on a core combination of TCN and LSTM, incorporating an attention mechanism to adaptively handle the data. The core components of this model can capture both the spatiotemporal features and long-term dependencies of the OD passenger flow data. The attention mechanism assigns weights to different time steps, helping the model focus on critical information, thus enabling the capture of more data features and improving the short-term prediction accuracy of urban-rail-transit interval OD passenger flows.
- (2)
- External factors related to interval OD passenger flow, such as date, time attributes, weather conditions, and air quality, are analyzed. The factors most strongly correlated with passenger flow are identified and incorporated into the model’s predictions, enhancing the model’s adaptability and generalization ability.
- (3)
- Using the urban rail transit system in the densely populated city of Chongqing as a case study, we validate the proposed model by conducting experiments on the city’s interval OD passenger flow. The model’s performance is tested under high-, medium-, and low-passenger-flow scenarios. The results demonstrate that the proposed model fits the actual variation trends of passenger flow more effectively, with the smallest prediction error under high-passenger-flow conditions. The findings are of significant relevance for future urban rail transit construction and daily operations.
2. Characteristics and Influencing Factors of Urban Rail Transit Passenger Flow
2.1. Dataset Source
2.2. OD Passenger Flow Characteristics
2.3. Analysis of Factors Affecting Passenger Flows
3. Construction of OD Passenger Flow Prediction Deep Learning Model
3.1. Temporal Convolutional Network Model
- (1)
- TCN uses causal convolution. Since each prediction can only rely on previous data predictions, there will be no data leakage. The principle of causal convolution is shown in Figure 4. The value of the previous layer at time t only depends on the value before the next layer at time t; thus, causal convolution cannot predict future values. Causal convolution has a unidirectional structure and strict constraints on timing, and too many convolutional layers will lead to gradient disappearance and a poor fitting effect.
- (2)
- TCN combines deep neural networks with dilated convolutions to form a model that can save long-term valid historical data. It can effectively improve the performance of the model and solve the problems of causal convolution. Expanding convolution mainly allows the filter to be applied to filer regions exceeding its own length range by ignoring some inputs, which is equivalent to adding zero to obtain a larger filter from the initial filter. The principle of the expanding structure is shown in Figure 5.
- (3)
- The TCN model introduces a residual network to build long-term dependencies, thereby effectively improving the model’s performance. The structure principle of the residual network is shown in Figure 6. To some extent, the residual connection overcomes the problems caused by the disappearance of local gradients and edge gradient explosions in convolutional neural networks to achieve cross-layer information input.
3.2. Long Short-Term Memory Network Model
3.3. Attention Mechanism
3.4. TCN–Attention–LSTM Model
4. Passenger Flow Prediction Results and Discussion
4.1. Experimental Design
4.2. Prediction Results and Analysis of OD Passenger Flow
4.3. Model Comparison and Evaluation
5. Conclusions
- (1)
- A spatiotemporal distribution analysis was conducted on the OD passenger flow of Chongqing Rail Transit Line 3, and it was found that there was a bimodal pattern of morning and evening OD flow, with the passenger flow on weekdays being greater than that on rest days. The analysis of factors affecting OD passenger flow shows that date attributes, time attributes, and air quality have a significant impact on OD passenger flow data. This can provide a good dataset for accurately predicting OD passenger flow in urban rail transit in the future.
- (2)
- The TCN-LSTM model, which combines the attention mechanism, has better prediction accuracy than the original TCN-LSTM model, and the prediction results are more in line with the actual changes in OD passenger flow.
- (3)
- The attention mechanism can compensate for the shortcomings of the TCN-LSTM model in actual OD passenger flow prediction. The RMSE and MAE values of its prediction results are significantly reduced. The TCN-LSTM model mixed with the attention mechanism performs better than the TCN-LSTM model in predicting OD passenger flow and can more accurately predict OD passenger flow in rail transit.
- (4)
- The OD passenger flow prediction work of urban rail transit carried out in this study can provide some technical support for rail transit line planning and vehicle scheduling in large cities. At the same time, the proposed prediction method is not only applicable to new line expansion scenarios based on existing infrastructure, but also applicable to the future development of intelligent rail transit systems, such as urban rail transit systems based on virtual coupling technology.
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Yi, P.Y.; Huang, F.H.; Wang, J.C.; Peng, J. Topology augmented dynamic spatial-temporal network for passenger flow forecasting in urban rail transit. Appl. Intell. 2023, 53, 24655–24670. [Google Scholar] [CrossRef]
- Willumsen, L.G. Estimating Time-Dependent Trip Matrices from Traffic Counts. In Proceedings of the Ninth International Symposium on Transportation and Traffic Theory, Utrecht, The Netherlands, 11–13 July 1984; VNU Science Press: Utrecht, The Netherlands, 1984; pp. 397–411. [Google Scholar]
- Lam, W.H.K.; Lo, H.P. Estimation of Origin-Destination Matrix from Traffic Counts: A Comparison of Entropy Maximizing and Information Minimizing Models. Transp. Plan. Technol. 1991, 16, 85–104. [Google Scholar] [CrossRef]
- Wu, J. A Real-Time Origin-Destination Matrix Updating Algorithm for On-Line Applications. Transp. Res. Part B Methodol. 1997, 31, 381–396. [Google Scholar] [CrossRef]
- Cascetta, E.; Nguyen, S. A Unified Framework for Estimating or Updating Origin/Destination Matrices from Traffic Counts. Transp. Res. Part B Methodol. 1988, 22, 437–455. [Google Scholar] [CrossRef]
- Cascetta, E. Estimation of Trip Matrices from Traffic Counts and Survey Data: A Generalized Least Squares Estimator. Transp. Res. Part B Methodol. 1984, 18, 289–299. [Google Scholar] [CrossRef]
- Tebaldi, C.; West, M. Bayesian Inference on Network Traffic Using Link Count Data. J. Am. Stat. Assoc. 1998, 93, 557–573. [Google Scholar] [CrossRef]
- Bhattacharjee, D.; Sinha, K.C.; Krogmeier, J.V. Modeling the Effects of Traveler Information on Freeway Origin Destination Demand Prediction. Transp. Res. Part C Emerg. Technol. 2001, 9, 381–398. [Google Scholar] [CrossRef]
- Silva, R.; Kang, S.M.; Airoldi, E.M. Predicting Traffic Volumes and Estimating the Effects of Shocks in Massive Transportation Systems. Proc. Natl. Acad. Sci. USA 2015, 112, 5643–5648. [Google Scholar] [CrossRef]
- Leng, B.; Zeng, J.; Xiong, Z. Probability Tree Based Passenger Flow Prediction and Its Application to the Beijing Subway System. Front. Comput. Sci. China 2013, 7, 195–203. [Google Scholar] [CrossRef]
- Zou, L.M.; Wang, Z.J.; Guo, R. Real-time prediction of transit origin–destination flows during underground incidents. Transp. Res. Part C Emerg. Technol. 2024, 163, 104622. [Google Scholar] [CrossRef]
- Fu, R.; Zhang, Z.; Li, L. Using LSTM and GRU Neural Network Methods for Traffic Flow Prediction. In Proceedings of the Youth Academic Conference of Chinese Association of Automation, Wuhan, China, 11–13 November 2016; pp. 324–328. [Google Scholar]
- Ma, C.X.; Liu, T. Survey of short-term traffic flow prediction based on LSTM. Int. J. Mod. Phys. C 2025, 36, 2450177. [Google Scholar] [CrossRef]
- Zhang, J.; Li, H.; Zhang, S.; Yang, L.; Jin, G.; Qi, J. A spatiotemporal graph generative adversarial networks for short-term passenger flow prediction in urban rail transit systems. Int. J. Gen. Syst. 2023, 52, 694–721. [Google Scholar] [CrossRef]
- Ye, J.X.; Zhao, J.J.; Zheng, F.R.; Xu, C.Z. A Heterogeneous Graph Convolution Based Method for Short-Term OD Flow Completion and Prediction in a Metro System. IEEE Trans. Intell. Transp. Syst. 2024, 25, 15614–15627. [Google Scholar] [CrossRef]
- Wu, J.; He, D.; Jin, Z.; Li, X.; Li, Q.; Xiang, W. Learning spatial–temporal pairwise and high-order relationships for short-term passenger flow prediction in urban rail transit. Expert Syst. Appl. 2024, 245, 123091. [Google Scholar] [CrossRef]
- Miao, H.; Fei, Y.; Wang, S.; Wang, F.; Wen, D. Deep learning based origin-destination prediction via contextual information fusion. Multimed. Tools Appl. 2022, 81, 12029–12045. [Google Scholar] [CrossRef]
- Zou, X.; Zhang, S.; Zhang, C.; James, J.Q.; Chung, E. Long-Term Origin-Destination Demand Prediction with Graph Deep Learning. IEEE Trans. Big Data 2022, 8, 1481–1495. [Google Scholar] [CrossRef]
- Wang, L.G.; Dong, Y.F.; Wang, Y.Z.; Wang, P. Non-Symmetric Spatial-Temporal Network for Bus Origin-Destination Demand Prediction. Transp. Res. Rec. 2022, 2676, 279–289. [Google Scholar] [CrossRef]
- Lv, S.R.; Wang, K.P.; Yang, H.; Wang, P. An origin–destination passenger flow prediction system based on convolutional neural network and passenger source-based attention mechanism. Expert Syst. Appl. 2024, 238, 121989. [Google Scholar] [CrossRef]
- Zhang, W.; Wang, G.; Liu, X.; Zhu, T. AttnOD: An Attention-Based OD Prediction Model with Adaptive Graph Convolution. In International Conference on Neural Information Processing; Springer: Singapore, 2023. [Google Scholar]
- Liu, J.W.; Pan, L.; Ren, Q.Q. Spatial-temporal memory enhanced multi-level attention network for origin-destination demand prediction. Complex Intell. Syst. 2024, 10, 6435–6448. [Google Scholar] [CrossRef]
- Zeng, J.; Tang, J. Combining knowledge graph into metro passenger flow prediction: A split-attention relational graph convolutional network. Expert Syst. Appl. 2023, 213, 118790. [Google Scholar] [CrossRef]
- Kanungo, T.; Mount, D.M.; Netanyahu, N.S.; Piatko, C.D.; Silverman, R.; Wu, A.Y. An Efficient k-means Clustering Algorithm: Analysis and Implementation. IEEE Trans. Pattern Anal. Mach. Intell. 2002, 24, 881–892. [Google Scholar] [CrossRef]
- Guo, Y.; Zhu, Z.; Jiang, X.; Chen, T.; Li, Q. Analyzing the Impacts of Land Use and Network Features on Passenger Flow Distribution at Urban Rail Stations from a Classification Perspective. Sustainability 2024, 16, 3568. [Google Scholar] [CrossRef]
- Liu, P.F.; Wang, S.C.; Zhao, P. Robust estimation and test for Pearson’s correlation coefficient. Random Matrices Theory Appl. 2024, 13, 2450023. [Google Scholar] [CrossRef]
- Ejegwa, P.A.; Wanzenke, T.D.; Ogwuche, I.O.; Anum, M.T.; Isife, K.I. A robust correlation coefficient for fermatean fuzzy sets based on spearman’s correlation measure with application to clustering and selection process. J. Appl. Math. Comput. 2024, 70, 1747–1770. [Google Scholar] [CrossRef]
- Rossello, X. The P-value spectrum: From ‘absence of evidence’ to ‘evidence of difference’. Eur. Heart J. Acute Cardiovasc. Care 2024, 13, 181–182. [Google Scholar] [CrossRef] [PubMed]
- Hou, Z.; Du, Z.; Yang, G.; Yang, Z. Short-term passenger flow prediction of urban rail transit based on a combined deep learning model. Appl. Sci. 2022, 12, 7597. [Google Scholar] [CrossRef]
- Wei, L.; Guo, D.; Chen, Z.; Yang, J.; Feng, T. Forecasting Short-Term Passenger Flow of Subway Stations Based on the Temporal Pattern Attention Mechanism and the Long Short-Term Memory Network. ISPRS Int. J. Geo-Inf. 2023, 12, 25. [Google Scholar] [CrossRef]
- Zhao, Q.L.; Feng, X.B.; Zhang, L.W.; Wang, Y.D. Research on Short-Term Passenger Flow Prediction of LSTM Rail Transit Based on Wavelet Denoising. Mathematics 2023, 11, 4204. [Google Scholar] [CrossRef]
- Ma, C.X.; Zhang, B.W.; Li, S.K.; Lu, Y.P. Urban rail transit passenger flow prediction with ResCNN-GRU based on self-attention mechanism. Phys. A Stat. Mech. Its Appl. 2024, 638, 129619. [Google Scholar] [CrossRef]
Research Method | Reference | Specific Form |
---|---|---|
classical statistics | Willumsen L.G. 1984 [2] | Maximum entropy |
Lam, W. H. K.; Lo, H. P. 1991 [3] | Maximum likelihood | |
Wu, J. 1997 [4] | Maximum likelihood | |
Cascetta, E.; Nguyen, S. 1988 [5] | Maximum likelihood | |
Cascetta, E. 1984 [6] | Least squares | |
Tebaldi, C.; West, M. 1998 [7] | Bayesian inference | |
Bhattacharjee, D.; Sinha, K.C.; Krogmeier, J.V. 2001 [8] | Kalman filter | |
Silva, R.; Kang, S.M.; Airoldi, E.M. 2015 [9] | Autoregressive model | |
Leng, B.; Zeng, J.; Xiong, Z. 2013 [10] | Probability tree model | |
deep learning (single model) | Fu, R.; Zhang, Z.; Li, L. 2016 [12] | GRU |
Ma C.X; Liu T. 2025 [13] | LSTM | |
Zhang J.L.; Li H.; et al. 2023 [14] | GAN | |
Ye J.X.; Zhao J.J; Zheng F.R.; Xu C.Z. 2024 [15] | GCN | |
deep learning (combined model) | Wu J.X.; He D.Q.; Jin Z.Z. et al. 2024 [16] | HGARN |
Miao H.; Fei Y.; Wang S.Z.; et al. 2022 [17] | CNN\GCN+LSTM | |
Zou X.X.; Zhang S.Y.; et al. 2022 [18] | ST−GDL | |
Wang L.G.; Dong Y.F.; Wang Y.Z.; Wang P. 2022 [19] | CNN+Conv−LSTM | |
deep learning (with attention) | Lv S.R.; Wang K.P.; Yang H.; Wang P. 2024 [20] | PSAM−CNN+Attention |
Zhang W.C.; Wang G.; Liu X.; Zhu T.Y. 2023 [21] | GNN+Encoder–Decoder+Attention | |
Liu J.W.; Pan L.; Ren Q.Q. 2024 [22] | ST−MEN+Attention | |
Zeng J.; Tang J.J. 2023 [23] | GCN+LSTM+Attention |
A | B | C | D | E | F | G | H | I | J | K | L | M | N | O | |
A | 0 | 15,089 | 8282 | 3033 | 8990 | 2416 | 2957 | 17,695 | 6860 | 6181 | 1233 | 1035 | 1528 | 1431 | 741 |
B | 13,193 | 0 | 18,043 | 13,745 | 29,872 | 7806 | 12,244 | 78,040 | 20,602 | 21,201 | 6133 | 2953 | 6814 | 4283 | 2220 |
C | 8162 | 21,137 | 0 | 9198 | 26,538 | 7188 | 12,528 | 67,434 | 20,835 | 22,702 | 5955 | 3355 | 6619 | 3389 | 2004 |
D | 3071 | 13,382 | 11,036 | 0 | 13,614 | 3753 | 3280 | 22,482 | 5804 | 7256 | 1922 | 1113 | 1838 | 932 | 522 |
E | 8057 | 30,375 | 25,228 | 11,594 | 0 | 2857 | 8607 | 53,751 | 16,379 | 20,801 | 5969 | 3010 | 6763 | 4761 | 2480 |
F | 2162 | 8460 | 6733 | 3177 | 3783 | 0 | 4869 | 24,872 | 6747 | 8201 | 3862 | 1505 | 2776 | 1683 | 964 |
G | 3009 | 13,652 | 14,383 | 3506 | 11,480 | 5016 | 0 | 41,168 | 8794 | 10,156 | 3509 | 2020 | 3559 | 1984 | 1293 |
H | 17,423 | 81,706 | 66,065 | 21,015 | 59,066 | 22,510 | 32,659 | 0 | 67,930 | 78,664 | 35,627 | 19,930 | 49,586 | 17,378 | 15,974 |
I | 6285 | 21,841 | 20,320 | 4992 | 20,248 | 7070 | 7751 | 75,219 | 0 | 13,311 | 8728 | 6337 | 14,762 | 6347 | 3950 |
J | 6136 | 24,343 | 25,350 | 6922 | 27,489 | 9812 | 10,365 | 93,635 | 14,121 | 0 | 14,364 | 10,845 | 18,976 | 4328 | 5392 |
K | 1415 | 6787 | 6698 | 1684 | 7043 | 4530 | 3136 | 36,341 | 8446 | 12,375 | 0 | 3682 | 9356 | 2022 | 2305 |
L | 917 | 3496 | 3457 | 1094 | 3781 | 1455 | 1896 | 21,235 | 6600 | 10,019 | 3468 | 0 | 6479 | 1498 | 1918 |
M | 1447 | 6771 | 6630 | 1622 | 7661 | 2515 | 3560 | 47,835 | 15,158 | 16,823 | 9149 | 6733 | 0 | 3292 | 13,896 |
N | 1286 | 3451 | 2927 | 758 | 4299 | 1460 | 1511 | 16,855 | 6112 | 4341 | 2398 | 1400 | 3697 | 0 | 3135 |
O | 795 | 2491 | 2688 | 578 | 3900 | 1337 | 1618 | 16,962 | 4690 | 5018 | 2951 | 2540 | 14,067 | 3562 | 0 |
Items | Average Daily Passenger Flow/Person-Times |
---|---|
High OD passenger flow | ≥45,000 |
Medium OD passenger flow | 12,000–45,000 |
Low OD passenger flow | <12,000 |
Related Influencing Factors | Pearson Correlation Coefficient | p-Values |
---|---|---|
The minimum temperature | 0.142 | 0.628 |
The maximum temperature | 0.326 | 0.255 |
Weather conditions | −0.224 | 0.442 |
Relative humidity | −0.275 | 0.341 |
Air quality | −0.475 | 0.046 |
Date attribute | −0.800 | 0.010 |
Time granularity | 0.294 | 0.003 |
Related Influencing Factors | Spearman Correlation Coefficient | p-Value |
---|---|---|
The minimum temperature | 0.155 | 0.598 |
The maximum temperature | −0.060 | 0.839 |
Weather conditions | −0.104 | 0.522 |
Relative humidity | −0.255 | 0.362 |
Air quality | −0.266 | 0.359 |
Date attribute | −0.611 | 0.020 |
Time granularity | 0.103 | 0.304 |
Variables | Illustrations | Variables | Illustrations |
---|---|---|---|
Y1 | Date attribute (1–7 represents Monday to Sunday, respectively) | Y3 | Time granularity/hour |
Y2 | Air quality/non-dimensional relative numerical values | Y4 | Hourly passenger flow/person |
Passenger Flow Intervals | Model | RMSE | MAE |
---|---|---|---|
From Jiazhoulu to Guanyinqiao | TCN–Attention–LSTM | 3.249 | 3.106 |
TCN-LSTM | 5.908 | 6.934 | |
From Guanyinqiao to Huaxinjie | TCN–Attention–LSTM | 0.348 | 0.203 |
TCN-LSTM | 0.491 | 0.296 | |
From Sigongli to Nanping | TCN–Attention–LSTM | 0.348 | 0.212 |
TCN-LSTM | 0.381 | 0.228 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Hou, Z.; Han, J.; Yang, G. Analysis of Passenger Flow Characteristics and Origin–Destination Passenger Flow Prediction in Urban Rail Transit Based on Deep Learning. Appl. Sci. 2025, 15, 2853. https://doi.org/10.3390/app15052853
Hou Z, Han J, Yang G. Analysis of Passenger Flow Characteristics and Origin–Destination Passenger Flow Prediction in Urban Rail Transit Based on Deep Learning. Applied Sciences. 2025; 15(5):2853. https://doi.org/10.3390/app15052853
Chicago/Turabian StyleHou, Zhongwei, Jin Han, and Guang Yang. 2025. "Analysis of Passenger Flow Characteristics and Origin–Destination Passenger Flow Prediction in Urban Rail Transit Based on Deep Learning" Applied Sciences 15, no. 5: 2853. https://doi.org/10.3390/app15052853
APA StyleHou, Z., Han, J., & Yang, G. (2025). Analysis of Passenger Flow Characteristics and Origin–Destination Passenger Flow Prediction in Urban Rail Transit Based on Deep Learning. Applied Sciences, 15(5), 2853. https://doi.org/10.3390/app15052853