Enhancing Accuracy in Hourly Passenger Flow Forecasting for Urban Transit Using TBATS Boosting
Abstract
:1. Introduction
- This study identifies the gap between existing research and the operational challenges of urban public transport systems in the Indian subcontinent. The identified gap has been addressed by developing a hybrid model for hourly passenger flow forecasting.
- The study encompasses the results of Route 12 operating from Thane Station (West) to Pawar Nagar of the Thane Municipal Transport System. This route is one of the highest commute routes and serves as a benchmark, having typical commuting behavior for this region. The addressed challenges have been tested on the entire network, including this route that provides insight to the civic authorities for further planning.
- Hourly passenger flow forecasting has been carried out at the level of individual station pairs using a range of models, including SARIMA, TBATS, XGBoost, LightGBM, and the proposed TBATS Boosting model. TBATS, which integrates trigonometric seasonality, Box–Cox transformation, ARMA errors, and trend components, has been applied to capture the underlying statistical patterns in the data. The hybrid TBATS Boosting model enhances this framework by incorporating LightGBM for residual learning, enabling the capture of nonlinear dependencies and external influences. To the best of our knowledge, this is the first study from the Indian subcontinent that focuses on forecasting hourly passenger flow between city transit station pairs, explicitly incorporating regional holidays, a large historical dataset of hourly records, and the effects of the COVID-19 pandemic.
- The forecasting results from all models have been compared with actual passenger flow data for the same period. Performance has been rigorously evaluated under diverse operational conditions, including weekdays, different time slots, and holidays, across station pairs with high, moderate, and low passenger volumes. The evaluation has been conducted using standard metrics such as MAE, RMSE, MAPE, WMAPE, and MREP, offering a comprehensive assessment of forecasting accuracy in realistic transit scenarios.
- The proposed hybrid model demonstrates superior ability to capture dynamic passenger flow patterns compared to standalone frameworks. It offers actionable insights for transport authorities to optimize scheduling, improve resource allocation, and reduce passenger waiting times. The model is highly applicable in real-world operational settings, particularly in contexts affected by regional holidays, fluctuating demand, special events, and unexpected disruptions such as pandemics.
2. Literature Review
Reported Studies | Study Area | Data | Weather | COVID-19 | Holiday | Method |
---|---|---|---|---|---|---|
Shivaraj Halyal et al. [5] | Intercity (Hubballi-Dharwad BRTS) | December 2019–February 2020 (91 days) | × | × | × | LSTM, SARIMA |
Shanthappa et al. [14] | Intercity (Within Udupi City) | January 2022–December 2022 (365 days) | ✓ | × | × | RPTW-LSTM |
Nagaraj et al. [56] | Karnataka State Road Transport Corporation (KSRTC)—Various Regions | KSRTC dataset | × | ✓ | × | LSTM, RNN, and greedy layer-wise algorithm |
Thandassery et al. [57] | Kochi Metro Rail System (KMRL), Kerala, India | AFC data from 2017 to 2019 | × | × | × | Station Memorizing LSTM |
Cyril et al. [58] | Trivandrum City, Kerala, India | ETM data KSRTC, 2011–2013 | × | × | × | Holt–Winters’ additive and multiplicative models |
Cyril et al. [59] | Inter-district travel in Kerala (Trivandrum to five districts) | ETM data KSRTC, 2010–2013 | × | × | × | ARIMA |
Gummadi et al. [60] | Macherla route, Andhra Pradesh, India | Transit data from Macherla route for April 2016–December 2016 | × | × | × | ARIMA, artificial neural network (ANN) |
Study | Geographical Focus | Models | External Features | Pandemic Impact | Forecasting Granularity | Key Limitations |
---|---|---|---|---|---|---|
Shivaraj Halyal et al. [5] | Intercity (Hubballi-Dharwad BRTS) | LSTM, SARIMA | Weather, weekday/weekend | × | Daily passenger flow | No pandemic data; Limited temporal features; inter city transport |
Zhang et al. [35] | Intercity, China | RNN, GCN, LSTM | Holidays, weather | ✓ (partial) | Hourly (Metro systems) | Limited to metro; no multi-seasonality handling |
Perone et al. [53] | Italy | ARIMA, TBATS | COVID-19 cases (basic integration) | ✓ | Monthly forecasting | Focus on disease data; lacks external variability |
Liu et al. [31] | Multi-city, China | CNN, decision trees | Passenger history, Time of day, lunar cycles | × | Hourly passenger flow | No pandemic context; overfitting in deep learning models |
Proposed Study | Thane City, India | TBATS + LightGBM (hybrid) | Holidays (27 types), COVID-19 cases, cyclic time features (sin_time, cos_time), weekday patterns | ✓ (full integration) | Hourly forecasting for diverse station pairs (high, moderate, low) | - |
3. Data Description and Analysis
3.1. Study Area and Data Source
3.2. Data Description
3.3. Data Pre-Processing and Exploratory Data Analysis (EDA)
- Mapping and Cleaning:
- Prepared reference dictionary to convert station names from Marathi to English.
- Assign a unique number to each station and ensure unique station names and numbers.
- Eliminated extraneous features such as Sourceid, Destinationid, RouteNumber, and others.
- Outlier Removal:
- Handling outliers occurs due to unforeseen events after analyses of station-wise day-wise ticket counts.
- Feature Engineering:
- Generation of a Station_Pair feature by concatenating source and destination station names.
- Identification of temporal features for time series analysis, viz. month, weekday, and date and time of travel.
- Insertion of a binary feature Is_Holiday attribute and a categorical feature Holiday_Type (27 distinguished categories) to capture holiday effects.
- Introduced cyclic time features in the form of sin_time and cos_time for hourly and Weekday_Timeslot features to encapsulate behavioral variations.
- Handling Missing Data:
- Resolved temporal consistency through incorporating missing time slots with zero passenger counts.
- Integration of Pandemic Data:
- Incorporated COVID-19 indicators, including daily confirmed, deceased, and recovered cases, to capture pandemic-related impacts on passenger flow.
3.4. Exploratory Data Analysis (EDA)
- The Thane_Station_West to Pawar_Nagar, classified as a high-passenger-flow pair, usually possesses over 2000 daily passengers in pre-pandemic periods, with the highest peak of 4000 passengers in the studied range; whereas the hourly peaks reach approximately 400 passengers during peak hours, indicating its essentiality in the TMT network.
- Classified as a moderate-passenger-flow pair, the Thane_Station_West to Voltas_Gate pair accommodates a daily passenger flow ranging from 1000 to 3000 in a normal situation. The observed hourly demand is approximately 250 passengers during peak hours. This pair is striking a balance between high demand and localized transport.
- The Pawar_Nagar to Civil_Court pair, classified as a low-demand station pair, normally has a daily passenger demand of less than 1000 passengers, with the highest reported count being 800, and hourly peaks around 80 passengers during rush hours, indicating localized demand.
4. Methodology
4.1. TBATS
4.1.1. Box–Cox Transformation
4.1.2. Trend Component
4.1.3. Seasonal Component
4.1.4. ARMA Errors
4.1.5. TBATS Model Configuration and Training
- Trend component: auto-selects linear, damped, or no trend based on model fit criteria.
- Box–Cox transformation: applied for variance stabilization if it improves model performance.
- Damped trend: introduced when necessary to control long-term forecast growth.
4.2. LightGBM
Hyper Parameter Selection and Tuning
4.3. TBATS Boosting Hybrid Model
- TBATS model: produces the initial forecast based on trend and seasonality.
- LightGBM model: refines the forecast using residual learning informed by external features and lagged TBATS predictions.
Algorithm 1. TBATS Boosting Hybrid Approach for Passenger Flow Forecasting | |
Step 1: | Load input CSV files containing File1: origin-destination (OD) passenger count with Date_Time and external features. File2: forecast dataset with corresponding external features. |
Step 2: | Form Train and Test dataset for TBATS using input File1 |
Step 3: | Train the TBATS model using Date_Time as the index and Passenger_Count as the target variable: . Save the trained TBATS model for subsequent steps. |
Step 4: | Load the saved TBATS model and predict passenger count for the entire Train-Test dataset: . |
Step 5: | Calculate the residuals for the Train-Test dataset: |
Step 6: | Merge the TBATS predictions with external features and residuals to form a comprehensive feature set for the next modeling phase. |
Step 7: | Train LightGBM model using the dataset generated in step 6: . Save the trained LightGBM model for subsequent steps. |
Step 9: | Utilize the trained TBATS model to forecast initial passenger counts for the forecast horizon: . |
Step 10: | Combine TBATS forecast outputs with corresponding external features from the forecast dataset. |
Step 11: | Load the saved LightGBM model and predict the residuals for the forecast horizon: |
4.4. Evaluation Parameters
5. Results and Discussion
6. Conclusions
Author Contributions
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Balcombe, R.; Mackett, R.; Paulley, N.; Preston, J.; Shires, J.; Titheridge, H.; Wardman, M.; White, P. The Demand for Public Transport: A Practical Guide; Technical Report; Transportation Research Laborator: Crowthorne, UK, 2004. [Google Scholar]
- Talusan, J.P.; Mukhopadhyay, A.; Freudberg, D.; Dubey, A. On Designing Day Ahead and Same Day Ridership Level Prediction Models for City-Scale Transit Networks Using Noisy APC Data. arXiv 2022, arXiv:2210.04989. [Google Scholar] [CrossRef]
- Tirachini, A.; Hensher, D.A.; Rose, J.M. Crowding in public transport systems: Effects on users, operation and implications for the estimation of demand. Transp. Res. Part A Policy Pract. 2013, 53, 36–52. [Google Scholar] [CrossRef]
- Subah, A.I.; Rhythy, T.R.; Quadir, S.T.; Raihan, M.A. A systematic review on forecasting passenger flows of multimodal transportation system integrating metro. In Proceedings of the 7th International Conference on Civil Engineering for Sustainable Development, Khulna, Bangladesh, 23 July 2024. [Google Scholar]
- Halyal, S.; Mulangi, R.H.; Harsha, M.M. Forecasting public transit passenger demand: With neural networks using APC data. Case Stud. Transp. Policy 2022, 10, 965–975. [Google Scholar] [CrossRef]
- Agrawal, A. Sustainability of airlines in India with COVID-19: Challenges ahead and possible way-outs. J. Revenue Pricing Manag. 2021, 20, 457–472. [Google Scholar] [CrossRef]
- Elassy, M.; Al-Hattab, M.; Takruri, M.; Badawi, S. Intelligent transportation systems for sustainable smart cities. Transp. Eng. 2024, 16, 100252. [Google Scholar] [CrossRef]
- Ziel, F. Modeling public holidays in load forecasting: A German case study. J. Mod. Power Syst. Clean Energy 2018, 6, 191–207. [Google Scholar] [CrossRef]
- Jain, D. Impact of COVID-19 led transition of work culture and travel to work patterns on society and environment in Delhi. Urban Gov. 2024, 4, 388–400. [Google Scholar] [CrossRef]
- Padmakumar, A.; Patil, G.R. COVID-19 effects on urban driving, walking, and transit usage trends: Evidence from Indian metropolitan cities. Cities 2022, 126, 103697. [Google Scholar] [CrossRef]
- TERI. 2020. Impact of COVID-19 on Urban Mobility in India: Evidence from a Perception Study. New Delhi: The Energy and Resources Institute. 2020. Available online: https://www.teriin.org (accessed on 1 April 2025).
- Abu-Rayash, A.; Dincer, I. Analysis of mobility trends during the COVID-19 coronavirus pandemic: Exploring the impacts on global aviation and travel in selected cities. Energy Res. Soc. Sci. 2020, 68, 101693. [Google Scholar] [CrossRef]
- Thomas, N.; Jana, A.; Bandyopadhyay, S. Physical distancing on public transport in Mumbai, India: Policy and planning implications for unlock and post-pandemic period. Transp. Policy 2022, 116, 217–236. [Google Scholar] [CrossRef]
- Shanthappa, N.K.; Mulangi, R.H.; Manjunath, H.M. Deep learning-based public transit passenger flow prediction model: Integration of weather and temporal attributes. Public Transp. 2024. [Google Scholar] [CrossRef]
- Luo, Q.; Zhou, Y. Spatial-temporal Structures of Deep Learning Models for Traffic Flow Forecasting: A Survey. In Proceedings of the 2021 4th International Conference on Intelligent Autonomous Systems (ICoIAS), Wuhan, China, 14–16 May 2021; pp. 187–193s. [Google Scholar]
- Fontes, T.; Correia, R.; Ribeiro, J.; Borges, J.L. A Deep Learning Approach for Predicting Bus Passenger Demand Based on Weather Conditions. Transp. Telecommun. J. 2020, 21, 255–264. [Google Scholar] [CrossRef]
- Jain, D.; Singh, S. Adaptation of trips by metro rail users at two stations in extreme weather conditions: Delhi. Urban Clim. 2021, 36, 100766. [Google Scholar] [CrossRef]
- Jeffery, D.J.; Russam, K.; Robertson, D.I. Electronic route guidance by AUTOGUIDE: The research background. Traffic Eng. Control 1987, 28, 525–529. [Google Scholar]
- El Esawey, M. Daily bicycle traffic volume estimation: Comparison of historical average and count models. J. Urban Plan. Dev. 2018, 144, 04018011. [Google Scholar] [CrossRef]
- Yang, H.; Yang, J.; Han, L.D.; Liu, X.; Pu, L.; Chin, S.-M.; Hwang, H.-L. A Kriging based spatiotemporal approach for traffic volume data imputation. PLoS ONE 2018, 13, e0195957. [Google Scholar] [CrossRef]
- Kumar, S.V. Traffic Flow Prediction using Kalman Filtering Technique. Procedia Eng. 2017, 187, 582–587. [Google Scholar] [CrossRef]
- Jiao, P.; Li, R.; Sun, T.; Hou, Z.; Ibrahim, A. Three Revised Kalman Filtering Models for Short-Term Rail Transit Passenger Flow Prediction. Math. Probl. Eng. 2016, 2016, 9717582. [Google Scholar] [CrossRef]
- Emami, A.; Sarvi, M.; Asadi Bagloee, S. Using Kalman filter algorithm for short-term traffic flow prediction in a connected vehicle environment. J. Mod. Transport. 2019, 27, 222–232. [Google Scholar] [CrossRef]
- Chen, H.; Rakha, H.A. Real-time travel time prediction using particle filtering with a non-explicit state-transition model. Transp. Res. Part C Emerg. Technol. 2014, 43, 112–126. [Google Scholar] [CrossRef]
- Okutani, I.; Stephanedes, Y.J. Dynamic prediction of traffic volume through Kalman filtering theory. Transp. Res. Part B Methodol. 1984, 18, 1–11. [Google Scholar] [CrossRef]
- Whittaker, J.; Garside, S.; Lindveld, K. Tracking and predicting a network traffic process. Int. J. Forecast. 1997, 13, 51–61. [Google Scholar] [CrossRef]
- Williams, B.M.; Durvasula, P.K.; Brown, D.E. Urban Freeway Traffic Flow Prediction: Application of Seasonal Autoregressive Integrated Moving Average and Exponential Smoothing Models. Transp. Res. Rec. 1998, 1644, 132–141. [Google Scholar] [CrossRef]
- Milenković, M.; Švadlenka, L.; Melichar, V.; Bojović, N.; Avramović, Z. SARIMA modelling approach for railway passenger flow forecasting. Transport 2016, 33, 1113–1120. [Google Scholar] [CrossRef]
- Li, W.; Sui, L.; Zhou, M.; Dong, H. Short-term passenger flow forecast for urban rail transit based on multi-source data. J. Wirel. Commun. Netw. 2021, 2021, 9. [Google Scholar] [CrossRef]
- Devianto, D.; Permana, D.; Arif, E.; Afrimayani, A.; Yanuar, F.; Maiyastri, M.; Yollanda, M. An innovative model for capturing seasonal patterns of train passenger movement using exogenous variables and fuzzy time series hybridization. J. Open Innov. Technol. Mark. Complex. 2024, 10, 100232. [Google Scholar] [CrossRef]
- Liu, J.; Yang, X. Research on Passenger Flow Forecast of Urban Rail Transit Based on SARIMA-RBF Combination Model. In Proceedings of the 2021 International Conference on Intelligent Transportation, Big Data & Smart City (ICITBS), Xi’an, China, 27–28 March 2021; pp. 26–30. [Google Scholar] [CrossRef]
- Chuwang, D.D.; Chen, W. Forecasting Daily and Weekly Passenger Demand for Urban Rail Transit Stations Based on a Time Series Model Approach. Forecasting 2022, 4, 904–924. [Google Scholar] [CrossRef]
- Liu, Y.; Lyu, C.; Liu, X.; Liu, Z. Automatic Feature Engineering for Bus Passenger Flow Prediction Based on Modular Convolutional Neural Network. IEEE Trans. Intell. Transp. Syst. 2021, 22, 2349–2358. [Google Scholar] [CrossRef]
- Li, D.; Zhang, C.; Cao, J. Short-Term Passenger Flow Prediction of a Passageway in a Subway Station Using Time Space Correlations Between Multi Sites. IEEE Access 2020, 8, 72471–72484. [Google Scholar] [CrossRef]
- Zhang, J.; Chen, F.; Cui, Z.; Guo, Y.; Zhu, Y. Deep Learning Architecture for Short-Term Passenger Flow Forecasting in Urban Rail Transit. IEEE Trans. Intell. Transp. Syst. 2021, 22, 7004–7014. [Google Scholar] [CrossRef]
- Xu, Z.; Zhu, R.; Yang, Q.; Wang, L.; Wang, R.; Li, T. Short-Term Bus Passenger Flow Forecast Based on the Multi-feature Gradient Boosting Decision Tree. In Advances in Natural Computation, Fuzzy Systems and Knowledge Discovery. ICNC-FSKD 2019; Liu, Y., Wang, L., Zhao, L., Yu, Z., Eds.; Advances in Intelligent Systems and Computing; Springer: Cham, Switzerland, 2019; Volume 1074, pp. 660–673. [Google Scholar]
- Liu, L.; Chen, R.-C.; Zhao, Q.; Zhu, S. Applying a multistage of input feature combination to random forest for improving MRT passenger flow prediction. J. Ambient. Intell. Hum. Comput. 2019, 10, 4515–4532. [Google Scholar] [CrossRef]
- Zhou, W.; Wang, W.; Zhao, D. Passenger Flow Forecasting in Metro Transfer Station Based on the Combination of Singular Spectrum Analysis and AdaBoost-Weighted Extreme Learning Machine. Sensors 2020, 20, 3555. [Google Scholar] [CrossRef] [PubMed]
- Lee, E.H. Traffic Speed Prediction of Urban Road Network Based on High Importance Links Using XGB and SHAP. IEEE Access 2023, 11, 113217–113226. [Google Scholar] [CrossRef]
- Min, J.H.; Ham, S.W.; Kim, D.-K.; Lee, E.H. Deep Multimodal Learning for Traffic Speed Estimation Combining Dedicated Short-Range Communication and Vehicle Detection System Data. Transp. Res. Rec. 2023, 2677, 247–259. [Google Scholar] [CrossRef]
- Guo, J.; Xie, Z.; Qin, Y.; Jia, L.; Wang, Y. Short-Term Abnormal Passenger Flow Prediction Based on the Fusion of SVR and LSTM. IEEE Access 2019, 7, 42946–42955. [Google Scholar] [CrossRef]
- Xia, D.; Yang, N.; Jian, S.; Hu, Y.; Li, H. SW-BiLSTM: A Spark-based weighted BiLSTM model for traffic flow forecasting. Multimed. Tools Appl. 2022, 81, 23589–23614. [Google Scholar] [CrossRef]
- Li, W.-T.; Zhao, M.; Wu, Y.-H.; Yu, J.-J.; Bao, L.-Y.; Yang, H.; Liu, D. Collaborative offloading for UAV-enabled time-sensitive MEC networks. J. Wirel. Commun. Netw. 2021, 1, 2021. [Google Scholar] [CrossRef]
- Gong, M.; Fei, X.; Wang, Z.H.; Qiu, Y.J. Sequential Framework for Short-Term Passenger Flow Prediction at Bus Stop. Transp. Res. Rec. 2014, 2417, 58–66. [Google Scholar] [CrossRef]
- Mirzahossein, H.; Gholampour, I.; Sajadi, S.R.; Zamani, A.H. A hybrid deep and machine learning model for short-term traffic volume forecasting of adjacent intersections. IET Intell. Transp. Syst. 2022, 16, 1648–1663. [Google Scholar]
- Pan, Y.A.; Guo, J.; Chen, Y.; Cheng, Q.; Li, W.; Liu, Y. A fundamental diagram based hybrid framework for traffic flow estimation and prediction by combining a Markovian model with deep learning. Expert Syst. Appl. Part E 2024, 238, 122219. [Google Scholar] [CrossRef]
- Burdzik, R.; Chema, W.; Celiński, I. A study on passenger flow model and simulation in aspect of COVID-19 spreading on public transport bus stops. J. Public Transp. 2023, 25, 100063. [Google Scholar] [CrossRef]
- Shi, G.; Luo, L. Prediction and Impact Analysis of Passenger Flow in Urban Rail Transit in the Postpandemic Era. J. Adv. Transp. 2023, 2023, 3448864. [Google Scholar] [CrossRef]
- Zhang, L.; Liu, K. Unsupervised origin-destination flow estimation for analyzing COVID-19 impact on public transport mobility. Cities 2024, 151, 105086. [Google Scholar] [CrossRef]
- Li, X.; de Groot, M.; Bäck, T. Using forecasting to evaluate the impact of COVID-19 on passenger air transport demand. Decis. Sci. 2021, 54, 394–409. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
- Jing, Y.; Hu, H.; Guo, S.; Wang, X.; Chen, F. Short-Term Prediction of Urban Rail Transit Passenger Flow in External Passenger Transport Hub Based on LSTM-LGB-DRS. IEEE Trans. Intell. Transp. Syst. 2021, 22, 4611–4621. [Google Scholar] [CrossRef]
- Zhang, Y.; Zhu, C.; Wang, Q. LightGBM-based model for metro passenger volume forecasting. IET Intell. Transp. Syst. 2020, 14, 1815–1823. [Google Scholar] [CrossRef]
- Perone, G. Comparison of ARIMA, ETS, NNAR, TBATS and hybrid models to forecast the second wave of COVID-19 hospitalizations in Italy. Eur. J. Health Econ. 2022, 23, 917–940. [Google Scholar] [CrossRef]
- Yu, C.; Xu, C.; Li, Y.; Yao, S.; Bai, Y.; Li, J.; Wang, L.; Wu, W.; Wang, Y. Time Series Analysis and Forecasting of the Hand-Foot-Mouth Disease Morbidity in China Using An Advanced Exponential Smoothing State Space TBATS Model. Infect. Drug Resist. 2021, 14, 2809–2821. [Google Scholar] [CrossRef]
- Thayyib, P.V.; Thorakkattle, M.N.; Usmani, F.; Yahya, A.T.; Farhan, N.H.S. Forecasting Indian Goods and Services Tax revenue using TBATS, ETS, Neural Networks, and hybrid time series models. Cogent Econ. Financ. 2023, 11, 2285649. [Google Scholar] [CrossRef]
- Nagaraj, N.; Gururaj, H.L.; Swathi, B.H.; Hu, Y.-C. Passenger flow prediction in bus transportation system using deep learning. Multimed. Tools Appl. 2022, 81, 12519–12542. [Google Scholar] [CrossRef]
- Sajanraj, T.D.; Mulerikkal, J.; Raghavendra, S.; Vinith, R.; Fábera, V. Passenger flow prediction from AFC data using station memorizing LSTM for metro rail systems. Neural Netw. World 2021, 31, 173–189. [Google Scholar] [CrossRef]
- Cyril, A.; Mulangi, R.H.; George, V. Bus Passenger Demand Modelling Using Time-Series Techniques- Big Data Analytics. Open Transp. J. 2019, 13, 41–47. [Google Scholar] [CrossRef]
- Cyril, A.; Mulangi, R.H.; George, V. Modelling and Forecasting Bus Passenger Demand using Time Series Method. In Proceedings of the 2018 7th International Conference on Reliability, Infocom Technologies and Optimization (Trends and Future Directions) (ICRITO), Noida, India, 29–31 August 2018; pp. 460–466. [Google Scholar] [CrossRef]
- Gummadi, R.; Edara, S.R. Prediction of Passenger Flow of Transit Buses Over a Period of Time Using Artificial Neural Network. In Third International Congress on Information and Communication Technology; Yang, X.S., Sherratt, S., Dey, N., Joshi, A., Eds.; Advances in Intelligent Systems and Computing; Springer: Singapore, 2018; Volume 797, pp. 963–971. [Google Scholar]
- Uber Technologies, Inc. (n.d.). Kepler.gl [Data Visualization Tool]. Available online: https://kepler.gl (accessed on 27 March 2024).
- OpenAI. ChatGPT (Mar 14 Version) [Large Language Model]. 2023. Available online: https://chat.openai.com/chat (accessed on 15 October 2024).
Feature Name | Description | Type | Example Values |
---|---|---|---|
Date_Time | Timestamp in YYYY-MM-DD HH:MM:SS format. | Timestamp | 01-01-2023 00:00:00 |
Station_Pair | Combined source and destination station details. | Categorical | Thane_Station_West_Pawar_Nagar |
Passenger_Count | Number of passengers recorded for the given timestamp. | Numeric | 3, 22, 37 |
Is_Holiday | Binary indicator for holidays (1 = Holiday, 0 = Non-Holiday). | Binary | 0 |
Holiday_Type | Categorical representation of holiday types. | Categorical | 0, 1, 27 |
Month | Numeric representation of the month. | Categorical | 1, 2 |
sin_time | Sine-transformed hourly cyclic pattern. | Numeric | 0.866025404 |
cos_time | Cosine-transformed hourly cyclic pattern. | Numeric | 0.707106781 |
Weekday_Timeslot | Weekday and hourly time slot (e.g., 6_0). | Categorical | 6_0, 6_12 |
Confirmed | Confirmed COVID-19 cases for the corresponding day. | Numeric | 1234, 5678 |
Deceased | Deceased COVID-19 cases for the corresponding day. | Numeric | 5, 10 |
Recovered | Recovered COVID-19 cases for the corresponding day. | Numeric | 1200, 5500 |
TBATS_prediction | Initial passenger count forecasted by TBATS model | Numeric | 25, 22, 425 |
TBATS_prediction lag_24 | TBATS prediction value observed 24 h prior | Numeric | 22, 27, 45 |
TBATS_prediction lag_148 | TBATS prediction value observed 148 h prior | Numeric | 22, 27, 45 |
Hyperparameter | Final Value | Role in Model |
---|---|---|
Number of leaves | 51 | Controls the complexity and depth of each tree. |
Learning rate | 0.01 | Determines the step size for updating weights during training. |
Feature fraction | 0.8 | Fraction of features used per iteration; helps in regularization. |
Bagging fraction | 0.8 | Fraction of data used per iteration; reduces overfitting. |
Bagging frequency | 5 | Specifies frequency of bagging, promoting generalization. |
Minimum data in leaf | 30 | Minimum number of samples required in a leaf; prevents overfitting. |
Station_Pair | MAE | RMSE | MAPE | WMAPE | Forecasting_Model |
---|---|---|---|---|---|
Thane_Pawar | 27.15 | 43.12 | 32.72 | 25.59 | SARIMA |
24.65 | 37.12 | 30.09 | 21.76 | TBATS | |
20.47 | 29.23 | 27.95 | 17.12 | XGBoost | |
20.32 | 29.17 | 27.75 | 16.99 | LightGBM | |
16.11 | 24.7 | 22.03 | 14.52 | TBATS Boosting | |
Thane _Voltas | 29.01 | 43.78 | 34.62 | 21.03 | SARIMA |
26.01 | 38.04 | 32.07 | 20.25 | TBATS | |
20.08 | 28.23 | 31.23 | 18.41 | XGBoost | |
19.06 | 28.81 | 29.93 | 18.38 | LightGBM | |
16.19 | 26.37 | 25.07 | 16.62 | TBATS Boosting | |
Pawar_ Civil | 10.05 | 14.96 | 36.14 | 26.64 | SARIMA |
8.67 | 11.31 | 33.08 | 23.63 | TBATS | |
6.2 | 8.9 | 32.43 | 21.63 | XGBoost | |
6.27 | 8.98 | 31.02 | 21.87 | LightGBM | |
4.92 | 7.07 | 27.37 | 17.33 | TBATS Boosting |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Patel, M.; Patel, S.B.; Swain, D.; Mallagundla, R. Enhancing Accuracy in Hourly Passenger Flow Forecasting for Urban Transit Using TBATS Boosting. Modelling 2025, 6, 32. https://doi.org/10.3390/modelling6020032
Patel M, Patel SB, Swain D, Mallagundla R. Enhancing Accuracy in Hourly Passenger Flow Forecasting for Urban Transit Using TBATS Boosting. Modelling. 2025; 6(2):32. https://doi.org/10.3390/modelling6020032
Chicago/Turabian StylePatel, Madhuri, Samir B. Patel, Debabrata Swain, and Rishikesh Mallagundla. 2025. "Enhancing Accuracy in Hourly Passenger Flow Forecasting for Urban Transit Using TBATS Boosting" Modelling 6, no. 2: 32. https://doi.org/10.3390/modelling6020032
APA StylePatel, M., Patel, S. B., Swain, D., & Mallagundla, R. (2025). Enhancing Accuracy in Hourly Passenger Flow Forecasting for Urban Transit Using TBATS Boosting. Modelling, 6(2), 32. https://doi.org/10.3390/modelling6020032