Short-Term Mobile Network Traffic Forecasting Using Seasonal ARIMA and Holt-Winters Models
Abstract
:1. Introduction
1.1. Related Work
1.2. Contributions
- We analyze the dataset of real network traffic from a mobile operator in Portugal using fast processing statistical models, namely SARIMA, and Holt-Winters, which have not been applied to this data before.
- We demonstrate that the SARIMA model is more appropriate for forecasting download traffic, while the Holt-Winters model is better suited for forecasting upload traffic, showing appropriate errors in the considered dataset.
- Since statistical models are suitable for fast and precise forecasting of mobile network traffic, they can be implemented in cellular operators’ solutions without a significant increase in cost.
2. Descriptive Statistics
2.1. Dataset Description
2.2. Total Traffic Behavior
2.3. Traffic by Applications
3. Statistical Models for Forecasting Traffic
3.1. Seasonal ARIMA Model
3.2. Holt-Winters Model
- additive (linear) trend and additive (linear) seasonality
- multiplicative (exponential) trend and additive (linear) seasonality
- additive (linear) trend and multiplicative (exponential) seasonality
- multiplicative (exponential) trend and multiplicative (exponential) seasonality
3.3. Preliminary Checks
4. Forecasting Download and Upload Traffic
4.1. Evaluation Metrics
4.2. Download Traffic
4.3. Upload Traffic
4.4. Discussion
5. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
Abbreviations
3GPP | 3rd Generation Partnership Project |
5G | 5th generation |
ADF | Augmented Dickey–Fuller test |
AI | Artificial Intelligence |
AIC | Akaike information criterion |
AR | Auto regressive |
ARIMA | Auto regressive integrated moving average |
ITU-T | International Telecommunications Union – Telecommunications sector |
KPSS | Kwiatkowski–Phillips–Schmidt–Shin test |
LSTM | Long short-term memory |
LTE | Long-term evolution |
MA | Moving average |
MAE | Mean absolute error |
MAPE | Mean absolute percentage error |
ML | Machine learning |
MSE | Mean squared error |
MSISDN | Mobile station international subscriber directory number |
MSLE | Mean squared logarithmic error |
P2P | Peer-to-peer |
RMSE | Root mean squared error |
SARIMA | Seasonal ARIMA |
VoIP | Voice over internet protocol |
References
- Giordani, M.; Polese, M.; Mezzavilla, M.; Rangan, S.; Zorzi, M. Toward 6G Networks: Use Cases and Technologies. IEEE Commun. Mag. 2020, 58, 55–61. [Google Scholar] [CrossRef]
- Saad, W.; Bennis, M.; Chen, M. A Vision of 6G Wireless Systems: Applications, Trends, Technologies, and Open Research Problems. IEEE Netw. 2020, 34, 134–142. [Google Scholar] [CrossRef]
- Campos, R.; Ricardo, M.; Pouttu, A.; Correia, L. Wireless Technologies Towards 6G. Eurasip J. Wirel. Commun. Netw. 2023, 2023. [Google Scholar] [CrossRef]
- Kochetkov, D.; Vuković, D.; Sadekov, N.; Levkiv, H. Smart Cities and 5G Networks: An Emerging Technological Area? J. Geogr. Inst. Jovan Cvijic SASA 2019, 69, 289–295. [Google Scholar] [CrossRef]
- Kochetkov, D.; Almaganbetov, M. Using Patent Landscapes for Technology Benchmarking: A Case of 5G Networks. Adv. Syst. Sci. Appl. 2021, 21, 20–28. [Google Scholar] [CrossRef]
- Ruiz, S.; Ahmadi, H.; Gardašević, G.; Haddad, Y.; Katzis, K.; Grazioso, P.; Petrini, V.; Reichman, A.; Ozdemir, M.; Velez, F.; et al. 5G and Beyond Networks; Elsevier: Amsterdam, The Netherlands, 2021; pp. 141–186. [Google Scholar] [CrossRef]
- Moltchanov, D.; Sopin, E.; Begishev, V.; Samuylov, A.; Koucheryavy, Y.; Samouylov, K. A Tutorial on Mathematical Modeling of 5G/6G Millimeter Wave and Terahertz Cellular Systems. IEEE Commun. Surv. Tutorials 2022, 24, 1072–1116. [Google Scholar] [CrossRef]
- Kondratyeva, A.; Ivanova, D.; Begishev, V.; Markova, E.; Mokrov, E.; Gaidamaka, Y.; Samouylov, K. Characterization of Dynamic Blockage Probability in Industrial Millimeter Wave 5G Deployments. Future Internet 2022, 14, 193. [Google Scholar] [CrossRef]
- Mokrov, E.; Samouylov, K. Performance Assessment and Comparison of Deployment Options for 5G Millimeter Wave Systems. Future Internet 2023, 15, 60. [Google Scholar] [CrossRef]
- ITU-T. SERIES Y: Global Information Infrastructure, Internet Protocol Aspects, Next-Generation Networks, Internet of Things and Smart Cities; Technical Recommendation (TR) Y.3651; ITU Telecommunication Standardization Sector (ITU-T): Geneva, Switzerland, 2018. [Google Scholar]
- 3GPP. 5G System (5GS); Study on Traffic Characteristics and Performance Requirements for AI/ML Model Transfer; Technical Report (TR) 22.874; Release 18, V18.2.0; 3rd Generation Partnership Project (3GPP): Valbonne, France, 2017. [Google Scholar]
- ITU-T. SERIES Y: Global Information Infrastructure, Internet Protocol Aspects, Next-Generation Networks, Internet of Things and Smart Cities; Technical Recommendation (TR) Y.3602; ITU Telecommunication Standardization Sector (ITU-T): Geneva, Switzerland, 2022. [Google Scholar]
- Cisco. Spend Less Time Managing Your Network. 2022. Available online: https://www.cisco.com/site/us/en/products/networking/dna-center-platform/index.html (accessed on 1 June 2022).
- Chen, A.; Law, J.; Aibin, M. A Survey on Traffic Prediction Techniques Using Artificial Intelligence for Communication Networks. Telecom 2021, 2, 518–535. [Google Scholar] [CrossRef]
- Efron, B.; Hastie, T. Computer Age Statistical Inference: Algorithms, Evidence, and Data Science; Cambridge University Press: Cambridge, UK, 2016; pp. 1–475. [Google Scholar] [CrossRef]
- Jiang, W. Cellular Traffic Prediction with Machine Learning: A Survey. Expert Syst. Appl. 2022, 201, 117163. [Google Scholar] [CrossRef]
- Gorshenin, A.; Kuzmin, V. Statistical Feature Construction for Forecasting Accuracy Increase and Its Applications in Neural Network Based Analysis. Mathematics 2022, 10, 589. [Google Scholar] [CrossRef]
- Downey, A.; Loukides, M.; Blanchette, M.; Demarest, R. Think Stats: Exploratory Data Analysis; O’Reilly Media: Sebastopol, CA, USA, 2014; pp. 1–223. [Google Scholar]
- Gorshenin, A.; Shcherbinina, A. Efficiency of the Method for Detecting Normal Mixture Signals with Pre-Estimated Gaussian Mixture Noise. Pattern Recognit. Image Anal. 2020, 30, 470–479. [Google Scholar] [CrossRef]
- Gorshenin, A.; Kazakov, I.; Korolev, V. On the Convergence of Median Versions of the Expectation-Maximization Algorithm for the Separation of Finite Normal Mixtures. J. Math. Sci. 2022, 267, 92–98. [Google Scholar] [CrossRef]
- Xu, F.; Lin, Y.; Huang, J.; Wu, D.; Shi, H.; Song, J.; Li, Y. Big Data Driven Mobile Traffic Understanding and Forecasting: A Time Series Approach. IEEE Trans. Serv. Comput. 2016, 9, 796–805. [Google Scholar] [CrossRef]
- Stepanov, N.; Alekseeva, D.; Ometov, A.; Lohan, E. Applying Machine Learning to LTE Traffic Prediction: Comparison of Bagging, Random Forest, and SVM. In Proceedings of the 12th International Congress on Ultra Modern Telecommunications and Control Systems and Workshops, ICUMT 2020, Brno, Czech Republic, 5–7 October 2020; pp. 119–123. [Google Scholar] [CrossRef]
- Ma, T.; Antoniou, C.; Toledo, T. Hybrid Machine Learning Algorithm and Statistical Time Series Model for Network-Wide Traffic Forecast. Transp. Res. Part Emerg. Technol. 2020, 111, 352–372. [Google Scholar] [CrossRef]
- Lens Shiang, E.; Chien, W.C.; Lai, C.F.; Chao, H.C. Gated Recurrent Unit Network-based Cellular Traffic Prediction. In Proceedings of the 34th International Conference on Information Networking, ICOIN 2020, Barcelona, Spain, 7–10 January 2020; pp. 471–476. [Google Scholar] [CrossRef]
- Zhaowei, Q.; Haitao, L.; Zhihui, L.; Tao, Z. Short-Term Traffic Flow Forecasting Method with M-B-LSTM Hybrid Network. IEEE Trans. Intell. Transp. Syst. 2022, 23, 225–235. [Google Scholar] [CrossRef]
- Shan, M.; Yan, Q.; Huang, S.; Wang, Y. Prediction and Analysis of Telemetry Data Based on LSTM Network. In Proceedings of the 2nd International Conference on Computer Network, Electronic and Automation, ICCNEA 2019, Xi’an; China, 27–29 September 2019; pp. 155–159. [Google Scholar] [CrossRef]
- Syam, R.F.; Girsang, A.S. Bandwidth Provisioning for 4G Mobile Network Using Hybrid ARIMA-LSTM Based Traffic Forecasting. Int. J. Eng. Trends Technol. 2021, 69, 235–241. [Google Scholar] [CrossRef]
- Azari, A.; Salehi, F.; Papapetrou, P.; Cavdar, C. Energy and Resource Efficiency by User Traffic Prediction and Classification in Cellular Networks. IEEE Trans. Green Commun. Netw. 2022, 6, 1082–1095. [Google Scholar] [CrossRef]
- Tran, Q.T.; Hao, L.; Trinh, Q.K. Cellular Network Traffic Prediction Using Exponential Smoothing Methods. J. Inf. Commun. Technol. 2019, 18, 1–18. [Google Scholar] [CrossRef]
- Peng, Y.; Lei, M.; Li, J.B.; Peng, X.Y. A Novel Hybridization of Echo State Networks and Multiplicative Seasonal ARIMA Model for Mobile Communication Traffic Series Forecasting. Neural Comput. Appl. 2014, 24, 883–890. [Google Scholar] [CrossRef]
- Kurri, V.; Raja, V.; Prakasam, P. Cellular Traffic Prediction on Blockchain-Based Mobile Networks Using LSTM Model in 4G LTE Network. Peer-to-Peer Netw. Appl. 2021, 14, 1088–1105. [Google Scholar] [CrossRef]
- Oduro-Gyimah, F.K.; Boateng, K.O. Using Autoregressive Integrated Moving Average Models in the Analysis and Forecasting of Mobile Network Traffic Data. J. Eng. Res. 2019, 7, 1–9. [Google Scholar] [CrossRef]
- Céspedes, J.E.S.; Rodríguez, Y.G.; Sarmiento, D.A.L. Development of An Univariate Method for Predicting Traffic Behaviour in Wireless Networks through Statistical Models. Int. J. Eng. Technol. 2015, 7, 27–36. [Google Scholar]
- Bastos, J.A. Forecasting the Capacity of Mobile Networks. Telecommun. Syst. 2019, 72, 231–242. [Google Scholar] [CrossRef]
- Ak, E.; Canberk, B. Forecasting Quality of Service for Next-Generation Data-Driven WiFi6 Campus Networks. IEEE Trans. Netw. Serv. Manag. 2021, 18, 4744–4755. [Google Scholar] [CrossRef]
- Sone, S.P.; Lehtomäki, J.J.; Khan, Z. Wireless Traffic Usage Forecasting Using Real Enterprise Network Data: Analysis and Methods. IEEE Open J. Commun. Soc. 2020, 1, 777–797. [Google Scholar] [CrossRef]
- Shayea, I.; Alhammadi, A.; El-Saleh, A.A.; Hassan, W.H.; Mohamad, H.; Ergen, M. Time Series Forecasting Model of Future Spectrum Demands for Mobile Broadband Networks in Malaysia, Turkey, and Oman. Alex. Eng. J. 2022, 61, 8051–8067. [Google Scholar] [CrossRef]
- Gijón, C.; Toril, M.; Luna-Ramírez, S.; Marí-Altozano, M.L.; Ruiz-Avilés, J.M. Long-Term Data Traffic Forecasting for Network Dimensioning in LTE with Short Time Series. Electronics 2021, 10, 1151. [Google Scholar] [CrossRef]
- Li, Y.; Wang, Y. Mobile Virtual Reality Rail Traffic Congestion Prediction Algorithm Based on Convolutional Neural Network. Mob. Inf. Syst. 2022, 2022, 2174208. [Google Scholar] [CrossRef]
- Biernacki, A. Traffic Prediction Methods for Quality Improvement of Adaptive Video. Multimed. Syst. 2018, 24, 531–547. [Google Scholar] [CrossRef]
- Yu, Q.; Jibin, L.; Jiang, L. An Improved ARIMA-Based Traffic Anomaly Detection Algorithm for Wireless Sensor Networks. Int. J. Distrib. Sens. Netw. 2016, 2016, 9653230. [Google Scholar] [CrossRef]
- Feng, H.; Shu, Y.; Ma, M. WLAN Traffic Prediction Using Support Vector Machine. IEICE Trans. Commun. 2009, E92-B, 2915–2921. [Google Scholar] [CrossRef]
- Yadav, R.K.; Balakrishnan, M. Comparative Evaluation of ARIMA and ANFIS for Modeling of Wireless Network Traffic Time Series. Eurasip J. Wirel. Commun. Netw. 2014, 2014, 15. [Google Scholar] [CrossRef]
- Arifin, A.S.; Habibie, M.I. The Prediction of Mobile Data Traffic based on the ARIMA Model and Disruptive Formula in Industry 4.0: A Case Study in Jakarta, Indonesia. Telkomnika (Telecommun. Comput. Electron. Control) 2020, 18, 907–918. [Google Scholar] [CrossRef]
- Box, G.; Jenkins, G.; Reinsel, G.; Ljung, G. Time Series Analysis: Forecasting and Control; Wiley: Hoboken, NJ, USA, 2015; pp. 1–712. [Google Scholar]
- Cryer, J.; Chan, K. Time Series Analysis: With Applications in R; Springer: Berlin/Heidelberg, Germany, 2008; pp. 1–491. [Google Scholar]
- Faverjon, C.; Berezowski, J. Choosing the Best Algorithm for Event Detection Based on the Intend Application: A Conceptual Framework for Syndromic Surveillance. J. Biomed. Inform. 2018, 85, 126–135. [Google Scholar] [CrossRef] [PubMed]
- Hyndman, R.; Athanasopoulos, G. Forecasting: Principles and Practice; OTexts: Melbourne, Australia, 2021; pp. 1–442. [Google Scholar]
- Miao, D.; Qin, X.; Wang, W. The Periodic Data Traffic Modeling based on Multiplicative Seasonal ARIMA Model. In Proceedings of the 6th International Conference on Wireless Communications and Signal Processing, WCSP 2014, Hefei, China, 23–25 October 2014. [Google Scholar] [CrossRef]
- Cleveland, R.B.; Cleveland, W.S.; McRae, J.E.; Terpenning, I. STL: A Seasonal-Trend Decomposition Procedure Based on Loess. J. Off. Stat. 1990, 6, 3–73. [Google Scholar]
- Kwiatkowski, D.; Phillips, P.; Schmidt, P.; Shin, Y. Testing the Null Hypothesis of Stationarity Against the Alternative of a Unit Root. How Sure are We that Economic Time Series Have a Unit Root? J. Econom. 1992, 54, 159–178. [Google Scholar] [CrossRef]
- Efrosinin, D.; Kochetkova, I.; Stepanova, N.; Yarovslavtsev, A.; Samouylov, K.; Valentini, R. The Fourier Series Model for Predicting Sapflow Density Flux based on TreeTalker Monitoring System. Lect. Notes Comput. Sci. 2020, 12526, 198–209. [Google Scholar] [CrossRef]
- Efrosinin, D.; Kochetkova, I.; Stepanova, N.; Yarovslavtsev, A.; Samouylov, K.; Valentini, R. Trees Classification based on Fourier Coefficients of the Sapflow Density Flux. Ann. Math. Informaticae 2021, 53, 109–123. [Google Scholar] [CrossRef]
Ref. | Task | Data Source/Application | Model | Evaluation Metric |
---|---|---|---|---|
[30] | Traffic forecasting | Traffic from 2 cells | ARIMA | MAE, NMSE |
[31] | Traffic forecasting | Traffic from 3 LTE cells | ARIMA, LSTM | MSE, MAE, score |
[34] | Network capacity forecasting | Circuit switch and packet switch 3G traffic | Random walk, Linear trend, Exponential smoothing, ARIMA | RMSE, MAPE |
[38] | Traffic forecasting | Traffic from 7160 LTE cells | SARIMA, Holt-Winters, Random Forest, SVM, ANN | MAPE, MAE |
[28] | Recourse optimization | Traffic from the network with discontinuous reception (DRX) scheme | ARIMA, LSTM | RMSE |
[39] | Traffic congestion forecasting | Traffic from 8 detectors of virtual reality railway | Random walk, Historical mean, ARIMA, LSTM, GRU, DCFCN | RMSE, MAE, MAPE |
[36] | Traffic forecasting | Traffic from 470 access points of an enterprise network | Holt-Winters, SARIMA, LSTM, GRU, CNN | MAE, RMSE, NRMSE, score |
[40] | Throughput forecasting | Download traffic from HSPA network | ARIMA, FARIMA, ANN | Efficiency, switches per minute and buffering per minute |
[35] | Traffic forecasting | WiFi and cellular download traffic from a server | ARIMA, FARIMA, SVR, RNN | MAE, RMSE, MAPE, MASE |
[33] | Traffic forecasting | WiFi traffic from 15 protocols | ARIMA | Error rate, average absolute deviation, mean and variance of the error |
[41] | Traffic forecasting and anomaly detection | Traffic from wireless sensor network | ARIMA | Complexity, Accuracy, Intelligence, Independence |
[32] | Traffic forecasting | Traffic from 191 eNodeB | ARIMA | AIC, AICc, BIC |
[42] | Traffic forecasting | WLAN traffic | ARIMA, FARIMA, SVM, Welevet, ANN | MSE, NMSE |
[29] | Traffic forecasting | Voice and data traffic from 600 cells | Exponential Smoothing, Holt-Winters | AIC, SSR, RMSE, AMSE |
[37] | Spectrum efficiency forecasting | Traffic from 3 countries | AR, MA, ARMA, ARIMA | MAE, MSE, RMSE, NRMSE, NMAE |
[43] | Traffic forecasting | Traffic from an institutional wireless network | ARIMA, ANFIS | RMSE |
[44] | Traffic forecasting | LTE and 3G traffic | ARIMA | Percentage error between models |
Variable | Description |
---|---|
START_HOUR | Start time of the one-hour period for measuring traffic |
MASKED_MSISDN | Masked mobile station international subscriber directory number |
APP_CLASS | Application class |
UPLOAD | Incoming traffic in the uplink during one hour [MB] |
DOWNLOAD | Outgoing traffic in the downlink during one hour [MB] |
START_HOUR | MASKED_MSISDN | APP_CLASS | UPLOAD | DOWNLOAD |
---|---|---|---|---|
2018-02-10 01:00:00 | F6C1745A0A9DF638DE2C14683E0F250D | Streaming Applications | 0.002527 | 0.000616 |
2018-02-10 01:00:00 | 6474B3E3E20B5887A7593C61439250A9 | Others | 0.000828 | 0.000334 |
2018-02-10 01:00:00 | B05DEBB3D0E2ACD68FE47611CA3FDDCB | Web Applications | 0.000967 | 0.001813 |
2018-02-10 01:00:00 | B2D5516431ECC5B6851FD9FBAE0387A7 | Games | 0.000039 | 0.000052 |
2018-02-10 01:00:00 | 5B507EECA75149121F9C86E8690109D8 | 0.012802 | 0.006137 |
Application | No. Records | Application | No. Records |
---|---|---|---|
Web Applications | 9,641,283 | Others | 6,464,018 |
Instant Messaging Applications | 5,540,289 | Games | 5,199,684 |
File Transfer | 4,259,626 | 2,622,758 | |
Streaming Applications | 2,420,552 | VoIP | 1,825,777 |
Security | 1,667,752 | Music Streaming | 790,107 |
Network Operation | 635,634 | P2P Applications | 274,245 |
Terminals | 121,807 | File Systems | 10,118 |
DB Transactions | 5816 | Legacy Protocols | 22 |
Upload | Download | |
---|---|---|
Mean | ||
Standard deviation | ||
Minimum | 0 | 0 |
Mode | ||
Maximum |
Group | Applications |
---|---|
1. Time series is similar to the total traffic profile and seasonal | Others, Streaming Applications, Web Applications |
2. Time series is not similar to the total traffic profile and non-seasonal with outliers | DB Transactions, File Systems, File Transfer, Games, Mail, Music Streaming, P2P Applications, Security, Terminals |
3. Time series is not similar to the total traffic profile and seasonal | Instant Messaging Applications, Legacy Protocols, VoIP, Network Operation |
Parameter | Description |
---|---|
Time series parameters | |
Time series of data | |
Forecast value of | |
SARIMA model parameters | |
p | Order of the non-seasonal AR part |
d | Degree of differencing for the non-seasonal part |
q | Order of the non-seasonal MA part |
P | Order of the seasonal AR part |
D | Degree of differencing for the seasonal part |
Q | Order of the seasonal MA part |
m | Number of periods in each season |
Lag operator | |
Error terms | |
Parameters of the non-seasonal AR part | |
Parameters of the non-seasonal MA part | |
Parameters of the seasonal AR part | |
Parameters of the seasonal MA part | |
Holt-Winters model parameters | |
Smoothed value of | |
Estimate of the trend | |
Seasonal change factor | |
Data smoothing factor | |
Trend smoothing factor | |
Seasonal change smoothing factor |
Metric | Formula |
---|---|
Mean squared error (MSE) | |
Root mean square error (RMSE) | |
Mean absolute error (MAE) | |
Mean absolute percentage error (MAPE) | |
Mean squared logarithmic error (MSLE) |
Metric | SARIMA Model | Holt-Winters Model |
---|---|---|
MSE | 0.0181 | 0.00021 |
RMSE | 0.0181 | 0.0145 |
MAE | 0.01513 | 0.01217 |
MAPE | 15% | 11.2% |
MSLE | 0.000258 | 0.000163 |
Metric | SARIMA Model | Holt-Winters Model |
---|---|---|
MSE | 0.00004 | 0.00015 |
RMSE | 0.006 | 0.0123 |
MAE | 0.0046 | 0.0099 |
MAPE | 4.17% | 9.9% |
MSLE | 0.00003 | 0.000118 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Kochetkova, I.; Kushchazli, A.; Burtseva, S.; Gorshenin, A. Short-Term Mobile Network Traffic Forecasting Using Seasonal ARIMA and Holt-Winters Models. Future Internet 2023, 15, 290. https://doi.org/10.3390/fi15090290
Kochetkova I, Kushchazli A, Burtseva S, Gorshenin A. Short-Term Mobile Network Traffic Forecasting Using Seasonal ARIMA and Holt-Winters Models. Future Internet. 2023; 15(9):290. https://doi.org/10.3390/fi15090290
Chicago/Turabian StyleKochetkova, Irina, Anna Kushchazli, Sofia Burtseva, and Andrey Gorshenin. 2023. "Short-Term Mobile Network Traffic Forecasting Using Seasonal ARIMA and Holt-Winters Models" Future Internet 15, no. 9: 290. https://doi.org/10.3390/fi15090290
APA StyleKochetkova, I., Kushchazli, A., Burtseva, S., & Gorshenin, A. (2023). Short-Term Mobile Network Traffic Forecasting Using Seasonal ARIMA and Holt-Winters Models. Future Internet, 15(9), 290. https://doi.org/10.3390/fi15090290