Optimizing Short-Term Water Demand Forecasting: A Comparative Approach to the Battle of Water Demand Forecasting †
Abstract
:1. Introduction
2. Materials and Methods
2.1. General Framework
- Pre-processing: The data provided present missing values that limit the range of techniques that may be directly applied. The first step consists of pre-processing of seasonally decomposed missing value imputation using the R package imputeTS [5].
- Model generation: The second step comprises the generation of distinct models for forecasting. This study uses five distinct forecasting techniques (as presented in 2.2). Each of these techniques contains specific parameters. Thus, a range of parameter combinations is devised for each technique. These parameter variations result in a set of candidate models for each forecasting technique.
- Predicting of preceding period (hindcasting): Each of the candidate models generated in Step 2 is used to forecast a preceding period (e.g., a day or a week) leading up to the desired forecasting horizon. This step involves training the models on historical data preceding the target period, thereby assessing their predictive performance in a near-term context. Notably, this process may demand considerable computational resources, particularly when dealing with numerous parameter combinations or computationally intensive forecasting techniques.
- Model evaluation and selection: The predicted values in Step 3 are compared with the real values using the mean absolute error, thus enabling ranking of each model in terms of predictability performance in a near-term context. The model with the smallest error is selected for forecasting.
- Forecasting: The model selected in Step 4 is finally used to estimate the values for the forecasting horizon. This step involves training the model on historical data preceding the forecasting horizon.
2.2. Forecasting Techniques
- The Naïve technique considers patterns either on a day-by-day basis or differentiating weekdays from weekends. For each timestamp in the forecast period, it identifies relevant historical data blocks based on the chosen pattern. Then, it calculates the predicted value using either the average, median, or exponentially weighted moving average of the historical data block. The configurable parameters are the duration of historical data (e.g., one or two months), the pattern type (day-by-day basis or differentiating weekdays from weekends), and the type specified forecast type (average, median, or exponentially weighted moving average).
- The support vector regression (SVR) technique uses multiple regression models to perform the forecast. Initially, it prepares the historical data by creating lagged values and by categorizing the data into weekdays, Saturdays, and Sundays/holidays. For each timestamp in the forecast period, a SVR model is trained using the relevant blocks in the prepared historical data. The trained SVR model is then used to forecast that specific timestamp. The configurable parameters are the duration of historical data, and the number of lagged values (e.g., previous 5 or 10 measurements at the same time of the day for the same type of weekday).
- The Quevedo technique firstly estimates the total daily volume for the day for which measurements are to be estimated. This is conducted using an ARIMA model. Then, this total daily volume is distributed to hourly values based on the average pattern for this weekday. The configurable parameters are the duration of historical data and the pattern type (day-by-day basis or differentiating weekdays from weekends/holidays).
- Distinct from previous techniques, the XGBoost technique considers meteorological data (precipitation, air temperature, air relative humidity, wind speed). It firstly prepares the historical data with the necessary features (e.g., hour of the day, day of the week, weather variables, or holiday indicators). Then, a regression model is trained using 80% of the historical data, with the remaining 20% being used to monitor training progress and prevent overfitting. The trained regression model is then used to forecast each specific timestamp (note that expected weather data are required in this phase). The configurable parameters are the duration of historical data and multiple XGBoost-specific parameters (e.g., learning rate, number of estimators, max depth, and number of early stopping rounds).
- The long short-term memory technique (LSTM) is a recurrent neural network that memorizes long-term dependencies of time series [6]. The configurable parameters are the duration of the historical data, some parameters related to the general neural network algorithm (e.g., dropout, batch size, number of units), and LSTM-specific parameters (e.g., window length). The LSTM with weather data works by using two LSTM modules: the first module deals with the historical water demand data, using past observations with a time horizon equal to the length of the window; the second module considers meteorological data (precipitation, temperature, relative humidity, wind speed) with a shorter window length horizon [7]. The configurable parameters are the same as for the LSTM, but in this case both for the water and the meteorological data.
3. Results
4. Discussion and Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
References
- Pacchin, E.; Gagliardi, F.; Alvisi, S.; Franchini, M. A Comparison of Short-Term Water Demand Forecasting Models. Water Resour. Manag. 2019, 33, 1481–1497. [Google Scholar] [CrossRef]
- Mu, L.; Zheng, F.; Tao, R.; Zhang, Q.; Kapelan, Z. Hourly and Daily Urban Water Demand Predictions Using a Long Short-Term Memory Based Model. J. Water Resour. Plan. Manag. 2020, 146, 05020017. [Google Scholar] [CrossRef]
- Ghalehkhondabi, I.; Ardjmand, E.; Young, W.A.; Weckman, G.R. Water Demand Forecasting: Review of Soft Computing Methods. Env. Environ. Monit. Assess. 2017, 189, 313. [Google Scholar] [CrossRef] [PubMed]
- Niknam, A.; Zare, H.K.; Hosseininasab, H.; Mostafaeipour, A.; Herrera, M. A Critical Review of Short-Term Water Demand Forecasting Tools—What Method Should I Use? Sustainability 2022, 14, 5412. [Google Scholar] [CrossRef]
- Moritz, S.; Bartz-Beielstein, T. ImputeTS: Time Series Missing Value Imputation in R. R J. 2017, 9, 207. [Google Scholar] [CrossRef]
- Hochreiter, S.; Schmidhuber, J. Long Short-Term Memory. Neural Comput. 1997, 9, 1735–1780. [Google Scholar] [CrossRef] [PubMed]
- Zanfei, A.; Brentan, B.M.; Menapace, A.; Righetti, M. A Short-Term Water Demand Forecasting Model Using Multivariate Long Short-Term Memory with Meteorological Data. J. Hydroinformatics 2022, 24, 1053–1065. [Google Scholar] [CrossRef]
Week | DMA A | DMA B | DMA C | DMA D | DMA E | DMA F | DMA G | DMA H | DMA I | DMA J |
---|---|---|---|---|---|---|---|---|---|---|
W1 | Naïve | Quevedo | Naïve | SVR | XGBoost | Naïve | Quevedo | SVR | Quevedo | XGBoost |
W2 | SVR | SVR | SVR | XGBoost | Quevedo | XGBoost | SVR | XGBoost | SVR | SVR |
W3 | XGBoost | SVR | Quevedo | Naïve | SVR | XGBoost | SVR | SVR | SVR | Naïve |
W4 | Naïve | Naïve | Naïve | LSTM-W | Naïve | LSTM-W | Naïve | SVR | LSTM-W | XGBoost |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Ferreira, B.; Barreira, R.; Caetano, J.; Quarta, M.G.; Carriço, N. Optimizing Short-Term Water Demand Forecasting: A Comparative Approach to the Battle of Water Demand Forecasting. Eng. Proc. 2024, 69, 48. https://doi.org/10.3390/engproc2024069048
Ferreira B, Barreira R, Caetano J, Quarta MG, Carriço N. Optimizing Short-Term Water Demand Forecasting: A Comparative Approach to the Battle of Water Demand Forecasting. Engineering Proceedings. 2024; 69(1):48. https://doi.org/10.3390/engproc2024069048
Chicago/Turabian StyleFerreira, Bruno, Raquel Barreira, João Caetano, Maria Grazia Quarta, and Nelson Carriço. 2024. "Optimizing Short-Term Water Demand Forecasting: A Comparative Approach to the Battle of Water Demand Forecasting" Engineering Proceedings 69, no. 1: 48. https://doi.org/10.3390/engproc2024069048
APA StyleFerreira, B., Barreira, R., Caetano, J., Quarta, M. G., & Carriço, N. (2024). Optimizing Short-Term Water Demand Forecasting: A Comparative Approach to the Battle of Water Demand Forecasting. Engineering Proceedings, 69(1), 48. https://doi.org/10.3390/engproc2024069048