A Short-Term Water Demand Forecasting Model Using a Moving Window on Previously Observed Data

Pacchin, Elena; Alvisi, Stefano; Franchini, Marco

doi:10.3390/w9030172

Open AccessArticle

A Short-Term Water Demand Forecasting Model Using a Moving Window on Previously Observed Data

by

Elena Pacchin

^*,

Stefano Alvisi

and

Marco Franchini

Engineering Department, University of Ferrara, 44121 Ferrara, Italy

^*

Author to whom correspondence should be addressed.

Water 2017, 9(3), 172; https://doi.org/10.3390/w9030172

Submission received: 13 January 2017 / Revised: 14 February 2017 / Accepted: 24 February 2017 / Published: 28 February 2017

(This article belongs to the Special Issue Selected Papers from the 1st International Electronic Conference on Water Science)

Download

Browse Figures

Versions Notes

Abstract

:

In this article, a model for forecasting water demands over a 24-h time window using solely a pair of coefficients whose value is updated at every forecasting step is presented. The first coefficient expresses the ratio between the average water demand over the 24 h that follow the time the forecast is made and the average water demand over the 24 h that precede it. The second coefficient expresses the relationship between the average water demand in a generic hour falling within the 24-h forecasting period and the average water demand over that period. These coefficients are estimated using the information available in the weeks prior to the time of forecasting and, therefore, the model does not require any actual calibration process. The length of the time series necessary to implement the model is thus just a few weeks (3–4 weeks) and the model can be parameterized and used without there being any need to collect hourly water demand data for long periods. The application of the model to a real-life case and a comparison with results provided by another model already proposed in the scientific literature show it to be effective, robust, and easy to use.

Keywords:

water demand; forecast; moving window

1. Introduction

Water demands represent the driver of water distribution systems, and reliable demand forecasting represents a valid aid in simulating and managing such systems. Depending on the time horizon considered, water demand forecasts can be classified, as suggested by [1], into “strategic”, “tactical”, and “operational”.

Forecasts of a “strategic” type provide an estimate of water demands over long time horizons, spanning decades [2], and are intended to support long-term planning of water resource use or the design of water networks [3,4,5,6].

Forecasts of a “tactical” type provide an estimate of water demands over medium-term horizons of a few years [2] and are intended to support the management in interventions to upgrade and improve networks and installations [7,8].

Finally, forecasts of an “operational” type provide an estimate of water demands over short time horizons, ranging from one day to a few weeks [2], and are aimed at enabling optimal short-term (or real-time) management of the network devices and installations (pressure regulating valves, pumping stations, energy recovery systems, etc.) [9,10,11,12,13,14,15,16].

Focusing attention on the latter type of forecast, which the present study is specifically concerned with, a variety of models have recently been presented in the scientific literature (see, for example, [1,2,17] for a review of these models). The models differ in several respects—principally, in the time step considered, the type of input used, and the structure and approach on which the model is based.

The time step used typically ranges from a day [11,18] to an hour [12,19], or to as little as a quarter of an hour in the case of the model of [13]. There are also models with multiperiodicity, in which water demand is forecast at different time steps, for example on a daily and hourly basis, as in [10].

With regard to inputs, the majority of short-term forecasting models are based mainly on the historical series of demands observed in the previous week or month or year.

However, in some cases, climate factors such as precipitation, temperature, and humidity are also considered, as they can have a significant impact on water demand [19,20,21].

Finally, as far as approach is concerned, many of the models recently proposed in the literature are based on data-driven techniques such as artificial neural networks (ANNs) (e.g., [12,17,22,23,24,25], support vector machines (SVMs) (e.g., [26,27], fuzzy logic [28], projection pursuit regression (PPR), random forests (RMs) and multivariate adaptive regression splines (MARS) [12].

Other models, by contrast, are based primarily on the representation of the periodic patterns that typically characterize demand [13], possibly coupled with techniques of time series analysis, as in [10].

An important aspect that should be highlighted is the model’s parameterization, which is closely connected both to the approach and to the structure of the model itself.

The above-mentioned models are based on structures that can be more or less complex, but they are generally characterized by parameters whose values must be defined prior to the model’s application. For example, neural network models, by their very nature, require a calibration of weights and bias; a sufficiently long time series of historically observed data has to be used for this purpose in order for the network to be effectively trained, taking into account the variability in the water demand patterns that can be observed, for example, during the year. It is well known, in fact, that a neural network, when implemented in practice, is able to reliably reproduce/forecast data that fall within the range previously used for calibration, whereas it may produce poor results if applied to a set of data differing completely from those considered in the calibration phase [29,30]. Clearly, a neural network forecasting model calibrated on the basis of data relating to low demands—typical, for example, of a winter period—might perform poorly if then applied for forecasting in periods of high demand, such as in summertime. An analogous consideration applies for pattern-based models. For example, the model proposed by [10] requires at least one year of observations in order to define long-term periodic patterns, or seasonal fluctuations, as well as to characterize daily demand patterns, which vary from season to season. From an operational viewpoint, the need to have a time series of historically observed data for the network to which the forecasting model is to be applied can be a limiting factor to be taken into account when choosing the model’s structure.

In this paper, a short-term water demand forecasting model based on the use of a short series of data observed prior to the time, i.e., the hour, at which the forecast is made, is presented. The model is structured in such a way as to use, as input data, the hourly water demands observed over a few weeks prior to the time of the forecast and to deliver, as output, an hourly water demand forecast for the next 24 h. As illustrated in the sections that follow, some of the advantages of this model lie in the simplicity of the structure it is based on and the substantial absence of a calibration period. In fact, the parameters of this model simply change as time passes and are updated on the basis of the last, or most recent, observed data. Indeed, it is worth noting that this type of approach based on a moving window of previously observed data has been already used in other frameworks such as power demand forecasting [31], water meter under-registration [32], and meter tampering [33].

Below, the structure of the proposed forecasting model is presented (Section 2); the model is then applied to a real case study (Section 3), and the results obtained are analyzed and compared with those provided by another short-term forecasting model already proposed in the scientific literature (Section 4). Finally, in Section 5, some concluding considerations are presented.

2. The Proposed Model

With reference to the diagram in Figure 1, let us assume an hourly time step and let i (with i = 1, 2,…, 365) be a generic day of the year and h (with h = 1, 2,…, 24) be a generic hour of the day. Let the time t denote a generic hour of the year (counted from the beginning of the year) in which the forecast is made. Assuming that the forecast is made at the h-th hour of the i-th day of the year, the time t will be equal to:

t = 24 (i - 1) + h

(1)

Finally, let k (with

k = 1, 2, ..., K = 24

) be the forecast lead time.

The model comprises two steps. In the first step, the average water demand over 24 h which represents the forecasting time window is estimated. In the second step, relying on this forecast, the average hourly water demand in each hour included in the window is estimated.

More precisely, the average water demand

{\bar{D}}_{t}^{f o r}

for the K = 24 h following the time t is estimated by means of the following relation:

{\bar{D}}_{t}^{f o r} = α_{t} {\bar{D}}_{t - 24}^{o b s}

(2)

where

{\bar{D}}_{t - 24}^{o b s}

is the observed average water demand in the 24 h prior to the forecasting time t (or from t − 24 to t) and

α_{t}

is a coefficient having a specific value for the 24-h horizon that starts at the time t. In particular, it represents the expected ratio between the average water demand over the 24 h included in the forecasting interval and the average water demand over the 24 h that precede that interval.

Once the average water demand

{\bar{D}}_{t}^{f o r}

is known, we can go on to estimate the average hourly water demand

Q_{t + k}^{f o r}

for the time t + k (with k = 1, 2,…, K = 24) by means of the following relation:

Q_{t + k}^{f o r} = β_{t, k} {\bar{D}}_{t}^{f o r}

(3)

where the coefficient

β_{t, k}

refers to the specific lead time k of the time horizon of K = 24 h which begins at the time t. In other words, the coefficient

β_{t, k}

represents the expected ratio between the average hourly water demand in the hour corresponding to t + k and the average water demand over the 24 h that include the forecasted hour, i.e., the average water demand over the interval from t to t + 24.

The model is, thus, very simple and entails quantifying the two coefficients

α_{t}

and

β_{t, k}

, which are naturally updated at every time t.

2.1. Estimation of the Coefficients $α_{t}$ and $β_{t, k}$

Let us assume weeks divided into 7 days (Monday, Tuesday,…, Sunday). For each day, there is a particular pattern of hourly consumption [9]. In this section, it is assumed that all public holidays can be treated like Sundays. Greater details concerning the possibility of taking into account a different pattern for certain holidays will be provided in Section 2.3.

As is well known, hourly demands (i.e., average hourly water demands) are characterized by different patterns in the 7 days of the week and, with the exception of holidays, the days

{i, i - 7, i - 14, i - 21, ..}

are of the same “type” (for example, they are all Tuesdays/weekdays). Therefore, once the forecasting time

t = 24 (i - 1) + h

(counted from the beginning of the year) has been established, it will be possible to identify the n hours that correspond to the same hour h of the day on which the forecast is made and belongs to the same “type” of day in the previous n weeks. The set S of these n hours can thus be defined as:

S = {s_{1}; s_{2}; ...; s_{n}} = {t - 7 \times 24; t - 7 \times 24 \times 2; ...; t - 7 \times 24 \times n} = {t - 168; t - 336; ...; t - 168 \times n}

(4)

Within this framework, n can be interpreted as the length/extension of a moving window, that “moves” with t. It is important to clarify that the definition of the set S of n hours that corresponds to the same hour h of the day on which the forecast is made and belongs to the same “type” of day as provided in Equation (4) applies under “ordinary” conditions, that is, when the day considered is a Monday or a Tuesday or a Sunday and not a public holiday falling on a weekday. In the latter case, supposing for example that a holiday falls on a Tuesday, s₁, s₂,…, s_n would not be obtained by going back 7 × 24 h, 7 × 24 × 2 h,… 7 × 24 × n h, that is, by considering the Tuesdays of the previous weeks, but rather by selecting the corresponding hours of the previous Sundays (or other holidays, considered equivalent to Sundays). In the example considered, where the forecasting time t corresponds to an hour of a holiday falling on a Tuesday, s₁ would be equal to t − 2 × 24, therefore, going back 2 days, from Tuesday to Sunday, s₂ would be equal to t − (2 + 7) × 24, whilst considering the same hour occurring two Sundays ago, s₃ would be equal to t − (2 + 7 × 2) × 24, …, and s_n would be equal to t − (2 + 7 × (n − 1)) × 24. Incidentally, in this case, the series of observed data would be slightly shorter than n weeks (since s_n = t − (2 + 7 × (n − 1)) × 24, and not s_n = t − (7 × n) × 24 as in the case of an “ordinary” condition (see Equation (4)).

For the sake of simplicity, when referring to the set of n hours corresponding to the same hour h of the day on which the forecast is made and belonging to the same “type” of day, from this point on we will mean “ordinary” conditions and, thus, a moving window of n weeks preceding the forecasting time t. Clearly, if the forecasting time t corresponds to an hour of a holiday falling on a weekday, the moving window will be shorter than n weeks; conversely, if the forecasting time t corresponds to an hour of a weekday, but a holiday has fallen on the same type of weekday in the n previous weeks, that day cannot be included in the set S, making it necessary to lengthen the series by going back an additional week.

Having clarified these points, the coefficient

α_{t}

can be estimated in the following manner:

α_{t} = \frac{1}{n} \sum_{s_{j} = s_{1}}^{s_{n}} \frac{{\bar{D}}_{s_{j}}^{o b s}}{{\bar{D}}_{s_{j} - 24}^{o b s}}

(5)

where

{\bar{D}}_{s_{j}}^{o b s}

is the observed average water demand in the 24 h following the hour s_j (with j = 1, 2, …, n) while, with reference to Equation (4) and under “ordinary” conditions, s₁ = t − 7 × 24, s₂ = t − 7 × 24 × 2,…, s_n = t − 7 × 24 × n, and

{\bar{D}}_{s_{j} - 24}^{o b s}

is the observed average water demand in the 24 h preceding the hour s_j (i.e., from s_j − 24 to s_j) (see Figure 1).

The coefficient

β_{t, k}

, related to the specific lead time k of the horizon of K = 24 h that begins at the time t, is estimated as follows:

β_{t, k} = \frac{1}{n} \sum_{s_{j} = s_{1}}^{s_{n}} \frac{Q_{s_{j} + k}^{o b s}}{{\bar{D}}_{s_{j}}^{o b s}}

(6)

where

Q_{s_{j} + k}^{o b s}

is the hourly water demand in the k-th hour following the hour s_j (with j = 1, 2, …, n). It should be noted that at every forecasting time step t, 24 values of the coefficient

β_{t, k}

are calculated, one for each forecast lead time k (with

k = 1, 2, ..., K = 24

). The earlier considerations regarding the definition of the set S where the forecasted hour corresponds to an hour of a holiday falling on a weekday clearly apply for the estimation of

β_{t, k}

as well.

A numerical example of estimation of the coefficients

α_{t}

and

β_{t, k}

is provided in Appendix A.

2.2. Considerations on the Length n of the Moving Window

What has been illustrated above shows that the coefficients

α_{t}

and

β_{t, k}

implicitly contain information characterizing the water demand of a certain “type” of day and a given hour of the day. However, these coefficients are continuously updated, being computed over a moving window of n weeks (under “ordinary” conditions) which ends at the forecasting time step t and moves forward with it. Clearly, the length of the set S used to estimate the coefficients must be sufficiently long, for example n > 2, in order to stabilize the estimate of the coefficients themselves and avoid fluctuations caused by a particular/anomalous demand observed at a certain time step. Yet at the same time, the set S, and hence the moving window, should not be too long, for example n < 10, i.e., not more than two/three months, so that seasonal fluctuations in demand can be taken into account in the coefficients. In the case of a very long window, for example 6 months or one year, an offsetting effect would occur, so that if the forecasting step t were to fall, for example, in the summer, the corresponding coefficients would be estimated also on the basis of the observed demand data for the winter period, thus precluding a correct characterization of seasonal fluctuations.

There is a further advantage of considering a fairly short moving window: since the model is based on the data of the moving window alone, from an operational standpoint, it is only necessary to have n weeks of observed water demand data in order to implement it. As seen in the introduction, unlike other models it does not require a long calibration dataset.

2.3. Model with Holidays and Special Occasions

The previously illustrated model, hereinafter referred to as the αβ_WDF (αβ Water Demand Forecasting) model, is structured assuming that public holidays and other special occasions can be considered like Sundays, although in the literature (e.g., Bakker et al., 2013) it has been observed that the water demand pattern associated with specific holidays is slightly different from the one typical of Sundays.

We can take account of this different pattern by slightly modifying the method of estimating the coefficients

β_{t, k}

, though this implies the need to have m (with m ≥ 2) years of observed water demand data at our disposal.

Essentially, if the forecasted hour falls within a public holiday—being the average water demand

{\bar{D}}_{t}^{f o r}

over the K = 24 h following the time t still estimated as provided in Equation (2), with

α_{t}

estimated by means of Equation (5)—the hourly water demand

Q_{t + k}^{f o r}

for the hour corresponding to t+ k (with

k = 1, 2, ..., K = 24

) can be estimated using Equation (3), but in this case the coefficients

β_{t, k}

will be estimated on the basis of the data recorded for the forecasted hour corresponding to t + k on the same holiday in the m previous years.

Practically speaking, therefore, in this case

β_{t, k}

is given by:

β_{t, k} = \frac{1}{m} \sum_{s_{y} = s_{1}}^{s_{m}} \frac{Q_{s_{y} + k}^{o b s}}{{\bar{D}}_{s_{y}}^{o b s}}

(7)

where s_y denotes the hour of the same holiday corresponding to the current time t but occurring y years earlier and hence

Q_{s_{y} + k}^{o b s}

is the hourly water demand corresponding to the observed forecasted hour (which falls within a holiday) y = 1, 2, …, m years earlier and

{\bar{D}}_{s_{y}}^{o b s}

is the corresponding average water demand observed in the 24 h following the hour s_y in the y = 1, 2,…, m previous years.

For the sake of brevity, this variant of the model αβ_WDF will hereinafter be referred to as αβ_h_WDF (αβ holiday Water Demand Forecasting) model.

3. Case Study

The proposed model was applied to the real-life case of Castelfranco Emilia (Italy). The water distribution system of Castelfranco Emilia serves about 23,000 inhabitants and is supplied by a tank equipped with a flow meter which provides a measurement of the water consumption of the entire town, including leakage. Hourly demand data are available for the years 1997, 1998, 1999, and 2000. Since an analysis revealed that data for various periods of the years 1997 and 1999 were either missing or unreliable, it was decided to use the years 1998 and 2000 for the purpose of applying and evaluating the effectiveness of the model.

The same data were used in a case study by [10], who undertook to apply and test a model they proposed called Patt_WDF (Pattern based Water Demand Forecasting). The results of the Patt_WDF model were thus used by way of comparison to analyze the effectiveness of the model here proposed. More specifically, the performance of the two models was assessed considering the 24 forecasting time horizons (i.e., the forecasts for 1, 2, and up to 24 h ahead) and using the Root Mean Square Error (RMSE) and Mean Absolute Error (MAE%) defined as:

RMSE = \sqrt{\frac{1}{z} \sum_{i = 1}^{z} e_{i}^{2}}

(8)

MAE % = \frac{1}{z} \sum_{i = 1}^{z} | \frac{e_{i}}{μ_{o b s}} | \times 100

(9)

where z is the number of hours in the period considered (for example, a year),

e = Q^{o b s} - Q^{f o r}

is the error,

Q^{o b s}

is the value of the observed hourly water demand,

Q^{f o r}

is the value of the forecasted hourly water demand, and

μ_{o b s}

is the mean of the observed values.

It should be noted that, as stated in the introduction, the Patt_WDF is a short-term water demand forecasting model based on a hybrid technique; like the αβ_WDF model, the Patt_WDF model is based on a 24-h time horizon and an hourly time step, but unlike the αβ_WDF model, the Patt_WDF model requires at least a year of observed water demand data for calibration purposes. In particular, as complete data were available for the years 1998 and 2000, the 1998 data and 2000 data were used for the calibration and validation, respectively, of the Patt_WDF model. Accordingly, though the αβ_WDF model does not require calibration periods in the strict sense, it was decided to present the results for the years 1998 and 2000 separately in order to enable a straightforward and meaningful comparison with the results of the Patt_WDF model.

More specifically, in the next section we start off by presenting an analysis of the results obtained with the αβ_WDF model, applied assuming n = 4, i.e., considering a moving window of observed data of 4 weeks under “ordinary” conditions for the estimation of

α_{t}

and

β_{t, k}

. These results are then compared with those of the Patt_WDF model. Subsequently, with specific reference to public holidays alone, a comparison between the results provided by the αβ_WDF model and those provided by the αβ_h_WDF model is presented; the coefficients

β_{t, k}

assumed when applying the latter were estimated on the basis of observed data for the years 1997 and 1999 as well (since data related to the holidays considered were also available for these years).

4. Analysis and Discussion of the Results

Figure 2 and Figure 3 show the RMSE and MAE% values for different forecasting time horizons, ranging between 1 and 24 h, for the years 1998 and 2000 obtained with the αβ_WDF model and the Patt_WDF model.

Based on an analysis of the results provided by the αβ_WDF model, it may be observed that the forecasting accuracy is in general very good, with RMSE values in the range of 4–6 L/s and corresponding MAE% values between 5% and 7%. More specifically, it can be observed that the accuracy is greater for short time horizons and decreases slightly with increases in the lead time, although the variations of the RMSE and MAE% remain, in any case, very modest. With reference to the model’s application to the year 1998, the RMSE increases from about 4.5 L/s for the 1-h-ahead forecast to about 5.5 L/s for the 24-h-ahead forecast and the MAE% rises accordingly from about 6% for the 1-h-ahead forecast to about 7% for the 24-h-ahead forecast. Similarly, when the model is applied to data for the year 2000, the RMSE increases from about 5.0 L/s for the 1-h-ahead forecast to about 6.0 L/s for the 24-h-ahead forecast and the MAE% increases from about 5.5% to about 6.5%. Incidentally, the fact that MAE% is generally lower in 2000 than that observed in 1998, while for RMSE the reverse applies, is due to the fact that in 2000 the average hourly water demand had a slight increment and, at the same time, the magnitude of the errors is almost the same (see Equation (9)).

A comparison of the results provided, respectively, by the αβ_WDF model and Patt_WDF model shows that when the models were applied to data for the year 1998 (calibration period for the Patt_WDF model), the Patt_WDF model provided better forecasting accuracy than the αβ_WDF model for all time horizons, with RMSE values ranging from about 3 L/s for the 1-h-ahead forecast to about 4 L/s for the 24-h-ahead forecast and MAE% values ranging from about 4.5% for the 1-h-ahead forecast to about 5.5% for the 24-h-ahead forecast. Conversely, when the models were applied to data for the year 2000 (validation period for Patt_WDF), the αβ_WDF model generally provided better accuracy. The two models showed a similar RMSE for the 1-h-ahead forecast (4.5 L/s in the case of the Patt_WDF model and 5 L/s in the case of the αβ_WDF model), but the performance of the αβ_WDF model was better for other time horizons; specifically, the RMSE for the 24-h-ahead forecast was greater than 7 L/s in the case of the Patt_WDF model versus 6 L/s for the αβ_WDF model (corresponding to an MAE% of about 9% in the case of the Patt_WDF model and 6.5% for the αβ_WDF model). In general, comparing the trends in the indexes resulting from the two models when applied to two different years of data, it can be observed that the performance of the αβ_WDF model did not vary significantly from year to year, whereas that of the Patt_WDF model was significantly worse when applied to the 2000 data as opposed to the 1998 data. This is understandable, considering that the Patt_WDF model, unlike the αβ_WDF model, requires a set of calibration data regarding a long period (1 year). The Patt_WDF model is thus able to deliver high forecasting accuracy in relation, precisely, to the 1998 calibration data, i.e., the dataset used to characterize the periodic patterns on which the forecasting model is based. However, switching to another year (validation), and hence a different dataset, the model’s performance tends to decline. In contrast, the performance of the αβ_WDF model remains stable from one year to another, as it does not require a long calibration dataset and its forecasts are made using coefficients that are continuously updated on the basis of a moving window.

This different behavior of the two models is also confirmed by an analysis of the error frequency histograms shown in Figure 4 and Figure 5 for the years 1998 and 2000, respectively. In particular, in each figure, the histograms of the errors given by the two models in the 1-h-ahead (Figure 4a for 1998 and Figure 5a for 2000), 6-h-ahead (Figure 4b and Figure 5b), 12-h-ahead (Figure 4c and Figure 5c), and 24-h-ahead (Figure 4d and Figure 5d) forecasts are overlaid. Based on an analysis of Figure 4, relating to the year 1998, it may be noted, in general, that the histograms of the two models show wholly similar widths for the different time horizons considered and are centered on zero. More specifically, a slightly larger spread can be observed in the errors of the αβ_WDF model, confirming the better performance of the Patt_WDF model with respect to the (its) calibration period. However, analyzing the errors relating to the year 2000 (Figure 5), it can be observed that whilst the αβ_WDF model provides a symmetrical distribution of errors centered on zero, similar to one seen for 1998, the error distribution produced by Patt_WDF is not symmetric relative to zero, but is, rather, shifted onto the positive axis. This particular error distribution of the Patt_WDF model in the year 2000 indicates a general underestimation in the water demand forecast related to the validation period (the error being defined as

e = Q^{o b s} - Q^{f o r}

). This is understandable, considering that the observed data for 2000 show a slight increase in water demands compared to 1998. When used to forecast the water demands of 2000, the Patt_WDF model, calibrated on the basis of the 1998 data, tends to underestimate them. This problem does not occur with the αβ_WDF model, since it does not rely on parameters calibrated over a specific year, but rather uses information deduced from the demands observed in the last n = 4 weeks and is thus capable of adapting to long-term variations in the demands themselves.

To conclude the analysis of the performance of the αβ_WDF model, in Figure 6 and Figure 7, scatter plots of the 1-, 6-, 12-, and 24-h ahead demand forecasts for the years 1998 and 2000, respectively are shown. Furthermore, in Figure 8, by way of example, a comparison between the observed water demand pattern for a generic day of the year 2000 (23 January) and the 1- and 24-h-ahead forecasts provided by the αβ_WDF model is shown. The scatter plots in Figure 6 and Figure 7 confirm the accuracy of the forecast provided by the αβ_WDF model, as all of the points are distributed around the 45° line, which represents a perfect correspondence between observed and forecasted demands, and the forecasting accuracy remains practically constant as the time horizon changes (from 1 to 24 h): no significant variation can be noted in the width of the cloud of points, either for the year 1998 or the year 2000. The comparison in Figure 8 likewise shows that the trend in water demand predicted in the 1- and 24-h-ahead forecasts of the αβ_WDF model very closely approximate the historically observed demands.

Finally, we shall now focus our attention on the days corresponding to public holidays or special occasions.

Figure 9 shows, by way of example, the pattern of observed water demands on one holiday in the year 2000, the 1st of May (international workers’ day) and the corresponding 1-h-ahead forecast provided by the αβ_WDF model. It is evident that the forecast provided by the αβ_WDF model for this holiday is less accurate than the one shown in Figure 8, which relates to a generic weekday. More generally speaking, considering all and only the holidays falling in a year (for a total of about ten), the greater difficulty in correctly predicting demands is confirmed by the RMSE and MAE% values, which tend to be higher. In fact, considering only the holidays in the year 2000, the RMSE takes on values in the range of 8.5 to 10 L/s, respectively, for the 1- and 24-h-ahead forecasts, corresponding to MAE% values of between 10% and 13%, versus RMSE values of about 5 and 6 L/s, corresponding to MAE% values of 5.5% and 6.5%, when all the days of the year 2000 are considered.

With respect to these particular types of days, the application of the αβ_h_WDF model can provide a benefit. Indeed, as can be observed in Figure 9, which also shows the 1-h-ahead forecast provided by the αβ_h_WDF model, the estimate of the coefficients

β_{t, k}

made for the same holiday in the m = 3 previous years (1997, 1998, and 1999) (see Equation (7)) enables a more accurate forecast of the trend in water demand. This is further confirmed by the RMSE and MAE% values computed considering all and solely the public holidays in the year 2000, as they fall slightly from 8.5 and 10 L/s, respectively, for the 1- and 24-h-ahead forecasts of the αβ_WDF model—corresponding to MAE% values in the range of 10% to 13%—to 7.5 L/s and 8 L/s, respectively, for the 1- and 24-h-ahead forecasts of the αβ_h_WDF model, corresponding to MAE% values in the range of 9.5% to 10%.

5. Conclusions

In this article a new model for forecasting short-term demands in a water distribution system has been proposed. The model provides a forecast for the following 24 h using a pair of coefficients whose value is updated at every forecasting step on the basis of (a) the observed demand data pertaining to the 24 h prior to the time the forecast is made and (b) the observed data pertaining to several previous weeks for the same type of day and the same hour of the day falling within the forecasting interval. Application of the model to a real-life case and a comparison with the results provided by another short-term forecasting model based on the representation of periodic water demand patterns have revealed that the proposed αβ_WDF model has a good predictive capability over the entire forecasting time horizon considered (24 h); furthermore, it does not require calibration periods in the strict sense, being based solely on the water demands observed in the few weeks prior to the time at which the forecast is made. From an operational standpoint, this means that in cases in which long historical series of observed data are not available, the model can be used 1 or 2 months after data collection begins. Finally, again by virtue of the fact that the model does not require a static calibration of parameters, since the parameters are updated as time passes, the forecasting accuracy remains constant and high when applied to different years and/or periods. The values of the performance indicators (RMSE and MAE%) were distinctly better for the αβ_WDF model than for the pattern-based model it was compared to, which requires calibration on the basis of at least one year of data, when the comparison was made with reference to a period other than the one used for the latter model’s calibration.

Finally, it was observed that the forecasting accuracy provided by the proposed αβ_WDF model tends not to be as good when evaluated in relation to certain public holidays falling in the year; in these cases, if information is available about water demands observed during the same holidays in several previous years, the application of a variant of the model, referred to as αβ_h_WDF, enables a certain improvement in forecasting accuracy. In conclusion, its good forecasting capability over different time horizons and different periods/years, combined with the need for a small set of observed data for its implementation, make the proposed model a robust and effective water demand forecasting tool to be used in the context of real-time network management, potentially for any network size. This is due to its nature which is based on the use of data observed just prior to the time of the forecast. However, this latter aspect is currently under investigation considering case studies different from the one here examined, which regards a middle size pipe network, in order to characterize its efficacy as the number of served users changes. Results will be published in due time.

Acknowledgments

This study was carried out as part of the PRIN 2012 project “Tools and procedures for an advanced and sustainable management of water distribution systems”, No. 20127PKJ4X, funded by MIUR, of the FIR2016 project “Metodologie gestionali innovative per le reti urbane di distribuzione idrica” funded by University of Ferrara, and under the framework of the Terra&Acqua Tech Laboratory.

Author Contributions

Each of the authors contributed to the design, analysis, and writing of the study.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A. Example of Numerical Estimation of the Coefficients $α_{t}$ and $β_{t, k}$

A numerical example of estimation of the coefficients

α_{t}

and

β_{t, k}

(see Equations (5) and (6)) is here presented.

With reference to Figure 1, let us assume that the forecasting time t is at hour h = 2 of the day i. The forecasting lead time is assumed to be equal to k = 3 h, and thus the value of the water consumption to be forecasted is

Q_{t + 3}^{f o r}

. Let us assume a moving window made up of n = 3 weeks. With reference to Equation (4), the hour s_j (with j = 1,2,3) is thus

{s_{1}; s_{2}; s_{3}} = {t - 7 \times 24; t - 7 \times 24 \times 2; t - 7 \times 24 \times 3}

(see Figure A1).

With reference to Equation (5), each

{\bar{D}}_{s_{j}}^{o b s}

is the observed average water demand in the 24 h following the hour s_j (with j = 1, 2, 3) and

{\bar{D}}_{s_{j} - 24}^{o b s}

is the observed average water demand in the 24 h preceding the hour s_j (i.e., from s_j − 24 to s_j) (see also Figure 1). Thus, considering the water consumptions provided in Figure A1, the values of

{\bar{D}}_{s_{j}}^{o b s}

and

{\bar{D}}_{s_{j} - 24}^{o b s}

for j = 1, 2, 3 are

{49.05; 48.45; 48.51}

and

{46.09; 46.08; 46.15}

respectively.

Consequently, according to Equation (5),

α_{t}

is:

α_{t} = \frac{1}{3} \sum_{s_{j} = s_{1}}^{s_{3}} \frac{{\bar{D}}_{s_{j}}^{o b s}}{{\bar{D}}_{s_{j} - 24}^{o b s}} = 1.056

.

With reference to Equation (6), each

Q_{s_{j} + k}^{o b s}

is the hourly water demand in the k-th hour following the hour s_j (with j = 1, 2, 3), and thus, considering the water consumptions provided in Figure A1, the values of

Q_{s_{j} + k}^{o b s}

for k = 3 are

{24.72; 24.71; 24.77}

.

Consequently, according to Equation (6),

β_{t, k}

is:

β_{t, k} = \frac{1}{3} \sum_{s_{j} = s_{1}}^{s_{3}} \frac{Q_{s_{j} + k}^{o b s}}{{\bar{D}}_{s_{j}}^{o b s}} = 0.508

.

Figure A1. Hourly water consumption of the days {i, i − 1,…, i − 6, i − 7, i − 8,…, i − 13, i − 14, i − 15,…, i − 20, i − 21, i − 22}. The values used to compute

α_{t}

and

β_{t, k}

at the forecasting time t considering a lead time k = 3 are highlighted using the halftone screens provided in the legend.

Figure A1. Hourly water consumption of the days {i, i − 1,…, i − 6, i − 7, i − 8,…, i − 13, i − 14, i − 15,…, i − 20, i − 21, i − 22}. The values used to compute

α_{t}

and

β_{t, k}

at the forecasting time t considering a lead time k = 3 are highlighted using the halftone screens provided in the legend.

References

Donkor, E.; Mazzuchi, T.; Soyer, R.; Alan Roberson, J. Urban Water Demand Forecasting: Review of Methods and Models. J. Water Resour. Plan. Manag. 2014, 140, 146–159. [Google Scholar] [CrossRef]
Billings, B.; Jones, C. Forecasting Urban Water Demand, 2nd ed.; American Waterworks Association: Denver, CO, USA, 2008. [Google Scholar]
Alhumoud, J.M. Freshwater consumption in Kuwait: Analysis and forecasting. J. Water Supply Res. Technol. AQUA 2008, 57, 279–288. [Google Scholar] [CrossRef]
Mohamed, M.; Al-Mualla, A. Water demand forecasting in Umm Al-Quwain using the constant rate model. Desalination 2010, 259, 161–168. [Google Scholar] [CrossRef]
Feng, S.; Li, L.; Duan, Z.; Zhang, J. Assessing the impacts of South-to-North Water Transfer Project with decision support systems. Decis. Support Syst. 2007, 42, 1989–2003. [Google Scholar] [CrossRef]
Lee, S.; Wentz, E.; Gober, P. Space-time forecasting using soft geostatistics: A case study in forecasting municipal water demand for Phoenix, Arizona. Stoch. Environ. Res. Risk Assess. 2010, 24, 283–295. [Google Scholar] [CrossRef]
Polebitski, A.; Palmer, R.; Waddell, P. Evaluating water demands under climate change and transitions in the urban environment. J. Water Resour. Plan. Manag. ASCE 2011, 137, 249–257. [Google Scholar] [CrossRef]
Wu, L.; Zhou, H. Urban water demand forecasting based on HP filter and fuzzy neural network. J. Hydroinf. 2010, 12, 172–184. [Google Scholar]
Zhou, S.; McMahon, T.; Walton, A.; Lewis, J. Forecasting operational demand for an urban water supply zone. J. Hydrol. 2002, 259, 189–202. [Google Scholar] [CrossRef]
Alvisi, S.; Franchini, M.; Marinelli, A. A short-term, pattern-based model for water-demand forecasting. J. Hydroinf. 2007, 9, 39–50. [Google Scholar] [CrossRef]
Gato, S.; Jayasuriya, N.; Roberts, P. Forecasting residential water demand: Case study. J. Water Resour. Plan. Manag. ASCE 2007, 133, 309–319. [Google Scholar] [CrossRef]
Herrera, M.; Torgo, L.; Izquierdo, J.; Perez-Garcia, R. Predictive models for forecasting hourly urban water demand. J. Hydrol. 2010, 387, 141–150. [Google Scholar] [CrossRef]
Bakker, M.; Vreeburg, J.H.G.; Schagen, K.M.V.; Rietveld, L.C. A fully adaptive forecasting model for short-term drinking water demand. Environ. Model. Softw. 2013, 48, 141–151. [Google Scholar] [CrossRef]
Alvisi, S.; Franchini, M. Assessment of predictive uncertainty within the framework of water demand forecasting using the Model Conditional Processor (MCP). Urban Water J. 2017, 14, 1–10. [Google Scholar] [CrossRef]
Adamowski, J.; Chan, H.F.; Prasher, S.O.; Ozga-Zielinski, B.; Sliusarieva, A. Comparison of multiple linear and nonlinear regression, autoregressive integrated moving average, artificial neural network, and wavelet artificial neural network methods for urban water demand forecasting in Montreal, Canada. Water Resour. Res. 2012, 48. [Google Scholar] [CrossRef]
Puleo, V.; Fontanazza, C.M.; Notaro, V.; de Marchis, M.; Freni, G.; La Loggia, G. Pumps as turbines (PATs) in water distribution networks affected by intermittent service. J. Hydroinf. 2014, 16, 259–271. [Google Scholar] [CrossRef]
De Marchis, M.; Freni, G. Pump as turbine implementation in a dynamic numerical model: Cost analysis for energy recovery in water distribution network. J. Hidroinf. 2015, 17, 347–360. [Google Scholar] [CrossRef]
Al-Zahrani, M.A.; Abo-Monasar, A. Urban Residential Water Demand Prediction Based on Artificial Neural Networks and Time Series Models. Water Resour. Manag. 2015, 29, 3651–3662. [Google Scholar] [CrossRef]
Dos Santos, C.C.; Pereira Filho, A.J. Water Demand Forecasting Model for the Metropolitan Area of Sao Paulo, Brazil. Water Resour. Manag. 2014, 28, 4401–4414. [Google Scholar] [CrossRef]
Aly, A.; Wanakule, N. Short-term forecasting for urban water consumption. J. Water Resour. Plan. Manag. ASCE 2004, 130, 405–410. [Google Scholar] [CrossRef]
Tiwari, M.K.; Adamowski, J.F. Medium-term urban water demand forecasting with limited data using an ensemble wavelet-bootstrap machine-learning approach. J. Water Resour. Plan. Manag. 2015, 141. [Google Scholar] [CrossRef]
Liu, J.; Savenije, H.; Xu, J. Forecast of water demand in Weinan City in China using WDF-ANN model. Phys. Chem. Earth 2003, 28, 219–224. [Google Scholar] [CrossRef]
Ghiassi, M.; Zimbra, D.; Saidane, H. Urban water demand forecasting with a dynamic artificial neural network model. J. Water Resour. Plan. Manag. 2008, 134, 138–146. [Google Scholar] [CrossRef]
Babel, M.S.; Shinde, V.R. Identifying Prominent Explanatory Variables for Water Demand Prediction Using Artificial Neural Networks: A Case Study of Bangkok. Water Resour. Manag. 2011, 25, 1653–1676. [Google Scholar] [CrossRef]
Romano, M.; Kapelan, Z. Adaptive water demand forecasting for near real-time management of smart water distribution systems. Environ. Model. Softw. 2014, 60, 265–276. [Google Scholar] [CrossRef] [Green Version]
Msiza, I.S.; Nelwamondo, F.V.; Marwala, T. Water demand prediction using artificial neural networks and support vector regression. J. Comput. 2008, 3, 1–8. [Google Scholar] [CrossRef]
Shabani, S.; Yousefi, P.; Adamowski, J. Intelligent Soft Computing Models in Water Demand Forecasting. In Water Stress in Plants; Rahman, I.M.M., Begum, Z.A., Hasegawa, H., Eds.; InTech: Rijeka, Croatia, 2016; pp. 99–117. [Google Scholar]
Lertpalangsunti, N.; Chan, C.W.; Mason, R.; Tontiwachwuthikul, P. Toolset for construction of hybrid intelligent forecasting systems: Application for water demand prediction. Artif. Intell. Eng. 1999, 13, 21–42. [Google Scholar] [CrossRef]
Rafiq, M.; Bugmann, G.; Easterbrook, D. Neural Network Design for Engineering Applications. Comput. Struct. 2001, 79, 1541–1552. [Google Scholar] [CrossRef]
Alvisi, S.; Franchini, M. A Procedure for the Design of District Metered Areas in Water Distribution Systems. Procedia Eng. 2014, 70, 41–50. [Google Scholar] [CrossRef]
Alfares, H.K.; Nazeeruddin, M. Electric load forecasting: Literature survey and classification of methods. Int. J. Syst. Sci. 2002, 33, 23–34. [Google Scholar] [CrossRef]
Monedero, I.; Biscarri, F.; Guerrero, J.I.; Peña, M.; Roldán, M.; Leòn, C. Detection of Water Meter Under-Registration Using Statistical Algorithms. J. Water Resour. Plan. Manag. 2016, 142, 1–10. [Google Scholar] [CrossRef]
Monedero, I.; Biscarri, F.; Guerrero, J.I.; Roldán, M.; Leòn, C. An Approach to Detection of Tampering in Water Meters. Procedia Comput. Sci. 2015, 60, 413–421. [Google Scholar] [CrossRef]

Figure 1. Time structure of the proposed model named αβ Water Demand Forecasting (αβ_WDF).

Figure 2. Root Mean Square Error (RMSE) of the αβ_WDF and Patt_WDF models for the years 1998 and 2000 for different forecasting time horizons.

Figure 3. Mean Absolute Error (MAE%) of the αβ_WDF and Patt_WDF models for the years 1998 and 2000 for different forecasting time horizons.

Figure 4. Comparison of the forecasting error frequency histograms of the αβ_WDF and Patt_WDF models for the year 1998 and the time horizons of (a) 1 h; (b) 6 h; (c) 12 h; and (d) 24 h.

Figure 5. Comparison of the forecasting error frequency histograms of the αβ_WDF and Patt_WDF models for the year 2000 and the time horizons of (a) 1 h; (b) 6 h; (c) 12 h; and (d) 24 h.

Figure 6. Scatter plot of the (a) 1-h-ahead; (b) 6-h-ahead; (c) 12-h-ahead; and (d) 24-h-ahead forecasts provided by the αβ_WDF model for the year 1998.

Figure 7. Scatter plot of the (a) 1-h-ahead; (b) 6-h-ahead; (c) 12-h-ahead; and (d) 24-h-ahead forecasts provided by the αβ_WDF model for the year 2000.

Figure 8. Comparison between the observed water demand pattern for a generic day in the year 2000 (23 January) and the 1- and 24-h-ahead forecasts provided by the αβ_WDF model. Q is the discharge flowing in the main pipe that connects the tank to the water distribution system, thus representing the total water demand of the network.

Figure 9. Comparison between the observed water demand pattern for a holiday in the year 2000 (1 May, international workers’ day) and the 1-h-ahead forecasts provided by the αβ_WDF and αβ holiday Water Demand Forecasting (αβ_h_WDF) models. Q is the discharge flowing in the main pipe that connects the tank to the water distribution system, thus representing the total water demand of the network.

© 2017 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license ( http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Pacchin, E.; Alvisi, S.; Franchini, M. A Short-Term Water Demand Forecasting Model Using a Moving Window on Previously Observed Data. Water 2017, 9, 172. https://doi.org/10.3390/w9030172

AMA Style

Pacchin E, Alvisi S, Franchini M. A Short-Term Water Demand Forecasting Model Using a Moving Window on Previously Observed Data. Water. 2017; 9(3):172. https://doi.org/10.3390/w9030172

Chicago/Turabian Style

Pacchin, Elena, Stefano Alvisi, and Marco Franchini. 2017. "A Short-Term Water Demand Forecasting Model Using a Moving Window on Previously Observed Data" Water 9, no. 3: 172. https://doi.org/10.3390/w9030172

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Short-Term Water Demand Forecasting Model Using a Moving Window on Previously Observed Data

Abstract

1. Introduction

2. The Proposed Model

2.1. Estimation of the Coefficients $α_{t}$ and $β_{t, k}$

2.2. Considerations on the Length n of the Moving Window

2.3. Model with Holidays and Special Occasions

3. Case Study

4. Analysis and Discussion of the Results

5. Conclusions

Acknowledgments

Author Contributions

Conflicts of Interest

Appendix A. Example of Numerical Estimation of the Coefficients $α_{t}$ and $β_{t, k}$

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

A Short-Term Water Demand Forecasting Model Using a Moving Window on Previously Observed Data

Abstract

1. Introduction

2. The Proposed Model

2.1. Estimation of the Coefficients α t and β t , k

2.2. Considerations on the Length n of the Moving Window

2.3. Model with Holidays and Special Occasions

3. Case Study

4. Analysis and Discussion of the Results

5. Conclusions

Acknowledgments

Author Contributions

Conflicts of Interest

Appendix A. Example of Numerical Estimation of the Coefficients α t and β t , k

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

2.1. Estimation of the Coefficients $α_{t}$ and $β_{t, k}$

Appendix A. Example of Numerical Estimation of the Coefficients $α_{t}$ and $β_{t, k}$