1. Introduction
In the last decade, a series of unprecedented extreme hydrological events have occurred, some of which have been attributed to climate change [
1]. In fact, global projections indicate a positive correlation between global warming and the risk of extreme rainfall and floods [
2]. This intensification of the hydrological cycle raises growing concerns about future floods and their impact on large cities where exposure has also increased [
3]. Adequate adaptation solutions are required to increase resilience and reduce vulnerability of large urban centers. In this sense, the development of early warning technologies is crucial because it allows the localization of resources, the warning of communities at risk, and the initiation of disaster response operations [
4].
The great difficulty in developing forecasting tools and early warning systems in the field of hydrology is that the physical mechanisms involved in the runoff processes are non-linear and extremely difficult to model, so that runoff forecasting models have a high uncertainty [
5]. Furthermore, an early warning system should be able to give accurate warning for extreme events, but also should be able to give accurate forecasts for small flows to avoid false positive warnings that eventually reduce the reliability of the system [
6]. Given that the early warning systems give more importance to the simplicity and robustness of the forecasting model rather than an accurate description of the various internal sub-processes, it is certainly worth considering hybrid models that combine physical-based models with data-driven approaches for improving real-time runoff forecasts [
7,
8,
9,
10]. Another difficulty in predicting hydrological extremes is that to get ahead of an extreme event requires the implementation of weather forecasting models coupled to hydrological models. The development of this type of coupled systems has not had much development, mainly because the global climate models (GCMs) have had very coarse spatial resolution; so that, they did not have the ability to forecast extreme events with the degree of precision that is required in the scale of hydrological systems [
11]. However, today there are high resolution global weather forecast models, which in conjunction with novel downscaling techniques, allows the development of model-generated weather input data for hydrological forecasting models [
12,
13].
The development of Machine Learning (ML) and Deep Learning (DL) techniques, on the other hand, have recently boosted the use of data-driven approaches as a complement to traditional methods for the prediction and forecasting of hydrological variables, in which Artificial Neural Networks (ANN) are the most commonly used technique for this task [
8,
14,
15,
16,
17]. Furthermore, an improvement of this approach has been achieved with neuro-fuzzy systems, which combines the human-like reasoning of a fuzzy system with the learning capability of neural networks [
18]. This hybrid scheme has shown good results for rainfall-runoff modeling when comparing with the ANN approach alone [
19,
20]. Nevertheless, approaches such as ANN are not exactly adequate for the analysis of sequential data. To address this challenge, various methods have been developed to keep a certain memory of the previous state of the system, thus allowing the prediction to use not only the present information but also the previous state. One of the most successful techniques is the Long-Short Term Memory (LSTM) cells, which are based on Recurrent Neural Networks (RNN). This type of model has been widely used to achieve state-of-the-art results on sequence modeling tasks such as handwriting recognition [
21,
22], speech recognition [
23], time series prediction [
24,
25,
26] and robot control [
27], among others. In this sense, the new learning algorithms and architectures that are currently being developed for neural networks allow the acceleration of the development of hydrological forecasting and early warning tools [
28].
There are not many applications of DL in the field of hydrology [
29], among which DL is included for daily flows in a seasonal time-frame only based on flow observations [
30], and rainfall-runoff models that predict hourly flow only based on observed precipitation [
25]. Furthermore, a comparison between a physical-based model with four recurrent neural networks is presented in [
31], showing that data-driven models perform better. For the case of hydrological extremes, LSTM networks have been used in rainfall-runoff models to predict daily flow in several basins in the USA [
32], showing an improvement in predictions with LSTM networks compared to a traditional hydrological model; however, other studies argued that these results could have been improved by using the specific hydro-meteorological and geomorphological variables of each basin [
33]. Furthermore, hydrological applications of DL have been usually developed based on real time field observations of rainfall and flow (see [
33] and references therein), thus limiting the runoff forecast to the timescale defined by the catchment concentration time (e.g., [
6]). One alternative to increase the runoff forecast timescale is to link the runoff forecasts with near-future global meteorological forecasts that are capable of predicting rainfall, air temperature, and other input variables in a longer timescale. To our understanding, this link has not been previously explored.
In this article, we used deep learning (DL) for weather-runoff forecasting in región Metropolitana of Chile, a large urban area in a valley at the foot of the Andes Mountains, with more than seven million inhabitants. The final goal of this research is to develop an effective forecasting system to provide timely information and support in real-time decision making, as an adaptation to hydrological extremes in a warming climate. For this purpose, we implemented a coupled model of near-future global meteorological forecast with short-range runoff forecasting systems based on DL. Starting from a traditional hydrological model, we defined the conceptual model and the hydro-meteorological variables that were used in the training of the data-driven models. The meteorological variables were obtained through statistical scaling of the Global Forecast System (GFS). Two data-driven models were implemented for runoff forecasts, a simple Artificial Neural Networks (ANN) and a Deep Learning (DL) approach based on Long-Short Term Memory (LSTM) cells. With these two approaches, we predicted at once the entire hourly flow time-series for 1 day in the past and 3 days in the future with no need for the entire record of past flow measurements, just the flow at t = 0. This was a constraint required to be considered because of it is not possible to rely on always having a continuum time series of real-time observations, so we decided to minimize the real-time information needed for forecasting the flow. Finally, the same data was used in both data-driven techniques, in which the data were separated into training, validating and testing data sets, and the model skills were compared through different metrics.
As far as we know, the novelties of this contribution that have not been previously published are: (i) We present an approach based on a DL model that is coupled a near-future global meteorological forecast with a short-range runoff forecasting system. (ii) We use the complete set of hydro-meteorological and geomorphological variables of a complex hydrological model for training the data-driven models. (iii) Our approach is capable of predicting an output time-series with a finer temporal resolution than the input time-series. This temporal downscaling allows us to precisely locate the time at which the peak flow will occur. (iv) Finally, our approach predicts at once the entire hourly flow time-series for 1 day in the past and 3 days in the future with no need of the entire record of past flow measurements, just the flow at t = 0.
This paper is organized as follows: In the next section we describe the two algorithms used in this article (ANN and DL), present the study site composed of nine flow stations, and detail the methodology that couples meteorological forecast with data-driven weather-runoff forecast models. Two alternatives for the data-driven weather-forecast model are described that aim to predict the detailed hourly flow time-series for the following three days—ANN and DL approaches. In the results section we present the results in terms of the model’s performance in predicting the flow conditions for the following 3 days, in which we compare the performance of the two data-driven approaches. Finally, in the discussion and conclusion sections, we highlight and summarize the key features and limitations of the proposed methodology.
4. Discussion
In this article, we detailed a methodology that couples a process-based meteorological model that forecasts atmospheric conditions in the near future, with data-driven weather-runoff forecast models, which use these meteorological inputs for predicting hourly flow time-series in the near future. We implemented this methodology in región Metropolitana of Chile, for which two data-driven techniques were used for the weather-runoff forecast models—a simple ANN approach and a DL approach based on LSTM cells.
The data-driven weather-runoff models were designed based on the following three central ideas: (i) The near future flow (3 days) in the studied flow stations responds to both the precipitation rate of the storm, but also to changes in the watershed area or rate of snow melt (see
Figure 11f). Consequently, a rainfall-runoff scheme (e.g., [
25]) is not enough for predicting near-future flow, which justifies the weather-runoff concept that also uses air temperature and relatively humidity and the 0 °C isotherm for predicting the near-future flow. Both, air temperature and relatively humidity were important variables that improved the performance of the weather-runoff model in the preliminary exercises. Particularly, air temperature at 3500 m a.s.l. can be associated with snow melt rate, whereas air humidity at the 0 °C isotherm controls the limit between liquid and solid precipitation [
42]. (ii) Real-time flow observations are in general available, but it is not possible to rely on the availability of a continuous measured time-series for forecasting the near-future flow, especially during large storms. As a consequence, the weather-runoff forecast model is predominantly based on the (very reliable) GFS-NCEP model, although the flow at
is (always) needed as input. In case of not having flow observation for
, this flow can be estimated using simple cross correlations with the other flow stations. (iii) Early warning systems should be able to give accurate warning for extreme events, but should also be able to give accurate forecasts for small flows to avoid false positive warnings that eventually reduce the reliability of the system [
6,
43]. In this context, both data-driven approaches used the entire set of flow observations for training, validating and testing the forecast models, without paying specific attention to high flow events which are the important ones in early warning systems. For example, the range of predicted flows in the Maipo en el Manzano station varies between 25 to 1100 m
3/s.
With respect to the performance of data-driven weather-runoff forecast models, it is possible to argue that both approaches are accurate for predicting
and
; however, flow prediction based on the DL approach is far more accurate than the flow prediction based on ANN approach, as shown in
Table 5. This was pointed out by [
25] with a rainfall-runoff model, and it is verified in this new approach with a weather-runoff model. The DL approach has an excellent performance, with values of RMSE 5%, compared to RMSE 15.5% for ANN approach for the prediction of the peak flow in station 1 (
Table 5). These results are explained in the fact that, approaches such as ANN are not exactly adequate for the analysis of sequential data, such as the flow time series. To address the forecast of sequential data, is required to keep a certain memory of the previous state of the system, thus allowing the prediction using not only the present information but also the previous state. Since ANN do not have a temporal memory, they have difficulties in recognising temporal changes. This is reflected in greater error when predicting flow floods and therefore tend to underestimate the flow, as shown in
Figure 11. In this context, one of the most successful techniques based on Recurrent Neural Networks (RNN) is the DL approach based on LSTM cells [
24,
25,
26]. Although we use only one value of the flow as initial condition, the previous state for DL approach is obtaining through the seven sequences of the meteorological and geomorphological inputs of the previous days, which is used in the DL approach for generating the output sequential data.
Furthermore, another important feature of the proposed architecture of the DL weather-runoff forecast (
Figure 5b) is that it is capable of predicting an output time-series with a finer temporal resolution (1 h) than for the input time-series (6 h), thus enabling the use of DL as a temporal downscaling technique. This allows to precisely locating the time at which the peak flow will occur, which gives the system an early warning advantage, as shown in
Figure 12 and
Table 6. The DL weather-runoff model is capable of capturing the peak flow, the time at which it will occur and the flow duration. All of this information would be available from three days in advance, which is very useful for allocating resources and warning the communities at risk.
Finally, it is important to notice that the methodology that was implemented for the nine flow stations in Maipo and Mapocho rivers, can, in principle, be scaled to the entire set of flow stations with real-time measurement in Chile (approximately 450 flow stations). The coupling between the GFS-NCEP model forecast and the DL weather-runoff forecast model may not vary; however, input variables to DL weather-runoff forecast model should be different in flow stations located in the desert of northern Chile (latitude: −22° S) to the flow stations located in the austral part of southern Chile (latitude: −45° S). For example, presumably, the air temperature associated to glacier melt should not be a relevant input data in northern Chile where there are no glaciers.
5. Conclusions
The intensification of the hydrological cycle because of the global warming raises growing concerns about future floods and their impact on large cities where exposure has also increased, such as the región Metropolitana of central Chile. Adequate water adaptation solutions as early warning systems are crucial. Given that the early warning systems give more importance to the simplicity and robustness of the forecasting model rather than an accurate description of the various internal sub-processes, it is certainly worth considering data-driven approaches for improving real-time runoff forecasts.
In this article, we implemented a coupled model of a near-future global meteorological forecast with short-range runoff forecasting systems based on DL, showing that DL is a valuable technique that allows the acceleration of the development of hydrological forecasting and early warning tools. The coupling between meteorological forecasts and the DL weather-runoff forecast model, on the other hand, are able to satisfy two basic requirements that any early warning system should have: The forecast should be given in advance in a time-frame larger than catchment concentration time, and should be accurate and reliable. In this context, meteorological forecasts are accurate and reliable in predicting near-future meteorological conditions, which feed the DL weather-runoff forecast, thus enabling a reliable flow forecast in advance.
Furthermore, DL significantly improves runoff forecasts when compared with a simple ANN approach, being accurate in predicting the time-evolution of output variables, with an error for predicting the peak flow of RMSE 5% compared to RMSE 15.5% for the ANN approach, which is adequate to warn communities at risk and initiate disaster response operations. Another interesting aspect of this approach is that it is capable of predicting an output time-series with a finer temporal resolution than the input time-series. This temporal downscaling allows us to precisely locate the time at which the peak flow will occur. Finally, the real-time implementation of these DL models can be found in the open access webpage
www.AlertaHidrica.com.