Water Level Forecasting Combining Machine Learning and Ensemble Kalman Filtering in the Danshui River System, Taiwan
Abstract
:1. Introduction
2. Materials and Methods
2.1. Description of Study Area
2.2. Data Collection
2.2.1. Geographical Data
2.2.2. Rainfall Data
2.2.3. Water Level Data
2.3. Flash Flood Forecasting Model in River
2.4. Multiple Additive Regression Trees (MART)
- 1st step: Input the training factors z, the fitted values y, and the loss function into the equation [29].
- 2nd step: Before training gradient boosting trees, it is essential to determine an initial prediction value. The optimal initial prediction can be obtained by solving the model to find the value that minimizes the loss function [30,31]. Utilizing this value as the initial prediction enhances the model’s predictive performance.
- 3rd step: By performing M computations across M trees, the residuals between each prediction and the actual values can be calculated. The training data is then grouped based on the characteristics of the tree type, and the residuals are updated by minimizing the loss function [32]. To update each data point with the product of the updated residual and the learning rate, the prediction for the mth iteration is represented as follows:
- Compute the residual between each prediction value and the actual value.
- 2.
- Fit a regression tree model on the target variable γim and split it into multiple regions Rjm, j = 1, 2, ….., Jm.
- 3.
- For each region Rjm, calculate the residual rim,i = 1, 2, …., Jm, where the residual is obtained by minimizing the loss function.
- 4.
- To obtain the prediction for the mth iteration (fm(x)), add the product of the updated residual and the learning rate to the previous prediction value [21].
2.5. Ensemble Kalman Filter (EnKF)
2.5.1. Error Covariance and Analysis Equations
2.5.2. Ensemble Square Root Filtering
2.5.3. Covariance Localization
2.6. Comparison Criteria
3. Results
3.1. Determination of Hyperparameter in MART
3.2. Water Level Forecasting
3.2.1. Model Calibration
3.2.2. Model Validation
4. Discussion
4.1. Water Level Forecast with Module 1
4.2. Water Level Forecast with Module 2
4.3. Water Level Forecast with Module 3
4.4. Advantages, Limitations, and Future Work
5. Conclusions
- (1)
- Hyperparameters in MART were tuned using the grid search method, and the performance of adjusted hyperparameters was compared against default values using RMSE and MAE metrics. Results demonstrated that water level forecasts with adjusted parameters outperformed those with default parameters.
- (2)
- Model calibration and validation indicated that simulated water levels at present time generally aligned with forecasted levels. However, as the forecast lead time extended, the accuracy of the corrections diminished, leading to increasing divergence between forecasted and observed levels.
- (3)
- A 95% confidence interval generated through probabilistic forecasting was utilized to explore the potential range of forecasted water levels. Findings indicated that with smaller uncertainty in boundary conditions, the confidence interval was more accurate and often encompassed the actual water level. However, as the forecast lead time increased, uncertainties in boundary conditions and other factors grew, resulting in an expanded confidence interval.
- (4)
- Comparison of RMSE values for water level forecasts from present time to 3 h lead time at Tu-Ti-Kung-Pi, Taipei Bridge, and Bailing Bridge under modules one, two, and three revealed that the RMSE value at Tu-Ti-Kung-Pi was consistently more reasonable than those at Taipei Bridge and Bailing Bridge. This is attributed to the significant tidal influence and reduced impact from internal boundaries at Tu-Ti-Kung-Pi. After accounting for uncertainties in upstream and downstream boundaries, the primary source of uncertainty affecting Taipei Bridge and Bailing Bridge was the forecasted water level at the internal boundary.
Author Contributions
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Hsu, M.H.; Fu, J.C.; Liu, W.C. Flood routing with real-time stage correction method for flash flood forecasting in the Tanshui River, Taiwan. J. Hydrol. 2003, 283, 267–280. [Google Scholar] [CrossRef]
- Hsu, M.H.; Lin, S.H.; Fu, J.C.; Chen, A.S. Longitudinal stage profiles forecasting in rivers for flash floods. J. Hydrol. 2010, 388, 426–437. [Google Scholar] [CrossRef]
- Monro, J.C.; Anderson, E.A. National weather service river forecasting system. J. Hydraul. Div. 1974, HY5, 621–630. [Google Scholar] [CrossRef]
- Corradini, C.; Melone, F.; Ubertin, L. A semi-distributed adaptive model for real-time flood forecasting. Water Resour. Bull. 1986, 22, 1031–1038. [Google Scholar] [CrossRef]
- Solomon, S.I.; Bosso, E.; Dsorio, C.; Melo de Moraes, H.; Serrano, A. Flood forecasting for Tucurui Hydroelectrical plant, Brazil. Water Resour. Bull. 1986, 22, 209–217. [Google Scholar] [CrossRef]
- Goppert, H.; Ihringer, J.; Plate, E.J. Flood forecast model for improved reservoir management in the Lenne River catchment, Germany. Hydrol. Sci. 1988, 43, 215–241. [Google Scholar] [CrossRef]
- Förster, S.; Kneis, D.; Gocht, M.; Bronstert, A. Flood risk reduction by the use of retention areas at the Elbe River. Int. J. River Basin Manag. 2005, 3, 21–29. [Google Scholar] [CrossRef]
- Hsu, M.H.; Fu, J.C.; Liu, W.C. Dynamic routing model with real-time roughness updating for flood forecasting. J. Hydraul. Eng. 2006, 132, 605–619. [Google Scholar] [CrossRef]
- Kimura, N.; Hsu, M.H.; Tsai, M.Y.; Tsao, M.C.; Yu, S.L.; Tai, A. A river flash flood forecasting coupled with ensemble Kalman filter. J. Flood Risk Manag. 2016, 9, 178–192. [Google Scholar] [CrossRef]
- Patel, S.B.; Mehta, D.J.; Yadav, S.M. One dimensional hydrodynamic flood modeling for Ambica River, South Gujarat. J. Emerg. Technol. Innov. Res. 2018, 5, 595–601. [Google Scholar]
- Belyakova, P.A.; Moreido, V.M.; Tsyplenkov, A.S.; Amerbaev, A.N.; Grechishnikova, D.A.; Kurochkina, L.S.; Filippov, V.A.; Makeev, M.S. Forecasting water levels in Krasnodar Krai Rivers with the use of machine learning. Water Resour. 2022, 49, 10–22. [Google Scholar] [CrossRef]
- Kim, D.; Park, J.; Han, H.; Lee, H.; Kim, H.S.; Kim, S. Application of AI-based models for flood water level forecasting and flood risk classification. KSCE J. Civ. Eng. 2023, 27, 3163–3174. [Google Scholar] [CrossRef]
- Li, S.; Yang, J. Improved river water-stage forecasts by ensemble learning. Eng. Comput. 2023, 39, 3293–3311. [Google Scholar] [CrossRef]
- Mihel, A.M.; Lerga, J.; Krvavica, N. Estimating water levels and discharges in tidal rivers and estuaries: Review of machine learning approaches. Environ. Model. Softw. 2024, 176, 106033. [Google Scholar] [CrossRef]
- Wu, X.L.; Xiang, X.H.; Wang, C.H.; Chen, X.; Xu, C.Y.; Yu, Z. Coupled hydraulic and Kalman filter model for real-time correction of flood forecast in the Three Gorges interzone of Yangtze River, China. J. Hydrol. Eng. 2013, 18, 1416–1425. [Google Scholar] [CrossRef]
- Chen, J.C.; Chang, C.H.; Wu, S.J.; Hsu, C.T.; Lien, H.C. Real-time correction of water stage forecast using combination of forecasted errors by time series models and Kalman filter method. Stoch. Environ. Res. Risk Assess. 2015, 29, 1903–1920. [Google Scholar]
- Wang, X.; Babovic, V. Application of hybrid Kalman filter for improving water level forecast. J. Hydroinform. 2016, 18, 773–790. [Google Scholar] [CrossRef]
- Barthélémy, S.; Ricci, S.; Rochoux, M.C.; Le Pape, E.; Thual, O. Ensemble-based data assimilation for operational flood forecasting-On the merits of state estimation for 1D hydrodynamic forecasting through the example of the “Adour Maritime” river. J. Hydrol. 2017, 552, 210–224. [Google Scholar] [CrossRef]
- Yu, L.; Tan, S.K.; Chua, L.H. Online ensemble modeling for real time water level forecasts. Water Resour. Manag. 2017, 31, 1105–1119. [Google Scholar] [CrossRef]
- Lee, H.; Shen, H.; Noh, S.J.; Kim, S.; Seo, D.J.; Zhang, Y. Improving flood forecasting using conditional bias-penalized ensemble Kalman filter. J. Hydrol. 2019, 575, 596–611. [Google Scholar] [CrossRef]
- Fu, J.C.; Huang, H.Y.; Jang, J.H.; Huang, P.H. River stage forecasting using multiple additive regression trees. Water Resour. Manag. 2019, 33, 4491–4507. [Google Scholar] [CrossRef]
- Jang, J.H.; Lee, K.F.; Fu, J.C. Improving river-stage forecasting using hybrid models based on the combination of multiple additive regression trees and Runge-Kutta schemes. Water Resour. Manag. 2022, 36, 1123–1140. [Google Scholar] [CrossRef]
- Liu, W.C.; Hsu, M.H.; Wu, C.R.; Wang, C.F.; Kuo, A.Y. Modeling salt water intrusion in Tanshui River estuarine system-Case-study contrasting now and then. J. Hydraul. Eng. 2004, 130, 849–859. [Google Scholar] [CrossRef]
- Young, C.C.; Liu, W.C.; Liu, H.M. Uncertainty assessment for three-dimensional and fecal coliform modeling in Danshuei River estuarine system: The influence of first-order parametric decay reaction. Mar. Pollut. Bull. 2023, 193, 115220. [Google Scholar] [CrossRef]
- Liu, W.C.; Liu, H.M.; Young, C.C.; Huang, W.C. The influence of freshwater discharge and wind forcing on dispersal of river plumes using a three-dimensional circulation model. Water 2022, 14, 429. [Google Scholar] [CrossRef]
- Tsao, M.C. A River Flood Forecast Model with Data Assimilation Based on Ensemble Kalman Filter. Master’s Thesis, Department of Bioenvironmental Systems Engineering, National Taiwan University, Taiwan, 2011. [Google Scholar]
- Yu, S.L. River Flood Ensemble Forecast Model. Master’s Thesis, Department of Bioenvironmental Systems Engineering, National Taiwan University, Taiwan, 2012. [Google Scholar]
- Amein, M.; Fang, C.S. Implicit flood routing in natural channel. J. Hydraul. Div. ASCE 1970, 96, 2481–2500. [Google Scholar] [CrossRef]
- Huber, P.J. Robust estimation of a location parameter. Ann. Math. Stat. 1964, 53, 73–101. [Google Scholar] [CrossRef]
- Friedman, J.H.; Hastie, T.; Tibshirani, R. Additive logistic regression: A statistical view of boosting (with discussion). Ann. Stat. 2000, 28, 337–407. [Google Scholar] [CrossRef]
- Friedman, J.H. Greedy function approximation: A gradient boosting machine. Ann. Stat. 2001, 29, 1189–1232. [Google Scholar] [CrossRef]
- Friedman, J.H.; Meulman, J.J. Multiple additive regression trees with application in epidemiology. Stat. Med. 2003, 22, 1365–1381. [Google Scholar] [CrossRef]
- Burgers, G.; van Leeuwen, P.J.; Evensen, G. Analysis scheme in the ensemble Kalman filter. Mon. Weather Rev. 1998, 126, 1719–1724. [Google Scholar] [CrossRef]
- Whitaker, J.S.; Hamill, T.M. Ensemble data assimilation without perturbed observations. Mon. Weather Rev. 2002, 130, 1913–1924. [Google Scholar] [CrossRef]
- Hamill, T.M.; Whitaker, J.S.; Snyder, C. Distance-dependent filtering pf background error covariance in an ensemble Kalman filter. Mon. Weather Rev. 2001, 129, 2776–2790. [Google Scholar] [CrossRef]
- Gaspari, G.; Cohn, S.E. Construction of correlation functions in two and three dimensions. Q. J. R. Meteorol. Soc. 1999, 125, 723–757. [Google Scholar] [CrossRef]
- Nash, J.E.; Sutcliffe, J.V. River Flow Forecasting through Conceptual Model. Part 1-A Discussion of Principles. J. Hydrol. 1970, 10, 282–290. [Google Scholar] [CrossRef]
- Steel, R.G.D.; Torrie, J.H. Principles and Procedures of Statistics with Special Reference to the Biological Sciences; McGraw Hill: New York, NY, USA, 1960; pp. 187–287. [Google Scholar]
- Lee, K.F. Using Numerical Methods to Improve Machine Learning Models in River Stage Forecast. Master’s Thesis, National Cheng Kung University, Taiwan, 2021. [Google Scholar]
- Sthle, L.; Wold, S. Analysis of variance (ANOVA). Chemom. Intell. Lab. Syst. 1989, 6, 259–272. [Google Scholar] [CrossRef]
- Chung, S.F. Application of Artificial Neural Networks on Flood Routing and Forecasting. Master’s Thesis, Department of Bioenvironmental Systems Engineering, National Taiwan University, Taiwan, 2007. [Google Scholar]
Category | Typhoon Event | Time Period (Day Month Year) | Duration (hours) |
---|---|---|---|
Training set | Typhoon Saola | 30 July 2012~3 August 2012 | 120 |
Typhoon Haikui | 6 August 2012~7 August 2012 | 48 | |
Typhoon Soulik | 12 July 2013~13 July 2013 | 42 | |
Typhoon Trami | 20 August 2013~22 August 2013 | 72 | |
Typhoon Usagi | 19 September 2013~21 September 2013 | 51 | |
Typhoon Fitow | 4 October 2013~7 October 2013 | 96 | |
Typhoon Chan-How | 9 July 2015~11 July 2015 | 72 | |
Validating set | Typhoon Soudelor | 6 August 2015~9 August 2015 | 96 |
Test set | Typhoon Dujuan | 27 September 2015~29 September 2015 | 72 |
Typhoon Megi | 25 September 2016~28 September 2016 | 96 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Fu, J.-C.; Su, M.-P.; Liu, W.-C.; Huang, W.-C.; Liu, H.-M. Water Level Forecasting Combining Machine Learning and Ensemble Kalman Filtering in the Danshui River System, Taiwan. Water 2024, 16, 3530. https://doi.org/10.3390/w16233530
Fu J-C, Su M-P, Liu W-C, Huang W-C, Liu H-M. Water Level Forecasting Combining Machine Learning and Ensemble Kalman Filtering in the Danshui River System, Taiwan. Water. 2024; 16(23):3530. https://doi.org/10.3390/w16233530
Chicago/Turabian StyleFu, Jin-Cheng, Mu-Ping Su, Wen-Cheng Liu, Wei-Che Huang, and Hong-Ming Liu. 2024. "Water Level Forecasting Combining Machine Learning and Ensemble Kalman Filtering in the Danshui River System, Taiwan" Water 16, no. 23: 3530. https://doi.org/10.3390/w16233530
APA StyleFu, J.-C., Su, M.-P., Liu, W.-C., Huang, W.-C., & Liu, H.-M. (2024). Water Level Forecasting Combining Machine Learning and Ensemble Kalman Filtering in the Danshui River System, Taiwan. Water, 16(23), 3530. https://doi.org/10.3390/w16233530