ENSEMBLE MACHINE LEARNING FOR GLOBAL HYDROLOGICAL PREDICTION
DOI:
https://doi.org/10.37943/24DKYV6003Keywords:
hydrological modeling, machine learning, ensemble learning, discharge prediction, water resources monitoringAbstract
Accurate global hydrological prediction is vital for sustainable water management but is often hindered by data complexity and fragmentation. This study introduces an advanced machine learning framework to predict long-term average discharge using widely available global hydrological station metadata, aiming to develop a highly accurate and generalizable model for large-scale water resource assessment. The methodology utilized the Global Runoff Data Centre (GRDC) dataset, applying extensive feature engineering to station characteristics and a logarithmic transformation to the discharge variable. A diverse set of algorithms was trained, including a custom deep neural network with specialized architecture and several gradient boosting machines. These individual models were then integrated into a final Meta Ensemble model through an optimized weighting strategy to maximize predictive performance. The framework was rigorously validated on an independent test set. The Meta Ensemble model demonstrated superior predictive power, achieving a Coefficient of Determination (R²) of 0.954. This performance significantly surpassed that of both baseline methods and the individual advanced models. Analysis of the results confirmed that the model learned hydrologically meaningful relationships, identifying catchment area and geographical location as the most influential predictors. The findings confirm that a data-driven ensemble framework can accurately predict key hydrological characteristics using only station metadata. This approach offers a powerful and scalable alternative to traditional modeling, holding significant potential for water resource assessment in data-scarce regions and serving as a robust foundation for future intelligent monitoring systems.
References
Ahmed, M. A., & Li, S. S. (2024). Machine Learning Model for River Discharge Forecast: A Case Study of the Ottawa River in Canada. Hydrology, 11(9), 151. https://doi.org/10.3390/hydrology11090151.
Asadollahi, A., Magar, B. A., Poudel, B., Sohrabifar, A., & Kalra, A. (2024). Application of Machine Learning Models for Improving Discharge Prediction in Ungauged Watershed: A Case Study in East DuPage, Illinois. Geographies, 4(2), 363–377. https://doi.org/10.3390/geographies4020021.
Neftissov, A., Biloshchytskyi, A., Kazambayev, I., Dolhopolov, S., & Honcharenko, T. (2025). An Advanced Ensemble Machine Learning Framework for Estimating Long-Term Average Discharge at Hydrological Stations Using Global Metadata. Water, 17(14), 2097. https://doi.org/10.3390/w17142097.
Lu, M., Hou, Q., Qin, S., Zhou, L., Hua, D., Wang, X., & Cheng, L. (2023). A Stacking Ensemble Model of Various Machine Learning Models for Daily Runoff Forecasting. Water, 15(7), 1265. https://doi.org/10.3390/w15071265.
Peng, L., Fu, J., Yuan, Y., Wang, X., Zhao, Y., & Tong, J. (2025). A Bayesian Ensemble Learning-Based Scheme for Real-Time Error Correction of Flood Forecasting. Water, 17(14), 2048. https://doi.org/10.3390/w17142048.
Fu, J.-C., Su, M.-P., Liu, W.-C., Huang, W.-C., & Liu, H.-M. (2024). Water Level Forecasting Combining Machine Learning and Ensemble Kalman Filtering in the Danshui River System, Taiwan. Water, 16(23), 3530. https://doi.org/10.3390/w16233530.
Zhou, Y., Pan, J., & Shao, G. (2025). A Comparative Study of a Two-Dimensional Slope Hydrodynamic Model (TDSHM), Long Short-Term Memory (LSTM), and Convolutional Neural Network (CNN) Models for Runoff Prediction. Water, 17(9), 1380. https://doi.org/10.3390/w17091380.
Kang, X., Yu, H., Yang, C., Tian, Q., & Wang, Y. (2025). Analysis of Evolutionary Characteristics and Prediction of Annual Runoff in Qianping Reservoir. Water, 17(13), 1902. https://doi.org/10.3390/w17131902.
Wei, H., Wang, Y., Liu, J., & Cao, Y. (2023). Monthly Runoff Prediction by Combined Models Based on Secondary Decomposition at the Wulong Hydrological Station in the Yangtze River Basin. Water, 15(21), 3717. https://doi.org/10.3390/w15213717.
Yong, K., Li, M., Xiao, P., Gao, B., & Zheng, C. (2025). Monthly Streamflow Forecasting for the Irtysh River Based on a Deep Learning Model Combined with Runoff Decomposition. Water, 17(9), 1375. https://doi.org/10.3390/w17091375.
Martin, N., & White, J. (2024). Water Resources’ AI–ML Data Uncertainty Risk and Mitigation Using Data Assimilation. Water, 16(19), 2758. https://doi.org/10.3390/w16192758.
Chen, S., Yang, H., & Zheng, H. (2025). Intercomparison of Runoff and River Discharge Reanalysis Datasets at the Upper Jinsha River, an Alpine River on the Eastern Edge of the Tibetan Plateau. Water, 17(6), 871. https://doi.org/10.3390/w17060871.
Bahrami Chegeni, I., Riyahi, M. M., Bakhshipour, A. E., Azizipour, M., & Haghighi, A. (2025). Developing Machine Learning Models for Optimal Design of Water Distribution Networks Using Graph Theory-Based Features. Water, 17(11), 1654. https://doi.org/10.3390/w17111654.
Yuan, Y., Shen, D., Cao, Y., Wang, X., Zhang, B., & Dong, H. (2025). An Ensemble Machine Learning Approach for High-Resolution Estimation of Groundwater Storage Anomalies. Water, 17(10), 1445. https://doi.org/10.3390/w17101445.
Ziadi, S., Chokmani, K., Chaabani, C., & El Alem, A. (2024). Deep Learning-Based Automatic River Flow Estimation Using RADARSAT Imagery. Remote Sensing, 16(10), 1808. https://doi.org/10.3390/rs16101808.
He, S., Niu, G., Sang, X., Sun, X., Yin, J., & Chen, H. (2023). Machine Learning Framework with Feature Importance Interpretation for Discharge Estimation: A Case Study in Huitanggou Sluice Hydrological Station, China. Water, 15(10), 1923. https://doi.org/10.3390/w15101923.
Bărbulescu, A., & Zhen, L. (2024). Forecasting the River Water Discharge by Artificial Intelligence Methods. Water, 16(9), 1248. https://doi.org/10.3390/w16091248.
Workneh, H. A., & Jha, M. K. (2025). Utilizing Deep Learning Models to Predict Streamflow. Water, 17(5), 756. https://doi.org/10.3390/w17050756.
Huang, J., Chen, J., Huang, H., & Cai, X. (2025). Deep Learning-Based Daily Streamflow Prediction Model for the Hanjiang River Basin. Hydrology, 12(7), 168. https://doi.org/10.3390/hydrology12070168.
Liu, W., Zou, P., Jiang, D., Quan, X., & Dai, H. (2023). Computing River Discharge Using Water Surface Elevation Based on Deep Learning Networks. Water, 15(21), 3759. https://doi.org/10.3390/w15213759.
Francisco, R., & Matos, J. P. (2024). Deep Learning Prediction of Streamflow in Portugal. Hydrology, 11(12), 217. https://doi.org/10.3390/hydrology11120217.
Dolhopolov, S., Honcharenko, T., Terentyev, O., Savenko, V., Rosynskyi, A., Bodnar, N., & Alzidi, E. (2024). Multi-Stage Classification of Construction Site Modeling Objects Using Artificial Intelligence Based on BIM Technology. 2024 35th Conference of Open Innovations Association (FRUCT), 179–185. https://doi.org/10.23919/fruct61870.2024.10516383.
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2025 Articles are open access under the Creative Commons License

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.
Authors who publish a manuscript in this journal agree to the following terms:
- The authors reserve the right to authorship of their work and transfer to the journal the right of first publication under the terms of the Creative Commons Attribution License, which allows others to freely distribute the published work with a mandatory link to the the original work and the first publication of the work in this journal.
- Authors have the right to conclude independent additional agreements that relate to the non-exclusive distribution of the work in the form in which it was published by this journal (for example, to post the work in the electronic repository of the institution or publish as part of a monograph), providing the link to the first publication of the work in this journal.
- Other terms stated in the Copyright Agreement.