ENSEMBLE MACHINE LEARNING FOR GLOBAL HYDROLOGICAL PREDICTION

Authors

DOI:

https://doi.org/10.37943/24DKYV6003

Keywords:

hydrological modeling, machine learning, ensemble learning, discharge prediction, water resources monitoring

Abstract

Accurate global hydrological prediction is vital for sustainable water management but is often hindered by data complexity and fragmentation. This study introduces an advanced machine learning framework to predict long-term average discharge using widely available global hydrological station metadata, aiming to develop a highly accurate and generalizable model for large-scale water resource assessment. The methodology utilized the Global Runoff Data Centre (GRDC) dataset, applying extensive feature engineering to station characteristics and a logarithmic transformation to the discharge variable. A diverse set of algorithms was trained, including a custom deep neural network with specialized architecture and several gradient boosting machines. These individual models were then integrated into a final Meta Ensemble model through an optimized weighting strategy to maximize predictive performance. The framework was rigorously validated on an independent test set. The Meta Ensemble model demonstrated superior predictive power, achieving a Coefficient of Determination (R²) of 0.954. This performance significantly surpassed that of both baseline methods and the individual advanced models. Analysis of the results confirmed that the model learned hydrologically meaningful relationships, identifying catchment area and geographical location as the most influential predictors. The findings confirm that a data-driven ensemble framework can accurately predict key hydrological characteristics using only station metadata. This approach offers a powerful and scalable alternative to traditional modeling, holding significant potential for water resource assessment in data-scarce regions and serving as a robust foundation for future intelligent monitoring systems.

Author Biographies

Alexandr Neftissov, Academy of Physical Education and Mass Sports, Kazakhstan

PhD, Associate Professor, Rectorate for Science and Innovation
PhD, Associate Professor, Researcher, Scientific-Innovation Center Industry 4.0
Astana IT University, Kazakhstan

Tetyana Honcharenko, Kyiv National University of Construction and Architecture, Ukraine

Doctor of Technical Sciences, Professor, Head of Department of Information Technologies

Andrii Biloshchytskyi, Astana IT University, Kazakhstan

Doctor of Technical Sciences, Professor, Vice-Rector for Science and Innovations
Professor Department of Information Technologies,
Kyiv National University of Construction and Architecture, Ukraine

Ilyas Kazambayev, Master`s degree, Acting Director of Scientific-Innovation Center Industry 4.0, Astana IT University, Kazakhstan

Master`s degree, Acting Director of Scientific-Innovation Center Industry 4.0

Serhii Dolhopolov, Kyiv National University of Construction and Architecture, Ukraine

PhD Student, Junior Researcher, Assistant Lecturer at the Department of Information Technologies

References

Ahmed, M. A., & Li, S. S. (2024). Machine Learning Model for River Discharge Forecast: A Case Study of the Ottawa River in Canada. Hydrology, 11(9), 151. https://doi.org/10.3390/hydrology11090151.

Asadollahi, A., Magar, B. A., Poudel, B., Sohrabifar, A., & Kalra, A. (2024). Application of Machine Learning Models for Improving Discharge Prediction in Ungauged Watershed: A Case Study in East DuPage, Illinois. Geographies, 4(2), 363–377. https://doi.org/10.3390/geographies4020021.

Neftissov, A., Biloshchytskyi, A., Kazambayev, I., Dolhopolov, S., & Honcharenko, T. (2025). An Advanced Ensemble Machine Learning Framework for Estimating Long-Term Average Discharge at Hydrological Stations Using Global Metadata. Water, 17(14), 2097. https://doi.org/10.3390/w17142097.

Lu, M., Hou, Q., Qin, S., Zhou, L., Hua, D., Wang, X., & Cheng, L. (2023). A Stacking Ensemble Model of Various Machine Learning Models for Daily Runoff Forecasting. Water, 15(7), 1265. https://doi.org/10.3390/w15071265.

Peng, L., Fu, J., Yuan, Y., Wang, X., Zhao, Y., & Tong, J. (2025). A Bayesian Ensemble Learning-Based Scheme for Real-Time Error Correction of Flood Forecasting. Water, 17(14), 2048. https://doi.org/10.3390/w17142048.

Fu, J.-C., Su, M.-P., Liu, W.-C., Huang, W.-C., & Liu, H.-M. (2024). Water Level Forecasting Combining Machine Learning and Ensemble Kalman Filtering in the Danshui River System, Taiwan. Water, 16(23), 3530. https://doi.org/10.3390/w16233530.

Zhou, Y., Pan, J., & Shao, G. (2025). A Comparative Study of a Two-Dimensional Slope Hydrodynamic Model (TDSHM), Long Short-Term Memory (LSTM), and Convolutional Neural Network (CNN) Models for Runoff Prediction. Water, 17(9), 1380. https://doi.org/10.3390/w17091380.

Kang, X., Yu, H., Yang, C., Tian, Q., & Wang, Y. (2025). Analysis of Evolutionary Characteristics and Prediction of Annual Runoff in Qianping Reservoir. Water, 17(13), 1902. https://doi.org/10.3390/w17131902.

Wei, H., Wang, Y., Liu, J., & Cao, Y. (2023). Monthly Runoff Prediction by Combined Models Based on Secondary Decomposition at the Wulong Hydrological Station in the Yangtze River Basin. Water, 15(21), 3717. https://doi.org/10.3390/w15213717.

Yong, K., Li, M., Xiao, P., Gao, B., & Zheng, C. (2025). Monthly Streamflow Forecasting for the Irtysh River Based on a Deep Learning Model Combined with Runoff Decomposition. Water, 17(9), 1375. https://doi.org/10.3390/w17091375.

Martin, N., & White, J. (2024). Water Resources’ AI–ML Data Uncertainty Risk and Mitigation Using Data Assimilation. Water, 16(19), 2758. https://doi.org/10.3390/w16192758.

Chen, S., Yang, H., & Zheng, H. (2025). Intercomparison of Runoff and River Discharge Reanalysis Datasets at the Upper Jinsha River, an Alpine River on the Eastern Edge of the Tibetan Plateau. Water, 17(6), 871. https://doi.org/10.3390/w17060871.

Bahrami Chegeni, I., Riyahi, M. M., Bakhshipour, A. E., Azizipour, M., & Haghighi, A. (2025). Developing Machine Learning Models for Optimal Design of Water Distribution Networks Using Graph Theory-Based Features. Water, 17(11), 1654. https://doi.org/10.3390/w17111654.

Yuan, Y., Shen, D., Cao, Y., Wang, X., Zhang, B., & Dong, H. (2025). An Ensemble Machine Learning Approach for High-Resolution Estimation of Groundwater Storage Anomalies. Water, 17(10), 1445. https://doi.org/10.3390/w17101445.

Ziadi, S., Chokmani, K., Chaabani, C., & El Alem, A. (2024). Deep Learning-Based Automatic River Flow Estimation Using RADARSAT Imagery. Remote Sensing, 16(10), 1808. https://doi.org/10.3390/rs16101808.

He, S., Niu, G., Sang, X., Sun, X., Yin, J., & Chen, H. (2023). Machine Learning Framework with Feature Importance Interpretation for Discharge Estimation: A Case Study in Huitanggou Sluice Hydrological Station, China. Water, 15(10), 1923. https://doi.org/10.3390/w15101923.

Bărbulescu, A., & Zhen, L. (2024). Forecasting the River Water Discharge by Artificial Intelligence Methods. Water, 16(9), 1248. https://doi.org/10.3390/w16091248.

Workneh, H. A., & Jha, M. K. (2025). Utilizing Deep Learning Models to Predict Streamflow. Water, 17(5), 756. https://doi.org/10.3390/w17050756.

Huang, J., Chen, J., Huang, H., & Cai, X. (2025). Deep Learning-Based Daily Streamflow Prediction Model for the Hanjiang River Basin. Hydrology, 12(7), 168. https://doi.org/10.3390/hydrology12070168.

Liu, W., Zou, P., Jiang, D., Quan, X., & Dai, H. (2023). Computing River Discharge Using Water Surface Elevation Based on Deep Learning Networks. Water, 15(21), 3759. https://doi.org/10.3390/w15213759.

Francisco, R., & Matos, J. P. (2024). Deep Learning Prediction of Streamflow in Portugal. Hydrology, 11(12), 217. https://doi.org/10.3390/hydrology11120217.

Dolhopolov, S., Honcharenko, T., Terentyev, O., Savenko, V., Rosynskyi, A., Bodnar, N., & Alzidi, E. (2024). Multi-Stage Classification of Construction Site Modeling Objects Using Artificial Intelligence Based on BIM Technology. 2024 35th Conference of Open Innovations Association (FRUCT), 179–185. https://doi.org/10.23919/fruct61870.2024.10516383.

Downloads

Published

2025-10-30

How to Cite

Neftissov, A., Honcharenko, T., Biloshchytskyi, A., Kazambayev, I., & Dolhopolov, S. (2025). ENSEMBLE MACHINE LEARNING FOR GLOBAL HYDROLOGICAL PREDICTION . Scientific Journal of Astana IT University, 24. https://doi.org/10.37943/24DKYV6003

Issue

Section

Information Technologies