DEVELOPMENT OF TIME SERIES FORECASTING MODELS FOR AIR POLLUTION BASED ON DEEP SPARSE TRANSFORMER NETWORKS

Authors

DOI:

https://doi.org/10.37943/23VUWJ5711

Keywords:

deep sparse transformer network, air pollution forecasting, fractal analysis, long-term memory, environmental monitoring

Abstract

This study investigates the application of fractal analysis and deep learning methods for forecasting pollutant emissions from the Ekibastuz coal-fired power plant. The research is based on time series of NO, NO₂, and PM₁₀ concentrations collected by industrial sensors during 2023–2024. To assess long-term dependencies, an R/S analysis was performed, and the results demonstrated stable persistence with average Hurst exponent values exceeding 0.67. This confirmed the appropriateness of employing models capable of capturing long-range memory in the data. In the second stage, a Deep Sparse Transformer Network (DSTN) architecture was implemented and adapted to the task of emission forecasting under different boiler operating modes. DSTN combines the advantages of transformer-based models with a sparse attention mechanism, which reduces computational complexity and enables efficient handling of long sequences. The model was trained using the PyTorch framework on a dataset of more than 67,000 records, with forecasting performed at horizons of 1, 6, 12, and 24 steps. The highest accuracy was achieved for short-term forecasts: the coefficient of determination for NO₂ reached 0.95 at a one-step horizon and decreased to 0.38 at 24 steps. For NO and PM₁₀, R² values ranged from 0.93 to 0.26. These findings indicate that DSTN is a highly effective tool for short-term forecasting but less accurate at longer horizons due to error accumulation. The results confirm the practical value of integrating fractal analysis with transformer architectures for emission monitoring and coal power plant operation management. The proposed approach can be embedded into industrial control systems to enable timely responses to peak emissions, optimize combustion modes, and mitigate environmental risks.

Author Biographies

Andrii Biloshchytskyi, Astana IT University, Kazakhstan

Doctor of Technical Sciences, Vice-Rector for Science and Innovation

Oleksandr Kuchanskyi, Astana IT University, Kazakhstan

Doctor of Technical Sciences, Professor, School of Artificial Intelligence and Data Science

Yurii Andrashko, Uzhhorod National University, Ukraine

Candidate of Technical Sciences, Associate Professor of Department of System Analysis and Optimization Theory

Alexandr Neftissov, Academy of Physical Education and Mass Sports, Kazakhstan

PhD, Vice-Rector for Science and Innovation

References

Wang, Y., Yuan, Q., Li, T., & Zhu, L. (2022). Global spatiotemporal estimation of daily high-resolution surface carbon monoxide concentrations using Deep Forest. Journal of Cleaner Production, 350, 131500. https://doi.org/10.1016/j.jclepro.2022.131500

Li, T., & Cheng, X. (2021). Estimating daily full-coverage surface ozone concentration using satellite observations and a spatiotemporally embedded deep learning approach. International Journal of Applied Earth Observation and Geoinformation, 101, 102356. https://doi.org/10.1016/j.jag.2021.102356

Chang, Q., Zhang, H., & Zhao, Y. (2020). Ambient air pollution and daily hospital admissions for respiratory system–related diseases in a heavily polluted city in Northeast China. Environmental Science and Pollution Research, 27, 10055–10064. https://doi.org/10.1007/s11356-020-07678-8

Mahlangeni, N., Kapwata, T., Webster, C., Howlett-Downing, C., & Wright, C. Y. (2025). Exposure to air pollution from coal-fired power plants and impacts on human health: A scoping review. Reviews on Environmental Health. https://doi.org/10.1515/reveh-2024-0173

European Commission. (2021). Carbon Border Adjustment Mechanism: Questions and answers. https://ec.europa.eu/commission/presscorner/detail/en/qanda_21_3661

Nugmanova, D., Feshchenko, Y., Iashyna, L., Gyrina, O., Malynovska, K., Mammadbayov, E., Akhundova, I., Nurkina, N., Tariq, L., Makarova, J., & Vasylyev, A. (2018). The prevalence, burden and risk factors associated with chronic obstructive pulmonary disease in Commonwealth of Independent States (Ukraine, Kazakhstan and Azerbaijan): Results of the CORE study. BMC Pulmonary Medicine, 18, 26. https://doi.org/10.1186/s12890-018-0589-5

Nugmanova, D., Sokolova, L., Feshchenko, Y., Iashyna, L., Gyrina, O., Malynovska, K., Mustafayev, I., Aliyeva, G., Makarova, J., & Vasylyev, A. (2018). The prevalence, burden and risk factors associated with bronchial asthma in Commonwealth of Independent States countries (Ukraine, Kazakhstan and Azerbaijan): Results of the CORE study. BMC Pulmonary Medicine, 18, 110. https://doi.org/10.1186/s12890-018-0660-2

Semenova, Y., Zhunussov, Y., Pivina, L., Abisheva, A., Tinkov, A., Belikhina, T., Skalny, A., Zhanaspayev, M., Bulegenov, T., & Glushkova, N. (2019). Trace element biomonitoring in hair and blood of occupationally unexposed population residing in polluted areas of East Kazakhstan and Pavlodar regions. Journal of Trace Elements in Medicine and Biology, 56, 31–37. https://doi.org/10.1016/j.jtemb.2019.07.019

Kerimray, A., Assanov, D., Kenessov, B., & Karaca, F. (2020). Trends and health impacts of major urban air pollutants in Kazakhstan. Journal of the Air & Waste Management Association, 70, 1148–1164. https://doi.org/10.1080/10962247.2020.1793568

IQAir. (2024). World’s most polluted countries & regions. https://www.iqair.com/world-most-polluted-countries (accessed September 02, 2025).

Xue, Y., Pan, W., Lu, W. Z., & He, H. D. (2025). Multifractal nature of particulate matters (PMs) in Hong Kong urban air. Science of the Total Environment, 532, 744–751. https://doi.org/10.1016/j.scitotenv.2025.744

Kantelhardt, J. W., Zschiegner, S. A., Koscielny-Bunde, E., Havlin, S., Bunde, A., & Stanley, H. E. (2002). Multifractal detrended fluctuation analysis of nonstationary time series. Physica A: Statistical Mechanics and Its Applications, 316, 87–114. https://doi.org/10.1016/S0378-4371(02)01383-3

Thompson, J. R., & Wilson, J. R. (2016). Multifractal detrended fluctuation analysis: Practical applications to financial time series. Computer Simulation, 126, 63–88. https://doi.org/10.1016/j.cose.2016.05.012

Liu, X., Hadiatullah, H., Tai, P., Xu, Y., Zhang, X., Schnelle-Kreis, J., Schloter-Hai, B., & Zimmermann, R. (2021). Air pollution in Germany: Spatio-temporal variations and their driving factors based on continuous data from 2008 to 2018. Environmental Pollution, 276, 116732. https://doi.org/10.1016/j.envpol.2021.116732

Kolesnikova, K., Naizabayeva, L., Myrzabayeva, A., & Lisnevskyi, R. (2024). Use the neural networks in prediction of environmental processes. 2024 IEEE 4th International Conference on Smart Information Systems and Technologies (SIST), 625–630. https://doi.org/10.1109/SIST60832.2024.10710756

Kuchanskyi, O., Biloshchytskyi, A., Andrashko, Y., Neftissov, A., Biloshchytska, S., & Bronin, S. (2025). Predictability of air pollutants based on detrended fluctuation analysis: Ekibastuz coal-mining center in Northeastern Kazakhstan. Urban Science, 9, 273. https://doi.org/10.3390/urbansci9070273

Andrashko, Y., Kuchanskyi, O., Biloshchytskyi, A., Neftissov, A., & Biloshchytska, S. (2025). Forecasting air pollutant emissions using deep sparse transformer networks: A case study of the Ekibastuz coal-fired power plant. Sustainability, 17(11), 5115. https://doi.org/10.3390/su17115115

Zhang, Z., & Zhang, S. (2023). Modeling air quality PM2.5 forecasting using deep sparse attention-based transformer networks. International Journal of Environmental Science and Technology, 20, 13535–13550. https://doi.org/10.1007/s13762-023-05094-y

Cui, B., Liu, M., Li, S., Jin, Z., Zeng, Y., & Lin, Z. (2023). Deep learning methods for atmospheric PM2.5 prediction: A comparative study of transformer and CNN-LSTM-attention. Atmospheric Pollution Research, 14, 101833. https://doi.org/10.1016/j.apr.2023.101833

Jian, L., Zhao, Y., Zhu, Y. P., Zhang, M. B., & Bertolatti, D. (2012). An application of ARIMA model to predict submicron particle concentrations from meteorological factors at a busy roadside in Hangzhou, China. Science of the Total Environment, 426, 336–345. https://doi.org/10.1016/j.scitotenv.2012.03.071

Cekim, H. O. (2020). Forecasting PM10 concentrations using time series models: A case of the most polluted cities in Turkey. Environmental Science and Pollution Research, 27, 25612–25624. https://doi.org/10.1007/s11356-020-08691-9

Kumar, U., & Jain, V. K. (2010). ARIMA forecasting of ambient air pollutants (O3, NO, NO2 and CO). Stochastic Environmental Research and Risk Assessment, 24, 751–760. https://doi.org/10.1007/s00477-009-0357-8

Chu, J., Dong, Y., Han, X., Xie, J., Xu, X., & Xie, G. (2021). Short-term prediction of urban PM2.5 based on a hybrid modified variational mode decomposition and support vector regression model. Environmental Science and Pollution Research, 28, 56–72. https://doi.org/10.1007/s11356-020-11247-y

Agarwal, S., Sharma, S. R. S., Rahman, M. H., Vranckx, S., Maiheu, B., Blyth, L., Janssen, S., Gargava, P., Shukla, V. K., & Batra, S. (2020). Air quality forecasting using artificial neural networks with real time dynamic error correction in highly polluted regions. Science of the Total Environment, 735, 139454. https://doi.org/10.1016/j.scitotenv.2020.139454

Zaini, N., Ean, L. W., Ahmed, A. N., et al. (2022). A systematic literature review of deep learning neural networks for time series air quality forecasting. Environmental Science and Pollution Research, 29, 4958–4990. https://doi.org/10.1007/s11356-021-17442-1

Chen, T., & Guestrin, C. (2016). XGBoost: A scalable tree boosting system. Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 785–794. https://doi.org/10.1145/2939672.2939785

Biloshchytskyi, A., Neftissov, A., Kuchanskyi, O., Andrashko, Y., Biloshchytska, S., Mukhatayev, A., & Kazambayev, I. (2024). Fractal analysis of air pollution time series in urban areas in Astana, Republic of Kazakhstan. Urban Science, 8, 131. https://doi.org/10.3390/urbansci8030131

Biloshchytskyi, A., Kuchanskyi, O., Neftissov, A., Andrashko, Y., Biloshchytska, S., & Kazambayev, I. (2024). Fractal analysis of mining wastewater time series parameters: Balkhash urban region and Sayak ore district. Urban Science, 8, 200. https://doi.org/10.3390/urbansci8040200

Peters, E. E. (1994). Fractal market analysis: Applying chaos theory to investment and economics. John Wiley & Sons.

Anis, A., & Lloyd, E. (1976). The expected value of the adjusted rescaled Hurst range of independent normal summands. Biometrika, 63, 111–116. https://doi.org/10.1093/biomet/63.1.111

Sohrabi, Z., & Maleki, J. (2025). Fusing satellite imagery and ground-based observations for PM2.5 air pollution modeling in Iran using a deep learning approach. Scientific Reports, 15, 1, Art. no. 21449. https://doi.org/10.1038/s41598-025-05332-2

Zhang, Z., Zhang, S., Zhao, X., Chen, L., & Yao, J. (2022). Temporal difference-based graph transformer networks for air quality PM2.5 prediction: A case study in China. Frontiers in Environmental Science, 10, 924986. https://doi.org/10.3389/fenvs.2022.924986

The 17 Goals, Department of Economic and Social Affairs. (2025). Sustainable development. https://sdgs.un.org/goals (accessed May 23, 2025).

United Nations. (2025). Transforming our world: The 2030 agenda for sustainable development. https://sdgs.un.org/2030agenda (accessed May 23, 2025).

Downloads

Published

2025-09-30

How to Cite

Biloshchytskyi, A., Kuchanskyi, O., Andrashko, Y., & Neftissov, A. (2025). DEVELOPMENT OF TIME SERIES FORECASTING MODELS FOR AIR POLLUTION BASED ON DEEP SPARSE TRANSFORMER NETWORKS. Scientific Journal of Astana IT University, 23, 185–198. https://doi.org/10.37943/23VUWJ5711

Issue

Section

Information Technologies