DEVELOPMENT OF TIME SERIES FORECASTING MODELS FOR AIR POLLUTION BASED ON DEEP SPARSE TRANSFORMER NETWORKS
DOI:
https://doi.org/10.37943/23VUWJ5711Keywords:
deep sparse transformer network, air pollution forecasting, fractal analysis, long-term memory, environmental monitoringAbstract
This study investigates the application of fractal analysis and deep learning methods for forecasting pollutant emissions from the Ekibastuz coal-fired power plant. The research is based on time series of NO, NO₂, and PM₁₀ concentrations collected by industrial sensors during 2023–2024. To assess long-term dependencies, an R/S analysis was performed, and the results demonstrated stable persistence with average Hurst exponent values exceeding 0.67. This confirmed the appropriateness of employing models capable of capturing long-range memory in the data. In the second stage, a Deep Sparse Transformer Network (DSTN) architecture was implemented and adapted to the task of emission forecasting under different boiler operating modes. DSTN combines the advantages of transformer-based models with a sparse attention mechanism, which reduces computational complexity and enables efficient handling of long sequences. The model was trained using the PyTorch framework on a dataset of more than 67,000 records, with forecasting performed at horizons of 1, 6, 12, and 24 steps. The highest accuracy was achieved for short-term forecasts: the coefficient of determination for NO₂ reached 0.95 at a one-step horizon and decreased to 0.38 at 24 steps. For NO and PM₁₀, R² values ranged from 0.93 to 0.26. These findings indicate that DSTN is a highly effective tool for short-term forecasting but less accurate at longer horizons due to error accumulation. The results confirm the practical value of integrating fractal analysis with transformer architectures for emission monitoring and coal power plant operation management. The proposed approach can be embedded into industrial control systems to enable timely responses to peak emissions, optimize combustion modes, and mitigate environmental risks.
References
Wang, Y., Yuan, Q., Li, T., & Zhu, L. (2022). Global spatiotemporal estimation of daily high-resolution surface carbon monoxide concentrations using Deep Forest. Journal of Cleaner Production, 350, 131500. https://doi.org/10.1016/j.jclepro.2022.131500
Li, T., & Cheng, X. (2021). Estimating daily full-coverage surface ozone concentration using satellite observations and a spatiotemporally embedded deep learning approach. International Journal of Applied Earth Observation and Geoinformation, 101, 102356. https://doi.org/10.1016/j.jag.2021.102356
Chang, Q., Zhang, H., & Zhao, Y. (2020). Ambient air pollution and daily hospital admissions for respiratory system–related diseases in a heavily polluted city in Northeast China. Environmental Science and Pollution Research, 27, 10055–10064. https://doi.org/10.1007/s11356-020-07678-8
Mahlangeni, N., Kapwata, T., Webster, C., Howlett-Downing, C., & Wright, C. Y. (2025). Exposure to air pollution from coal-fired power plants and impacts on human health: A scoping review. Reviews on Environmental Health. https://doi.org/10.1515/reveh-2024-0173
European Commission. (2021). Carbon Border Adjustment Mechanism: Questions and answers. https://ec.europa.eu/commission/presscorner/detail/en/qanda_21_3661
Nugmanova, D., Feshchenko, Y., Iashyna, L., Gyrina, O., Malynovska, K., Mammadbayov, E., Akhundova, I., Nurkina, N., Tariq, L., Makarova, J., & Vasylyev, A. (2018). The prevalence, burden and risk factors associated with chronic obstructive pulmonary disease in Commonwealth of Independent States (Ukraine, Kazakhstan and Azerbaijan): Results of the CORE study. BMC Pulmonary Medicine, 18, 26. https://doi.org/10.1186/s12890-018-0589-5
Nugmanova, D., Sokolova, L., Feshchenko, Y., Iashyna, L., Gyrina, O., Malynovska, K., Mustafayev, I., Aliyeva, G., Makarova, J., & Vasylyev, A. (2018). The prevalence, burden and risk factors associated with bronchial asthma in Commonwealth of Independent States countries (Ukraine, Kazakhstan and Azerbaijan): Results of the CORE study. BMC Pulmonary Medicine, 18, 110. https://doi.org/10.1186/s12890-018-0660-2
Semenova, Y., Zhunussov, Y., Pivina, L., Abisheva, A., Tinkov, A., Belikhina, T., Skalny, A., Zhanaspayev, M., Bulegenov, T., & Glushkova, N. (2019). Trace element biomonitoring in hair and blood of occupationally unexposed population residing in polluted areas of East Kazakhstan and Pavlodar regions. Journal of Trace Elements in Medicine and Biology, 56, 31–37. https://doi.org/10.1016/j.jtemb.2019.07.019
Kerimray, A., Assanov, D., Kenessov, B., & Karaca, F. (2020). Trends and health impacts of major urban air pollutants in Kazakhstan. Journal of the Air & Waste Management Association, 70, 1148–1164. https://doi.org/10.1080/10962247.2020.1793568
IQAir. (2024). World’s most polluted countries & regions. https://www.iqair.com/world-most-polluted-countries (accessed September 02, 2025).
Xue, Y., Pan, W., Lu, W. Z., & He, H. D. (2025). Multifractal nature of particulate matters (PMs) in Hong Kong urban air. Science of the Total Environment, 532, 744–751. https://doi.org/10.1016/j.scitotenv.2025.744
Kantelhardt, J. W., Zschiegner, S. A., Koscielny-Bunde, E., Havlin, S., Bunde, A., & Stanley, H. E. (2002). Multifractal detrended fluctuation analysis of nonstationary time series. Physica A: Statistical Mechanics and Its Applications, 316, 87–114. https://doi.org/10.1016/S0378-4371(02)01383-3
Thompson, J. R., & Wilson, J. R. (2016). Multifractal detrended fluctuation analysis: Practical applications to financial time series. Computer Simulation, 126, 63–88. https://doi.org/10.1016/j.cose.2016.05.012
Liu, X., Hadiatullah, H., Tai, P., Xu, Y., Zhang, X., Schnelle-Kreis, J., Schloter-Hai, B., & Zimmermann, R. (2021). Air pollution in Germany: Spatio-temporal variations and their driving factors based on continuous data from 2008 to 2018. Environmental Pollution, 276, 116732. https://doi.org/10.1016/j.envpol.2021.116732
Kolesnikova, K., Naizabayeva, L., Myrzabayeva, A., & Lisnevskyi, R. (2024). Use the neural networks in prediction of environmental processes. 2024 IEEE 4th International Conference on Smart Information Systems and Technologies (SIST), 625–630. https://doi.org/10.1109/SIST60832.2024.10710756
Kuchanskyi, O., Biloshchytskyi, A., Andrashko, Y., Neftissov, A., Biloshchytska, S., & Bronin, S. (2025). Predictability of air pollutants based on detrended fluctuation analysis: Ekibastuz coal-mining center in Northeastern Kazakhstan. Urban Science, 9, 273. https://doi.org/10.3390/urbansci9070273
Andrashko, Y., Kuchanskyi, O., Biloshchytskyi, A., Neftissov, A., & Biloshchytska, S. (2025). Forecasting air pollutant emissions using deep sparse transformer networks: A case study of the Ekibastuz coal-fired power plant. Sustainability, 17(11), 5115. https://doi.org/10.3390/su17115115
Zhang, Z., & Zhang, S. (2023). Modeling air quality PM2.5 forecasting using deep sparse attention-based transformer networks. International Journal of Environmental Science and Technology, 20, 13535–13550. https://doi.org/10.1007/s13762-023-05094-y
Cui, B., Liu, M., Li, S., Jin, Z., Zeng, Y., & Lin, Z. (2023). Deep learning methods for atmospheric PM2.5 prediction: A comparative study of transformer and CNN-LSTM-attention. Atmospheric Pollution Research, 14, 101833. https://doi.org/10.1016/j.apr.2023.101833
Jian, L., Zhao, Y., Zhu, Y. P., Zhang, M. B., & Bertolatti, D. (2012). An application of ARIMA model to predict submicron particle concentrations from meteorological factors at a busy roadside in Hangzhou, China. Science of the Total Environment, 426, 336–345. https://doi.org/10.1016/j.scitotenv.2012.03.071
Cekim, H. O. (2020). Forecasting PM10 concentrations using time series models: A case of the most polluted cities in Turkey. Environmental Science and Pollution Research, 27, 25612–25624. https://doi.org/10.1007/s11356-020-08691-9
Kumar, U., & Jain, V. K. (2010). ARIMA forecasting of ambient air pollutants (O3, NO, NO2 and CO). Stochastic Environmental Research and Risk Assessment, 24, 751–760. https://doi.org/10.1007/s00477-009-0357-8
Chu, J., Dong, Y., Han, X., Xie, J., Xu, X., & Xie, G. (2021). Short-term prediction of urban PM2.5 based on a hybrid modified variational mode decomposition and support vector regression model. Environmental Science and Pollution Research, 28, 56–72. https://doi.org/10.1007/s11356-020-11247-y
Agarwal, S., Sharma, S. R. S., Rahman, M. H., Vranckx, S., Maiheu, B., Blyth, L., Janssen, S., Gargava, P., Shukla, V. K., & Batra, S. (2020). Air quality forecasting using artificial neural networks with real time dynamic error correction in highly polluted regions. Science of the Total Environment, 735, 139454. https://doi.org/10.1016/j.scitotenv.2020.139454
Zaini, N., Ean, L. W., Ahmed, A. N., et al. (2022). A systematic literature review of deep learning neural networks for time series air quality forecasting. Environmental Science and Pollution Research, 29, 4958–4990. https://doi.org/10.1007/s11356-021-17442-1
Chen, T., & Guestrin, C. (2016). XGBoost: A scalable tree boosting system. Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 785–794. https://doi.org/10.1145/2939672.2939785
Biloshchytskyi, A., Neftissov, A., Kuchanskyi, O., Andrashko, Y., Biloshchytska, S., Mukhatayev, A., & Kazambayev, I. (2024). Fractal analysis of air pollution time series in urban areas in Astana, Republic of Kazakhstan. Urban Science, 8, 131. https://doi.org/10.3390/urbansci8030131
Biloshchytskyi, A., Kuchanskyi, O., Neftissov, A., Andrashko, Y., Biloshchytska, S., & Kazambayev, I. (2024). Fractal analysis of mining wastewater time series parameters: Balkhash urban region and Sayak ore district. Urban Science, 8, 200. https://doi.org/10.3390/urbansci8040200
Peters, E. E. (1994). Fractal market analysis: Applying chaos theory to investment and economics. John Wiley & Sons.
Anis, A., & Lloyd, E. (1976). The expected value of the adjusted rescaled Hurst range of independent normal summands. Biometrika, 63, 111–116. https://doi.org/10.1093/biomet/63.1.111
Sohrabi, Z., & Maleki, J. (2025). Fusing satellite imagery and ground-based observations for PM2.5 air pollution modeling in Iran using a deep learning approach. Scientific Reports, 15, 1, Art. no. 21449. https://doi.org/10.1038/s41598-025-05332-2
Zhang, Z., Zhang, S., Zhao, X., Chen, L., & Yao, J. (2022). Temporal difference-based graph transformer networks for air quality PM2.5 prediction: A case study in China. Frontiers in Environmental Science, 10, 924986. https://doi.org/10.3389/fenvs.2022.924986
The 17 Goals, Department of Economic and Social Affairs. (2025). Sustainable development. https://sdgs.un.org/goals (accessed May 23, 2025).
United Nations. (2025). Transforming our world: The 2030 agenda for sustainable development. https://sdgs.un.org/2030agenda (accessed May 23, 2025).
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2025 Articles are open access under the Creative Commons License

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.
Authors who publish a manuscript in this journal agree to the following terms:
- The authors reserve the right to authorship of their work and transfer to the journal the right of first publication under the terms of the Creative Commons Attribution License, which allows others to freely distribute the published work with a mandatory link to the the original work and the first publication of the work in this journal.
- Authors have the right to conclude independent additional agreements that relate to the non-exclusive distribution of the work in the form in which it was published by this journal (for example, to post the work in the electronic repository of the institution or publish as part of a monograph), providing the link to the first publication of the work in this journal.
- Other terms stated in the Copyright Agreement.