Scientific Journal of Astana IT University https://journal.astanait.edu.kz/index.php/ojs <div> <p><strong>Registered and issued a <a href="https://sj.astanait.edu.kz/wp-content/uploads/2021/12/%D0%B6%D1%83%D1%80%D0%BD%D0%B0%D0%BB-%D1%81%D0%B2%D0%B8%D0%B4%D0%B5%D1%82%D0%B5%D0%BB%D1%8C%D1%81%D1%82%D0%B2%D0%BE_page-0001.jpg" target="_blank" rel="noreferrer noopener">certificate</a></strong> Ministry of Information and Social Development Of the Republic of Kazakhstan<br />No. KZ200074316 dated January 20, 2020</p> <p><strong>Published</strong>: since 2020<br /><strong>Frequency</strong>: quarterly (March, June, September, December)<br /><strong>Specialization:</strong> Information Technology<br /><strong>Owner:</strong> <a href="https://astanait.edu.kz/en/main-page/" target="_blank" rel="noreferrer noopener"><strong>LLP "Astana IT University"</strong></a> is the leading educational institution located in Astana, Kazakhstan, specializing in innovative, ongoing IT education and scientific research, underpinned by strong academic traditions and a robust commitment to social responsibility.</p> <p><strong>Publication language:</strong> English</p> <p><strong>The main goal </strong>of a scientific publication is to provide the opportunity for the exchange of information in the scientific community, including international level.</p> <p><strong>Area of expertise:</strong> technical and pedagogical sciences</p> <p><a href="maindirections:Section1. Information Technologies1 Information Security2 Information and communication technology (ICT)3 IT in management, administration, finance andeconomics4 Project managementSection2. Pedagogy5.Digitalization in education: technologies, models, methods"><strong>Main directions:</strong></a></p> <p><strong>I Information Technologies</strong><a href="maindirections:Section1. Information Technologies1 Information Security2 Information and communication technology (ICT)3 IT in management, administration, finance andeconomics4 Project managementSection2. Pedagogy5.Digitalization in education: technologies, models, methods"><br /></a>1 Information Security<br />2 Information and communication technology (ICT)<br />3 IT in management, administration, finance and economics<br />4 Project management</p> <p><strong>II Pedagogy</strong></p> <p>5. IT in learing and teaching</p> <p class="has-text-align-center"><strong>Chief-editor</strong><br /><strong><a href="https://sj.astanait.edu.kz/2086-2/" target="_blank" rel="noreferrer noopener">Andrii Biloshchytskyi</a></strong> – Doctor of Technical Sciences, Professor, Vice-Rector for Science and Innovation</p> <p class="has-text-align-center"><strong>Executive editors</strong><br /><strong><a href="https://sj.astanait.edu.kz/2093-2/" target="_blank" rel="noreferrer noopener">Beibut Amirgaliyev</a></strong> – Candidate of Technical Sciences, Associate Professor</p> <p class="has-text-align-center"><a href="https://sj.astanait.edu.kz/3552-2/" target="_blank" rel="noreferrer noopener"><strong>Nurkhat Zhakiyev</strong></a> – PhD in Physics, Director of the Department of Science and Innovation, Astana IT University</p> <p class="has-text-align-center" align="center">The journal is included in the list of publications recommended by the Committee for Quality Assurance in the Sphere of Science and Higher Education of the Ministry of Science and Higher Education of the Republic of Kazakhstan for the publication of the main results of scientific activities in the scientific areas “Information and Communication Technologies” (18.03.2022, No. 104) and “Pedagogy” (25.01.2024 No. 101).<br />Scientific works are <strong>accepted year-round</strong> to the journal, for more detailed information and to familiarize yourself with the publication requirements of your scientific papers, please do not hesitate to contact us.</p> <table border="1" cellspacing="0" cellpadding="0"> <tbody> <tr> <td valign="top" width="208"> <p> <img src="http://ojs.astanait.edu.kz/public/site/images/babyshark/gerb_sm.aaf449a0_2.png" alt="" /></p> <p><a href="https://www.gov.kz/memleket/entities/control/documents/details/287211?directionId=3826&amp;lang=en" target="_blank" rel="noreferrer noopener">CQASES of the MHES RK</a></p> </td> <td valign="top" width="208"> <p><a href="https://portal.issn.org/resource/issn/2707-9031" target="_blank" rel="noopener"><img src="http://ojs.astanait.edu.kz/public/site/images/babyshark/issn1.png" alt="" /><br /><br />(P): 2707-9031</a><br /><a href="https://portal.issn.org/resource/issn/2707-904X" target="_blank" rel="noopener">(E): 2707-904X</a></p> </td> <td valign="top" width="208"> <p> <img src="http://ojs.astanait.edu.kz/public/site/images/babyshark/doi1.png" alt="" /></p> <p><a href="https://apps.crossref.org/myCrossref/?report=missingmetadata&amp;datatype=j&amp;prefix=10.37943">10.37943/AITU.2020</a></p> </td> </tr> </tbody> </table> </div> <p> </p> <h2 class="has-text-align-left">OPEN ACCESS POLICY</h2> <p>Scientific Journal of Astana IT University is an open access journal. All articles are free for users to access, read, download, and print. The journal uses the <a href="https://creativecommons.org/licenses/by-nc-nd/3.0/deed.ru" target="_blank" rel="noreferrer noopener">CREATIVE COMMONS (CC BY-NC-ND)</a> copyright statement for open access journals.</p> en-US <p>Authors who publish a manuscript in this journal agree to the following terms:</p> <ul> <li>The authors reserve the right to authorship of their work and transfer to the journal the right of first publication under the terms of the <a href="https://creativecommons.org/licenses/by-nc-nd/3.0/deed.ru" target="_blank" rel="noreferrer noopener">Creative Commons Attribution License</a>, which allows others to freely distribute the published work with a mandatory link to the the original work and the first publication of the work in this journal.</li> <li>Authors have the right to conclude independent additional agreements that relate to the non-exclusive distribution of the work in the form in which it was published by this journal (for example, to post the work in the electronic repository of the institution or publish as part of a monograph), providing the link to the first publication of the work in this journal.</li> <li>Other terms stated in the <a href="https://sj.astanait.edu.kz/wp-content/uploads/2022/03/%D0%90%D0%92%D0%A2%D0%9E%D0%A0%D0%A1%D0%9A%D0%98%D0%99-%D0%94%D0%9E%D0%93%D0%9E%D0%92%D0%9E%D0%A0-%D0%B1%D0%B5%D0%B7-%D0%B7%D0%B0%D1%8F%D0%B2%D0%BA%D0%B8-ENG.pdf" target="_blank" rel="noopener">Copyright Agreement.</a></li> </ul> ojs@astanait.edu.kz (Andrii Biloshchitskyi) ojs@astanait.edu.kz (Zhansaya Makhambetova) Sun, 30 Mar 2025 00:00:00 +0500 OJS 3.3.0.8 http://blogs.law.harvard.edu/tech/rss 60 THE IMPACT OF AI AND PEER FEEDBACK ON RESEARCH WRITING SKILLS: A STUDY USING THE CGSCHOLAR PLATFORM AMONG KAZAKHSTANI SCHOLARS https://journal.astanait.edu.kz/index.php/ojs/article/view/695 <h3>This research studies the impact of AI and Peer feedback on the academic writing development of Kazakhstani scholars using the CGScholar platform − the product of cutting-edge research and development into collaborative learning, big data, and artificial intelligence developed by educators and computer scientists at the University of Illinois Urbana-Champaign (UIUC). The study aimed to find out how familiarity with AI tools and peer feedback processes affects participants’ openness to incorporating feedback into their academic writing. The study involved 36 Bolashak scholars enrolled in a scientific internship focused on education at the University of UIUC. A survey with 15 questions with multiple-choice, Likert scale, and open-ended questions was employed to collect a data. The survey was conducted via Google Forms in both English and Russian to ensure linguistic accessibility. Demographic information such as age, gender, and first language were collected to provide a nuanced understanding of the data. The analysis revealed a moderate positive correlation between familiarity with AI tools and openness to making changes based on feedback, and a strong positive correlation between research writing experience and expectations of peer feedback, especially in the area of research methodology. These results show that participants are open minded to AI-assisted feedback, however they still highly appreciate peer input, especially regarding methodological guidance. This study demonstrates the potential benefits of integrating AI tools with traditional feedback mechanisms to improve research writing quality in academic settings. Further research is recommended to evaluate the long-term impact of AI and peer feedback on academic writing skills, particularly through longitudinal studies that assess skill retention over multiple feedback cycles. Additionally, expanding the study to include a more diverse academic audience will provide deeper insights into how feedback mechanisms function across different research cultures and disciplines.</h3> Raigul Zheldibayeva Copyright (c) 2025 Articles are open access under the Creative Commons License https://creativecommons.org/licenses/by-nc-nd/4.0 https://journal.astanait.edu.kz/index.php/ojs/article/view/695 Sun, 30 Mar 2025 00:00:00 +0500 CONSTRACTION OF DISTRIBUTION MODELS OF THE UNIVERSITY EDUCATIONAL WORK VOLUME https://journal.astanait.edu.kz/index.php/ojs/article/view/593 <p>With the advent of new time requirements for the quality of educational services, which is influenced by management in functioning business processes, existing research in the field of resource allocation in the management of complex processes, namely the calculation of the teaching load of university teaching staff, was studied. The purpose of the research in this article is to develop functional and mathematical distribution models of the university educational work volume, as well as an algorithm for optimizing the generation of educational flows and initialization of academic groups, taking into account the specifics of disciplines and classroom fund. The algorithm is based on the construction of all business processes implemented during the formation of educational streams and groups. The functional model described for the process of distributing the volume of educational work includes the definition of the main functions, their relationships, input and output data, as well as the criteria and restrictions that govern this process. The mathematical model is based on the representation of all types of educational work of departments of educational programs as a discrete set of resources that must be distributed between educational departments in accordance with the assumptions and restrictions accepted at the university. Data mining and operations research techniques were used to write the functional model. Empirical and quantitative methods were used to write a mathematical model. Thus, a new methodology has been developed for solving complex optimization problems that arise when modeling and optimizing the distribution of the volume of educational work of a university. It should be noted that comparative experiments under labor-intensive and time-limited conditions confirm the effectiveness of this technique in solving problems of distributing the amount of educational work among departments of educational programs, which in turn contributes to the implementation of high-quality software.</p> Diana Chigambayeva, Gulzhan Soltan Copyright (c) 2025 Articles are open access under the Creative Commons License https://creativecommons.org/licenses/by-nc-nd/4.0 https://journal.astanait.edu.kz/index.php/ojs/article/view/593 Sun, 30 Mar 2025 00:00:00 +0500 COMPARATIVE ANALYSIS OF VARIOUS FORECAST MODELS OF ELECTRICITY CONSUMPTION IN SMART BUILDINGS https://journal.astanait.edu.kz/index.php/ojs/article/view/641 <p>The rapidly growing field of smart building technology depends heavily on accurate electricity consumption forecasting. By anticipating energy demands, building managers can optimize resource allocation, minimize waste, and enhance overall efficiency. This study provides a comprehensive comparative analysis of various models used to forecast electricity consumption in smart buildings, highlighting their strengths, limitations, and suitability for different use cases. The investigation focuses on three major categories of forecasting models: statistical methods, machine learning techniques, and hybrid approaches. Statistical models, such as the Moving Average Method, leverage historical data patterns to predict future trends. These models enable analysts to utilize predictive analytics, simulating real-world environments and helping them make more informed decisions. The study offers a detailed comparison of several predictive models applied to Internet of Things (IoT) data, with a particular emphasis on energy consumption in smart buildings. Among the short-term forecasting models examined are gradient-enhanced regressors (XGBoost), random forest (RF), and long short-term memory networks (LSTM). The performance of these models was evaluated based on prediction errors to identify the most accurate one. Time series, machine learning, and hybrid models used to predict energy consumption are considered and analyzed. The focus is on the accuracy of forecasts and their applicability in real-world conditions, considering factors such as climate change and data obtained from Internet of Things (IoT) sensors. The analysis shows that hybrid models combining machine learning and time series provide the best prediction accuracy over different time horizons. It also highlights the importance of integrating user behavior data and using IoT technologies to improve model accuracy. The results can be applied to create energy-efficient control systems in smart buildings and optimize energy consumption.</p> Akylbek Tokhmetov, Kenzhegali Nurgaliyev, Liliya Tanchenko Copyright (c) 2025 Articles are open access under the Creative Commons License https://creativecommons.org/licenses/by-nc-nd/4.0 https://journal.astanait.edu.kz/index.php/ojs/article/view/641 Sun, 30 Mar 2025 00:00:00 +0500 DEVELOPMENT OF MACHINE LEARNING METHODS FOR MARKET TRENDS https://journal.astanait.edu.kz/index.php/ojs/article/view/663 <p>In the rapidly evolving real estate market, the application of machine learning (ML) is crucial for understanding and predicting price trends. This study evaluates and compares seven ML models, including multiple linear regression, random forest regression, support vector regression (SVR), decision tree regression, and XGBoost, to determine the most effective predictor of real estate prices in Astana, Kazakhstan. The study focuses on the Yesil district, a key area in the city, utilizing a dataset of over 9,000 records extracted from a broader collection of more than 30,000 real estate transactions across Kazakhstan. Through rigorous experimentation, model performance was assessed using statistical metrics such as mean absolute error (MAE), root mean square error (RMSE), and the coefficient of determination (R-squared). The results indicate that the Random Forest Regressor and XGBRegressor models outperformed others, achieving the highest R-squared values (99.55% and 99.18%, respectively) and the lowest MAE and RMSE values. These findings highlight their robustness in predicting housing prices with high accuracy. The primary objective of this study was to develop a precise ML model capable of accurately forecasting real estate prices in Astana based on key market attributes. The superior predictive performance of the Random Forest and XGBRegressor models justifies their selection for deployment in real-world applications. Their high predictive accuracy suggests their potential utility for real estate professionals, policymakers, and investors seeking data-driven insights into market dynamics. This research expands knowledge on the applications of ML in the real estate sector, reinforcing the importance of evidence-based decision-making within the industry.</p> Saya Sapakova, Zhansaya Bekaulova, Almas Nurlanuly , Duriya Daniyarova, Galiya Ybytayeva, Kaldybayeva Aizhan Copyright (c) 2025 Articles are open access under the Creative Commons License https://creativecommons.org/licenses/by-nc-nd/4.0 https://journal.astanait.edu.kz/index.php/ojs/article/view/663 Sun, 30 Mar 2025 00:00:00 +0500 FEATURE SELECTION METHODS FOR LSTM-BASED RIVER WATER LEVEL AND DISCHARGE FORECASTING https://journal.astanait.edu.kz/index.php/ojs/article/view/746 <p>Accurate forecasting of river discharge and water levels is essential for effective water resource management, flood mitigation, and public safety. This study compares correlation-based and PCA-based feature selection methods for LSTM forecasting models in the study area at Uba River basin, within Shemonaiha city in the East Kazakhstan region. The dataset spans from 1995 to 2021, with 1995 to 2019 used for training and validation and 2020 to 2021 for testing. Both feature selection methods reduced the original predictor set to 13 features while generally maintaining predictive accuracy. An ensemble of 10 LSTM models was trained using 60-day input sequences to forecast discharge and water levels over a 10-day horizon, reducing variance from random initialization and stabilizing predictions. Performance was evaluated using the Nash-Sutcliffe Efficiency. Results showed that correlation-based selection performed comparably to the full-feature baseline in 2020 test set, suggesting that removing highly correlated predictors did not decrease short-term forecasts capacity of the model. The model with PCA-based selected features, while slightly lagging at longer lead times in 2020, exhibited advantages in most lead times with 2021 forecasts. However, overall predictive performance declined in 2021 compared to 2020, indicating that the hydrological conditions deviate more from the historical training record, and suggesting the need for model updates with relevant historical training data. Both feature selection methods successfully reduced dimensionality, while preserving performance capacity, though neither was universally superior across all forecast lead times. These results emphasize the value of systematic feature selection in hydrological modeling and highlight the importance of model adaptability to evolving environmental conditions.</p> Almas Alzhanov, Aliya Nugumanova Copyright (c) 2025 Articles are open access under the Creative Commons License https://creativecommons.org/licenses/by-nc-nd/4.0 https://journal.astanait.edu.kz/index.php/ojs/article/view/746 Sun, 30 Mar 2025 00:00:00 +0500 EFFECTIVENESS OF MACHINE LEARNING METHODS IN DETERMINING EARTHQUAKE PROBABLE AREAS: EXAMPLE OF KAZAKHSTAN https://journal.astanait.edu.kz/index.php/ojs/article/view/648 <p>This study investigates the effectiveness of machine learning methods in identifying earthquake-prone areas in Kazakhstan and its neighboring regions. By leveraging a comprehensive dataset encompassing significant earthquake data from 1900 to 2023, various machine learning algorithms were employed, including RandomForest, GradientBoosting, Logistic Regression, Support Vector Classification (SVC), K-Nearest Neighbors (KNeighbors), Decision Tree, XGBoost, LightGBM, AdaBoost, and MLPClassifier. The primary objective was to analyze and compare the performance of these models in predicting earthquake magnitudes and frequencies. The results reveal that certain algorithms significantly outperformed others in terms of accuracy, underscoring the potential of machine learning techniques to enhance earthquake prediction capabilities. Notably, XGBoost and RandomForest demonstrated the highest predictive accuracy, suggesting their suitability for application in seismic risk assessment. These findings offer valuable insights for governmental agencies engaged in disaster management and prevention planning, highlighting the practical implications of integrating advanced analytical techniques in their strategies. In addition to model performance analysis, a visual heatmap was generated to illustrate the geographical distribution of earthquake occurrences across the studied regions. This visual representation effectively identifies high-risk areas, serving as a crucial tool for local authorities and researchers in making informed decisions regarding safety measures and emergency preparedness. This research contributes to the expanding body of knowledge on earthquake prediction utilizing machine learning, emphasizing the necessity for continuous improvement in predictive models by incorporating additional environmental and geological factors. The implications of these findings extend beyond academic discourse, holding significant potential for enhancing public safety in regions vulnerable to seismic activity. As such, this study advocates for the integration of machine learning methodologies in disaster management frameworks to mitigate risks and enhance preparedness in earthquake-prone regions.</p> Gulnur Kazbekova, Arypzhan Aben, Anuarbek Amanov , Nurseit Zhunissov , Aiman Abibullayeva Copyright (c) 2025 Articles are open access under the Creative Commons License https://creativecommons.org/licenses/by-nc-nd/4.0 https://journal.astanait.edu.kz/index.php/ojs/article/view/648 Sun, 30 Mar 2025 00:00:00 +0500 NEW APPROACH TO ADDRESSING CLASS IMBALANCE IN MEDICAL DATASETS CONSIDERING SPECIFICS https://journal.astanait.edu.kz/index.php/ojs/article/view/678 <p>Currently, the popularization of the integration of machine learning into the field of medicine for data processing and analysis is being traced, but at the same time difficulties such as class imbalance and noisy datasets arise. Due to the prevalence of the problem, there are already existing solutions, but in all of them there is an abstraction from the field of medicine, namely, gender, racial and other differences are not taken into account. It is this side of the problem that is solved in our resampling algorithm. A feature of our algorithm is the use of splitting the dataset by an important feature through the p-value of Spearman correlation, which helps to consider subgroups of observations without losing their unique characteristics and removing noise data using LOF and Z-score separately for minority and majority classes, respectively. Synthetic data is generated in a flexible way, adapting to the data set using algorithm parameters. Work is provided with both quantitative and nominative features. The algorithm was tested on datasets for heart attack, chronic kidney disease, and liver disease, and the Random Forest ensemble method was used to train the model. After applying this class balancing method, improvements were recorded on average in Accuracy by 36%, in AUC by 15-25%, in Precision by 39-42%, and in Recall by 21-37% compared with SMOTE, ADASYN algorithms and the data set before balancing. Applying the algorithm on medical data can improve the accuracy of the algorithm and reduce the loss of reliability compared to other resampling methods.</p> Zholdas Buribayev, Ainur Yerkos, Zhibek Zhetpisbay Copyright (c) 2025 Articles are open access under the Creative Commons License https://creativecommons.org/licenses/by-nc-nd/4.0 https://journal.astanait.edu.kz/index.php/ojs/article/view/678 Sun, 30 Mar 2025 00:00:00 +0500 ARTIFICIAL INTELLIGENCE-ENHANCED MOBILE DIAGNOSTICS USING DECISION TREES FOR EARLY DETECTION OF RESPIRATORY DISEASES https://journal.astanait.edu.kz/index.php/ojs/article/view/690 <p>This article is devoted to the identification of early diagnosis of respiratory lung diseases, such as chronic obstructive pulmonary disease and pneumonia, to reduce mortality and prevent complications. One of the most effective methods of structuring data is the Decision Tree model. The research focuses on the development and evaluation of a decision tree model, which is used to obtain data in the form of questionnaires, text files from patients, where they describe in detail the entire process of the disease, describing their symptoms and general condition at different time periods. There are a few criteria that patients must answer for a more accurate diagnosis. The developed methodology will allow processing relevant data with various symptoms to obtain reliable identification of the signs of the disease, as well as the stages of its progression; this can be done without the use of complex and high-tech devices that make diagnosis very accessible and feasible in the shortest possible time, if resources and time are limited. The article describes the model, carefully collected, and processed the necessary data, and then the results will be described in detail, covering many indicators such as accuracy, responsiveness, F1 score and ROC-AUC. The results of this analysis strongly suggest that this model is effective enough to provide a high level of accuracy combined with extensive capabilities, which determines its practical importance for use in real conditions. It is noted that the decision tree model can significantly improve the quality of diagnostics, since it is possible to structure a large amount of data and thus collect many years of human experience.</p> Aigerim Aitim, Zhamilya Abdildanova, Symbat Tynystykbayeva, Aidana Muratbekova, Nurbike Nalhozha Copyright (c) 2025 Articles are open access under the Creative Commons License https://creativecommons.org/licenses/by-nc-nd/4.0 https://journal.astanait.edu.kz/index.php/ojs/article/view/690 Sun, 30 Mar 2025 00:00:00 +0500