INVESTIGATION OF DEEP LEARNING MODELS BASED ON SINGLE-LAYER SimpleRNN, LSTM AND GRU NETWORKS FOR RECOGNIZING SOUNDS OF UAV DISTANCES

Authors

DOI:

https://doi.org/10.37943/19XNOV6347

Keywords:

UAVs, UAV states, UAV sound recognition, UAV sound distance recognition, suspicious drone, SimpleRNN network, LSTM network, GRU network

Abstract

In recent years, the potential risks posed by easily moving objects have highlighted the need for intelligent surveillance systems in protected areas, primarily to ensure the safety of human lives. Among the most common of these objects are unmanned aerial vehicles (UAVs). Recent advances in deep learning techniques for recognizing audio signals have made these techniques effective in identifying moving or aerial objects, especially those powered by engines. And the growing deployment of UAVs has made their rapid recognition in various suspicious or unauthorized circumstances critical. Detecting suspicious drone flights, especially in restricted areas, remains a significant research challenge. It is vital to perform the task of determining their distance in order to quickly detect drones approaching people in such protected areas. Therefore, this paper aims to study the research question of recognizing UAV audio data from different distances. That is, recognizing drone audio at different distances was experimentally studied using Simple RNN, LSTM and GRU based deep learning models. The main objective of this study is based on finding one of the capable types of recurrent network for the task of recognizing UAV audio data at different distances. During the experimental study, the recognition abilities of Single-layer Simple RNN, LSTM and GRU recurrent network types were studied from two basic directions: with recognition accuracy curves and classification reports. As a result, LSTM and GRU based models showed high recognition ability for these types of audio signals. It was noted that UAVs can reliably predict distances greater than 10 meters based on the proposed deep learning architecture.

Author Biographies

Dana Utebayeva, Satbayev University, Kazakhstan

PhD, Researcher, Department of Electronics, Telecommunications and ST

Lyazzat Ilipbayeva , International Information Technology University, Kazakhstan

Candidate of Technical Sciences, Acting associate professor, Department of Radio-engineering, Electronics, Telecommunications

References

Taha B. and Shoufan A. (2019). Machine Learning-Based Drone Detection and Classification: State-of-the-Art in Research. IEEE Access, vol. 7, pp. 138669-138682, doi: https://doi.org/10.1109/ACCESS.2019.2942944.

First drone crash with a commercial aircraft in Canada triggers safety review and possible new rules. Available at: https://www.ediweekly.com/first-drone-crash-commercial-aircraft-canada-triggers-safety-review-possible-new-rules/

Patrick H. Hundreds of drones crash after glitching during show in China. (2023). Available at: https://www.independent.co.uk/tv/lifestyle/china-drone-crash-zoo-show-b2394312.html, Wednesday 16 August.

Kosenov A. Kazakhstan podtverdil proniknoveniye uzbekskogo bespilotnika na svoyu territoriyu. (2012). Available at: https://tengrinews.kz/events/kazahstan-podtverdil-proniknovenie-uzbekskogo-bespilotnika-208687/.

Seidaliyeva, U.; Ilipbayeva, L.; Taissariyeva, K.; Smailov, N.; Matson, E.T. (2024). Advances and Challenges in Drone Detection and Classification Techniques: A State-of-the-Art Review. Sensors, 24, 125. https://doi.org/10.3390/s24010125

Ilipbayeva L.B., Seydaliyeva U.O., Smaylov N.K., Matson E.T. (2024). Research of UAV detection using modified yoloalgorithm. Vestnik Almatinskogo universiteta energetiki i svyazi No 2(65) https://doi.org/10.51775/2790-0886_2024_65_2_179

Zhanbirova A. (2024). UAV crashes near airport in Kyrgyzstan. Available at: https://kz.kursiv.media/en/2024-08-15/uav-crashes-near-airport-in-kyrgyzstan/ (accessed on August 15, 2024 21:41)

Utebayeva D. and Yembergenova A. (2024). Study a deep learning-based audio classification for detecting the distance of UAV. IEEE International Conference on Evolving and Adaptive Intelligent Systems (EAIS), Madrid, Spain, 2024, pp. 1-7, https://doi.org/10.1109/EAIS58494.2024.10569107.

Mkrtchian G. and Furletov Y. (2022). Classification of Environmental Sounds Using Neural Networks. Systems of Signal Synchronization, Generating and Processing in Telecommunications (SYNCHROINFO), Arkhangelsk, Russian Federation, pp. 1-4, http://dx.doi.org/10.1109/SYNCHROINFO55067.2022.9840922.

Momynkulov Z., Omarov N. and Altayeva A. (2024) CNN-RNN Hybrid Model For Dangerous Sound Detection in Urban Area. IEEE 4th International Conference on Smart Information Systems and Technologies (SIST), Astana, Kazakhstan, pp. 284-289, http://dx.doi.org/10.1109/SIST61555.2024.10629358.

Babu K. A. and Ramkumar B. (2020). Automatic Recognition of Fundamental Heart Sound Segments From PCG Corrupted With Lung Sounds and Speech," in IEEE Access, vol. 8, pp. 179983-179994, https://doi.org/10.1109/ACCESS.2020.3023044.

Naveen Sundar G., Subramanian S., Narmadha D., Malin Bruntha P., I. Thanakumar Joseph S and S. S. (2024). Improved Heart Sound Classification Using LSTM Based Deep Learning Technique. 5th International Conference on Intelligent Communication Technologies and Virtual Mobile Networks (ICICV), Tirunelveli, India, pp. 557-561, http://dx.doi.org/10.1109/ICICV62344.2024.00094.

Bubashait M. and Hewahi N. (2021). Urban Sound Classification Using DNN, CNN & LSTM a Comparative Approach. International Conference on Innovation and Intelligence for Informatics, Computing, and Technologies (3ICT), Zallaq, Bahrain, 2021, pp. 46-50, https://doi.org/10.1109/3ICT53449.2021.9581339.

Hayashi T., Watanabe S., Toda T., Hori T., Le Roux J. and Takeda K. (2017). Duration-Controlled LSTM for Polyphonic Sound Event Detection. IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 25, no. 11, pp. 2059-2070, Nov., https://doi.org/10.1109/TASLP.2017.2740002.

Liu J. et al. (2018). Bowel Sound Detection Based on MFCC Feature and LSTM Neural Network. IEEE Biomedical Circuits and Systems Conference (BioCAS), Cleveland, OH, USA, pp. 1-4, doi: https://doi.org/10.1109/BIOCAS.2018.8584723.

Huang Z., Tang J., Xue S. and Dai L. (2016). Speaker adaptation OF RNN-BLSTM for speech recognition based on speaker code. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Shanghai, China, pp. 5305-5309, https://doi.org/10.1109/ICASSP.2016.7472690.

Hwang K. and Sung W. (2016). Character-level incremental speech recognition with recurrent neural networks. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Shanghai, China, 2016, pp. 5335-5339, doi: https://doi.org/10.1109/ICASSP.2016.7472696.

Lotfidereshgi R. and Gournay P. (2018). Speech Prediction Using an Adaptive Recurrent Neural Network with Application to Packet Loss Concealment. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Calgary, AB, Canada, pp. 5394-5398, https://doi.org/10.1109/ICASSP.2018.8462185.

Momynkulov Z., Omarov N. and Uxikbayev Y. (2024). Detection of Dangerous Situations by Sounds in Real-Time Using Deep Learning. IEEE 4th International Conference on Smart Information Systems and Technologies (SIST), Astana, Kazakhstan, pp. 278-283, http://dx.doi.org/10.1109/SIST61555.2024.10629572.

Jose T. and Mayan J. A. (2023). Real-Time Sound Detection of Rose-Ringed Parakeet Using LSTM Network with MFCC and Mel Spectrogram. Annual International Conference on Emerging Research Areas: International Conference on Intelligent Systems (AICERA/ICIS), Kanjirapally, India, pp. 1-6, https://doi.org/10.1109/AICERA/ICIS59538.2023.10420143.

Elghamrawy S. M. and Edin Ibrahim S. (2021). Audio Signal Processing and Musical Instrument Detection using Deep Learning Techniques. 9th International Japan-Africa Conference on Electronics, Communications, and Computations (JAC-ECC), Alexandria, Egypt, pp. 146-149, https://doi.org/10.1109/JAC-ECC54461.2021.9691427.

Kamepalli S., Rao B. S. and Venkata Krishna Kishore K. (2022). Multi-Class Classification and Prediction of Heart Sounds Using Stacked LSTM to Detect Heart Sound Abnormalities. 3rd International Conference for Emerging Technology (INCET), Belgaum, India, pp. 1-6, https://doi.org/10.1109/INCET54531.2022.9825189.

Dosbayev, Z. et al. (2021). Audio Surveillance: Detection of Audio-Based Emergency Situations. In: Wojtkiewicz, K., Treur, J., Pimenidis, E., Maleszka, M. (eds) Advances in Computational Collective Intelligence. ICCCI. Communications in Computer and Information Science, vol 1463. Springer, Cham. https://doi.org/10.1007/978-3-030-88113-9_33

Sajad S., Dharshika S. and Meleet S. (2021). Music Generation for Novices Using Recurrent Neural Network (RNN). International Conference on Innovative Computing, Intelligent Communication and Smart Electrical Systems (ICSES), Chennai, India, pp. 1-6, https://doi.org/10.1109/ICSES52305.2021.9633906.

Yang B., Matson E. T., Smith A. H., Dietz J. E. and Gallagher J. C. (2019). UAV Detection System with Multiple Acoustic Nodes Using Machine Learning Models. Third IEEE International Conference on Robotic Computing (IRC), Naples, Italy, pp. 493-498, https://doi.org/10.1109/IRC.2019.00103.

Dumitrescu, C.; Minea, M.; Costea, I.M.; Cosmin Chiva, I.; Semenescu, A. (2020). Development of an Acoustic System for UAV Detection. Sensors, 20, 4870. https://doi.org/10.3390/s20174870

Wang Y., Fagian Y., Ho K. E. and Matson E. T. (2021). A Feature Engineering Focused System for Acoustic UAV Detection. Fifth IEEE International Conference on Robotic Computing (IRC), Taichung, Taiwan, pp. 125-130, https://doi.org/10.1109/IRC52146.2021.00031.

Didkovskyi V., Kozeruk S. and Korzhik O. (2019). Simple Acoustic Array for Small UAV Detection. IEEE 39th International Conference on Electronics and Nanotechnology (ELNANO), Kyiv, Ukraine, pp. 656-659, https://doi.org/10.1109/ELNANO.2019.8783262.

Jeon S., Shin J. -W., Lee Y. -J., Kim W. -H., Kwon Y. and Yang Y. (2017). Empirical study of drone sound detection in real-life environment with deep neural networks. 25th European Signal Processing Conference (EUSIPCO), Kos, Greece, pp. 1858-1862, https://doi.org/10.23919/EUSIPCO.2017.8081531.

Ku I., Roh S., Kim G., Taylor C., Wang C. and Matson E. T. (2022). UAV Payload Detection Using Deep Learning and Data Augmentation. Sixth IEEE International Conference on Robotic Computing (IRC), Italy, pp. 18-25, https://doi.org/10.1109/IRC55401.2022.00009.

Katta S. S., Nandyala S., Viegas S. and AlMahmoud A. (2022). Benchmarking Audio-based Deep Learning Models for Detection and Identification of Unmanned Aerial Vehicles. Workshop on Benchmarking Cyber-Physical Systems and Internet of Things (CPS-IoTBench), Milan, Italy, pp. 7-11, https://ieeexplore.ieee.org/document/9805345.

Information from the Internet [mavic.kz] - Available at: https://mavic.kz/product/dron-dji-mini-2-fly-more-combo/

Downloads

Published

2024-09-30

How to Cite

Utebayeva, D., & Ilipbayeva , L. . (2024). INVESTIGATION OF DEEP LEARNING MODELS BASED ON SINGLE-LAYER SimpleRNN, LSTM AND GRU NETWORKS FOR RECOGNIZING SOUNDS OF UAV DISTANCES . Scientific Journal of Astana IT University, 19, 60–75. https://doi.org/10.37943/19XNOV6347

Issue

Section

Information Technologies