DEEP LEARNING-BASED FACE MASK DETECTION USING YOLOV5 MODEL

Authors

DOI:

https://doi.org/10.37943/12TXQS9259

Keywords:

YOLOv5, CNN, covid-19, target detection, deep learning

Abstract

Based on the background of rapid transmission of novel coronavirus and various pneumonia, wearing masks becomes the best solution to effectively reduce the probability of transmission. For a series of problems arising from crowded public places and collective units, where face recognition is difficult to increase target density, a deep convolutional neural network is used for real-time mask detection and recognition.
This paper presents the method based on YOLOv5 model for deep learning and mask detection in image recognition as well as a live camera to label the pedestrians without masks in time. This experiment will use LabelImg software to preprocess 5003 images and make lightweight improvements based on the original YOLOv5 model to generate the final face mask recognition model. The Mosaic method is added to merge the images effectively and process the images in batch, and secondly, the GIoU loss function is selected to calculate the bounding box regression loss by comparison, which improves the localization accuracy even more. According to the experimental detection results, analogized with the original model YOLOv5, the recall and accuracy are effectively improved. In this paper, YOLOv3, SSD, Fast-R-CNN detection algorithms are used for comparison, the detection results of this model have a high mAP value which is equal to 92.9, which are higher than the detection results of other models.
Real-time target recognition based on this model combined with practical applications can be applied in hospitals and crowd-gathering places to achieve effective reduction of epidemic transmission probability in a short period of time.

Author Biographies

Saya Sapakova, International University of Information Technology

Cand. of ph. and math. sc., Associate Professor of the Department
of Computer Engineering

Yelidana Yilibule, Al-Farabi Kazakh National University

Master student of the Department Information Systems

References

Zhao, W. M., Song, S. H., Chen, M. L., Zou, D., Ma, L. N., Ma, Y. K., ... & Bao, Y. M. (2020). The 2019 novel coronavirus resource. Hereditas, 42(2), 212-221.

Pramita, M. D., Kurniawen, B., & Utame, N P. (2020). Mast wearing classification using CNN, 7th International Conference on Advance Informatics: Concepts, Theory and Applications (ICAICTA). IEEE, 1-4.

CAO, C.S., & YUAN, J. (2021). YOLO-Mask algorithm based mask wear detection method. Advances in Excitation and Optoelectronics, 58(8), 211-218

Review: YOLOv1 – You Only Look Once (Object Detection). (2022). Retrieved from https://towardsdatascience.com/yolov1-you-only-look-once-object-detection-e1f3ffec8a89

Sharma, A. (2022). A Better, Faster, and Stronger Object Detector (YOLOv2) – PyImageSearch. Retrieved from https://pyimagesearch.com/2022/04/18/a-better-faster-and-stronger-objectdetector-yolov2/

Papers with Code – YOLOv3 Explained. (2022). Retrieved from https://paperswithcode.com/method/yolov3

Papers with code – yolov4 explained. Explained | Papers With Code. (n.d.). Retrieved from https://paperswithcode.com/method/yolov4

Jiang, W. (2022). Study on resnet and EfficientNet Remote Sensing Image Scene Classification. Computer Science and Application, 12(5), 1301–1313. https://doi.org/10.12677/CSA.2022.125130

Shi, L., Zhou, Z., & Guo, Z. (2021). Face anti-spoofing using Spatial Pyramid pooling. 2020 25th International Conference on Pattern Recognition (ICPR). https://doi.org/10.1109/ICPR48806.2021.9412407

Brailsford, S. C., Potts, C. N., & Smith, B. M. (1999). Constraint satisfaction problems: Algorithms and applications. European journal of operational research, 119(3), 557-581.

Wang, X., & Song, J. (2021). ICIOU: Improved loss based on complete intersection over union for bounding box regression. IEEE Access, 9, 105686–105695. https://doi.org/10.1109/ACCESS.2021.3100414

Redmon, J., Divvala, S., Girshick, R., & Farhadi, A. (2016). You only look once: Unified, real-time object detection. 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). https://doi.org/10.1109/CVPR.2016.91

Alexey Bochkovskiy, Chien-Yao Wang, & Hong-Yuan Mark Liao. (2020). YOLOv4: Optimal speed and accuracy of object detection. ArXiv: Computer vision and pattern recognition.

Zheng, Z., Wang, P., Liu, W., Li, J., Ye, R., & Ren, D. (2020). Distance-IoU loss: Faster and better learning for bounding box regression. Proceedings of the AAAI Conference on Artificial Intelligence, 34(07), 12993–13000. https://doi.org/10.1609/aaai.v34i07.6999

Zhaohui Zheng, Ping Wang, Wei Liu, Jinze Li, Rongguang Ye, & Dongwei Ren. (2019). Distance-IoU Loss: Faster and Better Learning for Bounding Box Regression. ArXiv: Computer Vision and Pattern

Recognition. Retrieved from http://export.arxiv.org/pdf/1911.08287

Khan, S. A., & Ali Rana, Z. (2019). Evaluating Performance of Software Defect Prediction Models Using Area Under Precision-Recall Curve (AUC-PR). 2019 2nd International Conference on Advancements in Computational Sciences (ICACS). https://doi.org/10.23919/ICACS.2019.8689135

Pereira, N. (2022). PereiraASLNet: ASL letter recognition with Yolox taking mean average precision and inference time considerations. 2022 2nd International Conference on Artificial Intelligence and Signal Processing (AISP). https://doi.org/10.1109/AISP53593.2022.9760665

Aagten-Murphy, D., & Bays, P. M. (2019). Independent working memory resources for egocentric and allocentric spatial information. PLOS Computational Biology, 15(2), e1006563. https://doi.org/10.1371/journal.pcbi.1006563

Downloads

Published

2022-12-30

How to Cite

Sapakova, S., & Yilibule, Y. (2022). DEEP LEARNING-BASED FACE MASK DETECTION USING YOLOV5 MODEL. Scientific Journal of Astana IT University, 12(12), 5–13. https://doi.org/10.37943/12TXQS9259

Issue

Section

Articles
betpas