DEEP LEARNING-BASED FACE MASK DETECTION USING YOLOV5 MODEL
DOI:
https://doi.org/10.37943/12TXQS9259Keywords:
YOLOv5, CNN, covid-19, target detection, deep learningAbstract
Based on the background of rapid transmission of novel coronavirus and various pneumonia, wearing masks becomes the best solution to effectively reduce the probability of transmission. For a series of problems arising from crowded public places and collective units, where face recognition is difficult to increase target density, a deep convolutional neural network is used for real-time mask detection and recognition.
This paper presents the method based on YOLOv5 model for deep learning and mask detection in image recognition as well as a live camera to label the pedestrians without masks in time. This experiment will use LabelImg software to preprocess 5003 images and make lightweight improvements based on the original YOLOv5 model to generate the final face mask recognition model. The Mosaic method is added to merge the images effectively and process the images in batch, and secondly, the GIoU loss function is selected to calculate the bounding box regression loss by comparison, which improves the localization accuracy even more. According to the experimental detection results, analogized with the original model YOLOv5, the recall and accuracy are effectively improved. In this paper, YOLOv3, SSD, Fast-R-CNN detection algorithms are used for comparison, the detection results of this model have a high mAP value which is equal to 92.9, which are higher than the detection results of other models.
Real-time target recognition based on this model combined with practical applications can be applied in hospitals and crowd-gathering places to achieve effective reduction of epidemic transmission probability in a short period of time.
References
Zhao, W. M., Song, S. H., Chen, M. L., Zou, D., Ma, L. N., Ma, Y. K., ... & Bao, Y. M. (2020). The 2019 novel coronavirus resource. Hereditas, 42(2), 212-221.
Pramita, M. D., Kurniawen, B., & Utame, N P. (2020). Mast wearing classification using CNN, 7th International Conference on Advance Informatics: Concepts, Theory and Applications (ICAICTA). IEEE, 1-4.
CAO, C.S., & YUAN, J. (2021). YOLO-Mask algorithm based mask wear detection method. Advances in Excitation and Optoelectronics, 58(8), 211-218
Review: YOLOv1 – You Only Look Once (Object Detection). (2022). Retrieved from https://towardsdatascience.com/yolov1-you-only-look-once-object-detection-e1f3ffec8a89
Sharma, A. (2022). A Better, Faster, and Stronger Object Detector (YOLOv2) – PyImageSearch. Retrieved from https://pyimagesearch.com/2022/04/18/a-better-faster-and-stronger-objectdetector-yolov2/
Papers with Code – YOLOv3 Explained. (2022). Retrieved from https://paperswithcode.com/method/yolov3
Papers with code – yolov4 explained. Explained | Papers With Code. (n.d.). Retrieved from https://paperswithcode.com/method/yolov4
Jiang, W. (2022). Study on resnet and EfficientNet Remote Sensing Image Scene Classification. Computer Science and Application, 12(5), 1301–1313. https://doi.org/10.12677/CSA.2022.125130
Shi, L., Zhou, Z., & Guo, Z. (2021). Face anti-spoofing using Spatial Pyramid pooling. 2020 25th International Conference on Pattern Recognition (ICPR). https://doi.org/10.1109/ICPR48806.2021.9412407
Brailsford, S. C., Potts, C. N., & Smith, B. M. (1999). Constraint satisfaction problems: Algorithms and applications. European journal of operational research, 119(3), 557-581.
Wang, X., & Song, J. (2021). ICIOU: Improved loss based on complete intersection over union for bounding box regression. IEEE Access, 9, 105686–105695. https://doi.org/10.1109/ACCESS.2021.3100414
Redmon, J., Divvala, S., Girshick, R., & Farhadi, A. (2016). You only look once: Unified, real-time object detection. 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). https://doi.org/10.1109/CVPR.2016.91
Alexey Bochkovskiy, Chien-Yao Wang, & Hong-Yuan Mark Liao. (2020). YOLOv4: Optimal speed and accuracy of object detection. ArXiv: Computer vision and pattern recognition.
Zheng, Z., Wang, P., Liu, W., Li, J., Ye, R., & Ren, D. (2020). Distance-IoU loss: Faster and better learning for bounding box regression. Proceedings of the AAAI Conference on Artificial Intelligence, 34(07), 12993–13000. https://doi.org/10.1609/aaai.v34i07.6999
Zhaohui Zheng, Ping Wang, Wei Liu, Jinze Li, Rongguang Ye, & Dongwei Ren. (2019). Distance-IoU Loss: Faster and Better Learning for Bounding Box Regression. ArXiv: Computer Vision and Pattern
Recognition. Retrieved from http://export.arxiv.org/pdf/1911.08287
Khan, S. A., & Ali Rana, Z. (2019). Evaluating Performance of Software Defect Prediction Models Using Area Under Precision-Recall Curve (AUC-PR). 2019 2nd International Conference on Advancements in Computational Sciences (ICACS). https://doi.org/10.23919/ICACS.2019.8689135
Pereira, N. (2022). PereiraASLNet: ASL letter recognition with Yolox taking mean average precision and inference time considerations. 2022 2nd International Conference on Artificial Intelligence and Signal Processing (AISP). https://doi.org/10.1109/AISP53593.2022.9760665
Aagten-Murphy, D., & Bays, P. M. (2019). Independent working memory resources for egocentric and allocentric spatial information. PLOS Computational Biology, 15(2), e1006563. https://doi.org/10.1371/journal.pcbi.1006563
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2023 Articles are open access under the Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.
Authors who publish a manuscript in this journal agree to the following terms:
- The authors reserve the right to authorship of their work and transfer to the journal the right of first publication under the terms of the Creative Commons Attribution License, which allows others to freely distribute the published work with a mandatory link to the the original work and the first publication of the work in this journal.
- Authors have the right to conclude independent additional agreements that relate to the non-exclusive distribution of the work in the form in which it was published by this journal (for example, to post the work in the electronic repository of the institution or publish as part of a monograph), providing the link to the first publication of the work in this journal.
- Other terms stated in the Copyright Agreement.