Dynamic Attention and Bi-directional Fusion for Safety Helmet Wearing Detection
Junwei Feng, Xueyan Fan, Yuyang Chen, Yi Li
TL;DR
The paper tackles real-time safety helmet wearing detection in cluttered construction environments where helmets are small and frequently occluded. It introduces DABFNet, combining a Dynamic Attention Detection Head (DAHead), a Bi-directional Weighted Feature Pyramid Network (BWFPN), and Wise-IoU loss (WIoU) to enhance multi-scale feature fusion and object localization. Experimental results on the SHWD dataset show improved accuracy (notably mAP@0.5:0.95) and reduced computational load, with ablations confirming the effectiveness of each component. This approach offers a practical, edge-friendly solution for construction-site safety monitoring with potential impact on real-world helmet compliance enforcement.
Abstract
Ensuring construction site safety requires accurate and real-time detection of workers' safety helmet use, despite challenges posed by cluttered environments, densely populated work areas, and hard-to-detect small or overlapping objects caused by building obstructions. This paper proposes a novel algorithm for safety helmet wearing detection, incorporating a dynamic attention within the detection head to enhance multi-scale perception. The mechanism combines feature-level attention for scale adaptation, spatial attention for spatial localization, and channel attention for task-specific insights, improving small object detection without additional computational overhead. Furthermore, a two-way fusion strategy enables bidirectional information flow, refining feature fusion through adaptive multi-scale weighting, and enhancing recognition of occluded targets. Experimental results demonstrate a 1.7% improvement in mAP@[.5:.95] compared to the best baseline while reducing GFLOPs by 11.9% on larger sizes. The proposed method surpasses existing models, providing an efficient and practical solution for real-world construction safety monitoring.
