RepVGG-GELAN: Enhanced GELAN with VGG-STYLE ConvNets for Brain Tumour Detection
Thennarasi Balakrishnan, Sandeep Singh Sengar
TL;DR
This work tackles brain-tumor detection in medical imagery by leveraging a YOLO-based detector enhanced with RepVGG reparameterized convolutions and GELAN backbones. The authors introduce RepVGG-GELAN, which combines RepVGG blocks, RepNCSPELAN4, and GELAN-inspired multi-scale feature fusion to improve precision and localization while maintaining efficiency, evidenced by higher precision ($$0.982$$) and AP$_{50}$ ($$0.970$$) on Br35H with 25.4M parameters. An ablation study confirms the contribution of RepVGG to performance gains, underscoring the value of reparameterized convolution and GELAN fusion for robust brain-tumor detection. The approach demonstrates strong potential for real-time, accurate medical image analysis, with publicly available code supporting reproducibility and deployment.
Abstract
Object detection algorithms particularly those based on YOLO have demonstrated remarkable efficiency in balancing speed and accuracy. However, their application in brain tumour detection remains underexplored. This study proposes RepVGG-GELAN, a novel YOLO architecture enhanced with RepVGG, a reparameterized convolutional approach for object detection tasks particularly focusing on brain tumour detection within medical images. RepVGG-GELAN leverages the RepVGG architecture to improve both speed and accuracy in detecting brain tumours. Integrating RepVGG into the YOLO framework aims to achieve a balance between computational efficiency and detection performance. This study includes a spatial pyramid pooling-based Generalized Efficient Layer Aggregation Network (GELAN) architecture which further enhances the capability of RepVGG. Experimental evaluation conducted on a brain tumour dataset demonstrates the effectiveness of RepVGG-GELAN surpassing existing RCS-YOLO in terms of precision and speed. Specifically, RepVGG-GELAN achieves an increased precision of 4.91% and an increased AP50 of 2.54% over the latest existing approach while operating at 240.7 GFLOPs. The proposed RepVGG-GELAN with GELAN architecture presents promising results establishing itself as a state-of-the-art solution for accurate and efficient brain tumour detection in medical images. The implementation code is publicly available at https://github.com/ThensiB/RepVGG-GELAN.
