R2Det: Exploring Relaxed Rotation Equivariance in 2D object detection
Zhiqiang Wu, Yingjie Liu, Hanlin Dong, Xuan Tang, Jian Yang, Bo Jin, Mingsong Chen, Xian Wei
TL;DR
This work tackles Rotational Symmetry-Breaking in 2D object detection by introducing a Relaxed Rotation-Equivariant framework. It defines a Relaxed Rotation-Equivariant GConv (R2GConv) built on a learnable perturbation $ \boldsymbol{\triangle} $ that perturbs the rotation group from $ \mathbf{C}_n $ to $ \mathbf{R}_n $, enabling end-to-end learning of relaxed equivariance. The authors design a lightweight backbone (R2Net) and a redesigned detector (R2Det) that leverage R2GConv with an efficient mix of lifting, point-wise, and depth-wise operations, achieving faster convergence and superior AP on VOC and COCO with fewer parameters and lower FLOPs. Extensive ablations, visualizations, and plug-and-play experiments (e.g., with YOLOv8) validate the effectiveness and generality of R2GConv and RRE modeling for real-world symmetry-breaking scenarios. The approach offers practical impact by enabling robust, efficient rotation-aware detection under imperfect symmetry while preserving compatibility with existing architectures and tasks.
Abstract
Group Equivariant Convolution (GConv) empowers models to explore underlying symmetry in data, improving performance. However, real-world scenarios often deviate from ideal symmetric systems caused by physical permutation, characterized by non-trivial actions of a symmetry group, resulting in asymmetries that affect the outputs, a phenomenon known as Symmetry Breaking. Traditional GConv-based methods are constrained by rigid operational rules within group space, assuming data remains strictly symmetry after limited group transformations. This limitation makes it difficult to adapt to Symmetry-Breaking and non-rigid transformations. Motivated by this, we mainly focus on a common scenario: Rotational Symmetry-Breaking. By relaxing strict group transformations within Strict Rotation-Equivariant group $\mathbf{C}_n$, we redefine a Relaxed Rotation-Equivariant group $\mathbf{R}_n$ and introduce a novel Relaxed Rotation-Equivariant GConv (R2GConv) with only a minimal increase of $4n$ parameters compared to GConv. Based on R2GConv, we propose a Relaxed Rotation-Equivariant Network (R2Net) as the backbone and develop a Relaxed Rotation-Equivariant Object Detector (R2Det) for 2D object detection. Experimental results demonstrate the effectiveness of the proposed R2GConv in natural image classification, and R2Det achieves excellent performance in 2D object detection with improved generalization capabilities and robustness. The code is available in \texttt{https://github.com/wuer5/r2det}.
