Group Orthogonalization Regularization For Vision Models Adaptation and Robustness
Yoav Kurtz, Noga Bar, Raja Giryes
TL;DR
Group Orthogonalization Regularization (GOR) targets redundant parameter coupling by enforcing orthonormality within groups of filters in each layer, dramatically reducing computational burden compared with full-layer orthogonalization. By aligning filter-group partitions with normalization groups, GOR enhances expressivity and diversity, improving adaptation performance for Vision Transformers and diffusion models, and boosting robustness under adversarial training. The approach integrates smoothly with LoRA/AdaptFormer-style adapters, yielding consistent gains across CIFAR-10, downstream ViT tasks, and text-to-image generation, while maintaining efficiency as layer dimensionality grows. Overall, GOR offers a practical, scalable regularization that reduces redundancy, strengthens transfer and robustness, and complements existing normalization strategies.
Abstract
As neural networks become deeper, the redundancy within their parameters increases. This phenomenon has led to several methods that attempt to reduce the correlation between convolutional filters. We propose a computationally efficient regularization technique that encourages orthonormality between groups of filters within the same layer. Our experiments show that when incorporated into recent adaptation methods for diffusion models and vision transformers (ViTs), this regularization improves performance on downstream tasks. We further show improved robustness when group orthogonality is enforced during adversarial training. Our code is available at https://github.com/YoavKurtz/GOR.
