Adaptive aggregation of Monte Carlo augmented decomposed filters for efficient group-equivariant convolutional neural network
Wenzhao Zhao, Barbara D. Wichtmann, Steffen Albert, Angelika Maurer, Frank G. Zöllner, Jürgen Hesser
TL;DR
This work targets the computational bottleneck of group-equivariant CNNs that rely on parameter sharing. It introduces a non-parameter-sharing framework built on adaptive aggregation of Monte Carlo augmented decomposed filters (MCG-CNN) and its weight-augmented variant (WMCG-CNN), enabling flexible, diverse transformation handling for both continuous and discrete groups, including shear transforms. The methodology combines Monte Carlo integration for group convolutions with filter decomposition into basis functions (e.g., Fourier-Bessel, Mexican hat) and bootstrapped samples, enabling efficient deployment in standard CNN backbones while preserving equivariance guarantees. Empirically, WMCG-CNN achieves superior or competitive performance on ImageNet and various denoising tasks with comparable parameter counts and computational cost, demonstrating improved sample efficiency and robustness to affine transformations. The work suggests a practical pathway to broader, more efficient group-equivariant architectures and points to future directions in basis design and application to other vision tasks.
Abstract
Group-equivariant convolutional neural networks (G-CNN) heavily rely on parameter sharing to increase CNN's data efficiency and performance. However, the parameter-sharing strategy greatly increases the computational burden for each added parameter, which hampers its application to deep neural network models. In this paper, we address these problems by proposing a non-parameter-sharing approach for group equivariant neural networks. The proposed methods adaptively aggregate a diverse range of filters by a weighted sum of stochastically augmented decomposed filters. We give theoretical proof about how the group equivariance can be achieved by our methods. Our method applies to both continuous and discrete groups, where the augmentation is implemented using Monte Carlo sampling and bootstrap resampling, respectively. Our methods also serve as an efficient extension of standard CNN. The experiments show that our method outperforms parameter-sharing group equivariant networks and enhances the performance of standard CNNs in image classification and denoising tasks, by using suitable filter bases to build efficient lightweight networks. The code will be available at https://github.com/ZhaoWenzhao/MCG_CNN.
