On Symmetries in Convolutional Weights
Bilal Alsallakh, Timothy Wroge, Vivek Miglani, Narine Kokhlikyan
TL;DR
The paper investigates mean-kernel symmetry in convolutional networks using a Dihedral group $D_4$-based metric to quantify per-layer symmetry: $S(K)=1 - \frac{1}{2 \cdot |\mathscr{T}|} \sum_{T \in \mathscr{T}} ||T(\hat{K}) - \hat{K}||_F$ with $\hat{K}$ as the normalized kernel. Across architectures like VGG-16, Inception-V3, and ResNet variants, it shows that internal mean kernels tend to become more symmetric deeper in the network, though artificial asymmetry arises at strided downsampling layers due to padding biases, which can be mitigated by techniques such as PartialConvolution, reflection padding, anti-aliasing, and SBPool. The study demonstrates that symmetry correlates with shift and flip consistency and can improve segmentation robustness when padding-induced skew is addressed, reinforcing symmetry as a potentially valuable inductive bias in CNNs. The work highlights directions for future research on the emergence of symmetry, its dependence on data augmentation and architecture, and its broader applicability to diverse tasks beyond image classification.
Abstract
We explore the symmetry of the mean k x k weight kernel in each layer of various convolutional neural networks. Unlike individual neurons, the mean kernels in internal layers tend to be symmetric about their centers instead of favoring specific directions. We investigate why this symmetry emerges in various datasets and models, and how it is impacted by certain architectural choices. We show how symmetry correlates with desirable properties such as shift and flip consistency, and might constitute an inherent inductive bias in convolutional neural networks.
