Alias-Free Convnets: Fractional Shift Invariance via Polynomial Activations
Hagay Michaeli, Tomer Michaeli, Daniel Soudry
TL;DR
This work addresses the overlooked problem that standard CNNs are not truly shift-invariant due to aliasing from downsampling and nonlinearities. It introduces Alias-Free ConvNets (AFC) that couple polynomial activations with an upsample–low-pass–downsample pipeline and alias-free normalization to guarantee shift-invariance for fractional translations and shift-equivariance of internal representations. Empirically, AFC achieves 100% shift consistency for integer and fractional shifts, shows certified robustness to translation-based adversarial attacks, and maintains competitive ImageNet performance, outperforming prior methods like APS and BlurPool under translation perturbations. The approach has practical implications for robust vision systems and can be extended to other domains and tasks such as segmentation, with opportunities to optimize computational efficiency.
Abstract
Although CNNs are believed to be invariant to translations, recent works have shown this is not the case, due to aliasing effects that stem from downsampling layers. The existing architectural solutions to prevent aliasing are partial since they do not solve these effects, that originate in non-linearities. We propose an extended anti-aliasing method that tackles both downsampling and non-linear layers, thus creating truly alias-free, shift-invariant CNNs. We show that the presented model is invariant to integer as well as fractional (i.e., sub-pixel) translations, thus outperforming other shift-invariant methods in terms of robustness to adversarial translations.
