Moment kernels: a simple and scalable approach for equivariance to rotations and reflections in deep convolutional networks

Zachary Schlamowitz; Andrew Bennecke; Daniel J. Tward

Moment kernels: a simple and scalable approach for equivariance to rotations and reflections in deep convolutional networks

Zachary Schlamowitz, Andrew Bennecke, Daniel J. Tward

TL;DR

The paper introduces moment kernels, a simple yet powerful form for achieving rotation and reflection equivariance in deep convolutional networks by treating feature maps as scalar, vector, or tensor fields. Moment kernels are radial functions of $|x|$ multiplied by powers of $x$ or the identity, and the authors prove that all equivariant kernels must take this form, enabling seamless use with standard convolution modules. They provide a complete derivation of equivariant transformation laws, classify kernel types (scalar-to-scalar, scalar-to-vector, vector-to-scalar, vector-to-vector, and higher-order tensors), and show how to construct moment kernels for general tensors. The approach is demonstrated on three biomedical tasks—image classification (DermaMNIST), 3D image registration (MRI), and an elliptical YOLO-based cell detector—where the moment-kernel networks deliver improved worst-case performance, faster convergence, and orientation-consistent results, while remaining interpretable and implementable within conventional CNN frameworks. The work offers a scalable, mathematically grounded alternative to representation-theory based equivariant methods, with meaningful implications for trust and robustness in biomedical imaging applications.

Abstract

The principle of translation equivariance (if an input image is translated an output image should be translated by the same amount), led to the development of convolutional neural networks that revolutionized machine vision. Other symmetries, like rotations and reflections, play a similarly critical role, especially in biomedical image analysis, but exploiting these symmetries has not seen wide adoption. We hypothesize that this is partially due to the mathematical complexity of methods used to exploit these symmetries, which often rely on representation theory, a bespoke concept in differential geometry and group theory. In this work, we show that the same equivariance can be achieved using a simple form of convolution kernels that we call ``moment kernels,'' and prove that all equivariant kernels must take this form. These are a set of radially symmetric functions of a spatial position $x$, multiplied by powers of the components of $x$ or the identity matrix. We implement equivariant neural networks using standard convolution modules, and provide architectures to execute several biomedical image analysis tasks that depend on equivariance principles: classification (outputs are invariant under orthogonal transforms), 3D image registration (outputs transform like a vector), and cell segmentation (quadratic forms defining ellipses transform like a matrix).

Moment kernels: a simple and scalable approach for equivariance to rotations and reflections in deep convolutional networks

TL;DR

multiplied by powers of

or the identity, and the authors prove that all equivariant kernels must take this form, enabling seamless use with standard convolution modules. They provide a complete derivation of equivariant transformation laws, classify kernel types (scalar-to-scalar, scalar-to-vector, vector-to-scalar, vector-to-vector, and higher-order tensors), and show how to construct moment kernels for general tensors. The approach is demonstrated on three biomedical tasks—image classification (DermaMNIST), 3D image registration (MRI), and an elliptical YOLO-based cell detector—where the moment-kernel networks deliver improved worst-case performance, faster convergence, and orientation-consistent results, while remaining interpretable and implementable within conventional CNN frameworks. The work offers a scalable, mathematically grounded alternative to representation-theory based equivariant methods, with meaningful implications for trust and robustness in biomedical imaging applications.

Abstract

, multiplied by powers of the components of

or the identity matrix. We implement equivariant neural networks using standard convolution modules, and provide architectures to execute several biomedical image analysis tasks that depend on equivariance principles: classification (outputs are invariant under orthogonal transforms), 3D image registration (outputs transform like a vector), and cell segmentation (quadratic forms defining ellipses transform like a matrix).

Moment kernels: a simple and scalable approach for equivariance to rotations and reflections in deep convolutional networks

TL;DR

Abstract

Moment kernels: a simple and scalable approach for equivariance to rotations and reflections in deep convolutional networks

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (3)