3D Steerable CNNs: Learning Rotationally Equivariant Features in Volumetric Data
Maurice Weiler, Mario Geiger, Max Welling, Wouter Boomsma, Taco Cohen
TL;DR
This work introduces 3D Steerable CNNs that achieve SE($\,3$)-equivariance by representing inputs as fields over $\mathbb{R}^3$ and learning kernels that are expressed as linear combinations of analytically derived steerable basises using irreducible $\operatorname{SO}(3)$ representations. By enforcing an equivariance constraint on kernels, decomposing tensor products into irreps, and employing spherical harmonics with radial basis functions, the model attains universal equivariant linear maps between field types. The approach yields markedly data-efficient models that outperform conventional 3D CNNs on tasks with inherent $\mathrm{SE}(3)$ symmetry, such as amino acid environment prediction and protein-structure classification, while drastically reducing parameter count (e.g., from tens of millions to a few hundred thousand). The framework integrates discretization strategies, equivariant nonlinearities, and precomputation tricks to enable practical training and inference, with demonstrated success on Tetris-like rotation tasks, SHREC17 model classification, and CATH protein-architecture classification. Overall, the work provides both a solid theoretical foundation and a scalable, plug-inable architectural paradigm for rotationally equivariant learning in volumetric data, with significant implications for structural biology and other 3D domains.
Abstract
We present a convolutional network that is equivariant to rigid body motions. The model uses scalar-, vector-, and tensor fields over 3D Euclidean space to represent data, and equivariant convolutions to map between such representations. These SE(3)-equivariant convolutions utilize kernels which are parameterized as a linear combination of a complete steerable kernel basis, which is derived analytically in this paper. We prove that equivariant convolutions are the most general equivariant linear maps between fields over R^3. Our experimental results confirm the effectiveness of 3D Steerable CNNs for the problem of amino acid propensity prediction and protein structure classification, both of which have inherent SE(3) symmetry.
