ILPO-NET: Network for the invariant recognition of arbitrary volumetric patterns in 3D
Dmitrii Zhemchuzhnikov, Sergei Grudinin
TL;DR
ILPO-Net addresses rotational variance in 3D volumetric pattern recognition by introducing a convolution that is invariant to local pattern orientation through a Wigner-matrix-based formulation and orientation pooling. The method expands learnable filters in spherical-harmonic components, performs 3D convolution in rotated frames, reconstructs the response in rotation space, and pools over orientations to achieve invariance without sacrificing expressiveness. Empirically, ILPO-Net delivers state-of-the-art performance on CATH and MedMNIST 3D datasets while dramatically reducing parameter counts, and filter visualizations confirm the learning of diverse, arbitrary-shaped patterns. The approach offers a principled, efficient alternative to data augmentation and broadens the applicability of robust 3D pattern recognition across disciplines.
Abstract
Effective recognition of spatial patterns and learning their hierarchy is crucial in modern spatial data analysis. Volumetric data applications seek techniques ensuring invariance not only to shifts but also to pattern rotations. While traditional methods can readily achieve translational invariance, rotational invariance possesses multiple challenges and remains an active area of research. Here, we present ILPO-Net (Invariant to Local Patterns Orientation Network), a novel approach that handles arbitrarily shaped patterns with the convolutional operation inherently invariant to local spatial pattern orientations using the Wigner matrix expansions. Our architecture seamlessly integrates the new convolution operator and, when benchmarked on diverse volumetric datasets such as MedMNIST and CATH, demonstrates superior performance over the baselines with significantly reduced parameter counts - up to 1000 times fewer in the case of MedMNIST. Beyond these demonstrations, ILPO-Net's rotational invariance paves the way for other applications across multiple disciplines. Our code is publicly available at https://gricad-gitlab.univ-grenoble-alpes.fr/GruLab/ILPO/-/tree/main/ILPONet.
