Unitary convolutions for learning on graphs and groups
Bobak T. Kiani, Lukas Fesser, Melanie Weber
TL;DR
This work introduces unitary group convolutions to address instability and over-smoothing in deep group-convolutional neural networks, with a focus on graphs. It proposes two concrete operators, UniConv and Lie UniConv, built via the exponential map to ensure unitary, norm-preserving transformations that are invertible and equivariant, and extends the approach to generalized group convolutions. The authors provide theoretical guarantees—invertibility, isometry, equivariance, Rayleigh-quotient invariance, and dynamical isometry—and demonstrate empirically that unitary GCNs achieve competitive or superior performance on long-range and heterophilous graph tasks while enabling deeper architectures. The results suggest that unitary, norm-preserving convolutions improve stability and learning of long-range dependencies, with potential applicability to broader group symmetries and robustness considerations in geometric ML.
Abstract
Data with geometric structure is ubiquitous in machine learning often arising from fundamental symmetries in a domain, such as permutation-invariance in graphs and translation-invariance in images. Group-convolutional architectures, which encode symmetries as inductive bias, have shown great success in applications, but can suffer from instabilities as their depth increases and often struggle to learn long range dependencies in data. For instance, graph neural networks experience instability due to the convergence of node representations (over-smoothing), which can occur after only a few iterations of message-passing, reducing their effectiveness in downstream tasks. Here, we propose and study unitary group convolutions, which allow for deeper networks that are more stable during training. The main focus of the paper are graph neural networks, where we show that unitary graph convolutions provably avoid over-smoothing. Our experimental results confirm that unitary graph convolutional networks achieve competitive performance on benchmark datasets compared to state-of-the-art graph neural networks. We complement our analysis of the graph domain with the study of general unitary convolutions and analyze their role in enhancing stability in general group convolutional architectures.
