Equivariant Matrix Function Neural Networks
Ilyes Batatia, Lars L. Schaaf, Huajie Chen, Gábor Csányi, Christoph Ortner, Felix A. Faber
TL;DR
The paper tackles the challenge of modeling non-local, many-body interactions in graphs, which standard MPNNs struggle with due to limited receptive fields and oversmoothing. It introduces Matrix Function Networks (MFNs), an equivariant GNN framework that parameterizes non-local interactions through analytic matrix functions defined on graph operators, with a resolvent-based parameterization enabling scalable evaluation. The architecture combines a local equivariant graph layer, construction of self-adjoint, group-equivariant matrices, and a matrix-function update, yielding potential linear scaling under sparse structure via selected inversion. Empirically, MFNs achieve state-of-the-art performance on ZINC and TU datasets and demonstrate strong capability to capture complex non-local quantum interactions (e.g., cumulenes), underscoring MFN’s potential to advance molecular modeling and force-field development.
Abstract
Graph Neural Networks (GNNs), especially message-passing neural networks (MPNNs), have emerged as powerful architectures for learning on graphs in diverse applications. However, MPNNs face challenges when modeling non-local interactions in graphs such as large conjugated molecules, and social networks due to oversmoothing and oversquashing. Although Spectral GNNs and traditional neural networks such as recurrent neural networks and transformers mitigate these challenges, they often lack generalizability, or fail to capture detailed structural relationships or symmetries in the data. To address these concerns, we introduce Matrix Function Neural Networks (MFNs), a novel architecture that parameterizes non-local interactions through analytic matrix equivariant functions. Employing resolvent expansions offers a straightforward implementation and the potential for linear scaling with system size. The MFN architecture achieves stateof-the-art performance in standard graph benchmarks, such as the ZINC and TU datasets, and is able to capture intricate non-local interactions in quantum systems, paving the way to new state-of-the-art force fields.
