Multi-Scale Protein Structure Modelling with Geometric Graph U-Nets
Chang Liu, Vivian Li, Linus Leong, Vladimir Radenkovic, Pietro Liò, Chaitanya K. Joshi
TL;DR
Geometric Graph U-Nets address the challenge of representing multi-scale protein structure by progressively coarsening and refining geometric graphs. The paper develops a formal GWL/IGWL expressivity framework showing pooling can preserve or boost discriminative power while enabling global context integration. Empirically, Geometric U-Nets outperform invariant and equivariant baselines on protein fold classification, demonstrating the value of multi-scale representations. The authors also provide open-source code and outline future directions for biomolecular complexes and multi-state design.
Abstract
Geometric Graph Neural Networks (GNNs) and Transformers have become state-of-the-art for learning from 3D protein structures. However, their reliance on message passing prevents them from capturing the hierarchical interactions that govern protein function, such as global domains and long-range allosteric regulation. In this work, we argue that the network architecture itself should mirror this biological hierarchy. We introduce Geometric Graph U-Nets, a new class of models that learn multi-scale representations by recursively coarsening and refining the protein graph. We prove that this hierarchical design can theoretically more expressive than standard Geometric GNNs. Empirically, on the task of protein fold classification, Geometric U-Nets substantially outperform invariant and equivariant baselines, demonstrating their ability to learn the global structural patterns that define protein folds. Our work provides a principled foundation for designing geometric deep learning architectures that can learn the multi-scale structure of biomolecules.
