A Hitchhiker's Guide to Geometric GNNs for 3D Atomic Systems
Alexandre Duval, Simon V. Mathis, Chaitanya K. Joshi, Victor Schmidt, Santiago Miret, Fragkiskos D. Malliaros, Taco Cohen, Pietro Liò, Yoshua Bengio, Michael Bronstein
TL;DR
The paper surveys Geometric GNNs for 3D atomic systems, placing emphasis on symmetry-aware design to model molecules, proteins, and materials. It introduces a taxonomy into invariant, Cartesian-equivariant, spherical-equivariant, and unconstrained families, and details the pipeline from input graph construction to embeddings, interaction layers, and task-specific outputs. Key contributions include a rigorous treatment of Cartesian and spherical tensor formalisms, explicit discussion of tensor products and irreps, and a survey of datasets and applications ranging from property prediction to molecular dynamics and structure generation. The discussion also highlights open questions on when to bake in physics, how to construct graphs, and how to scale these models, pointing toward both theory-driven and data-driven pathways for future progress.
Abstract
Recent advances in computational modelling of atomic systems, spanning molecules, proteins, and materials, represent them as geometric graphs with atoms embedded as nodes in 3D Euclidean space. In these graphs, the geometric attributes transform according to the inherent physical symmetries of 3D atomic systems, including rotations and translations in Euclidean space, as well as node permutations. In recent years, Geometric Graph Neural Networks have emerged as the preferred machine learning architecture powering applications ranging from protein structure prediction to molecular simulations and material generation. Their specificity lies in the inductive biases they leverage - such as physical symmetries and chemical properties - to learn informative representations of these geometric graphs. In this opinionated paper, we provide a comprehensive and self-contained overview of the field of Geometric GNNs for 3D atomic systems. We cover fundamental background material and introduce a pedagogical taxonomy of Geometric GNN architectures: (1) invariant networks, (2) equivariant networks in Cartesian basis, (3) equivariant networks in spherical basis, and (4) unconstrained networks. Additionally, we outline key datasets and application areas and suggest future research directions. The objective of this work is to present a structured perspective on the field, making it accessible to newcomers and aiding practitioners in gaining an intuition for its mathematical abstractions.
