NEAT: Neighborhood-Guided, Efficient, Autoregressive Set Transformer for 3D Molecular Generation
Daniel Rose, Roxane Axel Jacob, Johannes Kirchmair, Thierry Langer
TL;DR
<3-5 sentence high-level summary> This paper introduces NEAT, a permutation-invariant autoregressive generator for 3D molecules that treats molecules as atom sets and uses neighborhood-guided supervision with a set-transformer trunk and flow-based coordinate modeling. By avoiding canonical atom orderings and employing neighborhood continuations, NEAT achieves order-agnostic next-token predictions with efficient, batched inference. On QM9, NEAT shows competitive performance across stability, validity, and uniqueness metrics while delivering faster sampling than diffusion-based methods. This approach provides a scalable foundation for conditional de novo molecular design and autoregressive 3D generation without vector quantization.
Abstract
Autoregressive models are a promising alternative to diffusion-based models for 3D molecular structure generation. However, a key limitation is the assumption of a token order: while text has a natural sequential order, the next token prediction given a molecular graph prefix should be invariant to atom permutations. Previous works sidestepped this mismatch by using canonical orders or focus atoms. We argue that this is unnecessary. We introduce NEAT, a Neighborhood-guided, Efficient, Autoregressive, Set Transformer that treats molecular graphs as sets of atoms and learns the order-agnostic distribution over admissible tokens at the graph boundary with an autoregressive flow model. NEAT approaches state-of-the-art performance in 3D molecular generation with high computational efficiency and atom-level permutation invariance, establishing a practical foundation for scalable molecular design.
