Conformation Generation using Transformer Flows
Sohil Atul Shah, Vladlen Koltun
TL;DR
This work tackles fast and accurate generation of valid 3D conformations for molecular graphs. It introduces ConfFlow, a graph-conditioned continuous normalizing flow that directly samples 3D coordinates from a simple base distribution, without enforcing explicit geometric constraints. The method combines graph-conditioned CNFs with a Transformer-based Graph Continuous Flow (GCF) architecture, stabilized by kinetic and Jacobian regularization, and trained to maximize exact likelihoods. Empirical results on GEOM-QM9 and GEOM-Drugs show ConfFlow achieving up to 40% improvements over state-of-the-art baselines in conformation generation and competitive performance on ensemble property predictions, highlighting its scalability to large drug-like molecules. The work provides a practical, non-equivariant coordinate-space generative approach with code availability for community use.</n>
Abstract
Estimating three-dimensional conformations of a molecular graph allows insight into the molecule's biological and chemical functions. Fast generation of valid conformations is thus central to molecular modeling. Recent advances in graph-based deep networks have accelerated conformation generation from hours to seconds. However, current network architectures do not scale well to large molecules. Here we present ConfFlow, a flow-based model for conformation generation based on transformer networks. In contrast with existing approaches, ConfFlow directly samples in the coordinate space without enforcing any explicit physical constraints. The generative procedure is highly interpretable and is akin to force field updates in molecular dynamics simulation. When applied to the generation of large molecule conformations, ConfFlow improve accuracy by up to $40\%$ relative to state-of-the-art learning-based methods. The source code is made available at https://github.com/IntelLabs/ConfFlow.
