WGFormer: An SE(3)-Transformer Driven by Wasserstein Gradient Flows for Molecular Ground-State Conformation Prediction
Fanmeng Wang, Minjie Cheng, Hongteng Xu
TL;DR
This work tackles the challenge of predicting molecular ground-state conformations with high accuracy and efficiency. It introduces WGFormer, a Wasserstein gradient flow-driven SE(3)-Transformer that operates in an auto-encoding framework, encoding low-quality conformations and decoding ground-state coordinates via an MLP, while enforcing latent energy minimization on atom mixtures. The approach connects transformer-based attention to Wasserstein gradient flows, leveraging energy functionals E^0 and E^∞ and entropic OT to guide conformation optimization. Empirically, WGFormer achieves state-of-the-art results on Molecule3D and QM9 with notable speedups, demonstrating the value of a physically grounded, interpretable framework for conformation prediction and downstream applications like docking and property prediction. The work suggests a promising direction for energy-aware, geometry-preserving molecular modeling and invites further theoretical development and broader task extensions.
Abstract
Predicting molecular ground-state conformation (i.e., energy-minimized conformation) is crucial for many chemical applications such as molecular docking and property prediction. Classic energy-based simulation is time-consuming when solving this problem, while existing learning-based methods have advantages in computational efficiency but sacrifice accuracy and interpretability. In this work, we propose a novel and effective method to bridge the energy-based simulation and the learning-based strategy, which designs and learns a Wasserstein gradient flow-driven SE(3)-Transformer, called WGFormer, for ground-state conformation prediction. Specifically, our method tackles this task within an auto-encoding framework, which encodes low-quality conformations by the proposed WGFormer and decodes corresponding ground-state conformations by an MLP. The architecture of WGFormer corresponds to Wasserstein gradient flows -- it optimizes conformations by minimizing an energy function defined on the latent mixture models of atoms, thereby significantly improving performance and interpretability. Extensive experiments demonstrate that our method consistently outperforms state-of-the-art competitors, providing a new and insightful paradigm to predict ground-state conformation.
