EquiBoost: An Equivariant Boosting Approach to Molecular Conformation Generation
Yixuan Yang, Xingyu Fang, Zhaowen Cheng, Pengju Yan, Xiaolin Li
TL;DR
EquiBoost introduces a novel SE(3)-equivariant boosting framework that stacks multiple equivariant graph transformers to iteratively refine molecular conformations, offering a compelling alternative to diffusion-based generation. By sharing parameters across weak learners, injecting the step index as input, and using constrained random initialization, it achieves high accuracy with only five inference steps while maintaining diversity. The method combines permutation-invariant RMSD (piRMSD) with internal-coordinate losses to enforce both global and local geometric plausibility, and employs optimal conformation mapping to align training targets with reference ensembles. On GEOM-QM9 and GEOM-DRUGS, EquiBoost surpasses prior methods in AMR and precision, demonstrating notable efficiency gains and restoring boosting as a viable approach for fast, accurate 3D molecular conformation generation with practical impact in drug design and docking workflows.
Abstract
Molecular conformation generation plays key roles in computational drug design. Recently developed deep learning methods, particularly diffusion models have reached competitive performance over traditional cheminformatical approaches. However, these methods are often time-consuming or require extra support from traditional methods. We propose EquiBoost, a boosting model that stacks several equivariant graph transformers as weak learners, to iteratively refine 3D conformations of molecules. Without relying on diffusion techniques, EquiBoost balances accuracy and efficiency more effectively than diffusion-based methods. Notably, compared to the previous state-of-the-art diffusion method, EquiBoost improves generation quality and preserves diversity, achieving considerably better precision of Average Minimum RMSD (AMR) on the GEOM datasets. This work rejuvenates boosting and sheds light on its potential to be a robust alternative to diffusion models in certain scenarios.
