Torsional Diffusion for Molecular Conformer Generation
Bowen Jing, Gabriele Corso, Jeffrey Chang, Regina Barzilay, Tommi Jaakkola
TL;DR
The paper introduces torsional diffusion, a diffusion model restricted to torsion angles on a hypertorus, paired with an extrinsic-to-intrinsic SE(3)-equivariant score network to generate molecular conformers. By operating on the torsional degrees of freedom and preserving fixed non-torsional structure, it achieves state-of-the-art ensemble quality with far fewer denoising steps than Euclidean diffusion methods, and provides exact likelihoods enabling Boltzmann sampling across unseen molecules. The approach yields a generalizable torsional Boltzmann generator and demonstrates strong performance on GEOM-DRUGS and related datasets, highlighting practical impact for fast, accurate conformer generation and energy-aware sampling. Limitations include reliance on local-structure priors for rings and challenges around E/Z isomerism, with future work aimed at relaxing rigid local structures and extending to larger systems such as macrocycles and proteins.
Abstract
Molecular conformer generation is a fundamental task in computational chemistry. Several machine learning approaches have been developed, but none have outperformed state-of-the-art cheminformatics methods. We propose torsional diffusion, a novel diffusion framework that operates on the space of torsion angles via a diffusion process on the hypertorus and an extrinsic-to-intrinsic score model. On a standard benchmark of drug-like molecules, torsional diffusion generates superior conformer ensembles compared to machine learning and cheminformatics methods in terms of both RMSD and chemical properties, and is orders of magnitude faster than previous diffusion-based models. Moreover, our model provides exact likelihoods, which we employ to build the first generalizable Boltzmann generator. Code is available at https://github.com/gcorso/torsional-diffusion.
