Torsion-Space Diffusion for Protein Backbone Generation with Geometric Refinement
Lakshaditya Singh, Adwait Shelke, Divyansh Agrawal
TL;DR
This work tackles the challenge of generating physically valid protein backbones by shifting diffusion from Cartesian space to torsion angles, ensuring local geometry by construction. It introduces a Transformer-based denoiser operating on torsion angles $(\phi,\psi,\omega)$, a differentiable forward-kinematics module that fixes backbone bond lengths at $b=3.8$Å, and an iterative Radius of Gyration refinement to improve global compactness. On standard PDB backbones, the method achieves 100% bond-length accuracy and substantially improves global structure, reducing Rg error from 70% to 18.6% relative to Cartesian baselines. The result is a scalable, efficient framework for geometrically valid backbone generation that provides a promising path toward full-atom protein design.
Abstract
Designing new protein structures is fundamental to computational biology, enabling advances in therapeutic molecule discovery and enzyme engineering. Existing diffusion-based generative models typically operate in Cartesian coordinate space, where adding noise disrupts strict geometric constraints such as fixed bond lengths and angles, often producing physically invalid structures. To address this limitation, we propose a Torsion-Space Diffusion Model that generates protein backbones by denoising torsion angles, ensuring perfect local geometry by construction. A differentiable forward-kinematics module reconstructs 3D coordinates with fixed 3.8 Angstrom backbone bond lengths while a constrained post-processing refinement optimizes global compactness via Radius of Gyration (Rg) correction, without violating bond constraints. Experiments on standard PDB proteins demonstrate 100% bond-length accuracy and significantly improved structural compactness, reducing Rg error from 70% to 18.6% compared to Cartesian diffusion baselines. Overall, this hybrid torsion-diffusion plus geometric-refinement framework generates physically valid and compact protein backbones, providing a promising path toward full-atom protein generation.
