General Binding Affinity Guidance for Diffusion Models in Structure-Based Drug Design
Yue Jian, Curtis Wu, Danny Reidenbach, Aditi S. Krishnapriyan
TL;DR
BADGER addresses the challenge of guiding binding affinity in diffusion-based structure-based drug design (SBDD). It introduces two complementary strategies—Classifier Guidance (CG) and Classifier-Free Guidance (CFG)—to steer samples toward target affinity values, with a Gaussian-based continuous-conditioning energy and a multi-constraint extension for QED and SA. Evaluated on CrossDocked2020 and PDBBind, BADGER yields up to 60% improvements in ligand--protein binding energy over prior diffusion methods while preserving chemical validity, and it extends naturally to multi-property diffusion guidance. The framework is modular and plug-and-play, enabling affinity-aware diffusion across backbones and pockets, with public code for reproducibility and broad potential impact on accelerating drug discovery.
Abstract
Structure-based drug design (SBDD) aims to generate ligands that bind strongly and specifically to target protein pockets. Recent diffusion models have advanced SBDD by capturing the distributions of atomic positions and types, yet they often underemphasize binding affinity control during generation. To address this limitation, we introduce \textbf{\textnormal{\textbf{BADGER}}}, a general \textbf{binding-affinity guidance framework for diffusion models in SBDD}. \textnormal{\textbf{BADGER} }incorporates binding affinity awareness through two complementary strategies: (1) \textit{classifier guidance}, which applies gradient-based affinity signals during sampling in a plug-and-play fashion, and (2) \textit{classifier-free guidance}, which integrates affinity conditioning directly into diffusion model training. Together, these approaches enable controllable ligand generation guided by binding affinity. \textnormal{\textbf{BADGER} } can be added to any diffusion model and achieves up to a \textbf{60\% improvement in ligand--protein binding affinity} of sampled molecules over prior methods. Furthermore, we extend the framework to \textbf{multi-constraint diffusion guidance}, jointly optimizing for binding affinity, drug-likeness (QED), and synthetic accessibility (SA) to design realistic and synthesizable drug candidates.
