Guided Multi-objective Generative AI to Enhance Structure-based Drug Design
Amit Kadan, Kevin Ryczko, Erika Lloyd, Adrian Roitberg, Takeshi Yamazaki
TL;DR
IDOLpro addresses the inverse-design challenge in structure-based drug design by marrying diffusion-based molecule generation with differentiable, multi-objective guidance to co-optimize binding affinity and synthetic accessibility. It optimizes latent representations at a defined diffusion horizon $t_{hz}$ using gradients from differentiable scoring functions, followed by structural refinement with gradient-based methods and ANI2x, enabling rapid, in-pocket ligand design. On CrossDocked and Binding MOAD benchmarks, IDOLpro delivers state-of-the-art Vina improvements (e.g., $\approx 0.7$–$1.4$ kcal/mol) and higher QED, while outperforming exhaustive virtual screening in time and cost by orders of magnitude, and enabling lead optimization from scaffolded references. The framework is modular and extensible, capable of incorporating additional scores (e.g., ADME-Tox) to accelerate hit-finding and lead optimization in drug discovery pipelines. The approach holds potential for faster, more reliable generation of drug-like ligands directly in protein pockets, reducing search space and enabling multi-property optimization in silico.
Abstract
Generative AI has the potential to revolutionize drug discovery. Yet, despite recent advances in deep learning, existing models cannot generate molecules that satisfy all desired physicochemical properties. Herein, we describe IDOLpro, a generative chemistry AI combining diffusion with multi-objective optimization for structure-based drug design. Differentiable scoring functions guide the latent variables of the diffusion model to explore uncharted chemical space and generate novel ligands in silico, optimizing a plurality of target physicochemical properties. We demonstrate our platform's effectiveness by generating ligands with optimized binding affinity and synthetic accessibility on two benchmark sets. IDOLpro produces ligands with binding affinities over 10%-20% better than the next best state-of-the-art method on each test set, producing more drug-like molecules with generally better synthetic accessibility scores than other methods. We do a head-to-head comparison of IDOLpro against a classic virtual screen of a large database of drug-like molecules. We show that IDOLpro can generate molecules for a range of important disease-related targets with better binding affinity and synthetic accessibility than any molecule found in the virtual screen while being over 100x faster and less expensive to run. On a test set of experimental complexes, IDOLpro is the first to produce molecules with better binding affinities than experimentally observed ligands. IDOLpro can accommodate other scoring functions (e.g. ADME-Tox) to accelerate hit-finding, hit-to-lead, and lead optimization for drug discovery.
