Apo2Mol: 3D Molecule Generation via Dynamic Pocket-Aware Diffusion Models
Xinzhe Zheng, Shiyu Jiang, Gustavo Seabra, Chenglong Li, Yanjun Li
TL;DR
Apo2Mol presents a diffusion-based framework that jointly generates 3D ligands and holo-pocket conformations from apo protein states, directly addressing pocket flexibility in structure-based drug design. It leverages a large experimentally resolved apo–holo dataset and a SE(3)-equivariant hierarchical graph to model detailed ligand–pocket interactions and residue-level pocket dynamics without relying on MD simulations. The method demonstrates state-of-the-art binding affinity and drug-likeness metrics in apo-to-holo generation and maintains competitive performance when baselines are evaluated on holo pockets, while ablations confirm the importance of the complex graph and quaternion-based transformations. These results advance practical SBDD by enabling dynamic, data-driven generation of ligand–pocket complexes from apo structures, with implications for targets lacking bound-state templates. However, a remaining gap in reproducing some fine-grained pocket conformations suggests future work on broader pretraining and refinement strategies to further close the holo-pocket distribution gap.
Abstract
Deep generative models are rapidly advancing structure-based drug design, offering substantial promise for generating small molecule ligands that bind to specific protein targets. However, most current approaches assume a rigid protein binding pocket, neglecting the intrinsic flexibility of proteins and the conformational rearrangements induced by ligand binding, limiting their applicability in practical drug discovery. Here, we propose Apo2Mol, a diffusion-based generative framework for 3D molecule design that explicitly accounts for conformational flexibility in protein binding pockets. To support this, we curate a dataset of over 24,000 experimentally resolved apo-holo structure pairs from the Protein Data Bank, enabling the characterization of protein structure changes associated with ligand binding. Apo2Mol employs a full-atom hierarchical graph-based diffusion model that simultaneously generates 3D ligand molecules and their corresponding holo pocket conformations from input apo states. Empirical studies demonstrate that Apo2Mol can achieve state-of-the-art performance in generating high-affinity ligands and accurately capture realistic protein pocket conformational changes.
