HybridLinker: Topology-Guided Posterior Sampling for Enhanced Diversity and Validity in 3D Molecular Linker Generation
Minyeong Hwang, Ziseok Lee, Kwang-Soo Kim, Kyungsu Kim, Eunho Yang
TL;DR
HybridLinker tackles the persistent diversity- validity trade-off in 3D linker generation by integrating pretrained point cloud-free and point cloud-aware models in a zero-shot two-stage pipeline. The core innovation, LinkerDPS, performs diffusion posterior sampling across topology and point cloud spaces using an energy-based cross-domain likelihood and an inpainting-based conditional score estimator, enabling topology-guided refinement of surrogates. Experiments on ZINC-derived fragment pairs demonstrate that HybridLinker achieves superior diversity and validity, and it also enhances drug-likeness optimization compared to strong baselines, suggesting strong utility as a foundational model for fragment-based drug design. By bridging topology and geometry without additional training, LinkerDPS broadens the applicability of diffusion-based molecular design to challenging cross-domain tasks and large-molecule generation.
Abstract
Linker generation is critical in drug discovery applications such as lead optimization and PROTAC design, where molecular fragments are assembled into diverse drug candidates via molecular linker. Existing methods fall into point cloud-free and point cloud-aware categories based on their use of fragments' 3D poses alongside their topologies in sampling the linker's topology. Point cloud-free models prioritize sample diversity but suffer from lower validity due to overlooking fragments' spatial constraints, while point cloud-aware models ensure higher validity but restrict diversity by enforcing strict spatial constraints. To overcome these trade-offs without additional training, we propose HybridLinker, a framework that enhances point cloud-aware inference by providing diverse bonding topologies from a pretrained point cloud-free model as guidance. At its core, we propose LinkerDPS, the first diffusion posterior sampling (DPS) method operating across point cloud-free and point cloud-aware spaces, bridging molecular topology with 3D point clouds via an energy-inspired function. By transferring the diverse sampling distribution of point cloud-free models into the point cloud-aware distribution, HybridLinker significantly surpasses baselines, improving both validity and diversity in foundational molecular design and applied drug optimization tasks, establishing a new DPS framework in the molecular domains beyond imaging.
