Peptide2Mol: A Diffusion Model for Generating Small Molecules as Peptide Mimics for Targeted Protein Binding
Xinheng He, Yijia Zhang, Haowei Lin, Xingang Peng, Xiangzhe Kong, Mingyu Li, Jianzhu Ma
TL;DR
Peptide2Mol addresses the gap between peptide binders and drug-like small molecules by learning a trajectory from peptide–binding interfaces to pocket-fitting small-molecule mimics using an $E(3)$-equivariant diffusion model. It trains on diverse data—small molecules, protein–ligand complexes, and peptide–protein interfaces—and generates non-autoregressive, pocket-aware molecules that preserve peptide-like interactions while achieving drug-like properties. The method supports partial diffusion for peptidomimetic optimization and can refine results with Pocket2Mol to improve docking plausibility, with results showing competitive property metrics and residue-level mimicry validated by PMI analyses. This work advances structure-based design by enabling peptide-to-small-molecule translation within 3D pockets, offering a new avenue for targeted protein binding and peptidomimetic design.
Abstract
Structure-based drug design has seen significant advancements with the integration of artificial intelligence (AI), particularly in the generation of hit and lead compounds. However, most AI-driven approaches neglect the importance of endogenous protein interactions with peptides, which may result in suboptimal molecule designs. In this work, we present Peptide2Mol, an E(3)-equivariant graph neural network diffusion model that generates small molecules by referencing both the original peptide binders and their surrounding protein pocket environments. Trained on large datasets and leveraging sophisticated modeling techniques, Peptide2Mol not only achieves state-of-the-art performance in non-autoregressive generative tasks, but also produces molecules with similarity to the original peptide binder. Additionally, the model allows for molecule optimization and peptidomimetic design through a partial diffusion process. Our results highlight Peptide2Mol as an effective deep generative model for generating and optimizing bioactive small molecules from protein binding pockets.
