FABind: Fast and Accurate Protein-Ligand Binding
Qizhi Pei, Kaiyuan Gao, Lijun Wu, Jinhua Zhu, Yingce Xia, Shufang Xie, Tao Qin, Kun He, Tie-Yan Liu, Rui Yan
TL;DR
FABind addresses the challenge of fast and accurate protein–ligand docking by unifying pocket prediction and docking into a single end-to-end framework. It introduces ligand-informed pocket prediction, a three-step FABind layer with E(3)-equivariant updates, and a training pipeline that couples pocket and docking tasks through scheduled sampling and distance-map refinement. The method demonstrates strong generalization to unseen proteins and achieves substantial efficiency gains, including around 170× speedups over DiffDock while maintaining competitive docking accuracy. These contributions offer a practical, scalable approach for drug discovery that reduces reliance on external pocket-detection modules and improves inference speed without sacrificing pose quality.
Abstract
Modeling the interaction between proteins and ligands and accurately predicting their binding structures is a critical yet challenging task in drug discovery. Recent advancements in deep learning have shown promise in addressing this challenge, with sampling-based and regression-based methods emerging as two prominent approaches. However, these methods have notable limitations. Sampling-based methods often suffer from low efficiency due to the need for generating multiple candidate structures for selection. On the other hand, regression-based methods offer fast predictions but may experience decreased accuracy. Additionally, the variation in protein sizes often requires external modules for selecting suitable binding pockets, further impacting efficiency. In this work, we propose $\mathbf{FABind}$, an end-to-end model that combines pocket prediction and docking to achieve accurate and fast protein-ligand binding. $\mathbf{FABind}$ incorporates a unique ligand-informed pocket prediction module, which is also leveraged for docking pose estimation. The model further enhances the docking process by incrementally integrating the predicted pocket to optimize protein-ligand binding, reducing discrepancies between training and inference. Through extensive experiments on benchmark datasets, our proposed $\mathbf{FABind}$ demonstrates strong advantages in terms of effectiveness and efficiency compared to existing methods. Our code is available at https://github.com/QizhiPei/FABind
