Exploiting Pre-trained Models for Drug Target Affinity Prediction with Nearest Neighbors
Qizhi Pei, Lijun Wu, Zhenyu He, Jinhua Zhu, Yingce Xia, Shufang Xie, Rui Yan
TL;DR
This work addresses the challenge of improving drug-target binding affinity (DTA) prediction with suboptimal accuracy and high training costs by introducing a non-parametric, embedding-based retrieval framework, $k$NN-DTA, applied on a pre-trained DTA model. It combines two neighbor-aggregation schemes—label aggregation via pairwise retrieval and representation aggregation via pointwise retrieval—into a unified inference-time pipeline, with an optional adaptive extension, Ada-$k$NN-DTA, that learns aggregation weights with lightweight training. Across four benchmark datasets (BindingDB IC$_{50}$, BindingDB $K_i$, DAVIS, KIBA), $k$NN-DTA achieves new state-of-the-art RMSE scores (e.g., 0.684 for IC$_{50}$ and 0.750 for $K_i$), and Ada-$k$NN-DTA further improves these results, while also showing promising zero-shot transfer performance. The approach demonstrates that smart retrieval from a pre-trained model can substantially boost predictive power without retraining, offering practical benefits for virtual screening and drug repurposing in AI-for-science contexts.
Abstract
Drug-Target binding Affinity (DTA) prediction is essential for drug discovery. Despite the application of deep learning methods to DTA prediction, the achieved accuracy remain suboptimal. In this work, inspired by the recent success of retrieval methods, we propose $k$NN-DTA, a non-parametric embedding-based retrieval method adopted on a pre-trained DTA prediction model, which can extend the power of the DTA model with no or negligible cost. Different from existing methods, we introduce two neighbor aggregation ways from both embedding space and label space that are integrated into a unified framework. Specifically, we propose a \emph{label aggregation} with \emph{pair-wise retrieval} and a \emph{representation aggregation} with \emph{point-wise retrieval} of the nearest neighbors. This method executes in the inference phase and can efficiently boost the DTA prediction performance with no training cost. In addition, we propose an extension, Ada-$k$NN-DTA, an instance-wise and adaptive aggregation with lightweight learning. Results on four benchmark datasets show that $k$NN-DTA brings significant improvements, outperforming previous state-of-the-art (SOTA) results, e.g, on BindingDB IC$_{50}$ and $K_i$ testbeds, $k$NN-DTA obtains new records of RMSE $\bf{0.684}$ and $\bf{0.750}$. The extended Ada-$k$NN-DTA further improves the performance to be $\bf{0.675}$ and $\bf{0.735}$ RMSE. These results strongly prove the effectiveness of our method. Results in other settings and comprehensive studies/analyses also show the great potential of our $k$NN-DTA approach.
