GraphPrint: Extracting Features from 3D Protein Structure for Drug Target Affinity Prediction
Amritpal Singh
TL;DR
GraphPrint addresses the limitation of sequence-based DTA models by incorporating 3D protein structure into a multimodal graph framework. It uses AlphaFold-based protein graphs, RDKit drug graphs, and handcrafted fingerprints in a four-branch network, with ablation confirming the 3D features provide complementary information. The method achieves a mean squared error of $0.1378$ and a concordance index of $0.8929$ on the KIBA dataset, competitive with state-of-the-art approaches. This work suggests that integrating 3D structure can accelerate drug discovery and offers a public 3D-aware KIBA resource for future research.
Abstract
Accurate drug target affinity prediction can improve drug candidate selection, accelerate the drug discovery process, and reduce drug production costs. Previous work focused on traditional fingerprints or used features extracted based on the amino acid sequence in the protein, ignoring its 3D structure which affects its binding affinity. In this work, we propose GraphPrint: a framework for incorporating 3D protein structure features for drug target affinity prediction. We generate graph representations for protein 3D structures using amino acid residue location coordinates and combine them with drug graph representation and traditional features to jointly learn drug target affinity. Our model achieves a mean square error of 0.1378 and a concordance index of 0.8929 on the KIBA dataset and improves over using traditional protein features alone. Our ablation study shows that the 3D protein structure-based features provide information complementary to traditional features.
