GraphPINE: Graph Importance Propagation for Interpretable Drug Response Prediction
Yoshitaka Inoue, Tianfan Fu, Augustin Luna
TL;DR
GraphPINE introduces an interpretable graph neural network for drug response prediction by initializing node importance with domain-specific prior knowledge from drug–gene interactions and propagating this importance through a novel IP Layer. The method unifies feature updates and graph-based importance propagation, enhancing both predictive performance and interpretability. On a large NCI-60–derived DRP dataset, GraphPINE (GT variant) achieves a PR-AUC of 0.894 and a ROC-AUC of 0.796, with ablations showing the IP layer substantially improves performance across architectures, and interpretability analyses revealing biologically meaningful gene–drug associations. The approach offers a principled framework for incorporating priors into GNNs, improving explainability without sacrificing accuracy, and holds potential for wider applications in network biology and precision medicine.
Abstract
Explainability is necessary for many tasks in biomedical research. Recent explainability methods have focused on attention, gradient, and Shapley value. These do not handle data with strong associated prior knowledge and fail to constrain explainability results based on known relationships between predictive features. We propose GraphPINE, a graph neural network (GNN) architecture leveraging domain-specific prior knowledge to initialize node importance optimized during training for drug response prediction. Typically, a manual post-prediction step examines literature (i.e., prior knowledge) to understand returned predictive features. While node importance can be obtained for gradient and attention after prediction, node importance from these methods lacks complementary prior knowledge; GraphPINE seeks to overcome this limitation. GraphPINE differs from other GNN gating methods by utilizing an LSTM-like sequential format. We introduce an importance propagation layer that unifies 1) updates for feature matrix and node importance and 2) uses GNN-based graph propagation of feature values. This initialization and updating mechanism allows for informed feature learning and improved graph representation. We apply GraphPINE to cancer drug response prediction using drug screening and gene data collected for over 5,000 gene nodes included in a gene-gene graph with a drug-target interaction (DTI) graph for initial importance. The gene-gene graph and DTIs were obtained from curated sources and weighted by article count discussing relationships between drugs and genes. GraphPINE achieves a PR-AUC of 0.894 and ROC-AUC of 0.796 across 952 drugs. Code is available at https://anonymous.4open.science/r/GraphPINE-40DE.
