Graph Diffusion Network for Drug-Gene Prediction
Jiayang Wu, Wensheng Gan, Philip S. Yu
TL;DR
GDNDGP tackles drug-gene prediction under data sparsity by fusing meta-path-based homogeneous graph learning with a diffusion-driven hard negative sampler for contrastive learning. It introduces meta-paths D-G-D and G-D-G to strengthen intra-type relationships and derives a graph diffusion network to progressively generate challenging negative samples, reducing the need for exhaustive negatives. The method delivers state-of-the-art results on DGIdb 4.0 and strong generalization on drug-go-disease tripartite networks, with ablations showing the diffusion component as a key contributor. The approach offers a scalable, robust framework for heterogeneous biological networks and has potential for extensions to miRNA-drug interactions and other multi-omics tasks.
Abstract
Predicting drug-gene associations is crucial for drug development and disease treatment. While graph neural networks (GNN) have shown effectiveness in this task, they face challenges with data sparsity and efficient contrastive learning implementation. We introduce a graph diffusion network for drug-gene prediction (GDNDGP), a framework that addresses these limitations through two key innovations. First, it employs meta-path-based homogeneous graph learning to capture drug-drug and gene-gene relationships, ensuring similar entities share embedding spaces. Second, it incorporates a parallel diffusion network that generates hard negative samples during training, eliminating the need for exhaustive negative sample retrieval. Our model achieves superior performance on the DGIdb 4.0 dataset and demonstrates strong generalization capability on tripartite drug-gene-disease networks. Results show significant improvements over existing methods in drug-gene prediction tasks, particularly in handling complex heterogeneous relationships. The source code is publicly available at https://github.com/csjywu1/GDNDGP.
