Layer-diverse Negative Sampling for Graph Neural Networks
Wei Duan, Jie Lu, Yu Guang Wang, Junyu Xuan
TL;DR
This paper tackles the shortcomings of traditional GNNs that rely solely on positive samples, which can lead to over-smoothing and limited expressivity, and addresses over-squashing by introducing Layer-diverse Negative Sampling (LDNS). LDNS leverages a DPP-based sampling matrix and a space-squeezing operation to generate diverse negative samples that span across layers, with a shortest-path candidate set to keep computation tractable; a layer-aware, k-DPP sampling step ensures reduced redundancy between layers. Empirically, LDNS improves node classification accuracy across seven homophilous and three heterophilous datasets, reduces cross-layer overlap in negative samples, and demonstrates robustness across multiple GNN architectures, while offering a favorable trade-off between performance and time complexity. The work shows that dynamically adding carefully chosen negative samples effectively rewires information flow, enhancing GNN expressivity and reducing over-squashing with practical computational considerations.
Abstract
Graph neural networks (GNNs) are a powerful solution for various structure learning applications due to their strong representation capabilities for graph data. However, traditional GNNs, relying on message-passing mechanisms that gather information exclusively from first-order neighbours (known as positive samples), can lead to issues such as over-smoothing and over-squashing. To mitigate these issues, we propose a layer-diverse negative sampling method for message-passing propagation. This method employs a sampling matrix within a determinantal point process, which transforms the candidate set into a space and selectively samples from this space to generate negative samples. To further enhance the diversity of the negative samples during each forward pass, we develop a space-squeezing method to achieve layer-wise diversity in multi-layer GNNs. Experiments on various real-world graph datasets demonstrate the effectiveness of our approach in improving the diversity of negative samples and overall learning performance. Moreover, adding negative samples dynamically changes the graph's topology, thus with the strong potential to improve the expressiveness of GNNs and reduce the risk of over-squashing.
