Graph Partial Label Learning with Potential Cause Discovering
Hang Gao, Jiaguo Yuan, Jiangmeng Li, Peng Qiao, Fengge Wu, Changwen Zheng, Huaping Liu
TL;DR
This work tackles graph representation learning under Partial Label Learning (PLL), where each graph is associated with multiple candidate labels and only one is correct. It introduces Graph Partial Label Learning with Potential Cause Discovering (GPCD), a three-phase framework that pre-trains a GNN, estimates a graph causal subset via potential causes, and performs auxiliary training guided by this subset to mitigate label noise. The approach is supported by theoretical results linking potential causes to the graph causal subset and by extensive experiments on seven graph PLL datasets, showing improved accuracy and robustness to noise and distribution shifts. The method offers a principled way to isolate task-relevant graph structure under PLL, with practical implications for scalable, accurate graph learning in weakly supervised settings.
Abstract
Graph Neural Networks (GNNs) have garnered widespread attention for their potential to address the challenges posed by graph representation learning, which face complex graph-structured data across various domains. However, due to the inherent complexity and interconnectedness of graphs, accurately annotating graph data for training GNNs is extremely challenging. To address this issue, we have introduced Partial Label Learning (PLL) into graph representation learning. PLL is a critical weakly supervised learning problem where each training instance is associated with a set of candidate labels, including the ground-truth label and the additional interfering labels. PLL allows annotators to make errors, which reduces the difficulty of data labeling. Subsequently, we propose a novel graph representation learning method that enables GNN models to effectively learn discriminative information within the context of PLL. Our approach utilizes potential cause extraction to obtain graph data that holds causal relationships with the labels. By conducting auxiliary training based on the extracted graph data, our model can effectively eliminate the interfering information in the PLL scenario. We support the rationale behind our method with a series of theoretical analyses. Moreover, we conduct extensive evaluations and ablation studies on multiple datasets, demonstrating the superiority of our proposed method.
