Fast Inference of Removal-Based Node Influence
Weikai Li, Zhiping Xiao, Xiao Luo, Yizhou Sun
TL;DR
The paper tackles the problem of measuring task-specific node influence under node removal in graphs by using a trained GNN as a surrogate and defining influence via the total variation distance $F_{g_\theta}(v_r) = \sum_{i \neq r} || g_\theta(G)_i - g_\theta(G_{-v_r})_i ||_1$. It introduces NORA, a gradient-based approximation that decomposes the influence into three components (embedding disappearance, aggregation change, and multi-hop spread) and computes a unified estimate with one forward and one backward pass, achieving $O(LNh^2+LMh)$ time. Extensive experiments on six datasets and six GNN models show that NORA consistently outperforms adapted baselines (node-mask and prediction-based) in correlating with real influence while drastically reducing runtime, with case studies on large graphs like ogbn-arxiv validating its practical relevance. The work provides a scalable, model-agnostic approach to influence estimation, enabling applications in marketing, information diffusion, and network robustness, and opens avenues for improved approximations and broader perturbation scenarios. $F_{g_\theta}(v_r)$, complexity comparisons, and the three-term decomposition are central to the method's effectiveness and efficiency.
Abstract
Graph neural networks (GNNs) are widely utilized to capture the information spreading patterns in graphs. While remarkable performance has been achieved, there is a new trending topic of evaluating node influence. We propose a new method of evaluating node influence, which measures the prediction change of a trained GNN model caused by removing a node. A real-world application is, "In the task of predicting Twitter accounts' polarity, had a particular account been removed, how would others' polarity change?". We use the GNN as a surrogate model whose prediction could simulate the change of nodes or edges caused by node removal. Our target is to obtain the influence score for every node, and a straightforward way is to alternately remove every node and apply the trained GNN on the modified graph to generate new predictions. It is reliable but time-consuming, so we need an efficient method. The related lines of work, such as graph adversarial attack and counterfactual explanation, cannot directly satisfy our needs, since their problem settings are different. We propose an efficient, intuitive, and effective method, NOde-Removal-based fAst GNN inference (NORA), which uses the gradient information to approximate the node-removal influence. It only costs one forward propagation and one backpropagation to approximate the influence score for all nodes. Extensive experiments on six datasets and six GNN models verify the effectiveness of NORA. Our code is available at https://github.com/weikai-li/NORA.git.
