Table of Contents
Fetching ...

On the Probability of Necessity and Sufficiency of Explaining Graph Neural Networks: A Lower Bound Optimization Approach

Ruichu Cai, Yuxuan Zhu, Xuexin Chen, Yuan Fang, Min Wu, Jie Qiao, Zhifeng Hao

TL;DR

This paper introduces NSEG, a framework for Graph Neural Network explanations that optimizes a lower bound on the Probability of Necessity and Sufficiency (PNS) to produce explanations that are both necessary and sufficient. By modeling GNNs as Structural Causal Models and performing counterfactual estimation for edges and node features, NSEG derives a differentiable lower bound $\text{PNS}_{lb}^{e,f}$ that can be optimized with continuous masks and sampling. The method demonstrates state-of-the-art performance on synthetic and real-world graph datasets in terms of fidelity to both necessary and sufficient criteria, while providing interpretable joint explanations that combine graph structure and node features. The authors also provide variants focusing on PN or PS and perform extensive ablations, qualitative analyses, and sensitivity studies, highlighting the practical impact for trustworthy GNN explanations and potential future work on priors-free approaches.

Abstract

The explainability of Graph Neural Networks (GNNs) is critical to various GNN applications, yet it remains a significant challenge. A convincing explanation should be both necessary and sufficient simultaneously. However, existing GNN explaining approaches focus on only one of the two aspects, necessity or sufficiency, or a heuristic trade-off between the two. Theoretically, the Probability of Necessity and Sufficiency (PNS) holds the potential to identify the most necessary and sufficient explanation since it can mathematically quantify the necessity and sufficiency of an explanation. Nevertheless, the difficulty of obtaining PNS due to non-monotonicity and the challenge of counterfactual estimation limit its wide use. To address the non-identifiability of PNS, we resort to a lower bound of PNS that can be optimized via counterfactual estimation, and propose a framework of Necessary and Sufficient Explanation for GNN (NSEG) via optimizing that lower bound. Specifically, we depict the GNN as a structural causal model (SCM), and estimate the probability of counterfactual via the intervention under the SCM. Additionally, we leverage continuous masks with a sampling strategy to optimize the lower bound to enhance the scalability. Empirical results demonstrate that NSEG outperforms state-of-the-art methods, consistently generating the most necessary and sufficient explanations.

On the Probability of Necessity and Sufficiency of Explaining Graph Neural Networks: A Lower Bound Optimization Approach

TL;DR

This paper introduces NSEG, a framework for Graph Neural Network explanations that optimizes a lower bound on the Probability of Necessity and Sufficiency (PNS) to produce explanations that are both necessary and sufficient. By modeling GNNs as Structural Causal Models and performing counterfactual estimation for edges and node features, NSEG derives a differentiable lower bound that can be optimized with continuous masks and sampling. The method demonstrates state-of-the-art performance on synthetic and real-world graph datasets in terms of fidelity to both necessary and sufficient criteria, while providing interpretable joint explanations that combine graph structure and node features. The authors also provide variants focusing on PN or PS and perform extensive ablations, qualitative analyses, and sensitivity studies, highlighting the practical impact for trustworthy GNN explanations and potential future work on priors-free approaches.

Abstract

The explainability of Graph Neural Networks (GNNs) is critical to various GNN applications, yet it remains a significant challenge. A convincing explanation should be both necessary and sufficient simultaneously. However, existing GNN explaining approaches focus on only one of the two aspects, necessity or sufficiency, or a heuristic trade-off between the two. Theoretically, the Probability of Necessity and Sufficiency (PNS) holds the potential to identify the most necessary and sufficient explanation since it can mathematically quantify the necessity and sufficiency of an explanation. Nevertheless, the difficulty of obtaining PNS due to non-monotonicity and the challenge of counterfactual estimation limit its wide use. To address the non-identifiability of PNS, we resort to a lower bound of PNS that can be optimized via counterfactual estimation, and propose a framework of Necessary and Sufficient Explanation for GNN (NSEG) via optimizing that lower bound. Specifically, we depict the GNN as a structural causal model (SCM), and estimate the probability of counterfactual via the intervention under the SCM. Additionally, we leverage continuous masks with a sampling strategy to optimize the lower bound to enhance the scalability. Empirical results demonstrate that NSEG outperforms state-of-the-art methods, consistently generating the most necessary and sufficient explanations.
Paper Structure (64 sections, 5 theorems, 55 equations, 12 figures, 7 tables, 1 algorithm)

This paper contains 64 sections, 5 theorems, 55 equations, 12 figures, 7 tables, 1 algorithm.

Key Result

Lemma 1

The lower bound of $\text{PNS}(\xi)$ is:

Figures (12)

  • Figure 1: Explanations for the prediction "Sport Lover Group", which are highlighted in blue. Each node is a member in the group whose node features are their hobby, denoted by the icons. PN, PS, PNS refer to the probability of necessity, the probability of sufficiency, and the probability of necessity and sufficiency, respectively.
  • Figure 2: The causal diagram of corresponding SCM of GNN. $\mathbf{h}_{v}^{k}$ denotes the hidden representation of node $v$ in $k$-th layer, $\mathbf{e}_{v,u}$ denotes the entry between $v$ and $u$, $\mathbf{x}_{v}$ denotes the node feature of node $v$, $\mathbf{H}^{(k)}=\{\mathbf{h}_{v}^{(k)}|v\in V\}$ denotes a set of node representations in $k$-th layer, $\mathbf{X}=\{\mathbf{x}_{v}|v\in V\}$ denotes a set of node features, $\mathbf{E}=\{\mathbf{e}_{v,u}|v,u\in V\}$ denotes a set of graph entries.
  • Figure 3: Illustration of the overall framework of NSEG. (a) is the process to obtain Eq. \ref{['eq:do_ef_11_final']}; (b), (c), and (d) are the processes to obtain Eqs. \ref{['eq:do_ef_00_final']}, \ref{['eq:do_ef_01_final']}, and \ref{['eq:do_ef_10_final']}, respectively.
  • Figure 4: Fid+$^d$, Fid-$^d$, and charact$^d$ w.r.t. threshold on BA-Shapes, Tree-Cycles, Tree-Grid, Mutagenicity, and MSRC_21 datasets.
  • Figure 5: Explanations of GNNExplainer, PGExplainer,CF-GNNExplainer, CF$^2$, NSEG(PNS$^{e}$), and NSEG(PNS$^{e,f}$) in Mutagenicity and MSRC_21, where the explanations are highlighted in blue.
  • ...and 7 more figures

Theorems & Definitions (15)

  • Definition 1
  • Definition 2
  • Definition 3
  • Lemma 1
  • Lemma 2
  • Proposition 1
  • Proposition 2
  • Definition 4
  • Definition 5
  • Proposition 3
  • ...and 5 more