Table of Contents
Fetching ...

Heterophilic Graph Neural Networks Optimization with Causal Message-passing

Botao Wang, Jia Li, Heng Chang, Keli Zhang, Fugee Tsung

TL;DR

This work proposes CausalMP, a causal message-passing discovery network for heterophilic graph learning, that iteratively learns the explicit causal structure of input graphs and achieves superior link prediction performance.

Abstract

In this work, we discover that causal inference provides a promising approach to capture heterophilic message-passing in Graph Neural Network (GNN). By leveraging cause-effect analysis, we can discern heterophilic edges based on asymmetric node dependency. The learned causal structure offers more accurate relationships among nodes. To reduce the computational complexity, we introduce intervention-based causal inference in graph learning. We first simplify causal analysis on graphs by formulating it as a structural learning model and define the optimization problem within the Bayesian scheme. We then present an analysis of decomposing the optimization target into a consistency penalty and a structure modification based on cause-effect relations. We then estimate this target by conditional entropy and present insights into how conditional entropy quantifies the heterophily. Accordingly, we propose CausalMP, a causal message-passing discovery network for heterophilic graph learning, that iteratively learns the explicit causal structure of input graphs. We conduct extensive experiments in both heterophilic and homophilic graph settings. The result demonstrates that the our model achieves superior link prediction performance. Training on causal structure can also enhance node representation in classification task across different base models.

Heterophilic Graph Neural Networks Optimization with Causal Message-passing

TL;DR

This work proposes CausalMP, a causal message-passing discovery network for heterophilic graph learning, that iteratively learns the explicit causal structure of input graphs and achieves superior link prediction performance.

Abstract

In this work, we discover that causal inference provides a promising approach to capture heterophilic message-passing in Graph Neural Network (GNN). By leveraging cause-effect analysis, we can discern heterophilic edges based on asymmetric node dependency. The learned causal structure offers more accurate relationships among nodes. To reduce the computational complexity, we introduce intervention-based causal inference in graph learning. We first simplify causal analysis on graphs by formulating it as a structural learning model and define the optimization problem within the Bayesian scheme. We then present an analysis of decomposing the optimization target into a consistency penalty and a structure modification based on cause-effect relations. We then estimate this target by conditional entropy and present insights into how conditional entropy quantifies the heterophily. Accordingly, we propose CausalMP, a causal message-passing discovery network for heterophilic graph learning, that iteratively learns the explicit causal structure of input graphs. We conduct extensive experiments in both heterophilic and homophilic graph settings. The result demonstrates that the our model achieves superior link prediction performance. Training on causal structure can also enhance node representation in classification task across different base models.

Paper Structure

This paper contains 18 sections, 2 theorems, 13 equations, 6 figures, 5 tables, 1 algorithm.

Key Result

Proposition 3.2

Given the intervention strategy $I\in\mathcal{I}$, if we have the condition distribution $P(X_{-I},A|X), P(A|X)$, the causal structure $A_c$ and corresponding optimal intervention target $x_I$ can be obtained by optimizing: where $X_I$ are the intervened node features, $X_{-I}$ are the non-intervened features, $\mathbf{H}(\cdot)$ is the entropy.

Figures (6)

  • Figure 1: Detect heterophily by causal-effect estimation from asymmetric information flow that results from the mimic behaviors of fraudsters.
  • Figure 2: Venn diagram of center node and its neighbors.
  • Figure 3: Main scheme of CausalMP. In each iteration, we modify the detected dependencies into directed edge (red) and add edges (yellow) through mutual information. Both graphs are encoded and decoded by GNN with shared parameter that optimized by the weighed summation of three losses.
  • Figure 4: Influence of shot number in node classification.
  • Figure 5: Influence of consistency term $\alpha$.
  • ...and 1 more figures

Theorems & Definitions (3)

  • Proposition 3.2
  • Proposition 3.3
  • Definition 3.4