Revealing Decurve Flows for Generalized Graph Propagation

Chen Lin; Liheng Ma; Yiyang Chen; Wanli Ouyang; Michael M. Bronstein; Philip H. S. Torr

Revealing Decurve Flows for Generalized Graph Propagation

Chen Lin, Liheng Ma, Yiyang Chen, Wanli Ouyang, Michael M. Bronstein, Philip H. S. Torr

TL;DR

This study addresses the limitations of the traditional analysis of message-passing, central to graph learning, by defining Generalized Propagation Neural Networks (GPNNs), a framework that unifies most propagation-based graph neural networks.

Abstract

This study addresses the limitations of the traditional analysis of message-passing, central to graph learning, by defining {\em \textbf{generalized propagation}} with directed and weighted graphs. The significance manifest in two ways. \textbf{Firstly}, we propose {\em Generalized Propagation Neural Networks} (\textbf{GPNNs}), a framework that unifies most propagation-based graph neural networks. By generating directed-weighted propagation graphs with adjacency function and connectivity function, GPNNs offer enhanced insights into attention mechanisms across various graph models. We delve into the trade-offs within the design space with empirical experiments and emphasize the crucial role of the adjacency function for model expressivity via theoretical analysis. \textbf{Secondly}, we propose the {\em Continuous Unified Ricci Curvature} (\textbf{CURC}), an extension of celebrated {\em Ollivier-Ricci Curvature} for directed and weighted graphs. Theoretically, we demonstrate that CURC possesses continuity, scale invariance, and a lower bound connection with the Dirichlet isoperimetric constant validating bottleneck analysis for GPNNs. We include a preliminary exploration of learned propagation patterns in datasets, a first in the field. We observe an intriguing ``{\em \textbf{decurve flow}}'' - a curvature reduction during training for models with learnable propagation, revealing the evolution of propagation over time and a deeper connection to over-smoothing and bottleneck trade-off.

Revealing Decurve Flows for Generalized Graph Propagation

TL;DR

Abstract

Paper Structure (66 sections, 26 theorems, 107 equations, 6 figures, 9 tables)

This paper contains 66 sections, 26 theorems, 107 equations, 6 figures, 9 tables.

Introduction
Directed and Weighted Propagation
Preliminary
Generalized Propagation
GPNNs
GPNN Design Space
Multi-head v.s. Single-head
Local v.s. Non-Local v.s. Global Propagation
Static v.s. Dynamic Propagation
Feature-independent v.s. Feature-dependent Propagation
Expressiveness of GPNNs
Continuous Unified Ricci Curvature
Preliminaries for CURC
Construction of CURC
Optimal Transport of Information
...and 51 more sections

Key Result

Proposition 2.1

(Static GPNNs) For a static GPNN model with a fixed adjacency feature $[f_{uv}]_{u,v\in \mathcal{V}}$ and sufficient heads and layers, the expressiveness is upper-bounded by the color refinement iteration

Figures (6)

Figure 1: Geometric Flow Depicting Curvature Decrease. This visual serves as an analogy for the curvature decrease observed in Generalized Propagation Neural Networks (GPNNs) with learnable propagation. The proposed Continuous Uniform Ricci Curvature (CURC) are applied to characterize this "decurve flow" observed across various models. Dark-blue and red represent regions of negative and positive curvature, respectively
Figure 2: The demonstration of GPNN framework
Figure 3: Visualization of the curvature distribution characterized on ZINC for 4 models with dynamic propagation learned from data. The CURC distribution of 0 to 200 epochs are placed vertically for each model. A tendency of curvature shifting to their left side is observed across all models. This ubiquitous decurve flow phenomenon suggests a change of bottlenecks by the directed-unweighted propagation graphs analysis central to this work.
Figure 4: Visualization of Generalized Propagation and its Curvature for 6 graphs in ZINC at . 1st row: the visualization of propagation graph connection $\omega_{uv}$; 2nd row: the visualization of CURC $\kappa_{CURC}(u,v)$; 3rd row: the distributions of CURC; In the left most column, we visualize the input graph. Ep-0, Ep-50, Ep-100, Ep-1000 are listed afterwords.
Figure 5: The trend of Minimum CURC for the first 32 test graphs in ZINC. Shade is the Confidence Interval at the $95\%$ confidence level.
...and 1 more figures

Theorems & Definitions (66)

Proposition 2.1
Proposition 2.2
Definition 3.1
Definition 3.2
Definition 3.3
Theorem 3.4
Definition 3.5
Definition 3.6
Definition 3.7
Proposition 3.8
...and 56 more

Revealing Decurve Flows for Generalized Graph Propagation

TL;DR

Abstract

Revealing Decurve Flows for Generalized Graph Propagation

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (6)

Theorems & Definitions (66)