Simple GNNs with Low Rank Non-parametric Aggregators

Luciano Vinas; Arash A. Amini

Simple GNNs with Low Rank Non-parametric Aggregators

Luciano Vinas, Arash A. Amini

TL;DR

This work revisits recent spectral GNN approaches to semi-supervised node classification and suggests conventional methods such as non-parametric regression are well suited for semi-supervised learning on sparse, directed networks and a variety of other graph types commonly found in SSNC benchmarks.

Abstract

We revisit recent spectral GNN approaches to semi-supervised node classification (SSNC). We posit that state-of-the-art (SOTA) GNN architectures may be over-engineered for common SSNC benchmark datasets (citation networks, page-page networks, etc.). By replacing feature aggregation with a non-parametric learner we are able to streamline the GNN design process and avoid many of the engineering complexities associated with SOTA hyperparameter selection (GNN depth, non-linearity choice, feature dropout probability, etc.). Our empirical experiments suggest conventional methods such as non-parametric regression are well suited for semi-supervised learning on sparse, directed networks and a variety of other graph types commonly found in SSNC benchmarks. Additionally, we bring attention to recent changes in evaluation conventions for SSNC benchmarking and how this may have partially contributed to rising performances over time.

Simple GNNs with Low Rank Non-parametric Aggregators

TL;DR

Abstract

Paper Structure (16 sections, 6 equations, 6 figures, 3 tables)

This paper contains 16 sections, 6 equations, 6 figures, 3 tables.

Introduction
GNN and SSNC Formalism
Nonparametric Spectral Reshaping
Practical considerations.
Directed/asymmetric case.
Multiple layers.
Motivating General Spectral Learners
Complexity of Low Rank Spectral Learners
Experiments
SSNC Benchmarks
CSBM Experiment
Aggregation Ablation
Matrix Representation ($\bm M$):
Changes in Evaluation Conventions
Comparing Split Performances
...and 1 more sections

Figures (6)

Figure 1: Example of learned spectral filter $\hat{f}(\lambda)$ for the CSBM experiment with an unbounded Sobolev kernel ($\gamma = 0.1$).
Figure 2: Accuracy comparison of the Kernel model for different graph representations $\bm A$ and $\bm D-\bm A$. Shown above is the signed accuracy difference between the adjacency and Laplacian representations. Best performing kernel was selected per dataset.
Figure 3: LR Kernel performance relative to the full-rank Kernel for different truncation factors $r$. Performance is seen to gradually decline on most datasets as the truncation factor $r$ decreases (that is truncation percentage increases). LR Kernel performance can also be seen to periodically increase above full-rank Kernel performance for the datasets Chameleon (red) and Squirrel (purple).
Figure 4: Performance homogenization achieved by LR Kernel model on directed networks.
Figure 5: Accuracy results and uncertainties on the citation datasets using different splits with linear models $\bm{XW}$ and $\bm{AXW}$. "Public" refers to the split introduced by Kipf17. Both "Sparse" and "Public" are single splits, so one cannot associate uncertainty to them.
...and 1 more figures

Simple GNNs with Low Rank Non-parametric Aggregators

TL;DR

Abstract

Simple GNNs with Low Rank Non-parametric Aggregators

Authors

TL;DR

Abstract

Table of Contents

Figures (6)