Table of Contents
Fetching ...

Superposition in Graph Neural Networks

Lukas Pertl, Han Xuanyuan, Pietro Liò

TL;DR

This work provides a representation-centric framework to study superposition in graph neural networks, quantifying how many independent axes are used and how tightly directions are packed in node and graph latent spaces. By extracting linear-probe directions and class-conditional centroids on held-out data, and applying metrics such as EffRank, SI, and Welch-normalized overlap, it reveals geometry-driven effects of width and pooling across GCN, GIN, and GAT. The findings show a three-phase width-dependent behavior, topology-induced node-level entanglement that pooling can re-mix into task axes, and metastable low-rank embeddings in shallow models. Practically, the results offer design guidance for more interpretable GNNs, linking architectural choices to concrete representational geometry outcomes.

Abstract

Interpreting graph neural networks (GNNs) is difficult because message passing mixes signals and internal channels rarely align with human concepts. We study superposition, the sharing of directions by multiple features, directly in the latent space of GNNs. Using controlled experiments with unambiguous graph concepts, we extract features as (i) class-conditional centroids at the graph level and (ii) linear-probe directions at the node level, and then analyze their geometry with simple basis-invariant diagnostics. Across GCN/GIN/GAT we find: increasing width produces a phase pattern in overlap; topology imprints overlap onto node-level features that pooling partially remixes into task-aligned graph axes; sharper pooling increases axis alignment and reduces channel sharing; and shallow models can settle into metastable low-rank embeddings. These results connect representational geometry with concrete design choices (width, pooling, and final-layer activations) and suggest practical approaches for more interpretable GNNs.

Superposition in Graph Neural Networks

TL;DR

This work provides a representation-centric framework to study superposition in graph neural networks, quantifying how many independent axes are used and how tightly directions are packed in node and graph latent spaces. By extracting linear-probe directions and class-conditional centroids on held-out data, and applying metrics such as EffRank, SI, and Welch-normalized overlap, it reveals geometry-driven effects of width and pooling across GCN, GIN, and GAT. The findings show a three-phase width-dependent behavior, topology-induced node-level entanglement that pooling can re-mix into task axes, and metastable low-rank embeddings in shallow models. Practically, the results offer design guidance for more interpretable GNNs, linking architectural choices to concrete representational geometry outcomes.

Abstract

Interpreting graph neural networks (GNNs) is difficult because message passing mixes signals and internal channels rarely align with human concepts. We study superposition, the sharing of directions by multiple features, directly in the latent space of GNNs. Using controlled experiments with unambiguous graph concepts, we extract features as (i) class-conditional centroids at the graph level and (ii) linear-probe directions at the node level, and then analyze their geometry with simple basis-invariant diagnostics. Across GCN/GIN/GAT we find: increasing width produces a phase pattern in overlap; topology imprints overlap onto node-level features that pooling partially remixes into task-aligned graph axes; sharper pooling increases axis alignment and reduces channel sharing; and shallow models can settle into metastable low-rank embeddings. These results connect representational geometry with concrete design choices (width, pooling, and final-layer activations) and suggest practical approaches for more interpretable GNNs.

Paper Structure

This paper contains 47 sections, 42 equations, 7 figures, 2 tables.

Figures (7)

  • Figure 1: Superposition effects across bottleneck dimensions $d$. A dashed line corresponds to $k=d$. Shaded regions show uncertainty as mean $\pm 1.96\sigma/ \sqrt{R}$ where $R=16$ is the number of seeds.
  • Figure 2: SI and WNO measured on GNNs trained on CONJUNCTION. Red highlighted data-points indicate models with perfect test accuracy.
  • Figure 3: Mean cosine similarities for the node features (top) and graph features (bottom) on Conjunction.
  • Figure 4: Alignment versus pooling parameter $p$ across architectures on Pairwise ($k=16$). Shaded regions show uncertainty as mean $\pm 1.96\sigma/ \sqrt{R}$ where $R=16$ is the number of seeds.
  • Figure 5: Axes‑aligned embedding vectors lose less information than arbitrary‑angled vectors under max pooling when both are corrupted by equal‑energy noise.
  • ...and 2 more figures