Graph Hopfield Networks: Energy-Based Node Classification with Associative Memory

Abinav Rao; Alex Wa; Rishi Athavale

Graph Hopfield Networks: Energy-Based Node Classification with Associative Memory

Abinav Rao, Alex Wa, Rishi Athavale

TL;DR

Graph Hopfield Networks is introduced, whose energy function couples associative memory retrieval with graph Laplacian smoothing for node classification, and Tuning enables graph sharpening for heterophilous benchmarks without architectural changes.

Abstract

We introduce Graph Hopfield Networks, whose energy function couples associative memory retrieval with graph Laplacian smoothing for node classification. Gradient descent on this joint energy yields an iterative update interleaving Hopfield retrieval with Laplacian propagation. Memory retrieval provides regime-dependent benefits: up to 2.0~pp on sparse citation networks and up to 5 pp additional robustness under feature masking; the iterative energy-descent architecture itself is a strong inductive bias, with all variants (including the memory-disabled NoMem ablation) outperforming standard baselines on Amazon co-purchase graphs. Tuning enables graph sharpening for heterophilous benchmarks without architectural changes.

Graph Hopfield Networks: Energy-Based Node Classification with Associative Memory

TL;DR

Abstract

Paper Structure (27 sections, 15 theorems, 57 equations, 4 figures, 11 tables)

This paper contains 27 sections, 15 theorems, 57 equations, 4 figures, 11 tables.

Introduction
Graph Hopfield Networks
Energy Function and Update Rule
Gated Memory Retrieval
Hierarchical Memory
Experiments
Setup
Node Classification
Robustness to Graph Corruption
Heterophilous Graphs
Discussion and Future Directions
Experimental Details
Related Work
Additional Results
Edge Deletion on Planetoid
...and 12 more sections

Key Result

Proposition D.1

If $\mathbf{A}$ is symmetric and $\mathbf{D}_{vv}=d_v>0$, then Hence $\mathbf{L}\succeq 0$ and the Laplacian term is convex.

Figures (4)

Figure 1: One iteration of the GHN update: node features $\mathbf{x}_v$ are blended with (i) memory retrieval from pattern bank $\mathbf{M}$, and (ii) graph Laplacian smoothing over neighbors. Damping $\alpha$ controls the step size.
Figure 2: Accuracy under progressive edge deletion on Amazon and Planetoid datasets. GHN variants degrade more gradually on Amazon; Planetoid details in Appendix \ref{['app:edge_robustness']}.
Figure 3: Accuracy under feature masking across five datasets. Tuning selects $K{=}64$ on Planetoid and $K{=}256$ on Amazon. All models degrade at comparable rates on Planetoid; on Amazon, memory-enabled variants show substantially greater robustness (Table \ref{['tab:amazon_corruption']}).
Figure 4: Phase diagram of LSE vs. LSR energy on Amazon Photo. Each cell shows mean test accuracy over 10 seeds. High variance in some cells reflects bimodal behavior (seeds either converge to ${\sim}94\%$ or collapse). At $K{=}64$, LSR is stable for moderate $\beta$ while LSE is bimodal for all tested $\beta$.

Theorems & Definitions (32)

Proposition D.1: Normalized Laplacian quadratic form
proof
Proposition D.2: Gradient formula
proof
Lemma D.3: Retrieval Jacobian
proof
Lemma D.4: Softmax covariance bound
proof
Proposition D.5: Convexity and strong convexity regime
proof
...and 22 more

Graph Hopfield Networks: Energy-Based Node Classification with Associative Memory

TL;DR

Abstract

Graph Hopfield Networks: Energy-Based Node Classification with Associative Memory

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (4)

Theorems & Definitions (32)