Random Wavelet Features for Graph Kernel Machines

Valentin de Bassompierre; Jean-Charles Delvenne; Laurent Jacques

Random Wavelet Features for Graph Kernel Machines

Valentin de Bassompierre, Jean-Charles Delvenne, Laurent Jacques

TL;DR

This work tackles scalable graph kernel computation by introducing randomized spectral node embeddings that approximate a Laplacian-based kernel $\boldsymbol\Gamma = h(\boldsymbol L)$ with a rank-$K$ matrix $\tilde{\boldsymbol\Gamma} = \boldsymbol\Phi^\top \boldsymbol\Phi$. The method proceeds in two stages: range finding via polynomial-filtered random signals to approximate the top-$K$ spectral subspace, and kernel embedding yielding $\boldsymbol\Phi = (h^{1/2}(\boldsymbol L)\boldsymbol Q)^\top$ so that $\tilde{\boldsymbol\Gamma}$ captures the dominant kernel components. The paper provides theoretical error bounds, connects to Randomized SVD, analyzes computational complexity, and demonstrates empirically that the approach outperforms existing random-feature methods for spectrally localized kernels, enabling scalable, principled graph representation learning. Overall, this method offers a practical framework for accurate low-rank kernel approximations on large graphs, with particular advantage for narrow-band kernels that reflect global geometric structure.

Abstract

Node embeddings map graph vertices into low-dimensional Euclidean spaces while preserving structural information. They are central to tasks such as node classification, link prediction, and signal reconstruction. A key goal is to design node embeddings whose dot products capture meaningful notions of node similarity induced by the graph. Graph kernels offer a principled way to define such similarities, but their direct computation is often prohibitive for large networks. Inspired by random feature methods for kernel approximation in Euclidean spaces, we introduce randomized spectral node embeddings whose dot products estimate a low-rank approximation of any specific graph kernel. We provide theoretical and empirical results showing that our embeddings achieve more accurate kernel approximations than existing methods, particularly for spectrally localized kernels. These results demonstrate the effectiveness of randomized spectral constructions for scalable and principled graph representation learning.

Random Wavelet Features for Graph Kernel Machines

TL;DR

This work tackles scalable graph kernel computation by introducing randomized spectral node embeddings that approximate a Laplacian-based kernel

with a rank-

matrix

. The method proceeds in two stages: range finding via polynomial-filtered random signals to approximate the top-

spectral subspace, and kernel embedding yielding

so that

captures the dominant kernel components. The paper provides theoretical error bounds, connects to Randomized SVD, analyzes computational complexity, and demonstrates empirically that the approach outperforms existing random-feature methods for spectrally localized kernels, enabling scalable, principled graph representation learning. Overall, this method offers a practical framework for accurate low-rank kernel approximations on large graphs, with particular advantage for narrow-band kernels that reflect global geometric structure.

Abstract

Paper Structure (18 sections, 6 theorems, 45 equations, 2 figures, 1 algorithm)

This paper contains 18 sections, 6 theorems, 45 equations, 2 figures, 1 algorithm.

Introduction
Related work
Mathematical tools
Spectral graph theory:
Graph kernels:
Proposed Method
[Part 1] Range finding:
[Part 2] Kernel approximation:
Connection with Randomized SVD and oversampling:
Complexity analysis:
Polynomial approximation error characterization:
Kernel approximation error:
Experimental results
Studying kernels with different bandwidths:
Effect of target rank K:
...and 3 more sections

Key Result

Proposition 1

Under Assumption ass:pchi, and with all previous notations holding, we have and

Figures (2)

Figure 1: Average relative spectral norm of the error w.r.t. the ground truth kernel, $\| \boldsymbol\Gamma - \tilde{\boldsymbol\Gamma} \| / \| \boldsymbol\Gamma \|$. Shaded areas represent min. and max. values, over 5 trials. Left (a): As a function of $\sigma$, and compared to g-GRFs. Center and right (b-c): As a function of the target rank $K$, for $\sigma=5$, and compared to the best rank-$K$ and rank-$(K+r)$ approximations. Center (b): Swiss-Roll graph. Right (c): Community graph.
Figure 2: Swiss-Roll graph. Top left: average computation time as a function of $K$, for $N=5\,000$. Top right, bottom: average computation time as a function of $N$, for $K=800$. Shaded areas represent minimum and maximum values, over 5 trials.

Theorems & Definitions (6)

Proposition 1
Lemma 1: halko2011finding, Proposition 8.1
Lemma 2: halko2011finding, Proposition 8.2
Lemma 3: halko2011finding, Proposition 8.3
Lemma 4: halko2011finding, Proposition 10.1
Lemma 5: halko2011finding, Proposition 10.2

Random Wavelet Features for Graph Kernel Machines

TL;DR

Abstract

Random Wavelet Features for Graph Kernel Machines

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (2)

Theorems & Definitions (6)