Table of Contents
Fetching ...

Random Wavelet Features for Graph Kernel Machines

Valentin de Bassompierre, Jean-Charles Delvenne, Laurent Jacques

TL;DR

This work tackles scalable graph kernel computation by introducing randomized spectral node embeddings that approximate a Laplacian-based kernel $\boldsymbol\Gamma = h(\boldsymbol L)$ with a rank-$K$ matrix $\tilde{\boldsymbol\Gamma} = \boldsymbol\Phi^\top \boldsymbol\Phi$. The method proceeds in two stages: range finding via polynomial-filtered random signals to approximate the top-$K$ spectral subspace, and kernel embedding yielding $\boldsymbol\Phi = (h^{1/2}(\boldsymbol L)\boldsymbol Q)^\top$ so that $\tilde{\boldsymbol\Gamma}$ captures the dominant kernel components. The paper provides theoretical error bounds, connects to Randomized SVD, analyzes computational complexity, and demonstrates empirically that the approach outperforms existing random-feature methods for spectrally localized kernels, enabling scalable, principled graph representation learning. Overall, this method offers a practical framework for accurate low-rank kernel approximations on large graphs, with particular advantage for narrow-band kernels that reflect global geometric structure.

Abstract

Node embeddings map graph vertices into low-dimensional Euclidean spaces while preserving structural information. They are central to tasks such as node classification, link prediction, and signal reconstruction. A key goal is to design node embeddings whose dot products capture meaningful notions of node similarity induced by the graph. Graph kernels offer a principled way to define such similarities, but their direct computation is often prohibitive for large networks. Inspired by random feature methods for kernel approximation in Euclidean spaces, we introduce randomized spectral node embeddings whose dot products estimate a low-rank approximation of any specific graph kernel. We provide theoretical and empirical results showing that our embeddings achieve more accurate kernel approximations than existing methods, particularly for spectrally localized kernels. These results demonstrate the effectiveness of randomized spectral constructions for scalable and principled graph representation learning.

Random Wavelet Features for Graph Kernel Machines

TL;DR

This work tackles scalable graph kernel computation by introducing randomized spectral node embeddings that approximate a Laplacian-based kernel with a rank- matrix . The method proceeds in two stages: range finding via polynomial-filtered random signals to approximate the top- spectral subspace, and kernel embedding yielding so that captures the dominant kernel components. The paper provides theoretical error bounds, connects to Randomized SVD, analyzes computational complexity, and demonstrates empirically that the approach outperforms existing random-feature methods for spectrally localized kernels, enabling scalable, principled graph representation learning. Overall, this method offers a practical framework for accurate low-rank kernel approximations on large graphs, with particular advantage for narrow-band kernels that reflect global geometric structure.

Abstract

Node embeddings map graph vertices into low-dimensional Euclidean spaces while preserving structural information. They are central to tasks such as node classification, link prediction, and signal reconstruction. A key goal is to design node embeddings whose dot products capture meaningful notions of node similarity induced by the graph. Graph kernels offer a principled way to define such similarities, but their direct computation is often prohibitive for large networks. Inspired by random feature methods for kernel approximation in Euclidean spaces, we introduce randomized spectral node embeddings whose dot products estimate a low-rank approximation of any specific graph kernel. We provide theoretical and empirical results showing that our embeddings achieve more accurate kernel approximations than existing methods, particularly for spectrally localized kernels. These results demonstrate the effectiveness of randomized spectral constructions for scalable and principled graph representation learning.
Paper Structure (18 sections, 6 theorems, 45 equations, 2 figures, 1 algorithm)

This paper contains 18 sections, 6 theorems, 45 equations, 2 figures, 1 algorithm.

Key Result

Proposition 1

Under Assumption ass:pchi, and with all previous notations holding, we have and

Figures (2)

  • Figure 1: Average relative spectral norm of the error w.r.t. the ground truth kernel, $\| \boldsymbol\Gamma - \tilde{\boldsymbol\Gamma} \| / \| \boldsymbol\Gamma \|$. Shaded areas represent min. and max. values, over 5 trials. Left (a): As a function of $\sigma$, and compared to g-GRFs. Center and right (b-c): As a function of the target rank $K$, for $\sigma=5$, and compared to the best rank-$K$ and rank-$(K+r)$ approximations. Center (b): Swiss-Roll graph. Right (c): Community graph.
  • Figure 2: Swiss-Roll graph. Top left: average computation time as a function of $K$, for $N=5\,000$. Top right, bottom: average computation time as a function of $N$, for $K=800$. Shaded areas represent minimum and maximum values, over 5 trials.

Theorems & Definitions (6)

  • Proposition 1
  • Lemma 1: halko2011finding, Proposition 8.1
  • Lemma 2: halko2011finding, Proposition 8.2
  • Lemma 3: halko2011finding, Proposition 8.3
  • Lemma 4: halko2011finding, Proposition 10.1
  • Lemma 5: halko2011finding, Proposition 10.2