Optimal Time Complexity Algorithms for Computing General Random Walk Graph Kernels on Sparse Graphs

Krzysztof Choromanski; Isaac Reid; Arijit Sehanobish; Avinava Dubey

Optimal Time Complexity Algorithms for Computing General Random Walk Graph Kernels on Sparse Graphs

Krzysztof Choromanski, Isaac Reid, Arijit Sehanobish, Avinava Dubey

TL;DR

This work presents the first linear time complexity randomized algorithms for unbiased approximation of the celebrated family of general random walk kernels (RWKs) for sparse graphs, and shows that the ability to approximate general RWKs (rather than just special cases) unlocks efficient implicit graph kernel learning.

Abstract

We present the first linear time complexity randomized algorithms for unbiased approximation of the celebrated family of general random walk kernels (RWKs) for sparse graphs. This includes both labelled and unlabelled instances. The previous fastest methods for general RWKs were of cubic time complexity and not applicable to labelled graphs. Our method samples dependent random walks to compute novel graph embeddings in $\mathbb{R}^d$ whose dot product is equal to the true RWK in expectation. It does so without instantiating the direct product graph in memory, meaning we can scale to massive datasets that cannot be stored on a single machine. We derive exponential concentration bounds to prove that our estimator is sharp, and show that the ability to approximate general RWKs (rather than just special cases) unlocks efficient implicit graph kernel learning. Our method is up to $\mathbf{27\times}$ faster than its counterparts for efficient computation on large graphs and scales to graphs $\mathbf{128 \times}$ bigger than largest examples amenable to brute-force computation.

Optimal Time Complexity Algorithms for Computing General Random Walk Graph Kernels on Sparse Graphs

TL;DR

Abstract

whose dot product is equal to the true RWK in expectation. It does so without instantiating the direct product graph in memory, meaning we can scale to massive datasets that cannot be stored on a single machine. We derive exponential concentration bounds to prove that our estimator is sharp, and show that the ability to approximate general RWKs (rather than just special cases) unlocks efficient implicit graph kernel learning. Our method is up to

faster than its counterparts for efficient computation on large graphs and scales to graphs

bigger than largest examples amenable to brute-force computation.

Optimal Time Complexity Algorithms for Computing General Random Walk Graph Kernels on Sparse Graphs

TL;DR

Abstract

Optimal Time Complexity Algorithms for Computing General Random Walk Graph Kernels on Sparse Graphs

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (6)

Theorems & Definitions (3)