Random Field Representations of Kernel Distances
Ian Langmore
TL;DR
This work reframes kernel-based distances between probability measures as expectations over random fields, showing ${\mathcal{D}^2}(μ-ν) = \mathbb{E}_U{\langle U, μ-ν \rangle^2}$ and linking this view to the conventional kernel/MMD form and to Fourier/Wiener representations. It develops a family of distance notions by replacing Brownian motion with continuous fractional fields ${B^H}$ and Gaussian free fields (GFF), yielding generalized energy distances such as the Dirichlet energy distance, with explicit spectral representations: ${\mathcal{D}^2}(μ-ν) \propto \int \frac{|\hat{μ}(ω) - \hat{ν}(ω)|^2}{\|ω\|^{d+2H}} dω$ and, for GFF, distances tied to Dirichlet energy. The paper provides conditions under which these distances are characteristic, analyzes sample-path assumptions (stationary increments, fractal scaling, Banach-space embedding), and gives support theorems ensuring the inducing fields are dense enough to distinguish measures. It also demonstrates practical implications through signal-to-noise analyses, discrete-space examples, and additive Brownian motion, guiding finite-sample estimation and informing when and how to use fractional and Gaussian fields to capture different moment- and tail-structure in distributions.
Abstract
Positive semi-definite kernels are used to induce pseudo-metrics, or ``distances'', between measures. We write these as an expected quadratic variation of, or expected inner product between, a random field and the difference of measures. This alternate viewpoint offers important intuition and interesting connections to existing forms. Metric distances leading to convenient finite sample estimates are shown to be induced by fields with dense support, stationary increments, and scale invariance. The main example of this is energy distance. We show that the common generalization preserving continuity is induced by fractional Brownian motion. We induce an alternate generalization with the Gaussian free field, formally extending the Cramér-von Mises distance. Pathwise properties give intuition about practical aspects of each. This is demonstrated through signal to noise ratio studies.
