Kernel Quantile Embeddings and Associated Probability Metrics
Masha Naslidnyk, Siu Lun Chau, François-Xavier Briol, Krikamol Muandet
TL;DR
This work introduces kernel quantile embeddings (KQEs) and kernel quantile discrepancies (KQDs) to represent and compare probability distributions in RKHSs beyond the traditional kernel mean embedding. By using directional quantiles in the RKHS, the authors establish quantile-characteristic kernels under milder conditions than mean-characteristic ones, and define e-KQD and sup-KQD as distance measures with near-linear estimators. They show connections to sliced Wasserstein and Sinkhorn divergences, and provide a Gaussian-measure-based estimator for e-KQD with favorable computational costs. Empirical results on two-sample testing demonstrate competitive power with MMD-based methods while offering improved scalability, especially in high-dimensional settings. Overall, KQEs offer a flexible, efficient framework for distributional comparisons with potential across hypothesis testing and related distributional learning tasks.
Abstract
Embedding probability distributions into reproducing kernel Hilbert spaces (RKHS) has enabled powerful nonparametric methods such as the maximum mean discrepancy (MMD), a statistical distance with strong theoretical and computational properties. At its core, the MMD relies on kernel mean embeddings to represent distributions as mean functions in RKHS. However, it remains unclear if the mean function is the only meaningful RKHS representation. Inspired by generalised quantiles, we introduce the notion of kernel quantile embeddings (KQEs). We then use KQEs to construct a family of distances that: (i) are probability metrics under weaker kernel conditions than MMD; (ii) recover a kernelised form of the sliced Wasserstein distance; and (iii) can be efficiently estimated with near-linear cost. Through hypothesis testing, we show that these distances offer a competitive alternative to MMD and its fast approximations.
