Quantization Of Probability Measures In Maximum~Mean~Discrepancy Distance
Zahra Mehraban, Alois Pichler
TL;DR
The paper tackles quantizing probability measures by minimizing the maximum mean discrepancy (MMD) in an RKHS, proposing a two-stage approach that first determines optimal weights for a fixed set of support points and then optimizes the locations. By reformulating the objective on a product space, it enables stochastic gradient methods, while also delivering deterministic solutions for Gaussian kernels and explicit weight formulas for probability measures. Key contributions include explicit optimal weights for general and probability measures, a stochastic-location optimization framework with a cost function defined over pairs drawn from the data distribution, handling of non-negativity constraints, and thorough numerical validation across several distributions with kernel choices such as Gaussian and Matérn. The results yield scalable, accurate quantization of continuous distributions, with direct implications for machine learning and signal processing tasks where tractable finite representations of measures are crucial.
Abstract
Accurate approximation of probability measures is essential in numerical applications. This paper explores the quantization of probability measures using the maximum mean discrepancy (MMD) distance as a guiding metric. We first investigate optimal approximations by determining the best weights, followed by addressing the problem of optimal facility locations. To facilitate efficient computation, we reformulate the nonlinear objective as expectations over a product space, enabling the use of stochastic approximation methods. For the Gaussian kernel, we derive closed-form expressions to develop a deterministic optimization approach. By integrating stochastic approximation with deterministic techniques, our framework achieves precise and efficient quantization of continuous distributions, with significant implications for machine learning and signal processing applications.
