BayesSum: Bayesian Quadrature in Discrete Spaces
Sophia Seulkee Kang, François-Xavier Briol, Toni Karvonen, Zonghao Chen
TL;DR
BayesSum extends Bayesian quadrature to discrete domains by placing a Gaussian process prior on the integrand f and deriving a Gaussian posterior for the intractable sum I = E[f(X)]. It achieves superior sample efficiency over traditional Monte Carlo baselines and provides finite-sample uncertainty, with practical variants for mixed discrete-continuous domains and active query strategies. The approach is validated on synthetic benchmarks and realistic unnormalized models (CMP, Potts), showing improved normalization-constant estimation and parameter learning with fewer function evaluations. The work also develops closed-form kernel mean embeddings for discrete distributions and introduces Stein BayesSum to handle cases lacking such embeddings, highlighting strong potential for discrete probabilistic numerics and complex Bayesian inference tasks.
Abstract
This paper addresses the challenging computational problem of estimating intractable expectations over discrete domains. Existing approaches, including Monte Carlo and Russian Roulette estimators, are consistent but often require a large number of samples to achieve accurate results. We propose a novel estimator, \emph{BayesSum}, which is an extension of Bayesian quadrature to discrete domains. It is more sample efficient than alternatives due to its ability to make use of prior information about the integrand through a Gaussian process. We show this through theory, deriving a convergence rate significantly faster than Monte Carlo in a broad range of settings. We also demonstrate empirically that our proposed method does indeed require fewer samples on several synthetic settings as well as for parameter estimation for Conway-Maxwell-Poisson and Potts models.
