Learned harmonic mean estimation of the Bayesian evidence with normalizing flows
Alicja Polanska, Matthew A. Price, Davide Piras, Alessio Spurio Mancini, Jason D. McEwen
TL;DR
The paper addresses Bayesian model comparison by improving evidence estimation through a learned harmonic mean estimator that uses normalizing flows to learn an internal importance target contained within the posterior. This yields a robust, sampler-agnostic method that remains accurate across low to high dimensions, demonstrated on benchmark problems and a DES Y1-like cosmology analysis, and complemented by an error-propagation framework in log space. The approach combines flow-based density learning (Real NVP and rational quadratic splines) with a concentration step via a flow temperature and standardized training to achieve stable evidence estimates, outperforming naïve flow methods and matching nested sampling benchmarks. The work provides open-source code, enabling scalable evidence estimation for complex cosmological and SBI settings, and highlights practical efficiency gains when leveraging saved MCMC chains or variational inferences for posterior samples.
Abstract
We present the learned harmonic mean estimator with normalizing flows - a robust, scalable and flexible estimator of the Bayesian evidence for model comparison. Since the estimator is agnostic to sampling strategy and simply requires posterior samples, it can be applied to compute the evidence using any Markov chain Monte Carlo (MCMC) sampling technique, including saved down MCMC chains, or any variational inference approach. The learned harmonic mean estimator was recently introduced, where machine learning techniques were developed to learn a suitable internal importance sampling target distribution to solve the issue of exploding variance of the original harmonic mean estimator. In this article we present the use of normalizing flows as the internal machine learning technique within the learned harmonic mean estimator. Normalizing flows can be elegantly coupled with the learned harmonic mean to provide an approach that is more robust, flexible and scalable than the machine learning models considered previously. We perform a series of numerical experiments, applying our method to benchmark problems and to a cosmological example in up to 21 dimensions. We find the learned harmonic mean estimator is in agreement with ground truth values and nested sampling estimates. The open-source harmonic Python package implementing the learned harmonic mean, now with normalizing flows included, is publicly available.
