Evidence Networks: simple losses for fast, amortized, neural Bayesian model comparison
Niall Jeffrey, Benjamin D. Wandelt
TL;DR
Evidence Networks tackle Bayesian model comparison when traditional methods falter due to intractable integrals, unknown likelihoods, or missing parameterizations. By designing bespoke losses, the authors train neural estimators that directly yield useful functions of the Bayes factor, with the l-POP-Exponential loss as the default for robust $\log K$ estimates. They validate the approach on analytic time-series problems and Dark Energy Survey data, showing accuracy across high-dimensional spaces and resilience in likelihood-free settings, while outperforming density-estimation baselines and remaining competitive with nested sampling when likelihoods are known. The method enables fast, amortized, parameter-free model comparison and broadens the scope of simulation-based inference, posterior predictive checks, and potential frequentist calibration via Bayes factors.
Abstract
Evidence Networks can enable Bayesian model comparison when state-of-the-art methods (e.g. nested sampling) fail and even when likelihoods or priors are intractable or unknown. Bayesian model comparison, i.e. the computation of Bayes factors or evidence ratios, can be cast as an optimization problem. Though the Bayesian interpretation of optimal classification is well-known, here we change perspective and present classes of loss functions that result in fast, amortized neural estimators that directly estimate convenient functions of the Bayes factor. This mitigates numerical inaccuracies associated with estimating individual model probabilities. We introduce the leaky parity-odd power (l-POP) transform, leading to the novel ``l-POP-Exponential'' loss function. We explore neural density estimation for data probability in different models, showing it to be less accurate and scalable than Evidence Networks. Multiple real-world and synthetic examples illustrate that Evidence Networks are explicitly independent of dimensionality of the parameter space and scale mildly with the complexity of the posterior probability density function. This simple yet powerful approach has broad implications for model inference tasks. As an application of Evidence Networks to real-world data we compute the Bayes factor for two models with gravitational lensing data of the Dark Energy Survey. We briefly discuss applications of our methods to other, related problems of model comparison and evaluation in implicit inference settings.
