Riemannian Laplace Approximation with the Fisher Metric
Hanlin Yu, Marcelo Hartmann, Bernardo Williams, Mark Girolami, Arto Klami
TL;DR
This work develops a principled Riemannian extension of Laplace approximation for Bayesian inference by leveraging the Fisher information metric as intrinsic geometry. It identifies biases in the prior RLA variant and introduces two improvements: a log-map correction (RLA-BLog) and a Fisher-metric based approach (RLA-F) that achieves asymptotic exactness for Gaussian-like targets and enhances finite-data performance. Theoretical results show conditions under which RLA-F is exact and Hausdorff MAP provides reparameterization-invariant optimization for Gaussian-like targets. Empirically, RLA-F consistently outperforms Euclidean LA and RLA-B across Banana, logistic regression, and neural-network tasks, while maintaining computational feasibility; RLA-B, though scalable, exhibits bias, especially in high dimensions. Overall, the Fisher-midelity RLA variant offers a robust, scalable, and theoretically grounded framework for accurate, efficient posterior approximations in complex models.
Abstract
Laplace's method approximates a target density with a Gaussian distribution at its mode. It is computationally efficient and asymptotically exact for Bayesian inference due to the Bernstein-von Mises theorem, but for complex targets and finite-data posteriors it is often too crude an approximation. A recent generalization of the Laplace Approximation transforms the Gaussian approximation according to a chosen Riemannian geometry providing a richer approximation family, while still retaining computational efficiency. However, as shown here, its properties depend heavily on the chosen metric, indeed the metric adopted in previous work results in approximations that are overly narrow as well as being biased even at the limit of infinite data. We correct this shortcoming by developing the approximation family further, deriving two alternative variants that are exact at the limit of infinite data, extending the theoretical analysis of the method, and demonstrating practical improvements in a range of experiments.
