Concentration of discrepancy-based approximate Bayesian computation via Rademacher complexity
Sirio Legramanti, Daniele Durante, Pierre Alquier
TL;DR
This work addresses the concentration behavior of discrepancy-based ABC posteriors by linking their asymptotics to the Rademacher complexity of the discrepancy’s function class within integral probability semimetrics (IPS). It offers a unified theory that yields uniform, constructible concentration bounds applicable to misspecified and non-i.i.d. data, without requiring strong regularity conditions on the data-generating process. The authors derive general results for fixed and shrinking tolerance regimes, and specialize them to MMD and Wasserstein-1 distances, providing explicit rates under bounded and unbounded settings. Illustrative simulations corroborate the theory, showing robust performance of IPS discrepancies with uniformly vanishing Rademacher complexity under model misspecification and contamination. The framework guides principled discrepancy choice and opens avenues for extending discrepancy-based ABC to broader pseudo-posterior settings and $f$-divergences.
Abstract
There has been increasing interest on summary-free solutions for approximate Bayesian computation (ABC) which replace distances among summaries with discrepancies between the empirical distributions of the observed data and the synthetic samples generated under the proposed parameter values. The success of these strategies has motivated theoretical studies on the limiting properties of the induced posteriors. However, there is still the lack of a theoretical framework for summary-free ABC that (i) is unified, instead of discrepancy-specific, (ii) does not require to constrain the analysis to data generating processes and statistical models meeting specific regularity conditions, but rather facilitates the derivation of limiting properties that hold uniformly, and (iii) relies on verifiable assumptions that provide explicit concentration bounds clarifying which factors govern the limiting behavior of the ABC posterior. We address this gap via a novel theoretical framework that introduces the concept of Rademacher complexity in the analysis of the limiting properties for discrepancy-based ABC posteriors, including in non-i.i.d. and misspecified settings. This yields a unified theory that relies on constructive arguments and provides more informative asymptotic results and uniform concentration bounds, even in settings not covered by current studies. These advancements are obtained by relating the asymptotic properties of summary-free ABC posteriors to the behavior of the Rademacher complexity associated with the chosen discrepancy in the family of integral probability semimetrics (IPS). The IPS class extends summary-based distances, and includes the Wasserstein distance and maximum mean discrepancy, among others. As clarified in specialized theoretical analyses of popular IPS discrepancies and via illustrative simulations, this perspective improves the understanding of summary-free ABC.
