Generalization Certificates for Adversarially Robust Bayesian Linear Regression
Mahalakshmi Sabanayagam, Russell Tsuchida, Cheng Soon Ong, Debarghya Ghoshdastidar
TL;DR
The paper tackles adversarial perturbations in probabilistic inference by defining adversarially robust posteriors via an adversarially perturbed negative log-likelihood within a generalized Bayesian framework. It shows how, for exponential-family models, the adversarial NLL admits closed-form expressions, enabling a tractable robust posterior $q_\delta(\theta)$. Leveraging the PAC-Bayesian framework, the authors derive the first rigorous generalization certificates for both standard and adversarial posteriors in Bayesian linear regression, with extensions to robust settings and explicit bound terms. Experiments on real and synthetic data corroborate the theoretical claims, demonstrating enhanced adversarial robustness of $q_\delta$ over the standard Bayes posterior and validating the proposed certificates as meaningful predictors of generalization performance in adversarial regimes.
Abstract
Adversarial robustness of machine learning models is critical to ensuring reliable performance under data perturbations. Recent progress has been on point estimators, and this paper considers distributional predictors. First, using the link between exponential families and Bregman divergences, we formulate an adversarial Bregman divergence loss as an adversarial negative log-likelihood. Using the geometric properties of Bregman divergences, we compute the adversarial perturbation for such models in closed-form. Second, under such losses, we introduce \emph{adversarially robust posteriors}, by exploiting the optimization-centric view of generalized Bayesian inference. Third, we derive the \emph{first} rigorous generalization certificates in the context of an adversarial extension of Bayesian linear regression by leveraging the PAC-Bayesian framework. Finally, experiments on real and synthetic datasets demonstrate the superior robustness of the derived adversarially robust posterior over Bayes posterior, and also validate our theoretical guarantees.
