Noise-Aware Differentially Private Variational Inference
Talal Alrawajfeh, Joonas Jälkö, Antti Honkela
TL;DR
This work addresses uncertainty quantification for Bayesian inference under differential privacy by introducing Noise-Aware DP Variational Inference (NA-DPVI). NA-DPVI integrates DP-induced gradient noise into a post-processing gradient-trace model, inferring latent quantities such as the Hessian and optimal parameters to yield a noise-aware posterior $\widetilde{p}(\boldsymbol{\theta}|\mathcal{T})$. The authors provide theoretical justification (Theorem 1 and related analysis) and validate the method across high-dimensional and non-conjugate settings, including 10D Bayesian linear regression and UCI Adult Bayesian logistic regression, demonstrating improved coverage and calibrated predictive distributions compared to baselines. The approach broadens the applicability of noise-aware inference beyond simple models and offers a principled framework for uncertainty quantification in privacy-preserving Bayesian analyses, though it remains tied to VI quality and practical privacy-utility considerations such as hyperparameter privacy leakage.
Abstract
Differential privacy (DP) provides robust privacy guarantees for statistical inference, but this can lead to unreliable results and biases in downstream applications. While several noise-aware approaches have been proposed which integrate DP perturbation into the inference, they are limited to specific types of simple probabilistic models. In this work, we propose a novel method for noise-aware approximate Bayesian inference based on stochastic gradient variational inference which can also be applied to high-dimensional and non-conjugate models. We also propose a more accurate evaluation method for noise-aware posteriors. Empirically, our inference method has similar performance to existing methods in the domain where they are applicable. Outside this domain, we obtain accurate coverages on high-dimensional Bayesian linear regression and well-calibrated predictive probabilities on Bayesian logistic regression with the UCI Adult dataset.
