Table of Contents
Fetching ...

Computation-Utility-Privacy Tradeoffs in Bayesian Estimation

Sitan Chen, Jingqiu Ding, Mahbod Majid, Walter McKelvie

Abstract

Bayesian methods lie at the heart of modern data science and provide a powerful scaffolding for estimation in data-constrained settings and principled quantification and propagation of uncertainty. Yet in many real-world use cases where these methods are deployed, there is a natural need to preserve the privacy of the individuals whose data is being scrutinized. While a number of works have attempted to approach the problem of differentially private Bayesian estimation through either reasoning about the inherent privacy of the posterior distribution or privatizing off-the-shelf Bayesian methods, these works generally do not come with rigorous utility guarantees beyond low-dimensional settings. In fact, even for the prototypical tasks of Gaussian mean estimation and linear regression, it was unknown how close one could get to the Bayes-optimal error with a private algorithm, even in the simplest case where the unknown parameter comes from a Gaussian prior. In this work, we give the first efficient algorithms for both of these problems that achieve mean-squared error $(1+o(1))\mathrm{OPT}$ and additionally show that both tasks exhibit an intriguing computational-statistical gap. For Bayesian mean estimation, we prove that the excess risk achieved by our method is optimal among all efficient algorithms within the low-degree framework, yet is provably worse than what is achievable by an exponential-time algorithm. For linear regression, we prove a qualitatively similar lower bound. Our algorithms draw upon the privacy-to-robustness framework of arXiv:2212.05015, but with the curious twist that to achieve private Bayes-optimal estimation, we need to design sum-of-squares-based robust estimators for inherently non-robust objects like the empirical mean and OLS estimator. Along the way we also add to the sum-of-squares toolkit a new kind of constraint based on short-flat decompositions.

Computation-Utility-Privacy Tradeoffs in Bayesian Estimation

Abstract

Bayesian methods lie at the heart of modern data science and provide a powerful scaffolding for estimation in data-constrained settings and principled quantification and propagation of uncertainty. Yet in many real-world use cases where these methods are deployed, there is a natural need to preserve the privacy of the individuals whose data is being scrutinized. While a number of works have attempted to approach the problem of differentially private Bayesian estimation through either reasoning about the inherent privacy of the posterior distribution or privatizing off-the-shelf Bayesian methods, these works generally do not come with rigorous utility guarantees beyond low-dimensional settings. In fact, even for the prototypical tasks of Gaussian mean estimation and linear regression, it was unknown how close one could get to the Bayes-optimal error with a private algorithm, even in the simplest case where the unknown parameter comes from a Gaussian prior. In this work, we give the first efficient algorithms for both of these problems that achieve mean-squared error and additionally show that both tasks exhibit an intriguing computational-statistical gap. For Bayesian mean estimation, we prove that the excess risk achieved by our method is optimal among all efficient algorithms within the low-degree framework, yet is provably worse than what is achievable by an exponential-time algorithm. For linear regression, we prove a qualitatively similar lower bound. Our algorithms draw upon the privacy-to-robustness framework of arXiv:2212.05015, but with the curious twist that to achieve private Bayes-optimal estimation, we need to design sum-of-squares-based robust estimators for inherently non-robust objects like the empirical mean and OLS estimator. Along the way we also add to the sum-of-squares toolkit a new kind of constraint based on short-flat decompositions.
Paper Structure (1 section, 5 theorems, 6 equations)

This paper contains 1 section, 5 theorems, 6 equations.

Table of Contents

  1. Introduction

Key Result

Theorem 1.1

Given $\mu$ drawn from prior $\pi = \mathcal{N}(0,\Sigma)$, if the data $x$ consists of $n \geqslant \tilde{\Omega}(\frac{d}{\varepsilon}\log(\sqrt{\mathop{\mathrm{Tr}}\nolimits(\Sigma)}/\alpha))$ i.i.d. samples from $\mathcal{N}(\mu,\textup{Id})$, there is where $\Lambda \coloneqq (\textup{Id} + \frac{1}{n}\Sigma^{-1})^{-1}$ and $\lVert\Lambda\rVert_{4/3}$ denotes that Schatten $4/3$-norm of $\L

Theorems & Definitions (5)

  • Theorem 1.1: Upper bounds for private Bayesian mean estimation
  • Theorem 1.2: Frequentist mean estimation with optimal constants
  • Theorem 1.3: Informal, see \ref{['thm:comp-lower-bound_mean']}
  • Theorem 1.4: Informal, see \ref{['lem:exponential-bayesian-error-regression']} and \ref{['thm:bayesian-error-regression']}
  • Theorem 1.5: Frequentist linear regression with optimal constants