Mean estimation in the add-remove model of differential privacy
Alex Kulesza, Ananda Theertha Suresh, Yuyan Wang
TL;DR
The paper addresses the problem of estimating the mean of a bounded real-valued dataset under the add-remove model of differential privacy, and shows that the add-remove and swap models yield essentially the same minimum achievable mean-squared error up to o(1). It introduces two technical innovations—a transformed noise approach that optimizes high-privacy performance and an hourglass mechanism that provides optimal low-privacy behavior—establishing an exact, min-max optimal algorithm for all privacy budgets. The hourglass mechanism achieves the univariate staircase marginals in two dimensions, enabling efficient sampling and rigorously matching the swap-model constant via an information-theoretic lower bound. Overall, the work closes the gap between the add-remove and swap models for mean estimation, with practical performance gains demonstrated by experiments across privacy regimes.
Abstract
Differential privacy is often studied under two different models of neighboring datasets: the add-remove model and the swap model. While the swap model is frequently used in the academic literature to simplify analysis, many practical applications rely on the more conservative add-remove model, where obtaining tight results can be difficult. Here, we study the problem of one-dimensional mean estimation under the add-remove model. We propose a new algorithm and show that it is min-max optimal, achieving the best possible constant in the leading term of the mean squared error for all $ε$, and that this constant is the same as the optimal algorithm under the swap model. These results show that the add-remove and swap models give nearly identical errors for mean estimation, even though the add-remove model cannot treat the size of the dataset as public information. We also demonstrate empirically that our proposed algorithm yields at least a factor of two improvement in mean squared error over algorithms frequently used in practice. One of our main technical contributions is a new hour-glass mechanism, which might be of independent interest in other scenarios.
