Tukey Depth Mechanisms for Practical Private Mean Estimation
Gavin Brown, Lydia Zakynthinou
TL;DR
This work addresses private mean estimation for multivariate Gaussian data by bringing theoretically optimal Tukey-depth-based methods into practice. It develops practical implementations of the Restricted Tukey Depth Mechanism (REM) and BoxEM, including exact and approximate Tukey depths and random-direction variants, to achieve robust, affine-invariant accuracy in small-sample, low-dimensional regimes. The authors provide detailed algorithms for sampling from the exponential mechanism, PTR-based privacy checks, and both exact and approximate volume computations, along with empirical results showing superior performance to Gaussian and CoinPress baselines under privacy constraints. They also outline a roadmap for extending to higher dimensions via PAC-volume estimation and approximate sampling, highlighting the trade-offs between computation and accuracy and the need for provably practical volume-estimation techniques. Overall, the paper presents a practical, robust toolkit for private multivariate mean estimation and charts a path toward scalable private analytics in moderate dimensions.
Abstract
Mean estimation is a fundamental task in statistics and a focus within differentially private statistical estimation. While univariate methods based on the Gaussian mechanism are widely used in practice, more advanced techniques such as the exponential mechanism over quantiles offer robustness and improved performance, especially for small sample sizes. Tukey depth mechanisms carry these advantages to multivariate data, providing similar strong theoretical guarantees. However, practical implementations fall behind these theoretical developments. In this work, we take the first step to bridge this gap by implementing the (Restricted) Tukey Depth Mechanism, a theoretically optimal mean estimator for multivariate Gaussian distributions, yielding improved practical methods for private mean estimation. Our implementations enable the use of these mechanisms for small sample sizes or low-dimensional data. Additionally, we implement variants of these mechanisms that use approximate versions of Tukey depth, trading off accuracy for faster computation. We demonstrate their efficiency in practice, showing that they are viable options for modest dimensions. Given their strong accuracy and robustness guarantees, we contend that they are competitive approaches for mean estimation in this regime. We explore future directions for improving the computational efficiency of these algorithms by leveraging fast polytope volume approximation techniques, paving the way for more accurate private mean estimation in higher dimensions.
