Privacy-Aware Randomized Quantization via Linear Programming
Zhongteng Cai, Xueru Zhang, Mohammad Mahdi Khalili
TL;DR
This work addresses the challenge of achieving differential privacy with discrete outputs by introducing a family of unbiased discrete quantization mechanisms and formulating an optimization framework. The core idea is to quantize a scalar input into a finite set of symmetric bins via two-bin sampling around the input, with output probabilities tuned to minimize mean absolute error under a DP budget. The Exponential Randomized Mechanism (ERM) is a special case, and the optimal randomized quantization mechanism (OPTM) is found by solving a linear program that upper-bounds MAE and imposes linearized DP constraints. Empirical results on scalar and vector inputs, as well as DP-SGD tasks, show that OPTM generally achieves a superior privacy-accuracy trade-off compared to baselines like MVU and RQM, with ERM offering competitive performance. The framework also enables extensions to high-dimensional, biased, and dynamic quantization, making discrete DP outputs more practical for bandwidth-limited and secure-aggregation settings.
Abstract
Differential privacy mechanisms such as the Gaussian or Laplace mechanism have been widely used in data analytics for preserving individual privacy. However, they are mostly designed for continuous outputs and are unsuitable for scenarios where discrete values are necessary. Although various quantization mechanisms were proposed recently to generate discrete outputs under differential privacy, the outcomes are either biased or have an inferior accuracy-privacy trade-off. In this paper, we propose a family of quantization mechanisms that is unbiased and differentially private. It has a high degree of freedom and we show that some existing mechanisms can be considered as special cases of ours. To find the optimal mechanism, we formulate a linear optimization that can be solved efficiently using linear programming tools. Experiments show that our proposed mechanism can attain a better privacy-accuracy trade-off compared to baselines.
