Majority Vote for Distributed Differentially Private Sign Selection
Weidong Liu, Jiyuan Tu, Xiaojun Mao, Xi Chen
TL;DR
This work addresses sign recovery in a distributed data setting under distributed group differential privacy ($(\epsilon,\delta)$-DGDP) by introducing DPVote, a general Majority Vote framework that uses a peeling step and the exponential mechanism to privately aggregate coordinate-wise sign estimates across machines. The method is applied to sparse mean estimation and sparse linear regression, achieving sign-consistency with minimax-optimal signal strength on the order of $O(\sqrt{\log p / N})$ while preserving DGDP and maintaining low communication costs. Theoretical guarantees are complemented by simulations showing competitive performance relative to non-private baselines and superiority over existing private approaches, with robust behavior across varying numbers of machines and privacy budgets. Overall, DPVote provides a principled, scalable approach to private distributed sign selection and has potential extensions to other sparse estimation problems and secure multi-party implementations.
Abstract
Privacy-preserving data analysis has become more prevalent in recent years. In this study, we propose a distributed group differentially private Majority Vote mechanism, for the sign selection problem in a distributed setup. To achieve this, we apply the iterative peeling to the stability function and use the exponential mechanism to recover the signs. For enhanced applicability, we study the private sign selection for mean estimation and linear regression problems, in distributed systems. Our method recovers the support and signs with the optimal signal-to-noise ratio as in the non-private scenario, which is better than contemporary works of private variable selections. Moreover, the sign selection consistency is justified by theoretical guarantees. Simulation studies are conducted to demonstrate the effectiveness of the proposed method.
