Privacy Preserving Semi-Decentralized Mean Estimation over Intermittently-Connected Networks
Rajarshi Saha, Mohamed Seif, Michal Yemini, Andrea J. Goldsmith, H. Vincent Poor
TL;DR
This work addresses privately estimating the mean of distributed vectors over intermittently connected networks. It introduces PriCER, a two-stage Private Collaborative Estimation via Relaying scheme that uses neighbor relaying and Gaussian perturbations to achieve differential privacy while mitigating link outages. The paper develops a detailed error analysis separating topology-induced variance and privacy-induced variance, and it optimizes relaying weights and noise variances under peer-to-peer privacy constraints, including closed-form solutions for Erdős-Rényi topologies. It also provides comprehensive privacy analyses at local, relay, and server levels, and demonstrates the approach via numerical experiments on distributed K-means and mean estimation scenarios. The results show PriCER can achieve favorable utility-privacy Tradeoffs in intermittently connected networks and is compatible with over-the-air aggregation techniques, offering practical improvements for privacy-preserving distributed estimation.
Abstract
We consider the problem of privately estimating the mean of vectors distributed across different nodes of an unreliable wireless network, where communications between nodes can fail intermittently. We adopt a semi-decentralized setup, wherein to mitigate the impact of intermittently connected links, nodes can collaborate with their neighbors to compute a local consensus, which they relay to a central server. In such a setting, the communications between any pair of nodes must ensure that the privacy of the nodes is rigorously maintained to prevent unauthorized information leakage. We study the tradeoff between collaborative relaying and privacy leakage due to the data sharing among nodes and, subsequently, propose PriCER: Private Collaborative Estimation via Relaying -- a differentially private collaborative algorithm for mean estimation to optimize this tradeoff. The privacy guarantees of PriCER arise (i) implicitly, by exploiting the inherent stochasticity of the flaky network connections, and (ii) explicitly, by adding Gaussian perturbations to the estimates exchanged by the nodes. Local and central privacy guarantees are provided against eavesdroppers who can observe different signals, such as the communications amongst nodes during local consensus and (possibly multiple) transmissions from the relays to the central server. We substantiate our theoretical findings with numerical simulations. Our implementation is available at https://github.com/rajarshisaha95/private-collaborative-relaying.
