Differentially Private Distributed Inference
Marios Papachristou, M. Amin Rahimian
TL;DR
The paper tackles privacy-preserving distributed inference in networks of agents by marrying differential privacy with non-Bayesian log-linear belief updates over discrete state spaces. It develops two main problem settings—distributed MLE with a finite signal set and online learning from intermittent streams—and proposes three DP algorithms: AM, GM, and a two-threshold method, all supported by nonasymptotic guarantees and rigorous privacy analysis via the Laplace mechanism. Theoretical results quantify the privacy-utility-cost tradeoffs, including error bounds and communication complexity that scale with the DP budget $\varepsilon$, network properties, and signal statistics. Empirical validation on real multicenter clinical trial data (ACTG and cancer datasets) shows privacy-preserving distributed inference with substantially faster runtimes than homomorphic-encryption approaches and competitive accuracy compared to first-order DP methods, highlighting practical applicability for privacy-aware, multicenter survival analysis. Overall, the work provides a principled framework for privacy-conscious collaboration in healthcare and related domains, enabling scalable, provably private distributed decision-making with concrete guidance on design choices and tradeoffs.
Abstract
How can agents exchange information to learn while protecting privacy? Healthcare centers collaborating on clinical trials must balance knowledge sharing with safeguarding sensitive patient data. We address this challenge by using differential privacy (DP) to control information leakage. Agents update belief statistics via log-linear rules, and DP noise provides plausible deniability and rigorous performance guarantees. We study two settings: distributed maximum likelihood estimation (MLE) with a finite set of private signals and online learning from an intermittent signal stream. Noisy aggregation introduces trade-offs between rejecting low-quality states and accepting high-quality ones. The MLE setting naturally applies to hypothesis testing with formal statistical guarantees. Through simulations, we demonstrate differentially private, distributed survival analysis on real-world clinical trial data, evaluating treatment efficacy and the impact of biomedical indices on patient survival. Our methods enable privacy-preserving inference with greater efficiency and lower error rates than homomorphic encryption and first-order DP optimization approaches.
