Communication-Efficient and Differentially Private Vertical Federated Learning with Zeroth-Order Optimization
Jianing Zhang, Evan Chen, Dong-Jun Han, Chaoyue Liu, Christopher G. Brinton
TL;DR
This paper tackles privacy and communication bottlenecks in vertical federated learning (VFL) by introducing DPZV, a differential privacy mechanism for zeroth-order VFL that injects calibrated scalar noise on the backward path. By using a two-point zeroth-order gradient estimator and MeZO for memory efficiency, DPZV achieves tunable $(ε,δ)$-DP while maintaining fast convergence, comparable to first-order DP-SGD, under asynchronous updates. Theoretical results establish convergence and DP guarantees, and extensive experiments on four datasets demonstrate superior privacy-utility tradeoffs and reduced communication rounds, highlighting practical impact for privacy-critical, bandwidth-constrained deployments. Overall, DPZV enables efficient, privacy-preserving VFL with adjustable privacy levels and strong empirical performance, paving the way for scalable, secure distributed learning in feature-partitioned environments.
Abstract
Vertical Federated Learning (VFL) enables collaborative model training across feature-partitioned devices, yet its reliance on device-server information exchange introduces significant communication overhead and privacy risks. Downlink communication from the server to devices in VFL exposes gradient-related signals of the global loss that can be leveraged in inference attacks. Existing privacy-preserving VFL approaches that inject differential privacy (DP) noise on the downlink have the natural repercussion of degraded gradient quality, slowed convergence, and excessive communication rounds. In this work, we propose DPZV, a communication-efficient and differentially private ZO-VFL framework with tunable privacy guarantees. Based on zeroth-order (ZO) optimization, DPZV injects calibrated scalar-valued DP noise on the downlink, significantly reducing variance amplification while providing equivalent protection against targeted inference attacks. Through rigorous theoretical analysis, we establish convergence guarantees comparable to first-order DP-SGD, despite relying solely on ZO estimators, and prove that DPZV satisfies $(ε, δ)$-DP. Extensive experiments demonstrate that DPZV consistently achieves a superior privacy-utility tradeoff and requires fewer communication rounds than existing DP-VFL baselines under strict privacy constraints ($ε\leq 10$).
