Information-Theoretic Decentralized Secure Aggregation with Collusion Resilience
Xiang Zhang, Zhou Li, Shuangyang Li, Kai Wan, Derrick Wing Kwan Ng, Giuseppe Caire
TL;DR
This work studies decentralized secure aggregation (DSA) in a fully connected $K$-user network from an information-theoretic viewpoint, where each user holds a private input $W_k$ and a secret key $Z_k$ derived from a source key $Z_{\Sigma}$. The goal is to compute the sum $\sum_k W_k$ securely under collusion up to $T$ users, with per-user communication rate $R_X$, individual key rate $R_Z$, and source key rate $R_{Z_{\Sigma}}$. The authors establish a complete rate region: it is empty for $T\ge K-2$, and for feasible cases ($T\le K-3$) the optimal region is $R_X\ge1$, $R_Z\ge1$, and $R_{Z_{\Sigma}}\ge K-1$, achievable by a linear scheme using a zero-sum, linearly dependent key structure. The results reveal fundamental limits and show that dual roles of the keys (encryption and decryption aid) enable minimal communication and key usage, with potential extensions to non-fully-connected topologies and groupwise-key designs. The analysis provides a foundation for designing provably secure and communication-efficient distributed learning protocols in decentralized settings.
Abstract
In decentralized federated learning (FL), multiple clients collaboratively learn a shared machine learning (ML) model by leveraging their privately held datasets distributed across the network, through interactive exchange of the intermediate model updates. To ensure data security, cryptographic techniques are commonly employed to protect model updates during aggregation. Despite growing interest in secure aggregation, existing works predominantly focus on protocol design and computational guarantees, with limited understanding of the fundamental information-theoretic limits of such systems. Moreover, optimal bounds on communication and key usage remain unknown in decentralized settings, where no central aggregator is available. Motivated by these gaps, we study the problem of decentralized secure aggregation (DSA) from an information-theoretic perspective. Specifically, we consider a network of $K$ fully-connected users, each holding a private input -- an abstraction of local training data -- who aim to securely compute the sum of all inputs. The security constraint requires that no user learns anything beyond the input sum, even when colluding with up to $T$ other users. We characterize the optimal rate region, which specifies the minimum achievable communication and secret key rates for DSA. In particular, we show that to securely compute one symbol of the desired input sum, each user must (i) transmit at least one symbol to others, (ii) hold at least one symbol of secret key, and (iii) all users must collectively hold no fewer than $K - 1$ independent key symbols. Our results establish the fundamental performance limits of DSA, providing insights for the design of provably secure and communication-efficient protocols in distributed learning systems.
