Free-Rider and Conflict Aware Collaboration Formation for Cross-Silo Federated Learning
Mengmeng Chen, Xiaohu Wu, Xiaoli Tang, Tiantian He, Yew-Soon Ong, Qiqi Liu, Qicheng Lao, Han Yu
TL;DR
This work tackles cross-silo Federated Learning under dual forces of self-interest and inter-organization competition. It introduces a graph-theoretic framework with a benefit graph, a competition graph, and a data-usage graph to encode data complementarity, rivalry, and data sharing constraints, and formalizes two principles to avoid free-riding and conflicts of interest. The FedEgoists algorithm blends Bron–Kerbosch clique detection, Tarjan SCC decomposition, and a cycle/path/node-merging procedure to produce a coalition structure that is both stable and coalition-proof, with complexity dominated by the Bron–Kerbosch step. The approach demonstrates substantial performance gains over nine baselines on CIFAR-10/100 and a real-world eICU dataset, while preserving feasible collaboration topologies in cross-silo FL. Overall, the method offers a scalable, provably optimal way to form cross-silo FL coalitions that align with participants’ incentives and competitive dynamics, enabling more effective and trustworthy federated collaborations.
Abstract
Federated learning (FL) is a machine learning paradigm that allows multiple FL participants (FL-PTs) to collaborate on training models without sharing private data. Due to data heterogeneity, negative transfer may occur in the FL training process. This necessitates FL-PT selection based on their data complementarity. In cross-silo FL, organizations that engage in business activities are key sources of FL-PTs. The resulting FL ecosystem has two features: (i) self-interest, and (ii) competition among FL-PTs. This requires the desirable FL-PT selection strategy to simultaneously mitigate the problems of free riders and conflicts of interest among competitors. To this end, we propose an optimal FL collaboration formation strategy -- FedEgoists -- which ensures that: (1) a FL-PT can benefit from FL if and only if it benefits the FL ecosystem, and (2) a FL-PT will not contribute to its competitors or their supporters. It provides an efficient clustering solution to group FL-PTs into coalitions, ensuring that within each coalition, FL-PTs share the same interest. We theoretically prove that the FL-PT coalitions formed are optimal since no coalitions can collaborate together to improve the utility of any of their members. Extensive experiments on widely adopted benchmark datasets demonstrate the effectiveness of FedEgoists compared to nine state-of-the-art baseline methods, and its ability to establish efficient collaborative networks in cross-silos FL with FL-PTs that engage in business activities.
