Incentivizing Truthful Collaboration in Heterogeneous Federated Learning
Dimitar Chakarov, Nikita Tsoy, Kristian Minchev, Nikola Konstantinov
TL;DR
This work addresses incentive issues in heterogeneous federated learning by modeling client behavior as a game and introducing a budget-balanced payment mechanism. It proves that the FedSGD protocol with the proposed payments is $\varepsilon$-Bayesian Incentive Compatible and yields $\varepsilon$-approximately truthful reporting, while maintaining convergence under $m$-strongly convex and $H$-smooth objectives. Theoretical results provide explicit bounds on payments and convergence rates as functions of heterogeneity parameters and learning dynamics, elucidating the trade-offs between heterogeneity, payments, and efficiency. Empirically, the mechanism consistently deters gradient manipulation across FedSGD, median-based FedSGD, and FedAvg on FeMNIST, Shakespeare, and Twitter datasets, demonstrating both robustness and adaptability to different FL paradigms and non-convex tasks. Overall, the paper contributes a principled mechanism-design approach to securing FL against manipulation arising from heterogeneity, with practical implications for reliable, scalable collaborative learning in distributed settings.
Abstract
Federated learning (FL) is a distributed collaborative learning method, where multiple clients learn together by sharing gradient updates instead of raw data. However, it is well-known that FL is vulnerable to manipulated updates from clients. In this work we study the impact of data heterogeneity on clients' incentives to manipulate their updates. First, we present heterogeneous collaborative learning scenarios where a client can modify their updates to be better off, and show that these manipulations can lead to diminishing model performance. To prevent such modifications, we formulate a game in which clients may misreport their gradient updates in order to "steer" the server model to their advantage. We develop a payment rule that provably disincentivizes sending modified updates under the FedSGD protocol. We derive explicit bounds on the clients' payments and the convergence rate of the global model, which allows us to study the trade-off between heterogeneity, payments and convergence. Finally, we provide an experimental evaluation of the effectiveness of our payment rule in the FedSGD, median-based aggregation FedSGD and FedAvg protocols on three tasks in computer vision and natural language processing. In all cases we find that our scheme successfully disincentivizes modifications.
