Towards Realistic Mechanisms That Incentivize Federated Participation and Contribution
Marco Bornstein, Amrit Singh Bedi, Anit Kumar Sahu, Furqan Khan, Furong Huang
TL;DR
This work tackles the lack of participation incentives and the free-rider problem in cross-device federated learning by designing RealFM, a mechanism that models device utility with a non-linear payoff $\phi_i(a)$ and data costs, while providing model-accuracy rewards $a^r$ and monetary rewards $R$. It extends the FL formulation to heterogeneous, non-sharing data and introduces an accuracy-shaping function $\gamma_i(m)$ to incentivize data contributions beyond local optima, funded by a dynamic marginal reward $r(\bm{m})$ and a server-retained share $p_m$. The authors prove feasibility, IR, and the existence of a unique Nash equilibrium under their mechanism, effectively eliminating free-riding and increasing data contributions. Empirical results on CIFAR-10 and MNIST show real-world performance gains, with server and device utilities improving by orders of magnitude compared to baselines across varied data distributions and costs, underscoring the practicality of realistic incentive designs in FL.
Abstract
Edge device participation in federating learning (FL) is typically studied through the lens of device-server communication (e.g., device dropout) and assumes an undying desire from edge devices to participate in FL. As a result, current FL frameworks are flawed when implemented in realistic settings, with many encountering the free-rider dilemma. In a step to push FL towards realistic settings, we propose RealFM: the first federated mechanism that (1) realistically models device utility, (2) incentivizes data contribution and device participation, (3) provably removes the free-rider dilemma, and (4) relaxes assumptions on data homogeneity and data sharing. Compared to previous FL mechanisms, RealFM allows for a non-linear relationship between model accuracy and utility, which improves the utility gained by the server and participating devices. On real-world data, RealFM improves device and server utility, as well as data contribution, by over 3 and 4 magnitudes respectively compared to baselines.
