ProFL: Performative Robust Optimal Federated Learning
Xue Zheng, Tian Xie, Xuwei Tan, Aylin Yener, Xueru Zhang
TL;DR
This work tackles performative distribution shifts in federated learning by targeting the performative optimal point (PO) under data contamination. It introduces ProFL, an algorithm that estimates the full performative gradient, combining robust local gradient estimation with a (possibly server-side) Jacobian estimate of how data shifts respond to model changes. The authors prove convergence under the Polyak-Lojasiewicz condition and demonstrate robustness to contamination, outperforming prior work that only reaches the performative stable point (PS) or relies on convex/noisy assumptions. The approach is validated on synthetic and real-features case studies, showing improved PO convergence, resilience to outliers, and reduced communication through adaptive sampling. Overall, ProFL advances practical, robust, distributed optimization under endogenous distribution shifts in FL settings.
Abstract
Performative prediction is a framework that captures distribution shifts that occur during the training of machine learning models due to their deployment. As the trained model is used, data generation causes the model to evolve, leading to deviations from the original data distribution. The impact of such model-induced distribution shifts in federated learning is increasingly likely to transpire in real-life use cases. A recently proposed approach extends performative prediction to federated learning with the resulting model converging to a performative stable point, which may be far from the performative optimal point. Earlier research in centralized settings has shown that the performative optimal point can be achieved under model-induced distribution shifts, but these approaches require the performative risk to be convex and the training data to be noiseless, assumptions often violated in realistic federated learning systems. This paper overcomes all of these shortcomings and proposes Performative Robust Optimal Federated Learning, an algorithm that finds performative optimal points in federated learning from noisy and contaminated data. We present the convergence analysis under the Polyak-Lojasiewicz condition, which applies to non-convex objectives. Extensive experiments on multiple datasets demonstrate the advantage of Robust Optimal Federated Learning over the state-of-the-art.
