Federated Communication-Efficient Multi-Objective Optimization
Baris Askin, Pranay Sharma, Gauri Joshi, Carlee Joe-Wong
TL;DR
The paper tackles the challenge of federated multi-objective optimization (FMOO) where data are distributed across clients and multiple objective functions must be optimized jointly. It introduces FedCMOO, a communication-efficient FMOO algorithm that uses a single aggregated gradient per client, and a Gram-matrix-based weight computation that decouples communication from the number of objectives $M$. A convergence analysis shows that FedCMOO achieves Pareto-stationary convergence under mild smoothness and stochastic gradient assumptions, with improved dependence on $M$ compared to prior works; a variant, FedCMOO-Pref, enforces user-specified trade-offs via KL-divergence-based weighting. Empirically, FedCMOO and FedCMOO-Pref outperform baselines like FSMGDA and scalarized approaches across diverse datasets, while offering significantly reduced communication costs. These results demonstrate scalable, preference-aware FMOO with practical impact for federated settings such as healthcare and recommendation systems.
Abstract
We study a federated version of multi-objective optimization (MOO), where a single model is trained to optimize multiple objective functions. MOO has been extensively studied in the centralized setting but is less explored in federated or distributed settings. We propose FedCMOO, a novel communication-efficient federated multi-objective optimization (FMOO) algorithm that improves the error convergence performance of the model compared to existing approaches. Unlike prior works, the communication cost of FedCMOO does not scale with the number of objectives, as each client sends a single aggregated gradient to the central server. We provide a convergence analysis of the proposed method for smooth and non-convex objective functions under milder assumptions than in prior work. In addition, we introduce a variant of FedCMOO that allows users to specify a preference over the objectives in terms of a desired ratio of the final objective values. Through extensive experiments, we demonstrate the superiority of our proposed method over baseline approaches.
