Decomposable Submodular Maximization in Federated Setting

Akbar Rafiey

Decomposable Submodular Maximization in Federated Setting

Akbar Rafiey

TL;DR

This work tackles federated maximization of decomposable submodular functions under privacy and communication constraints. It extends the continuous greedy paradigm to a federated setting (FedCG), proving $(1-1/e)$-approximation guarantees with small additive errors under both full and partial participation, and introduces scalable client-sampling techniques to handle massive client counts. It further develops a discrete federated approach (FedDiscrete Greedy) with provable approximation guarantees for general matroids and uniform matroids, and demonstrates applicability to fundamental problems like Facility Location and Maximum Coverage. The results provide a framework for privacy-preserving, scalable optimization of submodular objectives across distributed data sources, with concrete guidance on communication efficiency and gradient estimation.

Abstract

Submodular functions, as well as the sub-class of decomposable submodular functions, and their optimization appear in a wide range of applications in machine learning, recommendation systems, and welfare maximization. However, optimization of decomposable submodular functions with millions of component functions is computationally prohibitive. Furthermore, the component functions may be private (they might represent user preference function, for example) and cannot be widely shared. To address these issues, we propose a {\em federated optimization} setting for decomposable submodular optimization. In this setting, clients have their own preference functions, and a weighted sum of these preferences needs to be maximized. We implement the popular {\em continuous greedy} algorithm in this setting where clients take parallel small local steps towards the local solution and then the local changes are aggregated at a central server. To address the large number of clients, the aggregation is performed only on a subsampled set. Further, the aggregation is performed only intermittently between stretches of parallel local steps, which reduces communication cost significantly. We show that our federated algorithm is guaranteed to provide a good approximate solution, even in the presence of above cost-cutting measures. Finally, we show how the federated setting can be incorporated in solving fundamental discrete submodular optimization problems such as Maximum Coverage and Facility Location.

Decomposable Submodular Maximization in Federated Setting

TL;DR

-approximation guarantees with small additive errors under both full and partial participation, and introduces scalable client-sampling techniques to handle massive client counts. It further develops a discrete federated approach (FedDiscrete Greedy) with provable approximation guarantees for general matroids and uniform matroids, and demonstrates applicability to fundamental problems like Facility Location and Maximum Coverage. The results provide a framework for privacy-preserving, scalable optimization of submodular objectives across distributed data sources, with concrete guidance on communication efficiency and gradient estimation.

Abstract

Paper Structure (32 sections, 17 theorems, 93 equations, 5 algorithms)

This paper contains 32 sections, 17 theorems, 93 equations, 5 algorithms.

Introduction
Related Work
FedAvg, its convergence, and assumptions.
Preliminaries
Federated Continuous Greedy
Partial Participation and Client Selection
Practical Federated Continuous Greedy
Discrete Algorithm in Federated Setting
Conclusions and Future Work
Preliminary Results and Proof of Convergence for Full Participation
Quick overview.
Putting Everything Together: Proof of \ref{['thm:full-participation']}
Upper Bound for $\gamma_t$ in \ref{['ass:bounded-gradients']}
Proof of Convergence for FedCG (\ref{['thm:convergence-Scheme2-Alg1']})
Bounding the Variance
...and 17 more sections

Key Result

Theorem 3.2

Let $\mathcal{M}$ be a matroid of rank $r$ and $\mathcal{P}$ be its matroid polytope. Under the full participation assumption and Assumption ass:bounded-gradients, for every $\eta > 0$, Algorithm alg:FCG returns a $\mathbf{x}^{(T)}\in\mathcal{P}$ such that In particular, for large enough $T$, setting $\eta = 1/T$, Algorithm alg:FCG requires at most $\Tilde{O}(r)$ bits of communication per user pe

Theorems & Definitions (37)

Example 1.1: Welfare Maximization
Example 1.2: Feature Selection
Theorem 3.2: Full participation
Lemma 3.3: Unbiased sampling scheme
Lemma 3.4: Bounded variance
Theorem 3.5
Lemma 4.1: Unbiased sampling scheme
Lemma 4.2: Bounded variance
Theorem 4.3
Remark 4.4
...and 27 more

Decomposable Submodular Maximization in Federated Setting

TL;DR

Abstract

Decomposable Submodular Maximization in Federated Setting

Authors

TL;DR

Abstract

Table of Contents

Key Result

Theorems & Definitions (37)