Table of Contents
Fetching ...

Decomposable Submodular Maximization in Federated Setting

Akbar Rafiey

TL;DR

This work tackles federated maximization of decomposable submodular functions under privacy and communication constraints. It extends the continuous greedy paradigm to a federated setting (FedCG), proving $(1-1/e)$-approximation guarantees with small additive errors under both full and partial participation, and introduces scalable client-sampling techniques to handle massive client counts. It further develops a discrete federated approach (FedDiscrete Greedy) with provable approximation guarantees for general matroids and uniform matroids, and demonstrates applicability to fundamental problems like Facility Location and Maximum Coverage. The results provide a framework for privacy-preserving, scalable optimization of submodular objectives across distributed data sources, with concrete guidance on communication efficiency and gradient estimation.

Abstract

Submodular functions, as well as the sub-class of decomposable submodular functions, and their optimization appear in a wide range of applications in machine learning, recommendation systems, and welfare maximization. However, optimization of decomposable submodular functions with millions of component functions is computationally prohibitive. Furthermore, the component functions may be private (they might represent user preference function, for example) and cannot be widely shared. To address these issues, we propose a {\em federated optimization} setting for decomposable submodular optimization. In this setting, clients have their own preference functions, and a weighted sum of these preferences needs to be maximized. We implement the popular {\em continuous greedy} algorithm in this setting where clients take parallel small local steps towards the local solution and then the local changes are aggregated at a central server. To address the large number of clients, the aggregation is performed only on a subsampled set. Further, the aggregation is performed only intermittently between stretches of parallel local steps, which reduces communication cost significantly. We show that our federated algorithm is guaranteed to provide a good approximate solution, even in the presence of above cost-cutting measures. Finally, we show how the federated setting can be incorporated in solving fundamental discrete submodular optimization problems such as Maximum Coverage and Facility Location.

Decomposable Submodular Maximization in Federated Setting

TL;DR

This work tackles federated maximization of decomposable submodular functions under privacy and communication constraints. It extends the continuous greedy paradigm to a federated setting (FedCG), proving -approximation guarantees with small additive errors under both full and partial participation, and introduces scalable client-sampling techniques to handle massive client counts. It further develops a discrete federated approach (FedDiscrete Greedy) with provable approximation guarantees for general matroids and uniform matroids, and demonstrates applicability to fundamental problems like Facility Location and Maximum Coverage. The results provide a framework for privacy-preserving, scalable optimization of submodular objectives across distributed data sources, with concrete guidance on communication efficiency and gradient estimation.

Abstract

Submodular functions, as well as the sub-class of decomposable submodular functions, and their optimization appear in a wide range of applications in machine learning, recommendation systems, and welfare maximization. However, optimization of decomposable submodular functions with millions of component functions is computationally prohibitive. Furthermore, the component functions may be private (they might represent user preference function, for example) and cannot be widely shared. To address these issues, we propose a {\em federated optimization} setting for decomposable submodular optimization. In this setting, clients have their own preference functions, and a weighted sum of these preferences needs to be maximized. We implement the popular {\em continuous greedy} algorithm in this setting where clients take parallel small local steps towards the local solution and then the local changes are aggregated at a central server. To address the large number of clients, the aggregation is performed only on a subsampled set. Further, the aggregation is performed only intermittently between stretches of parallel local steps, which reduces communication cost significantly. We show that our federated algorithm is guaranteed to provide a good approximate solution, even in the presence of above cost-cutting measures. Finally, we show how the federated setting can be incorporated in solving fundamental discrete submodular optimization problems such as Maximum Coverage and Facility Location.
Paper Structure (32 sections, 17 theorems, 93 equations, 5 algorithms)

This paper contains 32 sections, 17 theorems, 93 equations, 5 algorithms.

Key Result

Theorem 3.2

Let $\mathcal{M}$ be a matroid of rank $r$ and $\mathcal{P}$ be its matroid polytope. Under the full participation assumption and Assumption ass:bounded-gradients, for every $\eta > 0$, Algorithm alg:FCG returns a $\mathbf{x}^{(T)}\in\mathcal{P}$ such that In particular, for large enough $T$, setting $\eta = 1/T$, Algorithm alg:FCG requires at most $\Tilde{O}(r)$ bits of communication per user pe

Theorems & Definitions (37)

  • Example 1.1: Welfare Maximization
  • Example 1.2: Feature Selection
  • Theorem 3.2: Full participation
  • Lemma 3.3: Unbiased sampling scheme
  • Lemma 3.4: Bounded variance
  • Theorem 3.5
  • Lemma 4.1: Unbiased sampling scheme
  • Lemma 4.2: Bounded variance
  • Theorem 4.3
  • Remark 4.4
  • ...and 27 more