On Sampling Strategies for Spectral Model Sharding

Denis Korzhenkov; Christos Louizos

On Sampling Strategies for Spectral Model Sharding

Denis Korzhenkov, Christos Louizos

TL;DR

This work presents two sampling strategies for Spectral model sharding, obtained as solutions to specific optimization problems, and demonstrates that both of these methods can lead to improved performance on various commonly used datasets.

Abstract

The problem of heterogeneous clients in federated learning has recently drawn a lot of attention. Spectral model sharding, i.e., partitioning the model parameters into low-rank matrices based on the singular value decomposition, has been one of the proposed solutions for more efficient on-device training in such settings. In this work, we present two sampling strategies for such sharding, obtained as solutions to specific optimization problems. The first produces unbiased estimators of the original weights, while the second aims to minimize the squared approximation error. We discuss how both of these estimators can be incorporated in the federated learning loop and practical considerations that arise during local training. Empirically, we demonstrate that both of these methods can lead to improved performance on various commonly used datasets.

On Sampling Strategies for Spectral Model Sharding

TL;DR

Abstract

On Sampling Strategies for Spectral Model Sharding

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (3)

Theorems & Definitions (7)