On Decentralized Linearly Separable Computation With the Minimum Computation Cost

Haoning Chen; Minquan Cheng; Zhenhao Huang; Youlong Wu

On Decentralized Linearly Separable Computation With the Minimum Computation Cost

Haoning Chen, Minquan Cheng, Zhenhao Huang, Youlong Wu

TL;DR

This work studies decentralized linearly separable computation with $\mathsf{N}$ workers, $\mathsf{N}_{\mathrm{r}}$ responsive, and $\mathsf{K}_{\mathrm{c}}$ linear combinations of $\mathsf{K}$ datasets. It introduces a cyclic data assignment and a side-information–aware scheme that leverages locally computed messages to minimize communication, achieving $\mathsf{R}_{\mathrm{dec}} = \mathsf{N}_{\mathrm{r}}\mathsf{K}_{\mathrm{c}}$ for small $\mathsf{K}_{\mathrm{c}}$ and $\mathsf{R}_{\mathrm{dec}} = \tfrac{\mathsf{K}}{\mathsf{N}}\mathsf{N}_{\mathrm{r}}$ for larger $\mathsf{K}_{\mathrm{c}}$, with optimality under cyclic assignment ($\mathsf{R}_{\mathrm{cyc}}^{*}=\mathsf{R}_{\mathrm{dec}}$). Importantly, side information provides gains only when $\mathsf{K}_{\mathrm{c}} > \tfrac{\mathsf{K}}{\mathsf{N}}\mathsf{N}_{\mathrm{r}}$, via a null-space–based encoding and random linear combinations that decouple unknown local data from global outputs. The approach yields provably decodable schemes with high probability and reduces communication compared to the master-based benchmark in relevant regimes, enabling efficient decentralized computation for linearly separable tasks in distributed networks.

Abstract

The distributed linearly separable computation problem finds extensive applications across domains such as distributed gradient coding, distributed linear transform, real-time rendering, etc. In this paper, we investigate this problem in a fully decentralized scenario, where $\mathsf{N}$ workers collaboratively perform the computation task without a central master. Each worker aims to compute a linearly separable computation that can be manifested as $\mathsf{K}_{\mathrm{c}}$ linear combinations of $\mathsf{K}$ messages, where each message is a function of a distinct dataset. We require that each worker successfully fulfill the task based on the transmissions from any $\mathsf{N}_{\mathrm{r}}$ workers, such that the system can tolerate any $\mathsf{N}-\mathsf{N}_{\mathrm{r}}$ stragglers. We focus on the scenario where the computation cost (the number of uncoded datasets assigned to each worker) is minimum, and aim to minimize the communication cost (the number of symbols the fastest $\mathsf{N}_{\mathrm{r}}$ workers transmit). We propose a novel distributed computing scheme that is optimal under the widely used cyclic data assignment. Interestingly, we demonstrate that the side information at each worker is ineffective in reducing the communication cost when $\mathsf{K}_{\mathrm{c}}\leq {\mathsf{K}}\mathsf{N}_{\mathrm{r}}/{\mathsf{N}}$, while it helps reduce the communication cost as $\mathsf{K}_{\mathrm{c}}$ increases.

On Decentralized Linearly Separable Computation With the Minimum Computation Cost

TL;DR

This work studies decentralized linearly separable computation with

workers,

responsive, and

linear combinations of

datasets. It introduces a cyclic data assignment and a side-information–aware scheme that leverages locally computed messages to minimize communication, achieving

for small

and

for larger

, with optimality under cyclic assignment (

). Importantly, side information provides gains only when

, via a null-space–based encoding and random linear combinations that decouple unknown local data from global outputs. The approach yields provably decodable schemes with high probability and reduces communication compared to the master-based benchmark in relevant regimes, enabling efficient decentralized computation for linearly separable tasks in distributed networks.

Abstract

workers collaboratively perform the computation task without a central master. Each worker aims to compute a linearly separable computation that can be manifested as

linear combinations of

messages, where each message is a function of a distinct dataset. We require that each worker successfully fulfill the task based on the transmissions from any

workers, such that the system can tolerate any

stragglers. We focus on the scenario where the computation cost (the number of uncoded datasets assigned to each worker) is minimum, and aim to minimize the communication cost (the number of symbols the fastest

workers transmit). We propose a novel distributed computing scheme that is optimal under the widely used cyclic data assignment. Interestingly, we demonstrate that the side information at each worker is ineffective in reducing the communication cost when

, while it helps reduce the communication cost as

increases.

Paper Structure (11 sections, 7 theorems, 48 equations, 3 figures, 1 table)

This paper contains 11 sections, 7 theorems, 48 equations, 3 figures, 1 table.

Introduction
System Model and Problem Formulation
Data Assignment Phase
Computing Phase
Decoding Phase
Main Results
Achievable Distributed Computing Scheme
Conclusion
Proof of Lemma \ref{['fullrank']}
Proof of Lemma \ref{['random']}
Proof of Lemma \ref{['dimlowerbound']}

Key Result

Theorem 1

For the $\left(\mathsf{K}, \mathsf{N}, \mathsf{N}_{\mathrm{r}}, \mathsf{K}_{\mathrm{c}}, \mathsf{M}\right)$ decentralized linearly separable computation problem with $\mathsf{M}=\frac{\mathsf{K}}{\mathsf{N}}\left(\mathsf{N}-\mathsf{N}_{\mathrm{r}}+1\right)$, the achieved communication cost $\mathsf{

Figures (3)

Figure 1: The considered decentralized computing system.
Figure 2: Communication costs for $\mathsf{K}=12$, $\mathsf{N}=6$, $\mathsf{N}_{\mathrm{r}}=3$.
Figure 3: Communication costs for $\mathsf{K}=12$, $\mathsf{N}=12$, $\mathsf{K}_{\mathrm{c}}=8$.

Theorems & Definitions (13)

Theorem 1
proof
Remark 1
Remark 2
Theorem 2: Optimality
proof
Remark 3
Example 1
Lemma 1
Lemma 2
...and 3 more

On Decentralized Linearly Separable Computation With the Minimum Computation Cost

TL;DR

Abstract

On Decentralized Linearly Separable Computation With the Minimum Computation Cost

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (3)

Theorems & Definitions (13)