Multi-Server Multi-Function Distributed Computation
Derya Malak, Mohammad Reza Deylam Salehi, Berksan Serbetci, Petros Elia
TL;DR
This work addresses the communication bottleneck in multi-server multi-function distributed computation where a user requests multiple non-linear functions over distributed datasets. It adopts Körner's characteristic-graph entropy to jointly capture data statistics, correlations, and function structure, deriving a general upper bound on the sum-rate that can be achieved when any $N_r$ servers cooperate in decoding. The main theoretical contributions include a general bound (Theorem 1) and specialized results for correlated Boolean functions and multi-linear functions under cyclic data placement, demonstrating significant gains over prior linear-coding baselines, especially with skewed data in a binary field. The framework offers principled compression limits for distributed functional computation and shows practical impact in reducing communication for complex tasks in cloud and edge computing settings, paving the way for efficient, scalable coded computing under general function classes.
Abstract
The work here studies the communication cost for a multi-server multi-task distributed computation framework, and does so for a broad class of functions and data statistics. Considering the framework where a user seeks the computation of multiple complex (conceivably non-linear) tasks from a set of distributed servers, we establish communication cost upper bounds for a variety of data statistics, function classes and data placements across the servers. To do so, we proceed to apply, for the first time here, Körner's characteristic graph approach -- which is known to capture the structural properties of data and functions -- to the promising framework of multi-server multi-task distributed computing. Going beyond the general expressions, and in order to offer clearer insight, we also consider the well-known scenario of cyclic dataset placement and linearly separable functions over the binary field, in which case our approach exhibits considerable gains over the state of art. Similar gains are identified for the case of multi-linear functions.
