Tessellated Distributed Computing
Ali Khalesi, Petros Elia
TL;DR
This work models multi-user distributed computing of real-valued linearly-decomposable functions as a fixed-support matrix factorization problem DE ≈ F, where D encodes server-function assignments and E encodes server contributions and user connections. It introduces tessellation-based and SVD-based fixed-support designs that decompose F into tiles, yielding lossless schemes with exact capacity in key regimes and, in lossy scenarios, asymptotic error bounds derived from the Marchenko-Pastur distribution. The main contributions include (i) a capacity formula for lossless reconstruction under disjoint supports, (ii) a tessellated construction linking fixed-support factorization with tiling and SVD, and (iii) a stochastic lossy analysis providing tight bounds on average reconstruction error via MP statistics. Collectively, the results quantify the rate-distortion-computation-communication tradeoffs in distributed computing and reveal a principled, tile-based approach to scalable, reliable computation across heterogeneous servers.
Abstract
The work considers the $N$-server distributed computing scenario with $K$ users requesting functions that are linearly-decomposable over an arbitrary basis of $L$ real (potentially non-linear) subfunctions. In our problem, the aim is for each user to receive their function outputs, allowing for reduced reconstruction error (distortion) $ε$, reduced computing cost ($γ$; the fraction of subfunctions each server must compute), and reduced communication cost ($δ$; the fraction of users each server must connect to). For any given set of $K$ requested functions -- which is here represented by a coefficient matrix $\mathbf {F} \in \mathbb{R}^{K \times L}$ -- our problem is made equivalent to the open problem of sparse matrix factorization that seeks -- for a given parameter $T$, representing the number of shots for each server -- to minimize the reconstruction distortion $\frac{1}{KL}\|\mathbf {F} - \mathbf{D}\mathbf{E}\|^2_{F}$ overall $δ$-sparse and $γ$-sparse matrices $\mathbf{D}\in \mathbb{R}^{K \times NT}$ and $\mathbf{E} \in \mathbb{R}^{NT \times L}$. With these matrices respectively defining which servers compute each subfunction, and which users connect to each server, we here design our $\mathbf{D},\mathbf{E}$ by designing tessellated-based and SVD-based fixed support matrix factorization methods that first split $\mathbf{F}$ into properly sized and carefully positioned submatrices, which we then approximate and then decompose into properly designed submatrices of $\mathbf{D}$ and $\mathbf{E}$.
