Graph-based Algorithms for Linear Computation Coding
Hans Rosenberger, Ali Bereyhi, Ralf R. Müller
TL;DR
This work addresses efficient computation of $\mathbf{y}=\mathbf{T}\mathbf{x}$ with constant $\mathbf{T}$ by reframing linear computation coding (LCC) as a DAG-based decomposition problem that jointly optimizes operation count and parallelism. It introduces a mixed algorithm (ua) that controls the dag depth via $\Delta\mu_{\max}$ and a depth-based penalty, bridging fully sequential (fs) and fully parallel (fp) approaches. Through hardware-driven cost modeling and extensive simulations, ua—with a focus on maintaining a parallel structure—consistently outperforms fs, fp, and existing baselines in total cost, especially under pipelined implementations. The results demonstrate that DAG-aware LCC is practical for large-scale, real-time linear mappings and has direct relevance to hardware-constrained neural network inference and signal processing tasks.
Abstract
We revisit existing linear computation coding (LCC) algorithms, and introduce a new framework that measures the computational cost of computing multidimensional linear functions, not only in terms of the number of additions, but also with respect to their suitability for parallel processing. Utilizing directed acyclic graphs, which correspond to signal flow graphs in hardware, we propose a novel LCC algorithm that controls the trade-off between the total number of operations and their parallel executability. Numerical evaluations show that the proposed algorithm, constrained to a fully parallel structure, outperforms existing schemes.
