Distributed Least Squares in Small Space via Sketching and Bias Reduction
Sachin Garg, Kevin Tan, Michał Dereziński
TL;DR
This work tackles the challenge of performing least-squares in space-limited distributed settings by focusing on reducing estimator bias rather than minimizing error alone. It introduces a sparse leverage-score sparsified embedding (LESS) that, combined with a leave-one-out analysis and higher-moment bounds, yields a nearly unbiased LS estimator from a compact sketch. A two-pass distributed scheme with a preconditioner achieves near-optimal time and space, enabling current-matrix-multiplication-time performance with only O(d^2 log(nd)) bits of space. The results extend beyond LS to bias-variance analyses for other sketching-based estimators and demonstrate a practical free-lunch phenomenon in distributed averaging, with significant implications for scalable, robust RandNLA in streaming and distributed contexts.
Abstract
Matrix sketching is a powerful tool for reducing the size of large data matrices. Yet there are fundamental limitations to this size reduction when we want to recover an accurate estimator for a task such as least square regression. We show that these limitations can be circumvented in the distributed setting by designing sketching methods that minimize the bias of the estimator, rather than its error. In particular, we give a sparse sketching method running in optimal space and current matrix multiplication time, which recovers a nearly-unbiased least squares estimator using two passes over the data. This leads to new communication-efficient distributed averaging algorithms for least squares and related tasks, which directly improve on several prior approaches. Our key novelty is a new bias analysis for sketched least squares, giving a sharp characterization of its dependence on the sketch sparsity. The techniques include new higher-moment restricted Bai-Silverstein inequalities, which are of independent interest to the non-asymptotic analysis of deterministic equivalents for random matrices that arise from sketching.
