Reweighted Solutions for Weighted Low Rank Approximation
David P. Woodruff, Taisuke Yasuda
TL;DR
This paper tackles weighted low rank approximation (WLRA), an NP-hard problem, by introducing a simple reweighting relaxation that uses the weight matrix $\mathbf{W}$ to reweight a low-rank solution. Under the natural assumption that $\mathrm{rank}(\mathbf{W})\le r$, a rank-$rk$ approximation of $\mathbf{W}\circ\mathbf{A}$ combined with an entrywise division by $\mathbf{W}$ yields provable approximation guarantees, with near-optimal storage and efficiency via randomized SVD. The authors establish a first relative-error guarantee for weighted feature selection via column subset selection and show nearly tight communication complexity bounds that depend on $r$, the rank of $\mathbf{W}$. They also demonstrate practical performance in model compression tasks, where weight matrices exhibit low-rank structure, and provide experiments on synthetic data that corroborate the theoretical findings. Overall, the work offers a simple yet powerful framework that unifies theory and practice for WLRA, with implications for distributed computing and parameter-efficient weight representations in large models.
Abstract
Weighted low rank approximation (WLRA) is an important yet computationally challenging primitive with applications ranging from statistical analysis, model compression, and signal processing. To cope with the NP-hardness of this problem, prior work considers heuristics, bicriteria, or fixed parameter tractable algorithms to solve this problem. In this work, we introduce a new relaxed solution to WLRA which outputs a matrix that is not necessarily low rank, but can be stored using very few parameters and gives provable approximation guarantees when the weight matrix has low rank. Our central idea is to use the weight matrix itself to reweight a low rank solution, which gives an extremely simple algorithm with remarkable empirical performance in applications to model compression and on synthetic datasets. Our algorithm also gives nearly optimal communication complexity bounds for a natural distributed problem associated with this problem, for which we show matching communication lower bounds. Together, our communication complexity bounds show that the rank of the weight matrix provably parameterizes the communication complexity of WLRA. We also obtain the first relative error guarantees for feature selection with a weighted objective.
