Relative Error Tensor Low Rank Approximation
Zhao Song, David P. Woodruff, Peilin Zhong
TL;DR
The paper tackles the problem of relative-error low-rank tensor approximation under the Frobenius norm, addressing the fundamental issue that an exact rank-k solution may not exist and that tensor rank is NP-hard. It introduces bicriteria and fixed-parameter approaches that yield near-optimal relative-error guarantees, using an iterative framework based on flattenings, regression subproblems, and randomized sketches (Gaussian, CountSketch, TensorSketch) to efficiently approximate CP decompositions and CURT-type decompositions. The results extend to a broad family of tensor error measures (ℓ1, ℓp, weighted norms) and to matrix CUR decompositions with input-sparsity time, while also establishing ETH-based hardness and hard instances that justify the need for bicriteria or parameterized strategies. The work thus provides the first scalable, relative-error low-rank approximation tools for tensors across many norms and structural settings, with significant implications for CURT, tensor regression, and large-scale data applications.
Abstract
We consider relative error low rank approximation of $tensors$ with respect to the Frobenius norm: given an order-$q$ tensor $A \in \mathbb{R}^{\prod_{i=1}^q n_i}$, output a rank-$k$ tensor $B$ for which $\|A-B\|_F^2 \leq (1+ε)$OPT, where OPT $= \inf_{\textrm{rank-}k~A'} \|A-A'\|_F^2$. Despite the success on obtaining relative error low rank approximations for matrices, no such results were known for tensors. One structural issue is that there may be no rank-$k$ tensor $A_k$ achieving the above infinum. Another, computational issue, is that an efficient relative error low rank approximation algorithm for tensors would allow one to compute the rank of a tensor, which is NP-hard. We bypass these issues via (1) bicriteria and (2) parameterized complexity solutions: (1) We give an algorithm which outputs a rank $k' = O((k/ε)^{q-1})$ tensor $B$ for which $\|A-B\|_F^2 \leq (1+ε)$OPT in $nnz(A) + n \cdot \textrm{poly}(k/ε)$ time in the real RAM model. Here $nnz(A)$ is the number of non-zero entries in $A$. (2) We give an algorithm for any $δ>0$ which outputs a rank $k$ tensor $B$ for which $\|A-B\|_F^2 \leq (1+ε)$OPT and runs in $ ( nnz(A) + n \cdot \textrm{poly}(k/ε) + \exp(k^2/ε) ) \cdot n^δ$ time in the unit cost RAM model. For outputting a rank-$k$ tensor, or even a bicriteria solution with rank-$Ck$ for a certain constant $C > 1$, we show a $2^{Ω(k^{1-o(1)})}$ time lower bound under the Exponential Time Hypothesis. Our results give the first relative error low rank approximations for tensors for a large number of robust error measures for which nothing was known, as well as column row and tube subset selection. We also obtain new results for matrices, such as $nnz(A)$-time CUR decompositions, improving previous $nnz(A)\log n$-time algorithms, which may be of independent interest.
