Faster Algorithms for Schatten-p Low Rank Approximation
Praneeth Kacham, David P. Woodruff
TL;DR
This work advances Schatten-$p$ low rank approximation by deriving faster iterative algorithms that exploit fast rectangular matrix multiplication and block-size tradeoffs, and by combining these with Li and Woodruff's sketching-based LW20 approach to broaden the regime where near-linear-time, high-precision LRA is attainable. It also provides the first stability analysis for a FP-implementable variant of Block Krylov/LazySVD techniques, showing that a modified LazySVD can deliver $(1+\\varepsilon)$-approximate Schatten-$p$ LRA on machines with polylogarithmic precision when using $O((k/\\sqrt{\\varepsilon})\\operatorname{poly}(\\log n))$ matvecs. The results identify concrete time scales, e.g., $\\tilde{O}(\\sqrt{p} n^{2+\\eta})$ under favorable rank conditions and a combined bound of $\\tilde{O}((n^{1-2/p})^{2+\\eta+(1-\\alpha)\\beta/(1+2\\beta)}\\operatorname{poly}(1/\\varepsilon) + n^2)$, signaling practical pathways for fast, stable Schatten-$p$ LRA in dense regimes. Collectively, the paper argues for a pragmatic hybrid pipeline that blends sketching, fast Krylov-based subspace methods, and FP-stable least-squares steps to balance speed and numerical reliability in large-scale low-rank approximation tasks.
Abstract
We study algorithms for the Schatten-$p$ Low Rank Approximation (LRA) problem. First, we show that by using fast rectangular matrix multiplication algorithms and different block sizes, we can improve the running time of the algorithms in the recent work of Bakshi, Clarkson and Woodruff (STOC 2022). We then show that by carefully combining our new algorithm with the algorithm of Li and Woodruff (ICML 2020), we can obtain even faster algorithms for Schatten-$p$ LRA. While the block-based algorithms are fast in the real number model, we do not have a stability analysis which shows that the algorithms work when implemented on a machine with polylogarithmic bits of precision. We show that the LazySVD algorithm of Allen-Zhu and Li (NeurIPS 2016) can be implemented on a floating point machine with only logarithmic, in the input parameters, bits of precision. As far as we are aware, this is the first stability analysis of any algorithm using $O((k/\sqrt{\varepsilon})\text{poly}(\log n))$ matrix-vector products with the matrix $A$ to output a $1+\varepsilon$ approximate solution for the rank-$k$ Schatten-$p$ LRA problem.
