Robust Tensor CUR Decompositions: Rapid Low-Tucker-Rank Tensor Recovery with Sparse Corruption
HanQin Cai, Zehan Chao, Longxiu Huang, Deanna Needell
TL;DR
This work tackles robust tensor recovery under Tucker-rank constraints by formulating TRPCA as decomposing a corrupted tensor $\mathcal{X}$ into a low-Tucker-rank part $\mathcal{L}$ and a sparse part $\mathcal{S}$, solved via a fast non-convex algorithm RTCUR. RTCUR uses alternating projections between sparsity enforcement (hard-thresholding) and inexact low-Tucker-rank approximation implemented through mode-wise tensor CUR decompositions (Fiber and Chidori), yielding significant computational savings over HOSVD-based TRPCA methods. The authors provide theoretical support for the tensor sparsity model, present four sampling variants, and demonstrate through extensive synthetic and real-data experiments that RTCUR is faster and often more robust than state-of-the-art TRPCA methods in tasks such as robust face modeling, video background subtraction, and network clustering. This approach enables scalable, reliable tensor recovery in large-scale applications by exploiting tensor CUR structure and adaptive sparsity handling.
Abstract
We study the tensor robust principal component analysis (TRPCA) problem, a tensorial extension of matrix robust principal component analysis (RPCA), that aims to split the given tensor into an underlying low-rank component and a sparse outlier component. This work proposes a fast algorithm, called Robust Tensor CUR Decompositions (RTCUR), for large-scale non-convex TRPCA problems under the Tucker rank setting. RTCUR is developed within a framework of alternating projections that projects between the set of low-rank tensors and the set of sparse tensors. We utilize the recently developed tensor CUR decomposition to substantially reduce the computational complexity in each projection. In addition, we develop four variants of RTCUR for different application settings. We demonstrate the effectiveness and computational advantages of RTCUR against state-of-the-art methods on both synthetic and real-world datasets.
