Learnable Scaled Gradient Descent for Guaranteed Robust Tensor PCA
Lanlan Feng, Ce Zhu, Yipeng Liu, Saiprasad Ravishankar, Longxiu Huang
TL;DR
This work introduces RTPCA-SGD, a scalable, tensor-SVD-based robust PCA method that factors the low-rank component as X = L * R^T and alternates with a sparsity-promoting S, avoiding costly full t-SVDs per iteration. The authors prove exact recovery and linear convergence with a rate independent of the condition number κ under mild μ-incoherence and α-sparsity assumptions, with thresholding controlled by a decaying parameter sequence and a fixed step size. A learnable, self-supervised deep unfolding model (RTPCA-LSGD) is proposed to adapt the four critical parameters (ζ_0, ζ_1, τ, η), enhancing practical performance without ground-truth data. Experiments on synthetic and real data (video denoising and background initialization) show RTPCA-SGD outperforms TNN-based RTPCA while offering competitive runtimes, and RTPCA-LSGD yields further gains, validating both the theory and the practical utility of the approach. Overall, the paper delivers a scalable, provably reliable tensor PCA framework with a principled pathway to learnable parameter optimization.
Abstract
Robust tensor principal component analysis (RTPCA) aims to separate the low-rank and sparse components from multi-dimensional data, making it an essential technique in the signal processing and computer vision fields. Recently emerging tensor singular value decomposition (t-SVD) has gained considerable attention for its ability to better capture the low-rank structure of tensors compared to traditional matrix SVD. However, existing methods often rely on the computationally expensive tensor nuclear norm (TNN), which limits their scalability for real-world tensors. To address this issue, we explore an efficient scaled gradient descent (SGD) approach within the t-SVD framework for the first time, and propose the RTPCA-SGD method. Theoretically, we rigorously establish the recovery guarantees of RTPCA-SGD under mild assumptions, demonstrating that with appropriate parameter selection, it achieves linear convergence to the true low-rank tensor at a constant rate, independent of the condition number. To enhance its practical applicability, we further propose a learnable self-supervised deep unfolding model, which enables effective parameter learning. Numerical experiments on both synthetic and real-world datasets demonstrate the superior performance of the proposed methods while maintaining competitive computational efficiency, especially consuming less time than RTPCA-TNN.
