Outlier-aware Tensor Robust Principal Component Analysis with Self-guided Data Augmentation
Yangyang Xu, Kexin Li, Li Yang, You-Wei Wen
TL;DR
This work tackles tensor robust PCA (TRPCA) under structured, non-sparse outliers by introducing a self-guided data augmentation framework. It learns a weight tensor $\mathcal{W}$ and an augmented tensor $\mathcal{Y}$ to reformulate TRPCA as standard TPCA, enabling efficient closed-form updates via proximal block coordinate descent and a convergence guarantee to stationary points. A generalized weight scheme $\mathcal{W}_{ijk}=\exp(-\frac{(\mathcal{Y}_{ijk}-\mathcal{X}_{ijk})^2}{2\gamma})$ extends outlier detection beyond binary masks, with $\gamma \to 0$ recovering oracle behavior. Empirical results on synthetic data and real tasks (face denoising, background subtraction, hyperspectral denoising) show improved accuracy and substantial speedups over state-of-the-art methods, validating the framework's practical impact on high-dimensional tensor processing.
Abstract
Tensor Robust Principal Component Analysis (TRPCA) is a fundamental technique for decomposing multi-dimensional data into a low-rank tensor and an outlier tensor, yet existing methods relying on sparse outlier assumptions often fail under structured corruptions. In this paper, we propose a self-guided data augmentation approach that employs adaptive weighting to suppress outlier influence, reformulating the original TRPCA problem into a standard Tensor Principal Component Analysis (TPCA) problem. The proposed model involves an optimization-driven weighting scheme that dynamically identifies and downweights outlier contributions during tensor augmentation. We develop an efficient proximal block coordinate descent algorithm with closed-form updates to solve the resulting optimization problem, ensuring computational efficiency. Theoretical convergence is guaranteed through a framework combining block coordinate descent with majorization-minimization principles. Numerical experiments on synthetic and real-world datasets, including face recovery, background subtraction, and hyperspectral denoising, demonstrate that our method effectively handles various corruption patterns. The results show the improvements in both accuracy and computational efficiency compared to state-of-the-art methods.
