Deeply Learned Robust Matrix Completion for Large-scale Low-rank Data Recovery
HanQin Cai, Chandra Kundu, Jialin Liu, Wotao Yin
TL;DR
This work tackles large-scale robust matrix completion by formulating LRMC, a scalable non-convex method that writes the target data as a low-rank factor $\bm{L}\bm{R}^\top$ plus a sparse outlier $\bm{S}$ and optimizes with a differentiable objective under partial observations. A key innovation is replacing expensive sparsity steps with learnable soft-thresholding via deep unfolding, together with scaled gradient updates for the low-rank factors; a novel FRMNN model further extends to effectively infinite iterations. The authors prove a recovery guarantee in a special case and show LRMC achieves linear convergence with a per-iteration cost of $\mathcal{O}(p n^2 r + n r^2)$, independent of outlier sparsity, while enabling fast inference on real tasks such as video background subtraction, ultrasound imaging, face modeling, and cloud removal. Empirically, LRMC outperforms state-of-the-art methods in speed and robustness across synthetic and diverse real datasets, highlighting its scalability and practical impact for large-scale robust data recovery.
Abstract
Robust matrix completion (RMC) is a widely used machine learning tool that simultaneously tackles two critical issues in low-rank data analysis: missing data entries and extreme outliers. This paper proposes a novel scalable and learnable non-convex approach, coined Learned Robust Matrix Completion (LRMC), for large-scale RMC problems. LRMC enjoys low computational complexity with linear convergence. Motivated by the proposed theorem, the free parameters of LRMC can be effectively learned via deep unfolding to achieve optimum performance. Furthermore, this paper proposes a flexible feedforward-recurrent-mixed neural network framework that extends deep unfolding from fix-number iterations to infinite iterations. The superior empirical performance of LRMC is verified with extensive experiments against state-of-the-art on synthetic datasets and real applications, including video background subtraction, ultrasound imaging, face modeling, and cloud removal from satellite imagery.
