Matrix Completion with Cross-Concentrated Sampling: Bridging Uniform Sampling and CUR Sampling
HanQin Cai, Longxiu Huang, Pengyu Li, Deanna Needell
TL;DR
This work introduces Cross-Concentrated Sampling (CCS), a flexible matrix-completion framework that blends uniform sampling and CUR sampling by concentrating observations on a cross formed by selected rows and columns. It establishes a sufficient condition for exact recovery under CCS with complexity $ ext{O}(r^2 n \, ext{log}^2(n))$ samples and develops a scalable non-convex solver, Iterative CUR Completion (ICURC), whose per-iteration cost is $ ext{O}(n r (|I|+|J|))$. Theoretical results are complemented by extensive experiments on synthetic data, image inpainting, collaborative filtering, and link prediction, showing CCS's practical benefits and ICURC's efficiency. The findings suggest CCS can reduce sampling costs and adapt to real-world constraints while maintaining recovery guarantees, with potential extensions to tensors and more rigorous convergence analysis in future work.
Abstract
While uniform sampling has been widely studied in the matrix completion literature, CUR sampling approximates a low-rank matrix via row and column samples. Unfortunately, both sampling models lack flexibility for various circumstances in real-world applications. In this work, we propose a novel and easy-to-implement sampling strategy, coined Cross-Concentrated Sampling (CCS). By bridging uniform sampling and CUR sampling, CCS provides extra flexibility that can potentially save sampling costs in applications. In addition, we also provide a sufficient condition for CCS-based matrix completion. Moreover, we propose a highly efficient non-convex algorithm, termed Iterative CUR Completion (ICURC), for the proposed CCS model. Numerical experiments verify the empirical advantages of CCS and ICURC against uniform sampling and its baseline algorithms, on both synthetic and real-world datasets.
