Structured Matrix Learning under Arbitrary Entrywise Dependence and Estimation of Markov Transition Kernel
Jinhang Chai, Jianqing Fan
TL;DR
This work tackles structured matrix learning where the signal is a sum of a low-rank and a sparse part under arbitrary entrywise noise. It develops an incoherent-constrained least-squares estimator and proves deterministic and minimax-optimal guarantees by leveraging a novel separation lemma that forces energy to spread across entries in incoherent matrices. The framework is instantiated to Markov transition kernel estimation, achieving minimax rates, and extended to structured RL, multitask regression, and robust covariance estimation. A practical alternating-minimization algorithm is proposed and empirically validated, with convergence in a few iterations and strong performance in both simulated and real data settings. The results offer a broadly applicable blueprint for precise recovery under heavy dependence and have implications for scalable RL and high-dimensional statistical learning.
Abstract
The problem of structured matrix estimation has been studied mostly under strong noise dependence assumptions. This paper considers a general framework of noisy low-rank-plus-sparse matrix recovery, where the noise matrix may come from any joint distribution with arbitrary dependence across entries. We propose an incoherent-constrained least-square estimator and prove its tightness both in the sense of deterministic lower bound and matching minimax risks under various noise distributions. To attain this, we establish a novel result asserting that the difference between two arbitrary low-rank incoherent matrices must spread energy out across its entries; in other words, it cannot be too sparse, which sheds light on the structure of incoherent low-rank matrices and may be of independent interest. We then showcase the applications of our framework to several important statistical machine learning problems. In the problem of estimating a structured Markov transition kernel, the proposed method achieves the minimax optimality and the result can be extended to estimating the conditional mean operator, a crucial component in reinforcement learning. The applications to multitask regression and structured covariance estimation are also presented. We propose an alternating minimization algorithm to approximately solve the potentially hard optimization problem. Numerical results corroborate the effectiveness of our method which typically converges in a few steps.
