Near-optimal Rank Adaptive Inference of High Dimensional Matrices
Frédéric Zheng, Yassir Jedra, Alexandre Proutiere
TL;DR
This work addresses high-dimensional matrix estimation from linear measurements by developing rank-adaptive inference that learns the effective rank from data. It introduces instance-specific lower bounds that depend on the target matrix spectrum and covariate design, and proposes a Thresholded Least Squares Estimator (T-LSE) that combines a Least Squares estimator with universal singular value thresholding to achieve near-optimal performance. Theoretical results provide finite-sample, rate-optimal bounds in multivariate regression and linear dynamical system identification, along with a refined analysis of matrix denoising via singular value thresholding. Empirical results corroborate the theory, showing that adaptivity yields substantial gains when the singular spectrum decays or alignment between design and target is imperfect, with potential impact on multivariate regression and dynamic system identification tasks.
Abstract
We address the problem of estimating a high-dimensional matrix from linear measurements, with a focus on designing optimal rank-adaptive algorithms. These algorithms infer the matrix by estimating its singular values and the corresponding singular vectors up to an effective rank, adaptively determined based on the data. We establish instance-specific lower bounds for the sample complexity of such algorithms, uncovering fundamental trade-offs in selecting the effective rank: balancing the precision of estimating a subset of singular values against the approximation cost incurred for the remaining ones. Our analysis identifies how the optimal effective rank depends on the matrix being estimated, the sample size, and the noise level. We propose an algorithm that combines a Least-Squares estimator with a universal singular value thresholding procedure. We provide finite-sample error bounds for this algorithm and demonstrate that its performance nearly matches the derived fundamental limits. Our results rely on an enhanced analysis of matrix denoising methods based on singular value thresholding. We validate our findings with applications to multivariate regression and linear dynamical system identification.
