Statistical Inference for Low-Rank Tensor Models
Ke Xu, Elynn Chen, Yuefeng Han
TL;DR
The paper develops a unified debiased-tangent-space approach to statistical inference for general and low-Tucker-rank linear functionals of tensors in tensor regression and tensor PCA. By projecting debiased estimators onto the tangent space of the low-Tucker-rank manifold and (when desirable) employing sample splitting, it achieves asymptotic normality and minimax-optimal confidence-interval lengths under weakened incoherence conditions and practical sub-Gaussian assumptions. The work provides explicit sample-size and SNR thresholds for both general and low-rank linear functionals, deriving data-driven procedures for variance estimation and CI construction. Numerical experiments confirm the theoretical guarantees and demonstrate applicability across diverse tensor models and loading structures, offering a pathway to scalable, uncertainty-aware tensor inference in high-dimensional data settings.
Abstract
Statistical inference for tensors has emerged as a critical challenge in analyzing high-dimensional data in modern data science. This paper introduces a unified framework for inferring general and low-Tucker-rank linear functionals of low-Tucker-rank signal tensors for several low-rank tensor models. Our methodology tackles two primary goals: achieving asymptotic normality and constructing minimax-optimal confidence intervals. By leveraging a debiasing strategy and projecting onto the tangent space of the low-Tucker-rank manifold, we enable inference for general and structured linear functionals, extending far beyond the scope of traditional entrywise inference. Specifically, in the low-Tucker-rank tensor regression or PCA model, we establish the computational and statistical efficiency of our approach, achieving near-optimal sample size requirements (in regression model) and signal-to-noise ratio (SNR) conditions (in PCA model) for general linear functionals without requiring sparsity in the loading tensor. Our framework also attains both computationally and statistically optimal sample size and SNR thresholds for low-Tucker-rank linear functionals. Numerical experiments validate our theoretical results, showcasing the framework's utility in diverse applications. This work addresses significant methodological gaps in statistical inference, advancing tensor analysis for complex and high-dimensional data environments.
