Optimal Estimation in Orthogonally Invariant Generalized Linear Models: Spectral Initialization and Approximate Message Passing
Yihan Zhang, Hong Chang Ji, Ramji Venkataramanan, Marco Mondelli
TL;DR
This work studies parameter estimation in generalized linear models with orthogonally invariant random designs, proposing a spectrally informed initialization and a Generalized Vector AMP (GVAMP) algorithmic framework. It proves that optimal spectral estimators reach a universal weak recovery threshold and that GVAMP dynamics, when initialized spectrally, admit a tractable state evolution describing their asymptotic behavior. A spectrally initialized Bayes-GVAMP variant is introduced whose fixed points align with the Bayes risk conjectured via replica analysis, suggesting statistical optimality within a broad efficiently computable class. Empirical results on synthetic data and real datasets such as GTEx and coded diffraction patterns demonstrate robust universality and superiority over i.i.d.-based methods, underscoring the practical impact of spectral initialization tied to GVAMP in complex correlated designs.
Abstract
We consider the problem of parameter estimation from a generalized linear model with a random design matrix that is orthogonally invariant in law. Such a model allows the design have an arbitrary distribution of singular values and only assumes that its singular vectors are generic. It is a vast generalization of the i.i.d. Gaussian design typically considered in the theoretical literature, and is motivated by the fact that real data often have a complex correlation structure so that methods relying on i.i.d. assumptions can be highly suboptimal. Building on the paradigm of spectrally-initialized iterative optimization, this paper proposes optimal spectral estimators and combines them with an approximate message passing (AMP) algorithm, establishing rigorous performance guarantees for these two algorithmic steps. Both the spectral initialization and the subsequent AMP meet existing conjectures on the fundamental limits to estimation -- the former on the optimal sample complexity for efficient weak recovery, and the latter on the optimal errors. Numerical experiments suggest the effectiveness of our methods and accuracy of our theory beyond orthogonally invariant data.
