Phase transitions for the existence of unregularized M-estimators in single index models
Takuya Koriyama, Pierre C. Bellec
TL;DR
The paper addresses the existence of unregularized M-estimators for high-dimensional single-index models under proportional asymptotics, deriving an explicit critical threshold $\delta_\infty$ that governs a sharp phase transition. It extends Candès & Sur's phase-transition results beyond binary logistic regression to Poisson and other link models, using a convex-analytic construction to connect estimator existence to a nonlinear system describing asymptotic behavior. A key contribution is proving an equivalence between the nonlinear system and a constrained infinite-dimensional convex optimization in a Hilbert space, ensuring a unique solution exists if and only if $\delta > \delta_\infty$ with a positive Lagrange multiplier; this provides a rigorous foundation for proportional asymptotics analyses. Numerical simulations in Poisson and generalized logistic settings corroborate the theory, showing empirical thresholds that match the theoretical $1/\delta_\infty$ across model variants and confirming the phase transition in estimator existence.
Abstract
This paper studies phase transitions for the existence of unregularized M-estimators under proportional asymptotics where the sample size $n$ and feature dimension $p$ grow proportionally with $n/p \to δ\in (1, \infty)$. We study the existence of M-estimators in single-index models where the response $y_i$ depends on covariates $x_i \sim N(0, I_p)$ through an unknown index ${w} \in \mathbb{R}^p$ and an unknown link function. An explicit expression is derived for the critical threshold $δ_\infty$ that determines the phase transition for the existence of the M-estimator, generalizing the results of Candés & Sur (2020) for binary logistic regression to other single-index models. Furthermore, we investigate the existence of a solution to the nonlinear system of equations governing the asymptotic behavior of the M-estimator when it exists. The existence of solution to this system for $δ> δ_\infty$ remains largely unproven outside the global null in binary logistic regression. We address this gap with a proof that the system admits a solution if and only if $δ> δ_\infty$, providing a comprehensive theoretical foundation for proportional asymptotic results that require as a prerequisite the existence of a solution to the system.
