Diaconis-Ylvisaker prior penalized likelihood for $p/n \to κ\in (0,1)$ logistic regression
Philipp Sterzinger, Ioannis Kosmidis
TL;DR
This work develops a comprehensive high-dimensional inference framework for logistic regression under proportional asymptotics by analysing the maximum Diaconis–Ylvisaker prior penalized likelihood (MDYPL) estimator. By recasting the non-separable DY prior as a logistic model with transformed responses, the authors derive an AMP-based state-evolution description that yields aggregate bias, variance, and asymptotic distributions for the MDYPL estimator and associated test statistics. They establish adjusted Z-statistics and rescaled penalized-likelihood ratio statistics that achieve standard null distributions, extend results to arbitrary covariate covariance, and propose estimation procedures for the unknown constants, with strong empirical support including simulations and a digit-recognition case study. The paper also explores adaptive shrinkage via the DY prior, identifies conditions for aggregate unbiasedness, and provides a conjectured extension to models with intercepts, all implemented in the brglm2 package. Overall, the results offer robust estimation and inference in high-dimensional logistic regression even when ML fails, broadening practical applicability and enabling principled hypothesis testing in complex regimes.
Abstract
We characterise the behavior of the maximum Diaconis--Ylvisaker prior penalized likelihood estimator in high-dimensional logistic regression, where the number of covariates is a fraction $κ\in (0,1)$ of the number of observations $n$, as $n \to \infty$. We construct a rescaled estimator with zero asymptotic aggregate bias and define adjusted $Z$-statistics and rescaled penalized likelihood ratio statistics that exhibit the typical null asymptotic distributions, when the covariates are independent multivariate normal with an arbitrary covariance matrix and the linear predictor has asymptotic variance $γ^2$. While the maximum likelihood estimate asymptotically exists only for a narrow range of $(κ, γ)$ values, the maximum Diaconis--Ylvisaker prior penalized likelihood estimate always exists and can be computed directly using standard maximum likelihood routines. Thus, our asymptotic results extend to $(κ, γ)$ values where the maximum likelihood framework breaks down, with no additional implementation or computational cost. We study the estimator's shrinkage properties, compare the proposed estimation and inference procedures with alternatives that also accommodate proportional asymptotics, and formulate a conjecture -- supported by strong empirical evidence -- that extends our results when the model includes an intercept parameter. Finally, we propose estimation methods for all unknown constants involved in our procedures and demonstrate the theoretical advances through extensive simulation studies and the analysis of digit recognition data.
