The Hessian Screening Rule
Johan Larsson, Jonas Wallin
TL;DR
The paper addresses accelerating high‑dimensional sparse regression along the lasso path by introducing the Hessian Screening Rule, a second‑order predictor screening technique. By viewing screening as a gradient estimation task, it derives a Hessian‑based, second‑order update to predict the next step in the path, augmented with a restricted computation and a unit‑bound term, and combines it with the ever‑active set for robust screening. The approach includes efficient Hessian updates via the sweep operator, warm starts, and extensions to general convex losses and elastic net, along with strategies to reduce KKT checks and integrate Gap Safe screening. Empirical results on simulated and real data show substantial speedups over existing methods, particularly in settings with high predictor correlation, while highlighting memory trade‑offs and practical considerations for very large problems.
Abstract
Predictor screening rules, which discard predictors before fitting a model, have had considerable impact on the speed with which sparse regression problems, such as the lasso, can be solved. In this paper we present a new screening rule for solving the lasso path: the Hessian Screening Rule. The rule uses second-order information from the model to provide both effective screening, particularly in the case of high correlation, as well as accurate warm starts. The proposed rule outperforms all alternatives we study on simulated data sets with both low and high correlation for $\ell_1$-regularized least-squares (the lasso) and logistic regression. It also performs best in general on the real data sets that we examine.
