Table of Contents
Fetching ...

Look-Ahead Screening Rules for the Lasso

Johan Larsson

TL;DR

This paper addresses the computational burden of fitting the lasso across a regularization path in high-dimensional settings. It introduces look-ahead screening, a safe screening strategy based on Gap Safe rules, to discard predictors for a range of $\lambda$ values by exploiting the duality gap $G(\hat{\beta}_\lambda,\hat{\theta}_\lambda;\lambda)$. The approach yields faster solutions of the lasso path, with simulations demonstrating substantial time reductions, particularly at higher signal-to-noise ratios, and early path steps discarding the majority of predictors. The method is solver-agnostic and extensible to other safe and heuristic screening rules, offering practical gains for high-dimensional sparse regression tasks; public code is provided for reproducibility.

Abstract

The lasso is a popular method to induce shrinkage and sparsity in the solution vector (coefficients) of regression problems, particularly when there are many predictors relative to the number of observations. Solving the lasso in this high-dimensional setting can, however, be computationally demanding. Fortunately, this demand can be alleviated via the use of screening rules that discard predictors prior to fitting the model, leading to a reduced problem to be solved. In this paper, we present a new screening strategy: look-ahead screening. Our method uses safe screening rules to find a range of penalty values for which a given predictor cannot enter the model, thereby screening predictors along the remainder of the path. In experiments we show that these look-ahead screening rules outperform the active warm-start version of the Gap Safe rules.

Look-Ahead Screening Rules for the Lasso

TL;DR

This paper addresses the computational burden of fitting the lasso across a regularization path in high-dimensional settings. It introduces look-ahead screening, a safe screening strategy based on Gap Safe rules, to discard predictors for a range of values by exploiting the duality gap . The approach yields faster solutions of the lasso path, with simulations demonstrating substantial time reductions, particularly at higher signal-to-noise ratios, and early path steps discarding the majority of predictors. The method is solver-agnostic and extensible to other safe and heuristic screening rules, offering practical gains for high-dimensional sparse regression tasks; public code is provided for reproducibility.

Abstract

The lasso is a popular method to induce shrinkage and sparsity in the solution vector (coefficients) of regression problems, particularly when there are many predictors relative to the number of observations. Solving the lasso in this high-dimensional setting can, however, be computationally demanding. Fortunately, this demand can be alleviated via the use of screening rules that discard predictors prior to fitting the model, leading to a reduced problem to be solved. In this paper, we present a new screening strategy: look-ahead screening. Our method uses safe screening rules to find a range of penalty values for which a given predictor cannot enter the model, thereby screening predictors along the remainder of the path. In experiments we show that these look-ahead screening rules outperform the active warm-start version of the Gap Safe rules.

Paper Structure

This paper contains 4 sections, 6 equations, 2 figures.

Figures (2)

  • Figure 1: This figure shows the predictors screened at the first step of the lasso path via look-ahead screening for a random sample of 20 predictors from the leukemia data set. A blue square indicates that the corresponding predictor can be discarded at the respective step.
  • Figure 2: Standard box plots of timings to fit a full lasso path to a simulated data set with $n = 100$, $p = 50\,000$, and five true signals.