Adaptive sieving: A dimension reduction technique for sparse optimization problems
Yancheng Yuan, Meixia Lin, Defeng Sun, Kim-Chuan Toh
TL;DR
The paper introduces adaptive sieving (AS), a dimension-reduction strategy for sparse convex composite problems of the form $\min_x \{\Phi(x)+P(x)\}$, enabling solution of sequences of drastically smaller reduced problems by iteratively pruning inactive features. AS is agnostic to the specific regularizer, relies on proximal mappings, and allows inexact solves, with a proximal residual $R(x)$ guiding termination. The authors provide a finite-termination analysis and develop an AS-based path-generation method for solving $\min_x \{\Phi(x)+\lambda P(x)\}$ across a grid of $\lambda$ values, yielding approximate solutions with controlled residuals. Extensive numerical experiments on synthetic and real data demonstrate substantial accelerations (often tens of times) and robust performance across linear and logistic regression models, including Lasso, SLOPE, exclusive lasso, and sparse group lasso. The results highlight AS’s practical impact for large-scale sparse learning and model selection via efficient solution-path generation.
Abstract
In this paper, we propose an adaptive sieving (AS) strategy for solving general sparse machine learning models by effectively exploring the intrinsic sparsity of the solutions, wherein only a sequence of reduced problems with much smaller sizes need to be solved. We further apply the proposed AS strategy to generate solution paths for large-scale sparse optimization problems efficiently. We establish the theoretical guarantees for the proposed AS strategy including its finite termination property. Extensive numerical experiments are presented in this paper to demonstrate the effectiveness and flexibility of the AS strategy to solve large-scale machine learning models.
