Classification by sparse generalized additive models
Felix Abramovich
TL;DR
The paper develops sparse generalized additive model (SpAM) classifiers for binary outcomes by minimizing the logistic loss with sparse group Lasso/Slope penalties on univariate additive components expanded in orthonormal bases. The proposed approach is adaptive to unknown sparsity and smoothness, and the authors prove nearly-minimax misclassification risk across analytic, Sobolev, and Besov function classes under a sparse group restricted eigenvalue condition, with a phase transition between sparse and dense regimes. They provide upper and lower bounds that match up to log factors and validate the theory with simulations and a real-data example (email spam). The work advances nonparametric, scalable classification in high dimensions by enabling simultaneous feature selection and smooth function estimation within a convex optimization framework.
Abstract
We consider (nonparametric) sparse (generalized) additive models (SpAM) for classification. The design of a SpAM classifier is based on minimizing the logistic loss with a sparse group Lasso/Slope-type penalties on the coefficients of univariate additive components' expansions in orthonormal series (e.g., Fourier or wavelets). The resulting classifier is inherently adaptive to the unknown sparsity and smoothness. We show that under certain sparse group restricted eigenvalue condition it is nearly-minimax (up to log-factors) simultaneously across the entire range of analytic, Sobolev and Besov classes. The performance of the proposed classifier is illustrated on a simulated and a real-data examples.
