Geometry of Sparsity-Inducing Norms
Jean-Philippe Chancelier, Michel de Lara, Antoine Deza, Lionel Pournin
TL;DR
The paper addresses finding solutions with a priori sparsity budget $k$ by studying generalized $k$-support norms built from a source norm. It develops a geometric-dual framework that connects exposed faces of convex sets generated by $k$-sparse vectors to sparsity in optimal solutions, establishing dual conditions under which minimizers are $k$-sparse. It provides a detailed face analysis for unit balls of generalized $k$-support norms, including orthant-monotonic and orthant-strictly monotonic cases, and offers a complete geometric description of top-$(q,k)$ and $(p,k)$-support norms, especially in the $p=\infty$ and $1<p<\infty$ regimes. The results illuminate how dual information identifies support, and reveal the intricate face and normal-cone structure that underpins sparsity-inducing regularization. Collectively, the work advances the geometric understanding of sparsity-promoting norms and informs the design of penalties that enforce a fixed sparsity budget in optimization.
Abstract
Sparse optimization seeks an optimal solution with few nonzero entries. To achieve this, it is common to add to the criterion a penalty term proportional to the $\ell_1$-norm, which is recognized as the archetype of sparsity-inducing norms. In this approach, the number of nonzero entries is not controlled a priori. By contrast, in this paper, we focus on finding an optimal solution with at most~$k$ nonzero coordinates (or for short, $k$-sparse vectors), where $k$ is a given sparsity level (or ``sparsity budget''). For this purpose, we study the class of generalized $k$-support norms that arise from a given source norm. When added as a penalty term, we provide conditions under which such generalized $k$-support norms promote $k$-sparse solutions. The result follows from an analysis of the exposed faces of closed convex sets generated by $k$-sparse vectors, and of how primal support identification can be deduced from dual information. Finally, we study some of the geometric properties of the unit balls for the $k$-support norms and their dual norms when the source norm belongs to the family of $\ell_p$-norms.
