Extremal points and sparse optimization for generalized Kantorovich-Rubinstein norms
Marcello Carioni, José A. Iglesias, Daniel Walter
TL;DR
This work addresses inverse-type problems regularized by a generalized Kantorovich-Rubinstein norm to model unbalanced transport between signed Radon measures. The authors derive a precise extremal-point characterization of the KR_p^{\alpha,\beta} unit ball, showing extremals are Dirac masses and rescaled dipoles (δ_x - δ_y)/(β+|x-y|^p) subject to 0<|x-y|^p<2α−β, which enables sharp first-order optimality conditions. Building on this, they develop an accelerated generalized conditional gradient (AGCG) method tailored to infinite-dimensional settings and prove sublinear convergence, with linear convergence under stronger assumptions. Numerical experiments illustrate the method's ability to promote transport near a reference measure while allowing mass creation elsewhere, confirming the theoretical findings. The framework offers a scalable, provably convergent approach to KR-based regularization with potential applications in sparse optimal design and unbalanced transport in imaging and inverse problems.
Abstract
A precise characterization of the extremal points of sublevel sets of nonsmooth penalties provides both detailed information about minimizers, and optimality conditions in general classes of minimization problems involving them. Moreover, it enables the application of accelerated generalized conditional gradient methods for their efficient solution. In this manuscript, this program is adapted to the minimization of a smooth convex fidelity term which is augmented with an unbalanced transport regularization term given in the form of a generalized Kantorovich-Rubinstein norm for Radon measures. More precisely, we show that the extremal points associated to the latter are given by all Dirac delta functionals supported in the spatial domain as well as certain dipoles, i.e., pairs of Diracs with the same mass but with different signs. Subsequently, this characterization is used to derive precise first-order optimality conditions as well as an efficient solution algorithm for which linear convergence is proved under natural assumptions. This behaviour is also reflected in numerical examples for a model problem.
