Learning Difference-of-Convex Regularizers for Inverse Problems: A Flexible Framework with Theoretical Guarantees
Yasi Zhang, Oscar Leong
TL;DR
This work tackles ill-posed inverse problems by learning flexible, DC-structured regularizers that balance modeling power with theoretical guarantees. The regularizers are parameterized as a difference of two Input Convex Neural Networks (IDCNs), enabling expressive priors while permitting optimization via Difference-of-Convex Algorithm (DCA) and Proximal Subgradient Method (PSM) with convergence guarantees under mild smoothness or KL conditions. A star-geometry analysis characterizes when optimal regularizers are DC via alpha-homogeneous gauges and dual mixed volumes, establishing conditions under which the learned regularizers admit DC decompositions. Empirical validation on Computed Tomography (CT) reconstruction demonstrates that the proposed ADCR framework, including DCA and PSM variants, achieves state-of-the-art performance in sparse-view and limited-angle settings, highlighting the practical impact of DC-regularized learning for ill-posed imaging tasks.
Abstract
Learning effective regularization is crucial for solving ill-posed inverse problems, which arise in a wide range of scientific and engineering applications. While data-driven methods that parameterize regularizers using deep neural networks have demonstrated strong empirical performance, they often result in highly nonconvex formulations that lack theoretical guarantees. Recent work has shown that incorporating structured nonconvexity into neural network-based regularizers, such as weak convexity, can strike a balance between empirical performance and theoretical tractability. In this paper, we demonstrate that a broader class of nonconvex functions, difference-of-convex (DC) functions, can yield improved empirical performance while retaining strong convergence guarantees. The DC structure enables the use of well-established optimization algorithms, such as the Difference-of-Convex Algorithm (DCA) and a Proximal Subgradient Method (PSM), which extend beyond standard gradient descent. Furthermore, we provide theoretical insights into the conditions under which optimal regularizers can be expressed as DC functions. Extensive experiments on computed tomography (CT) reconstruction tasks show that our approach achieves strong performance across sparse and limited-view settings, consistently outperforming other weakly supervised learned regularizers. Our code is available at \url{https://github.com/YasminZhang/ADCR}.
