Majorization-Minimization Networks for Inverse Problems: An Application to EEG Imaging
Le Minh Triet Tran, Sarah Reynaud, Ronan Fablet, Adrien Merlini, François Rousseau, Mai Quyen Pham
TL;DR
The paper addresses ill-posed inverse problems by introducing a learned Majorization-Minimization (MM) framework that learns a curvature majorant rather than a full optimizer, preserving MM descent guarantees. The curvature model is parameterized by a lightweight recurrent architecture and can enforce valid quadratic majorants through analytic diagonal bounds for cosine similarity losses and Hessian-vector product-based estimates for general losses. The approach is embedded in a bilevel optimization setup, yielding a provably convergent lower-level solver within the upper-level learning loop. Applied to EEG source imaging, the method achieves superior accuracy, stability, and cross-dataset generalization compared with deep-unrolled and meta-learning baselines, indicating robust optimization geometry transfer across datasets and temporal dynamics.
Abstract
Inverse problems are often ill-posed and require optimization schemes with strong stability and convergence guarantees. While learning-based approaches such as deep unrolling and meta-learning achieve strong empirical performance, they typically lack explicit control over descent and curvature, limiting robustness. We propose a learned Majorization-Minimization (MM) framework for inverse problems within a bilevel optimization setting. Instead of learning a full optimizer, we learn a structured curvature majorant that governs each MM step while preserving classical MM descent guarantees. The majorant is parameterized by a lightweight recurrent neural network and explicitly constrained to satisfy valid MM conditions. For cosine-similarity losses, we derive explicit curvature bounds yielding diagonal majorants. When analytic bounds are unavailable, we rely on efficient Hessian-vector product-based spectral estimation to automatically upper-bound local curvature without forming the Hessian explicitly. Experiments on EEG source imaging demonstrate improved accuracy, stability, and cross-dataset generalization over deep-unrolled and meta-learning baselines.
