Structured Regularization for Constrained Optimization on the SPD Manifold
Andrew Cheng, Melanie Weber
TL;DR
This work introduces a modular, symmetry-based regularization framework for constrained optimization on the SPD manifold, leveraging symmetric gauge functions to design sparsity and ball-neighborhood regularizers. By preserving geodesic convexity and enabling difference-of-convex (DC) structure, the authors enable the use of Euclidean convex-concave procedures (CCCP) to solve constrained geodesically convex problems efficiently, often with simpler subroutines than traditional Riemannian methods. They provide a convergence and complexity analysis, plus a broad set of applications (square roots, Karcher means, optimistic likelihoods, and SPD regression) and extensive experiments demonstrating speed and robustness advantages over standard Riemannian solvers. The approach offers a flexible, scalable pathway to incorporate structure and prior information into SPD optimization, with potential extensions to other Cartan-Hadamard manifolds and a range of regularizers beyond the two primary SG-based classes.
Abstract
Matrix-valued optimization tasks, including those involving symmetric positive definite (SPD) matrices, arise in a wide range of applications in machine learning, data science and statistics. Classically, such problems are solved via constrained Euclidean optimization, where the domain is viewed as a Euclidean space and the structure of the matrices (e.g., positive definiteness) enters as constraints. More recently, geometric approaches that leverage parametrizations of the problem as unconstrained tasks on the corresponding matrix manifold have been proposed. While they exhibit algorithmic benefits in many settings, they cannot directly handle additional constraints, such as inequality or sparsity constraints. A remedy comes in the form of constrained Riemannian optimization methods, notably, Riemannian Frank-Wolfe and Projected Gradient Descent. However, both algorithms require potentially expensive subroutines that can introduce computational bottlenecks in practise. To mitigate these shortcomings, we introduce a class of structured regularizers, based on symmetric gauge functions, which allow for solving constrained optimization on the SPD manifold with faster unconstrained methods. We show that our structured regularizers can be chosen to preserve or induce desirable structure, in particular convexity and "difference of convex" structure. We demonstrate the effectiveness of our approach in numerical experiments.
