Riemannian Optimization on the Oblique Manifold for Sparse Simplex Constraints via Multiplicative Updates
Flavia Esposito, Andersen Ang
TL;DR
The paper tackles sparse simplex-constrained, low-rank optimization by reformulating the problem on the oblique rotation manifold $\mathcal{OB}(r,n)$ with $\bm{H}=\bm{A}\odot\bm{A}$. It develops a Riemannian Multiplicative Update (RMU) algorithm that preserves nonnegativity and the unit-sum constraint through a metric-retraction step and a split-gradient update, with $Q=\bm{W}^T\bm{W}$ and $P=\bm{W}^T\bm{X}$ used to build the Riemannian gradient and handle the non-differentiable $\ell_1$ term via a subgradient. The method is shown to converge to stationary points on $\mathcal{OB}(r,n)$ and to perform favorably against a Riemannian Conjugate Gradient method and Euclidean heuristics in synthetic experiments, particularly at large scales where RMU's lower per-iteration cost and intrinsic constraint handling yield speed and stability advantages. This makes RMU a practical tool for structured low-rank problems in machine learning, signal processing, and computational biology where sparse simplex constraints are important.)
Abstract
Low-rank optimization problems with sparse simplex constraints involve variables that must satisfy nonnegativity, sparsity, and sum-to-one conditions, making their optimization particularly challenging due to the interplay between low-rank structures and constraints. These problems arise in various applications, including machine learning, signal processing, environmental fields, and computational biology. In this paper, we propose a novel manifold optimization approach to tackle these problems efficiently. Our method leverages the geometry of oblique rotation manifolds to reformulate the problem and introduces a new Riemannian optimization method based on Riemannian gradient descent that strictly maintains the simplex constraints. By exploiting the underlying manifold structure, our approach improves optimization efficiency. Experiments on synthetic datasets compared to standard Euclidean and Riemannian methods show the effectiveness of the proposed method.
