Curvature Corrected Nonnegative Manifold Data Factorization
Joyce Chew, Willem Diepeveen, Deanna Needell
TL;DR
The paper addresses the challenge of performing interpretable, low-rank factorization on manifold-valued data by introducing curvature corrected nonnegative manifold data factorization (CC-NMDF) for symmetric Riemannian manifolds. It recasts a Euclidean semi-NMF into a tangent-space framework, then applies curvature corrections to better approximate exact manifold reconstructions while remaining computationally tractable. An alternating, multiplicative-update algorithm is developed, with an initialization scheme via tangent-space K-means and corrections to mitigate factor cancellations; the method is validated on diffusion tensor MRI data, showing improved reconstruction and interpretability over tangent-space NMDF and competitive performance against curvature-corrected SVD variants. The work demonstrates how geometry-aware, low-rank decompositions can yield meaningful, identifiable factors that respect the underlying manifold structure, with practical implications for analyzing non-Euclidean scientific data such as DT-MRI.
Abstract
Data with underlying nonlinear structure are collected across numerous application domains, necessitating new data processing and analysis methods adapted to nonlinear domain structure. Riemannanian manifolds present a rich environment in which to develop such tools, as manifold-valued data arise in a variety of scientific settings, and Riemannian geometry provides a solid theoretical grounding for geometric data analysis. Low-rank approximations, such as nonnegative matrix factorization (NMF), are the foundation of many Euclidean data analysis methods, so adaptations of these factorizations for manifold-valued data are important building blocks for further development of manifold data analysis. In this work, we propose curvature corrected nonnegative manifold data factorization (CC-NMDF) as a geometry-aware method for extracting interpretable factors from manifold-valued data, analogous to nonnegative matrix factorization. We develop an efficient iterative algorithm for computing CC-NMDF and demonstrate our method on real-world diffusion tensor magnetic resonance imaging data.
