Riemannian Flow Matching for Brain Connectivity Matrices via Pullback Geometry
Antoine Collas, Ce Ju, Nicolas Salvy, Bertrand Thirion
TL;DR
This paper introduces DiffeoCFM, a pullback-geometry-based conditional flow matching framework for generating brain connectivity matrices that lie on non-Euclidean manifolds (SPD and correlation matrices). By mapping manifold data to Euclidean space via global diffeomorphisms (log for SPD, normalized Cholesky for Corr), the method trains standard Euclidean CFM and maps samples back, guaranteeing manifold-constraint outputs while avoiding costly Riemannian operations. The approach is theoretically shown to be equivalent to Riemannian CFM on pullback manifolds and is demonstrated on large-scale fMRI and EEG datasets, achieving state-of-the-art quality and classification performance with strong neurophysiological plausibility. The work significantly reduces computational overhead for geometry-aware generative modeling in neuroscience, enabling scalable and reliable synthesis of brain connectivity data for disease analysis and brain-computer interfaces.
Abstract
Generating realistic brain connectivity matrices is key to analyzing population heterogeneity in brain organization, understanding disease, and augmenting data in challenging classification problems. Functional connectivity matrices lie in constrained spaces, such as the set of symmetric positive definite or correlation matrices, that can be modeled as Riemannian manifolds. However, using Riemannian tools typically requires redefining core operations (geodesics, norms, integration), making generative modeling computationally inefficient. In this work, we propose DiffeoCFM, an approach that enables conditional flow matching (CFM) on matrix manifolds by exploiting pullback metrics induced by global diffeomorphisms on Euclidean spaces. We show that Riemannian CFM with such metrics is equivalent to applying standard CFM after data transformation. This equivalence allows efficient vector field learning, and fast sampling with standard ODE solvers. We instantiate DiffeoCFM with two different settings: the matrix logarithm for covariance matrices and the normalized Cholesky decomposition for correlation matrices. We evaluate DiffeoCFM on three large-scale fMRI datasets with more than 4600 scans from 2800 subjects (ADNI, ABIDE, OASIS-3) and two EEG motor imagery datasets with over 30000 trials from 26 subjects (BNCI2014-002 and BNCI2015-001). It enables fast training and achieves state-of-the-art performance, all while preserving manifold constraints. Code: https://github.com/antoinecollas/DiffeoCFM
