Diffeomorphic interpolation for efficient persistence-based topological optimization
Mathieu Carriere, Marc Theveneau, Théo Lacombe
TL;DR
This work tackles the extreme sparsity of gradients in persistence-based topological optimization by introducing a diffeomorphic interpolation that extends the sparse gradient $\nabla L(X)$ to a smooth vector field $\tilde{v}$ on $\mathbb{R}^d$, ensuring descent via a flow $\dot X=-\tilde{v}_t(X)$. Constructed in a Gaussian RKHS, $\tilde{v}$ interpolates the nonzero gradient entries on index set $I$ and preserves topological updates while providing a global, Lipschitz-bounded operator. The approach scales with subsampling, enabling linear-time updates to the full input and allowing re-use on new data, including latent spaces of pre-trained black-box autoencoders, with the ability to sample topologically-regular representations by reversing the learned flow. Empirical results show faster convergence than vanilla gradients, substantial scalability to large point clouds (e.g., the Stanford Bunny) when combined with subsampling, and successful regularization of latent spaces in black-box AE models, yielding improved interpretability. The method offers a practical, theoretically grounded route to integrate topology-aware priors into large-scale data analysis and model regularization tasks.
Abstract
Topological Data Analysis (TDA) provides a pipeline to extract quantitative topological descriptors from structured objects. This enables the definition of topological loss functions, which assert to what extent a given object exhibits some topological properties. These losses can then be used to perform topological optimizationvia gradient descent routines. While theoretically sounded, topological optimization faces an important challenge: gradients tend to be extremely sparse, in the sense that the loss function typically depends on only very few coordinates of the input object, yielding dramatically slow optimization schemes in practice.Focusing on the central case of topological optimization for point clouds, we propose in this work to overcome this limitation using diffeomorphic interpolation, turning sparse gradients into smooth vector fields defined on the whole space, with quantifiable Lipschitz constants. In particular, we show that our approach combines efficiently with subsampling techniques routinely used in TDA, as the diffeomorphism derived from the gradient computed on a subsample can be used to update the coordinates of the full input object, allowing us to perform topological optimization on point clouds at an unprecedented scale. Finally, we also showcase the relevance of our approach for black-box autoencoder (AE) regularization, where we aim at enforcing topological priors on the latent spaces associated to fixed, pre-trained, black-box AE models, and where we show thatlearning a diffeomorphic flow can be done once and then re-applied to new data in linear time (while vanilla topological optimization has to be re-run from scratch). Moreover, reverting the flow allows us to generate data by sampling the topologically-optimized latent space directly, yielding better interpretability of the model.
