Efficient Manifold-Constrained Neural ODE for High-Dimensional Datasets
Muhao Guo, Haoran Li, Yang Weng
TL;DR
The paper addresses learning continuous-time dynamics in high-dimensional data by exploiting underlying manifold structure. It introduces a manifold-Constrained NODE that maps inputs to a latent manifold $\mathcal{M}$ via a structure-preserving encoder and evolves dynamics on $\mathcal{M}$ with a NODE, using a cross-entropy-based alignment loss together with a supervised loss. A theoretical existence result guarantees well-posed dynamics on the manifold, and joint optimization aligns geometry and task performance. Empirically, the method yields higher accuracy, fewer function evaluations, and faster convergence than baselines across image and time-series datasets, demonstrating a scalable, geometry-aware approach for efficient continuous-time modeling. Overall, this framework provides a principled way to incorporate manifold geometry into NODEs to tackle high-dimensional dynamical systems more efficiently.
Abstract
Neural ordinary differential equations (NODE) have garnered significant attention for their design of continuous-depth neural networks and the ability to learn data/feature dynamics. However, for high-dimensional systems, estimating dynamics requires extensive calculations and suffers from high truncation errors for the ODE solvers. To address the issue, one intuitive approach is to consider the non-trivial topological space of the data distribution, i.e., a low-dimensional manifold. Existing methods often rely on knowledge of the manifold for projection or implicit transformation, restricting the ODE solutions on the manifold. Nevertheless, such knowledge is usually unknown in realistic scenarios. Therefore, we propose a novel approach to explore the underlying manifold to restrict the ODE process. Specifically, we employ a structure-preserved encoder to process data and find the underlying graph to approximate the manifold. Moreover, we propose novel methods to combine the NODE learning with the manifold, resulting in significant gains in computational speed and accuracy. Our experimental evaluations encompass multiple datasets, where we compare the accuracy, number of function evaluations (NFEs), and convergence speed of our model against existing baselines. Our results demonstrate superior performance, underscoring the effectiveness of our approach in addressing the challenges of high-dimensional datasets.
