On Vector Field Reconstruction from Noisy ODE in High Ambient Dimension
Hugo Henneuse
TL;DR
This work studies nonparametric reconstruction of the vector field $f$ in autonomous ODEs $y' = f(y)$ in high ambient dimension $D$ when the initial conditions lie on a low-dimensional structure. It proposes a regression-like estimator that alternates flow estimation, derivative estimation, and nearest-neighbor averaging to recover $f$ on the envelope $\operatorname{Env}_f(\mathcal X, T)$ from noisy observations. The authors derive minimax convergence rates that depend on the temporal grid, the number of trajectories, and the mass-concentration parameter $b$ of the initial-value distribution, showing these rates do not depend on $D$ and are optimal up to logarithmic factors; they also provide geometric corollaries under a manifold assumption and validate performance via numerical experiments in both low- and high-dimensional settings. The results offer a simple, scalable baseline for vector-field estimation in high dimensions and connect to manifold learning to mitigate the curse of dimensionality.
Abstract
This work investigates the nonparametric estimation of the vector field of a noisy Ordinary Differential Equation (ODE) in high-dimensional ambient spaces, under the assumption that the initial conditions are sampled from a lower-dimensional structure. Specifically, let \( f:\mathbb{R}^{D}\to\mathbb{R}^{D} \) denote the vector field of the autonomous ODE \( y' = f(y) \). We observe noisy trajectories \( \tilde{y}_{X_i}(t_j) = y_{X_i}(t_j) + \varepsilon_{i,j} \), where \( y_{X_i}(t_j) \) is the solution at time \( t_j \) with initial condition \( y(0)=X_i \), the \( X_i \) are drawn from a \((a,b)\)-standard distribution \( μ\), and \( \varepsilon_{i,j} \) denotes noise. From a minimax perspective, we study the reconstruction of \( f \) within the envelope of trajectories generated by the support of \( μ\). We proposed an estimator combining flow reconstruction with derivative estimation techniques from nonparametric regression. Under mild regularity assumptions on \( f \), we establish convergence rates that depend on the temporal resolution, the number of initial conditions, and the parameter \( b \), which controls the mass concentration of \( μ\). These rates are then shown to be minimax optimal (up to logarithmic factors) and illustrate how the proposed approach mitigates the curse of dimensionality. Additionally, we illustrate the computational and statistical efficiency of our estimator through numerical experiments.
