Optimal time sampling in physics-informed neural networks
Gabriel Turinici
TL;DR
This work analyzes how temporal sampling in physics-informed neural networks (PINNs) should be weighted to minimize the final-time error under a finite computational budget. By examining a linear ODE surrogate and leveraging convergence properties, it proves that the optimal time-weighting is a truncated exponential distribution with rate $4\lambda/3$, where $\lambda$ is a Lyapunov-like exponent, and shows the optimal instantaneous error profile $w(t)\propto e^{-\lambda (T-t)/3}$. Numerical experiments on a linear ODE, Burgers' equation, and the Lorenz system validate the theory: chaotic or highly sensitive regimes favor heavier weighting of early times, while stable or parabolic cases benefit less from such weighting. The results provide a principled basis for time-sampling strategies in PINNs and highlight the practical need to estimate the appropriate rate parameter, potentially via hyperparameter optimization or adaptive schemes. Overall, the paper connects dynamical-system concepts to PINN training efficiency and offers actionable guidance for improving accuracy under limited compute.
Abstract
Physics-informed neural networks (PINN) is a extremely powerful paradigm used to solve equations encountered in scientific computing applications. An important part of the procedure is the minimization of the equation residual which includes, when the equation is time-dependent, a time sampling. It was argued in the literature that the sampling need not be uniform but should overweight initial time instants, but no rigorous explanation was provided for this choice. In the present work we take some prototypical examples and, under standard hypothesis concerning the neural network convergence, we show that the optimal time sampling follows a (truncated) exponential distribution. In particular we explain when is best to use uniform time sampling and when one should not. The findings are illustrated with numerical examples on linear equation, Burgers' equation and the Lorenz system.
