TENG: Time-Evolving Natural Gradient for Solving PDEs With Deep Neural Nets Toward Machine Precision
Zhuo Chen, Jacob McCarran, Esteban Vizcaino, Marin Soljačić, Di Luo
TL;DR
The paper addresses accurate neural PDE solving, particularly for initial-value problems, by introducing Time-Evolving Natural Gradient (TENG), which unifies time-dependent variational principles and optimization-based time integration through repeated $u$-space optimizations with tangent-space projections. By deriving efficient algorithms (TENG-Euler and high-order variants like TENG-Heun) and leveraging sparse updates, TENG achieves machine-precision per-step optimization across PDEs such as the heat, Allen-Cahn, and Burgers equations, outperforming TDVP, OBTI, and PINN baselines in accuracy with competitive runtimes. The work also provides a complexity and error analysis, including a reparameterization-invariance result and connections to Gauss-Newton, and demonstrates substantial empirical gains on multi-dimensional benchmarks. Overall, TENG offers a practical, high-precision framework for neural PDE solvers with potential for broad scientific impact and extension to more complex, real-world problems.
Abstract
Partial differential equations (PDEs) are instrumental for modeling dynamical systems in science and engineering. The advent of neural networks has initiated a significant shift in tackling these complexities though challenges in accuracy persist, especially for initial value problems. In this paper, we introduce the $\textit{Time-Evolving Natural Gradient (TENG)}$, generalizing time-dependent variational principles and optimization-based time integration, leveraging natural gradient optimization to obtain high accuracy in neural-network-based PDE solutions. Our comprehensive development includes algorithms like TENG-Euler and its high-order variants, such as TENG-Heun, tailored for enhanced precision and efficiency. TENG's effectiveness is further validated through its performance, surpassing current leading methods and achieving $\textit{machine precision}$ in step-by-step optimizations across a spectrum of PDEs, including the heat equation, Allen-Cahn equation, and Burgers' equation.
