Stability in Training PINNs for Stiff PDEs: Why Initial Conditions Matter
Baoli Hao, Ulisses Braga-Neto, Chun Liu, Lifan Wang, Ming Zhong
TL;DR
The paper tackles the instability of Physics-Informed Neural Networks (PINNs) on stiff time-dependent PDEs by isolating initial-condition enforcement as the key factor in training stability. It introduces a hard-constraint PINN (HC-PINN) via the transformation $ ilde{u} = oldsymbol{Cpsi} + oldsymbol{Cphi}u_{nn}$ that exactly embeds initial and boundary data, reducing the multi-objective loss to a single PDE-residual loss and enabling implicit-time-stepping-like training behavior. A Neural Tangent Kernel (NTK) analysis provides a theoretical basis for reduced spectral bias under hard constraints, complemented by well-conditioning conditions on $oldsymbol{Cpsi}$ and $oldsymbol{Cphi}$. Empirical results across seven stiff PDEs, including 2D Allen-Cahn, demonstrate substantial improvements in accuracy and stability over baseline PINNs and other variants, supporting the claim that IC enforcement is essential for reliable PINN solvers in stiff regimes. The work also outlines how HC-PINN can be combined with other techniques (e.g., causal PINNs, residual-based attention, time-marching) to further enhance performance.
Abstract
Training Physics-Informed Neural Networks (PINNs) on stiff time-dependent PDEs remains highly unstable. Through rigorous ablation studies, we identify a surprisingly critical factor: the enforcement of initial conditions. We present the first systematic ablation of two core strategies, hard initial-condition constraints and adaptive loss weighting. Across challenging benchmarks (sharp transitions, higher-order derivatives, coupled systems, and high frequency modes), we find that exact enforcement of initial conditions (ICs) is not optional but essential. Our study demonstrates that stability and efficiency in PINN training fundamentally depend on ICs, paving the way toward more reliable PINN solvers in stiff regimes.
