Stiff Transfer Learning for Physics-Informed Neural Networks
Emilien Seiler, Wanzhou Lei, Pavlos Protopapas
TL;DR
Stiff differential equations present a major challenge for data-driven solvers, particularly PINNs, due to disparate time scales that cause training instability and inaccuracies. The authors introduce STL-PINN, which trains a Multi-Head PINN in a low-stiff regime and then transfers to a high-stiff regime to obtain a final, one-shot solution, addressing stiffness-induced failure modes and retraining costs. The method leverages a fixed latent representation $H$ to enable fast transfer via a closed-form update for the head weights and uses a perturbation framework for nonlinear terms, achieving superior accuracy and competitive speed against implicit methods across stiff ODEs/PDEs and scalable reparametrizations. The results demonstrate practical impact for rapid parametric simulations and optimization in stiff systems, with STL-PINN outperforming state-of-the-art PINN baselines and offering a viable alternative to traditional solvers in many settings.
Abstract
Stiff differential equations are prevalent in various scientific domains, posing significant challenges due to the disparate time scales of their components. As computational power grows, physics-informed neural networks (PINNs) have led to significant improvements in modeling physical processes described by differential equations. Despite their promising outcomes, vanilla PINNs face limitations when dealing with stiff systems, known as failure modes. In response, we propose a novel approach, stiff transfer learning for physics-informed neural networks (STL-PINNs), to effectively tackle stiff ordinary differential equations (ODEs) and partial differential equations (PDEs). Our methodology involves training a Multi-Head-PINN in a low-stiff regime, and obtaining the final solution in a high stiff regime by transfer learning. This addresses the failure modes related to stiffness in PINNs while maintaining computational efficiency by computing "one-shot" solutions. The proposed approach demonstrates superior accuracy and speed compared to PINNs-based methods, as well as comparable computational efficiency with implicit numerical methods in solving stiff-parameterized linear and polynomial nonlinear ODEs and PDEs under stiff conditions. Furthermore, we demonstrate the scalability of such an approach and the superior speed it offers for simulations involving initial conditions and forcing function reparametrization.
