Architectural Strategies for the optimization of Physics-Informed Neural Networks
Hemanth Saratchandran, Shin-Fang Chng, Simon Lucey
TL;DR
This work analyzes how neural architectures influence Physics-Informed Neural Network (PINN) optimization through the Neural Tangent Kernel (NTK). It establishes that Gaussian activations yield favorable boundary NTK spectra, with a lower bound on the minimum eigenvalue that scales quartically with layer width, suggesting superior training dynamics for PINNs. The authors further propose equilibrated PINNs, using row-equilibrated weights to condition the loss landscape and improve convergence, supported by theoretical intuition and empirical gains. Across Burgers’ equation, Navier–Stokes, and high-frequency diffusion, Gaussian-based and equilibrated PINNs consistently outperform baselines, offering a practical architectural route to mitigate spectral-bias-related training instability in PINNs.
Abstract
Physics-informed neural networks (PINNs) offer a promising avenue for tackling both forward and inverse problems in partial differential equations (PDEs) by incorporating deep learning with fundamental physics principles. Despite their remarkable empirical success, PINNs have garnered a reputation for their notorious training challenges across a spectrum of PDEs. In this work, we delve into the intricacies of PINN optimization from a neural architecture perspective. Leveraging the Neural Tangent Kernel (NTK), our study reveals that Gaussian activations surpass several alternate activations when it comes to effectively training PINNs. Building on insights from numerical linear algebra, we introduce a preconditioned neural architecture, showcasing how such tailored architectures enhance the optimization process. Our theoretical findings are substantiated through rigorous validation against established PDEs within the scientific literature.
