Error Analysis and Numerical Algorithm for PDE Approximation with Hidden-Layer Concatenated Physics Informed Neural Networks
Yianxia Qian, Yongchao Zhang, Suchuan Dong
TL;DR
The paper introduces HLConcPINN, a physics-informed neural network framework that uses hidden-layer concatenation and an extended block time marching scheme to solve parabolic and hyperbolic PDEs with provable error control. By enabling arbitrary network depth (≥2) and general smooth activations beyond the first two layers, HLConcPINN preserves representation capacity and theoretical guarantees. The authors prove residual decay and total error bounds that relate the solution error to training and quadrature errors, and validate the theory with extensive numerical experiments on the heat, Burgers', wave, and nonlinear Klein-Gordon equations. The approach significantly improves long-time accuracy and offers a versatile, theory-backed alternative to standard PINN formulations for complex, time-dependent PDEs.
Abstract
We present the hidden-layer concatenated physics informed neural network (HLConcPINN) method, which combines hidden-layer concatenated feed-forward neural networks, a modified block time marching strategy, and a physics informed approach for approximating partial differential equations (PDEs). We analyze the convergence properties and establish the error bounds of this method for two types of PDEs: parabolic (exemplified by the heat and Burgers' equations) and hyperbolic (exemplified by the wave and nonlinear Klein-Gordon equations). We show that its approximation error of the solution can be effectively controlled by the training loss for dynamic simulations with long time horizons. The HLConcPINN method in principle allows an arbitrary number of hidden layers not smaller than two and any of the commonly-used smooth activation functions for the hidden layers beyond the first two, with theoretical guarantees. This generalizes several recent neural-network techniques, which have theoretical guarantees but are confined to two hidden layers in the network architecture and the $\tanh$ activation function. Our theoretical analyses subsequently inform the formulation of appropriate training loss functions for these PDEs, leading to physics informed neural network (PINN) type computational algorithms that differ from the standard PINN formulation. Ample numerical experiments are presented based on the proposed algorithm to validate the effectiveness of this method and confirm aspects of the theoretical analyses.
