Neural Tangent Kernel of Neural Networks with Loss Informed by Differential Operators

Weiye Gan; Yicheng Li; Qian Lin; Zuoqiang Shi

Neural Tangent Kernel of Neural Networks with Loss Informed by Differential Operators

Weiye Gan, Yicheng Li, Qian Lin, Zuoqiang Shi

TL;DR

The paper addresses spectral bias in physics-informed neural networks by developing a Neural Tangent Kernel (NTK) framework for deep nets trained with a loss informed by differential operators. It derives that the NTK for the physics-informed loss factors as $K_{\mathcal{T}}^{NT}(x,x') = \mathcal{T}_x\mathcal{T}_{x'}K^{NT}(x,x')$, linking the operator-informed dynamics to the standard NTK, and proves convergence properties of the NTK both at initialization and during training in the wide-network limit. The results show that, in most cases, the differential operator in the loss does not induce a faster eigenvalue decay nor a stronger spectral bias, supported by perturbation arguments and empirical experiments. The findings suggest that improving PINP performance from a spectral-bias perspective is better achieved by balancing the loss components rather than adding higher-order differential operators, with implications for training efficiency and generalization in PDE-solving contexts.

Abstract

Spectral bias is a significant phenomenon in neural network training and can be explained by neural tangent kernel (NTK) theory. In this work, we develop the NTK theory for deep neural networks with physics-informed loss, providing insights into the convergence of NTK during initialization and training, and revealing its explicit structure. We find that, in most cases, the differential operators in the loss function do not induce a faster eigenvalue decay rate and stronger spectral bias. Some experimental results are also presented to verify the theory.

Neural Tangent Kernel of Neural Networks with Loss Informed by Differential Operators

TL;DR

, linking the operator-informed dynamics to the standard NTK, and proves convergence properties of the NTK both at initialization and during training in the wide-network limit. The results show that, in most cases, the differential operator in the loss does not induce a faster eigenvalue decay nor a stronger spectral bias, supported by perturbation arguments and empirical experiments. The findings suggest that improving PINP performance from a spectral-bias perspective is better achieved by balancing the loss components rather than adding higher-order differential operators, with implications for training efficiency and generalization in PDE-solving contexts.

Neural Tangent Kernel of Neural Networks with Loss Informed by Differential Operators

TL;DR

Abstract

Neural Tangent Kernel of Neural Networks with Loss Informed by Differential Operators

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (2)

Theorems & Definitions (38)