On the study of frequency control and spectral bias in Wavelet-Based Kolmogorov Arnold networks: A path to physics-informed KANs
Juan Daniel Meshir, Abel Palafox, Edgar Alejandro Guerrero
TL;DR
The paper tackles spectral bias in differential-equation solving networks by introducing Wavelet Kolmogorov Arnold Networks (Wav-KANs) and analyzing them through the Neural Tangent Kernel (NTK). It shows that the decay of NTK eigenvalues can be controlled by the mother wavelet frequency, such as the Morlet wavelet with parameter $b$, enabling balanced learning of low- and high-frequency components. The authors provide theoretical propositions and empirical demonstrations that adjusting $b$ or increasing hidden units mitigates spectral bias, and extend the approach to physics-informed neural networks as Wav-KINNs, which solve Poisson, Heat, Wave, and Helmholtz equations with fewer parameters and without domain decomposition. They report robust performance for several PDE types but note loss-term imbalance can hinder certain problems, indicating future work on loss balancing and a full NTK theory for wavelet-based networks.
Abstract
Spectral bias, the tendency of neural networks to prioritize learning low-frequency components of functions during the initial training stages, poses a significant challenge when approximating solutions with high-frequency details. This issue is particularly pronounced in physics-informed neural networks (PINNs), widely used to solve differential equations that describe physical phenomena. In the literature, contributions such as Wavelet Kolmogorov Arnold Networks (Wav-KANs) have demonstrated promising results in capturing both low- and high-frequency components. Similarly, Fourier features (FF) are often employed to address this challenge. However, the theoretical foundations of Wav-KANs, particularly the relationship between the frequency of the mother wavelet and spectral bias, remain underexplored. A more in-depth understanding of how Wav-KANs manage high-frequency terms could offer valuable insights for addressing oscillatory phenomena encountered in parabolic, elliptic, and hyperbolic differential equations. In this work, we analyze the eigenvalues of the neural tangent kernel (NTK) of Wav-KANs to enhance their ability to converge on high-frequency components, effectively mitigating spectral bias. Our theoretical findings are validated through numerical experiments, where we also discuss the limitations of traditional approaches, such as standard PINNs and Fourier features, in addressing multi-frequency problems.
