Kolmogorov-Arnold Fourier Networks

Jusheng Zhang; Yijia Fan; Kaitong Cai; Keze Wang

Kolmogorov-Arnold Fourier Networks

Jusheng Zhang, Yijia Fan, Kaitong Cai, Keze Wang

TL;DR

KAN offers strong theoretical expressiveness but suffers from parameter explosion and poor high-frequency capture in high dimensions. KAF mitigates these issues by replacing B-spline bases with learnable Random Fourier Features and introducing a GELU-Fourier hybrid activation with adaptive spectral weighting, achieving parameter efficiency and improved spectral representation. The approach demonstrates superior or competitive performance across vision, NLP, audio, and PDE solving, often with fewer parameters and reasonable compute, while maintaining interpretability aspects inspired by Kolmogorov-Arnold theory. These results suggest KAF as a practical, scalable alternative to traditional KAN and MLP-based architectures in high-dimensional learning tasks.

Abstract

Although Kolmogorov-Arnold based interpretable networks (KAN) have strong theoretical expressiveness, they face significant parameter explosion and high-frequency feature capture challenges in high-dimensional tasks. To address this issue, we propose the Kolmogorov-Arnold-Fourier Network (KAF), which effectively integrates trainable Random Fourier Features (RFF) and a novel hybrid GELU-Fourier activation mechanism to balance parameter efficiency and spectral representation capabilities. Our key technical contributions include: (1) merging KAN's dual-matrix structure through matrix association properties to substantially reduce parameters; (2) introducing learnable RFF initialization strategies to eliminate spectral distortion in high-dimensional approximation tasks; (3) implementing an adaptive hybrid activation function that progressively enhances frequency representation during the training process. Comprehensive experiments demonstrate the superiority of our KAF across various domains including vision, NLP, audio processing, and differential equation-solving tasks, effectively combining theoretical interpretability with practical utility and computational efficiency.

Kolmogorov-Arnold Fourier Networks

TL;DR

Abstract

Kolmogorov-Arnold Fourier Networks

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (9)