Scientific Machine Learning with Kolmogorov-Arnold Networks

Salah A. Faroughi; Farinaz Mostajeran; Amin Hamed Mashhadzadeh; Shirko Faroughi

Scientific Machine Learning with Kolmogorov-Arnold Networks

Salah A. Faroughi, Farinaz Mostajeran, Amin Hamed Mashhadzadeh, Shirko Faroughi

TL;DR

This review analyzes Kolmogorov–Arnol'd networks (KANs) as principled alternatives to traditional multilayer perceptrons in scientific machine learning. It synthesizes progress across data-driven, physics-informed, and deep-operator contexts, highlighting how KANs leverage Kolmogorov–Arnol'd representations to decompose high-dimensional mappings into univariate components, often improving interpretability, convergence, and spectral behavior. Key contributions include architectural innovations (two-layer and deep KANs), basis-function design (splines, Chebyshev, wavelets, RBFs), and comparative analyses showing favorable accuracy and efficiency versus MLPs and PINNs, as well as advances in DeepOKAN for operator learning. The work also identifies challenges (computational cost, hyperparameter sensitivity, framework support) and charts directions toward stronger theory, geometry-aware modeling, and industrial-scale validation, underscoring the potential of KAN-based models to deliver robust, mesh-independent, and physically consistent SciML solutions.

Abstract

The field of scientific machine learning, which originally utilized multilayer perceptrons (MLPs), is increasingly adopting Kolmogorov-Arnold Networks (KANs) for data encoding. This shift is driven by the limitations of MLPs, including poor interpretability, fixed activation functions, and difficulty capturing localized or high-frequency features. KANs address these issues with enhanced interpretability and flexibility, enabling more efficient modeling of complex nonlinear interactions and effectively overcoming the constraints associated with conventional MLP architectures. This review categorizes recent progress in KAN-based models across three distinct perspectives: (i) data-driven learning, (ii) physics-informed modeling, and (iii) deep-operator learning. Each perspective is examined through the lens of architectural design, training strategies, application efficacy, and comparative evaluation against MLP-based counterparts. By benchmarking KANs against MLPs, we highlight consistent improvements in accuracy, convergence, and spectral representation, clarifying KANs' advantages in capturing complex dynamics while learning more effectively. In addition to reviewing recent literature, this work also presents several comparative evaluations that clarify central characteristics of KAN modeling and hint at their potential implications for real-world applications. Finally, this review identifies critical challenges and open research questions in KAN development, particularly regarding computational efficiency, theoretical guarantees, hyperparameter tuning, and algorithm complexity. We also outline future research directions aimed at improving the robustness, scalability, and physical consistency of KAN-based frameworks.

Scientific Machine Learning with Kolmogorov-Arnold Networks

TL;DR

Abstract

Scientific Machine Learning with Kolmogorov-Arnold Networks

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (18)