Relating Piecewise Linear Kolmogorov Arnold Networks to ReLU Networks

Nandi Schoots; Mattia Jacopo Villani; Niels uit de Bos

Relating Piecewise Linear Kolmogorov Arnold Networks to ReLU Networks

Nandi Schoots, Mattia Jacopo Villani, Niels uit de Bos

TL;DR

This work builds an explicit, bidirectional bridge between piecewise linear Kolmogorov-Arnold Networks (KANs) and ReLU networks. It provides constructive conversions in both directions, showing that any ReLU network can be represented as a KAN and vice versa, with quantified effects on depth, width, and nonzero parameters: ReLU→KAN introduces a linear parameter overhead $O(\sum_i n_i)$, while KAN→ReLU preserves parameter count aside from width expansion by a factor up to $k$, the number of segments per activation. The authors derive polyhedral-decomposition bounds for both architectures, establishing that KANs yield a finer partition than ReLUs for a given parameter budget, and prove that any piecewise linear function can be represented as a KAN. This bridge enables transferring ReLU-network theory to KANs (e.g., symmetries, initialisation, generalisation bounds) while enabling KAN interpretability via tractable polyhedral analyses and efficient inference through parameter-efficient representations.

Abstract

Kolmogorov-Arnold Networks are a new family of neural network architectures which holds promise for overcoming the curse of dimensionality and has interpretability benefits (arXiv:2404.19756). In this paper, we explore the connection between Kolmogorov Arnold Networks (KANs) with piecewise linear (univariate real) functions and ReLU networks. We provide completely explicit constructions to convert a piecewise linear KAN into a ReLU network and vice versa.

Relating Piecewise Linear Kolmogorov Arnold Networks to ReLU Networks

TL;DR

, while KAN→ReLU preserves parameter count aside from width expansion by a factor up to

, the number of segments per activation. The authors derive polyhedral-decomposition bounds for both architectures, establishing that KANs yield a finer partition than ReLUs for a given parameter budget, and prove that any piecewise linear function can be represented as a KAN. This bridge enables transferring ReLU-network theory to KANs (e.g., symmetries, initialisation, generalisation bounds) while enabling KAN interpretability via tractable polyhedral analyses and efficient inference through parameter-efficient representations.

Relating Piecewise Linear Kolmogorov Arnold Networks to ReLU Networks

TL;DR

Abstract

Relating Piecewise Linear Kolmogorov Arnold Networks to ReLU Networks

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (4)

Theorems & Definitions (25)