Input Convex Kolmogorov Arnold Networks

Thomas Deschatre; Xavier Warin

Input Convex Kolmogorov Arnold Networks

Thomas Deschatre, Xavier Warin

TL;DR

This work introduces Input Convex Kolmogorov Arnold Networks (ICKAN) to approximate convex functions while preserving convexity, leveraging the Kolmogorov-Arnold representation. It develops two main variants, P1-ICKAN (piecewise-linear) and Cubic-ICKAN (Hermite cubic), with optional grid adaptation and universal approximation guarantees for the adapted case. The paper provides convergence theorems, analyzes layer construction, and demonstrates competitive performance against standard ICNNs on function-approximation and toy control tasks. It then extends to Partial ICKAN (PICKAN) for partially convex targets and applies ICKAN to optimal transport via a Brenier potential-based formulation, showing favorable results on synthetic data, especially in higher dimensions and tensorized settings. Overall, ICKANs offer a principled, interpretable alternative to ICNNs with potential practical benefits for separable or convex-structure problems in optimization and transport.

Abstract

This article presents an input convex neural network architecture using Kolmogorov-Arnold networks (ICKAN). Two specific networks are presented: the first is based on a low-order, linear-by-part, representation of functions, and a universal approximation theorem is provided. The second is based on cubic splines, for which only numerical results support convergence. We demonstrate on simple tests that these networks perform competitively with classical input convex neural networks (ICNNs). In a second part, we use the networks to solve some optimal transport problems needing a convex approximation of functions and demonstrate their effectiveness. Comparisons with ICNNs show that cubic ICKANs produce results similar to those of classical ICNNs.

Input Convex Kolmogorov Arnold Networks

TL;DR

Abstract

Input Convex Kolmogorov Arnold Networks

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (21)

Theorems & Definitions (9)