KAN See In the Dark
Aoxiang Ning, Minglong Xue, Jinhong He, Chengyun Song
TL;DR
This paper tackles LLIE by addressing nonlinear degradation factors arising from uneven illumination and noise. It introduces KSID, a diffusion-based LLIE framework that embeds a KAN-Block to learn nonlinear mappings with improved interpretability, augmented by a Frequency Domain Perception Module to stabilize the diffusion process and sharpen high-frequency details. The authors provide a Kolmogorov-Arnold theorem grounded rationale for using KANs, detail a two-phase training scheme, and demonstrate strong quantitative and qualitative performance on LLIE benchmarks, including state-of-the-art results on several datasets. The work highlights the potential of interpretable, non-linear architectures in low-level vision tasks and opens avenues for integrating frequency-domain cues into diffusion-based image enhancement.
Abstract
Existing low-light image enhancement methods are difficult to fit the complex nonlinear relationship between normal and low-light images due to uneven illumination and noise effects. The recently proposed Kolmogorov-Arnold networks (KANs) feature spline-based convolutional layers and learnable activation functions, which can effectively capture nonlinear dependencies. In this paper, we design a KAN-Block based on KANs and innovatively apply it to low-light image enhancement. This method effectively alleviates the limitations of current methods constrained by linear network structures and lack of interpretability, further demonstrating the potential of KANs in low-level vision tasks. Given the poor perception of current low-light image enhancement methods and the stochastic nature of the inverse diffusion process, we further introduce frequency-domain perception for visually oriented enhancement. Extensive experiments demonstrate the competitive performance of our method on benchmark datasets. The code will be available at: https://github.com/AXNing/KSID}{https://github.com/AXNing/KSID.
