The Dawn of KAN in Image-to-Image (I2I) Translation: Integrating Kolmogorov-Arnold Networks with GANs for Unpaired I2I Translation
Arpan Mahara, Naphtali D. Rishe, Liangdong Deng
TL;DR
This paper tackles unpaired image-to-image translation by integrating Kolmogorov-Arnold Networks (KAN) into the CUT framework, replacing the traditional two-layer MLP with a two-layer efficient KAN. The authors design a KAN-CUT model that leverages B-splines, concatenation with GLUs, and PatchNCE-based contrastive learning within a GAN setup to produce higher-quality target-domain images. They demonstrate through experiments on Horse→Zebra and Cat→Dog that KAN-CUT achieves superior FID scores and improved visual fidelity compared to established baselines, validating the potential of KAN in generative AI tasks. The work highlights the broader impact of KAN on representation learning for generative models and suggests further exploration across more datasets and tasks.
Abstract
Image-to-Image translation in Generative Artificial Intelligence (Generative AI) has been a central focus of research, with applications spanning healthcare, remote sensing, physics, chemistry, photography, and more. Among the numerous methodologies, Generative Adversarial Networks (GANs) with contrastive learning have been particularly successful. This study aims to demonstrate that the Kolmogorov-Arnold Network (KAN) can effectively replace the Multi-layer Perceptron (MLP) method in generative AI, particularly in the subdomain of image-to-image translation, to achieve better generative quality. Our novel approach replaces the two-layer MLP with a two-layer KAN in the existing Contrastive Unpaired Image-to-Image Translation (CUT) model, developing the KAN-CUT model. This substitution favors the generation of more informative features in low-dimensional vector representations, which contrastive learning can utilize more effectively to produce high-quality images in the target domain. Extensive experiments, detailed in the results section, demonstrate the applicability of KAN in conjunction with contrastive learning and GANs in Generative AI, particularly for image-to-image translation. This work suggests that KAN could be a valuable component in the broader generative AI domain.
