Cauchy activation function and XNet
Xin Li, Zhihong Xia, Hongkun Zhang
TL;DR
This work introduces the Cauchy activation function, derived from the Cauchy integral theorem, and the XNet architecture to achieve high-precision function approximation with shallower networks. The authors prove a Cauchy Approximation Theorem and a General Approximation Theorem, establishing $O(m^{-k})$ convergence for any $k>0$ and extending universal approximation to higher dimensions. Empirically, Cauchy activation improves performance on MNIST and CIFAR-10, and delivers substantial advantages in solving low- and high-dimensional PDEs, including high-dimensional Allen–Cahn problems, often outperforming PINNs and standard activations while reducing network depth and training time. The results imply significant potential for accurate, efficient scientific computing and CV tasks, where high-order local approximation and rapid convergence are valuable. Overall, the Cauchy activation enables higher precision with simpler architectures, suggesting broad applicability across image processing and computational mathematics.
Abstract
We have developed a novel activation function, named the Cauchy Activation Function. This function is derived from the Cauchy Integral Theorem in complex analysis and is specifically tailored for problems requiring high precision. This innovation has led to the creation of a new class of neural networks, which we call (Comple)XNet, or simply XNet. We will demonstrate that XNet is particularly effective for high-dimensional challenges such as image classification and solving Partial Differential Equations (PDEs). Our evaluations show that XNet significantly outperforms established benchmarks like MNIST and CIFAR-10 in computer vision, and offers substantial advantages over Physics-Informed Neural Networks (PINNs) in both low-dimensional and high-dimensional PDE scenarios.
