Analytic Convolutional Layer: A Step to Analytic Neural Network
Jingmao Cui, Donglai Tao, Linmi Tao, Ruiyang Liu, Yu Cheng
TL;DR
The paper addresses the data-driven parameter explosion and limited interpretability of traditional CNNs by proposing the Analytic Convolutional Layer (ACL), a model-driven layer that blends Analytic Kernel Functions (ACKs) with plain kernels. AKPs parameterize each analytic kernel, enabling adaptive representation of feature spaces while maintaining a compact parameter budget; ACLs can also approximate pretrained kernels with far fewer parameters, supporting compression and interpretability. Experiments across datasets (Oxford Flowers, MNIST, Food-101, CIFAR-10) demonstrate that ACLs achieve competitive accuracy with substantial parameter reductions, and AnaNNs (Analytic Neural Networks) show improved explainability and maintain performance gains in several configurations. The work establishes a foundation for analytic neural networks, enabling principled integration of prior knowledge, interpretability, and potential efficiency advantages, with public code to foster replication and further exploration.
Abstract
The prevailing approach to embedding prior knowledge within convolutional layers typically includes the design of steerable kernels or their modulation using designated kernel banks. In this study, we introduce the Analytic Convolutional Layer (ACL), an innovative model-driven convolutional layer, which is a mosaic of analytical convolution kernels (ACKs) and traditional convolution kernels. ACKs are characterized by mathematical functions governed by analytic kernel parameters (AKPs) learned in training process. Learnable AKPs permit the adaptive update of incorporated knowledge to align with the features representation of data. Our extensive experiments demonstrate that the ACLs not only have a remarkable capacity for feature representation with a reduced number of parameters but also attain increased reliability through the analytical formulation of ACKs. Furthermore, ACLs offer a means for neural network interpretation, thereby paving the way for the intrinsic interpretability of neural network. The source code will be published in company with the paper.
