Convolutional Fully-Connected Capsule Network (CFC-CapsNet): A Novel and Fast Capsule Network
Pouya Shiri, Amirali Baniasadi
TL;DR
This paper addresses the inefficiency of Capsule Networks on complex datasets by introducing the Convolutional Fully-Connected Capsule Network (CFC-CapsNet). It replaces the conventional primary capsule construction with a Convolutional Fully-Connected (CFC) layer that yields significantly fewer capsules yet more expressive representations, resulting in faster training and inference and fewer parameters. The approach, combined with an enhanced class-independent decoder and regularization techniques (capsule dropout and hard training), achieves competitive accuracy on CIFAR-10, SVHN, and Fashion-MNIST, while dramatically reducing parameters (roughly 30% fewer on FMNIST and ~50% on CIFAR-10/SVHN) and speeding up computation (approximately 4x training and 4.5x inference). The work also analyzes parameter sensitivity, robustness to affine transformations, and positions CFC-CapsNet as a practical, lighter variant of CapsNet suitable for real-world applications, with potential extensions to other CapsNet architectures.
Abstract
A Capsule Network (CapsNet) is a relatively new classifier and one of the possible successors of Convolutional Neural Networks (CNNs). CapsNet maintains the spatial hierarchies between the features and outperforms CNNs at classifying images including overlapping categories. Even though CapsNet works well on small-scale datasets such as MNIST, it fails to achieve a similar level of performance on more complicated datasets and real applications. In addition, CapsNet is slow compared to CNNs when performing the same task and relies on a higher number of parameters. In this work, we introduce Convolutional Fully-Connected Capsule Network (CFC-CapsNet) to address the shortcomings of CapsNet by creating capsules using a different method. We introduce a new layer (CFC layer) as an alternative solution to creating capsules. CFC-CapsNet produces fewer, yet more powerful capsules resulting in higher network accuracy. Our experiments show that CFC-CapsNet achieves competitive accuracy, faster training and inference and uses less number of parameters on the CIFAR-10, SVHN and Fashion-MNIST datasets compared to conventional CapsNet.
