DCNFIS: Deep Convolutional Neuro-Fuzzy Inference System
Mojtaba Yeganejou, Kimia Honari, Ryan Kluzinski, Scott Dick, Michael Lipsett, James Miller
TL;DR
DCNFIS addresses interpretability in deep learning by replacing the CNN classifier head with a modified ANFIS, enabling end-to-end training and fuzzy-rule–based explanations. It achieves accuracy on MNIST, Fashion-MNIST, CIFAR-10/100 comparable to the base CNNs and scales to ImageNet with notable gains in interpretability via medoid-based saliency maps, while maintaining competitive performance with state-of-the-art fuzzy methods, as evidenced by $O(N_C \cdot N_V)$ classifier complexity. The approach provides practical explanation mechanisms grounded in fuzzy rules and saliency, enabling debugging and analysis on datasets like Fashion-MNIST and potential counterfactual reasoning, with demonstrated improvements in global class separation in latent spaces. Collectively, the work advances transparent, accurate deep learning by integrating fuzzy inference into CNN backbones and lays groundwork for broader application in large-scale vision tasks and explainable AI research.
Abstract
A key challenge in eXplainable Artificial Intelligence is the well-known tradeoff between the transparency of an algorithm (i.e., how easily a human can directly understand the algorithm, as opposed to receiving a post-hoc explanation), and its accuracy. We report on the design of a new deep network that achieves improved transparency without sacrificing accuracy. We design a deep convolutional neuro-fuzzy inference system (DCNFIS) by hybridizing fuzzy logic and deep learning models and show that DCNFIS performs as accurately as existing convolutional neural networks on four well-known datasets and 3 famous architectures. Our performance comparison with available fuzzy methods show that DCNFIS is now state-of-the-art fuzzy system and outperforms other shallow and deep fuzzy methods to the best of our knowledge. At the end, we exploit the transparency of fuzzy logic by deriving explanations, in the form of saliency maps, from the fuzzy rules encoded in the network to take benefit of fuzzy logic upon regular deep learning methods. We investigate the properties of these explanations in greater depth using the Fashion-MNIST dataset.
