Explaning with trees: interpreting CNNs using hierarchies
Caroline Mazini Rodrigues, Nicolas Boutry, Laurent Najman
TL;DR
This paper introduces xAiTrees, a hierarchical-region explainability framework for CNNs that balances fidelity to model reasoning with human interpretability. It leverages multiscale segmentation (via Binary Partition Trees and Hierarchical Watershed) and both human-based and model-based region attributions to produce explanations, further enhanced by a persistence-based tree shaping and aggregation into a final visualization. A novel Pixel Impact Rate (PIR) metric is proposed to quantify per-pixel explanatory power, and extensive experiments across Cat-vs-Dog, CIFAR-10, and ImageNet demonstrate competitive fidelity with strong bias-detection performance, outperforming several baselines. The work includes a human-subject study validating interpretability gains and provides open-source code to encourage adoption and reproducibility.
Abstract
Challenges persist in providing interpretable explanations for neural network reasoning in explainable AI (xAI). Existing methods like Integrated Gradients produce noisy maps, and LIME, while intuitive, may deviate from the model's reasoning. We introduce a framework that uses hierarchical segmentation techniques for faithful and interpretable explanations of Convolutional Neural Networks (CNNs). Our method constructs model-based hierarchical segmentations that maintain the model's reasoning fidelity and allows both human-centric and model-centric segmentation. This approach offers multiscale explanations, aiding bias identification and enhancing understanding of neural network decision-making. Experiments show that our framework, xAiTrees, delivers highly interpretable and faithful model explanations, not only surpassing traditional xAI methods but shedding new light on a novel approach to enhancing xAI interpretability.
