DXAI: Explaining Classification by Image Decomposition
Elnatan Kadar, Guy Gilboa
TL;DR
The paper tackles explainable AI for image classification when discriminative cues are dense or additive by introducing DXAI, a decomposition-based framework that splits an input $x$ into a class-agnostic part $\psi_{Agnostic}$ and a class-distinct part $\psi_{Distinct}$ so that $x=\psi_{Agnostic}+\psi_{Distinct}$. An approximate solution employs multi-branch style-transfer GANs with an $\alpha$-blending mechanism to isolate class-specific features, guided by a pre-trained classifier $C$ and a shared multi-head discriminator that also performs classification. Experiments on diverse datasets demonstrate high-resolution, multi-channel explanations that capture color and texture cues beyond what heatmaps provide, outperforming several baselines on objective fidelity metrics. The approach offers a new lens for XAI, with potential extensions to diffusion-based generators, while trade-offs include training complexity and the lack of a natural pixel-wise importance ranking.
Abstract
We propose a new way to explain and to visualize neural network classification through a decomposition-based explainable AI (DXAI). Instead of providing an explanation heatmap, our method yields a decomposition of the image into class-agnostic and class-distinct parts, with respect to the data and chosen classifier. Following a fundamental signal processing paradigm of analysis and synthesis, the original image is the sum of the decomposed parts. We thus obtain a radically different way of explaining classification. The class-agnostic part ideally is composed of all image features which do not posses class information, where the class-distinct part is its complementary. This new visualization can be more helpful and informative in certain scenarios, especially when the attributes are dense, global and additive in nature, for instance, when colors or textures are essential for class distinction. Code is available at https://github.com/dxai2024/dxai.
