Occlusion Sensitivity Analysis with Augmentation Subspace Perturbation in Deep Feature Space
Pedro Valois, Koichiro Niinuma, Kazuhiro Fukui
TL;DR
This work addresses the challenge of explainability in vision models by introducing Occlusion Sensitivity Analysis with Deep Feature Augmentation Subspaces (OSA-DAS). The method augments traditional occlusion analysis by generating multiple augmented views and representing both the reference and occluded inputs as low-dimensional subspaces in the deep feature space, then quantifying occlusion impact via subspace distance using canonical angles. Key contributions include a data-augmentation–driven subspace framework, a formalized orthogonal-degree distance for attribution, a gradient-informed masking speedup, and a Minimal Size plus Overall performance metric to evaluate explanations. Experiments on ImageNet-1k across ResNet-50, ViT-B, and Swin-V2 show that OSA-DAS yields more robust, class-agnostic explanations with competitive or superior overall metrics compared to standard interpreters, albeit with higher memory requirements. This approach enhances trust in AI systems by tying explanations to training-time augmentations and deep feature representations, offering a scalable, architecture-agnostic path to reliable model interpretability.
Abstract
Deep Learning of neural networks has gained prominence in multiple life-critical applications like medical diagnoses and autonomous vehicle accident investigations. However, concerns about model transparency and biases persist. Explainable methods are viewed as the solution to address these challenges. In this study, we introduce the Occlusion Sensitivity Analysis with Deep Feature Augmentation Subspace (OSA-DAS), a novel perturbation-based interpretability approach for computer vision. While traditional perturbation methods make only use of occlusions to explain the model predictions, OSA-DAS extends standard occlusion sensitivity analysis by enabling the integration with diverse image augmentations. Distinctly, our method utilizes the output vector of a DNN to build low-dimensional subspaces within the deep feature vector space, offering a more precise explanation of the model prediction. The structural similarity between these subspaces encompasses the influence of diverse augmentations and occlusions. We test extensively on the ImageNet-1k, and our class- and model-agnostic approach outperforms commonly used interpreters, setting it apart in the realm of explainable AI.
