Explaining Human Comparisons using Alignment-Importance Heatmaps
Nhut Truong, Dario Pesenti, Uri Hasson
TL;DR
This paper introduces Alignment Importance Score (AIS) heatmaps to explain human similarity judgments (HSJs) by identifying which DNN feature maps drive alignment with human perception. AIS is computed by mask-based perturbations in the final convolutional layer and selecting a subset of feature maps that maximizes cross-domain predictability of HSJs; heatmaps are then generated to visualize image regions tied to high AIS. Across multiple architectures and datasets, AIS-based pruning improves out-of-sample HSJ prediction and reveals that comparison-relevant information can diverge from visually salient regions. The study also quantifies how AIS heatmaps relate to traditional saliency maps, finding meaningful overlap in some cases (notably Animals) but important divergences in others, and demonstrates broad generalization of AIS across architectures, informing how to build interpretable explanations for human comparisons.
Abstract
We present a computational explainability approach for human comparison tasks, using Alignment Importance Score (AIS) heatmaps derived from deep-vision models. The AIS reflects a feature-map's unique contribution to the alignment between Deep Neural Network's (DNN) representational geometry and that of humans. We first validate the AIS by showing that prediction of out-of-sample human similarity judgments is improved when constructing representations using only higher-scoring AIS feature maps identified from a training set. We then compute image-specific heatmaps that visually indicate the areas that correspond to feature-maps with higher AIS scores. These maps provide an intuitive explanation of which image areas are more important when it is compared to other images in a cohort. We observe a correspondence between these heatmaps and saliency maps produced by a gaze-prediction model. However, in some cases, meaningful differences emerge, as the dimensions relevant for comparison are not necessarily the most visually salient. To conclude, Alignment Importance improves prediction of human similarity judgments from DNN embeddings, and provides interpretable insights into the relevant information in image space.
