Semantic Prioritization in Visual Counterfactual Explanations with Weighted Segmentation and Auto-Adaptive Region Selection

Lintong Zhang; Kang Yin; Seong-Whan Lee

Semantic Prioritization in Visual Counterfactual Explanations with Weighted Segmentation and Auto-Adaptive Region Selection

Lintong Zhang, Kang Yin, Seong-Whan Lee

TL;DR

This work tackles the interpretability gap in fine-grained visual counterfactual explanations by introducing WSAR-Net, a non-generative CE framework that constrains edits to semantically relevant regions and optimizes the editing order. It combines a weighted semantic map, derived from attribution and segmentation, with an auto-adaptive candidate editing sequence to minimize editing of non-semantic units while accelerating class transitions. Key contributions include the weighted semantic map formulation $M_{sem}=S_q\circ M_c$, the sparse editing mechanism via $f(I^*)=(\mathds{1}-\mathbf{a})\circ f(I)+\mathbf{a}\circ P f(I')$, and the dual-stage optimization that scales to multiple distractors through a similarity-guided pruning of edit permutations. Experiments on CUB-200-2011 and Stanford Dogs with ResNet-50 and VGG-16 demonstrate improved semantic coherence (Near-KP, Same-KP) and reduced editing effort, underscoring practical gains in efficient, interpretable counterfactual explanations.

Abstract

In the domain of non-generative visual counterfactual explanations (CE), traditional techniques frequently involve the substitution of sections within a query image with corresponding sections from distractor images. Such methods have historically overlooked the semantic relevance of the replacement regions to the target object, thereby impairing the model's interpretability and hindering the editing workflow. Addressing these challenges, the present study introduces an innovative methodology named as Weighted Semantic Map with Auto-adaptive Candidate Editing Network (WSAE-Net). Characterized by two significant advancements: the determination of an weighted semantic map and the auto-adaptive candidate editing sequence. First, the generation of the weighted semantic map is designed to maximize the reduction of non-semantic feature units that need to be computed, thereby optimizing computational efficiency. Second, the auto-adaptive candidate editing sequences are designed to determine the optimal computational order among the feature units to be processed, thereby ensuring the efficient generation of counterfactuals while maintaining the semantic relevance of the replacement feature units to the target object. Through comprehensive experimentation, our methodology demonstrates superior performance, contributing to a more lucid and in-depth understanding of visual counterfactual explanations.

Semantic Prioritization in Visual Counterfactual Explanations with Weighted Segmentation and Auto-Adaptive Region Selection

TL;DR

Abstract

Semantic Prioritization in Visual Counterfactual Explanations with Weighted Segmentation and Auto-Adaptive Region Selection

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (8)