Owen-based Semantics and Hierarchy-Aware Explanation (O-Shap)
Xiangyu Zhou, Chenhan Xiao, Yang Weng
TL;DR
The paper tackles the limitations of SHAP in vision tasks arising from feature dependencies by introducing O-Shap, an Owen-value-based, hierarchy-aware explanation framework. It reformulates Shapley axioms for image data into group-level variants (Group Symmetry and Group Dummy) and extends the Owen value to multi-level hierarchies, enabling structure-aware attributions with polynomial-time complexity $O(|N|^{n \cdot \log_{n}{2}})$ under balanced partitions. A semantics-aware segmentation pipeline is proposed to satisfy the positive $T$-Property, ensuring semantic coherence across hierarchy levels. Extensive experiments on five image datasets and one tabular dataset demonstrate superior attribution precision and runtime efficiency compared to seven SHAP variants, highlighting the method’s robustness when structure matters.
Abstract
Shapley value-based methods have become foundational in explainable artificial intelligence (XAI), offering theoretically grounded feature attributions through cooperative game theory. However, in practice, particularly in vision tasks, the assumption of feature independence breaks down, as features (i.e., pixels) often exhibit strong spatial and semantic dependencies. To address this, modern SHAP implementations now include the Owen value, a hierarchical generalization of the Shapley value that supports group attributions. While the Owen value preserves the foundations of Shapley values, its effectiveness critically depends on how feature groups are defined. We show that commonly used segmentations (e.g., axis-aligned or SLIC) violate key consistency properties, and propose a new segmentation approach that satisfies the $T$-property to ensure semantic alignment across hierarchy levels. This hierarchy enables computational pruning while improving attribution accuracy and interpretability. Experiments on image and tabular datasets demonstrate that O-Shap outperforms baseline SHAP variants in attribution precision, semantic coherence, and runtime efficiency, especially when structure matters.
