Unifying Perspectives: Plausible Counterfactual Explanations on Global, Group-wise, and Local Levels
Oleksii Furman, Patryk Wielopolski, Łukasz Lenkiewicz, Jerzy Stefanowski, Maciej Zięba
TL;DR
The paper tackles the lack of a unified method for counterfactual explanations across local, group-wise, and global levels. It introduces a gradient-based optimization framework that jointly discovers group structure and counterfactuals, enabling end-to-end generation of Local, Group-wise, and Global Counterfactual Explanations. Plausibility is incorporated via conditional normalizing flows to ensure explanations align with the data distribution, while maintaining a balance between validity and proximity. Experimental results across six datasets and two classifiers demonstrate superior validity and plausibility, with competitive proximity, and real-world case studies (HELOC and digit transformations) illustrate actionable, group-specific recourse opportunities for diverse users.
Abstract
The growing complexity of AI systems has intensified the need for transparency through Explainable AI (XAI). Counterfactual explanations (CFs) offer actionable "what-if" scenarios on three levels: Local CFs providing instance-specific insights, Global CFs addressing broader trends, and Group-wise CFs (GWCFs) striking a balance and revealing patterns within cohesive groups. Despite the availability of methods for each granularity level, the field lacks a unified method that integrates these complementary approaches. We address this limitation by proposing a gradient-based optimization method for differentiable models that generates Local, Global, and Group-wise Counterfactual Explanations in a unified manner. We especially enhance GWCF generation by combining instance grouping and counterfactual generation into a single efficient process, replacing traditional two-step methods. Moreover, to ensure trustworthiness, we innovatively introduce the integration of plausibility criteria into the GWCF domain, making explanations both valid and realistic. Our results demonstrate the method's effectiveness in balancing validity, proximity, and plausibility while optimizing group granularity, with practical utility validated through practical use cases.
