CohEx: A Generalized Framework for Cohort Explanation
Fanyu Meng, Xin Liu, Zhaodan Kong, Xin Chen
TL;DR
CohEx introduces cohort explanations as a principled middle ground between global and local XAI, defining a generalized, data-driven framework based on supervised clustering to produce context-specific explanations. It iteratively recomputes local importances within cohorts and refines cohort definitions through supervised clustering, balancing generalizability and conciseness while enforcing locality and stability. The approach is evaluated on synthetic, bike-sharing, and MNIST tasks, showing improved locality, generalizability, and interpretable cohort structure compared to baselines like VINE, REPID, GALE, and hierarchical schemes. The work provides a concrete objective and metrics for cohort explanations and highlights practical considerations and future directions for robust, scalable cohort-level explainability in real-world settings.
Abstract
eXplainable Artificial Intelligence (XAI) has garnered significant attention for enhancing transparency and trust in machine learning models. However, the scopes of most existing explanation techniques focus either on offering a holistic view of the explainee model (global explanation) or on individual instances (local explanation), while the middle ground, i.e., cohort-based explanation, is less explored. Cohort explanations offer insights into the explainee's behavior on a specific group or cohort of instances, enabling a deeper understanding of model decisions within a defined context. In this paper, we discuss the unique challenges and opportunities associated with measuring cohort explanations, define their desired properties, and create a generalized framework for generating cohort explanations based on supervised clustering.
