DiCoFlex: Model-agnostic diverse counterfactuals with flexible control
Oleksii Furman, Ulvi Movsum-zada, Patryk Marszalek, Maciej Zięba, Marek Śmieja
TL;DR
DiCoFlex addresses the challenge of generating multiple diverse counterfactual explanations without requiring constant access to the predictive model or retraining. It leverages a conditional normalizing flow (MAF) trained on labeled data to approximate $p_{ heta}(oldsymbol{x}'|oldsymbol{x}, y')$, enabling a single forward pass to produce several counterfactuals while allowing inference-time control over sparsity via the $L_p$ norm and actionability via a feature mask. Training relies on sampling training counterfactuals from a $K$-nearest neighbors distribution $\\hat{q}(oldsymbol{x}'|oldsymbol{x}, y', d)$ and optimizing a KL-based objective that aligns the flow with this distribution, guaranteeing validity by construction. Empirical results on five benchmark tabular datasets show DiCoFlex achieves superior diversity, plausibility, and proximity, with real-time generation and flexible constraint handling, making it a practical tool for sensitive decision domains where stakeholder-guided recourse is important.
Abstract
Counterfactual explanations play a pivotal role in explainable artificial intelligence (XAI) by offering intuitive, human-understandable alternatives that elucidate machine learning model decisions. Despite their significance, existing methods for generating counterfactuals often require constant access to the predictive model, involve computationally intensive optimization for each instance and lack the flexibility to adapt to new user-defined constraints without retraining. In this paper, we propose DiCoFlex, a novel model-agnostic, conditional generative framework that produces multiple diverse counterfactuals in a single forward pass. Leveraging conditional normalizing flows trained solely on labeled data, DiCoFlex addresses key limitations by enabling real-time user-driven customization of constraints such as sparsity and actionability at inference time. Extensive experiments on standard benchmark datasets show that DiCoFlex outperforms existing methods in terms of validity, diversity, proximity, and constraint adherence, making it a practical and scalable solution for counterfactual generation in sensitive decision-making domains.
