Viewing the process of generating counterfactuals as a source of knowledge: a new approach for explaining classifiers
Vincent Lemaire, Nathan Le Boudec, Victor Guyomard, Françoise Fessant
TL;DR
This paper tackles explainability by reframing counterfactual generation as a knowledge-creating process. It builds a knowledge base for the Naive Bayes classifier by encoding the impact of feature changes with $\Delta$ values and demonstrates that changes are additive, enabling tractable trajectory construction toward a target class. The Telco churn case shows how clustering Delta-based profiles yields actionable, trajectory-driven preventive and reactive interventions. Overall, the work provides a practical framework to convert local counterfactual explanations into global, reusable knowledge for explainability, with potential extensions to domains beyond churn.
Abstract
There are now many explainable AI methods for understanding the decisions of a machine learning model. Among these are those based on counterfactual reasoning, which involve simulating features changes and observing the impact on the prediction. This article proposes to view this simulation process as a source of creating a certain amount of knowledge that can be stored to be used, later, in different ways. This process is illustrated in the additive model and, more specifically, in the case of the naive Bayes classifier, whose interesting properties for this purpose are shown.
