Enhancing XAI Narratives through Multi-Narrative Refinement and Knowledge Distillation
Flavio Giorgi, Matteo Silvestri, Cesare Campagnano, Fabrizio Silvestri, Gabriele Tolomei
TL;DR
This work presents Multi-Narrative Refinement (MNR), a two-stage, knowledge-distillation framework that enables small language models to generate high-quality counterfactual narratives for tabular data. By producing multiple drafts (Draft Narrative Generator) and a subsequent Refiner to synthesize and correct them, the approach substantially improves feature-faithfulness metrics (AvgFF, PFF, TF) and narrative quality while reducing compute (memory, time, energy) compared to using a large teacher model alone. The authors provide a formal background for counterfactual explanations and narratives, introduce a structured evaluation protocol combining automated metrics with human judgments, and demonstrate substantial gains on Adult and Titanic datasets. The work offers practical pathways for deploying explanation-generation in real-world, policy-driven contexts, balancing interpretability, efficiency, and robustness while acknowledging social risks and the need for careful governance.
Abstract
Explainable Artificial Intelligence has become a crucial area of research, aiming to demystify the decision-making processes of deep learning models. Among various explainability techniques, counterfactual explanations have been proven particularly promising, as they offer insights into model behavior by highlighting minimal changes that would alter a prediction. Despite their potential, these explanations are often complex and technical, making them difficult for non-experts to interpret. To address this challenge, we propose a novel pipeline that leverages Language Models, large and small, to compose narratives for counterfactual explanations. We employ knowledge distillation techniques along with a refining mechanism to enable Small Language Models to perform comparably to their larger counterparts while maintaining robust reasoning abilities. In addition, we introduce a simple but effective evaluation method to assess natural language narratives, designed to verify whether the models' responses are in line with the factual, counterfactual ground truth. As a result, our proposed pipeline enhances both the reasoning capabilities and practical performance of student models, making them more suitable for real-world use cases.
