Are Good Explainers Secretly Human-in-the-Loop Active Learners?
Emma Thuong Nguyen, Abhishek Ghose
TL;DR
The paper investigates whether using explanations to guide data selection amounts to a human-in-the-loop Active Learning workflow. It offers a formal mathematical framework that maps explanations to retrieval-based data selection, using matrices $A$, $B$, and $C$ and retrieval weights $W$ to minimize the generalization loss $\mathcal{L}_v$ over $\theta$ and $\Psi$. Empirically, it demonstrates initial results on SST-5 with Partition SHAP explanations and a linear SVM, showing the explanation-driven strategy can outperform standard AL methods in early iterations and remain competitive thereafter, while enabling simulation-based evaluation. The work highlights practical implications for evaluating XAI-based data curation and outlines directions for broader datasets, alternative explainers, and user studies to validate the approach.
Abstract
Explainable AI (XAI) techniques have become popular for multiple use-cases in the past few years. Here we consider its use in studying model predictions to gather additional training data. We argue that this is equivalent to Active Learning, where the query strategy involves a human-in-the-loop. We provide a mathematical approximation for the role of the human, and present a general formalization of the end-to-end workflow. This enables us to rigorously compare this use with standard Active Learning algorithms, while allowing for extensions to the workflow. An added benefit is that their utility can be assessed via simulation instead of conducting expensive user-studies. We also present some initial promising results.
