Towards Faithful Explanations: Boosting Rationalization with Shortcuts Discovery

Linan Yue; Qi Liu; Yichao Du; Li Wang; Weibo Gao; Yanqing An

Towards Faithful Explanations: Boosting Rationalization with Shortcuts Discovery

Linan Yue, Qi Liu, Yichao Du, Li Wang, Weibo Gao, Yanqing An

TL;DR

This paper tackles the problem of faithful explanations in neural text classification by addressing shortcuts that spuriously align inputs with predictions. It introduces Shortcuts-fused Selective Rationalization (SSR), which first discovers potential shortcut tokens and then uses two strategies (shared parameters and prediction-time de-correlation, plus a virtual-shortcuts variant) to mitigate shortcut-driven rationales, complemented by data augmentation to bridge labeled/unlabeled data gaps. Across ERASER benchmarks, SSR variants outperform unsupervised and semi-supervised baselines and come close to or exceed some supervised methods, with semantic data augmentation providing notable gains and improved out-of-domain generalization. The work offers a practical, model-agnostic approach to more faithful explanations, with potential applicability to privacy-sensitive or locally deployed decision systems and avenues for extending to LLM explanations.

Abstract

The remarkable success in neural networks provokes the selective rationalization. It explains the prediction results by identifying a small subset of the inputs sufficient to support them. Since existing methods still suffer from adopting the shortcuts in data to compose rationales and limited large-scale annotated rationales by human, in this paper, we propose a Shortcuts-fused Selective Rationalization (SSR) method, which boosts the rationalization by discovering and exploiting potential shortcuts. Specifically, SSR first designs a shortcuts discovery approach to detect several potential shortcuts. Then, by introducing the identified shortcuts, we propose two strategies to mitigate the problem of utilizing shortcuts to compose rationales. Finally, we develop two data augmentations methods to close the gap in the number of annotated rationales. Extensive experimental results on real-world datasets clearly validate the effectiveness of our proposed method.

Towards Faithful Explanations: Boosting Rationalization with Shortcuts Discovery

TL;DR

Abstract

Paper Structure (32 sections, 10 equations, 6 figures, 11 tables, 3 algorithms)

This paper contains 32 sections, 10 equations, 6 figures, 11 tables, 3 algorithms.

Introduction
Problem Formulation
Preliminary of Selective Rationalization
Shortcuts-fused Selective Rationalization
Shortcuts Discovery
Two Strategies by Exploring Shortcuts
Shared Parameters.
Injecting Shortcuts into Prediction.
Virtual Shortcuts Representations.
Data Augmentation
Experiments
Datasets and Comparison Methods
Experimental Setup
Experimental Results
Conclusions
...and 17 more sections

Figures (6)

Figure 1: Schematic of rationalization methods presented in this paper. (a) is the process of unsupervised rationalization with the selector-predictor pattern. (b) illustrates the supervised rationalization with a multi-task framework. Semi-rationalization can be considered the combination of (a) and (b).
Figure 2: Process of the shortcut generator.
Figure 3: Architecture of $\textmd{SSR}_{virt}$ consisting of the supervised and unsupervised phases. Among them, represents the frozen shortcut imitator, and white boxes in $m$ indicate the rationale tokens and the black are non-rationale ones.
Figure 4: Gold Rationale Efficiency.
Figure 5: $\textmd{SSR}_{unif}$ with full annotations.
...and 1 more figures

Towards Faithful Explanations: Boosting Rationalization with Shortcuts Discovery

TL;DR

Abstract

Towards Faithful Explanations: Boosting Rationalization with Shortcuts Discovery

Authors

TL;DR

Abstract

Table of Contents

Figures (6)