Topic-aware Causal Intervention for Counterfactual Detection
Thong Nguyen, Truc-My Nguyen
TL;DR
This work tackles CFD by identifying clue-phrase and label biases that limit generalization, especially under data imbalance. It proposes a topic-aware causal framework that integrates a neural topic representation with backdoor adjustment and deconfounded hidden representations, enabling robust CFD predictions across languages and bias-sensitive tasks. The approach achieves state-of-the-art results on CFD benchmarks (SemEval-2020, Amazon-2021) and demonstrates cross-lingual and transferability benefits to related tasks like PI and ISA, with extensive ablations and case studies supporting the effectiveness of causal intervention. The proposed deconfounding strategy, combining a topic-driven global semantics with causal intervention, has practical implications for fairer, more reliable NLP systems in multilingual and bias-prone settings.
Abstract
Counterfactual statements, which describe events that did not or cannot take place, are beneficial to numerous NLP applications. Hence, we consider the problem of counterfactual detection (CFD) and seek to enhance the CFD models. Previous models are reliant on clue phrases to predict counterfactuality, so they suffer from significant performance drop when clue phrase hints do not exist during testing. Moreover, these models tend to predict non-counterfactuals over counterfactuals. To address these issues, we propose to integrate neural topic model into the CFD model to capture the global semantics of the input statement. We continue to causally intervene the hidden representations of the CFD model to balance the effect of the class labels. Extensive experiments show that our approach outperforms previous state-of-the-art CFD and bias-resolving methods in both the CFD and other bias-sensitive tasks.
