Counterfactual Samples Constructing and Training for Commonsense Statements Estimation
Chong Liu, Zaiwen Feng, Lin Liu, Zhenyun Deng, Jiuyong Li, Ruifang Zhai, Debo Cheng, Li Qin
TL;DR
The paper addresses weaknesses in Plausibility Estimation by introducing Commonsense Counterfactual Samples Generating (CCSG), a model-agnostic method that enforces language-explainable and commonsense-sensitive reasoning. CCSG combines a Counterfactual Samples Constructor (word-level substitutions guided by contribution and dropout-based augmentation) with a sentence-level contrastive training objective, underpinned by a Structural Causal Model (SCM) to mitigate commonsense bias. Evaluations across nine datasets show CCSG achieves a 3.07% absolute improvement over prior state-of-the-art and enhances LLM-based knowledge filtering, while ablations demonstrate improvements in bias reduction and explainability. The approach advances PE robustness, interpretability, and reliability, though it recognizes limitations in fictional contexts and ethical considerations for deployment.
Abstract
Plausibility Estimation (PE) plays a crucial role for enabling language models to objectively comprehend the real world. While large language models (LLMs) demonstrate remarkable capabilities in PE tasks but sometimes produce trivial commonsense errors due to the complexity of commonsense knowledge. They lack two key traits of an ideal PE model: a) Language-explainable: relying on critical word segments for decisions, and b) Commonsense-sensitive: detecting subtle linguistic variations in commonsense. To address these issues, we propose a novel model-agnostic method, referred to as Commonsense Counterfactual Samples Generating (CCSG). By training PE models with CCSG, we encourage them to focus on critical words, thereby enhancing both their language-explainable and commonsense-sensitive capabilities. Specifically, CCSG generates counterfactual samples by strategically replacing key words and introducing low-level dropout within sentences. These counterfactual samples are then incorporated into a sentence-level contrastive training framework to further enhance the model's learning process. Experimental results across nine diverse datasets demonstrate the effectiveness of CCSG in addressing commonsense reasoning challenges, with our CCSG method showing 3.07% improvement against the SOTA methods.
