Negation Triplet Extraction with Syntactic Dependency and Semantic Consistency
Yuchen Shi, Deqing Yang, Jingping Liu, Yanghua Xiao, Zongyu Wang, Huimin Xu
TL;DR
This work introduces Negation Triplet Extraction (NTE), a task to extract <subject, cue, scope> triplets from sentences, addressing the missing negation subject in prior negation understanding. It proposes SSENE, an Encoder-Decoder PLM with a syntax-aware encoder and a semantic-consistency auxiliary task, optimizing with a combined loss $\mathcal{L}=\mathcal{L}_1+\alpha\mathcal{L}_2$ where $0<\alpha<1$, and encodes syntactic structure via a dependency-attention mechanism and matrix $\mathbf{M}$. A high-quality Chinese dataset NegComment is constructed from Meituan reviews to evaluate NTE, and extensive experiments show SSENE outperforms baselines in NTE and remains competitive on CDSR, with ablations confirming the benefits of both syntactic information and semantic alignment. The model's ability to leverage distant dependencies and enforce semantic consistency has practical implications for downstream tasks like search, sentiment analysis, and reasoning over negated attributes. The work also demonstrates generalizability to CDSR and contributes a new dataset to advance negation understanding in real-world Chinese text.
Abstract
Previous works of negation understanding mainly focus on negation cue detection and scope resolution, without identifying negation subject which is also significant to the downstream tasks. In this paper, we propose a new negation triplet extraction (NTE) task which aims to extract negation subject along with negation cue and scope. To achieve NTE, we devise a novel Syntax&Semantic-Enhanced Negation Extraction model, namely SSENE, which is built based on a generative pretrained language model (PLM) {of Encoder-Decoder architecture} with a multi-task learning framework. Specifically, the given sentence's syntactic dependency tree is incorporated into the PLM's encoder to discover the correlations between the negation subject, cue and scope. Moreover, the semantic consistency between the sentence and the extracted triplet is ensured by an auxiliary task learning. Furthermore, we have constructed a high-quality Chinese dataset NegComment based on the users' reviews from the real-world platform of Meituan, upon which our evaluations show that SSENE achieves the best NTE performance compared to the baselines. Our ablation and case studies also demonstrate that incorporating the syntactic information helps the PLM's recognize the distant dependency between the subject and cue, and the auxiliary task learning is helpful to extract the negation triplets with more semantic consistency.
