It is Simple Sometimes: A Study On Improving Aspect-Based Sentiment Analysis Performance
Laura Cabello, Uchenna Akujuobi
TL;DR
PFInstruct addresses the need for simple, effective prompt-based learning for ABSA subtasks by adding an NLP-task prefix to the InstructABSA framework. The approach treats all ABSA subtasks as generative tasks and leverages RE/NER prefixes or even noisy prefixes to augment context, achieving improvements across SemEval, ERSA, and SentiHood datasets, including a $+3.28$ F1 on ATE Rest14 and a $+5.43$ F1 average on AOOE across SemEval datasets. A noisy prefix also boosts performance, indicating instruction-following benefits beyond clean prompts, though it introduces higher variance. The method shows competitive results in biomedical ERSA and generalizes to domain shifts, illustrating that a lightweight, prefix-based instruction-tuning strategy can effectively handle ABSA subtasks across diverse domains.
Abstract
Aspect-Based Sentiment Analysis (ABSA) involves extracting opinions from textual data about specific entities and their corresponding aspects through various complementary subtasks. Several prior research has focused on developing ad hoc designs of varying complexities for these subtasks. In this paper, we present a generative framework extensible to any ABSA subtask. We build upon the instruction tuned model proposed by Scaria et al. (2023), who present an instruction-based model with task descriptions followed by in-context examples on ABSA subtasks. We propose PFInstruct, an extension to this instruction learning paradigm by appending an NLP-related task prefix to the task description. This simple approach leads to improved performance across all tested SemEval subtasks, surpassing previous state-of-the-art (SOTA) on the ATE subtask (Rest14) by +3.28 F1-score, and on the AOOE subtask by an average of +5.43 F1-score across SemEval datasets. Furthermore, we explore the impact of the prefix-enhanced prompt quality on the ABSA subtasks and find that even a noisy prefix enhances model performance compared to the baseline. Our method also achieves competitive results on a biomedical domain dataset (ERSA).
