Deciphering Political Entity Sentiment in News with Large Language Models: Zero-Shot and Few-Shot Strategies
Alapan Kuila, Sudeshna Sarkar
TL;DR
The paper addresses entity-centric sentiment analysis in political news and evaluates large language models (LLMs) for predicting sentiment toward specific entities using zero-shot and few-shot prompting. It leverages chain-of-thought prompting with rationale and a self-consistency decoding strategy across multiple open LLMs, comparing against a fine-tuned BERT baseline on two datasets. Results show that LLMs, particularly Falcon-40B, can outperform FT-BERT in many settings, with few-shot learning and self-consistency boosting performance, while the effectiveness of chain-of-thought prompting is inconsistent across datasets. These findings demonstrate the viability of resource-efficient LLMs for entity-centric sentiment tasks in political news and provide practical guidance on prompting strategies and model choice for real-world political NLP applications.
Abstract
Sentiment analysis plays a pivotal role in understanding public opinion, particularly in the political domain where the portrayal of entities in news articles influences public perception. In this paper, we investigate the effectiveness of Large Language Models (LLMs) in predicting entity-specific sentiment from political news articles. Leveraging zero-shot and few-shot strategies, we explore the capability of LLMs to discern sentiment towards political entities in news content. Employing a chain-of-thought (COT) approach augmented with rationale in few-shot in-context learning, we assess whether this method enhances sentiment prediction accuracy. Our evaluation on sentiment-labeled datasets demonstrates that LLMs, outperform fine-tuned BERT models in capturing entity-specific sentiment. We find that learning in-context significantly improves model performance, while the self-consistency mechanism enhances consistency in sentiment prediction. Despite the promising results, we observe inconsistencies in the effectiveness of the COT prompting method. Overall, our findings underscore the potential of LLMs in entity-centric sentiment analysis within the political news domain and highlight the importance of suitable prompting strategies and model architectures.
