Exploiting contextual information to improve stance detection in informal political discourse with LLMs
Arman Engin Sucu, Yixiang Zhou, Mario A. Nascimento, Tony Mullen
TL;DR
The paper tackles political stance detection in informal online discourse and shows that enriching LLM prompts with structured user-profile summaries derived from historical posts substantially boosts accuracy. Through a three-phase methodology and cross-model evaluation across seven LLMs, it demonstrates accuracy gains of roughly 17.5 to 38.5 percentage points, achieving up to 74% on a real-world politics forum dataset. Key findings highlight that a strategically curated set of 10–20 posts yields near-optimal context, that profile-quality matters more than sheer volume, and that different models contribute complementary strengths in profile generation and classification. These results underscore the practical value of user-level context in improving nuanced political NLP tasks and offer guidance for scalable deployment of contextualized stance detectors.
Abstract
This study investigates the use of Large Language Models (LLMs) for political stance detection in informal online discourse, where language is often sarcastic, ambiguous, and context-dependent. We explore whether providing contextual information, specifically user profile summaries derived from historical posts, can improve classification accuracy. Using a real-world political forum dataset, we generate structured profiles that summarize users' ideological leaning, recurring topics, and linguistic patterns. We evaluate seven state-of-the-art LLMs across baseline and context-enriched setups through a comprehensive cross-model evaluation. Our findings show that contextual prompts significantly boost accuracy, with improvements ranging from +17.5\% to +38.5\%, achieving up to 74\% accuracy that surpasses previous approaches. We also analyze how profile size and post selection strategies affect performance, showing that strategically chosen political content yields better results than larger, randomly selected contexts. These findings underscore the value of incorporating user-level context to enhance LLM performance in nuanced political classification tasks.
