Robust Stance Detection: Understanding Public Perceptions in Social Media
Nayoung Kim, David Mosallanezhad, Lu Cheng, Michelle V. Mancenido, Huan Liu
TL;DR
This work tackles the challenge of detecting public stances across changing domains and targets in social media. It introduces STANCE-C3, a two-stage framework that (1) generates domain-counterfactual text to bridge domain gaps via a T5-based generator and (2) employs a supervised contrastive loss to learn cross-target, domain-invariant representations. The approach demonstrates consistent improvements over state-of-the-art baselines in cross-domain and cross-target scenarios on COVID-19-related datasets, with ablation studies underscoring the importance of both counterfactual augmentation and the contrastive objective. The results suggest that domain- and target-robust stance detectors can provide reliable, policy-relevant insights in environments where data are scarce or rapidly shifting, offering practical value for public health and governance contexts.
Abstract
The abundance of social media data has presented opportunities for accurately determining public and group-specific stances around policy proposals or controversial topics. In contrast with sentiment analysis which focuses on identifying prevailing emotions, stance detection identifies precise positions (i.e., supportive, opposing, neutral) relative to a well-defined topic, such as perceptions toward specific global health interventions during the COVID-19 pandemic. Traditional stance detection models, while effective within their specific domain (e.g., attitudes towards masking protocols during COVID-19), often lag in performance when applied to new domains and topics due to changes in data distribution. This limitation is compounded by the scarcity of domain-specific, labeled datasets, which are expensive and labor-intensive to create. A solution we present in this paper combines counterfactual data augmentation with contrastive learning to enhance the robustness of stance detection across domains and topics of interest. We evaluate the performance of current state-of-the-art stance detection models, including a prompt-optimized large language model, relative to our proposed framework succinctly called STANCE-C3 (domain-adaptive Cross-target STANCE detection via Contrastive learning and Counterfactual generation). Empirical evaluations demonstrate STANCE-C3's consistent improvements over the baseline models with respect to accuracy across domains and varying focal topics. Despite the increasing prevalence of general-purpose models such as generative AI, specialized models such as STANCE-C3 provide utility in safety-critical domains wherein precision is highly valued, especially when a nuanced understanding of the concerns of different population segments could result in crafting more impactful public policies.
