Examining the Influence of Political Bias on Large Language Model Performance in Stance Classification
Lynnette Hui Xian Ng, Iain Cruickshank, Roy Ka-Wei Lee
TL;DR
This study investigates how political biases in large language models affect stance classification across three datasets (BASIL, SemEval2016, Elections2016) and seven LLMs using four prompting schemes. It finds statistically significant left-vs-right performance differences at the dataset level ($p<0.01$) and shows that increasing target ambiguity generally reduces stance classification accuracy, while prompting schemes have mixed effects. Notably, prompting the model to reason about political bias (bias CoT) often does not mitigate bias and can amplify it, suggesting data-driven origins of bias. The work highlights dataset instability and the strong influence of data and annotation quality on downstream political-stance tasks, calling for robust data design and prompting strategies in real-world applications.
Abstract
Large Language Models (LLMs) have demonstrated remarkable capabilities in executing tasks based on natural language queries. However, these models, trained on curated datasets, inherently embody biases ranging from racial to national and gender biases. It remains uncertain whether these biases impact the performance of LLMs for certain tasks. In this study, we investigate the political biases of LLMs within the stance classification task, specifically examining whether these models exhibit a tendency to more accurately classify politically-charged stances. Utilizing three datasets, seven LLMs, and four distinct prompting schemes, we analyze the performance of LLMs on politically oriented statements and targets. Our findings reveal a statistically significant difference in the performance of LLMs across various politically oriented stance classification tasks. Furthermore, we observe that this difference primarily manifests at the dataset level, with models and prompting schemes showing statistically similar performances across different stance classification datasets. Lastly, we observe that when there is greater ambiguity in the target the statement is directed towards, LLMs have poorer stance classification accuracy. Code & Dataset: http://doi.org/10.5281/zenodo.12938478
