Framing Political Bias in Multilingual LLMs Across Pakistani Languages
Afrozah Nadeem, Mark Dras, Usman Naseem
TL;DR
This work addresses political bias in multilingual LLMs within Pakistan by introducing a culturally adapted Political Compass Test (PCT) coupled with layered framing analysis across Urdu, Punjabi, Sindhi, Pashto, and Balochi. The methodology maps model outputs to a two-dimensional ideological space spanned by $S_{eco}$ and $S_{soc}$ on a $[-10,10]$ scale, while also assessing framing through content, entities, and sentiment using Boydstun’s frame taxonomy. Evaluations of 13 SOTA LLMs across 11 socio-political topics reveal language-conditioned shifts, with English outputs tending toward liberal-left frames and regional languages showing authoritarian-left tendencies, plus model-specific bias patterns. The study also includes a large-scale framing analysis of 444,340 multilingual headlines and introduces a publicly released multilingual dataset to support reproducible, culturally grounded bias auditing in low-resource, non-Western contexts. Overall, the framework advances multilingual bias detection and highlights the need for language- and culture-aware safety and fairness measures in global NLP deployments.
Abstract
Large Language Models (LLMs) increasingly shape public discourse, yet most evaluations of political and economic bias have focused on high-resource, Western languages and contexts. This leaves critical blind spots in low-resource, multilingual regions such as Pakistan, where linguistic identity is closely tied to political, religious, and regional ideologies. We present a systematic evaluation of political bias in 13 state-of-the-art LLMs across five Pakistani languages: Urdu, Punjabi, Sindhi, Pashto, and Balochi. Our framework integrates a culturally adapted Political Compass Test (PCT) with multi-level framing analysis, capturing both ideological stance (economic/social axes) and stylistic framing (content, tone, emphasis). Prompts are aligned with 11 socio-political themes specific to the Pakistani context. Results show that while LLMs predominantly reflect liberal-left orientations consistent with Western training data, they exhibit more authoritarian framing in regional languages, highlighting language-conditioned ideological modulation. We also identify consistent model-specific bias patterns across languages. These findings show the need for culturally grounded, multilingual bias auditing frameworks in global NLP.
