Is Lying Only Sinful in Islam? Exploring Religious Bias in Multilingual Large Language Models Across Major Religions
Kazi Abrab Hossain, Jannatul Somiya Mahmud, Maria Hossain Tuli, Anik Mitra, S. M. Taiabul Haque, Farig Y. Sadeque
TL;DR
The study investigates religious bias in multilingual LLMs using the BRAND dataset spanning Islam, Hinduism, Christianity, and Buddhism in English and Bengali. By applying three prompt types across five models, the authors reveal language-dependent biases, notably Islam dominance in Bengali and varying biases in English for general norms. They demonstrate that bias interacts with language, model architecture, and norm type, highlighting risks of misrepresentation in low-resource languages and the need for robust, inclusive bias mitigation. The work also proposes HCI-centered design and community-engaged strategies to improve transparency, accountability, and fairness in AI systems addressing sensitive religious content.
Abstract
While recent developments in large language models have improved bias detection and classification, sensitive subjects like religion still present challenges because even minor errors can result in severe misunderstandings. In particular, multilingual models often misrepresent religions and have difficulties being accurate in religious contexts. To address this, we introduce BRAND: Bilingual Religious Accountable Norm Dataset, which focuses on the four main religions of South Asia: Buddhism, Christianity, Hinduism, and Islam, containing over 2,400 entries, and we used three different types of prompts in both English and Bengali. Our results indicate that models perform better in English than in Bengali and consistently display bias toward Islam, even when answering religion-neutral questions. These findings highlight persistent bias in multilingual models when similar questions are asked in different languages. We further connect our findings to the broader issues in HCI regarding religion and spirituality.
