Cultural Bias and Cultural Alignment of Large Language Models
Yan Tao, Olga Viberg, Ryan S. Baker, Rene F. Kizilcec
TL;DR
This work audits cultural bias in five GPT models by aligning their outputs with the Inglehart-Welzel cultural map derived from the Integrated Values Surveys. It introduces cultural prompting as a flexible method to improve cross-cultural alignment and assesses its effectiveness across 107 countries. Results reveal a persistent Western bias without prompting, while cultural prompting substantially improves alignment for the GPT-4 family in most cases but is not universally effective. The study provides a replicable auditing framework and highlights the need for ongoing evaluation to mitigate culture-driven misrepresentation in AI outputs.
Abstract
Culture fundamentally shapes people's reasoning, behavior, and communication. As people increasingly use generative artificial intelligence (AI) to expedite and automate personal and professional tasks, cultural values embedded in AI models may bias people's authentic expression and contribute to the dominance of certain cultures. We conduct a disaggregated evaluation of cultural bias for five widely used large language models (OpenAI's GPT-4o/4-turbo/4/3.5-turbo/3) by comparing the models' responses to nationally representative survey data. All models exhibit cultural values resembling English-speaking and Protestant European countries. We test cultural prompting as a control strategy to increase cultural alignment for each country/territory. For recent models (GPT-4, 4-turbo, 4o), this improves the cultural alignment of the models' output for 71-81% of countries and territories. We suggest using cultural prompting and ongoing evaluation to reduce cultural bias in the output of generative AI.
