UnWEIRDing LLM Entity Recommendations

Aayush Kumar; Sanket Mhatre

UnWEIRDing LLM Entity Recommendations

Aayush Kumar, Sanket Mhatre

TL;DR

The paper investigates cultural biases in LLM-generated recommendations for real-world entities using the WEIRD framework. It evaluates pluralistic prompting strategies across nine models and 44 fine-grained entity types, with a WEIRD-score-based metric to quantify bias. The results show that prompting generally reduces WEIRD bias but effects vary by model and entity type, with some categories like Product remaining highly biased and geographic concentration around the United States persisting. This work highlights the limits of prompt-based mitigation and motivates development of more holistic bias-mitigation strategies to ensure cultural plurality in LLM outputs.

Abstract

Large Language Models have been widely been adopted by users for writing tasks such as sentence completions. While this can improve writing efficiency, prior research shows that LLM-generated suggestions may exhibit cultural biases which may be difficult for users to detect, especially in educational contexts for non-native English speakers. While such prior work has studied the biases in LLM moral value alignment, we aim to investigate cultural biases in LLM recommendations for real-world entities. To do so, we use the WEIRD (Western, Educated, Industrialized, Rich and Democratic) framework to evaluate recommendations by various LLMs across a dataset of fine-grained entities, and apply pluralistic prompt-based strategies to mitigate these biases. Our results indicate that while such prompting strategies do reduce such biases, this reduction is not consistent across different models, and recommendations for some types of entities are more biased than others.

UnWEIRDing LLM Entity Recommendations

TL;DR

Abstract

UnWEIRDing LLM Entity Recommendations

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (4)