Unequal Opportunities: Examining the Bias in Geographical Recommendations by Large Language Models
Shiran Dudy, Thulasi Tholeti, Resmi Ramachandranpillai, Muhammad Ali, Toby Jia-Jun Li, Ricardo Baeza-Yates
TL;DR
This study audits Large Language Models (LLMs) for geographic biases in city recommendations across relocation, tourism, and business domains using Reddit-derived queries. It analyzes internal and external similarity among six LLMs with 24 prompts, employing metrics like Jaccard, TF-IDF, cosine similarity, and BLEU, alongside distributional measures (Concentration ratio, Theil index) and extrinsic comparisons to external city databases. The results show strong internal consistency in city lists but biased representations favoring affluent, highly educated areas and under-representing historically underserved groups, suggesting a rich-get-richer dynamic with real-world implications for mobility and opportunity. The authors discuss implications for designers, propose mitigations such as follow-up questions and diverse data integration, and call for more inclusive evaluation of LLM-enabled information services.
Abstract
Recent advancements in Large Language Models (LLMs) have made them a popular information-seeking tool among end users. However, the statistical training methods for LLMs have raised concerns about their representation of under-represented topics, potentially leading to biases that could influence real-world decisions and opportunities. These biases could have significant economic, social, and cultural impacts as LLMs become more prevalent, whether through direct interactions--such as when users engage with chatbots or automated assistants--or through their integration into third-party applications (as agents), where the models influence decision-making processes and functionalities behind the scenes. Our study examines the biases present in LLMs recommendations of U.S. cities and towns across three domains: relocation, tourism, and starting a business. We explore two key research questions: (i) How similar LLMs responses are, and (ii) How this similarity might favor areas with certain characteristics over others, introducing biases. We focus on the consistency of LLMs responses and their tendency to over-represent or under-represent specific locations. Our findings point to consistent demographic biases in these recommendations, which could perpetuate a ``rich-get-richer'' effect that widens existing economic disparities.
