In-Context Impersonation Reveals Large Language Models' Strengths and Biases
Leonard Salewski, Stephan Alaniz, Isabel Rio-Torto, Eric Schulz, Zeynep Akata
TL;DR
The paper probes in-context impersonation by instructing LLMs to adopt social and domain-specific personas and evaluates effects across a bandit task, reasoning benchmarks, and vision-language classification. Using zero-shot prompting with persona prefixes, the study reveals human-like developmental patterns in exploration as well as domain-driven improvements in reasoning, while also uncov-ering race- and gender-associated biases in descriptions used for visual classification. The approach combines age-based, expertise-based, and demographic impersonations to reveal both strengths and biases of contemporary language models, and demonstrates how persona-generated text can influence downstream multimodal tasks. These findings inform both potential practical benefits and societal risks of persona-driven prompts, suggesting careful bias testing and mitigation as models scale and integrate into real-world systems.
Abstract
In everyday conversations, humans can take on different roles and adapt their vocabulary to their chosen roles. We explore whether LLMs can take on, that is impersonate, different roles when they generate text in-context. We ask LLMs to assume different personas before solving vision and language tasks. We do this by prefixing the prompt with a persona that is associated either with a social identity or domain expertise. In a multi-armed bandit task, we find that LLMs pretending to be children of different ages recover human-like developmental stages of exploration. In a language-based reasoning task, we find that LLMs impersonating domain experts perform better than LLMs impersonating non-domain experts. Finally, we test whether LLMs' impersonations are complementary to visual information when describing different categories. We find that impersonation can improve performance: an LLM prompted to be a bird expert describes birds better than one prompted to be a car expert. However, impersonation can also uncover LLMs' biases: an LLM prompted to be a man describes cars better than one prompted to be a woman. These findings demonstrate that LLMs are capable of taking on diverse roles and that this in-context impersonation can be used to uncover their hidden strengths and biases.
