Persona is a Double-edged Sword: Mitigating the Negative Impact of Role-playing Prompts in Zero-shot Reasoning Tasks
Junseok Kim, Nakyeong Yang, Kyomin Jung
TL;DR
This work investigates the paradoxical effects of role-playing prompts on zero-shot reasoning and introduces Jekyll & Hyde, a robust framework that automatically generates LLM personas, runs dual solvers with and without personas, and uses a bias-mitigated evaluator to select the better solution. The approach consistently improves reasoning across twelve diverse datasets and multiple backbone models, while reducing sensitivity to persona quality and position bias. Key contributions include automatic persona generation, dual-perspective solving, and a consistency-based evaluation protocol that approaches oracle performance in many cases. The framework highlights the potential of ensemble prompting to strengthen LLM reasoning while providing practical guidance on maintaining stability and efficiency in human-AI systems.
Abstract
Recent studies demonstrate that prompting a role-playing persona to an LLM improves reasoning capability. However, assigning an adequate persona is difficult since LLMs are extremely sensitive to assigned prompts; thus, inaccurately defined personas sometimes hinder LLMs and degrade their reasoning capabilities. In this paper, we first investigate the potential negative impact of injecting persona into language models. Furthermore, we propose a novel framework, Jekyll \& Hyde, which ensembles the outcomes of both role-playing and neutral prompts to enhance the robustness of reasoning ability. Specifically, Jekyll \& Hyde predicts an appropriate persona using an LLM when defining the role-playing prompt. Then, Jekyll \& Hyde collects two potential solutions from role-playing and neutral prompts and selects a better solution using the LLM evaluator. The experimental analysis demonstrates that role-playing prompts sometimes distract LLMs, degrading their reasoning abilities in 7 out of 12 datasets in llama3. Meanwhile, Jekyll \& Hyde improve reasoning capabilities by selecting better choices among the potential solutions on twelve widely-used natural language reasoning datasets. In addition, we reveal that assigning LLM-generated personas obtains more stable results than handcrafted personas.
