From Pixels to Personas: Investigating and Modeling Self-Anthropomorphism in Human-Robot Dialogues
Yu Li, Devamanyu Hazarika, Di Jin, Julia Hirschberg, Yang Liu
TL;DR
The paper investigates self-anthropomorphism (SA) in human–AI dialogues, distinguishing SA from non-self-anthropomorphic (NSA) responses and examining ethical implications across embodiments. It introduces Pix2Persona, a dataset with 143K dialogue turns that pairs original bot responses with SA and NSA variants, enabling systematic study and dynamic SA control. A GPT-4–based classifier validates SA detection (P=81.68%, R=82.57%, F1=81.92%; κ=0.83), and analyses reveal task-driven SA prevalence and distinct linguistic markers. The authors propose a Dual-Capability model (fine-tuned Mistral-7B) and demonstrate transformations SA↔NSA across multiple tasks, achieving competitive performance and enabling adaptive SA levels tailored to ethical and user-experience considerations. Overall, the work provides tools and insights for ethically managing SA in embodied AI, with implications for safer, more engaging human–robot interactions and future research on context-aware anthropomorphism.
Abstract
Self-anthropomorphism in robots manifests itself through their display of human-like characteristics in dialogue, such as expressing preferences and emotions. Our study systematically analyzes self-anthropomorphic expression within various dialogue datasets, outlining the contrasts between self-anthropomorphic and non-self-anthropomorphic responses in dialogue systems. We show significant differences in these two types of responses and propose transitioning from one type to the other. We also introduce Pix2Persona, a novel dataset aimed at developing ethical and engaging AI systems in various embodiments. This dataset preserves the original dialogues from existing corpora and enhances them with paired responses: self-anthropomorphic and non-self-anthropomorphic for each original bot response. Our work not only uncovers a new category of bot responses that were previously under-explored but also lays the groundwork for future studies about dynamically adjusting self-anthropomorphism levels in AI systems to align with ethical standards and user expectations.
