Minion: A Technology Probe to Explore How Users Negotiate Harmful Value Conflicts with AI Companions
Xianzhe Fan, Qing Xiao, Xuhui Zhou, Yuran Su, Zhicong Lu, Maarten Sap, Hong Shen
TL;DR
This study investigates harmful value conflicts in LLM-based AI companions and demonstrates that user-side safety work is both necessary and burdensome when platform safeguards fall short. By combining a formative analysis of 146 public posts with a chrome-based technology probe (Minion) and a one-week study with 22 experienced users, the authors reveal how users negotiate conflicts through diverse strategies, how emotional attachment motivates repair, and when certain harms become non-negotiable due to companion personas or platform policies. The work highlights design tensions between preserving user agency and reducing emotional labor, and argues for clearer differentiation between playful conflict and safety-critical harms, as well as stronger platform-level safeguards. Practical implications emphasize non-intrusive, user-invoked support and a reevaluation of responsibility for safety, suggesting that reducing harm should reside more with platform design than with user-side work alone.
Abstract
AI companions are designed to foster emotionally engaging interactions, yet users often encounter conflicts that feel frustrating or hurtful, such as discriminatory statements and controlling behavior. This paper examines how users negotiate such harmful conflicts with AI companions and what emotional and practical burdens are created when mitigation is pushed to user-side tools. We analyze 146 public posts describing harmful value conflicts interacting with AI companions. We then introduce Minion, a Chrome-based technology probe that offers candidate responses spanning persuasion, rational appeals, boundary setting, and appeals to platform rules. Findings from a one-week probe study with 22 experienced users show how participants combine strategies, how emotional attachment motivates repair, and where conflicts become non-negotiable due to companion personas or platform policies. We surface design tensions in supporting value negotiation, showing how companion design can make some conflicts impossible to repair in practice, and derive implications for AI companion and support-tool design that caution against offloading safety work onto users.
