"It's a Fair Game", or Is It? Examining How Users Navigate Disclosure Risks and Benefits When Using LLM-Based Conversational Agents
Zhiping Zhang, Michelle Jia, Hao-Ping Lee, Bingsheng Yao, Sauvik Das, Ada Lerner, Dakuo Wang, Tianshi Li
TL;DR
The paper addresses privacy risks in LLM-based conversational agents by combining a real-world disclosure analysis of the ShareGPT52K dataset with semi-structured interviews of 19 CA users. It reveals that users routinely trade privacy for utility and convenience, yet hold flawed mental models and encounter dark patterns that undermine awareness and control of privacy risks. Through empirical evidence on memorization risks, interdependent privacy, and human-like nudges, the study advances practical design guidelines and calls for paradigm shifts in technology, policy, and society. The findings emphasize the need for user-centered privacy controls, transparent model operation, and local-model options to meaningfully improve privacy protections in LLM-based CAs.
Abstract
The widespread use of Large Language Model (LLM)-based conversational agents (CAs), especially in high-stakes domains, raises many privacy concerns. Building ethical LLM-based CAs that respect user privacy requires an in-depth understanding of the privacy risks that concern users the most. However, existing research, primarily model-centered, does not provide insight into users' perspectives. To bridge this gap, we analyzed sensitive disclosures in real-world ChatGPT conversations and conducted semi-structured interviews with 19 LLM-based CA users. We found that users are constantly faced with trade-offs between privacy, utility, and convenience when using LLM-based CAs. However, users' erroneous mental models and the dark patterns in system design limited their awareness and comprehension of the privacy risks. Additionally, the human-like interactions encouraged more sensitive disclosures, which complicated users' ability to navigate the trade-offs. We discuss practical design guidelines and the needs for paradigm shifts to protect the privacy of LLM-based CA users.
