PRIV-QA: Privacy-Preserving Question Answering for Cloud Large Language Models
Guangwei Li, Yuansen Zhang, Yinggui Wang, Shoumeng Yan, Lei Wang, Tao Wei
TL;DR
This work tackles privacy risks in cloud-based QA by proposing PRIV-QA, a two-module framework that de-identifies user queries (Hide Module) and recovers them after cloud-model responses (Recover Module). It introduces SensitiveQA, the first bilingual general privacy QA dataset with 57k interactions to evaluate privacy protection and QA quality. Experiments demonstrate high precision/recall for sensitive-information detection (up to 91.36/89.40 in English) and strong extraction-defense rates, while maintaining competitive recovery quality (BLEU/METEOR/ROUGE) with manageable time overhead. The approach offers a practical privacy-utility trade-off for real-world cloud LLM usage, with code and data publicly available.
Abstract
The rapid development of large language models (LLMs) is redefining the landscape of human-computer interaction, and their integration into various user-service applications is becoming increasingly prevalent. However, transmitting user data to cloud-based LLMs presents significant risks of data breaches and unauthorized access to personal identification information. In this paper, we propose a privacy preservation pipeline for protecting privacy and sensitive information during interactions between users and LLMs in practical LLM usage scenarios. We construct SensitiveQA, the first privacy open-ended question-answering dataset. It comprises 57k interactions in Chinese and English, encompassing a diverse range of user-sensitive information within the conversations. Our proposed solution employs a multi-stage strategy aimed at preemptively securing user information while simultaneously preserving the response quality of cloud-based LLMs. Experimental validation underscores our method's efficacy in balancing privacy protection with maintaining robust interaction quality. The code and dataset are available at https://github.com/ligw1998/PRIV-QA.
