GAMA: A General Anonymizing Multi-Agent System for Privacy Preservation Enhanced by Domain Rules and Disproof Mechanism
Hailong Yang, Renhuo Zhao, Guanjin Wang, Zhaohong Deng
TL;DR
The paper tackles privacy risks in LLM-based multi-agent systems by introducing GAMA, a General Anonymizing Multi-Agent framework that partitions agent workspaces into private and public spaces and anonymizes data before exposure to web-based LLMs. It combines an anonymizing mechanism (AMPP) with two enhancement modules—Domain-Rule-based Knowledge Enhancement (DRKE) and Disproof-based Logic Enhancement (DLE)—to preserve semantic and logical integrity under privacy constraints. Empirical results on general QA datasets and privacy-specific QA benchmarks show that GAMA achieves higher task accuracy and lower privacy leakage than baselines, while also exhibiting strong privacy-identification and resilience to re-identification attacks. The approach offers a practical path to safer deployment of LLM-powered MAS in privacy-sensitive domains, with room for autonomous privacy-rule learning and multimodal extensions in future work.
Abstract
With the rapid advancement of Large Language Models (LLMs), LLM-based agents exhibit exceptional abilities in understanding and generating natural language, enabling human-like collaboration and information transmission in LLM-based Multi-Agent Systems (MAS). High-performance LLMs are often hosted on web servers in public cloud environments. When tasks involve private data, MAS cannot securely utilize these LLMs without implementing the agentic privacy-preserving mechanism. To address this challenge, we propose a General Anonymizing Multi-Agent System (GAMA), which divides the agents' workspace into private and public spaces, ensuring privacy through a structured anonymization mechanism. In the private space, agents handle sensitive data, while in the public web space, only anonymized data is utilized. GAMA incorporates two key modules to mitigate semantic loss caused by anonymization: Domain-Rule-based Knowledge Enhancement (DRKE) and Disproof-based Logic Enhancement (DLE). We evaluate GAMA on two general question-answering datasets, a public privacy leakage benchmark, and two customized question-answering datasets related to privacy. The results demonstrate that GAMA outperforms existing baselines on the evaluated datasets in terms of both task accuracy and privacy preservation metrics.
