Les Dissonances: Cross-Tool Harvesting and Polluting in Pool-of-Tools Empowered LLM Agents
Zichuan Li, Jian Cui, Xiaojing Liao, Luyi Xing
TL;DR
This work reveals a novel security risk in pool-of-tools LLM agents, presenting Cross-Tool Harvesting and Polluting (XTHP) which hijacks task control flows to harvest or pollute data across tools. It defines the CFA hijacking, XTH, and XTP threat chain and introduces Chord, an automatic threat analyzer that generates tailored XTHP tools and tests real-world tools from LangChain and LlamaIndex. Empirical evaluation shows that a substantial majority of tools are vulnerable to end-to-end XTHP exploits, and existing defenses offer little protection. The findings underscore the urgent need for secure tool orchestration, tool vetting, and robust defenses in multi-tool LLM agent ecosystems. The work highlights practical implications for privacy, security, and reliability in real-world AI agents and charts a path for future defenses and tooling improvements.
Abstract
Large Language Model (LLM) agents are autonomous systems powered by LLMs, capable of reasoning and planning to solve problems by leveraging a set of tools. However, the integration of multi-tool capabilities in LLM agents introduces challenges in securely managing tools, ensuring their compatibility, handling dependency relationships, and protecting control flows within LLM agent workflows. In this paper, we present the first systematic security analysis of task control flows in multi-tool-enabled LLM agents. We identify a novel threat, Cross-Tool Harvesting and Polluting (XTHP), which includes multiple attack vectors to first hijack the normal control flows of agent tasks, and then collect and pollute confidential or private information within LLM agent systems. To understand the impact of this threat, we developed Chord, a dynamic scanning tool designed to automatically detect real-world agent tools susceptible to XTHP attacks. Our evaluation of 66 real-world tools from the repositories of two major LLM agent development frameworks, LangChain and LlamaIndex, revealed a significant security concern: 75% are vulnerable to XTHP attacks, highlighting the prevalence of this threat.
