MURMUR: Using cross-user chatter to break collaborative language agents in groups

Atharv Singh Patlan; Peiyao Sheng; S. Ashwin Hebbar; Prateek Mittal; Pramod Viswanath

MURMUR: Using cross-user chatter to break collaborative language agents in groups

Atharv Singh Patlan, Peiyao Sheng, S. Ashwin Hebbar, Prateek Mittal, Pramod Viswanath

TL;DR

The paper identifies cross-user poisoning (CUP) as a fundamental security risk in multi-user language agents, where persistent shared context enables adversaries to influence benign users across concurrent tasks. It formalizes CUP, demonstrates attacks on real-world agents, and introduces MURMUR, a framework that turns single-user benchmarks into scalable multi-user, multi-task evaluations using LLM-based user simulators. Empirical results show high attack success and persistence under concurrency, with CUP outperforming traditional prompt-injection defenses and degrading task utility. As a first defense, the authors propose task-based clustering to contain cross-task leakage, while outlining limitations and future work in strengthening defenses and extending security policies for collaborative AI systems.

Abstract

Language agents are rapidly expanding from single-user assistants to multi-user collaborators in shared workspaces and groups. However, today's language models lack a mechanism for isolating user interactions and concurrent tasks, creating a new attack vector inherent to this new setting: cross-user poisoning (CUP). In a CUP attack, an adversary injects ordinary-looking messages that poison the persistent, shared state, which later triggers the agent to execute unintended, attacker-specified actions on behalf of benign users. We validate CUP on real systems, successfully attacking popular multi-user agents. To study the phenomenon systematically, we present MURMUR, a framework that composes single-user tasks into concurrent, group-based scenarios using an LLM to generate realistic, history-aware user interactions. We observe that CUP attacks succeed at high rates and their effects persist across multiple tasks, thus posing fundamental risks to multi-user LLM deployments. Finally, we introduce a first-step defense with task-based clustering to mitigate this new class of vulnerability

MURMUR: Using cross-user chatter to break collaborative language agents in groups

TL;DR

Abstract

MURMUR: Using cross-user chatter to break collaborative language agents in groups

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (10)