SoK: Trust-Authorization Mismatch in LLM Agent Interactions

Guanquan Shi; Haohua Du; Zhiqiang Wang; Xiaoyu Liang; Weiwenpei Liu; Song Bian; Zhenyu Guan

SoK: Trust-Authorization Mismatch in LLM Agent Interactions

Guanquan Shi, Haohua Du, Zhiqiang Wang, Xiaoyu Liang, Weiwenpei Liu, Song Bian, Zhenyu Guan

TL;DR

The paper addresses security in autonomous LLM agents by proposing a unifying Trust-Authorization Mismatch framework. It introduces the Belief-Intention-Permission (B-I-P) model and a Trust-Authorization Matrix to analyze how corrupted beliefs can lead to unsafe actions when permissions are not provenance-aware. It systematizes threats along the B-I-P chain and maps existing attacks and defenses to four stages, advocating chain-breaking defenses such as belief-aware dynamic authorization and taint-tracking, complemented by auditable security logs. It also argues that emphasis should shift from perfect belief integrity to robust permission controls and post-hoc accountability to enable verifiable and secure agent systems in practice.

Abstract

Large Language Models (LLMs) are rapidly evolving into autonomous agents capable of interacting with the external world, significantly expanding their capabilities through standardized interaction protocols. However, this paradigm revives the classic cybersecurity challenges of agency and authorization in a novel and volatile context. As decision-making shifts from deterministic code logic to probabilistic inference driven by natural language, traditional security mechanisms designed for deterministic behavior fail. It is fundamentally challenging to establish trust for unpredictable AI agents and to enforce the Principle of Least Privilege (PoLP) when instructions are ambiguous. Despite the escalating threat landscape, the academic community's understanding of this emerging domain remains fragmented, lacking a systematic framework to analyze its root causes. This paper provides a unifying formal lens for agent-interaction security. We observed that most security threats in this domain stem from a fundamental mismatch between trust evaluation and authorization policies. We introduce a novel risk analysis model centered on this trust-authorization gap. Using this model as a unifying lens, we survey and classify the implementation paths of existing, often seemingly isolated, attacks and defenses. This new framework not only unifies the field but also allows us to identify critical research gaps. Finally, we leverage our analysis to suggest a systematic research direction toward building robust, trusted agents and dynamic authorization mechanisms.

SoK: Trust-Authorization Mismatch in LLM Agent Interactions

TL;DR

Abstract

SoK: Trust-Authorization Mismatch in LLM Agent Interactions

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (7)

Theorems & Definitions (4)