Exposing LLM User Privacy via Traffic Fingerprint Analysis: A Study of Privacy Risks in LLM Agent Interactions

Yixiang Zhang; Xinhao Deng; Zhongyi Gu; Yihao Chen; Ke Xu; Qi Li; Jianping Wu

Exposing LLM User Privacy via Traffic Fingerprint Analysis: A Study of Privacy Risks in LLM Agent Interactions

Yixiang Zhang, Xinhao Deng, Zhongyi Gu, Yihao Chen, Ke Xu, Qi Li, Jianping Wu

TL;DR

This work addresses privacy risks introduced by LLM agents that autonomously plan and invoke external tools. It shows that encrypted network traffic between users and agents preserves distinctive timing and volume patterns (traffic fingerprints) that reveal agent behaviors, identify the specific agent, and infer sensitive user attributes such as occupation. The authors propose AgentPrint, a framework combining diverse prompt generation, multi-view traffic representations (MTAM), CNN-based fingerprinting, and a zero-shot occupation inference over a DWA-based occupation network with RCA and EWMA. Key findings include high classification performance for agent behavior (macro F1 ≈ 0.924) and identity (macro F1 ≈ 0.866) under mixed-flow conditions, as well as substantial privacy leakage in occupation profiling (Top-3 up to 73.9% for high-exposure virtual users and 69.1% for real users). The results underscore that encryption alone cannot safeguard privacy in the era of LLM agents and motivate technical countermeasures and regulatory updates to mitigate traffic-based side-channel leaks.

Abstract

Large Language Models (LLMs) are increasingly deployed as agents that orchestrate tasks and integrate external tools to execute complex workflows. We demonstrate that these interactive behaviors leave distinctive fingerprints in encrypted traffic exchanged between users and LLM agents. By analyzing traffic patterns associated with agent workflows and tool invocations, adversaries can infer agent activities, distinguish specific agents, and even profile sensitive user attributes. To highlight this risk, we develop AgentPrint, which achieves an F1-score of 0.866 in agent identification and attains 73.9% and 69.1% top-3 accuracy in user attribute inference for simulated- and real-user settings, respectively. These results uncover an overlooked risk: the very interactivity that empowers LLM agents also exposes user privacy, underscoring the urgent need for technical countermeasures alongside regulatory and policy safeguards.

Exposing LLM User Privacy via Traffic Fingerprint Analysis: A Study of Privacy Risks in LLM Agent Interactions

TL;DR

Abstract

Exposing LLM User Privacy via Traffic Fingerprint Analysis: A Study of Privacy Risks in LLM Agent Interactions

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (11)