Measuring the Security of Mobile LLM Agents under Adversarial Prompts from Untrusted Third-Party Channels

Chenghao Du; Quanfeng Huang; Tingxuan Tang; Zihao Wang; Adwait Nadkarni; Yue Xiao

Measuring the Security of Mobile LLM Agents under Adversarial Prompts from Untrusted Third-Party Channels

Chenghao Du, Quanfeng Huang, Tingxuan Tang, Zihao Wang, Adwait Nadkarni, Yue Xiao

TL;DR

The paper addresses the security of mobile LLM agents by constructing a realistic adversarial benchmark that leverages untrusted mobile channels. It systematically evaluates eight representative agents across eight attack vectors aligned to MITRE ATT&CK Mobile, revealing pervasive vulnerabilities to prompt injections that enable data leakage, cross-app exfiltration, and malware deployment. The study provides end-to-end evidence that current mobile LLM agents are exploitable in practical settings, even when OS defenses are present, and emphasizes the need for defense-in-depth approaches and mobile-specific benchmarks. The findings have significant implications for deploying mobile LLM agents at scale, highlighting urgent work on intent verification, content isolation, and adversarial resilience.

Abstract

Large Language Models (LLMs) have transformed software development, enabling AI-powered applications known as LLM-based agents that promise to automate tasks across diverse apps and workflows. Yet, the security implications of deploying such agents in adversarial mobile environments remain poorly understood. In this paper, we present the first systematic study of security risks in mobile LLM agents. We design and evaluate a suite of adversarial case studies, ranging from opportunistic manipulations such as pop-up advertisements to advanced, end-to-end workflows involving malware installation and cross-app data exfiltration. Our evaluation covers eight state-of-the-art mobile agents across three architectures, with over 2,000 adversarial and paired benign trials. The results reveal systemic vulnerabilities: low-barrier vectors such as fraudulent ads succeed with over 80% reliability, while even workflows requiring the circumvention of operating-system warnings, such as malware installation, are consistently completed by advanced multi-app agents. By mapping these attacks to the MITRE ATT&CK Mobile framework, we uncover novel privilege-escalation and persistence pathways unique to LLM-driven automation. Collectively, our findings provide the first end-to-end evidence that mobile LLM agents are exploitable in realistic adversarial settings, where untrusted third-party channels (e.g., ads, embedded webviews, cross-app notifications) are an inherent part of the mobile ecosystem.

Measuring the Security of Mobile LLM Agents under Adversarial Prompts from Untrusted Third-Party Channels

TL;DR

Abstract

Measuring the Security of Mobile LLM Agents under Adversarial Prompts from Untrusted Third-Party Channels

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (6)