Table of Contents
Fetching ...

Adaptive and Resource-efficient Agentic AI Systems for Mobile and Embedded Devices: A Survey

Sicong Liu, Weiye Wu, Xiangrui Xu, Teng Li, Bowen Pang, Bin Guo, Zhiwen Yu

TL;DR

The survey addresses the challenge of deploying foundation-model–powered agents on mobile and embedded devices by proposing a unified view of adaptive, resource-efficient system design. It introduces a taxonomy spanning elastic FM inference, test-time adaptation, and dynamic multimodal fusion, and details techniques for prompt optimization, CoT adaptation, runtime model reconfiguration, memory and KV cache management, and system scheduling. The authors discuss representative agentic applications, case studies, datasets, and inference engines, and highlight open issues in elastic perception–action loops, physical intelligence, online adaptation, and multi-agent collaboration. The work aims to guide algorithm–system–hardware co-design for scalable, robust, and energy-efficient agentic AI in edge environments, catalyzing practical deployments and future research.

Abstract

Foundation models have reshaped AI by unifying fragmented architectures into scalable backbones with multimodal reasoning and contextual adaptation. In parallel, the long-standing notion of AI agents, defined by the sensing-decision-action loop, is entering a new paradigm: with FMs as their cognitive core, agents transcend rule-based behaviors to achieve autonomy, generalization, and self-reflection. This dual shift is reinforced by real-world demands such as autonomous driving, robotics, virtual assistants, and GUI agents, as well as ecosystem advances in embedded hardware, edge computing, mobile deployment platforms, and communication protocols that together enable large-scale deployment. Yet this convergence collides with reality: while applications demand long-term adaptability and real-time interaction, mobile and edge deployments remain constrained by memory, energy, bandwidth, and latency. This creates a fundamental tension between the growing complexity of FMs and the limited resources of deployment environments. This survey provides the first systematic characterization of adaptive, resource-efficient agentic AI systems. We summarize enabling techniques into elastic inference, test-time adaptation, dynamic multimodal integration, and agentic AI applications, and identify open challenges in balancing accuracy-latency-communication trade-offs and sustaining robustness under distribution shifts. We further highlight future opportunities in algorithm-system co-design, cognitive adaptation, and collaborative edge deployment. By mapping FM structures, cognition, and hardware resources, this work establishes a unified perspective toward scalable, adaptive, and resource-efficient agentic AI. We believe this survey can help readers to understand the connections between enabling technologies while promoting further discussions on the fusion of agentic intelligence and intelligent agents.

Adaptive and Resource-efficient Agentic AI Systems for Mobile and Embedded Devices: A Survey

TL;DR

The survey addresses the challenge of deploying foundation-model–powered agents on mobile and embedded devices by proposing a unified view of adaptive, resource-efficient system design. It introduces a taxonomy spanning elastic FM inference, test-time adaptation, and dynamic multimodal fusion, and details techniques for prompt optimization, CoT adaptation, runtime model reconfiguration, memory and KV cache management, and system scheduling. The authors discuss representative agentic applications, case studies, datasets, and inference engines, and highlight open issues in elastic perception–action loops, physical intelligence, online adaptation, and multi-agent collaboration. The work aims to guide algorithm–system–hardware co-design for scalable, robust, and energy-efficient agentic AI in edge environments, catalyzing practical deployments and future research.

Abstract

Foundation models have reshaped AI by unifying fragmented architectures into scalable backbones with multimodal reasoning and contextual adaptation. In parallel, the long-standing notion of AI agents, defined by the sensing-decision-action loop, is entering a new paradigm: with FMs as their cognitive core, agents transcend rule-based behaviors to achieve autonomy, generalization, and self-reflection. This dual shift is reinforced by real-world demands such as autonomous driving, robotics, virtual assistants, and GUI agents, as well as ecosystem advances in embedded hardware, edge computing, mobile deployment platforms, and communication protocols that together enable large-scale deployment. Yet this convergence collides with reality: while applications demand long-term adaptability and real-time interaction, mobile and edge deployments remain constrained by memory, energy, bandwidth, and latency. This creates a fundamental tension between the growing complexity of FMs and the limited resources of deployment environments. This survey provides the first systematic characterization of adaptive, resource-efficient agentic AI systems. We summarize enabling techniques into elastic inference, test-time adaptation, dynamic multimodal integration, and agentic AI applications, and identify open challenges in balancing accuracy-latency-communication trade-offs and sustaining robustness under distribution shifts. We further highlight future opportunities in algorithm-system co-design, cognitive adaptation, and collaborative edge deployment. By mapping FM structures, cognition, and hardware resources, this work establishes a unified perspective toward scalable, adaptive, and resource-efficient agentic AI. We believe this survey can help readers to understand the connections between enabling technologies while promoting further discussions on the fusion of agentic intelligence and intelligent agents.

Paper Structure

This paper contains 84 sections, 1 equation, 35 figures, 17 tables.

Figures (35)

  • Figure 1: Development of FMs (8-bit) and embedded hardware.
  • Figure 2: Related concepts.
  • Figure 3: Cross-level landscape of agentic AI systems.
  • Figure 4: Dynamically adaptive and resource-efficient agentic AI system workflow.
  • Figure 5: Performance comparison of FMs and traditional DL.
  • ...and 30 more figures