Table of Contents
Fetching ...

Reducing Cost of LLM Agents with Trajectory Reduction

Yuan-An Xiao, Pengfei Gao, Chao Peng, Yingfei Xiong

Abstract

Multi-turn agent systems based on Large Language Models (LLMs) have become increasingly popular for software engineering tasks. While LLM agents demonstrate promising effectiveness, the high computational cost of input tokens due to ever-growing trajectories remains a significant efficiency concern. Efficiency has been largely overlooked in existing studies and agent products, and this paper addresses this gap by introducing an inference-time trajectory reduction approach that reduces computational costs. By analyzing existing agent trajectories, we demonstrate that useless, redundant, and expired information is widespread across trajectories. Such waste can be identified and reduced without compromising the agent's performance. We propose a simple yet effective trajectory reduction approach, AgentDiet, which automatically removes such waste during agent execution. We implement AgentDiet on a top-performing coding agent, and our evaluation on two LLMs and two benchmarks shows that AgentDiet can reduce input tokens by 39.9%-59.7% and the total computational cost by 21.1%-35.9%, while maintaining the same agent performance. These results indicate that inference-time trajectory reduction is a promising direction for agent systems.

Reducing Cost of LLM Agents with Trajectory Reduction

Abstract

Multi-turn agent systems based on Large Language Models (LLMs) have become increasingly popular for software engineering tasks. While LLM agents demonstrate promising effectiveness, the high computational cost of input tokens due to ever-growing trajectories remains a significant efficiency concern. Efficiency has been largely overlooked in existing studies and agent products, and this paper addresses this gap by introducing an inference-time trajectory reduction approach that reduces computational costs. By analyzing existing agent trajectories, we demonstrate that useless, redundant, and expired information is widespread across trajectories. Such waste can be identified and reduced without compromising the agent's performance. We propose a simple yet effective trajectory reduction approach, AgentDiet, which automatically removes such waste during agent execution. We implement AgentDiet on a top-performing coding agent, and our evaluation on two LLMs and two benchmarks shows that AgentDiet can reduce input tokens by 39.9%-59.7% and the total computational cost by 21.1%-35.9%, while maintaining the same agent performance. These results indicate that inference-time trajectory reduction is a promising direction for agent systems.

Paper Structure

This paper contains 43 sections, 5 figures, 5 tables, 1 algorithm.

Figures (5)

  • Figure 1: Typical workflow of an LLM agent.
  • Figure 2: A case study of reducing information waste in the trajectory (pytest-dev__pytest-6202).
  • Figure 3: Design of the reflection module in AgentDiet.
  • Figure 4: Histogram of the number of tokens reduced by AgentDiet in each step. The X axis is plotted logarithmically. SbV = SWE-bench Verified. MSbF = Multi-SWE-bench Flash.
  • Figure 5: Histogram of the Step metric on Multi-SWE-bench Flash + Gemini 2.5 Pro between Original (left) and AgentDiet (right).