Table of Contents
Fetching ...

Adaptive Memory Admission Control for LLM Agents

Guilin Zhang, Wei Jiang, Xiejiashan Wang, Aisha Behr, Kai Zhao, Jeffrey Friedman, Xu Chu, Amine Anoun

TL;DR

Findings demonstrate that explicit and interpretable admission control is a critical design principle for scalable and reliable memory in LLM-based agents and proposes Adaptive Memory Admission Control, a framework that treats memory admission as a structured decision problem.

Abstract

LLM-based agents increasingly rely on long-term memory to support multi-session reasoning and interaction, yet current systems provide little control over what information is retained. In practice, agents either accumulate large volumes of conversational content, including hallucinated or obsolete facts, or depend on opaque, fully LLM-driven memory policies that are costly and difficult to audit. As a result, memory admission remains a poorly specified and weakly controlled component in agent architectures. To address this gap, we propose Adaptive Memory Admission Control (A-MAC), a framework that treats memory admission as a structured decision problem. A-MAC decomposes memory value into five complementary and interpretable factors: future utility, factual confidence, semantic novelty, temporal recency, and content type prior. The framework combines lightweight rule-based feature extraction with a single LLM-assisted utility assessment, and learns domain-adaptive admission policies through cross-validated optimization. This design enables transparent and efficient control over long-term memory. Experiments on the LoCoMo benchmark show that A-MAC achieves a superior precision-recall tradeoff, improving F1 to 0.583 while reducing latency by 31% compared to state-of-the-art LLM-native memory systems. Ablation results identify content type prior as the most influential factor for reliable memory admission. These findings demonstrate that explicit and interpretable admission control is a critical design principle for scalable and reliable memory in LLM-based agents.

Adaptive Memory Admission Control for LLM Agents

TL;DR

Findings demonstrate that explicit and interpretable admission control is a critical design principle for scalable and reliable memory in LLM-based agents and proposes Adaptive Memory Admission Control, a framework that treats memory admission as a structured decision problem.

Abstract

LLM-based agents increasingly rely on long-term memory to support multi-session reasoning and interaction, yet current systems provide little control over what information is retained. In practice, agents either accumulate large volumes of conversational content, including hallucinated or obsolete facts, or depend on opaque, fully LLM-driven memory policies that are costly and difficult to audit. As a result, memory admission remains a poorly specified and weakly controlled component in agent architectures. To address this gap, we propose Adaptive Memory Admission Control (A-MAC), a framework that treats memory admission as a structured decision problem. A-MAC decomposes memory value into five complementary and interpretable factors: future utility, factual confidence, semantic novelty, temporal recency, and content type prior. The framework combines lightweight rule-based feature extraction with a single LLM-assisted utility assessment, and learns domain-adaptive admission policies through cross-validated optimization. This design enables transparent and efficient control over long-term memory. Experiments on the LoCoMo benchmark show that A-MAC achieves a superior precision-recall tradeoff, improving F1 to 0.583 while reducing latency by 31% compared to state-of-the-art LLM-native memory systems. Ablation results identify content type prior as the most influential factor for reliable memory admission. These findings demonstrate that explicit and interpretable admission control is a critical design principle for scalable and reliable memory in LLM-based agents.
Paper Structure (16 sections, 4 equations, 3 figures, 5 tables, 1 algorithm)

This paper contains 16 sections, 4 equations, 3 figures, 5 tables, 1 algorithm.

Figures (3)

  • Figure 1: Overview of A-MAC. Candidate memories are extracted from conversation history and evaluated using five complementary signals capturing usefulness, reliability, redundancy, temporal relevance, and persistence. A learned linear admission policy aggregates these signals into a score $\mathcal{S}(m)$ and applies a threshold $\theta$ to determine whether to admit, update, or reject a candidate.
  • Figure 2: Precision-recall tradeoff comparison. A-MAC achieves the best balance between precision and recall, occupying the upper-right region of the curve. Dashed lines indicate F1 iso-contours.
  • Figure 3: Cross-domain F1 performance. Personal conversations achieve higher F1 due to explicit preference statements that align with Type Prior's strengths. The dashed line indicates mean performance across domains.