In-Memory Learning: A Declarative Learning Framework for Large Language Models
Bo Wang, Tianxiang Sun, Hang Yan, Siyin Wang, Qingyuan Cheng, Xipeng Qiu
TL;DR
This work proposes In-Memory Learning (IML), a declarative-memory-inspired framework enabling language-model agents to self-improve without human labels by maintaining and refining memory notes through three phases: induction, revision, and inference. It frames the agent's operation as a POMDP with memory context notes $\phi$ and demonstrates a gradient-like refinement process within memory, including a momentum mechanism, and a dedicated benchmark to assess self-improvement. Experiments on a four-class, 10-dimension truth-table benchmark show model-dependent gains in inference and induction across different LLMs, while highlighting challenges such as local minima and parameter sensitivity. The findings suggest that memory-centric, declarative self-improvement is feasible for LLM-based agents, offering a path toward label-free continual learning, albeit with limitations in multimodality and model capacity.
Abstract
The exploration of whether agents can align with their environment without relying on human-labeled data presents an intriguing research topic. Drawing inspiration from the alignment process observed in intelligent organisms, where declarative memory plays a pivotal role in summarizing past experiences, we propose a novel learning framework. The agents adeptly distill insights from past experiences, refining and updating existing notes to enhance their performance in the environment. This entire process transpires within the memory components and is implemented through natural language, so we character this framework as In-memory Learning. We also delve into the key features of benchmarks designed to evaluate the self-improvement process. Through systematic experiments, we demonstrate the effectiveness of our framework and provide insights into this problem.
