LEGOMem: Modular Procedural Memory for Multi-agent LLM Systems for Workflow Automation
Dongge Han, Camille Couturier, Daniel Madrigal Diaz, Xuchao Zhang, Victor Rühle, Saravan Rajmohan
TL;DR
LEGOMem tackles statelessness in multi-agent LLM workflow systems by introducing modular procedural memory, split into full-task and subtask memories stored in a semantic memory bank. The framework operates offline memory construction and online retrieval to augment planning for the orchestrator and execution guidance for task agents, with three retrieval variants (vanilla, dynamic, query rewrite). Experiments on OfficeBench show that orchestrator memory is crucial for planning and delegation, while fine-grained agent memory boosts execution accuracy, enabling smaller models to approach or surpass larger ones. The work provides a practical framework and a research tool to study memory design for multi-agent workflow automation, with implications for continual learning and scalable tool use.
Abstract
We introduce LEGOMem, a modular procedural memory framework for multi-agent large language model (LLM) systems in workflow automation. LEGOMem decomposes past task trajectories into reusable memory units and flexibly allocates them across orchestrators and task agents to support planning and execution. To explore the design space of memory in multi-agent systems, we use LEGOMem as a lens and conduct a systematic study of procedural memory in multi-agent systems, examining where memory should be placed, how it should be retrieved, and which agents benefit most. Experiments on the OfficeBench benchmark show that orchestrator memory is critical for effective task decomposition and delegation, while fine-grained agent memory improves execution accuracy. We find that even teams composed of smaller language models can benefit substantially from procedural memory, narrowing the performance gap with stronger agents by leveraging prior execution traces for more accurate planning and tool use. These results position LEGOMem as both a practical framework for memory-augmented agent systems and a research tool for understanding memory design in multi-agent workflow automation.
