Enhancing Reasoning with Collaboration and Memory
Julie Michelman, Nasrin Baratalipour, Matthew Abueg
TL;DR
This work investigates how collaboration among multiple LLM agents, diverse reasoning styles, and memory banks can improve reasoning performance. By introducing varied-context exemplars and a summarizer agent, and by comparing frozen and learned memory with different retrieval strategies, the study reveals that random exemplar retrieval and distributed varied-context perspectives often outperform more principled similarity-based retrieval and homogeneous setups. Analogical prompting demonstrates robustness to memory design, while summarizers tend to aid weaker models more than stronger ones. The findings offer practical guidance for building continuous, memory-augmented, multi-agent reasoning systems and highlight the nuanced interactions between memory, prompting, and collaboration in large language models.
Abstract
We envision a continuous collaborative learning system where groups of LLM agents work together to solve reasoning problems, drawing on memory they collectively build to improve performance as they gain experience. This work establishes the foundations for such a system by studying the interoperability of chain-of-thought reasoning styles, multi-agent collaboration, and memory banks. Extending beyond the identical agents of self-consistency, we introduce varied-context agents with diverse exemplars and a summarizer agent in place of voting. We generate frozen and continuously learned memory banks of exemplars and pair them with fixed, random, and similarity-based retrieval mechanisms. Our systematic study reveals where various methods contribute to reasoning performance of two LLMs on three grounded reasoning tasks, showing that random exemplar selection can often beat more principled approaches, and in some tasks, inclusion of any exemplars serves only to distract both weak and strong models.
