Active Inference for Self-Organizing Multi-LLM Systems: A Bayesian Thermodynamic Approach to Adaptation
Rithvik Prakki
TL;DR
The paper presents an active-inference framework that sits above LLM-driven agents to dynamically adapt prompts and search strategies, addressing the rigidity of static prompts. By modeling the environment with three state factors and seven observation modalities and optimizing through variational free energy and expected free energy, the approach enables principled exploration of prompt-search configurations while accounting for thermodynamic costs. Empirical results show the agent learns structured environment dynamics, develops meaningful prompt-search observation mappings, and exhibits a shift from information-gathering to targeted testing, underscoring a robust exploration-exploitation balance. This thermodynamic, information-theoretic perspective extends active inference to high-dimensional, language-driven environments and suggests a path toward more autonomous, self-improving AI systems.
Abstract
This paper introduces a novel approach to creating adaptive language agents by integrating active inference with large language models (LLMs). While LLMs demonstrate remarkable capabilities, their reliance on static prompts limits adaptation to new information and changing environments. We address this by implementing an active inference framework that acts as a cognitive layer above an LLM-based agent, dynamically adjusting prompts and search strategies through principled information-seeking behavior. Our framework models the environment using three state factors (prompt, search, and information states) with seven observation modalities capturing quality metrics. By framing the agent's learning through the free energy principle, we enable systematic exploration of prompt combinations and search strategies. Experimental results demonstrate the effectiveness of this approach, with the agent developing accurate models of environment dynamics evidenced by emergent structure in observation matrices. Action selection patterns reveal sophisticated exploration-exploitation behavior, transitioning from initial information-gathering to targeted prompt testing. The integration of thermodynamic principles with language model capabilities provides a principled framework for creating robust, adaptable agents, extending active inference beyond traditional low-dimensional control problems to high-dimensional, language-driven environments.
