Enabling Agents to Communicate Entirely in Latent Space
Zhuoyun Du, Runze Wang, Huiyu Bai, Zouying Cao, Xiaoyong Zhu, Bo Zheng, Wei Chen, Haochao Ying
TL;DR
The paper addresses the bottleneck of language-based inter-agent communication in LLM-driven systems by proposing Interlat, which transmits entire latent representations (last-layer hidden states) between agents and applies explicit compression. It introduces a training framework with conditional mind separation and plan-alignment losses, plus a latent-space compression stage to produce concise yet information-rich messages. Across ALFWorld experiments, Interlat improves task success and promotes exploratory, multi-path reasoning, while reducing communication latency up to about 24× with minimal performance loss. This work demonstrates the feasibility and benefits of fully latent-space inter-agent communication, offering practical guidance for building more efficient and capable multi-agent systems.
Abstract
While natural language is the de facto communication medium for LLM-based agents, it presents a fundamental constraint. The process of downsampling rich, internal latent states into discrete tokens inherently limits the depth and nuance of information that can be transmitted, thereby hindering collaborative problem-solving. Inspired by human mind-reading, we propose Interlat (Inter-agent Latent Space Communication), a paradigm that leverages the last hidden states of an LLM as a representation of its mind for direct transmission (termed latent communication). An additional compression process further compresses latent communication via entirely latent space reasoning. Experiments demonstrate that Interlat outperforms both fine-tuned chain-of-thought (CoT) prompting and single-agent baselines, promoting more exploratory behavior and enabling genuine utilization of latent information. Further compression not only substantially accelerates inference but also maintains competitive performance through an efficient information-preserving mechanism. We position this work as a feasibility study of entirely latent space inter-agent communication, and our results highlight its potential, offering valuable insights for future research.
