Direct Semantic Communication Between Large Language Models via Vector Translation
Fu-Chun Yang, Jason Eshraghian
TL;DR
Cross-model semantic communication is enabled by learning bidirectional latent translations between distinct LLM representation spaces. The authors introduce a dual-encoder translator and a conservative vector-injection mechanism that injects translated vectors into the target model with a blending strength of $0.3$ to steer generation while preserving stability. They report an average cosine alignment of $0.538$ across five domains and a bidirectional asymmetry of $2.01:1$, indicating general-purpose representations transfer more readily than instruction-tuned ones. A detailed machine-learning-domain case study and multi-domain evaluation demonstrate consistent semantic transfer without destabilizing decoding, suggesting practical pathways for collaborative AI systems that share meaning rather than tokens.
Abstract
In multi-agent settings, such as debate, reflection, or tool-calling, large language models (LLMs) pass messages as plain tokens, discarding most latent semantics. This constrains information transfer and adds unnecessary computational overhead. We form a latent bridge via vector translations, which use learned mappings that enable direct semantic exchange between representation spaces. A dual-encoder translator trained between Llama-2-7B and Mistral-7B-Instruct attains an average cosine alignment of 0.538. Injecting the translated vectors at 30 percent blending strength steers the target model's generation without destabilizing logits. Bidirectional evaluation shows a 2.01:1 transfer asymmetry, indicating that general-purpose models yield more transferable representations than instruction-tuned variants. This conservative injection preserves computational stability while demonstrating that cross-model latent communication is feasible, enabling collaborative AI systems that share meaning rather than tokens.
