Detecting Perspective Shifts in Multi-agent Systems
Eric Bridgeford, Hayden Helm
TL;DR
This work addresses the challenge of detecting perspective shifts in black-box multi-agent systems by introducing the Temporal Data Kernel Perspective Space (TDKPS), a time-aware, low-dimensional embedding of agents. It couples TDkPS with principled agent- and group-level permutation-based tests (including energy-distance and distance-correlation based methods) to detect temporal changes in behavior without requiring access to internal mechanisms. Through simulations and a natural experiment using digital congresspersons and COVID-era public-health queries, the authors demonstrate near-optimal power, robust performance, and substantial computational efficiency for group-level inference. The approach provides a scalable framework for monitoring dynamic agent ecosystems as generative agents continue to proliferate, with implications for safety, reliability, and interpretability in real-world deployments.
Abstract
Generative models augmented with external tools and update mechanisms (or \textit{agents}) have demonstrated capabilities beyond intelligent prompting of base models. As agent use proliferates, dynamic multi-agent systems have naturally emerged. Recent work has investigated the theoretical and empirical properties of low-dimensional representations of agents based on query responses at a single time point. This paper introduces the Temporal Data Kernel Perspective Space (TDKPS), which jointly embeds agents across time, and proposes several novel hypothesis tests for detecting behavioral change at the agent- and group-level in black-box multi-agent systems. We characterize the empirical properties of our proposed tests, including their sensitivity to key hyperparameters, in simulations motivated by a multi-agent system of evolving digital personas. Finally, we demonstrate via natural experiment that our proposed tests detect changes that correlate sensitively, specifically, and significantly with a real exogenous event. As far as we are aware, TDKPS is the first principled framework for monitoring behavioral dynamics in black-box multi-agent systems -- a critical capability as generative agent deployment continues to scale.
