π€ AI Summary
This study addresses the challenge of monitoring dynamic evolution of individual and collective behaviors in black-box multi-agent systems. We propose the Temporal Data Kernel Perspective Space (TDKPS) joint embedding framework, which employs low-dimensional representation learning and kernel methods to map agent response sequences across time steps into a unified, comparable feature space. Subsequently, we design novel hypothesis testing procedures tailored to detect behavioral changes at both agent-level and population-level granularities. To our knowledge, this constitutes the first principled, interpretable statistical inference paradigm for monitoring multi-agent behavioral dynamics. Experiments demonstrate robustness to hyperparameter choices; sensitivity and specificity are validated on simulated digital persona systems; and natural experiments confirm the frameworkβs ability to significantly detect behavioral anomalies strongly associated with real-world exogenous events.
π Abstract
Generative models augmented with external tools and update mechanisms (or extit{agents}) have demonstrated capabilities beyond intelligent prompting of base models. As agent use proliferates, dynamic multi-agent systems have naturally emerged. Recent work has investigated the theoretical and empirical properties of low-dimensional representations of agents based on query responses at a single time point. This paper introduces the Temporal Data Kernel Perspective Space (TDKPS), which jointly embeds agents across time, and proposes several novel hypothesis tests for detecting behavioral change at the agent- and group-level in black-box multi-agent systems. We characterize the empirical properties of our proposed tests, including their sensitivity to key hyperparameters, in simulations motivated by a multi-agent system of evolving digital personas. Finally, we demonstrate via natural experiment that our proposed tests detect changes that correlate sensitively, specifically, and significantly with a real exogenous event. As far as we are aware, TDKPS is the first principled framework for monitoring behavioral dynamics in black-box multi-agent systems -- a critical capability as generative agent deployment continues to scale.