π€ AI Summary
This work addresses the lack of systematic behavioral control mechanisms in existing large language modelβbased multi-agent dialogue systems, which often rely on ad hoc prompt engineering. The authors propose a lightweight policy framework that, for the first time, treats prompts as parameterizable policy actions. By introducing a state-aware dynamic prompting mechanism, the framework enables controllable intervention in multi-agent conversations without requiring model fine-tuning. It comprises five components that jointly generate guiding prompts in real time based on the current dialogue state of each agent. Experimental results on two public-issue discussion scenarios demonstrate significant improvements across key metrics, including responsiveness, rebuttal quality, evidence utilization, non-redundancy, and stance evolution.
π Abstract
Large Language Models (LLMs) have emerged as a new paradigm for multi-agent systems. However, existing research on the behaviour of LLM-based multi-agents relies on ad hoc prompts and lacks a principled policy perspective. Different from reinforcement learning, we investigate whether prompt-as-action can be parameterized so as to construct a lightweight policy which consists of a sequence of state-action pairs to influence conversational behaviours without training. Our framework regards prompts as actions executed by LLMs, and dynamically constructs prompts through five components based on the current state of the agent. To test the effectiveness of parameterized control, we evaluated the dialogue flow based on five indicators: responsiveness, rebuttal, evidence usage, non-repetition, and stance shift. We conduct experiments using different LLM-driven agents in two discussion scenarios related to the general public and show that prompt parameterization can influence the dialogue dynamics. This result shows that policy-parameterised prompts offer a simple and effective mechanism to influence the dialogue process, which will help the research of multi-agent systems in the direction of social simulation.