Optimizing Service Operations via LLM-Powered Multi-Agent Simulation

πŸ“… 2026-04-05
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF
πŸ€– AI Summary
This study addresses the challenge of optimizing service system performance in the presence of complex and difficult-to-model participant behavioral responses. The authors propose a large language model–based multi-agent simulation framework (LLM-MAS), which formulates the design problem as a stochastic optimization problem with decision-dependent uncertainty. By leveraging prompt engineering to orchestrate agent interactions, the framework simultaneously performs zeroth-order gradient estimation and updates design parameters within a single simulation trajectory. This approach uniquely integrates LLM-driven multi-agent simulation with steady-state performance optimization, incorporating variance reduction techniques and controlled Markov chain modeling. Empirical evaluations in sustainable supply chain and tournament design scenarios demonstrate substantial improvements over black-box optimization baselines, enabling both efficient evaluation of existing designs and discovery of high-performing configurations overlooked by conventional methods.
πŸ“ Abstract
Service system performance depends on how participants respond to design choices, but modeling these responses is hard due to the complexity of human behavior. We introduce an LLM-powered multi-agent simulation (LLM-MAS) framework for optimizing service operations. We pose the problem as stochastic optimization with decision-dependent uncertainty: design choices are embedded in prompts and shape the distribution of outcomes from interacting LLM-powered agents. By embedding key numerical information in prompts and extracting it from LLM-generated text, we model this uncertainty as a controlled Markov chain. We develop an on-trajectory learning algorithm that, on a single simulation run, simultaneously constructs zeroth-order gradient estimates and updates design parameters to optimize steady-state performance. We also incorporate variance reduction techniques. In a sustainable supply chain application, our method outperforms benchmarks, including blackbox optimization and using LLMs as numerical solvers or as role-playing system designers. A case study on optimal contest design with real behavioral data shows that LLM-MAS is both as a cost-effective evaluator of known designs and an exploratory tool that can uncover strong designs overlooked by traditional approaches.
Problem

Research questions and friction points this paper is trying to address.

service operations
human behavior
decision-dependent uncertainty
stochastic optimization
multi-agent simulation
Innovation

Methods, ideas, or system contributions that make the work stand out.

LLM-powered multi-agent simulation
decision-dependent uncertainty
zeroth-order optimization
controlled Markov chain
on-trajectory learning
Y
Yanyuan Wang
Department of Industrial Engineering and Decision Analytics, The Hong Kong University of Science and Technology, Clear Water Bay, Hong Kong SAR
Xiaowei Zhang
Xiaowei Zhang
The Hong Kong University of Science and Technology
stochastic simulation and optimizationdecision analyticsreinforcement learning