🤖 AI Summary
Large language models (LLMs) face significant challenges in real-time performance, deployment cost, and practical integration within 4X/ grand strategy games. Method: This paper proposes an LLM+X hierarchical architecture—decoupling macro-level strategic reasoning (e.g., diplomacy, long-term planning) performed by LLMs from micro-level tactical execution handled by lightweight rule-based or symbolic subsystems. We introduce the first hybrid LLM framework tailored for commercial 4X games, specifically Civilization V enhanced with the Vox Populi mod, leveraging Llama 3 and Phi-3 via prompt engineering and instruction tuning, and designing modular interfaces and an automated evaluation framework. Contribution/Results: Across 2,327 full-game episodes, our LLM agent achieves win rates competitive with enhanced rule-based AI, exhibits distinct and diverse strategic styles—both relative to traditional AI and across different LLM variants—and demonstrates feasibility of low-latency, cost-efficient deployment.
📝 Abstract
Large Language Models' capacity to reason in natural language makes them uniquely promising for 4X and grand strategy games, enabling more natural human-AI gameplay interactions such as collaboration and negotiation. However, these games present unique challenges due to their complexity and long-horizon nature, while latency and cost factors may hinder LLMs' real-world deployment. Working on a classic 4X strategy game, Sid Meier's Civilization V with the Vox Populi mod, we introduce Vox Deorum, a hybrid LLM+X architecture. Our layered technical design empowers LLMs to handle macro-strategic reasoning, delegating tactical execution to subsystems (e.g., algorithmic AI or reinforcement learning AI in the future). We validate our approach through 2,327 complete games, comparing two open-source LLMs with a simple prompt against Vox Populi's enhanced AI. Results show that LLMs achieve competitive end-to-end gameplay while exhibiting play styles that diverge substantially from algorithmic AI and from each other. Our work establishes a viable architecture for integrating LLMs in commercial 4X games, opening new opportunities for game design and agentic AI research.