AgentGit: A Version Control Framework for Reliable and Scalable LLM-Powered Multi-Agent Systems

📅 2025-11-01
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address the low reliability and poor scalability of large language model (LLM)-based multi-agent systems in complex tasks, this paper proposes the first version control framework specifically designed for multi-agent behavioral trajectories. Inspired by Git, it introduces commit, branch, and rollback primitives to enable fine-grained agent state management, parallel exploration, and fault recovery. Methodologically, the framework builds an infrastructure layer atop LangGraph, supporting state snapshots of collaborative workflows, cross-branch trajectory comparison, and atomic rollbacks. Its core contribution lies in adapting software engineering’s version control paradigm to multi-agent systems—enabling, for the first time, safe experimentation, iterative debugging, and A/B testing of agent behaviors. Evaluated on a real-world scientific paper abstract analysis task, the framework reduces redundant computation, achieving average improvements of 32.7% in both execution time and token consumption, while concurrently enhancing system stability and collaborative efficiency.

Technology Category

Application Category

📝 Abstract
With the rapid progress of large language models (LLMs), LLM-powered multi-agent systems (MAS) are drawing increasing interest across academia and industry. However, many current MAS frameworks struggle with reliability and scalability, especially on complex tasks. We present AgentGit, a framework that brings Git-like rollback and branching to MAS workflows. Built as an infrastructure layer on top of LangGraph, AgentGit supports state commit, revert, and branching, allowing agents to traverse, compare, and explore multiple trajectories efficiently. To evaluate AgentGit, we designed an experiment that optimizes target agents by selecting better prompts. We ran a multi-step A/B test against three baselines -- LangGraph, AutoGen, and Agno -- on a real-world task: retrieving and analyzing paper abstracts. Results show that AgentGit significantly reduces redundant computation, lowers runtime and token usage, and supports parallel exploration across multiple branches, enhancing both reliability and scalability in MAS development. This work offers a practical path to more robust MAS design and enables error recovery, safe exploration, iterative debugging, and A/B testing in collaborative AI systems.
Problem

Research questions and friction points this paper is trying to address.

Addressing reliability and scalability issues in LLM-powered multi-agent systems
Enabling efficient trajectory comparison and exploration through Git-like version control
Reducing redundant computation and resource usage in complex multi-agent tasks
Innovation

Methods, ideas, or system contributions that make the work stand out.

Git-like rollback and branching for multi-agent workflows
State commit, revert, and branching for trajectory exploration
Parallel exploration across branches to reduce computation redundancy
🔎 Similar Papers
No similar papers found.
Y
Yang Li
University of Hong Kong, Hong Kong SAR
S
Siqi Ping
University of Hong Kong, Hong Kong SAR
X
Xiyu Chen
University of Hong Kong, Hong Kong SAR
X
Xiaojian Qi
University of Hong Kong, Hong Kong SAR
Zigan Wang
Zigan Wang
Tsinghua University
Corporate FinanceForeign ExchangePolitical EconomicsEnvironmental EconomicsEconometrics
Y
Ye Luo
University of Hong Kong, Pokfulam Road, Hong Kong SAR
X
Xiaowei Zhang
Hong Kong University of Science and Technology, Hong Kong SAR