MAPF-GPT: Imitation Learning for Multi-Agent Pathfinding at Scale

📅 2024-08-29
🏛️ arXiv.org
📈 Citations: 2
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses Multi-Agent Path Finding (MAPF), an NP-hard combinatorial optimization problem. We propose MAPF-GPT, the first end-to-end foundation model for MAPF built solely on imitation learning and a Transformer architecture. Trained via behavioral cloning on expert trajectory data, it requires no hand-crafted heuristics, inter-agent communication, or online search. Its key contribution is the first demonstration of zero-shot generalization in MAPF: given unseen maps, obstacle configurations, and agent counts, it directly generates collision-free paths without fine-tuning. Extensive experiments on diverse MAPF benchmarks show that MAPF-GPT significantly outperforms existing learnable solvers in solution quality and success rate, while achieving high inference efficiency and strong scalability—enabling real-time deployment on large-scale instances.

Technology Category

Application Category

📝 Abstract
Multi-agent pathfinding (MAPF) is a problem that generally requires finding collision-free paths for multiple agents in a shared environment. Solving MAPF optimally, even under restrictive assumptions, is NP-hard, yet efficient solutions for this problem are critical for numerous applications, such as automated warehouses and transportation systems. Recently, learning-based approaches to MAPF have gained attention, particularly those leveraging deep reinforcement learning. Typically, such learning-based MAPF solvers are augmented with additional components like single-agent planning or communication. Orthogonally, in this work we rely solely on imitation learning that leverages a large dataset of expert MAPF solutions and transformer-based neural network to create a foundation model for MAPF called MAPF-GPT. The latter is capable of generating actions without additional heuristics or communication. MAPF-GPT demonstrates zero-shot learning abilities when solving the MAPF problems that are not present in the training dataset. We show that MAPF-GPT notably outperforms the current best-performing learnable MAPF solvers on a diverse range of problem instances and is computationally efficient during inference.
Problem

Research questions and friction points this paper is trying to address.

Develops imitation learning for multi-agent pathfinding.
Uses transformer-based model without heuristics.
Achieves zero-shot learning in diverse problem instances.
Innovation

Methods, ideas, or system contributions that make the work stand out.

Imitation learning for MAPF
Transformer-based neural network
Zero-shot learning capabilities
🔎 Similar Papers
No similar papers found.
A
A. Andreychuk
AIRI, Moscow, Russia
Konstantin Yakovlev
Konstantin Yakovlev
HSE University
Machine Learning
A
Aleksandr Panov
AIRI, Moscow, Russia; Federal Research Center “Computer Science and Control” of the Russian Academy of Sciences, Moscow, Russia; Moscow Institute of Physics and Technology, Dolgoprudny, Russia
Alexey Skrynnik
Alexey Skrynnik
CogAI Lab
Reinforcement learningMachine LearningAIMulti-agent systems