Wolfpack Adversarial Attack for Robust Multi-Agent Reinforcement Learning

📅 2025-02-05
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Multi-agent reinforcement learning (MARL) suffers from brittle cooperation under adversarial attacks, where coordinated perturbations cause catastrophic failure of collaborative behavior. Method: This paper proposes Wolfpack, the first biologically inspired, targeted attack framework for MARL, leveraging multi-agent gradient-based perturbations, adjacency-aware disturbance, and collaboration-aware perturbation to destabilize cooperative policies. To counter such threats, we introduce WALL—a novel defense framework featuring collaboration-stability-oriented adversarial training, integrating centralized training with decentralized execution (CTDE) and collaboration-aware regularization. Contribution/Results: Wolfpack reduces cooperation success rates by 62% on average across benchmarks. WALL achieves 89% task completion under diverse attacks, improving robustness by over 17% relative to state-of-the-art defenses—establishing the first end-to-end attack-defense闭环 in MARL grounded in collaboration stability.

Technology Category

Application Category

📝 Abstract
Traditional robust methods in multi-agent reinforcement learning (MARL) often struggle against coordinated adversarial attacks in cooperative scenarios. To address this limitation, we propose the Wolfpack Adversarial Attack framework, inspired by wolf hunting strategies, which targets an initial agent and its assisting agents to disrupt cooperation. Additionally, we introduce the Wolfpack-Adversarial Learning for MARL (WALL) framework, which trains robust MARL policies to defend against the proposed Wolfpack attack by fostering system-wide collaboration. Experimental results underscore the devastating impact of the Wolfpack attack and the significant robustness improvements achieved by WALL.
Problem

Research questions and friction points this paper is trying to address.

Addresses vulnerability in multi-agent reinforcement learning
Proposes defense against coordinated adversarial attacks
Enhances robustness through system-wide collaboration
Innovation

Methods, ideas, or system contributions that make the work stand out.

Wolfpack Adversarial Attack framework
targets initial and assisting agents
WALL enhances MARL robustness
🔎 Similar Papers
No similar papers found.
S
Sunwoo Lee
Graduate School of Artificial Intelligence, UNIST, Ulsan, South Korea
J
Jaebak Hwang
Graduate School of Artificial Intelligence, UNIST, Ulsan, South Korea
Y
Yonghyeon Jo
Graduate School of Artificial Intelligence, UNIST, Ulsan, South Korea
Seungyul Han
Seungyul Han
Assistant Professor, Graduate School of AI, UNIST
Reinforcement LearningMachine LearningIntelligent ControlSignal Processing