Wolfpack Adversarial Attack for Robust Multi-Agent Reinforcement Learning

📅 2025-02-05

📈 Citations: 0

✨ Influential: 0

🤖 AI Summary

Multi-agent reinforcement learning (MARL) suffers from brittle cooperation under adversarial attacks, where coordinated perturbations cause catastrophic failure of collaborative behavior. Method: This paper proposes Wolfpack, the first biologically inspired, targeted attack framework for MARL, leveraging multi-agent gradient-based perturbations, adjacency-aware disturbance, and collaboration-aware perturbation to destabilize cooperative policies. To counter such threats, we introduce WALL—a novel defense framework featuring collaboration-stability-oriented adversarial training, integrating centralized training with decentralized execution (CTDE) and collaboration-aware regularization. Contribution/Results: Wolfpack reduces cooperation success rates by 62% on average across benchmarks. WALL achieves 89% task completion under diverse attacks, improving robustness by over 17% relative to state-of-the-art defenses—establishing the first end-to-end attack-defense闭环 in MARL grounded in collaboration stability.

Technology Category

Application Category

📝 Abstract

Traditional robust methods in multi-agent reinforcement learning (MARL) often struggle against coordinated adversarial attacks in cooperative scenarios. To address this limitation, we propose the Wolfpack Adversarial Attack framework, inspired by wolf hunting strategies, which targets an initial agent and its assisting agents to disrupt cooperation. Additionally, we introduce the Wolfpack-Adversarial Learning for MARL (WALL) framework, which trains robust MARL policies to defend against the proposed Wolfpack attack by fostering system-wide collaboration. Experimental results underscore the devastating impact of the Wolfpack attack and the significant robustness improvements achieved by WALL.

Problem

Research questions and friction points this paper is trying to address.

Addresses vulnerability in multi-agent reinforcement learning

Proposes defense against coordinated adversarial attacks

Enhances robustness through system-wide collaboration

Innovation

Methods, ideas, or system contributions that make the work stand out.

Wolfpack Adversarial Attack framework

targets initial and assisting agents

WALL enhances MARL robustness

🔎 Similar Papers

No similar papers found.

Authors to Follow