Harmonizing Intra-coherence and Inter-divergence in Ensemble Attacks for Adversarial Transferability

📅 2025-05-02
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address the limited transferability of black-box adversarial attacks, this paper pioneers the integration of domain generalization into adversarial example generation, proposing a novel domain-generalization-driven ensemble attack paradigm. To overcome two key limitations of existing methods—insufficient inter-model gradient sharing and static weight assignment—we introduce a consensus gradient direction synthesis mechanism and a dual-harmonized weighting strategy, jointly optimizing intra-model gradient consistency and inter-model gradient diversity. The method integrates singular value decomposition, gradient consistency regularization, and dynamic weighted ensemble. Extensive experiments on ImageNet and CIFAR-10 across heterogeneous architectures—including ResNet, VGG, and ViT—demonstrate an average 12.7% improvement in transfer-based attack success rate over state-of-the-art methods.

Technology Category

Application Category

📝 Abstract
The development of model ensemble attacks has significantly improved the transferability of adversarial examples, but this progress also poses severe threats to the security of deep neural networks. Existing methods, however, face two critical challenges: insufficient capture of shared gradient directions across models and a lack of adaptive weight allocation mechanisms. To address these issues, we propose a novel method Harmonized Ensemble for Adversarial Transferability (HEAT), which introduces domain generalization into adversarial example generation for the first time. HEAT consists of two key modules: Consensus Gradient Direction Synthesizer, which uses Singular Value Decomposition to synthesize shared gradient directions; and Dual-Harmony Weight Orchestrator which dynamically balances intra-domain coherence, stabilizing gradients within individual models, and inter-domain diversity, enhancing transferability across models. Experimental results demonstrate that HEAT significantly outperforms existing methods across various datasets and settings, offering a new perspective and direction for adversarial attack research.
Problem

Research questions and friction points this paper is trying to address.

Improving adversarial example transferability via ensemble attacks
Addressing shared gradient direction capture insufficiency
Developing adaptive weight allocation for model harmony
Innovation

Methods, ideas, or system contributions that make the work stand out.

Uses Singular Value Decomposition for shared gradients
Dynamically balances intra-domain and inter-domain harmony
Introduces domain generalization in adversarial example generation
🔎 Similar Papers
No similar papers found.
Z
Zhaoyang Ma
Z
Zhihao Wu
W
Wang Lu
X
Xin Gao
J
Jinghang Yue
Taolin Zhang
Taolin Zhang
Hefei University of Technology
LLMVLLMDeep Learning
Lipo Wang
Lipo Wang
Nanyang Technological University
machine learningbiomedical engineeringoptimization
Y
Youfang Lin
J
Jing Wang