Superpixel Attack - Enhancing Black-Box Adversarial Attack with Image-Driven Division Areas

📅 2025-11-29

🏛️ Applied Informatics

📈 Citations: 0

✨ Influential: 0

career value

226K/year

🤖 AI Summary

Deep learning models deployed in safety-critical applications—such as autonomous driving and facial recognition—are vulnerable to black-box adversarial attacks, where attackers lack access to model internals. To address this, we propose a superpixel-based efficient black-box attack method. Unlike conventional rectangular perturbations, our approach leverages superpixels to achieve image-driven, semantically coherent region segmentation, striking an improved balance between color consistency and spatial compactness. We further design a multi-functional search strategy integrated with iterative perturbation optimization, enabling high-quality adversarial example generation without querying model parameters or gradients. Extensive experiments demonstrate that our method achieves an average 2.10% improvement in attack success rate across multiple state-of-the-art defended models, significantly outperforming existing black-box attack baselines. The source code is publicly available.

Technology Category

Application Category

📝 Abstract

Deep learning models are used in safety-critical tasks such as automated driving and face recognition. However, small perturbations in the model input can significantly change the predictions. Adversarial attacks are used to identify small perturbations that can lead to misclassifications. More powerful black-box adversarial attacks are required to develop more effective defenses. A promising approach to black-box adversarial attacks is to repeat the process of extracting a specific image area and changing the perturbations added to it. Existing attacks adopt simple rectangles as the areas where perturbations are changed in a single iteration. We propose applying superpixels instead, which achieve a good balance between color variance and compactness. We also propose a new search method, versatile search, and a novel attack method, Superpixel Attack, which applies superpixels and performs versatile search. Superpixel Attack improves attack success rates by an average of 2.10% compared with existing attacks. Most models used in this study are robust against adversarial attacks, and this improvement is significant for black-box adversarial attacks. The code is avilable at https://github.com/oe1307/SuperpixelAttack.git.

Problem

Research questions and friction points this paper is trying to address.

Enhancing black-box adversarial attack success rates

Applying superpixels to balance color variance and compactness

Proposing versatile search for improved perturbation effectiveness

Innovation

Methods, ideas, or system contributions that make the work stand out.

Superpixel Attack uses superpixels for perturbation areas

It introduces versatile search method for optimization

The approach improves black-box attack success rates

🔎 Similar Papers

Improving the Robustness of Object Detection and Classification AI models against Adversarial Patch Attacks