Cross-Modality Perturbation Synergy Attack for Person Re-identification

📅 2024-01-18
🏛️ Neural Information Processing Systems
📈 Citations: 14
Influential: 0
📄 PDF
🤖 AI Summary
Existing security research on person re-identification (ReID) focuses predominantly on single-modality (RGB) settings, leaving cross-modal (e.g., RGB-infrared) ReID systems’ vulnerabilities systematically unexplored. Method: This work presents the first systematic security study of cross-modal ReID models and proposes a modality-aware universal adversarial attack. By modeling cross-modal feature disentanglement, the method jointly exploits gradient signals from both infrared and visible-light modalities to co-optimize a universal perturbation—thereby amplifying inter-modal feature discrepancy and degrading discriminative capability. Contribution/Results: The approach explicitly accounts for modality-specific characteristics and distributional shifts between modalities. Evaluated on three major benchmarks—RegDB, SYSU-MM01, and LLCM—it reduces Rank-1 accuracy by over 40% on average, exposing critical robustness bottlenecks in current cross-modal ReID architectures. This work establishes a novel paradigm for security assessment of multimodal vision systems.

Technology Category

Application Category

📝 Abstract
In recent years, there has been significant research focusing on addressing security concerns in single-modal person re-identification (ReID) systems that are based on RGB images. However, the safety of cross-modality scenarios, which are more commonly encountered in practical applications involving images captured by infrared cameras, has not received adequate attention. The main challenge in cross-modality ReID lies in effectively dealing with visual differences between different modalities. For instance, infrared images are typically grayscale, unlike visible images that contain color information. Existing attack methods have primarily focused on the characteristics of the visible image modality, overlooking the features of other modalities and the variations in data distribution among different modalities. This oversight can potentially undermine the effectiveness of these methods in image retrieval across diverse modalities. This study represents the first exploration into the security of cross-modality ReID models and proposes a universal perturbation attack specifically designed for cross-modality ReID. This attack optimizes perturbations by leveraging gradients from diverse modality data, thereby disrupting the discriminator and reinforcing the differences between modalities. We conducted experiments on three widely used cross-modality datasets, namely RegDB, SYSU, and LLCM. The results not only demonstrate the effectiveness of our method but also provide insights for future improvements in the robustness of cross-modality ReID systems. The code will be available at https://github.com/finger-monkey/cmps__attack.
Problem

Research questions and friction points this paper is trying to address.

Addressing security gaps in cross-modality person ReID systems
Overcoming visual differences between RGB and infrared modalities
Developing universal perturbation attacks for multi-modal ReID robustness
Innovation

Methods, ideas, or system contributions that make the work stand out.

Universal perturbation attack for cross-modality ReID
Optimizes perturbations using diverse modality gradients
Disrupts discriminator and reinforces modality differences
🔎 Similar Papers
No similar papers found.
Y
Yunpeng Gong
School of Informatics, Xiamen University
Zhun Zhong
Zhun Zhong
Hefei University of Technology & University of Nottingham
Computer Vision
Zhiming Luo
Zhiming Luo
Xiamen University
Computer VisionDeep LearningMachine Learning
Yansong Qu
Yansong Qu
Purdue University-West Lafayette
Intelligent TransportationAutonomous Driving
R
Rongrong Ji
School of Informforms, Xiamen University
M
Min Jiang
Department of Artificial Intelligence, Key Laboratory of Multimedia Trusted Perception and Efficient Computing, Ministry of Education of China, School of Informatics, Key Laboratory of Digital Protection and Intelligent Processing of Intangible Cultural Heritage of Fujian and Taiwan, Ministry of Culture and Tourism, Xiamen University, Xiamen 361005, Fujian, P.R. China