CoRA: A Collaborative Robust Architecture with Hybrid Fusion for Efficient Perception

๐Ÿ“… 2025-12-15
๐Ÿ“ˆ Citations: 0
โœจ Influential: 0
๐Ÿ“„ PDF
๐Ÿค– AI Summary
To address feature misalignment and severe performance degradation caused by communication mismatches in collaborative perception, this paper proposes a hybrid architecture integrating feature-level intermediate fusion with semantic-driven object-level correction. We first reveal the complementary nature of intermediate and late fusion, and introduce a novel dual-branch decoupled design: one branch employs selective feature fusion to reduce communication overhead, while the other leverages semantic-guided spatial displacement estimation and a lightweight object-level correction network to robustly rectify pose estimation errors. The proposed jointly optimized hybrid fusion paradigm achieves approximately 19% improvement in AP@0.7 under extreme communication mismatch, while reducing communication volume by over 5ร—โ€”significantly outperforming existing state-of-the-art methods.

Technology Category

Application Category

๐Ÿ“ Abstract
Collaborative perception has garnered significant attention as a crucial technology to overcome the perceptual limitations of single-agent systems. Many state-of-the-art (SOTA) methods have achieved communication efficiency and high performance via intermediate fusion. However, they share a critical vulnerability: their performance degrades under adverse communication conditions due to the misalignment induced by data transmission, which severely hampers their practical deployment. To bridge this gap, we re-examine different fusion paradigms, and recover that the strengths of intermediate and late fusion are not a trade-off, but a complementary pairing. Based on this key insight, we propose CoRA, a novel collaborative robust architecture with a hybrid approach to decouple performance from robustness with low communication. It is composed of two components: a feature-level fusion branch and an object-level correction branch. Its first branch selects critical features and fuses them efficiently to ensure both performance and scalability. The second branch leverages semantic relevance to correct spatial displacements, guaranteeing resilience against pose errors. Experiments demonstrate the superiority of CoRA. Under extreme scenarios, CoRA improves upon its baseline performance by approximately 19% in AP@0.7 with more than 5x less communication volume, which makes it a promising solution for robust collaborative perception.
Problem

Research questions and friction points this paper is trying to address.

Addresses performance degradation in collaborative perception under adverse communication
Proposes hybrid fusion to decouple performance from robustness with low communication
Corrects spatial displacements using semantic relevance to ensure resilience against errors
Innovation

Methods, ideas, or system contributions that make the work stand out.

Hybrid fusion combining intermediate and late fusion
Feature-level fusion branch for efficient critical feature selection
Object-level correction branch for spatial displacement resilience
๐Ÿ”Ž Similar Papers
No similar papers found.
Gong Chen
Gong Chen
Nanjing University
Magnetic imaging
C
Chaokun Zhang
College of Intelligence and Computing, Tianjin University, Tianjin, China
P
Pengcheng Lv
School of Future Technology, Tianjin University, Tianjin, China
X
Xiaohui Xie
Department of Computer Science and Technology, Tsinghua University, Beijing, China