Long-Tailed Visual Recognition via Permutation-Invariant Head-to-Tail Feature Fusion

📅 2025-05-31
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Long-tailed class distribution severely degrades deep models’ recognition performance on tail classes, primarily due to representation space distortion and classifier bias induced by sparse tail-class features. To address this, we propose the Permutation-Invariant Head-to-Tail Feature fusion framework (PIF-H2TF), which enables unbiased semantic transfer from head to tail classes via permutation-symmetric modeling—jointly optimizing both feature representation space and decision boundaries. Our approach innovatively integrates feature reweighting, semantic distillation, and classifier calibration into a plug-and-play module compatible with mainstream backbones and long-tailed learning paradigms. Extensive experiments on standard benchmarks—including ImageNet-LT and iNaturalist—demonstrate significant improvements in tail-class accuracy, achieving new state-of-the-art overall performance without incurring additional inference overhead.

Technology Category

Application Category

📝 Abstract
The imbalanced distribution of long-tailed data presents a significant challenge for deep learning models, causing them to prioritize head classes while neglecting tail classes. Two key factors contributing to low recognition accuracy are the deformed representation space and a biased classifier, stemming from insufficient semantic information in tail classes. To address these issues, we propose permutation-invariant and head-to-tail feature fusion (PI-H2T), a highly adaptable method. PI-H2T enhances the representation space through permutation-invariant representation fusion (PIF), yielding more clustered features and automatic class margins. Additionally, it adjusts the biased classifier by transferring semantic information from head to tail classes via head-to-tail fusion (H2TF), improving tail class diversity. Theoretical analysis and experiments show that PI-H2T optimizes both the representation space and decision boundaries. Its plug-and-play design ensures seamless integration into existing methods, providing a straightforward path to further performance improvements. Extensive experiments on long-tailed benchmarks confirm the effectiveness of PI-H2T.
Problem

Research questions and friction points this paper is trying to address.

Addresses imbalanced data distribution in long-tailed recognition
Improves tail class accuracy via feature fusion
Optimizes representation space and decision boundaries
Innovation

Methods, ideas, or system contributions that make the work stand out.

Permutation-invariant feature fusion enhances representation
Head-to-tail fusion transfers semantic information
Plug-and-play design integrates seamlessly
🔎 Similar Papers
No similar papers found.
M
Mengke Li
College of Computer Science and Software Engineering, Shenzhen University, Shenzhen, China
Zhikai Hu
Zhikai Hu
Hong Kong Baptist University
Y
Yang Lu
Fujian Key Laboratory of Sensing and Computing for Smart City, School of Informatics, Xiamen University, Xiamen, China
W
Weichao Lan
Inspur Smart City Technology Co., Ltd., Jinan, China
Y
Yiu-ming Cheung
Department of Computer Science, Hong Kong Baptist University, Hong Kong SAR, China
H
Hui Huang
College of Computer Science and Software Engineering, Shenzhen University, Shenzhen, China