Long-Tailed Visual Recognition via Permutation-Invariant Head-to-Tail Feature Fusion

📅 2025-05-31

📈 Citations: 0

✨ Influential: 0

career value

221K/year

🤖 AI Summary

Long-tailed class distribution severely degrades deep models’ recognition performance on tail classes, primarily due to representation space distortion and classifier bias induced by sparse tail-class features. To address this, we propose the Permutation-Invariant Head-to-Tail Feature fusion framework (PIF-H2TF), which enables unbiased semantic transfer from head to tail classes via permutation-symmetric modeling—jointly optimizing both feature representation space and decision boundaries. Our approach innovatively integrates feature reweighting, semantic distillation, and classifier calibration into a plug-and-play module compatible with mainstream backbones and long-tailed learning paradigms. Extensive experiments on standard benchmarks—including ImageNet-LT and iNaturalist—demonstrate significant improvements in tail-class accuracy, achieving new state-of-the-art overall performance without incurring additional inference overhead.

Technology Category

Application Category

📝 Abstract

The imbalanced distribution of long-tailed data presents a significant challenge for deep learning models, causing them to prioritize head classes while neglecting tail classes. Two key factors contributing to low recognition accuracy are the deformed representation space and a biased classifier, stemming from insufficient semantic information in tail classes. To address these issues, we propose permutation-invariant and head-to-tail feature fusion (PI-H2T), a highly adaptable method. PI-H2T enhances the representation space through permutation-invariant representation fusion (PIF), yielding more clustered features and automatic class margins. Additionally, it adjusts the biased classifier by transferring semantic information from head to tail classes via head-to-tail fusion (H2TF), improving tail class diversity. Theoretical analysis and experiments show that PI-H2T optimizes both the representation space and decision boundaries. Its plug-and-play design ensures seamless integration into existing methods, providing a straightforward path to further performance improvements. Extensive experiments on long-tailed benchmarks confirm the effectiveness of PI-H2T.

Problem

Research questions and friction points this paper is trying to address.

Addresses imbalanced data distribution in long-tailed recognition

Improves tail class accuracy via feature fusion

Optimizes representation space and decision boundaries

Innovation

Methods, ideas, or system contributions that make the work stand out.

Permutation-invariant feature fusion enhances representation

Head-to-tail fusion transfers semantic information

Plug-and-play design integrates seamlessly

🔎 Similar Papers

No similar papers found.